tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mattmann, Chris A (3980)" <chris.a.mattm...@jpl.nasa.gov>
Subject FW: [jira] [Commented] (TIKA-1787) Include Stanford Name Entity Recognition in Tika
Date Tue, 17 Nov 2015 17:56:28 GMT
Thamme, can you have a look here:

https://builds.apache.org/job/tika-trunk-jdk1.7/887/org.apache.tika$tika-pa
rsers/testReport/junit/org.apache.tika.parser.ner/NamedEntityParserTest/tes
tParse/


Tests seem to be failing (worked for me locally maybe b/c I had
already downloaded the models?)

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++





-----Original Message-----
From: "Hudson (JIRA)" <jira@apache.org>
Date: Tuesday, November 17, 2015 at 12:48 PM
To: jpluser <chris.a.mattmann@jpl.nasa.gov>
Subject: [jira] [Commented] (TIKA-1787) Include Stanford Name Entity
Recognition in Tika

>
>    [ 
>https://issues.apache.org/jira/browse/TIKA-1787?page=com.atlassian.jira.pl
>ugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15009116#comm
>ent-15009116 ] 
>
>Hudson commented on TIKA-1787:
>------------------------------
>
>UNSTABLE: Integrated in tika-trunk-jdk1.7 #887 (See
>[https://builds.apache.org/job/tika-trunk-jdk1.7/887/])
>Fix for TIKA-1787: Include Stanford Name Entity Recognition in Tika
>contributed by Thamme Gowda N and Yueheng He this closes #61 this closes
>#62 (mattmann: 
>[http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1714835])
>* trunk/.gitignore
>* trunk/CHANGES.txt
>* trunk/tika-parsers/pom.xml
>* trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner
>* 
>trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/NERecogniser.j
>ava
>* 
>trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/NamedEntityPar
>ser.java
>* trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/corenlp
>* 
>trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/corenlp/CoreNL
>PNERecogniser.java
>* trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/opennlp
>* 
>trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/opennlp/OpenNL
>PNERecogniser.java
>* 
>trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/opennlp/OpenNL
>PNameFinder.java
>* trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/regex
>* 
>trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/regex/RegexNER
>ecogniser.java
>* trunk/tika-parsers/src/main/resources/org/apache/tika/parser/ner
>* trunk/tika-parsers/src/main/resources/org/apache/tika/parser/ner/regex
>* 
>trunk/tika-parsers/src/main/resources/org/apache/tika/parser/ner/regex/ner
>-regex.txt
>* trunk/tika-parsers/src/test/java/org/apache/tika/parser/ner
>* 
>trunk/tika-parsers/src/test/java/org/apache/tika/parser/ner/NamedEntityPar
>serTest.java
>* trunk/tika-parsers/src/test/java/org/apache/tika/parser/ner/regex
>* 
>trunk/tika-parsers/src/test/java/org/apache/tika/parser/ner/regex/RegexNER
>ecogniserTest.java
>* trunk/tika-parsers/src/test/resources/org/apache/tika/parser
>* trunk/tika-parsers/src/test/resources/org/apache/tika/parser/ner
>* trunk/tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp
>* 
>trunk/tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp/M
>odelGetter.groovy
>* 
>trunk/tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp/g
>et-models.sh
>* trunk/tika-parsers/src/test/resources/org/apache/tika/parser/ner/regex
>* 
>trunk/tika-parsers/src/test/resources/org/apache/tika/parser/ner/regex/ner
>-regex.txt
>* 
>trunk/tika-parsers/src/test/resources/org/apache/tika/parser/ner/tika-conf
>ig.xml
>
>
>> Include Stanford Name Entity Recognition in Tika
>> ------------------------------------------------
>>
>>                 Key: TIKA-1787
>>                 URL: https://issues.apache.org/jira/browse/TIKA-1787
>>             Project: Tika
>>          Issue Type: Improvement
>>          Components: mime, parser
>>    Affects Versions: 1.12
>>         Environment: Java 1.8, Mac OSX 10.11
>>            Reporter: Yueheng He
>>            Assignee: Chris A. Mattmann
>>              Labels: features, newbie, test
>>             Fix For: 1.12
>>
>>   Original Estimate: 168h
>>  Remaining Estimate: 168h
>>
>> Using the Stanford Name Entity Recognition, Tika will be able to
>>extract name entities like PERSON, ORGANIZATION, LOCATION, etc from the
>>given text. The extracted name entities will be added to the metadata
>
>
>
>--
>This message was sent by Atlassian JIRA
>(v6.3.4#6332)

Mime
View raw message