tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ann Burgess (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (TIKA-1363) .mat files not parsing
Date Mon, 14 Jul 2014 22:36:09 GMT

     [ https://issues.apache.org/jira/browse/TIKA-1363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Ann Burgess updated TIKA-1363:

    Attachment: TIKA.1363.aburgess.140614.patch.txt

Tyler P. and I determined the tika parser was not registered in META-INF.  This was determined
after I changed the unit test to AutoDetectParser, and the test failed.  This patch updates
the unit test and adds the MatParser to the org.apache.tika.parser.Parser file.

> .mat files not parsing
> ----------------------
>                 Key: TIKA-1363
>                 URL: https://issues.apache.org/jira/browse/TIKA-1363
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.6
>            Reporter: Ann Burgess
>              Labels: metadata, parser, snapshot
>         Attachments: TIKA.1363.aburgess.140614.patch.txt, test_data_1.mat
> We recently committed a parser for Matlab .mat files, however I've just downloaded the
most recent Tika and am not getting any parsed --text or --metadata for the .mat file used
in the unit test.  The steps I've used are below.  Am I missing something at the command line?
 Can anyone else successfully get a text or metadata output for a .mat file?
> Steps: 
> svn co https://svn.apache.org/repos/asf/tika/trunk tika
> setenv MAVEN_OPTS "-Xms128m -Xmx256m"
> cd tika
> mvn install
> java -jar tika-app/target/tika-app-1.6-SNAPSHOT.jar --text /Users/IGSWAHWSWBURGESS/Development/tika/tika-parsers/src/test/resources/test-documents/breidamerkurjokull_radar_profiles_2009.mat

This message was sent by Atlassian JIRA

View raw message