tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "John Mastarone (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (TIKA-935) TikaException thrown when trying to parse archive (*.ar) files
Date Mon, 28 May 2012 02:41:22 GMT

     [ https://issues.apache.org/jira/browse/TIKA-935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

John Mastarone updated TIKA-935:
--------------------------------

    Attachment: ArParserTest.java
                TIKA-935.patch

Patch uploaded which corrects the error in the *.ar file detection, along with new unit test
class that makes use of existing .ar files in the test-documents folder.  With this patch,
parsing occurs successfully in a latest build.  The unit tests pass.
                
> TikaException thrown when trying to parse archive (*.ar) files
> --------------------------------------------------------------
>
>                 Key: TIKA-935
>                 URL: https://issues.apache.org/jira/browse/TIKA-935
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.2
>         Environment: Windows 7
>            Reporter: John Mastarone
>         Attachments: ArParserTest.java, TIKA-935.patch
>
>
> A TikaException is thrown when trying to drop either of the two .ar files from the parsers'
test-documents folder into Tika-GUI.  From looking at this: http://stuff.mit.edu/afs/athena/software/cygwin/cygwin_v1.3.2/usr/share/magic.mime
  the archive detection is not done correctly for these types of files in the PackageExtractor
class, and a TarArchiveInputStream is chosen by default, incorrectly.  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message