tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chris A. Mattmann (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-1096) CompressorParser: Add support for handling concatenated InputStreams
Date Sat, 23 Mar 2013 01:47:14 GMT

    [ https://issues.apache.org/jira/browse/TIKA-1096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13611521#comment-13611521
] 

Chris A. Mattmann commented on TIKA-1096:
-----------------------------------------

applied the patch and tested locally:

{noformat}
[INFO] 
[INFO] --- maven-remote-resources-plugin:1.2.1:process (default) @ tika ---
[INFO] 
[INFO] --- maven-site-plugin:3.0:attach-descriptor (attach-descriptor) @ tika ---
[INFO] 
[INFO] --- maven-install-plugin:2.3.1:install (default-install) @ tika ---
[INFO] Installing /Users/mattmann/src/tika/trunk/pom.xml to /Users/mattmann/.m2/repository/org/apache/tika/tika/1.4-SNAPSHOT/tika-1.4-SNAPSHOT.pom
[INFO] Installing /Users/mattmann/src/tika/trunk/target/tika-1.4-SNAPSHOT-site.xml to /Users/mattmann/.m2/repository/org/apache/tika/tika/1.4-SNAPSHOT/tika-1.4-SNAPSHOT-site.xml
[INFO] ------------------------------------------------------------------------
[INFO] Reactor Summary:
[INFO] 
[INFO] Apache Tika parent ................................ SUCCESS [1.899s]
[INFO] Apache Tika core .................................. SUCCESS [17.224s]
[INFO] Apache Tika parsers ............................... SUCCESS [52.196s]
[INFO] Apache Tika XMP ................................... SUCCESS [1.824s]
[INFO] Apache Tika application ........................... SUCCESS [12.541s]
[INFO] Apache Tika OSGi bundle ........................... SUCCESS [19.793s]
[INFO] Apache Tika server ................................ SUCCESS [15.585s]
[INFO] Apache Tika ....................................... SUCCESS [0.270s]
[INFO] ------------------------------------------------------------------------
[INFO] BUILD SUCCESS
[INFO] ------------------------------------------------------------------------
[INFO] Total time: 2:02.238s
[INFO] Finished at: Fri Mar 22 18:28:21 PDT 2013
[INFO] Final Memory: 36M/252M
[INFO] -----------------------------
{noformat}

Good to go, commiting!
                
> CompressorParser: Add support for handling concatenated InputStreams
> --------------------------------------------------------------------
>
>                 Key: TIKA-1096
>                 URL: https://issues.apache.org/jira/browse/TIKA-1096
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.4
>            Reporter: Gregory Chanan
>            Assignee: Chris A. Mattmann
>            Priority: Minor
>         Attachments: TIKA-1096.patch
>
>
> COMPRESS-220 added support for CompressorStreamFactory to return an InputStream with
decompressConcatenated set to true.  Today, Tika uses the CompressorStreamFactory without
this option, which caused me some problems parsing some gzipped files that required this option.
> Today I have to do some pre-processing on the InputStreams before I send them to Tika;
it would be great if Tika could handle this for me.
> I wrote up a quick patch that adds this option; I'll attach it soon.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message