tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dave Meikle (JIRA)" <j...@apache.org>
Subject [jira] Commented: (TIKA-149) Parser for zip files
Date Tue, 05 Aug 2008 19:22:44 GMT

    [ https://issues.apache.org/jira/browse/TIKA-149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12620011#action_12620011
] 

Dave Meikle commented on TIKA-149:
----------------------------------

Sorry, should have just manipulated the stream. Not sure about the delegate parser though,
as each file may require a different parser. I have just updated the code to use the AutoDetectParser
but if you can see the other use case it can be changed for the seperate setter method for
the delegate parser.

> Parser for zip files
> --------------------
>
>                 Key: TIKA-149
>                 URL: https://issues.apache.org/jira/browse/TIKA-149
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Jukka Zitting
>         Attachments: TIKA-149.patch
>
>
> Tika should be able to parse zip files. The resulting XHTML document should be something
like this:
> <xhtml>
>   <head>...</head>
>   <body>
>     <div class="file">
>         <h1>path/to/file/inside/the/zip</h1>
>         ... (parsed contents of the file)
>     </div>
>     ...
>   </body>
> </xhtml>

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message