tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Uwe Schindler (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-847) Add regular expression support to the MagicDetector
Date Wed, 18 Jan 2012 10:55:39 GMT

    [ https://issues.apache.org/jira/browse/TIKA-847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13188402#comment-13188402

Uwe Schindler commented on TIKA-847:

You could use the original FSM library, Lucene's is a fork of: http://www.brics.dk/automaton/
> Add regular expression support to the MagicDetector
> ---------------------------------------------------
>                 Key: TIKA-847
>                 URL: https://issues.apache.org/jira/browse/TIKA-847
>             Project: Tika
>          Issue Type: New Feature
>          Components: mime
>    Affects Versions: 1.0
>            Reporter: Andrew Jackson
>              Labels: detection, format
> Following on from TIKA-86, we would like to add support for regular expressions to the
MagicDetector. This would allow more signatures to be re-used from more sources (e.g. the
file(1) command). As part of the SCAPE Project, we have added this functionality to our own
Tika branch (e.g. https://github.com/openplanets/tika/commit/b8de9e77c8b432788e3f73a4dbccca8ea044b503)
and are working to tidy this up to make a clean patch we can submit here.
> BTW, are there any patch submission guidelines or coding standards we should check our
work against first?

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message