tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chris A. Mattmann (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (TIKA-819) Make Option to Exclude Embedded Files' Text for Text Content
Date Sun, 01 Jul 2012 21:00:52 GMT

     [ https://issues.apache.org/jira/browse/TIKA-819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Chris A. Mattmann updated TIKA-819:
-----------------------------------

    Fix Version/s:     (was: 1.2)
                   1.3

- push to 1.3
                
> Make Option to Exclude Embedded Files' Text for Text Content
> ------------------------------------------------------------
>
>                 Key: TIKA-819
>                 URL: https://issues.apache.org/jira/browse/TIKA-819
>             Project: Tika
>          Issue Type: New Feature
>          Components: general
>    Affects Versions: 1.0
>         Environment: Windows-7 + JDK 1.6 u26
>            Reporter: Albert L.
>             Fix For: 1.3
>
>
> It would be nice to be able to disable text content from embedded files.
> For example, if I have a DOCX with an embedded PPTX, then I would like the option to
disable text from the PPTX from showing up when asking for the text content from DOCX.  In
other words, it would be nice to have the option to get text content *only* from the DOCX
instead of the DOCX+PPTX.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message