tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ray Gauss II (JIRA)" <j...@apache.org>
Subject [jira] [Resolved] (TIKA-979) Metadata not clean after tikaParser.parser.
Date Mon, 29 Oct 2012 00:51:12 GMT

     [ https://issues.apache.org/jira/browse/TIKA-979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Ray Gauss II resolved TIKA-979.
-------------------------------

       Resolution: Won't Fix
    Fix Version/s:     (was: 1.3)

A Metadata object is meant to hold metadata specific to the input stream being parsed.

In your tests you're attempting to parse different files using the same Metadata object.

I think your solution of creating a new Metadata object for each file is more appropriate
than altering Tika to accomodate that use case.
                
> Metadata not clean after tikaParser.parser.
> -------------------------------------------
>
>                 Key: TIKA-979
>                 URL: https://issues.apache.org/jira/browse/TIKA-979
>             Project: Tika
>          Issue Type: Wish
>          Components: parser
>    Affects Versions: 1.2
>         Environment: windows server 2008 
>            Reporter: Xujunjie
>         Attachments: TikaTest.java
>
>
> If we want to make tika to do things in batch. In tika 0.9 we can run tikaParser.parser(..)
one by one to handle files in batch. But in tika 1.2, the Metadata is not clean after parser.
So we have to clean the Metadata or just new a new Objcet of TikaMetadata for next parser.

> So I think we need add a function to clean Metadata or just clean after parser().

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message