tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jukka Zitting (JIRA)" <j...@apache.org>
Subject [jira] Resolved: (TIKA-113) Metadata (such as title) should not be part of content
Date Thu, 10 Apr 2008 10:52:05 GMT

     [ https://issues.apache.org/jira/browse/TIKA-113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Jukka Zitting resolved TIKA-113.
--------------------------------

    Resolution: Fixed
      Assignee: Jukka Zitting

Resolved in revision  646748 by implementing a BodyContentHandler class for getting just the
XHTML body content.

> Metadata (such as title) should not be part of content
> ------------------------------------------------------
>
>                 Key: TIKA-113
>                 URL: https://issues.apache.org/jira/browse/TIKA-113
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>            Reporter: Rida Benjelloun
>            Assignee: Jukka Zitting
>             Fix For: 0.2-incubating
>
>
> Metadata (such as title)  is added in the content. In my opinion it would be preferable
 that the toString () on the writer return only the content of the document and not metadata.
The metadata  are already  stored in the metadata object
> Rida.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message