tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tim Allison (Jira)" <j...@apache.org>
Subject [jira] [Assigned] (TIKA-2966) Create a tika-eval SAXHandler
Date Thu, 17 Oct 2019 11:25:00 GMT

     [ https://issues.apache.org/jira/browse/TIKA-2966?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Tim Allison reassigned TIKA-2966:
---------------------------------

    Assignee: Tim Allison

> Create a tika-eval SAXHandler
> -----------------------------
>
>                 Key: TIKA-2966
>                 URL: https://issues.apache.org/jira/browse/TIKA-2966
>             Project: Tika
>          Issue Type: Improvement
>            Reporter: Tim Allison
>            Assignee: Tim Allison
>            Priority: Major
>
> One of the improvements coming in 1.23 is the decoupling of the text stats calculator
from the tika-eval app.  To make this even easier to use, let's add a handler that will calculate
the text stats on .endDocument() and record those stats in a metadata object.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Mime
View raw message