tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-2766) Be able to extract raw values from excel, not formatted
Date Wed, 31 Oct 2018 11:29:00 GMT

    [ https://issues.apache.org/jira/browse/TIKA-2766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16669974#comment-16669974

ASF GitHub Bot commented on TIKA-2766:

jtbdevelopment opened a new pull request #256: TIKA-2766 - be able to get raw values from
URL: https://github.com/apache/tika/pull/256

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:

> Be able to extract raw values from excel, not formatted
> -------------------------------------------------------
>                 Key: TIKA-2766
>                 URL: https://issues.apache.org/jira/browse/TIKA-2766
>             Project: Tika
>          Issue Type: Improvement
>          Components: core
>            Reporter: JTB Development
>            Priority: Minor
> By default, tika extracts excel values as formatted in the sheet.  It's a fine default.
> However, many times, I am asked to extract raw values as the nicely formatted sheet for
human eyes is losing precision.
> In local instances, I've cloned the tika classes in order to do so, but it's messy due
to how the code is layered (i wind up extending/copying 3-4 classes because chain of class
> I believe by adding a config option to the open office config class I can implement same
option much more cleanly. 
> I plan to issue a pull request in few weeks (doing this contribute on the side based
on professional use)

This message was sent by Atlassian JIRA

View raw message