tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-2479) Handle empty cells in tables uniformly
Date Mon, 29 Oct 2018 22:22:00 GMT

    [ https://issues.apache.org/jira/browse/TIKA-2479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16667835#comment-16667835
] 

ASF GitHub Bot commented on TIKA-2479:
--------------------------------------

dameikle commented on issue #214: TIKA-2479 - Handle empty cells in XLSX
URL: https://github.com/apache/tika/pull/214#issuecomment-434102665
 
 
   It looks like Nick Burch hasn't seen this and has added a different fix for this in master.
 Does this do what you were expecting?

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


> Handle empty cells in tables uniformly
> --------------------------------------
>
>                 Key: TIKA-2479
>                 URL: https://issues.apache.org/jira/browse/TIKA-2479
>             Project: Tika
>          Issue Type: Bug
>            Reporter: Tim Allison
>            Priority: Minor
>             Fix For: 2.0, 1.19
>
>         Attachments: patch.diff
>
>
> It looks like we output a <td/> for empty cells in xls, and tables in doc, docx
and pptx.  However, we don't retain empty cells in xlsx or tables in ppt.  We should make
this handling uniform.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message