tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-2479) Handle empty cells in tables uniformly
Date Tue, 30 Oct 2018 00:25:00 GMT

    [ https://issues.apache.org/jira/browse/TIKA-2479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16667928#comment-16667928
] 

ASF GitHub Bot commented on TIKA-2479:
--------------------------------------

glb commented on issue #214: TIKA-2479 - Handle empty cells in XLSX
URL: https://github.com/apache/tika/pull/214#issuecomment-434128458
 
 
   @dameikle it looks like it would solve the problem in a similar way, yes. Thanks!

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


> Handle empty cells in tables uniformly
> --------------------------------------
>
>                 Key: TIKA-2479
>                 URL: https://issues.apache.org/jira/browse/TIKA-2479
>             Project: Tika
>          Issue Type: Bug
>            Reporter: Tim Allison
>            Priority: Minor
>             Fix For: 2.0, 1.19
>
>         Attachments: patch.diff
>
>
> It looks like we output a <td/> for empty cells in xls, and tables in doc, docx
and pptx.  However, we don't retain empty cells in xlsx or tables in ppt.  We should make
this handling uniform.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message