tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-1195) XLSB support
Date Wed, 19 Apr 2017 16:02:41 GMT

    [ https://issues.apache.org/jira/browse/TIKA-1195?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15974963#comment-15974963
] 

Hudson commented on TIKA-1195:
------------------------------

SUCCESS: Integrated in Jenkins build tika-2.x #244 (See [https://builds.apache.org/job/tika-2.x/244/])
TIKA-1195 and TIKA-2329, upgrade to POI 3.16-final and add xlsb parser (tallison: rev a847a863d1e25a9ba8209cd28c3e98be153f34a5)
* (edit) tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/ooxml/OOXMLParser.java
* (edit) tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/microsoft/ooxml/OOXMLParserTest.java
* (edit) tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/ooxml/OOXMLExtractorFactory.java
* (edit) tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/ooxml/XSSFExcelExtractorDecorator.java
* (add) tika-test-resources/src/test/resources/test-documents/testEXCEL_various.xlsb
* (edit) tika-parser-modules/pom.xml
* (edit) tika-parser-modules/tika-parser-office-module/src/test/java/org/apache/tika/parser/microsoft/ExcelParserTest.java
* (edit) CHANGES.txt
* (add) tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/parser/microsoft/ooxml/XSSFBExcelExtractorDecorator.java


> XLSB support
> ------------
>
>                 Key: TIKA-1195
>                 URL: https://issues.apache.org/jira/browse/TIKA-1195
>             Project: Tika
>          Issue Type: Improvement
>          Components: general
>    Affects Versions: 1.4
>         Environment: W2008R2
>            Reporter: Frederic Ronny
>              Labels: new-parser
>             Fix For: 2.0, 1.15
>
>
> We use Manifoldcf 1.3 and Solr 4.4 to index a shared network drive, works fine for most
of our Office filetypes ( docx, xlsx,.... ) but we also have a lot of files with filetype
xlsb which are not in the supported filetypes. 
> In order to keep using this solution it is essential to us that there will be a solution
provided in the future



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message