tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tim Allison (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-2477) Tika : Content of XLSX file extraction is not working after poi library upgrade
Date Mon, 16 Oct 2017 18:39:00 GMT

    [ https://issues.apache.org/jira/browse/TIKA-2477?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16206374#comment-16206374
] 

Tim Allison commented on TIKA-2477:
-----------------------------------

Try a more recent version of Tika -- 1.16 -- available here: http://www.apache.org/dyn/closer.cgi/tika/tika-app-1.16.jar
and let us know if you still have the same problem. Thank you!

> Tika :  Content of XLSX file extraction is not working after poi library upgrade
> --------------------------------------------------------------------------------
>
>                 Key: TIKA-2477
>                 URL: https://issues.apache.org/jira/browse/TIKA-2477
>             Project: Tika
>          Issue Type: Bug
>          Components: core
>            Reporter: Ramchandran
>
> Hi Team,
> I had written program to extract content of simple xlsx file. The program is working
fine with poi-3.11 library but now I have upgraded my poi library to poi-3.16. Now the program
is running but the output is not displayed.(Post upgrade only sheet name is displayed).
> Class File
> ===============
> package MSExcelParse;
> import java.io.File;
> import java.io.FileInputStream;
> import java.io.IOException;
> import org.apache.tika.exception.TikaException;
> import org.apache.tika.metadata.Metadata;
> import org.apache.tika.parser.AutoDetectParser;
> import org.apache.tika.parser.ParseContext;
> import org.apache.tika.parser.Parser;
> import org.apache.tika.sax.BodyContentHandler;
> import org.xml.sax.SAXException;
> public class MSExcelParser{
>    public static void main(final String[] args) throws IOException, TikaException, SAXException
{
>       
>       //detecting the file type
>       BodyContentHandler handler = new BodyContentHandler();
>       Metadata metadata = new Metadata();
>       FileInputStream inputstream = new FileInputStream(new File("C:\\JavaTest\\Student.xlsx"));
>       ParseContext pcontext = new ParseContext();
>       
>       Parser parser = new AutoDetectParser();
>       parser.parse(inputstream, handler, metadata,pcontext);
>       
>       System.out.println("Contents of the document:" + handler.toString());
>    }
> }
> .classpath file
> ============
> <?xml version="1.0" encoding="UTF-8"?>
> <classpath>
> 	<classpathentry kind="src" path="src"/>
> 	<classpathentry kind="con" path="org.eclipse.jdt.launching.JRE_CONTAINER/org.eclipse.jdt.internal.debug.ui.launcher.StandardVMType/JavaSE-1.7"/>
> 	<classpathentry kind="lib" path="C:/JavaTest/commons-collections4-4.1.jar"/>
> 	<classpathentry kind="lib" path="C:/JavaTest/commons-compress-1.8.1.jar"/>
> 	<classpathentry kind="lib" path="C:/JavaTest/xmlbeans-2.6.0.jar"/>
> 	<classpathentry kind="lib" path="C:/JavaTest/poi-3.16.jar"/>
> 	<classpathentry kind="lib" path="C:/JavaTest/poi-ooxml-3.16.jar"/>
> 	<classpathentry kind="lib" path="C:/JavaTest/poi-ooxml-schemas-3.16.jar"/>
> 	<classpathentry kind="lib" path="C:/JavaTest/poi-scratchpad-3.16.jar"/>
> 	<classpathentry kind="lib" path="C:/JavaTest/tika-core-1.7.jar"/>
> 	<classpathentry kind="lib" path="C:/JavaTest/tika-parsers-1.7.jar"/>
> 	<classpathentry kind="output" path="bin"/>
> </classpath>



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message