tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nick Burch (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-792) NoSuchMethodException "CTMarkupImpl.<init>(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document
Date Mon, 05 Aug 2013 11:22:48 GMT

    [ https://issues.apache.org/jira/browse/TIKA-792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13729400#comment-13729400
] 

Nick Burch commented on TIKA-792:
---------------------------------

Tim - I'd suggest you add this test document to POI, then write a unit test that triggers
the same code in POI from the test that Tika is doing. Unit test should pass in POI, but will
then trigger the inclusion of these extra classes in the cut-down poi-ooxml-schemas class,
which should then fix the issue in Tika
                
> NoSuchMethodException "CTMarkupImpl.<init>(org.apache.xmlbeans.SchemaType, boolean)"
processing a OOXML document
> ----------------------------------------------------------------------------------------------------------------
>
>                 Key: TIKA-792
>                 URL: https://issues.apache.org/jira/browse/TIKA-792
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.0
>         Environment: Linux, JDK 1.6, Jetty 8.x, Tomcat 6.x
>            Reporter: Torsten Krah
>             Fix For: 1.2
>
>         Attachments: test10.docx
>
>
> Parsing some OOXML documents, this stacktrace is logged many times:
> java.lang.NoSuchMethodException: org.openxmlformats.schemas.wordprocessingml.x2006.main.impl.CTMarkupImpl.<init>(org.apache.xmlbeans.SchemaType,
boolean)
> 	at java.lang.Class.getConstructor0(Class.java:2723)
> 	at java.lang.Class.getDeclaredConstructor(Class.java:2002)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.getJavaImplConstructor2(SchemaTypeImpl.java:1749)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createUnattachedSubclass(SchemaTypeImpl.java:1886)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createUnattachedNode(SchemaTypeImpl.java:1875)
> 	at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createElementType(SchemaTypeImpl.java:1021)
> 	at org.apache.xmlbeans.impl.values.XmlObjectBase.create_element_user(XmlObjectBase.java:893)
> 	at org.apache.xmlbeans.impl.store.Xobj.getUser(Xobj.java:1657)
> 	at org.apache.xmlbeans.impl.store.Cur.getUser(Cur.java:2654)
> 	at org.apache.xmlbeans.impl.store.Cur.getObject(Cur.java:2647)
> 	at org.apache.xmlbeans.impl.store.Cursor._getObject(Cursor.java:995)
> 	at org.apache.xmlbeans.impl.store.Cursor.getObject(Cursor.java:2904)
> 	at org.apache.poi.xwpf.usermodel.XWPFParagraph.<init>(XWPFParagraph.java:83)
> 	at org.apache.poi.xwpf.usermodel.XWPFDocument.onDocumentRead(XWPFDocument.java:145)
> 	at org.apache.poi.POIXMLDocument.load(POIXMLDocument.java:159)
> 	at org.apache.poi.xwpf.usermodel.XWPFDocument.<init>(XWPFDocument.java:115)
> 	at org.apache.poi.xwpf.extractor.XWPFWordExtractor.<init>(XWPFWordExtractor.java:53)
> 	at org.apache.poi.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:180)
> 	at org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:63)
> 	at org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:69)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> 	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> Looking at the poi code java is right here, there is no constructor with a SchemaType
and a boolean, only with SchemaType.
> My guess is this one was missed during upgrade to poi beta4, but only a guess, anyway
needs a fix :-).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message