tika-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Nick Burch (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (TIKA-1168) The IWork NumbersContentHandler returns unsupported Metadata PropertyType
Date Mon, 02 Jun 2014 16:47:01 GMT

    [ https://issues.apache.org/jira/browse/TIKA-1168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14015538#comment-14015538
] 

Nick Burch commented on TIKA-1168:
----------------------------------

I've just tried with a recent svn snapshot build, and tika is able to extract the metadata
from your test file without errors

Any chance you could re-test, to see if the metadata work since 1.4 has solved it for you
too?

> The IWork NumbersContentHandler returns unsupported Metadata PropertyType
> -------------------------------------------------------------------------
>
>                 Key: TIKA-1168
>                 URL: https://issues.apache.org/jira/browse/TIKA-1168
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.4
>            Reporter: Jake Gilley
>         Attachments: testMultipleSheets.numbers
>
>
> The resolveMetadataKey function in org.apache.tika.parser.iwork.NumbersContentHandler
returns a Property with PropertyType "SIMPLE" by default. This is causing a PropertyTypeException("SIMPLE
is not supported") when adding the Property using Metatadata.add(Property, String). The exception
is being generated by line 92 in org.apache.tika.parser.iwork.NumbersContentHandler.
> metadata.add(metadataKey, attributes.getValue("sfa:string"));
> Caused by: org.apache.tika.metadata.PropertyTypeException: SIMPLE is not supported
> 	at org.apache.tika.metadata.Metadata.add(Metadata.java:373)
> 	at org.apache.tika.parser.iwork.NumbersContentHandler.startElement(NumbersContentHandler.java:92)
> 	at org.apache.tika.sax.ContentHandlerDecorator.startElement(ContentHandlerDecorator.java:126)
> 	at org.apache.xerces.parsers.AbstractSAXParser.startElement(Unknown Source)
> 	at org.apache.xerces.parsers.AbstractXMLDocumentParser.emptyElement(Unknown Source)
> 	at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanStartElement(Unknown Source)
> 	at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl$FragmentContentDispatcher.dispatch(Unknown
Source)
> 	at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown Source)
> 	at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source)
> 	at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source)
> 	at org.apache.xerces.parsers.XMLParser.parse(Unknown Source)
> 	at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source)
> 	at org.apache.xerces.jaxp.SAXParserImpl$JAXPSAXParser.parse(Unknown Source)
> 	at org.apache.xerces.jaxp.SAXParserImpl.parse(Unknown Source)
> 	at javax.xml.parsers.SAXParser.parse(Unknown Source)
> 	at org.apache.tika.parser.iwork.IWorkPackageParser.parse(IWorkPackageParser.java:209)
> 	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> 	... 16 more



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message