lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tim Allison (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SOLR-7232) DIH should extract multiple values for metadata fields extracted by Tika
Date Wed, 12 Jul 2017 20:24:01 GMT

    [ https://issues.apache.org/jira/browse/SOLR-7232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16084624#comment-16084624
] 

Tim Allison commented on SOLR-7232:
-----------------------------------

Wanted to ping on this while I came across it.  I'll try to put together a PR shortly.

> DIH should extract multiple values for metadata fields extracted by Tika
> ------------------------------------------------------------------------
>
>                 Key: SOLR-7232
>                 URL: https://issues.apache.org/jira/browse/SOLR-7232
>             Project: Solr
>          Issue Type: Improvement
>          Components: contrib - DataImportHandler
>            Reporter: Tim Allison
>            Priority: Trivial
>
> The TikaEntityProcessor is currently pulling only the first value for a given metadata
key.  If there are multiple values for a given metadata key as extracted by Tika, those values
are currently being ignored.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message