lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jan Høydahl (JIRA) <j...@apache.org>
Subject [jira] [Commented] (SOLR-1526) Client Side Tika integration
Date Tue, 23 Jun 2015 07:57:00 GMT

    [ https://issues.apache.org/jira/browse/SOLR-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14597303#comment-14597303
] 

Jan Høydahl commented on SOLR-1526:
-----------------------------------

Commenting on this old issue...

Now that Tika has a REST API, it may make more sense to integrate that one on the client side.
Then SolrJ as well as other language client libraries could do it more or less the same way?

[~gsingers], there was some talk about adding more official clients to Solr, is that still
being discussed?

> Client Side Tika integration
> ----------------------------
>
>                 Key: SOLR-1526
>                 URL: https://issues.apache.org/jira/browse/SOLR-1526
>             Project: Solr
>          Issue Type: New Feature
>          Components: clients - java
>            Reporter: Grant Ingersoll
>            Priority: Minor
>             Fix For: 4.9, Trunk
>
>         Attachments: clientextraction.tar.gz
>
>
> Often times it is cost prohibitive to send full, rich documents over the wire.  The contrib/extraction
library has server side integration with Tika, but it would be nice to have a client side
implementation as well.  It should support both metadata and content or just metadata.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message