lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jeroen Steggink <jer...@stegg-inc.com>
Subject Update existing documents when using ExtractingRequestHandler?
Date Wed, 09 Oct 2013 15:50:40 GMT
Hi,

In a content management system I have a document and an attachment. The 
document contains the meta data and the attachment the actual data.
I would like to combine data of both in one Solr document.

I have thought of several options:

1. Using ExtractingRequestHandler I would extract the data (extractOnly) 
and combine it with the meta data and send it to Solr.
     But this might be inefficient and increase the network traffic.
2. Seperate Tika installation and use that to extract and send the data 
to Solr.
     This would stress an already busy web server.
3. First upload the file using ExtractingRequestHandler, then use atomic 
updates to add the other fields.

Or is there another way? First add the meta data and later use the 
ExtractingRequestHandler to add the file contents?

Cheers,
Jeroen

-- 
Sent from my Android device with K-9 Mail. Please excuse my brevity.
Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message