lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Gary Taylor ...@inovem.com>
Subject Re: Extracting contents of zipped files with Tika and Solr 1.4.1 (now Solr 3.1)
Date Mon, 23 May 2011 09:38:40 GMT
Jayendra,

I cleared out my local repository, and replayed all of my steps from 
Friday and it now it works.  The only difference (or the only one that's 
obvious to me) was that I applied the patch before doing a full 
compile/test/dist.  But I assumed that given I was seeing my new log 
entries (from ExtractingDocumentLoader.java) I was running the correct 
code anyway.

However, I'm very pleased that it's working now - I get the full 
contents of the zipped files indexed and not just the file names.

Thank you again for your assistance, and the patch!

Kind regards,
Gary.


On 21/05/2011 03:12, Jayendra Patil wrote:
> Hi Gary,
>
> I tried the patch on the the 3.1 source code (@
> http://svn.apache.org/repos/asf/lucene/dev/branches/lucene_solr_3_1/)
> as well and it worked fine.
> @Patch - https://issues.apache.org/jira/browse/SOLR-2416, which deals
> with the Solr Cell module.
>
> You may want to verify the contents from the results by enabling the
> stored attribute on the text field.
>
> e.g. URL curl "http://localhost:8983/solr/update/extract?stream.file=C:/Test.zip&literal.id=777045&literal.title=Test&commit=true"
>
> Let me know if it works. I would be happy to share the generated
> artifact you can test on.
>
> Regards,
> Jayendra


Mime
View raw message