lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Yiannis Pericleous <Y.Pericle...@albourne.com>
Subject Solr Cell and encrypted pdf files
Date Tue, 18 May 2010 14:14:04 GMT
Hi,

I can't seem to get solr cell to index password protected pdf files.
I can't figure out how to pass the password to tika and looking at 
ExtractingDocumentLoader,
it doesn't seem to pass any pdf password related metadata to the tika 
parser.

Whatever I do, pdfbox complains that: "The supplied password does not 
match either the owner or user password in the document."

If i strip the password manually before trying to index the document it 
works

What I'm I missing?

thanks!

yiannis

Mime
View raw message