lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ganesh" <>
Subject PDF text extracted without spaces
Date Fri, 03 Dec 2010 05:35:22 GMT
Hello all,

I know, this is not the right group to ask this question, thought some of you guys might have

I newbie with Tika. I am using latest version 0.8 version. I extracted text from PDF document
but found spaces and new line missing. Indexing the data gives wrong result. Could any one
in this group could help me? I am using tika directly to extract the contents, which later
gets indexed.

Send free SMS to your Friends on Mobile from your Yahoo! Messenger. Download Now!

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message