lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Gong Li <>
Subject About PDF+Lucene
Date Sat, 19 Feb 2011 13:44:09 GMT

I use PDFBOX to extract the text in the PDF and then use Lucene to index and
search. Finally, I can find the context of the keyword but in String.

Question: I need to create a new PDF which contains the context of the
keyword. The format is like the original one, but only contains the context
of the keyword. HOW???


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message