lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ben Litchfield <>
Subject Re: Missing pdf document title
Date Mon, 10 Nov 2003 17:13:27 GMT

I would try two things.

1)Is PDFBox getting the title from the document?
You can run this example to find out

java org.pdfbox.examples.pdmodel.PrintDocumentMetaData <input-pdf>

2)Is the lucene field getting properly set in the lucene database.  I
would use luke( to verify that lucene is
getting the field.

Other than that I would double check your code that gets the "Title" field


On Mon, 10 Nov 2003, Zhou, Oliver wrote:

> Hi,
> I'm using lucene demo with pdfbox-0.6.4 to index pdf files.
> It created the index files.  However, the pdf document title was empty when
> I did search.  Any idea on why?
> Thanks
> Oliver
> ---------------------------------------------------------------------
> To unsubscribe, e-mail:
> For additional commands, e-mail:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message