lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Betsey Benagh <>
Subject Re: Question about indexing PDFs
Date Thu, 25 Aug 2016 18:19:33 GMT
It looks like the metadata of the PDFs was indexed, but not the content
(which is what I was interested in).  Searches on terms I know exist in
the content come up empty.

On 8/25/16, 2:16 PM, "Betsey Benagh" <> wrote:

>Right, that¹s where I looked.  No Œcontent¹.  Which is what confused me.
>On 8/25/16, 1:56 PM, "Erick Erickson" <> wrote:
>>when you say "I don't see it in the schema for that collection" are you
>>talking schema.xml? managed_schema? Or actual documents in the index?
>>these are defined by dynamic fields and the like in the schema files.
>>Take a look at the admin UI>>schema browser>>drop down and you'll see
>>the actual fields in your index...
>>On Thu, Aug 25, 2016 at 8:39 AM, Betsey Benagh
>>> wrote:
>>> Following the instructions in the quick start guide, I imported a bunch
>>> PDF documents into my Solr 6.0 instance.  As far as I can tell from the
>>> documentation, there should be a 'content' field indexing, well, the
>>> content, but I don't see it in the schema for that collection.  Is
>>> something obvious I might have missed?
>>> Thanks!

View raw message