nutch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From MengYing Wang <mengyingwa...@gmail.com>
Subject Re: How could I make more metadata indexed in Solr?
Date Thu, 30 Oct 2014 19:11:51 GMT
Dear Markus,

Yes, in most cases, it works. Also to extract the content metadata, we
should specify the fields in the "index.content.md" configuration
directive. Thank you!

Best,
Mengying (Angela) Wang

On Mon, Oct 27, 2014 at 1:07 AM, Markus Jelsma <markus.jelsma@openindex.io>
wrote:

> Hi - enable the index-metadata plugin and specify your fields in the
> index.parse.md configuration directive.
> Markus
>
> -----Original message-----
> From: Mengying Wang<wang533@usc.edu>
> Sent: Sunday 26th October 2014 7:14
> To: dev@nutch.apache.org; solr-user@lucene.apache.org
> Cc: mattmann@apache.org
> Subject: How could I make more metadata indexed in Solr?
>
> Hi everyone,
>
> When I use the ./nutch parsechecker command to a pdf file, I see a number
> of metadata,
> e.g., ETag="cbf961-5aafc-41e4319014b80" meta:creation-date=2004-11-10T21:34:35Z
> dcterms:modified=2004-11-10T21:34:35Z meta:save-date=2004-11-10T21:34:35Z
> xmpTPg:NPages=10, etc. However, when I run the ./nutch indexchecker
> command, only a few metadata appears, which will be indexed in the Solr. I
> am wondering how could I make other metadata indexed in Solr too? Thank you!
>
> Best,
>
> Mengying (Angela) Wang
>
>
>


-- 
Best,
Mengying (Angela) Wang

Mime
View raw message