lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Tony Mullins <tonymullins...@gmail.com>
Subject Re: How to re-index Solr & get term frequency within documents
Date Tue, 02 Jul 2013 06:00:22 GMT
I use Nutch as input datasource for my Solr.
So I cannot re-run all the Nutch jobs to generate data again for Solr as it
will take very long to generate that much data.

I was hoping there would be an easier way inside Solr to just re-index all
the existing data.

Thanks,
Tony


On Tue, Jul 2, 2013 at 1:37 AM, Jack Krupansky <jack@basetechnology.com>wrote:

> Or, go with a commercial product that has a single-click Solr re-index
> capability, such as:
>
> 1. DataStax Enterprise - data is stored in Cassandra and reindexed into
> Solr from there.
>
> 2. LucidWorks Search - data sources are declared so that the package can
> automatically re-crawl the data sources.
>
> But, yeah, as Otis says, "re-index" is really just a euphemism for
> deleting your Solr data directory and indexing from scratch from the
> original data sources.
>
> -- Jack Krupansky
>
> -----Original Message----- From: Otis Gospodnetic
> Sent: Monday, July 01, 2013 2:26 PM
> To: solr-user@lucene.apache.org
> Subject: Re: How to re-index Solr & get term frequency within documents
>
>
> If all your fields are stored, you can do it with
> http://search-lucene.com/?q=**solrentityprocessor<http://search-lucene.com/?q=solrentityprocessor>
>
> Otherwise, just reindex the same way you indexed in the first place.
> *Always* be ready to reindex from scratch.
>
> Otis
> --
> Solr & ElasticSearch Support -- http://sematext.com/
> Performance Monitoring -- http://sematext.com/spm
>
>
>
> On Mon, Jul 1, 2013 at 1:29 PM, Tony Mullins <tonymullins.tm@gmail.com>
> wrote:
>
>> Thanks Jack , it worked.
>>
>> Could you please provide some info on how to re-index existing data in
>> Solr, after changing the schema.xml ?
>>
>> Thanks,
>> Tony
>>
>>
>> On Mon, Jul 1, 2013 at 8:21 PM, Jack Krupansky <jack@basetechnology.com>*
>> *wrote:
>>
>>  You can write any function query in the field list of the "fl" parameter.
>>> Sounds like you want "termfreq":
>>>
>>> termfreq(field_arg,term)
>>>
>>> fl=id,a,b,c,termfreq(a,xyz)
>>>
>>>
>>> -- Jack Krupansky
>>>
>>> -----Original Message----- From: Tony Mullins
>>> Sent: Monday, July 01, 2013 10:47 AM
>>> To: solr-user@lucene.apache.org
>>> Subject: How to re-index Solr & get term frequency within documents
>>>
>>>
>>> Hi,
>>>
>>> I am using Solr 4.3.0.
>>> If I change my solr's schema.xml then do I need to re-index my solr ? And
>>> if yes , how to ?
>>>
>>> My 2nd question is I need to find the frequency of term per document in
>>> all
>>> documents of search result.
>>>
>>> My field is
>>>
>>> <field name="CommentX" type="text_general" stored="true" indexed="true"
>>> multiValued="true" termVectors="true" termPositions="true"
>>> termOffsets="true"/>
>>>
>>> And I am trying this query
>>>
>>> http://localhost:8080/solr/****select/?q=iphone&fl=AuthorX%**
>>> 2CTitleX%2CCommentX&df=****CommentX&wt=xml&indent=true&**
>>> qt=tvrh&tv=true&tv.tf=true&tv.****df=true&tv.positions&tv.****
>>> offsets=true<http://localhost:**8080/solr/select/?q=iphone&fl=**
>>> AuthorX%2CTitleX%2CCommentX&**df=CommentX&wt=xml&indent=**
>>> true&qt=tvrh&tv=true&tv.tf=**true&tv.df=true&tv.positions&**
>>> tv.offsets=true<http://localhost:8080/solr/select/?q=iphone&fl=AuthorX%2CTitleX%2CCommentX&df=CommentX&wt=xml&indent=true&qt=tvrh&tv=true&tv.tf=true&tv.df=true&tv.positions&tv.offsets=true>
>>> >
>>>
>>> Its just returning me the result set, no info on my searched term's
>>> (iphone) frequency in each document.
>>>
>>> How can I make Solr to return the frequency of searched term per document
>>> in result set ?
>>>
>>> Thanks,
>>> Tony.
>>>
>>>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message