lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andrew Clegg <andrew.cl...@gmail.com>
Subject Re: Faceting within one document
Date Thu, 29 Oct 2009 19:45:35 GMT


Are you sure? I've *never* explicitly deleted a document, I only ever
rebuild the entire index with the data import handler's "full import with
cleaning" operation.


Lance Norskog-2 wrote:
> 
> 0-value facets are left behind by docs which you have deleted. If you
> optimize, there should be no 0-value facets.
> 
> On Wed, Oct 28, 2009 at 11:36 AM, Andrew Clegg <andrew.clegg@gmail.com>
> wrote:
>>
>>
>> Isn't the TermVectorComponent more for one document at a time, and the
>> TermsComponent for the whole index?
>>
>> Actually -- having done some digging... What I'm really after is the most
>> informative terms in a given document, which should take into account
>> global
>> document frequency as well as term frequency in the document at hand. I
>> think I can use the MoreLikeThisHandler to do this, with a bit of
>> experimentation...
>>
>> Thanks for the facet mincount tip BTW.
>>
>> Andrew.
>>
>>
>> Avlesh Singh wrote:
>>>
>>> For facets -
>>> http://wiki.apache.org/solr/SimpleFacetParameters#facet.mincount
>>> For terms - http://wiki.apache.org/solr/TermsComponent
>>>
>>> Helps?
>>>
>>> Cheers
>>> Avlesh
>>>
>>> On Wed, Oct 28, 2009 at 11:32 PM, Andrew Clegg
>>> <andrew.clegg@gmail.com>wrote:
>>>
>>>>
>>>> Hi,
>>>>
>>>> If I give a query that matches a single document, and facet on a
>>>> particular
>>>> field, I get a list of all the terms in that field which appear in that
>>>> document.
>>>>
>>>> (I also get some with a count of zero, I don't really understand where
>>>> they
>>>> come from... ?)
>>>>
>>>> Is it possible with faceting, or a similar mechanism, to get a count of
>>>> how
>>>> many times each term appears within that document?
>>>>
>>>> This would be really useful for building a list of top keywords within
>>>> a
>>>> long document, for summarization purposes. I can do it on the client
>>>> side
>>>> but it'd be nice to know if there's a quicker way.
>>>>
>>>> Thanks!
>>>>
>>>> Andrew.
>>>>
>>>> --
>>>> View this message in context:
>>>> http://www.nabble.com/Faceting-within-one-document-tp26099278p26099278.html
>>>> Sent from the Solr - User mailing list archive at Nabble.com.
>>>>
>>>>
>>>
>>>
>>
>> --
>> View this message in context:
>> http://www.nabble.com/Faceting-within-one-document-tp26099278p26099847.html
>> Sent from the Solr - User mailing list archive at Nabble.com.
>>
>>
> 
> 
> 
> -- 
> Lance Norskog
> goksron@gmail.com
> 
> 

-- 
View this message in context: http://www.nabble.com/Faceting-within-one-document-tp26099278p26119536.html
Sent from the Solr - User mailing list archive at Nabble.com.


Mime
View raw message