lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jack Krupansky" <j...@basetechnology.com>
Subject Re: Getting position increments directly from the the index
Date Thu, 23 May 2013 14:09:10 GMT
Take a look at the Term Vectors Component:
http://wiki.apache.org/solr/TermVectorComponent

-- Jack Krupansky

-----Original Message----- 
From: Igor Shalyminov
Sent: Thursday, May 23, 2013 9:54 AM
To: java-user@lucene.apache.org
Subject: Re: Getting position increments directly from the the index

Thanks, Mike and Jack!

Those are really good options.
But, just to clarify, is there a way to get, let's say, a vector of position 
increments directly from the index, without re-parsing document contents?

-- 
Best Regards,
Igor

23.05.2013, 16:13, "Jack Krupansky" <jack@basetechnology.com>:
> It might be nice to inquire as to the largest position for a field in a
> document. Is that information kept anywhere? Not that I know of, although 
> I
> suppose it can be calculated at runtime by running though all the terms of
> the field. Then he could just divide by 1000.
>
> -- Jack Krupansky
>
> -----Original Message-----
> From: Michael McCandless
> Sent: Thursday, May 23, 2013 6:28 AM
> To: Lucene Users
> Subject: Re: Getting position increments directly from the the index
>
> Do you actually index the sentence boundary as a token?  If so, you
> could just get the totalTermFreq of that token?
>
> Mike McCandless
>
> http://blog.mikemccandless.com
>
> On Wed, May 22, 2013 at 10:11 AM, Igor Shalyminov
> <ishalyminov@yandex-team.ru> wrote:
>
>>  Hello!
>>
>>  I'm storing sentence bounds in the index as position increments of 1000.
>>  I want to get the total number of sentences in the index, i. e. the 
>> number
>>  of "1000" increment values.
>>  Can I do that some other way rather than just loading each document and
>>  extracting position increments with a custom Analyzer?
>>
>>  --
>>  Best Regards,
>>  Igor Shalyminov
>>
>>  ---------------------------------------------------------------------
>>  To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>>  For additional commands, e-mail: java-user-help@lucene.apache.org
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org 


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message