lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mukherjee, Prasenjit" <P.Mukher...@corp.aol.com>
Subject RE: readVInt, what is it for?
Date Thu, 03 Jul 2008 04:13:09 GMT
The slide16 in the following ppt might be of some help. Let me know if
it helps. 

http://docs.google.com/Presentation?docid=dmsxgtg_98dbh529dn

-Prasen 

-----Original Message-----
From: Grant Ingersoll [mailto:gsingers@apache.org] 
Sent: Thursday, July 03, 2008 8:08 AM
To: java-dev@lucene.apache.org
Subject: Re: readVInt, what is it for?

I'd suggest starting with a couple of places:
http://lucene.apache.org/java/2_3_2/fileformats.html

and

http://lucene.apache.org/java/2_3_2/scoring.html

and then do as Yonik said and step through the internals, starting with
a simple TermQuery which leads to the TermScorer.

-Grant


On Jul 2, 2008, at 10:04 PM, blazingwolf7 wrote:

>
> Hmmm, I don't think I get it. How is it tracked during index time? I 
> index my file earlier. Later I will open the index and perform a 
> search.
> Shouldn't
> the frequency of each term in each document found be calculated at 
> during the searching process?
>
>
> Yonik Seeley wrote:
>>
>> The frequency is tracked at index time.  It's simply a read at query 
>> time.  See TermDocs.
>> If you really want to understand more about the code internals of 
>> Lucene, I'd suggest stepping through more example queries with a 
>> debugger.
>>
>> -Yonik
>>
>> On Wed, Jul 2, 2008 at 8:49 PM, blazingwolf7 <blazingwolf7@gmail.com>
>> wrote:
>>>
>>> Thanks, I am clear now on that. But do anyone know where is the 
>>> frequency of the term for each document calculated? I mean which 
>>> class it may be in and which method?
>>> Thanks
>>>
>>>
>>> Uwe Schindler wrote:
>>>>
>>>> A VInt is the way, how integers are stored in the index file in a 
>>>> compressed and variable length manner.
>>>>
>>>> Read here: http://lucene.apache.org/java/2_3_2/
>>>> fileformats.html#VInt
>>>>
>>>> -----
>>>> Uwe Schindler
>>>> H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de
>>>> eMail: uwe@thetaphi.de
>>>>
>>>>> -----Original Message-----
>>>>> From: blazingwolf7 [mailto:blazingwolf7@gmail.com]
>>>>> Sent: Wednesday, July 02, 2008 11:47 AM
>>>>> To: java-dev@lucene.apache.org
>>>>> Subject: readVInt, what is it for?
>>>>>
>>>>>
>>>>> Hi,
>>>>>
>>>>> I am fairly new to Lucene and is now currently going through its 
>>>>> source code. I am currently trying to determine how Lucene 
>>>>> calculate the frequency of a term in each document located.
>>>>>
>>>>> I encounter a method named readVInt() in IndexInput class. It 
>>>>> seems everytime it called this method it will be able to generate 
>>>>> the document number and the frequency of the term in each 
>>>>> document.
>>>>>
>>>>> I am wondering how it work and fail to find and information on it 
>>>>> on the Internet. Could anyone explain it to me? Thanks
>>>>> --
>>>>> View this message in context:
>>>>> http://www.nabble.com/readVInt%2C-what-is-
>>>>> it-for--tp18233802p18233802.html
>>>>> Sent from the Lucene - Java Developer mailing list archive at 
>>>>> Nabble.com.
>>>>>
>>>>>
>>>>> ------------------------------------------------------------------
>>>>> --- To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
>>>>> For additional commands, e-mail: java-dev-help@lucene.apache.org
>>>>
>>>>
>>>>
>>>> -------------------------------------------------------------------
>>>> -- To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
>>>> For additional commands, e-mail: java-dev-help@lucene.apache.org
>>>>
>>>>
>>>>
>>>
>>> --
>>> View this message in context:
>>> http://www.nabble.com/readVInt%2C-what-is-it-for--tp18233802p1824979
>>> 0.html Sent from the Lucene - Java Developer mailing list archive at

>>> Nabble.com.
>>>
>>>
>>> --------------------------------------------------------------------
>>> - To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
>>> For additional commands, e-mail: java-dev-help@lucene.apache.org
>>>
>>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
>> For additional commands, e-mail: java-dev-help@lucene.apache.org
>>
>>
>>
>
> --
> View this message in context: 
> http://www.nabble.com/readVInt%2C-what-is-it-for--tp18233802p18250434.
> html Sent from the Lucene - Java Developer mailing list archive at 
> Nabble.com.
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-dev-help@lucene.apache.org
>

--------------------------
Grant Ingersoll
http://www.lucidimagination.com

Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerformance
http://wiki.apache.org/lucene-java/LuceneFAQ








---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message