lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Karsten Konrad" <>
Subject AW: Lucene internal document number?
Date Fri, 06 Aug 2004 12:33:37 GMT



a have a short question it's regarding lucene internal document numbers: 
can you give me an idea where they are written into the index and how 
they are generated?

I am not 100% sure about the technical design, only
from my experience with Lucene:

The numbers depend on when the document was indexed. 
The "older" the document, the smaller the number. All 
documents are numbered from 0 to n-1 where n is the 
number of documents the current reader sees. There
are never any "gaps" in this numbering.

There is, to my knowledge, no explicit point where
these numbers are written in the index. Think of
positions in a list - they are not part of the
list itself. You have to take into account that
these numbers may change for documents after 
any deletions in the index.




Dr.-Ing. Karsten Konrad
Head of Artificial Intelligence Lab

Xtramind Technologies GmbH 
Stuhlsatzenhausweg 3 
D-66123 Saarbrücken

Phone +49 (681) 3 02-51 13 
Fax +49 (681) 3 02-51 09

>Besuchen Sie uns !
>DMS |  Halle 2 Stand 2705 |  07.- 09. September 2004 |  Messe Essen |

-----Ursprüngliche Nachricht-----
Von: B. Grimm [Eastbeam GmbH] [] 
Gesendet: Freitag, 6. August 2004 13:42
Betreff: Lucene internal document number?

hi there,
i looked around through the source but i dont get it. i also read the 
faq and i know that numbers are incremental for each index and start by 
0 and change when optimizing and so one...

i looked at the doc writers in lucene, but i dont get the point where 
numbers are given and written (i assume by using writeVInt() or 
something like that).

it would be very kind if anyone can tell me what line in which file i 
had to look for.

thanks in andvance and kind regards from berlin, germany.


Mit freundlichem Gruß,
Bastian Grimm

To unsubscribe, e-mail:
For additional commands, e-mail:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message