lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lukas Vlcek" <lukas.vl...@gmail.com>
Subject Re: Wikia search goes live today
Date Tue, 08 Jan 2008 20:38:34 GMT
I should note that this technique is probably not easily applicable to
current Lucene scoring mechanism without additional development.

On 1/8/08, Lukas Vlcek <lukas.vlcek@gmail.com> wrote:
>
> After checking the Lucene API of ParallelReader it seems that the star
> score could be stored in different index which shares the same identifier
> for the documents. Such index could be small (partitioned to many small
> indices?) so the updates can be fast. Is that what you meant Andrzej? ;-)
>
> Anyway, I remember different technique which I once mentioned in Lucene
> mail list taking inspiration from book called Programming Collective
> Intelligence <http://www.oreilly.com/catalog/9780596529321/> . The idea is
> not to store score (may be I should call it user preference) into index but
> into neural net. One useful side effect is that this technique could score
> reasonably even document without any stars (meaning "similar" document to
> highly started documents could score better even if they haven't been stared
> by any user yet).
>
> Regards,
> Lukas
>
> On 1/8/08, Andrzej Bialecki <ab@getopt.org> wrote:
> >
> > Lukas Vlcek wrote:
> > > So staring will be accommodated only during indexing phase. Does it
> > mean it
> > > will be pretty static value not a dynamically changing variable...
> > correct?
> > > In other words if I add my starts to some document it won't affect the
> >
> > > scoring immediately but after indexing cycle. Correct?
> >
> > (I'm not involved in Wikia development). There are some ways to go about
> > it even in the pure Lucene-land, so that the updates are fast without
> > reindexing the main content. Hint: ParallelReader.
> >
> >
> > --
> > Best regards,
> > Andrzej Bialecki     <><
> >   ___. ___ ___ ___ _ _   __________________________________
> > [__ || __|__/|__||\/|  Information Retrieval, Semantic Web
> > ___|||__||  \|  ||  |  Embedded Unix, System Integration
> > http://www.sigram.com  Contact: info at sigram dot com
> >
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> > For additional commands, e-mail: java-user-help@lucene.apache.org
> >
> >
>
>
> --
> http://blog.lukas-vlcek.com/
>



-- 
http://blog.lukas-vlcek.com/

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message