lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Michael Stoppelman" <stop...@gmail.com>
Subject Re: Wikia search goes live today
Date Tue, 08 Jan 2008 20:12:10 GMT
I'm surprised they aren't keeping *any* logs or so they claim. Seems foolish
to me from a data-mining prospective.

"A Wikia employee told me today that people were already asking what the
most popular search terms were. He said there was no way of finding out as
no logs are kept." [1]
[1]
http://radar.oreilly.com/archives/2008/01/why_wikia_will_change_search.html

-M

On Jan 8, 2008 12:09 PM, Dennis Kubes <kubes@apache.org> wrote:

> Star ratings are being stored but not accounted for in the score as of
> yet.  The plan is to include them in future indexing scores. :)
>
> Dennis
>
> Mike Klaas wrote:
> > On 7-Jan-08, at 11:49 PM, Lukas Vlcek wrote:
> >
> >> This would be great!
> >>
> >> I am particularly interested how they are going about customized
> >> search (if
> >> they have a plan to do it). I mean if they can reorder raw search
> results
> >> based on some kind of collective knowledge (which is probably kept
> >> outside
> >> of Lucene index - at least that is what I can see from Nutch score
> >> explanations).
> >
> > I don't think that there is anything like that yet.  It looks to me like
> > a standard disjunction over title/content/host/url + a global document
> > boost based on pagerank-y link analysis (or simply # inlinks).  If they
> > are incorporating the "star" ratings yet, it is probably folded in to
> > the global doc boost.
> >
> > -Mike
> >
> >
> >> Regards,
> >> Lukas
> >>
> >> On Jan 7, 2008 11:14 PM, Otis Gospodnetic <otis_gospodnetic@yahoo.com>
> >> wrote:
> >>
> >>> See my comment (around #45-50) on Techcrunch about that from late last
> >>> night.  There is actually one Wikia guy helping Nutch - Dennis
> >>> Kubes.  He
> >>> must have been hitting reload on that TC post, because he IMed me
> >>> quickly
> >>> after I posted my comment and clarified that he is that Wikia
> >>> developer I
> >>> was referring to in my comment.... so I'm looking forward to more
> >>> contributions from Dennis and his coworkers! :)
> >>>
> >>> Otis
> >>> --
> >>> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
> >>>
> >>> ----- Original Message ----
> >>> From: Grant Ingersoll <gsingers@apache.org>
> >>> To: java-user@lucene.apache.org
> >>> Sent: Monday, January 7, 2008 11:21:33 AM
> >>> Subject: Re: Wikia search goes live today
> >>>
> >>> One other thing to note, you can definitely see Lucene in action (or
> >>> Nutch, that is) by clicking on the score returned for a given document
> >>>
> >>> (try searching for Lucene) and you see, in all it's glory, the Lucene
> >>> explain results...  It even displays the Nutch logo, which makes me
> >>> wonder if they are misusing an ASF trademark (but, IANAL, so I don't
> >>> know) since they don't state that Nutch is a trademark of the ASF.
> >>> But, that is a discussion for somewhere else...
> >>>
> >>>
> >>> On Jan 7, 2008, at 8:13 AM, Grant Ingersoll wrote:
> >>>
> >>>>
> >>>> On Jan 7, 2008, at 7:48 AM, Lukas Vlcek wrote:
> >>>>
> >>>>> Hi,
> >>>>>
> >>>>> I noticed that Wikia search goes live today (see
> >>>>> http://www.devxnews.com/article.php/3719906).
> >>>>> Does anybody know where I could find more technical information
> >>>>> about their
> >>>>> solution? Are they going to contribute their enhancements back to
> >>>>> Lucene/Nutch/Hadoop code? My understanding is that as long as they
> >>>>> claim
> >>>>> they want to build their solution on top of open source technology
> >>>>> they
> >>>>> should be contributing back.
> >>>>
> >>>> Not sure what they have done, but nothing in the Apache license
> >>>> requires contribution back, even if it would be appreciated.
> >>>>
> >>>> Cheers,
> >>>> Grant
> >>>>
> >>>> --------------------------
> >>>> Grant Ingersoll
> >>>> http://lucene.grantingersoll.com
> >>>> http://www.lucenebootcamp.com
> >>>>
> >>>> Lucene Helpful Hints:
> >>>> http://wiki.apache.org/lucene-java/BasicsOfPerformance
> >>>> http://wiki.apache.org/lucene-java/LuceneFAQ
> >>>>
> >>>>
> >>>>
> >>>>
> >>>>
> >>>> ---------------------------------------------------------------------
> >>>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> >>>> For additional commands, e-mail: java-user-help@lucene.apache.org
> >>>>
> >>>
> >>> --------------------------
> >>> Grant Ingersoll
> >>> http://lucene.grantingersoll.com
> >>> http://www.lucenebootcamp.com
> >>>
> >>> Lucene Helpful Hints:
> >>> http://wiki.apache.org/lucene-java/BasicsOfPerformance
> >>> http://wiki.apache.org/lucene-java/LuceneFAQ
> >>>
> >>>
> >>>
> >>>
> >>>
> >>> ---------------------------------------------------------------------
> >>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> >>> For additional commands, e-mail: java-user-help@lucene.apache.org
> >>>
> >>>
> >>>
> >>>
> >>>
> >>> ---------------------------------------------------------------------
> >>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> >>> For additional commands, e-mail: java-user-help@lucene.apache.org
> >>>
> >>>
> >>
> >>
> >> --
> >> http://blog.lukas-vlcek.com/
> >
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> > For additional commands, e-mail: java-user-help@lucene.apache.org
> >
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message