lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "DECAFFMEYER MATHIEU" <MATHIEU.DECAFFMA...@fortis.lu>
Subject RE: Low hits
Date Thu, 25 Jan 2007 14:49:49 GMT
Thank u for your reply,

There is not much help in Regain community,

But I can see that when I type e.g. 
title:logistics
I have like 0.70
also headlines:logistics 0.70

But when I type logistics I have 0.02

I do not udnerstand since I added this word as title and headlines and I
need a higher score for titles of the web pages for the people I
develop.

I will try Luke but for some reason I can't install it in my company,

Can someone give me some suggestions on what I should do ?

Thank u.

__________________________________

   Mathieu Decaffmeyer
   Web Developer
   Fortis Banque Luxembourg
   IS Retail Banking - Web Content Management
   Mobile : 0032  479 / 69 . 42 . 96

    

-----Original Message-----
From: Chris Hostetter [mailto:hossman_lucene@fucit.org] 
Sent: Tuesday, January 23, 2007 8:44 PM
To: java-user@lucene.apache.org
Subject: RE: Low hits

*****  This message comes from the Internet Network *****


: When I index the whole website, then when I type a title of a document
I
: have like 60 to 70 % as score.
: When I index only one page, then when I type the title I have like 2%
as
: score.

I don't know what Regain is ... but this sounds like some issue between
how it reports the scores Lucene generates.  Lucene scores are not
percentages, they are either a "raw score" which is an absolute
calculation absed on the query boosts and constants in your Similarity
class -- raw scores can be any floating point number; or they are a
"Hits
semi-nomalized score" which are garunteed to be between 0 and 1, but the
"best" matching document is only garunteed to have a score no greater
then
1 -- it is not garunteed to have a score that *is* 1.

perhaps Regain is translating the Hits score as a percentage -- in which
case there is nothing wrong with the top matching document haveing a
score
of "2%" ... it's still the top matching document.


I would suggest you start by asking some questions of the Regain user
community or Regain is a commercial product, ask their customer support
people aboutthe scores.


-Hoss


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org



============================================
Internet communications are not secure and therefore Fortis Banque Luxembourg S.A. does not
accept legal responsibility for the contents of this message. The information contained in
this e-mail is confidential and may be legally privileged. It is intended solely for the addressee.
If you are not the intended recipient, any disclosure, copying, distribution or any action
taken or omitted to be taken in reliance on it, is prohibited and may be unlawful. Nothing
in the message is capable or intended to create any legally binding obligations on either
party and it is not intended to provide legal advice.
============================================


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message