lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "scott chu" <scott....@udngroup.com>
Subject Re: Doing Shingle but also keep special single word
Date Fri, 20 Aug 2010 14:46:07 GMT
Hi, Brendan,

    Thanks for reply. The real case is that I can't predict when there's a 
new important special word that users are interesting cause I am building a 
daily news article data. Therefore, I don't know when & what single words 
should include into that new field.  I've ever thought about manually 
maintaining a special word dictionary but it costs too much effort, so I 
gave up that idea.

However, you suggestion still sound a good trade-off to me, I'll take into 
account seriously.

Scott

----- Original Message ----- 
From: "Brendan Grainger" <brendan.grainger@gmail.com>
To: <solr-user@lucene.apache.org>
Sent: Friday, August 20, 2010 10:06 PM
Subject: Re: Doing Shingle but also keep special single word


Hi Scott,

Is there a reason why you wouldn't just index these special words into 
another field and then search over both fields? That would also have the 
nice property of being able to boost on the special word field if you 
wanted.

HTH
Brendan

On Aug 20, 2010, at 6:19 AM, scott chu (朱炎詹) wrote:

> I am building index with Shingle filter. We know it's minimum 2-gram but I 
> also want keep some special single word, e.g. IBM, Microsoft, etc. i.e. I 
> want to do a minimum 2-gram but also want to have these single word in my 
> index, Is it possible?
>
> Scott



Mime
View raw message