lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mattmann, Chris A (388J)" <chris.a.mattm...@jpl.nasa.gov>
Subject Re: Which token filter can combine 2 terms into 1?
Date Thu, 27 Dec 2012 16:12:10 GMT
Hi Guys,

I also worked on a CombiningTokenFilter, see:

https://issues.apache.org/jira/browse/LUCENE-3413


Patch has been up and available for a while.

HTH!

Cheers,
Chris


On 12/27/12 12:26 AM, "Dmitry Kan" <solrexpert@gmail.com> wrote:

>Hi,
>
>Have a look onto TokenFilter. Extending it will give you access to a
>TokenStream.
>
>Regards,
>
>Dmitry Kan
>
>On Fri, Dec 21, 2012 at 9:05 AM, Xi Shen <davidshen84@gmail.com> wrote:
>
>> Hi,
>>
>> I am looking for a token filter that can combine 2 terms into 1? E.g.
>>
>> the input has been tokenized by white space:
>>
>> t1 t2 t2a t3
>>
>> I want a filter that output:
>>
>> t1 t2t2a t3
>>
>> I know it is a very special case, and I am thinking about develop a
>>filter
>> of my own. But I cannot figure out which API I should use to look for
>>terms
>> in a Token Stream.
>>
>>
>> --
>> Regards´╝î
>> David Shen
>>
>> http://about.me/davidshen
>> https://twitter.com/#!/davidshen84
>>


Mime
View raw message