lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Michael McCandless <luc...@mikemccandless.com>
Subject Re: DocumentsWriter.checkMaxTermLength issues
Date Mon, 31 Dec 2007 23:40:53 GMT
Grant Ingersoll wrote:

>
> On Dec 31, 2007, at 12:54 PM, Michael McCandless wrote:
>
>> I actually think indexing should try to be as robust as possible.   
>> You
>> could test like crazy and never hit a massive term, go into  
>> production
>> (say, ship your app to lots of your customer's computers) only to
>> suddenly see this exception.  In general it could be a long time  
>> before
>> you "accidentally" our users see this.
>>
>> So I'm thinking we should have the default behavior, in IndexWriter,
>> be to skip immense terms?
>>
>> Then people can use TokenFilter to change this behavior if they want.
>>
> +1.  We could log it, right?

Yes, to IndexWriter's infoStream, if it's set.  I'll do that...

Mike

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message