lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Simon Willnauer (JIRA)" <>
Subject [jira] Commented: (LUCENE-2102) LowerCaseFilter for Turkish language
Date Tue, 01 Dec 2009 21:31:20 GMT


Simon Willnauer commented on LUCENE-2102:

bq. Would there be opposition to making contrib/snowball depend upon contrib/analyzers so
the SnowballAnalyzer can use this filter instead of lowercase filter for the Turkish case?
(based upon Version, of course)?

i think we can arrange something like that. Since we factored out Smart-cn the jar has reasonable
size so this won't be an issue. maybe we should think about moving snowball into analyzers/snowball
- just an idea.
Anyway, this is somewhat unrelated to this particular patch but still considerable.

> LowerCaseFilter for Turkish language
> ------------------------------------
>                 Key: LUCENE-2102
>                 URL:
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Analysis
>    Affects Versions: 3.0
>            Reporter: Ahmet Arslan
>            Assignee: Robert Muir
>            Priority: Minor
>             Fix For: 3.1
>         Attachments: LUCENE-2102.patch
> java.lang.Character.toLowerCase() converts 'I' to 'i' however in Turkish alphabet lowercase
of 'I' is not 'i'. It is LATIN SMALL LETTER DOTLESS I.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message