lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "YOO JEONGIN (JIRA)" <j...@apache.org>
Subject [jira] [Created] (LUCENE-8772) [nori] A word that is registered in advance, but the words are not separated and recognized as 'UNKNOWN'
Date Fri, 19 Apr 2019 02:34:00 GMT
YOO JEONGIN created LUCENE-8772:
-----------------------------------

             Summary: [nori]  A word that is registered in advance, but the words are not
separated and recognized as 'UNKNOWN'
                 Key: LUCENE-8772
                 URL: https://issues.apache.org/jira/browse/LUCENE-8772
             Project: Lucene - Core
          Issue Type: Bug
          Components: modules/analysis
    Affects Versions: 8.0, 7.7.1, 7.7, 7.6, 7.5
            Reporter: YOO JEONGIN
         Attachments: image-2019-04-19-11-32-56-310.png

In case of 'nori', if there is no word starting from the left, 'UNKNOWN' is analyzed even
if there is a word already registered in the middle.
So here is the question.
Does nori analyze only on the left side and do not analyze from the right side?
Could this be solved?

 

ex)

input => 갊수학

Condition

dictionary registered : 수학
dictionary Unregistered : 갊

result => 갊수학

!image-2019-04-19-11-32-56-310.png!



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message