lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mayya Sharipova (JIRA)" <j...@apache.org>
Subject [jira] [Created] (LUCENE-8100) Error on reindex using WordNet synonyms file
Date Fri, 15 Dec 2017 15:25:03 GMT
Mayya Sharipova created LUCENE-8100:
---------------------------------------

             Summary: Error on reindex using WordNet synonyms file
                 Key: LUCENE-8100
                 URL: https://issues.apache.org/jira/browse/LUCENE-8100
             Project: Lucene - Core
          Issue Type: Bug
          Components: modules/analysis
    Affects Versions: 7.0.1
            Reporter: Mayya Sharipova
            Priority: Minor


Originally reported in the ES issues: https://github.com/elastic/elasticsearch/issues/27798#issuecomment-351838983

but looks like the issue is introduced from the Lucene 7.0.X.

Copying the user's issue here:

------------------------------------------------------

I'm encountering the following error on indexing when trying to use the wn_s.pl synonyms file
(which I've moved to /usr/local/etc/elasticsearch):


{code:javascript}
{
	"error": {
		"root_cause": [{
			"type": "illegal_argument_exception",
			"reason": "failed to build synonyms"
		}],
		"type": "illegal_argument_exception",
		"reason": "failed to build synonyms",
		"caused_by": {
			"type": "parse_exception",
			"reason": "Invalid synonym rule at line 2",
			"caused_by": {
				"type": "illegal_argument_exception",
				"reason": "term: physical entity analyzed to a token with posinc != 1"
			}
		}
	}
}
{code}

Here's the line it's objecting to:

s(100001930,1,'physical entity',n,1,0). 
I'm using the WordNet Prolog synonyms file from http://wordnetcode.princeton.edu/3.0/WNprolog-3.0.tar.gz2
------------------------------------------------------

Looks like the error comes from  Lucene's classes of *WordnetSynonymParser* and *SynonymMap*,
and changes introduce from Lucene 7.0 version.




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message