mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Chris Harrington <ch...@heystaks.com>
Subject Russian Analyzer error
Date Wed, 13 Feb 2013 14:49:28 GMT
Hi all,

I'm having some trouble using a RussianAnalyzer in Mahout, I thought this would suffice

RussianAnalyzer analyzer = new RussianAnalyzer(Version.LUCENE_31); 
DocumentProcessor.tokenizeDocuments(new Path(inputDir), analyzer.getClass().asSubclass(Analyzer.class),
tokenizedPath, conf);

but it throws this error.

java.lang.IllegalStateException: java.lang.NoSuchMethodException: org.apache.lucene.analysis.ru.RussianAnalyzer.<init>()
	at org.apache.mahout.common.ClassUtils.instantiateAs(ClassUtils.java:68)
	at org.apache.mahout.common.ClassUtils.instantiateAs(ClassUtils.java:28)
	at org.apache.mahout.vectorizer.document.SequenceFileTokenizerMapper.setup(SequenceFileTokenizerMapper.java:58)
	at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142)
	at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764)
	at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370)
	at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:212)
Caused by: java.lang.NoSuchMethodException: org.apache.lucene.analysis.ru.RussianAnalyzer.<init>()
	at java.lang.Class.getConstructor0(Class.java:2706)
	at java.lang.Class.getConstructor(Class.java:1657)
	at org.apache.mahout.common.ClassUtils.instantiateAs(ClassUtils.java:62)
	... 6 more


Since this happens with various other analyzers (FrenchAnalyzer , DutchAnalyzer, etc) I figure
I must be going about this in the wrong way. 

Any advice?
Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message