mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Camilo Lopez <cam...@camilolopez.com>
Subject Re: Mahout 0.5 seq2sparse gives Error: LUCENE_31
Date Tue, 21 Jun 2011 14:57:09 GMT
Sean, 

Thanks for the reply, it turned out I had 0.4 and 0.5 installed at the same time and a broken
symlink ... so yeah mixed versions problem.


On 2011-06-21, at 1:44 AM, Sean Owen wrote:

> It sounds like it doesn't have the right version of Lucene available for
> some reason, but you'd have to post more of your error. That would be
> helpful.
> 
> On Mon, Jun 20, 2011 at 6:32 PM, Camilo Lopez <camilo@camilolopez.com>wrote:
> 
>> Has anyone seen this error as well, while trying to create sparse vectors
>> from a sequenced directory.
>> 
>> 
>> First sequencing:
>> 
>> bin/mahout seqdirectory -i /user/hadoop/htmlless_articles  -o
>> /user/hadoop/htmless_articles_seq -ow
>> Running on hadoop, using HADOOP_HOME=/usr/local/hadoop/
>> No HADOOP_CONF_DIR set, using /usr/local/hadoop//src/conf
>> 11/06/20 13:31:35 WARN driver.MahoutDriver: No seqdirectory.props found on
>> classpath, will use command-line arguments only
>> 11/06/20 13:31:35 INFO common.AbstractJob: Command line arguments:
>> {--charset=UTF-8, --chunkSize=64, --endPhase=2147483647,
>> --fileFilterClass=org.apache.mahout.text.PrefixAdditionFilter,
>> --input=/user/hadoop/htmlless_articles, --keyPrefix=,
>> --output=/user/hadoop/htmless_articles_seq, --overwrite=null,
>> --startPhase=0, --tempDir=temp}
>> 11/06/20 13:31:35 INFO common.HadoopUtil: Deleting
>> /user/hadoop/htmless_articles_seq
>> 11/06/20 13:31:43 INFO driver.MahoutDriver: Program took 8298 ms
>> 
>> 
>> Then when trying to create sparse vectors:
>> 
>> 
>> bin/mahout seq2sparse -i /htmless_articles_seq -o
>> /htmless_articles_vectors_1 -wt tfidf
>> 
>> 11/06/20 13:13:20 INFO mapred.JobClient: Task Id :
>> attempt_201104261414_0625_m_000000_0, Status : FAILED
>> Error: LUCENE_31


Mime
View raw message