mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Pavan K Narayanan <pavan.naraya...@gmail.com>
Subject mahout classification error
Date Wed, 23 Oct 2013 10:04:48 GMT
I am trying to use the "mahout trainlogistic" function to train a email
spam identifier function where my input, in csv format looks as follows:

"word_freq_make:","word_freq_address:","word_freq_all:",...,"spamornot"
...
...
...

All the data here are in numeric format -- very similar to donut.csv. I
used the following command to train:

mahout trainlogistic --input
/home/hduser/mahout-distribution-0.7/bin/spambase.csv --output modelspam
--target spamornot --categories 2 --predictors word_freq_make:
word_freq_address: word_freq_all: word_freq_3d: --types numeric --features
20 --passes 10 --rate 50

I got the following error:
Exception in thread "main" java.lang.NullPointerException
at
org.apache.mahout.classifier.sgd.CsvRecordFactory.firstLine(CsvRecordFactory.java:176)
 at
org.apache.mahout.classifier.sgd.TrainLogistic.mainToOutput(TrainLogistic.java:78)
at
org.apache.mahout.classifier.sgd.TrainLogistic.main(TrainLogistic.java:64)
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
 at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
 at
org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68)
at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139)
 at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:195)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
 at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
 at java.lang.reflect.Method.invoke(Method.java:606)
at org.apache.hadoop.util.RunJar.main(RunJar.java:160)

grateful for any help/comments/suggestions

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message