mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jakub Stransky <>
Subject 20 news groups example
Date Mon, 01 Dec 2014 13:09:25 GMT
Hello experienced mahout users,

I am new to mahout and I am trying to run naive bayes classification
example with 20news groups categories. I do not userstand one thing which I
am unable to spot. To train categorization I need a labeled data. I don't
see the way how the label of a particular document is passed to training
the model.
I think that I understand TF and IDF etc. but simply dont see how label is

Could someone provide some insight into this?


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message