mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chandra Mohan, Ananda Vel Murugan" <>
Subject significance of FEATURES in SGD
Date Wed, 03 Jul 2013 12:58:19 GMT

I am experimenting Mahout for text classification. I have 2 million training data i.e text
of approximately 20 words. They fall into 121 categories. I tried AdaptiveLogisticRegression.
When I create sparse vector of cardinality 10000, it takes hours to converge, but when I tried
with 100 it converges fast. Is this measure very significant in determining the accuracy of
the model? Please advise.


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message