mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Frank Wang <wangfan...@gmail.com>
Subject Naive Bayes vs. SGD
Date Mon, 06 Dec 2010 13:32:29 GMT
Hi,

I'm working on a text classification problem. Given a piece of content, it
will be classified into 1 or more categories.
>From my understanding, Naive Bayes model is non-parametric, so every
training requires all the cumulated sample data. However, if I were to use
SGD model with n binary logistic regression, I wouldn't need to keep the
historical sample data. Which seems will lead to faster training in the long
run.

Is this a fair logic?

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message