mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Grant Ingersoll <gsing...@apache.org>
Subject Re: Question on Bayes Classifier
Date Thu, 29 Apr 2010 10:25:02 GMT
I believe, and I haven't checked my notes from way back when, that it doesn't need to be calculated
b/c P(C) is the same across all the comparisons, so P(D|C) is the only factor that matters
b/c you only need relative scores for ordering purposes. 

On Apr 29, 2010, at 2:25 AM, Gurudev Devanla wrote:

> Hello All,
> 
> This is my first post ever on any open source mailing list. So, please
> excuse me if I am not following certain standards.
> 
> I was walking through the code for Naive Bayes classifier and I notice that
> in TestClassifier.java, at the point where the document wieghts are
> calculated the probability of the class(label)  is not taken into
> consideration. My knowledge of document wt in Naive Bayes is :
> 
> Pr(C|D )  =  Pr(D|C) * P(C) , but in the implementation I have downloaded, I
> don't see Pr(C) being used in the calculation.
> 
> Any pointers would be great. I am probably overlooking this be considered
> elsewhere in the program.
> 
> Thanks
> betacoder

--------------------------
Grant Ingersoll
http://www.lucidimagination.com/

Search the Lucene ecosystem using Solr/Lucene: http://www.lucidimagination.com/search


Mime
View raw message