spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sean Owen <so...@cloudera.com>
Subject Re: how idf is calculated
Date Fri, 31 Oct 2014 10:38:38 GMT
Yes, here the base doesn't matter as it just multiplies all results by
a constant factor. Math libraries tend to have ln, not log10 or log2.
ln is often the more, er, natural base for several computations. So I
would assume that log = ln in the context of ML.

On Fri, Oct 31, 2014 at 11:31 AM, Andrejs Abele <andrejs@sindicetech.com> wrote:
> I found my problem. I assumed based on TF-IDF in  Wikipedia , that log base
> 10 is used, but as I found in this discussion, in scala it is actually ln
> (natural logarithm).

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@spark.apache.org
For additional commands, e-mail: user-help@spark.apache.org


Mime
View raw message