mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Dean Jones <dean.m.jo...@gmail.com>
Subject Re: Naive bayes and character n-grams
Date Thu, 10 Oct 2013 07:14:54 GMT
Hi Suneel,

On 9 October 2013 14:27, Suneel Marthi <suneel_marthi@yahoo.com> wrote:
> an example of a Naive-Bayes classifier trained on character n-grams is the LangDetect
library.
> (see http://code.google.com/p/language-detection/)
>
> Agree with Ted that it should be relatively easy to build one.
>

Thanks. Yes, I need to (re-)train a language detector. We have an
existing system based on an earlier version of Mahout which I'm
looking to switch to using character n-grams instead of word tokens.

Dean.

Mime
View raw message