mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Dean Jones <>
Subject Re: Naive bayes and character n-grams
Date Thu, 10 Oct 2013 07:14:54 GMT
Hi Suneel,

On 9 October 2013 14:27, Suneel Marthi <> wrote:
> an example of a Naive-Bayes classifier trained on character n-grams is the LangDetect
> (see
> Agree with Ted that it should be relatively easy to build one.

Thanks. Yes, I need to (re-)train a language detector. We have an
existing system based on an earlier version of Mahout which I'm
looking to switch to using character n-grams instead of word tokens.


View raw message