mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Dean Jones <>
Subject Naive bayes and character n-grams
Date Wed, 09 Oct 2013 10:18:54 GMT
Hello folks,

I see that it's possible to use mahout to train a naive bayes
classifier using n-grams as features (or I guess, strictly speaking,
mahout can be used to generate sequence files containing n-grams; I
suspect the naive bayes trainer is indifferent to the form of features
it trains on). Is there any facility to generate character n-grams
instead of word n-grams?



View raw message