mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ankur Desai (ankurdes)" <>
Subject Mahout Collocation parameter
Date Wed, 02 Dec 2015 20:38:36 GMT

I am running collocation on mahout and are having trouble understanding what the minsupport
parameter is doing.

I want to get the bigrams/trigrams that occur at least 5 times in the corpus.  I set the minsupport
value to 5 and I am still getting results that are occur only one time in the entire corpus.

Can someone please help me understand what this parameter is for or how I can get bigrams/trigrams
that occur at least X number of times.


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message