mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Prabhakar Srinivasan <>
Subject Outlier detection/Pruning
Date Tue, 03 Dec 2013 17:34:15 GMT
Can someone point me to some explanatory documentation for Outlier
Detection & Removal in Clustering in Mahout. I am unable to understand the
internal mechanism of outlier detection just by reading the Javadoc:
clusterClassificationThreshold Is a clustering strictness / outlier removal
parameter. Its value should be between 0 and 1. Vectors having pdf below
this value will not be clustered.

What does the pdf represent?


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message