mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Donni Khan <prince.don...@googlemail.com>
Subject Re: Process UnStructured Data in Mahout for Clustering
Date Thu, 04 Dec 2014 14:30:44 GMT
Hi
it depends on the nature of data you are clustering. If you have knowledge
about your data, you can figure out the results and you can also set the
correct parameters to the clustering algorithm like number of topics or
number of clusters.

Cheers,
Donni

On Thu, Dec 4, 2014 at 2:38 PM, Shahid Shaikh <shaikhshahidg@gmail.com>
wrote:

> Hi All,
>    I have been trying mahout clustering  on unstructured data i.e human
> written data . I have tried mahout clustering algorithms like
> Kmeans,Canopy+Kmeans and LDA but the results produced are not help full .
>
> i see the problem is with the way data is written , Can some one please
> provide me some pointers on how to proceed with unstructured data  for
> clustering.
>
>
> i have written and analyzer that uses lower-Case and stop-words filter also
> .
>
> thanks :)
>
>
> Regards,
> Shaikh Shahid G .
> +91 9503954781
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message