Dear All,

I was wondering why there is training data and testing data in kmeans ? Shouldn't it be unsupervised learning with just access to stream data ?

I found similar question but couldn't understand the answer.
http://stackoverflow.com/questions/30972057/is-the-streaming-k-means-clustering-predefined-in-mllib-library-of-spark-supervi

Thanks!
Ahmed