Do you need cosine distance and correlation between vectors or between variables (elements of vector)? It would be helpful if you could tell us details of your task.


On Thu, May 22, 2014 at 5:49 PM, jamal sasha <jamalshasha@gmail.com> wrote:
Hi,
  I have bunch of vectors like
[0.1234,-0.231,0.23131]
.... and so on.

and  I want to compute cosine similarity and pearson correlation using pyspark.. 
How do I do this?
Any ideas?
Thanks