spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Arunkumar Pillai <arunkumar1...@gmail.com>
Subject Need to user univariate summary stats
Date Thu, 04 Feb 2016 09:22:33 GMT
Hi

I'm currently using query

sqlContext.sql("SELECT MAX(variablesArray) FROM " + tableName)

to extract mean max min.
is there any better  optimized way ?

In the example i saw df.groupBy("key").agg(skewness("a"), kurtosis("a"))


But i don't have key anywhere in the data.

How to extract the univariate summary stats from df. please help

-- 
Thanks and Regards
        Arun

Mime
View raw message