spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Arunkumar Pillai <>
Subject [Spark 1.6] Univariate Stats using apache spark
Date Thu, 04 Feb 2016 09:34:50 GMT

Currently after creating a dataframe i'm queryingmax max min mean  it to
get result.
sqlContext.sql("SELECT MAX(variablesArray) FROM " + tableName)

Is this an optimized way?
I'm not able to find the all stats like min max mean variance skewness
kurtosis directly from a dataframe

Please help

Thanks and Regards

View raw message