spark-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From bradc <>
Subject Re: Design document - MLlib's statistical package for DataFrames
Date Thu, 16 Feb 2017 22:21:31 GMT

While it is also missing in spark.mllib, I'd suggest adding cardinality as
part of the Simple descriptive statistics for both and spark.mlib? 
This is useful even for data in double precision FP to understand the
"uniqueness" of the feature data.


View this message in context:
Sent from the Apache Spark Developers List mailing list archive at

To unsubscribe e-mail:

View raw message