Sebastián Ramírez created SPARK-5026:
----------------------------------------
Summary: PySpark rdd.randomSpit() is not documented
Key: SPARK-5026
URL: https://issues.apache.org/jira/browse/SPARK-5026
Project: Spark
Issue Type: Documentation
Components: Documentation, PySpark
Affects Versions: 1.2.0
Reporter: Sebastián Ramírez
Priority: Minor
In the current latest version of Spark (1.2.0) If you go to the Python API, in the RDD section,
there is no documentation for rdd.randomSplit(): http://spark.apache.org/docs/latest/api/python/pyspark.html#pyspark.RDD
Nevertheless, it is used as an example in the 1.2.0 documentation for mllib: http://spark.apache.org/docs/latest/mllib-ensembles.html#regression
(It's in the Python code tab, you can *Ctrl+F* and search for "randomSplit").
But looking in the code, it seems implemented: https://github.com/apache/spark/blob/branch-1.2/python/pyspark/rdd.py#L322
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org
|