spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From pseudo oduesp <pseudo20...@gmail.com>
Subject Labeledpoint
Date Tue, 21 Jun 2016 16:12:31 GMT
Hi,
i am pyspark user and i want test Randomforest.

i have dataframe with 100 columns
i should give Rdd or data frame to algorithme i transformed my dataframe to
only tow columns
label ands features  columns

 df.label df.features
  0            (517,(0,1,2,333,56 ...
   1           (517,(0,11,0,33,6 ...
    0           (517,(0,1,0,33,8 ...

but i dont have no ieda to transorme data frame like input to data frame i
test the example in offciel web page without succes

please give me example how i can work and specily with test set  .

thanks

Mime
View raw message