spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From jamborta <>
Subject spark sql results maintain order (in python)
Date Thu, 04 Sep 2014 10:42:25 GMT

I ran into a problem with spark sql, when run a query like this "select
count(*), city, industry from table group by hour" and I would like to take
the results from the shemaRDD

1, I have to parse each line to get the values out of the dic (eg in order
to convert it to a csv)
2, The order is not kept in a python dict - I couldn't find a way to
maintain the original order (especially a problem in this case, when the
column names are derived).


View this message in context:
Sent from the Apache Spark User List mailing list archive at

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message