spark-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Reynold Xin <>
Subject Re: Interested in contributing to GraphX in Python
Date Mon, 04 Aug 2014 22:03:50 GMT
Thanks for your interest.

I think the main challenge is if we have to call Python functions per
record, it can be pretty expensive to serialize/deserialize across
boundaries of the Python process and JVM process.  I don't know if there is
a good way to solve this problem yet.

On Fri, Aug 1, 2014 at 11:06 AM, Rajiv Abraham <>

> Hi,
> I just saw Ankur's GraphX presentation and it looks very exciting! I would
> like to contribute to a Python version of GraphX. I checked out JIRA and
> Github but I did not find much info.
> - Are there limitations currently to port GraphX in Python? (e.g. Maybe the
> Python Spark RDD API is incomplete or not refactored for GraphX as compared
> to the Scala version)
> - If I had to start, could  I take inspiration from the Scala version and
> try to emulate it in Python?
> - Otherwise any suggestions of  starter tasks regarding GraphX in Python
> would be appreciated
> --
> Take care,
> Rajiv

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message