spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sean Owen (JIRA)" <>
Subject [jira] [Updated] (SPARK-5654) Integrate SparkR into Apache Spark
Date Sun, 08 Feb 2015 13:23:35 GMT


Sean Owen updated SPARK-5654:
    Component/s: Project Infra

> Integrate SparkR into Apache Spark
> ----------------------------------
>                 Key: SPARK-5654
>                 URL:
>             Project: Spark
>          Issue Type: New Feature
>          Components: Project Infra
>            Reporter: Shivaram Venkataraman
> The SparkR project [1] provides a light-weight frontend to launch Spark jobs from R.
The project was started at the AMPLab around a year ago and has been incubated as its own
project to make sure it can be easily merged into upstream Spark, i.e. not introduce any external
dependencies etc. SparkR’s goals are similar to PySpark and shares a similar design pattern
as described in our meetup talk[2], Spark Summit presentation[3].
> Integrating SparkR into the Apache project will enable R users to use Spark out of the
box and given R’s large user base, it will help the Spark project reach more users.  Additionally,
work in progress features like providing R integration with ML Pipelines and Dataframes can
be better achieved by development in a unified code base.
> SparkR is available under the Apache 2.0 License and does not have any external dependencies
other than requiring users to have R and Java installed on their machines.  SparkR’s developers
come from many organizations including UC Berkeley, Alteryx, Intel and we will support future
development, maintenance after the integration.
> [1]
> [2]
> [3]

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message