spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sean Owen <so...@cloudera.com>
Subject Re: Spark serialization issues with third-party libraries
Date Fri, 21 Nov 2014 09:06:04 GMT
You are probably casually sending UIMA objects from the driver to
executors in a closure. You'll have to design your program so that you
do not need to ship these objects to or from the remote task workers.

On Fri, Nov 21, 2014 at 8:39 AM, jatinpreet <jatinpreet@gmail.com> wrote:
> Hi,
>
> I am planning to use UIMA library to process data in my RDDs. I have had bad
> experiences while using third party libraries inside worker tasks. The
> system gets plagued with Serialization issues. But as UIMA classes are not
> necessarily Serializable, I am not sure if it will work.
>
> Please explain which classes need to be Serializable and which of them can
> be left as it is? A clear understanding will help me a lot.
>
> Thanks,
> Jatin
>
>
>
> -----
> Novice Big Data Programmer
> --
> View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Spark-serialization-issues-with-third-party-libraries-tp19454.html
> Sent from the Apache Spark User List mailing list archive at Nabble.com.
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscribe@spark.apache.org
> For additional commands, e-mail: user-help@spark.apache.org
>

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@spark.apache.org
For additional commands, e-mail: user-help@spark.apache.org


Mime
View raw message