spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From 宿荣全 (JIRA) <j...@apache.org>
Subject [jira] [Created] (SPARK-4817) Print the specified number of data and handle all of the elements in RDD
Date Wed, 10 Dec 2014 10:37:12 GMT
宿荣全 created SPARK-4817:
--------------------------

             Summary: Print the specified number of data and handle all of the elements in
RDD
                 Key: SPARK-4817
                 URL: https://issues.apache.org/jira/browse/SPARK-4817
             Project: Spark
          Issue Type: New Feature
          Components: Streaming
            Reporter: 宿荣全
            Priority: Minor


Dstream.print function:Print 10 elements and handle 11 elements.
A new function based on Dstream.print function is presented:
the new function:
Print the specified number of data and handle all of the elements in RDD.
there is a work scene:
val dstream = stream.map->filter->mapPartitions->print
the data after filter need update database in mapPartitions,but don't need print each data,only
need to print the top 20 for view the data processing.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message