Gabriel Reid created CRUNCH-139:
-----------------------------------
Summary: PCollection#length doesn't always reduce the count to a single value
Key: CRUNCH-139
URL: https://issues.apache.org/jira/browse/CRUNCH-139
Project: Crunch
Issue Type: Bug
Reporter: Gabriel Reid
Assignee: Gabriel Reid
Fix For: 0.5.0
PCollection#length doesn't explicitly set the number of reducers to 1, which means that the
output of the counting mappers can be partitioned. This results only a partial count (i.e.
an incorrect value) being returned in the length PObject if the input PCollection spans multiple
default reduce partitions.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
|