From dev-return-2197-apmail-crunch-dev-archive=crunch.apache.org@crunch.apache.org Mon Mar 11 11:47:16 2013 Return-Path: X-Original-To: apmail-crunch-dev-archive@www.apache.org Delivered-To: apmail-crunch-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id D1030D3CB for ; Mon, 11 Mar 2013 11:47:16 +0000 (UTC) Received: (qmail 77386 invoked by uid 500); 11 Mar 2013 11:47:16 -0000 Delivered-To: apmail-crunch-dev-archive@crunch.apache.org Received: (qmail 77327 invoked by uid 500); 11 Mar 2013 11:47:14 -0000 Mailing-List: contact dev-help@crunch.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@crunch.apache.org Delivered-To: mailing list dev@crunch.apache.org Received: (qmail 77145 invoked by uid 500); 11 Mar 2013 11:47:13 -0000 Delivered-To: apmail-incubator-crunch-dev@incubator.apache.org Received: (qmail 77137 invoked by uid 99); 11 Mar 2013 11:47:12 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 11 Mar 2013 11:47:12 +0000 Date: Mon, 11 Mar 2013 11:47:12 +0000 (UTC) From: "Dave Beech (JIRA)" To: crunch-dev@incubator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (CRUNCH-162) Add utility function for merging output by identity reduce MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/CRUNCH-162?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13598743#comment-13598743 ] Dave Beech commented on CRUNCH-162: ----------------------------------- 'Shard' could work. I don't like 'reshard' so much - it implies the data has been previously sharded. I might just have read a load of log files, and I wouldn't be able to call those 'shards' and keep a straight face! > Add utility function for merging output by identity reduce > ---------------------------------------------------------- > > Key: CRUNCH-162 > URL: https://issues.apache.org/jira/browse/CRUNCH-162 > Project: Crunch > Issue Type: Improvement > Components: MapReduce Patterns > Affects Versions: 0.4.0 > Reporter: Dave Beech > Priority: Minor > > Something I find myself doing reasonably often in mapreduce is to use > the reduce step as nothing more than a means to merge data into larger > files (using the identity reducer). > There doesn't appear to be a neat way to do this with Crunch at the moment. > Ref: http://mail-archives.apache.org/mod_mbox/incubator-crunch-user/201302.mbox/%3CCAFZSZPsXRxWT45c9w4ef7Ruij2exE4HP2CDNMjd%2BVc%3D9RWX-Jw%40mail.gmail.com%3E -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira