From user-return-5083-apmail-drill-user-archive=drill.apache.org@drill.apache.org Mon Feb 8 18:23:01 2016 Return-Path: X-Original-To: apmail-drill-user-archive@www.apache.org Delivered-To: apmail-drill-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id ABD7C18A29 for ; Mon, 8 Feb 2016 18:23:01 +0000 (UTC) Received: (qmail 30430 invoked by uid 500); 8 Feb 2016 18:23:01 -0000 Delivered-To: apmail-drill-user-archive@drill.apache.org Received: (qmail 30361 invoked by uid 500); 8 Feb 2016 18:23:01 -0000 Mailing-List: contact user-help@drill.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@drill.apache.org Delivered-To: mailing list user@drill.apache.org Received: (qmail 30348 invoked by uid 99); 8 Feb 2016 18:23:00 -0000 Received: from Unknown (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 08 Feb 2016 18:23:00 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 7465A1A0032 for ; Mon, 8 Feb 2016 18:23:00 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.298 X-Spam-Level: * X-Spam-Status: No, score=1.298 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=omernik-com.20150623.gappssmtp.com Received: from mx1-us-west.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id WqxfWkOn69Fc for ; Mon, 8 Feb 2016 18:22:59 +0000 (UTC) Received: from mail-lb0-f176.google.com (mail-lb0-f176.google.com [209.85.217.176]) by mx1-us-west.apache.org (ASF Mail Server at mx1-us-west.apache.org) with ESMTPS id 1D5262050D for ; Mon, 8 Feb 2016 18:22:59 +0000 (UTC) Received: by mail-lb0-f176.google.com with SMTP id bc4so87676122lbc.2 for ; Mon, 08 Feb 2016 10:22:59 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=omernik-com.20150623.gappssmtp.com; s=20150623; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=PDQfm9dPWUGOWPDOyjg+qQVY4ope3ZkOqZzm3GzjuS4=; b=OajLfdnlGFRTlvV0ePTaE8DRfx7+My6yLwATDKoYTO1p0UrF5qAoiwG+bpA3HOFUQz F5NqH7T1nTGyxgNDcBBF1f3933pNIeGmR4fnRvUToUrbWYOJQwOkXbLmNK7mixIn3lJ9 mEuQqC8DR/vY+NymQEU7e98ad9fblcMPSqp/dJx3cDgtSAS6C6TYtt4hjamnwowVdmAC SpCCqzbd5gxqqcJrYV6+KLC5IET08v51Iznl2o8jIQfRbg/xFY/WrCzOGLeAudmRf9/v nT7elR5L3kCe34EduWoI4jkuYGwSuqyafYvlTX5k4JGNE9N6gPTZpcYLZUK6K4KTGYUz VpQQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:content-type; bh=PDQfm9dPWUGOWPDOyjg+qQVY4ope3ZkOqZzm3GzjuS4=; b=HrP9KWwkiivo9vHDqQr4gPyM7dI7HxvfWwa+lOf+9jwHQj/jzu0+eUd4V11WoqLR1Z CkRDkE35QlPPP0A0HJOo4Iptr7DLyboyHaWP6tuwgoafNUYw6WYrQIf1S28YEEg3Cpg0 XvMxbvuFr8HMFA/IEVKN0yZji1Ud45PfeZ7kq7Enc3jadmMdVkiyvgF++FlSeb6J7NvO EW/s9vUWpaVSi7+1zZzsHmpQvyhVT8vCjpV8BW2qnstRdiwSmM1OYDbkhYJh7RzapWTV IqiIrFvFuYRmUoN5aw+4nyHR51zpHaCtIm0mdTboQaXumnV1wUJumu5/fLHaN9yQdk13 mYeg== X-Gm-Message-State: AG10YORS0xG884LHOckqP/WK3DHbpfzR60eWSJTXNVekhceTcyurTGECm7bdO1f4BtIaLWfLjieUR+bWfMFhTQ== X-Received: by 10.112.210.105 with SMTP id mt9mr12237388lbc.108.1454955777589; Mon, 08 Feb 2016 10:22:57 -0800 (PST) MIME-Version: 1.0 Received: by 10.112.72.10 with HTTP; Mon, 8 Feb 2016 10:22:38 -0800 (PST) In-Reply-To: References: From: John Omernik Date: Mon, 8 Feb 2016 12:22:38 -0600 Message-ID: Subject: Re: Dealing with files created in Windows To: user Content-Type: multipart/alternative; boundary=001a11c3c7144999db052b464a45 --001a11c3c7144999db052b464a45 Content-Type: text/plain; charset=UTF-8 No, I do not want to reprocess files. I am sorry for bluntness, but this seems like something that shouldn't require Drill to require an outside ETL process to prune the files. (The data I have is large, and it just seems like a process prone to failure). On Mon, Feb 8, 2016 at 12:07 PM, Abdel Hakim Deneche wrote: > is dos2unix an option ? > > On Mon, Feb 8, 2016 at 9:56 AM, John Omernik wrote: > > > Are there any decent tricks for dealing with Windows based text files > (that > > use /r/n as the line ending rather than just /n) > > > > Right now my last field has /r showing up, and I'd like to not have that > > there, I guess I could regex_replace it maybe? I was hoping for a > > performant way to handle (Without reprocessing either) > > > > John > > > > > > -- > > Abdelhakim Deneche > > Software Engineer > > > > > Now Available - Free Hadoop On-Demand Training > < > http://www.mapr.com/training?utm_source=Email&utm_medium=Signature&utm_campaign=Free%20available > > > --001a11c3c7144999db052b464a45--