drill-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From John Omernik <j...@omernik.com>
Subject Re: Dealing with files created in Windows
Date Mon, 08 Feb 2016 18:26:58 GMT
Ya, I think I will use regex_replace once and have Drill preprocess things
(I think).  I am thinking that there are a number of JIRAs on this, and
that we should handle this better from the system level.



On Mon, Feb 8, 2016 at 12:23 PM, Nathan Griffith <ngriffith@dremio.com>
wrote:

> Was going to say my goto for this kind of issue is the 'tr' command in
> unix, but if I understand right you'd rather not have to preprocess,
> instead preferring an in-Drill solution.
>
> As I think you're hinting at, a Drill UDF tailored to the data might
> be one way to handle it.
>
> On Mon, Feb 8, 2016 at 10:07 AM, Abdel Hakim Deneche
> <adeneche@maprtech.com> wrote:
> > is dos2unix an option ?
> >
> > On Mon, Feb 8, 2016 at 9:56 AM, John Omernik <john@omernik.com> wrote:
> >
> >> Are there any decent tricks for dealing with Windows based text files
> (that
> >> use /r/n as the line ending rather than just /n)
> >>
> >> Right now my last field has /r showing up, and I'd like to not have that
> >> there, I guess I could regex_replace it maybe? I was hoping for a
> >> performant way to handle (Without reprocessing either)
> >>
> >> John
> >>
> >
> >
> >
> > --
> >
> > Abdelhakim Deneche
> >
> > Software Engineer
> >
> >   <http://www.mapr.com/>
> >
> >
> > Now Available - Free Hadoop On-Demand Training
> > <
> http://www.mapr.com/training?utm_source=Email&utm_medium=Signature&utm_campaign=Free%20available
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message