I mean to add a step, putting a Python piped script in the middle. So instead of
RDBMS à Sqoop à Hive
You would do
RDBMS à some file format à Python (replacing ctrl-a) à modified files à Sqoop à Hive
Does that help?
From: Nitin kak [mailto:firstname.lastname@example.org]
Sent: Monday, February 04, 2013 2:53 PM
Subject: Re: String replace functionality in Sqoop import using Oozie
Didn't get you. Could you please elaborate just a bit?
On Mon, Feb 4, 2013 at 2:43 PM, Connell, Chuck <Chuck.Connell@nuance.com> wrote:
Do it first with Python??
Is there a way to replace (or drop) a character in one of the fields on the fly while importing data from an RDBMS system? I basically want to replace Ctrl-A characters in the fields. I know its possible to do that with Sqoop Hive Import(--hive-delims-replacement) but Hive Sqoop import is know not to work correctly with Oozie(hope this one gets remedied soon).