sqoop-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Amir Mohammad Saied <amirsa...@gmail.com>
Subject Specifying row key in sqoop-import
Date Mon, 16 Dec 2013 13:42:55 GMT
Hi,

I'm using Sqoop to import (only one column of) a table from MySQL to HDFS.
I'd like records to be stored as SequenceFiles so I can run Mahout's
"seq2sparse" to generate Vectors from them later.

I've two questions regarding the import process:

1) Dumping SequenceFiles generated by sqoop-import, I realized the row
"Key" is automatically generated by Sqoop, and is not the "id" column of
the MySQL table row. Can I ask sqoop-import to use the row's "id" field as
Key?

2) If its possible to set row "Key" (above question), can I cast it to a
specific class using sqoop-import?

Thanks,

amir

Mime
View raw message