Hi, guys,

I have a question how Sqoop imports the data in parallel. IMO, Sqoop first gets the min and max values for the SPLIT_BY column, and then does a range-based partition, to let each mapper consumes one range. Do we support hash-based partition, like each mapper ingests the data satisfying query "select * from table where hash(split_by) % n = i" ?

thanks,
Wei