sqoop-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jarek Jarcec Cecho (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (SQOOP-1055) Add option to export from Hive use HQL query
Date Wed, 22 May 2013 06:17:25 GMT

     [ https://issues.apache.org/jira/browse/SQOOP-1055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel

Jarek Jarcec Cecho updated SQOOP-1055:

    Summary: Add option to export from Hive use HQL query  (was: Add Sqoop export --query
> Add option to export from Hive use HQL query
> --------------------------------------------
>                 Key: SQOOP-1055
>                 URL: https://issues.apache.org/jira/browse/SQOOP-1055
>             Project: Sqoop
>          Issue Type: Improvement
>            Reporter: Hari Sekhon
> Sqoop currently has a --query option for import but not for export.
> It would be nice if the export --query option supporting HiveQL could be added as users
currently have to create a temporary table and then export that as a two step process with
a full disk re-write of all the to-be-exported data to a new table before the sqoop export
command is started.
> Since Sqoop executes a distributed map-only job, I believe certain queries such as joins
that have to be done via a reduce phase will yield little performance improvement due to the
map->reduce intermediate writes needing to be written anyway. However we could save on
the final reduce phase writes and also turn this in to a more convenient one step instead
two step process.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

View raw message