sqoop-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jarek Jarcec Cecho (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (SQOOP-1055) Add option to export from Hive use HQL query
Date Wed, 22 May 2013 06:17:25 GMT

     [ https://issues.apache.org/jira/browse/SQOOP-1055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Jarek Jarcec Cecho updated SQOOP-1055:
--------------------------------------

    Summary: Add option to export from Hive use HQL query  (was: Add Sqoop export --query
option)
    
> Add option to export from Hive use HQL query
> --------------------------------------------
>
>                 Key: SQOOP-1055
>                 URL: https://issues.apache.org/jira/browse/SQOOP-1055
>             Project: Sqoop
>          Issue Type: Improvement
>            Reporter: Hari Sekhon
>
> Sqoop currently has a --query option for import but not for export.
> It would be nice if the export --query option supporting HiveQL could be added as users
currently have to create a temporary table and then export that as a two step process with
a full disk re-write of all the to-be-exported data to a new table before the sqoop export
command is started.
> Since Sqoop executes a distributed map-only job, I believe certain queries such as joins
that have to be done via a reduce phase will yield little performance improvement due to the
map->reduce intermediate writes needing to be written anyway. However we could save on
the final reduce phase writes and also turn this in to a more convenient one step instead
two step process.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message