sqoop-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sowmya Ramesh (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (SQOOP-2457) Add option to automatically compute statistics after loading date into a hive table
Date Thu, 14 Jul 2016 22:22:20 GMT

     [ https://issues.apache.org/jira/browse/SQOOP-2457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Sowmya Ramesh updated SQOOP-2457:
---------------------------------
    Issue Type: Improvement  (was: Bug)

> Add option to  automatically compute statistics after loading date into a hive table
> ------------------------------------------------------------------------------------
>
>                 Key: SQOOP-2457
>                 URL: https://issues.apache.org/jira/browse/SQOOP-2457
>             Project: Sqoop
>          Issue Type: Improvement
>    Affects Versions: 1.4.6
>            Reporter: Venkat Ranganathan
>            Assignee: Venkat Ranganathan
>             Fix For: 1.4.7
>
>
> With CBO and different execution engines like Tez depedning on statistics like row count
heavily, it is important that we provide the option to update stats on data loaded into Hive
as part of the --hive-import option.  Ideally these should be Hive managed, but there are
use cases where this is not automatic and hence this option will help in those cases
> Will be disabled by default.   Enabled by a flag 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message