hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Prasanth Jayachandran (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-11777) implement an option to have single ETL strategy for multiple directories
Date Tue, 10 Nov 2015 19:36:10 GMT

    [ https://issues.apache.org/jira/browse/HIVE-11777?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14999183#comment-14999183
] 

Prasanth Jayachandran commented on HIVE-11777:
----------------------------------------------

Can you add test cases for combining logic? Other than that patch looks good to me +1 

> implement an option to have single ETL strategy for multiple directories
> ------------------------------------------------------------------------
>
>                 Key: HIVE-11777
>                 URL: https://issues.apache.org/jira/browse/HIVE-11777
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Sergey Shelukhin
>            Assignee: Sergey Shelukhin
>         Attachments: HIVE-11777.01.patch, HIVE-11777.02.patch, HIVE-11777.03.patch, HIVE-11777.04.patch,
HIVE-11777.05.patch, HIVE-11777.patch
>
>
> In case of metastore footer PPD we don't want to call PPD call with all attendant SARG,
MS and HBase overhead for each directory. If we wait for some time (10ms? some fraction of
inputs?) we can do one call without losing overall perf. 
> For now make it time based.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message