hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Matt McCline (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-17994) Vectorization: Serialization bottlenecked on irrelevant hashmap lookup
Date Wed, 20 Dec 2017 21:51:02 GMT

    [ https://issues.apache.org/jira/browse/HIVE-17994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16299153#comment-16299153
] 

Matt McCline commented on HIVE-17994:
-------------------------------------

Thank you [~teddy.choi] for your review and [~gopalv] for your help.

> Vectorization: Serialization bottlenecked on irrelevant hashmap lookup
> ----------------------------------------------------------------------
>
>                 Key: HIVE-17994
>                 URL: https://issues.apache.org/jira/browse/HIVE-17994
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Gopal V
>            Assignee: Matt McCline
>            Priority: Minor
>         Attachments: HIVE-17994.01.patch, HIVE-17994.02.patch, HIVE-17994.03.patch, HIVE-17994.04.patch,
HIVE-17994.05.patch, HIVE-17994.06.patch, vec-serialize-hashmap.png
>
>
> On machines with slower NUMA, the hashmap lookup for TypeInfo::getPrimitiveCategory is
the slowest part of the vectorized serialization loops. The static object references run hot
with the NUMA access speeds penalizing half the threads.
> This lookup is done for every column, for every row - though vectorization enforces that
this type cannot change at all.
> !vec-serialize-hashmap.png!



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message