hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Colin Ma (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-18048) Support Struct type with vectorization for Parquet file
Date Mon, 25 Dec 2017 03:51:00 GMT

    [ https://issues.apache.org/jira/browse/HIVE-18048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16303047#comment-16303047
] 

Colin Ma commented on HIVE-18048:
---------------------------------

[~Ferd], the initial patch is finished, can you help to review, thanks.

> Support Struct type with vectorization for Parquet file
> -------------------------------------------------------
>
>                 Key: HIVE-18048
>                 URL: https://issues.apache.org/jira/browse/HIVE-18048
>             Project: Hive
>          Issue Type: Sub-task
>            Reporter: Colin Ma
>            Assignee: Colin Ma
>         Attachments: HIVE-18048.001.patch
>
>
> Struct type is not supported in MapWork with vectorization, it should be supported to
improve the performance.
> The following improvements will be implemented:
> * Add fields of struct type to VectorizedRowBatchCtx.
> * Improve the VectorizedParquetRecordReader to support the struct type for parquet file.
> Note:
> * Orc file won't be supported.
> * Filter operator won't be supported.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message