hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Marta Kuczora (Jira)" <>
Subject [jira] [Commented] (HIVE-21215) Read Parquet INT64 timestamp
Date Fri, 10 Jan 2020 09:57:00 GMT


Marta Kuczora commented on HIVE-21215:

Hi [~zhxjdwh], no this issue is not solved yet. The Parquet update is blocked by a recently
found bug in ParquetFooterInputFromCache. (HIVE-22716). As soon as I get that fix in, we
can go forward with upgrading the Parquet version and then with this patch.

> Read Parquet INT64 timestamp
> ----------------------------
>                 Key: HIVE-21215
>                 URL:
>             Project: Hive
>          Issue Type: New Feature
>            Reporter: Karen Coppage
>            Assignee: Marta Kuczora
>            Priority: Major
> [WIP]
> This patch enables Hive to start reading timestamps from Parquet written with the new
> With Parquet version 1.11, a new timestamp LogicalType with base INT64 and the following
metadata is introduced:
> * boolean isAdjustedToUtc: marks whether the timestamp is converted to UTC (aka Instant
semantics) or not (LocalDateTime semantics).
> * enum TimeUnit (NANOS, MICROS, MILLIS): granularity of timestamp
> Upon reading, the semantics of these new timestamps will be determined by their metadata,
while the semantics of INT96 timestamps will continue to be deduced from the writer metadata.
> This feature will be behind a flag for now.

This message was sent by Atlassian Jira

View raw message