drill-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Stefán Baxter <ste...@activitystream.com>
Subject Parquet drill date fields
Date Wed, 03 Feb 2016 13:15:28 GMT
Hi,

I'm converting Avro to parquest and I'm getting this log entry back for a
timestamp field:

Written 1,008,842B for [occurred_at] INT64: 591,435 values, 2,169,557B raw,
1,008,606B comp, 5 pages, encodings: [BIT_PACKED, PLAIN, PLAIN_DICTIONARY,
RLE], dic { 123,832 entries, 990,656B raw, 123,832B comp}

Can someone please tell me if this is the expected encoding for a timestamp
field.

I'm a bit surprised that it seems to be dictionary based. (Yes, I have
enabled dictionary encoding for Parquet files).

Regards,
 -Stefán

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message