Hi,
I'm converting Avro to parquest and I'm getting this log entry back for a
timestamp field:
Written 1,008,842B for [occurred_at] INT64: 591,435 values, 2,169,557B raw,
1,008,606B comp, 5 pages, encodings: [BIT_PACKED, PLAIN, PLAIN_DICTIONARY,
RLE], dic { 123,832 entries, 990,656B raw, 123,832B comp}
Can someone please tell me if this is the expected encoding for a timestamp
field.
I'm a bit surprised that it seems to be dictionary based. (Yes, I have
enabled dictionary encoding for Parquet files).
Regards,
-Stefán
|