hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (Jira)" <>
Subject [jira] [Work logged] (HIVE-25296) Replace parquet-hadoop-bundle dependency with the actual parquet modules
Date Tue, 06 Jul 2021 11:17:00 GMT


ASF GitHub Bot logged work on HIVE-25296:

                Author: ASF GitHub Bot
            Created on: 06/Jul/21 11:16
            Start Date: 06/Jul/21 11:16
    Worklog Time Spent: 10m 
      Work Description: kgyrtkirk commented on pull request #2288:

   looking at the jar contents I see
   * some slf4j stuff in the jar
   * a parquet.thrift at the root which is a bit odd to me
   * a thrift runtime
   * some jackson stuff
   I would be a bit worried from the above for the thrift runtime/compatibility - could you
do a real world test to launch some tez containers and/or llap to see if everything is working

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail:

For queries about this service, please contact Infrastructure at:

Issue Time Tracking

    Worklog Id:     (was: 619093)
    Time Spent: 20m  (was: 10m)

> Replace parquet-hadoop-bundle dependency with the actual parquet modules
> ------------------------------------------------------------------------
>                 Key: HIVE-25296
>                 URL:
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Stamatis Zampetakis
>            Assignee: Stamatis Zampetakis
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 4.0.0
>          Time Spent: 20m
>  Remaining Estimate: 0h
> The parquet-hadoop-bundle is not a real dependency but a mere packaging
> of three parquet modules to create an uber jar. The Parquet community
> created this artificial module on demand by HIVE-5783 but the
> benefits if any are unclear.
> On the contrary using the uber dependency has some drawbacks:
> * Parquet souce code cannot be attached easily in IDEs which makes debugging sessions
> * Finding concrete dependencies with Parquet is not possible just by inspecting the pom
> * Extra maintenance cost for the Parquet community adding additional verification steps
during a release.
> The goal of this JIRA is to replace the uber dependency with concrete dependencies to
the respective modules:
> * parquet-common
> * parquet-column
> * parquet-hadoop

This message was sent by Atlassian Jira

View raw message