From issues-return-193725-apmail-hive-issues-archive=hive.apache.org@hive.apache.org Tue Jun 16 16:55:07 2020 Return-Path: X-Original-To: apmail-hive-issues-archive@minotaur.apache.org Delivered-To: apmail-hive-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [207.244.88.153]) by minotaur.apache.org (Postfix) with SMTP id 6E5051944A for ; Tue, 16 Jun 2020 16:55:07 +0000 (UTC) Received: (qmail 86169 invoked by uid 500); 16 Jun 2020 16:55:03 -0000 Delivered-To: apmail-hive-issues-archive@hive.apache.org Received: (qmail 86070 invoked by uid 500); 16 Jun 2020 16:55:03 -0000 Mailing-List: contact issues-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list issues@hive.apache.org Received: (qmail 86026 invoked by uid 99); 16 Jun 2020 16:55:03 -0000 Received: from mailrelay1-us-west.apache.org (HELO mailrelay1-us-west.apache.org) (209.188.14.139) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 16 Jun 2020 16:55:03 +0000 Received: from jira-he-de.apache.org (static.172.67.40.188.clients.your-server.de [188.40.67.172]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 8490541C22 for ; Tue, 16 Jun 2020 16:55:02 +0000 (UTC) Received: from jira-he-de.apache.org (localhost.localdomain [127.0.0.1]) by jira-he-de.apache.org (ASF Mail Server at jira-he-de.apache.org) with ESMTP id 05248780519 for ; Tue, 16 Jun 2020 16:55:02 +0000 (UTC) Date: Tue, 16 Jun 2020 16:55:02 +0000 (UTC) From: "ASF GitHub Bot (Jira)" To: issues@hive.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Work logged] (HIVE-19103) Nested structure Projection Push Down in Hive with ORC MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HIVE-19103?focusedWorklogId=446685&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-446685 ] ASF GitHub Bot logged work on HIVE-19103: ----------------------------------------- Author: ASF GitHub Bot Created on: 16/Jun/20 16:54 Start Date: 16/Jun/20 16:54 Worklog Time Spent: 10m Work Description: github-actions[bot] commented on pull request #330: URL: https://github.com/apache/hive/pull/330#issuecomment-644886703 This pull request has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Feel free to reach out on the dev@hive.apache.org list if the patch is in need of reviews. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: users@infra.apache.org Issue Time Tracking ------------------- Worklog Id: (was: 446685) Remaining Estimate: 0h Time Spent: 10m > Nested structure Projection Push Down in Hive with ORC > ------------------------------------------------------ > > Key: HIVE-19103 > URL: https://issues.apache.org/jira/browse/HIVE-19103 > Project: Hive > Issue Type: Improvement > Components: Hive, ORC > Reporter: Ashish Sharma > Assignee: Ashish Sharma > Priority: Critical > Labels: pull-request-available > Attachments: HIVE-19103.2.patch, HIVE-19103.3.patch, HIVE-19103.patch > > Time Spent: 10m > Remaining Estimate: 0h > > Reading required columns only in nested structure schema > Example - > *Current state* - > Schema - struct,g:string>> > Query - select c.e.f from t where c.e.f > 10; > Current state - read entire c struct from the file and then filter because "hive.io.file.readcolumn.ids" is referred due to which all the children column are select to read from the file. > Conf - > _hive.io.file.readcolumn.ids = "2" > hive.io.file.readNestedColumn.paths = "c.e.f"_ > Result - > boolean[ ] include = [true,false,false,true,true,true,true,true] > *Expected state* - > Schema - struct,g:string>> > Query - select c.e.f from t where c.e.f > 10; > Expected state - instead of reading entire c struct from the file just read only the f column by referring the " hive.io.file.readNestedColumn.paths". > Conf - > _hive.io.file.readcolumn.ids = "2" > hive.io.file.readNestedColumn.paths = "c.e.f"_ > Result - > boolean[ ] include = [true,false,false,true,false,true,true,false] -- This message was sent by Atlassian Jira (v8.3.4#803005)