hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Zoltan Haindrich (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-19943) Header values keep showing up in result sets
Date Tue, 26 Jun 2018 09:38:00 GMT

    [ https://issues.apache.org/jira/browse/HIVE-19943?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16523488#comment-16523488
] 

Zoltan Haindrich commented on HIVE-19943:
-----------------------------------------

{quote}Is there any way this could be reported to Hive to get fixed?{quote}

This ticket is exactly that :P 

I think if you could live with it: right now it would be the best option to disable vectorization
- and when the problem is better known there might be other options to workaround the issue
- or there might be a fix...

> Header values keep showing up in result sets
> --------------------------------------------
>
>                 Key: HIVE-19943
>                 URL: https://issues.apache.org/jira/browse/HIVE-19943
>             Project: Hive
>          Issue Type: Bug
>          Components: Query Processor
>    Affects Versions: 2.1.0
>         Environment: Hdinsight Hive interactivequerry
> [Components|https://docs.microsoft.com/en-us/azure/hdinsight/hdinsight-component-versioning#hadoop-components-available-with-different-hdinsight-versions]
>            Reporter: Liam De Lee
>            Priority: Major
>
> We are using the tblproperties ("skip.header.line.count"="1") when creating an external
table.
> When we do a select * from table we get it back as expected without the header present
in the result set.
> However when we do for instance a count(1) we get the header back in this count (tested
with a select * from table and paste it in notepad to find the amount of rows)
> If we also do this with a select distinct(column) from table we also get the header as
a distinct value.
> file structure:
> ||_TESTING_TYPE||
> |adf|
> |hyg|
> |abc|
>  
> *Update: 26/06/2018*
> Create statement:
> {code:java}
> -----------------------------------
> --test_type--
> -----------------------------------
> CREATE EXTERNAL TABLE IF NOT EXISTS ext.test_type_in
>   (
>     test_type      string
>     )
> ROW FORMAT DELIMITED
> FIELDS TERMINATED BY '\073'
> STORED AS TEXTFILE
> LOCATION 'adl://{adlslocation}data/data2/test'
> tblproperties ("skip.header.line.count"="1")
> {code}
>  Select statement:
> {code:java}
> select * from test_type_in;
> {code}
> Distinct statement:
> {code:java}
> select distinct test_type from test_type_in ORDER BY test_type;
> {code}
> I cannot show the exact statement because of NDA so i changed those values to test.
>  
> I can also tell you it is not just at our HDInsight but also at another company we are
working for. It does not Mather what is in the data as well. so for testing purposes:
> {code:java}
> test_type,abcg,gjeiza,aze,grriajj,gd,rrjri,vdju{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message