drill-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jacques Nadeau (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (DRILL-19) Build a JSON scanner that does schema discovery
Date Fri, 11 Jan 2013 00:48:13 GMT

    [ https://issues.apache.org/jira/browse/DRILL-19?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13550629#comment-13550629
] 

Jacques Nadeau commented on DRILL-19:
-------------------------------------

Nulls:
I'm generally inclined to drop nulls.  I can imagine a few scenarios where this might be an
issue.  However, by and large I think that could be handled case-by-case in the scanner (e.g.
an option for a particular scanner such as "encode-nulls-as").

Index number:
The array should probably carry a subtype value which is the value of all array values. If
the array is heteregenous, then the array subtype should be latebind and then the indvidual
values should be latebind encoded (schema in place).  In your case that means that the schema
should be something like Field{type=ARRAY, id=2, fullFieldName="test2", subType="INTEGER"}.
 
                
> Build a JSON scanner that does schema discovery
> -----------------------------------------------
>
>                 Key: DRILL-19
>                 URL: https://issues.apache.org/jira/browse/DRILL-19
>             Project: Apache Drill
>          Issue Type: New Feature
>            Reporter: Jacques Nadeau
>            Assignee: Timothy Chen
>


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message