arrow-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Philipp Moritz (JIRA)" <>
Subject [jira] [Created] (ARROW-4912) [C++, Python] Allow specifying column names to CSV reader
Date Sat, 16 Mar 2019 00:51:00 GMT
Philipp Moritz created ARROW-4912:

             Summary: [C++, Python] Allow specifying column names to CSV reader
                 Key: ARROW-4912
             Project: Apache Arrow
          Issue Type: Improvement
            Reporter: Philipp Moritz

Currently I think there is no way to specify custom column names for CSV files. It's possible
to specify the full schema of the file, but not just column names.

See the related discussion here: ARROW-3722

The goal of this is to re-use the CSV type-inference but still allow people to specify custom
names for the columns. As far as I know, there is currently no way to set column names post-hoc,
so we should provide a way to specify them before reading the file.

Related to this, ParseOptions(header_rows=0) is not currently implemented.

Is there any current way to do this or does this need to be implmented?

This message was sent by Atlassian JIRA

View raw message