spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "M. Le Bihan (JIRA)" <>
Subject [jira] [Created] (SPARK-26968) option("quoteMode", "NON_NUMERIC") have no effect on a CSV generation
Date Fri, 22 Feb 2019 09:34:00 GMT
M. Le Bihan created SPARK-26968:

             Summary: option("quoteMode", "NON_NUMERIC") have no effect on a CSV generation
                 Key: SPARK-26968
             Project: Spark
          Issue Type: Bug
          Components: Spark Core
    Affects Versions: 2.4.0
            Reporter: M. Le Bihan

I have a CSV to write that has that schema :
StructType s = schema.add("codeCommuneCR", StringType, false);
s = s.add("nomCommuneCR", StringType, false);
s = s.add("populationCR", IntegerType, false);
s = s.add("resultatComptable", IntegerType, false);{code}
If I don't provide an option "_quoteMode_" or even if I set it to {{NON_NUMERIC}}, this way
ds.coalesce(1).write().mode(SaveMode.Overwrite) .option("header", "true") .option("quoteMode",
"NON_NUMERIC").option("quote", "\"") .csv("./target/out_200071470.csv");{code}
the CSV written by {{Spark}} is this one :
If I set an option "_quoteAll_" instead, like that :
ds.coalesce(1).write().mode(SaveMode.Overwrite) .option("header", "true") .option("quoteAll",
true).option("quote", "\"") .csv("./target/out_200071470.csv");{code}
it generates :
"codeCommuneCR","nomCommuneCR","populationCR","resultatComptable" "03142","LENAX","267","43"{code}
It seems that the {{.option("quoteMode", "NON_NUMERIC")}} is broken. It should generate:


This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message