spark-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Didac Gil <>
Subject Load multiple CSV from different paths
Date Wed, 05 Jul 2017 14:08:14 GMT

Do you know any simple way to load multiple csv files (same schema) that are in different
Wildcards are not a solution, as I want to load specific csv files from different folders.

I came across a solution (
that suggests something like"csv").option("header", "false")
            .option('delimiter', '\t')
            .option('mode', 'DROPMALFORMED')
However, even it mentions that this approach would work in Spark 2.x, I don’t find an implementation
of load that accepts an Array[String] as an input parameter.

Thanks in advance for your help.

Didac Gil de la Iglesia
PhD in Computer Science
Spain:     +34 696 285 544
Sweden: +46 (0)730229737

View raw message