you have 2 options
1 )Clean ->Write your own parser to through each property and create a dataset
2) Hacky but simple -> Convert to json string then read in using spark.read.json(jsonString)
Please bear in mind the second option is expensive which is why it is hacky
I wrote my own parser
here which you can use to convert between JsonObjects to StructType schemas
Regards
Sam