-1

I have a spark dataframe with a single string column storing json values.

I need to infer the json schema from that string column and create another dataframe based on that json schema with multiple columns.

Json string values have multiple levels of hierarchy.

How can this be accomplished in PySpark? LOL I am breaking my head...

Much help would be appreciated.

Best Regards

bda
  • 144
  • 3
  • 16
  • https://stackoverflow.com/help/minimal-reproducible-example – wwnde May 18 '22 at 20:48
  • 1
    please check if this [question](https://stackoverflow.com/questions/55337552/how-to-parse-json-column-in-dataframe-in-scala) helps. – Guy May 19 '22 at 05:33
  • @Guy this looks interesting. Will try this approach. Ideally need to see this in PySpark, I do not know scala, yet :) – bda May 19 '22 at 18:44

0 Answers0