I am trying to parse xml file using spark-xml. But for some reason when i print schema it only shows root instead of the hierarchy. I am using sqlcontext to read the data. I am proceeding according to this video : https://www.youtube.com/watch?v=NemEp53yGbI
The structure of xml file is somewhat like this: <books> <book> <name></name> <price></price> <orderId></orderId> </book> <book> //Some more data </book> </books> For some books there,are multiple orders i.e. large number of orders while for some it just occurs once as empty. I use the "rowtag" attribute as book. How do i proceed or is there any other way to tackle this problem? Help would be much appreciated. Thank you.