Re: XML Parsing with Spark and SCala

2017-08-11 Thread Jörn Franke
Can you specify what "is not able to load" means and what are the expected results? > On 11. Aug 2017, at 09:30, Etisha Jain wrote: > > Hi > > I want to do xml parsing with spark, but the data from the file is not able > to load and the desired output is also not coming. > I am attaching a f

Re: XML Parsing

2015-07-19 Thread Ram Sriharsha
You would need to write an Xml Input Format that can parse XML into lines based on start/end tags Mahout has a XMLInputFormat implementation you should be able to import: https://github.com/apache/mahout/blob/master/integration/src/main/java/org/apache/mahout/text/wikipedia/XmlInputFormat.java Onc