I have setup an AWS EMR based cluster, where in I am being able to run my spark queries quite ok.
The next part of my work is to run the queries coming in from a webclient and show the results at it. The Spark queries as i Know I can only run from my EMR, and they don't return instantly with any output, as it takes time for spark to run queries (being a batch job). I am interested in knowing what patterns and practices are usually followed when people setup an EMR based cluster and how to bring data out to show over some interface.