I have setup an AWS EMR based cluster, where in I am being able to run my
spark queries quite ok.

 The next part of my work is to run the queries coming in from a webclient
and show the results at it.
The Spark queries as i Know I can only run from my EMR, and they don't
return instantly with any output, as it takes time for spark to run queries
(being a batch job).

I am interested in knowing what patterns and practices are usually followed
when people setup an EMR based cluster and how to bring data out to show
over some interface.

Reply via email to