I have been using Athena/Presto to read the parquet files in datalake, if your are already saving data to s3 I think this is the easiest option. Then I use Redash or Metabase to build dashboards (they have different limitations), both are very intuitive to use and easy to setup with docker.
-- Sent from: http://apache-spark-user-list.1001560.n3.nabble.com/ --------------------------------------------------------------------- To unsubscribe e-mail: user-unsubscr...@spark.apache.org