date:20180106

Is Apache Spark-2.2.1 compatible with Hadoop-3.0.0

2018-01-06 Thread akshay naidu

hello Users, I need to know whether we can run latest spark on latest hadoop version i.e., spark-2.2.1 released on 1st dec and hadoop-3.0.0 released on 13th dec. thanks.

Re: How to convert Array of Json rows into Dataset of specific columns in Spark 2.2.0?

2018-01-06 Thread Jules Damji

Here’s are couple tutorial that shows how to extract Structured nested data https://databricks.com/blog/2017/06/27/4-sql-high-order-lambda-functions-examine-complex-structured-data-databricks.html https://databricks.com/blog/2017/06/13/five-spark-sql-utility-functions-extract-explore-complex-dat

RE: How to convert Array of Json rows into Dataset of specific columns in Spark 2.2.0?

2018-01-06 Thread Hien Luu

Hi Kant, I am not sure whether you had come up with a solution yet, but the following works for me (in Scala) val emp_info = """ [ {"name": "foo", "address": {"state": "CA", "country": "USA"}, "docs":[{"subject": "english", "year": 2016}]}, {"name": "bar", "address": {"state": "OH", "c