I tried to check out what Spark SQL 1.3.0. I installed it and following the
online document here:
http://spark.apache.org/docs/latest/sql-programming-guide.html
In the example, it shows something like this:// Select everybody, but increment
the age by 1
df.select("name", df("age") + 1).show()
// name (age + 1)
// Michael null
// Andy 31
// Justin 20
But what I got on my Spark 1.3.0 is the following error:
Welcome to
____ __
/ __/__ ___ _____/ /__
_\ \/ _ \/ _ `/ __/ '_/
/___/ .__/\_,_/_/ /_/\_\ version 1.3.0
/_/
Using Scala version 2.10.4 (Java HotSpot(TM) 64-Bit Server VM, Java
1.6.0_43)scala> val sqlContext = new org.apache.spark.sql.SQLContext(sc)
sqlContext: org.apache.spark.sql.SQLContext =
org.apache.spark.sql.SQLContext@1c845f64
scala> val df = sqlContext.jsonFile("/user/yzhang/people.json")df:
org.apache.spark.sql.DataFrame = [age: bigint, name: string]scala>
df.printSchema
root
|-- age: long (nullable = true)
|-- name: string (nullable = true)scala> df.select("name", df("age") +
1).show()
<console>:30: error: overloaded method value select with alternatives:
(col: String,cols: String*)org.apache.spark.sql.DataFrame <and>
(cols: org.apache.spark.sql.Column*)org.apache.spark.sql.DataFrame
cannot be applied to (String, org.apache.spark.sql.Column)
df.select("name", df("age") + 1).show()
^
Is this a bug in Spark 1.3.0, or my build having some problem?
Thanks