toDF() is first introduced in Scala and Python (because
createDataFrame is too long), is used in lots places, I think it's
useful.
On Fri, May 8, 2015 at 11:03 AM, Shivaram Venkataraman
wrote:
> Agree that toDF is not very useful. In fact it was removed from the
> namespace in a recent change
> h
Agree that toDF is not very useful. In fact it was removed from the
namespace in a recent change
https://github.com/apache/spark/commit/4e930420c19ae7773b138dfc7db8fc03b4660251
Thanks
Shivaram
On Fri, May 8, 2015 at 1:10 AM, Sun, Rui wrote:
> toDF() is defined to convert an RDD to a DataFrame.
toDF() is defined to convert an RDD to a DataFrame. But it is just a very thin
wrapper of createDataFrame() by help the caller avoid input of SQLContext.
Since Scala/pySpark does not have toDF(), and we'd better keep API as narrow
and simple as possible. Is toDF() really necessary? Could we elim