Re: renaming SchemaRDD -> DataFrame

2015-02-12 Thread vha14
Ha CTO, Semantic Scholar http://www.quora.com/What-is-Semantic-Scholar-and-how-will-it-work -- View this message in context: http://apache-spark-developers-list.1001551.n3.nabble.com/renaming-SchemaRDD-DataFrame-tp10271p10612.html Sent from the Apache Spark Developers List mail

Re: renaming SchemaRDD -> DataFrame

2015-02-10 Thread Reynold Xin
;>>>>>>>>>>> > >> > >>>>>>>>>>>> > >> > >>>>>>>>>>>> On Tue, Jan 27, 2015 at 6:28 AM, Dirceu Semighini Filho < > >> > >>>>>>>

Re: renaming SchemaRDD -> DataFrame

2015-02-10 Thread Koert Kuipers
gt; >> > >>>>>> removed >> > >>>>>>>>>>>>> >> > >>>>>>>>>>>>> in >> > >>>>>>>>>> >> > >>>>>>>>>> the

Re: renaming SchemaRDD -> DataFrame

2015-02-10 Thread Reynold Xin
vy > > >>>>>>>>>>>>>> > > >>>>>>>>>>>>>> lifting > > >>>>>>>>>>>>>> required for a full SQL support. It is amazing that even > > >>>>>>>>>>>>

Re: renaming SchemaRDD -> DataFrame

2015-02-10 Thread Matei Zaharia
>>>>> >>>>>>>>>>>> DataFrame? >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> With this, we don't impact in existing code

Re: renaming SchemaRDD -> DataFrame

2015-02-10 Thread Koert Kuipers
t;>>>>>>> I want to address the issue that Matei raised about the > >>> > >>> heavy > >>>>>>>>>>>>>> > >>>>>>>>>>>>>> lifting > >>>>>>>>>&g

Re: renaming SchemaRDD -> DataFrame

2015-02-01 Thread Evan Chan
;>>>>> lifting >>>>>>>>>>>>>> required for a full SQL support. It is amazing that even >>>>>>>>>>>>>> after >>>>>> >>>>>> 30 >>>>>>>>>> >>>>>>

Re: renaming SchemaRDD -> DataFrame

2015-01-29 Thread Cheng Lian
w.youtube.com/watch?v=YWppYPWznSQ From: Patrick Wendell To: Reynold Xin Cc: "dev@spark.apache.org" Sent: Monday, January 26, 2015 4:01 PM Subject: Re: renaming SchemaRDD -> DataFrame One thing potentially not clear from this e-mail, there will be

Re: renaming SchemaRDD -> DataFrame

2015-01-29 Thread Cheng Lian
ak < michaelma...@yahoo.com.invalid> wrote: And in the off chance that anyone hasn't seen it yet, the Jan. 13 Bay Area Spark Meetup YouTube contained a wealth of background information on this idea (mostly from Patrick and Reynold :-). https://www.youtube.com/watch?v=YWppYPWzn

Re: renaming SchemaRDD -> DataFrame

2015-01-29 Thread Koert Kuipers
more descriptive. > >> > >> >> >>>>> > >> > >> >> >>>>> Even if SchemaRDD's needs to rely on Spark SQL under the > >> > covers, > >> > >> >> >>>>> it > >>

Re: renaming SchemaRDD -> DataFrame

2015-01-29 Thread Evan Chan
o at least >> > choose a >> > >> >> >>> package >> > >> >> >>>>> name for it that omits "sql". >> > >> >> >>>>> >> > >> >> >>>>&

Re: renaming SchemaRDD -> DataFrame

2015-01-28 Thread Evan R. Sparks
15 at 5:32 PM, Matei Zaharia < > > >> >> >>> matei.zaha...@gmail.com> > > >> >> >>>>> wrote: > > >> >> >>>>> > > >> >> >>>>>> (Actually when we designed Spark SQL we thought of giving i

Re: renaming SchemaRDD -> DataFrame

2015-01-28 Thread Michael Armbrust
t;>>>> be more clear from a user-facing perspective to at least > >> >> >> >>>>> choose a > >> >> >> >>> package > >> >> >> >>>>> name for it that omits "sql". > &

Re: renaming SchemaRDD -> DataFrame

2015-01-28 Thread Evan Chan
t;> >> >> this >> >> >> >>>>> point? >> >> >> >>>>> >> >> >> >>>>> -Sandy >> >> >> >>>>> >> >> >> >>>>> On Mon, Jan 26, 2015 at 5

Re: renaming SchemaRDD -> DataFrame

2015-01-28 Thread Reynold Xin
aria < > >> >> >>> matei.zaha...@gmail.com> > >> >> >>>>> wrote: > >> >> >>>>> > >> >> >>>>>> (Actually when we designed Spark SQL we thought of giving it > >>

Re: renaming SchemaRDD -> DataFrame

2015-01-28 Thread Evan Chan
>>> name, >> >> >>>>>> like Spark Schema, but we decided to stick with SQL since that >> >> >>>>>> was >> >> >>> the >> >> >>>>> most >> >> >>>>>&

Re: renaming SchemaRDD -> DataFrame

2015-01-28 Thread Reynold Xin
; >> >>>>>> > >> >>>>>>> On Jan 26, 2015, at 5:31 PM, Matei Zaharia < > >> >>> matei.zaha...@gmail.com> > >> >>>>>> wrote: > >> >>>>>>> > >> >>>>>>> While it might be possible to move this concept to Sp

Re: renaming SchemaRDD -> DataFrame

2015-01-28 Thread Evan Chan
ly does require quite a bit of >> >>> the >> >>>>>> infrastructure in Spark SQL, such as query planning and columnar >> >>>> storage. >> >>>>>> The intent of Spark SQL though is to be more than a SQL server -- >> >>&

Re: renaming SchemaRDD -> DataFrame

2015-01-27 Thread Reynold Xin
> Matei > >>>>>>> > >>>>>>>> On Jan 26, 2015, at 4:26 PM, Koert Kuipers > >>>> wrote: > >>>>>>>> > >>>>>>>> "The context is that SchemaRDD is becoming a common data forma

Re: renaming SchemaRDD -> DataFrame

2015-01-27 Thread Matei Zaharia
is a library. >>>>>>> >>>>>>> Matei >>>>>>> >>>>>>>> On Jan 26, 2015, at 4:26 PM, Koert Kuipers >>>> wrote: >>>>>>>> >>>>>>>> "The context is th

Re: renaming SchemaRDD -> DataFrame

2015-01-27 Thread Dirceu Semighini Filho
gt; > >> "The context is that SchemaRDD is becoming a common data format >> used >> > > for >> > > > >> bringing data into Spark from external systems, and used for >> various >> > > > >> components of Spark, e.g. MLlib

Re: renaming SchemaRDD -> DataFrame

2015-01-27 Thread Dmitriy Lyubimov
It has been pretty evident for some time that's what it is, hasn't it? Yes that's a better name IMO. On Mon, Jan 26, 2015 at 2:18 PM, Reynold Xin wrote: > Hi, > > We are considering renaming SchemaRDD -> DataFrame in 1.3, and wanted to > get the community'

Re: renaming SchemaRDD -> DataFrame

2015-01-27 Thread Koert Kuipers
rk core, not sql >>> > >>> > On Mon, Jan 26, 2015 at 6:11 PM, Michael Malak < >>> > michaelma...@yahoo.com.invalid> wrote: >>> > >>> >> And in the off chance that anyone hasn't seen it yet, the Jan. 13 Bay >>> Area >&

Re: renaming SchemaRDD -> DataFrame

2015-01-27 Thread Reynold Xin
ground information on >> this >> >> idea (mostly from Patrick and Reynold :-). >> >> >> >> https://www.youtube.com/watch?v=YWppYPWznSQ >> >> >> >> >> >> From: Patrick Wendell >&g

Re: renaming SchemaRDD -> DataFrame

2015-01-27 Thread Reynold Xin
t; And in the off chance that anyone hasn't seen it yet, the Jan. 13 > > Bay > > > > Area > > > > >>> Spark Meetup YouTube contained a wealth of background information > > on > > > > this > > > > >>> idea (mostly fr

Re: renaming SchemaRDD -> DataFrame

2015-01-27 Thread Michael Malak
ose not immersed in data science or AI and thus may have narrower appeal. - Original Message - From: Evan R. Sparks To: Matei Zaharia Cc: Koert Kuipers ; Michael Malak ; Patrick Wendell ; Reynold Xin ; "dev@spark.apache.org" Sent: Tuesday, January 27, 2015 9:55 AM Subject: Re: renaming

Re: renaming SchemaRDD -> DataFrame

2015-01-27 Thread Mark Hamstra
Bay > > Area > > >> Spark Meetup YouTube contained a wealth of background information on > > this > > >> idea (mostly from Patrick and Reynold :-). > > >> > > >> https://www.youtube.com/watch?v=YWppYPWznSQ > > >> > > >>

Re: renaming SchemaRDD -> DataFrame

2015-01-27 Thread Koert Kuipers
> > >> From: Patrick Wendell > >> To: Reynold Xin > >> Cc: "dev@spark.apache.org" > >> Sent: Monday, January 26, 2015 4:01 PM > >> Subject: Re: renaming SchemaRDD -> DataFrame > >> > &g

Re: renaming SchemaRDD -> DataFrame

2015-01-27 Thread Evan R. Sparks
gt; Spark Meetup YouTube contained a wealth of background information on > this > >>> idea (mostly from Patrick and Reynold :-). > >>> > >>> https://www.youtube.com/watch?v=YWppYPWznSQ > >>> > >>> >

Re: renaming SchemaRDD -> DataFrame

2015-01-27 Thread Dirceu Semighini Filho
gt; > >> > > > >> i agree. this to me also implies it belongs in spark core, not sql > > > >> > > > >> On Mon, Jan 26, 2015 at 6:11 PM, Michael Malak < > > > >> michaelma...@yahoo.com.invalid> wrote: > > > >> > >

Re: renaming SchemaRDD -> DataFrame

2015-01-26 Thread Kushal Datta
t; >> > > >>> And in the off chance that anyone hasn't seen it yet, the Jan. 13 Bay > > Area > > >>> Spark Meetup YouTube contained a wealth of background information on > > this > > >>> idea (mostly from Patrick and Reynold

Re: renaming SchemaRDD -> DataFrame

2015-01-26 Thread Sandy Ryza
gt; Spark Meetup YouTube contained a wealth of background information on > this > >>> idea (mostly from Patrick and Reynold :-). > >>> > >>> https://www.youtube.com/watch?v=YWppYPWznSQ > >>> > >>> >

Re: renaming SchemaRDD -> DataFrame

2015-01-26 Thread Matei Zaharia
mostly from Patrick and Reynold :-). >> >> https://www.youtube.com/watch?v=YWppYPWznSQ >> >> >> From: Patrick Wendell >> To: Reynold Xin >> Cc: "dev@spark.apache.org" >> Sent: Monday, January 26, 2015 4:01 PM >> Subject: Re:

Re: renaming SchemaRDD -> DataFrame

2015-01-26 Thread Matei Zaharia
ube contained a wealth of background information on this >>> idea (mostly from Patrick and Reynold :-). >>> >>> https://www.youtube.com/watch?v=YWppYPWznSQ >>> >>> ____ >>> From: Patrick Wendell >>> To: Reynold Xin >>> Cc: "dev@s

Re: renaming SchemaRDD -> DataFrame

2015-01-26 Thread Koert Kuipers
tps://www.youtube.com/watch?v=YWppYPWznSQ > > > From: Patrick Wendell > To: Reynold Xin > Cc: "dev@spark.apache.org" > Sent: Monday, January 26, 2015 4:01 PM > Subject: Re: renaming SchemaRDD -> DataFrame > > > One thin

Re: renaming SchemaRDD -> DataFrame

2015-01-26 Thread Koert Kuipers
t;> To: Reynold Xin >> Cc: "dev@spark.apache.org" >> Sent: Monday, January 26, 2015 4:01 PM >> Subject: Re: renaming SchemaRDD -> DataFrame >> >> >> One thing potentially not clear from this e-mail, there will be a 1:1 >> correspondence wh

Re: renaming SchemaRDD -> DataFrame

2015-01-26 Thread Michael Malak
l To: Reynold Xin Cc: "dev@spark.apache.org" Sent: Monday, January 26, 2015 4:01 PM Subject: Re: renaming SchemaRDD -> DataFrame One thing potentially not clear from this e-mail, there will be a 1:1 correspondence where you can get an RDD to/from a DataFrame. On Mon, Jan 2

Re: renaming SchemaRDD -> DataFrame

2015-01-26 Thread Patrick Wendell
One thing potentially not clear from this e-mail, there will be a 1:1 correspondence where you can get an RDD to/from a DataFrame. On Mon, Jan 26, 2015 at 2:18 PM, Reynold Xin wrote: > Hi, > > We are considering renaming SchemaRDD -> DataFrame in 1.3, and wanted to > get the comm

renaming SchemaRDD -> DataFrame

2015-01-26 Thread Reynold Xin
Hi, We are considering renaming SchemaRDD -> DataFrame in 1.3, and wanted to get the community's opinion. The context is that SchemaRDD is becoming a common data format used for bringing data into Spark from external systems, and used for various components of Spark, e.g. MLlib's