Re: Time to cut an Apache 2.4.1 release?

2019-02-12 Thread Dongjin Lee
ably would not be >> > back-ported to 2.4.x.SPARK-26154 does look like a bug whose fix could >> > be back-ported, but that's a big change. I wouldn't hold up 2.4.1 for >> > it, but it could go in if otherwise ready. >> > >> > >> > On Mon,

Re: Time to cut an Apache 2.4.1 release?

2019-02-11 Thread Dongjin Lee
t;> >> Thanks! >> >> --------- >> To unsubscribe e-mail: dev-unsubscr...@spark.apache.org >> >> -- *Dongjin Lee* *A hitchhiker in the mathematical world.* *github: <http://goog_969573159/&g

Re: Ask for reviewing on Structured Streaming PRs

2018-12-12 Thread Dongjin Lee
gt; to be reviewed, too. > > Thanks in advance, > Jungtaek Lim (HeartSaVioR) > > 1. > https://github.com/apache/spark/pulls?utf8=%E2%9C%93&q=is%3Aopen+is%3Apr+author%3AHeartSaVioR+%5BSS%5D > 2. > https://github.com/apache/spark/pulls?utf8=%E2%9C%93&q=is%3Aopen+is%3Apr

Re: welcome a new batch of committers

2018-10-03 Thread Dongjin Lee
gt; >>> >>> >>> >>> >>> -- >>> Sent from: http://apache-spark-developers-list.1001551.n3.nabble.com/ >>> >>> - >>> To unsubscribe e-mail: dev-unsubscr...@spar

Re: from_csv

2018-09-19 Thread Dongjin Lee
ning back result of parsing and >>>> original dataset by positions is expensive and not convenient. Instead >>>> users parse CSV columns by string functions. The approach is usually error >>>> prone especially for quoted values and other special cases. >>>>

Re: MatrixUDT and VectorUDT in Spark ML

2018-05-30 Thread Dongjin Lee
ation ou distribution du courrier ou de > toute pièce jointe est interdite. Si vous avez reçu cette communication par > erreur, veuillez nous en aviser par courrier et détruire immédiatement le > courrier et les pièces jointes. Merci. > -- > > -- *Dongjin Lee* *A hitchhiker

Re: Welcoming Hyukjin Kwon and Sameer Agarwal as committers

2017-08-08 Thread Dongjin Lee
------------- >> To unsubscribe e-mail: dev-unsubscr...@spark.apache.org >> >> -- *Dongjin Lee* *A hitchhiker in the mathematical world.* *github: <http://goog_969573159/>github.com/dongjinleekr <http://gi

Missing config property in documentation

2017-05-03 Thread Dongjin Lee
^1]: https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/scheduler/TaskResultGetter.scala -- *Dongjin Lee* *A hitchhiker in the mathematical world.facebook: www.facebook.com/dongjin.lee.kr <http://www.facebook.com/dongjin.lee.kr>linkedin: kr.linkedin.com/in/do

Re: Issues: Generate JSON with null values in Spark 2.0.x

2017-03-21 Thread Dongjin Lee
, 2017 at 4:44 PM, Chetan Khatri wrote: > Exactly. > > On Sat, Mar 11, 2017 at 1:35 PM, Dongjin Lee wrote: > >> Hello Chetan, >> >> Could you post some code? If I understood correctly, you are trying to >> save JSON like: >> >> { >> "firs

Re: Spark Local Pipelines

2017-03-13 Thread Dongjin Lee
rmations without depending on Spark at all. >> >> I've written up a doc >> <https://docs.google.com/document/d/1Ha4DRMio5A7LjPqiHUnwVzbaxbev6ys04myyz6nDgI4/edit?usp=sharing> >> detailing the approach, and I'm happy to discuss alternatives. If this >>

Re: Issues: Generate JSON with null values in Spark 2.0.x

2017-03-11 Thread Dongjin Lee
; we generate json will null values as a Dataframe ? > > Thanks. > -- *Dongjin Lee* *Software developer in Line+.So interested in massive-scale machine learning.facebook: www.facebook.com/dongjin.lee.kr <http://www.facebook.com/dongjin.lee.kr>linkedin: kr.linkedin.com/in/dongj

Re: GraphX-related "open" issues

2017-01-19 Thread Dongjin Lee
tps://spark-packages.org/)? ISTM graphx has already necessary > and sufficient APIs for these third-party ones. > > On Thu, Jan 19, 2017 at 12:21 PM, Dongjin Lee wrote: > >> Hi all, >> >> I am currently working on SPARK-15880[^1] and also have some interest >>

Re: GraphX-related "open" issues

2017-01-18 Thread Dongjin Lee
s about "Improvement" and "New >> Feature" for now. >> >> I'm just wondering about what kind of field value you want to fill in the >> `Resolution` field for those issues. >> >> Maybe, 'Later'? Or, 'Won't Fix'? >> &

Re: Question about SPARK-11374 (skip.header.line.count)

2016-12-08 Thread Dongjin Lee
t; Is it need for your use cases? Or, it's enough for you to remove them in a > preprocessing stage. > If this is too old and not proper in these days, I'll close the PR and > JIRA issue as WON'T FIX. > > Thank you for all in advance! > > Bests, > Dong