[New Project] sparksql-ml : Distributed Machine Learning using SparkSQL.

2023-02-27 Thread Chitral Verma
inputCol='raw', outputCol='filtered') AND WRITE AT LOCATION '/path/to/test-transformer' But a lot more can be done with this library. I was wondering if any of you find this interesting and would like to contribute to the project here, https://github.com/chitralverma/sparksql-ml Regards, Chitral Verma

Re: Slack for Spark Community: Merging various threads

2023-04-10 Thread Chitral Verma
Hi all, Thanks for starting a discussion on this super-important topic. I'm not sure if this is already considered, but Discord is also a viable option and many many open-source projects and communities are using it. - It's *mostly* free with no online user limitations like slack. - Has a b

Re: [SparkSQL, SparkUI, RESTAPI] How to extract the WholeStageCodeGen ids from SparkUI

2023-04-11 Thread Chitral Verma
try explain codegen on your DF and then pardee the string On Fri, 7 Apr, 2023, 3:53 pm Chenghao Lyu, wrote: > Hi, > > The detailed stage page shows the involved WholeStageCodegen Ids in its > DAG visualization from the Spark UI when running a SparkSQL. (e.g., under > the link > node:18088/histor

Query regarding stateless aggregations

2019-11-28 Thread Chitral Verma
(" ... ") ? Also posted here, https://stackoverflow.com/questions/59050663/is-it-possible-to-do-stateless-aggregations-using-spark-sql Any helps will be appreciated. Regards, Chitral Verma

Re: [VOTE] SPARK 2.4.0 (RC5)

2018-10-31 Thread Chitral Verma
+1 On Wed, 31 Oct 2018 at 11:56, Reynold Xin wrote: > +1 > > Look forward to the release! > > > > On Mon, Oct 29, 2018 at 3:22 AM Wenchen Fan wrote: > >> Please vote on releasing the following candidate as Apache Spark version >> 2.4.0. >> >> The vote is open until November 1 PST and passes if

[Discussion] Clarification regarding Stateful Aggregations over Structured Streaming

2018-12-16 Thread Chitral Verma
Hi Devs, For quite some time i've been looking at the structured streaming API to solve lots of use cases at my workplace, I've have some doubts I wanted to clarify regarding stateful aggregations over structured streaming. Currently, spark provides flatMapGroupWithState (FMGWS) / mapGroupWithSta

Re: [Discussion] Clarification regarding Stateful Aggregations over Structured Streaming

2018-12-16 Thread Chitral Verma
FMGWS > API (others could provide more info), but I was also looking at the API > recently thinking of an API for this: > https://issues.apache.org/jira/browse/SPARK-16738 > > Best, > Stavros > > On Sun, Dec 16, 2018 at 7:58 PM Chitral Verma > wrote: > >> Hi Devs