Re: [Proposal] Modification to Spark's Semantic Versioning Policy

2020-03-05 Thread Dongjoon Hyun
Hi, All. There is a on-going Xiao's PR referencing this email. https://github.com/apache/spark/pull/27821 Bests, Dongjoon. On Fri, Feb 28, 2020 at 11:20 AM Sean Owen wrote: > On Fri, Feb 28, 2020 at 12:03 PM Holden Karau > wrote: > >> 1. Could you estimate how many revert commits are req

Re: [DISCUSS] Remove multiple workers on the same host support from Standalone backend

2020-03-05 Thread Prashant Sharma
It was by design, one could run multiple workers on his laptop for trying out or testing spark in distributed mode, one could launch multiple workers and see how resource offers and requirements work. Certainly, I have not commonly seen, starting multiple workers on the same node as a practice so f

Re: Datasource V2 support in Spark 3.x

2020-03-05 Thread Wenchen Fan
Data Source V2 has evolved to Connector API which supports both data (the data source API) and metadata (the catalog API). The new APIs are under package org.apache.spark.sql.connector You can keep using Data Source V1 as there is no plan to deprecate it in the near future. But if you'd like to t

Re: comparable and orderable CalendarInterval

2020-03-05 Thread Enrico Minack
There is another feature missing for CalendarInterval, which is related to comparability: measure the length of an interval. Would be nice if you could access the length of an interval, than you could compute something like this: |Seq((Timestamp.valueOf("2020-02-01 12:00:00"), Timestamp.valu