subject:"Schema Evolution in Apache Spark"

Re: Schema Evolution in Apache Spark

2018-01-12 Thread Dongjoon Hyun

This is about Spark-layer test cases on **read-only** CSV, JSON, Parquet, ORC files. You can find more details and comparisons in terms of Spatk support coverage. Bests, Dongjoon. On Thu, Jan 11, 2018 at 22:19 Georg Heiler wrote: > Isn't this related to the data format used, i.e. parquet, Avro

Re: Schema Evolution in Apache Spark

2018-01-11 Thread Georg Heiler

Isn't this related to the data format used, i.e. parquet, Avro, ... which already support changing schema? Dongjoon Hyun schrieb am Fr., 12. Jan. 2018 um 02:30 Uhr: > Hi, All. > > A data schema can evolve in several ways and Apache Spark 2.3 already > supports the followings for file-based data

Schema Evolution in Apache Spark

2018-01-11 Thread Dongjoon Hyun

Hi, All. A data schema can evolve in several ways and Apache Spark 2.3 already supports the followings for file-based data sources like CSV/JSON/ORC/Parquet. 1. Add a column 2. Remove a column 3. Change a column position 4. Change a column type Can we guarantee users some schema evolution covera

Re: Schema Evolution in Apache Spark

Re: Schema Evolution in Apache Spark

Schema Evolution in Apache Spark

3 matches

Site Navigation

Mail list logo

Footer information