Re: [DISCUSS][SQL] Control the number of output files

2018-07-25 Thread lukas nalezenec
Hi, Yes, This feature is planned - Spark should be soon able to repartition output by size. Lukas Dne st 25. 7. 2018 23:26 uživatel Forest Fang napsal: > Has there been any discussion to simply support Hive's merge small files > configuration? It simply adds one additional stage to inspect size

Re: [DISCUSS][SQL] Control the number of output files

2018-08-06 Thread lukas nalezenec
ke to follow it's activity. >> thanks! >> koert >> >> On Wed, Jul 25, 2018 at 5:32 PM, lukas nalezenec >> wrote: >> >>> Hi, >>> Yes, This feature is planned - Spark should be soon able to repartition >>> output by size. >>> L