performance impact on insert for column sorting?
Hello, Does anyone have a rough idea of the performance impact of column sorting on insertion for a table that uses parquet format? Thanks, James
External Table Creation is slow/hangs
I have a dataset up on S3 in partitioned folders. I'm trying to create an external hive table pointing to the location of that data. The table schema is set up to have the column partitions matching how the folders are set up on S3. I've done this quite a few times successfully, but when the data