Wondering if anyone has thoughts on a path forward for parquet schema migrations, especially for people (like us) that are using raw parquet files rather than Hive.
So far we've gotten away with reading old files, converting, and writing to new directories, but that obviously becomes problematic above a certain data size.