Re: Clustering and Large-scale analysis of Hive Queries

2018-07-26 Thread Thai Bui
I don’t see any project especially tuned for Hive doing what you described. I have encountered this problem recently as the number of users and queries grew exponentially in my company and I’m interested. We are currently collecting Hive internal metrics in order to do certain analysis (don’t know

Parquet schema evolution, column conversion not supported

2018-07-26 Thread Patrick Duin
I'm encountering errors in Hive 2.3.2 when reading sets of Parquet files, where the schema has evolved. The error I'm seeing is : Failed with exception java.io.IOException:java.lang.RuntimeException: Hive internal error: conversion of string to arraynot supported yet. My schema has a top-level co