Hi, I am interested in working on a project that takes a large number of Hive queries (as well as their meta data like amount of resources used etc) and find out common sub queries and expensive query groups etc.
Are there any existing work in this domain? Happy to collaborate as well if there are shared I interests. Zheng