How to improve the concurrent query performance of spark SQL query

2021-08-26 Thread Tao Li
In the high concurrency scenario, the query performance of spark SQL is limited by namenode and hive Metastore. There are some caches in the code, but the effect is limited. Do we have a practical and effective way to solve the time-consuming problem of driver in concurrent query? -

Question about impersonation on Spark executor

2016-09-14 Thread Tao Li
Hi, I am new to Spark and would like to have a quick question about the end user impersonation on Spark executor process. Basically I am running SQL queries through Spark thrift server with doAs set to true to enable end user impersonation. In my experiment, I was able to start session for mul

Quick question about hive-exec 1.2.1.spark2

2016-08-03 Thread Tao Li
Hi, The spark-hive module has a dependency on hive-exec module (a custom built module from "Hive on Sparkā€ project). Can someone point me to the source code repo of the hive-exec module? Thanks. Here is the maven repo link: https://mvnrepository.com/artifact/org.spark-project.hive/hive-exec/1.