In the high concurrency scenario, the query performance of spark SQL is limited
by namenode and hive Metastore. There are some caches in the code, but the
effect is limited. Do we have a practical and effective way to solve the
time-consuming problem of driver in concurrent query?
-
Hi,
I am new to Spark and would like to have a quick question about the end user
impersonation on Spark executor process.
Basically I am running SQL queries through Spark thrift server with doAs set to
true to enable end user impersonation. In my experiment, I was able to start
session for mul
Hi,
The spark-hive module has a dependency on hive-exec module (a custom built
module from "Hive on Sparkā project). Can someone point me to the source code
repo of the hive-exec module? Thanks.
Here is the maven repo link:
https://mvnrepository.com/artifact/org.spark-project.hive/hive-exec/1.