Re: pyspark connect to spark thrift server port

Artemis User Thu, 20 Oct 2022 13:06:46 -0700

By default, Spark uses Apache Derby (running in embedded mode with storecontent defined in local files) for hosting the Hive metastore. You canexternalize the metastore on a JDBC-compliant database (e.g.,PostgreSQL) and use the database authentication provided by thedatabase. The JDBC configuration shall be defined in a hive-site.xmlfile in the Spark conf directory. Please see the metastore admin guidefor more details, including an init script for setting up your metastore(https://cwiki.apache.org/confluence/display/Hive/AdminManual+Metastore+3.0+Administration).


On 10/20/22 4:31 AM, second_co...@yahoo.com.INVALID wrote:

Currently my pyspark code able to connect to hive metastore at port9083. However using this approach i can't put in-place any securitymechanism like LDAP and sql authentication control. Is there anyway toconnect from pyspark to spark thrift server on port 10000 withoutexposing hive metastore url to the pyspark ? I would like toauthenticate the user before allow to execute spark sql, and usershould only allow to query from databases,tables that they have theaccess.
Thank you,
comet

Re: pyspark connect to spark thrift server port

Reply via email to