Thank you Bjorn Jorgensen and also thank to Sean Owen. DataFrame and .format("jdbc") is good way to resolved it. But in some reasons, i can't using DataFrame API, only can use RDD API in PySpark. ...T_T...
thanks all you guys help. but still need new idea to resolve it. XD javaca...@163.com 发件人: Bjørn Jørgensen 发送时间: 2022-09-19 18:34 收件人: javaca...@163.com 抄送: Xiao, Alton; user@spark.apache.org 主题: Re: 答复: [how to]RDD using JDBC data source in PySpark https://www.projectpro.io/recipes/save-dataframe-mysql-pyspark and https://towardsdatascience.com/pyspark-mysql-tutorial-fa3f7c26dc7 man. 19. sep. 2022 kl. 12:29 skrev javaca...@163.com <javaca...@163.com>: Thank you answer alton. But i see that is use scala to implement it. I know java/scala can get data from mysql using JDBCRDD farily well. But i want to get same way in Python Spark. Would you to give me more advice, very thanks to you. javaca...@163.com 发件人: Xiao, Alton 发送时间: 2022-09-19 18:04 收件人: javaca...@163.com; user@spark.apache.org 主题: 答复: [how to]RDD using JDBC data source in PySpark Hi javacaoyu: https://hevodata.com/learn/spark-mysql/#Spark-MySQL-Integration I think spark have already integrated mysql 发件人: javaca...@163.com <javaca...@163.com> 日期: 星期一, 2022年9月19日 17:53 收件人: user@spark.apache.org <user@spark.apache.org> 主题: [how to]RDD using JDBC data source in PySpark 你通常不会收到来自 javaca...@163.com 的电子邮件。了解这一点为什么很重要 Hi guys: Does have some way to let rdd can using jdbc data source in pyspark? i want to get data from mysql, but in PySpark, there is not supported JDBCRDD like java/scala. and i search docs from web site, no answer. So i need your guys help, Thank you very much. javaca...@163.com -- Bjørn Jørgensen Vestre Aspehaug 4, 6010 Ålesund Norge +47 480 94 297