In my understanding, the slower performance of PostgreSQL is a known behavior 
for write intensive applications. PGBouncer used for connection pooling cannot 
change/improve that. And Airflow with multiple DAGs and/or dynamic tasks with 
heavy workload will be write intensive. I have done extensive tests and have 
collected database statistics that show the bottleneck. I can share the details 
if you would like to see. I am wondering if there are any load tests or 
comparisons done so far by anyone in the community. Unless I am doing something 
totally wrong, PostgreSQL actually does not seem to be a right choice at all; 
slower performance and higher cost (due to PGBouncer process) compared to MySQL 

Jigar

> On Oct 14, 2024, at 2:41 AM, Jarek Potiuk <ja...@potiuk.com> wrote:
> 
> MySQL is not going away. You can use it if you want. We have no plans to
> remove it.
> 
> The advice did not change. Postgres is generally more stable that's why we
> recommend it. MySQL has much worse locking behaviour that is somewhat
> unpredictable and - especially when you use backfills - it is known to
> generate occasional deadlocks. This is likely why you got the advice (quite
> likely by me).
> 
> But in general it's fully supported and works and we have no plans to get
> rid of it.
> 
> Re: cost/price - it's up to you to choose and pay for services you use - we
> do not compare pricing of all possible available services.
> 
> Also if you follow the best practices for Postgres (use pgbouncer:
> https://airflow.apache.org/docs/apache-airflow/stable/howto/set-up-database.html#setting-up-a-postgresql-database),
> it's unlikely you will have slower postgres with similar machine/setup.
> 
> J.
> 
> 
>> On Mon, Oct 14, 2024 at 7:38 AM Jigar Parekh <ji...@vizeit.com> wrote:
>> 
>> Back in June’24, I had a discussion on Slack about a database issue. My
>> database backend for the Airflow instance is MySQL. It was recommended to
>> migrate to PostgreSQL to resolve such issues. I was also told that MySQL
>> may not be supported in the future versions. I configured PostgreSQL and
>> performed few tests to compare both the DBs with the type of heavy workload
>> I was expecting for my airflow instance in the production environment. The
>> test results did not show a reason to switch from MySQL to PostgreSQL. In
>> fact, PostgreSQL performed slower and airflow configuration cost more
>> compared to MySQL. I wanted to start this discussion to find out if others
>> have any similar observations about Postgres and what is Airflow community
>> planning to do about MySQL support in the future versions?
>> 
>> Jigar
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: dev-unsubscr...@airflow.apache.org
>> For additional commands, e-mail: dev-h...@airflow.apache.org
>> 
>> 

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@airflow.apache.org
For additional commands, e-mail: dev-h...@airflow.apache.org

Reply via email to