Re: When to use PARTITION BY HASH?

MichaelDBA Sun, 07 Jun 2020 04:42:19 -0700

The article referenced below assumes a worst case scenario forbulk-loading with hash partitioned tables. It assumes that the valuesbeing inserted are in strict ascending or descending order with no gaps(like a sequence number incrementing by 1), thereby ensuring everypartition is hit in order before repeating the process. If the valuesbeing inserted are not strictly sequential with no gaps, then theperformance is much better. Obviously, what part of the tables andindexes are in memory has a lot to do with it as well.


Regards,
Michael Vitale


Imre Samu wrote on 6/5/2020 7:48 AM:

> "Bulk loads ...",

As I see - There is an interesting bulkload benchmark:
"How Bulkload performance is affected by table partitioning inPostgreSQL" by Beena Emerson (Enterprisedb, December 4, 2019 )/SUMMARY: This article covers how benchmark tests can be used todemonstrate the effect of table partitioning on performance. Testsusing range- and hash-partitioned tables are compared and the reasonsfor their different results are explained:
                 1. Range partitions
           2. Hash partitions
                 3. Combination graphs
               4. Explaining the behavior
                 5. Conclusion/
/
/
/"For the hash-partitioned table, the first value is inserted in thefirst partition, the second number in the second partition and so ontill all the partitions are reached before it loops back to the firstpartition again until all the data is exhausted. Thus it exhibits theworst-case scenario where the partition is repeatedly switched forevery value inserted. As a result, the number of times the partitionis switched in a range-partitioned table is equal to the number ofpartitions, while in a hash-partitioned table, the number of times thepartition has switched is equal to the amount of data being inserted.This causes the massive difference in timing for the two partitiontypes."/
https://www.enterprisedb.com/postgres-tutorials/how-bulkload-performance-affected-table-partitioning-postgresql

Regards,
 Imre

Re: When to use PARTITION BY HASH?

Reply via email to