Re: [D] EDW and Data marts [cloudberry]

via GitHub Mon, 10 Mar 2025 22:41:29 -0700


GitHub user 2shar-p added a comment to the discussion: EDW and Data marts


By "segmentation" If you mean distribution of data across all the segments, 
then the short answer is yes. 
All tables will be distributed by a key, which is either a single column or 
group of columns. You could also choose to distribute randomly, and that will 
still distribute data for each table across all segments. This applies to each 
table in the Cloudberry deployment - each database, each schema, each table, 
each partition... all that has data in it will be distributed across all 
segments and each segment will have it's own portion of data. Now, one could 
choose to make tables replicated instead of distributed, in which case, the 
same copy of data will be on each and every segment, still data will be on 
every segment of the Cloudberry's "shared nothing MPP cluster"  -- happy to 
help chat more; I am on Cloudberry Slack, mostly available during US hours. 

GitHub link: 
https://github.com/apache/cloudberry/discussions/987#discussioncomment-12457564

----
This is an automatically sent email for dev@cloudberry.apache.org.
To unsubscribe, please send an email to: dev-unsubscr...@cloudberry.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@cloudberry.apache.org
For additional commands, e-mail: dev-h...@cloudberry.apache.org

Re: [D] EDW and Data marts [cloudberry]

Reply via email to