Distribute by in DML is also supported by Hive.
And it is also useful for flink.
Users can use this ability to increase cache hit rate in lookup join.
And users can use "distribute by key, rand(1, 10)” to avoid data skew problem.
And I think it is another way to solve this Flip204[1]
There is alrea
Hi dev,
I am working on advancing the flip204 project, but I have noticed an error in
the SQL syntax section.
The sql syntax in the documentation is:
SELECT /*+ SHUFFLE_HASH('Customers') */ o.order_id, o.total, c.country, c.zip
FROM Orders AS o
JOIN Customers FOR SYSTEM_TIME AS OF o.proc_time
yntax is:
SELECT /*+ SHUFFLE_HASH(‘c') */ o.order_id, o.total, c.country, c.zip
FROM Orders AS o
JOIN Customers FOR SYSTEM_TIME AS OF o.proc_time AS c
ON o.customer_id = c.id;
On 2024/01/23 04:04:32 yunfan zhang wrote:
> Hi dev,
>
> I am working on advancing the flip204 project, but I h