Andrey Mashenkov created IGNITE-24995: -----------------------------------------
Summary: Sql. Rework correlates serialization and propagation to another node. Key: IGNITE-24995 URL: https://issues.apache.org/jira/browse/IGNITE-24995 Project: Ignite Issue Type: Improvement Components: sql Affects Versions: 3.0 Reporter: Andrey Mashenkov As for now, a SharedState class for storing correlates in execution context and is used by CorrelatedNestedLoopJoinNode (CNLJN) execution node. Seems, CorrelatedNestedLoopJoinNode was designed to use batching for correlates variables, to transfer many rows at a time, but implemented in wrong way, and this just don't work. There are few related issues 1. The class implements Serializable interface and can be transferred to another node. This causes using DefaultUserObjectMarshaller for class serialization in messaging system. Despite the SharedState class contains BinaryTuple objects, they are not converted to byte[] during serialization, which is ineffective. Maybe making it Externalizable could mitigate the issue. 2. We don't need to put a whole sql row to a correlate variable, but only required row columns(projection) to reduce network pressure. It is important that all the nodes creates the same projection for the same correlate. 3. We should fix the SharedState class to make batching possible, by allowing set multiple rows for the same correlate id. Most likely, we must keep correlates hierarchy order to preserve CNLJN collation. Correlate id number doesn't have this guarantee) in case of more than one correlate. Let's improve SharedState class structure, and fix/drop broken batching, fix messaging serialization issue. -- This message was sent by Atlassian Jira (v8.20.10#820010)