Fang-Yu Rao created IMPALA-14328:
------------------------------------
Summary: Produce column lineage graph when Calcite is the planner
Key: IMPALA-14328
URL: https://issues.apache.org/jira/browse/IMPALA-14328
Project: IMPALA
Issue Type: Sub-task
Components: Frontend
Reporter: Fang-Yu Rao
Assignee: Fang-Yu Rao
We found that Impala could not produce a structurally same column lineage graph
when Calcite is the planner. For instance, consider the following query in
[lineage.test|https://github.com/apache/impala/blob/master/testdata/workloads/functional-query/queries/QueryTest/lineage.test].
{code:java}
select * from (
select tinyint_col + int_col x from functional.alltypes
union all
select sum(bigint_col) y from (select bigint_col from functional.alltypes)
v1) v2
{code}
We expect Impala to produce a graph with 4 vertices and 1 edge. However, we
only get one vertice when Calcite is the planner.
{code}
{
"edges": [
{
"edgeType": "PROJECTION",
"sources": [],
"targets": [
0
]
}
],
"endTime": 1755630445,
"hash": "3968bd65781e9e856eaca799f4501513",
"queryId": "fb443702ac817ecc:c432854600000000",
"queryText": "select * from ( select tinyint_col + int_col x from
functional.alltypes union all select sum(bigint_col) y from (select
bigint_col from functional.alltypes)
"timestamp": 1755630437,
"user": "fangyurao",
"vertices": [
{
"id": 0,
"vertexId": "X",
"vertexType": "COLUMN"
}
]
}
{code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]