[jira] [Commented] (FLINK-18835) sql using group by, duplicated group fileld appears

YHF (Jira) Tue, 11 Aug 2020 03:10:22 -0700


    [ 
https://issues.apache.org/jira/browse/FLINK-18835?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17175431#comment-17175431
 ]


YHF commented on FLINK-18835:
-----------------------------

[~jark] A is field name,2 is value,  code see attachment SumAnalysis.java and 
result data ,the result data is only one time window,

YdScanTmp is source type,GroupTmp is result type

result only has cmtInf which value is 2,but the source data has cmtInf which 
value is 0 or 1.

I found that I can use dataset in stream time window, dataset will return the 
correct data. use time window of data stream, create dataset from data of each 
window in apply function, then run sql and collect result list. but I think 
this is not a appropriate way.

 

> sql using group by, duplicated group fileld appears
> ---------------------------------------------------
>
>                 Key: FLINK-18835
>                 URL: https://issues.apache.org/jira/browse/FLINK-18835
>             Project: Flink
>          Issue Type: Bug
>          Components: Table SQL / Planner
>    Affects Versions: 1.11.1
>            Reporter: YHF
>            Priority: Critical
>         Attachments: SumAnalysis.java, result.txt
>
>
> datasource is kafka,then create a temporary view, group by (fieldA,fieldB) 
> using sql,
> then transform the result table to datastream using toRetractStream, then 
> print the result,
> I find duplicated (fieldA,fieldB)
> see attachment for code
> group by(scanType,scanSite,cmtInf),but result is below
> (true,Otm\{, scanType=97, scanSite=14, cmtInf=24,jp=1.000000000000000000, 
> db=0E-18, dbjp=1.000000000000000000, pjWei=27.070000000000000000, 
> dbWei=0E-18, mintime=2020-07-29 11:33:57.679, maxtime=2020-07-29 
> 11:33:57.679})
> 3> (true,Otm\{, scanType=97, scanSite=14, cmtInf=24,jp=1.000000000000000000, 
> db=0E-18, dbjp=1.000000000000000000, pjWei=27.070000000000000000, 
> dbWei=0E-18, mintime=2020-07-29 11:33:57.679, maxtime=2020-07-29 
> 11:33:57.679})



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Commented] (FLINK-18835) sql using group by, duplicated group fileld appears

Reply via email to