[ https://issues.apache.org/jira/browse/FLINK-18835?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17175431#comment-17175431 ]
YHF commented on FLINK-18835: ----------------------------- [~jark] A is field name,2 is value, code see attachment SumAnalysis.java and result data ,the result data is only one time window, YdScanTmp is source type,GroupTmp is result type result only has cmtInf which value is 2,but the source data has cmtInf which value is 0 or 1. I found that I can use dataset in stream time window, dataset will return the correct data. use time window of data stream, create dataset from data of each window in apply function, then run sql and collect result list. but I think this is not a appropriate way. > sql using group by, duplicated group fileld appears > --------------------------------------------------- > > Key: FLINK-18835 > URL: https://issues.apache.org/jira/browse/FLINK-18835 > Project: Flink > Issue Type: Bug > Components: Table SQL / Planner > Affects Versions: 1.11.1 > Reporter: YHF > Priority: Critical > Attachments: SumAnalysis.java, result.txt > > > datasource is kafka,then create a temporary view, group by (fieldA,fieldB) > using sql, > then transform the result table to datastream using toRetractStream, then > print the result, > I find duplicated (fieldA,fieldB) > see attachment for code > group by(scanType,scanSite,cmtInf),but result is below > (true,Otm\{, scanType=97, scanSite=14, cmtInf=24,jp=1.000000000000000000, > db=0E-18, dbjp=1.000000000000000000, pjWei=27.070000000000000000, > dbWei=0E-18, mintime=2020-07-29 11:33:57.679, maxtime=2020-07-29 > 11:33:57.679}) > 3> (true,Otm\{, scanType=97, scanSite=14, cmtInf=24,jp=1.000000000000000000, > db=0E-18, dbjp=1.000000000000000000, pjWei=27.070000000000000000, > dbWei=0E-18, mintime=2020-07-29 11:33:57.679, maxtime=2020-07-29 > 11:33:57.679}) -- This message was sent by Atlassian Jira (v8.3.4#803005)