2010YOUY01 commented on issue #12114:
URL: https://github.com/apache/datafusion/issues/12114#issuecomment-2306306056

   > ### Additional context
   > We also have some great sql fuzz coverage in 
https://github.com/datafusion-contrib/datafusion-sqlancer from @2010YOUY01, but 
I think that focuses on the queries themselves, rather than the setup (block 
size, input order, etc)
   
   I agree SQLancer is not the best choice for aggregation-specific fuzzing 
(though doable), due to:
   1. It takes a lot of effort to try all possible configuration knobs on 
randomly generated data
   2. It's random SQL + random config, the randomly generated SQL will be 
complex and with deeply nested exprs, which will be hard to reduce and 
investigate
   
   So now I plan to cover more SQL features and try to find easy to identify 
and fix bugs, configuration fuzzing is less prioritized for SQLancer
   
   So I think rust-level fuzzing is better.
   
   Besides, I think we can also find some comprehensive aggregation queries to 
do some SQL level fuzzing (Fixed SQL + random config, and check under different 
config the query always gives the same result)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to