Hello, The blog post is informative but examines the performance of one single query thus it may not be enough to justify a change in the default configuration. If there are more comprehensive benchmarks that show that changing the default value of hive.optimize.reducededuplication is beneficial for the overall workload then we can definitely consider this option.
Best, Stamatis On Mon, Dec 30, 2024 at 7:12 AM lisoda <lis...@yeah.net> wrote: > Hello Sir. > > I was just trying to reference the relevant mailing list links when > submitting [HIVE-28681] Default to disable > hive.optimize.reducededuplication - ASF JIRA > <https://issues.apache.org/jira/browse/HIVE-28681>. (I pin old historical > emails by replying to them and then copy their web links.) However, whether > we should default to disabling reducededuplication is a matter that needs > to be discussed; perhaps it should be enabled by default, but the default > parameter settings related to reducededuplication are not quite appropriate. > > Furthermore, since the authors of the article are Seonggon Namgung and > Sungwoo Park, I believe we should seek their opinions. > > Seonggon Namgung > <https://issues.apache.org/jira/secure/ViewProfile.jspa?name=seonggon> & > Sungwoo > Park <https://issues.apache.org/jira/secure/ViewProfile.jspa?name=glapark>, > what do you think? > > -lisoda > > > > > 在 2024-12-30 04:44:04,"Ayush Saxena" <ayush...@gmail.com> 写道: > > If it is related to Hive we can get it posted on the Hive’s Twitter page > as well, > > If you say so & share us what you want to write along with the link to the > blog > > https://x.com/apachehive > <https://x.com/apachehive?s=21&t=Uk-WvdFllQb3Flaq-uxsLA> > > -Ayush > > On 28 Dec 2024, at 4:43 PM, lisoda <lis...@yeah.net> wrote: > > see again > > > ---- Replied Message ---- > From lisoda<lis...@yeah.net> <lis...@yeah.net> > Date 12/24/2023 00:28 > To user<user@hive.apache.org> <user@hive.apache.org> > Cc > Subject Re: Blog article 'Performance Tuning for Single-table Queries' > > > > ---- Replied Message ---- > From Sungwoo Park<glap...@gmail.com> <glap...@gmail.com> > Date 12/24/2023 00:06 > To user@hive.apache.org > Cc > Subject Blog article 'Performance Tuning for Single-table Queries' > Hello Hive users, > > I have published a new blog article 'Performance Tuning for Single-table > Queries'. It shows how to change configuration parameters of Hive and Tez > in order to make simple queries run faster than Spark. Although it > uses Hive on MR3, the technique equally applies to Hive on Tez and > Hive-LLAP. > > https://www.datamonad.com/post/2023-12-23-optimize-bi-1.8/ > > Hope you find it useful. > > Cheers, > > --- Sungwoo > >