+1, to have a checkstyle build. I am strongly against doing that big
refactor to make just checkstyle happy, such a refactor will make
backports to Hive lower branches tough and the life of folks
maintaining downstream forks quite painful.

We should enforce same kind of stuff like in Tez/Hadoop, where
checkstyle violations are highlighted and the committer before
committing can check that & decide whether that in unavoidable or not

-Ayush

On Mon, 8 Jan 2024 at 14:05, László Bodor <bodorlaszlo0...@gmail.com> wrote:
>
> thanks for the responses so far!
> I'm a bit against the one-time huge refactor commit as we don't need that
> (but I can be convinced of course), because checkstyle can be set up to
> warn only on style issues in the new/touched bits in the PR (or at least
> that's how it works in tez), that's what we need, so we don't have to make
> that huge commit to simply introduce this enforcement
>
> Butao Zhang <butaozha...@163.com> ezt írta (időpont: 2024. jan. 8., H,
> 9:28):
>
> > +1
> >
> >
> >
> > BTW, We have a independent checkstyle file under iceberg module
> > https://github.com/apache/hive/tree/master/iceberg/checkstyle . I think
> > we need to consider unifing the checkstyle in all the sub-module.
> >
> >
> > Thanks,
> > Butao Zhang
> > ---- Replied Message ----
> > | From | Zsolt Miskolczi<zsolt.miskol...@gmail.com> |
> > | Date | 1/8/2024 16:19 |
> > | To | <dev@hive.apache.org> |
> > | Subject | Re: Force coding style in hive precommit |
> > +1
> >
> > In case there is an agreement about the coding style, we can prepare a tool
> > that enforces that style at compile time. Run a tool one time to re-format
> > all the existing code once. And turn on a compile time check. Iceberg did
> > the same approach, they had one huge commit with almost 4k files changed
> > and from that point, it worked well. And there are no issues about
> > formatting.
> > I don't think putting a warning message helps at all. Also, it should be
> > enforced on compile time.
> >
> > Zsolt
> >
> > Kirti Ruge <kirtirug...@gmail.com> ezt írta (időpont: 2024. jan. 8., H,
> > 7:20):
> >
> > +1
> > As it would improve maintainability and code reviews. Sometimes small
> > indentation/styling issues would kill review cycle time and we can easily
> > avoid it before requesting review.
> > Enforcing more rules around it definitely boost guaranteeing quality. We
> > can integrate it with git hooks. If we are going for this, I can work on
> > getting it in place .
> >
> > Thanks,
> > Kirti
> >
> > On 08-Jan-2024, at 11:36 AM, Akshat m <akshatats...@gmail.com> wrote:
> >
> > +1, We do have a documentation round it as well:
> >
> >
> > https://cwiki.apache.org/confluence/display/Hive/HowToContribute#HowToContribute-CodingConventions
> > so it makes sense to enforce it as well.
> >
> > Right now we have a small section around this in documentation, We can
> > also
> > expand this to a new page and add more Java practices to it as well which
> > are followed in the project while we are at this, Will be a great
> > addition
> > to Hive 4 documentation, I can pick it up.
> >
> > I suggest we add this style check as a pre-commit git hook as well, so it
> > is enforced when the author is committing locally as well, this can save
> > the wait time for pre-commit failure in the PR for the author to realise
> > the styling issues, ideally this should be taken care of with the ide
> > style
> > configuration but in case we miss it this would error out while
> > committing the changes.
> >
> > Regards,
> > Akshat
> >
> > On Sat, Jan 6, 2024 at 10:17 AM László Bodor <bodorlaszlo0...@gmail.com>
> > wrote:
> >
> > Hi All!
> >
> > What do you think about forcing coding style in Hive precommit?
> >
> > I remember, back in the old days, precommit printed some warnings in
> > case
> > some coding style (formatting, indentation, naming convention, etc.)
> > problems were found in the patch, now it's simply not used, I guess
> > since
> > we're using GitHub PRs.
> >
> > For example: I remember I simply approved a PR a few months ago which
> > LGTM, and later just realized it's full of 4-spaces indentation, which
> > is
> > wrong if we assume that code should be formatted according to the style
> > definition here:
> >
> > https://github.com/apache/hive/blob/master/dev-support/eclipse-styles.xml
> >
> > I have just attached an example of Tez PR to open minds and start a
> > conversation.
> >
> > Regards,
> > Laszlo Bodor
> >
> >
> >
> >
> >

Reply via email to