I agree, reformatting 90% of the code base is tough.

There are two main issues:
  (1) Incompatible merges. This is hard, especially for the folks that have
to merge the pull requests ;-)

  (2) Author history: This is less of an issue, I think. "git log
<filename>" and "git show <revision> -- <filename>" will still work and one
may have to go one commit back to find out why something was changed


What I could image is to do this incrementally. Define the code style in
"flink-parent" but do not activate it.
Then start with some projects (new projects, plus some others):
merge/reject PRs, reformat, activate code style.

Piece by piece. This is realistically going to take a long time until it is
pulled through all components, but that's okay, I guess.

Stephan


On Mon, Feb 27, 2017 at 1:53 PM, Aljoscha Krettek <aljos...@apache.org>
wrote:

> Just for a bit of context, this is the output of running cloc on the Flink
> codebase:
> ------------------------------------------------------------
> -----------------------
> Language                         files          blank        comment
>     code
> ------------------------------------------------------------
> -----------------------
> Java                              4609         126825         185428
>   519096
>
> => 704,524 lines of code + comments/javadoc
>
> When I apply the google style to the Flink code base using
> https://github.com/google/google-java-format I get these commit
> statistics:
>
> 4577 files changed, 647645 insertions(+), 622663 deletions(-)
>
> That is, a change to the Google Code Style would touch roughly over 90% of
> all code/comment lines.
>
> I would like to have a well defined code style, such as the Google Code
> style, that has nice tooling and support but I don't think we will ever
> convince enough people to do this kind of massive change. Even I think it's
> a bit crazy to change 90% of the code base in one commit.
>
> On Mon, 27 Feb 2017 at 11:10 Till Rohrmann <trohrm...@apache.org> wrote:
>
> > No, I think that's exactly what people mean when saying "losing the
> commit
> > history". With the reformatting you would have to go manually through all
> > past commits until you find the commit which changed a given line before
> > the reformatting.
> >
> > Cheers,
> > Till
> >
> > On Sun, Feb 26, 2017 at 6:32 PM, Alexander Alexandrov <
> > alexander.s.alexand...@gmail.com> wrote:
> >
> > > Just to clarify - by "losing the commit history" you actually mean
> > "losing
> > > the ability to annotate each line in a file with its last commit",
> right?
> > >
> > > Or is there some other sense in which something is lost after applying
> > bulk
> > > re-format?
> > >
> > > Cheers,
> > > A.
> > >
> > > On Sat, Feb 25, 2017 at 7:10 AM Henry Saputra <henry.sapu...@gmail.com
> >
> > > wrote:
> > >
> > > > Just want to clarify what unify code style here.
> > > >
> > > > Is the intention to have IDE and Maven plugins to have the same check
> > > style
> > > > rules?
> > > >
> > > > Or are we talking about having ONE code style for both Java and
> Scala?
> > > >
> > > > - Henry
> > > >
> > > > On Fri, Feb 24, 2017 at 8:08 AM, Greg Hogan <c...@greghogan.com>
> > wrote:
> > > >
> > > > > I agree wholeheartedly with Ufuk. We cannot reformat the codebase,
> > > cannot
> > > > > pause while flushing the PR queue, and won't find a consensus code
> > > style.
> > > > >
> > > > > I think we can create a baseline code style for new and existing
> > > > > contributors for which reformatting on changed files will be
> > acceptable
> > > > for
> > > > > PR reviews.
> > > > >
> > > > > On Fri, Feb 24, 2017 at 5:01 AM, Dawid Wysakowicz <
> > > > > wysakowicz.da...@gmail.com> wrote:
> > > > >
> > > > > > The problem with code style when it is not enforced is that it
> will
> > > be
> > > > a
> > > > > > matter of luck to what parts of files / new files will it be
> > applied.
> > > > > When
> > > > > > the code style is not applied to whole file, it is pretty much
> > > useless
> > > > > > anyway. You would need to manually select just the fragments one
> is
> > > > > > changing. The benefits of having code style and enforcing it I
> see
> > > are:
> > > > > >  - being able to apply autoformatter, which speeds up writing
> code
> > > > > >  - it would make reviewing PRs easier as e.g. there would be line
> > > > length
> > > > > > limit applied etc. which will make line breaking more reader
> > > friendly.
> > > > > >
> > > > > > Though I think if a consensus is not reachable it would be good
> to
> > > once
> > > > > and
> > > > > > forever decide that we don't want a code style and checkstyle.
> > > > > >
> > > > > > 2017-02-24 10:51 GMT+01:00 Ufuk Celebi <u...@apache.org>:
> > > > > >
> > > > > > > On Fri, Feb 24, 2017 at 10:46 AM, Fabian Hueske <
> > fhue...@gmail.com
> > > >
> > > > > > wrote:
> > > > > > > > I agree with Till that encouraging a code style without
> > enforcing
> > > > it
> > > > > > does
> > > > > > > > not make a lot of sense.
> > > > > > > > If we enforce it, we need to touch all files and PRs.
> > > > > > >
> > > > > > > I think it makes sense for new contributors to have a starting
> > > point
> > > > > > > without enforcing anything (I do agree that we are past the
> point
> > > to
> > > > > > > reach consensus on a style and enforcement ;-)).
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
>

Reply via email to