Hi Gilles,

------------------ Original ------------------
From:&nbsp;"GillesSadowski"<gillese...@gmail.com&gt;;
Date:&nbsp;Wed, Feb 26, 2020 00:32 AM
To:&nbsp;"Commons Developers List"<dev@commons.apache.org&gt;;

Subject:&nbsp;Re: [math] Discuss: New feature MiniBatchKMeansClusterer



&gt;Hello.
&gt;
&gt;[Side question: Do you send a copy of your posts directly to me?
&gt;If so, that is not necessary and is even annoying because when I
&gt;hit "reply" in my mail client, the conversation continues off-list...]

Yes, I always copy to you, I found the ML never forward my email up to now
(that I subscribe the ML with my other email and never received my own email).
Sorry for annoying you, but you are the only person received my email because I 
CC to you.

&gt;&gt;Le mar. 25 févr. 2020 à 14:53, CT <chentao...@qq.com&gt; a écrit :
&gt;&gt;
&gt;&gt; Hi Gilles:
&gt;&gt; &nbsp; Sorry for my unfamiliar in contribution. I started a new PR for 
most of your suggestion:
&gt;&gt; https://github.com/apache/commons-math/pull/120
&gt;
&gt;Sorry for seemingly nit-picking but the global issue is the same
&gt;as with PR #118: It contains too many unrelated changes.
&gt;There should be *one* PR for each batch of significant changes.
&gt;More precisely, for this work, that would entail (a.o. things), one
&gt;PR for each of the following:
&gt; * Class "ClusterUtils" (design yet to be discussed)
&gt; * Factoring out whatever code is necessary for your proposed
&gt;feature (that would otherwise be duplicated)
&gt;
&gt;We try and avoid bloating the API, hence the changes which
&gt;I've suggested:
&gt; * remove the constructors with default parameters
&gt;
&gt;Overall, each PR should probably contain a single commit, in
&gt;order to ease review.
Do you mean this:
&nbsp;* For JIRA issue #MATH-1509 Start a PR with "MiniBatchKMeansClusterer", 
but without the "ClusterUtils",
despite the duplicate code between "MiniBatchKMeansClusterer" and 
"KMeansPlusPlusClusterer",
also with "CentroidInitializer" and test code with in a single commit.
&nbsp;* Suggestions like "remove the constructors with default parameters" 
should apply as a new commit of the PR above,
and tracking by a subtask of JIRA issue #MATH-1509.
&nbsp;* Fire a new JIRA issue for the duplicate code, and start another PR with 
"ClusterUtils" in,
and extract duplicate code into "ClusterUtils".


&gt;&gt;"org.apache.commons.math4.userguide.ClusterAlgorithmComparison":
&gt;&gt; I remain have one question below:
&gt;&gt;
&gt;&gt; ------------------ Original ------------------
&gt; From: "Gilles Sadowski"<gillese...@gmail.com&gt;;
&gt; Date: Mon, Feb 24, 2020 09:52 PM
&gt; To: "dev"<dev@commons.apache.org&gt;;
&gt; Subject: Re: [math] Discuss: New feature MiniBatchKMeansClusterer
&gt;
&gt; &gt;Hi.
&gt; &gt;
&gt; &gt;Le sam. 22 févr. 2020 à 14:37, CT <chentao...@qq.com&gt; a écrit :
&gt; &gt;&gt;
&gt; &gt;&gt; Hi Gilles:
&gt; &gt;&gt; &nbsp; I really appricate for your patient to help me to solve 
the mail sending problem, I try to set the only setting about charset "Use 
&gt;Unicode" for this mail.
&gt; &gt;
&gt; &gt;The contents seems fine.&nbsp; However, there is something wrong: 
replying to your
&gt; &gt;message sends to you instead of the mailing list...
&gt; &gt;
&gt; &gt;&gt; &nbsp; I have created a pull request: 
https://github.com/apache/commons-math/pull/118
&gt; &gt;
&gt; &gt;That PR fails the build:
&gt; &gt; &nbsp; 
https://travis-ci.org/apache/commons-math/builds/653293451?utm_source=github_status&amp;utm_medium=notification
&gt; &gt;
&gt; &gt;Also, the code doesn't take into account all the remarks which I
&gt; &gt;made in previous comments (e.g. file JIRA reports to allow tracking
&gt; &gt;of changes to existing code and not mix those with new code).
&gt; &gt;
&gt;
&gt; I did not get what kinds of problem should tracking by JIRA.
&gt; In my opinion all the change is related to my PR.
&gt;
&gt;Yes, they are related but they could be provided in more focused
&gt;PRs as explained above.
&gt;
&gt; The JIRA issue about Feature MiniBatch has been closed.
&gt;
&gt;It is not closed:
&gt; &nbsp; https://issues.apache.org/jira/projects/MATH/issues/MATH-1509
&gt;
&gt;&gt; Should I fire a new issue?
&gt;
&gt;You should create one JIRA issue per self-contained change.
&gt;You could also create "sub-tasks" of MATH-1509 (for changes that
&gt;are strongly related to your contribution).

&gt;&gt;
&gt;&gt; &gt;In addition, you should avoid mixing massive cosmetic changes (e.g.
&gt;&gt; &gt;Javadoc reformatting) within a commit than contains other types of
&gt;&gt; &gt;change:
&gt;&gt; &gt;&nbsp; &nbsp; 
https://github.com/apache/commons-math/pull/118/commits/47a055e6264d084547854f9290461f020f2131cf
&gt;&gt; &gt;
&gt;&gt; &gt;&gt; &nbsp; And a comparsion between KMeans and MiniBatchKMeans by 
using the 
&gt;&gt;"org.apache.commons.math4.userguide.ClusterAlgorithmComparison":
&gt;&gt; &gt;
&gt;&gt; &gt;Functionality seems to work fine, but ability to review incremental
&gt;&gt; &gt;changes is extremely important in a collaborative project.
&gt;&gt; &gt;
&gt;&gt;
&gt;&gt; Thanks for your remind, and I created a new PR that improve the 
problems one by one.

&gt;I hope that I've now fixed the misunderstanding,
&gt;Gilles

&gt;&gt; &gt; [...]

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@commons.apache.org
For additional commands, e-mail: dev-h...@commons.apache.org

Reply via email to