I'd only tweak this to perhaps not close JIRAs that have been updated recently -- even just avoiding things updated in the last month. For example this would close https://issues.apache.org/jira/browse/SPARK-27758 which was opened Friday (though, for other reasons it should probably be closed). Still I don't mind it under the logic that it has been reported against 2.1.0.
On the other hand, I'd go further and close _anything_ not updated in a long time, like a year (or 2 if feeling conservative). That is there's probably a lot of old cruft out there that wasn't marked with an Affected Version, before that was required. On Sat, May 18, 2019 at 10:48 PM Hyukjin Kwon <gurwls...@gmail.com> wrote: > Thanks guys. > > This thread got more than 3 PMC votes without any objection. I slightly > edited JQL from Abdeali's suggestion (thanks, Abdeali). > > > JQL: > > project = SPARK > AND status in (Open, "In Progress", Reopened) > AND ( > affectedVersion = EMPTY OR > NOT (affectedVersion in versionMatch("^3.*") > OR affectedVersion in versionMatch("^2.4.*") > OR affectedVersion in versionMatch("^2.3.*") > ) > ) > > > https://issues.apache.org/jira/issues/?jql=project%20%3D%20SPARK%20%0A%20%20AND%20status%20in%20(Open%2C%20%22In%20Progress%22%2C%20Reopened)%0A%20%20AND%20(%0A%20%20%20%20affectedVersion%20%3D%20EMPTY%20OR%0A%20%20%20%20NOT%20(affectedVersion%20in%20versionMatch(%22%5E3.*%22)%0A%20%20%20%20%20%20OR%20affectedVersion%20in%20versionMatch(%22%5E2.4.*%22)%0A%20%20%20%20%20%20OR%20affectedVersion%20in%20versionMatch(%22%5E2.3.*%22)%0A%20%20%20%20)%0A%20%20) > > > It means we will resolve all JIRAs that have EOL releases as affected > versions, including no version specified in affected versions - this will > reduce open JIRAs under 900. > > Looks I can use a bulk action feature in JIRA. Tomorrow at the similar > time, I will > - Label those JIRAs as 'bulk-closed' > - Resolve them via `Incomplete` status. > > Please double check the list and let me know if you guys have any concern. > > > > > > 2019년 5월 18일 (토) 오후 12:22, Dongjoon Hyun <dongjoon.h...@gmail.com>님이 작성: > >> +1, too. >> >> Thank you, Hyukjin! >> >> Bests, >> Dongjoon. >> >> >> On Fri, May 17, 2019 at 9:07 AM Imran Rashid <iras...@cloudera.com.invalid> >> wrote: >> >>> +1, thanks for taking this on >>> >>> On Wed, May 15, 2019 at 7:26 PM Hyukjin Kwon <gurwls...@gmail.com> >>> wrote: >>> >>>> oh, wait. 'Incomplete' can still make sense in this way then. >>>> Yes, I am good with 'Incomplete' too. >>>> >>>> 2019년 5월 16일 (목) 오전 11:24, Hyukjin Kwon <gurwls...@gmail.com>님이 작성: >>>> >>>>> I actually recently used 'Incomplete' a bit when the JIRA is >>>>> basically too poorly formed (like just copying and pasting an error) ... >>>>> >>>>> I was thinking about 'Unresolved' status or `Auto Closed' too. I >>>>> double checked they can be reopen as well after resolution. >>>>> >>>>> [image: Screen Shot 2019-05-16 at 10.35.14 AM.png] >>>>> [image: Screen Shot 2019-05-16 at 10.35.39 AM.png] >>>>> >>>>> 2019년 5월 16일 (목) 오전 11:04, Sean Owen <sro...@gmail.com>님이 작성: >>>>> >>>>>> Agree, anything without an Affected Version should be old enough to >>>>>> time out. >>>>>> I might use "Incomplete" or something as the status, as we haven't >>>>>> otherwise used that. Maybe that's simpler than a label. But, anything >>>>>> like >>>>>> that sounds good. >>>>>> >>>>>> On Wed, May 15, 2019 at 8:40 PM Hyukjin Kwon <gurwls...@gmail.com> >>>>>> wrote: >>>>>> >>>>>>> BTW, affected version became a required field (I don't remember when >>>>>>> exactly was .. I believe it's around when we work on Spark 2.3): >>>>>>> >>>>>>> [image: Screen Shot 2019-05-16 at 10.29.50 AM.png] >>>>>>> >>>>>>> So, including all EOL versions and affected versions not specified >>>>>>> will roughly work. >>>>>>> Using "Cannot Reproduce" as its status and 'bulk-closed' label makes >>>>>>> the best sense to me. >>>>>>> >>>>>>> Okie. I want to open this roughly for a week before taking an actual >>>>>>> action for this. If there's no more feedback, I will do as I said ^ next >>>>>>> week. >>>>>>> >>>>>>> >>>>>>> 2019년 5월 15일 (수) 오후 11:33, Josh Rosen <rosenvi...@gmail.com>님이 작성: >>>>>>> >>>>>>>> +1 in favor of some sort of JIRA cleanup. >>>>>>>> >>>>>>>> My only request is that we attach some sort of 'bulk-closed' label >>>>>>>> to issues that we close via JIRA filter batch operations (and resolve >>>>>>>> the >>>>>>>> issues as "Timed Out" / "Cannot Reproduce", not "Fixed"). Using a label >>>>>>>> makes it easier to audit what was closed, simplifying the process of >>>>>>>> identifying and re-opening valid issues caught in our dragnet. >>>>>>>> >>>>>>>> >>>>>>>> On Wed, May 15, 2019 at 7:19 AM Sean Owen <sro...@gmail.com> wrote: >>>>>>>> >>>>>>>>> I gave up looking through JIRAs a long time ago, so, big respect >>>>>>>>> for >>>>>>>>> continuing to try to triage them. I am afraid we're missing a few >>>>>>>>> important bug reports in the torrent, but most JIRAs are not >>>>>>>>> well-formed, just questions, stale, or simply things that won't be >>>>>>>>> added. I do think it's important to reflect that reality, and so >>>>>>>>> I'm >>>>>>>>> always in favor of more aggressively closing JIRAs. I think this is >>>>>>>>> more standard practice, from projects like TensorFlow/Keras, >>>>>>>>> pandas, >>>>>>>>> etc to just automatically drop Issues that don't see activity for N >>>>>>>>> days. We won't do that, but, are probably on the other hand far too >>>>>>>>> lax in closing them. >>>>>>>>> >>>>>>>>> Remember that JIRAs stay searchable and can be reopened, so it's >>>>>>>>> not >>>>>>>>> like we lose much information. >>>>>>>>> >>>>>>>>> I'd close anything that hasn't had activity in 2 years (?), as a >>>>>>>>> start. >>>>>>>>> I like the idea of closing things that only affect an EOL release, >>>>>>>>> but, many items aren't marked, so may need to cast the net wider. >>>>>>>>> >>>>>>>>> I think only then does it make sense to look at bothering to >>>>>>>>> reproduce >>>>>>>>> or evaluate the 1000s that will still remain. >>>>>>>>> >>>>>>>>> On Wed, May 15, 2019 at 4:25 AM Hyukjin Kwon <gurwls...@gmail.com> >>>>>>>>> wrote: >>>>>>>>> > >>>>>>>>> > Hi all, >>>>>>>>> > >>>>>>>>> > I would like to propose to resolve all JIRAs that affects EOL >>>>>>>>> releases - 2.2 and below. and affected version >>>>>>>>> > not specified. I was rather against this way and considered this >>>>>>>>> as last resort in roughly 3 years ago >>>>>>>>> > when we discussed. Now I think we should go ahead with this. See >>>>>>>>> below. >>>>>>>>> > >>>>>>>>> > I have been talking care of this for so long time almost every >>>>>>>>> day those 3 years. The number of JIRAs >>>>>>>>> > keeps increasing and it does never go down. Now the number is >>>>>>>>> going over 2500 JIRAs. >>>>>>>>> > Did you guys know? in JIRA, we can only go through page by page >>>>>>>>> up to 1000 items. So, currently we're even >>>>>>>>> > having difficulties to go through every JIRA. We should manually >>>>>>>>> filter out and check each. >>>>>>>>> > The number is going over the manageable size. >>>>>>>>> > >>>>>>>>> > I am not suggesting this without anything actually trying. This >>>>>>>>> is what we have tried within my visibility: >>>>>>>>> > >>>>>>>>> > 1. In roughly 3 years ago, Sean tried to gather committers and >>>>>>>>> even non-committers people to sort >>>>>>>>> > out this number. At that time, we were only able to keep >>>>>>>>> this number as is. After we lost this momentum, >>>>>>>>> > it kept increasing back. >>>>>>>>> > 2. At least I scanned _all_ the previous JIRAs at least more >>>>>>>>> than two times and resolved them. Roughly >>>>>>>>> > once a year. The rest of them are mostly obsolete but not >>>>>>>>> enough information to investigate further. >>>>>>>>> > 3. I strictly stick to "Contributing to JIRA Maintenance" >>>>>>>>> https://spark.apache.org/contributing.html and >>>>>>>>> > resolve JIRAs. >>>>>>>>> > 4. Promoting other people to comment on JIRA or actively >>>>>>>>> resolve them. >>>>>>>>> > >>>>>>>>> > One of the facts I realised is the increasing number of >>>>>>>>> committers doesn't virtually help this much (although >>>>>>>>> > it might be helpful if somebody active in JIRA becomes a >>>>>>>>> committer.) >>>>>>>>> > >>>>>>>>> > One of the important thing I should note is that, it's now >>>>>>>>> almost pretty difficult to reproduce and test the >>>>>>>>> > issues found in EOL releases. We should git clone, checkout, >>>>>>>>> build and test. And then, see if that issue >>>>>>>>> > still exists in upstream, and fix. This is non-trivial overhead. >>>>>>>>> > >>>>>>>>> > Therefore, I would like to propose resolving _all_ the JIRAs >>>>>>>>> that targets EOL releases - 2.2 and below. >>>>>>>>> > Please let me know if anyone has some concerns or objections. >>>>>>>>> > >>>>>>>>> > Thanks. >>>>>>>>> >>>>>>>>> >>>>>>>>> --------------------------------------------------------------------- >>>>>>>>> To unsubscribe e-mail: dev-unsubscr...@spark.apache.org >>>>>>>>> >>>>>>>>>