Re: code review: splitIds from DConf '22 day 3: saving a sort and "getting performance"

user1234 via Digitalmars-d-learn Fri, 05 Aug 2022 06:52:08 -0700

On Thursday, 4 August 2022 at 13:18:40 UTC, kdevel wrote:

At DConf '22 day 3 Robert Schadek presented at around 07:22:00in the YT video the function `splitIds`. Given an HTML pagefrom bugzilla containing a list of issues `splitIds` aims atextracting all bug-ids referenced within a specific url context:
```
long [] splitIds (string page)
{
   enum re = ctRegex!(`"show_bug.cgi\?id=[0-9]+"`);
   auto m = page.matchAll (re);

   return m
.filter!(it => it.length > 0) // what isthis?.map!(it => it.front) // wholematch, it[0].map!(it => it.find!(isNumber)) // searchesfist number.map!(it => it.until!(it => !it.isNumber ())) // lastnumber.filter!(it => !it.empty) // again anempty check??? why?
      .map!(it => it.to!long ())
.uniq // .sort ismissing. IMHO saving at the wrong things?
      .array;
}
```
`m` contains all matches. It is a "list of lists" as one wouldsay in Perl. The "inner lists" contains as first element("`front`") the string which matches the whole pattern. So myfirst question is:
What is the purpose of the first filter call? Since the elementof `m` is a match it cannot have a length of 0.
[...]

I think that the first one is to prevent to call `front()` on anempty range, excepted that according to the regex that should nothappen.

BTW I haven't washed the video but I suppose this is related tothe migration of bugzilla to GH issues. I wonder whyhttps://bugzilla.readthedocs.io/en/5.0/api/index.html#apis is notused instead.

Re: code review: splitIds from DConf '22 day 3: saving a sort and "getting performance"

Reply via email to