"Joel Jacobson" <j...@compiler.org> writes: > In total, I scraped the first-page of some ~50k websites, > which produced 45M test rows to import, > which when GROUP BY pattern and flags was reduced > down to 235k different regex patterns, > and 1.5M different text string subjects.
This seems like an incredibly useful test dataset. I'd definitely like a copy. > No is_match differences were detected, good! Cool ... > However, there were 23 cases where what got captured differed: I shall take a closer look at that. Many thanks for doing this work! regards, tom lane