While I understand the motivation for the change, I think in this case the
label reversing used is more complex than needed. It would have been
simpler (and thus likely more efficient / less bug-prone) to reverse the
whole hostnames, storing moc.elpmaxe.www instad of com.example.www

On Mon, 11 Dec 2023 at 15:34, Amir Sarabadani <asarabad...@wikimedia.org>
wrote:

> Hi Tilman,
> Sorry for the late reply.
>
> Regarding finding the actual link from the row. The recommended way is to
> do processing in code afterwards. That's what MediaWiki does (in
> https://gerrit.wikimedia.org/g/mediawiki/core/+/80790ffc21a49fbe7709eaf5ce634b645798cf47/includes/ExternalLinks/LinkFilter.php#264)
> and you can easily replicate the logic of LinkFilter::reverseIndexes() in
> your programming language of choice. Doing all of data processing in SQL is
> not recommended.
>
> Indeed, we these changes are really necessary. For example with the
> current growth of Wikimedia Commons we will have to resort to more drastic
> actions if its database growth doesn't slow down (See
> https://phabricator.wikimedia.org/T343131 and
> https://phabricator.wikimedia.org/F37157040). Noting that database growth
> is not always about the wiki's growth, lots of times it's just high use of
> some features of mediawiki (in here, templates and external links).
>
> We will keep in mind to update documentation for further work (and thank
> you for the feedback!). The next will be pagelinks.
>
> Best
> _______________________________________________
> Cloud mailing list -- cloud@lists.wikimedia.org
> List information:
> https://lists.wikimedia.org/postorius/lists/cloud.lists.wikimedia.org/
>
_______________________________________________
Cloud mailing list -- cloud@lists.wikimedia.org
List information: 
https://lists.wikimedia.org/postorius/lists/cloud.lists.wikimedia.org/

Reply via email to