Without wanting to take away from this thread--folks who respond here, could you please also have a look at T365693: MediaWiki Dumps XML - Provide attribute to indicate that user is temporary account in exported content <https://phabricator.wikimedia.org/T365693> and add your feedback? (Or just reply to this list and I'll make sure it's captured on the task.)
Thanks, Kosta On 8. Oct 2024 at 20:07:05, Sascha Brawer via Cloud < cloud@lists.wikimedia.org> wrote: > QRank <https://qrank.wmcloud.org/> uses dumps (plus access logs) to > compute a ranking signal for Wikidata items. > > — Sascha > > Am Di., 8. Okt. 2024 um 18:57 Uhr schrieb YiFei Zhu < > zhuyifei1...@gmail.com>: > >> On Tue, Oct 8, 2024 at 8:59 AM Bryan Davis <bd...@wikimedia.org> wrote: >> > >> > I was asked recently what I knew about the types of tools that use >> > data from the https://dumps.wikimedia.org/ project. I had to admit >> > that I really didn't know of many tools off the top of my head that >> > relied on dumps. Most of the use cases I have heard about are for >> > research topics like looking at word frequencies and sentence >> > complexity, or machine learning things that consume some or all of the >> > wiki corpus. >> > >> > Do you run a tool that needs data from Dumps to do its job? I would >> > love to hear some stories about how this data helps folks advance the >> > work of the movement. >> >> YiFeiBot uses dumps to find a list of pages with interlanguage links, >> for the interlanguage link removal task. It does this by processing >> each page's wikitext through a regex. >> >> > Bryan >> > -- >> > Bryan Davis Wikimedia Foundation >> > Principal Software Engineer Boise, ID USA >> > [[m:User:BDavis_(WMF)]] irc: bd808 >> > _______________________________________________ >> > Cloud mailing list -- cloud@lists.wikimedia.org >> > List information: >> https://lists.wikimedia.org/postorius/lists/cloud.lists.wikimedia.org/ >> _______________________________________________ >> Cloud mailing list -- cloud@lists.wikimedia.org >> List information: >> https://lists.wikimedia.org/postorius/lists/cloud.lists.wikimedia.org/ >> > _______________________________________________ > Cloud mailing list -- cloud@lists.wikimedia.org > List information: > https://lists.wikimedia.org/postorius/lists/cloud.lists.wikimedia.org/ >
_______________________________________________ Cloud mailing list -- cloud@lists.wikimedia.org List information: https://lists.wikimedia.org/postorius/lists/cloud.lists.wikimedia.org/