On Mon, Feb 4, 2019 at 12:09 PM Thomas Stieve <tomthirt...@email.arizona.edu> wrote: > > I would like to map IP edits to articles. However, I would like to exclude IP > address that are invalid, for example by using proxy servers. I am interested > in articles in the 2016 Wikipedia for 271 languages at the country level of > geolocation.
I looked for existing data sets related to your request, but did not find an exact match from the content of <https://meta.wikimedia.org/wiki/Research:Data> or <https://meta.wikimedia.org/wiki/Statistics>. The data in our Wiki Replica databases available in the Cloud Services environment will not have geolocation tagging done already. IP address information will only be available for anonymous edits as a side effect of MediaWiki's use of surrogate usernames for anonymous edits generated from the IP address of the editor. Gathering all IP edits from 2016 across all Wikipedias via the Wiki Replica databases will require quite a large amount of processing power. Reposting your question on the Analytics mailing list (<https://lists.wikimedia.org/mailman/listinfo/analytics>) may find more Wikimedian's who have experience in collecting this type of data for analysis. Bryan -- Bryan Davis Wikimedia Foundation <bd...@wikimedia.org> [[m:User:BDavis_(WMF)]] Manager, Technical Engagement Boise, ID USA irc: bd808 v:415.839.6885 x6855 _______________________________________________ Wikimedia Cloud Services mailing list Cloud@lists.wikimedia.org (formerly lab...@lists.wikimedia.org) https://lists.wikimedia.org/mailman/listinfo/cloud