On Mon, Feb 4, 2019 at 12:09 PM Thomas Stieve
<tomthirt...@email.arizona.edu> wrote:
>
> I would like to map IP edits to articles. However, I would like to exclude IP 
> address that are invalid, for example by using proxy servers. I am interested 
> in articles in the 2016 Wikipedia for 271 languages at the country level of 
> geolocation.

I looked for existing data sets related to your request, but did not
find an exact match from the content of
<https://meta.wikimedia.org/wiki/Research:Data> or
<https://meta.wikimedia.org/wiki/Statistics>.

The data in our Wiki Replica databases available in the Cloud Services
environment will not have geolocation tagging done already. IP address
information will only be available for anonymous edits as a side
effect of MediaWiki's use of surrogate usernames for anonymous edits
generated from the IP address of the editor. Gathering all IP edits
from 2016 across all Wikipedias via the Wiki Replica databases will
require quite a large amount of processing power.

Reposting your question on the Analytics mailing list
(<https://lists.wikimedia.org/mailman/listinfo/analytics>) may find
more Wikimedian's who have experience in collecting this type of data
for analysis.

Bryan
-- 
Bryan Davis              Wikimedia Foundation    <bd...@wikimedia.org>
[[m:User:BDavis_(WMF)]] Manager, Technical Engagement    Boise, ID USA
irc: bd808                                        v:415.839.6885 x6855

_______________________________________________
Wikimedia Cloud Services mailing list
Cloud@lists.wikimedia.org (formerly lab...@lists.wikimedia.org)
https://lists.wikimedia.org/mailman/listinfo/cloud

Reply via email to