[Cloud] List of users who have access to certain references

2018-12-27 Thread Huji Lee
This is an idea that came up on fawiki, and there is some merit to it. I just want to figure out the best approach to implement it and would love your input. *TL;DR: *We want to sweep through the recent edits in articles, look at each diff, see if it contains the addition of a "{{cite book}}" temp

Re: [Cloud] List of users who have access to certain references

2018-12-27 Thread John
Using a combination of pywiki and mwparserfromhell it shouldn’t be too much of an issue for a wiki. It might be hard for such a bot to keep up on say enwiki, but slower wikis shouldn’t be an issue. Pair that with a database backend, and you should be able to do it without too much issues. On Thu,

Re: [Cloud] List of users who have access to certain references

2018-12-27 Thread T Paris
Could I ask that you guys make this an “opt in” feature. Both because it’ll speed up the bot and also because once you start identifying which books people own, you start to develop a profile on people. v/r, TP Sent from Mail for Windows 10 From: Huji Lee Sent: Thursday, December 27, 2018 11:

Re: [Cloud] List of users who have access to certain references

2018-12-27 Thread Huji Lee
We will never know who "owns" which book. We only know that they have used it as a source a number of times. It could very well be that they just can easily borrow that book from a library (as is my case, with a lot of books and journals I have used as sources on Wikipedia). The profiling issue is

Re: [Cloud] List of users who have access to certain references

2018-12-27 Thread John
What’s fawiki’s edit rate? Processing a diff shouldn’t take more than 1-2 seconds especially if you optimize the logic. I’m just spitballing ideas at this point, but the logic should be easy On Thu, Dec 27, 2018 at 12:37 PM Huji Lee wrote: > We will never know who "owns" which book. We only know

Re: [Cloud] List of users who have access to certain references

2018-12-27 Thread Huji Lee
Got it. I am also looking for rough ideas at this point. The edit rate of fawiki is not that high, some 5-6K per day (and I am guessing 3-4K if restricting to article namespace). But note that we only care about edits in the last 6-12 months, by users who ha