On 12/05/2014 08:34 PM, Louis Suárez-Potts wrote:
On 05 Dec2014, at 19:41, Andreas Säger <saege...@t-online.de> wrote:

Am 05.12.2014 um 01:15 schrieb Andrew Douglas Pitonyak:
I did a scrape of the pages, and it is about 8GB last time I did it. Off
hand, I expect that a huge chunk of that is SPAM, especially since most
of the SPAMS have large graphics included. I considered writing a PERL
script to clean that based on certain search criteria, but, it just
feels like a huge annoyance to spend hours removing posts and then
trolling the rest of the files to rearrange all of the links so that
things continue to function. So, I did not start the clean-up process
from my scrape.

Hi,

Last time when I was browsing oooforum.org, there was a distinct day
when the moderators gave up. Every posting since that day is spam or an
unanswered question. Everything before that day was more or less well
moderated.
Sorry, I don't recall which day it was but it is easy to find when you
search postings of active members.

Hope this helps.
I volunteer to help with moderation. I’m a moderator for the dev (and other?) 
lists, but never do the work, as usually others do it before I get to it—I live 
in a lucky timezone, I guess. Or I’m exceptionally lazy.

-louis
Ed is currently considering his options as to what he would like to do.

In the meantime, I am happy to send a copy of my last scrape from around September 1 to anyone who wants it providing I do not have too many takers. That scrape took a few days to run based on the load and the amount of spam. The posts are mostly spam and it is only a scrape. The scrape contains links and similar, but you need to get through mostly spam to find anything useful.

After Ed decides what he wants to do a better plan can be put in place.

--
Andrew Pitonyak
My Macro Document: http://www.pitonyak.org/AndrewMacro.odt
Info:  http://www.pitonyak.org/oo.php


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@openoffice.apache.org
For additional commands, e-mail: dev-h...@openoffice.apache.org

Reply via email to