On 12/05/2014 08:34 PM, Louis Suárez-Potts wrote:
On 05 Dec2014, at 19:41, Andreas Säger <saege...@t-online.de> wrote:
Am 05.12.2014 um 01:15 schrieb Andrew Douglas Pitonyak:
I did a scrape of the pages, and it is about 8GB last time I did it. Off
hand, I expect that a huge chunk of that is SPAM, especially since most
of the SPAMS have large graphics included. I considered writing a PERL
script to clean that based on certain search criteria, but, it just
feels like a huge annoyance to spend hours removing posts and then
trolling the rest of the files to rearrange all of the links so that
things continue to function. So, I did not start the clean-up process
from my scrape.
Hi,
Last time when I was browsing oooforum.org, there was a distinct day
when the moderators gave up. Every posting since that day is spam or an
unanswered question. Everything before that day was more or less well
moderated.
Sorry, I don't recall which day it was but it is easy to find when you
search postings of active members.
Hope this helps.
I volunteer to help with moderation. I’m a moderator for the dev (and other?)
lists, but never do the work, as usually others do it before I get to it—I live
in a lucky timezone, I guess. Or I’m exceptionally lazy.
-louis
Ed is currently considering his options as to what he would like to do.
In the meantime, I am happy to send a copy of my last scrape from around
September 1 to anyone who wants it providing I do not have too many
takers. That scrape took a few days to run based on the load and the
amount of spam. The posts are mostly spam and it is only a scrape. The
scrape contains links and similar, but you need to get through mostly
spam to find anything useful.
After Ed decides what he wants to do a better plan can be put in place.
--
Andrew Pitonyak
My Macro Document: http://www.pitonyak.org/AndrewMacro.odt
Info: http://www.pitonyak.org/oo.php
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@openoffice.apache.org
For additional commands, e-mail: dev-h...@openoffice.apache.org