All list archives can be downloaded from many places. Once you have them all in 
a place, you can try doing nifty things like pattern analysis etc. If you're a 
real n3wb and UNIX/BSD/*nix looks fun still, you can use grep, sort, awk and 
even perl to do fun things with data stored in ASCII files from mail archives.

There are many tools to understand data. Big data is even more fascinating as 
you can pull too many things into compiling reports or patterns. 

Hadoop instances, AWS cloud options will allow you to deploy things like Micro 
Strategy based solutions that can then pull data to mine to create reports in 
real time. What I like about Micro Strategy is that they can produce complex 
reports with complex logic based on many data points. But now if you are still 
this far with me, you should consider working for the NSA. The technology is 
similar.

OpenBSD mailing lists aren't really high traffic. Neither has the IRC been 
alive much in the #OpenBSD channels across most servers. So your data points 
may not be as big..

This might get me flamed, but say if you need a data point for a 
reference...you may very well run pattern analysis based on Theo's e-mails, 
their timing, content length, words used, similar words pattern analysis etc. 
to judge (yes, there are actually modelling options like this) when he is angry 
or happy. He is not a FB person much (Are you Theo?) but in truly geek style, 
FB status updates really are the old cvs commits to HEAD or CURRENT.

So you can search those as well. 

Actually, in all honesty if you are interesting in something like this, many 
univ. offer courses on data these days. 

Happy hunting. Come back and tell us when he was the most mad!! Over the years, 
many a battle have been waged on these lists. Pop-corn worthy :)

Bruno Delbono
| Cognitive Researcher - Human Behavioural Project
| Real Sociedad Española De Antropología
| Royal Spanish Society Of Anthropology
| ☎: +1 855 253 5436 ☎: +1 424 354 4700 
| ✉: bruno.delb...@anthropology.es    | ☞: Anthropology.ES

| ✉: bruno.delb...@secure.af  | ☞: Secure.AF  | ☛: Mail.AC



-----Original Message-----
From: owner-m...@openbsd.org [mailto:owner-m...@openbsd.org] On Behalf Of Chris 
Cappuccio
Sent: September 5, 2013 2:40 PM
To: Kasper Adel
Cc: misc
Subject: Re: Data Mining/Crawling a Mailing List

The NSA has some good tools. I'd give them a call. Their contact info:

9800 Savage Rd  Fort Meade, MD 20755
(301) 688-6524

Kasper Adel [karim.a...@gmail.com] wrote:
> Hello,
> 
> A bit off topic but i was looking for a way/tool that could crawl 
> through a mailing list/news archives and try to filter most common 
> discussions and things like that, if anyone is aware of such a tool, pls let 
> me know.
> 
> Thanks,
> Kim

--
scio me nihil scire or scio me nescire

Reply via email to