Hi, > I'm currently doing research for my bachelor thesis on how to automatically > extract FAQs from unstructured data. ... > For the evaluation part I'd like to ask you for having a look at one or two > FAQs and maybe give some comments on how far the questions matched the FAQ's > title, how relevant they were etc. > Here's the direct link to the SpamAssassin FAQs: > http://faqcluster.com/spam-spamassassin-mail-rule-rules
That's pretty cool. I looked through a few, and it indeed looks helpful. Not so much on how relevant they are, but two things that I thought of after viewing a few articles: - How about a pointer to the original version, in case the reader wants to follow the whole thread? - How about a time/date stamp so users have an idea where it fits in context? Best regards, Alex