Am 25.8.2010 22:47, schrieb Karsten Bräckelmann: > > Jan, any chance you could provide the paragraphs or text parts > corresponding to the seeks? > > Just to clarify: We do *not* require the full message, even though it > makes things simpler. In fact, no headers (other than Subject) are ever > used in the sought process.
Karsten, I've been out of office (better: out-of-oder ;)) for some days. Of course I'll provide you some samples. I've spoken to some of our customers and they have agreed, that I may submit the affected mails. Let me prepare this and I'll send you a link. Are you able to analyze MS Outlook .msg format? > Anonymizing any personal data is perfectly fine. Moreover, the ham > corpus for sought is not available publicly, but restricted to a few SA > developers only. > > The rendered and normalized body text is used to prevent seeks from > appearing in the automatically generated rules -- strings directly > extracted from spam. Thus, by its nature, the FP string itself cannot > possibly be confidential. :) Understood and agreed ;) Thank you for your efforts and the great work on spamassassin! Cheers, Jan