This all looks really quite interesting. Though I wonder if the load of 
all the extra work now being down will kill them... (especially without 
the server being released/available).

rODbegbie wrote:
>> 2 Nilsimsa Signatures
>>
>>    Nilsimsa is a _fuzzy signature_ algorithm based on statistical models
>>    of n-gram occurrence in a piece of text. Nilsimsa disregards small
>>    changes (mutations) in text that are statistically irrelevant.
>>    Nilsimsa signatures can be compared to determine the similarity
>>    (between 0 - 100%) in source texts. Razor v2 includes support for
>>    Nilsimsa signatures.

Has there been any report on how they're doing this in the server? From 
my brief experimentation with Nilsimsa signatures it appeared that I 
needed to run the Nilsimsa compare functions against every single hash 
in the database, rather than being able to do a simple hash lookup. This 
pretty much sucked performance wise, though maybe Vipul has figured out 
a way to do it without this "full table scan" that I had to do.

Matt.


_______________________________________________________________

Don't miss the 2002 Sprint PCS Application Developer's Conference
August 25-28 in Las Vegas - 
http://devcon.sprintpcs.com/adp/index.cfm?source=osdntextlink

_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to