Lars is looking into statistical analysis of spam. He has a corpus of some 2000 messages filtered by Spam Assassin the last couple of months, and I am sharing data for my own corpus of similar size.
Other similar investigations & rants:
(source: Corante: The Silver Bullet to Kill Spam)Update: Lars has published the first results of his investigations.
According to discusstions on the SpamAssassin list, it appears that the next release of SpamAssassin (2.4) will have Bayesian filtering.
Posted by: Adam Kalsey on August 30, 2002 07:38 AMThe URLs to read more about Lars' work are:
http://www.ludvig.no/blog/archives/2002/08/statistical_analysis_of_spam_part_i.html
http://www.ludvig.no/blog/archives/2002/09/statistical_analysis_of_spam_part_ii.html
http://www.ludvig.no/blog/archives/2002/09/statistical_analysis_of_spam_part_iii.html
SpamWeed is a very effective anti-spam filter.
Posted by: L.B.S. on May 22, 2003 02:29 PM
©
Anders Jacobsen [extrospection.com photography] |