Good to know PHP - Spam mod

Posted on:August 21 2005

I just wrote a small spam mod for this blog software. Lets see if it works.





Comments:


Testing if comments still work.. yes! :)
niko
Quote
2005-08-21 07:49:00


Yep it works ;)
How's the work on the .NET wrapper?
Raedwulf
Quote
2005-08-21 10:51:00


"Have you got anything without spam?"
"Well, there's spam egg sausage and spam, that's not got much spam in it."
Factory
Quote
2005-08-21 11:13:00


buy your own eastern european wife online right now! 20% off, only this week!
(if this gets on, you failed)
Armen138
Quote
2005-08-21 14:30:00


better luck next time :P
Armen138
Quote
2005-08-21 14:31:00


'small' spam mod doesn't suggest it works with sophisticated AI to detect spam messages but does suggest it does something ie. if you noticed all the spam messages had the email field filled with random chars. That might help in detecting the spam-still this concept does approach ai - sorry :D
Raedwulf
Quote
2005-08-21 21:10:00


you dont neccesarily need a good AI routine to do this, three tables of 'good', 'bad' and 'ignored' words in mySQL, then just anaylse each word and create a 'score'. Then all you have to do is set a baseline score, anything above it is good, below it is spam and deleted/highlighted.
John C
Quote
2005-08-21 23:00:00


That's simple :D
Raedwulf
Quote
2005-08-22 08:12:00


Wouldn't a baysian approche do just that but instead also make a smart assumption about the score. You act on the notion that all bad words only come out in bad content. But baysian for sorting out bad and good comments in PHP sounds a little overkill. I just wanted so state that in all the papers I have read that notion of counting bad words have been proven to fail. I saw one with counting good and bad (somewhat a naive baysian classifier) can't say anything about the result.
Mikael B
Quote
2005-08-22 13:29:00


The way which i suggested (i.e. detect emails with words in them as 'good' and random chars as 'bad' ) might work but of course if you had an email like r2d2@starwars.com or c3po@starwars.com it would be caught because they don't make sense to a word detector :P
Raedwulf
Quote
2005-08-22 16:49:00


hehe :) my current anti-spam strategy is just a list with bad words combinations like 'super poker', because I assume no one will post something about that in this blog :)
niko
Quote
2005-08-22 19:59:00


Add comment:


Posted by:


Enter the missing letter in: "Internationa?"


Text:

 

  

Possible Codes


Feature Code
Link [url] www.example.com [/url]
Bold [b]bold text[/b]
Quote [quote]quoted text[/quote]
Code [code]source code[/code]

Emoticons