ASPN ActiveState Programmer Network
ActiveState
/ Home / Perl / PHP / Python / Tcl / XSLT /
/ Safari / My ASPN /
Cookbooks | Documentation | Mailing Lists | Modules | News Feeds | Products | User Groups


Recent Messages
List Archives
About the List
List Leaders
Subscription Options

View Subscriptions
Help

View by Topic
ActiveState
.NET Framework
Open Source
Perl
PHP
Python
Tcl
Web Services
XML & XSLT

View by Category
Database
General
SOAP
System Administration
Tools
User Interfaces
Web Programming
XML Programming


MyASPN >> Mail Archive >> spamassassin-users
spamassassin-users
R: Stock spam in images
by Giampaolo Tomassoni other posts by this author
Oct 2 2006 9:50AM messages near this date
RE: Stock spam in images | RE: Stock spam in images
>  You'd need some clever rules...
>  
>  As an example, the word "stock" is perfectly valid in emails, but if you
>  found it in an attached image you'd be pretty sure it was spam.

It would be perfectly valid in a, say, graph image too. SA is meant to work in the overall m
essage content. It is not that simple to discard a thesis which includes images as a content
 carrier from the SA viewpoint, I guess.


>  So you'd need two sets of rules anyhow.

Why? A spammer wouldn't send just the word "stock" in its image message...


>  It looks like SA 3.2 will let us do that in a sane manner.
>  
>  Phil
>  --
>  Phil Randal
>  Network Engineer
>  Herefordshire Council
>  Hereford, UK  
>  
>  > -----Original Message-----
>  > From: Fabien GARZIANO [mailto:fabien.garziano@[...].com] 
>  > Sent: 02 October 2006 16:11
>  > To: users@[...].org
>  > Subject: RE: Stock spam in images
>  > 
>  >  
>  > Too bad, cause I agree with Giampaolo, it would be great. 
>  > What about making a plugin including OCR components but 
>  > instead of using inner dictionnary, passing it back to 
>  > spamassassin through the MTA... Yeah, I know, the load will 
>  > increase ... But that would be nice ?
>  > 
>  > ...
>  > 
>  > ... Ok,I go back to sleep ....
>  > 
>  > -----Message d'origine-----
>  > De : Randal, Phil [mailto:prandal@[...].uk] 
>  > Envoyé : lundi 2 octobre 2006 16:19
>  > Ã? : users@[...].org
>  > Objet : RE: Stock spam in images
>  > 
>  > Giampaolo Tomassoni wrote:
>  > 
>  > > And, by the way, it seems to work!
>  > > 
>  > > Actually, the only limit I see is the own-made FuzzyOcr.words (and, 
>  > > maybe, the fact that script text may probably get undetected). 
>  > > Wouldn't it be better to inject the detected text back to SA? There 
>  > > should be enough variants of spam worlds to let SA fuzzily 
>  > catch the 
>  > > ones from images.
>  > > 
>  > > Am I wrong?
>  > 
>  > I think so.  Some of the words would be perfectly legitimate 
>  > in the text of emails but rarely found in attached legitimate images.
>  > 
>  > Quite apart from the fact that Spamassassin isn't designed 
>  > for "reinjection".
>  > 
>  > Cheers,
>  > 
>  > Phil
>  > --
>  > Phil Randal
>  > Network Engineer
>  > Herefordshire Council
>  > Hereford, UK
>  > 
Thread:
Dylan Bouterse
Brent Kennedy
Giampaolo Tomassoni
Fabien GARZIANO
Phil Randal
Giampaolo Tomassoni
Fabien GARZIANO
Phil Randal
Theo Van Dinter
Decoder
Stuart Johnston
Theo Van Dinter
Andreas Pettersson
Giampaolo Tomassoni
Giampaolo Tomassoni
Phil Randal
Decoder
Bowie Bailey
Giampaolo Tomassoni
Bret Miller
Dylan Bouterse

Privacy Policy | Email Opt-out | Feedback | Syndication
© 2004 ActiveState, a division of Sophos All rights reserved