ASPN ActiveState Programmer Network
ActiveState
/ Home / Perl / PHP / Python / Tcl / XSLT /
/ Safari / My ASPN /
Cookbooks | Documentation | Mailing Lists | Modules | News Feeds | Products | User Groups


Recent Messages
List Archives
About the List
List Leaders
Subscription Options

View Subscriptions
Help

View by Topic
ActiveState
.NET Framework
Open Source
Perl
PHP
Python
Tcl
Web Services
XML & XSLT

View by Category
Database
General
SOAP
System Administration
Tools
User Interfaces
Web Programming
XML Programming


MyASPN >> Mail Archive >> spamassassin-users
spamassassin-users
bayes_seen file of 340MB
by Federico Giannici other posts by this author
Jun 15 2005 4:10AM messages near this date
Re: Limit the size of spamd | Re: bayes_seen file of 340MB
We have a SpamAssassin installation with a single bayes database for all 
   our mailboxes (a couple thousand).

I think that the "bayes_toks" file has the expected size (around 8MB), 
but the "bayes_seen" file seems too big to me: around 340MB!
Is this size normal?
Doesn't such a dimension slow down the queries?


Here is our "local.cf" content:

use_bayes 1
bayes_path /var/spamassassin/bayes
bayes_use_hapaxes 1
bayes_auto_learn 1
bayes_learn_to_journal 1
bayes_journal_max_size 1000000
bayes_expiry_max_db_size 250000


Here is the "sa-learn --dump magic" output:

0.000          0          3          0  non-token data: bayes db version
0.000          0    5103649          0  non-token data: nspam
0.000          0    1439768          0  non-token data: nham
0.000          0     448750          0  non-token data: ntokens
0.000          0 1118530322          0  non-token data: oldest atime
0.000          0 1118832322          0  non-token data: newest atime
0.000          0 1118832323          0  non-token data: last journal 
sync atime
0.000          0 1118797752          0  non-token data: last expiry atime
0.000          0      43200          0  non-token data: last expire 
atime delta
0.000          0     186807          0  non-token data: last expire 
reduction count


Thanks.

-- 
___________________________________________________
     __
    |-                      giannici@[...].it
    |ederico Giannici      http://www.neomedia.it
___________________________________________________
Thread:
Federico Giannici
Matt Kettler

Privacy Policy | Email Opt-out | Feedback | Syndication
© ActiveState Software Inc. All rights reserved