Spamassassin: Difference between revisions

From wiki
initial draft
 
Add Bayes training
Line 1: Line 1:
{{WIP}}
edit file <code>/etc/spamassassin/local.cf</code>
edit file <code>/etc/spamassassin/local.cf</code>


Line 8: Line 9:
<syntaxhighlight lang="console">
<syntaxhighlight lang="console">
# mkdir /var/lib/spamassassin/bayes_db
# mkdir /var/lib/spamassassin/bayes_db
</syntaxhighlight>
== Training ==
To reach a good efficiency, SpamAssassin Bayesian filter need to be trained with both spam and ham messages. You can use your actual mailbox for that but note the following points:
* Be sure that the folders you use for training contain only spam or ham. If a folder contain a mix of them, SpamAssassin will learn wrong info and produce bad quality results
* To be effective you need between 1000 and 5000 messages each of both spam and ham.
* You need to have more ham than spam to train. Otherwise, SpamAssassin might become biased toward spam.
<syntaxhighlight lang="console">
# sa-learn --spam --showdots --dir /var/maildir/<username>/Maildir/.Spam/cur/
# sa-learn --ham --showdots --dir /var/maildir/<username>/Maildir/cur/
</syntaxhighlight>
</syntaxhighlight>

Revision as of 06:49, 4 March 2016


Warning Warning: This page is a work in progress and is not completed. Important informations might be missing or wrong.

edit file /etc/spamassassin/local.cf

bayes_path /var/lib/spamassassin/bayes_db/bayes
bayes_file_mode 0770
# mkdir /var/lib/spamassassin/bayes_db


Training

To reach a good efficiency, SpamAssassin Bayesian filter need to be trained with both spam and ham messages. You can use your actual mailbox for that but note the following points:

  • Be sure that the folders you use for training contain only spam or ham. If a folder contain a mix of them, SpamAssassin will learn wrong info and produce bad quality results
  • To be effective you need between 1000 and 5000 messages each of both spam and ham.
  • You need to have more ham than spam to train. Otherwise, SpamAssassin might become biased toward spam.
# sa-learn --spam --showdots --dir /var/maildir/<username>/Maildir/.Spam/cur/
# sa-learn --ham --showdots --dir /var/maildir/<username>/Maildir/cur/