Difference between revisions of "Spamassassin"
From wiki
(initial draft) |
(Add Bayes training) |
||
Line 1: | Line 1: | ||
+ | {{WIP}} | ||
edit file <code>/etc/spamassassin/local.cf</code> | edit file <code>/etc/spamassassin/local.cf</code> | ||
Line 8: | Line 9: | ||
<syntaxhighlight lang="console"> | <syntaxhighlight lang="console"> | ||
# mkdir /var/lib/spamassassin/bayes_db | # mkdir /var/lib/spamassassin/bayes_db | ||
+ | </syntaxhighlight> | ||
+ | |||
+ | |||
+ | == Training == | ||
+ | |||
+ | To reach a good efficiency, SpamAssassin Bayesian filter need to be trained with both spam and ham messages. You can use your actual mailbox for that but note the following points: | ||
+ | * Be sure that the folders you use for training contain only spam or ham. If a folder contain a mix of them, SpamAssassin will learn wrong info and produce bad quality results | ||
+ | * To be effective you need between 1000 and 5000 messages each of both spam and ham. | ||
+ | * You need to have more ham than spam to train. Otherwise, SpamAssassin might become biased toward spam. | ||
+ | |||
+ | <syntaxhighlight lang="console"> | ||
+ | # sa-learn --spam --showdots --dir /var/maildir/<username>/Maildir/.Spam/cur/ | ||
+ | # sa-learn --ham --showdots --dir /var/maildir/<username>/Maildir/cur/ | ||
</syntaxhighlight> | </syntaxhighlight> |
Revision as of 06:49, 4 March 2016
Warning: | This page is a work in progress and is not completed. Important informations might be missing or wrong. |
edit file /etc/spamassassin/local.cf
bayes_path /var/lib/spamassassin/bayes_db/bayes
bayes_file_mode 0770
# mkdir /var/lib/spamassassin/bayes_db
Training
To reach a good efficiency, SpamAssassin Bayesian filter need to be trained with both spam and ham messages. You can use your actual mailbox for that but note the following points:
- Be sure that the folders you use for training contain only spam or ham. If a folder contain a mix of them, SpamAssassin will learn wrong info and produce bad quality results
- To be effective you need between 1000 and 5000 messages each of both spam and ham.
- You need to have more ham than spam to train. Otherwise, SpamAssassin might become biased toward spam.
# sa-learn --spam --showdots --dir /var/maildir/<username>/Maildir/.Spam/cur/
# sa-learn --ham --showdots --dir /var/maildir/<username>/Maildir/cur/