I would like to block some emails which contains certain Japanese words, but spamassassin fails to detect such words when the email is HTML encoded, for example:
This is a multi-part message in MIME format.
--------------050206070005060005050706
Content-Type: text/plain; charset=ISO-2022-JP; format=flowed
Content-Transfer-Encoding: quoted-printable
こんにちは!残念な&=
#12364;ら凶報がございま&#=
12377;。数ヶ月前、あな...
--------------050206070005060005050706
Content-Type: text/html; charset="ISO-2022-JP"
Content-Transfer-Encoding: quoted-printable
<html>
<head>
<meta http-equiv=3D"content-type" content=3D"text/html; =
charset=3DISO-2022-JP">
</head>
<body bgcolor=3D"#FFFFFF" text=3D"#000000">
こんにちは!</br>
</br>
残念ながら凶報がご&=
#12374;います。</br>
...
</body>
</html>
--------------050206070005060005050706--
Example rule in spamassassin:
body JAP_BAD_1 /残念ながら凶報がございます/
score JAP_BAD_1 5.0
However, when I run the test:
spamassassin -D textcat -t spam.test
It doesn't show up the match. What do I have to do?
0 Answers