control characters and Util::clean_text()

David Miller justdave at bugzilla.org
Thu Dec 22 04:39:01 UTC 2005


Mick Weiss wrote on 12/21/05 10:30 PM:
> David Miller wrote:
>> Technically, you're not allowed anything that's not US-ASCII in email 
>> headers, but that's another bug.
> 
> Didn't this recently change? I believe umlauts and such characters are 
> (since very very recently) allowed.

RFC2822 (April 2001) section 2.1 states:

    A message that is conformant with this standard is comprised of
    characters with values in the range 1 through 127 and interpreted as
    US-ASCII characters [ASCII].  For brevity, this document sometimes
    refers to this range of characters as simply "US-ASCII characters".

That RFC is not listed as being obsoleted or updated by any other RFCs yet.

The only legal way to get around that is by base64 or quoted-printable 
encoding the header values, which is described in RFC2045 and RFC2231.

-- 
Dave Miller                                   http://www.justdave.net/
System Administrator, Mozilla Corporation      http://www.mozilla.com/
Project Leader, Bugzilla Bug Tracking System  http://www.bugzilla.org/



More information about the developers mailing list