control characters and Util::clean_text()

Emmanuel Seyman eseyman at linagora.com
Wed Dec 21 16:35:32 UTC 2005


Dennis Melentyev wrote:
>
> ASCII 127 is a *correct* Russian symbol in cp1251 (thanks to M$).
> Also, what to do with UTF-8 input?

127 should be DELETE, no matter what charset you are using (since it's
part of the ASCII charset). This is a non-printable character so it should
 be trimmed.

What does it output in cp1251?

Emmanuel




More information about the developers mailing list