On Tue, Jul 1, 2008 at 1:53 AM, Julian Cable <
julian.cable@bbc.co.uk> wrote:
-On [20080701 09:30], Martin Jerabek (martin.jerabek@isis-papyrus.com) wrote:
>If more non-ASCII characters are going to be included in the tzdata
>files, I would like to propose to define UTF-8 as the official encoding
>of the tzdata files.
In principle, I agree. In practice UTF-8 has at least one little quirk which has caused me problems:
Microsoft operating systems always start UTF-8 encoded files with a Byte Order Mark (BOM) (http://en.wikipedia.org/wiki/Byte_Order_Mark)
*nix-like operating systems never do (at least in my experience) and at least one perl-based xml parser running on Linux chokes on the BOM.