On Mon, 2023-01-23 at 15:28 -0700, Paul Gilmartin via tz wrote:
On 1/23/23 13:48:02, Paul Eggert via tz wrote:
* Makefile (UNUSUAL_OK_LATIN_1): Allow all non-alphabetic, non-ASCII printable characters that are Latin-1. This is primarily for “§” and we might as well allow them all since even XEmacs 21 supports them all.
Ouch! UTF-8 is too pervasive on desktops and WWW for that to be comfortable.
And on a UTF-8 desktop, GNU sed strangles on non-UTF-8 strings: 1250 $ printf 'a\xa7b\n' | sed -E 's/(.)(.)(.)/1 \1 2 \2 3 \3/' sed: RE error: illegal byte sequence 1251 $
I think the intent is to allow non-ASCII characters that are in Latin- 1, even though the file is coded in UTF-8. That is, not all Unicode characters are allowed, only those that appear in Latin-1. John Sauter (John_Sauter@systemeyescomputerstore.com) -- get my PGP public key with gpg --locate-external-keys John_Sauter@systemeyescomputerstore.com