![]() |
|
Message-ID: <20250414110314.GF2724612@port70.net> Date: Mon, 14 Apr 2025 13:03:14 +0200 From: Szabolcs Nagy <nsz@...t70.net> To: Rich Felker <dalias@...c.org> Cc: Nick Wellnhofer <wellnhofer@...um.de>, musl@...ts.openwall.com Subject: Re: iconv EUC-JP support is incomplete * Rich Felker <dalias@...c.org> [2025-01-18 06:34:31 -0500]: > On Wed, Jan 15, 2025 at 06:16:00PM +0100, Nick Wellnhofer wrote: > > Hello, > > > > I ran into this after getting reports of libxml2 test failures on > > musl. EUC-JP also supports JIS X 0212 as third code set with > > characters encoded as three-byte sequences starting with 0x8F. It > > seems that musl's iconv implementation doesn't support these > > characters. > > > > Here's the work-around in our test suite: > > https://gitlab.gnome.org/GNOME/libxml2/-/commit/afe8f8451459750d330e8bca9ee3b02aeadeebb5 > > Thanks. Do you have any references for EUC-JP being expected to > include these extensions? Does the WHATWG definition specify them as > supported? fwiw i see 0x8F specified in https://encoding.spec.whatwg.org/#euc-jp the logic seems return a single code point for a 0x8F, 0xA1-0xFE, 0xA1-0xFE sequence.
Powered by blists - more mailing lists
Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.