Follow @Openwall on Twitter for new release announcements and other news
[<prev] [next>] [<thread-prev] [day] [month] [year] [list]
Message-ID: <20250414110314.GF2724612@port70.net>
Date: Mon, 14 Apr 2025 13:03:14 +0200
From: Szabolcs Nagy <nsz@...t70.net>
To: Rich Felker <dalias@...c.org>
Cc: Nick Wellnhofer <wellnhofer@...um.de>, musl@...ts.openwall.com
Subject: Re: iconv EUC-JP support is incomplete

* Rich Felker <dalias@...c.org> [2025-01-18 06:34:31 -0500]:

> On Wed, Jan 15, 2025 at 06:16:00PM +0100, Nick Wellnhofer wrote:
> > Hello,
> > 
> > I ran into this after getting reports of libxml2 test failures on
> > musl. EUC-JP also supports JIS X 0212 as third code set with
> > characters encoded as three-byte sequences starting with 0x8F. It
> > seems that musl's iconv implementation doesn't support these
> > characters.
> > 
> > Here's the work-around in our test suite:
> > https://gitlab.gnome.org/GNOME/libxml2/-/commit/afe8f8451459750d330e8bca9ee3b02aeadeebb5
> 
> Thanks. Do you have any references for EUC-JP being expected to
> include these extensions? Does the WHATWG definition specify them as
> supported?

fwiw i see 0x8F specified in

https://encoding.spec.whatwg.org/#euc-jp

the logic seems return a single code point
for a 0x8F, 0xA1-0xFE, 0xA1-0xFE sequence.

Powered by blists - more mailing lists

Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.