|
Message-ID: <CAEeofciCeL09qNyr0=xZVpeZ24RVpLaJugd5rehBRPkak1GOvQ@mail.gmail.com> Date: Fri, 26 Feb 2021 12:45:06 +0000 From: Alexander Richardson <Alexander.Richardson@...cam.ac.uk> To: Szabolcs Nagy <nsz@...t70.net> Cc: musl@...ts.openwall.com Subject: Re: Potentially incorrect musl scalbn results on AArch64? On Fri, 26 Feb 2021 at 11:43, Szabolcs Nagy <nsz@...t70.net> wrote: > > * Alexander Richardson <Alexander.Richardson@...cam.ac.uk> [2021-02-25 10:27:11 +0000]: > > Hello, > > > > I've recently been tracking down testsuite failures on FreeBSD aarch64 > > and as part of this updated the FreeBSD scalbn* implementations to use > > the musl versions. However, two of the scalbn tests are failing on > > non-x86 architectures (https://godbolt.org/z/rax7f6) > > For example, scalbn(1, -1023) returns > > "1.1125369292536006915451e-308"/0x0.8p-1022 on x86, but if I run the > > tests on aarch64 I get 0 instead. > > i added musl list on cc > > i cannot reproduce your issue (i.e. the c code works for me on > all targets as is) > > one issue can be that if freebsd incorrectly sets the fpu on > aarch64 into flush-subnormals-to-zero mode. > > or a clang compiler bug (which we have seen before wrt floating > point optimizations, although not wrong results, only wrong fenv) > Thanks very much for that suggestion! Turns out that as of https://cgit.freebsd.org/src/commit/?id=65618fdda0f272a823e6701966421bdca0efa301 FreeBSD sets the flush-subnormals-to-zero flag on startup so this is a FreeBSD issue and I can confirm that the code works as expected when I clear the flag. I've submitted a possible fix to FreeBSD in https://reviews.freebsd.org/D28938. Alex > > I'm not particularly familiar with floating-point calculations, but it > > appears to me that this could be caused by x86's extended precision > > during calculations? > > If I cast the result to (long double) on aarch64 prior to the > > multiplication, I get the expected result on AArch64 (but that's > > obviously slow and won't work on architectures where long double == > > double). > > I've attached the current workaround, but I'm sure there is a better > > solution to this. Or possibly the test is incorrect and 0 is a > > perfectly valid result? > > > > Kind regards, > > Alex > > > > > > diff --git a/lib/msun/src/s_scalbn.c b/lib/msun/src/s_scalbn.c > > index 219cd8f0c989..0d344840862f 100644 > > --- a/lib/msun/src/s_scalbn.c > > +++ b/lib/msun/src/s_scalbn.c > > @@ -29,6 +29,19 @@ double scalbn(double x, int n) > > } > > u.i = (uint64_t)(0x3ff+n)<<52; > > x = y * u.f; > > +#if !defined(__amd64__) && !defined(__i386__) > > + /* > > + * x86 performs the multiplication with higher precision, but on > > + * non-x86 architectures we might get 0 instead of a tiny value. To work > > + * around this problem perform the multiplication with float128 (slow). > > + * TODO: This doesn't work on e.g. MIPS where long double == double. > > + */ > > + if (x == 0.) { > > + x = (long double)y * u.f; > > + /* fprintf(stderr, "\ttrying again: %a/%a\n", x, > > (double)((long double)y * u.f)); */ > > + return x; > > + } > > +#endif > > return x; > > }
Powered by blists - more mailing lists
Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.