Follow @Openwall on Twitter for new release announcements and other news
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20190227193613.GG21289@port70.net>
Date: Wed, 27 Feb 2019 20:36:13 +0100
From: Szabolcs Nagy <nsz@...t70.net>
To: musl@...ts.openwall.com
Subject: Re: FE Exception triggered by comparison

* Rich Felker <dalias@...c.org> [2019-02-27 12:26:41 -0500]:
> On Wed, Feb 27, 2019 at 08:14:07PM +0300, Alexander Monakov wrote:
> > On Wed, 27 Feb 2019, Rich Felker wrote:
> > 
> > > Are there reasons we should perhaps use the __builtin versions of
> > > these when __GNUC__ indicates they're available? I like our bit test
> > > versions we have now, and I think they're sufficiently efficient, but
> > > I'm open to changes if there's a good reason.
> > 
> > Well, it really depends on what one considers 'sufficiently efficient'.
> > Instead of comparing a register with itself and testing flags (2 instructions)
> > you get (for 'int f(double x){return isnan(x);}'):
> > 
> > f:
> >         movabsq $9223372036854775807, %rdx
> >         movq    %xmm0, %rax
> >         andq    %rdx, %rax
> >         movabsq $9218868437227405312, %rdx
> >         cmpq    %rdx, %rax
> >         seta    %al
> >         movzbl  %al, %eax
> >         ret
> > 
> > (note that movq %xmm0, %rax is going to be more costly than a normal
> > move as it crosses from fp to integer domain in the cpu)
> > 
> > I think musl bit test can be implemented more efficiently via right-shifting
> > the representation in %rax first, avoiding 64-bit immediates,
> 
> Or left-shifting rather than masking to get rid of the sign bit?
> That's all it's doing. I don't think right-shift is okay since losing
> any low bits would break the comparison.
> 
> > but even then
> > I'd say the "native" version is preferable.
> 
> I suspect this is probably true, though I also worry a bit whether
> there are archs where it does something inefficient or broken.

e.g. isnan is broken with -fsignaling-nan since it should
not signal but the ucomisd gcc generates does as discussed.

(although it's unlikely to matter much: we dont support
snan in all apis)

but gcc used to generate horrible code for fpclassify things,
nowadays it should be mostly fixed, i don't remember if
there were actual correctness bugs or just inefficient code.

> 
> Ideally the compiler would be able to recognize portable (within IEEE)
> patterns for floating point representation examination and optimize
> them if there's a more efficient way to be able to do it for a
> particular machine.
> 
> Rich

Powered by blists - more mailing lists

Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.