Follow @Openwall on Twitter for new release announcements and other news
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20190629093347.GQ16415@port70.net>
Date: Sat, 29 Jun 2019 11:33:47 +0200
From: Szabolcs Nagy <nsz@...t70.net>
To: musl@...ts.openwall.com
Subject: Re: Conditional signal safety?

* Markus Wichmann <nullplan@....net> [2019-06-29 07:54:05 +0200]:
> Hi all,
> 
> at work yesterday I had to build an exception handler (a signal handler
> for SIGSEGV, SIGBUS, SIGILL, and SIGFPE). For my purposes, it was really
> convenient to just use dladdr() to find out at least what module and
> function PC and LR were pointing to when the exception happened, so I
> used that function.
> 
> Now, dladdr() is not on the list of signal safe functions, but then,
> dladdr() is a GNU extension. I wondered if it is signal safe and noticed
> that at least musl's implementation is, provided that dlopen() was not
> the function that was pre-empted. That got me thinking: Is there such a
> thing as "conditional signal safety"?
> 
> dladdr() takes a rwlock in read mode. At the moment, this means it can
> only block if the lock is write locked, which only dlopen() will ever
> do. dladdr() does nothing else that would impede signal safety. But of
> course, these are implementation details. What is actually defined about
> the interface?

note that the signals you handle (SIGSEGV, SIGBUS, SIGILL, SIGFPE)
are usually not asynchronous but happen at particular instructions.

dlopen does not hold locks while it runs user code, so you only
have issues if the dlopen code itself faults (which can happen e.g.
when invalid arguments are passed to it) so indeed in practice you
may get away with dladdr in the signal handler (e.g. if you know
dlopen won't fault).

in theory this does not help: the only concept the libc defines and
guarantees is async-signal-safety and dladdr is not as-safe so it
may do arbitrary non-as-safe operations, not just taking a dlopen
lock, and conversely arbitrary non-as-safe libc apis may take the
dlopen lock internally.

(btw this is why unwinding from a signal handler does not work
reliably even if there are async unwind tables in the binary: the
unwinder has to look up those tables for a particular elf module
the pc is in and this mechanism needs to synchronize with dlopen
which is currently not lock free and thus can deadlock.)

> 
> Ciao,
> Markus

Powered by blists - more mailing lists

Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.