Follow @Openwall on Twitter for new release announcements and other news
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <20220602142504.GY7074@brightrain.aerifal.cx>
Date: Thu, 2 Jun 2022 10:25:05 -0400
From: Rich Felker <dalias@...c.org>
To: Sascha Braun <sascha.braun.lpz@...glemail.com>
Cc: musl@...ts.openwall.com
Subject: Re: Problably Issue in name_from_dns // __res_msend_rc
 (lookup_name.c)

On Wed, Jun 01, 2022 at 11:52:46PM +0200, Sascha Braun wrote:
> I see - getaddrinfo always returns -2, EAI_NONAME in case of these rare
> errors.
> 
> The errno value (EAGAIN) came from my syscall, recvfrom, which does not
> have data in these cases (all packets indicated by 'poll' were read
> previously by recvfrom calls). (Btw., I did try to return 0 for these cases
> in my recvfrom implementation, did not help either.)

It's normal to see these EAGAINs. They are not errors. Returning 0
would be wrong; a zero-length read means EOF not EAGAIN.

I don't see an immediate cause visible for why you're getting
EAI_NONAME. Note that your debug output does not seem to show the
*length* recvfrom returned, so is it possible you're just showing old
data that was already in the buffer, when recvfrom really
malfunctioned and returned a short read? At the res_msend loop layer
musl would possibly accept this (like you seem to be seeing) but then
find no results later.

Rich



> 
> Thanks
> 
> Am Mi., 1. Juni 2022 um 23:30 Uhr schrieb Rich Felker <dalias@...c.org>:
> 
> > On Wed, Jun 01, 2022 at 10:35:52PM +0200, Sascha Braun wrote:
> > > Hi Rich,
> > >
> > > Thanks for your time. The EADDRINUSE is in reality an EAGAIN, this is a
> > > glitch in the Emscripten/WASI modification for musl (there are different
> > > values for errnos per implementation):
> > > __WASI_ERRNO_ADDRINUSE == -3 == EAI_AGAIN.
> > >
> > > My question currently breaks down to: Is it common for this
> > implementation
> > > of getaddrinfo to return EAGAIN sometimes (I experience 1:100), even if
> > > answers from DNS servers came in?
> > > If not, I need to investigate much further. I really could not find
> > issues
> > > with my sockets implementation, although your thought is of course very
> > > reasonable.
> >
> > OK, looking at your code again, part of your problem is that you're
> > misusing perror. You need to save the return value of getaddrinfo.
> > Unless it's EAI_SYSTEM, errno is meaningless and you need to use
> > gai_strerror to get a string for the error code (the value that was
> > returned) rather than inspecting errno.
> >
> > If getaddrinfo actually returned EAI_AGAIN, this probably means you
> > got an inconclusive result from one of the nameservers, probably
> > ServFail. However if your debug output is packet dumps, I'm not seeing
> > a ServFail there, and I'm not convinced it actually returned EAI_AGAIN
> > since you're not saving the value to check it. The value that happens
> > to be in errno is NOT the error getaddrinfo returned. Can you check
> > what it's actually returning?
> >
> >
> > > Am Mi., 1. Juni 2022 um 16:14 Uhr schrieb Rich Felker <dalias@...c.org>:
> > >
> > > > On Tue, May 31, 2022 at 12:24:19PM +0200, Sascha Braun wrote:
> > > > > Hi everyone,
> > > > >
> > > > > I'm implementing a socket protocol with similar works to Emscripten
> > - ok:
> > > > >
> > > > > Here's what I noticed
> > > > >
> > > > > - resolv.conf using multiple DNS servers
> > > > > options timeout:22 attempts:5
> > > > > nameserver 8.8.4.4
> > > > > nameserver 208.67.222.222
> > > > > nameserver 9.9.9.9
> > > > > nameserver 1.1.1.1
> > > > >
> > > > > - getaddrinfo with no hints, so that IPV6 and IPV4 is resolved
> > > > > void socketstest1_client_dnsonly(void) {
> > > > >
> > > > > struct addrinfo hints, * res;
> > > > > memset(&hints, 0, sizeof(hints));
> > > > > printf("begin lookup1...\n");
> > > > > if (getaddrinfo("www.web.de", "80", &hints, &res) != 0) {
> > > > > perror("getaddrinfo1");
> > > > > }
> > > > > printf("begin lookup2...\n");
> > > > > if (getaddrinfo("www.google.de", "80", &hints, &res) != 0) {
> > > > > perror("getaddrinfo2");
> > > > > }
> > > > > printf("begin lookup3...\n");
> > > > > if (getaddrinfo("www.google.com", "80", &hints, &res) != 0) {
> > > > > perror("getaddrinfo3");
> > > > > }
> > > > >
> > > > > }
> > > > >
> > > > > When repeating socketstest1_client_dnsonly often, it appears that
> > IPV6
> > > > > answers for IPV4 requests to another server [or vice-versa]
> > responses can
> > > > > get mixed up and getaddrinfo reports sometimes only an error where
> > none
> > > > is.
> > > > >
> > > > > Below is a console dump of my test where you see the critical
> > situation
> > > > > (end)
> > > > > 8.8.4.4:53 was queried, 9.9.9.9:53 responded for a (different IP
> > Proto),
> > > > > resulting in error getaddrinfo1: Address in use
> > > > >
> > > > > This happens in about 1/100 tests. When you specify IPV4 or IPV6 in
> > > > hints,
> > > > > this issue does NOT show up.
> > > >
> > > > Can you provide more information on how to interpret the console dump,
> > > > and in particular, which of your syscalls is returning the EADDRINUSE
> > > > error? I'm pretty sure this is just a bug in your socket
> > > > stack/emulation. It looks like it's coming from recvfrom, and
> > > > EADDRINUSE is not a valid error for recvfrom to produce.
> > > >
> > > > Rich
> > > >
> >

Powered by blists - more mailing lists

Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.