|
Message-ID: <0100017c71ef9e36-29a22d3f-22a5-495a-9d9e-0eac76c3ff4e-000000@email.amazonses.com> Date: Tue, 12 Oct 2021 00:38:21 +0000 From: "(GalaxyMaster)" <galaxy@...nwall.com.au> To: musl@...ts.openwall.com Subject: Re: get/set*ent functions and real world applications Enrico, On Mon, Oct 11, 2021 at 02:18:47PM -0300, ??rico Nogueira wrote: > On Mon Oct 11, 2021 at 10:32 AM -03, (GalaxyMaster) wrote: > > > > The whole parssing of password and group entires is quite "dumb", in > > terms that > > it always expects perfectly valid input and produces always correct > > output, but > > I would argue that these functions should be a bit smarter and should > > not make > > assumptions about the validity of the input, i.e treat it as untrusted > > user > > input. > > There's a reason it's recommended that one only make changes to these > files using tools like the ones from the shadow suite. Things in /etc > can, theoretically, only be written to by root or at least trusted > users, so treating as entirely untrusted seems a bit over the top... Well, I prefer to treat anything external to my application as untrusted input, this saves time on troubleshooting weird issues later. In your statement above you put implicit trust into these abstract "trusted users", why guess what these could or could not do if we can handle the input in a safe and deterministic way? There was an argument on this list that libc should not treat/hide developer's mistakes and that's the reason you may see segfaults on a musl-based system more often (it is less forgiving to the poorly written code). Hoowever, I would argue that this particular case requires special treatment from libc since there is no way avoiding these functions if you are working with passwd/group based files. > It seems like both libraries are inconsistent in their own ways. glibc > skips malformed entries when some fields are missing, but fixes a > missing supplemental group entry. musl skips a missing supplemental > group entry, but "fixes" malformed entries with fields missing. > > Maybe striving for consistency by either always skipping or always > fixing entries seems like a more reasonable choice to me, maybe? I am not defending Glibc, but I find their approach to this matter consistent and expected from the common sense point of view. I would rather see musl aligned with it than try to convince everyone of yet another way of doing this. > > With put*() functions the situation a bit better, but still could be > > improved to achieve better compatibility. putpwent() will output > > '(null)' for any NULL pointer passed for the string arguments (due to > > direct call to fprintf() and the UB for that situation), while Glibc > > would output an empty string instead. > > It would seem the function returns EINVAL instead of outputting anything > at all, from my look at the code. I think that's reasonable behavior for > musl to implement, given how badly specified the function is. No, it is not behaving like that, it behaves exactly as I described: === galaxy@...l:~/musl-tests $ cat test-putpwent.c #include <sys/types.h> #include <pwd.h> #include <stdio.h> #include <stdlib.h> #include <errno.h> int main() { struct passwd *pw; FILE *fp; errno = 0; fp = fopen("test-putpwent.output", "w"); if (!fp || errno != 0) return errno; pw = malloc(sizeof(struct passwd)); pw->pw_name = pw->pw_passwd = pw->pw_gecos = pw->pw_dir, pw->pw_shell = NULL; putpwent(pw, fp); fclose(fp); return 0; } galaxy@...l:~/musl-tests $ ls -ld test-putpwent.output ls: cannot access 'test-putpwent.output': No such file or directory galaxy@...l:~/musl-tests $ ./test-putpwent galaxy@...l:~/musl-tests $ cat test-putpwent.output (null):(null):0:0:(null):(null):(null) galaxy@...l:~/musl-tests $ === On a Glibc system, the putpwent() call with pw->pw_name being NULL will fail to produce a record. If the pw->pw_name field is not NULL, then it will produce a record with the name followed by empty string fields, like "user::0:0:::", which is more expected in my opinion. > > Moreover, putpwent() is inconsistent with putgrent() -- the latter > > locks the file before writing and unlocks afterwards, while the former > > is just going ahead with fprintf(). I know these funnctions are thread > > unsafe, but this lack of locking makes putpwent() plainly dangerous on > > a multiuser system. > > I'm pretty sure these functions are always dangerous on multiuser > systems; flockfile(3) is FILE level locking, it just protects from other > threads touching a given FILE object. If you want to protect yourself > from multiple programs handling the file simultaneously, you need file > locking, such as done with fcntl(2). A fair point, I was not familiar with that function and somehow thought it was at the file level. I think that it is not libc's job to do file lockingi here, so I think we are fine in this regard. Thank you for explaining why there is an inconsistency, between putpwent() and putgrent() -- it makes sense. -- (GM)
Powered by blists - more mailing lists
Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.