|
Message-ID: <20200816025425.GA22555@openwall.com> Date: Sun, 16 Aug 2020 04:54:26 +0200 From: Solar Designer <solar@...nwall.com> To: john-users@...ts.openwall.com Subject: Re: Performance John in the cloud On Sat, Aug 15, 2020 at 11:06:13PM +0200, Solar Designer wrote: > on a c5a.24xlarge instance (96 vCPUs, AMD EPYC 7R32) BTW, here are some other benchmarks on that CPU, 96 threads: Benchmarking: descrypt, traditional crypt(3) [DES 256/256 AVX2]... (96xOMP) DONE Many salts: 407961K c/s real, 4254K c/s virtual Only one salt: 62797K c/s real, 654782 c/s virtual Benchmarking: md5crypt, crypt(3) $1$ (and variants) [MD5 256/256 AVX2 8x3]... (96xOMP) DONE Many salts: 4608K c/s real, 48002 c/s virtual Only one salt: 3801K c/s real, 39523 c/s virtual Benchmarking: bcrypt ("$2a$05", 32 iterations) [Blowfish 32/64 X3]... (96xOMP) DONE Speed for cost 1 (iteration count) of 32 Raw: 86832 c/s real, 902 c/s virtual Benchmarking: sha512crypt, crypt(3) $6$ (rounds=5000) [SHA512 256/256 AVX2 4x]... (96xOMP) DONE Speed for cost 1 (iteration count) of 5000 Raw: 64060 c/s real, 669 c/s virtual 48 threads works slightly better for descrypt: $ OMP_NUM_THREADS=48 john -test -form=descrypt Will run 48 OpenMP threads Benchmarking: descrypt, traditional crypt(3) [DES 256/256 AVX2]... (48xOMP) DONE Many salts: 418480K c/s real, 8718K c/s virtual Only one salt: 79034K c/s real, 1651K c/s virtual Not bad for one CPU chip. Just a few years ago these speeds at descrypt and md5crypt and sha512crypt were only achieved on GPU. Of course, modern high-end GPUs are a few times faster at these three hash types... but not at bcrypt. That speed at bcrypt is the highest I see so far for any one chip - we reach higher speeds on ZTEX boards, but those have four FPGA chips each, and NVIDIA Tesla V100 GPU doesn't reach the above speed (but gets very close). I guess an AMD EPYC with 128 threads (64 cores) will show even better speed; I just haven't had access to one yet. Of course, this isn't as energy-efficient as the FPGAs are, but it is a higher speed per chip. We'll need to support larger FPGAs to beat that. > c5a.24xlarge is currently priced at $1.56+/hour spot, $3.696 on-demand. > Our Bundle (beyond the free trial) costs $0.64/hour on this instance. Alexander
Powered by blists - more mailing lists
Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.