|
Message-ID: <CAKGDhHVt0HhZ5OdMKxtk74janc47oVx4Y7y4XcoFZpSsJ9C3XQ@mail.gmail.com> Date: Mon, 27 Jul 2015 09:47:52 +0200 From: Agnieszka Bielec <bielecagnieszka8@...il.com> To: john-dev@...ts.openwall.com Subject: Re: PHC: Lyra2 vs yescrypt benchmarks 2 Lyra2 CPU on well - 3792 GeForce GTX 960M - 629 AMD Tahiti - 2844 GeForce GTX TITAN - 1638 memory: 1.5 MB yescrypt CPU on well - 4736 GeForce GTX 960M - 419 AMD Tahiti - 914 GeForce GTX TITAN - 1050 memory: 1.5 MB output (not everything): Lyra2 CPU a@...l:~/m/run$ ./john --test --format=lyra2 Will run 8 OpenMP threads Benchmarking: Lyra2 [Blake2 AVX]... (8xOMP) memory per hash : 1.50 MB using different password for benchmarking DONE Speed for cost 1 (t) of 1, cost 2 (m) of 64, cost 3 (c) of 256, cost 4 (p) of 1 Raw: 3792 c/s real, 474 c/s virtual Lyra2 AMD Tahiti [a@...er run]$ ./john --test --format=lyra2-opencl --v=4 Benchmarking: Lyra2-opencl [Lyra2 OpenCL]... memory per hash : 1.50 MB Device 0: Tahiti [AMD Radeon HD 7900 Series] Calculating best global worksize (GWS); max. 1s single kernel invocation. gws: 256 431 c/s 431 rounds/s 592.868ms per crypt_all()! gws: 512 834 c/s 834 rounds/s 613.804ms per crypt_all()+ gws: 1024 1484 c/s 1484 rounds/s 690.006ms per crypt_all()+ Local worksize (LWS) 64, global worksize (GWS) 1024 using different password for benchmarking DONE Speed for cost 1 (t) of 1, cost 2 (m) of 64, cost 3 (c) of 256, cost 4 (p) of 1 Raw: 1473 c/s real, 204800 c/s virtual [a@...er run]$ LWS=32 ./john --test --format=lyra2-opencl --v=4 Benchmarking: Lyra2-opencl [Lyra2 OpenCL]... memory per hash : 1.50 MB Device 0: Tahiti [AMD Radeon HD 7900 Series] Calculating best global worksize (GWS); max. 1s single kernel invocation. gws: 256 758 c/s 758 rounds/s 337.492ms per crypt_all()! gws: 512 1471 c/s 1471 rounds/s 348.015ms per crypt_all()+ gws: 1024 2312 c/s 2312 rounds/s 442.737ms per crypt_all()+ Local worksize (LWS) 32, global worksize (GWS) 1024 using different password for benchmarking DONE Speed for cost 1 (t) of 1, cost 2 (m) of 64, cost 3 (c) of 256, cost 4 (p) of 1 Raw: 2292 c/s real, 307200 c/s virtual [a@...er run]$ LWS=16 ./john --test --format=lyra2-opencl --v=4 Benchmarking: Lyra2-opencl [Lyra2 OpenCL]... memory per hash : 1.50 MB Device 0: Tahiti [AMD Radeon HD 7900 Series] Calculating best global worksize (GWS); max. 1s single kernel invocation. gws: 256 1011 c/s 1011 rounds/s 253.080ms per crypt_all()! gws: 512 1999 c/s 1999 rounds/s 256.024ms per crypt_all()+ gws: 1024 2708 c/s 2708 rounds/s 378.131ms per crypt_all()+ Local worksize (LWS) 16, global worksize (GWS) 1024 using different password for benchmarking DONE Speed for cost 1 (t) of 1, cost 2 (m) of 64, cost 3 (c) of 256, cost 4 (p) of 1 Raw: 2648 c/s real, 153600 c/s virtual [a@...er run]$ LWS=8 ./john --test --format=lyra2-opencl --v=4 Benchmarking: Lyra2-opencl [Lyra2 OpenCL]... memory per hash : 1.50 MB Device 0: Tahiti [AMD Radeon HD 7900 Series] Calculating best global worksize (GWS); max. 1s single kernel invocation. gws: 256 1121 c/s 1121 rounds/s 228.188ms per crypt_all()! gws: 512 2162 c/s 2162 rounds/s 236.790ms per crypt_all()+ gws: 1024 2913 c/s 2913 rounds/s 351.487ms per crypt_all()+ Local worksize (LWS) 8, global worksize (GWS) 1024 using different password for benchmarking DONE Speed for cost 1 (t) of 1, cost 2 (m) of 64, cost 3 (c) of 256, cost 4 (p) of 1 Raw: 2844 c/s real, 307200 c/s virtual Lyra2 TITAN [a@...er run]$ ./john --test --format=lyra2-opencl --v=4 --dev=5 Benchmarking: Lyra2-opencl [Lyra2 OpenCL]... memory per hash : 1.50 MB Device 5: GeForce GTX TITAN Options used: -I ./kernels -cl-mad-enable -cl-nv-verbose -D__GPU__ -DDEVICE_INFO=65554 -DDEV_VER_MAJOR=352 -DDEV_VER_MINOR=21 -D_OPENCL_COMPILER -DBINARY_SIZE=256 -DSALT_SIZE=64 Calculating best global worksize (GWS); max. 1s single kernel invocation. gws: 256 275 c/s 275 rounds/s 928.547ms per crypt_all()! gws: 512 550 c/s 550 rounds/s 929.863ms per crypt_all()+ Local worksize (LWS) 64, global worksize (GWS) 512 using different password for benchmarking DONE Speed for cost 1 (t) of 1, cost 2 (m) of 64, cost 3 (c) of 256, cost 4 (p) of 1 Raw: 553 c/s real, 553 c/s virtual [a@...er run]$ LWS=32 ./john --test --format=lyra2-opencl --v=4 --dev=5 Benchmarking: Lyra2-opencl [Lyra2 OpenCL]... memory per hash : 1.50 MB Device 5: GeForce GTX TITAN Options used: -I ./kernels -cl-mad-enable -cl-nv-verbose -D__GPU__ -DDEVICE_INFO=65554 -DDEV_VER_MAJOR=352 -DDEV_VER_MINOR=21 -D_OPENCL_COMPILER -DBINARY_SIZE=256 -DSALT_SIZE=64 Calculating best global worksize (GWS); max. 1s single kernel invocation. gws: 256 275 c/s 275 rounds/s 930.093ms per crypt_all()! gws: 512 561 c/s 561 rounds/s 911.119ms per crypt_all()! Local worksize (LWS) 32, global worksize (GWS) 512 using different password for benchmarking DONE Speed for cost 1 (t) of 1, cost 2 (m) of 64, cost 3 (c) of 256, cost 4 (p) of 1 Raw: 562 c/s real, 565 c/s virtual [a@...er run]$ LWS=16 ./john --test --format=lyra2-opencl --v=4 --dev=5 Benchmarking: Lyra2-opencl [Lyra2 OpenCL]... memory per hash : 1.50 MB Device 5: GeForce GTX TITAN Options used: -I ./kernels -cl-mad-enable -cl-nv-verbose -D__GPU__ -DDEVICE_INFO=65554 -DDEV_VER_MAJOR=352 -DDEV_VER_MINOR=21 -D_OPENCL_COMPILER -DBINARY_SIZE=256 -DSALT_SIZE=64 Calculating best global worksize (GWS); max. 1s single kernel invocation. gws: 256 476 c/s 476 rounds/s 537.400ms per crypt_all()! gws: 512 948 c/s 948 rounds/s 539.760ms per crypt_all()+ Local worksize (LWS) 16, global worksize (GWS) 512 using different password for benchmarking DONE Speed for cost 1 (t) of 1, cost 2 (m) of 64, cost 3 (c) of 256, cost 4 (p) of 1 Raw: 948 c/s real, 948 c/s virtual [a@...er run]$ LWS=8 ./john --test --format=lyra2-opencl --v=4 --dev=5 Benchmarking: Lyra2-opencl [Lyra2 OpenCL]... memory per hash : 1.50 MB Device 5: GeForce GTX TITAN Options used: -I ./kernels -cl-mad-enable -cl-nv-verbose -D__GPU__ -DDEVICE_INFO=65554 -DDEV_VER_MAJOR=352 -DDEV_VER_MINOR=21 -D_OPENCL_COMPILER -DBINARY_SIZE=256 -DSALT_SIZE=64 Calculating best global worksize (GWS); max. 1s single kernel invocation. gws: 256 722 c/s 722 rounds/s 354.485ms per crypt_all()! gws: 512 1232 c/s 1232 rounds/s 415.283ms per crypt_all()+ Local worksize (LWS) 8, global worksize (GWS) 512 using different password for benchmarking DONE Speed for cost 1 (t) of 1, cost 2 (m) of 64, cost 3 (c) of 256, cost 4 (p) of 1 Raw: 1228 c/s real, 1228 c/s virtual [a@...er run]$ LWS=4 ./john --test --format=lyra2-opencl --v=4 --dev=5 Benchmarking: Lyra2-opencl [Lyra2 OpenCL]... memory per hash : 1.50 MB Device 5: GeForce GTX TITAN Options used: -I ./kernels -cl-mad-enable -cl-nv-verbose -D__GPU__ -DDEVICE_INFO=65554 -DDEV_VER_MAJOR=352 -DDEV_VER_MINOR=21 -D_OPENCL_COMPILER -DBINARY_SIZE=256 -DSALT_SIZE=64 Calculating best global worksize (GWS); max. 1s single kernel invocation. gws: 256 894 c/s 894 rounds/s 286.124ms per crypt_all()! gws: 512 1629 c/s 1629 rounds/s 314.276ms per crypt_all()+ Local worksize (LWS) 4, global worksize (GWS) 512 using different password for benchmarking DONE Speed for cost 1 (t) of 1, cost 2 (m) of 64, cost 3 (c) of 256, cost 4 (p) of 1 Raw: 1638 c/s real, 1625 c/s virtual Lyra2 960m none@...e ~/Desktop/rrr/run $ LWS=8 ./john --test --format=lyra2-opencl --v=4 Benchmarking: Lyra2-opencl [Lyra2 OpenCL]... memory per hash : 1.50 MB Device 0: GeForce GTX 960M Options used: -I ./kernels -cl-mad-enable -cl-nv-verbose -D__GPU__ -DDEVICE_INFO=131090 -DDEV_VER_MAJOR=352 -DDEV_VER_MINOR=21 -D_OPENCL_COMPILER -DBINARY_SIZE=256 -DSALT_SIZE=64 Calculating best global worksize (GWS); max. 1s single kernel invocation. gws: 256 609 c/s 609 rounds/s 420.031ms per crypt_all()! Local worksize (LWS) 8, global worksize (GWS) 256 using different password for benchmarking DONE Speed for cost 1 (t) of 1, cost 2 (m) of 64, cost 3 (c) of 256, cost 4 (p) of 1 Raw: 629 c/s real, 629 c/s virtual yescrypt CPU a@...l:~/m/run$ ./john --test --format=yescrypt Will run 8 OpenMP threads Benchmarking: yescrypt [pwxform Salsa20/8 AVX]... (8xOMP) memory per hash : 1.51 MB using different password for benchmarking DONE Speed for cost 1 (N) of 2048, cost 2 (r) of 6, cost 3 (p) of 1, cost 4 (t) of 0, cost 5 (g) of 0 Raw: 4736 c/s real, 592 c/s virtual yescrypt AMD Tahiti [a@...er run]$ LWS=32 GWS=1024 ./john --test --format=yescrypt-opencl --v=4 Benchmarking: yescrypt-opencl [pwxform Salsa20/8 OpenCL]... memory per hash : 1.51 MB Device 0: Tahiti [AMD Radeon HD 7900 Series] Local worksize (LWS) 32, global worksize (GWS) 1024 using different password for benchmarking DONE Speed for cost 1 (N) of 2048, cost 2 (r) of 6, cost 3 (p) of 1, cost 4 (t) of 0, cost 5 (g) of 0 Raw: 882 c/s real, 102400 c/s virtual [a@...er run]$ LWS=16 GWS=1024 ./john --test --format=yescrypt-opencl --v=4 Benchmarking: yescrypt-opencl [pwxform Salsa20/8 OpenCL]... memory per hash : 1.51 MB Device 0: Tahiti [AMD Radeon HD 7900 Series] Local worksize (LWS) 16, global worksize (GWS) 1024 using different password for benchmarking DONE Speed for cost 1 (N) of 2048, cost 2 (r) of 6, cost 3 (p) of 1, cost 4 (t) of 0, cost 5 (g) of 0 Raw: 914 c/s real, 102400 c/s virtual [a@...er run]$ LWS=8 GWS=1024 ./john --test --format=yescrypt-opencl --v=4 Benchmarking: yescrypt-opencl [pwxform Salsa20/8 OpenCL]... memory per hash : 1.51 MB Device 0: Tahiti [AMD Radeon HD 7900 Series] Local worksize (LWS) 8, global worksize (GWS) 1024 using different password for benchmarking DONE Speed for cost 1 (N) of 2048, cost 2 (r) of 6, cost 3 (p) of 1, cost 4 (t) of 0, cost 5 (g) of 0 Raw: 914 c/s real, 102400 c/s virtual yescrypt TITAN [a@...er run]$ ./john --test --format=yescrypt-opencl --v=4 --dev=5 Benchmarking: yescrypt-opencl [pwxform Salsa20/8 OpenCL]... memory per hash : 1.51 MB Device 5: GeForce GTX TITAN Options used: -I ./kernels -cl-mad-enable -cl-nv-verbose -D__GPU__ -DDEVICE_INFO=65554 -DDEV_VER_MAJOR=352 -DDEV_VER_MINOR=21 -D_OPENCL_COMPILER -DBINARY_SIZE=32 -DSALT_SIZE=64 -DPLAINTEXT_LENGTH=125 -DHASH_SIZE=44 -DKEY_SIZE=125 Calculating best global worksize (GWS); max. 100s total for crypt_all() gws: 256 231 c/s 231 rounds/s 1.104s per crypt_all()! gws: 512 470 c/s 470 rounds/s 1.088s per crypt_all()! gws: 1024 827 c/s 827 rounds/s 1.237s per crypt_all()+ gws: 2048 1016 c/s 1016 rounds/s 2.014s per crypt_all()+ Local worksize (LWS) 64, global worksize (GWS) 2048 using different password for benchmarking DONE Speed for cost 1 (N) of 2048, cost 2 (r) of 6, cost 3 (p) of 1, cost 4 (t) of 0, cost 5 (g) of 0 Raw: 1013 c/s real, 1013 c/s virtual [a@...er run]$ LWS=32 ./john --test --format=yescrypt-opencl --v=4 --dev=5 Benchmarking: yescrypt-opencl [pwxform Salsa20/8 OpenCL]... memory per hash : 1.51 MB Device 5: GeForce GTX TITAN Options used: -I ./kernels -cl-mad-enable -cl-nv-verbose -D__GPU__ -DDEVICE_INFO=65554 -DDEV_VER_MAJOR=352 -DDEV_VER_MINOR=21 -D_OPENCL_COMPILER -DBINARY_SIZE=32 -DSALT_SIZE=64 -DPLAINTEXT_LENGTH=125 -DHASH_SIZE=44 -DKEY_SIZE=125 Calculating best global worksize (GWS); max. 100s total for crypt_all() gws: 256 231 c/s 231 rounds/s 1.105s per crypt_all()! gws: 512 470 c/s 470 rounds/s 1.088s per crypt_all()! gws: 1024 866 c/s 866 rounds/s 1.181s per crypt_all()+ gws: 2048 1045 c/s 1045 rounds/s 1.958s per crypt_all()+ Local worksize (LWS) 32, global worksize (GWS) 2048 using different password for benchmarking DONE Speed for cost 1 (N) of 2048, cost 2 (r) of 6, cost 3 (p) of 1, cost 4 (t) of 0, cost 5 (g) of 0 Raw: 1050 c/s real, 1044 c/s virtual yescrypt 960m none@...e ~/Desktop/rrr/run $ LWS=16 ./john --test --format=yescrypt-opencl --v=4 Benchmarking: yescrypt-opencl [pwxform Salsa20/8 OpenCL]... memory per hash : 1.51 MB Device 0: GeForce GTX 960M Options used: -I ./kernels -cl-mad-enable -cl-nv-verbose -D__GPU__ -DDEVICE_INFO=131090 -DDEV_VER_MAJOR=352 -DDEV_VER_MINOR=21 -D_OPENCL_COMPILER -DBINARY_SIZE=32 -DSALT_SIZE=64 -DPLAINTEXT_LENGTH=125 -DHASH_SIZE=44 -DKEY_SIZE=125 Calculating best global worksize (GWS); max. 100s total for crypt_all() gws: 256 353 c/s 353 rounds/s 723.560ms per crypt_all()! gws: 512 420 c/s 420 rounds/s 1.218s per crypt_all()+ gws: 1024 414 c/s 414 rounds/s 2.469s per crypt_all() gws: 2048 412 c/s 412 rounds/s 4.965s per crypt_all() Local worksize (LWS) 16, global worksize (GWS) 512 using different password for benchmarking DONE Speed for cost 1 (N) of 2048, cost 2 (r) of 6, cost 3 (p) of 1, cost 4 (t) of 0, cost 5 (g) of 0 Raw: 419 c/s real, 419 c/s virtual
Powered by blists - more mailing lists
Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.