Follow @Openwall on Twitter for new release announcements and other news
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <BLU159-W1057F88597BD2F68C8ABF9A4840@phx.gbl>
Date: Sat, 21 Jan 2012 13:58:12 +0000
From: Alex Sicamiotis <alekshs@...mail.com>
To: <john-users@...ts.openwall.com>
Subject: RE: DES with OpenMP




> Date: Sat, 21 Jan 2012 15:15:50 +0400
> From: solar@...nwall.com
> To: john-users@...ts.openwall.com
> Subject: Re: [john-users] DES with OpenMP
> 
> On Sat, Jan 21, 2012 at 12:22:18AM +0000, Alex Sicamiotis wrote:
> > ... assuming (erroneously) that the code is identical... Apparently it has significant differences that speed up the OMP version and the nonOMP version is somewhat suboptimal (?).
> 
> In your case, yes.  In most other cases, it's the other way around.
> 
> Things should be more reasonable with GCC 4.3.
> 
> Alexander


Results are in (@4GHz / no x.org / 1.7.9 plain - no jumbo):

1. No-omp / ASM=0

GCC 4.3.4 (-O2 -march=nocona)

Benchmarking: Traditional DES [128/128 BS SSE2-16]... DONE
Many salts:     4444K c/s real, 4444K c/s virtual
Only one salt:  4262K c/s real, 4262K c/s virtual

GCC 4.6.2 (-O2 -march=nocona)

Benchmarking: Traditional DES [128/128 BS SSE2-16]... DONE
Many salts:     3370K c/s real, 3370K c/s virtual
Only one salt:  3265K c/s real, 3265K c/s virtual

ICC 12.1 (-fast -march=core2)

Benchmarking: Traditional DES [128/128 BS SSE2-16]... DONE
Many salts:     4487K c/s real, 4487K c/s virtual
Only one salt:  4316K c/s real, 4316K c/s virtual

...so GCC 4.3.4 does great here but ICC slightly outperforms it.


2. OMP with NUM_THREADS=1

GCC 4.3.4 (-O2 -march=nocona)

Benchmarking: Traditional DES [128/128 BS SSE2-16]... DONE
Many salts:     4131K c/s real, 4131K c/s virtual
Only one salt:  3780K c/s real, 3782K c/s virtual

GCC 4.6.2 (-O2 -march=nocona)

Benchmarking: Traditional DES [128/128 BS SSE2-16]... DONE
Many salts:     4168K c/s real, 4168K c/s virtual
Only one salt:  3813K c/s real, 3813K c/s virtual

ICC 12.1 (-fast -march=core2)
Many salts:     4625K c/s real, 4625K c/s virtual
Only one salt:  4187K c/s real, 4187K c/s virtual

Almost identical performance for 4.3.4 / 4.6.2...


3. OMP with NUM_THREADS=2 

GCC 4.3.4 (-O2 -march=nocona)

Benchmarking: Traditional DES [128/128 BS SSE2-16]... DONE
Many salts:     7932K c/s real, 3969K c/s virtual
Only one salt:  6268K c/s real, 3134K c/s virtual

GCC 4.6.2 (-O2 -march=nocona)

Benchmarking: Traditional DES [128/128 BS SSE2-16]... DONE
Many salts:     7967K c/s real, 3983K c/s virtual
Only one salt:  6316K c/s real, 3158K c/s virtual

ICC 12.1 (-fast -march=core2)

Benchmarking: Traditional DES [128/128 BS SSE2-16]... DONE
Many salts:     8900K c/s real, 4450K c/s virtual 
Only one salt:  7153K c/s real, 3575K c/s virtual

Again, almost identical performance for 4.3.4 / 4.6.2... the biggest difference of 4.3.4 vs 4.6.2 in my case seems to be the non-OMP build which reaches almost ICC levels.

 		 	   		  

Powered by blists - more mailing lists

Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.