|
Message-ID: <a93725a9e9523e5d06912a9f64709dad@smtp.hushmail.com> Date: Sun, 01 Apr 2012 20:17:55 +0200 From: magnum <john.magnum@...hmail.com> To: john-dev@...ts.openwall.com Subject: Re: fast hashes on GPU On 04/01/2012 07:46 PM, Solar Designer wrote: > On Sun, Apr 01, 2012 at 06:21:14PM +0200, magnum wrote: >> On 03/31/2012 02:19 PM, Solar Designer wrote: >>> ...Oh, I just got it to: >>> >>> Many salts: 38062K c/s real, 38062K c/s virtual >>> Only one salt: 26270K c/s real, 26270K c/s virtual >>> >>> by simply adding "#pragma unroll 64" before the last loop in >>> sha512_block(). >> >> Did you add just that very pragma line, > > Yes. There were similar lines for nearby loops, but somehow not for > that one yet. > >> or did you also add something >> like "#pragma OPENCL EXTENSION cl_nv_pragma_unroll : enable" somewhere >> as well? > > No. This was CUDA code, not OpenCL. Ah, yes. Lukas, or anyone, could you explain how to use pragma unroll in OpenCL? I don't seem to get any impact from it (whereas manual unrolling provides a significant speedup). magnum
Powered by blists - more mailing lists
Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.