|
Message-ID: <20150820210313.GA29340@openwall.com> Date: Fri, 21 Aug 2015 00:03:13 +0300 From: Solar Designer <solar@...nwall.com> To: john-dev@...ts.openwall.com Subject: Re: PHC: Argon2 on GPU On Thu, Aug 20, 2015 at 10:40:16PM +0200, Agnieszka Bielec wrote: > 2015-08-20 22:34 GMT+02:00 Solar Designer <solar@...nwall.com>: > > On Thu, Aug 20, 2015 at 08:04:20PM +0200, Agnieszka Bielec wrote: > >> 2015-08-19 18:39 GMT+02:00 Solar Designer <solar@...nwall.com>: > >> > I think you may try working on ulong16 or ulong8 instead. I expect > >> > ulong8 to match the current GPU hardware best, but OTOH ulong16 makes > >> > more parallelism apparent to the OpenCL compiler and allocates it to one > >> > work-item. So please try both and see which works best. > >> > >> I created something using ulong8, it's almost not noticeable better > >> speed in my laptop but worse on super both cards, no idea if this is > >> what you wanted ( I think that not ), you can take a look on branch > >> vector8 > > > You should also > > use the wider vector type for the global memory references and in the > > kernel parameter list. > > was even more slower (on super, both cards) Where is the code? Slower now doesn't necessarily mean we're doing anything wrong - it might also mean we're not doing enough of it yet. And how much slower was it? Did you try re-tuning LWS and GWS? Alexander
Powered by blists - more mailing lists
Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.