|
Message-ID: <20150713173144.GB12357@openwall.com> Date: Mon, 13 Jul 2015 20:31:44 +0300 From: Solar Designer <solar@...nwall.com> To: john-dev@...ts.openwall.com Subject: Re: PHC: Lyra2 on GPU Agnieszka, On Mon, Jul 13, 2015 at 04:33:24PM +0200, Agnieszka Bielec wrote: > 2015-07-12 16:55 GMT+02:00 Solar Designer <solar@...nwall.com>: > > Can you also try: > > > > t = 1 > > m = 80 > > c = 256 > > p = 1 > > > > This should be almost 2 MB. > > these tests are for 960m this time Of course, I meant this for a bigger card. > and I discovered now that the best number of lws also differ for > various costs but it isn't autotuned Why isn't it auto-tuned? > >> I said that I'm using local memory but I wanted to say __private , > >> sorry if caused confusion > > > > OK. I guess you're putting the current row (24 KB) in there? And when > > you were using global memory before, you had the current row fetched > > from and sent to global memory each time? > > it's not 24KB. I wrote that there are very small chunks an when I > tried 2x, 3x, 5x bigger - speed decreased. > but I'm sceptic about so huge cache in local memory because we have > e.g. 32KB for all lws number of threads and speed will decrease after > only if I change lws from 64 to 1 You're right indeed. So how much private memory are you using per Lyra2 instance? Alexander
Powered by blists - more mailing lists
Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.