|
Message-ID: <CANJ2NMPpUavfAWrJL6m=z3pbXFqV3yT7ZouKHGD_6ER0xbK87Q@mail.gmail.com> Date: Thu, 28 Jun 2012 00:33:07 +0800 From: myrice <qqlddg@...il.com> To: john-dev@...ts.openwall.com Subject: Re: async key transfers to GPU Here are the result with 2 streams. Stream #1 memcpyH2D first half of keys in set_key(). Stream #2 memcpyH2D second half of keys in beginning of crypt_all(). Then I split xsha512 kernel into 2 with 2 streams. Here is the profiler data: http://myrice.me/gsoc/async.png There is only 4.2% overlap. memcpyH2D is 2 time slow on bull. And two xsha512 kernels execution time are slight slow than one. So avoid data copy and generate password on GPU would be very helpful. It is time for me working it out quickly. Thanks myrice
Powered by blists - more mailing lists
Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.