|
Message-ID: <CANJ2NMNcMW4_nBPgMj53iK0tBbtC7jq6sAwUxJyofNyA4au0YA@mail.gmail.com>
Date: Fri, 6 Apr 2012 19:24:01 +0800
From: myrice <qqlddg@...il.com>
To: john-dev@...ts.openwall.com
Subject: Re: fast hashes on GPU
On Tue, Apr 3, 2012 at 8:08 PM, Lukas Odzioba <lukas.odzioba@...il.com>wrote:
>
> You can try split that copy and overlap it with kernel execution. It
> is possible on fermi and newer cards.
>
Thanks for this. I change each cudaMemcpy to cudaMemcpyAsync. Am I right?
However, no performance gains. Could you provide some example by doing this?
Thanks!
Dongdong Li
Content of type "text/html" skipped
Powered by blists - more mailing lists
Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.