|
Message-ID: <7174a14ccd6fe3099723d792e418c22a@smtp.hushmail.com> Date: Sun, 02 Nov 2014 22:19:03 +0100 From: magnum <john.magnum@...hmail.com> To: john-dev@...ts.openwall.com Subject: Re: descrypt speed On 2014-11-02 18:59, Royce Williams wrote: > On Thu, Oct 30, 2014 at 9:33 PM, magnum <john.magnum@...hmail.com> wrote: >> On 2014-10-31 06:02, Royce Williams wrote: >>> On a GTX970, shouldn't this be sm_52? >> >> You can force this by editing NVCC_FLAGS in Makefile. Add something like >> "-arch sm_50" (or 52). But I doubt it will make much difference and it will >> only affect CUDA formats. > > In my system with both an sm_20 and an sm_50 card, when running solely > descrypt-opencl (not CUDA), the ptxas info shows that sm_50 is involved in > some way. Is this cosmetic? OpenCL compiles a suitable (different) kernel for each and you do not have to configure anything. You can configure CUDA for compiling several archs at once, see "nvcc --help". It something like "-gencode arch=compute_20,code=sm_20 -gencode arch=compute_50,code=sm_50" (added to NVCC_FLAGS instead of just -arch sm_xx). The one most suitable of them will be picked at runtime. magnum
Powered by blists - more mailing lists
Confused about mailing lists and their use? Read about mailing lists on Wikipedia and check out these guidelines on proper formatting of your messages.