RCUDA: R Bindings for the CUDA Library for GPU Computing

library(RCUDA)
m = loadModule("inst/sampleKernels/set.ptx")
k = m$setValue_kernel

N = 1e7L
i = integer(N)
ci = copyToDevice(i)

 # To get over N threads, we use 512 within a block for the maximum amount
 # and then  256 x 128 grid.
 # Would we be better off with a different break down of the grid or the block?
system.time(replicate(100, .cuda(k, ci, N, gridDim = c(256L, 128L), blockDim = c(512L))))

system.time(replicate(100, .cuda(k, ci, N, gridDim = c(32768L), blockDim = c(512L))))

system.time(replicate(100, .cuda(k, ci, N, gridDim = c(32768L), blockDim = c(32, 16))))

i = ci[]
head(i)
done = i[i != 0]
length(done) + 1L
table(diff(done))

duncantl/RCUDA documentation built on May 15, 2019, 5:26 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

duncantl/RCUDA
R Bindings for the CUDA Library for GPU Computing

explorations/grid.R
In duncantl/RCUDA: R Bindings for the CUDA Library for GPU Computing

R Package Documentation

Browse R Packages

We want your feedback!

duncantl/RCUDA R Bindings for the CUDA Library for GPU Computing

explorations/grid.R In duncantl/RCUDA: R Bindings for the CUDA Library for GPU Computing

R Package Documentation

Browse R Packages

We want your feedback!

duncantl/RCUDA
R Bindings for the CUDA Library for GPU Computing

explorations/grid.R
In duncantl/RCUDA: R Bindings for the CUDA Library for GPU Computing