0

I coded a small cuda function as following:

e=8
@cuda.jit
def my_kernel(e):
    i,j = cuda.grid(2)

    if i <= e and j<=e:
        i**2+j**2

# Host code   

threadsperblock = 16,16
blockspergrid = 32,32
my_kernel[blockspergrid, threadsperblock](e)
print('ok')

Now if I turn it into 3d form it gives me an error:

e=8
@cuda.jit
def my_kernel(e):
    i,j,k = cuda.grid(3)

    if i <= e and j<=e and k<=e:
        i**2+j**2+k**2

# Host code   

threadsperblock = 16,16,16
blockspergrid = 32,32,32
my_kernel[blockspergrid, threadsperblock](e)
print('ok')

error:
[1] Call to cuLaunchKernel results in CUDA_ERROR_INVALID_VALUE

How to solve this? Thanks really much.

Robert Crovella
  • 131,712
  • 9
  • 184
  • 228
ZHANG Juenjie
  • 471
  • 4
  • 17

0 Answers0