nvidia
CUDA Matrix multiplication breaks for large matrices
I have the following matrix multiplication code, implemented using CUDA 3.2 and VS 2008. I am running on Windows server 2008 r2 enterprise. I am running a Nvidia GTX 480. The following code works fine[详细]
2023-01-22 23:07 分类:问答CL_OUT_OF_RESOURCES for 2 millions floats with 1GB VRAM?
It seems like 2 million floats should be no big deal, only 8MBs of 1GB of GPU RAM. I am able to allocate that much at times and sometimes more than that with no trouble. I get CL_OUT_OF_RESOURCES when[详细]
2023-01-21 21:42 分类:问答Example for rendering with Cg to a offscreen frame buffer object
I would like to see an example of rendering with nVidia Cg to an offscreen frame buffer object. T开发者_开发问答he computers I have access to have graphic cards but no monitors (or X server). So I wa[详细]
2023-01-21 04:54 分类:问答CUDA kernels throw different results on 2 different GPUs(GeForce 8600M GT vs Quadro FX 770M)
I\'ve been working on an AES CUDA application and I have a kernel which performs ECB encryption on the GPU. In order to assure the logic of the algorithm is not modified when running in parallel I sen[详细]
2023-01-21 02:49 分类:问答How to mitigate host + device memory tranfer bottlenecks in OpenCL/CUDA
If my algorithm is bottlenecked by host to device and device to host memory tr开发者_StackOverflowansfers, is the only solution a different or revised algorithm?There are a couple things you can try t[详细]
2023-01-21 00:30 分类:问答Bitmap conversion using GPU
I don\'t know whether this is the right forum. Anyway here is the question. In one of our application we display medical images and on top of them some algorithm generated bitmap. The real bitmap is a[详细]
2023-01-20 12:39 分类:问答Size of statically allocated shared memory per block with Compute Prof (Cuda/OpenCL)
In Nvidia\'s compute prof there is a column called \"static private mem per work group\" and the tooltip of it says \"Size of statically allocated shared memory per block\". My application shows that[详细]
2023-01-19 18:08 分类:问答Rationalizing what is going on in my simple OpenCL kernel in regards to global memory
const char programSource[] = \"__kernel void vecAdd(__global int *a, __global int *b, __global int *c)\"[详细]
2023-01-19 03:34 分类:问答Why aren't there bank conflicts in global memory for Cuda/OpenCL?
One thing I haven\'t figured out and google isn\'t helping me, is why is it possible to have bank conflicts with shared memory, but not in global memory? Can there be bank conflicts with registers?[详细]
2023-01-18 22:18 分类:问答GPGPU, OpenCL, CUDA, ATI Stream
Please tell me what technologies GPGPU exist already and which hardwares vendor\'s implement GPGPU? I\'ve b开发者_如何学JAVAeen reading articles on various sites from morning and I\'ve become confuse[详细]
2023-01-17 11:36 分类:问答