opencl
OpenCL, TBB, OpenMP
I have implemented few normal looping applications in OpenMP, TBB and OpenCL. In all these applications, OpeCL gives far better performance than others too when I am only running it on CPU with no spe[详细]
2023-03-29 15:20 分类:问答OpenCL - Initializing program-scope variables from the host?
I have some general parameters declared as a global (__constant) struct, like so: typedef struct { int a;[详细]
2023-03-29 11:50 分类:问答OpenCL - How to I query for a device's SIMD width?
In CUDA, there is a concept of a warp, which is defined as 开发者_JAVA技巧the maximum number of threads that can execute the same instruction simultaneously within a single processing element.For NVID[详细]
2023-03-29 05:18 分类:问答Sparse Cholesky factorization algorithm for GPU [closed]
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.[详细]
2023-03-29 01:56 分类:问答How does the opencl command queue work, and what can I ask of it
I\'m working on an algorithm that does prettymuch the same operation a bunch of times. Since the operation consists of some linear algebra(BLAS), I thourght I would try using the GPU for this.[详细]
2023-03-27 03:24 分类:问答Is there a way to independently task and use heterogenous multi gpus in a windows 7 system?
Can I have two mixed chipset/generation AMD gpus in my desktop; a 6950 and 4870, and dedicate one gpu (4870) for opencl/gpgpu purposes only, eliminating the device from video output or display driving[详细]
2023-03-26 07:19 分类:问答How to implement a shared buffer?
I\'ve got one program which creates 3 worker programs. The preferable method of communication in my situation would be through a memory buffer which all four programs may access.[详细]
2023-03-25 22:19 分类:问答how much time does it take to make a call to opencl?
I\'m currently implementing an algorithm that does allot of linear algebra on small matrices and vectors. the code is fast but I\'m wondering if it would make sense to implement it on a gpgpu instead[详细]
2023-03-25 21:51 分类:问答Barriers in OpenCL
In OpenCL, my understanding is that you can use the barrier() function to synchronize threads in a work group.I do (generally) understand what they are for and when to use them.I\'m also aware that al[详细]
2023-03-25 10:06 分类:问答local mem on opencl hardware
I\'ve been wondering, is there a way to estimate the mount of shared mem on the different GPGPU\'s without going out and buying the cards?[详细]
2023-03-25 02:12 分类:问答