算法通过运用线性存储器纹理解决了越界问题,并借助CUDA的共享内存以减少对全局内存的访问次数。
By using CUDA's texture from linear memory, over-bound accessing is avoided. For reducing access of global memory, the algorithm employs CUDA's Shared memory.
算法通过运用线性存储器纹理解决了越界问题,并借助CUDA的共享内存以减少对全局内存的访问次数。
By using CUDA's texture from linear memory, over-bound accessing is avoided. For reducing access of global memory, the algorithm employs CUDA's Shared memory.
应用推荐