新一代面向密集计算的高性能处理器普遍采用分布式寄存器文件来支撑ALU阵列,并通过VLIW开发指令级并行。
Newly-emerging high performance processors for intensive computing generally use distributed register files to support ALU array and to explore instruction level parallelism(ILP) by VLIW.
倍。CUDA使GPU流处理器阵列的性能得到充分发挥,极大地提高了并行计算程序的效率。
CUDA gives full play to the advantages of GPU Streaming Multiprocessors Array and greatly improves the efficiency of the parallel computation programs.
首先介绍了L S SIMD阵列微处理器的三种并行性:数据并行、流水线并行和指令的并行执行。
This paper firstly discusses three types of parallelism in LS SIMD array microprocessor, they are the concurrence of data, the pipelining and the operation in parallel.
应用推荐