问题描述
我想对开普勒的体系结构有所了解,但是对我来说这没有意义.
I wanted to get a feel for Kepler's architecture, but it doesn't make sense to me.
如果warp是32个线程,并且其中4个得到调度/执行,则意味着128个内核正在使用,而64个处于空闲状态.在白皮书中,它谈到了有关独立指令的内容,那么为这些指令保留了64个内核吗?
If a warp is 32 threads, and 4 of them get scheduled/executed, that would mean 128 cores are in use and 64 are left idle. In the whitepaper it said something about independent instructions, so are the 64 cores reserved for those instructions?
如果是这样,有人可以举一个例子说明何时需要独立的指导吗?
If so, can someone give me an example of when an independent instruction would be needed?
推荐答案
开普勒的每个SM都具有192(SP)内核和4个warp调度程序.每个Warp调度程序都具有双重发行的能力,这意味着在某些情况下,它实际上可以在单个发行槽中从给定线程块(实际上是针对特定的Warp)发出2条指令.
Each SM in Kepler has 192 (SP) cores, and 4 warp schedulers. Each warp scheduler is capable of dual-issue which means that it can actually issue 2 instructions from a given threadblock (actually for a particular warp) in a single issue slot, under some circumstances.
其中一种情况是指令应该是独立的,这粗略地说意味着没有指令取决于另一条指令的输出.
One of these circumstances is that the instructions should be independent, which roughly speaking means that niether instruction depends on the output of the other instruction.
通过4个warp调度程序,每个调度程序都可能执行两次发出操作,理论上可以启动多达8个warp指令的工作.从理论上讲,这至少足以使192(SP)内核保持繁忙.
With 4 warp schedulers, each capable of possibly dual-issue, it's theoretically possible to launch work for up to 8 warp instructions. This is at least theoretically enough to keep 192 (SP) cores busy.
除了SP单元(通常称为核心")之外,SM还具有执行单元,因此实际的指令混合将确定在任何给定的发布时段中安排了哪些执行单元.
An SM has execution units besides the SP units that are commonly referred to as "cores", so the actual instruction mix will determine which execution units are scheduled in any given issue slot.
您可以在 GK110白皮书.
这篇关于为什么GK110有192个核心和4个扭曲?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!