cuda - Grid of thread blocks and Multiprocessor -

June 15, 2014

the cuda architecture built around scalable array of multithreaded streaming multiprocessors (sms). when cuda program on host cpu invokes kernel grid, blocks of grid enumerated , distributed multiprocessors available execution capacity. threads of thread block execute concurrently on 1 multiprocessor, , multiple thread blocks can execute concurrently on 1 multiprocessor. thread blocks terminate, new blocks launched on vacated multiprocessors.

does mean if have video card of 2 multiprocessor x n-cuda cores , if launch kernel like

mykernel<<<1,n>>>(sth);

one of multiprocessors idle, since i'm launching single block of n threads?

you correct.

in currect cuda architectures, block ever scheduled , run on single multiprocessor. if run 1 block on device more 1 multiprocessor, 1 of multiprocessors idle.

Search This Blog

New Mian

cuda - Grid of thread blocks and Multiprocessor -

Comments

Post a Comment

Popular posts from this blog

android - java.net.UnknownHostException(Unable to resolve host “URL”: No address associated with hostname) -

jquery - How can I dynamically add a browser tab? -

keyboard - C++ GetAsyncKeyState alternative -