i'm studying ptx , don't understand how cta (compute thread array) different cuda block.
are same thing? seems me (i'm @ beginning of ptx document) they're same
yes, ptx compute thread array conceptually , functionally same block in cuda or workgroup in opencl.
Comments
Post a Comment