Hi, Arya! I think it should be `int stride = blockDim.x/2`. https://github.com/aryagxr/cuda/blob/18ee1d1f6db9e095f3590c8bf7039fbb271938a5/layernorm/kernels/smem-layernorm.cu#L52
Hi, Arya!
I think it should be
int stride = blockDim.x/2.cuda/layernorm/kernels/smem-layernorm.cu
Line 52 in 18ee1d1