-
由 Pascal Lamblin 提交于
- use KERNEL macro - do not use `%(fail)s` on GPU to avoid returning prematurely from kernel - have special block for y == 0 (and reorder other ones) - keep calling // 0 or % 0 on GPU, even though cuda will not fail
d543d5d2
- use KERNEL macro - do not use `%(fail)s` on GPU to avoid returning prematurely from kernel - have special block for y == 0 (and reorder other ones) - keep calling // 0 or % 0 on GPU, even though cuda will not fail