提交 a4f7d118 authored 作者: Arnaud Bergeron's avatar Arnaud Bergeron

Fix kernel compilation under stupid cuda overloading rules.

上级 7b71d4c4
......@@ -683,7 +683,7 @@ class GpuEye(GpuKernelBase, Op):
def c_kernel_code(self):
return """
KERNEL void k(GLOBAL_MEM %(ctype)s *a, ga_size n, ga_size m) {
ga_size nb = min(n, m);
ga_size nb = (ga_size)min((ga_ulong)n, (ga_ulong)m);
for (ga_size i = LID_0; i < nb; i += LDIM_0) {
a[i*m + i] = 1;
}
......
Markdown 格式
0%
您添加了 0 到此讨论。请谨慎行事。
请先完成此评论的编辑!
注册 或者 后发表评论