提交 20e7ec9d authored 作者: f0k's avatar f0k

make local_gpu_alloc_memset_0() keep the original broadcast pattern

上级 0c3ab9a5
...@@ -1699,6 +1699,10 @@ def local_gpualloc_memset_0(node): ...@@ -1699,6 +1699,10 @@ def local_gpualloc_memset_0(node):
inp.data.size == 1 and inp.data.size == 1 and
(numpy.asarray(inp.data) == 0).all()): (numpy.asarray(inp.data) == 0).all()):
new_out = GpuAlloc(memset_0=True)(*node.inputs) new_out = GpuAlloc(memset_0=True)(*node.inputs)
if new_out.type.broadcastable != node.outputs[0].type.broadcastable:
# force old broadcasting pattern; we must not change it here
new_out = tensor.patternbroadcast(new_out,
node.outputs[0].broadcastable)
return [new_out] return [new_out]
......
Markdown 格式
0%
您添加了 0 到此讨论。请谨慎行事。
请先完成此评论的编辑!
注册 或者 后发表评论