提交 f2c202db authored 作者: Pascal Lamblin's avatar Pascal Lamblin

Use return_internal_type=True in compute_test_value of shared variables.

That way, we use the CudaNdarray, instead of transferring back and forth to the GPU, and the Gpu ops have CudaNdarrays as inputs, not numpy ndarrays or scalars.
上级 253dde5c
...@@ -335,7 +335,7 @@ class PureOp(object): ...@@ -335,7 +335,7 @@ class PureOp(object):
if isinstance(ins, graph.Constant): if isinstance(ins, graph.Constant):
input_vals.append(ins.value) input_vals.append(ins.value)
elif isinstance(ins,SharedVariable): elif isinstance(ins,SharedVariable):
input_vals.append(ins.get_value(borrow=True)) input_vals.append(ins.get_value(borrow=True, return_internal_type=True))
elif isinstance(ins,graph.Variable) and hasattr(ins.tag, 'test_value'): elif isinstance(ins,graph.Variable) and hasattr(ins.tag, 'test_value'):
# ensure that the test value is correct # ensure that the test value is correct
input_vals.append(ins.type.filter(ins.tag.test_value)) input_vals.append(ins.type.filter(ins.tag.test_value))
......
Markdown 格式
0%
您添加了 0 到此讨论。请谨慎行事。
请先完成此评论的编辑!
注册 或者 后发表评论