Use return_internal_type=True in compute_test_value of shared variables.
That way, we use the CudaNdarray, instead of transferring back and forth
to the GPU, and the Gpu ops have CudaNdarrays as inputs, not numpy
ndarrays or scalars.
正在显示
请
注册
或者
登录
后发表评论