GpuConv - new refcounting rules, and rewrite of valid version=13 aka conv_patch_stack_reduce
- fixes errors in conv_patch_stack_reduce when the entire kernel doesn't fit
into shared memory
- lowers the shared memory requirement of conv_patch_stack_reduce
正在显示
请
注册
或者
登录
后发表评论