-
由 Matthew Willson 提交于
Improve performance of GpuCrossentropySoftmaxArgmax1HotWithBias significantly by implementing a TODO: launch more threads per row and do parallel sum and max reductions
e9d5e0ac
Improve performance of GpuCrossentropySoftmaxArgmax1HotWithBias significantly by implementing a TODO: launch more threads per row and do parallel sum and max reductions