New version of the C-code generator for Elemwise, enabling loop reordering.
- Code reviewed by Olivier B., thanks!
- Lots of comments added
- Unused code removed
- Looping in Elemwise is now done so the contiguous dimension of the output is
looped over in the inner-most loop. It does not change anything in the
non-inplace case (the output was C-contiguous), but can lead to really
better performances (at least 7x) in the inplace case.
正在显示
请
注册
或者
登录
后发表评论