Try to fix how Null gradients are propagated in Scan
- propagate NullType when summing different gradient contributions
- identify which outer outputs of the scan node implementing the grad
should be Null
- replace them, and the corresponding inner variables, with zeros
of the appropriate size, so the scan node can be compiled and run.
- introduce a difference in undefined gradient because going through
a shared variable with updates, and because a NullType was returned
by the internal graph.
正在显示
请
注册
或者
登录
后发表评论