-
由 Rami Al-Rfou 提交于
New sparse gradient is returned, ADDSD is optimized in place, we have to write a flag to turn on optimization in case of updates
8551830d
New sparse gradient is returned, ADDSD is optimized in place, we have to write a flag to turn on optimization in case of updates