1. 27 6月, 2017 1 次提交
  2. 22 6月, 2017 1 次提交
  3. 21 6月, 2017 2 次提交
  4. 20 6月, 2017 2 次提交
  5. 16 6月, 2017 2 次提交
  6. 15 6月, 2017 1 次提交
  7. 14 6月, 2017 5 次提交
  8. 13 6月, 2017 13 次提交
  9. 12 6月, 2017 1 次提交
    • xiaoqie's avatar
      cuda fix · fc36eefb
      xiaoqie 提交于
      All tests in test_nnet.py pass with CUDA.
      Only fp32 tests in test_nnet.py pass with OpenCL. GpuFromHost doesn't work with fp16 or fp64.
      Larger work item size doesn't improve performance.
      Add 2 local_barrier(), it's strange that AMD card doesn't need these local_barrier(), but they are necessary for NVIDIA cards.
      fc36eefb
  10. 11 6月, 2017 3 次提交
  11. 10 6月, 2017 9 次提交