1. 13 6月, 2017 4 次提交
  2. 12 6月, 2017 1 次提交
    • xiaoqie's avatar
      cuda fix · fc36eefb
      xiaoqie 提交于
      All tests in test_nnet.py pass with CUDA.
      Only fp32 tests in test_nnet.py pass with OpenCL. GpuFromHost doesn't work with fp16 or fp64.
      Larger work item size doesn't improve performance.
      Add 2 local_barrier(), it's strange that AMD card doesn't need these local_barrier(), but they are necessary for NVIDIA cards.
      fc36eefb
  3. 10 6月, 2017 2 次提交
  4. 09 6月, 2017 2 次提交
  5. 08 6月, 2017 3 次提交
  6. 07 6月, 2017 3 次提交
  7. 06 6月, 2017 10 次提交
  8. 05 6月, 2017 3 次提交
  9. 03 6月, 2017 4 次提交
  10. 02 6月, 2017 2 次提交
  11. 01 6月, 2017 6 次提交