1. 14 7月, 2017 5 次提交
  2. 16 6月, 2017 2 次提交
  3. 15 6月, 2017 1 次提交
  4. 14 6月, 2017 5 次提交
  5. 13 6月, 2017 13 次提交
  6. 12 6月, 2017 1 次提交
    • xiaoqie's avatar
      cuda fix · fc36eefb
      xiaoqie 提交于
      All tests in test_nnet.py pass with CUDA.
      Only fp32 tests in test_nnet.py pass with OpenCL. GpuFromHost doesn't work with fp16 or fp64.
      Larger work item size doesn't improve performance.
      Add 2 local_barrier(), it's strange that AMD card doesn't need these local_barrier(), but they are necessary for NVIDIA cards.
      fc36eefb
  7. 11 6月, 2017 3 次提交
  8. 10 6月, 2017 10 次提交