提交 · 2d41daaf645cfb9d1dda4829bf9f0ec82769cd43 · testgroup / pytensor

27 6月, 2017 3 次提交
- We get error just with batch size > 2**16. · 2d41daaf
  由 Seton Steven Bocco 提交于 6月 26, 2017
```
No need of big input.
```
  2d41daaf
- Update tests to make them more precise. · 0fa0aa36
  由 Seton Steven Bocco 提交于 6月 20, 2017
  
  0fa0aa36
- Add tests to reproduce error reported in issue #5985 . · 9123a835
  由 notoraptor 提交于 6月 14, 2017
  
  9123a835
26 6月, 2017 2 次提交
- Merge pull request #6076 from lamblin/fix_no_cudnn · 9db9d791
  由 Frédéric Bastien 提交于 6月 26, 2017
```
Allow fallback on GpuCorrMM if cuDNN is not there
```
  9db9d791
- Allow fallback on GpuCorrMM if cuDNN is not there · 0719742c
  由 Pascal Lamblin 提交于 6月 26, 2017
  
  0719742c
23 6月, 2017 1 次提交
- Merge pull request #6062 from lamblin/faster_tests · e679cd14
  由 Frédéric Bastien 提交于 6月 22, 2017
```
Use FAST_RUN for reference version.
```
  e679cd14
21 6月, 2017 4 次提交
- Merge pull request #6041 from xiaoqie/pool-fix2 · 0cff557b
  由 abergeron 提交于 6月 21, 2017
```
Fix parameter types in ave_pool kernels, remove static_cast, add GLOBAL_MEM
```
  0cff557b
- Use FAST_RUN for reference version. · c5c417c9
  由 Pascal Lamblin 提交于 6月 20, 2017
```
This should speed up the FAST_COMPILE buildbot
```
  c5c417c9
- Merge pull request #6057 from lamblin/fix_nomem_segfault · 9b044407
  由 Pascal Lamblin 提交于 6月 20, 2017
```
Fail if output memory not allocated
```
  9b044407
- Merge pull request #6055 from gvtulder/f-elemwise_cgen_error-msg · 9ed454d3
  由 Pascal Lamblin 提交于 6月 20, 2017
```
Fix elemwise ValueError message (printf formatting)
```
  9ed454d3
20 6月, 2017 4 次提交
- Only redefine atomicAdd on doubles for arch < 6 · fde1fdf1
  由 Pascal Lamblin 提交于 6月 20, 2017
```
This fixes a compilation issue on Pascal GPUs.
```
  fde1fdf1
- Fail if output memory not allocated · 14a89b67
  由 Pascal Lamblin 提交于 6月 19, 2017
```
instead of segfaulting like a barbarian.
```
  14a89b67
- Merge pull request #6056 from notoraptor/fix-dnn-config-flags · d5123351
  由 Pascal Lamblin 提交于 6月 20, 2017
```
Update documentation and config flags about supported cuDNN algorithms.
```
  d5123351
- Update documentation and config flags about supported cuDNN algorithms. · f6dd8251
  由 notoraptor 提交于 6月 19, 2017
  
  f6dd8251
19 6月, 2017 1 次提交
- Fix elemwise ValueError message (printf formatting). · 27b70e22
  由 Gijs van Tulder 提交于 6月 19, 2017
  
  27b70e22
16 6月, 2017 2 次提交
- Merge pull request #6043 from nouiz/warn_float16 · c5cd87fa
  由 abergeron 提交于 6月 15, 2017
```
Don't print float16 warning for ops that don't have c code
```
  c5cd87fa
- Merge pull request #6032 from slefrancois/jenkins_32_docker · de68703f
  由 Frédéric Bastien 提交于 6月 15, 2017
```
jenkins buildbot with docker
```
  de68703f
15 6月, 2017 2 次提交
- Don't print float16 warning for ops that don't have c code · 0dd6ce24
  由 Frederic Bastien 提交于 6月 15, 2017
  
  0dd6ce24
- version bump for cache · cb0c0883
  由 xiaoqie 提交于 6月 15, 2017
  
  cb0c0883
14 6月, 2017 6 次提交
- Merge pull request #6030 from lamblin/fix_5036 · e34c0424
  由 Frédéric Bastien 提交于 6月 14, 2017
```
Add lifter for CrossentropyCategorical1Hot and grad
```
  e34c0424
- Fix parameter types in ave_pool kernels, remove static_cast, add GLOBAL_MEM · 413f1b91
  由 xiaoqie 提交于 6月 14, 2017
  
  413f1b91
- Merge pull request #5942 from botev/master · 7509fa75
  由 Pascal Lamblin 提交于 6月 13, 2017
```
Added mode 'half' to Images2Neibs. Tests pass.
```
  7509fa75
- Merge pull request #6025 from nouiz/nanguardmode_int · 88b49770
  由 Pascal Lamblin 提交于 6月 13, 2017
```
[ENH] Speed up nanguardmode by not checking *int* dtype
```
  88b49770
- Remove debugprint line from test · 45e6855f
  由 Pascal Lamblin 提交于 6月 13, 2017
  
  45e6855f
- Merge pull request #6038 from notoraptor/update-config-doc-for-dnn-paths · 4e8eac00
  由 Frédéric Bastien 提交于 6月 13, 2017
```
(small fix) Add doc for `dnn.include_path` and `dnn.library_path`.
```
  4e8eac00
13 6月, 2017 13 次提交
- flake8 · 0509589f
  由 Frederic Bastien 提交于 6月 13, 2017
  
  0509589f
- Merge pull request #6034 from abergeron/split_long · 5d2dd362
  由 Frédéric Bastien 提交于 6月 13, 2017
```
Split long-running test so that it helps travis not to give up
```
  5d2dd362
- Default messages more explicit. · 52b9c165
  由 notoraptor 提交于 6月 13, 2017
  
  52b9c165
- Merge pull request #6012 from abergeron/fix_offset · e5ba1b08
  由 Frédéric Bastien 提交于 6月 13, 2017
```
Fix offset problems in the new backend.
```
  e5ba1b08
- Update description for default values. · 1656a4aa
  由 notoraptor 提交于 6月 13, 2017
  
  1656a4aa
- Merge pull request #6029 from lamblin/fix_useless_bitwise · 8dcc5fc6
  由 Frédéric Bastien 提交于 6月 13, 2017
```
Fix issue in optimizations with bitwise operations
```
  8dcc5fc6
- Add doc for `dnn.include_path` and `dnn.library_path`. · b9819d30
  由 notoraptor 提交于 6月 13, 2017
  
  b9819d30
- Add comments explaining the difference between offset_im and data_im_offset. · a762b617
  由 Arnaud Bergeron 提交于 6月 12, 2017
  
  a762b617
- Merge pull request #6015 from xiaoqie/port-softmax · c0b7c96d
  由 abergeron 提交于 6月 12, 2017
```
Port Softmax kernel to OpenCL
```
  c0b7c96d
- Change docstrings to comments. · 4182a1ad
  由 Arnaud Bergeron 提交于 6月 12, 2017
  
  4182a1ad
- Split long test. · 4475869c
  由 Arnaud Bergeron 提交于 6月 12, 2017
  
  4475869c
- Add c code to abs for uint* and bool · 88c9397d
  由 Frederic Bastien 提交于 6月 12, 2017
  
  88c9397d
- Also skip bool and have a real gpuarray test. · 14c79a2a
  由 Frederic Bastien 提交于 6月 12, 2017
  
  14c79a2a
12 6月, 2017 1 次提交

cuda fix · fc36eefb

由提交于 6月 12, 2017

All tests in test_nnet.py pass with CUDA.
Only fp32 tests in test_nnet.py pass with OpenCL. GpuFromHost doesn't work with fp16 or fp64.
Larger work item size doesn't improve performance.
Add 2 local_barrier(), it's strange that AMD card doesn't need these local_barrier(), but they are necessary for NVIDIA cards.

fc36eefb

11 6月, 2017 1 次提交
- Merge remote-tracking branch 'origin/master' · 3805415a
  由 botev 提交于 6月 11, 2017
  
  3805415a