提交 · 89f584bc14bbc7df625fb571173e41d0baa0d460 · testgroup / pytensor

04 9月, 2015 3 次提交

Use the CUDA driver API for CUDA gpuarray operations. · 89f584bc

由提交于 1月 21, 2015

Instead of mixing the CUDA driver API and the runtime API in the generated code,
use only the CUDA driver API.
GPU programs for CUDA gpuarray operations (except conv operations) are now
generated as a string that is passed to the python interface of libgpuarray.
libgpuarray then generates a cubin bytearray, which is embedded in the
generated code.  The generated code then uses the CUDA driver
API via the C++ interface of libgpuarray to load and launch the GPU program.

This has at least two benefits:

(1) This approach does not use the nvcc offline compiler to compile the
    generated code into the shared library.  It uses the host compiler
    directly, which is likely to be faster.  Note that, for cubin generation,
    libgpuarray still uses the nvcc offline compiler, but an improvement is
    being made to use NVRTC and ptxas instead of nvcc, which should be, again,
    faster.
(2) Mixing the CUDA driver API and the runtime API is typically discouraged.

89f584bc

Merge pull request #3357 from Thrandis/gpu_reshape · 7852531c
由 Xavier Bouthillier 提交于 9月 03, 2015
```
Gpu reshape opt.
```
7852531c
Merge pull request #3358 from nouiz/tests · c42a18c6
由 abergeron 提交于 9月 03, 2015
```
Fix test and better error message
```
c42a18c6

03 9月, 2015 4 次提交
- Better error message · 2a2ae620
  由 Frederic 提交于 9月 02, 2015
  
  2a2ae620
- More info in tests error · 2fc09a03
  由 Frederic 提交于 9月 02, 2015
  
  2fc09a03
- make lib.cnmem=1 work more frequently · b9967384
  由 Frederic 提交于 9月 02, 2015
  
  b9967384
- Gpu reshape opt. · b8008818
  由 Cesar Laurent 提交于 9月 02, 2015
  
  b8008818
02 9月, 2015 8 次提交
- Fix test in FAST_COMPILE · a4bc6622
  由 Frederic 提交于 9月 02, 2015
  
  a4bc6622
- Merge pull request #3339 from nouiz/davikrehalt-master · dc13bfca
  由 Frédéric Bastien 提交于 9月 02, 2015
```
[CRASH] fix crash on Raspberry Pi 1
```
  dc13bfca
- Merge pull request #3348 from nouiz/CoulombeC-master · 46680247
  由 abergeron 提交于 9月 01, 2015
```
Small visual changes (blue color) (bis)
```
  46680247
- Merge pull request #3337 from nouiz/ignore_border · f9e65e0e
  由 abergeron 提交于 9月 01, 2015
```
Warn about pending ignore_border default value change.
```
  f9e65e0e
- Update cudnn v3 config flag · 616d7860
  由 Frederic 提交于 9月 01, 2015
  
  616d7860
- flake8 · 709403a2
  由 Frederic 提交于 9月 01, 2015
  
  709403a2
- use str as we use everywhere else · 81b563b1
  由 Frederic 提交于 9月 01, 2015
  
  81b563b1
- Fix crash in pydotprint · a9c44a89
  由 Frederic 提交于 9月 01, 2015
  
  a9c44a89
01 9月, 2015 7 次提交
- flake8 · 4c39c3f1
  由 Frederic 提交于 9月 01, 2015
  
  4c39c3f1
- Make the check for arm more generic · 9dc5d70b
  由 Frederic 提交于 8月 28, 2015
  
  9dc5d70b
- made compatible with Raspberry Pi 1 · b3fa118b
  由 Andy Jiang 提交于 8月 07, 2015
  
  b3fa118b
- pep8 · 5afe6f9c
  由 Frederic Bastien 提交于 9月 01, 2015
  
  5afe6f9c
- Update docstring and warning message following review · 79a353d1
  由 Frederic Bastien 提交于 9月 01, 2015
  
  79a353d1
- Merge pull request #3344 from nouiz/cycle · 87f8c5a1
  由 abergeron 提交于 8月 31, 2015
```
Deactivate merge of assert as it cause cycle in the graph
```
  87f8c5a1
- Merge pull request #3290 from mohammadpz/prod_dimshuffle_opt · ca8d85de
  由 carriepl 提交于 8月 31, 2015
```
Prod dimshuffle opt
```
  ca8d85de
31 8月, 2015 7 次提交
- Deactivate merge of assert as it cause cycle in the graph · d537addb
  由 Frederic 提交于 8月 31, 2015
  
  d537addb
- FusionOptimizer added for solve FAST_COMPILE issue · 85a7535d
  由 Mohammad Pezeshki 提交于 8月 31, 2015
  
  85a7535d
- more tests + comments · 59b8455c
  由 Mohammad Pezeshki 提交于 8月 27, 2015
  
  59b8455c
- Logical tests added · 0686893f
  由 Mohammad Pezeshki 提交于 8月 27, 2015
  
  0686893f
- compatible dimes are now fixed · 303cc80d
  由 Mohammad Pezeshki 提交于 8月 27, 2015
  
  303cc80d
- numerical tests added · ae53db81
  由 Mohammad Pezeshki 提交于 8月 17, 2015
  
  ae53db81
- optimization for prod added · 6aa9d88e
  由 Mohammad Pezeshki 提交于 8月 17, 2015
  
  6aa9d88e
29 8月, 2015 4 次提交
- Merge pull request #3267 from koningrobot/tensordot-as-dot · 856aa0b6
  由 Frédéric Bastien 提交于 8月 29, 2015
```
Implement batched_tensordot in terms of batched_dot
```
  856aa0b6
- Warn about pending ignore_border default value change. · 2fa005b3
  由 Frederic Bastien 提交于 8月 28, 2015
  
  2fa005b3
- Merge pull request #3334 from abergeron/delete_old_crap · 77f6b2be
  由 abergeron 提交于 8月 28, 2015
```
Delete old stuff
```
  77f6b2be
- Merge pull request #3288 from abergeron/nouiz_mixed · 7320e1b1
  由 abergeron 提交于 8月 28, 2015
```
Nouiz mixed
```
  7320e1b1
28 8月, 2015 7 次提交
- Fix some typos and phrasings. · 7f43e9f4
  由 Arnaud Bergeron 提交于 8月 28, 2015
  
  7f43e9f4
- Remove remnants of theano modules that were deleted in 0.7. · bcfe70c7
  由 Arnaud Bergeron 提交于 8月 28, 2015
  
  bcfe70c7
- Delete the old unmaintained copy of scan in sandbox. · 43d58c19
  由 Arnaud Bergeron 提交于 8月 28, 2015
  
  43d58c19
- Flake8 fix. · c79e0cfd
  由 Arnaud Bergeron 提交于 8月 17, 2015
  
  c79e0cfd
- hash tuple instead of xor them · 0b5ee2f1
  由 Frederic 提交于 8月 11, 2015
  
  0b5ee2f1
- Remove deprecated comment · 1bc1c0fa
  由 Frederic 提交于 8月 11, 2015
  
  1bc1c0fa
- If a reduce upcast the input, don't move it to the GPU. · 233e7813
  由 Frederic 提交于 8月 11, 2015
  
  233e7813