- 09 11月, 2016 6 次提交
-
-
由 notoraptor 提交于
-
由 notoraptor 提交于
CPU op MaxAndArgmax rewritten so that it now takes the axes as a parameter op. Same update for GPU op GpuMaxAndArgmax. max_and_argmax wrapper rewritten to fully check the axis before passing it to MaxAndArgmax. Some other files have also been updated to ensure that the optimization which replace CPU op by GPU op works well and that all tests involving MaxAndArgmax still work well after the updates. GPU op rewritten to handle the last libgpuarray update. test_reduction rewritten. It now tests also 0-d arrays (scalars). I have run the following tests with success. The new update of libgpuarray has been downloaded, compiled and fully installed before running these tests. nosetests -v theano/gpuarray/tests/test_opt.py # There is 1 fail here, but not related to MaxAndArgmax: # ERROR: theano.gpuarray.tests.test_opt.test_local_lift_abstractconv_gpu_shape # RuntimeError: cuDNN is required for convolution and pooling nosetests -v theano/tensor/nnet/tests/test_nnet.py nosetests -v theano/tensor/tests/test_opt_uncanonicalize.py nosetests -v theano/tensor/tests/test_basic.py THEANO_FLAGS=floatX=float32,profile=True,profiling.n_ops=1000,profiling.n_apply=1000,profiling.ignore_first_call=True,profiling.destination=profiling.log nosetests --nocapture --verbose theano/gpuarray/tests/test_reduction.py Prevent Flake8! -
由 notoraptor 提交于
-
由 notoraptor 提交于
Tests have been rewritten, and they run so that a CPU computation and a GPU computation are always performed (separately) on the same input. This allow to run the tests with Theano profiling flags and then compare the execution time of MaxAndArgmax (CPU) and GpuMaxAndArgmax (GPU). Some code has also been modified in theano/tensor/basic.py, related to MaxAndArgmax, to make the API more uniform and to put most of axis checking in maxandargmax wrapper instead of in make_node functions of (Gpu)MaxAndArgmax.
-
由 notoraptor 提交于
-
由 notoraptor 提交于
GpuMaxAndArgmax op is implemented in a separate file in theano/gpuarray. A test script has also been written. To integrate this op in Theano, an optimization has been added to ttheano/gpuarray/opt.py.
-
- 08 11月, 2016 19 次提交
-
-
由 Frédéric Bastien 提交于
Added checks for taps values.
-
由 Frédéric Bastien 提交于
Fix remaining tests in debugmode
-
由 Frédéric Bastien 提交于
Update to fix python 3 error
-
由 Frédéric Bastien 提交于
Option to evaluate part of the max-and-argmax on cpu
-
由 Frédéric Bastien 提交于
Numpy imports
-
由 Frédéric Bastien 提交于
Fix h_softmax example data types
-
由 Pascal Lamblin 提交于
Fix grad of elemwise with boolean inputs.
-
由 Cesar Laurent 提交于
-
由 Pascal Lamblin 提交于
-
由 Pascal Lamblin 提交于
-
由 Pascal Lamblin 提交于
-
由 notoraptor 提交于
Commented code removed, and assertion added to check that call to sys.getrefcount does nots changes the refcount.
-
由 notoraptor 提交于
Update test function test_pycuda_memory_to_theano() to fix Python 3 error. The refcount of a variable seems to be different between Python 2 and Python 3. So it seems better to stock an initial refcount in a variable and then do the checkings with that variable instead of directly using comparing to constant values. Tests passed with Python 3 (Anaconda): ``` $ nosetests --verbose --nocapture theano/misc/tests/test_pycuda_theano_simple.py Run pycuda only example to test that pycuda works. ... ok Simple example with pycuda function and Theano CudaNdarray object. ... ok theano.misc.tests.test_pycuda_theano_simple.test_pycuda_memory_to_theano ... 4 gpuarray ref count before creating a CudaNdarray 4 strides (20, 5, 1) gpuarray ref count after creating a CudaNdarray 5 gpuarray ref count after deleting the CudaNdarray 4 ok ---------------------------------------------------------------------- Ran 3 tests in 0.297s OK ```
-
由 kvmanohar22 提交于
-
由 kvmanohar22 提交于
-
由 kvmanohar22 提交于
-
由 Arnaud Bergeron 提交于
-
由 Arnaud Bergeron 提交于
-
由 Frédéric Bastien 提交于
#2801:subtensor-incsubtensor
-
- 07 11月, 2016 2 次提交
-
-
由 Jakub Sygnowski 提交于
-
由 Frédéric Bastien 提交于
Import numpy
-
- 05 11月, 2016 13 次提交
-
-
由 Pascal Lamblin 提交于
-
由 Pascal Lamblin 提交于
-
由 abergeron 提交于
[REG, CRASH] fix crash related to float16 introduced in gh-5077
-
由 Pascal Lamblin 提交于
-
由 Pascal Lamblin 提交于
DebugMode is too strict here. It happens rarely, so I decided to have a workaround for these specific tests. Moving the definition of values_eq_approx outside of TensorType made it possible.
-
由 Saizheng Zhang 提交于
-
由 Saizheng Zhang 提交于
-
由 Saizheng Zhang 提交于
-
由 Saizheng Zhang 提交于
-
由 Saizheng Zhang 提交于
-
由 Saizheng Zhang 提交于
-
由 Saizheng Zhang 提交于
-
由 Saizheng Zhang 提交于
-