1. 20 10月, 2016 8 次提交
    • notoraptor's avatar
      Added another optimization for [sd]gemm_, · 61980334
      notoraptor 提交于
      just by skipping alpha*matrix multiplication when alpha == 1.0.
      
      All tests succeed (with blas.ldflags empty) for:
      * test_abstract_conv.py in theano/tensor/nnet/tests/
      * test_blas.py and test_blas_scipy.py in theano/tensor/tests/
      
      I have modified theano/tensor/tests/test_blas_c.py
      to skip all tests that involves either gemv or ger functions.
      * Before the modifications, this file executed 44 tests and 34 were skipped.
      * After  the modifications, this file executes 44 tests and 29 are skipped.
      
      # $ theano-cache purge && THEANO_FLAGS=blas.ldflags= nosetests --verbose theano/tensor/tests/test_blas_c.py
      
      PS: I also tried to execute test_corr.py in
      theano/tensor/nnet/tests/test_corr.py after removing ldflags checking,
      but I get many errors in many of the tests (theano outputs not matches ref outputs).
      So for the moment I have let this file and I will continue investigations tomorrow.
      61980334
    • notoraptor's avatar
      Required corrections and modifications are done. · 9ca9474b
      notoraptor 提交于
      Recall: code is tested with:
      
      $ theano-cache purge && THEANO_FLAGS=blas.ldflags= nosetests theano/tensor/nnet/tests/test_abstract_conv.py:TestCorrConv2d
      
      NB:
      1) dgemm_ is never called in these tests. Only sgemm_ is called.
      2) All LDA,LDB,LDC are always in the set {M;N;K} during these tests.
      9ca9474b
    • notoraptor's avatar
      flake8 errors fixed · 6b3afd89
      notoraptor 提交于
      6b3afd89
    • notoraptor's avatar
      Added some simplifications. · 39a2b2d2
      notoraptor 提交于
      39a2b2d2
    • notoraptor's avatar
      I added an implementation of C-functions "sgemm_" and "dgemm_" that call Numpy… · 0cd7aa7b
      notoraptor 提交于
      I added an implementation of C-functions "sgemm_" and "dgemm_" that call Numpy C-API functions to perform matrix product when BLAS is explicitely disabled (with theano flag "blas.ldflags" set to empty).
      
      This can be tested with:
      THEANO_FLAGS=blas.ldflags= nosetests theano/tensor/nnet/tests/test_abstract_conv.py:TestCorrConv2d
      0cd7aa7b
    • Frédéric Bastien's avatar
      Merge pull request #5122 from gvtulder/f-faster-conv3d-tests · 96be471e
      Frédéric Bastien 提交于
      Somewhat smaller/faster 3d convolution tests
      96be471e
    • Gijs van Tulder's avatar
      cb0e84e0
    • Gijs van Tulder's avatar
      Make test_corr3d tests smaller/faster. · 10caddc2
      Gijs van Tulder 提交于
      10caddc2
  2. 19 10月, 2016 2 次提交
  3. 18 10月, 2016 6 次提交
  4. 17 10月, 2016 5 次提交
  5. 16 10月, 2016 1 次提交
  6. 15 10月, 2016 18 次提交