• notoraptor's avatar
    Added another optimization for [sd]gemm_, · 61980334
    notoraptor 提交于
    just by skipping alpha*matrix multiplication when alpha == 1.0.
    
    All tests succeed (with blas.ldflags empty) for:
    * test_abstract_conv.py in theano/tensor/nnet/tests/
    * test_blas.py and test_blas_scipy.py in theano/tensor/tests/
    
    I have modified theano/tensor/tests/test_blas_c.py
    to skip all tests that involves either gemv or ger functions.
    * Before the modifications, this file executed 44 tests and 34 were skipped.
    * After  the modifications, this file executes 44 tests and 29 are skipped.
    
    # $ theano-cache purge && THEANO_FLAGS=blas.ldflags= nosetests --verbose theano/tensor/tests/test_blas_c.py
    
    PS: I also tried to execute test_corr.py in
    theano/tensor/nnet/tests/test_corr.py after removing ldflags checking,
    but I get many errors in many of the tests (theano outputs not matches ref outputs).
    So for the moment I have let this file and I will continue investigations tomorrow.
    61980334
名称
最后提交
最后更新
..
nnet 正在载入提交数据...
signal 正在载入提交数据...
tests 正在载入提交数据...
__init__.py 正在载入提交数据...
alt_dgemm.c 正在载入提交数据...
alt_sgemm.c 正在载入提交数据...
basic.py 正在载入提交数据...
blas.py 正在载入提交数据...
blas_c.py 正在载入提交数据...
blas_headers.py 正在载入提交数据...
blas_scipy.py 正在载入提交数据...
elemwise.py 正在载入提交数据...
elemwise_cgen.py 正在载入提交数据...
extra_ops.py 正在载入提交数据...
fft.py 正在载入提交数据...
fourier.py 正在载入提交数据...
inplace.py 正在载入提交数据...
io.py 正在载入提交数据...
nlinalg.py 正在载入提交数据...
opt.py 正在载入提交数据...
opt_uncanonicalize.py 正在载入提交数据...
raw_random.py 正在载入提交数据...
shared_randomstreams.py 正在载入提交数据...
sharedvar.py 正在载入提交数据...
slinalg.py 正在载入提交数据...
sort.py 正在载入提交数据...
subtensor.py 正在载入提交数据...
type.py 正在载入提交数据...
type_other.py 正在载入提交数据...
utils.py 正在载入提交数据...
var.py 正在载入提交数据...
xlogx.py 正在载入提交数据...