• notoraptor's avatar
    New update! I have integrated the required changes and · 24f96fa8
    notoraptor 提交于
    re-run the tests. All tests are passed as before, and now
    some tests are faster. The biggest gain on my computer is
    for theano/tensor/nnet/tests/test_corr3d.py, which goes from
    687 seconds before to 259 seconds now. For other tests, it's
    between 3 and 20 seconds.
    
    Now there is not copy nor memory allocation
    (apart from NumPy wrapping structures) when BETA == 0.
    
    I rewrote the OP(matrix) function so that it does not return
    new allocated data anymore. Instead it just creates a
    PyArrayObject wrapper around the matrix pointer with the right
    format: F-contiguous (nrow * ncol) by default, or
    C-contiguous (ncol * nrow) if matrix need to be transposed.
    
    I also rewrote the matrix sum function so that it requires
    scalars to multiply each passed matrix before addition.
    Now the function do: B = alpha*A + beta*B
    with alpha and beta as the scalars (both set to 1 if we just want
    B = A + B). Thus, there is now only one iteration over A and B,
    in which A and B are each read once, and B modified once.
    24f96fa8
名称
最后提交
最后更新
.jenkins 正在载入提交数据...
benchmark 正在载入提交数据...
bin 正在载入提交数据...
doc 正在载入提交数据...
theano 正在载入提交数据...
.gitignore 正在载入提交数据...
.mailmap 正在载入提交数据...
.travis.yml 正在载入提交数据...
CONTRIBUTING.md 正在载入提交数据...
DESCRIPTION.txt 正在载入提交数据...
EMAIL.txt 正在载入提交数据...
HISTORY.txt 正在载入提交数据...
ISSUE_TEMPLATE.md 正在载入提交数据...
LICENSE.txt 正在载入提交数据...
MANIFEST.in 正在载入提交数据...
NEWS.txt 正在载入提交数据...
NEWS_DEV.txt 正在载入提交数据...
README.txt 正在载入提交数据...
Theano.pyproj 正在载入提交数据...
Theano.sln 正在载入提交数据...
requirement-rtd.txt 正在载入提交数据...
setup.cfg 正在载入提交数据...
setup.py 正在载入提交数据...