• Xavier Bouthillier's avatar
    Merge pull request #5852 from Faruk-Ahmed/split_elemwise_addmul · 89aac420
    Xavier Bouthillier 提交于
    Adapt local_gpu_elemwise optimization of new gpuarray back-end to avoid number of inputs overflow with Elemwise<add,mul>. The current optimization was already splitting the input, but it was not using the method split_huge_add_or_mul because of new gpuarray lifter signature (see comment 
    https://github.com/Theano/Theano/pull/5852#discussion_r114145523)
    
    The unit test for large number of inputs was invalid because it was testing the old back-end (theano.sandbox.cuda). It is now adapted to gpuarray lifter optimization function. The number of settings tested is reduced to lower the computation time while still making sure we test at least one case with no number of inputs overflow and at least one case with number of inputs overflow.
    
    split_huge_add_or_mul() is made more general so it can be used if any case like Elemwise<add,mul> occurs elsewhere.
    89aac420
名称
最后提交
最后更新
..
GpuArray.pkl 正在载入提交数据...
__init__.py 正在载入提交数据...
config.py 正在载入提交数据...
rnn_support.py 正在载入提交数据...
test_abstractconv.py 正在载入提交数据...
test_basic_ops.py 正在载入提交数据...
test_blas.py 正在载入提交数据...
test_blocksparse.py 正在载入提交数据...
test_cgpukernelbase.py 正在载入提交数据...
test_dnn.py 正在载入提交数据...
test_elemwise.py 正在载入提交数据...
test_extra_ops.py 正在载入提交数据...
test_fft.py 正在载入提交数据...
test_gemmcorr.py 正在载入提交数据...
test_gemmcorr3d.py 正在载入提交数据...
test_gpuarray_multinomial_wo_replacement.pkl 正在载入提交数据...
test_linalg.py 正在载入提交数据...
test_multinomial.py 正在载入提交数据...
test_neighbours.py 正在载入提交数据...
test_nerv.py 正在载入提交数据...
test_nnet.py 正在载入提交数据...
test_opt.py 正在载入提交数据...
test_others.py 正在载入提交数据...
test_pickle.py 正在载入提交数据...
test_pool.py 正在载入提交数据...
test_reduction.py 正在载入提交数据...
test_rng_mrg.py 正在载入提交数据...
test_scan.py 正在载入提交数据...
test_subtensor.py 正在载入提交数据...
test_type.py 正在载入提交数据...
tstgpueye.c 正在载入提交数据...