提交 0de7e570 authored 作者: Frederic's avatar Frederic

Move where we use the nvcc.fastmath flags from the CudaNdarrayType to the NVCC_compiler.

It seam more logical to associate it with the compiler then the Type. This also make cuda_ndarray.cu compiled with it.
上级 d912ebb9
......@@ -15,6 +15,7 @@ Since 0.5rc2
This mean that now we recompile all modules for each value of "nvcc.flags".
This do use change the default, but if you used this flags, it was ignored
for module already compiled.
* The Theano flag "nvcc.fastmath" is now also used for the cuda_ndarray.cu file.
=============
Release Notes
......
......@@ -81,6 +81,8 @@ class NVCC_compiler():
They will also be included in the "hard" part of the key module.
"""
flags = [flag for flag in config.nvcc.flags.split(' ') if flag]
if config.nvcc.fastmath:
flags.append('-use_fast_math')
return flags
# cuda_ndarray_cuh_hash = hash_from_file(
# os.path.join(os.path.split(__file__)[0], 'cuda_ndarray.cuh'))
......
......@@ -373,10 +373,7 @@ class CudaNdarrayType(Type):
return NVCC_compiler
def c_compile_args(self):
ret = []
if config.nvcc.fastmath:
ret.append('-use_fast_math')
return ret
return []
# Register CudaNdarrayType to the OutputGuard list of known types
......
Markdown 格式
0%
您添加了 0 到此讨论。请谨慎行事。
请先完成此评论的编辑!
注册 或者 后发表评论