Skip to content
项目
群组
代码片段
帮助
当前项目
正在载入...
登录 / 注册
切换导航面板
P
pytensor
项目
项目
详情
活动
周期分析
仓库
仓库
文件
提交
分支
标签
贡献者
图表
比较
统计图
议题
0
议题
0
列表
看板
标记
里程碑
合并请求
0
合并请求
0
CI / CD
CI / CD
流水线
作业
日程
统计图
Wiki
Wiki
代码片段
代码片段
成员
成员
折叠边栏
关闭边栏
活动
图像
聊天
创建新问题
作业
提交
问题看板
Open sidebar
testgroup
pytensor
Commits
4c8d04ff
提交
4c8d04ff
authored
8月 08, 2014
作者:
Frederic
浏览文件
操作
浏览文件
下载
电子邮件补丁
差异文件
small doc fix.
上级
76811aa7
隐藏空白字符变更
内嵌
并排
正在显示
4 个修改的文件
包含
19 行增加
和
15 行删除
+19
-15
conv.txt
doc/library/tensor/nnet/conv.txt
+7
-8
blas.py
theano/sandbox/cuda/blas.py
+9
-5
test_conv_cuda_ndarray.py
theano/sandbox/cuda/tests/test_conv_cuda_ndarray.py
+0
-1
Conv3D.py
theano/tensor/nnet/Conv3D.py
+3
-1
没有找到文件。
doc/library/tensor/nnet/conv.txt
浏览文件 @
4c8d04ff
...
@@ -53,19 +53,18 @@ TODO: Give examples for how to use these things! They are pretty complicated.
...
@@ -53,19 +53,18 @@ TODO: Give examples for how to use these things! They are pretty complicated.
Also, there is restrictions on which shape are supported.
Also, there is restrictions on which shape are supported.
- :func:`GpuCorrMM <theano.sandbox.cuda.blas.GpuCorrMM>`
- :func:`GpuCorrMM <theano.sandbox.cuda.blas.GpuCorrMM>`
This is a GPU-only version of a correlation that computes correlations
This is a GPU-only version of a correlation that computes correlations
as `caffe
`(https://github.com/BVLC/caffe/blob/master/src/caffe/layers/conv_layer.cu).
as `caffe
<https://github.com/BVLC/caffe/blob/master/src/caffe/layers/conv_layer.cu>`_.
For each element in a batch, it first creates a
For each element in a batch, it first creates a
Toeplitz(http://en.wikipedia.org/wiki/Toeplitz_matrix) matrix in a cuda kernel.
`Toeplitz <http://en.wikipedia.org/wiki/Toeplitz_matrix>`_ matrix in a cuda kernel.
Then, it performs a `
gemm` call to multiply this Toeplitz matrix and the kernel.
Then, it performs a `
`gemm`` call to multiply this Toeplitz matrix and the kernel.
It need extra memory equal to the size of the Toeplitz matrix. Precisely,
It need extra memory equal to the size of the Toeplitz matrix. Precisely,
the dimensions of this 2D Toeplitz matrix is equal to
=
the dimensions of this 2D Toeplitz matrix is equal to
(no of channels * filter width * filter height, output width * output height)
.
``(no of channels * filter width * filter height, output width * output height)``
.
You can enable it for call to conv2d 2d by setting
'THEANO_FLAGS=optimizer_including=conv_gemm'
You can enable it for call to conv2d 2d by setting
``THEANO_FLAGS=optimizer_including=conv_gemm``
in your environment. This is not enabled by default because it
in your environment. This is not enabled by default because it
uses some extra memory.
uses some extra memory.
MM mean matrix multiply.
.. autofunction:: theano.tensor.nnet.conv.conv2d
.. autofunction:: theano.tensor.nnet.conv.conv2d
.. autofunction:: theano.tensor.nnet.Conv3D.conv3D
.. autofunction:: theano.tensor.nnet.Conv3D.conv3D
.. autofunction:: theano.tensor.nnet.conv3d2d.conv3d
.. autofunction:: theano.tensor.nnet.conv3d2d.conv3d
.. autofunction:: theano.sandbox.cuda.fftconv.conv2d_fft
.. autofunction:: theano.sandbox.cuda.fftconv.conv2d_fft
.. autofunction:: theano.sandbox.cuda.blas.GpuCorrMM
theano/sandbox/cuda/blas.py
浏览文件 @
4c8d04ff
...
@@ -499,16 +499,21 @@ gpu_ger_inplace = GpuGer(inplace=True)
...
@@ -499,16 +499,21 @@ gpu_ger_inplace = GpuGer(inplace=True)
class
GpuCorrMM
(
GpuOp
):
class
GpuCorrMM
(
GpuOp
):
"""
"""GPU correlation implementation using Matrix Multiply.
Author: Arjun Jain
Implement the caffe convolution
:note: It don't implement the grad. So you should use it by
enabling the Theano flag ``optimizer_including=conv_gemm`` and
use :func:`conv2d <theano.tensor.nnet.conv.conv2d>`.
"""
"""
def
__init__
(
self
,
border_mode
,
def
__init__
(
self
,
border_mode
,
subsample
=
(
1
,
1
),
subsample
=
(
1
,
1
),
pad
=
0
):
pad
=
0
):
"""
"""
:param border_mode: "valid" or "full"
:param border_mode: "valid" or "full"
:param subsample: not yet supported
:param subsample: the subsample operation applied on each output image.
Should be a tuple with 2 elements.
(sv, sh) is equivalent to GpuCorrMM(...)(...)[:,:,::sv, ::sh]
:param pad: not yet supported
:param pad: not yet supported
"""
"""
self
.
border_mode
=
border_mode
self
.
border_mode
=
border_mode
...
@@ -552,7 +557,6 @@ class GpuCorrMM(GpuOp):
...
@@ -552,7 +557,6 @@ class GpuCorrMM(GpuOp):
return
Apply
(
self
,
[
img
,
kern
],
[
CudaNdarrayType
(
broadcastable
)()])
return
Apply
(
self
,
[
img
,
kern
],
[
CudaNdarrayType
(
broadcastable
)()])
def
flops
(
self
,
inputs
,
outputs
):
def
flops
(
self
,
inputs
,
outputs
):
""" Useful with the hack in profilemode to print the MFlops"""
images
,
kerns
=
inputs
images
,
kerns
=
inputs
out
,
=
outputs
out
,
=
outputs
assert
images
[
1
]
==
kerns
[
1
]
assert
images
[
1
]
==
kerns
[
1
]
...
...
theano/sandbox/cuda/tests/test_conv_cuda_ndarray.py
浏览文件 @
4c8d04ff
...
@@ -640,7 +640,6 @@ def test_valid():
...
@@ -640,7 +640,6 @@ def test_valid():
mode
=
theano_mode
.
including
(
"conv_gemm"
)
mode
=
theano_mode
.
including
(
"conv_gemm"
)
version
=
[
-
1
]
version
=
[
-
1
]
# Remove case not supported
# Add tests with strided inputs by still square images and filters.
# Add tests with strided inputs by still square images and filters.
shapes
+=
get_shapes2
(
scales_img
=
(
2
,
2
),
img_stride
=
(
2
,
2
))
shapes
+=
get_shapes2
(
scales_img
=
(
2
,
2
),
img_stride
=
(
2
,
2
))
shapes
+=
get_shapes2
(
scales_kern
=
(
2
,
2
),
kern_stride
=
(
2
,
2
))
shapes
+=
get_shapes2
(
scales_kern
=
(
2
,
2
),
kern_stride
=
(
2
,
2
))
...
...
theano/tensor/nnet/Conv3D.py
浏览文件 @
4c8d04ff
...
@@ -40,7 +40,9 @@ from theano.gradient import grad_undefined
...
@@ -40,7 +40,9 @@ from theano.gradient import grad_undefined
#the output function is only defined when dr, dc, dt are natural numbers.
#the output function is only defined when dr, dc, dt are natural numbers.
class
Conv3D
(
theano
.
Op
):
class
Conv3D
(
theano
.
Op
):
""" 3D "convolution" of multiple filters on a minibatch (does not flip the kernel, moves kernel with a user specified stride) """
""" 3D `convolution` of multiple filters on a minibatch
:note: does not flip the kernel, moves kernel with a user specified stride
"""
def
__eq__
(
self
,
other
):
def
__eq__
(
self
,
other
):
return
type
(
self
)
==
type
(
other
)
return
type
(
self
)
==
type
(
other
)
...
...
编写
预览
Markdown
格式
0%
重试
或
添加新文件
添加附件
取消
您添加了
0
人
到此讨论。请谨慎行事。
请先完成此评论的编辑!
取消
请
注册
或者
登录
后发表评论