- 26 7月, 2011 21 次提交
-
-
由 Razvan Pascanu 提交于
-
由 Razvan Pascanu 提交于
-
由 Razvan Pascanu 提交于
-
由 Razvan Pascanu 提交于
-
由 Razvan Pascanu 提交于
-
由 Razvan Pascanu 提交于
I'm not sure why, and I don't have sufficient python knowledge to figure out. If anyone wants to give it a try please go ahead.
-
由 Razvan Pascanu 提交于
-
由 Frederic Bastien 提交于
-
由 Frederic Bastien 提交于
-
由 Frederic Bastien 提交于
The just point to the existing implementation.
-
由 Frederic Bastien 提交于
-
由 James Bergstra 提交于
-
由 Razvan Pascanu 提交于
-
由 Razvan Pascanu 提交于
I'll try to add a test for it soon
-
由 Frederic Bastien 提交于
In DebugMode, preemptive modif in case some op check if they have been applied in the Compute_map when their thunk is executed.
-
由 Razvan Pascanu 提交于
-
由 Frederic Bastien 提交于
-
由 Frederic Bastien 提交于
-
由 Frederic Bastien 提交于
-
由 Frederic Bastien 提交于
-
由 Frederic Bastien 提交于
-
- 25 7月, 2011 19 次提交
-
-
由 Razvan Pascanu 提交于
-
由 Razvan Pascanu 提交于
-
由 Razvan Pascanu 提交于
-
由 Razvan Pascanu 提交于
-
由 Razvan Pascanu 提交于
-
由 Razvan Pascanu 提交于
This optimization has been moved in sandbox/cuda
-
由 Razvan Pascanu 提交于
I've added another version of the optimization for gpu scans.
-
由 Razvan Pascanu 提交于
This import used to be on top of the file, but that creates a cycling import.
-
由 Razvan Pascanu 提交于
preparing for the cython code, I used to grab the wrong inputs ( it should be the node inputs not self.inputs)
-
由 Razvan Pascanu 提交于
reconstruct_graph always replaces CudaNdarrays with TensorTypes, so in case of a gpu scan op, we can not use that logic to generate a hash ( I compute it locally when I am generating the gpu scan op).
-
由 Razvan Pascanu 提交于
The way it works, __init__ gets a lambda function that construct a Tensor Type. By default constructs Tensor Types, but move to gpu will replace it with a CudaNdarray constructor.
-
由 Razvan Pascanu 提交于
Notes on this change: 1) I needed to replicate the inplace optimization, to make a version that can handle ops moved on the gpu. I've also added a replica of the reconstruct_graph function from scan_utils that again does not replace CudaNdarray with TensorTypes ..
-
由 Olivier Delalleau 提交于
-
由 Frederic Bastien 提交于
-
由 Olivier Delalleau 提交于
-
由 Razvan Pascanu 提交于
Two changes. First one is to make profiling work with scan (scan was looking for a subclass of the ProfileStat object). Second, I made a more fixed length printing of timings which I believe personally is much better.
-
由 Razvan Pascanu 提交于
While we agreed that there might be a more principial way of solving this, this solution was fast to add and it is pretty efficient for now.
-
由 Razvan Pascanu 提交于
Is the shape of the data that should match.
-
由 Razvan Pascanu 提交于
In order for scan to run correctly inplace, it needs that none of the initial states are the same memory buffer.
-