1. 25 7月, 2011 14 次提交
    • Razvan Pascanu's avatar
      import cuda in perform for allocating zeros on cuda · 875d0364
      Razvan Pascanu 提交于
      This import used to be on top of the file, but that creates a cycling
      import.
      875d0364
    • Razvan Pascanu's avatar
      bug in make_thunk · 21fac54d
      Razvan Pascanu 提交于
      preparing for the cython code, I used to grab the wrong inputs ( it should
      be the node inputs not self.inputs)
      21fac54d
    • Razvan Pascanu's avatar
      Fix bug in computing hash · 32638671
      Razvan Pascanu 提交于
      reconstruct_graph always replaces CudaNdarrays with TensorTypes, so in case
      of a gpu scan op, we can not use that logic to generate a hash ( I compute
      it locally when I am generating the gpu scan op).
      32638671
    • Razvan Pascanu's avatar
      Added new mechanism to construct output types that avoid importing Cuda. · c28d13a0
      Razvan Pascanu 提交于
      The way it works, __init__ gets a lambda function that construct a Tensor
      Type. By default constructs Tensor Types, but move to gpu will replace it
      with a CudaNdarray constructor.
      c28d13a0
    • Razvan Pascanu's avatar
      Moved the scan_to_gpu optimization in sandbox/cuda · 54e1d21e
      Razvan Pascanu 提交于
      Notes on this change: 1) I needed to replicate the inplace optimization, to
      make a version that can handle ops moved on the gpu. I've also added a
      replica of the reconstruct_graph function from scan_utils that again does
      not replace CudaNdarray with TensorTypes ..
      54e1d21e
    • Razvan Pascanu's avatar
      Enhanced profiling for scan · b785a858
      Razvan Pascanu 提交于
      Two changes. First one is to make profiling work with scan (scan was looking
      for a subclass of the ProfileStat object). Second, I made a more fixed
      length printing of timings which I believe personally is much better.
      b785a858
    • Razvan Pascanu's avatar
      Adding the cython version of scan · caef102d
      Razvan Pascanu 提交于
      While we agreed that there might be a more principial way of solving this,
      this solution was fast to add and it is pretty efficient for now.
      caef102d
    • Razvan Pascanu's avatar
      fix bug in equal computation · 95e010f5
      Razvan Pascanu 提交于
      Is the shape of the data that should match.
      95e010f5
    • Razvan Pascanu's avatar
      fixed inplace optimization · 17bc0839
      Razvan Pascanu 提交于
      In order for scan to run correctly inplace, it needs that none of the
      initial states are the same memory buffer.
      17bc0839
    • Razvan Pascanu's avatar
      removed comment lines · 667104c4
      Razvan Pascanu 提交于
      667104c4
    • Razvan Pascanu's avatar
      rewrote implementation that merges scan ops · f1670422
      Razvan Pascanu 提交于
      The old implementation used to result in stochastic order error in
      debugmode. After many attempts to solve it, I decided that it would be
      better and faster just to rewrite it. This new implementation does not
      suffer from any bug (i.e. all tests pass in debug mode).
      f1670422
    • Razvan Pascanu's avatar
      rename perform to execute · 9c6da45e
      Razvan Pascanu 提交于
      as Pascal suggested we should make it clear that scan doesn't have a normal
      perform anymore, and that one needs to use make_thunk.
      9c6da45e
    • Razvan Pascanu's avatar
      functions that return the different kind of arguments of scan · 917ce99d
      Razvan Pascanu 提交于
      After talking to Pascal we decided that having this kind of funcion can be
      quite useful, such that not every optimization does this splitting over and
      over again ( it is really easy to have bugs by either messing up the order
      or the count).
      917ce99d
    • Razvan Pascanu's avatar
      Add the id even on short names · 35dabf14
      Razvan Pascanu 提交于
      I think this commit got reverted ( I'm not sure when and why). Without it
      graphs produced with pydotprint might be extremely misleading.
      35dabf14
  2. 24 7月, 2011 6 次提交
  3. 23 7月, 2011 20 次提交