more doc formatting

7409025b · Pascal Lamblin · 5c231a77 · 7409025b · 7409025b · 7409025b
--- a/doc/advanced/features.txt
+++ b/doc/advanced/features.txt
@@ -5,6 +5,8 @@
 List of Env Features
 ====================

+See :api:`gof.env.Env`.
+
 WRITEME

 .. _nodefinder:
@@ -12,4 +14,6 @@ WRITEME
 NodeFinder
 ==========

+See :api:`gof.toolbox.NodeFinder`.
+
 WRITEME
--- a/doc/advanced_tutorial/cop.txt
+++ b/doc/advanced_tutorial/cop.txt
@@ -35,7 +35,7 @@ There are less methods to define for an Op than for a Type:
  This must return C code that cleans up whatever c_code allocated and
  that we must free.

-  *Default* The default behavior is to do nothing.
+  *Default:* The default behavior is to do nothing.

 .. function:: c_compile_args()
              c_headers()
@@ -118,14 +118,14 @@ version that it produces in the code I gave above.
 .. code-block:: python

   from theano import gof
-   
+
   class BinaryDoubleOp(gof.Op):
-   
+
       def __init__(self, name, fn, ccode):
           self.name = name
           self.fn = fn
           self.ccode = ccode
-   
+
       def make_node(self, x, y):
           if isinstance(x, (int, float)):
               x = gof.Constant(double, x)
@@ -134,29 +134,29 @@ version that it produces in the code I gave above.
           if x.type != double or y.type != double:
               raise TypeError('%s only works on doubles' % self.name)
           return gof.Apply(self, [x, y], [double()])
-   
+
       def perform(self, node, (x, y), (z, )):
           z[0] = self.fn(x, y)
-   
+
       def __str__(self):
           return self.name
-   
+
       def c_code(self, node, name, (x, y), (z, ), sub):
           return self.ccode % locals()
-   
-   
+
+
   add = BinaryDoubleOp(name = 'add',
                        fn = lambda x, y: x + y,
                        ccode = "%(z)s = %(x)s + %(y)s;")
-   
+
   sub = BinaryDoubleOp(name = 'sub',
                        fn = lambda x, y: x - y,
                        ccode = "%(z)s = %(x)s - %(y)s;")
-   
+
   mul = BinaryDoubleOp(name = 'mul',
                        fn = lambda x, y: x * y,
                        ccode = "%(z)s = %(x)s * %(y)s;")
-   
+
   div = BinaryDoubleOp(name = 'div',
                        fn = lambda x, y: x / y,
                        ccode = "%(z)s = %(x)s / %(y)s;")
--- a/doc/advanced_tutorial/ctype.txt
+++ b/doc/advanced_tutorial/ctype.txt
@@ -42,7 +42,7 @@ What needs to be defined

 In order to be C-compatible, a Type must define several additional
 methods, which all start with the ``c_`` prefix. The complete list can
-be found in the documentation for :ref:`type`. Here, we'll focus on
+be found in the documentation for :api:`gof.Type`. Here, we'll focus on
 the most important ones:


@@ -55,13 +55,13 @@ the most important ones:
 .. function:: c_init(name, sub)

  This must return C code which initializes the variables declared in
-  c_declare. Either this or c_extract will be called.
+  ``c_declare``. Either this or ``c_extract`` will be called.

 .. function:: c_extract(name, sub)

  This must return C code which takes a reference to a Python object
-  and initializes the variables declared in c_declare to match the
-  Python object's data. Either this or c_init will be called.
+  and initializes the variables declared in ``c_declare`` to match the
+  Python object's data. Either this or ``c_init`` will be called.

 .. function:: c_sync(name, sub)

@@ -101,7 +101,7 @@ out:
 .. warning::
   If some error condition occurs and you want to fail and/or raise an
   Exception, you must use the ``fail`` code contained in
-   ``sub['fail']`` (there is an example in the definition of c_extract
+   ``sub['fail']`` (there is an example in the definition of ``c_extract``
   below). You must *NOT* use the ``return`` statement anywhere, ever,
   nor ``break`` outside of your own loops or ``goto`` to strange
   places or anything like that. Failure to comply with this
@@ -109,8 +109,8 @@ out:
   leaks because Theano defines its own cleanup system and assumes
   that you are not meddling with it. Furthermore, advanced operations
   or types might do code transformations on your code such as
-   inserting it in a loop - in that case they can call your code
-   generating methods with custom failure code that takes into account
+   inserting it in a loop -- in that case they can call your
+   code-generating methods with custom failure code that takes into account
   what they are doing!


@@ -129,7 +129,7 @@ Defining the methods

 Very straightforward. All we need to do is write C code to declare a
 double. That double will be named whatever is passed to our function
-in the "name" argument. That will usually be some mangled name like
+in the ``name`` argument. That will usually be some mangled name like
 "V0", "V2" or "V92" depending on how many nodes there are in the
 computation graph and what rank the current node has. This function
 will be called for all Variables whose type is ``double``.
@@ -161,15 +161,15 @@ This function has to initialize the
 double we declared previously to a suitable value. This is useful if
 we want to avoid dealing with garbage values, especially if our data
 type is a pointer. This is not going to be called for all Variables with
-the ``double`` type. Indeed, if a Variable is an input which we pass
-from Python we will want to extract that input from a Python object,
-therefore it is the c_extract method that will be called instead of
-c_init. You can therefore not assume, when writing c_extract, that the
+the ``double`` type. Indeed, if a Variable is an input that we pass
+from Python, we will want to extract that input from a Python object,
+therefore it is the ``c_extract`` method that will be called instead of
+``c_init``. You can therefore not assume, when writing ``c_extract``, that the
 initialization has been done (in fact you can assume that it *hasn't*
 been done).

 ``c_init`` will typically be called on output Variables, but in general
-you should only assume that either c_init or c_extract has been
+you should only assume that either ``c_init`` or ``c_extract`` has been
 called, without knowing for sure which of the two.


@@ -205,7 +205,7 @@ then we insert code for failure. The code for failure is in
 ``sub["fail"]`` and it basically does a ``goto`` to cleanup code.

 If the check passes then we convert the Python float into a double
-using the PyFloat_AsDouble function (yet again provided by CPython's C
+using the ``PyFloat_AsDouble`` function (yet again provided by CPython's C
 API) and we put it in our double variable that we declared previously.


@@ -250,13 +250,14 @@ to a Python ``float``. This returns a new reference which we assign to
 will happily see a Python ``float`` come out of his computations.

 The rest of the code is not absolutely necessary and it is basically
-"good practice". PyFloat_FromDouble can return NULL on failure. NULL
-is a pretty bad reference to have and neither Python nor Theano like
-it. If this happens we change the NULL pointer (which will cause us
-problems) to a pointer to None (which is *not* a NULL pointer). Since
-None is an object like the others we need to increase its reference
-count before we can set a new pointer to it. This situation is
-unlikely to ever happen, but if it ever does, better safe than sorry.
+"good practice". ``PyFloat_FromDouble`` can return ``NULL`` on failure.
+``NULL`` is a pretty bad reference to have and neither Python nor Theano
+like it. If this happens, we change the ``NULL`` pointer (which will
+cause us problems) to a pointer to ``None`` (which is *not* a ``NULL``
+pointer). Since ``None`` is an object like the others, we need to
+increase its reference count before we can set a new pointer to it. This
+situation is unlikely to ever happen, but if it ever does, better safe
+than sorry.

 .. warning::
   I said this already but it really needs to be emphasized that if
@@ -277,28 +278,29 @@ unlikely to ever happen, but if it ever does, better safe than sorry.

 We actually have nothing to do here. We declared a double on the stack
 so the C language will reclaim it for us when its scope ends. We
-didn't malloc() anything so there's nothing to free(). Furthermore,
+didn't ``malloc()`` anything so there's nothing to ``free()``. Furthermore,
 the ``py_%(name)s`` pointer hasn't changed so we don't need to do
 anything with it. Therefore, we have nothing to cleanup. Sweet!

 There are however two important things to keep in mind:

 First, note that ``c_sync`` and ``c_cleanup`` might be called in
-sequence, so they need to play nice together. In particular, let's say
-that you allocate memory in ``c_init`` or ``c_extract`` for some
-reason. You might want to either embed what you allocated to some
-Python object in ``c_sync`` or to free it in ``c_cleanup``. If you do
-the former, you don't want to free the allocated storage so you should
-set the pointer to it to NULL to avoid that ``c_cleanup`` mistakenly
-frees it. Another option is to declare a variable in c_declare that
-you set to true in c_sync to notify c_cleanup that c_sync was called.
-
-Second, whenever you use %(fail)s in c_extract or in the code of an
-:ref:`operation <op>` you can count on c_cleanup being called right
-after that. Therefore, it's important to make sure that c_cleanup
-doesn't depend on any code prior to a reference to
-%(fail)s. Furthermore, because of the way Theano blocks code together,
-only the variables declared in c_declare will be visible in c_cleanup!
+sequence, so they need to play nice together. In particular, let's
+say that you allocate memory in ``c_init`` or ``c_extract`` for some
+reason. You might want to either embed what you allocated to some Python
+object in ``c_sync`` or to free it in ``c_cleanup``. If you do the
+former, you don't want to free the allocated storage so you should set
+the pointer to it to ``NULL`` to avoid that ``c_cleanup`` mistakenly
+frees it. Another option is to declare a variable in ``c_declare`` that
+you set to true in ``c_sync`` to notify ``c_cleanup`` that ``c_sync``
+was called.
+
+Second, whenever you use ``%(fail)s`` in ``c_extract`` or in the code of an
+:ref:`operation <op>`, you can count on ``c_cleanup`` being called right
+after that. Therefore, it's important to make sure that ``c_cleanup``
+doesn't depend on any code placed after a reference to
+``%(fail)s``. Furthermore, because of the way Theano blocks code together,
+only the variables declared in ``c_declare`` will be visible in ``c_cleanup``!


 What the generated C will look like
@@ -307,7 +309,7 @@ What the generated C will look like
 ``c_init`` and ``c_extract`` will only be called if there is a Python
 object on which we want to apply computations using C
 code. Conversely, ``c_sync`` will only be called if we want to
-communicate the values we have computed to Python and ``c_cleanup``
+communicate the values we have computed to Python, and ``c_cleanup``
 will only be called when we don't need to process the data with C
 anymore. In other words, the use of these functions for a given Variable
 depends on the the relationship between Python and C with respect to
@@ -378,13 +380,13 @@ like this:
     //c_cleanup for x
   }

-It's not pretty, but it gives you an idea of how things
-work (note that the variable names won't be x, y, z, etc. - they will
-get a unique mangled name). The ``fail`` code runs a goto to the
+It's not pretty, but it gives you an idea of how things work (note that
+the variable names won't be ``x``, ``y``, ``z``, etc. - they will
+get a unique mangled name). The ``fail`` code runs a ``goto`` to the
 appropriate label in order to run all cleanup that needs to be
-done. Note which variables get extracted (the three inputs x, y and
-z), which ones only get initialized (the temporary variable a and the
-output b) and which one is synced (the final output b).
+done. Note which variables get extracted (the three inputs ``x``, ``y`` and
+``z``), which ones only get initialized (the temporary variable ``a`` and the
+output ``b``) and which one is synced (the final output ``b``).

 The C code above is a single C block for the whole graph. Depending on
 which :ref:`linker` is used to process the computation graph, it is
@@ -399,26 +401,31 @@ Final version

 .. code-block:: python

+   from theano import gof
+
   class Double(gof.Type):
-   
+
       def filter(self, x, strict=False):
           if strict and not isinstance(x, float):
               raise TypeError('Expected a float!')
           return float(x)
-   
+
       def values_eq_approx(self, x, y, tolerance=1e-4):
           return abs(x - y) / (x + y) < tolerance
-   
+
+       def __str__(self):
+           return "double"
+
       def c_declare(self, name, sub):
           return """
           double %(name)s;
           """ % dict(name = name)
-   
+
       def c_init(self, name, sub):
           return """
           %(name)s = 0.0;
           """ % dict(name = name)
-   
+
       def c_extract(self, name, sub):
           return """
           if (!PyFloat_Check(py_%(name)s)) {
@@ -427,7 +434,7 @@ Final version
           }
           %(name)s = PyFloat_AsDouble(py_%(name)s);
           """ % dict(sub, name = name)
-   
+
       def c_sync(self, name, sub):
           return """
           Py_XDECREF(py_%(name)s);
@@ -438,8 +445,8 @@ Final version
               py_%(name)s = Py_None;
           }
           """ % dict(name = name)
-   
+
       def c_cleanup(self, name, sub):
           return ""
-   
+
   double = Double()
--- a/doc/advanced_tutorial/graphstructures.txt
+++ b/doc/advanced_tutorial/graphstructures.txt
@@ -151,7 +151,7 @@ Op instance is written so that:

 .. code-block:: python

-    e = scalar('x') + 1
+    e = dscalar('x') + 1

 builds the following graph:


--- a/doc/advanced_tutorial/inplace.txt
+++ b/doc/advanced_tutorial/inplace.txt
@@ -7,8 +7,8 @@ Views and inplace operations

 Theano allows the definition of Ops which return a :term:`view` on one
 of their inputs or operates :term:`inplace` on one or several
-inputs. This allows more efficient operations on numpy's ndarray data type than
-would be possible otherwise.
+inputs. This allows more efficient operations on numpy's ``ndarray``
+data type than would be possible otherwise.
 However, in order to work correctly, these Ops need to
 implement an additional interface.

@@ -29,7 +29,7 @@ Views

 A "view" on an object ``x`` is an object ``y`` which shares memory
 with ``x`` in some way. In other words, changing ``x`` might also
-change ``y`` and vice versa. For example, imagine a "vector" structure
+change ``y`` and vice versa. For example, imagine a ``vector`` structure
 which contains two fields: an integer length and a pointer to a memory
 buffer. Suppose we have:

@@ -51,7 +51,7 @@ range ``0xDEADBEFF - 0xDEADBFDF`` and z the range ``0xCAFEBABE -
 considered to be a view of ``x`` and vice versa.

 Suppose you had an Op which took ``x`` as input and returned
-``y``. You would need to tell Theano that y is a view of x. For this
+``y``. You would need to tell Theano that ``y`` is a view of ``x``. For this
 purpose, you would set the ``view_map`` field as follows:


@@ -103,7 +103,7 @@ operation on ``x``.

   .. code-block:: python

-      x, y = dscalars('xy')
+      x, y = dscalars('x', 'y')
      r1 = log(x)

      # r2 is x AFTER the add_inplace - x still represents the value before adding y
@@ -119,7 +119,7 @@ operation on ``x``.

   Needless to say, this goes for user-defined inplace operations as
   well: the modified input must figure in the list of outputs you
-   give to Apply in the definition of make_node.
+   give to ``Apply`` in the definition of ``make_node``.

   Also, for technical reasons but also because they are slightly
   confusing to use as evidenced by the previous code, Theano does not
@@ -132,7 +132,7 @@ operation on ``x``.
   introduces inconsistencies.


-Take the previous definitions of x, y and z and suppose an Op which
+Take the previous definitions of ``x``, ``y`` and ``z`` and suppose an Op which
 adds one to every byte of its input. If we give ``x`` as an input to
 that Op, it can either allocate a new buffer of the same size as ``x``
 (that could be ``z``) and set that new buffer's bytes to the variable of
@@ -141,7 +141,7 @@ it could add one to each byte *in* the buffer ``x``, therefore
 changing it. That would be an inplace Op.

 Theano needs to be notified of this fact. The syntax is similar to
-that of view_map:
+that of ``view_map``:


 .. code-block:: python
@@ -160,10 +160,10 @@ first input (position 0).
   myop.destroy_map = {1: [0]} # second output operates inplace on first input

   myop.destroy_map = {0: [0], # first output operates inplace on first input
-                    1: [1]} # *AND* second output operates inplace on second input
+                       1: [1]} # *AND* second output operates inplace on second input

   myop.destroy_map = {0: [0], # first output operates inplace on first input
-                    1: [0]} # *AND* second output *ALSO* operates inplace on first input
+                       1: [0]} # *AND* second output *ALSO* operates inplace on first input

   myop.destroy_map = {0: [0, 1]} # first output operates inplace on both the first and second input
   # unlike for views, the previous line is legal and supported
@@ -194,7 +194,7 @@ input(s)'s memory). From there, go to the previous section.
   the value of ``x`` it might invert the order and that will
   certainly lead to erroneous computations.

-   You can often identify an incorrect view_map or destroy_map by using
-   :ref:`DebugMode`.  *Be sure to use DebugMode when developing a new Op that
-   uses view_map and/or destroy_map.*
+   You can often identify an incorrect ``view_map`` or ``destroy_map``
+   by using :ref:`DebugMode`.  *Be sure to use DebugMode when developing
+   a new Op that uses ``view_map`` and/or ``destroy_map``.*

--- a/doc/advanced_tutorial/op.txt
+++ b/doc/advanced_tutorial/op.txt
@@ -12,16 +12,17 @@ computations. We'll start by defining multiplication.
 Op's contract
 =============

-An Op (:api:`gof.op.Op`) is any object which defines the following methods:
+An Op (:api:`gof.op.Op`) is any object which defines the
+following methods:


 .. function:: make_node(*inputs)
-    
+
  This method is responsible for creating output Variables of a
  suitable Type to serve as the outputs of this Op's application.
  This method should put these outputs into an Apply instance, and
  return the Apply instance.
-  
+
  This method creates an Apply node representing the application of
  the Op on the inputs provided. If the Op cannot be applied on
  these inputs, it must raise an appropriate exception.
@@ -30,13 +31,13 @@ An Op (:api:`gof.op.Op`) is any object which defines the following methods:
  ordered correctly: a subsequent ``self.make_node(*apply.inputs)``
  must produce something equivalent to the first ``apply``.

-``default_output``
+.. attribute:: default_output

-  *Default*: None
+  *Default:* None

  If this member variable is an integer, then the default
  implementation of ``__call__`` will return
-  `node.outputs[self.default_output]``, where `node` was returned
+  ``node.outputs[self.default_output]``, where ``node`` was returned
  by ``make_node``.  Otherwise, the entire list of outputs will be
  returned.

@@ -45,7 +46,7 @@ An Op (:api:`gof.op.Op`) is any object which defines the following methods:
  Syntactic shortcut to make_node which returns the output
  Variables of the Op.

-  *Default*: this is done for you by Op.
+  *Default:* this is done for you by Op.

 .. function:: perform(node, inputs, output_storage)

@@ -64,26 +65,26 @@ An Op (:api:`gof.op.Op`) is any object which defines the following methods:

    - ``output_storage``: This is a list of storage cells.
      A storage cell is a one-element list. It is forbidden to change
-      the length of the list(s) contained in output_storage.  There is
+      the length of the list(s) contained in ``output_storage``.  There is
      one storage cell for each output of the Op.

      The data you put in ``output_storage`` must match the type of the
      symbolic output. This is a situation where the ``node`` argument
      can come in handy.

-      A function Mode may allow output_storage elements to persist between
-      evaluations, or it may reset output_storage cells to hold a value of
-      None.  This feature can allow perform to reuse memory between calls, for
-      example.
+      A function Mode may allow ``output_storage`` elements to persist between
+      evaluations, or it may reset ``output_storage`` cells to hold a value of
+      ``None``.  This feature can allow ``perform`` to reuse memory between
+      calls, for example.

  This method must be determined by the inputs. That is to say, if
  it is evaluated once on inputs A and returned B, then if ever
  inputs C, equal to A, are presented again, then outputs equal to
  B must be returned again.
-  
+
  You must be careful about aliasing outputs to inputs, and making
-  modifications to any of the inputs. See `Views and inplace
-  operations <views_and_inplace>`_ before writing a ``perform``
+  modifications to any of the inputs. See :ref:`Views and inplace
+  operations <views_and_inplace>` before writing a ``perform``
  implementation that does either of these things.

 .. function:: __eq__(other)
@@ -95,20 +96,21 @@ An Op (:api:`gof.op.Op`) is any object which defines the following methods:
  (from perform) as this one, given identical inputs. This means it
  will produce the same output values, it will destroy the same
  inputs (same destroy_map), and will alias outputs to the same
-  inputs (same view_map).
+  inputs (same view_map). For more details, see
+  :ref:`views_and_inplace`.

 .. function:: __hash__()

  If two Op instances compare equal, then they **must** return the
  same hash value.
-      
+
  Equally important, this hash value must not change during the
  lifetime of self.  Op instances should be immutable in this
  sense.

 .. function:: __ne__(other)
-  
-  Default: ``(not (self==other))``
+
+  *Default:* ``(not (self==other))``

 .. function:: grad(inputs, output_gradients)

@@ -116,30 +118,28 @@ An Op (:api:`gof.op.Op`) is any object which defines the following methods:

  If the Op you are defining is differentiable, you can define its
  gradient symbolically in this method.
-    
+
  Both the ``inputs`` and ``output_gradients`` will be
  Variables. This method must return a list containing one Variable
-  (or None) for each input. Each returned Variable represents the
+  (or ``None``) for each input. Each returned Variable represents the
  gradient with respect to that input given the symbolic gradients
  with respect to each output.

  If the output is not differentiable with respect to any inputs,
-  then this method should be defined to return [None for i in
-  inputs].
+  then this method should be defined to return ``[None for i in
+  inputs]``.

-  If this method is not defined, then theano assumes it has been
+  If this method is not defined, then Theano assumes it has been
  forgotten.  Symbolic differentiation will fail on a graph that
  includes this Op.

-  For more information on the use of this method, see ``grad``.
-

 For each method, the *default* is what :api:`theano.gof.op.Op` defines
 for you.  At a bare minimum, a new Op must define ``make_node`` and
 ``perform``, which have no defaults.

 For more details, including the interface for providing a C
-implementation of perform(), refer to the documentation for :ref:`op`.
+implementation of ``perform()``, refer to the documentation for :ref:`op`.


 Defining an Op: ``mul``
@@ -252,7 +252,7 @@ AttributeError: 'int' object has no attribute 'type'
 Automatic Constant Wrapping
 ---------------------------

-Well, ok. We'd like our Op to be a bit more flexible. This can be done
+Well, OK. We'd like our Op to be a bit more flexible. This can be done
 by modifying ``make_node`` to accept Python ``int`` or ``float`` as
 ``x`` and/or ``y``:


--- a/doc/advanced_tutorial/optimization.txt
+++ b/doc/advanced_tutorial/optimization.txt
@@ -18,7 +18,7 @@ Env is a wrapper around a whole computation graph, you can see its
 :ref:`documentation <env>` for more details) and navigates through it
 in a suitable way, replacing some Variables by others in the process. A
 local optimization, on the other hand, is defined as a function on a
-*single* :ref:`apply` node and must return either False (to mean that
+*single* :ref:`apply` node and must return either ``False`` (to mean that
 nothing is to be done) or a list of new Variables that we would like to
 replace the node's outputs with. A :ref:`navigator` is a special kind
 of global optimization which navigates the computation graph in some
@@ -49,7 +49,7 @@ methods:

  This method takes an Env object and adds :ref:`features
  <envfeature>` to it. These features are "plugins" that are needed
-  for the apply method to do its job properly.
+  for the ``apply`` method to do its job properly.

 .. function:: optimize(env)

@@ -69,7 +69,7 @@ A local optimization is an object which defines the following methods:

 .. function:: transform(node)

-  This method takes an :ref:`apply` node and returns either False to
+  This method takes an :ref:`apply` node and returns either ``False`` to
  signify that no changes are to be done or a list of Variables which
  matches the length of the node's ``outputs`` list. When the
  LocalOptimizer is applied by a Navigator, the outputs of the node
@@ -99,9 +99,9 @@ Here is the code for a global optimization implementing the
 simplification described above:

 .. code-block:: python
-   
+
   from theano.gof import toolbox
-   
+
   class Simplify(gof.Optimizer):
       def add_requirements(self, env):
           env.extend(toolbox.ReplaceValidate())
@@ -116,38 +116,39 @@ simplification described above:
                           env.replace_validate(z, b)
                       elif y == b:
                           env.replace_validate(z, a)
-   
+
   simplify = Simplify()

 Here's how it works: first, in ``add_requirements``, we add the
 ``ReplaceValidate`` :ref:`envfeature` located in
-``theano.gof.toolbox``. This feature adds the ``replace_validate``
-method to the env, which is an enhanced version of ``replace`` that
+:api:`theano.gof.toolbox`. This feature adds the ``replace_validate``
+method to ``env``, which is an enhanced version of ``replace`` that
 does additional checks to ensure that we are not messing up the
-computation graph (note: if ReplaceValidate was already added by
+computation graph (note: if ``ReplaceValidate`` was already added by
 another optimizer, ``extend`` will do nothing). In a nutshell,
-``toolbox.ReplaceValidate`` grants access to ``env.replace_validate``
+``toolbox.ReplaceValidate`` grants access to ``env.replace_validate``,
 and ``env.replace_validate`` allows us to replace a Variable with
 another while respecting certain validation constraints. You can
 browse the list of :ref:`features <envfeaturelist>` and see if some of
 them might be useful to write optimizations with. For example, as an
-exercise, try to rewrite Simplify using :ref:`nodefinder` (hint: you
-want to use the method it publishes in place of the call to toposort!)
+exercise, try to rewrite Simplify using :ref:`nodefinder`. (Hint: you
+want to use the method it publishes instead of the call to toposort!)

 Then, in ``apply`` we do the actual job of simplification. We start by
 iterating through the graph in topological order. For each node
 encountered, we check if it's a ``div`` node. If not, we have nothing
-to do here. If so, we put in x, y and z the numerator, denominator and
-quotient (output) of the division. The simplification only occurs when
-the numerator is a multiplication, so we check for that. If the
-numerator is a multiplication we put the two operands in a and b, so
+to do here. If so, we put in ``x``, ``y`` and ``z`` the numerator,
+denominator and quotient (output) of the division.
+The simplification only occurs when the numerator is a multiplication,
+so we check for that. If the numerator is a multiplication we put the
+two operands in ``a`` and ``b``, so
 we can now say that ``z == (a*b)/y``. If ``y==a`` then ``z==b`` and if
-``y==b`` then ``z==a``. When either case happens then we can replace z
-by either a or b using ``env.replace_validate`` - else we do
+``y==b`` then ``z==a``. When either case happens then we can replace
+``z`` by either ``a`` or ``b`` using ``env.replace_validate`` - else we do
 nothing. You might want to check the documentation about :ref:`variable`
 and :ref:`apply` to get a better understanding of the
 pointer-following game you need to get ahold of the nodes of interest
-for the simplification (x, y, z, a, b, etc.)
+for the simplification (``x``, ``y``, ``z``, ``a``, ``b``, etc.).

 Test time:

@@ -217,7 +218,7 @@ The local version of the above code would be the following:


 .. code-block:: python
-   
+
   class LocalSimplify(gof.LocalOptimizer):
       def transform(self, node):
           if node.op == div:
@@ -234,7 +235,7 @@ The local version of the above code would be the following:
           # but it isn't now
           # TODO: do this and explain it
           return [] # that's not what you should do
-   
+
   local_simplify = LocalSimplify()

 The definition of transform is the inner loop of the global optimizer,

--- a/doc/advanced_tutorial/type.txt
+++ b/doc/advanced_tutorial/type.txt
@@ -39,30 +39,30 @@ default values.
  ``filter(value, strict = True)`` does not raise an exception, the
  value is compatible with the Type.

-  *Default*: True iff ``filter(value, strict = True)`` does not raise
+  *Default:* True iff ``filter(value, strict = True)`` does not raise
  an exception.

 .. function:: values_eq(a, b)

  Returns True iff ``a`` and ``b`` are equal.

-  *Default*: ``a == b``
+  *Default:* ``a == b``

 .. function:: values_eq_approx(a, b)

  Returns True iff ``a`` and ``b`` are approximately equal, for a
  definition of "approximately" which varies from Type to Type.

-  *Default*: ``values_eq(a, b)``
+  *Default:* ``values_eq(a, b)``

 .. function:: make_variable(name=None)

  Makes a :term:`Variable` of this Type with the specified name, if
-  ``name is not None``. If ``name is ``None``, then the Variable does
+  ``name`` is not ``None``. If ``name`` is ``None``, then the Variable does
  not have a name. The Variable will have its ``type`` field set to
  the Type object.

-  *Default*: there is a generic definition of this in Type. The
+  *Default:* there is a generic definition of this in Type. The
  Variable's ``type`` will be the object that defines this method (in
  other words, ``self``).

@@ -70,21 +70,21 @@ default values.

  Syntactic shortcut to ``make_variable``.

-  *Default*: ``make_variable``
+  *Default:* ``make_variable``

 .. function:: __eq__(other)

  Used to compare Type instances themselves

-  *Default*: ``object.__eq__``
+  *Default:* ``object.__eq__``

 .. function:: __hash__()

-  Types should not be mutable, so it should be Ok to define a hash
+  Types should not be mutable, so it should be OK to define a hash
  function.  Typically this function should hash all of the terms
  involved in ``__eq__``.

-  *Default*: ``id(self)``
+  *Default:* ``id(self)``

 For each method, the *default* is what ``Type`` defines
 for you. So, if you create an instance of ``Type`` or an
@@ -99,7 +99,7 @@ For more details you can go see the documentation for :ref:`type`.
 Defining double
 ===============

-We are going to base Type ``double`` on Python's ``float``. We are
+We are going to base Type ``double`` on Python's ``float``. We
 must define ``filter`` and shall override ``values_eq_approx``.


@@ -139,17 +139,17 @@ graph in such a way that it produces slightly different variables, for
 example because of numerical instability like rounding errors at the
 end of the mantissa. For instance, ``a + a + a + a + a + a`` might not
 actually produce the exact same output as ``6 * a`` (try with a=0.1),
-but with ``values_eq_approx`` we with don't necessarily mind.
+but with ``values_eq_approx`` we don't necessarily mind.

 We added an extra ``tolerance`` argument here. Since this argument is
-not part of the API, it must have a default value which we
+not part of the API, it must have a default value, which we
 chose to be 1e-4.

 .. note::

    ``values_eq`` is never actually used by Theano, but it might be used
-    internally in the future. Equality testing in DebugMode is done
-    using ``values_eq_approx``.
+    internally in the future. Equality testing in
+    :ref:`DebugMode <debugmode>` is done using ``values_eq_approx``.

 **Putting them together**

@@ -160,7 +160,7 @@ the Type is to instantiate a plain Type and set the needed fields:

 .. code-block:: python

-   from Theano import gof
+   from theano import gof

   double = gof.Type()
   double.filter = filter
@@ -175,19 +175,19 @@ and define ``filter`` and ``values_eq_approx`` in the subclass:
   from theano import gof

   class Double(gof.Type):
-   
+
       def filter(self, x, strict=False):
           if strict and not isinstance(x, float):
               raise TypeError('Expected a float!')
           return float(x)
-   
+
       def values_eq_approx(self, x, y, tolerance=1e-4):
           return abs(x - y) / (abs(x) + abs(y)) < tolerance
-   
+
   double = Double()

 ``double`` is then an instance of Type ``Double``, which in turn is a
-sublcass of ``Type``.
+subclass of ``Type``.

 There is a small issue with defining ``double`` this way. All
 instances of ``Double`` are technically the same Type. However, different
@@ -199,7 +199,7 @@ instances of ``Double`` are technically the same Type. However, different
 False

 Theano compares Types using ``==`` to see if they are the same.
-This happens in DebugMode.  Also, ops can (and should) ensure that their inputs
+This happens in DebugMode.  Also, Ops can (and should) ensure that their inputs
 have the expected Type by checking something like ``if x.type == lvector``.

 There are several ways to make sure that equality testing works properly:
@@ -243,7 +243,7 @@ attempt to clear up the confusion:
  that Type instance. If you were to parse the C expression ``c = a +
  b;``, ``a``, ``b`` and ``c`` would all be Variable instances.

-* A **subclass of Type** is a way of implementing 
+* A **subclass of Type** is a way of implementing
  a set of Type instances that share
  structural similarities. In the ``double`` example that we are doing,
  there is actually only one Type in that set, therefore the subclass
@@ -265,18 +265,18 @@ Final version
   from theano import gof

   class Double(gof.Type):
-   
+
       def filter(self, x, strict=False):
           if strict and not isinstance(x, float):
               raise TypeError('Expected a float!')
           return float(x)
-   
+
       def values_eq_approx(self, x, y, tolerance=1e-4):
           return abs(x - y) / (abs(x) + abs(y)) < tolerance

       def __str__(self):
           return "double"
-   
+
   double = Double()