merge

5d37b0f4 · James Bergstra · ce67f1fe · 9715a651 · 5d37b0f4 · 5d37b0f4
--- a/doc/library/tensor/basic.txt
+++ b/doc/library/tensor/basic.txt
@@ -290,6 +290,23 @@ Comparisons
    Theano has no boolean dtype.  Instead, all boolean tensors are represented
    in ``'int8'``.

+.. note::
+
+    The six usual equality and inequality operators share the same interface.
+      :Parameter:  *a* - symbolic Tensor (or compatible)
+      :Parameter:  *b* - symbolic Tensor (or compatible)
+      :Return type: symbolic Tensor
+      :Returns: a symbolic tensor representing the application of the logical 
+      elementwise operator.
+
+    Here is an example with the less-than operator.
+
+    .. code-block:: python 
+
+        import theano.tensor as T
+        x,y = T.dmatrices('x','y')
+        z = T.le(x,y)
+
 .. function:: lt(a, b)

    Returns a symbolic ``'int8'`` tensor representing the result of logical less-than (a<b).
@@ -304,69 +321,86 @@ Comparisons

 .. function:: le(a, b)

-    Returns a variable representing the result of logical less than or
-    equal (a<=b).
-      :Parameter:  *a* - symbolic Tensor (or compatible)
-      :Parameter:  *b* - symbolic Tensor (or compatible)
-      :Return type: symbolic Tensor
-      :Returns: a symbolic tensor representing the application of logical 
-      elementwise less than or equal.
-
-    .. code-block:: python 
+    Returns a variable representing the result of logical less than or equal (a<=b).

-        import theano.tensor as T
-        x,y = T.dmatrices('x','y')
-        z = T.le(x,y)
+    Also available using syntax ``a <= b``

 .. function:: ge(a, b)

    Returns a variable representing the result of logical greater or equal than (a>=b).
-      :Parameter:  *a* - symbolic Tensor (or compatible)
-      :Parameter:  *b* - symbolic Tensor (or compatible)
-      :Return type: symbolic Tensor
-      :Returns: a symbolic tensor representing the application of logical 
-      elementwise greater than or equal.

-    .. code-block:: python 
-
-        import theano.tensor as T
-        x,y = T.dmatrices('x','y')
-        z = T.ge(x,y)
+    Also available using syntax ``a >= b``

 .. function:: eq(a, b)

    Returns a variable representing the result of logical equality (a==b).
-      :Parameter:  *a* - symbolic Tensor (or compatible)
-      :Parameter:  *b* - symbolic Tensor (or compatible)
-      :Return type: symbolic Tensor
-      :Returns: a symbolic tensor representing the application of logical 
-      elementwise equality.

-    .. code-block:: python 
+.. function:: neq(a, b)

-        import theano.tensor as T
-        x,y = T.dmatrices('x','y')
-        z = T.eq(x,y)
+    Returns a variable representing the result of logical inequality (a!=b).

-.. function:: neq(a, b)

-    Returns a variable representing the result of logical inequality
-    (a!=b).
-      :Parameter:  *a* - symbolic Tensor (or compatible)
-      :Parameter:  *b* - symbolic Tensor (or compatible)
+Condition
+---------
+
+.. function:: switch(cond, ift, iff)
+    Returns a variable representing a switch between ift (iftrue) and iff (iffalse)
+     based on the condition cond.
+      :Parameter:  *cond* - symbolic Tensor (or compatible)
+      :Parameter:  *ift* - symbolic Tensor (or compatible)
+      :Parameter:  *iff* - symbolic Tensor (or compatible)
      :Return type: symbolic Tensor
-      :Returns: a symbolic tensor representing the application of logical 
-      elementwise inequality.

    .. code-block:: python 
+      import theano.tensor as T
+      a,b = T.dmatrices('a','b')
+      x,y = T.dmatrices('x','y')
+      z = T.switch(T.lt(a,b), x, y)
+
+Bit-wise
+--------
+
+.. note:: The bitwise operators must have an integer type as input.
+
+The bitwise operators possess this interface: 
+    :Parameter:  *a* - symbolic Tensor of integer type.
+    :Parameter:  *b* - symbolic Tensor of integer type.
+    .. note:: The bit-wise not (invert) does not have this second parameter.
+
+    :Return type: symbolic Tensor
+
+.. function:: and_(a, b)
+
+    Returns a variable representing the result of the bitwise and.
+
+.. function:: or_(a, b)
+
+    Returns a variable representing the result of the bitwise or.
+
+.. function:: xor(a, b)
+
+    Returns a variable representing the result of the bitwise xor.
+
+.. function:: invert(a)
+
+    Returns a variable representing the result of the bitwise not.
+
+Here is an example using the bit-wise and_:
+
+.. code-block:: python 
+
+    import theano.tensor as T
+    x,y = T.imatrices('x','y')
+    z = T.and_(x,y)

-        import theano.tensor as T
-        x,y = T.dmatrices('x','y')
-        z = T.neq(x,y)

 Mathematical
 ------------

+
+
+
+
 .. _libdoc_tensor_broadcastable:

 Broadcasting in Theano vs. Numpy

--- a/doc/tutorial/apply.png
+++ b/doc/tutorial/apply.png
--- a/doc/tutorial/apply.svg
+++ b/doc/tutorial/apply.svg
--- a/doc/tutorial/symbolic_graphs.txt
+++ b/doc/tutorial/symbolic_graphs.txt
+
+.. _symbolic_graph:
+
+================
+Graph Structures
+================
+
+In order to be able to take advantage of Theano, you need to understand
+how Theano works. Theano represents mathematical computations as graphs
+( for a detailed rendering see :ref:`graphstructures` - parts of this
+are directly taken from there). Graphs are composed of itnerconnected 
+:ref:`apply` and :ref:`variable` nodes. They are associated to *function
+application* and *data*, respectively. An operation is represented by
+an :ref:`op` and data types are represented by :ref:`type` instances. 
+Here is a piece of code and a diagram showing the structure built by
+that piece of code. This should help you understand how these pieces fit
+together:
+
+-----------------------
+
+**Code**
+
+.. code-block:: python
+
+    x = dmatrix('x')
+    y = dmatrix('y')
+    z = x + y
+
+**Diagram**
+
+.. image:: apply.png 
+
+-----------------------
+
+Arrows represent references to the Python objects pointed at. The blue
+box is an :ref:`apply` node. Red boxes are :ref:`variable` nodes. Green
+circles are :ref:`Ops <op>`. Purple boxes are :ref:`Types <type>`.
+
+When we create :ref:`Variables <variable>` and then :ref:`apply`
+:ref:`Ops <op>` to them to make more Variables, we build a
+bi-partite, directed, acyclic graph. Variables point to the Apply nodes
+representing the function application producing them via their
+``owner`` field. These Apply nodes point in turn to their input and
+output Variables via their ``inputs`` and ``outputs`` fields.
+(Apply instances also contain a list of references to their ``outputs``, but
+those pointers don't count in this graph.)
+
+The ``owner`` field of both ``x`` and ``y`` point to ``None`` because
+they are not the result of another computation. If one of them was the
+result of another computation, it's ``owner`` field would point to another
+blue box like ``z`` does, and so on.
+
+Note that the ``Apply`` instance's outputs points to
+``z``, and ``z.owner`` points back to the ``Apply`` instance.
+
+
+The graph structure is needed for *Optimizations* and *Automatic
+Differentiation*.
+
+Automatic Differentiation
+=========================
+
+Having the graph structure, computing automatic differentiation is
+simple. The only thing :func:`tensor.grad` has to do is to traverse the
+graph from the outputs back towards the inputs through all :ref:`apply`
+nodes ( :ref:`apply` nodes are those who define what computations the
+graph does). For each such :ref:`apply` node, its  :ref:`op` defines 
+how to compute the gradient of the node's outputs with respect to its
+inputs. Note that if an :ref:`op` does not define how to compute the
+gradient, then any expression containing this :ref:`op` is not
+differentiable. Using the `chain rule <http://en.wikipedia.org/wiki/Chain_rile>`_ 
+these gradients can be composed in order to obtain the expression of the 
+gradient of the graph's output with respect to the graph's inputs .
+
+
+Optimizations
+=============
+
+When compiling a Theano function, what you give to the
+:ref:`theano.function <libdoc_compile_function>` is actually a graph
+(starting from the outputs variables you can traverse the graph up to
+the input variables). While this graph structure shows how to compute
+the output from the input, it also offers the posibility to improve the  
+the way this computation is carried out. The way optimizations work in 
+Theano is by indentifying and replacing certain patterns in the graph 
+with other specialized patterns that produce the same results but are either 
+faster or more stable. Optimizations can also detect 
+identical subgraphs and ensure that the same values are not computed
+twice or reformulate parts of the graph to a GPU specific version.
+
+For example, one (simple) optimization that Theano uses is to replace 
+the pattern :math:`\frac{xy}{y}` by :math:`x`.