Merge pull request #4419 from slefrancois/function_copy

Function copy

Merge pull request #4419 from slefrancois/function_copy
77236f47 · Frédéric Bastien · c95bea50 · 4d3a70c9 · 77236f47 · 77236f47
--- a/doc/internal/release.txt
+++ b/doc/internal/release.txt
@@ -16,7 +16,7 @@ The Theano release plan is detailed below. Comments and/or suggestions are
 welcome on the mailing list.
 1) We aim to update Theano several times a year. These releases will be made
-    as new features are implemented.     
+   as new features are implemented.     
 2) Urgent releases will only be made when a bug generating incorrect
   output is discovered and fixed.

--- a/doc/tutorial/examples.txt
+++ b/doc/tutorial/examples.txt
@@ -185,7 +185,7 @@ internal state, and returns the old state value.
 This code introduces a few new concepts.  The ``shared`` function constructs
 so-called :ref:`shared variables<libdoc_compile_shared>`.
-These are hybrid symbolic and non-symbolic variables whose value may be shared 
+These are hybrid symbolic and non-symbolic variables whose value may be shared
 between multiple functions.  Shared variables can be used in symbolic expressions just like
 the objects returned by ``dmatrices(...)`` but they also have an internal
 value that defines the value taken by this symbolic variable in *all* the
@@ -274,9 +274,6 @@ In practice, a good way of thinking about the ``givens`` is as a mechanism
 that allows you to replace any part of your formula with a different
 expression that evaluates to a tensor of same shape and dtype.
-.. _using_random_numbers:
 .. note::
    Theano shared variable broadcast pattern default to False for each
@@ -285,6 +282,57 @@ expression that evaluates to a tensor of same shape and dtype.
    different pattern, just pass it as a parameter
    ``theano.shared(..., broadcastable=(True, False))``
+Copying functions
+=================
+Theano functions can be copied, which can be useful for creating similar
+functions but with different shared variables or updates. This is done using
+the :func:`copy()<theano.compile.function_module.Function.copy>` method of ``function`` objects. The optimized graph of the original function is copied,
+so compilation only needs to be performed once.
+Let's start from the accumulator defined above:
+>>> import theano
+>>> import theano.tensor as T
+>>> state = theano.shared(0)
+>>> inc = T.iscalar('inc')
+>>> accumulator = theano.function([inc], state, updates=[(state, state+inc)])
+We can use it to increment the state as usual:
+>>> accumulator(10)
+array(0)
+>>> print(state.get_value())
+10
+We can use ``copy()`` to create a similar accumulator but with its own internal state
+using the ``swap`` parameter, which is a dictionary of shared variables to exchange:
+>>> new_state = theano.shared(0)
+>>> new_accumulator = accumulator.copy(swap={state:new_state})
+>>> new_accumulator(100)
+[array(0)]
+>>> print(new_state.get_value())
+100
+The state of the first function is left untouched:
+>>> print(state.get_value())
+10
+We now create a copy with updates removed using the ``delete_updates``
+parameter, which is set to ``False`` by default:
+>>> null_accumulator = accumulator.copy(delete_updates=True)
+As expected, the shared state is no longer updated:
+>>> null_accumulator(9000)
+[array(10)]
+>>> print(state.get_value())
+10
+.. _using_random_numbers:
 Using Random Numbers
 ====================
@@ -299,8 +347,8 @@ RandomStream object (a random number generator) for each such
 variable, and draw from it as necessary. We will call this sort of
 sequence of random numbers a *random stream*. *Random streams* are at
 their core shared variables, so the observations on shared variables
-hold here as well. Theanos's random objects are defined and implemented in 
+hold here as well. Theanos's random objects are defined and implemented in
-:ref:`RandomStreams<libdoc_tensor_shared_randomstreams>` and, at a lower level, 
+:ref:`RandomStreams<libdoc_tensor_shared_randomstreams>` and, at a lower level,
 in :ref:`RandomStreamsBase<libdoc_tensor_raw_random>`.
 Brief Example
@@ -453,7 +501,7 @@ array([ 0.59044123])
 Other Random Distributions
 --------------------------
-There are :ref:`other distributions implemented <libdoc_tensor_raw_random>`. 
+There are :ref:`other distributions implemented <libdoc_tensor_raw_random>`.
 .. _example_other_random:
@@ -488,14 +536,14 @@ It will be used repeatedly.
    import theano
    import theano.tensor as T
    rng = numpy.random
    N = 400                                   # training sample size
    feats = 784                               # number of input variables
    # generate a dataset: D = (input_values, target_class)
    D = (rng.randn(N, feats), rng.randint(size=N, low=0, high=2))
    training_steps = 10000
    # Declare Theano symbolic variables
    x = T.matrix("x")
    y = T.vector("y")

--- a/theano/compile/function_module.py
+++ b/theano/compile/function_module.py
@@ -714,7 +714,13 @@ class Function(object):
        f_cpy = maker.__class__(inputs=ins, outputs=outs, fgraph=fg_cpy,
                                mode=maker.mode, profile=profile,
-                                on_unused_input=maker.on_unused_input,
+                                # When removing updates containing variables
+                                # not used in the output function, copy
+                                # generates an unused implicit input.
+                                # We ignore the resulting errors,
+                                # but could change it to 'warn' if this might
+                                # cause problems.
+                                on_unused_input='ignore',
                                function_builder=maker.function_builder,
                                # As this is an optimized graph, it
                                # can contain inplace. DebugMode check

--- a/theano/compile/tests/test_function_module.py
+++ b/theano/compile/tests/test_function_module.py
@@ -366,6 +366,7 @@ class T_function(unittest.TestCase):
            assert in1.value is in2.value
    def test_copy_delete_updates(self):
+        w = T.iscalar('w')
        x = T.fscalar('x')
        # SharedVariable for tests, one of them has update
        y = theano.shared(value=1, name='y')
@@ -383,6 +384,15 @@ class T_function(unittest.TestCase):
            assert cpy(1)[0] == 4
            assert cpy(1)[0] == 4
+        # Test if unused implicit and explicit inputs from delete_updates
+        # are ignored as intended.
+        for mode in ["FAST_RUN", "FAST_COMPILE"]:
+            ori = theano.function([x], x, mode=mode, updates={z: z * 2})
+            cpy = ori.copy(delete_updates=True)
+            ori = theano.function([x, w], x, mode=mode, updates={z: z + w})
+            cpy = ori.copy(delete_updates=True)
    def test_shared_state0(self):
        a = T.scalar()  # the a is for 'anonymous' (un-named).
        x, s = T.scalars('xs')