I use the op "assign" to change the value of variables instead of "=", but I found the gradient I got is quite different. Could anyone tell me the difference and why? thanks! Like change w = w1 to op1 = tf.assign(w, w1) sess.run(op1)
(Tensorflow)Does the op assign change the gradient computation?
1.4k views Asked by Andy At
1
=
andtf.assign
are different operations.=
is a python operation, in which you assign a python value to a python variabletf.assign
is a Tensorflow operation that assigns the value to the variable ref and returns the assign operation.=
is executed in python and doesn't affect the computation graph.tf.assign
is a node in the computational graph.To understand, let's run this simple script
print(x.name, y.name)
outputsVariable_1:0 Variable_1:0
because
=
is executed in python and you've overwritten the variablex
.print(a.name, b.name)
outputsAssign:0 Variable_3:0
because you defined an assign op in the computational graph, nowa
is an assign op.When you run the defined graph, you get:
But these values are computed differently: one is a computation in the graph, the others no.
If you forgot to assign
a
to the assign op created withtf.assign
(thus you change the linea = a.assign(b)
toa.assign(b)
), then when you evaluate the graph, you'll get: