I use the op "assign" to change the value of variables instead of "=", but I found the gradient I got is quite different. Could anyone tell me the difference and why? thanks! Like change w = w1 to op1 = tf.assign(w, w1) sess.run(op1)
(Tensorflow)Does the op assign change the gradient computation?
1.5k views Asked by Andy At
1
=andtf.assignare different operations.=is a python operation, in which you assign a python value to a python variabletf.assignis a Tensorflow operation that assigns the value to the variable ref and returns the assign operation.=is executed in python and doesn't affect the computation graph.tf.assignis a node in the computational graph.To understand, let's run this simple script
print(x.name, y.name)outputsVariable_1:0 Variable_1:0because
=is executed in python and you've overwritten the variablex.print(a.name, b.name)outputsAssign:0 Variable_3:0because you defined an assign op in the computational graph, nowais an assign op.When you run the defined graph, you get:
But these values are computed differently: one is a computation in the graph, the others no.
If you forgot to assign
ato the assign op created withtf.assign(thus you change the linea = a.assign(b)toa.assign(b)), then when you evaluate the graph, you'll get: