A convenient method for accumulating gradients which increase the score of "good" things and decrease the score of "bad" things.
A convenient method for accumulating gradients which increase the score of "good" things and decrease the score of "bad" things.
The accumulator on which to put the gradient
The model
The variable whose score we want to increase
The variable whose score we want to decrease