1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19
forward: single tick advance hidden state add functions to G Graph does backprop sets .dw fields of matrices Solver updates models step cache doesn't matter input forward calc loss,set deriv in last layer backward solver