Скачать книгу

z slash partial-differential upper Y right-parenthesis upper F Superscript upper T Baseline right-bracket Subscript left-parenthesis p comma q right-parenthesis Baseline period"/>

      In the right half of Figure 3.25, the 6 × 4 matrix is ∂z/∂Y)FT. In order to compute the partial derivative of z with respect to one element in the input X, we need to find which elements in ∂z/∂Y)FT are involved and add them. In the left half of Figure 3.25, we see that the input element 5 (shown in larger font) is involved in four convolution operations, shown by the gray, light gray, dotted gray and black boxes, respectively. These four convolution operations correspond to p = 1, 2, 3, 4. For example, when p = 2 (the light gray box), 5 is the third element in the convolution, and hence q = 3 when p = 2, and we put a light gray circle in the (2, 3)‐th element of the (∂z/∂Y)FTmatrix. After all four circles are put in the matrix (∂z/∂Y)FT,the partial derivative is the sum of ellements in these four locations of (∂z/∂Y)FT. The set m−1(il, jl, dl) contains at most HWDl elements. Hence, Eq. (3.102) requires at most HWDl summations to compute one element of ∂z/∂X.

Schematic illustration of computing ∂z/∂X. Schematic illustration of pooling layer operation.

      Formally this can be represented as

      (3.103)StartLayout 1st Row max colon y Subscript i Sub Superscript l plus 1 Subscript comma j Sub Superscript l plus 1 Subscript comma d Baseline equals MAX left-parenthesis x Subscript i Sub Superscript l plus 1 Subscript times upper H plus i comma j Sub Superscript l plus 1 Subscript times upper W plus j comma d Superscript l Baseline right-parenthesis 2nd Row average colon y Subscript i Sub Superscript l plus 1 Subscript comma j Sub Superscript l plus 1 Subscript comma d Baseline equals StartFraction 1 Over italic upper H upper W EndFraction sigma-summation Underscript 0 less-than-or-equal-to i less-than upper H comma 0 less-than-or-equal-to j less-than upper W Endscripts left-parenthesis x Subscript i Sub Superscript l plus 1 Subscript times upper H plus i comma j Sub Superscript l plus 1 Subscript times upper W plus j comma d Superscript l Baseline right-parenthesis comma EndLayout

      where 0 ≤ il + 1 < Hl + 1, 0 ≤ jl + 1 < Wl + 1, and 0 ≤ d < Dl + 1 = Dl.

      Pooling is a local operator, and its forward computation is straightforward. When focusing on the backpropagation, only max pooling will be discussed and we can resort to the indicator matrix again. All we need to encode in this indicator matrix is: for every element in y, where does it come from in xl?

      We need a triplet (il, jl, dl) to locate one element in the input xl, and another triplet (il + 1, jl + 1, dl + 1) to locate one element in y. The pooling output y Subscript i Sub Superscript l plus 1 Subscript comma j Sub Superscript l plus 1 Subscript comma d Sub Superscript l plus 1 comes from x Subscript i Sub Superscript l Subscript comma j Sub Superscript l Subscript comma d Sub Superscript l Subscript Superscript l, if and only if the following conditions are met: (i) they are in the same channel; (ii) the (il, jl)‐th spatial entry belongs to the (il + 1, jl + 1 )‐th subregion; and (iii) the (il, jl)‐th spatial entry is the largest one in that subregion. This can be represented as

d Superscript l plus 1 Baseline equals d Superscript l Baseline comma left floor i Superscript l Baseline slash upper H right floor equals i Superscript l plus 1 Baseline comma left floor j Superscript l Baseline slash upper W right floor equals j Superscript i plus 1 Baseline comma x Subscript i Sub Superscript l Subscript comma j Sub Superscript l Subscript comma d Sub Superscript l Subscript Superscript l Baseline greater-than-or-equal-to y Subscript i plus i Sub Superscript l plus 1 Subscript times upper H comma j plus j Sub Superscript l plus 1 Subscript times upper W comma d Sub Superscript l Subscript Baseline comma for-all 0 less-than-or-equal-to i less-than upper H comma 0 less-than-or-equal-to j less-than upper W comma

      where ⌊·⌋ is the floor function. If the stride is not H(W) in the vertical (horizontal) direction, the equation must be changed accordingly. Given a (il + 1, jl + 1, dl + 1) triplet, there is only one (il, jl, dl) triplet that satisfies all these conditions. So, we define an indicator matrix left-parenthesis x Superscript l Baseline right-parenthesis element-of 
				<p style= Скачать книгу