1장-신경망 복습

1.2 신경망 추론

Untitled

Fully Connected Layer : 입력층 * 가중치 내적 ⇒ 편향 더하기 ⇒ 활성화 함수 적용 ⇒ 다음 뉴런의 입력층

$$ h_1 = x_1w_{11}+x_2w_{21}+b_1 $$

**** 요약 ****

역전파는 Loss function(L)을 ouptut(y)로 미분한 값에서부터 각 구간의 미분 값을 곱해줍니다. 이때 output과 label의 오차를 앞 계층에 전해줍니다. 미분(기울기)를 이용해서 신경망의 매개변수를 갱신합니다.

[계층으로 클래스화 및 순전파 구현]

구현 규칙1 : 모든 계층은 forward()와 backward()메서드를 가집니다.
구현 규칙2 : 모든 계층은 인스턴스 변수인 params와 grads를 가집니다.
- 모든 계층에는 학습해야 하는 매개변수가 반드시 인스턴스 변수인 params에 존재합니다.

<aside> 💡 TwoLayerNet 처리: x → Affine → Sigmoid → Affine → S

</aside>

Sigmoid 계층, 완전연결 Affine 계층 구현

$$ y = \frac{1}{1 + exp(-x)} $$

class Sigmoid:
    def __init__(self):
        self.params = [] # 매개변수 따로 X
    
    def forward(self,x):
        return 1 / (1 + np.exp(-x))

class Affine:
    def __init__(self, W, b): 
        # 초기화될 때 가중치, 편향 받는다
        self.params = [W,b]

    def forward(self,x):
        # 순전파 처리 구현
        W,b = self.params
        out = np.matmul(x,W) + b
        return out

TwoLayerNet 클래스로 신경망 추상화

class TwoLayerNet:
    def __init__(self, input_size, hidden_size, output_size):
        I, H, O = input_size, hidden_size, output_size

        # 가중치와 편향 초기화
        W1 = np.random.randn(I,H)
        b1 = np.random.random(H)
        W2 = np.random.randn(H,O)
        b2 = np.random.random(O)

        # 계층 생성
        self.layers = [
            Affine(W1, b1),
            Sigmoid(),
            Affine(W2, b2)
        ]

        # 모든 가중치를 리스트에 모은다.
        self.params = []
        for layer in self.layers:
            self.params += layer.params

    # 주 추론 처리
    def predict(self,x):
        for layer in self.layers:
            x = layer.forward(x) # 각 레이어에 대해서 순전파 적용
        return x

신경망 추론 수행

x = np.random.randn(10,2)
model = TwoLayerNet(2,4,3)
s = model.predict(x)