Stochastic Gradient Descent, Backpropagation




CS256

Chris Pollett

Oct 20, 2021

Outline

div class="slide">

Introduction

Stochastic Gradient Descent

Training Terminology

Visiting All the Data with SGD

In-Class Exercise

Defining our Symbols

Using the Chain Rule to Compute an Update

Backpropagation

Computing Partial Derivatives

TensorFlow and Keras

What TensorFlow Lets Us Do