Backpropagation, Tensorflow




CS256

Chris Pollett

Oct 23, 2017

Outline

Introduction

Defining our Symbols

Using the Chain Rule to Compute an Update

Backpropagation

Quiz

Which of the following is true?

  1. Mean squared error is an example of a cost function that might be used to train a neural net.
  2. We called a function `f:RR^m->RR` convex if its Hessian was negative definite.
  3. All gradient descent algorithms make use of mini-batches.

Computing Partial Derivatives

TensorFlow

What TensorFlow Lets Us Do

Computation Graph Nodes

Example Building and Running Computation Graph

Using Reduce Operations

TensorFlow Optimizers

Example Training a Computation Graph TensorFlow