Lecture 7 moves from fully-connected to convolutional networks by introducing new computational primitives that respect the spatial structure of 2D image data. We discuss convolution layers, which slide a learnable filter over the input data. We discuss pooling layers, which spatially downsample their input data. We then look at normalization layers including batch, layer, and instance normalization, which normalize their input data along different axes and improve training speed.
Slides: http://myumi.ch/K43Zy
_________________________________________________________________________________________________
Computer Vision has become ubiquitous in our society, with applications in search, image understanding, apps, mapping, medicine, drones, and self-driving cars. Core to many of these applications are visual recognition tasks such as image classification and object detection. Recent developments in neural network approaches have greatly advanced the performance of these state-of-the-art visual recognition systems. This course is a deep dive into details of neural-network based deep learning methods for computer vision. During this course, students will learn to implement, train and debug their own neural networks and gain a detailed understanding of cutting-edge research in computer vision. We will cover learning algorithms, neural network architectures, and practical engineering tricks for training and fine-tuning networks for visual recognition tasks.
Course Website: http://myumi.ch/Bo9Ng
Instructor: Justin Johnson http://myumi.ch/QA8Pg