Understanding Representations and Reducing their Redundancy in Deep Networks

Cogswell, Michael Andrew

Understanding Representations and Reducing their Redundancy in Deep Networks

Files

etd-03032016-170427_Cogswell_M_T_2016.pdf (2.79 MB)

Downloads: 538

Date

2016-02-18

Authors

Cogswell, Michael Andrew

Publisher

Virginia Tech

Abstract

Neural networks in their modern deep learning incarnation have achieved state of the art performance on a wide variety of tasks and domains. A core intuition behind these methods is that they learn layers of features which interpolate between two domains in a series of related parts.

The first part of this thesis introduces the building blocks of neural networks for computer vision. It starts with linear models then proceeds to deep multilayer perceptrons and convolutional neural networks, presenting the core details of each. However, the introduction also focuses on intuition by visualizing concrete examples of the parts of a modern network.

The second part of this thesis investigates regularization of neural networks. Methods like dropout and others have been proposed to favor certain (empirically better) solutions over others. However, big deep neural networks still overfit very easily. This section proposes a new regularizer called DeCov, which leads to significantly reduced overfitting (difference between train and val performance) and greater generalization, sometimes better than dropout and other times not. The regularizer is based on the cross-covariance of hidden representations and takes advantage of the intuition that different features should try to represent different things, an intuition others have explored with similar losses. Experiments across a range of datasets and network architectures demonstrate reduced overfitting due to DeCov while almost always maintaining or increasing generalization performance and often improving performance over dropout.

Keywords

Object Recognition, Overfitting, Computer Vision, Machine learning

Persistent link

http://hdl.handle.net/10919/78167

Collections

Masters Theses

Full item page

Understanding Representations and Reducing their Redundancy in Deep Networks

Files

TR Number

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

Persistent link

Collections