A Deterministic Approach to Partitioning Neural Network Training Data for the Classification Problem

Smith, Gregory Edward

A Deterministic Approach to Partitioning Neural Network Training Data for the Classification Problem

dc.contributor.author	Smith, Gregory Edward	en
dc.contributor.committeechair	Ragsdale, Cliff T.	en
dc.contributor.committeemember	Zobel, Christopher W.	en
dc.contributor.committeemember	Rees, Loren P.	en
dc.contributor.committeemember	Cook, Deborah F.	en
dc.contributor.committeemember	Brown, Evelyn C.	en
dc.contributor.department	Business (Management Science)	en
dc.date.accessioned	2014-03-14T20:15:23Z	en
dc.date.adate	2006-09-28	en
dc.date.available	2014-03-14T20:15:23Z	en
dc.date.issued	2006-08-07	en
dc.date.rdate	2006-09-28	en
dc.date.sdate	2006-08-18	en
dc.description.abstract	The classification problem in discriminant analysis involves identifying a function that accurately classifies observations as originating from one of two or more mutually exclusive groups. Because no single classification technique works best for all problems, many different techniques have been developed. For business applications, neural networks have become the most commonly used classification technique and though they often outperform traditional statistical classification methods, their performance may be hindered because of failings in the use of training data. This problem can be exacerbated because of small data set size. In this dissertation, we identify and discuss a number of potential problems with typical random partitioning of neural network training data for the classification problem and introduce deterministic methods to partitioning that overcome these obstacles and improve classification accuracy on new validation data. A traditional statistical distance measure enables this deterministic partitioning. Heuristics for both the two-group classification problem and k-group classification problem are presented. We show that these heuristics result in generalizable neural network models that produce more accurate classification results, on average, than several commonly used classification techniques. In addition, we compare several two-group simulated and real-world data sets with respect to the interior and boundary positions of observations within their groups' convex polyhedrons. We show by example that projecting the interior points of simulated data to the boundary of their group polyhedrons generates convex shapes similar to real-world data group convex polyhedrons. Our two-group deterministic partitioning heuristic is then applied to the repositioned simulated data, producing results superior to several commonly used classification techniques.	en
dc.description.degree	Ph. D.	en
dc.identifier.other	etd-08182006-164833	en
dc.identifier.sourceurl	http://scholar.lib.vt.edu/theses/available/etd-08182006-164833/	en
dc.identifier.uri	http://hdl.handle.net/10919/28710	en
dc.publisher	Virginia Tech	en
dc.relation.haspart	GregorySmithDissertation.pdf	en
dc.rights	In Copyright	en
dc.rights.uri	http://rightsstatements.org/vocab/InC/1.0/	en
dc.subject	convex sets	en
dc.subject	discriminant analysis	en
dc.subject	Neural networks	en
dc.subject	data partitioning	en
dc.title	A Deterministic Approach to Partitioning Neural Network Training Data for the Classification Problem	en
dc.type	Dissertation	en
thesis.degree.discipline	Business (Management Science)	en
thesis.degree.grantor	Virginia Polytechnic Institute and State University	en
thesis.degree.level	doctoral	en
thesis.degree.name	Ph. D.	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: GregorySmithDissertation.pdf
Size:: 533.5 KB
Format:: Adobe Portable Document Format

Download

Collections

Doctoral Dissertations