A Deterministic Approach to Partitioning Neural Network Training Data for the Classification Problem

dc.contributor.authorSmith, Gregory Edwarden
dc.contributor.committeechairRagsdale, Cliff T.en
dc.contributor.committeememberZobel, Christopher W.en
dc.contributor.committeememberRees, Loren P.en
dc.contributor.committeememberCook, Deborah F.en
dc.contributor.committeememberBrown, Evelyn C.en
dc.contributor.departmentBusiness (Management Science)en
dc.date.accessioned2014-03-14T20:15:23Zen
dc.date.adate2006-09-28en
dc.date.available2014-03-14T20:15:23Zen
dc.date.issued2006-08-07en
dc.date.rdate2006-09-28en
dc.date.sdate2006-08-18en
dc.description.abstractThe classification problem in discriminant analysis involves identifying a function that accurately classifies observations as originating from one of two or more mutually exclusive groups. Because no single classification technique works best for all problems, many different techniques have been developed. For business applications, neural networks have become the most commonly used classification technique and though they often outperform traditional statistical classification methods, their performance may be hindered because of failings in the use of training data. This problem can be exacerbated because of small data set size. In this dissertation, we identify and discuss a number of potential problems with typical random partitioning of neural network training data for the classification problem and introduce deterministic methods to partitioning that overcome these obstacles and improve classification accuracy on new validation data. A traditional statistical distance measure enables this deterministic partitioning. Heuristics for both the two-group classification problem and k-group classification problem are presented. We show that these heuristics result in generalizable neural network models that produce more accurate classification results, on average, than several commonly used classification techniques. In addition, we compare several two-group simulated and real-world data sets with respect to the interior and boundary positions of observations within their groups' convex polyhedrons. We show by example that projecting the interior points of simulated data to the boundary of their group polyhedrons generates convex shapes similar to real-world data group convex polyhedrons. Our two-group deterministic partitioning heuristic is then applied to the repositioned simulated data, producing results superior to several commonly used classification techniques.en
dc.description.degreePh. D.en
dc.identifier.otheretd-08182006-164833en
dc.identifier.sourceurlhttp://scholar.lib.vt.edu/theses/available/etd-08182006-164833/en
dc.identifier.urihttp://hdl.handle.net/10919/28710en
dc.publisherVirginia Techen
dc.relation.haspartGregorySmithDissertation.pdfen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectconvex setsen
dc.subjectdiscriminant analysisen
dc.subjectNeural networksen
dc.subjectdata partitioningen
dc.titleA Deterministic Approach to Partitioning Neural Network Training Data for the Classification Problemen
dc.typeDissertationen
thesis.degree.disciplineBusiness (Management Science)en
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.leveldoctoralen
thesis.degree.namePh. D.en

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
GregorySmithDissertation.pdf
Size:
533.5 KB
Format:
Adobe Portable Document Format