Ensemble Learning Techniques for Structured and Unstructured Data

dc.contributor.authorKing, Michael Allenen
dc.contributor.committeechairAbrahams, Alan Samuelen
dc.contributor.committeechairRagsdale, Cliff T.en
dc.contributor.committeememberWang, Gang Alanen
dc.contributor.committeememberZobel, Christopher W.en
dc.contributor.committeememberMatheson, Lance A.en
dc.contributor.departmentBusiness Information Technologyen
dc.date.accessioned2015-04-02T08:00:07Zen
dc.date.available2015-04-02T08:00:07Zen
dc.date.issued2015-04-01en
dc.description.abstractThis research provides an integrated approach of applying innovative ensemble learning techniques that has the potential to increase the overall accuracy of classification models. Actual structured and unstructured data sets from industry are utilized during the research process, analysis and subsequent model evaluations. The first research section addresses the consumer demand forecasting and daily capacity management requirements of a nationally recognized alpine ski resort in the state of Utah, in the United States of America. A basic econometric model is developed and three classic predictive models evaluated the effectiveness. These predictive models were subsequently used as input for four ensemble modeling techniques. Ensemble learning techniques are shown to be effective. The second research section discusses the opportunities and challenges faced by a leading firm providing sponsored search marketing services. The goal for sponsored search marketing campaigns is to create advertising campaigns that better attract and motivate a target market to purchase. This research develops a method for classifying profitable campaigns and maximizing overall campaign portfolio profits. Four traditional classifiers are utilized, along with four ensemble learning techniques, to build classifier models to identify profitable pay-per-click campaigns. A MetaCost ensemble configuration, having the ability to integrate unequal classification cost, produced the highest campaign portfolio profit. The third research section addresses the management challenges of online consumer reviews encountered by service industries and addresses how these textual reviews can be used for service improvements. A service improvement framework is introduced that integrates traditional text mining techniques and second order feature derivation with ensemble learning techniques. The concept of GLOW and SMOKE words is introduced and is shown to be an objective text analytic source of service defects or service accolades.en
dc.description.degreePh. D.en
dc.format.mediumETDen
dc.identifier.othervt_gsexam:4594en
dc.identifier.urihttp://hdl.handle.net/10919/51667en
dc.publisherVirginia Techen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectensemble methodsen
dc.subjectdata miningen
dc.subjectMachine learningen
dc.subjectclassificationen
dc.subjectstructured dataen
dc.subjectunstructured dataen
dc.titleEnsemble Learning Techniques for Structured and Unstructured Dataen
dc.typeDissertationen
thesis.degree.disciplineBusiness, Business Information Technologyen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.leveldoctoralen
thesis.degree.namePh. D.en

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
King_MA_D_2015.pdf
Size:
2.34 MB
Format:
Adobe Portable Document Format