Evalutating Biological Data Using Rank Correlation Methods

dc.contributor.authorSlotta, Douglas J.en
dc.contributor.committeechairHeath, Lenwood S.en
dc.contributor.committeememberRamakrishnan, Narenen
dc.contributor.committeememberMurali, T. M.en
dc.contributor.committeememberVergara, John Paul C.en
dc.contributor.committeememberPotts, Malcolmen
dc.contributor.committeememberHelm, Richard F.en
dc.contributor.departmentComputer Scienceen
dc.date.accessioned2014-03-14T20:11:46Zen
dc.date.adate2005-05-24en
dc.date.available2014-03-14T20:11:46Zen
dc.date.issued2005-05-05en
dc.date.rdate2005-05-24en
dc.date.sdate2005-05-09en
dc.description.abstractAnalyses based upon rank correlation methods, such as Spearman's Rho and Kendall's Tau, can provide quick insights into large biological data sets. Comparing expression levels between different technologies and models is problematic due to the different units of measure. Here again, rank correlation provides an effective means of comparison between the two techniques. Massively Parallel Signature Sequencing (MPSS) transcript abundance levels to microarray signal intensities for Arabidopsis thaliana are compared. Rank correlations can be applied to subsets as well as the entire set. Results of subset comparisons can be used to improve the capabilities of predictive models, such as Predicted Highly Expressed (PHX). This is done for Escherichia coli. Methods are given to combine predictive models based upon feedback from experimental data. The problem of feature selection in supervised learning situations is also considered, where all features are drawn from a common domain and are best interpreted via ordinal comparisons with other features, rather than as numerical values. This is done for synthetic data as well as for microarray experiments examining the life cycle of Drosophila melanogaster and human leukemia cells. Two novel methods are presented based upon Rho and Tau, and their efficacy is tested with synthetic and real world data. The method based upon Spearman's Rho is shown to be more effective.en
dc.description.degreePh. D.en
dc.identifier.otheretd-05092005-105158en
dc.identifier.sourceurlhttp://scholar.lib.vt.edu/theses/available/etd-05092005-105158/en
dc.identifier.urihttp://hdl.handle.net/10919/27613en
dc.publisherVirginia Techen
dc.relation.haspartdis.pdfen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectMPSSen
dc.subjectbioinformaticsen
dc.subjectmicroarraysen
dc.subjectspoiler counten
dc.subjectrank-orderen
dc.subjectfeature selectionen
dc.titleEvalutating Biological Data Using Rank Correlation Methodsen
dc.typeDissertationen
thesis.degree.disciplineComputer Scienceen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.leveldoctoralen
thesis.degree.namePh. D.en

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
dis.pdf
Size:
1.47 MB
Format:
Adobe Portable Document Format