Visualizing Categorical Time Series Data with Applications to Computer and Communications Network Traces

dc.contributor.authorRibler, Randy L.en
dc.contributor.committeechairAbrams, Marcen
dc.contributor.committeememberKriz, Ronald D.en
dc.contributor.committeememberEhrich, Roger W.en
dc.contributor.committeememberFoutz, Roberten
dc.contributor.committeememberRibbens, Calvin J.en
dc.contributor.departmentComputer Scienceen
dc.date.accessioned2014-03-14T20:21:22Zen
dc.date.adate1997-04-04en
dc.date.available2014-03-14T20:21:22Zen
dc.date.issued1997-04-04en
dc.date.rdate1997-04-04en
dc.date.sdate1998-07-21en
dc.description.abstractVisualization tools allow scientists to comprehend very large data sets and to discover relationships which are otherwise difficult to detect. Unfortunately, not all types of data can be visualized easily using existing tools. In particular, long sequences of nonnumeric data cannot be visualized adequately. Examples of this type of data include trace files of computer performance information, the nucleotides in a genetic sequence, a record of stocks traded over a period of years, and the sequence of words in this document. The term categorical time series is defined and used to describe this family of data. When visualizations designed for numerical time series are applied to categorical time series, the distortions which result from the arbitrary conversion of unordered categorical values to totally ordered numerical values can be profound. Examples of this phenomenon are presented and explained. Several new, general purpose techniques for visualizing categorical time series data have been developed as part of this work and have been incorporated into the Chitra perfor- mance analysis and visualization system. All of these new visualizations can be produced in O(n) time. The new visualizations for categorical time series provide general purpose techniques for visualizing aspects of categorical data which are commonly of interest. These include periodicity, stationarity, cross-correlation, autocorrelation, and the detection of recurring patterns. The effective use of these visualizations is demonstrated in a number of application domains, including performance analysis, World Wide Web traffic analysis, network routing simulations, document comparison, pattern detection, and the analysis of the performance of genetic algorithms.en
dc.description.degreePh. D.en
dc.identifier.otheretd-1711111139751001en
dc.identifier.sourceurlhttp://scholar.lib.vt.edu/theses/available/etd-1711111139751001/en
dc.identifier.urihttp://hdl.handle.net/10919/30314en
dc.publisherVirginia Techen
dc.relation.haspartetd.pdfen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectvisualizationen
dc.subjectcategorical dataen
dc.subjecttime seriesen
dc.subjectdata miningen
dc.subjectperformance analysisen
dc.subjectinformation visualizationen
dc.titleVisualizing Categorical Time Series Data with Applications to Computer and Communications Network Tracesen
dc.typeDissertationen
thesis.degree.disciplineComputer Scienceen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.leveldoctoralen
thesis.degree.namePh. D.en

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
etd.pdf
Size:
19.95 MB
Format:
Adobe Portable Document Format