Show simple item record

dc.contributor.authorAbdulla, Ghaleb M.S.en_US
dc.date.accessioned2014-03-14T20:21:49Z
dc.date.available2014-03-14T20:21:49Z
dc.date.issued1998-04-27en_US
dc.identifier.otheretd-33098-142912en_US
dc.identifier.urihttp://hdl.handle.net/10919/30470
dc.description.abstractThis dissertation deals with monitoring, collecting, analyzing, and modeling of World Wide Web (WWW) traffic and client interactions. The rapid growth of WWW usage has not been accompanied by an overall understanding of models of information resources and their deployment strategies. Consequently, the current Web architecture often faces performance and reliability problems. Scalability, latency, bandwidth, and disconnected operations are some of the important issues that should be considered when attempting to adjust for the growth in Web usage. The WWW Consortium launched an effort to design a new protocol that will be able to support future demands. Before doing that, however, we need to characterize current users' interactions with the WWW and understand how it is being used. We focus on proxies since they provide a good medium for caching, filtering information, payment methods, and copyright management. We collected proxy data from our environment over a period of more than two years. We also collected data from other sources such as schools, information service providers, and commercial sites. Sampling times range from days to years. We analyzed the collected data looking for important characteristics that can help in designing a better HTTP protocol. We developed a modeling approach that considers Web traffic characteristics such as self-similarity and long-range dependency. We developed an algorithm to characterize users' sessions. Finally we developed a high-level Web traffic model suitable for sensitivity analysis. As a result of this work we develop statistical models of parameters such as arrival times, file sizes, file types, and locality of reference. We describe an approach to model long-range and dependent Web traffic and we characterize activities of users accessing a digital library courseware server or Web search tools. Temporal and spatial locality of reference within examined user communities is high, so caching can be an effective tool to help reduce network traffic and to help solve the scalability problem. We recommend utilizing our findings to promote a smart distribution or push model to cache documents when there is likelihood of repeat accesses.en_US
dc.publisherVirginia Techen_US
dc.relation.haspartthesis.pdfen_US
dc.rightsI hereby grant to Virginia Tech or its agents the right to archive and to make available my thesis or dissertation in whole or in part in the University Libraries in all forms of media, now or hereafter known. I retain all proprietary rights, such as patent rights. I also retain the right to use in future works (such as articles or books) all or part of this thesis or dissertation.en_US
dc.subjectTime Seriesen_US
dc.subjectModelingen_US
dc.subjectScalabilityen_US
dc.subjectWorld Wide Weben_US
dc.subjectLog analysisen_US
dc.subjectCachingen_US
dc.subjectProxyen_US
dc.titleAnalysis and Modeling of World Wide Web Trafficen_US
dc.typeDissertationen_US
dc.contributor.departmentComputer Scienceen_US
dc.description.degreePh. D.en_US
thesis.degree.namePh. D.en_US
thesis.degree.leveldoctoralen_US
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen_US
thesis.degree.disciplineComputer Scienceen_US
dc.contributor.committeechairFox, Edward Alanen_US
dc.contributor.committeememberKafura, Dennis G.en_US
dc.contributor.committeememberBalci, Osmanen_US
dc.contributor.committeememberAbrams, Marcen_US
dc.contributor.committeememberNayfeh, Ali H.en_US
dc.identifier.sourceurlhttp://scholar.lib.vt.edu/theses/available/etd-33098-142912/en_US
dc.date.sdate1998-04-27en_US
dc.date.rdate1999-04-30
dc.date.adate1998-04-30en_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record