Show simple item record

dc.contributor.authorAmento, Brianen_US
dc.date.accessioned2014-03-14T20:19:37Z
dc.date.available2014-03-14T20:19:37Z
dc.date.issued2001-09-26en_US
dc.identifier.otheretd-12022003-105949en_US
dc.identifier.urihttp://hdl.handle.net/10919/29871
dc.description.abstractTopic management is the task of gathering, evaluating, organizing, and sharing a set of web sites for a specific topic. Current web tools do not provide adequate support for this task. We created and continue to develop the TopicShop system to address this need. TopicShop includes (1) a web crawler/analyzer that discovers relevant web sites and builds site profiles, and (2) user interfaces for information workspaces. We conducted an empirical pilot study comparing user performance with TopicShop vs. Yahooï . Results from this study were used to improve the design of TopicShop. A number of key design changes were incorporated into a second version of TopicShop based on results and user comments of the pilot study including (1) the tasks of evaluation and organization are treated as integral instead of separable, (2) spatial organization is important to users and must be well supported in the interface, and (3) distinct user and global datasets help users deal with the large quantity of information available on the web. A full empirical study using the second iteration of TopicShop covered more areas of the World Wide Web and validated results from the pilot study. Across the two studies, TopicShop subjects found over 80% more high-quality sites (where quality was determined by independent expert judgements) while browsing only 81% as many sites and completing their task in 89% of the time. The site profile data that TopicShop provide -- in particular, the number of pages on a site and the number of other sites that link to it -- were the key to these results, as users exploited them to identify the most promising sites quickly and easily. We also evaluated a number of link- and content-based algorithms using a dataset of web documents rated for quality by human topic experts. Link-based metrics did a good job of picking out high-quality items. Precision at 5 (the common information retrieval metric indicating the percentage of high quality items selected that are actually high quality) is about 0.75, and precision at 10 is about 0.55; this is in a dataset where 32% of all documents were of high quality. Surprisingly, a simple content-based metric, which ranked documents by the total number of pages on their containing site, performed nearly as well. These studies give insight into users' needs for the task of topic management, and provide empirical evidence of the effectiveness of task-specific interfaces (such as TopicShop) for managing topical collections.en_US
dc.publisherVirginia Techen_US
dc.relation.haspartamento-dissertation-final.pdfen_US
dc.rightsI hereby certify that, if appropriate, I have obtained and attached hereto a written permission statement from the owner(s) of each third party copyrighted matter to be included in my thesis, dissertation, or project report, allowing distribution as specified below. I certify that the version I submitted is the same as that approved by my advisory committee. I hereby grant to Virginia Tech or its agents the non-exclusive license to archive and make accessible, under the conditions specified below, my thesis, dissertation, or project report in whole or in part in all forms of media, now or hereafter known. I retain all other ownership rights to the copyright of the thesis, dissertation or project report. I also retain the right to use in future works (such as articles or books) all or part of this thesis, dissertation, or project report.en_US
dc.subjectsocial filteringen_US
dc.subjectCocitation analysisen_US
dc.subjectinformation visualizationen_US
dc.subjectcollaborative filteringen_US
dc.subjectsocial network analysisen_US
dc.subjectcomputer-supported cooperative worken_US
dc.titleUser Interfaces for Topic Management of Web Sitesen_US
dc.typeDissertationen_US
dc.contributor.departmentComputer Scienceen_US
dc.description.degreePh. D.en_US
thesis.degree.namePh. D.en_US
thesis.degree.leveldoctoralen_US
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen_US
thesis.degree.disciplineComputer Scienceen_US
dc.contributor.committeechairHix, Deborah S.en_US
dc.contributor.committeememberSchulman, Robert S.en_US
dc.contributor.committeememberHartson, H. Rexen_US
dc.contributor.committeememberTerveen, Lorenen_US
dc.contributor.committeememberEhrich, Roger W.en_US
dc.identifier.sourceurlhttp://scholar.lib.vt.edu/theses/available/etd-12022003-105949/en_US
dc.date.sdate2003-12-02en_US
dc.date.rdate2003-12-15
dc.date.adate2003-12-15en_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record