Stepping Stones and Pathways:Improving Retrieval by Chains of Relationships between Documents

dc.contributor.authorDas Neves, Fernando Adrianen
dc.contributor.committeechairFox, Edward A.en
dc.contributor.committeememberRamakrishnan, Narenen
dc.contributor.committeememberKafura, Dennis G.en
dc.contributor.committeememberKriz, Ronald D.en
dc.contributor.committeememberNorth, Christopher L.en
dc.contributor.departmentComputer Scienceen
dc.date.accessioned2014-03-14T20:17:50Zen
dc.date.adate2004-12-08en
dc.date.available2014-03-14T20:17:50Zen
dc.date.issued2004-09-16en
dc.date.rdate2004-12-08en
dc.date.sdate2004-11-01en
dc.description.abstractThe information retrieval (IR) field has been successful in developing techniques to address many types of information needs. However, there are cases in which traditional approaches to IR are not able to produce adequate results. Examples include: when a small set of (2-3) documents is needed as an answer rather than a single document, or when "query splitting" is required to satisfactorily explore the document space. We explore an alternative model of building and presenting retrieval results for such cases. In particular, we research effective methods for handling information needs that may: 1. Include multiple topics: A typical query is interpreted by current IR systems as a request to retrieve documents that each discusses all topics included in that query. We propose an alternative interpretation based on query splitting. It allows queries to be interpreted as requests to retrieve sets of documents rather than individual documents, with meaningful relationships among the members of each such set. 2. Be interpreted as parts in a chain of relationships: Suppose a query concerns topics t1 and tm. Is there a relation between topics t1 and tm that involves t2 and possibly other topics as in {t1, t2, â ¦ tm}? Thus, we propose an alternative interpretation of user queries and presentation of the results. Our interpretation has the potential to improve retrieval results whenever there is a mismatch between the user's understanding of the collection and the actual collection content. We define and refine a retrieval scheme that enhances retrieval through a framework that combines multiple sources of evidence. Query results in our interpretation are networks of document groups representing topics, each group relating to and connecting to other groups in the network that partially answer the user's information need. We devise new and more effective representations and techniques to visualize results, and incorporate the user as part of the retrieval process. We also evaluate the improvement of the query results based on multiple measures. In particular, we verify the validity of our approach through a study involving a collection of Operating Systems research papers that was specially built for this dissertation.en
dc.description.degreePh. D.en
dc.identifier.otheretd-11012004-003013en
dc.identifier.sourceurlhttp://scholar.lib.vt.edu/theses/available/etd-11012004-003013/en
dc.identifier.urihttp://hdl.handle.net/10919/29419en
dc.publisherVirginia Techen
dc.relation.haspartdissertation.PDFen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectInformation retrievalen
dc.subjectLiterature-based discoveryen
dc.subjectCombination of sources of evidenceen
dc.subjectIndexing of scientific literatureen
dc.titleStepping Stones and Pathways:Improving Retrieval by Chains of Relationships between Documentsen
dc.typeDissertationen
thesis.degree.disciplineComputer Scienceen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.leveldoctoralen
thesis.degree.namePh. D.en

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
dissertation.PDF
Size:
1.38 MB
Format:
Adobe Portable Document Format