Indexing Large Collections of Small Text Records for Ranked Retrieval

dc.contributorVirginia Tech. Department of Computer Science. Digital Library Research Laboratoryen
dc.contributor.authorFrance, Robert K.en
dc.contributor.authorFox, Edward A.en
dc.contributor.departmentDigital Library Research Laboratoryen
dc.contributor.departmentComputer Scienceen
dc.date.accessed2014-09-04en
dc.date.accessioned2015-05-29T20:36:38Zen
dc.date.available2015-05-29T20:36:38Zen
dc.date.issued1993en
dc.description.abstractThe MARIAN online public access catalog system at Virginia Tech has been developed to apply advanced information retrieval methods and object-oriented technology to the needs of library patrons. We give a description of our data model, design, processing, data representations, and retrieval operation. By identifying objects of interest during the indexing process, storing them according to our "information graph" model, and applying weighting schemes that seem appropriate for this large collection of small text records, we hope to better serve user needs. Since every text word is important in this domain, we employ opportunistic matching algorithms and a mix of data structures to support searching, that will give good performance for a large campus community, even though MARIAN runs on a distributed collection of small workstations. An initial small experiment indicates that our new ad hoc weighting scheme is more effective than a more standard approach.en
dc.format.extent37 pagesen
dc.format.mimetypeapplication/pdfen
dc.identifier.citationFrance, Robert K. and Edward A. Fox. "Indexing Large Collections of Small Text Records for Ranked Retrieval." Internal Report, Virginia Tech, 1993.en
dc.identifier.urihttp://hdl.handle.net/10919/52849en
dc.identifier.urlhttp://www.dlib.vt.edu/reports/LargeCollsSmTexts.pdfen
dc.language.isoen_USen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectIndexingen
dc.subjectCollection managementen
dc.subjectRanked retrievalen
dc.subjectSmall text recordsen
dc.titleIndexing Large Collections of Small Text Records for Ranked Retrievalen
dc.typeTechnical reporten
dc.type.dcmitypeTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
1993_Indexing_large_collections.pdf
Size:
569.37 KB
Format:
Adobe Portable Document Format