Using Concept Maps as a Tool for Cross-Language Relevance Determination

dc.contributor.authorRichardson, W. Ryanen
dc.contributor.committeechairFox, Edward A.en
dc.contributor.committeememberRamakrishnan, Narenen
dc.contributor.committeememberTegarden, David P.en
dc.contributor.committeememberCline, Ben E.en
dc.contributor.committeememberPérez-Quiñones, Manuel A.en
dc.contributor.departmentComputer Scienceen
dc.date.accessioned2014-03-14T20:13:41Zen
dc.date.adate2007-08-02en
dc.date.available2014-03-14T20:13:41Zen
dc.date.issued2007-06-06en
dc.date.rdate2007-08-02en
dc.date.sdate2007-07-02en
dc.description.abstractConcept maps, introduced by Novak, aid learners' understanding. I hypothesize that concept maps also can function as a summary of large documents, e.g., electronic theses and dissertations (ETDs). I have built a system that automatically generates concept maps from English-language ETDs in the computing field. The system also will provide Spanish translations of these concept maps for native Spanish speakers. Using machine translation techniques, my approach leads to concept maps that could allow researchers to discover pertinent dissertations in languages they cannot read, helping them to decide if they want a potentially relevant dissertation translated. I am using a state-of-the-art natural language processing system, called Relex, to extract noun phrases and noun-verb-noun relations from ETDs, and then produce concept maps automatically. I also have incorporated information from the table of contents of ETDs to create novel styles of concept maps. I have conducted five user studies, to evaluate user perceptions about these different map styles. I am using several methods to translate node and link text in concept maps from English to Spanish. Nodes labeled with single words from a given technical area can be translated using wordlists, but phrases in specific technical fields can be difficult to translate. Thus I have amassed a collection of about 580 Spanish-language ETDs from Scirus and two Mexican universities and I am using this corpus to mine phrase translations that I could not find otherwise. The usefulness of the automatically-generated and translated concept maps has been assessed in an experiment at Universidad de las Americas (UDLA) in Puebla, Mexico. This experiment demonstrated that concept maps can augment abstracts (translated using a standard machine translation package) in helping Spanish speaking users find ETDs of interest.en
dc.description.degreePh. D.en
dc.identifier.otheretd-07022007-184525en
dc.identifier.sourceurlhttp://scholar.lib.vt.edu/theses/available/etd-07022007-184525/en
dc.identifier.urihttp://hdl.handle.net/10919/28191en
dc.publisherVirginia Techen
dc.relation.haspartRichardson_Appendix_VI_VII.pdfen
dc.relation.haspartRichardson_Appendix_III.pdfen
dc.relation.haspartRichardson_Appendix_IV.pdfen
dc.relation.haspartRichardson_etd.pdfen
dc.relation.haspartRichardson_Appendix_V.pdfen
dc.relation.haspartRichardson_Appendix_I.pdfen
dc.relation.haspartRichardson_Appendix_II.pdfen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectnamed entity extractionen
dc.subjectcomputing ontologiesen
dc.subjectconcept mappingen
dc.subjectcross-language information retrievalen
dc.titleUsing Concept Maps as a Tool for Cross-Language Relevance Determinationen
dc.typeDissertationen
thesis.degree.disciplineComputer Scienceen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.leveldoctoralen
thesis.degree.namePh. D.en

Files

Original bundle
Now showing 1 - 5 of 7
Loading...
Thumbnail Image
Name:
Richardson_etd.pdf
Size:
4.31 MB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
Richardson_Appendix_I.pdf
Size:
764.06 KB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
Richardson_Appendix_II.pdf
Size:
1.21 MB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
Richardson_Appendix_III.pdf
Size:
880.67 KB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
Richardson_Appendix_IV.pdf
Size:
1.13 MB
Format:
Adobe Portable Document Format