A Case Study of Using Domain Analysis for the Conflation Algorithms Domain

dc.contributor.authorYilmaz, Okanen
dc.contributor.authorFrakes, William B.en
dc.contributor.departmentComputer Scienceen
dc.date.accessioned2013-06-19T14:37:11Zen
dc.date.available2013-06-19T14:37:11Zen
dc.date.issued2007en
dc.description.abstractThis paper documents the domain engineering process for much of the conflation algorithms domain. Empirical data on the process and products of domain engineering were collected. Six conflation algorithms of four different types: three affix removal, one successor variety, one table lookup, and one n-gram were analyzed. Products of the analysis include a generic architecture, reusable components, a little language and an application generator that extends the scope of the domain analysis beyond previous generators. The application generator produces source code for not only affix removal type but also successor variety, table lookup, and n-gram stemmers. The performance of the stemmers generated automatically was compared with the stemmers developed manually in terms of stem similarity, source and executable sizes, and development and execution times. All five stemmers generated by the application generator produced more than 99.9% identical stems with the manually developed stemmers. Some of the generated stemmers were as efficient as their manual equivalents and some were not.en
dc.format.mimetypeapplication/pdfen
dc.identifierhttp://eprints.cs.vt.edu/archive/00000993/en
dc.identifier.sourceurlhttp://eprints.cs.vt.edu/archive/00000993/01/Yilmaz-tse07.pdfen
dc.identifier.trnumberTR-07-32en
dc.identifier.urihttp://hdl.handle.net/10919/19833en
dc.language.isoenen
dc.publisherDepartment of Computer Science, Virginia Polytechnic Institute & State Universityen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectAlgorithmsen
dc.subjectData structuresen
dc.titleA Case Study of Using Domain Analysis for the Conflation Algorithms Domainen
dc.typeTechnical reporten
dc.type.dcmitypeTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Yilmaz-tse07.pdf
Size:
477.57 KB
Format:
Adobe Portable Document Format