Phrasal Document Analysis for Modeling

dc.contributor.authorSojitra, Ritesh D.en
dc.contributor.committeechairCyre, Walling R.en
dc.contributor.committeememberGray, Festus Gailen
dc.contributor.committeememberArmstrong, James R.en
dc.contributor.departmentElectrical and Computer Engineeringen
dc.date.accessioned2014-03-14T20:52:26Zen
dc.date.adate1998-09-24en
dc.date.available2014-03-14T20:52:26Zen
dc.date.issued1998-09-11en
dc.date.rdate1998-09-24en
dc.date.sdate1998-09-11en
dc.description.abstractSpecifications of digital hardware systems are typically written in a natural language. The objective of this research is automatic information extraction from specifications to aid model generation for system level design automation. This is done by automatic extraction of the noun phrases and the verbs from the natural language specification statements. First, the natural language sentences are parsed using a chart parser. Then, a noun phrase and verb extractor scans these charts to obtain the noun phrases with their frequencies of occurrence. The noun phrases are then classified by semantic types. Also the verbs are automatically assigned their respective roots and classified. Finally, each sentence is summarized as a sequence of "chunks": noun phrases, verbs and prepositions. Vectors are generated from these chunks and imported into MS Excel for plotting occurrence graphs of noun phrases and verbs with respect to the sentences in which they occur. Finally, inter-term dependencies between noun phrases, and between noun phrases and verbs were studied. The frequencies of occurrence, the classification of chunks, the occurrence graphs and the inter-term dependencies together give useful information about the subject, the hardware components and the behavior of a system described by a natural language specification document.en
dc.description.degreeMaster of Scienceen
dc.identifier.otheretd-82398-164327en
dc.identifier.sourceurlhttp://scholar.lib.vt.edu/theses/available/etd-82398-164327/en
dc.identifier.urihttp://hdl.handle.net/10919/36993en
dc.publisherVirginia Techen
dc.relation.haspartThesis.pdfen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectChunken
dc.subjectInformation Extractionen
dc.subjectModelingen
dc.subjectModelMakeren
dc.subjectNoun Phraseen
dc.subjectParseren
dc.titlePhrasal Document Analysis for Modelingen
dc.typeThesisen
thesis.degree.disciplineElectrical and Computer Engineeringen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.levelmastersen
thesis.degree.nameMaster of Scienceen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Thesis.pdf
Size:
441.92 KB
Format:
Adobe Portable Document Format

Collections