Conversation Facts

dc.contributor.authorKang, Christianen
dc.contributor.authorMiller, Nathanen
dc.contributor.authorMarks, Jonathonen
dc.date.accessioned2019-05-20T02:54:10Zen
dc.date.available2019-05-20T02:54:10Zen
dc.date.issued2019-05-08en
dc.description.abstractThe Conversation Facts project is a part of Dr. Fox's CS 4624: Multimedia, Hypertext, and Information Access class; it was proposed by Saurabh Chakravarty as a way to help his research in natural language processing. The goal of the Conversation Facts project is to be able to take a summary of a conversation and link it back to where it occurs in the conversation dialogue. We used the Argumentative Dialogue Summary Corpus: Version 1 from Natural Language and Dialog Systems as our dataset for this project. This project was created in Python due to its natural language processing libraries which include spaCy and the Natural Language Toolkit (NLTK) libraries. These two contained the methods and techniques used in the project to parse the data and process it into the parts of speech for us to work with. Our general method of approach for this project was to create knowledge graphs of the summaries and the conversation dialogues. This way, we could connect the two based on the entity-relation-entity (ERE) triples. We can then compare the summary triple which would point us back to a corresponding conversation triple. This will link back to the section in the dialogue text that the summary is referencing. Upon completion of the project, we have found that our methods outperform naïve implementations of simply running our data through industry standard software, but there are still many things that could be improved to get better results. Our program focuses on utilizing natural language processing techniques, but we believe that machine learning could be applied to the data set in order to increase accuracy. The report explains the requirements set for the team to accomplish, the overall design of the project, the implementation of said design, and evaluation of results. It also includes a User’s Manual and Developer’s Manual to help illustrate how to either run the source code or continue development on the project. Finally, we describe the lessons learned throughout completing the project and list the resources used.en
dc.description.notesVT_CS4624_ConversationFacts.zip - A zipped version of the project code ConversationFactsFinalPresentation.pdf - The finalized version of the last presentation ConversationFactsFinalPresentation.pptx - An editable version of the last presentation ConversationFactsFinalReport.pdf - The finalized version of the report explaining the project ConversationFactsFinalReport.docx - An editable version of the report explaining the projecten
dc.identifier.urihttp://hdl.handle.net/10919/89567en
dc.language.isoen_USen
dc.publisherVirginia Techen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectNatural Language Processingen
dc.subjectPythonen
dc.subjectKnowledge Graphen
dc.subjectConversationen
dc.subjectSummaryen
dc.titleConversation Factsen
dc.typePresentationen
dc.typeReporten
dc.typeSoftwareen

Files

Original bundle
Now showing 1 - 5 of 5
Name:
VT_CS4624_ConversationFacts.zip
Size:
568.6 KB
Format:
Loading...
Thumbnail Image
Name:
ConversationFactsFinalPresentation.pdf
Size:
350.26 KB
Format:
Adobe Portable Document Format
Name:
ConversationFactsFinalPresentation.pptx
Size:
793 KB
Format:
Microsoft Powerpoint XML
Loading...
Thumbnail Image
Name:
ConversationFactsFinalReport.pdf
Size:
1.49 MB
Format:
Adobe Portable Document Format
Name:
ConversationFactsFinalReport.docx
Size:
1.24 MB
Format:
Microsoft Word XML
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Item-specific license agreed upon to submission
Description: