Classifying ETDs

dc.contributor.authorShah, Vedanten
dc.contributor.authorRamesh, Vaishalien
dc.contributor.authorDaniel, Reemaen
dc.contributor.authorGathani, Mihir D.en
dc.date.accessioned2023-07-05T17:43:20Zen
dc.date.available2023-07-05T17:43:20Zen
dc.date.issued2023-05-17en
dc.description.abstractElectronic Theses and Dissertations (ETDs) are academic documents that provide an in-depth insight into an account of the research work of a graduate student and are designed to be stored in machine archives and retrieved globally. These documents contain abundant information that may be utilized by various machine learning tasks such as classification, summarization, and question-answering. However, these documents often have incomplete, incorrect, or inconsistent metadata which makes it challenging to accurately categorize these documents without manual intervention since there is no one uniform format to develop the metadata. Therefore, through the Classifying ETDs capstone project, we aim to create a gold standard classification dataset, leverage machine learning and deep learning algorithms to automatically classify ETDs with missing metadata, and develop a website to allow a user to classify an ETD with missing metadata and view already classified ETDs. The expected impact of this project is to advance information availability from long documents and eventually aid in improving long document information accessibility through regular search engines.en
dc.description.notesClassifyingETDsWebsiteDemo.mp4 - It is the demo of the website for the ClassifyingETDs project in .mp4 format. ClassifyingETDsPresentation.pptx - It is the final project presentation for the ClassifyingETDs project in .pptx form. ClassifyingETDsPresentation.pdf - It is the final project presentation for the ClassifyingETDs project in .pdf form. ClassifyingETDsReport.pdf - It is the final project report for the ClassifyingETDs project in .pdf form. ClassifyingETDsReport.docx - It is the final project report for the ClassifyingETDs project in .docx form.en
dc.identifier.urihttp://hdl.handle.net/10919/115647en
dc.language.isoen_USen
dc.publisherVirginia Techen
dc.subjectGold Standard ETD Classification Dataseten
dc.subjectDeep Learningen
dc.subjectText Classification Modelsen
dc.subjectInteractive User Interfaceen
dc.subjectData Cleaningen
dc.titleClassifying ETDsen
dc.typePresentationen
dc.typeReporten
dc.typeVideoen

Files

Original bundle
Now showing 1 - 5 of 5
Name:
ClassifyingETDsWebsiteDemo.mp4
Size:
4.26 MB
Format:
MP4 Container format for video files
Name:
ClassifyingETDsPresentation.pptx
Size:
8.25 MB
Format:
Microsoft Powerpoint XML
Loading...
Thumbnail Image
Name:
ClassifyingETDsPresentation.pdf
Size:
2.87 MB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
ClassifyingETDsReport.pdf
Size:
10.75 MB
Format:
Adobe Portable Document Format
Name:
ClassifyingETDsReport.docx
Size:
13.19 MB
Format:
Microsoft Word XML
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Item-specific license agreed upon to submission
Description: