Object Detection and Document Accessibility

dc.contributor.authorDevera, Alanen
dc.contributor.authorNader, Michaelen
dc.contributor.authorZhang, Zehuaen
dc.contributor.authorKeegan, Elizabethen
dc.contributor.authorGunn, Theodoreen
dc.contributor.authorNguyen, Gabrielleen
dc.contributor.authorWevley, Lukeen
dc.date.accessioned2023-07-05T17:42:55Zen
dc.date.available2023-07-05T17:42:55Zen
dc.date.issued2023-05-10en
dc.description.abstractElectronic Theses and Dissertations (ETDs) are the primary way that students and professors write down and report their degree research. They allow new minds to understand where that field of study was left off, and how to continue the work that has been left. However, since many of the ETDs uploaded onto the internet are presented via PDF, it's difficult for users to view these ETDs in an effective manner, especially when you consider potential students with disabilities such as visual impairments. The goal of this project was to extend upon the previous work that has been done to make a Flask-based web application so that we can transform these long documents into something much more readable, user-friendly, and accessible via HTML rather than PDF. Also, our goal was to apply an algorithm to the returned bounding boxes that come from the object detection model to make sure that separate paragraphs and references are placed into their own box for correct XML generation on the website. To make the application's UI usable, we have applied a few changes to improve the experience. We have created the option for users to download the paper via PDF or XML, have a side-bar on the left of the website that contains a dynamic table of contents to jump to whatever part of the paper you select, and have a side-bar view on the right of the website that contains the original PDF so that any errors in our application don't ruin the user's understanding. We plan for future contributors to add a dark mode and dyslexic-friendly font. Lots of accessibility features will be added via HTML/CSS/React through improving the UI, but what's also included is the option to use an on-screen reader. Our project focuses on using NVDA, a popular screen reader, to allow for users with potential visual impairments to be able to listen along to the ETD instead. This was studied thoroughly throughout the course of this project. Finally, for the algorithms side of the project, the focus has been to improve upon the returned bounding boxes from the object detection models to separate paragraph and reference bounding boxes to only include one paragraph or one reference per box. The object detection models do the best they can for the amount of training they've received, but errors are still possible. This side of the project focused on fixing those errors from the model to make sure that the XML generation works well and the text is readable on our final application. The algorithms team was able to get a good post-processing algorithm to work for around 90% of the paragraphs in the ETDs that were tested, but were unable to get to the references part of the deliverable. This is left for future collaborators.en
dc.identifier.urihttp://hdl.handle.net/10919/115644en
dc.language.isoen_USen
dc.publisherVirginia Techen
dc.rightsAttribution-NonCommercial 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by-nc/4.0/en
dc.subjectETDen
dc.subjectElectronic Theses and Dissertationsen
dc.subjectObject Detectionen
dc.subjectDeep Learningen
dc.subjectAccessibilityen
dc.titleObject Detection and Document Accessibilityen
dc.typePresentationen
dc.typeReporten

Files

Original bundle
Now showing 1 - 4 of 4
Loading...
Thumbnail Image
Name:
ObjectDetectionDocAccessibilityReport.pdf
Size:
3.66 MB
Format:
Adobe Portable Document Format
Name:
ObjectDetectionDocAccessibilityReport.zip
Size:
4.14 MB
Format:
Loading...
Thumbnail Image
Name:
ObjectDetectionDocAccessibilityPresentation.pdf
Size:
4.19 MB
Format:
Adobe Portable Document Format
Name:
ObjectDetectionDocAccessibilityPresentation.pptx
Size:
6.22 MB
Format:
Microsoft Powerpoint XML
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Item-specific license agreed upon to submission
Description: