CS4624 SP2025 Search ETD Elements

Abstract

ETDs (Electronic Theses and Dissertations) are documents that are written by hard-working individuals, containing information about significant discoveries and achievements regarding their work. For our project, we continued building the system off of previous teams’ (mainly 2022 and 2023) objectives. This includes scaling the application from 200k ETDs to 500k ETDs, increasing indexing speed and search/retrieval speed, and adding additional web components to support figure and table captions. A critical component of this project is Elasticsearch, a search engine built on Apache Lucene, which is used to store and search these ETDs, figures, and captions.

Description

Repository Links: Full-Stack Repository: https://code.vt.edu/cs4624-sp2025/search-etd-elements-full-stack Back-End Repository: https://code.vt.edu/cs4624-sp2025/search-etd-elements Front-End Repository: https://code.vt.edu/cs4624-sp2025/search-etd-elements-front-end

Keywords

Citation