Integrated Digital Library System for Long Documents and their Elements
dc.contributor.author | Chekuri, Satvik | en |
dc.contributor.author | Chandrasekar, Prashant | en |
dc.contributor.author | Banerjee, Bipasha | en |
dc.contributor.author | Park, Sung Hee | en |
dc.contributor.author | Masrourisaadat, Nila | en |
dc.contributor.author | Ahuja, Aman | en |
dc.contributor.author | Ingram, William A. | en |
dc.contributor.author | Fox, Edward A. | en |
dc.date.accessioned | 2024-01-22T13:04:54Z | en |
dc.date.available | 2024-01-22T13:04:54Z | en |
dc.date.issued | 2023 | en |
dc.description.abstract | We describe a next-generation integrated Digital Library (DL) system that addresses the numerous goals associated with long documents such as Electronic Theses and Dissertations (ETDs). Our extensible workflow-centric design supports a variety of users/personas (e.g., researchers, curators, and experimenters) who can benefit from improved access to ETDs and the content buried therein. Our approach leverages natural language processing, deep learning, information retrieval, and software engineering methods. The services cover ingesting, storing, curating, analyzing, detecting, extracting, classifying, summarizing, topic modeling, browsing, searching, retrieving, recommending, visualizing/reporting, and interacting with ETDs and derivative text/image-based elements/objects. Workflows connect the services and their APIs, along with UI-based access. We believe our approach can guide others to combine tailored user support, research, and education by way of extensible DLs. | en |
dc.description.version | Accepted version | en |
dc.format.extent | Pages 13-24 | en |
dc.format.extent | 12 page(s) | en |
dc.format.mimetype | application/pdf | en |
dc.identifier.doi | https://doi.org/10.1109/JCDL57899.2023.00012 | en |
dc.identifier.eissn | 2575-8152 | en |
dc.identifier.isbn | 9798350399318 | en |
dc.identifier.issn | 2575-7865 | en |
dc.identifier.orcid | Ingram, William [0000-0002-8307-8844] | en |
dc.identifier.orcid | Fox, Edward [0000-0003-1447-6870] | en |
dc.identifier.uri | https://hdl.handle.net/10919/117429 | en |
dc.identifier.volume | 2023-June | en |
dc.language.iso | en | en |
dc.publisher | ACM | en |
dc.rights | In Copyright | en |
dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | en |
dc.subject | Digital Library | en |
dc.subject | Information System | en |
dc.subject | Information Retrieval | en |
dc.subject | Deep Learning | en |
dc.subject | NLP | en |
dc.title | Integrated Digital Library System for Long Documents and their Elements | en |
dc.title.serial | 2023 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES, JCDL | en |
dc.type | Conference proceeding | en |
dc.type.dcmitype | Text | en |
dc.type.other | Proceedings Paper | en |
dc.type.other | Book in series | en |
pubs.finish-date | 2023-06-30 | en |
pubs.organisational-group | /Virginia Tech | en |
pubs.organisational-group | /Virginia Tech/Engineering | en |
pubs.organisational-group | /Virginia Tech/Engineering/Computer Science | en |
pubs.organisational-group | /Virginia Tech/Library | en |
pubs.organisational-group | /Virginia Tech/All T&R Faculty | en |
pubs.organisational-group | /Virginia Tech/Engineering/COE T&R Faculty | en |
pubs.organisational-group | /Virginia Tech/Library/Library assessment administrators | en |
pubs.organisational-group | /Virginia Tech/Library/Dean's office | en |
pubs.organisational-group | /Virginia Tech/Library/Information Technology | en |
pubs.organisational-group | /Virginia Tech/Graduate students | en |
pubs.organisational-group | /Virginia Tech/Graduate students/Doctoral students | en |
pubs.start-date | 2023-06-26 | en |