Integrated Digital Library System for Long Documents and their Elements

dc.contributor.authorChekuri, Satviken
dc.contributor.authorChandrasekar, Prashanten
dc.contributor.authorBanerjee, Bipashaen
dc.contributor.authorPark, Sung Heeen
dc.contributor.authorMasrourisaadat, Nilaen
dc.contributor.authorAhuja, Amanen
dc.contributor.authorIngram, William A.en
dc.contributor.authorFox, Edward A.en
dc.date.accessioned2024-01-22T13:04:54Zen
dc.date.available2024-01-22T13:04:54Zen
dc.date.issued2023en
dc.description.abstractWe describe a next-generation integrated Digital Library (DL) system that addresses the numerous goals associated with long documents such as Electronic Theses and Dissertations (ETDs). Our extensible workflow-centric design supports a variety of users/personas (e.g., researchers, curators, and experimenters) who can benefit from improved access to ETDs and the content buried therein. Our approach leverages natural language processing, deep learning, information retrieval, and software engineering methods. The services cover ingesting, storing, curating, analyzing, detecting, extracting, classifying, summarizing, topic modeling, browsing, searching, retrieving, recommending, visualizing/reporting, and interacting with ETDs and derivative text/image-based elements/objects. Workflows connect the services and their APIs, along with UI-based access. We believe our approach can guide others to combine tailored user support, research, and education by way of extensible DLs.en
dc.description.versionAccepted versionen
dc.format.extentPages 13-24en
dc.format.extent12 page(s)en
dc.format.mimetypeapplication/pdfen
dc.identifier.doihttps://doi.org/10.1109/JCDL57899.2023.00012en
dc.identifier.eissn2575-8152en
dc.identifier.isbn9798350399318en
dc.identifier.issn2575-7865en
dc.identifier.orcidIngram, William [0000-0002-8307-8844]en
dc.identifier.orcidFox, Edward [0000-0003-1447-6870]en
dc.identifier.urihttps://hdl.handle.net/10919/117429en
dc.identifier.volume2023-Juneen
dc.language.isoenen
dc.publisherACMen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectDigital Libraryen
dc.subjectInformation Systemen
dc.subjectInformation Retrievalen
dc.subjectDeep Learningen
dc.subjectNLPen
dc.titleIntegrated Digital Library System for Long Documents and their Elementsen
dc.title.serial2023 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES, JCDLen
dc.typeConference proceedingen
dc.type.dcmitypeTexten
dc.type.otherProceedings Paperen
dc.type.otherBook in seriesen
pubs.finish-date2023-06-30en
pubs.organisational-group/Virginia Techen
pubs.organisational-group/Virginia Tech/Engineeringen
pubs.organisational-group/Virginia Tech/Engineering/Computer Scienceen
pubs.organisational-group/Virginia Tech/Libraryen
pubs.organisational-group/Virginia Tech/All T&R Facultyen
pubs.organisational-group/Virginia Tech/Engineering/COE T&R Facultyen
pubs.organisational-group/Virginia Tech/Library/Library assessment administratorsen
pubs.organisational-group/Virginia Tech/Library/Dean's officeen
pubs.organisational-group/Virginia Tech/Library/Information Technologyen
pubs.organisational-group/Virginia Tech/Graduate studentsen
pubs.organisational-group/Virginia Tech/Graduate students/Doctoral studentsen
pubs.start-date2023-06-26en

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
JCDL2023ETDSatvik.pdf
Size:
1.83 MB
Format:
Adobe Portable Document Format
Description:
Accepted version
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Plain Text
Description: