ETDseer Concept Paper

dc.contributor.authorMa, Yufengen
dc.contributor.authorJiang, Tingtingen
dc.contributor.authorShrestha, Chandanien
dc.date.accessioned2017-05-27T15:04:26Zen
dc.date.available2017-05-27T15:04:26Zen
dc.date.issued2017-05-03en
dc.description.abstractETDSeer (electronic thesis and dissertation digital library connected with SeerSuite) will build on 15 years of collaboration between teams at Virginia Tech (VT) and Penn State University (PSU), since both have been leaders in the worldwide digital library (DL) community. VT helped launch the national and international efforts for ETDs more than 20 years ago, which have been led by the Networked Digital Library of Theses and Dissertations (NDLTD, directed by PI Fox); its Union Catalog has increased to 5 million records. PSU hosts CiteSeerX, which co-PI Giles launched almost 20 years ago, and which is connected with a wide variety of research results under the SeerSuite family. ETDs, typically in PDF, are a largely untapped international resource. Digital libraries with advanced services can effectively address the broad needs to discover and utilize ETDs of interest. Our research will leverage SeerSuite methods that have been applied mostly to short documents, plus a variety of exploratory studies at VT, and will yield a “web of graduate research”, rich knowledge bases, and a digital library with effective interfaces. References will be analyzed and converted to canonical forms, figures and tables will be recognized and re-represented for flexible searching, small sections (acknowledgments, biographical sketches) will be mined, and aids for researchers will be built especially from literature reviews and discussions of future work. Entity recognition and disambiguation will facilitate flexible use of a large graph of linked open data.en
dc.description.notesETDseerReport.pdf -- Final report in PDF format ETDseerPresentation.pptx -- Presentation slides in pptx format ETDseerPresentation.pdf -- Presentation slides in PDF format ETDseerLaTeX.zip -- LaTeX source files for final reporten
dc.identifier.urihttp://hdl.handle.net/10919/77868en
dc.language.isoen_USen
dc.publisherVirginia Techen
dc.rightsCreative Commons Attribution-NonCommercial-NoDerivs 3.0 United Statesen
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/us/en
dc.subjectdeep learningen
dc.subjectdomain independent digital libraryen
dc.subjectinformation extraction (IE)en
dc.subjectinformation retrievalen
dc.subjectnatural language processing (NLP)en
dc.subjectNDLTDen
dc.subjectCiteSeerXen
dc.titleETDseer Concept Paperen
dc.typePresentationen
dc.typeReporten
Files
Original bundle
Now showing 1 - 4 of 4
Loading...
Thumbnail Image
Name:
ETDseerReport.pdf
Size:
1.27 MB
Format:
Adobe Portable Document Format
Name:
ETDseerPresentation.pptx
Size:
4.52 MB
Format:
Microsoft Powerpoint XML
Loading...
Thumbnail Image
Name:
ETDseerPresentation.pdf
Size:
2.84 MB
Format:
Adobe Portable Document Format
Name:
ETDseerLaTeX.zip
Size:
3.11 MB
Format:
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Item-specific license agreed upon to submission
Description: