CS6604: Digital Libraries
Browse by
Recent Submissions
-
Classification and extraction of information from ETD documents
(Virginia Tech, 2020-01-30)In recent years, advances in natural language processing, machine learning, and neural networks have led to powerful tools for digital libraries, allowing library collections to be discovered, used, and reused in exciting ... -
Otrouha: Automatic Classification of Arabic ETDs
(Virginia Tech, 2020-01-23)ETDs are becoming a new genre of documents that is highly precious and worth preserving. This has resulted in a sustainable need to build an effective tool to facilitate retrieving ETD collections. While Arabic ETDs have ... -
Toward an Intelligent Crawling Scheduler for Archiving News Websites Using Reinforcement Learning
(Virginia Tech, 2019-12-03)Web crawling is one of the fundamental activities for many kinds of web technology organizations and companies such as Internet Archive and Google. While companies like Google often focus on content delivery for users, ... -
Tweet Analysis and Classification: Diabetes and Heartbleed Internet Virus as Use Cases
(Virginia Tech, 2019-12-24)The proliferation of data on social media has driven the need for researchers to develop algorithms to filter and process this data into meaningful information. In this project, we consider the task of classifying tweets ... -
Cross-Platform Data Collection and Analysis for Online Hate Groups
(Virginia Tech, 2019-12-26)Hate groups are using online social media increasingly over the last decade. An online audience of hate groups is exposed to the material with hateful agenda and underlying propaganda. The presence of hate across multiple ... -
ACM Venue Recommendation System
(Virginia Tech, 2019-12-23)A frequent goal of a researcher is to publish his/her work in appropriate conferences and journals. With a large number of options for venues in the microdomains of every research discipline, the issue of selecting suitable ... -
Generating Synthetic Healthcare Records Using Convolutional Generative Adversarial Networks
(Virginia Tech, 2019-12-20)Deep learning models have demonstrated high-quality performance in several areas such as image classification and speech processing. However, creating a deep learning model using electronic health record (EHR) data requires ... -
Social Communities Knowledge Discovery: Approaches applied to clinical study
(Virginia Tech, 2017-05)In recent efforts being conducted by the Social Interactome team, to validate hypotheses of the study, we have worked to make sense of the data that has been collected during two 16-week experiments and three Amazon ... -
Sentiment and Topic Analysis
(Virginia Tech, 2017-05-03)The IDEAL (Integrated Digital Event Archiving and Library) and Global Event and Trend Archive Research (GETAR) projects have collected over 1.5 billion tweets, and webpages from social media and the World Wide Web and ... -
ETDseer Concept Paper
(Virginia Tech, 2017-05-03)ETDSeer (electronic thesis and dissertation digital library connected with SeerSuite) will build on 15 years of collaboration between teams at Virginia Tech (VT) and Penn State University (PSU), since both have been leaders ... -
CS6604 Spring 2017 Global Events Team Project
(Virginia Tech, 2017-05-03)This submission describes the work the Global Events team completed in Spring 2017. It includes the final report and presentation, as well as key relevant materials (source code). Based on the previous reports and different ... -
Unsupervised Event Extraction from News and Twitter
(2014-05-11)Living in the age of big data, we are facing massive information every day, especially that from the mainstream news and the social networks. Due to its gigantic volume, one may get frustrated when trying to identify the ... -
Epidemiology Network
(2014-05-11)This project aims at developing an RDF graph building service for Cyber Infrastructure for Network Science (CINET). The purpose of this service is to do web crawling and find digital contents related to user requests. More ... -
IDEAL Pages
(2014-05-10)The main goal of this project is to provide a convenient Web enabled interface to a large collection of event-related webpages supporting the two main services of browsing and searching. We first studied the events and ... -
Twitter Metadata
(2014-05-10)A number of projects and research efforts work with collections of tweets. Of particular interest is the collection of tweets related to world events. Many organizations have their own individual tweet collections regarding ... -
CINET Registry
(2014-05-09)Cyber-infrastructure for Network Science (CINET) is a computational and analytic framework for the network science researcher and education. The cyber-infrastructure (CI) part of CINET is responsible for coordinating the ... -
Qatar content classification
(2014-05-09)This reports on a term project for the CS660 Digital libraries course (Spring 2014). The project has been conducted under the supervision of Prof. Edward Fox and Mr. Tarek Kanan. The goal is to develop an Arabic newspaper ... -
Ensemble Classification Project
(2014-05-08)Transfer learning unlike traditional machine learning is a technique that allows domains, tasks and distributions used in training and testing to be different. Knowledge gained from one domain can be utilized to learn a ... -
Knowledge Building and Sharing: A Metamodel for Guided Research, Learning, and Application
(2014-05-07)Specific field methodology and models cannot be an afterthought when designing, developing, or administering any kind of technology or system. However, the mass amount of techniques and options can be both overwhelming ...