Now showing items 1-10 of 149
Sentiment and Topic Analysis
(Virginia Tech, 2017-05-03)
The IDEAL (Integrated Digital Event Archiving and Library) and Global Event and Trend Archive Research (GETAR) projects have collected over 1.5 billion tweets, and webpages from social media and the World Wide Web and ...
FFMPEG on the IBM Cloud
This module aims to introduce FFMPEG to students in a linux environment (IBM Cloud)
Collection Management Webpages
(Virginia Polytechnic Institute and State University, 2017-12-25)
The Collection Management Webpages team is responsible for collecting, processing, and storing webpages from different sources. Our team worked on familiarizing ourselves with the necessary tools and data required to produce ...
Analyzing Microblog Feeds to Trade Stocks
(Virginia Tech, 2017-05-10)
The goal of this project is to leverage microblogging data about the stock market to predict price trends and execute trades based on these predictions. Predicting the price trends of stocks with microblogging data involves ...
CS5604: Information and Storage Retrieval Fall 2016 - CMT (Collection Management Tweets)
(Virginia Tech, 2016-12-08)
As the Collection Management Tweets team in the Fall 2016 CS5604 class, we were responsible for processing >1.2 billion tweets, including data transfer, noise reduction, tweet augmentation, and storage via several technologies. ...
CS 5604 INFORMATION STORAGE AND RETRIEVAL Front-End Team Fall 2016 Final Report
(Virginia Tech, 2016-12-08)
Information Retrieval systems are a common tool for building research and disseminating knowledge. For this to be possible, these systems must be able to effectively show varying amounts of relevant information to the ...
CS5604: Information and Storage Retrieval Fall 2017 - FE (Front-End Team)
(Virginia Tech, 2017-12-24)
Social media and Web data are becoming important sources of information for researchers to monitor and study global events. GETAR, led by Dr. Edward Fox, is a project aiming to collect, organize, browse, visualize, ...
News Event Website
This report gives a detailed overview of the archiving services developed by the News Event Detection Website Group in the Virginia Tech Multimedia, Hypertext, and Information Access capstone. Our group developed a framework ...
Leveraging eXist-db for Efficient TEI Document Management
Professor David Radcliffe has created Lord Byron and his Times (LBT), a large digital archive of works surrounding Lord Byron and his contemporaries. The original website was unusable slow due to the expensive XSLT ...
The following problem was addressed by our project: How can we easily visualize the content of a body of text without manually analyzing its content? The initial goal was to be able to visualize captioned college lectures, ...