Now showing items 1-10 of 69
FFMPEG on the IBM Cloud
This module aims to introduce FFMPEG to students in a linux environment (IBM Cloud)
News Event Website
This report gives a detailed overview of the archiving services developed by the News Event Detection Website Group in the Virginia Tech Multimedia, Hypertext, and Information Access capstone. Our group developed a framework ...
The following problem was addressed by our project: How can we easily visualize the content of a body of text without manually analyzing its content? The initial goal was to be able to visualize captioned college lectures, ...
Document Clustering for IDEAL
Document clustering is an unsupervised classification of text documents into groups (clusters). The documents with similar properties are grouped together into one cluster. Documents which have dissimilar patterns are ...
Classification Project in CS5604, Spring 2016
In the grand scheme of a large Information Retrieval project, the work of our team was that of performing text classification on both tweet collections and their associated webpages. In order to accomplish this task, we ...
PIAT - Poison Ivy Appalachian Trail Mega-Transect Data Collection Application
Provided are details and specifications for the Poison Ivy Appalachian Trail Mega-Transect Data Collection Application, referred to as PIAT. Described in the report are the software requirements, design, implementation, ...
OutbreakSum: Automatic Summarization of Texts Relating to Disease Outbreaks
The goal of the fall 2014 Disease Outbreak Project (OutbreakSum) was to develop software for automatically analyzing and summarizing large collections of texts pertaining to disease outbreaks. Although our code was tested ...
Collection Management for IDEAL
The collection management portion of the information retrieval system has three major tasks. The first task is to perform incremental update of the new data flow from the tweet MySQL database to HDFS and then to HBase. ...
CS5604: Clustering and Social Networks for IDEAL
The Integrated Digital Event Archiving and Library (IDEAL) project of Virginia Tech provides services for searching, browsing, analysis, and visualization of over 1 billion tweets and over 65 million webpages. The project ...
Computational Linguistics Hurricane Group
The problem-project based learning described in our presentation and report addresses automatic summarization of web content using natural language processing. Initially, we used simple techniques such as word frequencies ...