Search
Now showing items 1-10 of 65
News Event Website
(2015-05-14)
This report gives a detailed overview of the archiving services developed by the News Event Detection Website Group in the Virginia Tech Multimedia, Hypertext, and Information Access capstone. Our group developed a framework ...
Document Clustering for IDEAL
(2015-05-13)
Document clustering is an unsupervised classification of text documents into groups
(clusters). The documents with similar properties are grouped together into one cluster.
Documents which have dissimilar patterns are ...
Classification Project in CS5604, Spring 2016
(2016-05-04)
In the grand scheme of a large Information Retrieval project, the work of our team was that of performing text classification on both tweet collections and their associated webpages. In order to accomplish this task, we ...
PIAT - Poison Ivy Appalachian Trail Mega-Transect Data Collection Application
(2016-05-08)
Provided are details and specifications for the Poison Ivy Appalachian Trail Mega-Transect Data Collection Application, referred to as PIAT. Described in the report are the software requirements, design, implementation, ...
OutbreakSum: Automatic Summarization of Texts Relating to Disease Outbreaks
(2014-12)
The goal of the fall 2014 Disease Outbreak Project (OutbreakSum) was to develop software for automatically analyzing and summarizing large collections of texts pertaining to disease outbreaks. Although our code was tested ...
Collection Management for IDEAL
(2016-05-04)
The collection management portion of the information retrieval system has three major tasks. The first task is to perform incremental update of the new data flow from the tweet MySQL database to HDFS and then to HBase. ...
CS5604: Clustering and Social Networks for IDEAL
(2016-05-03)
The Integrated Digital Event Archiving and Library (IDEAL) project of Virginia Tech provides services for searching, browsing, analysis, and visualization of over 1 billion tweets and over 65 million webpages. The project ...
Computational Linguistics Hurricane Group
(2014-12)
The problem-project based learning described in our presentation and report addresses automatic summarization of web content using natural language processing. Initially, we used simple techniques such as word frequencies ...
IDRgeneralization: Music Appreciation
(2013-05-18)
When instructors teach courses, they break up the material into components. The students need to fully understand these components in order to understand the entire course. This concept led Uma Murthy to create the SuperIDR. ...
ProjOpenDSA - OpenDSA Log Support
(2012-12-11)
The OpenDSA project is an online eTextbook project that includes not only literature but other dynamic content to be used in Data Structures and Algorithms courses. OpenDSA contains exercises of various types to go along ...