Now showing items 1-6 of 6
VT Web Archive Project
VTWebArchive is a project to archive, organize, and make available to the public, historical back-versions of content hosted on vt.edu domains. This system incorporates several open source software packages to design a ...
Developing an improved focused crawler for the IDEAL project
The IDEAL (Integrated Digital Event Archive and Library) project currently has a general purpose web crawler to find articles relevant to a set of URLs the user can provide. The resulting articles are return based on ...
The Digital Library Research Laboratory is a group focused on researching and implementing a full stack Hadoop cluster for data storage and analysis. The DLRL Cluster project is focused on learning and teaching the ...
NRV Tweets and RSS feeds
The goal of this project was to associate existing data in the Virtual Town Square database from the New River Valley area with topical metadata. We took a database of approximately 360,000 tweets and 15,000 RSS news stories ...
CS4624 IDEAL Spreadsheet
The IDEAL proposal encompasses an incredibly vast infrastructure of technology intended to be used by people of varying backgrounds. The analysts and researchers who will be familiar with the data presented through many ...
Xpantrac Connection with IDEAL
Title: Integrating Xpantrac into the IDEAL software suite, and applying it to identify topics for IDEAL webpages Identifying topics is useful because it allows us to easily understand what a document is about. If we ...