Now showing items 1-6 of 6
Developing an improved focused crawler for the IDEAL project
The IDEAL (Integrated Digital Event Archive and Library) project currently has a general purpose web crawler to find articles relevant to a set of URLs the user can provide. The resulting articles are return based on ...
Computational Linguistic Analysis of Earthquake Collections
CS4984 is a newly-offered class at Virginia Tech with a unit based, project-problem based learning curriculum. This class style is based on NSF-funded work on curriculum for the field of digital libraries and related topics, ...
NRV Tweets and RSS feeds
The goal of this project was to associate existing data in the Virtual Town Square database from the New River Valley area with topical metadata. We took a database of approximately 360,000 tweets and 15,000 RSS news stories ...
Generating an Intelligent Human-Readable Summary of a Shooting Event from a Large Collection of Webpages
We describe our approach to generating summaries of a shooting event from a large collection of webpages. We work with two separate events - a shooting at a school in Newtown, Connecticut and another at a mall in Tucson, ...
Computational Linguistics Hurricane Group
The problem-project based learning described in our presentation and report addresses automatic summarization of web content using natural language processing. Initially, we used simple techniques such as word frequencies ...
OutbreakSum: Automatic Summarization of Texts Relating to Disease Outbreaks
The goal of the fall 2014 Disease Outbreak Project (OutbreakSum) was to develop software for automatically analyzing and summarizing large collections of texts pertaining to disease outbreaks. Although our code was tested ...