Search
Now showing items 1-10 of 31
CS5604: Clustering and Social Networks for IDEAL
(2016-05-03)
The Integrated Digital Event Archiving and Library (IDEAL) project of Virginia Tech provides services for searching, browsing, analysis, and visualization of over 1 billion tweets and over 65 million webpages. The project ...
CS5604 Fall 2017 Classification Team Submission
(Virginia Tech, 2018-01-03)
This project submission includes the work of the 'Classification' team of the CS5604 'Information Storage and Retrieval' course of Fall 2017 towards the GETAR project. Classification of the GETAR data would allow users to ...
Topic Analysis project in CS5604, Spring 2016: Extracting Topics from Tweets and Webpages for IDEAL
(2016-05-04)
The IDEAL (Integrated Digital Event Archiving and Library) project aims to ingest tweets and web-based content from social media and the web and index it for retrieval. One of the required milestones for a graduate-level ...
Solr Project with IDEAL, in CS5604 (Information Storage and Retrieval)
(2016-05-04)
This submission describes the work of the Solr team as part of the IDEAL project with the main goal of designing and developing a distributed search infrastructure. It includes the project reports, final presentations, as ...
Collaborative Filtering for IDEAL
(2016-05-04)
The students of CS5604 (Information Retrieval and Storage), have been building an Information Retrieval System based on tweet and webpage collections of the Digital Library Research Laboratory (DLRL). The students have ...
CS5604 Front-End User Interface Team
(2016-05-03)
This project is part of a wider research project whose focus is developing an information retrieval and analysis system in support of the IDEAL (Integrated Digital Event Archiving and Library) project. The search engine ...
Collection Management for IDEAL
(2016-05-04)
The collection management portion of the information retrieval system has three major tasks. The first task is to perform incremental update of the new data flow from the tweet MySQL database to HDFS and then to HBase. ...
Classification Project in CS5604, Spring 2016
(2016-05-04)
In the grand scheme of a large Information Retrieval project, the work of our team was that of performing text classification on both tweet collections and their associated webpages. In order to accomplish this task, we ...
CS5604 Information Storage and Retrieval Fall 2017 Solr Report
(Virginia Tech, 2018-01-15)
The Digital Library Research Laboratory (DLRL) has collected over 1.5 billion tweets and millions of webpages for the Integrated Digital Event Archiving and Library (IDEAL) and Global Event Trend Archive Research (GETAR) ...
Clustering and Topic Analysis in CS 5604 Information Retrieval Fall 2016
(Virginia Tech, 2016-12-08)
The IDEAL (Integrated Digital Event Archiving and Library) and Global Event and Trend Archive Research (GETAR) projects aim to build a robust Information Retrieval (IR) system by retrieving tweets and webpages from social ...