Now showing items 1-11 of 11

    • Collection Management for IDEAL 

      Ma, Yufeng; Nan, Dong (2016-05-04)
      The collection management portion of the information retrieval system has three major tasks. The first task is to perform incremental update of the new data flow from the tweet MySQL database to HDFS and then to HBase. ...
    • CS5604 Front-End User Interface Team 

      Masiane, Moeti; Warren, Lawrence (2016-05-03)
      This project is part of a wider research project whose focus is developing an information retrieval and analysis system in support of the IDEAL (Integrated Digital Event Archiving and Library) project. The search engine ...
    • Effective Search in Online Knowledge Communities: A Genetic Algorithm Approach 

      Zhang, Xiaoyu (Virginia Tech, 2009-09-11)
      Online Knowledge Communities, also known as online forum, are popular web-based tools that allow members to seek and share knowledge. Documents to answer varieties of questions are associated with the process of knowledge ...
    • Focused Crawling 

      Farag, Mohamed Magdy Gharib; Khan, Mohammed Saquib Akmal; Mishra, Gaurav; Ganesh, Prasad Krishnamurthi; Collins, Wil; Dickerson, Will (Virginia Tech, 2012-12-11)
      Finding information on the WWW is a difficult and challenging task because of the extremely large volume of content in the WWW. Search engines can be used to facilitate this task, but it is still difficult to cover all the ...
    • Forecasting Protests by Detecting Future Time Mentions in News and Social Media 

      Muthiah, Sathappan (Virginia Tech, 2014-07-11)
      Civil unrest (protests, strikes, and ``occupy'' events) is a common occurrence in both democracies and authoritarian regimes. The study of civil unrest is a key topic for political scientists as it helps capture an important ...
    • Iterative Computing over a Unified Relationship Matrix for Information Integration 

      Xi, Wensi (Virginia Tech, 2006-06-20)

      In this dissertation I use a Unified Relationship Matrix (URM) to represent a set of heterogeneous data objects and their inter-relationships. I argue that integrated and iterative computations over the Unified ...

    • Named Entity Recognition for IDEAL 

      Du, Qianzhou; Zhang, Xuan (2015-05-10)
      The term “Named Entity”, which was first introduced by Grishman and Sundheim, is widely used in Natural Language Processing (NLP). The researchers were focusing on the information extraction task, that is extracting ...
    • Program Transformations for Information Personalization 

      Perugini, Saverio (Virginia Tech, 2004-05-28)
      Personalization constitutes the mechanisms and technologies necessary to customize information access to the end-user. It can be defined as the automatic adjustment of information content, structure, and presentation. The ...
    • Program Transformations for Information Personalization 

      Perugini, Saverio (Virginia Tech, 2004-05-14)
      Personalization constitutes the mechanisms and technologies necessary to customize information access to the end-user. It can be defined as the automatic adjustment of information content, structure, and presentation. ...
    • Solr Team Project Report 

      Gruss, Richard; Choudhury, Ananya; Komawar, Nikhil (2015-05-13)
      The Integrated Digital Event Archive and Library (IDEAL) is a Digital Library project that aims to collect, index, archive and provide access to digital contents related to important events, including disasters, man-made ...
    • Topic Analysis project in CS5604, Spring 2016: Extracting Topics from Tweets and Webpages for IDEAL 

      Mehta, Sneha; Vinayagam, Radha Krishnan (2016-05-04)
      The IDEAL (Integrated Digital Event Archiving and Library) project aims to ingest tweets and web-based content from social media and the web and index it for retrieval. One of the required milestones for a graduate-level ...