Now showing items 23-35 of 35

    • Document Clustering for IDEAL 

      Thumma, Sujit Reddy; Kalidas, Rubasri; Torkey, Hanaa (2015-05-13)
      Document clustering is an unsupervised classification of text documents into groups (clusters). The documents with similar properties are grouped together into one cluster. Documents which have dissimilar patterns are ...
    • Focused Crawling 

      Farag, Mohamed Magdy Gharib; Khan, Mohammed Saquib Akmal; Mishra, Gaurav; Ganesh, Prasad Krishnamurthi (2012-12-11)
      Finding information on WWW is difficult and challenging task because of the extremely large volume of the WWW. Search engine can be used to facilitate this task, but it is still difficult to cover all the webpages on the ...
    • Hadoop Project for IDEAL in CS5604 

      Cadena, Jose; Chen, Mengsu; Wen, Chengyuan (Virginia Tech, 2015-05-11)
      The Integrated Digital Event Archive and Library (IDEAL) system addresses the need for combining the best of digital library and archive technologies in support of stakeholders who are remembering and/or studying important ...
    • Large Scale Network Visualization with Gephi 

      Alam, Maksudul; Arifuzzaman, S M; Bhuiyan, Md Hasanuzzaman (2012-12-11)
      The notion of graphs or networks is sufficiently pervasive since it can be used to model various types of data sources. Social, biological, and other networks capture the underlying structural and relational properties. ...
    • LDA Team Project in CS5604, Spring 2015: Extracting Topics from Tweets and Webpages for IDEAL 

      Pumma, Sarunya; Liu, Xiaoyang (2015-05-10)
      IDEAL or Integrated Digital Event Archiving and Library is a project of Virginia Tech to implement a state-of-the-art event-based information retrieval system. A practice project of CS 5604 Information Retrieval is a part ...
    • Leveraging eXist-db for Efficient TEI Document Management 

      Schutt, Kyle; Morgan, Kyle (2012-12-10)
      Professor David Radcliffe has created Lord Byron and his Times (LBT), a large digital archive of works surrounding Lord Byron and his contemporaries. The original website was unusable slow due to the expensive XSLT ...
    • Named Entity Recognition for IDEAL 

      Du, Qianzhou; Zhang, Xuan (2015-05-10)
      The term “Named Entity”, which was first introduced by Grishman and Sundheim, is widely used in Natural Language Processing (NLP). The researchers were focusing on the information extraction task, that is extracting ...
    • ProjOpenDSA - OpenDSA Log Support 

      Wei, Shiyi; Suwardiman, Victoria; Swaminathan, Anand (2012-12-11)
      The OpenDSA project is an online eTextbook project that includes not only literature but other dynamic content to be used in Data Structures and Algorithms courses. OpenDSA contains exercises of various types to go along ...
    • Reducing Noise for IDEAL 

      Wang, Xiangwen; Chandrasekar, Prashant (2015-05-12)
      The corpora for which we are building an information retrieval system consists of tweets and web pages (extracted from URL links that might be included in the tweets) that have been selected based on rudimentary string ...
    • Social Network Project for IDEAL in CS5604 

      Harb, Islam; Jin, Yilong; Cedeno, Vanessa; Mallampati, Sai Ravi Kiran; Bulusu, Bhaskara Srinivasa Bharadwaj (2015-05-11)
      The IDEAL (Integrated Digital Event Archiving and Library) project involves VT faculty, staff, and students, along with collaborators around the world, in archiving important events and integrating the digital library, ...
    • Solr Project with IDEAL, in CS5604 (Information Storage and Retrieval) 

      Xia, Long; Jiang, Tingting; Galad, Andrej; Maharshi, Shivam (2016-05-04)
      This submission describes the work of the Solr team as part of the IDEAL project with the main goal of designing and developing a distributed search infrastructure. It includes the project reports, final presentations, as ...
    • Solr Team Project Report 

      Gruss, Richard; Choudhury, Ananya; Komawar, Nikhil (2015-05-13)
      The Integrated Digital Event Archive and Library (IDEAL) is a Digital Library project that aims to collect, index, archive and provide access to digital contents related to important events, including disasters, man-made ...
    • Topic Analysis project in CS5604, Spring 2016: Extracting Topics from Tweets and Webpages for IDEAL 

      Mehta, Sneha; Vinayagam, Radha Krishnan (2016-05-04)
      The IDEAL (Integrated Digital Event Archiving and Library) project aims to ingest tweets and web-based content from social media and the web and index it for retrieval. One of the required milestones for a graduate-level ...