Now showing items 107-126 of 130

  • SEDNA XML Database 

    Vijay, Sony; El Meligy Abdelhamid, Sherif; Malayattil, Sarosh (2010-12-09)
    The module introduces the use of SEDNA XML database for XML retrieval. The primary focus of the module is to describe the architecture of SEDNA database and how standard XML queries can be used to retrieve data from it.
  • Set Orthogonality 

    Suleman, Hussein; Zubair, Mohammad (2001-10-19)
    There is no way to determine all the sets that an identifier belongs to. This is typically referred to as set orthogonality because the protocol allows a harvester to find out which identifiers belong to a particular set ...
  • Social Media for Cities, Counties, and Communities 

    Kavanaugh, Andrea L.; Nastev, Apostol; Fox, Edward A.; Sheetz, Steven D.; Shoemaker, Donald J.; Xie, Lexing (2011-03-11)
    Social media (i.e., Twitter, Facebook, Flickr, YouTube) and other tools and services with user generated content have made a staggering amount of information (and misinformation) available. Some government officials seek ...
  • Social Media Use by Government: From the Routine to the Critical 

    Kavanaugh, Andrea L.; Fox, Edward A.; Sheetz, Steven D.; Yang, Seungwon; Li, Lin T.; Whalen, Travis; Shoemaker, Donald J.; Natsev, Paul; Xie, Lexing (2011-06)
    Social media (i.e., Twitter, Facebook, Flickr, YouTube) and other services with user-generated content have made a staggering amount of information (and misinformation) available. Government officials seek to leverage these ...
  • Supporting Document Triage via Annotation-Based Multi-Application Visualizations 

    Bae, Soonil; Kim, DoHyoung; Meintanis, Konstantinos; Moore, Michael; Zacchi, Anna; Shipman, Frank M., III; Hsieh, Haowei; Marshall, Cathy (2010)
    For open-ended information tasks, users must sift through many potentially relevant documents, a practice we refer to as document triage. Normally, people perform triage using multiple applications in concert: a search ...
  • Text Classification Using Mahout 

    Alam, Maksudul; Arifuzzaman, S. M.; Bhuiyan, Md Hasanuzzaman (2012-11-06)
    This module focuses on classification of text using Apache Mahout. After successful completion of this module, students will be able to explain and apply methods of classification, correctly classify a set of documents ...
  • Text Clustering Using LucidWorks and Apache Mahout 

    Chen, Liangzhe; Lin, Xiao; Wood, Andrew (2012-11-17)
    This module introduces algorithms and evaluation metrics for flat clustering. We focus on the usage of LucidWorks big data analysis software and Apache Mahout, an open source machine learning library in clustering of ...
  • Twitter Use During an Emergency Event: The Case of UT Austin Shooting 

    Li, Lin T.; Yang, Seungwon; Kavanaugh, Andrea L.; Fox, Edward A.; Sheetz, Steven D.; Shoemaker, Donald J. (2011-06)
    This poster presents one of our efforts developed in the context of Crisis, Tragedy, and Recovery Network (CTRnet) project. One of our derived works from this project is the use of social media by government to respond to ...
  • Two Approaches to Enhance the Education for ETDs: Developing Educational Modules and Migrating the ETD Guide into a Community Wiki 

    Yang, Seungwon; Levy, Jean; Miller, Kevin; Pomerantz, Jeffrey P.; Oh, Sanghee; Wildemuth, Barbara M.; Fox, Edward A. (2008-05-19)
    Two efforts have been made by the Digital Library (DL) Curriculum Development Project Group (http://curric.dlib.vt.edu) to help the ETD community. Our first activity is the preparation of multiple educational modules, which ...
  • Use and Usability in a Digital Library Search System 

    France, Robert K.; Nowell, Lucy T.; Fox, Edward A.; Saad, Rani A.; Zhao, Jianxin (1999)
    Digital libraries must reach out to users from all walks of life, serving information needs at all levels. To do this, they must attain high standards of usability over an extremely broad audience. This paper details the ...
  • Use and Usability in a Digital Library Search System 

    France, Robert K.; Nowell, Lucy Terry; Fox, Edward A.; Saad, Rani, A.; Zhao, Jianxin (Virginia Tech Digital Library Research Laboratory, 1999)
    Digital libraries must reach out to users from all walks of life, serving information needs at all levels. To do this, they must attain high standards of usability over an extremely broad audience. This paper details the ...
  • Using the Repository Explorer to Archive OAI Protocol Compliance 

    Suleman, Hussein (2001-06-24)
    The Open Archives Initiative (OAI) is dedicated to solving problems of digital library interoperability by defining simple protocols, most recently the Open Archives Initiative Protocol for Metadata Harvesting [2], which ...
  • The Variety of Ways in Which Instructors Implement a Modular Digital Library Curriculum 

    Wildemuth, Barbara M.; Pomerantz, Jeffrey P.; Oh, Sanghee; Yang, Seungwon; Fox, Edward A. (2009-05-14)
    This poster illustrates how information professionals can implement a modular digital library curriculum. It discusses instructors' perspectives on digital library syllabi, assignments, readings, and overall course content. ...
  • Web Archiving 

    Lee, Spencer; Kan'an, Tarek; Jiao, Jian (2009-10-09)
    This module covers the ideas, approaches, problems and needs of web archiving to build a static and long term collection consisting of web pages.
  • Web Publishing 

    Karia, Pratik (2009-09-08)
    This module covers the general principles of web publishing and the various paradigms that can be used for storing and retrieving content within digital libraries. This module introduces various techniques to publish ...
  • Weights and Measures: An Axiomatic Model for Similarity Computations 

    France, Robert K. (1994)
    This paper proposes a formal model for similarity functions, first over arbitrary objects, then over sets and the sorts of weighted sets that are found in text retrieval systems. Using a handful of axioms and constraints, ...
  • Weka 

    Peddi, Bhanu; Xiong, Huijun; ElSherbiny, Noha (2010-12-10)
    This module stresses the methods of text classification used in information retrieval. We focus on the usage of Weka, a data mining toolkit, in data processing with three classification algorithms: Naive Bayes [1], k Nearest ...
  • What is a Successful Digital Library? 

    Shen, Rao; Vemuri, Naga S.; Fan, Weiguo; Fox, Edward A. (2006-09-18)
    We synthesize diverse research in the area of digital library (DL) quality models, information systems (IS) success and adoption models, and information-seeking behavior models, to present a more integrated view of the ...
  • When Stopping Rules Don't Stop 

    France, Robert K. (1995)
    Performing ranked retrieval on large document collections can be slow. The method of stopping rules has been proposed to make it more efficient. Stopping rules, which terminate search when the highest ranked documents have ...
  • Why Students Use Social Networking Sites After Crisis Situations 

    Sheetz, Steven D.; Fox, Edward A.; Fitzgerald, Andrew; Palmer, Sean; Shoemaker, Donald J.; Kavanaugh, Andrea L. (2011)
    Communities respond to tragedy by making virtuous use of social networking sites for a variety of purposes. We asked students to describe why they used a social networking site after the tragic shootings at Virginia Tech, ...