Now showing items 113-131 of 131

    • Text Classification Using Mahout 

      Alam, Maksudul; Arifuzzaman, S. M.; Bhuiyan, Md Hasanuzzaman (2012-11-06)
      This module focuses on classification of text using Apache Mahout. After successful completion of this module, students will be able to explain and apply methods of classification, correctly classify a set of documents ...
    • Text Clustering Using LucidWorks and Apache Mahout 

      Chen, Liangzhe; Lin, Xiao; Wood, Andrew (2012-11-17)
      This module introduces algorithms and evaluation metrics for flat clustering. We focus on the usage of LucidWorks big data analysis software and Apache Mahout, an open source machine learning library in clustering of ...
    • Twitter Use During an Emergency Event: The Case of UT Austin Shooting 

      Li, Lin T.; Yang, Seungwon; Kavanaugh, Andrea L.; Fox, Edward A.; Sheetz, Steven D.; Shoemaker, Donald J. (2011-06)
      This poster presents one of our efforts developed in the context of Crisis, Tragedy, and Recovery Network (CTRnet) project. One of our derived works from this project is the use of social media by government to respond to ...
    • Two Approaches to Enhance the Education for ETDs: Developing Educational Modules and Migrating the ETD Guide into a Community Wiki 

      Yang, Seungwon; Levy, Jean; Miller, Kevin; Pomerantz, Jeffrey P.; Oh, Sanghee; Wildemuth, Barbara M.; Fox, Edward A. (2008-05-19)
      Two efforts have been made by the Digital Library (DL) Curriculum Development Project Group (http://curric.dlib.vt.edu) to help the ETD community. Our first activity is the preparation of multiple educational modules, which ...
    • Use and Usability in a Digital Library Search System 

      France, Robert K.; Nowell, Lucy Terry; Fox, Edward A.; Saad, Rani, A.; Zhao, Jianxin (Virginia Tech Digital Library Research Laboratory, 1999)
      Digital libraries must reach out to users from all walks of life, serving information needs at all levels. To do this, they must attain high standards of usability over an extremely broad audience. This paper details the ...
    • Use and Usability in a Digital Library Search System 

      France, Robert K.; Nowell, Lucy T.; Fox, Edward A.; Saad, Rani A.; Zhao, Jianxin (1999)
      Digital libraries must reach out to users from all walks of life, serving information needs at all levels. To do this, they must attain high standards of usability over an extremely broad audience. This paper details the ...
    • Using the Repository Explorer to Archive OAI Protocol Compliance 

      Suleman, Hussein (2001-06-24)
      The Open Archives Initiative (OAI) is dedicated to solving problems of digital library interoperability by defining simple protocols, most recently the Open Archives Initiative Protocol for Metadata Harvesting [2], which ...
    • The Variety of Ways in Which Instructors Implement a Modular Digital Library Curriculum 

      Wildemuth, Barbara M.; Pomerantz, Jeffrey P.; Oh, Sanghee; Yang, Seungwon; Fox, Edward A. (2009-05-14)
      This poster illustrates how information professionals can implement a modular digital library curriculum. It discusses instructors' perspectives on digital library syllabi, assignments, readings, and overall course content. ...
    • Web Archiving 

      Lee, Spencer; Kan'an, Tarek; Jiao, Jian (2009-10-09)
      This module covers the ideas, approaches, problems and needs of web archiving to build a static and long term collection consisting of web pages.
    • Web Publishing 

      Karia, Pratik (2009-09-08)
      This module covers the general principles of web publishing and the various paradigms that can be used for storing and retrieving content within digital libraries. This module introduces various techniques to publish ...
    • Weights and Measures: An Axiomatic Model for Similarity Computations 

      France, Robert K. (1994)
      This paper proposes a formal model for similarity functions, first over arbitrary objects, then over sets and the sorts of weighted sets that are found in text retrieval systems. Using a handful of axioms and constraints, ...
    • Weka 

      Peddi, Bhanu; Xiong, Huijun; ElSherbiny, Noha (2010-12-10)
      This module stresses the methods of text classification used in information retrieval. We focus on the usage of Weka, a data mining toolkit, in data processing with three classification algorithms: Naive Bayes [1], k Nearest ...
    • What is a Successful Digital Library? 

      Shen, Rao; Vemuri, Naga S.; Fan, Weiguo; Fox, Edward A. (2006-09-18)
      We synthesize diverse research in the area of digital library (DL) quality models, information systems (IS) success and adoption models, and information-seeking behavior models, to present a more integrated view of the ...
    • When Stopping Rules Don't Stop 

      France, Robert K. (1995)
      Performing ranked retrieval on large document collections can be slow. The method of stopping rules has been proposed to make it more efficient. Stopping rules, which terminate search when the highest ranked documents have ...
    • Why Students Use Social Networking Sites After Crisis Situations 

      Sheetz, Steven D.; Fox, Edward A.; Fitzgerald, Andrew; Palmer, Sean; Shoemaker, Donald J.; Kavanaugh, Andrea L. (2011)
      Communities respond to tragedy by making virtuous use of social networking sites for a variety of purposes. We asked students to describe why they used a social networking site after the tragic shootings at Virginia Tech, ...
    • Why Students Use Social Networking Sites After Crisis Situations 

      Sheetz, Steven D.; Fox, Edward A.; Fitzgerald, Andrew; Palmer, Sean; Shoemaker, Donald J.; Kavanaugh, Andrea L. (2011)
      Communities respond to tragedy by making virtuous use of social networking sites for a variety of purposes. We asked students to describe why they used a social networking site after the tragic shootings at Virginia Tech, ...
    • WordNet 

      Fouh, Eric; Poirel, Christopher (2010-10-25)
      This module covers the use of a thesaurus in several information retrieval (IR) techniques: index construction (e.g., tokenization, stemming, and lemmatization), robustness to query typographical errors (e.g., the use of ...
    • The World According to MARIAN: How the Document Universe is Represented and Searched in the MARIAN/ Academy Digital Library Search System 

      France, Robert K. (1999-09-27)
      This presentation focuses on MARIAN (Multiple Access Retrieval of library Information with ANotations), an online library catalog information system. MARIAN is intended for library end-users rather than catalogers, provides ...
    • An XML Log Standard and Tool for Digital Library Logging Analysis 

      Goncalves, Marcos A.; Luo, Ming; Shen, Rao; Ali, Mir Farooq; Fox, Edward A. (2002-09)
      Log analysis can be a primary source of knowledge about how digital library patrons actually use DL systems and services and how systems behave while trying to support user information seeking activities. Log recording and ...