Now showing items 1-10 of 42
OAI/ODL Component Composition Exercise
This exercise is a hands-on introduction to building digital libraries from components. It is a good introduction for individuals with modest Unix skills, and it reviews the process of using DLbox to create a working digital ...
This module addresses the use of metadata, specific metadata standards that may be used to describe digital objects, and the creation of metadata records.
This module covers the principles and application of the digitization process for digital libraries. Students will be able to explain the digitization process, understand the critical issues and challenges of digitization ...
Conceptual Frameworks, Models, Theories, and Definitions
This module introduces several conceptual modules characterizing the digital library domain. Students will be provided with a high level yet comprehensive knowledge of several conceptual frameworks and models, a unifying ...
Hadoop Map-Reduce is a software framework for writing applications for processing large amounts of data in parallel on commodity hardware.
Apache Solr: Indexing and Searching
This module addresses the basic concepts of the open source Apache Solr platform that is specifically designed for indexing documents and executing searches.
Evaluation in Information Retrieval
This module addresses the methods used to evaluate an Information Retrieval system. We focus on evaluating a system using relevance and apply the knowledge by using TREC_EVAL.
File Formats, Transformation, and Migration
This module covers the principles and applications of the transformation and migration processes for the preservation of digital content, as well as key issues surrounding digital preservation strategies.
Text Clustering Using LucidWorks and Apache Mahout
This module introduces algorithms and evaluation metrics for flat clustering. We focus on the usage of LucidWorks big data analysis software and Apache Mahout, an open source machine learning library in clustering of ...
Pure Data Module
This is the manual for the Pure Data (Pd) Module. Within this directory you will find the source for Pd and pd-l2ork (source and precompiled binary), several Pd tutorials and the patches that accompany them, as well as ...