Recent Submissions

  • Virginia Tech Data Landscape and Environmental Assessment: Technical Briefing on Data Preservation and Repository System 

    Shen, Yi (2015)
    A faculty-wide data environmental scan and landscape study at Virginia Tech was conducted in 2015 and concluded with 652 responses received from Teaching & Research Faculty and Research Faculty in 8 different colleges. The ...
  • Virginia Tech Data Landscape and Environmental Assessment: Technical Briefing on Data Curation 

    Shen, Yi (2015)
    A Virginia Tech Research Data Assessment and Landscape Study was conducted in 2015 to take stock of the data assets being created and held within the institution and to examine data sharing practices and expectations of ...
  • How we went from worst practices to good practices, and became happier in the process 

    French, Amanda; Kayiwa, Francis; Lawrence, Anne; Gilbertson, Keith; Lohrey, Melissa (2016-04-25)
    Our application team was struggling. We had good people and the desire to create good software, but the library as an organization did not yet have experience with software development processes. Work halted. Team members ...
  • Button Gwinnett Signatures: A Census 

    Speer, Ryan (Manuscript Society, 2008)
  • Nearline Web Archiving 

    Xie, Zhiwu; Nayyar, Krati; Fox, Edward A. (2016-06-23)
    In this paper, we propose a modified approach to real­time transactional web archiving. It leverages the web caching infrastructure that is already prevalent on web servers. Instead of archiving web content at HTTP transaction ...
  • Evaluating Cost of Cloud Execution in a Data Repository 

    Xie, Zhiwu; Chen, Yinlin; Speer, Julie; Walters, Tyler (ACM, 2016-06)
    In this paper, we utilize a set of controlled experiments to benchmark the cost associated with the cloud execution of typical repository functions such as ingestion, fixity checking, and heavy data processing. We focus ...
  • On-Demand Big Data Analysis in Digital Repositories 

    Xie, Zhiwu; Chen, Yinlin; Jiang, Tingting; Speer, Julie; Walters, Tyler; Tarazaga, Pablo A.; Kasarda, Mary (Springer International Publishing, 2015-12-18)
    We describe a use and reuse driven digital repository integrated with lightweight data analysis capabilities provided by the Docker framework. Using building sensor data collected from the Virginia Tech Goodwin Hall Living ...
  • A UWS Case for 200-Style Memento Negotiations 

    Xie, Zhiwu; Chandrasekar, Prashant; Fox, Edward A. (IEEE Technical Committee on Digital Libraries, 2015-10)
    Uninterruptible web service (UWS) is a web archiving application that handles server errors using the most recently archived representation of the requested web resource. The application is developed as an Apache module. ...
  • Web Archiving Inconsistency: A Research Agenda 

    Xie, Zhiwu; Van de Sompel, Herbert; Liu, Jinyang; van Reenen, Johann; Jordan, Ramiro (IEEE Technical Committee on Digital Libraries, 2015-10)
    Scaling web applications usually boils down to a tradeoff between consistency and latency. Very large web operations typically favor low latency, hence purposefully sacrifice strict consistency in the sense of serializability. ...
  • Web Archiving and Digital Libraries 2015 (WADL 2015) Overview 

    Fox, Edward A.; Xie, Zhiwu; Klein, Martin (IEEE Technical Committee on Digital Libraries, 2015-10)
    Our understanding of the past will, to a large extent, depend on our success with Web archiving. WADL 2015 brought together international leaders from industry, government, and academia, who are tackling this important ...
  • Web Archiving and Digital Libraries (WADL) 

    Fox, Edward A.; Xie, Zhiwu (ACM, 2015-06-25)
    This workshop will explore integration of Web archiving and digital libraries, so the complete life cycle involved is covered: creation/authoring, uploading/publishing in the Web (2.0), (focused) crawling, indexing, ...
  • WADL 2016: Third International Workshop on Web Archiving and Digital Libraries 

    Fox, Edward A.; Xie, Zhiwu; Klein, Martin (2016-06)
    This workshop will explore integration of Web archiving and digital libraries, so the complete life cycle involved is covered: creation/authoring, uploading/publishing in the Web (2.0), (focused) crawling, indexing, ...
  • VTechData: An Institutional Data Repository 

    Xie, Zhiwu; Speer, Julie; Chen, Yinlin; Jiang, Tingting; Brittle, Collin; Mather, Paul (2016-06-14)
    We introduce VTechData, a Sufia/Fedora based institutional repository specifically implemented to meet the needs of research data management at Virginia Tech. Despite the rapid maturity of Hydra and Fedora code bases, the ...
  • Are Repositories Impeding Big Data Reuse? 

    Xie, Zhiwu; Galad, Andrej; Chen, Yinlin; Fox, Edward A. (Virginia Tech, 2016-06-14)
    In this intentionally provocative presentation, we question the scalability of popular digital repositories and whether they are suitable for big data reuse. Are the layers of API these repositories have painted over file ...
  • Clustering 

    Xie, Zhiwu (2015-06-11)
    This presentation is part of a panel presentation at Open Repository 2015, Fedora Technical Working Group - Assessment of Fedora 4.
  • Big Data Processing in the Cloud: a Hydra/Sufia Experience 

    Brittle, Collin; Xie, Zhiwu (2014-06-10)
    Presentation video available at https://connectpro.helsinki.fi/p1txjdy74ts/ This presentation addresses the challenge of processing big data in a cloud-based data repository. Using the Hydra Project’s Hydra and Sufia ...
  • Using Transactional Web Archives To Handle Server Errors 

    Xie, Zhiwu; Chandrasekar, Prashant; Fox, Edward A. (2015-06)
    We describe a web archiving application that handles server errors using the most recently archived representation of the requested web resource. The application is developed as an Apache module. It leverages the transactional ...
  • FishTraits version 2: integrating ecological, biogeographic and bibliographic information 

    Xie, Zhiwu; Frimpong, Emmanual A.; Lee, Sunshin (ACM, 2013-07-22)
    In this paper we describe the new development of FishTraits. Originating from an ecological database that documents and consolidates more than 100 traits for 809 fish species, the new version focuses on the integration of ...
  • The Insitutional Repository's Role in Preserving Research Data 

    Xie, Zhiwu; McMillan, Gail; Walter, Tyler (Virginia Tech, 2012-07-25)
    In recent years, many funding agencies have started to require long-term preservation and open access to research data. While most research universities have already run their own institutional repositories (IR), it's not ...

View more