Now showing items 5-22 of 22

    • DLA: Who We Are and What We Do 

      McMillan, Gail; Gilbertson, Keith; Hall, Nathan; Lawrence, Anne; Weeks, Kimberli; Wills, Jane; Xie, Zhiwu (2012-05-24)
    • Evaluating Cost of Cloud Execution in a Data Repository 

      Xie, Zhiwu; Chen, Yinlin; Speer, Julie; Walters, Tyler (ACM, 2016-06)
      In this paper, we utilize a set of controlled experiments to benchmark the cost associated with the cloud execution of typical repository functions such as ingestion, fixity checking, and heavy data processing. We focus ...
    • Facilitate Cross-Repository Big Data Discovery and Reuse 

      Xie, Zhiwu (Virginia Tech, 2013-03-13)
      Researchers have accumulated large amount of observational, experimental, and simulation data. Much effort has been made to collect, curate, preserve, and provide open access to them, but putting the data online is only ...
    • FishTraits version 2: integrating ecological, biogeographic and bibliographic information 

      Xie, Zhiwu; Frimpong, Emmanual A.; Lee, Sunshin (ACM, 2013-07-22)
      In this paper we describe the new development of FishTraits. Originating from an ecological database that documents and consolidates more than 100 traits for 809 fish species, the new version focuses on the integration of ...
    • Improving scalability by self-archiving 

      Xie, Zhiwu; Liu, Jinyang; Van de Sompel, Herbert; van Reenen, Johann; Jordan, Ramiro (ACM, 2011-06-13)
      The newer generation of web browsers supports the client-side database, making it possible to run the full web application stacks entirely in the web clients. Still, the server side database is indispensable as the central ...
    • The Insitutional Repository's Role in Preserving Research Data 

      Xie, Zhiwu; McMillan, Gail; Walter, Tyler (Virginia Tech, 2012-07-25)
      In recent years, many funding agencies have started to require long-term preservation and open access to research data. While most research universities have already run their own institutional repositories (IR), it's not ...
    • Nearline Web Archiving 

      Xie, Zhiwu; Nayyar, Krati; Fox, Edward A. (2016-06-23)
      In this paper, we propose a modified approach to real­time transactional web archiving. It leverages the web caching infrastructure that is already prevalent on web servers. Instead of archiving web content at HTTP transaction ...
    • Newman Library Pecha Kucha: Digital Library and Archives 

      Hall, Nathan; Lawrence, Anne; Xie, Zhiwu (2012-05-24)
    • On-Demand Big Data Analysis in Digital Repositories 

      Xie, Zhiwu; Chen, Yinlin; Jiang, Tingting; Speer, Julie; Walters, Tyler; Tarazaga, Pablo A.; Kasarda, Mary (Springer International Publishing, 2015-12-18)
      We describe a use and reuse driven digital repository integrated with lightweight data analysis capabilities provided by the Docker framework. Using building sensor data collected from the Virginia Tech Goodwin Hall Living ...
    • Poor Man's Social Network: Consistently Trade Freshness For Scalability 

      Xie, Zhiwu; Liu, Jinyang; Van de Sompel, Herbert; van Reenen, Johann; Jordan, Ramiro (USENIX Association, 2012-06)
      Typical social networking functionalities such as feed following are known to be hard to scale. Different from the popular approach that sacrifices consistency for scalability, in this paper we describe, implement, and ...
    • Towards Use And Reuse Driven Big Data Management 

      Xie, Zhiwu; Chen, Yinlin; Speer, Julie; Walters, Tyler; Tarazaga, Pablo A; Kasarda, Mary (2015-06-03)
      We propose a use and reuse driven big data management approach that fuses the data repository and data processing capabilities in a co-located, public cloud. It answers to the urgent data management needs from the growing ...
    • Using Transactional Web Archives To Handle Server Errors 

      Xie, Zhiwu; Chandrasekar, Prashant; Fox, Edward A. (2015-06)
      We describe a web archiving application that handles server errors using the most recently archived representation of the requested web resource. The application is developed as an Apache module. It leverages the transactional ...
    • A UWS Case for 200-Style Memento Negotiations 

      Xie, Zhiwu; Chandrasekar, Prashant; Fox, Edward A. (IEEE Technical Committee on Digital Libraries, 2015-10)
      Uninterruptible web service (UWS) is a web archiving application that handles server errors using the most recently archived representation of the requested web resource. The application is developed as an Apache module. ...
    • VTechData: An Institutional Data Repository 

      Xie, Zhiwu; Speer, Julie; Chen, Yinlin; Jiang, Tingting; Brittle, Collin; Mather, Paul (2016-06-14)
      We introduce VTechData, a Sufia/Fedora based institutional repository specifically implemented to meet the needs of research data management at Virginia Tech. Despite the rapid maturity of Hydra and Fedora code bases, the ...
    • WADL 2016: Third International Workshop on Web Archiving and Digital Libraries 

      Fox, Edward A.; Xie, Zhiwu; Klein, Martin (2016-06)
      This workshop will explore integration of Web archiving and digital libraries, so the complete life cycle involved is covered: creation/authoring, uploading/publishing in the Web (2.0), (focused) crawling, indexing, ...
    • Web Archiving and Digital Libraries (WADL) 

      Fox, Edward A.; Xie, Zhiwu (ACM, 2015-06-25)
      This workshop will explore integration of Web archiving and digital libraries, so the complete life cycle involved is covered: creation/authoring, uploading/publishing in the Web (2.0), (focused) crawling, indexing, ...
    • Web Archiving and Digital Libraries 2015 (WADL 2015) Overview 

      Fox, Edward A.; Xie, Zhiwu; Klein, Martin (IEEE Technical Committee on Digital Libraries, 2015-10)
      Our understanding of the past will, to a large extent, depend on our success with Web archiving. WADL 2015 brought together international leaders from industry, government, and academia, who are tackling this important ...
    • Web Archiving Inconsistency: A Research Agenda 

      Xie, Zhiwu; Van de Sompel, Herbert; Liu, Jinyang; van Reenen, Johann; Jordan, Ramiro (IEEE Technical Committee on Digital Libraries, 2015-10)
      Scaling web applications usually boils down to a tradeoff between consistency and latency. Very large web operations typically favor low latency, hence purposefully sacrifice strict consistency in the sense of serializability. ...