Zhiwu Xie

Associate Professor and Technology Development Librarian
Center for Digital Research and Scholarship
University Libraries
Virginia Tech
Blacksburg, VA 24061

Office: 5092 Newman Library
Phone: 540-231-4453
Email: zhiwuxie@vt.edu
http://scholar.lib.vt.edu/staff/zxie/

Recent Submissions

  • WADL 2016: Third International Workshop on Web Archiving and Digital Libraries 

    Fox, Edward A.; Xie, Zhiwu; Klein, Martin (2016-06)
    This workshop will explore integration of Web archiving and digital libraries, so the complete life cycle involved is covered: creation/authoring, uploading/publishing in the Web (2.0), (focused) crawling, indexing, ...
  • VTechData: An Institutional Data Repository 

    Xie, Zhiwu; Speer, Julie; Chen, Yinlin; Jiang, Tingting; Brittle, Collin; Mather, Paul (2016-06-14)
    We introduce VTechData, a Sufia/Fedora based institutional repository specifically implemented to meet the needs of research data management at Virginia Tech. Despite the rapid maturity of Hydra and Fedora code bases, the ...
  • Are Repositories Impeding Big Data Reuse? 

    Xie, Zhiwu; Galad, Andrej; Chen, Yinlin; Fox, Edward A. (Virginia Tech, 2016-06-14)
    In this intentionally provocative presentation, we question the scalability of popular digital repositories and whether they are suitable for big data reuse. Are the layers of API these repositories have painted over file ...
  • On-Demand Big Data Analysis in Digital Repositories 

    Xie, Zhiwu; Chen, Yinlin; Jiang, Tingting; Speer, Julie; Walters, Tyler; Tarazaga, Pablo A.; Kasarda, Mary (Springer International Publishing, 2015-12-18)
    We describe a use and reuse driven digital repository integrated with lightweight data analysis capabilities provided by the Docker framework. Using building sensor data collected from the Virginia Tech Goodwin Hall Living ...
  • A UWS Case for 200-Style Memento Negotiations 

    Xie, Zhiwu; Chandrasekar, Prashant; Fox, Edward A. (IEEE Technical Committee on Digital Libraries, 2015-10)
    Uninterruptible web service (UWS) is a web archiving application that handles server errors using the most recently archived representation of the requested web resource. The application is developed as an Apache module. ...
  • Web Archiving Inconsistency: A Research Agenda 

    Xie, Zhiwu; Van de Sompel, Herbert; Liu, Jinyang; van Reenen, Johann; Jordan, Ramiro (IEEE Technical Committee on Digital Libraries, 2015-10)
    Scaling web applications usually boils down to a tradeoff between consistency and latency. Very large web operations typically favor low latency, hence purposefully sacrifice strict consistency in the sense of serializability. ...
  • Web Archiving and Digital Libraries 2015 (WADL 2015) Overview 

    Fox, Edward A.; Xie, Zhiwu; Klein, Martin (IEEE Technical Committee on Digital Libraries, 2015-10)
    Our understanding of the past will, to a large extent, depend on our success with Web archiving. WADL 2015 brought together international leaders from industry, government, and academia, who are tackling this important ...
  • Web Archiving and Digital Libraries (WADL) 

    Fox, Edward A.; Xie, Zhiwu (ACM, 2015-06-25)
    This workshop will explore integration of Web archiving and digital libraries, so the complete life cycle involved is covered: creation/authoring, uploading/publishing in the Web (2.0), (focused) crawling, indexing, ...
  • Clustering 

    Xie, Zhiwu (2015-06-11)
    This presentation is part of a panel presentation at Open Repository 2015, Fedora Technical Working Group - Assessment of Fedora 4.
  • Big Data Processing in the Cloud: a Hydra/Sufia Experience 

    Brittle, Collin; Xie, Zhiwu (2014-06-10)
    Presentation video available at https://connectpro.helsinki.fi/p1txjdy74ts/ This presentation addresses the challenge of processing big data in a cloud-based data repository. Using the Hydra Project’s Hydra and Sufia ...
  • Using Transactional Web Archives To Handle Server Errors 

    Xie, Zhiwu; Chandrasekar, Prashant; Fox, Edward A. (2015-06)
    We describe a web archiving application that handles server errors using the most recently archived representation of the requested web resource. The application is developed as an Apache module. It leverages the transactional ...
  • FishTraits version 2: integrating ecological, biogeographic and bibliographic information 

    Xie, Zhiwu; Frimpong, Emmanual A.; Lee, Sunshin (ACM, 2013-07-22)
    In this paper we describe the new development of FishTraits. Originating from an ecological database that documents and consolidates more than 100 traits for 809 fish species, the new version focuses on the integration of ...
  • The Insitutional Repository's Role in Preserving Research Data 

    Xie, Zhiwu; McMillan, Gail; Walter, Tyler (Virginia Tech, 2012-07-25)
    In recent years, many funding agencies have started to require long-term preservation and open access to research data. While most research universities have already run their own institutional repositories (IR), it's not ...
  • Facilitate Cross-Repository Big Data Discovery and Reuse 

    Xie, Zhiwu (Virginia Tech, 2013-03-13)
    Researchers have accumulated large amount of observational, experimental, and simulation data. Much effort has been made to collect, curate, preserve, and provide open access to them, but putting the data online is only ...
  • Improving scalability by self-archiving 

    Xie, Zhiwu; Liu, Jinyang; Van de Sompel, Herbert; van Reenen, Johann; Jordan, Ramiro (ACM, 2011-06-13)
    The newer generation of web browsers supports the client-side database, making it possible to run the full web application stacks entirely in the web clients. Still, the server side database is indispensable as the central ...
  • Towards Use And Reuse Driven Big Data Management 

    Xie, Zhiwu; Chen, Yinlin; Speer, Julie; Walters, Tyler; Tarazaga, Pablo A; Kasarda, Mary (2015-06-03)
    We propose a use and reuse driven big data management approach that fuses the data repository and data processing capabilities in a co-located, public cloud. It answers to the urgent data management needs from the growing ...
  • Archiving the Relaxed Consistency Web 

    Xie, Zhiwu; Van de Sompel, Herbert; Liu, Jinyang; van Reenen, Johann; Jordan, Ramiro (ACM, 2013)
    The historical, cultural, and intellectual importance of archiving the web has been widely recognized. Today, all countries with high Internet penetration rate have established high-profile archiving initiatives to crawl ...
  • Newman Library Pecha Kucha: Digital Library and Archives 

    Hall, Nathan; Lawrence, Anne; Xie, Zhiwu (2012-05-24)
  • DLA: Who We Are and What We Do 

    McMillan, Gail; Gilbertson, Keith; Hall, Nathan; Lawrence, Anne; Weeks, Kimberli; Wills, Jane; Xie, Zhiwu (2012-05-24)
  • Poor Man's Social Network: Consistently Trade Freshness For Scalability 

    Xie, Zhiwu; Liu, Jinyang; Van de Sompel, Herbert; van Reenen, Johann; Jordan, Ramiro (USENIX Association, 2012-06)
    Typical social networking functionalities such as feed following are known to be hard to scale. Different from the popular approach that sacrifices consistency for scalability, in this paper we describe, implement, and ...