Now showing items 11-22 of 22

    • Nearline Web Archiving 

      Xie, Zhiwu; Nayyar, Krati; Fox, Edward A. (2016-06-23)
      In this paper, we propose a modified approach to real­time transactional web archiving. It leverages the web caching infrastructure that is already prevalent on web servers. Instead of archiving web content at HTTP transaction ...
    • Newman Library Pecha Kucha: Digital Library and Archives 

      Hall, Nathan; Lawrence, Anne; Xie, Zhiwu (2012-05-24)
    • On-Demand Big Data Analysis in Digital Repositories 

      Xie, Zhiwu; Chen, Yinlin; Jiang, Tingting; Speer, Julie; Walters, Tyler; Tarazaga, Pablo A.; Kasarda, Mary (Springer International Publishing, 2015-12-18)
      We describe a use and reuse driven digital repository integrated with lightweight data analysis capabilities provided by the Docker framework. Using building sensor data collected from the Virginia Tech Goodwin Hall Living ...
    • Poor Man's Social Network: Consistently Trade Freshness For Scalability 

      Xie, Zhiwu; Liu, Jinyang; Van de Sompel, Herbert; van Reenen, Johann; Jordan, Ramiro (USENIX Association, 2012-06)
      Typical social networking functionalities such as feed following are known to be hard to scale. Different from the popular approach that sacrifices consistency for scalability, in this paper we describe, implement, and ...
    • Towards Use And Reuse Driven Big Data Management 

      Xie, Zhiwu; Chen, Yinlin; Speer, Julie; Walters, Tyler; Tarazaga, Pablo A; Kasarda, Mary (2015-06-03)
      We propose a use and reuse driven big data management approach that fuses the data repository and data processing capabilities in a co-located, public cloud. It answers to the urgent data management needs from the growing ...
    • Using Transactional Web Archives To Handle Server Errors 

      Xie, Zhiwu; Chandrasekar, Prashant; Fox, Edward A. (2015-06)
      We describe a web archiving application that handles server errors using the most recently archived representation of the requested web resource. The application is developed as an Apache module. It leverages the transactional ...
    • A UWS Case for 200-Style Memento Negotiations 

      Xie, Zhiwu; Chandrasekar, Prashant; Fox, Edward A. (IEEE Technical Committee on Digital Libraries, 2015-10)
      Uninterruptible web service (UWS) is a web archiving application that handles server errors using the most recently archived representation of the requested web resource. The application is developed as an Apache module. ...
    • VTechData: An Institutional Data Repository 

      Xie, Zhiwu; Speer, Julie; Chen, Yinlin; Jiang, Tingting; Brittle, Collin; Mather, Paul (2016-06-14)
      We introduce VTechData, a Sufia/Fedora based institutional repository specifically implemented to meet the needs of research data management at Virginia Tech. Despite the rapid maturity of Hydra and Fedora code bases, the ...
    • WADL 2016: Third International Workshop on Web Archiving and Digital Libraries 

      Fox, Edward A.; Xie, Zhiwu; Klein, Martin (2016-06)
      This workshop will explore integration of Web archiving and digital libraries, so the complete life cycle involved is covered: creation/authoring, uploading/publishing in the Web (2.0), (focused) crawling, indexing, ...
    • Web Archiving and Digital Libraries (WADL) 

      Fox, Edward A.; Xie, Zhiwu (ACM, 2015-06-25)
      This workshop will explore integration of Web archiving and digital libraries, so the complete life cycle involved is covered: creation/authoring, uploading/publishing in the Web (2.0), (focused) crawling, indexing, ...
    • Web Archiving and Digital Libraries 2015 (WADL 2015) Overview 

      Fox, Edward A.; Xie, Zhiwu; Klein, Martin (IEEE Technical Committee on Digital Libraries, 2015-10)
      Our understanding of the past will, to a large extent, depend on our success with Web archiving. WADL 2015 brought together international leaders from industry, government, and academia, who are tackling this important ...
    • Web Archiving Inconsistency: A Research Agenda 

      Xie, Zhiwu; Van de Sompel, Herbert; Liu, Jinyang; van Reenen, Johann; Jordan, Ramiro (IEEE Technical Committee on Digital Libraries, 2015-10)
      Scaling web applications usually boils down to a tradeoff between consistency and latency. Very large web operations typically favor low latency, hence purposefully sacrifice strict consistency in the sense of serializability. ...