This community has collections of works organized by individual Virginia Tech faculty member names. Works in these collections are often also cross-listed in departmental collections.

Collections in this community

Recent Submissions

  • Nearline Web Archiving 

    Xie, Zhiwu; Nayyar, Krati; Fox, Edward A. (2016-06-23)
    In this paper, we propose a modified approach to real­time transactional web archiving. It leverages the web caching infrastructure that is already prevalent on web servers. Instead of archiving web content at HTTP transaction ...
  • Evaluating Cost of Cloud Execution in a Data Repository 

    Xie, Zhiwu; Chen, Yinlin; Speer, Julie; Walters, Tyler (ACM, 2016-06)
    In this paper, we utilize a set of controlled experiments to benchmark the cost associated with the cloud execution of typical repository functions such as ingestion, fixity checking, and heavy data processing. We focus ...
  • On-Demand Big Data Analysis in Digital Repositories 

    Xie, Zhiwu; Chen, Yinlin; Jiang, Tingting; Speer, Julie; Walters, Tyler; Tarazaga, Pablo A.; Kasarda, Mary (Springer International Publishing, 2015-12-18)
    We describe a use and reuse driven digital repository integrated with lightweight data analysis capabilities provided by the Docker framework. Using building sensor data collected from the Virginia Tech Goodwin Hall Living ...
  • A UWS Case for 200-Style Memento Negotiations 

    Xie, Zhiwu; Chandrasekar, Prashant; Fox, Edward A. (IEEE Technical Committee on Digital Libraries, 2015-10)
    Uninterruptible web service (UWS) is a web archiving application that handles server errors using the most recently archived representation of the requested web resource. The application is developed as an Apache module. ...
  • Web Archiving Inconsistency: A Research Agenda 

    Xie, Zhiwu; Van de Sompel, Herbert; Liu, Jinyang; van Reenen, Johann; Jordan, Ramiro (IEEE Technical Committee on Digital Libraries, 2015-10)
    Scaling web applications usually boils down to a tradeoff between consistency and latency. Very large web operations typically favor low latency, hence purposefully sacrifice strict consistency in the sense of serializability. ...
  • Web Archiving and Digital Libraries 2015 (WADL 2015) Overview 

    Fox, Edward A.; Xie, Zhiwu; Klein, Martin (IEEE Technical Committee on Digital Libraries, 2015-10)
    Our understanding of the past will, to a large extent, depend on our success with Web archiving. WADL 2015 brought together international leaders from industry, government, and academia, who are tackling this important ...
  • Web Archiving and Digital Libraries (WADL) 

    Fox, Edward A.; Xie, Zhiwu (ACM, 2015-06-25)
    This workshop will explore integration of Web archiving and digital libraries, so the complete life cycle involved is covered: creation/authoring, uploading/publishing in the Web (2.0), (focused) crawling, indexing, ...
  • WADL 2016: Third International Workshop on Web Archiving and Digital Libraries 

    Fox, Edward A.; Xie, Zhiwu; Klein, Martin (2016-06)
    This workshop will explore integration of Web archiving and digital libraries, so the complete life cycle involved is covered: creation/authoring, uploading/publishing in the Web (2.0), (focused) crawling, indexing, ...
  • VTechData: An Institutional Data Repository 

    Xie, Zhiwu; Speer, Julie; Chen, Yinlin; Jiang, Tingting; Brittle, Collin; Mather, Paul (2016-06-14)
    We introduce VTechData, a Sufia/Fedora based institutional repository specifically implemented to meet the needs of research data management at Virginia Tech. Despite the rapid maturity of Hydra and Fedora code bases, the ...
  • Are Repositories Impeding Big Data Reuse? 

    Xie, Zhiwu; Galad, Andrej; Chen, Yinlin; Fox, Edward A. (Virginia Tech, 2016-06-14)
    In this intentionally provocative presentation, we question the scalability of popular digital repositories and whether they are suitable for big data reuse. Are the layers of API these repositories have painted over file ...
  • Clustering 

    Xie, Zhiwu (2015-06-11)
    This presentation is part of a panel presentation at Open Repository 2015, Fedora Technical Working Group - Assessment of Fedora 4.
  • Big Data Processing in the Cloud: a Hydra/Sufia Experience 

    Brittle, Collin; Xie, Zhiwu (2014-06-10)
    Presentation video available at https://connectpro.helsinki.fi/p1txjdy74ts/ This presentation addresses the challenge of processing big data in a cloud-based data repository. Using the Hydra Project’s Hydra and Sufia ...
  • Using Transactional Web Archives To Handle Server Errors 

    Xie, Zhiwu; Chandrasekar, Prashant; Fox, Edward A. (2015-06)
    We describe a web archiving application that handles server errors using the most recently archived representation of the requested web resource. The application is developed as an Apache module. It leverages the transactional ...
  • FishTraits version 2: integrating ecological, biogeographic and bibliographic information 

    Xie, Zhiwu; Frimpong, Emmanual A.; Lee, Sunshin (ACM, 2013-07-22)
    In this paper we describe the new development of FishTraits. Originating from an ecological database that documents and consolidates more than 100 traits for 809 fish species, the new version focuses on the integration of ...
  • The Insitutional Repository's Role in Preserving Research Data 

    Xie, Zhiwu; McMillan, Gail; Walter, Tyler (Virginia Tech, 2012-07-25)
    In recent years, many funding agencies have started to require long-term preservation and open access to research data. While most research universities have already run their own institutional repositories (IR), it's not ...
  • Facilitate Cross-Repository Big Data Discovery and Reuse 

    Xie, Zhiwu (Virginia Tech, 2013-03-13)
    Researchers have accumulated large amount of observational, experimental, and simulation data. Much effort has been made to collect, curate, preserve, and provide open access to them, but putting the data online is only ...
  • Improving scalability by self-archiving 

    Xie, Zhiwu; Liu, Jinyang; Van de Sompel, Herbert; van Reenen, Johann; Jordan, Ramiro (ACM, 2011-06-13)
    The newer generation of web browsers supports the client-side database, making it possible to run the full web application stacks entirely in the web clients. Still, the server side database is indispensable as the central ...
  • Mi-1.2, an R gene for aphid resistance in tomato, has direct negative effects on a zoophytophagous biocontrol agent, Orius insidiosus. 

    Pallipparambil, GR; Sayler, RJ; Shapiro, JP; Thomas, JM; Kring, TJ; Goggin, FL (2015-02)
    Mi-1.2 is a single dominant gene in tomato that confers race-specific resistance against certain phloem-feeding herbivores including aphids, whiteflies, psyllids, and root-knot nematodes. Few prior studies have considered ...
  • If I could turn back time: Looking back at 2+ years of DMP consulting at Virginia Tech. 

    Ogier, A (2016-05-05)
    Presented as part of the DMPs and Public Access: Agency and Data Service Experiences at RDAP 2016.
  • VIVO + SHARE: An Institutional Perspective 

    Ogier, AL (2016-03-11)
    A presentation given on March 11, 2016 as part of the DuraSpace Hot Topics Series.

View more