VTechWorks staff will be away for the Memorial Day holiday on Monday, May 27, and will not be replying to requests at that time. Thank you for your patience.

Show simple item record

dc.contributor.authorXu, Lunaen_US
dc.date.accessioned2019-02-06T09:00:43Z
dc.date.available2019-02-06T09:00:43Z
dc.date.issued2019-02-05
dc.identifier.othervt_gsexam:18051en_US
dc.identifier.urihttp://hdl.handle.net/10919/87469
dc.description.abstractThe big data era has driven the needs for data analysis in every aspect of our daily lives. With the rapid growth of data size and complexity of data analysis models, modern big data analytic applications face the challenge to provide timely results often with limited resources. Such demand drives the growth of new hardware resources including GPUs and FPGAs, as well as storage devices such as SSDs and NVMs. It is challenging to manage the resources available in a cost restricted environment to best serve the applications with different characteristics. Extant approaches are agnostic to such heterogeneity in both underlying resources and workloads and require user knowledge and manual configuration for best performance. In this dissertation, we design, and implement a series of novel techniques, algorithms, and frameworks, to realize workload-aware resource management and scheduling. We demonstrate our techniques for efficient resource management across memory resource for in-memory data analytic platforms, processing resources for compute-intensive machine learning applications, and finally we design and develop a workload and heterogeneity-aware scheduler for general big data platforms. The dissertation demonstrates that designing an effective resource manager requires efforts from both application and system side. The presented approach makes and joins the efforts on both sides to provide a holistic heterogeneity-aware resource manage and scheduling system. We are able to avoid task failure due to resource unavailability by workload-aware resource management, and improve the performance of data processing frameworks by carefully scheduling tasks according to the task characteristics and utilization and availability of the resources.en_US
dc.format.mediumETDen_US
dc.publisherVirginia Techen_US
dc.rightsThis item is protected by copyright and/or related rights. Some uses of this item may be deemed fair and permitted by law even without permission from the rights holder(s), or the rights holder(s) may have licensed the work for use under certain conditions. For other uses you need to obtain permission from the rights holder(s).en_US
dc.subjectresource managementen_US
dc.subjectbig dataen_US
dc.subjectschedulingen_US
dc.subjectheterogeneityen_US
dc.titleA Workload-aware Resource Management and Scheduling System for Big Data Analysisen_US
dc.typeDissertationen_US
dc.contributor.departmentComputer Scienceen_US
dc.description.degreePHDen_US
thesis.degree.namePHDen_US
thesis.degree.leveldoctoralen_US
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen_US
thesis.degree.disciplineComputer Science and Applicationsen_US
dc.contributor.committeechairButt, Alien_US
dc.contributor.committeememberRibbens, Calvin Jen_US
dc.contributor.committeememberCameron, Kirk Wen_US
dc.contributor.committeememberLee, Dongyoonen_US
dc.contributor.committeememberLi, Minen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record