Show simple item record

dc.contributor.authorYeom, Jae-seungen_US
dc.date.accessioned2015-11-22T07:00:46Z
dc.date.available2015-11-22T07:00:46Z
dc.date.issued2014-05-30en_US
dc.identifier.othervt_gsexam:2796en_US
dc.identifier.urihttp://hdl.handle.net/10919/64180
dc.description.abstractData-intensive scientific applications often process an enormous amount of data. The scalability of such applications depends critically on how to manage the locality of data. Our study explores two common types of applications that are vastly different in terms of memory access pattern and workload variation. One includes those with multi-stride accesses in regular nested parallel loops. The other is for processing large-scale irregular social network graphs. In the former case, the memory location or the data item accessed in a loop is predictable and the load on processing a unit work (an array element) is relatively uniform with no significant variation. On the other hand, in the latter case, the data access per unit work (a vertex) is highly irregular in terms of the number of accesses and the locations being accessed. This property is further tied to the load and presents significant challenges in the scalability of the application performance. Designing platforms to support extreme performance scaling requires understanding of how application specific information can be used to control the locality and improve the performance. Such insights are necessary to determine which control and which abstraction to provide for interfacing an underlying system and an application as well as for designing a new system. Our goal is to expose common requirements of data-intensive scientific applications for scalability. For the former type of applications, those with regular accesses and uniform workload, we contribute new methods to improve the temporal locality of software-managed local memories, and optimize the critical path of scheduling data transfers for multi-dimensional arrays in nested loops. In particular, we provide a runtime framework allowing transparent optimization by source-to-source compilers or automatic fine tuning by programmers. Finally, we demonstrate the effectiveness of the approach by comparing against a state-of-the-art language-based framework. For the latter type, those with irregular accesses and non-uniform workload, we analyze how the heavy-tailed property of input graphs limits the scalability of the application. Then, we introduce an application-specific workload model as well as a decomposition method that allows us to optimize locality with the custom load balancing constraints of the application. Finally, we demonstrate unprecedented strong scaling of a contagion simulation on two state-of-the-art high performance computing platforms.en_US
dc.format.mediumETDen_US
dc.publisherVirginia Techen_US
dc.rightsThis Item is protected by copyright and/or related rights. Some uses of this Item may be deemed fair and permitted by law even without permission from the rights holder(s), or the rights holder(s) may have licensed the work for use under certain conditions. For other uses you need to obtain permission from the rights holder(s).en_US
dc.subjectParallel systemsen_US
dc.subjectSoftware-managed memoriesen_US
dc.subjectDistributed memoriesen_US
dc.subjectData localityen_US
dc.subjectScalabilityen_US
dc.subjectParallel discrete event simulationen_US
dc.subjectSocial networksen_US
dc.subjectContagionen_US
dc.titleOptimizing Data Accesses for Scaling Data-intensive Scientific Applicationsen_US
dc.typeDissertationen_US
dc.contributor.departmentComputer Scienceen_US
dc.description.degreePh. D.en_US
thesis.degree.namePh. D.en_US
thesis.degree.leveldoctoralen_US
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen_US
thesis.degree.disciplineComputer Science and Applicationsen_US
dc.contributor.committeechairMarathe, Madhav Vishnuen_US
dc.contributor.committeechairNikolopoulos, Dimitrios S.en_US
dc.contributor.committeememberRibbens, Calvin J.en_US
dc.contributor.committeememberSchulz, Martinen_US
dc.contributor.committeememberBisset, Keith R.en_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record