User-based I/O Profiling for Leadership Scale HPC Workloads

dc.contributor.authorYazdani, Ahmad Hosseinen
dc.contributor.authorPaul, Arnaben
dc.contributor.authorKarimi, Ahmaden
dc.contributor.authorWang, Feiyien
dc.contributor.authorButt, Alien
dc.date.accessioned2025-02-06T18:36:05Zen
dc.date.available2025-02-06T18:36:05Zen
dc.date.issued2025-01-04en
dc.date.updated2025-02-01T09:06:56Zen
dc.description.abstractI/O constitutes a significant portion of most of the application runtime. Spawning many such applications concurrently on an HPC system leads to severe I/O contention. Thus, understanding and subsequently reducing I/O contention induced by such multi-tenancy is critical for the efficient and reliable performance of the HPC system. In this study, we demonstrate that an application’s performance is influenced by the command line arguments passed to the job submission. We model an application’s I/O behavior based on two factors: past I/O behavior within a time window and userconfigured I/O settings via command-line arguments. We conclude that I/O patterns for well-known HPC applications like E3SM and LAMMP are predictable, with an average uncertainty below 0.25 (A probability of 80%) and near zero (A probability of 100%) within a day. However, I/O pattern variance increases as the study time window lengthens. Additionally, we show that for 38 users and at least 50 applications constituting approximately 93000 job submissions, there is a high correlation between a submitted command line and the past command lines made within 1 to 10 days submitted by the user. We claim the length of this time window is unique per user.en
dc.description.versionPublished versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.doihttps://doi.org/10.1145/3700838.3700865en
dc.identifier.urihttps://hdl.handle.net/10919/124520en
dc.language.isoenen
dc.publisherACMen
dc.rightsPublic Domain (U.S.)en
dc.rights.holderThe author(s)en
dc.rights.urihttp://creativecommons.org/publicdomain/mark/1.0/en
dc.titleUser-based I/O Profiling for Leadership Scale HPC Workloadsen
dc.typeArticle - Refereeden
dc.type.dcmitypeTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
3700838.3700865.pdf
Size:
877.78 KB
Format:
Adobe Portable Document Format
Description:
Published version
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Item-specific license agreed upon to submission
Description: