Characterization and Optimization of the Fitting of Quantum Correlation Functions
| dc.contributor.author | Chuang, Pi-Yueh | en |
| dc.contributor.author | Shah, Niteya | en |
| dc.contributor.author | Barry, Patrick | en |
| dc.contributor.author | Cloet, Ian | en |
| dc.contributor.author | Constantinescu, Emil M. | en |
| dc.contributor.author | Sato, Nobuo | en |
| dc.contributor.author | Qiu, Jian-Wei | en |
| dc.contributor.author | Feng, Wu-chun | en |
| dc.date.accessioned | 2026-03-13T14:47:28Z | en |
| dc.date.available | 2026-03-13T14:47:28Z | en |
| dc.date.issued | 2024-09 | en |
| dc.description.abstract | This case study presents a characterization and optimization of an application code for extracting parton distribution functions from high energy electron-proton scattering data. Profiling this application code reveals that the phase-space density computation accounts for 93% of the overall execution time for a single iteration on a single core. When executing multiple iterations in parallel on a multicore system, the application spends 78% of its overall execution time idling due to load imbalance. We address these issues by first transforming the application code from Python to C++ and then tackling the application load imbalance via a hybrid scheduling strategy that combines dynamic and static scheduling. These techniques result in a 62% reduction in CPU idle time and a 2.46x speedup in overall execution time per node. In addition, the typically enabled power-management mechanisms in supercomputers (e.g., AMD Turbo Core, Intel Turbo Boost, and RAPL) can significantly impact intra-node scalability when more than 50% of the CPU cores are used. This finding underscores the importance of understanding system interactions with power management, as they can adversely impact application performance, and highlights the necessity of intra-node scaling tests to identify performance degradation that inter-node scaling tests might otherwise overlook. | en |
| dc.description.version | Submitted version | en |
| dc.format.extent | 8 page(s) | en |
| dc.format.mimetype | application/pdf | en |
| dc.identifier.doi | https://doi.org/10.1109/HPEC62836.2024.10938443 | en |
| dc.identifier.isbn | 979-8-3503-8714-8 | en |
| dc.identifier.issn | 2377-6943 | en |
| dc.identifier.orcid | Feng, Wu-Chun [0000-0002-6015-0727] | en |
| dc.identifier.uri | https://hdl.handle.net/10919/142240 | en |
| dc.language.iso | en | en |
| dc.publisher | IEEE | en |
| dc.rights | In Copyright | en |
| dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | en |
| dc.subject | C++ | en |
| dc.subject | Python | en |
| dc.subject | parallelization | en |
| dc.subject | profiling | en |
| dc.subject | characterization | en |
| dc.subject | optimization | en |
| dc.subject | performance | en |
| dc.subject | power management | en |
| dc.subject | scalability | en |
| dc.subject | systems | en |
| dc.subject | deep inelastic scattering | en |
| dc.subject | quantum physics | en |
| dc.title | Characterization and Optimization of the Fitting of Quantum Correlation Functions | en |
| dc.title.serial | 2024 IEEE High Performance Extreme Computing Conference (HPEC) | en |
| dc.type | Conference proceeding | en |
| dc.type.dcmitype | Text | en |
| dc.type.other | Proceedings Paper | en |
| dc.type.other | Book in series | en |
| pubs.finish-date | 2024-09-27 | en |
| pubs.organisational-group | Virginia Tech | en |
| pubs.organisational-group | Virginia Tech/Engineering | en |
| pubs.organisational-group | Virginia Tech/Engineering/Computer Science | en |
| pubs.organisational-group | Virginia Tech/Faculty of Health Sciences | en |
| pubs.organisational-group | Virginia Tech/All T&R Faculty | en |
| pubs.organisational-group | Virginia Tech/Engineering/COE T&R Faculty | en |
| pubs.start-date | 2024-09-23 | en |