kth.sePublications
Change search
Link to record
Permanent link

Direct link
Perez Martinez, AdalbertoORCID iD iconorcid.org/0000-0001-5204-8549
Publications (7 of 7) Show all publications
Ju, Y., Huber, D., Perez Martinez, A., Ulbl, P., Markidis, S., Schlatter, P., . . . Laure, E. (2025). Dynamic Resource Management for In-Situ Techniques Using MPI-Sessions. In: Blaas-Schenner, C Niethammer, C Haas, T (Ed.), Recent advances in the message passing interface, EUROMPI 2024: . Paper presented at 31st European MPI Users' Group Meeting (EuroMPI), September 25-27, 2024, Pawsey Supercomput Res Centre, Perth, Australia (pp. 105-120). Springer Nature
Open this publication in new window or tab >>Dynamic Resource Management for In-Situ Techniques Using MPI-Sessions
Show others...
2025 (English)In: Recent advances in the message passing interface, EUROMPI 2024 / [ed] Blaas-Schenner, C Niethammer, C Haas, T, Springer Nature , 2025, p. 105-120Conference paper, Published paper (Refereed)
Abstract [en]

The computational power of High-Performance Computing (HPC) systems increases continuously and rapidly. Data-intensive applications are designed to leverage the high computational capacity of HPC resources and typically generate a large amount of data for traditional post-processing data analytics. However, the HPC systems' in-/output (IO) subsystem develops relatively slowly, and the storage capacity is limited. This could lead to limited actual performance and scientific discovery. In-situ techniques are a partial remedy to these problems by reducing or avoiding the data flow through the IO subsystem to/from the storage. However, in current practice, asynchronous in-situ techniques with static resource management often allocate separate computing resources for executing in-situ task(s), which remain idle if no in-situ work is at hand. In the present work, we target improving the efficiency of computing resource usage by launching and releasing necessary additional computing resources for in-situ task(s). Our approach is based on extensions for MPI Sessions that enable the required dynamic resource management. In this paper, we propose a basic and an advanced in-situ techniques with dynamic resource management enabled by MPI Sessions, their implementations on two real-world use cases, and a critical analysis of the experimental results.

Place, publisher, year, edition, pages
Springer Nature, 2025
Series
Lecture Notes in Computer Science, ISSN 0302-9743 ; 15267
Keywords
In-situ, HPC, Dynamic resource management, MPI Session
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-357272 (URN)10.1007/978-3-031-73370-3_7 (DOI)001329986700007 ()2-s2.0-85206070581 (Scopus ID)
Conference
31st European MPI Users' Group Meeting (EuroMPI), September 25-27, 2024, Pawsey Supercomput Res Centre, Perth, Australia
Note

Part of ISBN 978-3-031-73369-7, 978-3-031-73370-3

QC 20241206

Available from: 2024-12-06 Created: 2024-12-06 Last updated: 2024-12-06Bibliographically approved
Stanly, R., Du, S., Xavier, D., Perez Martinez, A., Mukha, T., Markidis, S., . . . Schlatter, P. (2024). Generating synthetic turbulence with vector autoregression of proper orthogonal decomposition time coefficients. Journal of Fluid Mechanics, 1000, Article ID A83.
Open this publication in new window or tab >>Generating synthetic turbulence with vector autoregression of proper orthogonal decomposition time coefficients
Show others...
2024 (English)In: Journal of Fluid Mechanics, ISSN 0022-1120, E-ISSN 1469-7645, Vol. 1000, article id A83Article in journal (Refereed) Published
Abstract [en]

This study introduces vector autoregression (VAR) as a linear procedure that can be used for synthesizing turbulence time series over an entire plane, allowing them to be imposed as an efficient turbulent inflow condition in simulations requiring stationary and cross-correlated turbulence time series. VAR is a statistical tool for modelling and prediction of multivariate time series through capturing linear correlations between multiple time series. A Fourier-based proper orthogonal decomposition (POD) is performed on the two-dimensional (2-D) velocity slices from a precursor simulation of a turbulent boundary layer at a momentum thickness-based Reynolds number, Re-theta=790. A subset of the most energetic structures in space are then extracted, followed by applying a VAR model to their complex time coefficients. It is observed that VAR models constructed using time coefficients of 5 and 30 most energetic POD modes per wavenumber (corresponding to 66% and 97% of turbulent kinetic energy, respectively) are able to make accurate predictions of the evolution of the velocity field at Re-theta=790 for infinite time. Moreover, the 2-D velocity fields from the POD-VAR when used as a turbulent inflow condition, gave a short development distance when compared with other common inflow methods. Since the VAR model can produce an infinite number of velocity planes in time, this enables reaching statistical stationarity without having to run an extremely long precursor simulation or applying ad hoc methods such as periodic time series.

Place, publisher, year, edition, pages
Cambridge University Press (CUP), 2024
Keywords
turbulent boundary layers
National Category
Probability Theory and Statistics Fluid Mechanics
Identifiers
urn:nbn:se:kth:diva-357749 (URN)10.1017/jfm.2024.1034 (DOI)001368616600001 ()2-s2.0-85205947695 (Scopus ID)
Note

Not duplicate with DiVA 1833117

QC 20241216

Available from: 2024-12-16 Created: 2024-12-16 Last updated: 2025-02-05Bibliographically approved
Jansson, N., Karp, M., Perez, A., Mukha, T., Ju, Y., Liu, J., . . . Markidis, S. (2023). Exploring the Ultimate Regime of Turbulent Rayleigh–Bénard Convection Through Unprecedented Spectral-Element Simulations. In: SC '23: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis: . Paper presented at SC: The International Conference for High Performance Computing, Networking, Storage, and Analysis, NOV 12–17 DENVER, CO, USA (pp. 1-9). Association for Computing Machinery (ACM), Article ID 5.
Open this publication in new window or tab >>Exploring the Ultimate Regime of Turbulent Rayleigh–Bénard Convection Through Unprecedented Spectral-Element Simulations
Show others...
2023 (English)In: SC '23: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, Association for Computing Machinery (ACM) , 2023, p. 1-9, article id 5Conference paper, Published paper (Refereed)
Abstract [en]

We detail our developments in the high-fidelity spectral-element code Neko that are essential for unprecedented large-scale direct numerical simulations of fully developed turbulence. Major inno- vations are modular multi-backend design enabling performance portability across a wide range of GPUs and CPUs, a GPU-optimized preconditioner with task overlapping for the pressure-Poisson equation and in-situ data compression. We carry out initial runs of Rayleigh–Bénard Convection (RBC) at extreme scale on the LUMI and Leonardo supercomputers. We show how Neko is able to strongly scale to 16,384 GPUs and obtain results that are not pos- sible without careful consideration and optimization of the entire simulation workflow. These developments in Neko will help resolv- ing the long-standing question regarding the ultimate regime in RBC. 

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM), 2023
National Category
Computer Sciences Fluid Mechanics
Identifiers
urn:nbn:se:kth:diva-340333 (URN)10.1145/3581784.3627039 (DOI)2-s2.0-85179549233 (Scopus ID)
Conference
SC: The International Conference for High Performance Computing, Networking, Storage, and Analysis, NOV 12–17 DENVER, CO, USA
Funder
Swedish Research Council, 2019-04723Swedish e‐Science Research CenterEU, Horizon 2020, 101093393, 101092621, 956748
Note

Part of ISBN 9798400701092

QC 20231204

Available from: 2023-12-04 Created: 2023-12-04 Last updated: 2025-02-09Bibliographically approved
Ju, Y., Li, M., Perez, A., Bellentani, L., Jansson, N., Markidis, S., . . . Laure, E. (2023). In-Situ Techniques on GPU-Accelerated Data-Intensive Applications. In: Proceedings 2023 IEEE 19th International Conference on e-Science, e-Science 2023: . Paper presented at 19th IEEE International Conference on e-Science, e-Science 2023, Limassol, Cyprus, Oct 9 2023 - Oct 14 2023. Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>In-Situ Techniques on GPU-Accelerated Data-Intensive Applications
Show others...
2023 (English)In: Proceedings 2023 IEEE 19th International Conference on e-Science, e-Science 2023, Institute of Electrical and Electronics Engineers (IEEE) , 2023Conference paper, Published paper (Refereed)
Abstract [en]

The computational power of High-Performance Computing (HPC) systems is constantly increasing, however, their input/output (IO) performance grows relatively slowly, and their storage capacity is also limited. This unbalance presents significant challenges for applications such as Molecular Dynamics (MD) and Computational Fluid Dynamics (CFD), which generate massive amounts of data for further visualization or analysis. At the same time, checkpointing is crucial for long runs on HPC clusters, due to limited walltimes and/or failures of system components, and typically requires the storage of large amount of data. Thus, restricted IO performance and storage capacity can lead to bottlenecks for the performance of full application workflows (as compared to computational kernels without IO). In-situ techniques, where data is further processed while still in memory rather to write it out over the I/O subsystem, can help to tackle these problems. In contrast to traditional post-processing methods, in-situ techniques can reduce or avoid the need to write or read data via the IO subsystem. They offer a promising approach for applications aiming to leverage the full power of large scale HPC systems. In-situ techniques can also be applied to hybrid computational nodes on HPC systems consisting of graphics processing units (GPUs) and central processing units (CPUs). On one node, the GPUs would have significant performance advantages over the CPUs. Therefore, current approaches for GPU-accelerated applications often focus on maximizing GPU usage, leaving CPUs underutilized. In-situ tasks using CPUs to perform data analysis or preprocess data concurrently to the running simulation, offer a possibility to improve this underutilization.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2023
Keywords
CPU, GPU, HPC, in-situ
National Category
Computer Sciences Computer Systems
Identifiers
urn:nbn:se:kth:diva-338984 (URN)10.1109/e-Science58273.2023.10254865 (DOI)2-s2.0-85174292669 (Scopus ID)
Conference
19th IEEE International Conference on e-Science, e-Science 2023, Limassol, Cyprus, Oct 9 2023 - Oct 14 2023
Note

Part of ISBN 9798350322231

QC 20231101

Available from: 2023-11-01 Created: 2023-11-01 Last updated: 2023-12-04Bibliographically approved
Perez Martinez, A., Örlü, R., Talamelli, A. & Schlatter, P. (2022). Appraisal of cavity hot-wire probes for wall-shear-stress measurements. Experiments in Fluids, 63(9), Article ID 151.
Open this publication in new window or tab >>Appraisal of cavity hot-wire probes for wall-shear-stress measurements
2022 (English)In: Experiments in Fluids, ISSN 0723-4864, E-ISSN 1432-1114, Vol. 63, no 9, article id 151Article in journal (Refereed) Published
Abstract [en]

Flush-mounted cavity hot-wire probes have emerged as an alternative to classical hot-wire probes mounted several diameters above the surface for wall-shear stress measurements. They aim at increasing the frequency response and accuracy by circumventing the well-known issue of heat transfer to the substrate that hot-wire and hot-film probes possess. Their use, however, depends on the assumption that the cavity does not influence the flow field. In this study, we show that this assumption does not hold, and that turbulence statistics are modified by the presence of the cavity with sizes that are practically in use. The mean velocity and fluctuations increase near the cavity while the shear stress decreases in its surroundings, all seemingly stemming from the fact that the no-slip condition is not present anymore and that flow reversal occurs. Overall, the energy spectra and the probability density function of the wall shear stress fluctuations indicate a change of nature of turbulence by the presence of the cavity.

Place, publisher, year, edition, pages
Springer Nature, 2022
National Category
Other Engineering and Technologies
Identifiers
urn:nbn:se:kth:diva-319078 (URN)10.1007/s00348-022-03498-3 (DOI)000853851500001 ()2-s2.0-85139264668 (Scopus ID)
Note

QC 20220926

Available from: 2022-09-26 Created: 2022-09-26 Last updated: 2023-05-29Bibliographically approved
Ju, Y., Perez Martinez, A., Markidis, S., Schlatter, P. & Laure, E. (2022). Understanding the Impact of Synchronous, Asynchronous, and Hybrid In-Situ Techniques in Computational Fluid Dynamics Applications. In: 2022 IEEE 18TH INTERNATIONAL CONFERENCE ON E-SCIENCE (ESCIENCE 2022): . Paper presented at IEEE 18th International Conference on E-Science (E-Science), OCT 10-14, 2022, Salt Lake City, UT (pp. 295-305). Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>Understanding the Impact of Synchronous, Asynchronous, and Hybrid In-Situ Techniques in Computational Fluid Dynamics Applications
Show others...
2022 (English)In: 2022 IEEE 18TH INTERNATIONAL CONFERENCE ON E-SCIENCE (ESCIENCE 2022), Institute of Electrical and Electronics Engineers (IEEE) , 2022, p. 295-305Conference paper, Published paper (Refereed)
Abstract [en]

High-Performance Computing (HPC) systems provide input/output (IO) performance growing relatively slowly compared to peak computational performance and have limited storage capacity. Computational Fluid Dynamics (CFD) applications aiming to leverage the full power of Exascale HPC systems, such as the solver Nek5000, will generate massive data for further processing. These data need to be efficiently stored via the IO subsystem. However, limited IO performance and storage capacity may result in performance, and thus scientific discovery, bottlenecks. In comparison to traditional post-processing methods, in-situ techniques can reduce or avoid writing and reading the data through the IO subsystem, promising to be a solution to these problems. In this paper, we study the performance and resource usage of three in-situ use cases: data compression, image generation, and uncertainty quantification. We furthermore analyze three approaches when these in-situ tasks and the simulation are executed synchronously, asynchronously, or in a hybrid manner. In-situ compression can be used to reduce the IO time and storage requirements while maintaining data accuracy. Furthermore, in-situ visualization and analysis can save Terabytes of data from being routed through the IO subsystem to storage. However, the overall efficiency is crucially dependent on the characteristics of both, the in-situ task and the simulation. In some cases, the overhead introduced by the in-situ tasks can be substantial. Therefore, it is essential to choose the proper in-situ approach, synchronous, asynchronous, or hybrid, to minimize overhead and maximize the benefits of concurrent execution.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2022
Series
Proceeding IEEE International Conference on e-Science (e-Science), ISSN 2325-372X
Keywords
CFD, in-situ, HPC
National Category
Fluid Mechanics Computer Sciences
Identifiers
urn:nbn:se:kth:diva-324634 (URN)10.1109/eScience55777.2022.00043 (DOI)000927625900029 ()2-s2.0-85145434110 (Scopus ID)
Conference
IEEE 18th International Conference on E-Science (E-Science), OCT 10-14, 2022, Salt Lake City, UT
Note

Part of proceedings: ISBN 978-1-6654-6124-5

QC 20230309

Available from: 2023-03-09 Created: 2023-03-09 Last updated: 2025-02-09Bibliographically approved
Stanly, R., Du, S., Xavier, D., Perez Martinez, A., Mukha, T., Markidis, S., . . . Schlatter, P.Generating synthetic turbulence with vector autoregression of proper orthogonal decomposition time coefficients.
Open this publication in new window or tab >>Generating synthetic turbulence with vector autoregression of proper orthogonal decomposition time coefficients
Show others...
(English)Manuscript (preprint) (Other academic)
Abstract [en]

This study introduces vector autoregression (VAR) as a linear procedure that can be used for synthetizing turbulence time series over an entire plane, allowing them to be imposed as efficient turbulent inflow conditions in simulations requiring stationary and cross-correlated turbulence time series. A VAR model is applied to the complex time coefficients derived from a Fourier-based proper orthogonal decomposition (POD) of the velocity fields of the precursor simulation of a turbulent boundary layer at a momentum thickness based Reynolds number, Re_theta=790. VAR is a statistical tool for modelling and prediction of multivariate time series through capturing linear correlations between multiple time series. By performing POD, firstly a subset of the most energetic structures in space are extracted, and then a VAR model is fitted to their time coefficients. It is observed that VAR models constructed using time coefficients of 5 and 30 most energetic POD modes per wave number (corresponding to >40% and >90% of turbulent kinetic energy across all wave numbers, respectively), are able to make accurate predictions of the evolution of the velocity field at Re_theta=790 for infinite time. Moreover, the two-dimensional velocity fields from the low-order POD-VAR are used as a turbulent inflow condition and compared against other common inflow methods. Since the VAR model can produce an infinite number of velocity planes in time, this enables reaching statistical stationarity without having to run an extremely long precursor simulation or applying ad-hoc methods such as periodic time series. 

Keywords
vector autoregression, turbulent boundary layer, proper orthogonal decomposition, crosscorrelation, ordinary least squares, power spectrum, simulations
National Category
Fluid Mechanics
Research subject
Engineering Mechanics
Identifiers
urn:nbn:se:kth:diva-342784 (URN)
Note

QC 20240201

Available from: 2024-01-31 Created: 2024-01-31 Last updated: 2025-02-09Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0001-5204-8549

Search in DiVA

Show all publications