kth.sePublications
Change search
Link to record
Permanent link

Direct link
Alternative names
Publications (10 of 463) Show all publications
Jansson, N., Karp, M., Wahlgren, J., Markidis, S. & Schlatter, P. (2025). Design of Neko—A Scalable High‐Fidelity Simulation Framework With Extensive Accelerator Support. Concurrency and Computation, 37(2), Article ID e8340.
Open this publication in new window or tab >>Design of Neko—A Scalable High‐Fidelity Simulation Framework With Extensive Accelerator Support
Show others...
2025 (English)In: Concurrency and Computation, ISSN 1532-0626, E-ISSN 1532-0634, Vol. 37, no 2, article id e8340Article in journal (Refereed) Published
Abstract [en]

Recent trends and advancements in including more diverse and heterogeneous hardware in High-Performance Computing (HPC) are challenging scientific software developers in their pursuit of efficient numerical methods with sustained performance across a diverse set of platforms. As a result, researchers are today forced to re-factor their codes to leverage these powerful new heterogeneous systems. We present our design considerations of Neko—a portable framework for high-fidelity spectral element flow simulations. Unlike prior works, Neko adopts a modern object-oriented Fortran 2008 approach, allowing multi-tier abstractions of the solver stack and facilitating various hardware backends ranging from general-purpose processors, accelerators down to exotic vector processors and Field-Programmable Gate Arrays (FPGAs). Focusing on the performance and portability of Neko, we describe the framework's device abstraction layer managing device memory, data transfer and kernel launches from Fortran, allowing for a solver written in a hardware-neutral yet performant way. Accelerator-specific optimizations are also discussed, with auto-tuning of key kernels and various communication strategies using device-aware MPI. Finally, we present performance measurements on a wide range of computing platforms, including the EuroHPC pre-exascale system LUMI, where Neko achieves excellent parallel efficiency for a large direct numerical simulation (DNS) of turbulent fluid flow using up to 80% of the entire LUMI supercomputer.

Place, publisher, year, edition, pages
Wiley, 2025
National Category
Computational Mathematics Computer Sciences
Identifiers
urn:nbn:se:kth:diva-358042 (URN)10.1002/cpe.8340 (DOI)001387473600001 ()2-s2.0-85213688601 (Scopus ID)
Funder
Swedish Research Council, 2019‐04723Swedish e‐Science Research Center, SESSIEU, Horizon Europe, 101093393
Note

QC 20250122

Available from: 2025-01-03 Created: 2025-01-03 Last updated: 2025-01-22Bibliographically approved
Ju, Y., Huber, D., Perez Martinez, A., Ulbl, P., Markidis, S., Schlatter, P., . . . Laure, E. (2025). Dynamic Resource Management for In-Situ Techniques Using MPI-Sessions. In: Blaas-Schenner, C Niethammer, C Haas, T (Ed.), Recent advances in the message passing interface, EUROMPI 2024: . Paper presented at 31st European MPI Users' Group Meeting (EuroMPI), September 25-27, 2024, Pawsey Supercomput Res Centre, Perth, Australia (pp. 105-120). Springer Nature
Open this publication in new window or tab >>Dynamic Resource Management for In-Situ Techniques Using MPI-Sessions
Show others...
2025 (English)In: Recent advances in the message passing interface, EUROMPI 2024 / [ed] Blaas-Schenner, C Niethammer, C Haas, T, Springer Nature , 2025, p. 105-120Conference paper, Published paper (Refereed)
Abstract [en]

The computational power of High-Performance Computing (HPC) systems increases continuously and rapidly. Data-intensive applications are designed to leverage the high computational capacity of HPC resources and typically generate a large amount of data for traditional post-processing data analytics. However, the HPC systems' in-/output (IO) subsystem develops relatively slowly, and the storage capacity is limited. This could lead to limited actual performance and scientific discovery. In-situ techniques are a partial remedy to these problems by reducing or avoiding the data flow through the IO subsystem to/from the storage. However, in current practice, asynchronous in-situ techniques with static resource management often allocate separate computing resources for executing in-situ task(s), which remain idle if no in-situ work is at hand. In the present work, we target improving the efficiency of computing resource usage by launching and releasing necessary additional computing resources for in-situ task(s). Our approach is based on extensions for MPI Sessions that enable the required dynamic resource management. In this paper, we propose a basic and an advanced in-situ techniques with dynamic resource management enabled by MPI Sessions, their implementations on two real-world use cases, and a critical analysis of the experimental results.

Place, publisher, year, edition, pages
Springer Nature, 2025
Series
Lecture Notes in Computer Science, ISSN 0302-9743 ; 15267
Keywords
In-situ, HPC, Dynamic resource management, MPI Session
National Category
Computer Sciences
Identifiers
urn:nbn:se:kth:diva-357272 (URN)10.1007/978-3-031-73370-3_7 (DOI)001329986700007 ()2-s2.0-85206070581 (Scopus ID)
Conference
31st European MPI Users' Group Meeting (EuroMPI), September 25-27, 2024, Pawsey Supercomput Res Centre, Perth, Australia
Note

Part of ISBN 978-3-031-73369-7, 978-3-031-73370-3

QC 20241206

Available from: 2024-12-06 Created: 2024-12-06 Last updated: 2024-12-06Bibliographically approved
Mallor, F., Sanmiguel Vila, C., Hajipour, M., Vinuesa, R., Schlatter, P. & Örlü, R. (2025). Experimental characterization of turbulent boundary layers around a NACA 4412 wing profile. Experimental Thermal and Fluid Science, 160, Article ID 111327.
Open this publication in new window or tab >>Experimental characterization of turbulent boundary layers around a NACA 4412 wing profile
Show others...
2025 (English)In: Experimental Thermal and Fluid Science, ISSN 0894-1777, E-ISSN 1879-2286, Vol. 160, article id 111327Article in journal (Refereed) Published
Abstract [en]

An experimental characterization of the turbulent boundary layers developing around a NACA 4412 wing profile is carried out in the Minimum Turbulence Level (MTL) wind tunnel located at KTH Royal Institute of Technology. The campaign included collecting wall-pressure data via built-in pressure taps, capturing velocity signals in the turbulent boundary layers (TBLs) using hot-wire anemometry (HWA), and conducting direct skin-friction measurements with oil-film interferometry (OFI). The research spanned two chord-based Reynolds numbers (Rec=4×105 and 106) and four angles of attack (5°, 8°, 11° and 14°), encompassing a broad spectrum of flow conditions, from mild to strong adverse-pressure gradients (APGs), including scenarios where the TBL detaches from the wing surface. This dataset offers crucial insights into TBL behavior under varied flow conditions, particularly in the context of APGs. Key features include the quasi-independence of the pressure coefficient distributions from Reynolds number, which aids in distinguishing Reynolds-number effects from those due to APG strengths. The study also reveals changes in TBL dynamics as separation approaches, with energy shifting from the inner to the outer region and the eventual transition to a free-shear flow state post-separation. Additionally, the diagnostic scaling in the outer region under spatial-resolution effects is considered, showing further evidence for its applicability for small L+, however with inconsistent results for larger L+. The findings and database resulting from this campaign may be of special relevance for the development and validation of turbulence models, especially in the context of aeronautical applications.

Place, publisher, year, edition, pages
Elsevier BV, 2025
Keywords
Adverse-pressure gradient, Hot-wire anemometry, Turbulence scaling, Turbulent boundary layer, Wind-tunnel experiment
National Category
Fluid Mechanics Vehicle and Aerospace Engineering
Identifiers
urn:nbn:se:kth:diva-354903 (URN)10.1016/j.expthermflusci.2024.111327 (DOI)001333952600001 ()2-s2.0-85205566751 (Scopus ID)
Note

QC 20241030

Available from: 2024-10-16 Created: 2024-10-16 Last updated: 2025-02-14Bibliographically approved
Guastoni, L., Geetha Balasubramanian, A., Foroozan, F., Güemes, A., Ianiro, A., Discetti, S., . . . Vinuesa, R. (2025). Fully convolutional networks for velocity-field predictions based on the wall heat flux in turbulent boundary layers. Theoretical and Computational Fluid Dynamics, 39(1), Article ID 13.
Open this publication in new window or tab >>Fully convolutional networks for velocity-field predictions based on the wall heat flux in turbulent boundary layers
Show others...
2025 (English)In: Theoretical and Computational Fluid Dynamics, ISSN 0935-4964, E-ISSN 1432-2250, Vol. 39, no 1, article id 13Article in journal (Refereed) Published
Abstract [en]

Fully-convolutional neural networks (FCN) were proven to be effective for predicting the instantaneous state of a fully-developed turbulent flow at different wall-normal locations using quantities measured at the wall. In Guastoni et al. (J Fluid Mech 928:A27, 2021. https://doi.org/10.1017/jfm.2021.812), we focused on wall-shear-stress distributions as input, which are difficult to measure in experiments. In order to overcome this limitation, we introduce a model that can take as input the heat-flux field at the wall from a passive scalar. Four different Prandtl numbers Pr=ν/α=(1,2,4,6) are considered (where ν is the kinematic viscosity and α is the thermal diffusivity of the scalar quantity). A turbulent boundary layer is simulated since accurate heat-flux measurements can be performed in experimental settings: first we train the network on aptly-modified DNS data and then we fine-tune it on the experimental data. Finally, we test our network on experimental data sampled in a water tunnel. These predictions represent the first application of transfer learning on experimental data of neural networks trained on simulations. This paves the way for the implementation of a non-intrusive sensing approach for the flow in practical applications.

Place, publisher, year, edition, pages
Springer Nature, 2025
Keywords
Machine learning, Turbulence simulation, Turbulent boundary layers
National Category
Fluid Mechanics
Identifiers
urn:nbn:se:kth:diva-358176 (URN)10.1007/s00162-024-00732-y (DOI)001378464000001 ()2-s2.0-85212435435 (Scopus ID)
Note

Not duplicate with DiVA 1756843

QC 20250114

Available from: 2025-01-07 Created: 2025-01-07 Last updated: 2025-02-09Bibliographically approved
Rezaeiravesh, S., Gscheidle, C., Peplinski, A., Garcke, J. & Schlatter, P. (2025). In-situ estimation of time-averaging uncertainties in turbulent flow simulations. Computer Methods in Applied Mechanics and Engineering, 433, Article ID 117511.
Open this publication in new window or tab >>In-situ estimation of time-averaging uncertainties in turbulent flow simulations
Show others...
2025 (English)In: Computer Methods in Applied Mechanics and Engineering, ISSN 0045-7825, E-ISSN 1879-2138, Vol. 433, article id 117511Article in journal (Refereed) Published
Abstract [en]

The statistics obtained from turbulent flow simulations are generally uncertain due to finite time averaging. Most techniques available in the literature to accurately estimate these uncertainties typically only work in an offline mode, that is, they require access to all available samples of a time series at once. In addition to the impossibility of online monitoring of uncertainties during the course of simulations, such an offline approach can lead to input/output (I/O) deficiencies and large storage/memory requirements, which can be problematic for large-scale simulations of turbulent flows. Here, we designed, implemented and tested a framework for estimating time-averaging uncertainties in turbulence statistics in an in-situ (online/streaming/updating) manner. The proposed algorithm relies on a novel low-memory update formula for computing the sample-estimated autocorrelation functions (ACFs). Based on this, smooth modeled ACFs of turbulence quantities can be generated to accurately estimate the time-averaging uncertainties in the corresponding sample mean estimators. The resulting uncertainty estimates are highly robust, accurate, and quantitatively the same as those obtained by standard offline estimators. Moreover, the computational overhead added by the in-situ algorithm is found to be negligible allowing for online estimation of uncertainties for multiple points and quantities. The framework is general and can be used with any flow solver and also integrated into the simulations over conformal and complex meshes created by adopting adaptive mesh refinement techniques. The results of the study are encouraging for the further development of the in-situ framework for other uncertainty quantification and data-driven analyses relevant not only to large-scale turbulent flow simulations, but also to the simulation of other dynamical systems leading to time-varying quantities with autocorrelated samples.

Place, publisher, year, edition, pages
Elsevier BV, 2025
Keywords
Autocorrelation, In-situ estimation, Time-averaging uncertainty, Turbulent flows, Uncertainty quantification
National Category
Fluid Mechanics
Identifiers
urn:nbn:se:kth:diva-356691 (URN)10.1016/j.cma.2024.117511 (DOI)001356362600001 ()2-s2.0-85208533004 (Scopus ID)
Note

QC 20241122

Available from: 2024-11-20 Created: 2024-11-20 Last updated: 2025-02-09Bibliographically approved
Massaro, D., Peplinski, A., Stanly, R., Mirzareza, S., Lupi, V., Xiang, Y. & Schlatter, P. (2024). A comprehensive framework to enhance numerical simulations in the spectral-element code Nek5000. Computer Physics Communications, 302, Article ID 109249.
Open this publication in new window or tab >>A comprehensive framework to enhance numerical simulations in the spectral-element code Nek5000
Show others...
2024 (English)In: Computer Physics Communications, ISSN 0010-4655, E-ISSN 1879-2944, Vol. 302, article id 109249Article in journal (Refereed) Published
Abstract [en]

A framework is presented for the spectral-element code Nek5000, which has been, and still is, widely used in the computational fluid dynamics (CFD) community to perform high-fidelity numerical simulations of transitional and high Reynolds number flows. Despite the widespread usage, there is a deficiency in having a comprehensive set of tools specifically designed for conducting simulations using Nek5000. To address this issue, we have created a unique framework that allows, inter alia, to perform stability analysis and compute statistics of a turbulent flow. The framework encapsulates modules that provide tools, run-time parameters and memory structures, defining interfaces and performing different tasks. First, the framework architecture is described, showing its non-intrusive approach. Then, the modules are presented, explaining the main tools that have been implemented and describing some of the test cases. The code is open-source and available online, with proper documentation, to-run instructions and related examples.

Place, publisher, year, edition, pages
Elsevier BV, 2024
Keywords
Computational fluid dynamics, Numerical toolbox, Stability analysis, Statistical analysis
National Category
Fluid Mechanics
Identifiers
urn:nbn:se:kth:diva-347059 (URN)10.1016/j.cpc.2024.109249 (DOI)001244454300001 ()2-s2.0-85193603654 (Scopus ID)
Note

QC 20240702

Available from: 2024-05-30 Created: 2024-05-30 Last updated: 2025-02-09Bibliographically approved
Xavier, D., Rezaeiravesh, S. & Schlatter, P. (2024). Autoregressive models for quantification of time-averaging uncertainties in turbulent flows. Physics of fluids, 36(10), Article ID 105122.
Open this publication in new window or tab >>Autoregressive models for quantification of time-averaging uncertainties in turbulent flows
2024 (English)In: Physics of fluids, ISSN 1070-6631, E-ISSN 1089-7666, Vol. 36, no 10, article id 105122Article in journal (Refereed) Published
Abstract [en]

Autoregressive models (ARMs) can be powerful tools for quantifying uncertainty in the time averages of turbulent flow quantities. This is because ARMs are efficient estimators of the autocorrelation function (ACF) of statistically stationary turbulence processes. In this study, we demonstrate a method for order selection of ARMs that uses the integral timescale of turbulence. A crucial insight into the operating principles of the ARM in terms of the time span covered by the product of model order and spacing between samples is provided, which enables us to develop computationally efficient implementations of ARM-based uncertainty estimators. This approach facilitates the quantification of uncertainty in downsampled time series and on a series of autocorrelated batch means with minimal loss of accuracy. Furthermore, a method for estimating uncertainties in second-order moments using first-order uncertainties is discussed. These techniques are applied to the time series data of turbulent flow a) through a plane channel and b) over periodic hills. Additionally, we illustrate the potential of ARMs in generating synthetic turbulence time series. Our study presents autoregressive models as intuitive and powerful tools for turbulent flows, paving the way for further applications in the field.

Place, publisher, year, edition, pages
AIP Publishing, 2024
National Category
Fluid Mechanics
Identifiers
urn:nbn:se:kth:diva-355158 (URN)10.1063/5.0211541 (DOI)001328568900031 ()2-s2.0-85205964233 (Scopus ID)
Note

QC 20241024

Available from: 2024-10-24 Created: 2024-10-24 Last updated: 2025-02-09Bibliographically approved
Mallor, F., Semprini-Cesari, G., Mukha, T., Rezaeiravesh, S. & Schlatter, P. (2024). Bayesian Optimization of Wall-Normal Blowing and Suction-Based Flow Control of a NACA 4412 Wing Profile. Flow Turbulence and Combustion, 113(1), 93-118
Open this publication in new window or tab >>Bayesian Optimization of Wall-Normal Blowing and Suction-Based Flow Control of a NACA 4412 Wing Profile
Show others...
2024 (English)In: Flow Turbulence and Combustion, ISSN 1386-6184, E-ISSN 1573-1987, Vol. 113, no 1, p. 93-118Article in journal (Refereed) Published
Abstract [en]

Active flow-control techniques have shown promise for achieving high levels of drag reduction. However, these techniques are often complex and involve multiple tunable parameters, making it challenging to optimize their efficiency. Here, we present a Bayesian optimization (BO) approach based on Gaussian process regression to optimize a wall-normal blowing and suction control scheme for a NACA 4412 wing profile at two angles of attack: 5 and 11∘, corresponding to cruise and high-lift scenarios, respectively. An automated framework is developed by linking the BO code to the CFD solver OpenFOAM. RANS simulations (validated against high-fidelity LES and experimental data) are used in order to evaluate the different flow cases. BO is shown to provide rapid convergence towards a global maximum, even when the complexity of the response function is increased by introducing a model for the cost of the flow control actuation. The importance of considering the actuation cost is highlighted: while some cases yield a net drag reduction (NDR), they may result in an overall power increase. Furthermore, optimizing for NDR or net power reduction (NPR) can lead to significantly different actuation strategies. Finally, by considering losses and efficiencies representative of real-world applications, still a significant NPR is achieved in the 11∘ case, while net power reduction is only marginally positive in the 5∘ case.

Place, publisher, year, edition, pages
Springer Nature, 2024
Keywords
Flow control, Bayesian optimization, Gaussian process regression, drag reduction, turbulence
National Category
Fluid Mechanics
Research subject
Aerospace Engineering; Engineering Mechanics
Identifiers
urn:nbn:se:kth:diva-342442 (URN)10.1007/s10494-023-00475-6 (DOI)001058987600003 ()2-s2.0-85169823248 (Scopus ID)
Note

QC 20240123

Available from: 2024-01-19 Created: 2024-01-19 Last updated: 2025-03-24Bibliographically approved
Mukha, T., Brethouwer, G. & Schlatter, P. (2024). Boundary Conditions for Wall-Modelled Large-Eddy Simulation Using Spectral Element Discretization. In: ERCOFTAC Series: (pp. 215-220). Springer Science and Business Media B.V., 31
Open this publication in new window or tab >>Boundary Conditions for Wall-Modelled Large-Eddy Simulation Using Spectral Element Discretization
2024 (English)In: ERCOFTAC Series, Springer Science and Business Media B.V. , 2024, Vol. 31, p. 215-220Chapter in book (Other academic)
Abstract [en]

Complementing large-eddy simulation (LES) with wall-modelling is, perhaps, the most straight-forward way to enable high-fidelity simulations at high Reynolds numbers. At the same time, high-order methods offer the benefits of high computational efficiency and potentially faster convergence with respect to mesh refinement even outside the asymptotic regime.

Place, publisher, year, edition, pages
Springer Science and Business Media B.V., 2024
National Category
Computational Mathematics Fluid Mechanics
Identifiers
urn:nbn:se:kth:diva-340784 (URN)10.1007/978-3-031-47028-8_33 (DOI)2-s2.0-85178120212 (Scopus ID)
Note

QC 20231214

Available from: 2023-12-14 Created: 2023-12-14 Last updated: 2025-02-09Bibliographically approved
Massaro, D., Karp, M., Jansson, N., Markidis, S. & Schlatter, P. (2024). Direct numerical simulation of the turbulent flow around a Flettner rotor. Scientific Reports, 14(1), Article ID 3004.
Open this publication in new window or tab >>Direct numerical simulation of the turbulent flow around a Flettner rotor
Show others...
2024 (English)In: Scientific Reports, E-ISSN 2045-2322, Vol. 14, no 1, article id 3004Article in journal (Refereed) Published
Abstract [en]

The three-dimensional turbulent flow around a Flettner rotor, i.e. an engine-driven rotating cylinder in an atmospheric boundary layer, is studied via direct numerical simulations (DNS) for three different rotation speeds (α). This technology offers a sustainable alternative mainly for marine propulsion, underscoring the critical importance of comprehending the characteristics of such flow. In this study, we evaluate the aerodynamic loads produced by the rotor of height h, with a specific focus on the changes in lift and drag force along the vertical axis of the cylinder. Correspondingly, we observe that vortex shedding is inhibited at the highest α values investigated. However, in the case of intermediate α, vortices continue to be shed in the upper section of the cylinder (y/h>0.3). As the cylinder begins to rotate, a large-scale motion becomes apparent on the high-pressure side, close to the bottom wall. We offer both a qualitative and quantitative description of this motion, outlining its impact on the wake deflection. This finding is significant as it influences the rotor wake to an extent of approximately one hundred diameters downstream. In practical applications, this phenomenon could influence the performance of subsequent boats and have an impact on the cylinder drag, affecting its fuel consumption. This fundamental study, which investigates a limited yet significant (for DNS) Reynolds number and explores various spinning ratios, provides valuable insights into the complex flow around a Flettner rotor. The simulations were performed using a modern GPU-based spectral element method, leveraging the power of modern supercomputers towards fundamental engineering problems.

Place, publisher, year, edition, pages
Springer Nature, 2024
National Category
Fluid Mechanics
Identifiers
urn:nbn:se:kth:diva-344051 (URN)10.1038/s41598-024-53194-x (DOI)38321050 (PubMedID)2-s2.0-85184207516 (Scopus ID)
Funder
KTH Royal Institute of TechnologyKTH Royal Institute of Technology
Note

QC 20240301

Available from: 2024-02-29 Created: 2024-02-29 Last updated: 2025-02-09Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0001-9627-5903

Search in DiVA

Show all publications