Understanding the I/O Impact on the Performance of High-Throughput Molecular Docking
2021 (English)In: Proceedings of PDSW 2021: IEEE/ACM 6th International Parallel Data Systems Workshop, Held in conjunction with SC 2021: The International Conference for High Performance Computing, Networking, Storage and Analysis, Institute of Electrical and Electronics Engineers (IEEE) , 2021, p. 9-14Conference paper, Published paper (Refereed)
Abstract [en]
High-throughput molecular docking is a data-driven simulation methodology to estimate millions of molecules' position and interaction strength (ligands) when interacting with a given protein site. Because of its data-driven nature, the high-throughput molecular docking performance depends on how fast we can ingest data into the processing pipeline and how efficiently we can write molecular docking results to a shared file. This work characterizes the I/O performance of a high-performance, high-throughput molecular docking application, called Docker-HT, running on a supercomputer up to 512 computing nodes with two different parallel I/O configurations. We show that a tuned I/O configuration can improve the overall parallel efficiency from 71% to 90% on 512 nodes and identify and solve a performance degradation observed when running on 16 and 32 nodes.
Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE) , 2021. p. 9-14
Keywords [en]
High-Throughput Molecular Docking, Parallel I/O, Darshan, I/O Profiling
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:kth:diva-310260DOI: 10.1109/PDSW54622.2021.00007ISI: 000764093100002Scopus ID: 2-s2.0-85121020138OAI: oai:DiVA.org:kth-310260DiVA, id: diva2:1647601
Conference
IEEE/ACM 6th International Parallel Data Systems Workshop (PDSW), NOV 14-19, 2021, St Louis, MO
Note
QC 20220328
Part of proceedings: ISBN 978-1-6654-1837-9
2022-03-282022-03-282022-06-25Bibliographically approved