kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Deep learning-based approach for COVID-19 spread prediction
KTH, School of Architecture and the Built Environment (ABE), Urban Planning and Environment, Geoinformatics. Division of Geographic Information Sciences, Department of Mathematics and Informatics, Eduardo Mondlane University, Julius Nyerere street, 3453, Maputo, Mozambique.ORCID iD: 0000-0001-7218-9082
KTH, School of Architecture and the Built Environment (ABE), Urban Planning and Environment, Geoinformatics.ORCID iD: 0000-0003-1164-8403
2024 (English)In: International Journal of Data Science and Analytics, ISSN 2364-415X, p. 1-17Article in journal (Refereed) Epub ahead of print
Abstract [en]

Spread prediction models are vital tools to help health authorities and governments fight against infectious diseases such as COVID-19. The availability of historical daily COVID-19 cases, in conjunction with other datasets such as temperature and humidity (which are believed to play a key role in the spread of the disease), has opened a window for researchers to investigate the potential of different techniques to model and thereby expand our understanding of the factors (e.g., interaction or exposure resulting from mobility) that govern the underlying dynamics of the spread. Traditionally, infectious diseases are modeled using compartmental models such as the SIR model. However, this model shortcoming is that it does not account for mobility, and the resulting mixing or interactions, which we conjecture are a key factor in the dynamics of the spread. Statistical analysis and deep learning-based approaches such as autoregressive integrated moving average (ARIMA), gated recurrent units, variational autoencoder, long short-term memory (LSTM), convolution LSTM, stacked LSTM, and bidirectional LSTM have been tested with COVID-19 historical data to predict the disease spread mainly in medium- and high-income countries with good COVID-19 testing capabilities. However, few studies have focused on low-income countries with low access to COVID-19 testing and, hence, highly biased historical datasets. In addition to this, the arguable best model (BiLSTM) has not been tested with an arguably good set of features (people mobility data, temperature, and relative humidity). Therefore, in thisstudy, the multi-layer BiLSTM model is tested with mobility trend data from Google, temperature, and relative humidity to predict daily COVID-19 cases in low-income countries. The performance of the proposed multi-layer BiLSTM is evaluated by comparing its RMSE with the one from multi-layer LSTM (with the same settings as BiLSTM) in four developing countries namely Mozambique, Rwanda, Nepal, and Myanmar. The proposed multi-layer BiLSTM outperformed the multilayer LSTM in all four countries. The proposed multi-layer BiLSTM was also evaluated by comparing its root mean-squared error (RMSE) with multi-layer LSTM models, ARIMA- and stacked LSTM-based models in eight countries, namely Italy, Turkey, Australia, Brazil, Canada, Egypt, Japan, and the UK. Finally, the proposed multi-layer BiLSTM model was evaluated at the city level by comparing its average relative error with the other four models, namely the LSTM-based model considering multi-layer architecture, Google Cloud Forecasting, the LSTM-based model with mobility data only, and the LSTM-based model with mobility, temperature, and relative humidity data for 7 periods (of 28 days each) in six highly populated regions in Japan, namely Tokyo, Aichi, Osaka, Hyogo, Kyoto, and Fukuoka. The proposed multi-layer BiLSTM model outperformed the multi-layer LSTM model and other previous models by up to 1.6 and 0.6 times in terms of RMSE and ARE, respectively.Therefore, the proposed model enables more accurate forecasting of COVID-19 cases and can support governments and health authorities in their decisions, mainly in developing countries with limited resources.

Place, publisher, year, edition, pages
Springer Nature , 2024. p. 1-17
National Category
Computer Sciences
Research subject
Geodesy and Geoinformatics, Geoinformatics
Identifiers
URN: urn:nbn:se:kth:diva-355376DOI: 10.1007/s41060-024-00558-1ISI: 001242732100002Scopus ID: 2-s2.0-85195551094OAI: oai:DiVA.org:kth-355376DiVA, id: diva2:1908969
Note

QC 20241030

Available from: 2024-10-29 Created: 2024-10-29 Last updated: 2024-11-07Bibliographically approved

Open Access in DiVA

fulltext(1791 kB)42 downloads
File information
File name FULLTEXT01.pdfFile size 1791 kBChecksum SHA-512
002a57d57cd9b046bb6847725759ddbc756fbe74d3290b1ba4c948c1c95025c298867ddf9079c79360980ed5b021ccb4d8f5ab1c8ee7a683f9e95879905a733a
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Cumbane, Silvino PedroGidofalvi, Gyözö

Search in DiVA

By author/editor
Cumbane, Silvino PedroGidofalvi, Gyözö
By organisation
Geoinformatics
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 42 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 287 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf