kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
STMA-GCN_PedCross: Skeleton Based Spatial-Temporal Graph Convolution Networks with Multiple Attentions for Fast Pedestrian Crossing Intention Prediction
School of Civil and Transportation Engineering, South China University of Technology, China, 510000.
KTH, School of Architecture and the Built Environment (ABE), Civil and Architectural Engineering, Transport planning.ORCID iD: 0000-0001-9990-4269
School of Civil and Transportation Engineering, South China University of Technology, China, 510000.
KTH, School of Architecture and the Built Environment (ABE), Civil and Architectural Engineering, Transport planning.ORCID iD: 0000-0002-2141-0389
2023 (English)In: 2023 IEEE 26th International Conference on Intelligent Transportation Systems, ITSC 2023, Institute of Electrical and Electronics Engineers Inc. , 2023, p. 500-506Conference paper, Published paper (Refereed)
Abstract [en]

Pedestrian crossing intention prediction is an important task for intelligent vehicles to enhance safety and reduce the risk of accidents. The high prediction accuracy and fast execution speed are essential requirements for this task. Existing studies on pedestrian crossing intention prediction have achieved good performance by using complex models and multiple modalities of input data. However, these approaches are limited in practical implications due to their high computational complexity and resource requirements. To address these, the paper propose the Spatial-Temporal Graph Convolution Network (GCN) with multiple attentions for fast and robust Pedestrian Crossing Intention Prediction (STMAGCN_PedCross). It utilizes easily accessible yet robust keypoints as input for predicting pedestrian crossing intention, making it both accurate and practical. To evaluate the effectiveness of the proposed model, we compared it with state-of-the-art models on a large-scale public Joint Attention in Autonomous Driving (JAAD) dataset. The results demonstrate that the STMA-GCN_PedCross model achieves comparable accuracy perfmance to the state-of-the-art models while having a higher robustness (recall rate) and requiring significantly less execution time. The ablation analysis further confirms the effectiveness of attention mechanisms in capturing spatial-temporal features from the skeleton sequence data by giving different attentions. Moreover, the analysis also reveals the significance of different keypoints in the prediction results.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers Inc. , 2023. p. 500-506
Series
IEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC, ISSN 2153-0009
National Category
Computer graphics and computer vision
Identifiers
URN: urn:nbn:se:kth:diva-344366DOI: 10.1109/ITSC57777.2023.10421893ISI: 001178996700074Scopus ID: 2-s2.0-85186505882OAI: oai:DiVA.org:kth-344366DiVA, id: diva2:1844370
Conference
26th IEEE International Conference on Intelligent Transportation Systems, ITSC 2023, Bilbao, Spain, Sep 24 2023 - Sep 28 2023
Note

QC 20240314

Part of ISBN 979-835039946-2

Available from: 2024-03-13 Created: 2024-03-13 Last updated: 2025-02-07Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Zhang, QiMa, Zhenliang

Search in DiVA

By author/editor
Zhang, QiMa, Zhenliang
By organisation
Transport planning
Computer graphics and computer vision

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 99 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf