kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
PedAST-GCN: Fast Pedestrian Crossing Intention Prediction Using Spatial-Temporal Attention Graph Convolution Networks
South China University of Technology, School of Civil Engineering Transportation, Guangzhou, China.ORCID iD: 0000-0003-4289-2388
KTH, School of Architecture and the Built Environment (ABE), Civil and Architectural Engineering, Transport planning.ORCID iD: 0000-0002-2141-0389
KTH, School of Architecture and the Built Environment (ABE), Civil and Architectural Engineering, Transport planning.ORCID iD: 0000-0001-9990-4269
South China University of Technology, School of Civil Engineering Transportation, Guangzhou, China.ORCID iD: 0000-0002-4500-8941
Show others and affiliations
2024 (English)In: IEEE Transactions on Intelligent Transportation Systems, ISSN 1524-9050, E-ISSN 1558-0016, Vol. 25, no 10, p. 13277-13290Article in journal (Refereed) Published
Abstract [en]

Accurately and timely predicting pedestrian crossing intentions in real-time is critical for operating intelligent vehicles on roads. Although existing models achieve promising accuracy using complex models and video image data, they are constrained for real-time practical use given the high model complexity, time-consuming data preprocessing, and low-quality image data in the wild. To address these, the paper proposes a Spatial-Temporal Attention Graph Convolution Network model for fast pedestrian crossing intention prediction (PedAST-GCN). It uses a lightweight GCN model as the backbone network with simple but robust graph representations of pedestrian crossing intention modality features, including pedestrian pose, bounding box, and vehicle speeds. The model is validated by comparing it with state-of-the-art models on two large-scale public datasets (JAAD and PIE). The results highlight the better performance of the PedAST-GCN model for pedestrian crossing intention prediction in terms of accuracy and computation times. The ablation analysis confirms the value of the backbone layer and graph design, the designed modality features, the effectiveness of attention mechanisms in capturing long-term dependencies (spatial-temporal attention) and fusing heterogeneous features (modality attention), and the robust performance across various observation lengths and in the presence of noisy data.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE) , 2024. Vol. 25, no 10, p. 13277-13290
Keywords [en]
graph convolution networks, modality features, Pedestrian crossing intention prediction, video image data
National Category
Computer graphics and computer vision
Identifiers
URN: urn:nbn:se:kth:diva-367407DOI: 10.1109/TITS.2024.3398252ISI: 001230785100001Scopus ID: 2-s2.0-85194066812OAI: oai:DiVA.org:kth-367407DiVA, id: diva2:1984789
Note

QC 20250717

Available from: 2025-07-17 Created: 2025-07-17 Last updated: 2025-08-28Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Ma, ZhenliangZhang, Qi

Search in DiVA

By author/editor
Ling, YanchengMa, ZhenliangZhang, QiXie, Bangquan
By organisation
Transport planning
In the same journal
IEEE Transactions on Intelligent Transportation Systems
Computer graphics and computer vision

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 64 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf