kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Conditional Driving from Natural Language Instructions
University of Washington.
NVIDIA.
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Robotics, Perception and Learning, RPL. University of Washington.ORCID iD: 0000-0002-1396-0102
Allen Institute for AI.
Show others and affiliations
2019 (English)In: Proceedings of the Conference on Robot Learning, CoRL 2019, ML Research Press , 2019, p. 540-551Conference paper, Published paper (Refereed)
Abstract [en]

Widespread adoption of self-driving cars will depend not only on their safety but largely on their ability to interact with human users. Just like human drivers, self-driving cars will be expected to understand and safely follow natural-language directions that suddenly alter the pre-planned route according to user’s preference or in presence of ambiguities, particularly in locations with poor or outdated map coverage. To this end, we propose a language-grounded driving agent implementing a hierarchical policy using recurrent layers and gated attention. The hierarchical approach enables us to reason both in terms of high-level language instructions describing long time horizons and low-level, complex, continuous state/action spaces required for real-time control of a self-driving car. We train our policy with conditional imitation learning from realistic language data collected from human drivers and navigators. Through quantitative and interactive experiments within the CARLA framework, we show that our model can successfully interpret language instructions and follow them safely, even when generalizing to previously unseen environments. Code and video are available at: https://sites.google.com/view/language-grounded-driving.

Place, publisher, year, edition, pages
ML Research Press , 2019. p. 540-551
Keywords [en]
autonomous vehicles, imitation learning, Language to control
National Category
Computer graphics and computer vision
Identifiers
URN: urn:nbn:se:kth:diva-350333Scopus ID: 2-s2.0-85103362097OAI: oai:DiVA.org:kth-350333DiVA, id: diva2:1883761
Conference
3rd Conference on Robot Learning, CoRL 2019, Osaka, Japan, Oct 30 2019 - Nov 1 2019
Note

QC 20240711

Available from: 2024-07-11 Created: 2024-07-11 Last updated: 2025-02-07Bibliographically approved

Open Access in DiVA

No full text in DiVA

Scopus

Authority records

Pronobis, Andrzej

Search in DiVA

By author/editor
Pronobis, Andrzej
By organisation
Robotics, Perception and Learning, RPL
Computer graphics and computer vision

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 43 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf