kth.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Action Graphs: Weakly-supervised Action Localization with Graph Convolution Networks
KTH, School of Electrical Engineering and Computer Science (EECS), Intelligent systems, Robotics, Perception and Learning, RPL.ORCID iD: 0000-0002-5750-9655
2020 (English)In: 2020 ieee winter conference on applications of computer vision (wacv), IEEE COMPUTER SOC , 2020, p. 604-613Conference paper, Published paper (Refereed)
Abstract [en]

We present a method for weakly-supervised action localization based on graph convolutions. In order to find and classify video time segments that correspond to relevant action classes, a system must be able to both identify discriminative time segments in each video, and identify the full extent of each action. Achieving this with weak video level labels requires the system to use similarity and dissimilarity between moments across videos in the training data to understand both how an action appears, as well as the subactions that comprise the action's full extent. However, current methods do not make explicit use of similarity between video moments to inform the localization and classification predictions. We present a novel method that uses graph convolutions to explicitly model similarity between video moments. Our method utilizes similarity graphs that encode appearance and motion, and pushes the state of the art on THUMOS'14, ActivityNet 1.2, and Charades for weakly-supervised action localization.

Place, publisher, year, edition, pages
IEEE COMPUTER SOC , 2020. p. 604-613
Series
IEEE Winter Conference on Applications of Computer Vision, ISSN 2472-6737
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:kth:diva-286168DOI: 10.1109/WACV45572.2020.9093404ISI: 000578444800063Scopus ID: 2-s2.0-85085498656OAI: oai:DiVA.org:kth-286168DiVA, id: diva2:1525155
Conference
IEEE Winter Conference on Applications of Computer Vision (WACV), MAR 01-05, 2020, Snowmass, CO
Note

QC 20210203

Available from: 2021-02-03 Created: 2021-02-03 Last updated: 2023-03-30Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Kjellström, Hedvig

Search in DiVA

By author/editor
Kjellström, Hedvig
By organisation
Robotics, Perception and Learning, RPL
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 27 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf