kth.sePublikationer KTH
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
SmartTBD: Smart Tracking for Resource-constrained Object Detection
KTH, Skolan för elektroteknik och datavetenskap (EECS), Intelligenta system.
Ericsson AB, Ericsson Res, Stockholm, Sweden.
Ericsson AB, Ericsson Res, Stockholm, Sweden.
KTH, Skolan för elektroteknik och datavetenskap (EECS), Intelligenta system, Robotik, perception och lärande, RPL.ORCID-id: 0000-0002-7189-1336
Visa övriga samt affilieringar
2025 (Engelska)Ingår i: ACM Transactions on Embedded Computing Systems, ISSN 1539-9087, E-ISSN 1558-3465, Vol. 24, nr 2, artikel-id 24Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

With the growing demand for video analysis on mobile devices, object tracking has demonstrated to be a suitable assistance to object detection under the Tracking-By-Detection (TBD) paradigm for reducing computational overhead and power demands. However, performing TBD with fixed hyper-parameters leads to computational inefficiency and ignores perceptual dynamics, as fixed setups tend to run suboptimally, given the variability of scenarios. In this article, we propose SmartTBD, a scheduling strategy for TBD based on multi-objective optimization of accuracy-latency metrics. SmartTBD is a novel deep reinforcement learning based scheduling architecture that computes appropriate TBD configurations in video sequences to improve the speed and detection accuracy. This involves a challenging optimization problem due to the intrinsic relation between the video characteristics and the TBD performance. Therefore, we leverage video characteristics, frame information, and the past TBD results to drive the optimization problem. Our approach surpasses baselines with fixed TBD configurations and recent research, achieving accuracy comparable to pure detection while significantly reducing latency. Moreover, it enables performance analysis of tracking and detection in diverse scenarios. The method is proven to be generalizable and highly practical in common video analytics datasets on resource-constrained devices.

Ort, förlag, år, upplaga, sidor
Association for Computing Machinery (ACM) , 2025. Vol. 24, nr 2, artikel-id 24
Nyckelord [en]
Mobile vision, tracking-by-detection, scheduling
Nationell ämneskategori
Telekommunikation
Identifikatorer
URN: urn:nbn:se:kth:diva-362957DOI: 10.1145/3703912ISI: 001454951000008Scopus ID: 2-s2.0-105003605284OAI: oai:DiVA.org:kth-362957DiVA, id: diva2:1956144
Anmärkning

QC 20250505

Tillgänglig från: 2025-05-05 Skapad: 2025-05-05 Senast uppdaterad: 2025-05-27Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Övriga länkar

Förlagets fulltextScopus

Person

Zhou, ShihangYin, WenjieBjörkman, Mårten

Sök vidare i DiVA

Av författaren/redaktören
Zhou, ShihangYin, WenjieBjörkman, Mårten
Av organisationen
Intelligenta systemRobotik, perception och lärande, RPL
I samma tidskrift
ACM Transactions on Embedded Computing Systems
Telekommunikation

Sök vidare utanför DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetricpoäng

doi
urn-nbn
Totalt: 80 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf