Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
SmartTBD: Smart Tracking for Resource-constrained Object Detection
KTH, Skolan för elektroteknik och datavetenskap (EECS), Intelligenta system.
Ericsson AB, Ericsson Res, Stockholm, Sweden.
Ericsson AB, Ericsson Res, Stockholm, Sweden.
KTH, Skolan för elektroteknik och datavetenskap (EECS), Intelligenta system, Robotik, perception och lärande, RPL.ORCID-id: 0000-0002-7189-1336
Vise andre og tillknytning
2025 (engelsk)Inngår i: ACM Transactions on Embedded Computing Systems, ISSN 1539-9087, E-ISSN 1558-3465, Vol. 24, nr 2, artikkel-id 24Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

With the growing demand for video analysis on mobile devices, object tracking has demonstrated to be a suitable assistance to object detection under the Tracking-By-Detection (TBD) paradigm for reducing computational overhead and power demands. However, performing TBD with fixed hyper-parameters leads to computational inefficiency and ignores perceptual dynamics, as fixed setups tend to run suboptimally, given the variability of scenarios. In this article, we propose SmartTBD, a scheduling strategy for TBD based on multi-objective optimization of accuracy-latency metrics. SmartTBD is a novel deep reinforcement learning based scheduling architecture that computes appropriate TBD configurations in video sequences to improve the speed and detection accuracy. This involves a challenging optimization problem due to the intrinsic relation between the video characteristics and the TBD performance. Therefore, we leverage video characteristics, frame information, and the past TBD results to drive the optimization problem. Our approach surpasses baselines with fixed TBD configurations and recent research, achieving accuracy comparable to pure detection while significantly reducing latency. Moreover, it enables performance analysis of tracking and detection in diverse scenarios. The method is proven to be generalizable and highly practical in common video analytics datasets on resource-constrained devices.

sted, utgiver, år, opplag, sider
Association for Computing Machinery (ACM) , 2025. Vol. 24, nr 2, artikkel-id 24
Emneord [en]
Mobile vision, tracking-by-detection, scheduling
HSV kategori
Identifikatorer
URN: urn:nbn:se:kth:diva-362957DOI: 10.1145/3703912ISI: 001454951000008Scopus ID: 2-s2.0-105003605284OAI: oai:DiVA.org:kth-362957DiVA, id: diva2:1956144
Merknad

QC 20250505

Tilgjengelig fra: 2025-05-05 Laget: 2025-05-05 Sist oppdatert: 2025-05-27bibliografisk kontrollert

Open Access i DiVA

Fulltekst mangler i DiVA

Andre lenker

Forlagets fulltekstScopus

Person

Zhou, ShihangYin, WenjieBjörkman, Mårten

Søk i DiVA

Av forfatter/redaktør
Zhou, ShihangYin, WenjieBjörkman, Mårten
Av organisasjonen
I samme tidsskrift
ACM Transactions on Embedded Computing Systems

Søk utenfor DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric

doi
urn-nbn
Totalt: 80 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf