One Shot Object Detection: For Tracking Purposes
2017 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE credits
Student thesis
Abstract [en]
One of the things augmented reality depends on is object tracking, which is a problem classically found in cinematography and security. However, the algorithms designed for the classical application are often too expensive computationally or too complex to run on simpler mobile hardware. One of the methods to do object tracking is with a trained neural network, this has already led to great results but is unfortunately still running into some of the same problems as the classical algorithms. For this reason a neural network designed specifically for object tracking on mobile hardware needs to be developed. This thesis will propose two di erent neural networks designed for object tracking on mobile hardware. Both are based on a siamese network structure and methods to improve their accuracy using filtering are also introduced. The first network is a modified version of “CNN architecture for geometric matching” that utilizes an a ne regression to perform object tracking. This network was shown to underperform in the MOT benchmark as-well as the VOT benchmark and therefore not further developed. The second network is an object detector based on “SqueezeDet” in a siamese network structure utilizing the performance optimized layers of “MobileNets”. The accuracy of the object detector network is shown to be competitive in the VOT benchmark, placing at the 16th place compared to trackers from the 2016 challenge. It was also shown to run in real-time on mobile hardware. Thus the one shot object detection network used for a tracking application can improve the experience of augmented reality applications on mobile hardware.
Place, publisher, year, edition, pages
2017. , p. 66
Series
TRITA-ICT-EX ; 2017:141
Keywords [en]
Object tracking, Deep learning, Siamese neural network, Affine regression network, One shot learning, Object detector, PID controller
National Category
Computer Sciences Embedded Systems
Identifiers
URN: urn:nbn:se:kth:diva-217252OAI: oai:DiVA.org:kth-217252DiVA, id: diva2:1161376
External cooperation
Viorama GmBh.
Subject / course
Computer Science
Educational program
Master of Science - Embedded Systems
Presentation
2017-09-06, Alonzo, Isafjordsgatan 22, Kista, 10:00 (English)
Supervisors
Examiners
2017-12-102017-11-292022-06-26Bibliographically approved