In this thesis we addressed the problem of visually tracking an object in a sequence of images. We described the discriminative correlation filter, which is a building block for our tracker. We also described the CSR-DCF method, that improves correlation filters by using a binary mask and channel weights. The mask determines which pixels are important for filter learning, and is calculated based on appearance likehood and spatial likehood, whereas channel weights determine the discriminative power of each channel. The tracker is one of the best on the VOT challenge, but its main drawback is the inefficient implementation in Matlab. We wanted to make the tracker widely accessible and have it run more efficiently. That is why we implemented it using C++ programming language and added it to OpenCV library. We used VOT framework to test the effects of different parameters on the speed and accuracy of this tracker, and to compare the original implementation in Matlab to ours. We concluded that our implementation produces similar results while running at a higher speed. The tracker is currently the best in OpenCV library, and was ranked first among realtime trackers on VOT2017.
|