A practical solution to the problem of detecting peoples and vehicles from video frames

Vitaly Fralenko, Mikhail Khachumov
15m
The research is dedicated to solving the problem of people and vehicle localization in video frames. Video frames of areas with forest and roads are used as test data. The algorithm from the modified "deep_sort_realtime" package is used for object tracking. In addition, the capability to use Yolo 8 for object detection has been added, as well as the ability to extract informative features using Mobilenet v3. For the input images is used letterbox preprocessing, and various optimizations affecting the quality and speed of results have been added. For license plate recognition, the "tflite_avto_num_recognition" software package is used (which employs Canny and Hough transformations, as well as the CNN-LSTM-CTC neural network). The obtained solutions work in real time and rely on open-source libraries.