Computer Vision Application in Object Detection and Tracking for Aerial Surveillance

Archana Nandibewoor; L K Prateek; Manish Sakaray; Abul Hassan; Akshay Ravankar; Abhilash Hegde

doi:10.17485/IJST/v16i31.645

Article

Computer Vision Application in Object Detection and Tracking for Aerial Surveillance

VIEWS 579
PDF 151

Indian Journal of Science and Technology

DOI: 10.17485/IJST/v16i31.645

Year: 2023, Volume: 16, Issue: 31, Pages: 2374-2379

Original Article

Computer Vision Application in Object Detection and Tracking for Aerial Surveillance

Archana Nandibewoor^1*, L K Prateek², Manish Sakaray², Abul Hassan², Akshay Ravankar², Abhilash Hegde³

¹Assistant Professor, Research and Development Center, SDMCET, Affiliated to VTU, Dharwad, Karnataka, India
²Student, Computer Science and Engineering, SDMCET, Dharwad, India
³Junior Research Fellow, Research Scholar (Ph.D.), SDMCET, Dharwad, India

*Corresponding Author
Email: [email protected]

Received Date:20 March 2023, Accepted Date:09 July 2023, Published Date:14 August 2023

This work is licensed under a Creative Commons Attribution 4.0 International License.

Abstract

Objectives: Computer vision duties like object detection, tracking, and counting are significant for surveillance. Factors like altitude, camera angle, occlusion, and motion blur make it a more challenging task. To present a method to overcome all these factors and implement surveillance quickly and accurately for smaller and larger object aspect ratios. Methods: Horizontal Bounding Boxes and Oriented Bounding Boxes (HBB and OBB) are evaluated on two ground truths respectively. PASCAL VOC 07 metric is adopted to calculate the mean average precision. Constructed on the score, the original implementation of Mask R-CNN includes the application of a mask head to the highest-scoring 100 HBBs. Subsequently, the mask head was extended to all HBBs remaining after the process of Non-Maximum Suppression. This modification allowed the evaluation of Mask R-CNN, Cascade Mask RCNN, and Hybrid Task Cascade methods on a wider range of bounding boxes. Findings: In summary, this research explores and compares different approaches and techniques in the field of object detection, particularly focusing on oriented object detection and the challenges posed by geometric variations. Furthermore, it addresses the impact of different models, such as Mask R-CNN, Faster R-CNN OBB + RoI Transformer, and Faster R-CNN OBB + Dpool, on performance. Additionally, it highlights the importance of handling numerical instability caused by extremely small instances. The research findings are visually presented in Figure 2, providing a clear representation of the performance of various networks. Novelty: The study summarizes the findings of existing research papers and identifies research gaps. The performance parameters of the various algorithms and analysis for various networks show the evolution of various methods over the years. With changes in the network, like mask transferring and dataset, the accuracy for smaller, bigger objects and speed of execution are affected, are explained in results and discussions as well as the conclusions.

Keywords: RCNN; Deep learning; Object detection; Computer Vision; Drones

References

Cazzato D, Cimarelli C, Sanchez-Lopez JL, Voos H, Leo M. A Survey of Computer Vision Methods for 2D Object Detection from Unmanned Aerial Vehicles. Journal of Imaging. 2020;6(8):78. Available from: https://doi.org/10.3390/jimaging6080078
Al-Kaff A, Martín D, García FT, Escalera ADL, Armingol JM. Survey of computer vision algorithms and applications for unmanned aerial vehicles. Expert Systems with Applications. 2018;92:447–463. Available from: https://doi.org/10.1016/j.eswa.2017.09.033
Pradhan PK, Baruah U. Object Detection Under Occlusion in Aerial Images: A Review. In: D, N., eds. Lecture Notes in Networks and Systems. (Vol. 281, pp. 215-227) Springer Singapore. 2022.
Jadhav R, Patil R, Diwan A, Rathod SM, Inamdar M. Aerial Object Detection and Tracking using YOLOv4 and DeepSORT. In: 2022 International Conference on Industry 4.0 Technology (I4Tech). (pp. 1-6) IEEE. 2022.
Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T, et al. An image is worth 16x16 words: Transformers for image recognition at scale. ICLR. 2021. Available from: https://doi.org/10.48550/arXiv.2010.11929
Rajjak SS, Kureshi AK. Recent Advances in Object Detection and Tracking for High Resolution Video: Overview and State-of-the-Art. In: 5th International Conference On Computing, Communication, Control And Automation (ICCUBEA). (Vol. 2, pp. 215-228) 2019.
Brian KS, Isaac-Medina D, Organisciak TP, Breckon M, Poyser CG, Willcocks, et al. Unmanned Aerial Vehicle Visual Detection and Tracking using Deep Neural Networks: A Performance Benchmark. 2021. Available from: https://openaccess.thecvf.com/content/ICCV2021W/AntiUAV/papers/Isaac-Medina_Unmanned_Aerial_Vehicle_Visual_Detection_and_Tracking_Using_Deep_Neural_ICCVW_2021_paper.pdf
Fan GPDP, Ji X, Qin MM, Cheng. Cognitive vision inspired object segmentation metric and loss function. Available from: https://doi.org/10.1360/SSI-2020-0370

Copyright

© 2023 Nandibewoor et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Published By Indian Society for Education and Environment (iSee)