• P-ISSN 0974-6846 E-ISSN 0974-5645

Indian Journal of Science and Technology


Indian Journal of Science and Technology

Year: 2023, Volume: 16, Issue: 31, Pages: 2374-2379

Original Article

Computer Vision Application in Object Detection and Tracking for Aerial Surveillance

Received Date:20 March 2023, Accepted Date:09 July 2023, Published Date:14 August 2023


Objectives: Computer vision duties like object detection, tracking, and counting are significant for surveillance. Factors like altitude, camera angle, occlusion, and motion blur make it a more challenging task. To present a method to overcome all these factors and implement surveillance quickly and accurately for smaller and larger object aspect ratios. Methods: Horizontal Bounding Boxes and Oriented Bounding Boxes (HBB and OBB) are evaluated on two ground truths respectively. PASCAL VOC 07 metric is adopted to calculate the mean average precision. Constructed on the score, the original implementation of Mask R-CNN includes the application of a mask head to the highest-scoring 100 HBBs. Subsequently, the mask head was extended to all HBBs remaining after the process of Non-Maximum Suppression. This modification allowed the evaluation of Mask R-CNN, Cascade Mask RCNN, and Hybrid Task Cascade methods on a wider range of bounding boxes. Findings: In summary, this research explores and compares different approaches and techniques in the field of object detection, particularly focusing on oriented object detection and the challenges posed by geometric variations. Furthermore, it addresses the impact of different models, such as Mask R-CNN, Faster R-CNN OBB + RoI Transformer, and Faster R-CNN OBB + Dpool, on performance. Additionally, it highlights the importance of handling numerical instability caused by extremely small instances. The research findings are visually presented in Figure 2, providing a clear representation of the performance of various networks. Novelty: The study summarizes the findings of existing research papers and identifies research gaps. The performance parameters of the various algorithms and analysis for various networks show the evolution of various methods over the years. With changes in the network, like mask transferring and dataset, the accuracy for smaller, bigger objects and speed of execution are affected, are explained in results and discussions as well as the conclusions.

Keywords: RCNN; Deep learning; Object detection; Computer Vision; Drones


  1. Cazzato D, Cimarelli C, Sanchez-Lopez JL, Voos H, Leo M. A Survey of Computer Vision Methods for 2D Object Detection from Unmanned Aerial Vehicles. Journal of Imaging. 2020;6(8):78. Available from: https://doi.org/10.3390/jimaging6080078
  2. Al-Kaff A, Martín D, García FT, Escalera ADL, Armingol JM. Survey of computer vision algorithms and applications for unmanned aerial vehicles. Expert Systems with Applications. 2018;92:447–463. Available from: https://doi.org/10.1016/j.eswa.2017.09.033
  3. Pradhan PK, Baruah U. Object Detection Under Occlusion in Aerial Images: A Review. In: D, N., eds. Lecture Notes in Networks and Systems. (Vol. 281, pp. 215-227) Springer Singapore. 2022.
  4. Jadhav R, Patil R, Diwan A, Rathod SM, Inamdar M. Aerial Object Detection and Tracking using YOLOv4 and DeepSORT. In: 2022 International Conference on Industry 4.0 Technology (I4Tech). (pp. 1-6) IEEE. 2022.
  5. Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T, et al. An image is worth 16x16 words: Transformers for image recognition at scale. ICLR. 2021. Available from: https://doi.org/10.48550/arXiv.2010.11929


© 2023 Nandibewoor et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Published By Indian Society for Education and Environment (iSee)


Subscribe now for latest articles and news.