Deep Learning based enhanced aerial object detection
DOI:
https://doi.org/10.58825/jog.2025.19.1.197Keywords:
Object detection, UAV, Super resolutionAbstract
In congested urban environments, accurate detection and counting of humans and vehicles provide valuable insights for optimizing traffic flow, identifying congestion hotspots, and designing efficient transportation systems. By leveraging computer vision algorithms, such as deep learning based object detection models, real-time monitoring of pedestrian and vehicular traffic can be achieved with high accuracy and granularity. The ability to precisely quantify pedestrian and vehicle movements enables urban planners and policymakers to make data-driven decisions regarding infrastructure development, road maintenance, and public transit planning. In this work, we enhanced the existing deep learning based network architecture for object detection using UAV images. The enhanced network architecture can detect and give a count of the number of objects for any particular area in the image.
References
Aswini N. and U. S V (2021) "Drone Object Detection Using Deep Learning Algorithms," 2021 Third International Conference on Inventive Research in Computing Applications (ICIRCA), Coimbatore, India, pp. 1187-1192, doi: 10.1109/ICIRCA51532.2021.9544983.
Albaba B. M. and S. Ozer (2021) "SyNet: An Ensemble Network for Object Detection in UAV Images," 25th International Conference on Pattern Recognition (ICPR), Milan, Italy, pp. 10227-10234, doi: 10.1109/ICPR48806.2021.9412847.
Anwar S. and N. Barnes (2022) "Densely Residual Laplacian Super-Resolution," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 44, no. 3, pp. 1192-1204, doi: 10.1109/TPAMI.2020.3021088.
Bazi Y. and F. Melgani (2018), "Convolutional SVM Networks for Object Detection in UAV Imagery," in IEEE Transactions on Geoscience and Remote Sensing, vol. 56, no. 6, pp. 3107-3118, doi: 10.1109/TGRS.2018.2790926
Bejiga M. B. A. Zeggada and F. Melgani (2016) "Convolutional neural networks for near real-time object detection from UAV imagery in avalanche search and rescue operations," IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Beijing, China, pp. 693-696, doi: 10.1109/IGARSS.2016.7729174.
Hosseinpoor H. and F. Samadzadegan (2020) "Convolutional Neural Network for Building Extraction from High-Resolution Remote Sensing Images," International Conference on Machine Vision and Image Processing (MVIP), Iran, pp. 1-5, doi: 10.1109/MVIP49855.2020.9187483.
Li C., B. Zhang, D. Hong, J. Yao and J. Chanussot, (2023) "LRR-Net: An Interpretable Deep Unfolding Network for Hyperspectral Anomaly Detection," in IEEE Transactions on Geoscience and Remote Sensing, vol. 61, pp. 1-12, Art no. 5513412, doi: 10.1109/TGRS.2023.3279834
Lim B., S. Son, H. Kim, S. Nah, and K. Mu Lee (2017) “Enhanced deep residual networks for single image super-resolution,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit. Workshops (CVPRW), pp. 136–144.
Liu Y., F. Yang and P. Hu (2020) "Small-Object Detection in UAV-Captured Images via Multi-Branch Parallel Feature Pyramid Networks," in IEEE Access, vol. 8, pp. 145740-145750, doi: 10.1109/ACCESS.2020.3014910.
Maharjan N., H. Miyazaki, B. M. Pati, M. N. Dailey, S. Shrestha and T. Nakamura (2022) "Detection of River Plastic Using UAV Sensor Data and Deep Learning" Remote Sensing 14, no. 13: 3049. https://doi.org/10.3390/rs14133049
Meng L., Z. Peng, J. Zhou, J. Zhang, Z. Lu, A. Baumann and Y. Du (2020) "Real-Time Detection of Ground Objects Based on Unmanned Aerial Vehicle Remote Sensing with Deep Learning: Application in Excavator Detection for Pipeline Safety" Remote Sensing 12, no. 1: 182. https://doi.org/10.3390/rs12010182
Micheal A. A., K. Vani (2022) Deep Learning-Based Multi-class Multiple Object Tracking in UAV Video. J Indian Soc Remote Sens 50, 2543–2552. https://doi.org/10.1007/s12524-022-01615-7
Mittal P., R. Singh and A. Sharma (2020) "Deep learning-based object detection in low-altitude UAV datasets: A survey." Image and Vision computing 104, 104046.
Papageorgiou C. P., M. Oren and T. Poggio (1998) "A general framework for object detection," Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271), Bombay, Indiam, pp. 555-562, doi: 10.1109/ICCV.1998.710772.
Pathak A. R., M. Pandey and S. Rautaray (2018) “Application of Deep Learning for Object Detection”, Procedia Computer Science, Volume 132, Pages 1706-1717, ISSN 1877-0509, https://doi.org/10.1016/j.procs.2018.05.144
Razakarivony S. and F. Jurie (2016) “Vehicle detection in aerial imagery: A small target detection benchmark,” J. Vis. Commun. Image Represent., vol. 34, pp. 187–203, 2016
Saetchnikov I. V., E. A. Tcherniavskaia and V. V. Skakun (2021) "Object Detection for Unmanned Aerial Vehicle Camera via Convolutional Neural Networks," in IEEE Journal on Miniaturization for Air and Space Systems, vol. 2, no. 2, pp. 98-103, doi: 10.1109/JMASS.2020.3040976
Sharma M. et al., (2021) “YOLOrs: Object detection in multimodal remote sensing imagery,” IEEE J. Sel. Topics Appl. Earth Observ. Remote Sens., vol. 14, pp. 1497–1508
Sun W., L. Dai, X. Zhang et al. (2022) RSOD: Real-time small object detection algorithm in UAV-based traffic monitoring. Appl Intell 52, 8448–8463. https://doi.org/10.1007/s10489-021-02893-3
Wang J., X. Teng, Z. Li, Q. Yu, Y. Bian and J. Wei (2022) "VSAI: A Multi-View Dataset for Vehicle Detection in Complex Scenarios Using Aerial Images" Drones 6, no. 7: 161. https://doi.org/10.3390/drones6070161
Wu X., D. Hong and J. Chanussot (2022) "Convolutional Neural Networks for Multimodal Remote Sensing Data Classification," in IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1-10, Art no. 5517010, doi: 10.1109/TGRS.2021.3124913
Wu X., D. Hong and J. Chanussot (2023) "UIU-Net: U-Net in U-Net for Infrared Small Object Detection," in IEEE Transactions on Image Processing, vol. 32, pp. 364-376, doi: 10.1109/TIP.2022.3228497
Wu X., W. Li, D. Hong, R. Tao and Q. Du (2021). Deep Learning for UAV-based Object Detection and Tracking: A Survey. ArXiv. https://doi.org/10.1109/MGRS.2021.3115137
Xiao J., S. Zhang, Y. Dai, Z. Jiang, B. Yi and C. Xu (2020) "Multiclass Object Detection in UAV Images Based on Rotation Region Network," in IEEE Journal on Miniaturization for Air and Space Systems, vol. 1, no. 3, pp. 188-196, doi: 10.1109/JMASS.2020.3025970.
Zeng Y., T. Zhang, W. He and Z. Zhang (2023) "YOLOv7-UAV: An Unmanned Aerial Vehicle Image Object Detection Algorithm Based on Improved YOLOv7" Electronics 12, no. 14: 3141. https://doi.org/10.3390/electronics12143141
Zhang J., J. Lei, W. Xie, Z. Fang, Y. Li and Q. Du (2023) "SuperYOLO: Super Resolution Assisted Object Detection in Multimodal Remote Sensing Imagery," in IEEE Transactions on Geoscience and Remote Sensing, vol. 61, pp. 1-15, Art no. 5605415, doi: 10.1109/TGRS.2023.3258666.
Zhang X., X. Zhou, M. Lin and J. Sun (2018) “ShuffleNet: An extremely efficient convolutional neural network for mobile devices,” in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit, pp. 6848–6856.
Zhang R., H. Li, K. Duan, S. You, K. Liu, F. Wang and Y. Hu. (2020). "Automatic Detection of Earthquake-Damaged Buildings by Integrating UAV Oblique Photography and Infrared Thermal Imaging" Remote Sensing 12, no. 16: 2621. https://doi.org/10.3390/rs12162621
Zhou W., X. Min, R. Hu, Y. Long and H. Luo (2022) FasterX: Real-Time Object Detection Based on Edge GPUs for UAV Applications. ArXiv. /abs/2209.03157
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Journal of Geomatics

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.