Implementation of YOLOv7 Model for Human Detection in Difficult Conditions

Arijal B, Andi Sunyoto, M. Hanafi
DOI: https://doi.org/10.33650/jeecom.v7i1.10662



Abstract

The rapid development of artificial intelligence technology in recent decades has led to the development of highly efficient object detection algorithms, including human detection under difficult conditions. Human detection is one of the major challenges in computer vision as it involves various complex factors such as obstructed human objects, pose variations, small low-resolution human objects, as well as the presence of fake human objects such as statues or images. This research uses the SLR (Systematic Literature Review) method to determine the algorithm used, namely YOLOv7. The three YOLOv7 models tested in this study are YOLOv7x.pt, YOLOv7-w6-person.pt, and YOLOv7-w6-pose.pt. These models were selected based on their excellence in detecting human objects and their relevance for complex scenarios. Tests were conducted using 100 images obtained from the internet and divided into four categories of human objects under difficult conditions, which represent various challenges in human detection. Analysis was performed using convusion matrix to evaluate performance metrics such as accuracy, precision, recall, and F1-score. Based on the test results, the YOLOv7-w6-person.pt model showed the best overall performance, especially in detecting humans in obstructed conditions and complex lighting with a precision of 90.4%, Recall 88.7%, and F1-Score 89.5%. This model has higher accuracy, precision, and F1-score than the other models, making it a reliable choice for human detection in difficult scenarios. These findings not only demonstrate the relevance of YOLOv7 as a reliable human detection algorithm, but also provide a basis for further optimization of YOLOv7-based human detection systems, both through improving the model architecture and adapting to more specific datasets. This research makes an important contribution to the development of human detection technologies for real-world applications, such as surveillance, crowd analysis, and automated safety systems.


Full Text:

PDF

References

Khairunnas, K., Yuniarno, E. M., & Zaini, A. (2021). Making Human Object Detection Module Using YOLO Method for Mobile Robot. ITS Engineering Journal, 10(1), A50-A55.

Shrestha, A., Yeophantong, T., & Nongpong, K. (2019, April). An Approach for Handling Partially Visible Human Shapes in People Detection Systems. In 2019 6th International Conference on Control, Decision and Information Technologies (CoDIT) (pp. 1350-1354). IEEE.

Gajjar, V., Khandhediya, Y., Gurnani, A., Mavani, V., & Raval, M. S. (2018). In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (pp. 1908-1916).

Du, S., Zhang, B., Zhang, P., Xiang, P., & Xue, H. (2021). FA-YOLO: An improved YOLO model for infrared occlusion object detection under confusing background. Wireless Communications and Mobile Computing, 2021, 1-10.

Chen, Y., Wang, X., & Ouyang, Y. (2023). DCPDN: High-Performance Pedestrian Detection Networks for Difficult Conditions. IEEE Access.

Yu, X., Gong, Y., Jiang, N., Ye, Q., & Han, Z. (2020). Scale match for tiny person detection. In Proceedings of the IEEE/CVF winter conference on applications of computer vision (pp. 1257-1265).

Kataoka, H., Ohki, S., Iwata, K., & Satoh, Y. (2018, August). Occlusion Handling Human Detection with Refocused Images. In 2018 24th International Conference on Pattern Recognition (ICPR) (pp. 1701-1706). IEEE.

Terven, J., & Cordova-Esparza, D. (2023). A Comprehensive Review of YOLO Architectures in Computer Vision: From YOLOv1 to YOLOv8 and YOLO-NAS. arXiv preprint arXiv:2304.00501.

Anyim, A. C. (2023). A Comparative Study of YOLOv5 and YOLOv7 Modifications for Face Detection on a Custom Dataset. Center for Postgraduate Studies, University of the Balearic Islands.

Elharrouss, O., Akbari, Y., Almaadeed, N., & Al-Maadeed, S. (2022). Backbones-review: Feature extraction networks for deep learning and deep reinforcement learning approaches. arXiv preprint arXiv:2206.08016.

Wang, C. Y., Bochkovskiy, A., & Liao, H. Y. M. (2023). YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 7464-7475).

Diponegoro, M. H., Kusumawardani, S. S., & Hidayah, I. (2021). Systematic literature review: implementation of deep learning methods in student performance prediction. National Journal of Electrical Engineering and Information Technology, 10(2), 131-138. [23] S. Leonelli and N. Tempini, Data Journeys in the Sciences. 2020.

Feng, G., Yang, Q., Tang, C., Liu, Y., Wu, X., & Wu, W. (2024). Mask-Wearing Detection in Complex Environments Based on Improved YOLOv7. Applied Sciences, 14(9), 3606. [18] C. Shang and F. You, "Data Analytics and Machine Learning for Smart Process Manufacturing: Recent Advances and Perspectives in the Big Data Era," Engineering, vol. 5, no. 6, pp. 1010-1016, 2019, doi: 10.1016/j.eng.2019.01.019.

Mulyo, H., & Kusumodestoni, H. (2022). Object Detection on CCTV for Smart City Kendal Regency. AMRI National Journal of Information Engineering Method Analysis, 1(2), 121-124.

Aljabri, M., AlAmir, M., AlGhamdi, M., Abdel-Mottaleb, M., & Collado-Mesa, F. (2022). Towards a better understanding of annotation tools for medical imaging: a survey. Multimedia tools and applications, 81(18), 25877-25911.

Shrestha, A., Yeophantong, T., & Nongpong, K. (2019, April). An Approach for Handling Partially Visible Human Shapes in People Detection Systems. In 2019 6th International Conference on Control, Decision and Information Technologies (CoDIT) (pp. 1350-1354). IEEE.

Pradana, R. L., Khumaidi, A., & Andiana, R. (2022). Identification of Defect Causes in Welding Results with Image Processing Using the YOLO Method. J. Tech. Electro and Comput. Triac, 9(3), 1-5.

Zhou, J., Zhang, Y., & Wang, J. (2023). RDE-YOLOv7: an improved model based on YOLOv7 for better performance in detecting dragon fruits. Agronomy, 13(4), 1042.

Wahyuni, S., & Sulaeman, M. (2022). Application of deep learning algorithm for face detection attendance system at PT Karya Komponen Presisi. SIMANTIK Informatics Journal, 7(1), 12-21.

Zhang, Z., Lu, X., Cao, G., Yang, Y., Jiao, L., & Liu, F. (2021). ViT-YOLO: Transformer-based YOLO for object detection. In Proceedings of the IEEE/CVF international conference on computer vision (pp. 2799-2808).

Zahari, I., Pratama, Z., Mahmud, W., & Wibowo, D. A. (2022). Computer Answer Sheet Checking System with Optical Mark Recognition (OMR) Based on Open Computer Vision Python. In National Seminar on Innovation and Development of Applied Technology (SENOVTEK) (Vol. 1, pp. 9-16).

Hero, M. R., Setyanto, A., & Arief, M. R. (2024). A Comprehensive Review of Clasifier used with Imbalanced Data in Machine Learning. Journal of Electrical Engineering and Computer (JEECOM), 6(1), 177-185. [25] Q. Song, H. Ge, J. Caverlee, and X. Hu, "Tensor completion algorithms in big data analytics," arXiv, vol. 13, no. 1, 2017.

Cholissodin, I., Sutrisno, S., Soebroto, A. A., Hasanah, U., & Febiola, Y. I. (2020). AI, Machine Learning & Deep Learning. Faculty of Computer Science, Brawijaya University, Malan

Ertel, W. (2018). Introduction to artificial intelligence. Springer.

} Sulistiyanto, S., 2022. Perancangan Software The Smart Tourism Bali Dengan Android. Joutica, 7(1), pp.551-555.


Dimensions, PlumX, and Google Scholar Metrics

10.33650/jeecom.v7i1.10662


Refbacks

  • There are currently no refbacks.


Copyright (c) 2025 Arijal B

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Creative Commons License
 
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Journal of Electrical Engineering and Computer (JEECOM)
Published by LP3M Nurul Jadid University, Indonesia, Probolinggo, East Java, Indonesia.