Multimodal Image Analysis Based Pedestrian Detection Using Optimization with Classification by Hybrid Machine Learning Model

Автор: Johnson Kolluri, Ranjita Das

Журнал: International Journal of Image, Graphics and Signal Processing @ijigsp

Статья в выпуске: 1 vol.17, 2025 года.

Бесплатный доступ

In recent times People commonly display substantial intra-class variability in both appearance and position, making pedestrian recognition difficult. Current computer vision techniques like object identification as well as object classification has given deep learning (DL) models a lot of attention and this application is based on supervised learning, which necessitates labels. Multimodal imaging enables examining more than one molecule at a time, so that cellular events may be examined simultaneously or the progression of these events can be followed in real-time. Purpose of this study is to propose and construct a hybrid machine learning (ML) pedestrian identification model based on multimodal datasets. For pedestrian detection, the input is gathered as multimodal pictures, which are then processed for noise reduction, smoothing, and normalization. Then, the improved picture was categorized using metaheuristic salp cross-modal swarm optimization and optimized using naive spatio kernelized extreme convolutional transfer learning. We thoroughly evaluated the proposed approach on three benchmark datasets for multimodal pedestrian identification that are made accessible to the general public. For several multimodal image-based pedestrian datasets, experimental analysis is done in terms of average precision, log-average miss rate, accuracy, F1 score, and equal error rate. The findings of the studies show that our method is capable of performing cutting-edge detection on open datasets. proposed technique attained average precision of 95%, log-average miss rate of 81%, accuracy of 61%, F1 score of 51%, equal error rate of 59%.

Еще

Pedestrian Detection, Multimodal Image Analysis, Classification, Optimization, Hybrid Machine Learning, Salp Cross Modality

Короткий адрес: https://sciup.org/15019649

IDR: 15019649 | DOI: 10.5815/ijigsp.2025.01.03

Список литературы Multimodal Image Analysis Based Pedestrian Detection Using Optimization with Classification by Hybrid Machine Learning Model

Jain, D. K., Zhao, X., González-Almagro, G., Gan, C., &Kotecha, K. (2023). Multimodal pedestrian detection using metaheuristics with deep convolutional neural network in crowded scenes. Information Fusion, 95, 401-414.
Kolluri, J., & Das, R. (2023). Intelligent multimodal pedestrian detection using hybrid metaheuristic optimization with deep learning model. Image and Vision Computing, 104628.
Kim, J., Nirjhar, E. H., Kim, J., Chaspari, T., Ham, Y., Winslow, J. F., ...&Ahn, C. R. (2022). Capturing environmental distress of pedestrians using multimodal data: the interplay of biosignals and image-based data. Journal of Computing in Civil Engineering, 36(2), 04021039.
Zhang, H., He, B., Lu, G., & Zhu, Y. (2022). A simulation and machine learning based optimization method for integrated pedestrian facilities planning and staff assignment problem in the multi-mode rail transit transfer station. Simulation Modelling Practice and Theory, 115, 102449.
Dradrach, A., Konert, J., &Ruminski, J. (2023, April). Multimodal camera for pedestrian detection with deep learning models. In 2023 IEEE International Conference on Industrial Technology (ICIT) (pp. 1-6). IEEE.
Stanitsa, A., Hallett, S. H., & Jude, S. (2023). Investigating pedestrian behaviour in urban environments: a Wi-Fi tracking and machine learning approach. Multimodal Transportation, 2(1), 100049.
Wang, B., Zou, Y., Zhang, L., Li, Y., Chen, Q., &Zuo, C. (2022). Multimodal super-resolution reconstruction of infrared and visible images via deep learning. Optics and Lasers in Engineering, 156, 107078.
Huang, Z., Sun, S., Zhao, J., & Mao, L. (2023). Multi-modal policy fusion for end-to-end autonomous driving. Information Fusion, 101834.
Hua, C., Sun, M., Zhu, Y., Jiang, Y., Yu, J., & Chen, Y. (2022). Pedestrian detection network with multi-modal cross-guided learning. Digital Signal Processing, 103370.
Qiu, S., Zhao, H., Jiang, N., Wang, Z., Liu, L., An, Y., ...&Fortino, G. (2022). Multi-sensor information fusion based on machine learning for real applications in human activity recognition: State-of-the-art and research challenges. Information Fusion, 80, 241-265.
Ali, S., Li, J., Pei, Y., Khurram, R., Rehman, K. U., &Mahmood, T. (2022). A comprehensive survey on brain tumor diagnosis using deep learning and emerging hybrid techniques with multi-modal MR image. Archives of Computational Methods in Engineering, 29(7), 4871-4896.
Chen, C., Nishio, T., Bennis, M., & Park, J. (2022). RF-Inpainter: Multimodal Image Inpainting Based on Vision and Radio Signals. IEEE Access, 10, 110689-110700.
Zhang, C., & Berger, C. (2023). Pedestrian Behavior Prediction Using Deep Learning Methods for Urban Scenarios: A Review. IEEE Transactions on Intelligent Transportation Systems.
Huang, Z., Mo, X., &Lv, C. (2022, October). ReCoAt: A Deep Learning-based Framework for Multi-Modal Motion Prediction in Autonomous Driving Application. In 2022 IEEE 25th International Conference on Intelligent Transportation Systems (ITSC) (pp. 988-993). IEEE.
Wang, K., Zhou, T., Zhang, Z., Chen, T., & Chen, J. (2023). PVF-DectNet: Multi-modal 3D detection network based on Perspective-Voxel fusion. Engineering Applications of Artificial Intelligence, 120, 105951.
Wanchaitanawong, N., Tanaka, M., Shibata, T., &Okutomi, M. (2023). Multi-modal pedestrian detection with misalignment based on modal-wise regression and multi-modal IoU. Journal of Electronic Imaging, 32(1), 013025-013025.
Thakur, N., Nagrath, P., Jain, R., Saini, D., Sharma, N., &Hemanth, D. J. (2023). Autonomous pedestrian detection for crowd surveillance using deep learning framework. Soft Computing, 27(14), 9383-9399.
Yazdani, M., Sarvi, M., Bagloee, S. A., Nassir, N., Price, J., &Parineh, H. (2023). Intelligent vehicle pedestrian light (IVPL): A deep reinforcement learning approach for traffic signal control. Transportation research part C: emerging technologies, 149, 103991.
Srinivas, K., Singh, L., Chavva, S. R., Dappuri, B., Chandrasekaran, S., &Qamar, S. (2022). Multi-modal cyber security based object detection by classification using deep learning and background suppression techniques. Computers and Electrical Engineering, 103, 108333.
Iftikhar, S., Asim, M., Zhang, Z., & El-Latif, A. A. A. (2022). Advance generalization technique through 3D CNN to overcome the false positives pedestrian in autonomous vehicles. Telecommunication Systems, 80(4), 545-557.
A. K. M. Fahim Rahman, Mostofa Rakib Raihan, S.M. Mohidul Islam, " Pedestrian Detection in Thermal Images Using Deep Saliency Map and Instance Segmentation", International Journal of Image, Graphics and Signal Processing(IJIGSP), Vol.13, No.1, pp. 40-49, 2021. DOI:10.5815/ijigsp.2021.01.04
Anupam Dey, Fahad Mohammad, Saleque Ahmed, Raiyan Sharif, A.F.M. Saifuddin Saif,"Anomaly Detection in Crowded Scene by Pedestrians Behaviour Extraction using Long Short Term Method: A Comprehensive Study", International Journal of Education and Management Engineering(IJEME), Vol.9, No.1, pp.51-63, 2019. DOI: 10.5815/ijeme.2019.01.05

Еще

Статья научная