A novel YOLO LSTM approach for enhanced human action recognition in video sequences

Elnady, Mahmoud; Hossam El-Din Hassan;

Abstract


Human Action Recognition (HAR) is a critical task in computer vision with applications in surveillance, healthcare, and human-computer interaction. This paper introduces a novel approach combining the strengths of You Only Look Once (YOLO) for feature extraction and Long Short-Term Memory (LSTM) networks for temporal modeling to achieve robust and accurate action recognition in video sequences. The YOLO model efficiently identifies key features from individual frames, enabling real-time processing, while the LSTM network captures temporal dependencies to understand sequential dynamics in human movements. The proposed YOLO-LSTM framework is evaluated on multiple publicly available HAR datasets, achieving an accuracy of 96%, precision of 96%, recall of 97%, and F1-score of 96% on the UCF101 dataset; 99% across all metrics on the KTH dataset; 100% on the WEIZMANN dataset; and 98% on the IXMAS dataset. These results demonstrate the superior performance of our approach compared to existing methods in terms of both accuracy and processing speed. Additionally, this approach effectively handles challenges such as occlusions, varying illumination, and complex backgrounds, making it suitable for real-world applications. The results highlight the potential of combining object detection and recurrent architectures for advancing state-of-the-art HAR systems.


Other data

Title A novel YOLO LSTM approach for enhanced human action recognition in video sequences
Authors Elnady, Mahmoud; Hossam El-Din Hassan 
Keywords Human action recognition (HAR);Long short-term memory (LSTM);Temporal modeling;Video sequences
Issue Date 16-May-2025
Publisher Springer
Journal Scientific reports 
DOI 10.1038/s41598-025-01898-z
PubMed ID 40379779

Attached Files

File Description SizeFormat Existing users please Login
s41598-025-01898-z.pdf3.52 MBAdobe PDF    Request a copy
Recommend this item

Similar Items from Core Recommender Database

Google ScholarTM

Check



Items in Ain Shams Scholar are protected by copyright, with all rights reserved, unless otherwise indicated.