Human Activity Recognition in Video Sequences
Haiam Adel Abdul-Azim Ahmed;
Abstract
Recognizing human actions in video sequences has been a challenge problem in the last few years due to its real-world applications. A lot of action representation approaches have been proposed to improve the action recognition performance. Despite the popularity of local features-based approaches together with “Bag-of-Words” model for action representation, it fails to capture adequate spatial or temporal relationships. In an attempt to overcome this problem, a trajectory-based local representation approaches have been proposed to capture the temporal information.
This thesis introduces an improvement of trajectory-based human action recognition approaches to capture discriminative temporal relationships. In our approach, we extract trajectories by tracking the detected spatio-temporal interest points named “cuboid features” with matching its SIFT descriptors over the consecutive frames. We, also, proposed a linking and exploring method to obtain efficient trajectories for motion representation in realistic conditions. Then the volumes aligned the trajectories are described to represent human actions based on the Bag-of-Words (BOW) model. Finally, a support vector machine is used to classify human actions. The effectiveness of the proposed approach was evaluated on three popular datasets (KTH, Weizmann and UCF sports). Experimental results showed that the proposed approach yields performance improvement over the state-of-the-art approaches.
Chapter 1
This thesis introduces an improvement of trajectory-based human action recognition approaches to capture discriminative temporal relationships. In our approach, we extract trajectories by tracking the detected spatio-temporal interest points named “cuboid features” with matching its SIFT descriptors over the consecutive frames. We, also, proposed a linking and exploring method to obtain efficient trajectories for motion representation in realistic conditions. Then the volumes aligned the trajectories are described to represent human actions based on the Bag-of-Words (BOW) model. Finally, a support vector machine is used to classify human actions. The effectiveness of the proposed approach was evaluated on three popular datasets (KTH, Weizmann and UCF sports). Experimental results showed that the proposed approach yields performance improvement over the state-of-the-art approaches.
Chapter 1
Other data
| Title | Human Activity Recognition in Video Sequences | Other Titles | التعرف على نشاط الإنسان فى الفيديو | Authors | Haiam Adel Abdul-Azim Ahmed | Issue Date | 2015 |
Recommend this item
Similar Items from Core Recommender Database
Items in Ain Shams Scholar are protected by copyright, with all rights reserved, unless otherwise indicated.