Please use this identifier to cite or link to this item:
Scopus Web of Science® Altmetric
Type: Journal article
Title: Human action segmentation and recognition using discriminative semi-Markov models
Author: Shi, Q.
Li, C.
Wang, L.
Smola, A.
Citation: International Journal of Computer Vision, 2011; 93(1):22-32
Publisher: Kluwer Academic Publ
Issue Date: 2011
ISSN: 0920-5691
Statement of
Qinfeng Shi, Li Cheng, Li Wang, Alex Smola
Abstract: A challenging problem in human action understanding is to jointly segment and recognize human actions from an unseen video sequence, where one person performs a sequence of continuous actions. In this paper, we propose a discriminative semi-Markov model approach, and define a set of features over boundary frames, segments, as well as neighboring segments. This enable us to conveniently capture a combination of local and global features that best represent each specific action type. To efficiently solve the inference problem of simultaneous segmentation and recognition, a Viterbi-like dynamic programming algorithm is utilized, which in practice is able to process 20 frames per second. Moreover, the model is discriminatively learned from large margin principle, and is formulated as an optimization problem with exponentially many constraints. To solve it efficiently, we present two different optimization algorithms, namely cutting plane method and bundle method, and demonstrate that each can be alternatively deployed in a "plug and play" fashion. From its theoretical aspect, we also analyze the generalization error of the proposed approach and provide a PAC-Bayes bound. The proposed approach is evaluated on a variety of datasets, and is shown to perform competitively to the state-of-the-art methods. For example, on KTH dataset, it achieves 95.0% recognition accuracy, where the best known result on this dataset is 93.4% (Reddy and Shah in ICCV, 2009). © 2010 Springer Science+Business Media, LLC.
Keywords: Action segmentation and recognition
Large-margin method
Semi-Markov model
Rights: © Springer Science+Business Media, LLC 2010
DOI: 10.1007/s11263-010-0384-0
Published version:
Appears in Collections:Aurora harvest
Computer Science publications

Files in This Item:
File Description SizeFormat 
  Restricted Access
Restricted Access693.81 kBAdobe PDFView/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.