Please use this identifier to cite or link to this item: https://hdl.handle.net/2440/121689
Citations
Scopus Web of Science® Altmetric
?
?
Full metadata record
DC FieldValueLanguage
dc.contributor.authorWang, P.-
dc.contributor.authorLiu, L.-
dc.contributor.authorShen, C.-
dc.contributor.authorShen, H.-
dc.date.issued2019-
dc.identifier.citationPattern Recognition, 2019; 91:357-365-
dc.identifier.issn0031-3203-
dc.identifier.issn1873-5142-
dc.identifier.urihttp://hdl.handle.net/2440/121689-
dc.description.abstractMost video based action recognition approaches create the video-level representation by temporally pooling the features extracted at every frame. The pooling methods they adopt, however, usually completely or partially ignore the dynamic information contained in the temporal domain, which may undermine the discriminative power of the resulting video representation since the video sequence order could unveil the evolution of a specific event or action. To overcome this drawback and explore the importance of incorporating the temporal order information, in this paper we propose a novel temporal pooling approach to aggregate the frame-level features. Inspired by the capacity of Convolutional Neural Networks (CNN) in making use of the internal structure of images for information abstraction, we propose to apply the temporal convolution operation to the frame-level representations to extract the dynamic information. However, directly implementing this idea on the original high-dimensional feature will result in parameter explosion. To handle this issue, we propose to treat the temporal evolution of the feature value at each feature dimension as a 1D signal and learn a unique convolutional filter bank for each 1D signal. By conducting experiments on three challenging video-based action recognition datasets, HMDB51, UCF101, and Hollywood2, we demonstrate that the proposed method is superior to the conventional pooling methods.-
dc.description.statementofresponsibilityPeng Wang, Lingqiao Liu, Chunhua Shen, Heng Tao Shen-
dc.language.isoen-
dc.publisherElsevier-
dc.rights© 2019 Elsevier Ltd. All rights reserved.-
dc.subjectAction recognition; convolutional neural network; temporal pooling-
dc.titleOrder-aware convolutional pooling for video based action recognition-
dc.typeJournal article-
dc.identifier.doi10.1016/j.patcog.2019.03.002-
pubs.publication-statusPublished-
dc.identifier.orcidShen, C. [0000-0002-8648-8718]-
Appears in Collections:Aurora harvest 8
Computer Science publications

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.