Abstract
Learning and capturing both appearance and dynamic representations are pivotal for crowd video understanding. Convolutional Neural Networks (CNNs) have shown its remarkable potential in learning appearance representations from images. However, the learning of dynamic representation, and how it can be effectively combined with appearance features for video analysis, remains an open problem. In this study, we propose a novel spatio-temporal CNN, named Slicing CNN (S-CNN), based on the decomposition of 3D feature maps into 2D spatio-and 2D temporal-slices representations. The decomposition brings unique advantages: (1) the model is capable of capturing dynamics of different semantic units such as groups and objects, (2) it learns separated appearance and dynamic representations while keeping proper interactions between them, and (3) it exploits the selectiveness of spatial filters to discard irrelevant background clutter for crowd understanding. We demonstrate the effectiveness of the proposed S-CNN model on the WWW crowd video dataset for attribute recognition and observe significant performance improvements to the state-of-the-art methods (62.55% from 51.84% [21]).
Original language | English |
---|---|
Title of host publication | Proceedings - 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016 |
Publisher | IEEE Computer Society |
Pages | 5620-5628 |
Number of pages | 9 |
ISBN (Electronic) | 9781467388504 |
DOIs | |
Publication status | Published - Dec 9 2016 |
Externally published | Yes |
Event | 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016 - Las Vegas, United States Duration: Jun 26 2016 → Jul 1 2016 |
Publication series
Name | Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition |
---|---|
Volume | 2016-December |
ISSN (Print) | 1063-6919 |
Conference
Conference | 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016 |
---|---|
Country/Territory | United States |
City | Las Vegas |
Period | 6/26/16 → 7/1/16 |
Bibliographical note
Publisher Copyright:© 2016 IEEE.
ASJC Scopus Subject Areas
- Software
- Computer Vision and Pattern Recognition