Crowd counting via weighted VLAD on a dense attribute feature map

Sheng, B.; Shen, C.; Lin, G.; Li, J.; Yang, W.; Sun, C.

Please use this identifier to cite or link to this item: https://hdl.handle.net/2440/117266

Scopus	Web of Science®	Altmetric
Citations
?	?

Full metadata record

DC Field	Value	Language
dc.contributor.author	Sheng, B.	-
dc.contributor.author	Shen, C.	-
dc.contributor.author	Lin, G.	-
dc.contributor.author	Li, J.	-
dc.contributor.author	Yang, W.	-
dc.contributor.author	Sun, C.	-
dc.date.issued	2018	-
dc.identifier.citation	IEEE Transactions on Circuits and Systems for Video Technology, 2018; 28(8):1788-1797	-
dc.identifier.issn	1051-8215	-
dc.identifier.issn	1558-2205	-
dc.identifier.uri	http://hdl.handle.net/2440/117266	-
dc.description.abstract	Crowd counting is an important task in computer vision, which has many applications in video surveillance. Although the regression-based framework has achieved great improvements for crowd counting, how to improve the discriminative power of image representation is still an open problem. Conventional holistic features used in crowd counting often fail to capture semantic attributes and spatial cues of the image. In this paper, we propose integrating semantic information into learning locality-aware feature (LAF) sets for accurate crowd counting. First, with the help of a convolutional neural network, the original pixel space is mapped onto a dense attribute feature map, where each dimension of the pixelwise feature indicates the probabilistic strength of a certain semantic class. Then, LAF built on the idea of spatial pyramids on neighboring patches is proposed to explore more spatial context and local information. Finally, the traditional vector of locally aggregated descriptor (VLAD) encoding method is extended to a more generalized form weighted-VLAD (W-VLAD) in which diverse coefficient weights are taken into consideration. Experimental results validate the effectiveness of our presented method.	-
dc.description.statementofresponsibility	Biyun Sheng, Chunhua Shen, Guosheng Lin, Jun Li, Wankou Yang, Changyin Sun	-
dc.language.iso	en	-
dc.publisher	IEEE	-
dc.rights	© 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.	-
dc.source.uri	http://dx.doi.org/10.1109/tcsvt.2016.2637379	-
dc.subject	Semantics; feature extraction; image representation; encoding; roads; neural networks; image segmentation	-
dc.title	Crowd counting via weighted VLAD on a dense attribute feature map	-
dc.type	Journal article	-
dc.identifier.doi	10.1109/TCSVT.2016.2637379	-
pubs.publication-status	Published	-
dc.identifier.orcid	Shen, C. [0000-0002-8648-8718]	-
Appears in Collections:	Aurora harvest 8 Electrical and Electronic Engineering publications

Files in This Item:

There are no files associated with this item.

Show simple item record

Adelaide Research & Scholarship