Indices matter: Learning to index for deep image matting

Lu, H.; Dai, Y.; Shen, C.; Xu, S.

Please use this identifier to cite or link to this item: https://hdl.handle.net/2440/126124

Scopus	Web of Science®	Altmetric
Citations
?	?

Full metadata record

DC Field	Value	Language
dc.contributor.author	Lu, H.	-
dc.contributor.author	Dai, Y.	-
dc.contributor.author	Shen, C.	-
dc.contributor.author	Xu, S.	-
dc.date.issued	2019	-
dc.identifier.citation	Proceedings / IEEE International Conference on Computer Vision. IEEE International Conference on Computer Vision, 2019, vol.2019-October, pp.3265-3274	-
dc.identifier.isbn	9781728148038	-
dc.identifier.issn	1550-5499	-
dc.identifier.issn	2380-7504	-
dc.identifier.uri	http://hdl.handle.net/2440/126124	-
dc.description.abstract	We show that existing upsampling operators can be unified using the notion of the index function. This notion is inspired by an observation in the decoding process of deep image matting where indices-guided unpooling can often recover boundary details considerably better than other upsampling operators such as bilinear interpolation. By viewing the indices as a function of the feature map, we introduce the concept of ‘learning to index’, and present a novel index-guided encoder-decoder framework where indices are self-learned adaptively from data and are used to guide the pooling and upsampling operators, without extra training supervision. At the core of this framework is a flexible network module, termed IndexNet, which dynamically generates indices conditioned on the feature map. Due to its flexibility, IndexNet can be used as a plug-in applying to almost all off-the-shelf convolutional networks that have coupled downsampling and upsampling stages. We demonstrate the effectiveness of IndexNet on the task of natural image matting where the quality of learned indices can be visually observed from predicted alpha mattes. Results on the Composition-1k matting dataset show that our model built on MobileNetv2 exhibits at least 16:1% improvement over the seminal VGG-16 based deep matting baseline, with less training data and lower model capacity. Code and models have been made available at: https://tinyurl.com/IndexNetV1.	-
dc.description.statementofresponsibility	Hao Lu, Yutong Dai, Chunhua Shen, Songcen Xu	-
dc.language.iso	en	-
dc.publisher	IEEE	-
dc.relation.ispartofseries	IEEE International Conference on Computer Vision	-
dc.rights	© 2019 IEEE	-
dc.source.uri	http://dx.doi.org/10.1109/iccv.2019.00336	-
dc.subject	Indexes; Interpolation; Task analysis; Decoding; Semantics; Image resolution; Deconvolution	-
dc.title	Indices matter: Learning to index for deep image matting	-
dc.type	Conference paper	-
dc.contributor.conference	IEEE/CVF International Conference on Computer Vision (ICCV) (27 Oct 2019 - 2 Nov 2019 : Seoul, South Korea)	-
dc.identifier.doi	10.1109/ICCV.2019.00336	-
pubs.publication-status	Published	-
dc.identifier.orcid	Lu, H. [0000-0003-3854-8664]	-
dc.identifier.orcid	Dai, Y. [0000-0001-8019-4228]	-
Appears in Collections:	Aurora harvest 8 Computer Science publications

Files in This Item:

File	Description	Size	Format
hdl_126124.pdf	Accepted version	3.16 MB	Adobe PDF	View/Open

Show simple item record

Adelaide Research & Scholarship