Bootstrapping the performance of webly supervised semantic segmentation

Shen, T.; Lin, G.; Shen, C.; Reid, I.

doi:10.1109/CVPR.2018.00148

Bootstrapping the performance of webly supervised semantic segmentation

Date

2018

Authors

Shen, T.

Lin, G.

Shen, C.

Reid, I.

Type:

Conference paper

Citation

Proceedings / CVPR, IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2018, pp.1363-1371

Statement of Responsibility

Tong Shen, Guosheng Lin, Chunhua Shen, Ian Reid

Conference Name

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (18 Jun 2018 - 23 Jun 2018 : Salt Lake City, USA)

DOI

10.1109/CVPR.2018.00148

Abstract

Fully supervised methods for semantic segmentation require pixel-level class masks to train, the creation of which is expensive in terms of manual labour and time. In this work, we focus on weak supervision, developing a method for training a high-quality pixel-level classifier for semantic segmentation, using only image-level class labels as the provided ground-truth. Our method is formulated as a twostage approach in which we first aim to create accurate pixel-level masks for the training images via a bootstrapping process, and then use these now-accurately segmented images as a proxy ground-truth in a more standard supervised setting. The key driver for our work is that in the target dataset we typically have reliable ground-truth image-level labels, while data crawled from the web may have unreliable labels, but can be filtered to comprise only easy images to segment, therefore having reliable boundaries. These two forms of information are complementary and we use this observation to build a novel bi-directional transfer learning framework. This framework transfers knowledge between two domains, target domain and web domain, bootstrapping the performance of weakly supervised semantic segmentation. Conducting experiments on the popular benchmark dataset PASCAL VOC 2012 based on both a VGG16 network and on ResNet50, we reach state-of-the-art performance with scores of 60.2% IoU and 63.9% IoU respectively¹.

Rights

Grant ID

http://purl.org/au-research/grants/arc/CE140100016
http://purl.org/au-research/grants/arc/FT120100969

Published Version

https://ieeexplore.ieee.org/xpl/conhome/8576498/proceeding

Persistent link to this record

http://hdl.handle.net/2440/124478

Full item page

Bootstrapping the performance of webly supervised semantic segmentation

Date

Authors

Editors

Advisors

Journal Title

Journal ISSN

Volume Title

Type:

Citation

Statement of Responsibility

Conference Name

DOI

Abstract

School/Discipline

Dissertation Note

Provenance

Description

Access Status

Rights

License

Grant ID

Published Version

Call number

Persistent link to this record