Deep Object Segmentation and Beyond

Shen, ChunhuaWang, Xinlong2022-10-042022-10-042022https://hdl.handle.net/2440/136542Object segmentation is a fundamental computer vision problem that aims to recognize the object of interest and group the corresponding pixels in an image. With wide applications in self-driving cars, medical imaging, augmented reality, etc., object segmentation has attracted a lot of research attention. In this thesis, we propose a series of methods to solve the challenging problem with deep neural networks. We further generalize the proposed methods to solve extensive tasks such as image matting and study the interactions between object segmentation and unsupervised learning. First, we propose segmenting objects by locations (SOLO), a new, embarrassingly simple approach to segment all the object instances in an image and recognize their categories. Unlike previous methods that rely on either bounding box detection or grouping post-processing, SOLO directly maps a raw input image to the desired object categories and instance masks with a fully convolutional network. We demonstrate a much simpler and flexible instance segmentation framework with strong performance. Second, we present SOLOv2, a dynamic and fast instance segmentation solution that follows the principle of SOLO but improves it in terms of both speed and accuracy. SOLOv2 achieves state-of-the-art results with high efficiency, making it suitable for both mobile and cloud applications. We further demonstrate the generality of our method by extending it to perform panoptic segmentation and image matting. Third, we propose dense contrastive learning (DenseCL) to learn better representation from large-scale unlabeled images for dense prediction tasks such as segmentation. The proposed DenseCL performs dense pairwise contrastive learning at the level of pixels. Our method largely closes the gap between self-supervised pre-training and downstream dense prediction tasks. Finally, we propose a fully unsupervised learning method that learns to segment objects without any annotations. We present FreeSOLO, a self-supervised instance segmentation framework built on top of our simple-yet-effective methods SOLO(v2) for segmentation, and DenesCL for unsupervised learning. For the first time, we demonstrate unsupervised instance segmentation successfully. The code and models are publicly available at https://github.com/WXinlong.ennstance segmentation, object detection, panoptic segmentation, image matting, self-supervised learningDeep Object Segmentation and BeyondThesis