Please use this identifier to cite or link to this item:
Scopus Web of Science® Altmetric
Type: Conference paper
Title: Weakly-supervised structured output learning with flexible and latent graphs using high-order loss functions
Author: Carneiro, G.
Peng, T.
Bayer, C.
Navab, N.
Citation: Proceedings / IEEE International Conference on Computer Vision. IEEE International Conference on Computer Vision, 2015, vol.2015 International Conference on Computer Vision, ICCV 2015, pp.648-656
Publisher: IEEE
Issue Date: 2015
Series/Report no.: IEEE International Conference on Computer Vision
ISBN: 9781467383912
ISSN: 1550-5499
Conference Name: 2015 IEEE International Conference on Computer Vision (ICCV 2015) (11 Dec 2015 - 18 Dec 2015 : Santiago, CHILE)
Statement of
Gustavo Carneiro, Tingying Peng, Christine Bayer, Nassir Navab
Abstract: We introduce two new structured output models that use a latent graph, which is flexible in terms of the number of nodes and structure, where the training process minimises a high-order loss function using a weakly annotated training set. These models are developed in the context of microscopy imaging of malignant tumours, where the estimation of the number and proportion of classes of microcirculatory supply units (MCSU) is important in the assessment of the efficacy of common cancer treatments (an MCSU is a region of the tumour tissue supplied by a microvessel). The proposed methodologies take as input multimodal microscopy images of a tumour, and estimate the number and proportion of MCSU classes. This estimation is facilitated by the use of an underlying latent graph (not present in the manual annotations), where each MCSU is represented by a node in this graph, labelled with the MCSU class and image location. The training process uses the manual weak annotations available, consisting of the number of MCSU classes per training image, where the training objective is the minimisation of a high-order loss function based on the norm of the error between the manual and estimated annotations. One of the models proposed is based on a new flexible latent structure support vector machine (FLSSVM) and the other is based on a deep convolutional neural network (DCNN) model. Using a dataset of 89 weakly annotated pairs of multimodal images from eight tumours, we show that the quantitative results from DCNN are superior, but the qualitative results from FLSSVM are better and both display high correlation values regarding the number and proportion of MCSU classes compared to the manual annotations.
Keywords: Tumors, training, manuals, cancer, support vector machines, microscopy, computer vision
Rights: © 2015 IEEE
DOI: 10.1109/ICCV.2015.81
Grant ID:
Published version:
Appears in Collections:Aurora harvest 3
Computer Science publications

Files in This Item:
File Description SizeFormat 
  Restricted Access
Restricted Access1.94 MBAdobe PDFView/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.