Please use this identifier to cite or link to this item: https://hdl.handle.net/2440/136846
Citations
Scopus Web of Science® Altmetric
?
?
Type: Conference paper
Title: Contrastive Transformer-Based Multiple Instance Learning for Weakly Supervised Polyp Frame Detection
Author: Tian, Y.
Pang, G.
Liu, F.
Liu, Y.
Wang, C.
Chen, Y.
Verjans, J.
Carneiro, G.
Citation: Lecture Notes in Artificial Intelligence, 2022 / Wang, L., Dou, Q., Fletcher, P.T., Speidel, S., Li, S. (ed./s), vol.13433, pp.88-98
Publisher: Springer Link
Publisher Place: Online
Issue Date: 2022
Series/Report no.: Lecture Notes in Computer Science; 13433
ISBN: 978-3-031-16437-8
ISSN: 0302-9743
1611-3349
Conference Name: International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) (18 Sep 2022 - 22 Sep 2022 : Singapore)
Editor: Wang, L.
Dou, Q.
Fletcher, P.T.
Speidel, S.
Li, S.
Statement of
Responsibility: 
Yu Tian, Guansong Pang, Fengbei Liu, Yuyuan Liu, Chong Wang, Yuanhong Chen, Johan Verjans, Gustavo Carneiro
Abstract: Current polyp detection methods from colonoscopy videos use exclusively normal (i.e., healthy) training images, which i) ignore the importance of temporal information in consecutive video frames, and ii) lack knowledge about the polyps. Consequently, they often have high detection errors, especially on challenging polyp cases (e.g., small, flat, or partially visible polyps). In this work, we formulate polyp detection as a weakly-supervised anomaly detection task that uses video-level labelled training data to detect frame-level polyps. In particular, we propose a novel convolutional transformer-based multiple instance learning method designed to identify abnormal frames (i.e., frames with polyps) from anomalous videos (i.e., videos containing at least one frame with polyp). In our method, local and global temporal dependencies are seamlessly captured while we simultaneously optimise video and snippet-level anomaly scores. A contrastive snippet mining method is also proposed to enable an effective modelling of the challenging polyp cases. The resulting method achieves a detection accuracy that is substantially better than current state-of-the-art approaches on a new large-scale colonoscopy video dataset introduced in this work. Our code and dataset are available at https://github.com/tianyu0207/weakly-polyp.
Keywords: Polyp detection; Colonoscopy; Weakly-supervised learning; Video anomaly detection; Vision transformer
Rights: © 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
DOI: 10.1007/978-3-031-16437-8_9
Grant ID: http://purl.org/au-research/grants/arc/DP180103232
http://purl.org/au-research/grants/arc/FT190100525
Published version: https://link.springer.com/book/10.1007/978-3-031-16437-8
Appears in Collections:Computer Science publications

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.