Performance of somatic structural variant calling in lung cancer using Oxford Nanopore sequencing technology

Files

hdl_148043.pdf (2.77 MB)
  (Published version)

Date

2024

Authors

Liu, L.
Zhang, J.
Wood, S.
Newell, F.
Leonard, C.
Koufariotis, L.T.
Nones, K.
Dalley, A.J.
Chittoory, H.
Bashirzadeh, F.

Editors

Advisors

Journal Title

Journal ISSN

Volume Title

Type:

Journal article

Citation

BMC Genomics, 2024; 25(1):898-1-898-13

Statement of Responsibility

Lingchen Liu, Jia Zhang, Scott Wood, Felicity Newell, Conrad Leonard, Lambros T. Koufariotis, Katia Nones, Andrew J. Dalley, Haarika Chittoory, Farzad Bashirzadeh, Jung Hwa Son, Daniel Steinfort, Jonathan P. Williamson, Michael Bint, Carl Pahoff, Phan T. Nguyen, Scott Twaddell, David Arnold, Christopher Grainge, Peter T. Simpson, David Fielding, Nicola Waddell, and John V. Pearson

Conference Name

Abstract

Background: Lung cancer is a heterogeneous disease and the primary cause of cancer-related mortality worldwide. Somatic mutations, including large structural variants, are important biomarkers in lung cancer for selecting targeted therapy. Genomic studies in lung cancer have been conducted using short-read sequencing. Emerging long-read sequencing technologies are a promising alternative to study somatic structural variants, however there is no current consensus on how to process data and call somatic events. In this study, we preformed whole genome sequencing of lung cancer and matched non-tumour samples using long and short read sequencing to comprehensively benchmark three sequence aligners and seven structural variant callers comprised of generic callers (SVIM, Sniffles2, DELLY in generic mode and cuteSV) and somatic callers (Severus, SAVANA, nanomonsv and DELLY in somatic modes). Results: Different combinations of aligners and variant callers influenced somatic structural variant detection. The choice of caller had a significant influence on somatic structural variant detection in terms of variant type, size, sensitivity, and accuracy. The performance of each variant caller was assessed by comparing to somatic structural variants identified by short-read sequencing. When compared to somatic structural variants detected with short-read sequencing, more events were detected with long-read sequencing. The mean recall of somatic variant events identified by long-read sequencing was higher for the somatic callers (72%) than generic callers (53%). Among the somatic callers when using the minimap2 aligner, SAVANA and Severus achieved the highest recall at 79.5% and 79.25% respectively, followed by nanomonsv with a recall of 72.5%. Conclusion: Long-read sequencing can identify somatic structural variants in clincal samples. The longer reads have the potential to improve our understanding of cancer development and inform personalized cancer treatment.

School/Discipline

Dissertation Note

Provenance

Description

Access Status

Rights

© The Author(s) 2024. Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

License

Call number

Persistent link to this record