Positive-unlabeled learning in bioinformatics and computational biology: A brief review

Li, F.; Dong, S.; Leier, A.; Han, M.; Guo, X.; Xu, J.; Wang, X.; Pan, S.; Jia, C.; Zhang, Y.; Webb, G.I.; Coin, L.J.M.; Li, C.; Song, J.

doi:10.1093/bib/bbab461

Positive-unlabeled learning in bioinformatics and computational biology: A brief review

dc.contributor.author	Li, F.
dc.contributor.author	Dong, S.
dc.contributor.author	Leier, A.
dc.contributor.author	Han, M.
dc.contributor.author	Guo, X.
dc.contributor.author	Xu, J.
dc.contributor.author	Wang, X.
dc.contributor.author	Pan, S.
dc.contributor.author	Jia, C.
dc.contributor.author	Zhang, Y.
dc.contributor.author	Webb, G.I.
dc.contributor.author	Coin, L.J.M.
dc.contributor.author	Li, C.
dc.contributor.author	Song, J.
dc.date.issued	2022
dc.description.abstract	Conventional supervised binary classification algorithms have been widely applied to address significant research questions using biological and biomedical data. This classification scheme requires two fully labeled classes of data (e.g. positive and negative samples) to train a classification model. However, in many bioinformatics applications, labeling data is laborious, and the negative samples might be potentially mislabeled due to the limited sensitivity of the experimental equipment. The positive unlabeled (PU) learning scheme was therefore proposed to enable the classifier to learn directly from limited positive samples and a large number of unlabeled samples (i.e. a mixture of positive or negative samples). To date, several PU learning algorithms have been developed to address various biological questions, such as sequence identification, functional site characterization and interaction prediction. In this paper, we revisit a collection of 29 state-of-the-art PU learning bioinformatic applications to address various biological questions. Various important aspects are extensively discussed, including PU learning methodology, biological application, classifier design and evaluation strategy. We also comment on the existing issues of PU learning and offer our perspectives for the future development of PU learning applications. We anticipate that our work serves as an instrumental guideline for a better understanding of the PU learning framework in bioinformatics and further developing next-generation PU learning frameworks for critical biological applications.
dc.description.statementofresponsibility	Fuyi Li, Shuangyu Dong, André Leier, Meiya Han, Xudong Guo, Jing Xu, Xiaoyu Wang, Shirui Pan, Cangzhi Jia, Yang Zhang, Geoffrey I.Webb, Lachlan J.M. Coin, Chen Li and Jiangning Song
dc.identifier.citation	Briefings in Bioinformatics, 2022; 23(1):bbab461-1-bbab461-13
dc.identifier.doi	10.1093/bib/bbab461
dc.identifier.issn	1467-5463
dc.identifier.issn	1477-4054
dc.identifier.orcid	Li, F. [0000-0001-5216-3213]
dc.identifier.uri	https://hdl.handle.net/2440/139737
dc.language.iso	en
dc.publisher	Oxford University Press (OUP)
dc.relation.grant	http://purl.org/au-research/grants/nhmrc/1127948
dc.relation.grant	http://purl.org/au-research/grants/nhmrc/1144652
dc.relation.grant	http://purl.org/au-research/grants/arc/LP110200333
dc.relation.grant	http://purl.org/au-research/grants/arc/DP120104460
dc.relation.grant	http://purl.org/au-research/grants/nhmrc/1143366
dc.relation.grant	http://purl.org/au-research/grants/nhmrc/1103384
dc.relation.grant	http://purl.org/au-research/grants/nhmrc/GNT1195743
dc.rights	© The Author(s) 2021. Published by Oxford University Press. All rights reserved.
dc.source.uri	https://doi.org/10.1093/bib/bbab461
dc.subject	positive unlabeled learning; semi-supervised learning; machine learning; bioinformatics; pattern recognition
dc.subject.mesh	Computational Biology
dc.subject.mesh	Algorithms
dc.subject.mesh	Supervised Machine Learning
dc.title	Positive-unlabeled learning in bioinformatics and computational biology: A brief review
dc.type	Journal article
pubs.publication-status	Published

Collections

Medicine publications

Positive-unlabeled learning in bioinformatics and computational biology: A brief review

Files

Collections