Statistical Models for Missing Data in Proteomic Studies of Gastric Cancer

Kon, Daniel Dean

Please use this identifier to cite or link to this item: https://hdl.handle.net/2440/120609

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Glonek, Gary	-
dc.contributor.advisor	Stanford, Tyman	-
dc.contributor.author	Kon, Daniel Dean	-
dc.date.issued	2019	-
dc.identifier.uri	http://hdl.handle.net/2440/120609	-
dc.description.abstract	Disease diagnosis is often performed using a blood test for protein biomarkers which exhibit differential expression in diseased subjects as compared to healthy subjects. Discovery of new biomarkers enables cheaper and less invasive diagnosis. A method of biomarker discovery is the statistical analysis of proteomic mass spectrometry data to determine differences in protein concentration between groups of organisms. However, outcome-dependent missingness in proteomic mass spectrometry data hinders the extraction of useful information from the data and results in biased inference about these differences in protein expression. Existing methods of accounting for missing data, used for other, similar datasets such as those from RNA microarray experiments, assume missingness that is less severe and outcome-dependent than that which affects proteomic mass spectrometry data. These methods do not suffice to undo the bias, and new methods of statistical analysis are sought for biomarker discovery. We develop a joint statistical model for missing and observed data and apply it to a dataset from a gastric cancer experiment that has a large degree of outcome-dependent missingness in order to discover novel candidate biomarkers. A set of candidates is produced using the joint model. This set differs from the set of biomarker candidates produced in earlier work modelling the data without accounting for the outcome-dependent missingness.	en
dc.language.iso	en	en
dc.subject	Missing data	en
dc.subject	mixed effects models	en
dc.subject	selection model	en
dc.subject	joint model	en
dc.subject	proteomics	en
dc.subject	MALDI	en
dc.subject	mass spectrometry	en
dc.title	Statistical Models for Missing Data in Proteomic Studies of Gastric Cancer	en
dc.type	Thesis	en
dc.contributor.school	School of Mathematical Sciences	en
dc.provenance	This electronic version is made publicly available by the University of Adelaide in accordance with its open access policy for student theses. Copyright in this thesis remains with the author. This thesis may incorporate third party material which has been used by the author pursuant to Fair Dealing exceptions. If you are the owner of any included third party copyright material you wish to be removed from this electronic version, please complete the take down form located at: http://www.adelaide.edu.au/legals	en
dc.description.dissertation	Thesis (MPhil) -- University of Adelaide, School of Mathematical Sciences, 2019	en
Appears in Collections:	Research Theses

Files in This Item:

File	Description	Size	Format
Kon2019_Mphil.pdf		7.47 MB	Adobe PDF	View/Open

Show simple item record

Adelaide Research & Scholarship