Please use this identifier to cite or link to this item:
https://hdl.handle.net/2440/120609
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Glonek, Gary | - |
dc.contributor.advisor | Stanford, Tyman | - |
dc.contributor.author | Kon, Daniel Dean | - |
dc.date.issued | 2019 | - |
dc.identifier.uri | http://hdl.handle.net/2440/120609 | - |
dc.description.abstract | Disease diagnosis is often performed using a blood test for protein biomarkers which exhibit differential expression in diseased subjects as compared to healthy subjects. Discovery of new biomarkers enables cheaper and less invasive diagnosis. A method of biomarker discovery is the statistical analysis of proteomic mass spectrometry data to determine differences in protein concentration between groups of organisms. However, outcome-dependent missingness in proteomic mass spectrometry data hinders the extraction of useful information from the data and results in biased inference about these differences in protein expression. Existing methods of accounting for missing data, used for other, similar datasets such as those from RNA microarray experiments, assume missingness that is less severe and outcome-dependent than that which affects proteomic mass spectrometry data. These methods do not suffice to undo the bias, and new methods of statistical analysis are sought for biomarker discovery. We develop a joint statistical model for missing and observed data and apply it to a dataset from a gastric cancer experiment that has a large degree of outcome-dependent missingness in order to discover novel candidate biomarkers. A set of candidates is produced using the joint model. This set differs from the set of biomarker candidates produced in earlier work modelling the data without accounting for the outcome-dependent missingness. | en |
dc.language.iso | en | en |
dc.subject | Missing data | en |
dc.subject | mixed effects models | en |
dc.subject | selection model | en |
dc.subject | joint model | en |
dc.subject | proteomics | en |
dc.subject | MALDI | en |
dc.subject | mass spectrometry | en |
dc.title | Statistical Models for Missing Data in Proteomic Studies of Gastric Cancer | en |
dc.type | Thesis | en |
dc.contributor.school | School of Mathematical Sciences | en |
dc.provenance | This electronic version is made publicly available by the University of Adelaide in accordance with its open access policy for student theses. Copyright in this thesis remains with the author. This thesis may incorporate third party material which has been used by the author pursuant to Fair Dealing exceptions. If you are the owner of any included third party copyright material you wish to be removed from this electronic version, please complete the take down form located at: http://www.adelaide.edu.au/legals | en |
dc.description.dissertation | Thesis (MPhil) -- University of Adelaide, School of Mathematical Sciences, 2019 | en |
Appears in Collections: | Research Theses |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Kon2019_Mphil.pdf | 7.47 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.