Multivariate visual clustering of single nucleotide polymorphisms and clinical predictors using Chernoff faces

Date

2012

Authors

Lee, S.
Lee, S.
Dekker, G.
Roberts, C.

Editors

Advisors

Journal Title

Journal ISSN

Volume Title

Type:

Conference paper

Citation

Proceedings of the 5th Annual Applied Statistics Education and Research Collaboration (ASEARC) Conference, 2012:56-59

Statement of Responsibility

Shalem Lee, Sharon Lee, Gus Dekker, Clair Roberts

Conference Name

Annual Applied Statistics Education and Research Collaboration (5th : 2012 : Wollongong, New South Wales)

Abstract

With advanced technology, collection of health-related data in undertaken on a large scale, producing large and high-dimensional data. Visualization of such data is important and useful for further statistical analyses such as classification and clustering. However, visualizing large multivariate datasets is challenging, especially for high dimensional data, as they are often complex and confounded. Currently, visualization for Single Nucleotide Polymorphisms (SNPs) and clinical predictors of disease are assessed separately. As there is increasing evidence of genetic-environmental interactions for pregnancy complications, prediction models based solely on either clinical measurements or genetic risk factors may be inadequate. Hence, we present an example of multivariate visualization on combinations of clinical measurements and SNPs through Chernoff faces, and perform visual clustering for prediction of Preterm births (PTB). A random sample containing 100 patients (Uncomplicated pregnancy = 92, PTB=8) with 11 clinical and 4 genetic predictors are visualized into faces with various style of eyes, ears, nose and hair, showing two groups with similar face characteristics amongst Uncomplicated pregnancies and Preterm births. The faces identified as PTB appear to have either a tall hair style or no ears, which correspond to whether the mother was herself born preterm, and SNPs in TGFβ and IL1β genes.

School/Discipline

Dissertation Note

Provenance

Description

Access Status

Rights

Copyright status unknown

License

Grant ID

Call number

Persistent link to this record