Enhancing Socioeconomic Status Prediction for Cavities: A Hybrid Method

Files

hdl_147896.pdf (1.02 MB)
  (Published version)

Date

2025

Authors

Dao, A.T.M.
Do, L.G.
Stormon, N.
Nguyen, H.V.
Ha, D.H.

Editors

Advisors

Journal Title

Journal ISSN

Volume Title

Type:

Journal article

Citation

Journal of Dental Research, 2025; 104(9):947-954

Statement of Responsibility

A.T.M. Dao, L.G. Do, N. Stormon, H.V. Nguyen, and D.H. Ha

Conference Name

Abstract

Socioeconomic status (SES) measures one’s access to social resources across various dimensions. Traditionally, studies on SES commonly use principal component analysis (PCA), a data-driven method, to condense these dimensions into components, typically selecting the first component to represent SES. However, PCA may lack specificity for particular outcomes. Decision tree analysis (DTA), a knowledge-driven approach that identifies outcome-specific dimensions, may address PCA’s weaknesses but might not comprehensively capture SES. This study hypothesized that combining DTA and PCA to create SES predictors could enhance predictive accuracy more than using PCA alone could. It also explored whether the DTA-PCA combination, incorporating only significant loading indicators (SLIs) of the first component, could simplify SES predictors without compromising predictive accuracy. The study analyzed 12 SES indicators from the Study of Mothers’ and Infants’ Life Events Affecting Oral Health (SMILE) birth cohort study, involving 2,182 children. Five SES composites were created: 1 solely from DTA-identified indicators and 2 pairs combining values from either the entire first PCA component or SLIs with and without DTA. These composites served as predictors for predicting dental caries in 5 predictive models. Model accuracy was evaluated using root mean squared error with 5-fold cross-validation. SES composites derived from the DTA-PCA combination demonstrated superior predictive accuracy compared with those from the PCA-only approach. By incorporating only SLIs, this hybrid method generated SES predictors that not only outperformed those using the entire first component but also demonstrated noninferiority relative to the DTA-only method. This approach offers a promising framework for developing SES composites to predict dental caries, potentially improving the precision of predictive models. In addition, this method offers a practical framework for creating composite predictors from multi-item measurements across various outcomes. For future research using this method, a 3-step process is recommended: (1) identify relevant items using DTA, (2) determine their weights through PCA, and (3) generate a composite using the SLIs.

School/Discipline

Dissertation Note

Provenance

Description

Access Status

Rights

© The Author(s) 2025. This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).

License

Call number

Persistent link to this record