Please use this identifier to cite or link to this item:
https://www.arca.fiocruz.br/handle/icict/67546
PREDICTIVE MODELING OF GESTATIONAL WEIGHT GAIN: A MACHINE LEARNING MULTICLASS CLASSIFICATION STUDY
Aprendizado de máquina
Modelos preditivos
Saúde materna
Saúde fetal
Coorte de Araraquara
Machine learning
Prediction models
Maternal health
Fetal health
Araraquara cohort
Author
Affilliation
University of São Paulo. School of Public Health. São Paulo, SP, Brasil.
Fundação Oswaldo Cruz. Instituto Carlos Chagas. Curitiba, PR, Brasil.
University of São Paulo. School of Public Health. São Paulo, SP, Brasil.
University of São Paulo. School of Public Health. São Paulo, SP, Brasil.
Federal University of Paraná. Department of Statistics. Curitiba, PR, Brasil.
University of São Paulo. School of Public Health. São Paulo, SP, Brasil.
University of São Paulo. School of Public Health. São Paulo, SP, Brasil.
University of São Paulo. School of Public Health. São Paulo, SP, Brasil.
Fundação Oswaldo Cruz. Instituto Carlos Chagas. Curitiba, PR, Brasil.
University of São Paulo. School of Public Health. São Paulo, SP, Brasil.
University of São Paulo. School of Public Health. São Paulo, SP, Brasil.
Federal University of Paraná. Department of Statistics. Curitiba, PR, Brasil.
University of São Paulo. School of Public Health. São Paulo, SP, Brasil.
University of São Paulo. School of Public Health. São Paulo, SP, Brasil.
University of São Paulo. School of Public Health. São Paulo, SP, Brasil.
Abstract
Background Gestational weight gain (GWG) is a critical factor influencing maternal and fetal health. Excessive or insufficient GWG can lead to various complications, including gestational diabetes, hypertension, cesarean delivery, low birth weight, and preterm birth. This study aims to develop and evaluate machine learning models to predict GWG categories: below, within, or above recommended guidelines.
Methods We analyzed data from the Araraquara Cohort, Brazil, which comprised 1557 pregnant women with a gestational age of 19 weeks or less. Predictors included socioeconomic, demographic, lifestyle, morbidity, and anthropometric factors. Five machine learning algorithms (Random Forest, LightGBM, AdaBoost, CatBoost, and XGBoost) were employed for model development. The models were trained and evaluated using a multiclass classification approach. Model performance was assessed using metrics such as area under the ROC curve (AUC-ROC), F1 score and Matthew’s correlation coefficient (MCC).
Results The outcomes were categorized as follows: GWG within recommendations (28.7%), GWG below (32.5%), and GWG above recommendations (38.7%). The XGBoost presented the best overall model, achieving an AUC-ROC of 0.79 for GWG within, 0.76 for GWG below, and 0.65 for GWG above. The LightGBM also performed well with an AUC-ROC of 0.79 for predicting GWG within recommendations, 0.76 for GWG below, and 0.624 for GWG above. The most important predictors of GWG were pre-gestational BMI, maternal age, glycemic profile, hemoglobin levels, and arm circumference.
Conclusion Machine learning models can effectively predict GWG categories, offering a valuable tool for early identification of at-risk pregnancies. This approach can enhance personalized prenatal care and interventions to promote optimal pregnancy outcomes.
Keywords in Portuguese
Ganho de peso gestacionalAprendizado de máquina
Modelos preditivos
Saúde materna
Saúde fetal
Coorte de Araraquara
Keywords
Gestational weight gainMachine learning
Prediction models
Maternal health
Fetal health
Araraquara cohort
Share