Optimization of Non-Invasive Prenatal Testing Timing and Quantification of Influencing Factors Based on Gradient Boosting Trees and Survival Analysisa
DOI:
https://doi.org/10.62051/w3g7cj94Keywords:
Gradient Boosting Trees; K-means Clustering; Survival Analysis.Abstract
To enhance the accuracy of non-invasive prenatal testing and safeguard maternal health, this study mathematically models and investigates the relationship between fetal Y chromosome concentration and maternal characteristics, aiming to provide optimal testing timing recommendations. First, exploratory data analysis was conducted to examine the correlation between fetal Y chromosome concentration and maternal gestational age, BMI, and age. The Shapiro-Wilk normality test confirmed non-normal distribution, prompting the use of Spearman's correlation coefficient. Results revealed a weak positive correlation between Y chromosome concentration and gestational week at testing, and weak negative correlations with maternal BMI and age. Subsequently, a gradient-boosted decision tree was employed to construct a relational model. The F-test confirmed that gestational week, maternal BMI, and age all exerted statistically significant effects on Y chromosome concentration. Model evaluation revealed comparable mean squared errors across training and testing datasets, indicating no significant overfitting. To determine the optimal NIPT timing based primarily on maternal BMI, K-means clustering divided pregnant women carrying male fetuses into three BMI clusters: Cluster 0, Cluster 1 , and Cluster 2. Kaplan-Meier survival analysis and log-rank tests confirmed that gestational age significantly influenced Y-chromosome concentration Cluster 1, and Cluster 2. Kaplan-Meier survival analysis and log-rank tests confirmed significant differences in the distribution of Y chromosome detection timepoints across clusters. Ultimately, by constructing a risk-cost function and optimizing its solution, the optimal NIPT testing time points for each cluster were determined: 12.8 weeks for Cluster 0, 13.0 weeks for Cluster 1, and 16.2 weeks for Cluster 2. Sensitivity analysis demonstrated the model's robustness to detection errors, providing scientific evidence for clinical NIPT timing decisions.
Downloads
References
[1] Veuskens J R B, Rossum V M, Cattenstart E, et al. Common haplotypes within the chromosome 1q31.3 region determine systemic concentrations of the entire complement factor H protein family[J]. Journal of innate immunity, 2025, 21-26.
[2] Kumar A, Kumar S. The Role of X and Y Chromosomes in Semen Morphology and Concentration: A Study in Saran, Bihar, India[J]. Journal of Advances in Biology & Biotechnology, 2025, 28(3): 515-523.
[3] Xiao W, Akao S, Okamoto R, et al. The formation of aggregated chromatin/chromosomes in mouse oocytes treated with high concentration of IBMX as a model for a chromosome transfer in human[J]. Systems biology in reproductive medicine, 2024, 70(1): 195-203.
[4] Soberanis C F, Simpson L E, Beckett J A, et al. Near millimolar concentration of nucleosomes in mitotic chromosomes from late prometaphase into anaphase[J]. The Journal of cell biology, 2024, 223(11).
[5] Cift A, Benlioglu C, Yucel O M, et al. A New Sperm Concentration Threshold for Y Chromosome Microdeletion Analysis in Infertile Men: Could It Be Azoopermia?[J]. Urology research & practice, 2024, 50(3): 181-186.
[6] Delinassios G J, Hoffman M R, Koumakis G, et al. Sub-toxic cisplatin concentrations induce extensive chromosomal, nuclear and nucleolar abnormalities associated with high malignancy before acquired resistance develops: Implications for clinical caution[J]. PloS one, 2024, 19(12): e0311976.
[7] Dwi R, Sofiati P, Agesti V S, et al. Preliminary study of chromosome aberrations using Giemsa, two-colour fish, and micronucleus assays in lymphocytes of individuals living in elevated radon concentration areas[J]. Radiation protection dosimetry, 2023, 199(14): 1508-1515.
[8] Feng Y, C F E S, Hao L, et al. Antifungal Tolerance and Resistance Emerge at Distinct Drug Concentrations and Rely upon Different Aneuploid Chromosomes[J]. mBio, 2023, 14(2): e0022723-e0022723.
[9] Jung S L, Akhil K, Z K Y, et al. Concentration of non-myocyte proteins in arterial media of cerebral autosomal dominant arteriopathy with subcortical infarcts and leukoencephalopathy[J]. PloS one, 2023, 18(2): e0281094-e0281094.
[10] Son T T, Ngoc N N, Hien L T T, et al. Screening Y Chromosome Microdeletion in 1121 Men with Low Sperm Concentration and the Outcomes of Microdissection Testicular Sperm Extraction(mTESE) for Sperm Retrieval from Azoospermic Patients[J]. The application of clinical genetics, 2023, 16: 155-164.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Transactions on Computer Science and Intelligent Systems Research

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.








