Optimization of Non-Invasive Prenatal Testing Timing and Quantification of Influencing Factors Based on Gradient Boosting Trees and Survival Analysisa

Authors

  • Yuxuan Li Jinan University-University of Birmingham Joint Institute, Jinan University, Guangzhou, China, 511400
  • Yijia Liu Jinan University-University of Birmingham Joint Institute, Jinan University, Guangzhou, China, 511400
  • Xiyuan Wang Jinan University-University of Birmingham Joint Institute, Jinan University, Guangzhou, China, 511400

DOI:

https://doi.org/10.62051/w3g7cj94

Keywords:

Gradient Boosting Trees; K-means Clustering; Survival Analysis.

Abstract

To enhance the accuracy of non-invasive prenatal testing and safeguard maternal health, this study mathematically models and investigates the relationship between fetal Y chromosome concentration and maternal characteristics, aiming to provide optimal testing timing recommendations. First, exploratory data analysis was conducted to examine the correlation between fetal Y chromosome concentration and maternal gestational age, BMI, and age. The Shapiro-Wilk normality test confirmed non-normal distribution, prompting the use of Spearman's correlation coefficient. Results revealed a weak positive correlation between Y chromosome concentration and gestational week at testing, and weak negative correlations with maternal BMI and age. Subsequently, a gradient-boosted decision tree was employed to construct a relational model. The F-test confirmed that gestational week, maternal BMI, and age all exerted statistically significant effects on Y chromosome concentration. Model evaluation revealed comparable mean squared errors across training and testing datasets, indicating no significant overfitting. To determine the optimal NIPT timing based primarily on maternal BMI, K-means clustering divided pregnant women carrying male fetuses into three BMI clusters: Cluster 0, Cluster 1 , and Cluster 2. Kaplan-Meier survival analysis and log-rank tests confirmed that gestational age significantly influenced Y-chromosome concentration Cluster 1, and Cluster 2. Kaplan-Meier survival analysis and log-rank tests confirmed significant differences in the distribution of Y chromosome detection timepoints across clusters. Ultimately, by constructing a risk-cost function and optimizing its solution, the optimal NIPT testing time points for each cluster were determined: 12.8 weeks for Cluster 0, 13.0 weeks for Cluster 1, and 16.2 weeks for Cluster 2. Sensitivity analysis demonstrated the model's robustness to detection errors, providing scientific evidence for clinical NIPT timing decisions.

Downloads

Download data is not yet available.

References

[1] Veuskens J R B, Rossum V M, Cattenstart E, et al. Common haplotypes within the chromosome 1q31.3 region determine systemic concentrations of the entire complement factor H protein family[J]. Journal of innate immunity, 2025, 21-26.

[2] Kumar A, Kumar S. The Role of X and Y Chromosomes in Semen Morphology and Concentration: A Study in Saran, Bihar, India[J]. Journal of Advances in Biology & Biotechnology, 2025, 28(3): 515-523.

[3] Xiao W, Akao S, Okamoto R, et al. The formation of aggregated chromatin/chromosomes in mouse oocytes treated with high concentration of IBMX as a model for a chromosome transfer in human[J]. Systems biology in reproductive medicine, 2024, 70(1): 195-203.

[4] Soberanis C F, Simpson L E, Beckett J A, et al. Near millimolar concentration of nucleosomes in mitotic chromosomes from late prometaphase into anaphase[J]. The Journal of cell biology, 2024, 223(11).

[5] Cift A, Benlioglu C, Yucel O M, et al. A New Sperm Concentration Threshold for Y Chromosome Microdeletion Analysis in Infertile Men: Could It Be Azoopermia?[J]. Urology research & practice, 2024, 50(3): 181-186.

[6] Delinassios G J, Hoffman M R, Koumakis G, et al. Sub-toxic cisplatin concentrations induce extensive chromosomal, nuclear and nucleolar abnormalities associated with high malignancy before acquired resistance develops: Implications for clinical caution[J]. PloS one, 2024, 19(12): e0311976.

[7] Dwi R, Sofiati P, Agesti V S, et al. Preliminary study of chromosome aberrations using Giemsa, two-colour fish, and micronucleus assays in lymphocytes of individuals living in elevated radon concentration areas[J]. Radiation protection dosimetry, 2023, 199(14): 1508-1515.

[8] Feng Y, C F E S, Hao L, et al. Antifungal Tolerance and Resistance Emerge at Distinct Drug Concentrations and Rely upon Different Aneuploid Chromosomes[J]. mBio, 2023, 14(2): e0022723-e0022723.

[9] Jung S L, Akhil K, Z K Y, et al. Concentration of non-myocyte proteins in arterial media of cerebral autosomal dominant arteriopathy with subcortical infarcts and leukoencephalopathy[J]. PloS one, 2023, 18(2): e0281094-e0281094.

[10] Son T T, Ngoc N N, Hien L T T, et al. Screening Y Chromosome Microdeletion in 1121 Men with Low Sperm Concentration and the Outcomes of Microdissection Testicular Sperm Extraction(mTESE) for Sperm Retrieval from Azoospermic Patients[J]. The application of clinical genetics, 2023, 16: 155-164.

Downloads

Published

09-04-2026

How to Cite

Li, Y., Liu, Y., & Wang, X. (2026). Optimization of Non-Invasive Prenatal Testing Timing and Quantification of Influencing Factors Based on Gradient Boosting Trees and Survival Analysisa. Transactions on Computer Science and Intelligent Systems Research, 12, 175-183. https://doi.org/10.62051/w3g7cj94