bims-aukdir Biomed News
on Automated knowledge discovery in diabetes research
Issue of 2025–12–21
nineteen papers selected by
Mott Given



  1. Health Inf Sci Syst. 2026 Dec;14(1): 13
      Diabetic Retinopathy (DR) is a leading cause of vision loss among working-age individuals. Early detection can reduce the risk of vision loss by up to 95%, yet a shortage of retinologists and logistical challenges often delay the DR detection. Artificial Intelligence (AI) systems using Retinal Fundus Photographs (RFPs) present a promising solution. However, their clinical adoption is often hindered by issues such as low-quality data, model biases, learning of spurious features or lack of external validation. To address these challenges, we developed RAIS-DR, a Responsible AI System for DR screening that incorporates ethical principles across the AI lifecycle. RAIS-DR integrates efficient convolutional models for preprocessing, quality assessment, and three specialized DR classification models. We evaluated RAIS-DR against the FDA-approved EyeArt system on a local dataset of 1046 patients, unseen by both systems. Results are reported for two clinically relevant referral criteria: Referable DR (RDR) and All-Cause Referable (ACR), the latter including low-quality or ungradable images. Evaluations were conducted both per patient and per image. RAIS-DR demonstrated performance improvements in patient-level referral: for RDR, F1-score, accuracy, and specificity increased by 12, 19, and 20%, respectively; for ACR, the corresponding increases were 5, 6, and 10%. RAIS-DR demonstrated equitable performance across demographic subgroups, with Disparate Impact (DI) values between 0.984 and 1.031 and Equal Opportunity Difference (EOD) values near zero. Model explainability helped identify a clinical limitation: false positives were linked to patients with a history of LASER treatment. These findings position RAIS-DR as a robust, reproducible, responsible, and clinically viable solution for DR screening.
    Keywords:  Diabetic retinopathy; Fairness; Responsible AI system
    DOI:  https://doi.org/10.1007/s13755-025-00405-y
  2. Front Clin Diabetes Healthc. 2025 ;6 1697769
       Introduction: Prediabetes is a highly prevalent metabolic condition that significantly increases the risk of developing type 2 diabetes and cardiovascular disease. Despite its clinical importance, over 80% of individuals with prediabetes remain undiagnosed. Voice analysis has emerged as a non-invasive, accessible method for disease screening, with prior work showing promising results in detecting hypertension and type 2 diabetes from acoustic features. This study investigates whether voice-based machine learning models can identify individuals with prediabetes and evaluates the generalizability of these models across populations.
    Methods: Participants were recruited from clinical sites in India and a community college in Canada. All participants recorded the same spoken phrase multiple times daily via a mobile app, and glycemic status was assessed using HbA1c levels. Voice recordings were preprocessed to remove silence and trimmed to exclude potentially uninformative sections. A total of 167 acoustic features were extracted from each sample using Librosa, scipy, and parselmouth. Features were averaged per participant. Sex-specific models were developed under six experimental configurations varying by dataset balance (age/BMI-matched vs. unbalanced) and BMI inclusion. Feature selection was conducted using L1-regularized logistic regression (LASSO), and SMOTE was applied during training to address class imbalance. Twelve machine learning classifiers were evaluated using leave-one-subject-out cross-validation (LOSO-CV) on the India dataset. Final models were tested on a holdout India subset and the independent Canada dataset.
    Results: In cross-validation, the best female model (XGBoost, balanced, no BMI) achieved a balanced accuracy of 0.78, and the best male model (Random Forest, balanced, no BMI) achieved 0.68. However, holdout set testing identified different optimal configurations for generalization: the male XGBoost model trained on an unbalanced dataset outperformed the cross-validated model. In the Canada dataset, models failed to generalize effectively, with several configurations unable to correctly identify prediabetic participants.
    Discussion: Voice-based prediction models show potential for prediabetes screening in controlled populations, but their performance declines when applied across geographic or demographic boundaries. These findings highlight the need for more diverse training data and population-specific model tuning to support real-world applicability.
    Keywords:  prediabetes; type 2 diabetes; vocal biomarker; voice; voice signal analysis
    DOI:  https://doi.org/10.3389/fcdhc.2025.1697769
  3. NPJ Digit Med. 2025 Dec 19.
      To clarify the real-world performance of regulator-approved deep-learning (DL) systems for autonomous diabetic retinopathy (DR) screening, we systematically searched PubMed, Embase, and ClinicalTrials.gov to 3 April 2025, identifying 82 studies (887,244 examinations) covering 25 devices in 28 countries. Hierarchical bivariate meta-analysis yielded pooled sensitivity/specificity of 0.93/0.90 on a per-patient basis and 0.92/0.93 per eye, closely paralleling expert grading. Meta-regression showed that DR severity threshold, national-income level, image gradability, pupil dilation, reference standard, and diagnostic criteria collectively explained most between-study heterogeneity; any-DR screening, low-income settings, or ungradable images increased false-positive rates, whereas dilated pupils, portable cameras, and adjudicated references improved specificity. Publication bias was minimal. Overall, regulator-approved DL algorithms provide accurate, scalable DR detection, but programs must tailor deployment and reimbursement to disease threshold, image quality, and local resources, and post-market audits with standardized gradability metrics are needed to ensure safe, equitable global adoption.
    DOI:  https://doi.org/10.1038/s41746-025-02223-8
  4. Cureus. 2025 Nov;17(11): e96554
      Diabetic retinopathy (DR) is a leading cause of preventable blindness worldwide, with a disproportionate impact in low- and middle-income countries (LMICs). Artificial intelligence (AI) offers a potential means to address workforce and infrastructure gaps that limit access to DR screening in these settings, but evidence on its performance and feasibility remains scattered. A systematic review of studies published between January 2015 and June 2025 was conducted using six databases. Eligible studies evaluated AI, machine learning, or deep learning applied to retinal imaging for DR detection and reported quantitative diagnostic or implementation outcomes, while studies limited to high-income countries or non-original research were excluded. Only a small number of eligible studies were identified. Across these, AI-based tools generally showed high diagnostic accuracy and were feasible to implement in resource-limited environments. Early evidence suggested potential benefits, such as reduced screening costs, decreased clinician workload, and improved patient follow-up, though reporting on infrastructure needs, regulatory considerations, and long-term sustainability was limited. Overall, AI-based tools show promise for scaling DR screening in LMICs, with encouraging indications of good accuracy and operational efficiency, but further large-scale and implementation-focused research is required to guide their integration into health systems.
    Keywords:  artificial intelligence; cost-effectiveness; diabetic retinopathy; health systems; implementation feasibility; low- and middle-income countries (lmics); screening
    DOI:  https://doi.org/10.7759/cureus.96554
  5. Sci Rep. 2025 Dec 14.
      Diabetic Retinopathy (DR) is a leading cause of vision-threatening conditions worldwide. In clinical settings, the automated identification of DR using Multicolor Imaging (MCI) is critical for assisting ophthalmologists in the timely diagnosis and management of the condition. Deep Learning (DL) methods have been developed to automatically grade DR, enabling ophthalmologists to design personalized treatment plans for patients. However, a significant research gap remains in the application of DL for MCI analysis, particularly in leveraging its multimodal feature representations. This study proposes a novel approach, the Multimodal Network Incorporating Information Bottleneck (MNIIB), specifically designed for DR classification using MCI. Unlike previous approaches that primarily apply Information Bottleneck (IB) theory to model input-label relationships, our MNIIB framework explicitly employs IB principles to analyze and optimize the interactions between the different imaging modalities within MCI. It integrates features extracted from multiple modalities and incorporates an IB mechanism to identify and refine the shared information across modalities, thereby compressing redundant data and enhancing the extraction of diagnostically relevant features. The effectiveness of the proposed network was validated through extensive testing on a retinal image dataset, achieving an accuracy of 95.9%. These results highlight the potential of the MNIIB as a reliable diagnostic tool for the early and accurate detection of DR.
    Keywords:  Deep learning; Diabetic Retinopathy grading; Information bottleneck; Multicolor image; Multimodal classification
    DOI:  https://doi.org/10.1038/s41598-025-31526-9
  6. Sci Rep. 2025 Dec 19.
      Diabetes is one of the major health challenges in today's world, since chronic elevation of blood sugar can cause serious and sometimes irreparable damage to organs such as the heart, kidneys, and nervous system. Early detection of this disease plays a vital role in reducing its complications. However, machine learning and deep learning models often face distrust in medical settings due to their opaque, "black-box" nature. The aim of this study was to combine three machine learning algorithms using stacking and voting methods to propose a model for type 2 diabetes detection, and to increase transparency by using the explainability techniques LIME and SHAP to identify important features. This study used medical data from 768 Pima Indians Diabetes samples, including 8 features such as age, BMI, glucose, insulin, blood pressure, skin thickness, pregnancies, and family history. Data preprocessing included mean imputation for missing or zero values, Min-Max normalization, and classification into "Normal", "Prediabetes", and "Diabetes" based on fasting glucose thresholds. Feature selection was performed using Spearman correlation to retain the most relevant variables. A hybrid machine learning model was developed using three base models Neural Network (NN), k-Nearest Neighbors (KNN), and Random Forest (RF) with automated hyperparameter tuning. The outputs of these models were combined via stacking using a logistic regression (LR) meta-model and in parallel using a soft voting method. Nested cross-validation (5 outer and 5 inner folds) was applied to prevent data leakage and ensure robust evaluation. Model interpretability was assessed using LIME for local explanations and SHAP for global feature importance. Decision thresholds and influential feature regions were identified, and model calibration and decision curves evaluated clinical reliability. Models performance was evaluated using accuracy, precision, recall, specificity, F1-score, AUROC, Brier Score (1-B), and Expected Calibration Error (1-E). Statistical reliability was assessed using bootstrap resampling to compute 95% confidence intervals, as well as paired tests to compare the hybrid model with the base models and voting ensemble. Based on the evaluation metrics, the stacking ensemble achieved perfect performance for Class 0, with 100% accuracy, precision, recall, specificity, F1 score, and AUROC, alongside the highest calibration metrics (Brier Score: 99.9, ECE: 98.7). The Random Forest model also excelled, achieving 100% accuracy, precision, recall, specificity, and F1 score for Class 0 and Class 2. In contrast, the KNN model consistently underperformed, particularly for Class 0 (F1: 83.3, Precision: 83.3, Recall: 83.3). The Neural Network demonstrated strong recall for Class 0 (100%), while the voting ensemble showed balanced results but was slightly outperformed by the top ensemble methods. Explainable AI analyses using LIME and SHAP revealed that glucose was the most influential predictor for identifying the Pre-diabetes state. Both methods consistently identified a decision band between 0.35 and 0.47 (corresponding to 100-125 mg/dL) as the transition zone between "Normal" and "Prediabetes", confirming the model's alignment with WHO/ADA diagnostic criteria. The stacking model achieved perfect performance and superior calibration, outperforming all other models in type 2 diabetes prediction and classification. Explainability techniques (LIME and SHAP) identified glucose level, body mass index, and blood pressure as key predictive factors. This approach provides an accurate and interpretable tool for clinical decision support in healthcare systems.
    Keywords:  Artificial intelligence; Prediction; Transparent ensemble learning; Type 2 diabetes
    DOI:  https://doi.org/10.1038/s41598-025-31562-5
  7. Commun Med (Lond). 2025 Dec 20.
       BACKGROUND: Labelled data scarcity and class imbalance are common deep learning system (DLS) development challenges. We investigated if synthetic retinal images from a conditional cascaded diffusion model (CCDM) improves prognostic DLS (pDLS) performance for 2-year incident referable diabetic retinopathy or maculopathy (rDR/rM) prediction.
    METHODS: Macula images from 72,559 eyes (September 2013 to December 2019) from the UK South-East London Diabetic Eye Screening Programme (SEL-DESP) formed the development dataset, whilst 9,071 eyes were used for internal testing. Images from 2,842 eyes from Birmingham DESP were used for external testing. Prognostic DLS were augmented with ×1, ×2, and ×4 additional synthetic positive cases (pDLS-G) and compared to unaugmented (pDLS-N) and ×1 positive-case resampled pDLS (pDLS-R) using the Area-Under-the Receiver Operating Characteristic curve (AUROC).
    RESULTS: Here we show that CCDM generate realistic synthetic retinal images that are comparable to real images and demonstrate the utility of synthetic retinal images in augmenting the development of a pDLS. The internal and external test AUROC for the pDLS are 0.827 (95% CI: 0.794-0.861) and 0.756 (0.680-0.831), respectively. Augmentation with ×2 additional synthetic positive cases (pDLS-G ×2) significantly improves the internal test AUROC to 0.845 (95% CI: 0.812-0.877, p = 0.044) but does not improve the external test AUROC 0.717 (0.633-0.828, p = 0.243). Resampling positive real cases alone does not improve pDLS-R performance.
    CONCLUSIONS: Augmenting pDLS with synthetic retinal images significantly improves pDLS performance on internal testing but not external testing suggesting further research is required to enhance the generalisability of synthetic retinal image augmentation.
    DOI:  https://doi.org/10.1038/s43856-025-01316-5
  8. Indian J Tuberc. 2025 Dec;pii: S0019-5707(25)00246-X. [Epub ahead of print]72 Suppl 3 S106-S114
      Having both diabetes and tuberculosis (TB) at the same time is a big public health problem because they affect each other in two ways: they make people more likely to get the diseases and they make the diseases worse. Active tuberculosis is more likely to happen if you have diabetes, and diabetes can make it harder to control your blood sugar. This is a complicated relationship that isn't always taken into account in population-level risk estimates. Diabetes and tuberculosis are usually looked at as two separate diseases in traditional risk prediction methods, which don't take into account the complex relationships between the two that cause them to happen together. To fill this gap, this study suggests using machine learning (ML) models to figure out the risk of diabetes and tuberculosis together in different groups of people. To get a full picture of risk factors, many types of data are used, such as clinical, demographic, test, imaging, and socioeconomic data. For better prediction, the method uses a variety of machine learning models, such as logistic regression, random forest, XGBoost, and deep learning structures, along with ensemble and mixed learning strategies. A multitask learning structure is added so that risk factors for both diabetes and tuberculosis can be modelled at the same time. To compare how well a model works, evaluation measures like recall, F1-score, accuracy, and AUC-ROC are used. Cross-validation makes sure that the results are robustly generalised. The study also talks about problems, such as different types of data, a lack of population-level information, and the need for AI-driven healthcare estimates to be able to be explained. Pilot case studies in both rural and urban settings show that adding machine learning (ML) models to electronic health record (EHR) systems could help with early diagnosis and preventative screening.
    Keywords:  Comorbidity prediction; Diabetes; Ensemble models; Machine learning; Multi-task learning; Tuberculosis
    DOI:  https://doi.org/10.1016/j.ijtb.2025.11.014
  9. NPJ Metab Health Dis. 2025 Dec 17. 3(1): 48
      Prediabetes can progress to type 2 diabetes (T2D), but individual risk varies widely. Few studies have rigorously characterized subgroups at the point of prediabetes (PD) onset. Using electronic health records (EHRs), we developed a machine learning approach to stratify PD and analyze T2D progression risk. We defined PD onset based on strict HbA1c criteria and excluded patients with missing follow-ups or atypical clinical events, yielding a high-fidelity cohort of 14,436 patients from an initial pool of 74,054 (2017-2023, MedStar Health). An XGBoost model using routine features, including HbA1c, BMI, blood pressure, lipids, ALT, medication history, and lifestyle factors, was trained on 2018-2020 data and tested on 2021-2022 patients, achieving an AUC of 81.6%. Risk scores enabled subtyping into high-, medium-, and low-risk groups with distinct progression trajectories. Stratification patterns remained consistent in future cohorts. This approach supports earlier, personalized intervention and diabetes risk prediction using real-world EHR data.
    DOI:  https://doi.org/10.1038/s44324-025-00091-0
  10. Sci Rep. 2025 Dec 13.
      Diabetic foot ulcers (DFU) are a highly dangerous and even fatal sequela of diabetes mellitus, very often becoming a lower limb amputation and imposing significant morbidity and mortality all over the world. Deep learning and machine learning have recently made breakthroughs, creating potential for being applied in medical image analysis towards the detection of DFU, but the field still faces the challenge of optimizing the performance of the models in both terms of accuracy and interpretability. The present paper solves this issue by introducing a more optimized deep learning model to detect DFUs that has been proposed by integrating a proprietary Convolutional Neural Network (CNN) structure and a Genetic Algorithm (GA) based on ensembles. Seven different optimizers were distinctly trained by the custom CNN. The genetic algorithm (GA) was then applied to pool a final ensemble model out of individual models with the most remarkable performance. This GA based ensemble had an accuracy of 97%, a precision of 95%, a recall of 99%, and an F1 score of 97%, which showed better and stronger performance than the individual single models that made up it. In addition, the study implemented Grad-CAM (Gradient-weighted Class Activation Mapping) to increase interpretation and transparency. The results obtained demonstrated that Grad-CAM was able to accurately describe the features associated with ulcers and, therefore, offers invaluable information on how the model makes its decisions to healthcare professionals.
    DOI:  https://doi.org/10.1038/s41598-025-30532-1
  11. Diabetes Obes Metab. 2025 Dec 15.
       INTRODUCTION: Diabetic kidney disease (DKD) and diabetic nephropathy (DN) affect around 40% of diabetic patients but lack accurate risk prediction tools that include social determinants and demographic complexity. We developed and validated an ensemble machine learning model for three-year DKD/DN risk prediction with deployment readiness.
    METHODS: We analysed 18 742 eligible adult type 2 diabetic patients from Prince Sultan Military Medical City (PSMMC) registry between 2019 and 2024 in Riyadh, Saudi Arabia. Using temporal patient-level splitting, we developed a stacked ensemble model (LightGBM + CoxBoost) with several features including multiple literature-informed imputed variables including family history, non-steroidal anti-inflammatory drug (NSAID) use, socioeconomic deprivation, diabetic retinopathy severity, and antihypertensive medications, imputed via Bayesian multiple imputation by chained equations (MICE) with external study priors. Primary outcome was incident/progressive DKD/DN within 3 years' timeframe. We assessed discrimination, calibration, model utilisation, and algorithmic fairness.
    RESULTS: The final model achieved excellent discrimination (receiver operating characteristic [AUROC] of 0.852, 95% CI 0.847-0.857) and near-perfect calibration (slope 0.98, intercept -0.012) on multi-trial validation. Decision curve evaluation demonstrated superior net benefit (+22 events prevented per 1000 patients at 10% threshold) compared to treat-all strategies. Bootstrap validation showed minimal optimism in discrimination (C-statistic optimism = 0.005). No algorithmic bias was detected across demographic subgroups (maximum |Δ-AUROC| = 0.010). Prior sensitivity analysis confirmed validity and significance (AUROC variation ≤0.008). The model was engineered and deployed as an interactive web-based application (https://nephrarisk.streamlit.app/).
    CONCLUSIONS: Our developed and demonstrated model provided accurate and well-fair DKD/DN risk prediction with excellent calibration, allowing for better decision making with deployment as a web-based research tool and framework for future prospective clinical validation. Further validation and testing are warranted from different centres and healthcare systems to increase confidence and dissemination of our model findings for better utilisation purposes in the future.
    Keywords:  diabetes; diabetic kidney disease; diabetic nephropathy; glycaemic control; renal functions
    DOI:  https://doi.org/10.1111/dom.70385
  12. Front Endocrinol (Lausanne). 2025 ;16 1651493
       Objective: To construct and validate a clinical model to predict painful diabetic peripheral neuropathy (PDPN) risk in type 2 diabetes mellitus (T2DM) patients for early identification and intervention in primary care.
    Methods: A total of 1,984 patients with T2DM were included in the analysis. After data preprocessing and application of the Synthetic Minority Oversampling Technique (SMOTE) with a 200% oversampling ratio, feature selection was performed using Least Absolute Shrinkage and Selection Operator (LASSO) regression with 10-fold cross-validation. Six predictive models: multivariable logistic regression (LR), random forest (RF), extreme gradient boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), artificial neural network (ANN), and support vector machine (SVM)-were developed and tuned using repeated 5-fold cross-validation. Model performance was evaluated on the independent test cohort using comprehensive discrimination and calibration metrics. To enhance clinical interpretability, a nomogram and SHapley Additive exPlanations (SHAP) analysis were implemented to visualize predictor contributions.
    Results: Ten variables were selected as predictors. Among 1,984 patients, 81 (4.08%) had PDPN. LR model demonstrated the most favorable trade-off for screening purposes, with an area under the receiver operating characteristic curve (AUC-ROC) of 0.894 (95% CI: 0.814-0.964), area under the precision-recall curve (PR-AUC) of 0.470 (95% CI: 0.258-0.665), and balanced accuracy of 0.826 (95% CI: 0.667-0.932). SHAP analysis identified musculoskeletal disorders and HbA1c as the most influential predictors. A user-friendly dynamic web-based nomogram was constructed to support clinical implementation.
    Conclusion: We established and validated a clinically interpretable model for PDPN risk prediction in patients with T2DM. The dynamic nomogram enables individualized risk estimation and may assist timely intervention in routine practice.
    Keywords:  SHAP analysis; machine learning; multicenter retrospective study; multivariable logistic regression; web-based nomogram
    DOI:  https://doi.org/10.3389/fendo.2025.1651493
  13. J Imaging Inform Med. 2025 Dec 15.
      Diabetic retinopathy (DR) is the leading cause of blindness among the global working population. It is crucial to early screening and timely diagnosis in mitigating vision deterioration and preventing permanent blindness. The varying sizes and complex structures of different types of DR lesions present challenges for accurate grading. However, existing models suffer from long training time and insufficient extraction of small lesion features, which makes it difficult to learn the subtle differences between adjacent grades of DR images. This paper proposes the AttMamba framework for automatic DR grading, which integrates an attention mechanism with the state space model to improve training efficiency and the ability to extract small lesion features. The efficient 2D scanning module is integrated with VMamba to scan the whole image with fixed step sizes to accelerate the computation speed. Moreover, the squeeze-and-excitation module and global context module are proposed to locate regions of interest and extract channel attention and global attention features. It can learn the fine-grained differences between different grades of DR images for precise grading. Experiments are conducted on three public and one personal datasets. The accuracy of the proposed method achieved 0.832/0.789/0.756/0.889 on APTOS 2019, DDR, FGADR, and RUDR datasets, which improved by 7.4%/9.6%/17.2%/8.5% in accuracy compared to VMamba. The proposed model provides an effective solution for early diagnosis and timely treatment of diabetic retinopathy.
    Keywords:  Attention mechanism; Diabetic retinopathy; Disease classification; State space model; VMamba
    DOI:  https://doi.org/10.1007/s10278-025-01774-2
  14. Neural Netw. 2025 Dec 05. pii: S0893-6080(25)01300-0. [Epub ahead of print]196 108419
      Diabetic retinopathy, which is a retinal disease that results from diabetes, has become the leading cause of blindness. Early diagnose of diabetic retinopathy is crucial for preserving vision and saving lives. To date, several pioneering methods have been proposed for diagnosing diabetic retinopathy and have achieved preliminary results. However, several limitations persist, including the inability to detect diabetic retinopathy at various scales and low accuracy in large-scale real scenarios. Therefore, in this paper, we propose second-order optimization differential architecture search (SODAS) for predicting diabetic retinopathy. Firstly, we utilize neural architecture search to grade diabetic retinopathy to overcome the limitations of neglecting retinopathy at various scales caused by manually designing networks. Then, we integrate Gumbel-Softmax sampling into neural architecture search to encompass all the operation information during normalization, with the aims of reducing gradient information loss and improving accuracy. Additionally, we design second-order optimization to mitigate slow convergence associated with classical neural architecture search, which has been proven to converge quickly by detailed analysis and extensive results. Experimental results on benchmark datasets show that our SODAS has achieved average improvements of 12.1 %, 21.6 %, 28.8 %, 24.1 %, and 22.5 % in terms of accuracy, Cohen's kappa, AUC, IBA, and F1-score, respectively.
    Keywords:  Diabetic retinopathy prediction; High convergence; Neural architecture search; Second-order optimization
    DOI:  https://doi.org/10.1016/j.neunet.2025.108419
  15. J Transl Med. 2025 Dec 16. 23(1): 1396
       BACKGROUND: While ultra-widefield fluorescein angiography (UWF-FA) is essential for evaluating retinal vascular pathology in diabetic retinopathy (DR), its invasive nature limits its clinical application. This study aimed to develop and evaluate UWFDR-GAN, a generative adversarial network (GAN) framework for translating ultra-widefield color fundus photography (UWF-CFP) into UWF-FA specifically for DR patients.
    METHODS: A total of 270 paired UWF-CFP and UWF-FA images were collected from patients with DR, comprising 73 pairs of mild non-proliferative diabetic retinopathy (NPDR), 47 pairs of moderate NPDR, 82 pairs of severe NPDR, and 68 pairs of proliferative diabetic retinopathy (PDR). We first employed a self-supervised keypoint detection framework for precise cross-modal image registration. The generation network incorporated discrete wavelet transform/inverse transform (DWT/IDWT) to preserve high-frequency details and a Swin Transformer-based multi-scale discriminator to enhance structural realism. We quantitatively compared the performance of our model against several state-of-the-art methods, including pix2pix, pix2pixHD, and UWAFA-GAN, using objective evaluation metrics: the Multi-Scale Structural Similarity Index Measure (MS-SSIM), Peak Signal-to-Noise Ratio (PSNR), Fréchet Inception Distance (FID), and Inception Score (IS).
    RESULTS: UWFDR-GAN achieved the best quantitative performance (MS-SSIM: 0.7214; PSNR: 20.00; FID: 77.48; IS: 1.0123), outperforming all comparison models. Qualitatively, it preserved global vascular architecture and demonstrated superior reconstruction of DR-specific lesions, particularly neovascularization and non-perfusion areas.
    CONCLUSIONS: UWFDR-GAN provided a non-invasive ultra-widefield vascular assessment solution for clinical DR management, demonstrating potential to reduce reliance on invasive fluorescein imaging.
    Keywords:  Diabetic retinopathy; Generative adversarial networks; Retinal imaging; Ultra-widefield
    DOI:  https://doi.org/10.1186/s12967-025-07439-6
  16. Sci Rep. 2025 Dec 15. 15(1): 43796
      Non-ST-elevation myocardial infarction (NSTEMI) in elderly diabetic patients presents unique challenges in risk assessment and prognosis prediction. This study aimed to develop and validate a machine learning-based mortality risk prediction model for this specific population using the MIMIC-IV database. We conducted a retrospective cohort study including 5,272 NSTEMI patients aged ≥ 55 years with diabetes from the MIMIC-IV database. Multiple machine learning models were developed using clinical data collected within 24 h of admission. The primary outcome was 28-day all-cause mortality. Model performance was evaluated using ROC curves, calibration plots, and decision curve analysis. SHAP analysis was employed to interpret model predictions. The XGBoost model demonstrated superior performance (AUC = 0.86) compared to other algorithms and traditional scoring systems. SHAP analysis identified PaO2, Charlson Comorbidity Index, and APSIII score as the top three prognostic factors. Lactate levels showed the broadest influence range (SHAP values - 0.5 to 1.5), while platelet count exhibited distinct bidirectional effects on prognosis. Decision curve analysis confirmed the model's superior clinical utility across all risk threshold intervals. Our machine learning-based prediction model achieved robust performance in predicting 28-day mortality risk for elderly diabetic NSTEMI patients. The model's interpretability analysis revealed complex nonlinear relationships between clinical variables and outcomes, providing valuable insights for risk assessment and clinical decision-making.
    Keywords:  Cardiovascular risk; MIMIC-IV; Machine learning; Mortality risk prediction; NSTEMI; SHAP
    DOI:  https://doi.org/10.1038/s41598-025-27788-y
  17. Diabetes Res Clin Pract. 2025 Dec 16. pii: S0168-8227(25)01063-0. [Epub ahead of print] 113048
       AIMS: Early intervention can reduce the risk of gestational diabetes mellitus (GDM) and related adverse pregnancy outcomes. Accordingly, this study aims to develop prediction models for GDM in early pregnant women.
    METHODS: Pregnant women enrolled at Beijing Obstetrics and Gynecology Hospital were included in the derivation and temporal datasets. Women with electronic data capture records from other subcenters were included in the external validation dataset. Logistic regression and eXtreme Gradient Boosting (XGBoost) models were used to predict the risk of GDM.
    RESULTS: A total of 20,435 pregnant women were included in the derivation dataset, with 1,997 pregnant women in the temporal validation dataset. Furthermore, 100 pregnant women were included in the external validation dataset. The logistic regression and XGBoost models demonstrated AUCs of 0.738 (95% CI: 0.707, 0.771) and 0.737 (95% CI: 0.706, 0.767) in temporal validation. For the external validation dataset, the AUCs were 0.674 (95% CI: 0.440, 0.879) and 0.737 (95% CI: 0.510, 0.929) for the logistic regression and XGBoost models, respectively.
    CONCLUSION: Both the logistic regression and XGBoost models demonstrated satisfactory performance in the internal and temporal datasets. However, the XGBoost model showed more robust performance in the external validation dataset compared to the logistic regression model.
    Keywords:  Gestational diabetes mellitus; Logistic regression; Machine learning; Prediction model
    DOI:  https://doi.org/10.1016/j.diabres.2025.113048