bims-dinmec Biomed News
on DNA methylation in cancer
Issue of 2026–05–31
three papers selected by
Lorena Ancona, Humanitas Research



  1. Int J Mol Sci. 2026 May 14. pii: 4377. [Epub ahead of print]27(10):
      Methylomics has emerged as a central framework for understanding gene regulation in development and disease, yet the rapid expansion of profiling technologies, computational integration methods, and clinical applications has outpaced comprehensive synthesis. This review addresses that gap by systematically examining current advances across the full methylomics pipeline, from data generation to clinical translation. We draw on evidence from large-scale consortium datasets and benchmarking studies of multi-omics integration methods including MOFA, DIABLO, and deep learning architectures, single-cell and spatial methylomic technologies, long-read sequencing platforms (Oxford Nanopore, PacBio HiFi), and cell-free DNA (cfDNA) liquid biopsy approaches. The review further surveys methylation dysregulation across major disease domains, including cancer, cardiovascular disease, neurological disorders, and autoimmune conditions. Integrating methylomic data with transcriptomic and chromatin accessibility layers, particularly in spatial and single-cell contexts, substantially improves the resolution of disease-associated regulatory mechanisms. cfDNA methylation profiling emerges as a cross-disease, non-invasive monitoring platform with broad diagnostic potential, supported by machine learning-based deconvolution. We conclude that while technological barriers are diminishing, standardization of analytical workflows, population diversity in reference datasets, and regulatory alignment remain the principal challenges for translating methylomics advances into broadly accessible precision medicine.
    Keywords:  DNA methylation; cancer epigenetics; cell-free DNA (cfDNA); epigenetic biomarkers; epigenomics; long-read sequencing; spatial methylomics
    DOI:  https://doi.org/10.3390/ijms27104377
  2. bioRxiv. 2026 May 14. pii: 2026.05.11.723515. [Epub ahead of print]
       Summary: DNA methylation datasets from public repositories such as NCBI Gene Expression Omnibus are central to the development and evaluation of epigenetic aging clocks, yet existing resources and tools do not fully resolve the bottlenecks of dataset retrieval and metadata harmonization. Current benchmarking frameworks often rely on static curated collections, support only a subset of available Gene Expression Omnibus studies, focus on specific tissues, or require substantial manual intervention when metadata fields and supplementary files are inconsistently structured across studies. We developed MethylCurate, an agentic AI framework that addresses these limitations by automating the retrieval of DNA methylation datasets from the Gene Expression Omnibus, harmonizing heterogeneous metadata, mapping datasets to a unified format, and enabling scalable evaluation of epigenetic aging clocks through an integrated, dialogue-driven workflow.
    Availability and Implementation: MethylCurate is implemented in Python and combines deterministic modules for Gene Expression Omnibus dataset retrieval, quality control, and clock evaluation with large language model-assisted agents for metadata extraction, metadata harmonization, and DNA methylation data parsing. Source code, documentation, and example workflows are available at: https://github.com/Travyse/methylcurate.
    Contact: travyse.edwards@pennmedicine.upenn.edu.
    Supplementary Information: Supplementary data are available at Bioinformatics online.
    Key Messages: Automated curation of DNA methylation datasets from the Gene Expression Omnibus.Standardized preprocessing and metadata harmonization.Integrated benchmarking of epigenetic aging clocks.
    DOI:  https://doi.org/10.64898/2026.05.11.723515
  3. NPJ Precis Oncol. 2026 May 25.
      Circulating cell-free DNA (cfDNA) methylation profiling enables minimally invasive cancer detection and monitoring. Among available methods, cfMeDIP-seq is a sensitive, scalable, bisulfite-free approach suitable for low-input cfDNA. This review summarizes its principles, comparative technologies, and clinical applications, including early detection, tumor classification, and minimal residual disease monitoring. Despite promising performance, challenges remain in standardization and validation, while emerging multi-omic integrations may further enhance its role in precision oncology.
    DOI:  https://doi.org/10.1038/s41698-026-01507-w