bims-crepig Biomed News
on Chromatin regulation and epigenetics in cell fate and cancer
Issue of 2021–10–03
23 papers selected by
Connor Rogerson, University of Cambridge, MRC Cancer Unit



  1. Genome Biol. 2021 Sep 27. 22(1): 279
       BACKGROUND: Mammalian development is associated with extensive changes in gene expression, chromatin accessibility, and nuclear structure. Here, we follow such changes associated with mouse embryonic stem cell differentiation and X inactivation by integrating, for the first time, allele-specific data from these three modalities obtained by high-throughput single-cell RNA-seq, ATAC-seq, and Hi-C.
    RESULTS: Allele-specific contact decay profiles obtained by single-cell Hi-C clearly show that the inactive X chromosome has a unique profile in differentiated cells that have undergone X inactivation. Loss of this inactive X-specific structure at mitosis is followed by its reappearance during the cell cycle, suggesting a "bookmark" mechanism. Differentiation of embryonic stem cells to follow the onset of X inactivation is associated with changes in contact decay profiles that occur in parallel on both the X chromosomes and autosomes. Single-cell RNA-seq and ATAC-seq show evidence of a delay in female versus male cells, due to the presence of two active X chromosomes at early stages of differentiation. The onset of the inactive X-specific structure in single cells occurs later than gene silencing, consistent with the idea that chromatin compaction is a late event of X inactivation. Single-cell Hi-C highlights evidence of discrete changes in nuclear structure characterized by the acquisition of very long-range contacts throughout the nucleus. Novel computational approaches allow for the effective alignment of single-cell gene expression, chromatin accessibility, and 3D chromosome structure.
    CONCLUSIONS: Based on trajectory analyses, three distinct nuclear structure states are detected reflecting discrete and profound simultaneous changes not only to the structure of the X chromosomes, but also to that of autosomes during differentiation. Our study reveals that long-range structural changes to chromosomes appear as discrete events, unlike progressive changes in gene expression and chromatin accessibility.
    DOI:  https://doi.org/10.1186/s13059-021-02432-w
  2. Nucleic Acids Res. 2021 Sep 27. pii: gkab841. [Epub ahead of print]
      Long-range regulation by distal enhancers plays critical roles in cell-type specific transcriptional programs. Computational predictions of genome-wide enhancer-promoter interactions are still challenging due to limited accuracy and the lack of knowledge on the molecular mechanisms. Based on recent biological investigations, the protein-protein interactions (PPIs) between transcription factors (TFs) have been found to participate in the regulation of chromatin loops. Therefore, we developed a novel predictive model for cell-type specific enhancer-promoter interactions by leveraging the information of TF PPI signatures. Evaluated by a series of rigorous performance comparisons, the new model achieves superior performance over other methods. The model also identifies specific TF PPIs that may mediate long-range regulatory interactions, revealing new mechanistic understandings of enhancer regulation. The prioritized TF PPIs are associated with genes in distinct biological pathways, and the predicted enhancer-promoter interactions are strongly enriched with cis-eQTLs. Most interestingly, the model discovers enhancer-mediated trans-regulatory links between TFs and genes, which are significantly enriched with trans-eQTLs. The new predictive model, along with the genome-wide analyses, provides a platform to systematically delineate the complex interplay among TFs, enhancers and genes in long-range regulation. The novel predictions also lead to mechanistic interpretations of eQTLs to decode the genetic associations with gene expression.
    DOI:  https://doi.org/10.1093/nar/gkab841
  3. Nat Commun. 2021 Sep 30. 12(1): 5732
      Although alterations in chromatin structure are known to exist in tumors, how these alterations relate to molecular phenotypes in cancer remains to be demonstrated. Multi-omics profiling of human tumors can provide insight into how alterations in chromatin structure are propagated through the pathway of gene expression to result in malignant protein expression. We applied multi-omics profiling of chromatin accessibility, RNA abundance, and protein abundance to 36 human thyroid cancer primary tumors, metastases, and patient-match normal tissue. Through quantification of chromatin accessibility associated with active transcription units and global protein expression, we identify a local chromatin structure that is highly correlated with coordinated RNA and protein expression. In particular, we identify enhancers located within gene-bodies as predictive of correlated RNA and protein expression, that is independent of overall transcriptional activity. To demonstrate the generalizability of these findings we also identify similar results in an independent cohort of human breast cancers. Taken together, these analyses suggest that local enhancers, rather than distal enhancers, are likely most predictive of cancer gene expression phenotypes. This allows for identification of potential targets for cancer therapeutic approaches and reinforces the utility of multi-omics profiling as a methodology to understand human disease.
    DOI:  https://doi.org/10.1038/s41467-021-25872-1
  4. Nat Commun. 2021 Oct 01. 12(1): 5771
      Germline specification in mammals occurs through an inductive process whereby competent cells in the post-implantation epiblast differentiate into primordial germ cells (PGC). The intrinsic factors that endow epiblast cells with the competence to respond to germline inductive signals remain unknown. Single-cell RNA sequencing across multiple stages of an in vitro PGC-like cells (PGCLC) differentiation system shows that PGCLC genes initially expressed in the naïve pluripotent stage become homogeneously dismantled in germline competent epiblast like-cells (EpiLC). In contrast, the decommissioning of enhancers associated with these germline genes is incomplete. Namely, a subset of these enhancers partly retain H3K4me1, accumulate less heterochromatic marks and remain accessible and responsive to transcriptional activators. Subsequently, as in vitro germline competence is lost, these enhancers get further decommissioned and lose their responsiveness to transcriptional activators. Importantly, using H3K4me1-deficient cells, we show that the loss of this histone modification reduces the germline competence of EpiLC and decreases PGCLC differentiation efficiency. Our work suggests that, although H3K4me1 might not be essential for enhancer function, it can facilitate the (re)activation of enhancers and the establishment of gene expression programs during specific developmental transitions.
    DOI:  https://doi.org/10.1038/s41467-021-26065-6
  5. Bioinformatics. 2021 Sep 29. pii: btab680. [Epub ahead of print]
       MOTIVATION: Mapping of chromatin accessibility landscapes in single-cells and the integration with gene expression enables a better understanding of gene regulatory mechanisms defining cell identities and cell-fate determination in development and disease. Generally, raw data generated from single-cell Assay for Transposase-Accessible Chromatin sequencing (scATAC-seq) are deposited in repositories that are generally inaccessible due to lack of in-depth knowledge of computational programming.
    RESULTS: We have developed ShinyArchR.UiO, an R-based shiny app, that facilitates scATAC-seq data accessibility and visualisation in a user-friendly, interactive, and open-source web interface. ShinyArchR.UiO is an application that can streamline collaborative efforts for interpretation of massive chromatin accessibility datasets and allow for open access data sharing for wider audiences.
    AVAILABILITY: https://Github.com/EskelandLab/ShinyArchRUiO and a demo server with a haematopoietic tutorial dataset https://cancell.medisin.uio.no/ShinyArchR.UiO.
    SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
    Keywords:  Chromatin accessibility; Shiny R package; scATAC-seq; scRNA-seq; single-cell biology
    DOI:  https://doi.org/10.1093/bioinformatics/btab680
  6. Blood Adv. 2021 Oct 01. pii: bloodadvances.2021004469. [Epub ahead of print]
      Acute myeloid leukemia (AML) with MLL-rearrangement (MLL-r) comprises approximately 10% of all AML cases and portends poor outcomes. Much remains uncovered on how MLL-r AML drives leukemia development while preventing cells from normal myeloid differentiation. Here, we identified that transcription factor MEF2D is a super-enhancer-associated, highly expressed gene in MLL-r AML. Knockout of MEF2D profoundly impaired leukemia growth, induced myeloid differentiation, and delayed oncogenic progression in vivo. Mechanistically, MEF2D loss led to robust activation of a CEBPE-centered myeloid differentiation program in AML cells. Chromatin profiling revealed that MEF2D binds to and suppresses the chromatin accessibility of CEBPE cis-regulatory regions. In human acute leukemia samples, MEF2D expression showed a strong negative correlation with the expression of CEBPE. Depletion of CEBPE partially rescued the cell growth defect and myeloid cell differentiation induced by the loss of MEF2D. Lastly, we show that MEF2D is positively regulated by HOXA9, and downregulation of MEF2D is an important mechanism for DOT1L inhibitor-induced anti-leukemia effects. Collectively, our findings suggest that MEF2D plays a critical role in human MLL-r AML and uncover the MEF2D-CEBPE axis as a crucial transcriptional mechanism regulating leukemia cell self-renewal and differentiation block.
    DOI:  https://doi.org/10.1182/bloodadvances.2021004469
  7. Curr Issues Mol Biol. 2021 Sep 17. 43(2): 1156-1170
      HiChIP and PLAC-Seq are emerging technologies for studying genome-wide long-range chromatin interactions mediated by the protein of interest, enabling more sensitive and cost-efficient interrogation of protein-centric chromatin conformation. However, due to the unbalanced read distribution introduced by protein immunoprecipitation, existing reproducibility measures developed for Hi-C data are not appropriate for the analysis of HiChIP and PLAC-Seq data. Here, we present HPRep, a stratified and weighted correlation metric derived from normalized contact counts, to quantify reproducibility in HiChIP and PLAC-Seq data. We applied HPRep to multiple real datasets and demonstrate that HPRep outperforms existing reproducibility measures developed for Hi-C data. Specifically, we applied HPRep to H3K4me3 PLAC-Seq data from mouse embryonic stem cells and mouse brain tissues as well as H3K27ac HiChIP data from human lymphoblastoid cell line GM12878 and leukemia cell line K562, showing that HPRep can more clearly separate among pseudo-replicates, real replicates, and non-replicates. Furthermore, in an H3K4me3 PLAC-Seq dataset consisting of 11 samples from four human brain cell types, HPRep demonstrated the expected clustering of data that could not be achieved by existing methods developed for Hi-C data, highlighting the need for a reproducibility metric tailored to HiChIP and PLAC-Seq data.
    Keywords:  HiChIP; PLAC-Seq; chromatin spatial organization; reproducibility
    DOI:  https://doi.org/10.3390/cimb43020082
  8. Cell Rep. 2021 Sep 28. pii: S2211-1247(21)01209-2. [Epub ahead of print]36(13): 109755
      Spt4 is a transcription elongation factor with homologs in organisms with nucleosomes. Structural and in vitro studies implicate Spt4 in transcription through nucleosomes, and yet the in vivo function of Spt4 is unclear. Here, we assess the precise position of Spt4 during transcription and the consequences of the loss of Spt4 on RNA polymerase II (RNAPII) dynamics and nucleosome positioning in Saccharomyces cerevisiae. In the absence of Spt4, the spacing between gene-body nucleosomes increases and RNAPII accumulates upstream of the nucleosomal dyad, most dramatically at nucleosome +2. Spt4 associates with elongating RNAPII early in transcription, and its association dynamically changes depending on nucleosome positions. Together, our data show that Spt4 regulates early elongation dynamics, participates in co-transcriptional nucleosome positioning, and promotes RNAPII movement through the gene-body nucleosomes, especially the +2 nucleosome.
    Keywords:  MNase-seq; NET-seq; Nucleosome; RNAPII; Saccharomyces cerevisiae; TEF-seq; spt4; spt5; transcription elongation
    DOI:  https://doi.org/10.1016/j.celrep.2021.109755
  9. Gastroenterology. 2021 Sep 24. pii: S0016-5085(21)03547-2. [Epub ahead of print]
       BACKGROUND AND AIMS: Genomic alterations that encourage stem cell activity and hinder proper maturation are central to the development of colorectal cancer (CRC). Key molecular mediators that promote these malignant properties require further elucidation to galvanize translational advances. We therefore aimed to characterize a key factor that blocks intestinal differentiation, define its transcriptional and epigenetic program, and provide preclinical evidence for therapeutic targeting in CRC.
    METHODS: Intestinal tissue from transgenic mice and patients were analyzed by histopathology and immunostaining. Human CRC cells and neoplastic murine organoids were genetically manipulated for functional studies. Gene expression profiling was obtained through RNA sequencing. Histone modifications and transcription factor binding was determined by ChIP sequencing.
    RESULTS: We demonstrate that SRY-box transcription factor 9 (SOX9) promotes CRC by activating a stem cell-like program that hinders intestinal differentiation. Intestinal adenomas and colorectal adenocarcinomas from mouse models and patients demonstrate ectopic and elevated expression of SOX9. Functional experiments indicate a requirement for SOX9 in human CRC cell lines and engineered neoplastic organoids. Disrupting SOX9 activity impairs primary CRC tumor growth by inducing intestinal differentiation. By binding to genome wide enhancers, SOX9 directly activates genes associated with Paneth and stem cell activity, including PROM1. SOX9 upregulates PROM1 via a WNT-responsive intronic enhancer. A pentaspan transmembrane protein, PROM1 utilizes its first intracellular domain to support stem cell signaling, at least in part through SOX9, reinforcing a PROM1-SOX9 positive feedback loop.
    CONCLUSIONS: These studies establish SOX9 as a central regulator of an enhancer-driven stem cell-like program and carry important implications for developing therapeutics directed at overcoming differentiation defects in CRC.
    Keywords:  PROM1; SOX9; colorectal cancer; differentiation block
    DOI:  https://doi.org/10.1053/j.gastro.2021.09.044
  10. PLoS Genet. 2021 Sep 29. 17(9): e1009828
      Transcription-related proteins are frequently identified as targets of sumoylation, including multiple subunits of the RNA polymerase II (RNAPII) general transcription factors (GTFs). However, it is not known how sumoylation affects GTFs or whether they are sumoylated when they assemble at promoters to facilitate RNAPII recruitment and transcription initiation. To explore how sumoylation can regulate transcription genome-wide, we performed SUMO ChIP-seq in yeast and found, in agreement with others, that most chromatin-associated sumoylated proteins are detected at genes encoding tRNAs and ribosomal proteins (RPGs). However, we also detected 147 robust SUMO peaks at promoters of non-ribosomal protein-coding genes (non-RPGs), indicating that sumoylation also regulates this gene class. Importantly, SUMO peaks at non-RPGs align specifically with binding sites of GTFs, but not other promoter-associated proteins, indicating that it is GTFs specifically that are sumoylated there. Predominantly, non-RPGs with SUMO peaks are among the most highly transcribed, have high levels of TFIIF, and show reduced RNAPII levels when cellular sumoylation is impaired, linking sumoylation with elevated transcription. However, detection of promoter-associated SUMO by ChIP might be limited to sites with high levels of substrate GTFs, and promoter-associated sumoylation at non-RPGs may actually be far more widespread than we detected. Among GTFs, we found that TFIIF is a major target of sumoylation, specifically at lysines 60/61 of its Tfg1 subunit, and elevating Tfg1 sumoylation resulted in decreased interaction of TFIIF with RNAPII. Interestingly, both reducing promoter-associated sumoylation, in a sumoylation-deficient Tfg1-K60/61R mutant strain, and elevating promoter-associated SUMO levels, by constitutively tethering SUMO to Tfg1, resulted in reduced RNAPII occupancy at non-RPGs. This implies that dynamic GTF sumoylation at non-RPG promoters, not simply the presence or absence of SUMO, is important for maintaining elevated transcription. Together, our findings reveal a novel mechanism of regulating the basal transcription machinery through sumoylation of promoter-bound GTFs.
    DOI:  https://doi.org/10.1371/journal.pgen.1009828
  11. Nucleic Acids Res. 2021 Sep 27. pii: gkab844. [Epub ahead of print]
      Histone H3 lysine 27 methylation catalyzed by polycomb repressive complex 2 (PRC2) is conserved from fungi to humans and represses gene transcription. However, the mechanism for recognition of methylated H3K27 remains unclear, especially in fungi. Here, we found that the bromo-adjacent homology (BAH)-plant homeodomain (PHD) domain containing protein BAH-PHD protein 1 (BP1) is a reader of H3K27 methylation in the cereal fungal pathogen Fusarium graminearum. BP1 interacts with the core PRC2 component Suz12 and directly binds methylated H3K27. BP1 is distributed in a subset of genomic regions marked by H3K27me3 and co-represses gene transcription. The BP1 deletion mutant shows identical phenotypes on mycelial growth and virulence, as well as similar expression profiles of secondary metabolite genes to the strain lacking the H3K27 methyltransferase Kmt6. More importantly, BP1 can directly bind DNA through its PHD finger, which might increase nucleosome residence and subsequently reinforce transcriptional repression in H3K27me3-marked target regions. A phylogenetic analysis showed that BP1 orthologs are mainly conserved in fungi. Overall, our findings provide novel insights into the mechanism by which PRC2 mediates gene repression in fungi, which is distinct from the PRC1-PRC2 system in plants and mammals.
    DOI:  https://doi.org/10.1093/nar/gkab844
  12. EMBO J. 2021 Sep 27. e108439
      Upon replication stress, budding yeast checkpoint kinase Mec1ATR triggers the downregulation of transcription, thereby reducing the level of RNA polymerase (RNAP) on chromatin to facilitate replication fork progression. Here, we identify a hydroxyurea-induced phosphorylation site on Mec1, Mec1-S1991, that contributes to the eviction of RNAPII and RNAPIII during replication stress. The expression of the non-phosphorylatable mec1-S1991A mutant reduces replication fork progression genome-wide and compromises survival on hydroxyurea. This defect can be suppressed by destabilizing chromatin-bound RNAPII through a TAP fusion to its Rpb3 subunit, suggesting that lethality in mec1-S1991A mutants arises from replication-transcription conflicts. Coincident with a failure to repress gene expression on hydroxyurea in mec1-S1991A cells, highly transcribed genes such as GAL1 remain bound at nuclear pores. Consistently, we find that nuclear pore proteins and factors controlling RNAPII and RNAPIII are phosphorylated in a Mec1-dependent manner on hydroxyurea. Moreover, we show that Mec1 kinase also contributes to reduced RNAPII occupancy on chromatin during an unperturbed S phase by promoting degradation of the Rpb1 subunit.
    Keywords:  Mec1; nuclear pore; replication checkpoint; replication interference; replication stress; transcription
    DOI:  https://doi.org/10.15252/embj.2021108439
  13. Nat Commun. 2021 Oct 01. 12(1): 5775
      Neuroendocrine carcinomas (NEC) are tumors expressing markers of neuronal differentiation that can arise at different anatomic sites but have strong histological and clinical similarities. Here we report the chromatin landscapes of a range of human NECs and show convergence to the activation of a common epigenetic program. With a particular focus on treatment emergent neuroendocrine prostate cancer (NEPC), we analyze cell lines, patient-derived xenograft (PDX) models and human clinical samples to show the existence of two distinct NEPC subtypes based on the expression of the neuronal transcription factors ASCL1 and NEUROD1. While in cell lines and PDX models these subtypes are mutually exclusive, single-cell analysis of human clinical samples exhibits a more complex tumor structure with subtypes coexisting as separate sub-populations within the same tumor. These tumor sub-populations differ genetically and epigenetically contributing to intra- and inter-tumoral heterogeneity in human metastases. Overall, our results provide a deeper understanding of the shared clinicopathological characteristics shown by NECs. Furthermore, the intratumoral heterogeneity of human NEPCs suggests the requirement of simultaneous targeting of coexisting tumor populations as a therapeutic strategy.
    DOI:  https://doi.org/10.1038/s41467-021-26042-z
  14. Cell Rep. 2021 Sep 28. pii: S2211-1247(21)01211-0. [Epub ahead of print]36(13): 109757
      Breast cancer stem cells (BCSCs) play essential roles in tumor formation, drug resistance, relapse, and metastasis. NANOG is a protein required for stem cell self-renewal, but the mechanisms by which it performs this function are poorly understood. Here, we show that hypoxia-inducible factor 1α (HIF-1α) is required for NANOG-mediated BCSC enrichment. Mechanistically, NANOG is recruited by HIF-1 to cooperatively activate transcription of the TERT gene encoding the telomerase reverse transcriptase that maintains telomere length, which is required for stem cell self-renewal. NANOG stimulates HIF-1 transcriptional activity by recruitment of the deubiquitinase USP9X, which inhibits HIF-1α protein degradation, and by stabilizing HIF-1α interaction with the coactivator p300, which mediates histone acetylation. Our results delineate a cooperative transcriptional mechanism by which HIF-1 and NANOG mediate BCSC self-renewal.
    Keywords:  aldehyde dehydrogenase; hypoxia; mammosphere; pluripotency factor; proteasome; telomere; tumor-initiating cells; ubiquitination
    DOI:  https://doi.org/10.1016/j.celrep.2021.109757
  15. iScience. 2021 Sep 24. 24(9): 103034
      Transgenic reporters allow the measurement of regulatory DNA activity in vivo and consequently have long been useful tools for studying enhancers. Despite their utility, few studies have investigated the effects these reporters may have on the expression of other genes. Understanding these effects is required to accurately interpret reporter data and characterize gene regulatory mechanisms. By measuring the expression of Kruppel (Kr) enhancer reporters in live Drosophila embryos, we find reporters inhibit one another's expression and that of a nearby endogenous gene. Using synthetic transcription factor (TF) binding site arrays, we present evidence that competition for TFs is partially responsible for the observed transcriptional inhibition. We develop a simple thermodynamic model that predicts competition of the measured magnitude specifically when TF binding is restricted to distinct nuclear subregions. Our findings underline an unexpected role of the non-homogenous nature of the nucleus in regulating gene expression.
    Keywords:  Biological sciences; Cell biology; Developmental biology; Mathematical biosciences; Molecular biology; Molecular interaction
    DOI:  https://doi.org/10.1016/j.isci.2021.103034
  16. Nucleic Acids Res. 2021 Sep 27. pii: gkab820. [Epub ahead of print]
      Few genetically dominant mutations involved in human disease have been fully explained at the molecular level. In cases where the mutant gene encodes a transcription factor, the dominant-negative mode of action of the mutant protein is particularly poorly understood. Here, we studied the genome-wide mechanism underlying a dominant-negative form of the SOX18 transcription factor (SOX18RaOp) responsible for both the classical mouse mutant Ragged Opossum and the human genetic disorder Hypotrichosis-lymphedema-telangiectasia-renal defect syndrome. Combining three single-molecule imaging assays in living cells together with genomics and proteomics analysis, we found that SOX18RaOp disrupts the system through an accumulation of molecular interferences which impair several functional properties of the wild-type SOX18 protein, including its target gene selection process. The dominant-negative effect is further amplified by poisoning the interactome of its wild-type counterpart, which perturbs regulatory nodes such as SOX7 and MEF2C. Our findings explain in unprecedented detail the multi-layered process that underpins the molecular aetiology of dominant-negative transcription factor function.
    DOI:  https://doi.org/10.1093/nar/gkab820
  17. Mol Syst Biol. 2021 Sep;17(9): e10272
      It is essential for cells to control which genes are transcribed into RNA. In eukaryotes, two major control points are recruitment of RNA polymerase II (Pol II) into a paused state, and subsequent pause release toward transcription. Pol II recruitment and pause release occur in association with macromolecular clusters, which were proposed to be formed by a liquid-liquid phase separation mechanism. How such a phase separation mechanism relates to the interaction of Pol II with DNA during recruitment and transcription, however, remains poorly understood. Here, we use live and super-resolution microscopy in zebrafish embryos to reveal Pol II clusters with a large variety of shapes, which can be explained by a theoretical model in which regulatory chromatin regions provide surfaces for liquid-phase condensation at concentrations that are too low for canonical liquid-liquid phase separation. Model simulations and chemical perturbation experiments indicate that recruited Pol II contributes to the formation of these surface-associated condensates, whereas elongating Pol II is excluded from these condensates and thereby drives their unfolding.
    Keywords:  active matter; gene regulation; nuclear organization; super-resolution microscopy; transcriptional condensates
    DOI:  https://doi.org/10.15252/msb.202110272
  18. Dev Cell. 2021 Sep 21. pii: S1534-5807(21)00682-1. [Epub ahead of print]
      Epigenetic mechanisms contribute to the regulation of cell differentiation and function. Vascular smooth muscle cells (SMCs) are specialized contractile cells that retain phenotypic plasticity even after differentiation. Here, by performing selective demethylation of histone H3 lysine 4 di-methylation (H3K4me2) at SMC-specific genes, we uncovered that H3K4me2 governs SMC lineage identity. Removal of H3K4me2 via selective editing in cultured vascular SMCs and in murine arterial vasculature led to loss of differentiation and reduced contractility due to impaired recruitment of the DNA methylcytosine dioxygenase TET2. H3K4me2 editing altered SMC adaptative capacities during vascular remodeling due to loss of miR-145 expression. Finally, H3K4me2 editing induced a profound alteration of SMC lineage identity by redistributing H3K4me2 toward genes associated with stemness and developmental programs, thus exacerbating plasticity. Our studies identify the H3K4me2-TET2-miR145 axis as a central epigenetic memory mechanism controlling cell identity and function, whose alteration could contribute to various pathophysiological processes.
    Keywords:  DNA methylation; cell differentiation; epigenetics; gene regulation; histone modifications; microRNA; vascular disease; vascular injury
    DOI:  https://doi.org/10.1016/j.devcel.2021.09.001
  19. Nat Commun. 2021 Sep 28. 12(1): 5679
      Chromosomal rearrangements are a frequent cause of oncogene deregulation in human malignancies. Overexpression of EVI1 is found in a subgroup of acute myeloid leukemia (AML) with 3q26 chromosomal rearrangements, which is often therapy resistant. In AMLs harboring a t(3;8)(q26;q24), we observed the translocation of a MYC super-enhancer (MYC SE) to the EVI1 locus. We generated an in vitro model mimicking a patient-based t(3;8)(q26;q24) using CRISPR-Cas9 technology and demonstrated hyperactivation of EVI1 by the hijacked MYC SE. This MYC SE contains multiple enhancer modules, of which only one recruits transcription factors active in early hematopoiesis. This enhancer module is critical for EVI1 overexpression as well as enhancer-promoter interaction. Multiple CTCF binding regions in the MYC SE facilitate this enhancer-promoter interaction, which also involves a CTCF binding site upstream of the EVI1 promoter. We hypothesize that this CTCF site acts as an enhancer-docking site in t(3;8) AML. Genomic analyses of other 3q26-rearranged AML patient cells point to a common mechanism by which EVI1 uses this docking site to hijack enhancers active in early hematopoiesis.
    DOI:  https://doi.org/10.1038/s41467-021-25862-3
  20. Genome Biol. 2021 Sep 27. 22(1): 280
       BACKGROUND: Deep learning has proven to be a powerful technique for transcription factor (TF) binding prediction but requires large training datasets. Transfer learning can reduce the amount of data required for deep learning, while improving overall model performance, compared to training a separate model for each new task.
    RESULTS: We assess a transfer learning strategy for TF binding prediction consisting of a pre-training step, wherein we train a multi-task model with multiple TFs, and a fine-tuning step, wherein we initialize single-task models for individual TFs with the weights learned by the multi-task model, after which the single-task models are trained at a lower learning rate. We corroborate that transfer learning improves model performance, especially if in the pre-training step the multi-task model is trained with biologically relevant TFs. We show the effectiveness of transfer learning for TFs with ~ 500 ChIP-seq peak regions. Using model interpretation techniques, we demonstrate that the features learned in the pre-training step are refined in the fine-tuning step to resemble the binding motif of the target TF (i.e., the recipient of transfer learning in the fine-tuning step). Moreover, pre-training with biologically relevant TFs allows single-task models in the fine-tuning step to learn useful features other than the motif of the target TF.
    CONCLUSIONS: Our results confirm that transfer learning is a powerful technique for TF binding prediction.
    Keywords:  Deep learning; Model interpretation; Transcription factor binding prediction; Transfer learning
    DOI:  https://doi.org/10.1186/s13059-021-02499-5
  21. Oncogene. 2021 Sep 28.
      In breast cancer the transcription factor SOX4 has been shown to be associated with poor survival, increased tumor size and metastasis formation. This has mostly been attributed to the ability of SOX4 to regulate Epithelial-to-Mesenchymal-Transition (EMT). However, SOX4 regulates target gene transcription in a context-dependent manner that is determined by the cellular and epigenetic state. In this study we have investigated the loss of SOX4 in mammary tumor development utilizing organoids derived from a PyMT genetic mouse model of breast cancer. Using CRISPR/Cas9 to abrogate SOX4 expression, we found that SOX4 is required for inhibiting differentiation by regulating a subset of genes that are highly activated in fetal mammary stem cells (fMaSC). In this way, SOX4 re-activates an oncogenic transcriptional program that is regulated in many progenitor cell-types during embryonic development. SOX4-knockout organoids are characterized by the presence of more differentiated cells that exhibit luminal or basal gene expression patterns, but lower expression of cell cycle genes. In agreement, primary tumor growth and metastatic outgrowth in the lungs are impaired in SOX4KO tumors. Finally, SOX4KO tumors show a severe loss in competitive capacity to grow out compared to SOX4-proficient cells in primary tumors. Our study identifies a novel role for SOX4 in maintaining mammary tumors in an undifferentiated and proliferative state. Therapeutic manipulation of SOX4 function could provide a novel strategy for cancer differentiation therapy, which would promote differentiation and inhibit cycling of tumor cells.
    DOI:  https://doi.org/10.1038/s41388-021-02004-z
  22. EMBO Rep. 2021 Sep 27. e52348
      Obesity mainly results from a chronic energy imbalance. Promoting browning of white adipocytes is a promising strategy to enhance energy expenditure and combat obesity. N6-methyladenosine (m6A), the most abundant mRNA modification in eukaryotes, plays an important role in regulating adipogenesis. However, whether m6A regulates white adipocyte browning was unknown. Here, we report that adipose tissue-specific deletion of Fto, an m6A demethylase, predisposes mice to prevent high-fat diet (HFD)-induced obesity by enhancing energy expenditure. Additionally, deletion of FTO in vitro promotes thermogenesis and white-to-beige adipocyte transition. Mechanistically, FTO deficiency increases the m6A level of Hif1a mRNA, which is recognized by m6A-binding protein YTHDC2, facilitating mRNA translation and increasing HIF1A protein abundance. HIF1A activates the transcription of thermogenic genes, including Ppaggc1a, Prdm16, and Pparg, thereby promoting Ucp1 expression and the browning process. Collectively, these results unveil an epigenetic mechanism by which m6A-facilitated HIF1A expression controls browning of white adipocytes and thermogenesis, providing a potential target to counteract obesity and metabolic disease.
    Keywords:   Hif1a ; FTO; m6A; thermogenesis; translation
    DOI:  https://doi.org/10.15252/embr.202052348
  23. Nat Commun. 2021 Sep 28. 12(1): 5618
    BIOS Consortium
      Monozygotic (MZ) twins and higher-order multiples arise when a zygote splits during pre-implantation stages of development. The mechanisms underpinning this event have remained a mystery. Because MZ twinning rarely runs in families, the leading hypothesis is that it occurs at random. Here, we show that MZ twinning is strongly associated with a stable DNA methylation signature in adult somatic tissues. This signature spans regions near telomeres and centromeres, Polycomb-repressed regions and heterochromatin, genes involved in cell-adhesion, WNT signaling, cell fate, and putative human metastable epialleles. Our study also demonstrates a never-anticipated corollary: because identical twins keep a lifelong molecular signature, we can retrospectively diagnose if a person was conceived as monozygotic twin.
    DOI:  https://doi.org/10.1038/s41467-021-25583-7