U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

LOC111099028 ERG, ETS transcription factor breakpoint cluster recombination region [ Homo sapiens (human) ]

Gene ID: 111099028, updated on 10-Oct-2023

Summary

Gene symbol
LOC111099028
Gene description
ERG, ETS transcription factor breakpoint cluster recombination region
Gene type
biological region
Feature type(s)
misc_recomb: mitotic
protein_bind
RefSeq status
REVIEWED
Organism
Homo sapiens
Lineage
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo
Summary
This region is known to undergo mitotic DNA recombination with another region, the transmembrane protease serine 2 breakpoint cluster recombination region, located on the q arm of chromosome 21, about 3 Mb centromere-distal to this region. Recombination between these two regions can produce gene fusions involving the transmembrane protease, serine 2 (TMPRSS2) gene and the ERG, ETS transcription factor (ERG) gene, and deletions of the intervening sequence. Complex rearrangements have also been observed. This gene fusion is commonly found in prostate cancer cells. The double-strand breaks that lead to this rearrangement are mediated by the DNA topoisomerase II beta (TOP2B) protein, and TOP2B cleavage sites have been found adjacent to individual fusion breakpoints. Studies indicate that androgen signaling can induce proximity of the TMPRSS2 and ERG genomic loci, possibly influencing break location. Chromosomal architecture, specifically chromosome loop anchors, also plays a role in determining break location, as these regions have been shown to be more susceptible to cleavage by TOP2B. Molecular analysis shows that many breakpoint junctions share microhomology, or, contain small insertions at the junctions, while others are the result of blunt-end fusions. This record shows the range over which different rearrangements have been found, as well as the location of some specific breakpoints and regions enriched for dihydrotestosterone-dependent DNA topoisomerase II beta binding. [provided by RefSeq, Sep 2017]
NEW
Try the new Gene table
Try the new Transcript table

Genomic context

Location:
21q22.2
Annotation release Status Assembly Chr Location
RS_2023_10 current GRCh38.p14 (GCF_000001405.40) 21 NC_000021.9 (38454791..38522956)
RS_2023_10 current T2T-CHM13v2.0 (GCF_009914755.1) 21 NC_060945.1 (36838230..36906723)
105.20220307 previous assembly GRCh37.p13 (GCF_000001405.25) 21 NC_000021.8 (39826714..39894880)

Chromosome 21 - NC_000021.9Genomic Context describing neighboring genes Neighboring gene CDK7 strongly-dependent group 2 enhancer GRCh37_chr21:39704626-39705825 Neighboring gene Neanderthal introgressed variant-containing enhancer experimental_62068 Neighboring gene long intergenic non-protein coding RNA 1423 Neighboring gene uncharacterized LOC107985513 Neighboring gene ETS transcription factor ERG Neighboring gene H3K4me1 hESC enhancer GRCh37_chr21:39810939-39811438 Neighboring gene OCT4-NANOG hESC enhancer GRCh37_chr21:39820334-39821010 Neighboring gene Sharpr-MPRA regulatory region 14119 Neighboring gene uncharacterized LOC105372802 Neighboring gene NANOG-H3K4me1 hESC enhancer GRCh37_chr21:39879001-39879501 Neighboring gene small nuclear ribonucleoprotein polypeptide G pseudogene 13 Neighboring gene P300/CBP strongly-dependent group 1 enhancer GRCh37_chr21:40051342-40052541 Neighboring gene BRD4-independent group 4 enhancer GRCh37_chr21:40055463-40056662 Neighboring gene long intergenic non-protein coding RNA 114 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 18458 Neighboring gene uncharacterized LOC107985480 Neighboring gene BRD4-independent group 4 enhancer GRCh37_chr21:40159061-40160260

Genomic regions, transcripts, and products

NCBI Reference Sequences (RefSeq)

NEW Try the new Transcript table

RefSeqs maintained independently of Annotated Genomes

These reference sequences exist independently of genome builds. Explain

These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

Genomic

  1. NG_055542.1 

    Range
    101..68266
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

RefSeqs of Annotated Genomes: GCF_000001405.40-RS_2023_10

The following sections contain reference sequences that belong to a specific genome build. Explain

Reference GRCh38.p14 Primary Assembly

Genomic

  1. NC_000021.9 Reference GRCh38.p14 Primary Assembly

    Range
    38454791..38522956
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

Alternate T2T-CHM13v2.0

Genomic

  1. NC_060945.1 Alternate T2T-CHM13v2.0

    Range
    36838230..36906723
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)