U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination
    • Showing Current items.

    Knop1 lysine rich nucleolar protein 1 [ Mus musculus (house mouse) ]

    Gene ID: 66356, updated on 28-Oct-2024

    Summary

    Official Symbol
    Knop1provided by MGI
    Official Full Name
    lysine rich nucleolar protein 1provided by MGI
    Primary source
    MGI:MGI:1913606
    See related
    Ensembl:ENSMUSG00000030980 AllianceGenome:MGI:1913606
    Gene type
    protein coding
    RefSeq status
    VALIDATED
    Organism
    Mus musculus
    Lineage
    Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus
    Also known as
    Tsg118; 2310008H09Rik
    Summary
    Predicted to be located in nucleolus. Is expressed in collecting duct; submandibular gland primordium; and testis. Orthologous to human KNOP1 (lysine rich nucleolar protein 1). [provided by Alliance of Genome Resources, Oct 2024]
    Expression
    Ubiquitous expression in CNS E11.5 (RPKM 15.7), testis adult (RPKM 11.3) and 26 other tissues See more
    Orthologs
    NEW
    Try the new Gene table
    Try the new Transcript table

    Genomic context

    See Knop1 in Genome Data Viewer
    Location:
    7 F2; 7 63.66 cM
    Exon count:
    5
    Annotation release Status Assembly Chr Location
    RS_2024_02 current GRCm39 (GCF_000001635.27) 7 NC_000073.7 (118441440..118454907, complement)
    108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) 7 NC_000073.6 (118842217..118855998, complement)

    Chromosome 7 - NC_000073.7Genomic Context describing neighboring genes Neighboring gene STARR-positive B cell enhancer ABC_E4970 Neighboring gene centriolar coiled coil protein 110 Neighboring gene STARR-positive B cell enhancer ABC_E4971 Neighboring gene STARR-positive B cell enhancer ABC_E1359 Neighboring gene VPS35 endosomal protein sorting factor like Neighboring gene STARR-positive B cell enhancer mm9_chr7:125909648-125909949 Neighboring gene STARR-seq mESC enhancer starr_20020 Neighboring gene predicted gene, 26147 Neighboring gene STARR-seq mESC enhancer starr_20027 Neighboring gene IQ motif containing K Neighboring gene STARR-seq mESC enhancer starr_20029 Neighboring gene G protein-coupled receptor, family C, group 5, member B Neighboring gene predicted gene, 39076

    Genomic regions, transcripts, and products

    Expression

    • Project title: Mouse ENCODE transcriptome data Mouse ENCODE transcriptome data
    • Description: RNA profiling data sets generated by the Mouse ENCODE project.
    • BioProject: PRJNA66167
    • Publication: PMID 25409824
    • Analysis date: n/a

    Variation

    Alleles

    Alleles of this type are documented at Mouse Genome Informatics  (MGI)
    • Targeted (2) 

    General protein information

    Preferred Names
    lysine-rich nucleolar protein 1
    Names
    protein C16orf88 homolog
    testis-specific gene 118 protein

    NCBI Reference Sequences (RefSeq)

    NEW Try the new Transcript table

    RefSeqs maintained independently of Annotated Genomes

    These reference sequences exist independently of genome builds. Explain

    These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

    mRNA and Protein(s)

    1. NM_001168218.2NP_001161690.2  lysine-rich nucleolar protein 1 isoform 2

      Status: VALIDATED

      Description
      Transcript Variant: This variant (2) uses an alternate in-frame splice site in the middle portion of the coding region, compared to variant 1. This results in a shorter protein (isoform 2), compared to isoform 1.
      Source sequence(s)
      AC138717, AC165086
      Consensus CDS
      CCDS52377.1
      Related
      ENSMUSP00000068142.7, ENSMUST00000063607.12
    2. NM_001168219.2NP_001161691.2  lysine-rich nucleolar protein 1 isoform 3

      Status: VALIDATED

      Description
      Transcript Variant: This variant (3) uses an alternate in-frame splice site and lacks an exon in the middle portion of the coding region, compared to variant 1. This results in a shorter protein (isoform 3), compared to isoform 1.
      Source sequence(s)
      AC138717, AC165086
      Consensus CDS
      CCDS52376.1
      Related
      ENSMUSP00000114727.4, ENSMUST00000126792.9
    3. NM_001168220.2NP_001161692.1  lysine-rich nucleolar protein 1 isoform 2

      Status: VALIDATED

      Description
      Transcript Variant: This variant (4) differs in the 5' UTR, lacks a portion of the 5' coding region, and initiates translation at a downstream start codon, compared to variant 1. This variant (4) also lacks an exon in the middle portion of the coding region compared to variant 1. The encoded isoform (4) is shorter than isoform 1.
      Source sequence(s)
      AC138717, AC165086
      Consensus CDS
      CCDS52375.1
      UniProtKB/TrEMBL
      K4DI66
      Related
      ENSMUSP00000102159.2, ENSMUST00000106549.9
      Conserved Domains (1) summary
      pfam15477
      Location:222295
      SMAP; Small acidic protein family
    4. NM_001415717.1NP_001402646.1  lysine-rich nucleolar protein 1 isoform 4

      Status: VALIDATED

      Source sequence(s)
      AC138717, AC165086
    5. NM_023197.4NP_075686.3  lysine-rich nucleolar protein 1 isoform 1

      Status: VALIDATED

      Description
      Transcript Variant: This variant (1) represents the longest transcript and encodes the longest isoform (1).
      Source sequence(s)
      AC138717, AC165086
      Consensus CDS
      CCDS40104.1
      UniProtKB/TrEMBL
      H7BX94
      Related
      ENSMUSP00000102160.6, ENSMUST00000106550.12

    RefSeqs of Annotated Genomes: GCF_000001635.27-RS_2024_02

    The following sections contain reference sequences that belong to a specific genome build. Explain

    Reference GRCm39 C57BL/6J

    Genomic

    1. NC_000073.7 Reference GRCm39 C57BL/6J

      Range
      118441440..118454907 complement
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)