U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Ctsw cathepsin W [ Mus musculus (house mouse) ]

Gene ID: 13041, updated on 2-Nov-2024

Summary

Official Symbol
Ctswprovided by MGI
Official Full Name
cathepsin Wprovided by MGI
Primary source
MGI:MGI:1338045
See related
Ensembl:ENSMUSG00000024910 AllianceGenome:MGI:1338045
Gene type
protein coding
RefSeq status
REVIEWED
Organism
Mus musculus
Lineage
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus
Also known as
lymphopain
Summary
This gene encodes a member of the peptidase C1 (papain) family of cysteine proteases. The encoded preproprotein is proteolytically processed to generate a mature protein product. Expression of the encoded protein is upregulated following lymphocyte activation. Data from a human cell line suggests that the encoded enzyme may be important for viral entry into host cells. [provided by RefSeq, Aug 2015]
Expression
Biased expression in thymus adult (RPKM 6.4), spleen adult (RPKM 5.8) and 14 other tissues See more
Orthologs
NEW
Try the new Gene table
Try the new Transcript table

Genomic context

See Ctsw in Genome Data Viewer
Location:
19 A; 19 4.33 cM
Exon count:
10
Annotation release Status Assembly Chr Location
RS_2024_02 current GRCm39 (GCF_000001635.27) 19 NC_000085.7 (5515071..5518558, complement)
108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) 19 NC_000085.6 (5465043..5468628, complement)

Chromosome 19 - NC_000085.7Genomic Context describing neighboring genes Neighboring gene predicted gene, 31070 Neighboring gene fos-like antigen 1 Neighboring gene coiled-coil domain containing 85B Neighboring gene fibroblast growth factor (acidic) intracellular binding protein Neighboring gene epidermal growth factor-containing fibulin-like extracellular matrix protein 2 Neighboring gene MUS81 structure-specific endonuclease subunit Neighboring gene CapStarr-seq enhancer MGSCv37_chr19:5488369-5488552

Genomic regions, transcripts, and products

Expression

  • Project title: Mouse ENCODE transcriptome data Mouse ENCODE transcriptome data
  • Description: RNA profiling data sets generated by the Mouse ENCODE project.
  • BioProject: PRJNA66167
  • Publication: PMID 25409824
  • Analysis date: n/a

Bibliography

Variation

Alleles

Alleles of this type are documented at Mouse Genome Informatics  (MGI)
  • Endonuclease-mediated (2) 
  • Targeted (1)  1 citation

Interactions

Products Interactant Other Gene Complex Source Pubs Description

General gene information

Gene Ontology Provided by MGI

Function Evidence Code Pubs
enables cysteine-type endopeptidase activity IBA
Inferred from Biological aspect of Ancestor
more info
 
enables cysteine-type peptidase activity IEA
Inferred from Electronic Annotation
more info
 
Process Evidence Code Pubs
involved_in proteolysis IEA
Inferred from Electronic Annotation
more info
 
involved_in proteolysis involved in protein catabolic process IBA
Inferred from Biological aspect of Ancestor
more info
 
Component Evidence Code Pubs
located_in endoplasmic reticulum IEA
Inferred from Electronic Annotation
more info
 
is_active_in extracellular space IBA
Inferred from Biological aspect of Ancestor
more info
 
is_active_in lysosome IBA
Inferred from Biological aspect of Ancestor
more info
 

NCBI Reference Sequences (RefSeq)

NEW Try the new Transcript table

RefSeqs maintained independently of Annotated Genomes

These reference sequences exist independently of genome builds. Explain

These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

mRNA and Protein(s)

  1. NM_009985.5NP_034115.2  cathepsin W preproprotein

    See identical proteins and their annotated locations for NP_034115.2

    Status: REVIEWED

    Source sequence(s)
    AC122861, BB850740, BE447240
    Consensus CDS
    CCDS29465.1
    UniProtKB/Swiss-Prot
    P56203
    UniProtKB/TrEMBL
    A0A494BA69, Q8C2M0
    Related
    ENSMUSP00000025844.5, ENSMUST00000025844.6
    Conserved Domains (2) summary
    smart00848
    Location:4096
    Inhibitor_I29; Cathepsin propeptide inhibitor domain (I29)
    pfam00112
    Location:126356
    Peptidase_C1; Papain family cysteine protease

RefSeqs of Annotated Genomes: GCF_000001635.27-RS_2024_02

The following sections contain reference sequences that belong to a specific genome build. Explain

Reference GRCm39 C57BL/6J

Genomic

  1. NC_000085.7 Reference GRCm39 C57BL/6J

    Range
    5515071..5518558 complement
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

mRNA and Protein(s)

  1. XM_006531661.5XP_006531724.1  cathepsin W isoform X1

    Conserved Domains (2) summary
    PTZ00203
    Location:39204
    PTZ00203; cathepsin L protease; Provisional
    cl23744
    Location:126204
    Peptidase_C1; C1 Peptidase family (MEROPS database nomenclature), also referred to as the papain family; composed of two subfamilies of cysteine peptidases (CPs), C1A (papain) and C1B (bleomycin hydrolase). Papain-like enzymes are mostly endopeptidases with some ...

RNA

  1. XR_879413.4 RNA Sequence