NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907125969|ref|XP_036016668|]
View 

tubby-related protein 4 isoform X3 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Tub super family cl08308
Tub family;
1459-1529 1.79e-23

Tub family;


The actual alignment was detected with superfamily member pfam01167:

Pssm-ID: 460094  Cd Length: 251  Bit Score: 101.50  E-value: 1.79e-23
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907125969 1459 VMANKQPLWNEATQVYQLDFGGRVTQESAKNFQI---ELEGRQVMQFGRIDGNAYILDFQYPFSAVQAFAVALA 1529
Cdd:pfam01167  174 VLKNKPPRWNEQLQCYCLNFHGRVTVASVKNFQLvapEDQDKVILQFGKVGKDMFTMDYRYPLSAFQAFAICLS 247
PHA03247 super family cl33720
large tegument protein UL36; Provisional
995-1326 3.18e-07

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 55.71  E-value: 3.18e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969  995 ADSSRAPLQPLAKPKGGAAGAVAQLPARPPpalytcSQCSGAGPSSQSGAALAHAISTSPlasqssynllsPPDTSRDRT 1074
Cdd:PHA03247  2564 PDRSVPPPRPAPRPSEPAVTSRARRPDAPP------QSARPRAPVDDRGDPRGPAPPSPL-----------PPDTHAPDP 2626
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969 1075 DYVNSAFTEDEALSQHcQLEKPLRHPPLPEAAVTMKRPPPYQWDPMLGEDVWVPQERTAQPTVPNPLklsplmlgqGQHL 1154
Cdd:PHA03247  2627 PPPSPSPAANEPDPHP-PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTV---------GSLT 2696
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969 1155 DVARVPFVPPKSPSSPTATFPTGYGMGMPYPGSYNNPSLPGVQAPCSPK-----DALSQAQFAQQESAVVLQPAyPPSLS 1229
Cdd:PHA03247  2697 SLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPagpatPGGPARPARPPTTAGPPAPA-PPAAP 2775
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969 1230 YCTLPPTYPGSSTCSSVQLPPIALHPWN----------SYSTCPPMQNTQGTLPPKPHLVVEKPLVSPPPAELQSHMGTE 1299
Cdd:PHA03247  2776 AAGPPRRLTRPAVASLSESRESLPSPWDpadppaavlaPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 1907125969 1300 VM----------------VETADNFQEVLSLTESPVPQRTEKF 1326
Cdd:PHA03247  2856 VApggdvrrrppsrspaaKPAAPARPPVRRLARPAVSRSTESF 2898
SOCS super family cl02533
SOCS (suppressors of cytokine signaling) box. The SOCS box is found in the C-terminal region ...
319-354 4.15e-04

SOCS (suppressors of cytokine signaling) box. The SOCS box is found in the C-terminal region of CIS/SOCS family proteins (in combination with a SH2 domain), ASBs (ankyrin repeat-containing proteins with a SOCS box), SSBs (SPRY domain-containing proteins with a SOCS box), and WSBs (WD40 repeat-containing proteins with a SOCS box), as well as, other miscellaneous proteins. The function of the SOCS box is the recruitment of the ubiquitin-transferase system. The SOCS box interacts with Elongins B and C, Cullin-5 or Cullin-2, Rbx-1, and E2. Therefore, SOCS-box-containing proteins probably function as E3 ubiquitin ligases and mediate the degradation of proteins associated through their N-terminal regions.


The actual alignment was detected with superfamily member cd03717:

Pssm-ID: 470605  Cd Length: 39  Bit Score: 39.12  E-value: 4.15e-04
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1907125969  319 RVSSLQLLCQQAIASTLREDKdVNKLTLPPRLCSYL 354
Cdd:cd03717      2 SVRSLQHLCRFVIRQCTRRDL-IDQLPLPRRLKDYL 36
WD40 super family cl43672
WD40 repeat [General function prediction only];
45-163 9.03e-04

WD40 repeat [General function prediction only];


The actual alignment was detected with superfamily member COG2319:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 43.36  E-value: 9.03e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969   45 WLATGNGRGVVGVTftsshcrrDRSTPQRInFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVW------------IQY 112
Cdd:COG2319    218 LLASGSADGTVRLW--------DLATGKLL-RTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWdlatgellrtltGHS 288
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907125969  113 EGRWSVELVNDrGAQVLFGTADGQVIVMDCHGRMLAHVLLHESDGILSMSW 163
Cdd:COG2319    289 GGVNSVAFSPD-GKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAF 338
 
Name Accession Description Interval E-value
Tub pfam01167
Tub family;
1459-1529 1.79e-23

Tub family;


Pssm-ID: 460094  Cd Length: 251  Bit Score: 101.50  E-value: 1.79e-23
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907125969 1459 VMANKQPLWNEATQVYQLDFGGRVTQESAKNFQI---ELEGRQVMQFGRIDGNAYILDFQYPFSAVQAFAVALA 1529
Cdd:pfam01167  174 VLKNKPPRWNEQLQCYCLNFHGRVTVASVKNFQLvapEDQDKVILQFGKVGKDMFTMDYRYPLSAFQAFAICLS 247
PHA03247 PHA03247
large tegument protein UL36; Provisional
995-1326 3.18e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 55.71  E-value: 3.18e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969  995 ADSSRAPLQPLAKPKGGAAGAVAQLPARPPpalytcSQCSGAGPSSQSGAALAHAISTSPlasqssynllsPPDTSRDRT 1074
Cdd:PHA03247  2564 PDRSVPPPRPAPRPSEPAVTSRARRPDAPP------QSARPRAPVDDRGDPRGPAPPSPL-----------PPDTHAPDP 2626
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969 1075 DYVNSAFTEDEALSQHcQLEKPLRHPPLPEAAVTMKRPPPYQWDPMLGEDVWVPQERTAQPTVPNPLklsplmlgqGQHL 1154
Cdd:PHA03247  2627 PPPSPSPAANEPDPHP-PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTV---------GSLT 2696
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969 1155 DVARVPFVPPKSPSSPTATFPTGYGMGMPYPGSYNNPSLPGVQAPCSPK-----DALSQAQFAQQESAVVLQPAyPPSLS 1229
Cdd:PHA03247  2697 SLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPagpatPGGPARPARPPTTAGPPAPA-PPAAP 2775
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969 1230 YCTLPPTYPGSSTCSSVQLPPIALHPWN----------SYSTCPPMQNTQGTLPPKPHLVVEKPLVSPPPAELQSHMGTE 1299
Cdd:PHA03247  2776 AAGPPRRLTRPAVASLSESRESLPSPWDpadppaavlaPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 1907125969 1300 VM----------------VETADNFQEVLSLTESPVPQRTEKF 1326
Cdd:PHA03247  2856 VApggdvrrrppsrspaaKPAAPARPPVRRLARPAVSRSTESF 2898
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1039-1295 1.36e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 46.68  E-value: 1.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969 1039 SSQSGAALAHAISTSPLASQSSYNLLSPPDTSRDRTDYVNSAFTEDEALSQHCQLEKPLRHPP-----------LPEAAV 1107
Cdd:pfam03154  156 SDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPnqtqstaaphtLIQQTP 235
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969 1108 TMKRP------PPYQWDPMLGEDVWVPQERTAQPTVPNPLKLSPLMLGQGQHLDVARVPFVP-PKSPSSPTATFPTGYGM 1180
Cdd:pfam03154  236 TLHPQrlpsphPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPfPLTPQSSQSQVPPGPSP 315
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969 1181 GMPYPgSYNNPSLPGVQapcspkdalSQAQFAQQESAVVLQPAyPPSLSYCTLPPTYPGSstcssvQLPPIALHPWNSYS 1260
Cdd:pfam03154  316 AAPGQ-SQQRIHTPPSQ---------SQLQSQQPPREQPLPPA-PLSMPHIKPPPTTPIP------QLPNPQSHKHPPHL 378
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1907125969 1261 TCPPMQNTQGTLPPKPHLvveKPLVS----------PPPAELQSH 1295
Cdd:pfam03154  379 SGPSPFQMNSNLPPPPAL---KPLSSlsthhppsahPPPLQLMPQ 420
SOCS_SOCS_like cd03717
SOCS (suppressors of cytokine signaling) box of SOCS-like proteins. The CIS/SOCS family of ...
319-354 4.15e-04

SOCS (suppressors of cytokine signaling) box of SOCS-like proteins. The CIS/SOCS family of proteins is characterized by the presence of a C-terminal SOCS box and a central SH2 domain. These intracellular proteins regulate the responses of immune cells to cytokines. Identified as negative regulators of the cytokine-JAK-STAT pathway, they seem to play a role in many immunological and pathological processes. The function of the SOCS box is the recruitment of the ubiquitin-transferase system. Related SOCS boxes are also present in Rab40-like proteins and insect proteins of unknown function that also contain a NEUZ (domain in neuralized proteins) domain.


Pssm-ID: 239687  Cd Length: 39  Bit Score: 39.12  E-value: 4.15e-04
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1907125969  319 RVSSLQLLCQQAIASTLREDKdVNKLTLPPRLCSYL 354
Cdd:cd03717      2 SVRSLQHLCRFVIRQCTRRDL-IDQLPLPRRLKDYL 36
SOCS_box smart00969
The SOCS box acts as a bridge between specific substrate- binding domains and more generic ...
321-355 6.94e-04

The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases;


Pssm-ID: 198037  Cd Length: 34  Bit Score: 38.54  E-value: 6.94e-04
                            10        20        30
                    ....*....|....*....|....*....|....*
gi 1907125969   321 SSLQLLCQQAIASTLredKDVNKLTLPPRLCSYLS 355
Cdd:smart00969    1 RSLQHLCRLAIRRSL---GGIDKLPLPPRLKDYLL 32
WD40 COG2319
WD40 repeat [General function prediction only];
45-163 9.03e-04

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 43.36  E-value: 9.03e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969   45 WLATGNGRGVVGVTftsshcrrDRSTPQRInFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVW------------IQY 112
Cdd:COG2319    218 LLASGSADGTVRLW--------DLATGKLL-RTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWdlatgellrtltGHS 288
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907125969  113 EGRWSVELVNDrGAQVLFGTADGQVIVMDCHGRMLAHVLLHESDGILSMSW 163
Cdd:COG2319    289 GGVNSVAFSPD-GKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAF 338
SOCS_box pfam07525
SOCS box; The SOCS box acts as a bridge between specific substrate- binding domains and more ...
320-354 2.24e-03

SOCS box; The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases.


Pssm-ID: 462192  Cd Length: 39  Bit Score: 37.14  E-value: 2.24e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1907125969  320 VSSLQLLCQQAIASTL--REDKDVNKLTLPPRLCSYL 354
Cdd:pfam07525    2 PRSLQHLCRLAIRRALgkRRLGAIDKLPLPPLLKDYL 38
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
78-164 3.25e-03

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 41.17  E-value: 3.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969   78 LRGHNSEVVLVRWNEPYQKLATCDADGGIFVW--------IQYEGR----WSVELVNDrGAQVLFGTADGQVIVMDCHGR 145
Cdd:cd00200      5 LKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWdletgellRTLKGHtgpvRDVAASAD-GTYLASGSSDKTIRLWDLETG 83
                           90
                   ....*....|....*....
gi 1907125969  146 MLAHVLLHESDGILSMSWN 164
Cdd:cd00200     84 ECVRTLTGHTSYVSSVAFS 102
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
78-109 8.37e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 35.75  E-value: 8.37e-03
                            10        20        30
                    ....*....|....*....|....*....|..
gi 1907125969    78 LRGHNSEVVLVRWNEPYQKLATCDADGGIFVW 109
Cdd:smart00320    8 LKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
 
Name Accession Description Interval E-value
Tub pfam01167
Tub family;
1459-1529 1.79e-23

Tub family;


Pssm-ID: 460094  Cd Length: 251  Bit Score: 101.50  E-value: 1.79e-23
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907125969 1459 VMANKQPLWNEATQVYQLDFGGRVTQESAKNFQI---ELEGRQVMQFGRIDGNAYILDFQYPFSAVQAFAVALA 1529
Cdd:pfam01167  174 VLKNKPPRWNEQLQCYCLNFHGRVTVASVKNFQLvapEDQDKVILQFGKVGKDMFTMDYRYPLSAFQAFAICLS 247
PHA03247 PHA03247
large tegument protein UL36; Provisional
995-1326 3.18e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 55.71  E-value: 3.18e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969  995 ADSSRAPLQPLAKPKGGAAGAVAQLPARPPpalytcSQCSGAGPSSQSGAALAHAISTSPlasqssynllsPPDTSRDRT 1074
Cdd:PHA03247  2564 PDRSVPPPRPAPRPSEPAVTSRARRPDAPP------QSARPRAPVDDRGDPRGPAPPSPL-----------PPDTHAPDP 2626
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969 1075 DYVNSAFTEDEALSQHcQLEKPLRHPPLPEAAVTMKRPPPYQWDPMLGEDVWVPQERTAQPTVPNPLklsplmlgqGQHL 1154
Cdd:PHA03247  2627 PPPSPSPAANEPDPHP-PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTV---------GSLT 2696
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969 1155 DVARVPFVPPKSPSSPTATFPTGYGMGMPYPGSYNNPSLPGVQAPCSPK-----DALSQAQFAQQESAVVLQPAyPPSLS 1229
Cdd:PHA03247  2697 SLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPagpatPGGPARPARPPTTAGPPAPA-PPAAP 2775
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969 1230 YCTLPPTYPGSSTCSSVQLPPIALHPWN----------SYSTCPPMQNTQGTLPPKPHLVVEKPLVSPPPAELQSHMGTE 1299
Cdd:PHA03247  2776 AAGPPRRLTRPAVASLSESRESLPSPWDpadppaavlaPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 1907125969 1300 VM----------------VETADNFQEVLSLTESPVPQRTEKF 1326
Cdd:PHA03247  2856 VApggdvrrrppsrspaaKPAAPARPPVRRLARPAVSRSTESF 2898
PHA03247 PHA03247
large tegument protein UL36; Provisional
938-1290 3.58e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.25  E-value: 3.58e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969  938 RLTVPRYSIPTGDPPPYP------EIASQLAQGRSAA----QRLDNSLIHATLRRNNREVALKMAQLADSS-RAPLQPLA 1006
Cdd:PHA03247  2609 RGPAPPSPLPPDTHAPDPpppspsPAANEPDPHPPPTvpppERPRDDPAPGRVSRPRRARRLGRAAQASSPpQRPRRRAA 2688
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969 1007 KPKGGAAGAVAQLPARPP---PALYTCSQCSGAGPSSQSGAALAHAISTSPLASQSSYNLLSPPDTSRDRTDYVNSAfte 1083
Cdd:PHA03247  2689 RPTVGSLTSLADPPPPPPtpePAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAG--- 2765
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969 1084 dealsqhcqlekplrhPPLPEAAVTMKRPPPYQWDPMLGEDVWVpqERTAQPTVPNPLKLSPLMLGQGQHLDVARVPFVP 1163
Cdd:PHA03247  2766 ----------------PPAPAPPAAPAAGPPRRLTRPAVASLSE--SRESLPSPWDPADPPAAVLAPAAALPPAASPAGP 2827
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969 1164 PKSPSSPTATFPtgygmgmPYPGSYNNPSLP--GVQAPCSPkdaLSQAQFAQQESAVVLQPAYPPSLSyctLPPTYPGSS 1241
Cdd:PHA03247  2828 LPPPTSAQPTAP-------PPPPGPPPPSLPlgGSVAPGGD---VRRRPPSRSPAAKPAAPARPPVRR---LARPAVSRS 2894
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*....
gi 1907125969 1242 TCSSVQLPPIALHPWNSYSTCPPMQNTQGTLPPKPHLVVEKPLVSPPPA 1290
Cdd:PHA03247  2895 TESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPL 2943
PHA03378 PHA03378
EBNA-3B; Provisional
890-1292 3.54e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 48.52  E-value: 3.54e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969  890 VEEVCRPRTRMLCSQNTYTLPGPGSSATLRLTATEKkVPQPCTSatlnrLTVPrYSIPTGDPPPYPEIASQLAQGRSAAQ 969
Cdd:PHA03378   426 IEEEHRKKKAARTEQPRATPHSQAPTVVLHRPPTQP-LEGPTGP-----LSVQ-APLEPWQPLPHPQVTPVILHQPPAQG 498
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969  970 RLDNSLIHATLRRNNREVALK-MAQLADssRAPLQPLAKPKG-----GAAGAVAQLPARPPPALYTCSQCSGAGP---SS 1040
Cdd:PHA03378   499 VQAHGSMLDLLEKDDEDMEQRvMATLLP--PSPPQPRAGRRApcvytEDLDIESDEPASTEPVHDQLLPAPGLGPlqiQP 576
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969 1041 QSGAALAHAISTSPLASQSSYNLLSPPDTSRDRTdyVNSAFTEDEALSQHCQLEKPLRHPPLPEAAVTMK---RPPPY-- 1115
Cdd:PHA03378   577 LTSPTTSQLASSAPSYAQTPWPVPHPSQTPEPPT--TQSHIPETSAPRQWPMPLRPIPMRPLRMQPITFNvlvFPTPHqp 654
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969 1116 -QWDPMLGEDVWV-PQERTAQPTVPNPLKLSPLMLGQGQHLDVARVP--FVPPKSPSSPtATFPTGYGMGMPYPGSYNNP 1191
Cdd:PHA03378   655 pQVEITPYKPTWTqIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPtpMRPPAAPPGR-AQRPAAATGRARPPAAAPGR 733
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969 1192 SLPGVQAPcSPKDALSQAQFAQQESAVVLQPAYPPSLSYCTLPPTYPGsstcssvQLPPIalhpwnsystcpPMQNTQGT 1271
Cdd:PHA03378   734 ARPPAAAP-GRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPP-------QAPPA------------PQQRPRGA 793
                          410       420
                   ....*....|....*....|.
gi 1907125969 1272 LPPKPhlvveKPLVSPPPAEL 1292
Cdd:PHA03378   794 PTPQP-----PPQAGPTSMQL 809
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1039-1295 1.36e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 46.68  E-value: 1.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969 1039 SSQSGAALAHAISTSPLASQSSYNLLSPPDTSRDRTDYVNSAFTEDEALSQHCQLEKPLRHPP-----------LPEAAV 1107
Cdd:pfam03154  156 SDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPnqtqstaaphtLIQQTP 235
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969 1108 TMKRP------PPYQWDPMLGEDVWVPQERTAQPTVPNPLKLSPLMLGQGQHLDVARVPFVP-PKSPSSPTATFPTGYGM 1180
Cdd:pfam03154  236 TLHPQrlpsphPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPfPLTPQSSQSQVPPGPSP 315
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969 1181 GMPYPgSYNNPSLPGVQapcspkdalSQAQFAQQESAVVLQPAyPPSLSYCTLPPTYPGSstcssvQLPPIALHPWNSYS 1260
Cdd:pfam03154  316 AAPGQ-SQQRIHTPPSQ---------SQLQSQQPPREQPLPPA-PLSMPHIKPPPTTPIP------QLPNPQSHKHPPHL 378
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1907125969 1261 TCPPMQNTQGTLPPKPHLvveKPLVS----------PPPAELQSH 1295
Cdd:pfam03154  379 SGPSPFQMNSNLPPPPAL---KPLSSlsthhppsahPPPLQLMPQ 420
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
935-1183 3.65e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 45.25  E-value: 3.65e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969  935 TLNRLTVPRYSIPTGDPPPYPEIASQLAQGRSAAqrldnslihatlrrnnreVALKMAQLADSSRAPLQPLAKPKGGAAG 1014
Cdd:PRK12323   356 TLLRMLAFRPGQSGGGAGPATAAAAPVAQPAPAA------------------AAPAAAAPAPAAPPAAPAAAPAAAAAAR 417
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969 1015 AVAQLPARPPP---ALYTCSQCSGAGPSSQSGAALAHAISTSPLASQSSYNLLSPPDTSrdrtdyVNSAFTEDEALSQHC 1091
Cdd:PRK12323   418 AVAAAPARRSPapeALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAA------AAAPARAAPAAAPAP 491
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969 1092 QLEKPlrhPPLPEAAVTMKRPPPYQWDPMLGEDVW--VPQERTAQPTVPNPLKLSPLMLGQGQHLDVARVPFVPPKSPSS 1169
Cdd:PRK12323   492 ADDDP---PPWEELPPEFASPAPAQPDAAPAGWVAesIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRA 568
                          250
                   ....*....|....
gi 1907125969 1170 PTATFPTGYGMGMP 1183
Cdd:PRK12323   569 SASGLPDMFDGDWP 582
SOCS_SOCS_like cd03717
SOCS (suppressors of cytokine signaling) box of SOCS-like proteins. The CIS/SOCS family of ...
319-354 4.15e-04

SOCS (suppressors of cytokine signaling) box of SOCS-like proteins. The CIS/SOCS family of proteins is characterized by the presence of a C-terminal SOCS box and a central SH2 domain. These intracellular proteins regulate the responses of immune cells to cytokines. Identified as negative regulators of the cytokine-JAK-STAT pathway, they seem to play a role in many immunological and pathological processes. The function of the SOCS box is the recruitment of the ubiquitin-transferase system. Related SOCS boxes are also present in Rab40-like proteins and insect proteins of unknown function that also contain a NEUZ (domain in neuralized proteins) domain.


Pssm-ID: 239687  Cd Length: 39  Bit Score: 39.12  E-value: 4.15e-04
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1907125969  319 RVSSLQLLCQQAIASTLREDKdVNKLTLPPRLCSYL 354
Cdd:cd03717      2 SVRSLQHLCRFVIRQCTRRDL-IDQLPLPRRLKDYL 36
SOCS_box smart00969
The SOCS box acts as a bridge between specific substrate- binding domains and more generic ...
321-355 6.94e-04

The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases;


Pssm-ID: 198037  Cd Length: 34  Bit Score: 38.54  E-value: 6.94e-04
                            10        20        30
                    ....*....|....*....|....*....|....*
gi 1907125969   321 SSLQLLCQQAIASTLredKDVNKLTLPPRLCSYLS 355
Cdd:smart00969    1 RSLQHLCRLAIRRSL---GGIDKLPLPPRLKDYLL 32
WD40 COG2319
WD40 repeat [General function prediction only];
45-163 9.03e-04

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 43.36  E-value: 9.03e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969   45 WLATGNGRGVVGVTftsshcrrDRSTPQRInFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVW------------IQY 112
Cdd:COG2319    218 LLASGSADGTVRLW--------DLATGKLL-RTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWdlatgellrtltGHS 288
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907125969  113 EGRWSVELVNDrGAQVLFGTADGQVIVMDCHGRMLAHVLLHESDGILSMSW 163
Cdd:COG2319    289 GGVNSVAFSPD-GKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAF 338
WD40 COG2319
WD40 repeat [General function prediction only];
45-163 1.61e-03

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 42.59  E-value: 1.61e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969   45 WLATGNGRGVVGVtftsshcrRDRSTPQRInFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVW--------IQYEGR- 115
Cdd:COG2319    176 LLASGSDDGTVRL--------WDLATGKLL-RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWdlatgkllRTLTGHs 246
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907125969  116 ---WSVELVNDrGAQVLFGTADGQVIVMDCHGRMLAHVLLHESDGILSMSW 163
Cdd:COG2319    247 gsvRSVAFSPD-GRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAF 296
SOCS_box pfam07525
SOCS box; The SOCS box acts as a bridge between specific substrate- binding domains and more ...
320-354 2.24e-03

SOCS box; The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases.


Pssm-ID: 462192  Cd Length: 39  Bit Score: 37.14  E-value: 2.24e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1907125969  320 VSSLQLLCQQAIASTL--REDKDVNKLTLPPRLCSYL 354
Cdd:pfam07525    2 PRSLQHLCRLAIRRALgkRRLGAIDKLPLPPLLKDYL 38
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
78-164 3.25e-03

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 41.17  E-value: 3.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907125969   78 LRGHNSEVVLVRWNEPYQKLATCDADGGIFVW--------IQYEGR----WSVELVNDrGAQVLFGTADGQVIVMDCHGR 145
Cdd:cd00200      5 LKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWdletgellRTLKGHtgpvRDVAASAD-GTYLASGSSDKTIRLWDLETG 83
                           90
                   ....*....|....*....
gi 1907125969  146 MLAHVLLHESDGILSMSWN 164
Cdd:cd00200     84 ECVRTLTGHTSYVSSVAFS 102
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
78-109 8.37e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 35.75  E-value: 8.37e-03
                            10        20        30
                    ....*....|....*....|....*....|..
gi 1907125969    78 LRGHNSEVVLVRWNEPYQKLATCDADGGIFVW 109
Cdd:smart00320    8 LKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH