NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1720391008|ref|XP_030105892|]
View 

tubby-related protein 4 isoform X2 [Mus musculus]

Protein Classification

WD40 and Tub domain-containing protein( domain architecture ID 11455620)

protein containing domains WD40, SOCS, PHA03247, and Tub

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Tub super family cl08308
Tub family;
1470-1540 1.46e-23

Tub family;


The actual alignment was detected with superfamily member pfam01167:

Pssm-ID: 460094  Cd Length: 251  Bit Score: 101.50  E-value: 1.46e-23
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720391008 1470 VMANKQPLWNEATQVYQLDFGGRVTQESAKNFQI---ELEGRQVMQFGRIDGNAYILDFQYPFSAVQAFAVALA 1540
Cdd:pfam01167  174 VLKNKPPRWNEQLQCYCLNFHGRVTVASVKNFQLvapEDQDKVILQFGKVGKDMFTMDYRYPLSAFQAFAICLS 247
WD40 COG2319
WD40 repeat [General function prediction only];
45-217 6.79e-11

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 66.09  E-value: 6.79e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008   45 WLATGNGRGVVGVtftsshcrRDRSTPQRInFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiQYEGRWSVELVNDR 124
Cdd:COG2319    176 LLASGSDDGTVRL--------WDLATGKLL-RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLW-DLATGKLLRTLTGH 245
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008  125 GAQVSDFTWSHDGTQALISYRDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHV 204
Cdd:COG2319    246 SGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRT 325
                          170
                   ....*....|...
gi 1720391008  205 LLHESDGILSMSW 217
Cdd:COG2319    326 LTGHTGAVRSVAF 338
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1006-1337 2.85e-07

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 55.71  E-value: 2.85e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1006 ADSSRAPLQPLAKPKGGAAGAVAQLPARPPpalytcSQCSGAGPSSQSGAALAHAISTSPlasqssynllsPPDTSRDRT 1085
Cdd:PHA03247  2564 PDRSVPPPRPAPRPSEPAVTSRARRPDAPP------QSARPRAPVDDRGDPRGPAPPSPL-----------PPDTHAPDP 2626
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1086 DYVNSAFTEDEALSQHcQLEKPLRHPPLPEAAVTMKRPPPYQWDPMLGEDVWVPQERTAQPTVPNPLklsplmlgqGQHL 1165
Cdd:PHA03247  2627 PPPSPSPAANEPDPHP-PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTV---------GSLT 2696
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1166 DVARVPFVPPKSPSSPTATFPTGYGMGMPYPGSYNNPSLPGVQAPCSPK-----DALSQAQFAQQESAVVLQPAyPPSLS 1240
Cdd:PHA03247  2697 SLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPagpatPGGPARPARPPTTAGPPAPA-PPAAP 2775
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1241 YCTLPPTYPGSSTCSSVQLPPIALHPWN----------SYSTCPPMQNTQGTLPPKPHLVVEKPLVSPPPAELQSHMGTE 1310
Cdd:PHA03247  2776 AAGPPRRLTRPAVASLSESRESLPSPWDpadppaavlaPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 1720391008 1311 VM----------------VETADNFQEVLSLTESPVPQRTEKF 1337
Cdd:PHA03247  2856 VApggdvrrrppsrspaaKPAAPARPPVRRLARPAVSRSTESF 2898
SOCS super family cl02533
SOCS (suppressors of cytokine signaling) box. The SOCS box is found in the C-terminal region ...
373-408 3.64e-04

SOCS (suppressors of cytokine signaling) box. The SOCS box is found in the C-terminal region of CIS/SOCS family proteins (in combination with a SH2 domain), ASBs (ankyrin repeat-containing proteins with a SOCS box), SSBs (SPRY domain-containing proteins with a SOCS box), and WSBs (WD40 repeat-containing proteins with a SOCS box), as well as, other miscellaneous proteins. The function of the SOCS box is the recruitment of the ubiquitin-transferase system. The SOCS box interacts with Elongins B and C, Cullin-5 or Cullin-2, Rbx-1, and E2. Therefore, SOCS-box-containing proteins probably function as E3 ubiquitin ligases and mediate the degradation of proteins associated through their N-terminal regions.


The actual alignment was detected with superfamily member cd03717:

Pssm-ID: 470605  Cd Length: 39  Bit Score: 39.50  E-value: 3.64e-04
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1720391008  373 RVSSLQLLCQQAIASTLREDKdVNKLTLPPRLCSYL 408
Cdd:cd03717      2 SVRSLQHLCRFVIRQCTRRDL-IDQLPLPRRLKDYL 36
 
Name Accession Description Interval E-value
Tub pfam01167
Tub family;
1470-1540 1.46e-23

Tub family;


Pssm-ID: 460094  Cd Length: 251  Bit Score: 101.50  E-value: 1.46e-23
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720391008 1470 VMANKQPLWNEATQVYQLDFGGRVTQESAKNFQI---ELEGRQVMQFGRIDGNAYILDFQYPFSAVQAFAVALA 1540
Cdd:pfam01167  174 VLKNKPPRWNEQLQCYCLNFHGRVTVASVKNFQLvapEDQDKVILQFGKVGKDMFTMDYRYPLSAFQAFAICLS 247
WD40 COG2319
WD40 repeat [General function prediction only];
45-217 6.79e-11

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 66.09  E-value: 6.79e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008   45 WLATGNGRGVVGVtftsshcrRDRSTPQRInFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiQYEGRWSVELVNDR 124
Cdd:COG2319    176 LLASGSDDGTVRL--------WDLATGKLL-RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLW-DLATGKLLRTLTGH 245
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008  125 GAQVSDFTWSHDGTQALISYRDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHV 204
Cdd:COG2319    246 SGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRT 325
                          170
                   ....*....|...
gi 1720391008  205 LLHESDGILSMSW 217
Cdd:COG2319    326 LTGHTGAVRSVAF 338
PHA03247 PHA03247
large tegument protein UL36; Provisional
1006-1337 2.85e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 55.71  E-value: 2.85e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1006 ADSSRAPLQPLAKPKGGAAGAVAQLPARPPpalytcSQCSGAGPSSQSGAALAHAISTSPlasqssynllsPPDTSRDRT 1085
Cdd:PHA03247  2564 PDRSVPPPRPAPRPSEPAVTSRARRPDAPP------QSARPRAPVDDRGDPRGPAPPSPL-----------PPDTHAPDP 2626
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1086 DYVNSAFTEDEALSQHcQLEKPLRHPPLPEAAVTMKRPPPYQWDPMLGEDVWVPQERTAQPTVPNPLklsplmlgqGQHL 1165
Cdd:PHA03247  2627 PPPSPSPAANEPDPHP-PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTV---------GSLT 2696
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1166 DVARVPFVPPKSPSSPTATFPTGYGMGMPYPGSYNNPSLPGVQAPCSPK-----DALSQAQFAQQESAVVLQPAyPPSLS 1240
Cdd:PHA03247  2697 SLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPagpatPGGPARPARPPTTAGPPAPA-PPAAP 2775
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1241 YCTLPPTYPGSSTCSSVQLPPIALHPWN----------SYSTCPPMQNTQGTLPPKPHLVVEKPLVSPPPAELQSHMGTE 1310
Cdd:PHA03247  2776 AAGPPRRLTRPAVASLSESRESLPSPWDpadppaavlaPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 1720391008 1311 VM----------------VETADNFQEVLSLTESPVPQRTEKF 1337
Cdd:PHA03247  2856 VApggdvrrrppsrspaaKPAAPARPPVRRLARPAVSRSTESF 2898
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
78-218 7.76e-07

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 52.72  E-value: 7.76e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008   78 LRGHNSEVVLVRWNEPYQKLATCDADGGIFVWIQYEGRWSVELVNDRGAqVSDFTWSHDGTQALISYRDGFVLVGSVSGQ 157
Cdd:cd00200      5 LKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGP-VRDVAASADGTYLASGSSDKTIRLWDLETG 83
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720391008  158 R-------HwsseinlESQITCGIWTPDDQQVLFGTADGQVIV-----------MDCHGRMLAHVLLHESDGILSMSWN 218
Cdd:cd00200     84 EcvrtltgH-------TSYVSSVAFSPDGRILSSSSRDKTIKVwdvetgkclttLRGHTDWVNSVAFSPDGTFVASSSQ 155
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1050-1306 1.04e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.07  E-value: 1.04e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1050 SSQSGAALAHAISTSPLASQSSYNLLSPPDTSRDRTDYVNSAFTEDEALSQHCQLEKPLRHPP-----------LPEAAV 1118
Cdd:pfam03154  156 SDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPnqtqstaaphtLIQQTP 235
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1119 TMKRP------PPYQWDPMLGEDVWVPQERTAQPTVPNPLKLSPLMLGQGQHLDVARVPFVP-PKSPSSPTATFPTGYGM 1191
Cdd:pfam03154  236 TLHPQrlpsphPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPfPLTPQSSQSQVPPGPSP 315
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1192 GMPYPgSYNNPSLPGVQapcspkdalSQAQFAQQESAVVLQPAyPPSLSYCTLPPTYPGSstcssvQLPPIALHPWNSYS 1271
Cdd:pfam03154  316 AAPGQ-SQQRIHTPPSQ---------SQLQSQQPPREQPLPPA-PLSMPHIKPPPTTPIP------QLPNPQSHKHPPHL 378
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1720391008 1272 TCPPMQNTQGTLPPKPHLvveKPLVS----------PPPAELQSH 1306
Cdd:pfam03154  379 SGPSPFQMNSNLPPPPAL---KPLSSlsthhppsahPPPLQLMPQ 420
ANAPC4_WD40 pfam12894
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ...
146-219 2.97e-04

Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,


Pssm-ID: 403945 [Multi-domain]  Cd Length: 91  Bit Score: 41.11  E-value: 2.97e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720391008  146 DGFVLVGSVSGQRHWS-SEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHVLLHESDGILSMSWNY 219
Cdd:pfam12894   16 DGELLLHRLNWQRVWTlSPDKEDLEVTSLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFSAGSDLITCLGWGE 90
SOCS_SOCS_like cd03717
SOCS (suppressors of cytokine signaling) box of SOCS-like proteins. The CIS/SOCS family of ...
373-408 3.64e-04

SOCS (suppressors of cytokine signaling) box of SOCS-like proteins. The CIS/SOCS family of proteins is characterized by the presence of a C-terminal SOCS box and a central SH2 domain. These intracellular proteins regulate the responses of immune cells to cytokines. Identified as negative regulators of the cytokine-JAK-STAT pathway, they seem to play a role in many immunological and pathological processes. The function of the SOCS box is the recruitment of the ubiquitin-transferase system. Related SOCS boxes are also present in Rab40-like proteins and insect proteins of unknown function that also contain a NEUZ (domain in neuralized proteins) domain.


Pssm-ID: 239687  Cd Length: 39  Bit Score: 39.50  E-value: 3.64e-04
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1720391008  373 RVSSLQLLCQQAIASTLREDKdVNKLTLPPRLCSYL 408
Cdd:cd03717      2 SVRSLQHLCRFVIRQCTRRDL-IDQLPLPRRLKDYL 36
SOCS_box smart00969
The SOCS box acts as a bridge between specific substrate- binding domains and more generic ...
375-409 6.15e-04

The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases;


Pssm-ID: 198037  Cd Length: 34  Bit Score: 38.54  E-value: 6.15e-04
                            10        20        30
                    ....*....|....*....|....*....|....*
gi 1720391008   375 SSLQLLCQQAIASTLredKDVNKLTLPPRLCSYLS 409
Cdd:smart00969    1 RSLQHLCRLAIRRSL---GGIDKLPLPPRLKDYLL 32
SOCS_box pfam07525
SOCS box; The SOCS box acts as a bridge between specific substrate- binding domains and more ...
374-408 1.91e-03

SOCS box; The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases.


Pssm-ID: 462192  Cd Length: 39  Bit Score: 37.53  E-value: 1.91e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1720391008  374 VSSLQLLCQQAIASTL--REDKDVNKLTLPPRLCSYL 408
Cdd:pfam07525    2 PRSLQHLCRLAIRRALgkRRLGAIDKLPLPPLLKDYL 38
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
78-109 8.43e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 35.75  E-value: 8.43e-03
                            10        20        30
                    ....*....|....*....|....*....|..
gi 1720391008    78 LRGHNSEVVLVRWNEPYQKLATCDADGGIFVW 109
Cdd:smart00320    8 LKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
 
Name Accession Description Interval E-value
Tub pfam01167
Tub family;
1470-1540 1.46e-23

Tub family;


Pssm-ID: 460094  Cd Length: 251  Bit Score: 101.50  E-value: 1.46e-23
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720391008 1470 VMANKQPLWNEATQVYQLDFGGRVTQESAKNFQI---ELEGRQVMQFGRIDGNAYILDFQYPFSAVQAFAVALA 1540
Cdd:pfam01167  174 VLKNKPPRWNEQLQCYCLNFHGRVTVASVKNFQLvapEDQDKVILQFGKVGKDMFTMDYRYPLSAFQAFAICLS 247
WD40 COG2319
WD40 repeat [General function prediction only];
45-217 6.79e-11

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 66.09  E-value: 6.79e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008   45 WLATGNGRGVVGVtftsshcrRDRSTPQRInFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiQYEGRWSVELVNDR 124
Cdd:COG2319    176 LLASGSDDGTVRL--------WDLATGKLL-RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLW-DLATGKLLRTLTGH 245
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008  125 GAQVSDFTWSHDGTQALISYRDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHV 204
Cdd:COG2319    246 SGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRT 325
                          170
                   ....*....|...
gi 1720391008  205 LLHESDGILSMSW 217
Cdd:COG2319    326 LTGHTGAVRSVAF 338
WD40 COG2319
WD40 repeat [General function prediction only];
45-217 1.87e-10

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 64.93  E-value: 1.87e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008   45 WLATGNGRGVVGVTftsshcrrDRSTPQRInFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiQYEGRWSVELVNDR 124
Cdd:COG2319    218 LLASGSADGTVRLW--------DLATGKLL-RTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLW-DLATGELLRTLTGH 287
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008  125 GAQVSDFTWSHDGTQALISYRDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHV 204
Cdd:COG2319    288 SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRT 367
                          170
                   ....*....|...
gi 1720391008  205 LLHESDGILSMSW 217
Cdd:COG2319    368 LTGHTGAVTSVAF 380
WD40 COG2319
WD40 repeat [General function prediction only];
45-193 2.73e-07

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 54.92  E-value: 2.73e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008   45 WLATGNGRGVVGVtftsshcrRDRSTPQRINFnLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiQYEGRWSVELVNDR 124
Cdd:COG2319    260 LLASGSADGTVRL--------WDLATGELLRT-LTGHSGGVNSVAFSPDGKLLASGSDDGTVRLW-DLATGKLLRTLTGH 329
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720391008  125 GAQVSDFTWSHDGTQALISYRDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIV 193
Cdd:COG2319    330 TGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRL 398
PHA03247 PHA03247
large tegument protein UL36; Provisional
1006-1337 2.85e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 55.71  E-value: 2.85e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1006 ADSSRAPLQPLAKPKGGAAGAVAQLPARPPpalytcSQCSGAGPSSQSGAALAHAISTSPlasqssynllsPPDTSRDRT 1085
Cdd:PHA03247  2564 PDRSVPPPRPAPRPSEPAVTSRARRPDAPP------QSARPRAPVDDRGDPRGPAPPSPL-----------PPDTHAPDP 2626
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1086 DYVNSAFTEDEALSQHcQLEKPLRHPPLPEAAVTMKRPPPYQWDPMLGEDVWVPQERTAQPTVPNPLklsplmlgqGQHL 1165
Cdd:PHA03247  2627 PPPSPSPAANEPDPHP-PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTV---------GSLT 2696
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1166 DVARVPFVPPKSPSSPTATFPTGYGMGMPYPGSYNNPSLPGVQAPCSPK-----DALSQAQFAQQESAVVLQPAyPPSLS 1240
Cdd:PHA03247  2697 SLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPagpatPGGPARPARPPTTAGPPAPA-PPAAP 2775
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1241 YCTLPPTYPGSSTCSSVQLPPIALHPWN----------SYSTCPPMQNTQGTLPPKPHLVVEKPLVSPPPAELQSHMGTE 1310
Cdd:PHA03247  2776 AAGPPRRLTRPAVASLSESRESLPSPWDpadppaavlaPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 1720391008 1311 VM----------------VETADNFQEVLSLTESPVPQRTEKF 1337
Cdd:PHA03247  2856 VApggdvrrrppsrspaaKPAAPARPPVRRLARPAVSRSTESF 2898
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
78-218 7.76e-07

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 52.72  E-value: 7.76e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008   78 LRGHNSEVVLVRWNEPYQKLATCDADGGIFVWIQYEGRWSVELVNDRGAqVSDFTWSHDGTQALISYRDGFVLVGSVSGQ 157
Cdd:cd00200      5 LKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGP-VRDVAASADGTYLASGSSDKTIRLWDLETG 83
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720391008  158 R-------HwsseinlESQITCGIWTPDDQQVLFGTADGQVIV-----------MDCHGRMLAHVLLHESDGILSMSWN 218
Cdd:cd00200     84 EcvrtltgH-------TSYVSSVAFSPDGRILSSSSRDKTIKVwdvetgkclttLRGHTDWVNSVAFSPDGTFVASSSQ 155
PHA03247 PHA03247
large tegument protein UL36; Provisional
949-1301 3.40e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.25  E-value: 3.40e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008  949 RLTVPRYSIPTGDPPPYP------EIASQLAQGRSAA----QRLDNSLIHATLRRNNREVALKMAQLADSS-RAPLQPLA 1017
Cdd:PHA03247  2609 RGPAPPSPLPPDTHAPDPpppspsPAANEPDPHPPPTvpppERPRDDPAPGRVSRPRRARRLGRAAQASSPpQRPRRRAA 2688
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1018 KPKGGAAGAVAQLPARPP---PALYTCSQCSGAGPSSQSGAALAHAISTSPLASQSSYNLLSPPDTSRDRTDYVNSAfte 1094
Cdd:PHA03247  2689 RPTVGSLTSLADPPPPPPtpePAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAG--- 2765
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1095 dealsqhcqlekplrhPPLPEAAVTMKRPPPYQWDPMLGEDVWVpqERTAQPTVPNPLKLSPLMLGQGQHLDVARVPFVP 1174
Cdd:PHA03247  2766 ----------------PPAPAPPAAPAAGPPRRLTRPAVASLSE--SRESLPSPWDPADPPAAVLAPAAALPPAASPAGP 2827
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1175 PKSPSSPTATFPtgygmgmPYPGSYNNPSLP--GVQAPCSPkdaLSQAQFAQQESAVVLQPAYPPSLSyctLPPTYPGSS 1252
Cdd:PHA03247  2828 LPPPTSAQPTAP-------PPPPGPPPPSLPlgGSVAPGGD---VRRRPPSRSPAAKPAAPARPPVRR---LARPAVSRS 2894
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*....
gi 1720391008 1253 TCSSVQLPPIALHPWNSYSTCPPMQNTQGTLPPKPHLVVEKPLVSPPPA 1301
Cdd:PHA03247  2895 TESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPL 2943
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
67-218 1.72e-05

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 48.49  E-value: 1.72e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008   67 DRSTPQRINfNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiqyEGRWSVELVNDRG--AQVSDFTWSHDGTQALISY 144
Cdd:cd00200     79 DLETGECVR-TLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVW---DVETGKCLTTLRGhtDWVNSVAFSPDGTFVASSS 154
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008  145 RDGFVLVGSVSGQR-------HwsseinlESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHVLLHESDGILSMSW 217
Cdd:cd00200    155 QDGTIKLWDLRTGKcvatltgH-------TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAF 227

                   .
gi 1720391008  218 N 218
Cdd:cd00200    228 S 228
PHA03378 PHA03378
EBNA-3B; Provisional
901-1303 2.96e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 48.91  E-value: 2.96e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008  901 VEEVCRPRTRMLCSQNTYTLPGPGSSATLRLTATEKkVPQPCTSatlnrLTVPrYSIPTGDPPPYPEIASQLAQGRSAAQ 980
Cdd:PHA03378   426 IEEEHRKKKAARTEQPRATPHSQAPTVVLHRPPTQP-LEGPTGP-----LSVQ-APLEPWQPLPHPQVTPVILHQPPAQG 498
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008  981 RLDNSLIHATLRRNNREVALK-MAQLADssRAPLQPLAKPKG-----GAAGAVAQLPARPPPALYTCSQCSGAGP---SS 1051
Cdd:PHA03378   499 VQAHGSMLDLLEKDDEDMEQRvMATLLP--PSPPQPRAGRRApcvytEDLDIESDEPASTEPVHDQLLPAPGLGPlqiQP 576
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1052 QSGAALAHAISTSPLASQSSYNLLSPPDTSRDRTdyVNSAFTEDEALSQHCQLEKPLRHPPLPEAAVTMK---RPPPY-- 1126
Cdd:PHA03378   577 LTSPTTSQLASSAPSYAQTPWPVPHPSQTPEPPT--TQSHIPETSAPRQWPMPLRPIPMRPLRMQPITFNvlvFPTPHqp 654
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1127 -QWDPMLGEDVWV-PQERTAQPTVPNPLKLSPLMLGQGQHLDVARVP--FVPPKSPSSPtATFPTGYGMGMPYPGSYNNP 1202
Cdd:PHA03378   655 pQVEITPYKPTWTqIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPtpMRPPAAPPGR-AQRPAAATGRARPPAAAPGR 733
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1203 SLPGVQAPcSPKDALSQAQFAQQESAVVLQPAYPPSLSYCTLPPTYPGsstcssvQLPPIalhpwnsystcpPMQNTQGT 1282
Cdd:PHA03378   734 ARPPAAAP-GRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPP-------QAPPA------------PQQRPRGA 793
                          410       420
                   ....*....|....*....|.
gi 1720391008 1283 LPPKPhlvveKPLVSPPPAEL 1303
Cdd:PHA03378   794 PTPQP-----PPQAGPTSMQL 809
WD40 COG2319
WD40 repeat [General function prediction only];
65-218 3.35e-05

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 47.98  E-value: 3.35e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008   65 RRDRSTPQRINFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWiQYEGRWSVELVNDRGAQVSDFTWSHDGTQALISY 144
Cdd:COG2319     61 LLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLW-DLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGS 139
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720391008  145 RDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHVLLHESDGILSMSWN 218
Cdd:COG2319    140 ADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS 213
WD40 COG2319
WD40 repeat [General function prediction only];
65-217 4.62e-05

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 47.60  E-value: 4.62e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008   65 RRDRSTPQRINFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWIQYEGRWSVELVnDRGAQVSDFTWSHDGTQALISY 144
Cdd:COG2319     19 ALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLL-GHTAAVLSVAFSPDGRLLASAS 97
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1720391008  145 RDGFVLVGSVSGQRHWSSEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHVLLHESDGILSMSW 217
Cdd:COG2319     98 ADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAF 170
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1050-1306 1.04e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.07  E-value: 1.04e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1050 SSQSGAALAHAISTSPLASQSSYNLLSPPDTSRDRTDYVNSAFTEDEALSQHCQLEKPLRHPP-----------LPEAAV 1118
Cdd:pfam03154  156 SDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPnqtqstaaphtLIQQTP 235
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1119 TMKRP------PPYQWDPMLGEDVWVPQERTAQPTVPNPLKLSPLMLGQGQHLDVARVPFVP-PKSPSSPTATFPTGYGM 1191
Cdd:pfam03154  236 TLHPQrlpsphPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPfPLTPQSSQSQVPPGPSP 315
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1192 GMPYPgSYNNPSLPGVQapcspkdalSQAQFAQQESAVVLQPAyPPSLSYCTLPPTYPGSstcssvQLPPIALHPWNSYS 1271
Cdd:pfam03154  316 AAPGQ-SQQRIHTPPSQ---------SQLQSQQPPREQPLPPA-PLSMPHIKPPPTTPIP------QLPNPQSHKHPPHL 378
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1720391008 1272 TCPPMQNTQGTLPPKPHLvveKPLVS----------PPPAELQSH 1306
Cdd:pfam03154  379 SGPSPFQMNSNLPPPPAL---KPLSSlsthhppsahPPPLQLMPQ 420
ANAPC4_WD40 pfam12894
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ...
146-219 2.97e-04

Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,


Pssm-ID: 403945 [Multi-domain]  Cd Length: 91  Bit Score: 41.11  E-value: 2.97e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720391008  146 DGFVLVGSVSGQRHWS-SEINLESQITCGIWTPDDQQVLFGTADGQVIVMDCHGRMLAHVLLHESDGILSMSWNY 219
Cdd:pfam12894   16 DGELLLHRLNWQRVWTlSPDKEDLEVTSLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFSAGSDLITCLGWGE 90
SOCS_SOCS_like cd03717
SOCS (suppressors of cytokine signaling) box of SOCS-like proteins. The CIS/SOCS family of ...
373-408 3.64e-04

SOCS (suppressors of cytokine signaling) box of SOCS-like proteins. The CIS/SOCS family of proteins is characterized by the presence of a C-terminal SOCS box and a central SH2 domain. These intracellular proteins regulate the responses of immune cells to cytokines. Identified as negative regulators of the cytokine-JAK-STAT pathway, they seem to play a role in many immunological and pathological processes. The function of the SOCS box is the recruitment of the ubiquitin-transferase system. Related SOCS boxes are also present in Rab40-like proteins and insect proteins of unknown function that also contain a NEUZ (domain in neuralized proteins) domain.


Pssm-ID: 239687  Cd Length: 39  Bit Score: 39.50  E-value: 3.64e-04
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1720391008  373 RVSSLQLLCQQAIASTLREDKdVNKLTLPPRLCSYL 408
Cdd:cd03717      2 SVRSLQHLCRFVIRQCTRRDL-IDQLPLPRRLKDYL 36
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
946-1194 3.71e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 45.25  E-value: 3.71e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008  946 TLNRLTVPRYSIPTGDPPPYPEIASQLAQGRSAAqrldnslihatlrrnnreVALKMAQLADSSRAPLQPLAKPKGGAAG 1025
Cdd:PRK12323   356 TLLRMLAFRPGQSGGGAGPATAAAAPVAQPAPAA------------------AAPAAAAPAPAAPPAAPAAAPAAAAAAR 417
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1026 AVAQLPARPPP---ALYTCSQCSGAGPSSQSGAALAHAISTSPLASQSSYNLLSPPDTSrdrtdyVNSAFTEDEALSQHC 1102
Cdd:PRK12323   418 AVAAAPARRSPapeALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAA------AAAPARAAPAAAPAP 491
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008 1103 QLEKPlrhPPLPEAAVTMKRPPPYQWDPMLGEDVW--VPQERTAQPTVPNPLKLSPLMLGQGQHLDVARVPFVPPKSPSS 1180
Cdd:PRK12323   492 ADDDP---PPWEELPPEFASPAPAQPDAAPAGWVAesIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRA 568
                          250
                   ....*....|....
gi 1720391008 1181 PTATFPTGYGMGMP 1194
Cdd:PRK12323   569 SASGLPDMFDGDWP 582
SOCS_box smart00969
The SOCS box acts as a bridge between specific substrate- binding domains and more generic ...
375-409 6.15e-04

The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases;


Pssm-ID: 198037  Cd Length: 34  Bit Score: 38.54  E-value: 6.15e-04
                            10        20        30
                    ....*....|....*....|....*....|....*
gi 1720391008   375 SSLQLLCQQAIASTLredKDVNKLTLPPRLCSYLS 409
Cdd:smart00969    1 RSLQHLCRLAIRRSL---GGIDKLPLPPRLKDYLL 32
SOCS_box pfam07525
SOCS box; The SOCS box acts as a bridge between specific substrate- binding domains and more ...
374-408 1.91e-03

SOCS box; The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases.


Pssm-ID: 462192  Cd Length: 39  Bit Score: 37.53  E-value: 1.91e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1720391008  374 VSSLQLLCQQAIASTL--REDKDVNKLTLPPRLCSYL 408
Cdd:pfam07525    2 PRSLQHLCRLAIRRALgkRRLGAIDKLPLPPLLKDYL 38
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
53-217 2.68e-03

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 41.55  E-value: 2.68e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008   53 GVVGVTFTSSH-----CRRDRS-------TPQRInFNLRGHNSEVVLVRWNEPYQKLATCDADGGIFVWIQYEGRwSVEL 120
Cdd:cd00200     95 YVSSVAFSPDGrilssSSRDKTikvwdveTGKCL-TTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGK-CVAT 172
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720391008  121 VNDRGAQVSDFTWSHDGTQALISYRDGFVLVGSVSGQR-------HwsseinlESQITCGIWTPDDQQVLFGTADGQVIV 193
Cdd:cd00200    173 LTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKclgtlrgH-------ENGVNSVAFSPDGYLLASGSEDGTIRV 245
                          170       180
                   ....*....|....*....|....*
gi 1720391008  194 MD-CHGRMLAHVLLHESdGILSMSW 217
Cdd:cd00200    246 WDlRTGECVQTLSGHTN-SVTSLAW 269
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
78-109 8.43e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 35.75  E-value: 8.43e-03
                            10        20        30
                    ....*....|....*....|....*....|..
gi 1720391008    78 LRGHNSEVVLVRWNEPYQKLATCDADGGIFVW 109
Cdd:smart00320    8 LKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH