|
Name |
Accession |
Description |
Interval |
E-value |
| Rav1p_C super family |
cl13644 |
RAVE protein 1 C terminal; This domain family is found in eukaryotes, and is typically between ... |
1171-1902 |
1.36e-75 |
|
RAVE protein 1 C terminal; This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits. The actual alignment was detected with superfamily member pfam12234:
Pssm-ID: 432413 Cd Length: 637 Bit Score: 266.36 E-value: 1.36e-75
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1171 LDWVSKEDGSHILTVGVGANIFMYGRLSgivTEQTNSkdgvavitLPlggsikqgvksRWVLLRSIDLVssvDGTPSLPV 1250
Cdd:pfam12234 76 LDWTSTPDSQSILAVGFPHHVLLLTQLR---YDYTNK--------GP-----------SWAPIRKIDIR---DLTPHPIG 130
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1251 SLSWVRDGILVVGMDCEMHVYAQWkhavkfgdteadssnAEEAAMQDHSTfksnmlaRKSVVEGTAISDDvfcsptviqd 1330
Cdd:pfam12234 131 DSIWLDDGTLVVAAGNQLFIYDKW---------------LDLRLPDDPFT-------LRSIGSRKILSND---------- 178
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1331 ggLFEAAHVLSPTLPQYHPTQLLELMDLGKVRRAKAILSHLVKCIagevaivrdpdagegtkrhlsrtisvsgstaketv 1410
Cdd:pfam12234 179 --LFHLVSVLNGPLPVYHPQFLIQCLLAGKLELVKEILLRLFKEL----------------------------------- 221
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1411 tvgKDGTRDYTEIDSIPPLPLYALLAaDQDTSYRISEESTkipqsyedqtvSQPEDQYSELFQiqdiptddidlepekre 1490
Cdd:pfam12234 222 ---KFYSEDLEDLDSFLGIDLEKFLK-DDDKAYSKNKAFT-----------SSSDDDDPDPYE----------------- 269
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1491 nkskvinlsqygpaYFGQEHARVLSSHLMHSSLPGLTRLEQMFLVALADTVATTSTEldesrdkscsgRDTLDECGLRYL 1570
Cdd:pfam12234 270 --------------TFNEEVASSLNEKLTKISLPQLTRHEQITLINVIEAVGEVEKH-----------RRSLDENGARFL 324
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1571 LAMRLHTclltslppLYRVQLLHQGVSTCHFAWAFHSEAEEELINMIPAIQRGDPQWSELRAMGIGWWVRNINTLRRCIE 1650
Cdd:pfam12234 325 LGFKLHL--------LHKKRTSQSSLSWRDISWALHSDNQEILLDLVSRHYGNKLLWEAARESGIFMWLKDIEALRAQFE 396
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1651 KVAKASFQRNN--DALDAALFYLSMKKKAVVWGLFR--SQHDE--KMTTFFSHNFNEDRWRKAALKNAFSLLGKQRFEQS 1724
Cdd:pfam12234 397 VIARNEYTKSDerDPVDCSLFYLALKKKQVLQGLWRmaSWHPEqaKTLKFLSNDFSEPRWRTAALKNAFALLSKHRYEYA 476
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1725 AAFFLLAGSLKDAIEVCLEKMEDIQLAMVIARLYESefETSSTYISILNQKIL-GCQKDGsgfsckrlhpDPFLRSLAYW 1803
Cdd:pfam12234 477 AAFFLLADSLKDAVNVLLRQLKDLQLAIAVARVYEG--DDGPVLRELLEERVLpLAIKEG----------DRWLASWAFW 544
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1804 VMK---DYTRAL----DTLLEQTPKEDDEHQVII-KSC---NPVAFSFYNYLRTHPL-LIRRNLASPEgtlatlglKTEK 1871
Cdd:pfam12234 545 MLKrrdLAVRALvtppYDLLENTDLKKSDPASPVsKSFltdDPALVLLYQQLRKKTLqTLKGALKVTP--------KEEY 616
|
730 740 750
....*....|....*....|....*....|.
gi 291621657 1872 NFVdkinlierklfFTTANAHFKVGCPVLAL 1902
Cdd:pfam12234 617 DFV-----------LRVARIYDRMGCDLLAL 636
|
|
| WD40 super family |
cl29593 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
2767-3019 |
1.01e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment. The actual alignment was detected with superfamily member cd00200:
Pssm-ID: 475233 [Multi-domain] Cd Length: 289 Bit Score: 98.18 E-value: 1.01e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2767 VKRMTSHPVHQYYLTGAQDGSVRMFEWTRPQQLVCFRQAGNArVTRLYFNSQGNKCGVADGEGFLSIWQVNqtasNPKPY 2846
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGP-VRDVAASADGTYLASGSSDKTIRLWDLE----TGECV 86
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2847 MSWQCHSKATSDFAFITSSSLVATSGhsnDNRNVCLWDTlisPGNSLIHGFTCHDHGATVLQYAPKQQLLISGGRKGHVC 2926
Cdd:cd00200 87 RTLTGHTSYVSSVAFSPDGRILSSSS---RDKTIKVWDV---ETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIK 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2927 IFDIRQRQLIHTFQAHDSAIKALALDPYEEYFTTGSAEGNIKVWRLTGHGLIHSFKSeHakqsifrniGAGVMQIDIIQG 3006
Cdd:cd00200 161 LWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRG-H---------ENGVNSVAFSPD 230
|
250
....*....|....
gi 291621657 3007 NRLF-SCGADGTLK 3019
Cdd:cd00200 231 GYLLaSGSEDGTIR 244
|
|
| WD40 super family |
cl43672 |
WD40 repeat [General function prediction only]; |
45-268 |
3.98e-03 |
|
WD40 repeat [General function prediction only]; The actual alignment was detected with superfamily member COG2319:
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 42.59 E-value: 3.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 45 ECVQIIPGakHGNiQVSCVECSNQQGRIA-ASYGNAVCIFEPLGinshkrncqlkCQWLKTGQFFLSSVtYNLAWDPQDN 123
Cdd:COG2319 195 KLLRTLTG--HTG-AVRSVAFSPDGKLLAsGSADGTVRLWDLAT-----------GKLLRTLTGHSGSV-RSVAFSPDGR 259
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 124 RLLTATD--SIQLWAPPGDDILEEEEEIDNTVppvlndwkcvwqckTSVSvhlmeWSPDGEYFATAGkDDCLLKVWYPMT 201
Cdd:COG2319 260 LLASGSAdgTVRLWDLATGELLRTLTGHSGGV--------------NSVA-----FSPDGKLLASGS-DDGTVRLWDLAT 319
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 291621657 202 GWKSSIIPqdhhevkrrqsstqfsfvylAHPRAVTGFSWRktskymPRGsvcNVLLTSCHDGVCRLW 268
Cdd:COG2319 320 GKLLRTLT--------------------GHTGAVRSVAFS------PDG---KTLASGSDDGTVRLW 357
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Rav1p_C |
pfam12234 |
RAVE protein 1 C terminal; This domain family is found in eukaryotes, and is typically between ... |
1171-1902 |
1.36e-75 |
|
RAVE protein 1 C terminal; This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits.
Pssm-ID: 432413 Cd Length: 637 Bit Score: 266.36 E-value: 1.36e-75
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1171 LDWVSKEDGSHILTVGVGANIFMYGRLSgivTEQTNSkdgvavitLPlggsikqgvksRWVLLRSIDLVssvDGTPSLPV 1250
Cdd:pfam12234 76 LDWTSTPDSQSILAVGFPHHVLLLTQLR---YDYTNK--------GP-----------SWAPIRKIDIR---DLTPHPIG 130
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1251 SLSWVRDGILVVGMDCEMHVYAQWkhavkfgdteadssnAEEAAMQDHSTfksnmlaRKSVVEGTAISDDvfcsptviqd 1330
Cdd:pfam12234 131 DSIWLDDGTLVVAAGNQLFIYDKW---------------LDLRLPDDPFT-------LRSIGSRKILSND---------- 178
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1331 ggLFEAAHVLSPTLPQYHPTQLLELMDLGKVRRAKAILSHLVKCIagevaivrdpdagegtkrhlsrtisvsgstaketv 1410
Cdd:pfam12234 179 --LFHLVSVLNGPLPVYHPQFLIQCLLAGKLELVKEILLRLFKEL----------------------------------- 221
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1411 tvgKDGTRDYTEIDSIPPLPLYALLAaDQDTSYRISEESTkipqsyedqtvSQPEDQYSELFQiqdiptddidlepekre 1490
Cdd:pfam12234 222 ---KFYSEDLEDLDSFLGIDLEKFLK-DDDKAYSKNKAFT-----------SSSDDDDPDPYE----------------- 269
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1491 nkskvinlsqygpaYFGQEHARVLSSHLMHSSLPGLTRLEQMFLVALADTVATTSTEldesrdkscsgRDTLDECGLRYL 1570
Cdd:pfam12234 270 --------------TFNEEVASSLNEKLTKISLPQLTRHEQITLINVIEAVGEVEKH-----------RRSLDENGARFL 324
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1571 LAMRLHTclltslppLYRVQLLHQGVSTCHFAWAFHSEAEEELINMIPAIQRGDPQWSELRAMGIGWWVRNINTLRRCIE 1650
Cdd:pfam12234 325 LGFKLHL--------LHKKRTSQSSLSWRDISWALHSDNQEILLDLVSRHYGNKLLWEAARESGIFMWLKDIEALRAQFE 396
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1651 KVAKASFQRNN--DALDAALFYLSMKKKAVVWGLFR--SQHDE--KMTTFFSHNFNEDRWRKAALKNAFSLLGKQRFEQS 1724
Cdd:pfam12234 397 VIARNEYTKSDerDPVDCSLFYLALKKKQVLQGLWRmaSWHPEqaKTLKFLSNDFSEPRWRTAALKNAFALLSKHRYEYA 476
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1725 AAFFLLAGSLKDAIEVCLEKMEDIQLAMVIARLYESefETSSTYISILNQKIL-GCQKDGsgfsckrlhpDPFLRSLAYW 1803
Cdd:pfam12234 477 AAFFLLADSLKDAVNVLLRQLKDLQLAIAVARVYEG--DDGPVLRELLEERVLpLAIKEG----------DRWLASWAFW 544
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1804 VMK---DYTRAL----DTLLEQTPKEDDEHQVII-KSC---NPVAFSFYNYLRTHPL-LIRRNLASPEgtlatlglKTEK 1871
Cdd:pfam12234 545 MLKrrdLAVRALvtppYDLLENTDLKKSDPASPVsKSFltdDPALVLLYQQLRKKTLqTLKGALKVTP--------KEEY 616
|
730 740 750
....*....|....*....|....*....|.
gi 291621657 1872 NFVdkinlierklfFTTANAHFKVGCPVLAL 1902
Cdd:pfam12234 617 DFV-----------LRVARIYDRMGCDLLAL 636
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
2767-3019 |
1.01e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 98.18 E-value: 1.01e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2767 VKRMTSHPVHQYYLTGAQDGSVRMFEWTRPQQLVCFRQAGNArVTRLYFNSQGNKCGVADGEGFLSIWQVNqtasNPKPY 2846
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGP-VRDVAASADGTYLASGSSDKTIRLWDLE----TGECV 86
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2847 MSWQCHSKATSDFAFITSSSLVATSGhsnDNRNVCLWDTlisPGNSLIHGFTCHDHGATVLQYAPKQQLLISGGRKGHVC 2926
Cdd:cd00200 87 RTLTGHTSYVSSVAFSPDGRILSSSS---RDKTIKVWDV---ETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIK 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2927 IFDIRQRQLIHTFQAHDSAIKALALDPYEEYFTTGSAEGNIKVWRLTGHGLIHSFKSeHakqsifrniGAGVMQIDIIQG 3006
Cdd:cd00200 161 LWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRG-H---------ENGVNSVAFSPD 230
|
250
....*....|....
gi 291621657 3007 NRLF-SCGADGTLK 3019
Cdd:cd00200 231 GYLLaSGSEDGTIR 244
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
2767-3019 |
1.79e-21 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 99.99 E-value: 1.79e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2767 VKRMTSHPVHQYYLTGAQDGSVRMFEWTRPQQLVCFrQAGNARVTRLYFNSQGNKCGVADGEGFLSIWQVnqtaSNPKPY 2846
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTL-TGHTGAVRSVAFSPDGKTLASGSADGTVRLWDL----ATGKLL 155
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2847 MSWQCHSKATSDFAFITSSSLVATSGhsnDNRNVCLWDTLispGNSLIHGFTCHDHGATVLQYAPKQQLLISGGRKGHVC 2926
Cdd:COG2319 156 RTLTGHSGAVTSVAFSPDGKLLASGS---DDGTVRLWDLA---TGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVR 229
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2927 IFDIRQRQLIHTFQAHDSAIKALALDPYEEYFTTGSAEGNIKVWRLTGHGLIHSFKSEhakqsifrniGAGVMQIDII-Q 3005
Cdd:COG2319 230 LWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGH----------SGGVNSVAFSpD 299
|
250
....*....|....
gi 291621657 3006 GNRLFSCGADGTLK 3019
Cdd:COG2319 300 GKLLASGSDDGTVR 313
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
2932-2970 |
4.43e-05 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 42.68 E-value: 4.43e-05
10 20 30
....*....|....*....|....*....|....*....
gi 291621657 2932 QRQLIHTFQAHDSAIKALALDPYEEYFTTGSAEGNIKVW 2970
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
2933-2970 |
1.19e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 41.56 E-value: 1.19e-04
10 20 30
....*....|....*....|....*....|....*...
gi 291621657 2933 RQLIHTFQAHDSAIKALALDPYEEYFTTGSAEGNIKVW 2970
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
45-268 |
3.98e-03 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 42.59 E-value: 3.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 45 ECVQIIPGakHGNiQVSCVECSNQQGRIA-ASYGNAVCIFEPLGinshkrncqlkCQWLKTGQFFLSSVtYNLAWDPQDN 123
Cdd:COG2319 195 KLLRTLTG--HTG-AVRSVAFSPDGKLLAsGSADGTVRLWDLAT-----------GKLLRTLTGHSGSV-RSVAFSPDGR 259
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 124 RLLTATD--SIQLWAPPGDDILEEEEEIDNTVppvlndwkcvwqckTSVSvhlmeWSPDGEYFATAGkDDCLLKVWYPMT 201
Cdd:COG2319 260 LLASGSAdgTVRLWDLATGELLRTLTGHSGGV--------------NSVA-----FSPDGKLLASGS-DDGTVRLWDLAT 319
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 291621657 202 GWKSSIIPqdhhevkrrqsstqfsfvylAHPRAVTGFSWRktskymPRGsvcNVLLTSCHDGVCRLW 268
Cdd:COG2319 320 GKLLRTLT--------------------GHTGAVRSVAFS------PDG---KTLASGSDDGTVRLW 357
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Rav1p_C |
pfam12234 |
RAVE protein 1 C terminal; This domain family is found in eukaryotes, and is typically between ... |
1171-1902 |
1.36e-75 |
|
RAVE protein 1 C terminal; This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits.
Pssm-ID: 432413 Cd Length: 637 Bit Score: 266.36 E-value: 1.36e-75
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1171 LDWVSKEDGSHILTVGVGANIFMYGRLSgivTEQTNSkdgvavitLPlggsikqgvksRWVLLRSIDLVssvDGTPSLPV 1250
Cdd:pfam12234 76 LDWTSTPDSQSILAVGFPHHVLLLTQLR---YDYTNK--------GP-----------SWAPIRKIDIR---DLTPHPIG 130
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1251 SLSWVRDGILVVGMDCEMHVYAQWkhavkfgdteadssnAEEAAMQDHSTfksnmlaRKSVVEGTAISDDvfcsptviqd 1330
Cdd:pfam12234 131 DSIWLDDGTLVVAAGNQLFIYDKW---------------LDLRLPDDPFT-------LRSIGSRKILSND---------- 178
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1331 ggLFEAAHVLSPTLPQYHPTQLLELMDLGKVRRAKAILSHLVKCIagevaivrdpdagegtkrhlsrtisvsgstaketv 1410
Cdd:pfam12234 179 --LFHLVSVLNGPLPVYHPQFLIQCLLAGKLELVKEILLRLFKEL----------------------------------- 221
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1411 tvgKDGTRDYTEIDSIPPLPLYALLAaDQDTSYRISEESTkipqsyedqtvSQPEDQYSELFQiqdiptddidlepekre 1490
Cdd:pfam12234 222 ---KFYSEDLEDLDSFLGIDLEKFLK-DDDKAYSKNKAFT-----------SSSDDDDPDPYE----------------- 269
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1491 nkskvinlsqygpaYFGQEHARVLSSHLMHSSLPGLTRLEQMFLVALADTVATTSTEldesrdkscsgRDTLDECGLRYL 1570
Cdd:pfam12234 270 --------------TFNEEVASSLNEKLTKISLPQLTRHEQITLINVIEAVGEVEKH-----------RRSLDENGARFL 324
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1571 LAMRLHTclltslppLYRVQLLHQGVSTCHFAWAFHSEAEEELINMIPAIQRGDPQWSELRAMGIGWWVRNINTLRRCIE 1650
Cdd:pfam12234 325 LGFKLHL--------LHKKRTSQSSLSWRDISWALHSDNQEILLDLVSRHYGNKLLWEAARESGIFMWLKDIEALRAQFE 396
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1651 KVAKASFQRNN--DALDAALFYLSMKKKAVVWGLFR--SQHDE--KMTTFFSHNFNEDRWRKAALKNAFSLLGKQRFEQS 1724
Cdd:pfam12234 397 VIARNEYTKSDerDPVDCSLFYLALKKKQVLQGLWRmaSWHPEqaKTLKFLSNDFSEPRWRTAALKNAFALLSKHRYEYA 476
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1725 AAFFLLAGSLKDAIEVCLEKMEDIQLAMVIARLYESefETSSTYISILNQKIL-GCQKDGsgfsckrlhpDPFLRSLAYW 1803
Cdd:pfam12234 477 AAFFLLADSLKDAVNVLLRQLKDLQLAIAVARVYEG--DDGPVLRELLEERVLpLAIKEG----------DRWLASWAFW 544
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 1804 VMK---DYTRAL----DTLLEQTPKEDDEHQVII-KSC---NPVAFSFYNYLRTHPL-LIRRNLASPEgtlatlglKTEK 1871
Cdd:pfam12234 545 MLKrrdLAVRALvtppYDLLENTDLKKSDPASPVsKSFltdDPALVLLYQQLRKKTLqTLKGALKVTP--------KEEY 616
|
730 740 750
....*....|....*....|....*....|.
gi 291621657 1872 NFVdkinlierklfFTTANAHFKVGCPVLAL 1902
Cdd:pfam12234 617 DFV-----------LRVARIYDRMGCDLLAL 636
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
2767-3019 |
1.01e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 98.18 E-value: 1.01e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2767 VKRMTSHPVHQYYLTGAQDGSVRMFEWTRPQQLVCFRQAGNArVTRLYFNSQGNKCGVADGEGFLSIWQVNqtasNPKPY 2846
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGP-VRDVAASADGTYLASGSSDKTIRLWDLE----TGECV 86
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2847 MSWQCHSKATSDFAFITSSSLVATSGhsnDNRNVCLWDTlisPGNSLIHGFTCHDHGATVLQYAPKQQLLISGGRKGHVC 2926
Cdd:cd00200 87 RTLTGHTSYVSSVAFSPDGRILSSSS---RDKTIKVWDV---ETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIK 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2927 IFDIRQRQLIHTFQAHDSAIKALALDPYEEYFTTGSAEGNIKVWRLTGHGLIHSFKSeHakqsifrniGAGVMQIDIIQG 3006
Cdd:cd00200 161 LWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRG-H---------ENGVNSVAFSPD 230
|
250
....*....|....
gi 291621657 3007 NRLF-SCGADGTLK 3019
Cdd:cd00200 231 GYLLaSGSEDGTIR 244
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
2767-3019 |
1.79e-21 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 99.99 E-value: 1.79e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2767 VKRMTSHPVHQYYLTGAQDGSVRMFEWTRPQQLVCFrQAGNARVTRLYFNSQGNKCGVADGEGFLSIWQVnqtaSNPKPY 2846
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTL-TGHTGAVRSVAFSPDGKTLASGSADGTVRLWDL----ATGKLL 155
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2847 MSWQCHSKATSDFAFITSSSLVATSGhsnDNRNVCLWDTLispGNSLIHGFTCHDHGATVLQYAPKQQLLISGGRKGHVC 2926
Cdd:COG2319 156 RTLTGHSGAVTSVAFSPDGKLLASGS---DDGTVRLWDLA---TGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVR 229
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2927 IFDIRQRQLIHTFQAHDSAIKALALDPYEEYFTTGSAEGNIKVWRLTGHGLIHSFKSEhakqsifrniGAGVMQIDII-Q 3005
Cdd:COG2319 230 LWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGH----------SGGVNSVAFSpD 299
|
250
....*....|....
gi 291621657 3006 GNRLFSCGADGTLK 3019
Cdd:COG2319 300 GKLLASGSDDGTVR 313
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
2777-3019 |
5.50e-21 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 98.44 E-value: 5.50e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2777 QYYLTGAQDGSVRMFEWTRPQQLVCFRqAGNARVTRLYFNSQGNKCGVADGEGFLSIWQVNqtasNPKPYMSWQCHSKAT 2856
Cdd:COG2319 175 KLLASGSDDGTVRLWDLATGKLLRTLT-GHTGAVRSVAFSPDGKLLASGSADGTVRLWDLA----TGKLLRTLTGHSGSV 249
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2857 SDFAFITSSSLVATSGhsnDNRNVCLWDTlisPGNSLIHGFTCHDHGATVLQYAPKQQLLISGGRKGHVCIFDIRQRQLI 2936
Cdd:COG2319 250 RSVAFSPDGRLLASGS---ADGTVRLWDL---ATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLL 323
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2937 HTFQAHDSAIKALALDPYEEYFTTGSAEGNIKVWRLTGHGLIHSFKsEHakqsifrniGAGVMQIDI-IQGNRLFSCGAD 3015
Cdd:COG2319 324 RTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLT-GH---------TGAVTSVAFsPDGRTLASGSAD 393
|
....
gi 291621657 3016 GTLK 3019
Cdd:COG2319 394 GTVR 397
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
2765-3020 |
5.46e-19 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 90.09 E-value: 5.46e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2765 HNVKRMTSHPVHQYYLTGAQDGSVRMFEWTRPQQLVCFRQAgNARVTRLYFNSQGN--KCGVADGEgfLSIWQVNqtasN 2842
Cdd:cd00200 52 GPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTLTGH-TSYVSSVAFSPDGRilSSSSRDKT--IKVWDVE----T 124
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2843 PKPYMSWQCHSKATSDFAFITSSSLVATSGhsnDNRNVCLWDTlisPGNSLIHGFTCHDHGATVLQYAPKQQLLISGGRK 2922
Cdd:cd00200 125 GKCLTTLRGHTDWVNSVAFSPDGTFVASSS---QDGTIKLWDL---RTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSD 198
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2923 GHVCIFDIRQRQLIHTFQAHDSAIKALALDPYEEYFTTGSAEGNIKVWRLTGHGLIHSFkSEHAKqsifrnigaGVMQID 3002
Cdd:cd00200 199 GTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTL-SGHTN---------SVTSLA 268
|
250
....*....|....*....
gi 291621657 3003 IIQ-GNRLFSCGADGTLKT 3020
Cdd:cd00200 269 WSPdGKRLASGSADGTIRI 287
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
2807-3019 |
1.88e-18 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 88.55 E-value: 1.88e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2807 NARVTRLYFNSQGNKCGVADGEGFLSIWQVNqtasNPKPYMSWQCHSKATSDFAFITSSSLVATSGhsnDNRNVCLWDTl 2886
Cdd:cd00200 9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLE----TGELLRTLKGHTGPVRDVAASADGTYLASGS---SDKTIRLWDL- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2887 isPGNSLIHGFTCHDHGATVLQYAPKQQLLISGGRKGHVCIFDIRQRQLIHTFQAHDSAIKALALDPYEEYFTTGSAEGN 2966
Cdd:cd00200 81 --ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGT 158
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 291621657 2967 IKVWRLTGHGLIHSFKSeHAKQsifrnigagVMQIDII-QGNRLFSCGADGTLK 3019
Cdd:cd00200 159 IKLWDLRTGKCVATLTG-HTGE---------VNSVAFSpDGEKLLSSSSDGTIK 202
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
2777-2974 |
9.26e-18 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 88.43 E-value: 9.26e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2777 QYYLTGAQDGSVRMFEWTRPQQLVCFRqAGNARVTRLYFNSQGNKCGVADGEGFLSIWQVNqtasNPKPYMSWQCHSKAT 2856
Cdd:COG2319 217 KLLASGSADGTVRLWDLATGKLLRTLT-GHSGSVRSVAFSPDGRLLASGSADGTVRLWDLA----TGELLRTLTGHSGGV 291
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2857 SDFAFITSSSLVATSGhsnDNRNVCLWDTlisPGNSLIHGFTCHDHGATVLQYAPKQQLLISGGRKGHVCIFDIRQRQLI 2936
Cdd:COG2319 292 NSVAFSPDGKLLASGS---DDGTVRLWDL---ATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELL 365
|
170 180 190
....*....|....*....|....*....|....*...
gi 291621657 2937 HTFQAHDSAIKALALDPYEEYFTTGSAEGNIKVWRLTG 2974
Cdd:COG2319 366 RTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
2773-2971 |
2.97e-16 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 82.00 E-value: 2.97e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2773 HPVHQYYLTGAQDGSVRMFEWTRPQQLVCFRQAgNARVTRLYFNSQGNKCGVADGEGFLSIWQVNqtasNPKPYMSWQCH 2852
Cdd:cd00200 102 SPDGRILSSSSRDKTIKVWDVETGKCLTTLRGH-TDWVNSVAFSPDGTFVASSSQDGTIKLWDLR----TGKCVATLTGH 176
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2853 SKATSDFAFITSSSLVATSGhsnDNRNVCLWDTLISpgnSLIHGFTCHDHGATVLQYAPKQQLLISGGRKGHVCIFDIRQ 2932
Cdd:cd00200 177 TGEVNSVAFSPDGEKLLSSS---SDGTIKLWDLSTG---KCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRT 250
|
170 180 190
....*....|....*....|....*....|....*....
gi 291621657 2933 RQLIHTFQAHDSAIKALALDPYEEYFTTGSAEGNIKVWR 2971
Cdd:cd00200 251 GECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
2849-3019 |
4.77e-13 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 72.37 E-value: 4.77e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2849 WQCHSKATSDFAFITSSSLVATSGhsnDNRNVCLWDTlisPGNSLIHGFTCHDHGATVLQYAPKQQLLISGGRKGHVCIF 2928
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLLATGS---GDGTIKVWDL---ETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLW 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2929 DIRQRQLIHTFQAHDSAIKALALDPYEEYFTTGSAEGNIKVWRLTghglihSFKSEHAkqsiFRNIGAGVMQIDIIQGNR 3008
Cdd:cd00200 79 DLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVE------TGKCLTT----LRGHTDWVNSVAFSPDGT 148
|
170
....*....|..
gi 291621657 3009 -LFSCGADGTLK 3019
Cdd:cd00200 149 fVASSSQDGTIK 160
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
2893-3019 |
1.75e-11 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 67.75 E-value: 1.75e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2893 LIHGFTCHDHGATVLQYAPKQQLLISGGRKGHVCIFDIRQRQLIHTFQAHDSAIKALALDPYEEYFTTGSAEGNIKVWRL 2972
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDL 80
|
90 100 110 120
....*....|....*....|....*....|....*....|....*...
gi 291621657 2973 TGHGLIHSFKSeHAKqsifrnigaGVMQIDIIQGNR-LFSCGADGTLK 3019
Cdd:cd00200 81 ETGECVRTLTG-HTS---------YVSSVAFSPDGRiLSSSSRDKTIK 118
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
2852-3019 |
8.26e-08 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 57.61 E-value: 8.26e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2852 HSKATSDFAFITSSSLVATSGHSNDNRnvcLWDTLispGNSLIHGFTCHDHGATVLQYAPKQQLLISGGRKGHVCIFDIR 2931
Cdd:COG2319 35 LAAAVASLAASPDGARLAAGAGDLTLL---LLDAA---AGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLA 108
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 2932 QRQLIHTFQAHDSAIKALALDPYEEYFTTGSAEGNIKVWRLTGHGLIHSFKsEHAkqsifrnigAGVMQIDII-QGNRLF 3010
Cdd:COG2319 109 TGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLT-GHS---------GAVTSVAFSpDGKLLA 178
|
....*....
gi 291621657 3011 SCGADGTLK 3019
Cdd:COG2319 179 SGSDDGTVR 187
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
2932-2970 |
4.43e-05 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 42.68 E-value: 4.43e-05
10 20 30
....*....|....*....|....*....|....*....
gi 291621657 2932 QRQLIHTFQAHDSAIKALALDPYEEYFTTGSAEGNIKVW 2970
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
2933-2970 |
1.19e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 41.56 E-value: 1.19e-04
10 20 30
....*....|....*....|....*....|....*...
gi 291621657 2933 RQLIHTFQAHDSAIKALALDPYEEYFTTGSAEGNIKVW 2970
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
45-268 |
3.98e-03 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 42.59 E-value: 3.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 45 ECVQIIPGakHGNiQVSCVECSNQQGRIA-ASYGNAVCIFEPLGinshkrncqlkCQWLKTGQFFLSSVtYNLAWDPQDN 123
Cdd:COG2319 195 KLLRTLTG--HTG-AVRSVAFSPDGKLLAsGSADGTVRLWDLAT-----------GKLLRTLTGHSGSV-RSVAFSPDGR 259
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 291621657 124 RLLTATD--SIQLWAPPGDDILEEEEEIDNTVppvlndwkcvwqckTSVSvhlmeWSPDGEYFATAGkDDCLLKVWYPMT 201
Cdd:COG2319 260 LLASGSAdgTVRLWDLATGELLRTLTGHSGGV--------------NSVA-----FSPDGKLLASGS-DDGTVRLWDLAT 319
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 291621657 202 GWKSSIIPqdhhevkrrqsstqfsfvylAHPRAVTGFSWRktskymPRGsvcNVLLTSCHDGVCRLW 268
Cdd:COG2319 320 GKLLRTLT--------------------GHTGAVRSVAFS------PDG---KTLASGSDDGTVRLW 357
|
|
|