|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1135-1548 |
5.00e-41 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 157.00 E-value: 5.00e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1135 GRLVVVEDLHSGAQQHWSGHSAEISTLALSHSAQVLASASGRSSttahcqIRVWDVSGGLCQHLIFPHSTTVLALAFSPD 1214
Cdd:COG2319 58 LTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGT------VRLWDLATGLLLRTLTGHTGAVRSVAFSPD 131
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1215 DRLLVTlGDHDGrTLALWGTATYDLVSSTRLPE-PVHGVAFNPwDageltcvGQgtvtfwllqqrgadislqvrrepvpe 1293
Cdd:COG2319 132 GKTLAS-GSADG-TVRLWDLATGKLLRTLTGHSgAVTSVAFSP-D-------GK-------------------------- 175
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1294 avgageltslcygappLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSEL 1371
Cdd:COG2319 176 ----------------LLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSpdGKLLASGSADGTVRLWDLATGKLL 239
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1372 RckgsgassVFMEHelvlDGAVVSASFDDSVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCS 1451
Cdd:COG2319 240 R--------TLTGH----SGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGS 307
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1452 EDGSVRVWALASMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGSLRIFSVSRTAMELKMHPHPVALTTVAFS 1531
Cdd:COG2319 308 DDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS------PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFS 381
|
410
....*....|....*..
gi 2217304991 1532 TDGQTVLSGDKDGLVAV 1548
Cdd:COG2319 382 PDGRTLASGSADGTVRL 398
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
603-995 |
8.10e-40 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 153.53 E-value: 8.10e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 603 RHARRLLPTRTPGGPHPQKQTFSSGPGIAISSLSVSPAmcAVGSEDGFLRLWPLDFSSVLLEAEHEGPVSSVCVSPDGLR 682
Cdd:COG2319 15 DLALALLAAALGALLLLLLGLAAAVASLAASPDGARLA--AGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRL 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 683 VLSATSSGHLGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHP 762
Cdd:COG2319 93 LASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSP 172
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 763 TRPTFFCGFSSGAVRSFSLEAAEVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQW-HVLRVAADMVcpd 841
Cdd:COG2319 173 DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLlRTLTGHSGSV--- 249
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 842 apaspSALAVSRDGRLLAfvgpsrctvtvmgSASLDELLRV-DIGT---LDLASSRLDSAMAVCFGPAalGHLLVSTSSN 917
Cdd:COG2319 250 -----RSVAFSPDGRLLA-------------SGSADGTVRLwDLATgelLRTLTGHSGGVNSVAFSPD--GKLLASGSDD 309
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 918 RVV-VLDAVSGRIIRELPGvHPEPCPSLTLSEDARFLLIA-AGRTIKVY-----------IGHSEPVQAVAFSPDQQQVL 984
Cdd:COG2319 310 GTVrLWDLATGKLLRTLTG-HTGAVRSVAFSPDGKTLASGsDDGTVRLWdlatgellrtlTGHTGAVTSVAFSPDGRTLA 388
|
410
....*....|...
gi 2217304991 985 SAGD--AVFLWDV 995
Cdd:COG2319 389 SGSAdgTVRLWDL 401
|
|
| CFA20_dom |
pfam05018 |
CFA20 domain; This domain is characteriztic of cilia- and flagella-associated protein 20 ... |
13-190 |
2.73e-34 |
|
CFA20 domain; This domain is characteriztic of cilia- and flagella-associated protein 20 (CFA20). CFA20 is a cilium- and flagellum-specific protein that plays a role in axonemal structure organization and motility. In Chlamydomonas reinhardtii, it stabilizes outer doublet microtubules (DMTs) of the axoneme and may work as a scaffold for intratubular proteins, such as tektin and PACRG, to produce the beak structures in DMT1. Other proteins contain a domain with homology to CFA20. WDR90/POC16 contains such a domain in its N terminus, followed by a large C-terminal domain with multiple WD40 repeats. This domain is also present in the N terminus of uncharacterized protein C3orf67.
Pssm-ID: 461521 Cd Length: 185 Bit Score: 130.40 E-value: 2.73e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 13 AWQHPFLNVFRHFRV---DEWKRSAKQGDVAVVTDKTLKGAVYRIRGSVSAANYIQLPKSSTQSLGLTGRYLYVLFRPLp 89
Cdd:pfam05018 5 TFQSGFLSIFYSIGSkplQIWSKKVKNGHIKRVTDDDIKSNVLEIVGTNVATTYITCPADPKQSLGIKLPFLVLLVKNL- 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 90 SKHFVIHLDVSSKDNQVIRVSFSNLFKEFKSTATWLQFPLVLEartpqrdlvglapsgARWTClqldLQDVLLVYLNRCY 169
Cdd:pfam05018 84 GKYFSFEIQILDDKNVRRRFRFSNFQKVTKVKPFITTMPLRLN---------------EGWNQ----IQFNLADFTRRAY 144
|
170 180
....*....|....*....|....*
gi 2217304991 170 G----HLKSIRLCASLLVRNLYTSD 190
Cdd:pfam05018 145 GtnyvETVRVQIHANCRLRRIYFSD 169
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1298-1602 |
7.48e-28 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 115.51 E-value: 7.48e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1298 GELTSLCYGA-PPLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSELRck 1374
Cdd:cd00200 10 GGVTCVAFSPdGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASadGTYLASGSSDKTIRLWDLETGECVR-- 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1375 gsgassVFMEHElvldGAVVSASFDDSVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCSEDG 1454
Cdd:cd00200 88 ------TLTGHT----SYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDG 157
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1455 SVRVWALASMELVIQFQVLNQSCLCLAWSPpccgrpEQQRLAAGYGDGSLRIFSVsRTAMELK-MHPHPVALTTVAFSTD 1533
Cdd:cd00200 158 TIKLWDLRTGKCVATLTGHTGEVNSVAFSP------DGEKLLSSSSDGTIKLWDL-STGKCLGtLRGHENGVNSVAFSPD 230
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1534 GQTVLSGDKDGLVAVSHPCTGTTFRVLSDHQGApisticVTCkecedLGVEGTDLWLA-ASGDQRVSVWA 1602
Cdd:cd00200 231 GYLLASGSEDGTIRVWDLRTGECVQTLSGHTNS------VTS-----LAWSPDGKRLAsGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
707-988 |
2.27e-27 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 113.97 E-value: 2.27e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 707 RSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAAEV 786
Cdd:cd00200 6 KGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGEC 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 787 LVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAqyscadpQWHVLRVAADMVCPDAPASPSALAVSRDGRLLAfvgpsrc 866
Cdd:cd00200 86 VRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIK-------VWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVA------- 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 867 tvtvmgSASLDELLRVdigtLDLASSRL--------DSAMAVCFGPAalGHLLVSTSSNRVV-VLDAVSGRIIRELPGvH 937
Cdd:cd00200 152 ------SSSQDGTIKL----WDLRTGKCvatltghtGEVNSVAFSPD--GEKLLSSSSDGTIkLWDLSTGKCLGTLRG-H 218
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217304991 938 PEPCPSLTLSEDARFLL------------IAAGRTIKVYIGHSEPVQAVAFSPDQQQVLSAGD 988
Cdd:cd00200 219 ENGVNSVAFSPDGYLLAsgsedgtirvwdLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSA 281
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
400-777 |
3.35e-27 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 116.16 E-value: 3.35e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 400 AVIVVLLVDTGEQRFFLGHTDKVSALALDGSSSLLASAqARAPSVmRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLC 479
Cdd:COG2319 59 TLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASA-SADGTV-RLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLA 136
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 480 GVGKDHhgrtMVVAWGTGQvglgGEVVVLAKAHTDfDVQAfrVTFF-DETRMASCGQ-GSVRLWRLRGGVLrscpVDLGE 557
Cdd:COG2319 137 SGSADG----TVRLWDLAT----GKLLRTLTGHSG-AVTS--VAFSpDGKLLASGSDdGTVRLWDLATGKL----LRTLT 201
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 558 HHALQFTDLAFkqARDGcpepsaAMLFVCSRSGHIleidcqrmvvrharRLLPTRTPggphpQKQTFSSGPGIAISSLSV 637
Cdd:COG2319 202 GHTGAVRSVAF--SPDG------KLLASGSADGTV--------------RLWDLATG-----KLLRTLTGHSGSVRSVAF 254
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 638 SP--AMCAVGSEDGFLRLWPLDFSSVL-LEAEHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAPVL 714
Cdd:COG2319 255 SPdgRLLASGSADGTVRLWDLATGELLrTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVR 334
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217304991 715 ALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVR 777
Cdd:COG2319 335 SVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVR 397
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
415-738 |
2.41e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 96.25 E-value: 2.41e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 415 FLGHTDKVSALALDGSSSLLASAQARapSVMRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLCGVGKDHhgrtMVVAW 494
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLLATGSGD--GTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDK----TIRLW 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 495 GTGQvglgGEVVVLAKAHTDfDVQAfrVTFFDETRMASCG--QGSVRLWRLRGGVLRSCpvdLGEHHAlqftdlafkQAR 572
Cdd:cd00200 79 DLET----GECVRTLTGHTS-YVSS--VAFSPDGRILSSSsrDKTIKVWDVETGKCLTT---LRGHTD---------WVN 139
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 573 DGCPEPSAAMLFVCSRSGHILEIDcqrmvVRHARRLlptrtpggphpqkQTFSsGPGIAISSLSVSP--AMCAVGSEDGF 650
Cdd:cd00200 140 SVAFSPDGTFVASSSQDGTIKLWD-----LRTGKCV-------------ATLT-GHTGEVNSVAFSPdgEKLLSSSSDGT 200
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 651 LRLWPLDFSSVL--LEAeHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATV 728
Cdd:cd00200 201 IKLWDLSTGKCLgtLRG-HENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASG 279
|
330
....*....|
gi 2217304991 729 SQDRTVRIWD 738
Cdd:cd00200 280 SADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1424-1742 |
2.69e-17 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 84.31 E-value: 2.69e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1424 STRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVWALASMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGS 1503
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS------ADGTYLASGSSDKT 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1504 LRIFSVS--RTAMELKMHPHPValTTVAFSTDGQTVLSGDKDGLVAVSHPCTGTTFRVLSDHQGaPISTICVTckecedl 1581
Cdd:cd00200 75 IRLWDLEtgECVRTLTGHTSYV--SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD-WVNSVAFS------- 144
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1582 gvEGTDLWLAASGDQRVSVWASDWLRnhcelvdwlsfpmpattetqghlppslaafcpwdgalLMYVGPGVYKEViiynl 1661
Cdd:cd00200 145 --PDGTFVASSSQDGTIKLWDLRTGK-------------------------------------CVATLTGHTGEV----- 180
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1662 cqkqvvekiplpffaMSLSLSPGTHLLAVGFAECMLRLVDCAMGTA-QDFAGHDNAVHLCRFTPSARLLFTAARNE-ILV 1739
Cdd:cd00200 181 ---------------NSVAFSPDGEKLLSSSSDGTIKLWDLSTGKClGTLRGHENGVNSVAFSPDGYLLASGSEDGtIRV 245
|
...
gi 2217304991 1740 WEV 1742
Cdd:cd00200 246 WDL 248
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1422-1459 |
3.44e-06 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 45.38 E-value: 3.44e-06
10 20 30
....*....|....*....|....*....|....*...
gi 2217304991 1422 GTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVW 1459
Cdd:smart00320 2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1422-1459 |
6.16e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 44.64 E-value: 6.16e-06
10 20 30
....*....|....*....|....*....|....*...
gi 2217304991 1422 GTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVW 1459
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
707-738 |
8.11e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 38.45 E-value: 8.11e-04
10 20 30
....*....|....*....|....*....|..
gi 2217304991 707 RSHTAPVLALAMEQRRGQLATVSQDRTVRIWD 738
Cdd:smart00320 9 KGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
958-994 |
3.89e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 36.55 E-value: 3.89e-03
10 20 30
....*....|....*....|....*....|....*....
gi 2217304991 958 GRTIKVYIGHSEPVQAVAFSPDQQQVLSAGD--AVFLWD 994
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDdgTVKVWD 39
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1135-1548 |
5.00e-41 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 157.00 E-value: 5.00e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1135 GRLVVVEDLHSGAQQHWSGHSAEISTLALSHSAQVLASASGRSSttahcqIRVWDVSGGLCQHLIFPHSTTVLALAFSPD 1214
Cdd:COG2319 58 LTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGT------VRLWDLATGLLLRTLTGHTGAVRSVAFSPD 131
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1215 DRLLVTlGDHDGrTLALWGTATYDLVSSTRLPE-PVHGVAFNPwDageltcvGQgtvtfwllqqrgadislqvrrepvpe 1293
Cdd:COG2319 132 GKTLAS-GSADG-TVRLWDLATGKLLRTLTGHSgAVTSVAFSP-D-------GK-------------------------- 175
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1294 avgageltslcygappLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSEL 1371
Cdd:COG2319 176 ----------------LLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSpdGKLLASGSADGTVRLWDLATGKLL 239
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1372 RckgsgassVFMEHelvlDGAVVSASFDDSVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCS 1451
Cdd:COG2319 240 R--------TLTGH----SGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGS 307
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1452 EDGSVRVWALASMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGSLRIFSVSRTAMELKMHPHPVALTTVAFS 1531
Cdd:COG2319 308 DDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS------PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFS 381
|
410
....*....|....*..
gi 2217304991 1532 TDGQTVLSGDKDGLVAV 1548
Cdd:COG2319 382 PDGRTLASGSADGTVRL 398
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
603-995 |
8.10e-40 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 153.53 E-value: 8.10e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 603 RHARRLLPTRTPGGPHPQKQTFSSGPGIAISSLSVSPAmcAVGSEDGFLRLWPLDFSSVLLEAEHEGPVSSVCVSPDGLR 682
Cdd:COG2319 15 DLALALLAAALGALLLLLLGLAAAVASLAASPDGARLA--AGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRL 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 683 VLSATSSGHLGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHP 762
Cdd:COG2319 93 LASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSP 172
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 763 TRPTFFCGFSSGAVRSFSLEAAEVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQW-HVLRVAADMVcpd 841
Cdd:COG2319 173 DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLlRTLTGHSGSV--- 249
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 842 apaspSALAVSRDGRLLAfvgpsrctvtvmgSASLDELLRV-DIGT---LDLASSRLDSAMAVCFGPAalGHLLVSTSSN 917
Cdd:COG2319 250 -----RSVAFSPDGRLLA-------------SGSADGTVRLwDLATgelLRTLTGHSGGVNSVAFSPD--GKLLASGSDD 309
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 918 RVV-VLDAVSGRIIRELPGvHPEPCPSLTLSEDARFLLIA-AGRTIKVY-----------IGHSEPVQAVAFSPDQQQVL 984
Cdd:COG2319 310 GTVrLWDLATGKLLRTLTG-HTGAVRSVAFSPDGKTLASGsDDGTVRLWdlatgellrtlTGHTGAVTSVAFSPDGRTLA 388
|
410
....*....|...
gi 2217304991 985 SAGD--AVFLWDV 995
Cdd:COG2319 389 SGSAdgTVRLWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1168-1601 |
1.53e-37 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 146.98 E-value: 1.53e-37
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1168 QVLASASGRSSTTAHCQIRVWDVSGGLCQHLIFPHSTTVLALAFSPDDRLLVTLGDhDGRTLALWGTATYDLVSSTRLPE 1247
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAG-DLTLLLLDAAAGALLATLLGHTA 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1248 PVHGVAFNPWDAGELTCVGQGTVTFWLLQQRGADISLQVRREPVpeavgagelTSLCYgAP--PLLYCGTSSGQVCVWDT 1325
Cdd:COG2319 80 AVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAV---------RSVAF-SPdgKTLASGSADGTVRLWDL 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1326 RAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSELRckgsgassVFMEHelvlDGAVVSASFddSVD 1403
Cdd:COG2319 150 ATGKLLRTLTGHSGAVTSVAFSpdGKLLASGSDDGTVRLWDLATGKLLR--------TLTGH----TGAVRSVAF--SPD 215
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1404 mG---VVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVWALASMELVIQFQVLNQSCLCL 1480
Cdd:COG2319 216 -GkllASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSV 294
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1481 AWSppccgrPEQQRLAAGYGDGSLRIFSVSRTAMELKMHPHPVALTTVAFSTDGQTVLSGDKDGLVAVSHPCTGTTFRVL 1560
Cdd:COG2319 295 AFS------PDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTL 368
|
410 420 430 440
....*....|....*....|....*....|....*....|..
gi 2217304991 1561 SDHQGaPISTICVTckecedlgveGTDLWLA-ASGDQRVSVW 1601
Cdd:COG2319 369 TGHTG-AVTSVAFS----------PDGRTLAsGSADGTVRLW 399
|
|
| CFA20_dom |
pfam05018 |
CFA20 domain; This domain is characteriztic of cilia- and flagella-associated protein 20 ... |
13-190 |
2.73e-34 |
|
CFA20 domain; This domain is characteriztic of cilia- and flagella-associated protein 20 (CFA20). CFA20 is a cilium- and flagellum-specific protein that plays a role in axonemal structure organization and motility. In Chlamydomonas reinhardtii, it stabilizes outer doublet microtubules (DMTs) of the axoneme and may work as a scaffold for intratubular proteins, such as tektin and PACRG, to produce the beak structures in DMT1. Other proteins contain a domain with homology to CFA20. WDR90/POC16 contains such a domain in its N terminus, followed by a large C-terminal domain with multiple WD40 repeats. This domain is also present in the N terminus of uncharacterized protein C3orf67.
Pssm-ID: 461521 Cd Length: 185 Bit Score: 130.40 E-value: 2.73e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 13 AWQHPFLNVFRHFRV---DEWKRSAKQGDVAVVTDKTLKGAVYRIRGSVSAANYIQLPKSSTQSLGLTGRYLYVLFRPLp 89
Cdd:pfam05018 5 TFQSGFLSIFYSIGSkplQIWSKKVKNGHIKRVTDDDIKSNVLEIVGTNVATTYITCPADPKQSLGIKLPFLVLLVKNL- 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 90 SKHFVIHLDVSSKDNQVIRVSFSNLFKEFKSTATWLQFPLVLEartpqrdlvglapsgARWTClqldLQDVLLVYLNRCY 169
Cdd:pfam05018 84 GKYFSFEIQILDDKNVRRRFRFSNFQKVTKVKPFITTMPLRLN---------------EGWNQ----IQFNLADFTRRAY 144
|
170 180
....*....|....*....|....*
gi 2217304991 170 G----HLKSIRLCASLLVRNLYTSD 190
Cdd:pfam05018 145 GtnyvETVRVQIHANCRLRRIYFSD 169
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
634-995 |
4.42e-34 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 136.58 E-value: 4.42e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 634 SLSVSPAMCAVGSEDGFLRLWPLDFSSVLLEAE-HEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAP 712
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLgLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 713 VLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAAEVLVEHTC 792
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTG 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 793 HRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQW-HVLRVAADMVcpdapaspSALAVSRDGRLLAfvgpsrctvtvm 871
Cdd:COG2319 161 HSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLlRTLTGHTGAV--------RSVAFSPDGKLLA------------ 220
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 872 gSASLDELLRVdigtLDLASSRL--------DSAMAVCFGPAalGHLLVSTSSNRVVVL-DAVSGRIIRELPGvHPEPCP 942
Cdd:COG2319 221 -SGSADGTVRL----WDLATGKLlrtltghsGSVRSVAFSPD--GRLLASGSADGTVRLwDLATGELLRTLTG-HSGGVN 292
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217304991 943 SLTLSEDARFLL------------IAAGRTIKVYIGHSEPVQAVAFSPDQQQVLSAGD--AVFLWDV 995
Cdd:COG2319 293 SVAFSPDGKLLAsgsddgtvrlwdLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDdgTVRLWDL 359
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
386-818 |
3.30e-31 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 128.11 E-value: 3.30e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 386 LWTPDGAAVVYPCHAVIVVLLVDTGEQRFFLGHTDKVSALALDGSSSLLASAQARAPSVMRLWDFQTGRCLCLFRSPMHV 465
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 466 VCSLSFSDSGALLCGVGKDHHgrtmVVAWGTgqvgLGGEVVVLAKAHTDfDVQAfrVTFF-DETRMASCGQ-GSVRLWRL 543
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGT----VRLWDL----ATGLLLRTLTGHTG-AVRS--VAFSpDGKTLASGSAdGTVRLWDL 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 544 RGGVLrscpVDLGEHHALQFTDLAFkqARDGcpepsaAMLFVCSRSGHIleidcqRMVVRHARRLLPTRTpggphpqkqt 623
Cdd:COG2319 150 ATGKL----LRTLTGHSGAVTSVAF--SPDG------KLLASGSDDGTV------RLWDLATGKLLRTLT---------- 201
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 624 fssGPGIAISSLSVSP--AMCAVGSEDGFLRLWPLDFSSVLLE-AEHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSR 700
Cdd:COG2319 202 ---GHTGAVRSVAFSPdgKLLASGSADGTVRLWDLATGKLLRTlTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATG 278
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 701 VYHMLARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFS 780
Cdd:COG2319 279 ELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWD 358
|
410 420 430
....*....|....*....|....*....|....*...
gi 2217304991 781 LEAAEVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSL 818
Cdd:COG2319 359 LATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTV 396
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1298-1602 |
7.48e-28 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 115.51 E-value: 7.48e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1298 GELTSLCYGA-PPLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSELRck 1374
Cdd:cd00200 10 GGVTCVAFSPdGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASadGTYLASGSSDKTIRLWDLETGECVR-- 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1375 gsgassVFMEHElvldGAVVSASFDDSVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCSEDG 1454
Cdd:cd00200 88 ------TLTGHT----SYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDG 157
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1455 SVRVWALASMELVIQFQVLNQSCLCLAWSPpccgrpEQQRLAAGYGDGSLRIFSVsRTAMELK-MHPHPVALTTVAFSTD 1533
Cdd:cd00200 158 TIKLWDLRTGKCVATLTGHTGEVNSVAFSP------DGEKLLSSSSDGTIKLWDL-STGKCLGtLRGHENGVNSVAFSPD 230
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1534 GQTVLSGDKDGLVAVSHPCTGTTFRVLSDHQGApisticVTCkecedLGVEGTDLWLA-ASGDQRVSVWA 1602
Cdd:cd00200 231 GYLLASGSEDGTIRVWDLRTGECVQTLSGHTNS------VTS-----LAWSPDGKRLAsGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
707-988 |
2.27e-27 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 113.97 E-value: 2.27e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 707 RSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAAEV 786
Cdd:cd00200 6 KGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGEC 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 787 LVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAqyscadpQWHVLRVAADMVCPDAPASPSALAVSRDGRLLAfvgpsrc 866
Cdd:cd00200 86 VRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIK-------VWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVA------- 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 867 tvtvmgSASLDELLRVdigtLDLASSRL--------DSAMAVCFGPAalGHLLVSTSSNRVV-VLDAVSGRIIRELPGvH 937
Cdd:cd00200 152 ------SSSQDGTIKL----WDLRTGKCvatltghtGEVNSVAFSPD--GEKLLSSSSDGTIkLWDLSTGKCLGTLRG-H 218
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217304991 938 PEPCPSLTLSEDARFLL------------IAAGRTIKVYIGHSEPVQAVAFSPDQQQVLSAGD 988
Cdd:cd00200 219 ENGVNSVAFSPDGYLLAsgsedgtirvwdLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSA 281
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
400-777 |
3.35e-27 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 116.16 E-value: 3.35e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 400 AVIVVLLVDTGEQRFFLGHTDKVSALALDGSSSLLASAqARAPSVmRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLC 479
Cdd:COG2319 59 TLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASA-SADGTV-RLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLA 136
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 480 GVGKDHhgrtMVVAWGTGQvglgGEVVVLAKAHTDfDVQAfrVTFF-DETRMASCGQ-GSVRLWRLRGGVLrscpVDLGE 557
Cdd:COG2319 137 SGSADG----TVRLWDLAT----GKLLRTLTGHSG-AVTS--VAFSpDGKLLASGSDdGTVRLWDLATGKL----LRTLT 201
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 558 HHALQFTDLAFkqARDGcpepsaAMLFVCSRSGHIleidcqrmvvrharRLLPTRTPggphpQKQTFSSGPGIAISSLSV 637
Cdd:COG2319 202 GHTGAVRSVAF--SPDG------KLLASGSADGTV--------------RLWDLATG-----KLLRTLTGHSGSVRSVAF 254
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 638 SP--AMCAVGSEDGFLRLWPLDFSSVL-LEAEHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAPVL 714
Cdd:COG2319 255 SPdgRLLASGSADGTVRLWDLATGELLrTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVR 334
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217304991 715 ALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVR 777
Cdd:COG2319 335 SVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVR 397
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
304-741 |
4.46e-27 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 115.78 E-value: 4.46e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 304 VSLSQERSDASNADGPGFHSLEPWAQLEASDIHTAAAGTHVLTHESAEVPVARTGSCEGFLPDPVLRLKGVIGFGGHGTR 383
Cdd:COG2319 2 LSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAV 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 384 QAL-WTPDGAAVVYPCHAVIVVLL-VDTGEQRFFL-GHTDKVSALALDGSSSLLASAQAraPSVMRLWDFQTGRCLCLFR 460
Cdd:COG2319 82 LSVaFSPDGRLLASASADGTVRLWdLATGLLLRTLtGHTGAVRSVAFSPDGKTLASGSA--DGTVRLWDLATGKLLRTLT 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 461 SPMHVVCSLSFSDSGallcgvgkdhhgRTMVVAWGTGQVGL----GGEVVVLAKAHTDFdvqAFRVTF-FDETRMASCGQ 535
Cdd:COG2319 160 GHSGAVTSVAFSPDG------------KLLASGSDDGTVRLwdlaTGKLLRTLTGHTGA---VRSVAFsPDGKLLASGSA 224
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 536 -GSVRLWRLRGGVLrscpVDLGEHHALQFTDLAFkqARDGcpepsaAMLFVCSRSGHIleidcqrmvvrharRLLPTRTP 614
Cdd:COG2319 225 dGTVRLWDLATGKL----LRTLTGHSGSVRSVAF--SPDG------RLLASGSADGTV--------------RLWDLATG 278
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 615 ggphpQKQTFSSGPGIAISSLSVSP--AMCAVGSEDGFLRLWPLDFSSVLLEAE-HEGPVSSVCVSPDGLRVLSATSSGH 691
Cdd:COG2319 279 -----ELLRTLTGHSGGVNSVAFSPdgKLLASGSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGT 353
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|
gi 2217304991 692 LGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLAT 741
Cdd:COG2319 354 VRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1149-1508 |
2.14e-26 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 111.27 E-value: 2.14e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1149 QHWSGHSAEISTLALSHSAQVLASASGRSSttahcqIRVWDVSGGLCQHLIFPHSTTVLALAFSPDDRLLVTLGDhdGRT 1228
Cdd:cd00200 3 RTLKGHTGGVTCVAFSPDGKLLATGSGDGT------IKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSS--DKT 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1229 LALW----GTATYDLVSSTrlpEPVHGVAFNPwdageltcvgqgtvtfwllqqrgadislqvrrepvpeavgageltslc 1304
Cdd:cd00200 75 IRLWdletGECVRTLTGHT---SYVSSVAFSP------------------------------------------------ 103
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1305 ygAPPLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFSGSR--LVSGSSTGRLRLWAVgavSELRCKGsgassVF 1382
Cdd:cd00200 104 --DGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGtfVASSSQDGTIKLWDL---RTGKCVA-----TL 173
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1383 MEHElvldGAVVSASFDDSVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVWALA 1462
Cdd:cd00200 174 TGHT----GEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLR 249
|
330 340 350 360
....*....|....*....|....*....|....*....|....*.
gi 2217304991 1463 SMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGSLRIFS 1508
Cdd:cd00200 250 TGECVQTLSGHTNSVTSLAWS------PDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1323-1744 |
9.64e-26 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 111.93 E-value: 9.64e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1323 WDTRAGRCFLSWEADDGGIGLLLFSGSRLVSGSSTGRLRLWAVGAVSELRCKGSGASSVFMEHELVLDGAVVSASFDDSV 1402
Cdd:COG2319 23 AALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTV 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1403 dmgvvgttagTLWFVswAEGTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVWALASMELVIQFQVLNQSCLCLAW 1482
Cdd:COG2319 103 ----------RLWDL--ATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAF 170
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1483 SppccgrPEQQRLAAGYGDGSLRIFSVSRTAMELKMHPHPVALTTVAFSTDGQTVLSGDKDGLVAVSHPCTGTTFRVLSD 1562
Cdd:COG2319 171 S------PDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTG 244
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1563 HQGAPISticvtckecedLGVEGTDLWLA-ASGDQRVSVWasdwlrnhcelvDWLSFPMPATTETQGHLPPSLaAFCPwD 1641
Cdd:COG2319 245 HSGSVRS-----------VAFSPDGRLLAsGSADGTVRLW------------DLATGELLRTLTGHSGGVNSV-AFSP-D 299
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1642 GALLMYVGPGvyKEVIIYNLCQKQVVEKIPLPFFA-MSLSLSPGTHLLAVGFAECMLRLVDCAMGTA-QDFAGHDNAVHL 1719
Cdd:COG2319 300 GKLLASGSDD--GTVRLWDLATGKLLRTLTGHTGAvRSVAFSPDGKTLASGSDDGTVRLWDLATGELlRTLTGHTGAVTS 377
|
410 420
....*....|....*....|....*.
gi 2217304991 1720 CRFTPSARLLFTAAR-NEILVWEVPG 1744
Cdd:COG2319 378 VAFSPDGRTLASGSAdGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
626-882 |
1.25e-25 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 108.96 E-value: 1.25e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 626 SGPGIAISSLSVSPAMcAVGSEDGFLRLWPLDFSSVLLE-AEHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHM 704
Cdd:cd00200 51 TGPVRDVAASADGTYL-ASGSSDKTIRLWDLETGECVRTlTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLT 129
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 705 LARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAA 784
Cdd:cd00200 130 TLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTG 209
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 785 EVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAqyscadpQWHVLRVAADMVCPDAPASPSALAVSRDGRLLAfvgps 864
Cdd:cd00200 210 KCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIR-------VWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLA----- 277
|
250
....*....|....*...
gi 2217304991 865 rctvtvmgSASLDELLRV 882
Cdd:cd00200 278 --------SGSADGTIRI 287
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1355-1744 |
1.44e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 108.46 E-value: 1.44e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1355 SSTGRLRLWAVGAVSELRCKGSGASSVFMEHELVLDGAVVSASFDDSVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSK 1434
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1435 VNEVVFSPGESHCATCSEDGSVRVWALASMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGSLRIFSVSRTAM 1514
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFS------PDGKTLASGSADGTVRLWDLATGKL 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1515 ELKMHPHPVALTTVAFSTDGQTVLSGDKDGLVAVSHPCTGTTFRVLSDHQGaPISTICVTckecedlgVEGTdlWLA-AS 1593
Cdd:COG2319 155 LRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTG-AVRSVAFS--------PDGK--LLAsGS 223
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1594 GDQRVSVWasDWLRNHCelvdwlsfpmpaTTETQGHLPPSLA-AFCPwDGALLmyVGPGVYKEVIIYNLCQKQVVEKIPL 1672
Cdd:COG2319 224 ADGTVRLW--DLATGKL------------LRTLTGHSGSVRSvAFSP-DGRLL--ASGSADGTVRLWDLATGELLRTLTG 286
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217304991 1673 PFFA-MSLSLSPGTHLLAVGFAECMLRLVDCAMG-TAQDFAGHDNAVHLCRFTPSARLLFTAAR-NEILVWEVPG 1744
Cdd:COG2319 287 HSGGvNSVAFSPDGKLLASGSDDGTVRLWDLATGkLLRTLTGHTGAVRSVAFSPDGKTLASGSDdGTVRLWDLAT 361
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
841-1365 |
1.88e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 108.07 E-value: 1.88e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 841 DAPASPSALAVSRDGRLLAFVGPSRCTVTVMGSASLDELLRVDIGTLDLASSRLDSAMAVCFGPAalGHLLVSTSSNRVV 920
Cdd:COG2319 25 LGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPD--GRLLASASADGTV 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 921 VL-DAVSGRIIRELPGvHPEPCPSLTLSEDARFLL------------IAAGRTIKVYIGHSEPVQAVAFSPDQQQVLSAG 987
Cdd:COG2319 103 RLwDLATGLLLRTLTG-HTGAVRSVAFSPDGKTLAsgsadgtvrlwdLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGS 181
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 988 D--AVFLWDVLAptesdqsfpgappacktgpgagpledaasraselprqqvpkpcqaspprlGVCARPPEGGDGArdtrn 1065
Cdd:COG2319 182 DdgTVRLWDLAT--------------------------------------------------GKLLRTLTGHTGA----- 206
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1066 sgaprttylASCKAFTParvscsphsakgtcpppaSGGWLrlkAVVGYSGNGRanmVWRPDTGFFAYTcgrlvvvedlhs 1145
Cdd:COG2319 207 ---------VRSVAFSP------------------DGKLL---ASGSADGTVR---LWDLATGKLLRT------------ 241
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1146 gaqqhWSGHSAEISTLALSHSAQVLASASGRSSttahcqIRVWDVSGGLCQHLIFPHSTTVLALAFSPDDRLLVTlGDHD 1225
Cdd:COG2319 242 -----LTGHSGSVRSVAFSPDGRLLASGSADGT------VRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLAS-GSDD 309
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1226 GrTLALWGTATYDLVSS-TRLPEPVHGVAFNPwdageltcVGQgtvtfwllqqrgadislqvrrepvpeavgageltslc 1304
Cdd:COG2319 310 G-TVRLWDLATGKLLRTlTGHTGAVRSVAFSP--------DGK------------------------------------- 343
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217304991 1305 ygappLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAV 1365
Cdd:COG2319 344 -----TLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSpdGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
415-738 |
2.41e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 96.25 E-value: 2.41e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 415 FLGHTDKVSALALDGSSSLLASAQARapSVMRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLCGVGKDHhgrtMVVAW 494
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLLATGSGD--GTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDK----TIRLW 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 495 GTGQvglgGEVVVLAKAHTDfDVQAfrVTFFDETRMASCG--QGSVRLWRLRGGVLRSCpvdLGEHHAlqftdlafkQAR 572
Cdd:cd00200 79 DLET----GECVRTLTGHTS-YVSS--VAFSPDGRILSSSsrDKTIKVWDVETGKCLTT---LRGHTD---------WVN 139
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 573 DGCPEPSAAMLFVCSRSGHILEIDcqrmvVRHARRLlptrtpggphpqkQTFSsGPGIAISSLSVSP--AMCAVGSEDGF 650
Cdd:cd00200 140 SVAFSPDGTFVASSSQDGTIKLWD-----LRTGKCV-------------ATLT-GHTGEVNSVAFSPdgEKLLSSSSDGT 200
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 651 LRLWPLDFSSVL--LEAeHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATV 728
Cdd:cd00200 201 IKLWDLSTGKCLgtLRG-HENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASG 279
|
330
....*....|
gi 2217304991 729 SQDRTVRIWD 738
Cdd:cd00200 280 SADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1122-1364 |
4.05e-19 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 89.70 E-value: 4.05e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1122 VWRPDTGFFAyTCG--RLVVVEDLHSGAQQH-WSGHSAEISTLALSHSAQVLASASgrssttAHCQIRVWDVSGGLCQHL 1198
Cdd:cd00200 58 AASADGTYLA-SGSsdKTIRLWDLETGECVRtLTGHTSYVSSVAFSPDGRILSSSS------RDKTIKVWDVETGKCLTT 130
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1199 IFPHSTTVLALAFSPDDRLLVTlGDHDGrTLALW----GTATYDLVSSTRlpePVHGVAFNPwDAGEL-TCVGQGTVTFW 1273
Cdd:cd00200 131 LRGHTDWVNSVAFSPDGTFVAS-SSQDG-TIKLWdlrtGKCVATLTGHTG---EVNSVAFSP-DGEKLlSSSSDGTIKLW 204
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1274 LLQQRGADISLQVRREPVpeavgagelTSLCYGAPPLLYCGTSS-GQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSR 1350
Cdd:cd00200 205 DLSTGKCLGTLRGHENGV---------NSVAFSPDGYLLASGSEdGTIRVWDLRTGECVQTLSGHTNSVTSLAWSpdGKR 275
|
250
....*....|....
gi 2217304991 1351 LVSGSSTGRLRLWA 1364
Cdd:cd00200 276 LASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1424-1742 |
2.69e-17 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 84.31 E-value: 2.69e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1424 STRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVWALASMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGS 1503
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS------ADGTYLASGSSDKT 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1504 LRIFSVS--RTAMELKMHPHPValTTVAFSTDGQTVLSGDKDGLVAVSHPCTGTTFRVLSDHQGaPISTICVTckecedl 1581
Cdd:cd00200 75 IRLWDLEtgECVRTLTGHTSYV--SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD-WVNSVAFS------- 144
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1582 gvEGTDLWLAASGDQRVSVWASDWLRnhcelvdwlsfpmpattetqghlppslaafcpwdgalLMYVGPGVYKEViiynl 1661
Cdd:cd00200 145 --PDGTFVASSSQDGTIKLWDLRTGK-------------------------------------CVATLTGHTGEV----- 180
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 1662 cqkqvvekiplpffaMSLSLSPGTHLLAVGFAECMLRLVDCAMGTA-QDFAGHDNAVHLCRFTPSARLLFTAARNE-ILV 1739
Cdd:cd00200 181 ---------------NSVAFSPDGEKLLSSSSDGTIKLWDLSTGKClGTLRGHENGVNSVAFSPDGYLLASGSEDGtIRV 245
|
...
gi 2217304991 1740 WEV 1742
Cdd:cd00200 246 WDL 248
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
789-997 |
1.12e-11 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 67.75 E-value: 1.12e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 789 EHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQW-HVLRVAADmvcpdapaSPSALAVSRDGRLLAfvgpsrct 867
Cdd:cd00200 4 TLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELlRTLKGHTG--------PVRDVAASADGTYLA-------- 67
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 868 vtvmgSASLDELLRV-DIGTLDLASS---RLDSAMAVCFGPAalGHLLVSTSSNRVVVL-DAVSGRIIRELPGvHPEPCP 942
Cdd:cd00200 68 -----SGSSDKTIRLwDLETGECVRTltgHTSYVSSVAFSPD--GRILSSSSRDKTIKVwDVETGKCLTTLRG-HTDWVN 139
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217304991 943 SLTLSEDARFLL------------IAAGRTIKVYIGHSEPVQAVAFSPDQQQVLSAGD--AVFLWDVLA 997
Cdd:cd00200 140 SVAFSPDGTFVAsssqdgtiklwdLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSdgTIKLWDLST 208
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
413-542 |
1.47e-09 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 61.20 E-value: 1.47e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 413 RFFLGHTDKVSALALDGSSSLLASAQARapSVMRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLCGVGKDhhgRTMVV 492
Cdd:cd00200 171 ATLTGHTGEVNSVAFSPDGEKLLSSSSD--GTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSED---GTIRV 245
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|.
gi 2217304991 493 aWgtgQVGLGGEVVVLaKAHTDFdVQAFRVTfFDETRMASCGQ-GSVRLWR 542
Cdd:cd00200 246 -W---DLRTGECVQTL-SGHTNS-VTSLAWS-PDGKRLASGSAdGTIRIWD 289
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1422-1459 |
3.44e-06 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 45.38 E-value: 3.44e-06
10 20 30
....*....|....*....|....*....|....*...
gi 2217304991 1422 GTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVW 1459
Cdd:smart00320 2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1422-1459 |
6.16e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 44.64 E-value: 6.16e-06
10 20 30
....*....|....*....|....*....|....*...
gi 2217304991 1422 GTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVW 1459
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
746-929 |
3.18e-04 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 44.63 E-value: 3.18e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 746 YDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAAEVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCAD 825
Cdd:COG4257 10 YPVPAPGSGPRDVAVDPDGAVWFTDQGGGRIGRLDPATGEFTEYPLGGGSGPHGIAVDPDGNLWFTDNGNNRIGRIDPKT 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 826 PQwhVLRVAAdmvcPDAPASPSALAVSRDGRLLafvgpsrctVTVMGSaslDELLRVDIGT----LDLASSRLDSAMAVC 901
Cdd:COG4257 90 GE--ITTFAL----PGGGSNPHGIAFDPDGNLW---------FTDQGG---NRIGRLDPATgevtEFPLPTGGAGPYGIA 151
|
170 180
....*....|....*....|....*....
gi 2217304991 902 FGPAalGHLLV-STSSNRVVVLDAVSGRI 929
Cdd:COG4257 152 VDPD--GNLWVtDFGANAIGRIDPDTGTL 178
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
707-738 |
8.11e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 38.45 E-value: 8.11e-04
10 20 30
....*....|....*....|....*....|..
gi 2217304991 707 RSHTAPVLALAMEQRRGQLATVSQDRTVRIWD 738
Cdd:smart00320 9 KGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
958-994 |
9.13e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 38.45 E-value: 9.13e-04
10 20 30
....*....|....*....|....*....|....*....
gi 2217304991 958 GRTIKVYIGHSEPVQAVAFSPDQQQVLSAGD--AVFLWD 994
Cdd:smart00320 2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSDdgTIKLWD 40
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
841-960 |
1.90e-03 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 41.99 E-value: 1.90e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 841 DAPASPSALAVSRDGRLLAFVGPSRCTVTVMGSASLDELLRVDIGtldlassrlDSAMAVCFGPAAlGHLLVS-TSSNR- 918
Cdd:COG3391 107 PVGGGPRGLAVDPDGGRLYVADSGNGRVSVIDTATGKVVATIPVG---------AGPHGIAVDPDG-KRLYVAnSGSNTv 176
|
90 100 110 120
....*....|....*....|....*....|....*....|....*.
gi 2217304991 919 ---VVVLDAVSGRIIRELP-GVHPEpcpSLTLSEDARFLLIAAGRT 960
Cdd:COG3391 177 sviVSVIDTATGKVVATIPvGGGPV---GVAVSPDGRRLYVANRGS 219
|
|
| WDR74 |
cd22857 |
WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and ... |
667-745 |
3.21e-03 |
|
WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and plants is an essential factor for ribosome assembly. In cooperation with the assembly factor NVL2, WDR74 participates in an early cleavage of the pre-rRNA processing pathway. NVL2 is a type II double ring, AAA-ATPase, that may mediate the release of WDR74 from nucleolar pre-60S particles. WDR74 has been implicated in tumorigenesis. In lung cancer, it regulates cell proliferation, cell cycle progression, chemoresistance and cell aggressiveness, by inducing nuclear beta-catenin accumulation and driving downstream Wnt-responsive genes expression. In melanoma, it promotes apoptosis resistance and aggressive behavior by regulating the RPL5-MDM2-p53 pathway. WDR74 contains an N-terminal seven-bladed beta-propeller WD40 domain that associates with the D1-AAA domain of the AAA-ATPase NVL2, and a flexible lysine-rich C-terminus that extends outward from the WD40 domain, and is required for nucleolar localization.
Pssm-ID: 439303 [Multi-domain] Cd Length: 325 Bit Score: 41.83 E-value: 3.21e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304991 667 HEGPVSSVCVSPDGLRVLSATSSGHLGFLD----TLSRVYHMLArshTAPVLALAMEQRRGQLATVSQDRTVRIWDLATL 742
Cdd:cd22857 222 GETPIKAVAEDPDGHTVYVGDTSGDLASIDlrtgKLLGCFKGKC---GGSIRSIARHPELPLIASCGLDRYLRIWDTETR 298
|
...
gi 2217304991 743 QQL 745
Cdd:cd22857 299 QLL 301
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
958-994 |
3.89e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 36.55 E-value: 3.89e-03
10 20 30
....*....|....*....|....*....|....*....
gi 2217304991 958 GRTIKVYIGHSEPVQAVAFSPDQQQVLSAGD--AVFLWD 994
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDdgTVKVWD 39
|
|
|