|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
603-1013 |
7.77e-41 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 156.61 E-value: 7.77e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 603 RHARRLLPTRTPGGPHPQKQTFSSGPGIAISSLSVSPAmcAVGSEDGFLRLWPLDFSSVLLEAEHEGPVSSVCVSPDGLR 682
Cdd:COG2319 15 DLALALLAAALGALLLLLLGLAAAVASLAASPDGARLA--AGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRL 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 683 VLSATSSGHLGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHP 762
Cdd:COG2319 93 LASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSP 172
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 763 TRPTFFCGFSSGAVRSFSLEAAEVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQW-HVLRVAADMVcpd 841
Cdd:COG2319 173 DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLlRTLTGHSGSV--- 249
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 842 apaspSALAVSRDGRLLAfvgpsrctvtvmgSASLDELLRV-DIGT---LDLASSRLDSAMAVCFGPAalGHLLVSTSSN 917
Cdd:COG2319 250 -----RSVAFSPDGRLLA-------------SGSADGTVRLwDLATgelLRTLTGHSGGVNSVAFSPD--GKLLASGSDD 309
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 918 RVV-VLDAVSGRIIREstvfqLPGvHPEPCPSLTLSEDARFLLIA-AGRTIKVWDYATQASPgpQVYIGHSEPVQAVAFS 995
Cdd:COG2319 310 GTVrLWDLATGKLLRT-----LTG-HTGAVRSVAFSPDGKTLASGsDDGTVRLWDLATGELL--RTLTGHTGAVTSVAFS 381
|
410 420
....*....|....*....|
gi 2217305001 996 PDQQQVLSAGD--AVFLWDV 1013
Cdd:COG2319 382 PDGRTLASGSAdgTVRLWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1153-1568 |
1.25e-40 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 155.84 E-value: 1.25e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1153 GRLVVVEDLHSGAQQHWSGHSAEISTLALSHSAQVLASASGRSSttahcqIRVWDVSGGLCQHLIFPHSTTVLALAFSPD 1232
Cdd:COG2319 58 LTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGT------VRLWDLATGLLLRTLTGHTGAVRSVAFSPD 131
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1233 DRLLVTlGDHDGrTLALWGTATYDLVSSTRLPE-PVHGVAFNPwDageltcvGQgtvtfwllqqrgadislqvrrepvpe 1311
Cdd:COG2319 132 GKTLAS-GSADG-TVRLWDLATGKLLRTLTGHSgAVTSVAFSP-D-------GK-------------------------- 175
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1312 avgageltslcygappLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSEL 1389
Cdd:COG2319 176 ----------------LLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSpdGKLLASGSADGTVRLWDLATGKLL 239
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1390 RC-KGSGARSSSVfmehELVLDGA-VVSASFDDSVdmgvvgttagTLWfvSWAEGTSTRLISGHRSKVNEVVFSPGESHC 1467
Cdd:COG2319 240 RTlTGHSGSVRSV----AFSPDGRlLASGSADGTV----------RLW--DLATGELLRTLTGHSGGVNSVAFSPDGKLL 303
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1468 ATCSEDGSVRVWALASMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGSLRIFSVSRTAMELKMHPHPVALTT 1547
Cdd:COG2319 304 ASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS------PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTS 377
|
410 420
....*....|....*....|.
gi 2217305001 1548 VAFSTDGQTVLSGDKDGLVAV 1568
Cdd:COG2319 378 VAFSPDGRTLASGSADGTVRL 398
|
|
| CFA20_dom |
pfam05018 |
CFA20 domain; This domain is characteriztic of cilia- and flagella-associated protein 20 ... |
13-190 |
2.77e-34 |
|
CFA20 domain; This domain is characteriztic of cilia- and flagella-associated protein 20 (CFA20). CFA20 is a cilium- and flagellum-specific protein that plays a role in axonemal structure organization and motility. In Chlamydomonas reinhardtii, it stabilizes outer doublet microtubules (DMTs) of the axoneme and may work as a scaffold for intratubular proteins, such as tektin and PACRG, to produce the beak structures in DMT1. Other proteins contain a domain with homology to CFA20. WDR90/POC16 contains such a domain in its N terminus, followed by a large C-terminal domain with multiple WD40 repeats. This domain is also present in the N terminus of uncharacterized protein C3orf67.
Pssm-ID: 461521 Cd Length: 185 Bit Score: 130.40 E-value: 2.77e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 13 AWQHPFLNVFRHFRV---DEWKRSAKQGDVAVVTDKTLKGAVYRIRGSVSAANYIQLPKSSTQSLGLTGRYLYVLFRPLp 89
Cdd:pfam05018 5 TFQSGFLSIFYSIGSkplQIWSKKVKNGHIKRVTDDDIKSNVLEIVGTNVATTYITCPADPKQSLGIKLPFLVLLVKNL- 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 90 SKHFVIHLDVSSKDNQVIRVSFSNLFKEFKSTATWLQFPLVLEartpqrdlvglapsgARWTClqldLQDVLLVYLNRCY 169
Cdd:pfam05018 84 GKYFSFEIQILDDKNVRRRFRFSNFQKVTKVKPFITTMPLRLN---------------EGWNQ----IQFNLADFTRRAY 144
|
170 180
....*....|....*....|....*
gi 2217305001 170 G----HLKSIRLCASLLVRNLYTSD 190
Cdd:pfam05018 145 GtnyvETVRVQIHANCRLRRIYFSD 169
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
707-1006 |
9.16e-29 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 117.82 E-value: 9.16e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 707 RSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAAEV 786
Cdd:cd00200 6 KGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGEC 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 787 LVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAqyscadpQWHVLRVAADMVCPDAPASPSALAVSRDGRLLAfvgpsrc 866
Cdd:cd00200 86 VRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIK-------VWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVA------- 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 867 tvtvmgSASLDELLRVdigtLDLASSRL--------DSAMAVCFGPAalGHLLVSTSSNRVV-VLDAVSGRIIRestvfQ 937
Cdd:cd00200 152 ------SSSQDGTIKL----WDLRTGKCvatltghtGEVNSVAFSPD--GEKLLSSSSDGTIkLWDLSTGKCLG-----T 214
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 938 LPGvHPEPCPSLTLSEDARFLLIAAG-RTIKVWDYATQASpgPQVYIGHSEPVQAVAFSPDQQQVLSAGD 1006
Cdd:cd00200 215 LRG-HENGVNSVAFSPDGYLLASGSEdGTIRVWDLRTGEC--VQTLSGHTNSVTSLAWSPDGKRLASGSA 281
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1316-1622 |
1.97e-27 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 113.97 E-value: 1.97e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1316 GELTSLCYGA-PPLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSELRck 1392
Cdd:cd00200 10 GGVTCVAFSPdGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASadGTYLASGSSDKTIRLWDLETGECVR-- 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1393 gsgarsssVFMEHElvldGAVVSASFDDSVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCSE 1472
Cdd:cd00200 88 --------TLTGHT----SYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQ 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1473 DGSVRVWALASMELVIQFQVLNQSCLCLAWSPpccgrpEQQRLAAGYGDGSLRIFSVsRTAMELK-MHPHPVALTTVAFS 1551
Cdd:cd00200 156 DGTIKLWDLRTGKCVATLTGHTGEVNSVAFSP------DGEKLLSSSSDGTIKLWDL-STGKCLGtLRGHENGVNSVAFS 228
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217305001 1552 TDGQTVLSGDKDGLVAVSHPCTGTTFRVLSDHQGApisticVTCkecedLGVEGTDLWLA-ASGDQRVSVWA 1622
Cdd:cd00200 229 PDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNS------VTS-----LAWSPDGKRLAsGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
400-777 |
5.71e-27 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 115.39 E-value: 5.71e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 400 AVIVVLLVDTGEQRFFLGHTDKVSALALDGSSSLLASAqARAPSVmRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLC 479
Cdd:COG2319 59 TLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASA-SADGTV-RLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLA 136
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 480 GVGKDHhgrtMVVAWGTGQvglgGEVVVLAKAHTDfDVQAfrVTFF-DETRMASCGQ-GSVRLWRLRGGVLrscpVDLGE 557
Cdd:COG2319 137 SGSADG----TVRLWDLAT----GKLLRTLTGHSG-AVTS--VAFSpDGKLLASGSDdGTVRLWDLATGKL----LRTLT 201
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 558 HHALQFTDLAFkqARDGcpepsaAMLFVCSRSGHIleidcqrmvvrharRLLPTRTPggphpQKQTFSSGPGIAISSLSV 637
Cdd:COG2319 202 GHTGAVRSVAF--SPDG------KLLASGSADGTV--------------RLWDLATG-----KLLRTLTGHSGSVRSVAF 254
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 638 SP--AMCAVGSEDGFLRLWPLDFSSVL-LEAEHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAPVL 714
Cdd:COG2319 255 SPdgRLLASGSADGTVRLWDLATGELLrTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVR 334
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217305001 715 ALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVR 777
Cdd:COG2319 335 SVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVR 397
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
415-738 |
3.38e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 95.86 E-value: 3.38e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 415 FLGHTDKVSALALDGSSSLLASAQARapSVMRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLCGVGKDHhgrtMVVAW 494
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLLATGSGD--GTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDK----TIRLW 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 495 GTGQvglgGEVVVLAKAHTDfDVQAfrVTFFDETRMASCG--QGSVRLWRLRGGVLRSCpvdLGEHHAlqftdlafkQAR 572
Cdd:cd00200 79 DLET----GECVRTLTGHTS-YVSS--VAFSPDGRILSSSsrDKTIKVWDVETGKCLTT---LRGHTD---------WVN 139
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 573 DGCPEPSAAMLFVCSRSGHILEIDcqrmvVRHARRLlptrtpggphpqkQTFSsGPGIAISSLSVSP--AMCAVGSEDGF 650
Cdd:cd00200 140 SVAFSPDGTFVASSSQDGTIKLWD-----LRTGKCV-------------ATLT-GHTGEVNSVAFSPdgEKLLSSSSDGT 200
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 651 LRLWPLDFSSVL--LEAeHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATV 728
Cdd:cd00200 201 IKLWDLSTGKCLgtLRG-HENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASG 279
|
330
....*....|
gi 2217305001 729 SQDRTVRIWD 738
Cdd:cd00200 280 SADGTIRIWD 289
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1442-1479 |
3.67e-06 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 45.00 E-value: 3.67e-06
10 20 30
....*....|....*....|....*....|....*...
gi 2217305001 1442 GTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVW 1479
Cdd:smart00320 2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1442-1479 |
6.71e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 44.26 E-value: 6.71e-06
10 20 30
....*....|....*....|....*....|....*...
gi 2217305001 1442 GTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVW 1479
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
707-738 |
8.66e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 38.45 E-value: 8.66e-04
10 20 30
....*....|....*....|....*....|..
gi 2217305001 707 RSHTAPVLALAMEQRRGQLATVSQDRTVRIWD 738
Cdd:smart00320 9 KGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
603-1013 |
7.77e-41 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 156.61 E-value: 7.77e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 603 RHARRLLPTRTPGGPHPQKQTFSSGPGIAISSLSVSPAmcAVGSEDGFLRLWPLDFSSVLLEAEHEGPVSSVCVSPDGLR 682
Cdd:COG2319 15 DLALALLAAALGALLLLLLGLAAAVASLAASPDGARLA--AGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRL 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 683 VLSATSSGHLGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHP 762
Cdd:COG2319 93 LASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSP 172
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 763 TRPTFFCGFSSGAVRSFSLEAAEVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQW-HVLRVAADMVcpd 841
Cdd:COG2319 173 DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLlRTLTGHSGSV--- 249
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 842 apaspSALAVSRDGRLLAfvgpsrctvtvmgSASLDELLRV-DIGT---LDLASSRLDSAMAVCFGPAalGHLLVSTSSN 917
Cdd:COG2319 250 -----RSVAFSPDGRLLA-------------SGSADGTVRLwDLATgelLRTLTGHSGGVNSVAFSPD--GKLLASGSDD 309
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 918 RVV-VLDAVSGRIIREstvfqLPGvHPEPCPSLTLSEDARFLLIA-AGRTIKVWDYATQASPgpQVYIGHSEPVQAVAFS 995
Cdd:COG2319 310 GTVrLWDLATGKLLRT-----LTG-HTGAVRSVAFSPDGKTLASGsDDGTVRLWDLATGELL--RTLTGHTGAVTSVAFS 381
|
410 420
....*....|....*....|
gi 2217305001 996 PDQQQVLSAGD--AVFLWDV 1013
Cdd:COG2319 382 PDGRTLASGSAdgTVRLWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1153-1568 |
1.25e-40 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 155.84 E-value: 1.25e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1153 GRLVVVEDLHSGAQQHWSGHSAEISTLALSHSAQVLASASGRSSttahcqIRVWDVSGGLCQHLIFPHSTTVLALAFSPD 1232
Cdd:COG2319 58 LTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGT------VRLWDLATGLLLRTLTGHTGAVRSVAFSPD 131
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1233 DRLLVTlGDHDGrTLALWGTATYDLVSSTRLPE-PVHGVAFNPwDageltcvGQgtvtfwllqqrgadislqvrrepvpe 1311
Cdd:COG2319 132 GKTLAS-GSADG-TVRLWDLATGKLLRTLTGHSgAVTSVAFSP-D-------GK-------------------------- 175
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1312 avgageltslcygappLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSEL 1389
Cdd:COG2319 176 ----------------LLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSpdGKLLASGSADGTVRLWDLATGKLL 239
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1390 RC-KGSGARSSSVfmehELVLDGA-VVSASFDDSVdmgvvgttagTLWfvSWAEGTSTRLISGHRSKVNEVVFSPGESHC 1467
Cdd:COG2319 240 RTlTGHSGSVRSV----AFSPDGRlLASGSADGTV----------RLW--DLATGELLRTLTGHSGGVNSVAFSPDGKLL 303
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1468 ATCSEDGSVRVWALASMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGSLRIFSVSRTAMELKMHPHPVALTT 1547
Cdd:COG2319 304 ASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS------PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTS 377
|
410 420
....*....|....*....|.
gi 2217305001 1548 VAFSTDGQTVLSGDKDGLVAV 1568
Cdd:COG2319 378 VAFSPDGRTLASGSADGTVRL 398
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1186-1621 |
1.23e-37 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 146.98 E-value: 1.23e-37
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1186 QVLASASGRSSTTAHCQIRVWDVSGGLCQHLIFPHSTTVLALAFSPDDRLLVTLGDhDGRTLALWGTATYDLVSSTRLPE 1265
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAG-DLTLLLLDAAAGALLATLLGHTA 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1266 PVHGVAFNPWDAGELTCVGQGTVTFWLLQQRGADISLQVRREPVpeavgagelTSLCYgAP--PLLYCGTSSGQVCVWDT 1343
Cdd:COG2319 80 AVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAV---------RSVAF-SPdgKTLASGSADGTVRLWDL 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1344 RAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSELR-CKGSGARSSSVfmehELVLDGA-VVSASFD 1419
Cdd:COG2319 150 ATGKLLRTLTGHSGAVTSVAFSpdGKLLASGSDDGTVRLWDLATGKLLRtLTGHTGAVRSV----AFSPDGKlLASGSAD 225
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1420 DSVdmgvvgttagTLWfvSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVWALASMELVIQFQVLNQSCLC 1499
Cdd:COG2319 226 GTV----------RLW--DLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNS 293
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1500 LAWSPpccgrpEQQRLAAGYGDGSLRIFSVSRTAMELKMHPHPVALTTVAFSTDGQTVLSGDKDGLVAVSHPCTGTTFRV 1579
Cdd:COG2319 294 VAFSP------DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRT 367
|
410 420 430 440
....*....|....*....|....*....|....*....|...
gi 2217305001 1580 LSDHQGaPISTICVTckecedlgveGTDLWLA-ASGDQRVSVW 1621
Cdd:COG2319 368 LTGHTG-AVTSVAFS----------PDGRTLAsGSADGTVRLW 399
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
634-1013 |
3.92e-35 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 139.66 E-value: 3.92e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 634 SLSVSPAMCAVGSEDGFLRLWPLDFSSVLLEAE-HEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAP 712
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLgLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 713 VLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAAEVLVEHTC 792
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTG 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 793 HRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQW-HVLRVAADMVcpdapaspSALAVSRDGRLLAfvgpsrctvtvm 871
Cdd:COG2319 161 HSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLlRTLTGHTGAV--------RSVAFSPDGKLLA------------ 220
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 872 gSASLDELLRVdigtLDLASSRL--------DSAMAVCFGPAalGHLLVSTSSNRVVVL-DAVSGRIIRestvfqLPGVH 942
Cdd:COG2319 221 -SGSADGTVRL----WDLATGKLlrtltghsGSVRSVAFSPD--GRLLASGSADGTVRLwDLATGELLR------TLTGH 287
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2217305001 943 PEPCPSLTLSEDARFLLIA-AGRTIKVWDYATQASpgPQVYIGHSEPVQAVAFSPDQQQVLSAGD--AVFLWDV 1013
Cdd:COG2319 288 SGGVNSVAFSPDGKLLASGsDDGTVRLWDLATGKL--LRTLTGHTGAVRSVAFSPDGKTLASGSDdgTVRLWDL 359
|
|
| CFA20_dom |
pfam05018 |
CFA20 domain; This domain is characteriztic of cilia- and flagella-associated protein 20 ... |
13-190 |
2.77e-34 |
|
CFA20 domain; This domain is characteriztic of cilia- and flagella-associated protein 20 (CFA20). CFA20 is a cilium- and flagellum-specific protein that plays a role in axonemal structure organization and motility. In Chlamydomonas reinhardtii, it stabilizes outer doublet microtubules (DMTs) of the axoneme and may work as a scaffold for intratubular proteins, such as tektin and PACRG, to produce the beak structures in DMT1. Other proteins contain a domain with homology to CFA20. WDR90/POC16 contains such a domain in its N terminus, followed by a large C-terminal domain with multiple WD40 repeats. This domain is also present in the N terminus of uncharacterized protein C3orf67.
Pssm-ID: 461521 Cd Length: 185 Bit Score: 130.40 E-value: 2.77e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 13 AWQHPFLNVFRHFRV---DEWKRSAKQGDVAVVTDKTLKGAVYRIRGSVSAANYIQLPKSSTQSLGLTGRYLYVLFRPLp 89
Cdd:pfam05018 5 TFQSGFLSIFYSIGSkplQIWSKKVKNGHIKRVTDDDIKSNVLEIVGTNVATTYITCPADPKQSLGIKLPFLVLLVKNL- 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 90 SKHFVIHLDVSSKDNQVIRVSFSNLFKEFKSTATWLQFPLVLEartpqrdlvglapsgARWTClqldLQDVLLVYLNRCY 169
Cdd:pfam05018 84 GKYFSFEIQILDDKNVRRRFRFSNFQKVTKVKPFITTMPLRLN---------------EGWNQ----IQFNLADFTRRAY 144
|
170 180
....*....|....*....|....*
gi 2217305001 170 G----HLKSIRLCASLLVRNLYTSD 190
Cdd:pfam05018 145 GtnyvETVRVQIHANCRLRRIYFSD 169
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1156-1483 |
1.70e-32 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 131.96 E-value: 1.70e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1156 VVVEDLHSGAQQH-WSGHSAEISTLALSHSAQVLASASGRSSttahcqIRVWDVSGGLCQHLIFPHSTTVLALAFSPDDR 1234
Cdd:COG2319 102 VRLWDLATGLLLRtLTGHTGAVRSVAFSPDGKTLASGSADGT------VRLWDLATGKLLRTLTGHSGAVTSVAFSPDGK 175
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1235 LLVTlGDHDGrTLALWGTATYDLVSSTRLPE-PVHGVAFNPwdAGEL--TCVGQGTVTFWllqqrgadiSLQVRREPVPE 1311
Cdd:COG2319 176 LLAS-GSDDG-TVRLWDLATGKLLRTLTGHTgAVRSVAFSP--DGKLlaSGSADGTVRLW---------DLATGKLLRTL 242
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1312 AVGAGELTSLCYgAP--PLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVS 1387
Cdd:COG2319 243 TGHSGSVRSVAF-SPdgRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSpdGKLLASGSDDGTVRLWDLATGK 321
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1388 ELRC-KGSGARSSSVfmehELVLDGA-VVSASFDDSVdmgvvgttagTLWfvSWAEGTSTRLISGHRSKVNEVVFSPGES 1465
Cdd:COG2319 322 LLRTlTGHTGAVRSV----AFSPDGKtLASGSDDGTV----------RLW--DLATGELLRTLTGHTGAVTSVAFSPDGR 385
|
330
....*....|....*...
gi 2217305001 1466 HCATCSEDGSVRVWALAS 1483
Cdd:COG2319 386 TLASGSADGTVRLWDLAT 403
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
598-973 |
1.12e-31 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 129.65 E-value: 1.12e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 598 QRMVVRHARRLLPTRTPGGPHPQKQTFSSGPGIAISSLSVSPAMCAVGSEDGFLRLWPLDFSSVLLEAE-HEGPVSSVCV 676
Cdd:COG2319 49 ARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTgHTGAVRSVAF 128
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 677 SPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPC 756
Cdd:COG2319 129 SPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVR 208
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 757 AVTFHPTRPTFFCGFSSGAVRSFSLEAAEVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQ-WHVLRVAA 835
Cdd:COG2319 209 SVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGElLRTLTGHS 288
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 836 DMVcpdapaspSALAVSRDGRLLAfvgpsrctvtvmgSASLDELLRV-DIGTLDLA---SSRLDSAMAVCFGPAalGHLL 911
Cdd:COG2319 289 GGV--------NSVAFSPDGKLLA-------------SGSDDGTVRLwDLATGKLLrtlTGHTGAVRSVAFSPD--GKTL 345
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2217305001 912 VSTSSNRVVVL-DAVSGRIIREstvfqLPGvHPEPCPSLTLSEDARFLLIAAG-RTIKVWDYAT 973
Cdd:COG2319 346 ASGSDDGTVRLwDLATGELLRT-----LTG-HTGAVTSVAFSPDGRTLASGSAdGTVRLWDLAT 403
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
386-818 |
5.50e-31 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 127.33 E-value: 5.50e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 386 LWTPDGAAVVYPCHAVIVVLLVDTGEQRFFLGHTDKVSALALDGSSSLLASAQARAPSVMRLWDFQTGRCLCLFRSPMHV 465
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 466 VCSLSFSDSGALLCGVGKDHHgrtmVVAWGTgqvgLGGEVVVLAKAHTDfDVQAfrVTFF-DETRMASCGQ-GSVRLWRL 543
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGT----VRLWDL----ATGLLLRTLTGHTG-AVRS--VAFSpDGKTLASGSAdGTVRLWDL 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 544 RGGVLrscpVDLGEHHALQFTDLAFkqARDGcpepsaAMLFVCSRSGHIleidcqRMVVRHARRLLPTRTpggphpqkqt 623
Cdd:COG2319 150 ATGKL----LRTLTGHSGAVTSVAF--SPDG------KLLASGSDDGTV------RLWDLATGKLLRTLT---------- 201
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 624 fssGPGIAISSLSVSP--AMCAVGSEDGFLRLWPLDFSSVLLE-AEHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSR 700
Cdd:COG2319 202 ---GHTGAVRSVAFSPdgKLLASGSADGTVRLWDLATGKLLRTlTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATG 278
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 701 VYHMLARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFS 780
Cdd:COG2319 279 ELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWD 358
|
410 420 430
....*....|....*....|....*....|....*...
gi 2217305001 781 LEAAEVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSL 818
Cdd:COG2319 359 LATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTV 396
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
707-1006 |
9.16e-29 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 117.82 E-value: 9.16e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 707 RSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAAEV 786
Cdd:cd00200 6 KGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGEC 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 787 LVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAqyscadpQWHVLRVAADMVCPDAPASPSALAVSRDGRLLAfvgpsrc 866
Cdd:cd00200 86 VRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIK-------VWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVA------- 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 867 tvtvmgSASLDELLRVdigtLDLASSRL--------DSAMAVCFGPAalGHLLVSTSSNRVV-VLDAVSGRIIRestvfQ 937
Cdd:cd00200 152 ------SSSQDGTIKL----WDLRTGKCvatltghtGEVNSVAFSPD--GEKLLSSSSDGTIkLWDLSTGKCLG-----T 214
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 938 LPGvHPEPCPSLTLSEDARFLLIAAG-RTIKVWDYATQASpgPQVYIGHSEPVQAVAFSPDQQQVLSAGD 1006
Cdd:cd00200 215 LRG-HENGVNSVAFSPDGYLLASGSEdGTIRVWDLRTGEC--VQTLSGHTNSVTSLAWSPDGKRLASGSA 281
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1316-1622 |
1.97e-27 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 113.97 E-value: 1.97e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1316 GELTSLCYGA-PPLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSELRck 1392
Cdd:cd00200 10 GGVTCVAFSPdGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASadGTYLASGSSDKTIRLWDLETGECVR-- 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1393 gsgarsssVFMEHElvldGAVVSASFDDSVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCSE 1472
Cdd:cd00200 88 --------TLTGHT----SYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQ 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1473 DGSVRVWALASMELVIQFQVLNQSCLCLAWSPpccgrpEQQRLAAGYGDGSLRIFSVsRTAMELK-MHPHPVALTTVAFS 1551
Cdd:cd00200 156 DGTIKLWDLRTGKCVATLTGHTGEVNSVAFSP------DGEKLLSSSSDGTIKLWDL-STGKCLGtLRGHENGVNSVAFS 228
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217305001 1552 TDGQTVLSGDKDGLVAVSHPCTGTTFRVLSDHQGApisticVTCkecedLGVEGTDLWLA-ASGDQRVSVWA 1622
Cdd:cd00200 229 PDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNS------VTS-----LAWSPDGKRLAsGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
400-777 |
5.71e-27 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 115.39 E-value: 5.71e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 400 AVIVVLLVDTGEQRFFLGHTDKVSALALDGSSSLLASAqARAPSVmRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLC 479
Cdd:COG2319 59 TLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASA-SADGTV-RLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLA 136
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 480 GVGKDHhgrtMVVAWGTGQvglgGEVVVLAKAHTDfDVQAfrVTFF-DETRMASCGQ-GSVRLWRLRGGVLrscpVDLGE 557
Cdd:COG2319 137 SGSADG----TVRLWDLAT----GKLLRTLTGHSG-AVTS--VAFSpDGKLLASGSDdGTVRLWDLATGKL----LRTLT 201
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 558 HHALQFTDLAFkqARDGcpepsaAMLFVCSRSGHIleidcqrmvvrharRLLPTRTPggphpQKQTFSSGPGIAISSLSV 637
Cdd:COG2319 202 GHTGAVRSVAF--SPDG------KLLASGSADGTV--------------RLWDLATG-----KLLRTLTGHSGSVRSVAF 254
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 638 SP--AMCAVGSEDGFLRLWPLDFSSVL-LEAEHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAPVL 714
Cdd:COG2319 255 SPdgRLLASGSADGTVRLWDLATGELLrTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVR 334
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217305001 715 ALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVR 777
Cdd:COG2319 335 SVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVR 397
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
304-741 |
7.82e-27 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 115.01 E-value: 7.82e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 304 VSLSQERSDASNADGPGFHSLEPWAQLEASDIHTAAAGTHVLTHESAEVPVARTGSCEGFLPDPVLRLKGVIGFGGHGTR 383
Cdd:COG2319 2 LSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAV 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 384 QAL-WTPDGAAVVYPCHAVIVVLL-VDTGEQRFFL-GHTDKVSALALDGSSSLLASAQAraPSVMRLWDFQTGRCLCLFR 460
Cdd:COG2319 82 LSVaFSPDGRLLASASADGTVRLWdLATGLLLRTLtGHTGAVRSVAFSPDGKTLASGSA--DGTVRLWDLATGKLLRTLT 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 461 SPMHVVCSLSFSDSGallcgvgkdhhgRTMVVAWGTGQVGL----GGEVVVLAKAHTDFdvqAFRVTF-FDETRMASCGQ 535
Cdd:COG2319 160 GHSGAVTSVAFSPDG------------KLLASGSDDGTVRLwdlaTGKLLRTLTGHTGA---VRSVAFsPDGKLLASGSA 224
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 536 -GSVRLWRLRGGVLrscpVDLGEHHALQFTDLAFkqARDGcpepsaAMLFVCSRSGHIleidcqrmvvrharRLLPTRTP 614
Cdd:COG2319 225 dGTVRLWDLATGKL----LRTLTGHSGSVRSVAF--SPDG------RLLASGSADGTV--------------RLWDLATG 278
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 615 ggphpQKQTFSSGPGIAISSLSVSP--AMCAVGSEDGFLRLWPLDFSSVLLEAE-HEGPVSSVCVSPDGLRVLSATSSGH 691
Cdd:COG2319 279 -----ELLRTLTGHSGGVNSVAFSPdgKLLASGSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGT 353
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|
gi 2217305001 692 LGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLAT 741
Cdd:COG2319 354 VRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1167-1528 |
5.62e-26 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 109.73 E-value: 5.62e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1167 QHWSGHSAEISTLALSHSAQVLASASGRSSttahcqIRVWDVSGGLCQHLIFPHSTTVLALAFSPDDRLLVTLGDhdGRT 1246
Cdd:cd00200 3 RTLKGHTGGVTCVAFSPDGKLLATGSGDGT------IKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSS--DKT 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1247 LALWGTATYDLVSS-TRLPEPVHGVAFNPwdageltcvgqgtvtfwllqqrgadislqvrrepvpeavgageltslcygA 1325
Cdd:cd00200 75 IRLWDLETGECVRTlTGHTSYVSSVAFSP--------------------------------------------------D 104
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1326 PPLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFSGSR--LVSGSSTGRLRLWAVgavSELRCKGsgarsssVFM 1403
Cdd:cd00200 105 GRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGtfVASSSQDGTIKLWDL---RTGKCVA-------TLT 174
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1404 EHElvldGAVVSASFDDSVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVWALAS 1483
Cdd:cd00200 175 GHT----GEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRT 250
|
330 340 350 360
....*....|....*....|....*....|....*....|....*
gi 2217305001 1484 MELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGSLRIFS 1528
Cdd:cd00200 251 GECVQTLSGHTNSVTSLAWS------PDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
626-882 |
1.67e-25 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 108.58 E-value: 1.67e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 626 SGPGIAISSLSVSPAMcAVGSEDGFLRLWPLDFSSVLLE-AEHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHM 704
Cdd:cd00200 51 TGPVRDVAASADGTYL-ASGSSDKTIRLWDLETGECVRTlTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLT 129
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 705 LARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAA 784
Cdd:cd00200 130 TLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTG 209
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 785 EVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAqyscadpQWHVLRVAADMVCPDAPASPSALAVSRDGRLLAfvgps 864
Cdd:cd00200 210 KCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIR-------VWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLA----- 277
|
250
....*....|....*...
gi 2217305001 865 rctvtvmgSASLDELLRV 882
Cdd:cd00200 278 --------SGSADGTIRI 287
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
415-738 |
3.38e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 95.86 E-value: 3.38e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 415 FLGHTDKVSALALDGSSSLLASAQARapSVMRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLCGVGKDHhgrtMVVAW 494
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLLATGSGD--GTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDK----TIRLW 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 495 GTGQvglgGEVVVLAKAHTDfDVQAfrVTFFDETRMASCG--QGSVRLWRLRGGVLRSCpvdLGEHHAlqftdlafkQAR 572
Cdd:cd00200 79 DLET----GECVRTLTGHTS-YVSS--VAFSPDGRILSSSsrDKTIKVWDVETGKCLTT---LRGHTD---------WVN 139
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 573 DGCPEPSAAMLFVCSRSGHILEIDcqrmvVRHARRLlptrtpggphpqkQTFSsGPGIAISSLSVSP--AMCAVGSEDGF 650
Cdd:cd00200 140 SVAFSPDGTFVASSSQDGTIKLWD-----LRTGKCV-------------ATLT-GHTGEVNSVAFSPdgEKLLSSSSDGT 200
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 651 LRLWPLDFSSVL--LEAeHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATV 728
Cdd:cd00200 201 IKLWDLSTGKCLgtLRG-HENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASG 279
|
330
....*....|
gi 2217305001 729 SQDRTVRIWD 738
Cdd:cd00200 280 SADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1140-1382 |
5.52e-19 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 89.32 E-value: 5.52e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1140 VWRPDTGFFAyTCG--RLVVVEDLHSGAQQH-WSGHSAEISTLALSHSAQVLASASgrssttAHCQIRVWDVSGGLCQHL 1216
Cdd:cd00200 58 AASADGTYLA-SGSsdKTIRLWDLETGECVRtLTGHTSYVSSVAFSPDGRILSSSS------RDKTIKVWDVETGKCLTT 130
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1217 IFPHSTTVLALAFSPDDRLLVTlGDHDGrTLALW----GTATYDLVSSTRlpePVHGVAFNPwDAGEL-TCVGQGTVTFW 1291
Cdd:cd00200 131 LRGHTDWVNSVAFSPDGTFVAS-SSQDG-TIKLWdlrtGKCVATLTGHTG---EVNSVAFSP-DGEKLlSSSSDGTIKLW 204
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1292 LLQQRGADISLQVRREPVpeavgagelTSLCYGAPPLLYCGTSS-GQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSR 1368
Cdd:cd00200 205 DLSTGKCLGTLRGHENGV---------NSVAFSPDGYLLASGSEdGTIRVWDLRTGECVQTLSGHTNSVTSLAWSpdGKR 275
|
250
....*....|....
gi 2217305001 1369 LVSGSSTGRLRLWA 1382
Cdd:cd00200 276 LASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1444-1621 |
1.57e-15 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 78.92 E-value: 1.57e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1444 STRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVWALASMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGS 1523
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS------ADGTYLASGSSDKT 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 1524 LRIFSVS--RTAMELKMHPHPValTTVAFSTDGQTVLSGDKDGLVAVSHPCTGTTFRVLSDHQGaPISTICVTckecedl 1601
Cdd:cd00200 75 IRLWDLEtgECVRTLTGHTSYV--SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD-WVNSVAFS------- 144
|
170 180
....*....|....*....|
gi 2217305001 1602 gvEGTDLWLAASGDQRVSVW 1621
Cdd:cd00200 145 --PDGTFVASSSQDGTIKLW 162
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
789-1015 |
6.80e-12 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 68.13 E-value: 6.80e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 789 EHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQW-HVLRVAADmvcpdapaSPSALAVSRDGRLLAfvgpsrct 867
Cdd:cd00200 4 TLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELlRTLKGHTG--------PVRDVAASADGTYLA-------- 67
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 868 vtvmgSASLDELLRV-DIGTLDLASS---RLDSAMAVCFGPAalGHLLVSTSSNRVVVL-DAVSGRIIREstvfqLPGvH 942
Cdd:cd00200 68 -----SGSSDKTIRLwDLETGECVRTltgHTSYVSSVAFSPD--GRILSSSSRDKTIKVwDVETGKCLTT-----LRG-H 134
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217305001 943 PEPCPSLTLSEDARFLLIAAG-RTIKVWDYATqASPGpQVYIGHSEPVQAVAFSPDQQQVLSAGD--AVFLWDVLA 1015
Cdd:cd00200 135 TDWVNSVAFSPDGTFVASSSQdGTIKLWDLRT-GKCV-ATLTGHTGEVNSVAFSPDGEKLLSSSSdgTIKLWDLST 208
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
413-542 |
1.70e-09 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 60.81 E-value: 1.70e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 413 RFFLGHTDKVSALALDGSSSLLASAQARapSVMRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLCGVGKDhhgRTMVV 492
Cdd:cd00200 171 ATLTGHTGEVNSVAFSPDGEKLLSSSSD--GTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSED---GTIRV 245
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|.
gi 2217305001 493 aWgtgQVGLGGEVVVLaKAHTDFdVQAFRVTfFDETRMASCGQ-GSVRLWR 542
Cdd:cd00200 246 -W---DLRTGECVQTL-SGHTNS-VTSLAWS-PDGKRLASGSAdGTIRIWD 289
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1442-1479 |
3.67e-06 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 45.00 E-value: 3.67e-06
10 20 30
....*....|....*....|....*....|....*...
gi 2217305001 1442 GTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVW 1479
Cdd:smart00320 2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1442-1479 |
6.71e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 44.26 E-value: 6.71e-06
10 20 30
....*....|....*....|....*....|....*...
gi 2217305001 1442 GTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVW 1479
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
746-939 |
5.91e-05 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 46.94 E-value: 5.91e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 746 YDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAAEVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCAD 825
Cdd:COG4257 10 YPVPAPGSGPRDVAVDPDGAVWFTDQGGGRIGRLDPATGEFTEYPLGGGSGPHGIAVDPDGNLWFTDNGNNRIGRIDPKT 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 826 PQwhVLRVAAdmvcPDAPASPSALAVSRDGRLLafvgpsrctVTVMGSaslDELLRVDIGT----LDLASSRLDSAMAVC 901
Cdd:COG4257 90 GE--ITTFAL----PGGGSNPHGIAFDPDGNLW---------FTDQGG---NRIGRLDPATgevtEFPLPTGGAGPYGIA 151
|
170 180 190
....*....|....*....|....*....|....*....
gi 2217305001 902 FGPAalGHLLV-STSSNRVVVLDAVSGRIiresTVFQLP 939
Cdd:COG4257 152 VDPD--GNLWVtDFGANAIGRIDPDTGTL----TEYALP 184
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
707-738 |
8.66e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 38.45 E-value: 8.66e-04
10 20 30
....*....|....*....|....*....|..
gi 2217305001 707 RSHTAPVLALAMEQRRGQLATVSQDRTVRIWD 738
Cdd:smart00320 9 KGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
841-973 |
2.53e-03 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 41.60 E-value: 2.53e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 841 DAPASPSALAVSRDGRLLAFVGPSRCTVTVMGSASLDELLRVDIGtldlassrlDSAMAVCFGPAAlGHLLVS-TSSNR- 918
Cdd:COG3391 107 PVGGGPRGLAVDPDGGRLYVADSGNGRVSVIDTATGKVVATIPVG---------AGPHGIAVDPDG-KRLYVAnSGSNTv 176
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217305001 919 ---VVVLDAVSGRIIRESTVFQLPGvhpepcpSLTLSEDARFLLIA---------AGRTIKVWDYAT 973
Cdd:COG3391 177 sviVSVIDTATGKVVATIPVGGGPV-------GVAVSPDGRRLYVAnrgsntsngGSNTVSVIDLAT 236
|
|
| WDR74 |
cd22857 |
WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and ... |
667-745 |
3.09e-03 |
|
WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and plants is an essential factor for ribosome assembly. In cooperation with the assembly factor NVL2, WDR74 participates in an early cleavage of the pre-rRNA processing pathway. NVL2 is a type II double ring, AAA-ATPase, that may mediate the release of WDR74 from nucleolar pre-60S particles. WDR74 has been implicated in tumorigenesis. In lung cancer, it regulates cell proliferation, cell cycle progression, chemoresistance and cell aggressiveness, by inducing nuclear beta-catenin accumulation and driving downstream Wnt-responsive genes expression. In melanoma, it promotes apoptosis resistance and aggressive behavior by regulating the RPL5-MDM2-p53 pathway. WDR74 contains an N-terminal seven-bladed beta-propeller WD40 domain that associates with the D1-AAA domain of the AAA-ATPase NVL2, and a flexible lysine-rich C-terminus that extends outward from the WD40 domain, and is required for nucleolar localization.
Pssm-ID: 439303 [Multi-domain] Cd Length: 325 Bit Score: 41.83 E-value: 3.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 667 HEGPVSSVCVSPDGLRVLSATSSGHLGFLD----TLSRVYHMLArshTAPVLALAMEQRRGQLATVSQDRTVRIWDLATL 742
Cdd:cd22857 222 GETPIKAVAEDPDGHTVYVGDTSGDLASIDlrtgKLLGCFKGKC---GGSIRSIARHPELPLIASCGLDRYLRIWDTETR 298
|
...
gi 2217305001 743 QQL 745
Cdd:cd22857 299 QLL 301
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
980-1012 |
4.62e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 36.52 E-value: 4.62e-03
10 20 30
....*....|....*....|....*....|....*
gi 2217305001 980 QVYIGHSEPVQAVAFSPDQQQVLSAGD--AVFLWD 1012
Cdd:smart00320 6 KTLKGHTGPVTSVAFSPDGKYLASGSDdgTIKLWD 40
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
830-1001 |
9.84e-03 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 39.68 E-value: 9.84e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 830 VLRVAADMVCPDAPASPSALAVSRDGRLLAFVGPSRCTVTVMGSASLDELLRVDIGTLDLASSRLDSAMAVCFGPAALGH 909
Cdd:COG3391 2 LVASSLLVAVLLAVLALAALAVAVAALGLGGGGPLLAAASGGVVGAAVGGGGVALLAGLGLGAAAVADADGADAGADGRR 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217305001 910 LLVS-TSSNRVVVLDAVSGRIIRESTVfqlpGVHPEpcpSLTLSEDARFLLIAAGR--TIKVWDYATQASPGpQVYIGhS 986
Cdd:COG3391 82 LYVAnSGSGRVSVIDLATGKVVATIPV----GGGPR---GLAVDPDGGRLYVADSGngRVSVIDTATGKVVA-TIPVG-A 152
|
170
....*....|....*
gi 2217305001 987 EPVqAVAFSPDQQQV 1001
Cdd:COG3391 153 GPH-GIAVDPDGKRL 166
|
|
|