|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
603-1013 |
5.21e-41 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 157.00 E-value: 5.21e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 603 RHARRLLPTRTPGGPHPQKQTFSSGPGIAISSLSVSPAmcAVGSEDGFLRLWPLDFSSVLLEAEHEGPVSSVCVSPDGLR 682
Cdd:COG2319 15 DLALALLAAALGALLLLLLGLAAAVASLAASPDGARLA--AGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRL 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 683 VLSATSSGHLGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHP 762
Cdd:COG2319 93 LASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSP 172
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 763 TRPTFFCGFSSGAVRSFSLEAAEVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQW-HVLRVAADMVcpd 841
Cdd:COG2319 173 DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLlRTLTGHSGSV--- 249
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 842 apaspSALAVSRDGRLLAfvgpsrctvtvmgSASLDELLRV-DIGT---LDLASSRLDSAMAVCFGPAalGHLLVSTSSN 917
Cdd:COG2319 250 -----RSVAFSPDGRLLA-------------SGSADGTVRLwDLATgelLRTLTGHSGGVNSVAFSPD--GKLLASGSDD 309
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 918 RVV-VLDAVSGRIIREstvfqLPGvHPEPCPSLTLSEDARFLLIA-AGRTIKVWDYATQASPgpQVYIGHSEPVQAVAFS 995
Cdd:COG2319 310 GTVrLWDLATGKLLRT-----LTG-HTGAVRSVAFSPDGKTLASGsDDGTVRLWDLATGELL--RTLTGHTGAVTSVAFS 381
|
410 420
....*....|....*....|
gi 2462547867 996 PDQQQVLSAGD--AVFLWDV 1013
Cdd:COG2319 382 PDGRTLASGSAdgTVRLWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1153-1568 |
7.68e-41 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 156.61 E-value: 7.68e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1153 GRLVVVEDLHSGAQQHWSGHSAEISTLALSHSAQVLASASGRSSttahcqIRVWDVSGGLCQHLIFPHSTTVLALAFSPD 1232
Cdd:COG2319 58 LTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGT------VRLWDLATGLLLRTLTGHTGAVRSVAFSPD 131
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1233 DRLLVTlGDHDGrTLALWGTATYDLVSSTRLPE-PVHGVAFNPwDageltcvGQgtvtfwllqqrgadislqvrrepvpe 1311
Cdd:COG2319 132 GKTLAS-GSADG-TVRLWDLATGKLLRTLTGHSgAVTSVAFSP-D-------GK-------------------------- 175
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1312 avgageltslcygappLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSEL 1389
Cdd:COG2319 176 ----------------LLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSpdGKLLASGSADGTVRLWDLATGKLL 239
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1390 RC-KGSGARSSSVfmehELVLDGA-VVSASFDDSVdmgvvgttagTLWfvSWAEGTSTRLISGHRSKVNEVVFSPGESHC 1467
Cdd:COG2319 240 RTlTGHSGSVRSV----AFSPDGRlLASGSADGTV----------RLW--DLATGELLRTLTGHSGGVNSVAFSPDGKLL 303
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1468 ATCSEDGSVRVWALASMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGSLRIFSVSRTAMELKMHPHPVALTT 1547
Cdd:COG2319 304 ASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS------PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTS 377
|
410 420
....*....|....*....|.
gi 2462547867 1548 VAFSTDGQTVLSGDKDGLVAV 1568
Cdd:COG2319 378 VAFSPDGRTLASGSADGTVRL 398
|
|
| CFA20_dom |
pfam05018 |
CFA20 domain; This domain is characteriztic of cilia- and flagella-associated protein 20 ... |
13-190 |
2.77e-34 |
|
CFA20 domain; This domain is characteriztic of cilia- and flagella-associated protein 20 (CFA20). CFA20 is a cilium- and flagellum-specific protein that plays a role in axonemal structure organization and motility. In Chlamydomonas reinhardtii, it stabilizes outer doublet microtubules (DMTs) of the axoneme and may work as a scaffold for intratubular proteins, such as tektin and PACRG, to produce the beak structures in DMT1. Other proteins contain a domain with homology to CFA20. WDR90/POC16 contains such a domain in its N terminus, followed by a large C-terminal domain with multiple WD40 repeats. This domain is also present in the N terminus of uncharacterized protein C3orf67.
Pssm-ID: 461521 Cd Length: 185 Bit Score: 130.40 E-value: 2.77e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 13 AWQHPFLNVFRHFRV---DEWKRSAKQGDVAVVTDKTLKGAVYRIRGSVSAANYIQLPKSSTQSLGLTGRYLYVLFRPLp 89
Cdd:pfam05018 5 TFQSGFLSIFYSIGSkplQIWSKKVKNGHIKRVTDDDIKSNVLEIVGTNVATTYITCPADPKQSLGIKLPFLVLLVKNL- 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 90 SKHFVIHLDVSSKDNQVIRVSFSNLFKEFKSTATWLQFPLVLEartpqrdlvglapsgARWTClqldLQDVLLVYLNRCY 169
Cdd:pfam05018 84 GKYFSFEIQILDDKNVRRRFRFSNFQKVTKVKPFITTMPLRLN---------------EGWNQ----IQFNLADFTRRAY 144
|
170 180
....*....|....*....|....*
gi 2462547867 170 G----HLKSIRLCASLLVRNLYTSD 190
Cdd:pfam05018 145 GtnyvETVRVQIHANCRLRRIYFSD 169
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
707-1006 |
6.78e-29 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 118.59 E-value: 6.78e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 707 RSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAAEV 786
Cdd:cd00200 6 KGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGEC 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 787 LVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAqyscadpQWHVLRVAADMVCPDAPASPSALAVSRDGRLLAfvgpsrc 866
Cdd:cd00200 86 VRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIK-------VWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVA------- 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 867 tvtvmgSASLDELLRVdigtLDLASSRL--------DSAMAVCFGPAalGHLLVSTSSNRVV-VLDAVSGRIIRestvfQ 937
Cdd:cd00200 152 ------SSSQDGTIKL----WDLRTGKCvatltghtGEVNSVAFSPD--GEKLLSSSSDGTIkLWDLSTGKCLG-----T 214
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 938 LPGvHPEPCPSLTLSEDARFLLIAAG-RTIKVWDYATQASpgPQVYIGHSEPVQAVAFSPDQQQVLSAGD 1006
Cdd:cd00200 215 LRG-HENGVNSVAFSPDGYLLASGSEdGTIRVWDLRTGEC--VQTLSGHTNSVTSLAWSPDGKRLASGSA 281
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1316-1622 |
1.42e-27 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 114.74 E-value: 1.42e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1316 GELTSLCYGA-PPLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSELRck 1392
Cdd:cd00200 10 GGVTCVAFSPdGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASadGTYLASGSSDKTIRLWDLETGECVR-- 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1393 gsgarsssVFMEHElvldGAVVSASFDDSVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCSE 1472
Cdd:cd00200 88 --------TLTGHT----SYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQ 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1473 DGSVRVWALASMELVIQFQVLNQSCLCLAWSPpccgrpEQQRLAAGYGDGSLRIFSVsRTAMELK-MHPHPVALTTVAFS 1551
Cdd:cd00200 156 DGTIKLWDLRTGKCVATLTGHTGEVNSVAFSP------DGEKLLSSSSDGTIKLWDL-STGKCLGtLRGHENGVNSVAFS 228
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462547867 1552 TDGQTVLSGDKDGLVAVSHPCTGTTFRVLSDHQGApisticVTCkecedLGVEGTDLWLA-ASGDQRVSVWA 1622
Cdd:cd00200 229 PDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNS------VTS-----LAWSPDGKRLAsGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
400-777 |
3.90e-27 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 116.16 E-value: 3.90e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 400 AVIVVLLVDTGEQRFFLGHTDKVSALALDGSSSLLASAqARAPSVmRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLC 479
Cdd:COG2319 59 TLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASA-SADGTV-RLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLA 136
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 480 GVGKDHhgrtMVVAWGTGQvglgGEVVVLAKAHTDfDVQAfrVTFF-DETRMASCGQ-GSVRLWRLRGGVLrscpVDLGE 557
Cdd:COG2319 137 SGSADG----TVRLWDLAT----GKLLRTLTGHSG-AVTS--VAFSpDGKLLASGSDdGTVRLWDLATGKL----LRTLT 201
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 558 HHALQFTDLAFkqARDGcpepsaAMLFVCSRSGHIleidcqrmvvrharRLLPTRTPggphpQKQTFSSGPGIAISSLSV 637
Cdd:COG2319 202 GHTGAVRSVAF--SPDG------KLLASGSADGTV--------------RLWDLATG-----KLLRTLTGHSGSVRSVAF 254
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 638 SP--AMCAVGSEDGFLRLWPLDFSSVL-LEAEHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAPVL 714
Cdd:COG2319 255 SPdgRLLASGSADGTVRLWDLATGELLrTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVR 334
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462547867 715 ALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVR 777
Cdd:COG2319 335 SVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVR 397
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
415-738 |
2.41e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 96.25 E-value: 2.41e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 415 FLGHTDKVSALALDGSSSLLASAQARapSVMRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLCGVGKDHhgrtMVVAW 494
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLLATGSGD--GTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDK----TIRLW 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 495 GTGQvglgGEVVVLAKAHTDfDVQAfrVTFFDETRMASCG--QGSVRLWRLRGGVLRSCpvdLGEHHAlqftdlafkQAR 572
Cdd:cd00200 79 DLET----GECVRTLTGHTS-YVSS--VAFSPDGRILSSSsrDKTIKVWDVETGKCLTT---LRGHTD---------WVN 139
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 573 DGCPEPSAAMLFVCSRSGHILEIDcqrmvVRHARRLlptrtpggphpqkQTFSsGPGIAISSLSVSP--AMCAVGSEDGF 650
Cdd:cd00200 140 SVAFSPDGTFVASSSQDGTIKLWD-----LRTGKCV-------------ATLT-GHTGEVNSVAFSPdgEKLLSSSSDGT 200
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 651 LRLWPLDFSSVL--LEAeHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATV 728
Cdd:cd00200 201 IKLWDLSTGKCLgtLRG-HENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASG 279
|
330
....*....|
gi 2462547867 729 SQDRTVRIWD 738
Cdd:cd00200 280 SADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1444-1762 |
2.80e-17 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 84.31 E-value: 2.80e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1444 STRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVWALASMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGS 1523
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS------ADGTYLASGSSDKT 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1524 LRIFSVS--RTAMELKMHPHPValTTVAFSTDGQTVLSGDKDGLVAVSHPCTGTTFRVLSDHQGaPISTICVTckecedl 1601
Cdd:cd00200 75 IRLWDLEtgECVRTLTGHTSYV--SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD-WVNSVAFS------- 144
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1602 gvEGTDLWLAASGDQRVSVWASDWLRnhcelvdwlsfpmpattetqghlppslaafcpwdgalLMYVGPGVYKEViiynl 1681
Cdd:cd00200 145 --PDGTFVASSSQDGTIKLWDLRTGK-------------------------------------CVATLTGHTGEV----- 180
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1682 cqkqvvekiplpffaMSLSLSPGTHLLAVGFAECMLRLVDCAMGTA-QDFAGHDNAVHLCRFTPSARLLFTAARNE-ILV 1759
Cdd:cd00200 181 ---------------NSVAFSPDGEKLLSSSSDGTIKLWDLSTGKClGTLRGHENGVNSVAFSPDGYLLASGSEDGtIRV 245
|
...
gi 2462547867 1760 WEV 1762
Cdd:cd00200 246 WDL 248
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1442-1479 |
3.48e-06 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 45.38 E-value: 3.48e-06
10 20 30
....*....|....*....|....*....|....*...
gi 2462547867 1442 GTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVW 1479
Cdd:smart00320 2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1442-1479 |
6.23e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 44.64 E-value: 6.23e-06
10 20 30
....*....|....*....|....*....|....*...
gi 2462547867 1442 GTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVW 1479
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
707-738 |
8.20e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 38.45 E-value: 8.20e-04
10 20 30
....*....|....*....|....*....|..
gi 2462547867 707 RSHTAPVLALAMEQRRGQLATVSQDRTVRIWD 738
Cdd:smart00320 9 KGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
603-1013 |
5.21e-41 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 157.00 E-value: 5.21e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 603 RHARRLLPTRTPGGPHPQKQTFSSGPGIAISSLSVSPAmcAVGSEDGFLRLWPLDFSSVLLEAEHEGPVSSVCVSPDGLR 682
Cdd:COG2319 15 DLALALLAAALGALLLLLLGLAAAVASLAASPDGARLA--AGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRL 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 683 VLSATSSGHLGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHP 762
Cdd:COG2319 93 LASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSP 172
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 763 TRPTFFCGFSSGAVRSFSLEAAEVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQW-HVLRVAADMVcpd 841
Cdd:COG2319 173 DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLlRTLTGHSGSV--- 249
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 842 apaspSALAVSRDGRLLAfvgpsrctvtvmgSASLDELLRV-DIGT---LDLASSRLDSAMAVCFGPAalGHLLVSTSSN 917
Cdd:COG2319 250 -----RSVAFSPDGRLLA-------------SGSADGTVRLwDLATgelLRTLTGHSGGVNSVAFSPD--GKLLASGSDD 309
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 918 RVV-VLDAVSGRIIREstvfqLPGvHPEPCPSLTLSEDARFLLIA-AGRTIKVWDYATQASPgpQVYIGHSEPVQAVAFS 995
Cdd:COG2319 310 GTVrLWDLATGKLLRT-----LTG-HTGAVRSVAFSPDGKTLASGsDDGTVRLWDLATGELL--RTLTGHTGAVTSVAFS 381
|
410 420
....*....|....*....|
gi 2462547867 996 PDQQQVLSAGD--AVFLWDV 1013
Cdd:COG2319 382 PDGRTLASGSAdgTVRLWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1153-1568 |
7.68e-41 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 156.61 E-value: 7.68e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1153 GRLVVVEDLHSGAQQHWSGHSAEISTLALSHSAQVLASASGRSSttahcqIRVWDVSGGLCQHLIFPHSTTVLALAFSPD 1232
Cdd:COG2319 58 LTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGT------VRLWDLATGLLLRTLTGHTGAVRSVAFSPD 131
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1233 DRLLVTlGDHDGrTLALWGTATYDLVSSTRLPE-PVHGVAFNPwDageltcvGQgtvtfwllqqrgadislqvrrepvpe 1311
Cdd:COG2319 132 GKTLAS-GSADG-TVRLWDLATGKLLRTLTGHSgAVTSVAFSP-D-------GK-------------------------- 175
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1312 avgageltslcygappLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSEL 1389
Cdd:COG2319 176 ----------------LLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSpdGKLLASGSADGTVRLWDLATGKLL 239
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1390 RC-KGSGARSSSVfmehELVLDGA-VVSASFDDSVdmgvvgttagTLWfvSWAEGTSTRLISGHRSKVNEVVFSPGESHC 1467
Cdd:COG2319 240 RTlTGHSGSVRSV----AFSPDGRlLASGSADGTV----------RLW--DLATGELLRTLTGHSGGVNSVAFSPDGKLL 303
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1468 ATCSEDGSVRVWALASMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGSLRIFSVSRTAMELKMHPHPVALTT 1547
Cdd:COG2319 304 ASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS------PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTS 377
|
410 420
....*....|....*....|.
gi 2462547867 1548 VAFSTDGQTVLSGDKDGLVAV 1568
Cdd:COG2319 378 VAFSPDGRTLASGSADGTVRL 398
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1186-1621 |
7.31e-38 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 147.75 E-value: 7.31e-38
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1186 QVLASASGRSSTTAHCQIRVWDVSGGLCQHLIFPHSTTVLALAFSPDDRLLVTLGDhDGRTLALWGTATYDLVSSTRLPE 1265
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAG-DLTLLLLDAAAGALLATLLGHTA 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1266 PVHGVAFNPWDAGELTCVGQGTVTFWLLQQRGADISLQVRREPVpeavgagelTSLCYgAP--PLLYCGTSSGQVCVWDT 1343
Cdd:COG2319 80 AVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAV---------RSVAF-SPdgKTLASGSADGTVRLWDL 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1344 RAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSELR-CKGSGARSSSVfmehELVLDGA-VVSASFD 1419
Cdd:COG2319 150 ATGKLLRTLTGHSGAVTSVAFSpdGKLLASGSDDGTVRLWDLATGKLLRtLTGHTGAVRSV----AFSPDGKlLASGSAD 225
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1420 DSVdmgvvgttagTLWfvSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVWALASMELVIQFQVLNQSCLC 1499
Cdd:COG2319 226 GTV----------RLW--DLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNS 293
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1500 LAWSppccgrPEQQRLAAGYGDGSLRIFSVSRTAMELKMHPHPVALTTVAFSTDGQTVLSGDKDGLVAVSHPCTGTTFRV 1579
Cdd:COG2319 294 VAFS------PDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRT 367
|
410 420 430 440
....*....|....*....|....*....|....*....|...
gi 2462547867 1580 LSDHQGaPISTICVTckecedlgveGTDLWLA-ASGDQRVSVW 1621
Cdd:COG2319 368 LTGHTG-AVTSVAFS----------PDGRTLAsGSADGTVRLW 399
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
634-1013 |
2.49e-35 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 140.43 E-value: 2.49e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 634 SLSVSPAMCAVGSEDGFLRLWPLDFSSVLLEAE-HEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAP 712
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLgLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 713 VLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAAEVLVEHTC 792
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTG 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 793 HRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQW-HVLRVAADMVcpdapaspSALAVSRDGRLLAfvgpsrctvtvm 871
Cdd:COG2319 161 HSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLlRTLTGHTGAV--------RSVAFSPDGKLLA------------ 220
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 872 gSASLDELLRVdigtLDLASSRL--------DSAMAVCFGPAalGHLLVSTSSNRVVVL-DAVSGRIIRestvfqLPGVH 942
Cdd:COG2319 221 -SGSADGTVRL----WDLATGKLlrtltghsGSVRSVAFSPD--GRLLASGSADGTVRLwDLATGELLR------TLTGH 287
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462547867 943 PEPCPSLTLSEDARFLLIA-AGRTIKVWDYATQASpgPQVYIGHSEPVQAVAFSPDQQQVLSAGD--AVFLWDV 1013
Cdd:COG2319 288 SGGVNSVAFSPDGKLLASGsDDGTVRLWDLATGKL--LRTLTGHTGAVRSVAFSPDGKTLASGSDdgTVRLWDL 359
|
|
| CFA20_dom |
pfam05018 |
CFA20 domain; This domain is characteriztic of cilia- and flagella-associated protein 20 ... |
13-190 |
2.77e-34 |
|
CFA20 domain; This domain is characteriztic of cilia- and flagella-associated protein 20 (CFA20). CFA20 is a cilium- and flagellum-specific protein that plays a role in axonemal structure organization and motility. In Chlamydomonas reinhardtii, it stabilizes outer doublet microtubules (DMTs) of the axoneme and may work as a scaffold for intratubular proteins, such as tektin and PACRG, to produce the beak structures in DMT1. Other proteins contain a domain with homology to CFA20. WDR90/POC16 contains such a domain in its N terminus, followed by a large C-terminal domain with multiple WD40 repeats. This domain is also present in the N terminus of uncharacterized protein C3orf67.
Pssm-ID: 461521 Cd Length: 185 Bit Score: 130.40 E-value: 2.77e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 13 AWQHPFLNVFRHFRV---DEWKRSAKQGDVAVVTDKTLKGAVYRIRGSVSAANYIQLPKSSTQSLGLTGRYLYVLFRPLp 89
Cdd:pfam05018 5 TFQSGFLSIFYSIGSkplQIWSKKVKNGHIKRVTDDDIKSNVLEIVGTNVATTYITCPADPKQSLGIKLPFLVLLVKNL- 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 90 SKHFVIHLDVSSKDNQVIRVSFSNLFKEFKSTATWLQFPLVLEartpqrdlvglapsgARWTClqldLQDVLLVYLNRCY 169
Cdd:pfam05018 84 GKYFSFEIQILDDKNVRRRFRFSNFQKVTKVKPFITTMPLRLN---------------EGWNQ----IQFNLADFTRRAY 144
|
170 180
....*....|....*....|....*
gi 2462547867 170 G----HLKSIRLCASLLVRNLYTSD 190
Cdd:pfam05018 145 GtnyvETVRVQIHANCRLRRIYFSD 169
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
598-973 |
8.72e-32 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 130.03 E-value: 8.72e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 598 QRMVVRHARRLLPTRTPGGPHPQKQTFSSGPGIAISSLSVSPAMCAVGSEDGFLRLWPLDFSSVLLEAE-HEGPVSSVCV 676
Cdd:COG2319 49 ARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTgHTGAVRSVAF 128
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 677 SPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPC 756
Cdd:COG2319 129 SPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVR 208
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 757 AVTFHPTRPTFFCGFSSGAVRSFSLEAAEVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQ-WHVLRVAA 835
Cdd:COG2319 209 SVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGElLRTLTGHS 288
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 836 DMVcpdapaspSALAVSRDGRLLAfvgpsrctvtvmgSASLDELLRV-DIGTLDLA---SSRLDSAMAVCFGPAalGHLL 911
Cdd:COG2319 289 GGV--------NSVAFSPDGKLLA-------------SGSDDGTVRLwDLATGKLLrtlTGHTGAVRSVAFSPD--GKTL 345
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462547867 912 VSTSSNRVVVL-DAVSGRIIREstvfqLPGvHPEPCPSLTLSEDARFLLIAAG-RTIKVWDYAT 973
Cdd:COG2319 346 ASGSDDGTVRLwDLATGELLRT-----LTG-HTGAVTSVAFSPDGRTLASGSAdGTVRLWDLAT 403
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
386-818 |
4.08e-31 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 128.11 E-value: 4.08e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 386 LWTPDGAAVVYPCHAVIVVLLVDTGEQRFFLGHTDKVSALALDGSSSLLASAQARAPSVMRLWDFQTGRCLCLFRSPMHV 465
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 466 VCSLSFSDSGALLCGVGKDHHgrtmVVAWGTgqvgLGGEVVVLAKAHTDfDVQAfrVTFF-DETRMASCGQ-GSVRLWRL 543
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGT----VRLWDL----ATGLLLRTLTGHTG-AVRS--VAFSpDGKTLASGSAdGTVRLWDL 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 544 RGGVLrscpVDLGEHHALQFTDLAFkqARDGcpepsaAMLFVCSRSGHIleidcqRMVVRHARRLLPTRTpggphpqkqt 623
Cdd:COG2319 150 ATGKL----LRTLTGHSGAVTSVAF--SPDG------KLLASGSDDGTV------RLWDLATGKLLRTLT---------- 201
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 624 fssGPGIAISSLSVSP--AMCAVGSEDGFLRLWPLDFSSVLLE-AEHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSR 700
Cdd:COG2319 202 ---GHTGAVRSVAFSPdgKLLASGSADGTVRLWDLATGKLLRTlTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATG 278
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 701 VYHMLARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFS 780
Cdd:COG2319 279 ELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWD 358
|
410 420 430
....*....|....*....|....*....|....*...
gi 2462547867 781 LEAAEVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSL 818
Cdd:COG2319 359 LATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTV 396
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
707-1006 |
6.78e-29 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 118.59 E-value: 6.78e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 707 RSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAAEV 786
Cdd:cd00200 6 KGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGEC 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 787 LVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAqyscadpQWHVLRVAADMVCPDAPASPSALAVSRDGRLLAfvgpsrc 866
Cdd:cd00200 86 VRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIK-------VWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVA------- 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 867 tvtvmgSASLDELLRVdigtLDLASSRL--------DSAMAVCFGPAalGHLLVSTSSNRVV-VLDAVSGRIIRestvfQ 937
Cdd:cd00200 152 ------SSSQDGTIKL----WDLRTGKCvatltghtGEVNSVAFSPD--GEKLLSSSSDGTIkLWDLSTGKCLG-----T 214
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 938 LPGvHPEPCPSLTLSEDARFLLIAAG-RTIKVWDYATQASpgPQVYIGHSEPVQAVAFSPDQQQVLSAGD 1006
Cdd:cd00200 215 LRG-HENGVNSVAFSPDGYLLASGSEdGTIRVWDLRTGEC--VQTLSGHTNSVTSLAWSPDGKRLASGSA 281
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1316-1622 |
1.42e-27 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 114.74 E-value: 1.42e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1316 GELTSLCYGA-PPLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSELRck 1392
Cdd:cd00200 10 GGVTCVAFSPdGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASadGTYLASGSSDKTIRLWDLETGECVR-- 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1393 gsgarsssVFMEHElvldGAVVSASFDDSVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCSE 1472
Cdd:cd00200 88 --------TLTGHT----SYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQ 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1473 DGSVRVWALASMELVIQFQVLNQSCLCLAWSPpccgrpEQQRLAAGYGDGSLRIFSVsRTAMELK-MHPHPVALTTVAFS 1551
Cdd:cd00200 156 DGTIKLWDLRTGKCVATLTGHTGEVNSVAFSP------DGEKLLSSSSDGTIKLWDL-STGKCLGtLRGHENGVNSVAFS 228
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462547867 1552 TDGQTVLSGDKDGLVAVSHPCTGTTFRVLSDHQGApisticVTCkecedLGVEGTDLWLA-ASGDQRVSVWA 1622
Cdd:cd00200 229 PDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNS------VTS-----LAWSPDGKRLAsGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
400-777 |
3.90e-27 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 116.16 E-value: 3.90e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 400 AVIVVLLVDTGEQRFFLGHTDKVSALALDGSSSLLASAqARAPSVmRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLC 479
Cdd:COG2319 59 TLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASA-SADGTV-RLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLA 136
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 480 GVGKDHhgrtMVVAWGTGQvglgGEVVVLAKAHTDfDVQAfrVTFF-DETRMASCGQ-GSVRLWRLRGGVLrscpVDLGE 557
Cdd:COG2319 137 SGSADG----TVRLWDLAT----GKLLRTLTGHSG-AVTS--VAFSpDGKLLASGSDdGTVRLWDLATGKL----LRTLT 201
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 558 HHALQFTDLAFkqARDGcpepsaAMLFVCSRSGHIleidcqrmvvrharRLLPTRTPggphpQKQTFSSGPGIAISSLSV 637
Cdd:COG2319 202 GHTGAVRSVAF--SPDG------KLLASGSADGTV--------------RLWDLATG-----KLLRTLTGHSGSVRSVAF 254
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 638 SP--AMCAVGSEDGFLRLWPLDFSSVL-LEAEHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAPVL 714
Cdd:COG2319 255 SPdgRLLASGSADGTVRLWDLATGELLrTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVR 334
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462547867 715 ALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVR 777
Cdd:COG2319 335 SVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVR 397
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
304-741 |
5.05e-27 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 115.78 E-value: 5.05e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 304 VSLSQERSDASNADGPGFHSLEPWAQLEASDIHTAAAGTHVLTHESAEVPVARTGSCEGFLPDPVLRLKGVIGFGGHGTR 383
Cdd:COG2319 2 LSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAV 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 384 QAL-WTPDGAAVVYPCHAVIVVLL-VDTGEQRFFL-GHTDKVSALALDGSSSLLASAQAraPSVMRLWDFQTGRCLCLFR 460
Cdd:COG2319 82 LSVaFSPDGRLLASASADGTVRLWdLATGLLLRTLtGHTGAVRSVAFSPDGKTLASGSA--DGTVRLWDLATGKLLRTLT 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 461 SPMHVVCSLSFSDSGallcgvgkdhhgRTMVVAWGTGQVGL----GGEVVVLAKAHTDFdvqAFRVTF-FDETRMASCGQ 535
Cdd:COG2319 160 GHSGAVTSVAFSPDG------------KLLASGSDDGTVRLwdlaTGKLLRTLTGHTGA---VRSVAFsPDGKLLASGSA 224
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 536 -GSVRLWRLRGGVLrscpVDLGEHHALQFTDLAFkqARDGcpepsaAMLFVCSRSGHIleidcqrmvvrharRLLPTRTP 614
Cdd:COG2319 225 dGTVRLWDLATGKL----LRTLTGHSGSVRSVAF--SPDG------RLLASGSADGTV--------------RLWDLATG 278
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 615 ggphpQKQTFSSGPGIAISSLSVSP--AMCAVGSEDGFLRLWPLDFSSVLLEAE-HEGPVSSVCVSPDGLRVLSATSSGH 691
Cdd:COG2319 279 -----ELLRTLTGHSGGVNSVAFSPdgKLLASGSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGT 353
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|
gi 2462547867 692 LGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLAT 741
Cdd:COG2319 354 VRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1167-1528 |
3.97e-26 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 110.50 E-value: 3.97e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1167 QHWSGHSAEISTLALSHSAQVLASASGRSSttahcqIRVWDVSGGLCQHLIFPHSTTVLALAFSPDDRLLVTLGDhdGRT 1246
Cdd:cd00200 3 RTLKGHTGGVTCVAFSPDGKLLATGSGDGT------IKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSS--DKT 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1247 LALW----GTATYDLVSSTrlpEPVHGVAFNPwdageltcvgqgtvtfwllqqrgadislqvrrepvpeavgageltslc 1322
Cdd:cd00200 75 IRLWdletGECVRTLTGHT---SYVSSVAFSP------------------------------------------------ 103
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1323 ygAPPLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFSGSR--LVSGSSTGRLRLWAVgavSELRCKGsgarsss 1400
Cdd:cd00200 104 --DGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGtfVASSSQDGTIKLWDL---RTGKCVA------- 171
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1401 VFMEHElvldGAVVSASFDDSVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVWA 1480
Cdd:cd00200 172 TLTGHT----GEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWD 247
|
330 340 350 360
....*....|....*....|....*....|....*....|....*...
gi 2462547867 1481 LASMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGSLRIFS 1528
Cdd:cd00200 248 LRTGECVQTLSGHTNSVTSLAWS------PDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
626-882 |
1.31e-25 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 108.96 E-value: 1.31e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 626 SGPGIAISSLSVSPAMcAVGSEDGFLRLWPLDFSSVLLE-AEHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHM 704
Cdd:cd00200 51 TGPVRDVAASADGTYL-ASGSSDKTIRLWDLETGECVRTlTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLT 129
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 705 LARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAA 784
Cdd:cd00200 130 TLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTG 209
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 785 EVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAqyscadpQWHVLRVAADMVCPDAPASPSALAVSRDGRLLAfvgps 864
Cdd:cd00200 210 KCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIR-------VWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLA----- 277
|
250
....*....|....*...
gi 2462547867 865 rctvtvmgSASLDELLRV 882
Cdd:cd00200 278 --------SGSADGTIRI 287
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1341-1764 |
3.40e-25 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 110.39 E-value: 3.40e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1341 WDTRAGRCFLSWEADDGGIGLLLFSGSRLVSGSSTGRLRLWAVGAVSELRckgsgarsssvfmeHELVLDGAVVSASFDD 1420
Cdd:COG2319 23 AALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLA--------------TLLGHTAAVLSVAFSP 88
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1421 SVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVWALASMELVIQFQVLNQSCLCL 1500
Cdd:COG2319 89 DGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSV 168
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1501 AWSppccgrPEQQRLAAGYGDGSLRIFSVSRTAMELKMHPHPVALTTVAFSTDGQTVLSGDKDGLVAVSHPCTGTTFRVL 1580
Cdd:COG2319 169 AFS------PDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTL 242
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1581 SDHQGAPISticvtckecedLGVEGTDLWLA-ASGDQRVSVWasdwlrnhcelvDWLSFPMPATTETQGHLPPSLaAFCP 1659
Cdd:COG2319 243 TGHSGSVRS-----------VAFSPDGRLLAsGSADGTVRLW------------DLATGELLRTLTGHSGGVNSV-AFSP 298
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1660 wDGALLMYVGPGvyKEVIIYNLCQKQVVEKIPLPFFA-MSLSLSPGTHLLAVGFAECMLRLVDCAMGTA-QDFAGHDNAV 1737
Cdd:COG2319 299 -DGKLLASGSDD--GTVRLWDLATGKLLRTLTGHTGAvRSVAFSPDGKTLASGSDDGTVRLWDLATGELlRTLTGHTGAV 375
|
410 420
....*....|....*....|....*...
gi 2462547867 1738 HLCRFTPSARLLFTAAR-NEILVWEVPG 1764
Cdd:COG2319 376 TSVAFSPDGRTLASGSAdGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
415-738 |
2.41e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 96.25 E-value: 2.41e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 415 FLGHTDKVSALALDGSSSLLASAQARapSVMRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLCGVGKDHhgrtMVVAW 494
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLLATGSGD--GTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDK----TIRLW 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 495 GTGQvglgGEVVVLAKAHTDfDVQAfrVTFFDETRMASCG--QGSVRLWRLRGGVLRSCpvdLGEHHAlqftdlafkQAR 572
Cdd:cd00200 79 DLET----GECVRTLTGHTS-YVSS--VAFSPDGRILSSSsrDKTIKVWDVETGKCLTT---LRGHTD---------WVN 139
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 573 DGCPEPSAAMLFVCSRSGHILEIDcqrmvVRHARRLlptrtpggphpqkQTFSsGPGIAISSLSVSP--AMCAVGSEDGF 650
Cdd:cd00200 140 SVAFSPDGTFVASSSQDGTIKLWD-----LRTGKCV-------------ATLT-GHTGEVNSVAFSPdgEKLLSSSSDGT 200
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 651 LRLWPLDFSSVL--LEAeHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATV 728
Cdd:cd00200 201 IKLWDLSTGKCLgtLRG-HENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASG 279
|
330
....*....|
gi 2462547867 729 SQDRTVRIWD 738
Cdd:cd00200 280 SADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1140-1382 |
4.06e-19 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 89.70 E-value: 4.06e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1140 VWRPDTGFFAyTCG--RLVVVEDLHSGAQQH-WSGHSAEISTLALSHSAQVLASASgrssttAHCQIRVWDVSGGLCQHL 1216
Cdd:cd00200 58 AASADGTYLA-SGSsdKTIRLWDLETGECVRtLTGHTSYVSSVAFSPDGRILSSSS------RDKTIKVWDVETGKCLTT 130
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1217 IFPHSTTVLALAFSPDDRLLVTlGDHDGrTLALW----GTATYDLVSSTRlpePVHGVAFNPwDAGEL-TCVGQGTVTFW 1291
Cdd:cd00200 131 LRGHTDWVNSVAFSPDGTFVAS-SSQDG-TIKLWdlrtGKCVATLTGHTG---EVNSVAFSP-DGEKLlSSSSDGTIKLW 204
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1292 LLQQRGADISLQVRREPVpeavgagelTSLCYGAPPLLYCGTSS-GQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSR 1368
Cdd:cd00200 205 DLSTGKCLGTLRGHENGV---------NSVAFSPDGYLLASGSEdGTIRVWDLRTGECVQTLSGHTNSVTSLAWSpdGKR 275
|
250
....*....|....
gi 2462547867 1369 LVSGSSTGRLRLWA 1382
Cdd:cd00200 276 LASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1444-1762 |
2.80e-17 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 84.31 E-value: 2.80e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1444 STRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVWALASMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGS 1523
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS------ADGTYLASGSSDKT 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1524 LRIFSVS--RTAMELKMHPHPValTTVAFSTDGQTVLSGDKDGLVAVSHPCTGTTFRVLSDHQGaPISTICVTckecedl 1601
Cdd:cd00200 75 IRLWDLEtgECVRTLTGHTSYV--SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD-WVNSVAFS------- 144
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1602 gvEGTDLWLAASGDQRVSVWASDWLRnhcelvdwlsfpmpattetqghlppslaafcpwdgalLMYVGPGVYKEViiynl 1681
Cdd:cd00200 145 --PDGTFVASSSQDGTIKLWDLRTGK-------------------------------------CVATLTGHTGEV----- 180
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 1682 cqkqvvekiplpffaMSLSLSPGTHLLAVGFAECMLRLVDCAMGTA-QDFAGHDNAVHLCRFTPSARLLFTAARNE-ILV 1759
Cdd:cd00200 181 ---------------NSVAFSPDGEKLLSSSSDGTIKLWDLSTGKClGTLRGHENGVNSVAFSPDGYLLASGSEDGtIRV 245
|
...
gi 2462547867 1760 WEV 1762
Cdd:cd00200 246 WDL 248
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
789-1015 |
5.58e-12 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 68.52 E-value: 5.58e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 789 EHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQW-HVLRVAADmvcpdapaSPSALAVSRDGRLLAfvgpsrct 867
Cdd:cd00200 4 TLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELlRTLKGHTG--------PVRDVAASADGTYLA-------- 67
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 868 vtvmgSASLDELLRV-DIGTLDLASS---RLDSAMAVCFGPAalGHLLVSTSSNRVVVL-DAVSGRIIREstvfqLPGvH 942
Cdd:cd00200 68 -----SGSSDKTIRLwDLETGECVRTltgHTSYVSSVAFSPD--GRILSSSSRDKTIKVwDVETGKCLTT-----LRG-H 134
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462547867 943 PEPCPSLTLSEDARFLLIAAG-RTIKVWDYATqASPGpQVYIGHSEPVQAVAFSPDQQQVLSAGD--AVFLWDVLA 1015
Cdd:cd00200 135 TDWVNSVAFSPDGTFVASSSQdGTIKLWDLRT-GKCV-ATLTGHTGEVNSVAFSPDGEKLLSSSSdgTIKLWDLST 208
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
413-542 |
1.50e-09 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 61.20 E-value: 1.50e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 413 RFFLGHTDKVSALALDGSSSLLASAQARapSVMRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLCGVGKDhhgRTMVV 492
Cdd:cd00200 171 ATLTGHTGEVNSVAFSPDGEKLLSSSSD--GTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSED---GTIRV 245
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|.
gi 2462547867 493 aWgtgQVGLGGEVVVLaKAHTDFdVQAFRVTfFDETRMASCGQ-GSVRLWR 542
Cdd:cd00200 246 -W---DLRTGECVQTL-SGHTNS-VTSLAWS-PDGKRLASGSAdGTIRIWD 289
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1442-1479 |
3.48e-06 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 45.38 E-value: 3.48e-06
10 20 30
....*....|....*....|....*....|....*...
gi 2462547867 1442 GTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVW 1479
Cdd:smart00320 2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1442-1479 |
6.23e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 44.64 E-value: 6.23e-06
10 20 30
....*....|....*....|....*....|....*...
gi 2462547867 1442 GTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVW 1479
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
746-939 |
5.83e-05 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 46.94 E-value: 5.83e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 746 YDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAAEVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCAD 825
Cdd:COG4257 10 YPVPAPGSGPRDVAVDPDGAVWFTDQGGGRIGRLDPATGEFTEYPLGGGSGPHGIAVDPDGNLWFTDNGNNRIGRIDPKT 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 826 PQwhVLRVAAdmvcPDAPASPSALAVSRDGRLLafvgpsrctVTVMGSaslDELLRVDIGT----LDLASSRLDSAMAVC 901
Cdd:COG4257 90 GE--ITTFAL----PGGGSNPHGIAFDPDGNLW---------FTDQGG---NRIGRLDPATgevtEFPLPTGGAGPYGIA 151
|
170 180 190
....*....|....*....|....*....|....*....
gi 2462547867 902 FGPAalGHLLV-STSSNRVVVLDAVSGRIiresTVFQLP 939
Cdd:COG4257 152 VDPD--GNLWVtDFGANAIGRIDPDTGTL----TEYALP 184
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
707-738 |
8.20e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 38.45 E-value: 8.20e-04
10 20 30
....*....|....*....|....*....|..
gi 2462547867 707 RSHTAPVLALAMEQRRGQLATVSQDRTVRIWD 738
Cdd:smart00320 9 KGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
841-973 |
2.59e-03 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 41.60 E-value: 2.59e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 841 DAPASPSALAVSRDGRLLAFVGPSRCTVTVMGSASLDELLRVDIGtldlassrlDSAMAVCFGPAAlGHLLVS-TSSNR- 918
Cdd:COG3391 107 PVGGGPRGLAVDPDGGRLYVADSGNGRVSVIDTATGKVVATIPVG---------AGPHGIAVDPDG-KRLYVAnSGSNTv 176
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462547867 919 ---VVVLDAVSGRIIRESTVFQLPGvhpepcpSLTLSEDARFLLIA---------AGRTIKVWDYAT 973
Cdd:COG3391 177 sviVSVIDTATGKVVATIPVGGGPV-------GVAVSPDGRRLYVAnrgsntsngGSNTVSVIDLAT 236
|
|
| WDR74 |
cd22857 |
WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and ... |
667-745 |
3.28e-03 |
|
WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and plants is an essential factor for ribosome assembly. In cooperation with the assembly factor NVL2, WDR74 participates in an early cleavage of the pre-rRNA processing pathway. NVL2 is a type II double ring, AAA-ATPase, that may mediate the release of WDR74 from nucleolar pre-60S particles. WDR74 has been implicated in tumorigenesis. In lung cancer, it regulates cell proliferation, cell cycle progression, chemoresistance and cell aggressiveness, by inducing nuclear beta-catenin accumulation and driving downstream Wnt-responsive genes expression. In melanoma, it promotes apoptosis resistance and aggressive behavior by regulating the RPL5-MDM2-p53 pathway. WDR74 contains an N-terminal seven-bladed beta-propeller WD40 domain that associates with the D1-AAA domain of the AAA-ATPase NVL2, and a flexible lysine-rich C-terminus that extends outward from the WD40 domain, and is required for nucleolar localization.
Pssm-ID: 439303 [Multi-domain] Cd Length: 325 Bit Score: 41.83 E-value: 3.28e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462547867 667 HEGPVSSVCVSPDGLRVLSATSSGHLGFLD----TLSRVYHMLArshTAPVLALAMEQRRGQLATVSQDRTVRIWDLATL 742
Cdd:cd22857 222 GETPIKAVAEDPDGHTVYVGDTSGDLASIDlrtgKLLGCFKGKC---GGSIRSIARHPELPLIASCGLDRYLRIWDTETR 298
|
...
gi 2462547867 743 QQL 745
Cdd:cd22857 299 QLL 301
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
980-1012 |
4.42e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 36.52 E-value: 4.42e-03
10 20 30
....*....|....*....|....*....|....*
gi 2462547867 980 QVYIGHSEPVQAVAFSPDQQQVLSAGD--AVFLWD 1012
Cdd:smart00320 6 KTLKGHTGPVTSVAFSPDGKYLASGSDdgTIKLWD 40
|
|
|