NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2217304987|ref|XP_047289707|]
View 

WD repeat-containing protein 90 isoform X31 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
602-1007 6.52e-43

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 162.77  E-value: 6.52e-43
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  602 RHARRLLPTRTPGGPHPQKQTFSSGPGIAISSLSVSPAmcAVGSEDGFLRLWPLDFSSVLLEAEHEGPVSSVCVSPDGLR 681
Cdd:COG2319     15 DLALALLAAALGALLLLLLGLAAAVASLAASPDGARLA--AGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRL 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  682 VLSATSSGHLGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHP 761
Cdd:COG2319     93 LASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSP 172
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  762 TRPTFFCGFSSGAVRSFSLEAAEVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQW-HVLRVAADMVcpd 840
Cdd:COG2319    173 DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLlRTLTGHSGSV--- 249
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  841 apaspSALAVSRDGRLLAfvgpsrctvtvmgSASLDELLRV-DIGT---LDLASSRLDSAMAVCFGPAalGHLLVSTSSN 916
Cdd:COG2319    250 -----RSVAFSPDGRLLA-------------SGSADGTVRLwDLATgelLRTLTGHSGGVNSVAFSPD--GKLLASGSDD 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  917 RVV-VLDAVSGRIIRELPGvHPEPCPSLTLSEDARFLLIA-AGRTIKVWDYATQASPgpQVYIGHSEPVQAVAFSPDQQQ 994
Cdd:COG2319    310 GTVrLWDLATGKLLRTLTG-HTGAVRSVAFSPDGKTLASGsDDGTVRLWDLATGELL--RTLTGHTGAVTSVAFSPDGRT 386
                          410
                   ....*....|....*
gi 2217304987  995 VLSAGD--AVFLWDV 1007
Cdd:COG2319    387 LASGSAdgTVRLWDL 401
WD40 COG2319
WD40 repeat [General function prediction only];
1137-1550 9.70e-41

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 156.22  E-value: 9.70e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1137 GRLVVVEDLHSGAQQHWSGHSAEISTLALSHSAQVLASASGRSSttahcqIRVWDVSGGLCQHLIFPHSTTVLALAFSPD 1216
Cdd:COG2319     58 LTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGT------VRLWDLATGLLLRTLTGHTGAVRSVAFSPD 131
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1217 DRLLVTlGDHDGrTLALWGTATYDLVSSTRLPE-PVHGVAFNPwDageltcvGQgtvtfwllqqrgadislqvrrepvpe 1295
Cdd:COG2319    132 GKTLAS-GSADG-TVRLWDLATGKLLRTLTGHSgAVTSVAFSP-D-------GK-------------------------- 175
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1296 avgageltslcygappLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSEL 1373
Cdd:COG2319    176 ----------------LLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSpdGKLLASGSADGTVRLWDLATGKLL 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1374 RckgsgassVFMEHelvlDGAVVSASFDDSVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCS 1453
Cdd:COG2319    240 R--------TLTGH----SGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGS 307
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1454 EDGSVRVWALASMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGSLRIFSVSRTAMELKMHPHPVALTTVAFS 1533
Cdd:COG2319    308 DDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS------PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFS 381
                          410
                   ....*....|....*..
gi 2217304987 1534 TDGQTVLSGDKDGLVAV 1550
Cdd:COG2319    382 PDGRTLASGSADGTVRL 398
CFA20_dom super family cl04888
CFA20 domain; This domain is characteriztic of cilia- and flagella-associated protein 20 ...
13-190 2.76e-34

CFA20 domain; This domain is characteriztic of cilia- and flagella-associated protein 20 (CFA20). CFA20 is a cilium- and flagellum-specific protein that plays a role in axonemal structure organization and motility. In Chlamydomonas reinhardtii, it stabilizes outer doublet microtubules (DMTs) of the axoneme and may work as a scaffold for intratubular proteins, such as tektin and PACRG, to produce the beak structures in DMT1. Other proteins contain a domain with homology to CFA20. WDR90/POC16 contains such a domain in its N terminus, followed by a large C-terminal domain with multiple WD40 repeats. This domain is also present in the N terminus of uncharacterized protein C3orf67.


The actual alignment was detected with superfamily member pfam05018:

Pssm-ID: 461521  Cd Length: 185  Bit Score: 130.40  E-value: 2.76e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987   13 AWQHPFLNVFRHFRV---DEWKRSAKQGDVAVVTDKTLKGAVYRIRGSVSAANYIQLPKSSTQSLGLTGRYLYVLFRPLp 89
Cdd:pfam05018    5 TFQSGFLSIFYSIGSkplQIWSKKVKNGHIKRVTDDDIKSNVLEIVGTNVATTYITCPADPKQSLGIKLPFLVLLVKNL- 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987   90 SKHFVIHLDVSSKDNQVIRVSFSNLFKEFKSTATWLQFPLVLEartpqrdlvglapsgARWTClqldLQDVLLVYLNRCY 169
Cdd:pfam05018   84 GKYFSFEIQILDDKNVRRRFRFSNFQKVTKVKPFITTMPLRLN---------------EGWNQ----IQFNLADFTRRAY 144
                          170       180
                   ....*....|....*....|....*
gi 2217304987  170 G----HLKSIRLCASLLVRNLYTSD 190
Cdd:pfam05018  145 GtnyvETVRVQIHANCRLRRIYFSD 169
WD40 COG2319
WD40 repeat [General function prediction only];
399-776 5.95e-27

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 115.39  E-value: 5.95e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  399 AVIVVLLVDTGEQRFFLGHTDKVSALALDGSSSLLASAqARAPSVmRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLC 478
Cdd:COG2319     59 TLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASA-SADGTV-RLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLA 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  479 GVGKDHhgrtMVVAWGTGQvglgGEVVVLAKAHTDfDVQAfrVTFF-DETRMASCGQ-GSVRLWRLRGGVLrscpVDLGE 556
Cdd:COG2319    137 SGSADG----TVRLWDLAT----GKLLRTLTGHSG-AVTS--VAFSpDGKLLASGSDdGTVRLWDLATGKL----LRTLT 201
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  557 HHALQFTDLAFkqARDGcpepsaAMLFVCSRSGHIleidcqrmvvrharRLLPTRTPggphpQKQTFSSGPGIAISSLSV 636
Cdd:COG2319    202 GHTGAVRSVAF--SPDG------KLLASGSADGTV--------------RLWDLATG-----KLLRTLTGHSGSVRSVAF 254
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  637 SP--AMCAVGSEDGFLRLWPLDFSSVL-LEAEHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAPVL 713
Cdd:COG2319    255 SPdgRLLASGSADGTVRLWDLATGELLrTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVR 334
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217304987  714 ALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVR 776
Cdd:COG2319    335 SVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVR 397
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1426-1744 3.53e-17

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 83.92  E-value: 3.53e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1426 STRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVWALASMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGS 1505
Cdd:cd00200      1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS------ADGTYLASGSSDKT 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1506 LRIFSVS--RTAMELKMHPHPValTTVAFSTDGQTVLSGDKDGLVAVSHPCTGTTFRVLSDHQGaPISTICVTckecedl 1583
Cdd:cd00200     75 IRLWDLEtgECVRTLTGHTSYV--SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD-WVNSVAFS------- 144
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1584 gvEGTDLWLAASGDQRVSVWASDWLRnhcelvdwlsfpmpattetqghlppslaafcpwdgalLMYVGPGVYKEViiynl 1663
Cdd:cd00200    145 --PDGTFVASSSQDGTIKLWDLRTGK-------------------------------------CVATLTGHTGEV----- 180
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1664 cqkqvvekiplpffaMSLSLSPGTHLLAVGFAECMLRLVDCAMGTA-QDFAGHDNAVHLCRFTPSARLLFTAARNE-ILV 1741
Cdd:cd00200    181 ---------------NSVAFSPDGEKLLSSSSDGTIKLWDLSTGKClGTLRGHENGVNSVAFSPDGYLLASGSEDGtIRV 245

                   ...
gi 2217304987 1742 WEV 1744
Cdd:cd00200    246 WDL 248
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
602-1007 6.52e-43

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 162.77  E-value: 6.52e-43
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  602 RHARRLLPTRTPGGPHPQKQTFSSGPGIAISSLSVSPAmcAVGSEDGFLRLWPLDFSSVLLEAEHEGPVSSVCVSPDGLR 681
Cdd:COG2319     15 DLALALLAAALGALLLLLLGLAAAVASLAASPDGARLA--AGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRL 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  682 VLSATSSGHLGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHP 761
Cdd:COG2319     93 LASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSP 172
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  762 TRPTFFCGFSSGAVRSFSLEAAEVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQW-HVLRVAADMVcpd 840
Cdd:COG2319    173 DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLlRTLTGHSGSV--- 249
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  841 apaspSALAVSRDGRLLAfvgpsrctvtvmgSASLDELLRV-DIGT---LDLASSRLDSAMAVCFGPAalGHLLVSTSSN 916
Cdd:COG2319    250 -----RSVAFSPDGRLLA-------------SGSADGTVRLwDLATgelLRTLTGHSGGVNSVAFSPD--GKLLASGSDD 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  917 RVV-VLDAVSGRIIRELPGvHPEPCPSLTLSEDARFLLIA-AGRTIKVWDYATQASPgpQVYIGHSEPVQAVAFSPDQQQ 994
Cdd:COG2319    310 GTVrLWDLATGKLLRTLTG-HTGAVRSVAFSPDGKTLASGsDDGTVRLWDLATGELL--RTLTGHTGAVTSVAFSPDGRT 386
                          410
                   ....*....|....*
gi 2217304987  995 VLSAGD--AVFLWDV 1007
Cdd:COG2319    387 LASGSAdgTVRLWDL 401
WD40 COG2319
WD40 repeat [General function prediction only];
1137-1550 9.70e-41

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 156.22  E-value: 9.70e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1137 GRLVVVEDLHSGAQQHWSGHSAEISTLALSHSAQVLASASGRSSttahcqIRVWDVSGGLCQHLIFPHSTTVLALAFSPD 1216
Cdd:COG2319     58 LTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGT------VRLWDLATGLLLRTLTGHTGAVRSVAFSPD 131
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1217 DRLLVTlGDHDGrTLALWGTATYDLVSSTRLPE-PVHGVAFNPwDageltcvGQgtvtfwllqqrgadislqvrrepvpe 1295
Cdd:COG2319    132 GKTLAS-GSADG-TVRLWDLATGKLLRTLTGHSgAVTSVAFSP-D-------GK-------------------------- 175
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1296 avgageltslcygappLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSEL 1373
Cdd:COG2319    176 ----------------LLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSpdGKLLASGSADGTVRLWDLATGKLL 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1374 RckgsgassVFMEHelvlDGAVVSASFDDSVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCS 1453
Cdd:COG2319    240 R--------TLTGH----SGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGS 307
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1454 EDGSVRVWALASMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGSLRIFSVSRTAMELKMHPHPVALTTVAFS 1533
Cdd:COG2319    308 DDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS------PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFS 381
                          410
                   ....*....|....*..
gi 2217304987 1534 TDGQTVLSGDKDGLVAV 1550
Cdd:COG2319    382 PDGRTLASGSADGTVRL 398
CFA20_dom pfam05018
CFA20 domain; This domain is characteriztic of cilia- and flagella-associated protein 20 ...
13-190 2.76e-34

CFA20 domain; This domain is characteriztic of cilia- and flagella-associated protein 20 (CFA20). CFA20 is a cilium- and flagellum-specific protein that plays a role in axonemal structure organization and motility. In Chlamydomonas reinhardtii, it stabilizes outer doublet microtubules (DMTs) of the axoneme and may work as a scaffold for intratubular proteins, such as tektin and PACRG, to produce the beak structures in DMT1. Other proteins contain a domain with homology to CFA20. WDR90/POC16 contains such a domain in its N terminus, followed by a large C-terminal domain with multiple WD40 repeats. This domain is also present in the N terminus of uncharacterized protein C3orf67.


Pssm-ID: 461521  Cd Length: 185  Bit Score: 130.40  E-value: 2.76e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987   13 AWQHPFLNVFRHFRV---DEWKRSAKQGDVAVVTDKTLKGAVYRIRGSVSAANYIQLPKSSTQSLGLTGRYLYVLFRPLp 89
Cdd:pfam05018    5 TFQSGFLSIFYSIGSkplQIWSKKVKNGHIKRVTDDDIKSNVLEIVGTNVATTYITCPADPKQSLGIKLPFLVLLVKNL- 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987   90 SKHFVIHLDVSSKDNQVIRVSFSNLFKEFKSTATWLQFPLVLEartpqrdlvglapsgARWTClqldLQDVLLVYLNRCY 169
Cdd:pfam05018   84 GKYFSFEIQILDDKNVRRRFRFSNFQKVTKVKPFITTMPLRLN---------------EGWNQ----IQFNLADFTRRAY 144
                          170       180
                   ....*....|....*....|....*
gi 2217304987  170 G----HLKSIRLCASLLVRNLYTSD 190
Cdd:pfam05018  145 GtnyvETVRVQIHANCRLRRIYFSD 169
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
706-1000 1.24e-30

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 123.60  E-value: 1.24e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  706 RSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAAEV 785
Cdd:cd00200      6 KGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGEC 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  786 LVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAqyscadpQWHVLRVAADMVCPDAPASPSALAVSRDGRLLAfvgpsrc 865
Cdd:cd00200     86 VRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIK-------VWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVA------- 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  866 tvtvmgSASLDELLRVdigtLDLASSRL--------DSAMAVCFGPAalGHLLVSTSSNRVV-VLDAVSGRIIRELPGvH 936
Cdd:cd00200    152 ------SSSQDGTIKL----WDLRTGKCvatltghtGEVNSVAFSPD--GEKLLSSSSDGTIkLWDLSTGKCLGTLRG-H 218
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217304987  937 PEPCPSLTLSEDARFLLIAAG-RTIKVWDYATQASpgPQVYIGHSEPVQAVAFSPDQQQVLSAGD 1000
Cdd:cd00200    219 ENGVNSVAFSPDGYLLASGSEdGTIRVWDLRTGEC--VQTLSGHTNSVTSLAWSPDGKRLASGSA 281
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1300-1604 9.15e-28

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 115.12  E-value: 9.15e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1300 GELTSLCYGA-PPLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSELRck 1376
Cdd:cd00200     10 GGVTCVAFSPdGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASadGTYLASGSSDKTIRLWDLETGECVR-- 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1377 gsgassVFMEHElvldGAVVSASFDDSVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCSEDG 1456
Cdd:cd00200     88 ------TLTGHT----SYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDG 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1457 SVRVWALASMELVIQFQVLNQSCLCLAWSPpccgrpEQQRLAAGYGDGSLRIFSVsRTAMELK-MHPHPVALTTVAFSTD 1535
Cdd:cd00200    158 TIKLWDLRTGKCVATLTGHTGEVNSVAFSP------DGEKLLSSSSDGTIKLWDL-STGKCLGtLRGHENGVNSVAFSPD 230
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1536 GQTVLSGDKDGLVAVSHPCTGTTFRVLSDHQGApisticVTCkecedLGVEGTDLWLA-ASGDQRVSVWA 1604
Cdd:cd00200    231 GYLLASGSEDGTIRVWDLRTGECVQTLSGHTNS------VTS-----LAWSPDGKRLAsGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
399-776 5.95e-27

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 115.39  E-value: 5.95e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  399 AVIVVLLVDTGEQRFFLGHTDKVSALALDGSSSLLASAqARAPSVmRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLC 478
Cdd:COG2319     59 TLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASA-SADGTV-RLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLA 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  479 GVGKDHhgrtMVVAWGTGQvglgGEVVVLAKAHTDfDVQAfrVTFF-DETRMASCGQ-GSVRLWRLRGGVLrscpVDLGE 556
Cdd:COG2319    137 SGSADG----TVRLWDLAT----GKLLRTLTGHSG-AVTS--VAFSpDGKLLASGSDdGTVRLWDLATGKL----LRTLT 201
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  557 HHALQFTDLAFkqARDGcpepsaAMLFVCSRSGHIleidcqrmvvrharRLLPTRTPggphpQKQTFSSGPGIAISSLSV 636
Cdd:COG2319    202 GHTGAVRSVAF--SPDG------KLLASGSADGTV--------------RLWDLATG-----KLLRTLTGHSGSVRSVAF 254
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  637 SP--AMCAVGSEDGFLRLWPLDFSSVL-LEAEHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAPVL 713
Cdd:COG2319    255 SPdgRLLASGSADGTVRLWDLATGELLrTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVR 334
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217304987  714 ALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVR 776
Cdd:COG2319    335 SVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVR 397
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
414-737 3.11e-21

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 95.86  E-value: 3.11e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  414 FLGHTDKVSALALDGSSSLLASAQARapSVMRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLCGVGKDHhgrtMVVAW 493
Cdd:cd00200      5 LKGHTGGVTCVAFSPDGKLLATGSGD--GTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDK----TIRLW 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  494 GTGQvglgGEVVVLAKAHTDfDVQAfrVTFFDETRMASCG--QGSVRLWRLRGGVLRSCpvdLGEHHAlqftdlafkQAR 571
Cdd:cd00200     79 DLET----GECVRTLTGHTS-YVSS--VAFSPDGRILSSSsrDKTIKVWDVETGKCLTT---LRGHTD---------WVN 139
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  572 DGCPEPSAAMLFVCSRSGHILEIDcqrmvVRHARRLlptrtpggphpqkQTFSsGPGIAISSLSVSP--AMCAVGSEDGF 649
Cdd:cd00200    140 SVAFSPDGTFVASSSQDGTIKLWD-----LRTGKCV-------------ATLT-GHTGEVNSVAFSPdgEKLLSSSSDGT 200
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  650 LRLWPLDFSSVL--LEAeHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATV 727
Cdd:cd00200    201 IKLWDLSTGKCLgtLRG-HENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASG 279
                          330
                   ....*....|
gi 2217304987  728 SQDRTVRIWD 737
Cdd:cd00200    280 SADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1426-1744 3.53e-17

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 83.92  E-value: 3.53e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1426 STRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVWALASMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGS 1505
Cdd:cd00200      1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS------ADGTYLASGSSDKT 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1506 LRIFSVS--RTAMELKMHPHPValTTVAFSTDGQTVLSGDKDGLVAVSHPCTGTTFRVLSDHQGaPISTICVTckecedl 1583
Cdd:cd00200     75 IRLWDLEtgECVRTLTGHTSYV--SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD-WVNSVAFS------- 144
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1584 gvEGTDLWLAASGDQRVSVWASDWLRnhcelvdwlsfpmpattetqghlppslaafcpwdgalLMYVGPGVYKEViiynl 1663
Cdd:cd00200    145 --PDGTFVASSSQDGTIKLWDLRTGK-------------------------------------CVATLTGHTGEV----- 180
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1664 cqkqvvekiplpffaMSLSLSPGTHLLAVGFAECMLRLVDCAMGTA-QDFAGHDNAVHLCRFTPSARLLFTAARNE-ILV 1741
Cdd:cd00200    181 ---------------NSVAFSPDGEKLLSSSSDGTIKLWDLSTGKClGTLRGHENGVNSVAFSPDGYLLASGSEDGtIRV 245

                   ...
gi 2217304987 1742 WEV 1744
Cdd:cd00200    246 WDL 248
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
1424-1461 3.69e-06

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 45.38  E-value: 3.69e-06
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 2217304987  1424 GTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVW 1461
Cdd:smart00320    2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
WD40 pfam00400
WD domain, G-beta repeat;
1424-1461 6.87e-06

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 44.26  E-value: 6.87e-06
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 2217304987 1424 GTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVW 1461
Cdd:pfam00400    1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
706-737 8.87e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 38.45  E-value: 8.87e-04
                            10        20        30
                    ....*....|....*....|....*....|..
gi 2217304987   706 RSHTAPVLALAMEQRRGQLATVSQDRTVRIWD 737
Cdd:smart00320    9 KGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
602-1007 6.52e-43

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 162.77  E-value: 6.52e-43
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  602 RHARRLLPTRTPGGPHPQKQTFSSGPGIAISSLSVSPAmcAVGSEDGFLRLWPLDFSSVLLEAEHEGPVSSVCVSPDGLR 681
Cdd:COG2319     15 DLALALLAAALGALLLLLLGLAAAVASLAASPDGARLA--AGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRL 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  682 VLSATSSGHLGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHP 761
Cdd:COG2319     93 LASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSP 172
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  762 TRPTFFCGFSSGAVRSFSLEAAEVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQW-HVLRVAADMVcpd 840
Cdd:COG2319    173 DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLlRTLTGHSGSV--- 249
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  841 apaspSALAVSRDGRLLAfvgpsrctvtvmgSASLDELLRV-DIGT---LDLASSRLDSAMAVCFGPAalGHLLVSTSSN 916
Cdd:COG2319    250 -----RSVAFSPDGRLLA-------------SGSADGTVRLwDLATgelLRTLTGHSGGVNSVAFSPD--GKLLASGSDD 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  917 RVV-VLDAVSGRIIRELPGvHPEPCPSLTLSEDARFLLIA-AGRTIKVWDYATQASPgpQVYIGHSEPVQAVAFSPDQQQ 994
Cdd:COG2319    310 GTVrLWDLATGKLLRTLTG-HTGAVRSVAFSPDGKTLASGsDDGTVRLWDLATGELL--RTLTGHTGAVTSVAFSPDGRT 386
                          410
                   ....*....|....*
gi 2217304987  995 VLSAGD--AVFLWDV 1007
Cdd:COG2319    387 LASGSAdgTVRLWDL 401
WD40 COG2319
WD40 repeat [General function prediction only];
1137-1550 9.70e-41

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 156.22  E-value: 9.70e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1137 GRLVVVEDLHSGAQQHWSGHSAEISTLALSHSAQVLASASGRSSttahcqIRVWDVSGGLCQHLIFPHSTTVLALAFSPD 1216
Cdd:COG2319     58 LTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGT------VRLWDLATGLLLRTLTGHTGAVRSVAFSPD 131
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1217 DRLLVTlGDHDGrTLALWGTATYDLVSSTRLPE-PVHGVAFNPwDageltcvGQgtvtfwllqqrgadislqvrrepvpe 1295
Cdd:COG2319    132 GKTLAS-GSADG-TVRLWDLATGKLLRTLTGHSgAVTSVAFSP-D-------GK-------------------------- 175
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1296 avgageltslcygappLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSEL 1373
Cdd:COG2319    176 ----------------LLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSpdGKLLASGSADGTVRLWDLATGKLL 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1374 RckgsgassVFMEHelvlDGAVVSASFDDSVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCS 1453
Cdd:COG2319    240 R--------TLTGH----SGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGS 307
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1454 EDGSVRVWALASMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGSLRIFSVSRTAMELKMHPHPVALTTVAFS 1533
Cdd:COG2319    308 DDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS------PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFS 381
                          410
                   ....*....|....*..
gi 2217304987 1534 TDGQTVLSGDKDGLVAV 1550
Cdd:COG2319    382 PDGRTLASGSADGTVRL 398
WD40 COG2319
WD40 repeat [General function prediction only];
633-1019 2.17e-37

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 146.59  E-value: 2.17e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  633 SLSVSPAMCAVGSEDGFLRLWPLDFSSVLLEAE-HEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAP 711
Cdd:COG2319      1 ALSADGAALAAASADLALALLAAALGALLLLLLgLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  712 VLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAAEVLVEHTC 791
Cdd:COG2319     81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  792 HRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQW-HVLRVAADMVcpdapaspSALAVSRDGRLLAfvgpsrctvtvm 870
Cdd:COG2319    161 HSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLlRTLTGHTGAV--------RSVAFSPDGKLLA------------ 220
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  871 gSASLDELLRVdigtLDLASSRL--------DSAMAVCFGPAalGHLLVSTSSNRVVVL-DAVSGRIIRELPGvHPEPCP 941
Cdd:COG2319    221 -SGSADGTVRL----WDLATGKLlrtltghsGSVRSVAFSPD--GRLLASGSADGTVRLwDLATGELLRTLTG-HSGGVN 292
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  942 SLTLSEDARFLLIA-AGRTIKVWDYATQASpgPQVYIGHSEPVQAVAFSPDQQQVLSAGD--AVFLWDVlaPTERQVPTL 1018
Cdd:COG2319    293 SVAFSPDGKLLASGsDDGTVRLWDLATGKL--LRTLTGHTGAVRSVAFSPDGKTLASGSDdgTVRLWDL--ATGELLRTL 368

                   .
gi 2217304987 1019 S 1019
Cdd:COG2319    369 T 369
WD40 COG2319
WD40 repeat [General function prediction only];
1170-1603 3.35e-37

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 145.82  E-value: 3.35e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1170 QVLASASGRSSTTAHCQIRVWDVSGGLCQHLIFPHSTTVLALAFSPDDRLLVTLGDhDGRTLALWGTATYDLVSSTRLPE 1249
Cdd:COG2319      1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAG-DLTLLLLDAAAGALLATLLGHTA 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1250 PVHGVAFNPWDAGELTCVGQGTVTFWLLQQRGADISLQVRREPVpeavgagelTSLCYgAP--PLLYCGTSSGQVCVWDT 1327
Cdd:COG2319     80 AVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAV---------RSVAF-SPdgKTLASGSADGTVRLWDL 149
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1328 RAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSELRckgsgassVFMEHelvlDGAVVSASFddSVD 1405
Cdd:COG2319    150 ATGKLLRTLTGHSGAVTSVAFSpdGKLLASGSDDGTVRLWDLATGKLLR--------TLTGH----TGAVRSVAF--SPD 215
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1406 mG---VVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVWALASMELVIQFQVLNQSCLCL 1482
Cdd:COG2319    216 -GkllASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSV 294
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1483 AWSppccgrPEQQRLAAGYGDGSLRIFSVSRTAMELKMHPHPVALTTVAFSTDGQTVLSGDKDGLVAVSHPCTGTTFRVL 1562
Cdd:COG2319    295 AFS------PDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTL 368
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|..
gi 2217304987 1563 SDHQGaPISTICVTckecedlgveGTDLWLA-ASGDQRVSVW 1603
Cdd:COG2319    369 TGHTG-AVTSVAFS----------PDGRTLAsGSADGTVRLW 399
CFA20_dom pfam05018
CFA20 domain; This domain is characteriztic of cilia- and flagella-associated protein 20 ...
13-190 2.76e-34

CFA20 domain; This domain is characteriztic of cilia- and flagella-associated protein 20 (CFA20). CFA20 is a cilium- and flagellum-specific protein that plays a role in axonemal structure organization and motility. In Chlamydomonas reinhardtii, it stabilizes outer doublet microtubules (DMTs) of the axoneme and may work as a scaffold for intratubular proteins, such as tektin and PACRG, to produce the beak structures in DMT1. Other proteins contain a domain with homology to CFA20. WDR90/POC16 contains such a domain in its N terminus, followed by a large C-terminal domain with multiple WD40 repeats. This domain is also present in the N terminus of uncharacterized protein C3orf67.


Pssm-ID: 461521  Cd Length: 185  Bit Score: 130.40  E-value: 2.76e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987   13 AWQHPFLNVFRHFRV---DEWKRSAKQGDVAVVTDKTLKGAVYRIRGSVSAANYIQLPKSSTQSLGLTGRYLYVLFRPLp 89
Cdd:pfam05018    5 TFQSGFLSIFYSIGSkplQIWSKKVKNGHIKRVTDDDIKSNVLEIVGTNVATTYITCPADPKQSLGIKLPFLVLLVKNL- 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987   90 SKHFVIHLDVSSKDNQVIRVSFSNLFKEFKSTATWLQFPLVLEartpqrdlvglapsgARWTClqldLQDVLLVYLNRCY 169
Cdd:pfam05018   84 GKYFSFEIQILDDKNVRRRFRFSNFQKVTKVKPFITTMPLRLN---------------EGWNQ----IQFNLADFTRRAY 144
                          170       180
                   ....*....|....*....|....*
gi 2217304987  170 G----HLKSIRLCASLLVRNLYTSD 190
Cdd:pfam05018  145 GtnyvETVRVQIHANCRLRRIYFSD 169
WD40 COG2319
WD40 repeat [General function prediction only];
597-967 9.92e-34

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 135.81  E-value: 9.92e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  597 QRMVVRHARRLLPTRTPGGPHPQKQTFSSGPGIAISSLSVSPAMCAVGSEDGFLRLWPLDFSSVLLEAE-HEGPVSSVCV 675
Cdd:COG2319     49 ARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTgHTGAVRSVAF 128
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  676 SPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPC 755
Cdd:COG2319    129 SPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVR 208
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  756 AVTFHPTRPTFFCGFSSGAVRSFSLEAAEVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQ-WHVLRVAA 834
Cdd:COG2319    209 SVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGElLRTLTGHS 288
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  835 DMVcpdapaspSALAVSRDGRLLAfvgpsrctvtvmgSASLDELLRV-DIGTLDLA---SSRLDSAMAVCFGPAalGHLL 910
Cdd:COG2319    289 GGV--------NSVAFSPDGKLLA-------------SGSDDGTVRLwDLATGKLLrtlTGHTGAVRSVAFSPD--GKTL 345
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2217304987  911 VSTSSNRVVVL-DAVSGRIIRELPGvHPEPCPSLTLSEDARFLLIAAG-RTIKVWDYAT 967
Cdd:COG2319    346 ASGSDDGTVRLwDLATGELLRTLTG-HTGAVTSVAFSPDGRTLASGSAdGTVRLWDLAT 403
WD40 COG2319
WD40 repeat [General function prediction only];
385-817 5.79e-31

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 127.33  E-value: 5.79e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  385 LWTPDGAAVVYPCHAVIVVLLVDTGEQRFFLGHTDKVSALALDGSSSLLASAQARAPSVMRLWDFQTGRCLCLFRSPMHV 464
Cdd:COG2319      1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  465 VCSLSFSDSGALLCGVGKDHHgrtmVVAWGTgqvgLGGEVVVLAKAHTDfDVQAfrVTFF-DETRMASCGQ-GSVRLWRL 542
Cdd:COG2319     81 VLSVAFSPDGRLLASASADGT----VRLWDL----ATGLLLRTLTGHTG-AVRS--VAFSpDGKTLASGSAdGTVRLWDL 149
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  543 RGGVLrscpVDLGEHHALQFTDLAFkqARDGcpepsaAMLFVCSRSGHIleidcqRMVVRHARRLLPTRTpggphpqkqt 622
Cdd:COG2319    150 ATGKL----LRTLTGHSGAVTSVAF--SPDG------KLLASGSDDGTV------RLWDLATGKLLRTLT---------- 201
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  623 fssGPGIAISSLSVSP--AMCAVGSEDGFLRLWPLDFSSVLLE-AEHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSR 699
Cdd:COG2319    202 ---GHTGAVRSVAFSPdgKLLASGSADGTVRLWDLATGKLLRTlTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATG 278
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  700 VYHMLARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFS 779
Cdd:COG2319    279 ELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWD 358
                          410       420       430
                   ....*....|....*....|....*....|....*...
gi 2217304987  780 LEAAEVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSL 817
Cdd:COG2319    359 LATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTV 396
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
706-1000 1.24e-30

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 123.60  E-value: 1.24e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  706 RSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAAEV 785
Cdd:cd00200      6 KGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGEC 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  786 LVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAqyscadpQWHVLRVAADMVCPDAPASPSALAVSRDGRLLAfvgpsrc 865
Cdd:cd00200     86 VRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIK-------VWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVA------- 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  866 tvtvmgSASLDELLRVdigtLDLASSRL--------DSAMAVCFGPAalGHLLVSTSSNRVV-VLDAVSGRIIRELPGvH 936
Cdd:cd00200    152 ------SSSQDGTIKL----WDLRTGKCvatltghtGEVNSVAFSPD--GEKLLSSSSDGTIkLWDLSTGKCLGTLRG-H 218
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217304987  937 PEPCPSLTLSEDARFLLIAAG-RTIKVWDYATQASpgPQVYIGHSEPVQAVAFSPDQQQVLSAGD 1000
Cdd:cd00200    219 ENGVNSVAFSPDGYLLASGSEdGTIRVWDLRTGEC--VQTLSGHTNSVTSLAWSPDGKRLASGSA 281
WD40 COG2319
WD40 repeat [General function prediction only];
840-1367 3.03e-28

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 119.25  E-value: 3.03e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  840 DAPASPSALAVSRDGRLLAFVGPSRCTVTVMGSASLDELLRVDIGTLDLASSRLDSAMAVCFGPAalGHLLVSTSSNRVV 919
Cdd:COG2319     25 LGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPD--GRLLASASADGTV 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  920 VL-DAVSGRIIRELPGvHPEPCPSLTLSEDARFLLIA-AGRTIKVWDYATQASPgpQVYIGHSEPVQAVAFSPDQQQVLS 997
Cdd:COG2319    103 RLwDLATGLLLRTLTG-HTGAVRSVAFSPDGKTLASGsADGTVRLWDLATGKLL--RTLTGHSGAVTSVAFSPDGKLLAS 179
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  998 AGD--AVFLWDVLapTERQVPTLScvclGPPGPPETLsppspatkaspgppqparqaraqdrwrtqrpgpASSPGSRsps 1075
Cdd:COG2319    180 GSDdgTVRLWDLA--TGKLLRTLT----GHTGAVRSV---------------------------------AFSPDGK--- 217
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1076 hvrhlhhgwasvpglpkvamgtcpppasggWLrlkAVVGYSGNGRanmVWRPDTGFFAYTcgrlvvvedlhsgaqqhWSG 1155
Cdd:COG2319    218 ------------------------------LL---ASGSADGTVR---LWDLATGKLLRT-----------------LTG 244
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1156 HSAEISTLALSHSAQVLASASGRSSttahcqIRVWDVSGGLCQHLIFPHSTTVLALAFSPDDRLLVTlGDHDGrTLALWG 1235
Cdd:COG2319    245 HSGSVRSVAFSPDGRLLASGSADGT------VRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLAS-GSDDG-TVRLWD 316
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1236 TATYDLVSS-TRLPEPVHGVAFNPwdageltcVGQgtvtfwllqqrgadislqvrrepvpeavgageltslcygappLLY 1314
Cdd:COG2319    317 LATGKLLRTlTGHTGAVRSVAFSP--------DGK------------------------------------------TLA 346
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2217304987 1315 CGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAV 1367
Cdd:COG2319    347 SGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSpdGRTLASGSADGTVRLWDL 401
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1300-1604 9.15e-28

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 115.12  E-value: 9.15e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1300 GELTSLCYGA-PPLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSELRck 1376
Cdd:cd00200     10 GGVTCVAFSPdGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASadGTYLASGSSDKTIRLWDLETGECVR-- 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1377 gsgassVFMEHElvldGAVVSASFDDSVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCSEDG 1456
Cdd:cd00200     88 ------TLTGHT----SYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDG 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1457 SVRVWALASMELVIQFQVLNQSCLCLAWSPpccgrpEQQRLAAGYGDGSLRIFSVsRTAMELK-MHPHPVALTTVAFSTD 1535
Cdd:cd00200    158 TIKLWDLRTGKCVATLTGHTGEVNSVAFSP------DGEKLLSSSSDGTIKLWDL-STGKCLGtLRGHENGVNSVAFSPD 230
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1536 GQTVLSGDKDGLVAVSHPCTGTTFRVLSDHQGApisticVTCkecedLGVEGTDLWLA-ASGDQRVSVWA 1604
Cdd:cd00200    231 GYLLASGSEDGTIRVWDLRTGECVQTLSGHTNS------VTS-----LAWSPDGKRLAsGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
399-776 5.95e-27

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 115.39  E-value: 5.95e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  399 AVIVVLLVDTGEQRFFLGHTDKVSALALDGSSSLLASAqARAPSVmRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLC 478
Cdd:COG2319     59 TLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASA-SADGTV-RLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLA 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  479 GVGKDHhgrtMVVAWGTGQvglgGEVVVLAKAHTDfDVQAfrVTFF-DETRMASCGQ-GSVRLWRLRGGVLrscpVDLGE 556
Cdd:COG2319    137 SGSADG----TVRLWDLAT----GKLLRTLTGHSG-AVTS--VAFSpDGKLLASGSDdGTVRLWDLATGKL----LRTLT 201
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  557 HHALQFTDLAFkqARDGcpepsaAMLFVCSRSGHIleidcqrmvvrharRLLPTRTPggphpQKQTFSSGPGIAISSLSV 636
Cdd:COG2319    202 GHTGAVRSVAF--SPDG------KLLASGSADGTV--------------RLWDLATG-----KLLRTLTGHSGSVRSVAF 254
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  637 SP--AMCAVGSEDGFLRLWPLDFSSVL-LEAEHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAPVL 713
Cdd:COG2319    255 SPdgRLLASGSADGTVRLWDLATGELLrTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVR 334
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217304987  714 ALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVR 776
Cdd:COG2319    335 SVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVR 397
WD40 COG2319
WD40 repeat [General function prediction only];
303-740 8.23e-27

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 115.01  E-value: 8.23e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  303 VSLSQERSDASNADGPGFHSLEPWAQLEASDIHTAAAGTHVLTHESAEVPVARTGSCEGFLPDPVLRLKGVIGFGGHGTR 382
Cdd:COG2319      2 LSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAV 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  383 QAL-WTPDGAAVVYPCHAVIVVLL-VDTGEQRFFL-GHTDKVSALALDGSSSLLASAQAraPSVMRLWDFQTGRCLCLFR 459
Cdd:COG2319     82 LSVaFSPDGRLLASASADGTVRLWdLATGLLLRTLtGHTGAVRSVAFSPDGKTLASGSA--DGTVRLWDLATGKLLRTLT 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  460 SPMHVVCSLSFSDSGallcgvgkdhhgRTMVVAWGTGQVGL----GGEVVVLAKAHTDFdvqAFRVTF-FDETRMASCGQ 534
Cdd:COG2319    160 GHSGAVTSVAFSPDG------------KLLASGSDDGTVRLwdlaTGKLLRTLTGHTGA---VRSVAFsPDGKLLASGSA 224
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  535 -GSVRLWRLRGGVLrscpVDLGEHHALQFTDLAFkqARDGcpepsaAMLFVCSRSGHIleidcqrmvvrharRLLPTRTP 613
Cdd:COG2319    225 dGTVRLWDLATGKL----LRTLTGHSGSVRSVAF--SPDG------RLLASGSADGTV--------------RLWDLATG 278
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  614 ggphpQKQTFSSGPGIAISSLSVSP--AMCAVGSEDGFLRLWPLDFSSVLLEAE-HEGPVSSVCVSPDGLRVLSATSSGH 690
Cdd:COG2319    279 -----ELLRTLTGHSGGVNSVAFSPdgKLLASGSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGT 353
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|
gi 2217304987  691 LGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLAT 740
Cdd:COG2319    354 VRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1151-1510 2.76e-26

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 110.89  E-value: 2.76e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1151 QHWSGHSAEISTLALSHSAQVLASASGRSSttahcqIRVWDVSGGLCQHLIFPHSTTVLALAFSPDDRLLVTLGDhdGRT 1230
Cdd:cd00200      3 RTLKGHTGGVTCVAFSPDGKLLATGSGDGT------IKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSS--DKT 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1231 LALW----GTATYDLVSSTrlpEPVHGVAFNPwdageltcvgqgtvtfwllqqrgadislqvrrepvpeavgageltslc 1306
Cdd:cd00200     75 IRLWdletGECVRTLTGHT---SYVSSVAFSP------------------------------------------------ 103
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1307 ygAPPLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFSGSR--LVSGSSTGRLRLWAVgavSELRCKGsgassVF 1384
Cdd:cd00200    104 --DGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGtfVASSSQDGTIKLWDL---RTGKCVA-----TL 173
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1385 MEHElvldGAVVSASFDDSVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVWALA 1464
Cdd:cd00200    174 TGHT----GEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLR 249
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*.
gi 2217304987 1465 SMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGSLRIFS 1510
Cdd:cd00200    250 TGECVQTLSGHTNSVTSLAWS------PDGKRLASGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
1325-1746 1.38e-25

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 111.54  E-value: 1.38e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1325 WDTRAGRCFLSWEADDGGIGLLLFSGSRLVSGSSTGRLRLWAVGAVSELRCKGSGASSVFMEHELVLDGAVVSASFDDSV 1404
Cdd:COG2319     23 AALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTV 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1405 dmgvvgttagTLWFVswAEGTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVWALASMELVIQFQVLNQSCLCLAW 1484
Cdd:COG2319    103 ----------RLWDL--ATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAF 170
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1485 SppccgrPEQQRLAAGYGDGSLRIFSVSRTAMELKMHPHPVALTTVAFSTDGQTVLSGDKDGLVAVSHPCTGTTFRVLSD 1564
Cdd:COG2319    171 S------PDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTG 244
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1565 HQGAPISticvtckecedLGVEGTDLWLA-ASGDQRVSVWasdwlrnhcelvDWLSFPMPATTETQGHLPPSLaAFCPwD 1643
Cdd:COG2319    245 HSGSVRS-----------VAFSPDGRLLAsGSADGTVRLW------------DLATGELLRTLTGHSGGVNSV-AFSP-D 299
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1644 GALLMYVGPGvyKEVIIYNLCQKQVVEKIPLPFFA-MSLSLSPGTHLLAVGFAECMLRLVDCAMGTA-QDFAGHDNAVHL 1721
Cdd:COG2319    300 GKLLASGSDD--GTVRLWDLATGKLLRTLTGHTGAvRSVAFSPDGKTLASGSDDGTVRLWDLATGELlRTLTGHTGAVTS 377
                          410       420
                   ....*....|....*....|....*.
gi 2217304987 1722 CRFTPSARLLFTAAR-NEILVWEVPG 1746
Cdd:COG2319    378 VAFSPDGRTLASGSAdGTVRLWDLAT 403
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
625-881 1.48e-25

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 108.58  E-value: 1.48e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  625 SGPGIAISSLSVSPAMcAVGSEDGFLRLWPLDFSSVLLE-AEHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHM 703
Cdd:cd00200     51 TGPVRDVAASADGTYL-ASGSSDKTIRLWDLETGECVRTlTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLT 129
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  704 LARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAA 783
Cdd:cd00200    130 TLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTG 209
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  784 EVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAqyscadpQWHVLRVAADMVCPDAPASPSALAVSRDGRLLAfvgps 863
Cdd:cd00200    210 KCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIR-------VWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLA----- 277
                          250
                   ....*....|....*...
gi 2217304987  864 rctvtvmgSASLDELLRV 881
Cdd:cd00200    278 --------SGSADGTIRI 287
WD40 COG2319
WD40 repeat [General function prediction only];
1357-1746 2.53e-24

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 107.69  E-value: 2.53e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1357 SSTGRLRLWAVGAVSELRCKGSGASSVFMEHELVLDGAVVSASFDDSVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSK 1436
Cdd:COG2319      1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1437 VNEVVFSPGESHCATCSEDGSVRVWALASMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGSLRIFSVSRTAM 1516
Cdd:COG2319     81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFS------PDGKTLASGSADGTVRLWDLATGKL 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1517 ELKMHPHPVALTTVAFSTDGQTVLSGDKDGLVAVSHPCTGTTFRVLSDHQGaPISTICVTckecedlgVEGTdlWLA-AS 1595
Cdd:COG2319    155 LRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTG-AVRSVAFS--------PDGK--LLAsGS 223
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1596 GDQRVSVWasDWLRNHCelvdwlsfpmpaTTETQGHLPPSLA-AFCPwDGALLmyVGPGVYKEVIIYNLCQKQVVEKIPL 1674
Cdd:COG2319    224 ADGTVRLW--DLATGKL------------LRTLTGHSGSVRSvAFSP-DGRLL--ASGSADGTVRLWDLATGELLRTLTG 286
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217304987 1675 PFFA-MSLSLSPGTHLLAVGFAECMLRLVDCAMG-TAQDFAGHDNAVHLCRFTPSARLLFTAAR-NEILVWEVPG 1746
Cdd:COG2319    287 HSGGvNSVAFSPDGKLLASGSDDGTVRLWDLATGkLLRTLTGHTGAVRSVAFSPDGKTLASGSDdGTVRLWDLAT 361
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
631-964 2.94e-22

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 98.95  E-value: 2.94e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  631 ISSLSVSPA--MCAVGSEDGFLRLWPLDFSSVLLE-AEHEGPVSSVCVSPDGLRVLSATSSGHLGFLDT----LSRVYHM 703
Cdd:cd00200     12 VTCVAFSPDgkLLATGSGDGTIKVWDLETGELLRTlKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLetgeCVRTLTG 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  704 larsHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAA 783
Cdd:cd00200     92 ----HTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTG 167
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  784 EVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQWHvlrvaadMVCPDAPASPSALAVSRDGRLLAfvgps 863
Cdd:cd00200    168 KCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCL-------GTLRGHENGVNSVAFSPDGYLLA----- 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  864 rctvtvmgSASLDELLRVDigtldlassrldsamavcfgpaalghllvstssnrvvvlDAVSGRIIRELPGvHPEPCPSL 943
Cdd:cd00200    236 --------SGSEDGTIRVW---------------------------------------DLRTGECVQTLSG-HTNSVTSL 267
                          330       340
                   ....*....|....*....|..
gi 2217304987  944 TLSEDARFLLIAAG-RTIKVWD 964
Cdd:cd00200    268 AWSPDGKRLASGSAdGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
414-737 3.11e-21

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 95.86  E-value: 3.11e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  414 FLGHTDKVSALALDGSSSLLASAQARapSVMRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLCGVGKDHhgrtMVVAW 493
Cdd:cd00200      5 LKGHTGGVTCVAFSPDGKLLATGSGD--GTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDK----TIRLW 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  494 GTGQvglgGEVVVLAKAHTDfDVQAfrVTFFDETRMASCG--QGSVRLWRLRGGVLRSCpvdLGEHHAlqftdlafkQAR 571
Cdd:cd00200     79 DLET----GECVRTLTGHTS-YVSS--VAFSPDGRILSSSsrDKTIKVWDVETGKCLTT---LRGHTD---------WVN 139
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  572 DGCPEPSAAMLFVCSRSGHILEIDcqrmvVRHARRLlptrtpggphpqkQTFSsGPGIAISSLSVSP--AMCAVGSEDGF 649
Cdd:cd00200    140 SVAFSPDGTFVASSSQDGTIKLWD-----LRTGKCV-------------ATLT-GHTGEVNSVAFSPdgEKLLSSSSDGT 200
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  650 LRLWPLDFSSVL--LEAeHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATV 727
Cdd:cd00200    201 IKLWDLSTGKCLgtLRG-HENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASG 279
                          330
                   ....*....|
gi 2217304987  728 SQDRTVRIWD 737
Cdd:cd00200    280 SADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1124-1366 4.75e-19

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 89.70  E-value: 4.75e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1124 VWRPDTGFFAyTCG--RLVVVEDLHSGAQQH-WSGHSAEISTLALSHSAQVLASASgrssttAHCQIRVWDVSGGLCQHL 1200
Cdd:cd00200     58 AASADGTYLA-SGSsdKTIRLWDLETGECVRtLTGHTSYVSSVAFSPDGRILSSSS------RDKTIKVWDVETGKCLTT 130
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1201 IFPHSTTVLALAFSPDDRLLVTlGDHDGrTLALW----GTATYDLVSSTRlpePVHGVAFNPwDAGEL-TCVGQGTVTFW 1275
Cdd:cd00200    131 LRGHTDWVNSVAFSPDGTFVAS-SSQDG-TIKLWdlrtGKCVATLTGHTG---EVNSVAFSP-DGEKLlSSSSDGTIKLW 204
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1276 LLQQRGADISLQVRREPVpeavgagelTSLCYGAPPLLYCGTSS-GQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSR 1352
Cdd:cd00200    205 DLSTGKCLGTLRGHENGV---------NSVAFSPDGYLLASGSEdGTIRVWDLRTGECVQTLSGHTNSVTSLAWSpdGKR 275
                          250
                   ....*....|....
gi 2217304987 1353 LVSGSSTGRLRLWA 1366
Cdd:cd00200    276 LASGSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1426-1744 3.53e-17

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 83.92  E-value: 3.53e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1426 STRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVWALASMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGS 1505
Cdd:cd00200      1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS------ADGTYLASGSSDKT 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1506 LRIFSVS--RTAMELKMHPHPValTTVAFSTDGQTVLSGDKDGLVAVSHPCTGTTFRVLSDHQGaPISTICVTckecedl 1583
Cdd:cd00200     75 IRLWDLEtgECVRTLTGHTSYV--SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD-WVNSVAFS------- 144
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1584 gvEGTDLWLAASGDQRVSVWASDWLRnhcelvdwlsfpmpattetqghlppslaafcpwdgalLMYVGPGVYKEViiynl 1663
Cdd:cd00200    145 --PDGTFVASSSQDGTIKLWDLRTGK-------------------------------------CVATLTGHTGEV----- 180
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1664 cqkqvvekiplpffaMSLSLSPGTHLLAVGFAECMLRLVDCAMGTA-QDFAGHDNAVHLCRFTPSARLLFTAARNE-ILV 1741
Cdd:cd00200    181 ---------------NSVAFSPDGEKLLSSSSDGTIKLWDLSTGKClGTLRGHENGVNSVAFSPDGYLLASGSEDGtIRV 245

                   ...
gi 2217304987 1742 WEV 1744
Cdd:cd00200    246 WDL 248
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
788-1009 5.35e-14

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 74.68  E-value: 5.35e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  788 EHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQW-HVLRVAADmvcpdapaSPSALAVSRDGRLLAfvgpsrct 866
Cdd:cd00200      4 TLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELlRTLKGHTG--------PVRDVAASADGTYLA-------- 67
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  867 vtvmgSASLDELLRV-DIGTLDLASS---RLDSAMAVCFGPAalGHLLVSTSSNRVVVL-DAVSGRIIRELPGvHPEPCP 941
Cdd:cd00200     68 -----SGSSDKTIRLwDLETGECVRTltgHTSYVSSVAFSPD--GRILSSSSRDKTIKVwDVETGKCLTTLRG-HTDWVN 139
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217304987  942 SLTLSEDARFLLIAAG-RTIKVWDYATqASPGpQVYIGHSEPVQAVAFSPDQQQVLSAGD--AVFLWDVLA 1009
Cdd:cd00200    140 SVAFSPDGTFVASSSQdGTIKLWDLRT-GKCV-ATLTGHTGEVNSVAFSPDGEKLLSSSSdgTIKLWDLST 208
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
412-541 1.62e-09

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 61.20  E-value: 1.62e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  412 RFFLGHTDKVSALALDGSSSLLASAQARapSVMRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLCGVGKDhhgRTMVV 491
Cdd:cd00200    171 ATLTGHTGEVNSVAFSPDGEKLLSSSSD--GTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSED---GTIRV 245
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2217304987  492 aWgtgQVGLGGEVVVLaKAHTDFdVQAFRVTfFDETRMASCGQ-GSVRLWR 541
Cdd:cd00200    246 -W---DLRTGECVQTL-SGHTNS-VTSLAWS-PDGKRLASGSAdGTIRIWD 289
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
1424-1461 3.69e-06

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 45.38  E-value: 3.69e-06
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 2217304987  1424 GTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVW 1461
Cdd:smart00320    2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
WD40 pfam00400
WD domain, G-beta repeat;
1424-1461 6.87e-06

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 44.26  E-value: 6.87e-06
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 2217304987 1424 GTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVW 1461
Cdd:pfam00400    1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
YncE COG3391
DNA-binding beta-propeller fold protein YncE [General function prediction only];
840-967 1.44e-04

DNA-binding beta-propeller fold protein YncE [General function prediction only];


Pssm-ID: 442618 [Multi-domain]  Cd Length: 237  Bit Score: 45.46  E-value: 1.44e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  840 DAPASPSALAVSRDGRLLAFVGPSRCTVTVMGSASLDELLRVDIGtldlassrlDSAMAVCFGPAAlGHLLVS-TSSNR- 917
Cdd:COG3391    107 PVGGGPRGLAVDPDGGRLYVADSGNGRVSVIDTATGKVVATIPVG---------AGPHGIAVDPDG-KRLYVAnSGSNTv 176
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217304987  918 ---VVVLDAVSGRIIRELP-GVHPEpcpSLTLSEDARFLLIA---------AGRTIKVWDYAT 967
Cdd:COG3391    177 sviVSVIDTATGKVVATIPvGGGPV---GVAVSPDGRRLYVAnrgsntsngGSNTVSVIDLAT 236
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
745-928 3.67e-04

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 44.24  E-value: 3.67e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  745 YDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAAEVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCAD 824
Cdd:COG4257     10 YPVPAPGSGPRDVAVDPDGAVWFTDQGGGRIGRLDPATGEFTEYPLGGGSGPHGIAVDPDGNLWFTDNGNNRIGRIDPKT 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  825 PQwhVLRVAAdmvcPDAPASPSALAVSRDGRLLafvgpsrctVTVMGSaslDELLRVDIGT----LDLASSRLDSAMAVC 900
Cdd:COG4257     90 GE--ITTFAL----PGGGSNPHGIAFDPDGNLW---------FTDQGG---NRIGRLDPATgevtEFPLPTGGAGPYGIA 151
                          170       180
                   ....*....|....*....|....*....
gi 2217304987  901 FGPAalGHLLV-STSSNRVVVLDAVSGRI 928
Cdd:COG4257    152 VDPD--GNLWVtDFGANAIGRIDPDTGTL 178
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
706-737 8.87e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 38.45  E-value: 8.87e-04
                            10        20        30
                    ....*....|....*....|....*....|..
gi 2217304987   706 RSHTAPVLALAMEQRRGQLATVSQDRTVRIWD 737
Cdd:smart00320    9 KGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
YncE COG3391
DNA-binding beta-propeller fold protein YncE [General function prediction only];
829-995 1.18e-03

DNA-binding beta-propeller fold protein YncE [General function prediction only];


Pssm-ID: 442618 [Multi-domain]  Cd Length: 237  Bit Score: 42.37  E-value: 1.18e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  829 VLRVAADMVCPDAPASPSALAVSRDGRLLAFVGPSRCTVTVMGSASLDELLRVDIGTLDLASSRLDSAMAVCFGPAALGH 908
Cdd:COG3391      2 LVASSLLVAVLLAVLALAALAVAVAALGLGGGGPLLAAASGGVVGAAVGGGGVALLAGLGLGAAAVADADGADAGADGRR 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  909 LLVS-TSSNRVVVLDAVSGRIIRELP-GVHPEpcpSLTLSEDARFLLIAAGR--TIKVWDYATQASPGpQVYIGhSEPVq 984
Cdd:COG3391     82 LYVAnSGSGRVSVIDLATGKVVATIPvGGGPR---GLAVDPDGGRLYVADSGngRVSVIDTATGKVVA-TIPVG-AGPH- 155
                          170
                   ....*....|.
gi 2217304987  985 AVAFSPDQQQV 995
Cdd:COG3391    156 GIAVDPDGKRL 166
WDR74 cd22857
WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and ...
666-744 3.24e-03

WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and plants is an essential factor for ribosome assembly. In cooperation with the assembly factor NVL2, WDR74 participates in an early cleavage of the pre-rRNA processing pathway. NVL2 is a type II double ring, AAA-ATPase, that may mediate the release of WDR74 from nucleolar pre-60S particles. WDR74 has been implicated in tumorigenesis. In lung cancer, it regulates cell proliferation, cell cycle progression, chemoresistance and cell aggressiveness, by inducing nuclear beta-catenin accumulation and driving downstream Wnt-responsive genes expression. In melanoma, it promotes apoptosis resistance and aggressive behavior by regulating the RPL5-MDM2-p53 pathway. WDR74 contains an N-terminal seven-bladed beta-propeller WD40 domain that associates with the D1-AAA domain of the AAA-ATPase NVL2, and a flexible lysine-rich C-terminus that extends outward from the WD40 domain, and is required for nucleolar localization.


Pssm-ID: 439303 [Multi-domain]  Cd Length: 325  Bit Score: 41.83  E-value: 3.24e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  666 HEGPVSSVCVSPDGLRVLSATSSGHLGFLD----TLSRVYHMLArshTAPVLALAMEQRRGQLATVSQDRTVRIWDLATL 741
Cdd:cd22857    222 GETPIKAVAEDPDGHTVYVGDTSGDLASIDlrtgKLLGCFKGKC---GGSIRSIARHPELPLIASCGLDRYLRIWDTETR 298

                   ...
gi 2217304987  742 QQL 744
Cdd:cd22857    299 QLL 301
YvrE COG3386
Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase ...
785-973 3.42e-03

Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase YvrE is part of the Pathway/BioSystem: Non-phosphorylated Entner-Doudoroff pathway


Pssm-ID: 442613 [Multi-domain]  Cd Length: 266  Bit Score: 41.42  E-value: 3.42e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  785 VLVEHTCHRGavTGLTATPDGRLLFSSCSQGSLAQYSCADPQWHVLRvaadmvcpDAPASPSALAVSRDGRLLAfvgpsr 864
Cdd:COG3386      1 KLADAGFRLG--EGPVWDPDGRLYWVDIPGGRIHRYDPDGGAVEVFA--------EPSGRPNGLAFDPDGRLLV------ 64
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987  865 ctvtvmgSASLDELLRVDIGT------LDLASSRLDSAMAVCFGPAalGHLLVSTSSN-----RVVVLDAvSGRIIRELP 933
Cdd:COG3386     65 -------ADHGRGLVRFDPADgevtvlADEYGKPLNRPNDGVVDPD--GRLYFTDMGEylptgALYRVDP-DGSLRVLAD 134
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 2217304987  934 GVHpepCP-SLTLSEDARFLLIA--AGRTIKVWDYATQASPGP 973
Cdd:COG3386    135 GLT---FPnGIAFSPDGRTLYVAdtGAGRIYRFDLDADGTLGN 174
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
974-1006 4.78e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 36.52  E-value: 4.78e-03
                            10        20        30
                    ....*....|....*....|....*....|....*
gi 2217304987   974 QVYIGHSEPVQAVAFSPDQQQVLSAGD--AVFLWD 1006
Cdd:smart00320    6 KTLKGHTGPVTSVAFSPDGKYLASGSDdgTIKLWD 40
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH