|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
602-1007 |
6.52e-43 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 162.77 E-value: 6.52e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 602 RHARRLLPTRTPGGPHPQKQTFSSGPGIAISSLSVSPAmcAVGSEDGFLRLWPLDFSSVLLEAEHEGPVSSVCVSPDGLR 681
Cdd:COG2319 15 DLALALLAAALGALLLLLLGLAAAVASLAASPDGARLA--AGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRL 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 682 VLSATSSGHLGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHP 761
Cdd:COG2319 93 LASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSP 172
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 762 TRPTFFCGFSSGAVRSFSLEAAEVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQW-HVLRVAADMVcpd 840
Cdd:COG2319 173 DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLlRTLTGHSGSV--- 249
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 841 apaspSALAVSRDGRLLAfvgpsrctvtvmgSASLDELLRV-DIGT---LDLASSRLDSAMAVCFGPAalGHLLVSTSSN 916
Cdd:COG2319 250 -----RSVAFSPDGRLLA-------------SGSADGTVRLwDLATgelLRTLTGHSGGVNSVAFSPD--GKLLASGSDD 309
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 917 RVV-VLDAVSGRIIRELPGvHPEPCPSLTLSEDARFLLIA-AGRTIKVWDYATQASPgpQVYIGHSEPVQAVAFSPDQQQ 994
Cdd:COG2319 310 GTVrLWDLATGKLLRTLTG-HTGAVRSVAFSPDGKTLASGsDDGTVRLWDLATGELL--RTLTGHTGAVTSVAFSPDGRT 386
|
410
....*....|....*
gi 2217304987 995 VLSAGD--AVFLWDV 1007
Cdd:COG2319 387 LASGSAdgTVRLWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1137-1550 |
9.70e-41 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 156.22 E-value: 9.70e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1137 GRLVVVEDLHSGAQQHWSGHSAEISTLALSHSAQVLASASGRSSttahcqIRVWDVSGGLCQHLIFPHSTTVLALAFSPD 1216
Cdd:COG2319 58 LTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGT------VRLWDLATGLLLRTLTGHTGAVRSVAFSPD 131
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1217 DRLLVTlGDHDGrTLALWGTATYDLVSSTRLPE-PVHGVAFNPwDageltcvGQgtvtfwllqqrgadislqvrrepvpe 1295
Cdd:COG2319 132 GKTLAS-GSADG-TVRLWDLATGKLLRTLTGHSgAVTSVAFSP-D-------GK-------------------------- 175
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1296 avgageltslcygappLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSEL 1373
Cdd:COG2319 176 ----------------LLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSpdGKLLASGSADGTVRLWDLATGKLL 239
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1374 RckgsgassVFMEHelvlDGAVVSASFDDSVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCS 1453
Cdd:COG2319 240 R--------TLTGH----SGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGS 307
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1454 EDGSVRVWALASMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGSLRIFSVSRTAMELKMHPHPVALTTVAFS 1533
Cdd:COG2319 308 DDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS------PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFS 381
|
410
....*....|....*..
gi 2217304987 1534 TDGQTVLSGDKDGLVAV 1550
Cdd:COG2319 382 PDGRTLASGSADGTVRL 398
|
|
| CFA20_dom |
pfam05018 |
CFA20 domain; This domain is characteriztic of cilia- and flagella-associated protein 20 ... |
13-190 |
2.76e-34 |
|
CFA20 domain; This domain is characteriztic of cilia- and flagella-associated protein 20 (CFA20). CFA20 is a cilium- and flagellum-specific protein that plays a role in axonemal structure organization and motility. In Chlamydomonas reinhardtii, it stabilizes outer doublet microtubules (DMTs) of the axoneme and may work as a scaffold for intratubular proteins, such as tektin and PACRG, to produce the beak structures in DMT1. Other proteins contain a domain with homology to CFA20. WDR90/POC16 contains such a domain in its N terminus, followed by a large C-terminal domain with multiple WD40 repeats. This domain is also present in the N terminus of uncharacterized protein C3orf67.
Pssm-ID: 461521 Cd Length: 185 Bit Score: 130.40 E-value: 2.76e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 13 AWQHPFLNVFRHFRV---DEWKRSAKQGDVAVVTDKTLKGAVYRIRGSVSAANYIQLPKSSTQSLGLTGRYLYVLFRPLp 89
Cdd:pfam05018 5 TFQSGFLSIFYSIGSkplQIWSKKVKNGHIKRVTDDDIKSNVLEIVGTNVATTYITCPADPKQSLGIKLPFLVLLVKNL- 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 90 SKHFVIHLDVSSKDNQVIRVSFSNLFKEFKSTATWLQFPLVLEartpqrdlvglapsgARWTClqldLQDVLLVYLNRCY 169
Cdd:pfam05018 84 GKYFSFEIQILDDKNVRRRFRFSNFQKVTKVKPFITTMPLRLN---------------EGWNQ----IQFNLADFTRRAY 144
|
170 180
....*....|....*....|....*
gi 2217304987 170 G----HLKSIRLCASLLVRNLYTSD 190
Cdd:pfam05018 145 GtnyvETVRVQIHANCRLRRIYFSD 169
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
706-1000 |
1.24e-30 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 123.60 E-value: 1.24e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 706 RSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAAEV 785
Cdd:cd00200 6 KGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGEC 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 786 LVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAqyscadpQWHVLRVAADMVCPDAPASPSALAVSRDGRLLAfvgpsrc 865
Cdd:cd00200 86 VRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIK-------VWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVA------- 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 866 tvtvmgSASLDELLRVdigtLDLASSRL--------DSAMAVCFGPAalGHLLVSTSSNRVV-VLDAVSGRIIRELPGvH 936
Cdd:cd00200 152 ------SSSQDGTIKL----WDLRTGKCvatltghtGEVNSVAFSPD--GEKLLSSSSDGTIkLWDLSTGKCLGTLRG-H 218
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217304987 937 PEPCPSLTLSEDARFLLIAAG-RTIKVWDYATQASpgPQVYIGHSEPVQAVAFSPDQQQVLSAGD 1000
Cdd:cd00200 219 ENGVNSVAFSPDGYLLASGSEdGTIRVWDLRTGEC--VQTLSGHTNSVTSLAWSPDGKRLASGSA 281
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1300-1604 |
9.15e-28 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 115.12 E-value: 9.15e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1300 GELTSLCYGA-PPLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSELRck 1376
Cdd:cd00200 10 GGVTCVAFSPdGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASadGTYLASGSSDKTIRLWDLETGECVR-- 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1377 gsgassVFMEHElvldGAVVSASFDDSVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCSEDG 1456
Cdd:cd00200 88 ------TLTGHT----SYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDG 157
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1457 SVRVWALASMELVIQFQVLNQSCLCLAWSPpccgrpEQQRLAAGYGDGSLRIFSVsRTAMELK-MHPHPVALTTVAFSTD 1535
Cdd:cd00200 158 TIKLWDLRTGKCVATLTGHTGEVNSVAFSP------DGEKLLSSSSDGTIKLWDL-STGKCLGtLRGHENGVNSVAFSPD 230
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1536 GQTVLSGDKDGLVAVSHPCTGTTFRVLSDHQGApisticVTCkecedLGVEGTDLWLA-ASGDQRVSVWA 1604
Cdd:cd00200 231 GYLLASGSEDGTIRVWDLRTGECVQTLSGHTNS------VTS-----LAWSPDGKRLAsGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
399-776 |
5.95e-27 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 115.39 E-value: 5.95e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 399 AVIVVLLVDTGEQRFFLGHTDKVSALALDGSSSLLASAqARAPSVmRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLC 478
Cdd:COG2319 59 TLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASA-SADGTV-RLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLA 136
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 479 GVGKDHhgrtMVVAWGTGQvglgGEVVVLAKAHTDfDVQAfrVTFF-DETRMASCGQ-GSVRLWRLRGGVLrscpVDLGE 556
Cdd:COG2319 137 SGSADG----TVRLWDLAT----GKLLRTLTGHSG-AVTS--VAFSpDGKLLASGSDdGTVRLWDLATGKL----LRTLT 201
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 557 HHALQFTDLAFkqARDGcpepsaAMLFVCSRSGHIleidcqrmvvrharRLLPTRTPggphpQKQTFSSGPGIAISSLSV 636
Cdd:COG2319 202 GHTGAVRSVAF--SPDG------KLLASGSADGTV--------------RLWDLATG-----KLLRTLTGHSGSVRSVAF 254
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 637 SP--AMCAVGSEDGFLRLWPLDFSSVL-LEAEHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAPVL 713
Cdd:COG2319 255 SPdgRLLASGSADGTVRLWDLATGELLrTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVR 334
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217304987 714 ALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVR 776
Cdd:COG2319 335 SVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVR 397
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
414-737 |
3.11e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 95.86 E-value: 3.11e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 414 FLGHTDKVSALALDGSSSLLASAQARapSVMRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLCGVGKDHhgrtMVVAW 493
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLLATGSGD--GTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDK----TIRLW 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 494 GTGQvglgGEVVVLAKAHTDfDVQAfrVTFFDETRMASCG--QGSVRLWRLRGGVLRSCpvdLGEHHAlqftdlafkQAR 571
Cdd:cd00200 79 DLET----GECVRTLTGHTS-YVSS--VAFSPDGRILSSSsrDKTIKVWDVETGKCLTT---LRGHTD---------WVN 139
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 572 DGCPEPSAAMLFVCSRSGHILEIDcqrmvVRHARRLlptrtpggphpqkQTFSsGPGIAISSLSVSP--AMCAVGSEDGF 649
Cdd:cd00200 140 SVAFSPDGTFVASSSQDGTIKLWD-----LRTGKCV-------------ATLT-GHTGEVNSVAFSPdgEKLLSSSSDGT 200
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 650 LRLWPLDFSSVL--LEAeHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATV 727
Cdd:cd00200 201 IKLWDLSTGKCLgtLRG-HENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASG 279
|
330
....*....|
gi 2217304987 728 SQDRTVRIWD 737
Cdd:cd00200 280 SADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1426-1744 |
3.53e-17 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 83.92 E-value: 3.53e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1426 STRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVWALASMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGS 1505
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS------ADGTYLASGSSDKT 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1506 LRIFSVS--RTAMELKMHPHPValTTVAFSTDGQTVLSGDKDGLVAVSHPCTGTTFRVLSDHQGaPISTICVTckecedl 1583
Cdd:cd00200 75 IRLWDLEtgECVRTLTGHTSYV--SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD-WVNSVAFS------- 144
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1584 gvEGTDLWLAASGDQRVSVWASDWLRnhcelvdwlsfpmpattetqghlppslaafcpwdgalLMYVGPGVYKEViiynl 1663
Cdd:cd00200 145 --PDGTFVASSSQDGTIKLWDLRTGK-------------------------------------CVATLTGHTGEV----- 180
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1664 cqkqvvekiplpffaMSLSLSPGTHLLAVGFAECMLRLVDCAMGTA-QDFAGHDNAVHLCRFTPSARLLFTAARNE-ILV 1741
Cdd:cd00200 181 ---------------NSVAFSPDGEKLLSSSSDGTIKLWDLSTGKClGTLRGHENGVNSVAFSPDGYLLASGSEDGtIRV 245
|
...
gi 2217304987 1742 WEV 1744
Cdd:cd00200 246 WDL 248
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1424-1461 |
3.69e-06 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 45.38 E-value: 3.69e-06
10 20 30
....*....|....*....|....*....|....*...
gi 2217304987 1424 GTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVW 1461
Cdd:smart00320 2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1424-1461 |
6.87e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 44.26 E-value: 6.87e-06
10 20 30
....*....|....*....|....*....|....*...
gi 2217304987 1424 GTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVW 1461
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
706-737 |
8.87e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 38.45 E-value: 8.87e-04
10 20 30
....*....|....*....|....*....|..
gi 2217304987 706 RSHTAPVLALAMEQRRGQLATVSQDRTVRIWD 737
Cdd:smart00320 9 KGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
602-1007 |
6.52e-43 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 162.77 E-value: 6.52e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 602 RHARRLLPTRTPGGPHPQKQTFSSGPGIAISSLSVSPAmcAVGSEDGFLRLWPLDFSSVLLEAEHEGPVSSVCVSPDGLR 681
Cdd:COG2319 15 DLALALLAAALGALLLLLLGLAAAVASLAASPDGARLA--AGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRL 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 682 VLSATSSGHLGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHP 761
Cdd:COG2319 93 LASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSP 172
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 762 TRPTFFCGFSSGAVRSFSLEAAEVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQW-HVLRVAADMVcpd 840
Cdd:COG2319 173 DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLlRTLTGHSGSV--- 249
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 841 apaspSALAVSRDGRLLAfvgpsrctvtvmgSASLDELLRV-DIGT---LDLASSRLDSAMAVCFGPAalGHLLVSTSSN 916
Cdd:COG2319 250 -----RSVAFSPDGRLLA-------------SGSADGTVRLwDLATgelLRTLTGHSGGVNSVAFSPD--GKLLASGSDD 309
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 917 RVV-VLDAVSGRIIRELPGvHPEPCPSLTLSEDARFLLIA-AGRTIKVWDYATQASPgpQVYIGHSEPVQAVAFSPDQQQ 994
Cdd:COG2319 310 GTVrLWDLATGKLLRTLTG-HTGAVRSVAFSPDGKTLASGsDDGTVRLWDLATGELL--RTLTGHTGAVTSVAFSPDGRT 386
|
410
....*....|....*
gi 2217304987 995 VLSAGD--AVFLWDV 1007
Cdd:COG2319 387 LASGSAdgTVRLWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1137-1550 |
9.70e-41 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 156.22 E-value: 9.70e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1137 GRLVVVEDLHSGAQQHWSGHSAEISTLALSHSAQVLASASGRSSttahcqIRVWDVSGGLCQHLIFPHSTTVLALAFSPD 1216
Cdd:COG2319 58 LTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGT------VRLWDLATGLLLRTLTGHTGAVRSVAFSPD 131
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1217 DRLLVTlGDHDGrTLALWGTATYDLVSSTRLPE-PVHGVAFNPwDageltcvGQgtvtfwllqqrgadislqvrrepvpe 1295
Cdd:COG2319 132 GKTLAS-GSADG-TVRLWDLATGKLLRTLTGHSgAVTSVAFSP-D-------GK-------------------------- 175
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1296 avgageltslcygappLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSEL 1373
Cdd:COG2319 176 ----------------LLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSpdGKLLASGSADGTVRLWDLATGKLL 239
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1374 RckgsgassVFMEHelvlDGAVVSASFDDSVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCS 1453
Cdd:COG2319 240 R--------TLTGH----SGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGS 307
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1454 EDGSVRVWALASMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGSLRIFSVSRTAMELKMHPHPVALTTVAFS 1533
Cdd:COG2319 308 DDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS------PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFS 381
|
410
....*....|....*..
gi 2217304987 1534 TDGQTVLSGDKDGLVAV 1550
Cdd:COG2319 382 PDGRTLASGSADGTVRL 398
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
633-1019 |
2.17e-37 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 146.59 E-value: 2.17e-37
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 633 SLSVSPAMCAVGSEDGFLRLWPLDFSSVLLEAE-HEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAP 711
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLgLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 712 VLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAAEVLVEHTC 791
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTG 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 792 HRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQW-HVLRVAADMVcpdapaspSALAVSRDGRLLAfvgpsrctvtvm 870
Cdd:COG2319 161 HSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLlRTLTGHTGAV--------RSVAFSPDGKLLA------------ 220
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 871 gSASLDELLRVdigtLDLASSRL--------DSAMAVCFGPAalGHLLVSTSSNRVVVL-DAVSGRIIRELPGvHPEPCP 941
Cdd:COG2319 221 -SGSADGTVRL----WDLATGKLlrtltghsGSVRSVAFSPD--GRLLASGSADGTVRLwDLATGELLRTLTG-HSGGVN 292
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 942 SLTLSEDARFLLIA-AGRTIKVWDYATQASpgPQVYIGHSEPVQAVAFSPDQQQVLSAGD--AVFLWDVlaPTERQVPTL 1018
Cdd:COG2319 293 SVAFSPDGKLLASGsDDGTVRLWDLATGKL--LRTLTGHTGAVRSVAFSPDGKTLASGSDdgTVRLWDL--ATGELLRTL 368
|
.
gi 2217304987 1019 S 1019
Cdd:COG2319 369 T 369
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1170-1603 |
3.35e-37 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 145.82 E-value: 3.35e-37
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1170 QVLASASGRSSTTAHCQIRVWDVSGGLCQHLIFPHSTTVLALAFSPDDRLLVTLGDhDGRTLALWGTATYDLVSSTRLPE 1249
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAG-DLTLLLLDAAAGALLATLLGHTA 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1250 PVHGVAFNPWDAGELTCVGQGTVTFWLLQQRGADISLQVRREPVpeavgagelTSLCYgAP--PLLYCGTSSGQVCVWDT 1327
Cdd:COG2319 80 AVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAV---------RSVAF-SPdgKTLASGSADGTVRLWDL 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1328 RAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSELRckgsgassVFMEHelvlDGAVVSASFddSVD 1405
Cdd:COG2319 150 ATGKLLRTLTGHSGAVTSVAFSpdGKLLASGSDDGTVRLWDLATGKLLR--------TLTGH----TGAVRSVAF--SPD 215
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1406 mG---VVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVWALASMELVIQFQVLNQSCLCL 1482
Cdd:COG2319 216 -GkllASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSV 294
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1483 AWSppccgrPEQQRLAAGYGDGSLRIFSVSRTAMELKMHPHPVALTTVAFSTDGQTVLSGDKDGLVAVSHPCTGTTFRVL 1562
Cdd:COG2319 295 AFS------PDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTL 368
|
410 420 430 440
....*....|....*....|....*....|....*....|..
gi 2217304987 1563 SDHQGaPISTICVTckecedlgveGTDLWLA-ASGDQRVSVW 1603
Cdd:COG2319 369 TGHTG-AVTSVAFS----------PDGRTLAsGSADGTVRLW 399
|
|
| CFA20_dom |
pfam05018 |
CFA20 domain; This domain is characteriztic of cilia- and flagella-associated protein 20 ... |
13-190 |
2.76e-34 |
|
CFA20 domain; This domain is characteriztic of cilia- and flagella-associated protein 20 (CFA20). CFA20 is a cilium- and flagellum-specific protein that plays a role in axonemal structure organization and motility. In Chlamydomonas reinhardtii, it stabilizes outer doublet microtubules (DMTs) of the axoneme and may work as a scaffold for intratubular proteins, such as tektin and PACRG, to produce the beak structures in DMT1. Other proteins contain a domain with homology to CFA20. WDR90/POC16 contains such a domain in its N terminus, followed by a large C-terminal domain with multiple WD40 repeats. This domain is also present in the N terminus of uncharacterized protein C3orf67.
Pssm-ID: 461521 Cd Length: 185 Bit Score: 130.40 E-value: 2.76e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 13 AWQHPFLNVFRHFRV---DEWKRSAKQGDVAVVTDKTLKGAVYRIRGSVSAANYIQLPKSSTQSLGLTGRYLYVLFRPLp 89
Cdd:pfam05018 5 TFQSGFLSIFYSIGSkplQIWSKKVKNGHIKRVTDDDIKSNVLEIVGTNVATTYITCPADPKQSLGIKLPFLVLLVKNL- 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 90 SKHFVIHLDVSSKDNQVIRVSFSNLFKEFKSTATWLQFPLVLEartpqrdlvglapsgARWTClqldLQDVLLVYLNRCY 169
Cdd:pfam05018 84 GKYFSFEIQILDDKNVRRRFRFSNFQKVTKVKPFITTMPLRLN---------------EGWNQ----IQFNLADFTRRAY 144
|
170 180
....*....|....*....|....*
gi 2217304987 170 G----HLKSIRLCASLLVRNLYTSD 190
Cdd:pfam05018 145 GtnyvETVRVQIHANCRLRRIYFSD 169
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
597-967 |
9.92e-34 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 135.81 E-value: 9.92e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 597 QRMVVRHARRLLPTRTPGGPHPQKQTFSSGPGIAISSLSVSPAMCAVGSEDGFLRLWPLDFSSVLLEAE-HEGPVSSVCV 675
Cdd:COG2319 49 ARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTgHTGAVRSVAF 128
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 676 SPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPC 755
Cdd:COG2319 129 SPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVR 208
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 756 AVTFHPTRPTFFCGFSSGAVRSFSLEAAEVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQ-WHVLRVAA 834
Cdd:COG2319 209 SVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGElLRTLTGHS 288
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 835 DMVcpdapaspSALAVSRDGRLLAfvgpsrctvtvmgSASLDELLRV-DIGTLDLA---SSRLDSAMAVCFGPAalGHLL 910
Cdd:COG2319 289 GGV--------NSVAFSPDGKLLA-------------SGSDDGTVRLwDLATGKLLrtlTGHTGAVRSVAFSPD--GKTL 345
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*....
gi 2217304987 911 VSTSSNRVVVL-DAVSGRIIRELPGvHPEPCPSLTLSEDARFLLIAAG-RTIKVWDYAT 967
Cdd:COG2319 346 ASGSDDGTVRLwDLATGELLRTLTG-HTGAVTSVAFSPDGRTLASGSAdGTVRLWDLAT 403
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
385-817 |
5.79e-31 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 127.33 E-value: 5.79e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 385 LWTPDGAAVVYPCHAVIVVLLVDTGEQRFFLGHTDKVSALALDGSSSLLASAQARAPSVMRLWDFQTGRCLCLFRSPMHV 464
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 465 VCSLSFSDSGALLCGVGKDHHgrtmVVAWGTgqvgLGGEVVVLAKAHTDfDVQAfrVTFF-DETRMASCGQ-GSVRLWRL 542
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGT----VRLWDL----ATGLLLRTLTGHTG-AVRS--VAFSpDGKTLASGSAdGTVRLWDL 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 543 RGGVLrscpVDLGEHHALQFTDLAFkqARDGcpepsaAMLFVCSRSGHIleidcqRMVVRHARRLLPTRTpggphpqkqt 622
Cdd:COG2319 150 ATGKL----LRTLTGHSGAVTSVAF--SPDG------KLLASGSDDGTV------RLWDLATGKLLRTLT---------- 201
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 623 fssGPGIAISSLSVSP--AMCAVGSEDGFLRLWPLDFSSVLLE-AEHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSR 699
Cdd:COG2319 202 ---GHTGAVRSVAFSPdgKLLASGSADGTVRLWDLATGKLLRTlTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATG 278
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 700 VYHMLARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFS 779
Cdd:COG2319 279 ELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWD 358
|
410 420 430
....*....|....*....|....*....|....*...
gi 2217304987 780 LEAAEVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSL 817
Cdd:COG2319 359 LATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTV 396
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
706-1000 |
1.24e-30 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 123.60 E-value: 1.24e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 706 RSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAAEV 785
Cdd:cd00200 6 KGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGEC 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 786 LVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAqyscadpQWHVLRVAADMVCPDAPASPSALAVSRDGRLLAfvgpsrc 865
Cdd:cd00200 86 VRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIK-------VWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVA------- 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 866 tvtvmgSASLDELLRVdigtLDLASSRL--------DSAMAVCFGPAalGHLLVSTSSNRVV-VLDAVSGRIIRELPGvH 936
Cdd:cd00200 152 ------SSSQDGTIKL----WDLRTGKCvatltghtGEVNSVAFSPD--GEKLLSSSSDGTIkLWDLSTGKCLGTLRG-H 218
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217304987 937 PEPCPSLTLSEDARFLLIAAG-RTIKVWDYATQASpgPQVYIGHSEPVQAVAFSPDQQQVLSAGD 1000
Cdd:cd00200 219 ENGVNSVAFSPDGYLLASGSEdGTIRVWDLRTGEC--VQTLSGHTNSVTSLAWSPDGKRLASGSA 281
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
840-1367 |
3.03e-28 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 119.25 E-value: 3.03e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 840 DAPASPSALAVSRDGRLLAFVGPSRCTVTVMGSASLDELLRVDIGTLDLASSRLDSAMAVCFGPAalGHLLVSTSSNRVV 919
Cdd:COG2319 25 LGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPD--GRLLASASADGTV 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 920 VL-DAVSGRIIRELPGvHPEPCPSLTLSEDARFLLIA-AGRTIKVWDYATQASPgpQVYIGHSEPVQAVAFSPDQQQVLS 997
Cdd:COG2319 103 RLwDLATGLLLRTLTG-HTGAVRSVAFSPDGKTLASGsADGTVRLWDLATGKLL--RTLTGHSGAVTSVAFSPDGKLLAS 179
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 998 AGD--AVFLWDVLapTERQVPTLScvclGPPGPPETLsppspatkaspgppqparqaraqdrwrtqrpgpASSPGSRsps 1075
Cdd:COG2319 180 GSDdgTVRLWDLA--TGKLLRTLT----GHTGAVRSV---------------------------------AFSPDGK--- 217
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1076 hvrhlhhgwasvpglpkvamgtcpppasggWLrlkAVVGYSGNGRanmVWRPDTGFFAYTcgrlvvvedlhsgaqqhWSG 1155
Cdd:COG2319 218 ------------------------------LL---ASGSADGTVR---LWDLATGKLLRT-----------------LTG 244
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1156 HSAEISTLALSHSAQVLASASGRSSttahcqIRVWDVSGGLCQHLIFPHSTTVLALAFSPDDRLLVTlGDHDGrTLALWG 1235
Cdd:COG2319 245 HSGSVRSVAFSPDGRLLASGSADGT------VRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLAS-GSDDG-TVRLWD 316
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1236 TATYDLVSS-TRLPEPVHGVAFNPwdageltcVGQgtvtfwllqqrgadislqvrrepvpeavgageltslcygappLLY 1314
Cdd:COG2319 317 LATGKLLRTlTGHTGAVRSVAFSP--------DGK------------------------------------------TLA 346
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|....*
gi 2217304987 1315 CGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAV 1367
Cdd:COG2319 347 SGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSpdGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1300-1604 |
9.15e-28 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 115.12 E-value: 9.15e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1300 GELTSLCYGA-PPLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSRLVSGSSTGRLRLWAVGAVSELRck 1376
Cdd:cd00200 10 GGVTCVAFSPdGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASadGTYLASGSSDKTIRLWDLETGECVR-- 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1377 gsgassVFMEHElvldGAVVSASFDDSVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCSEDG 1456
Cdd:cd00200 88 ------TLTGHT----SYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDG 157
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1457 SVRVWALASMELVIQFQVLNQSCLCLAWSPpccgrpEQQRLAAGYGDGSLRIFSVsRTAMELK-MHPHPVALTTVAFSTD 1535
Cdd:cd00200 158 TIKLWDLRTGKCVATLTGHTGEVNSVAFSP------DGEKLLSSSSDGTIKLWDL-STGKCLGtLRGHENGVNSVAFSPD 230
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1536 GQTVLSGDKDGLVAVSHPCTGTTFRVLSDHQGApisticVTCkecedLGVEGTDLWLA-ASGDQRVSVWA 1604
Cdd:cd00200 231 GYLLASGSEDGTIRVWDLRTGECVQTLSGHTNS------VTS-----LAWSPDGKRLAsGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
399-776 |
5.95e-27 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 115.39 E-value: 5.95e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 399 AVIVVLLVDTGEQRFFLGHTDKVSALALDGSSSLLASAqARAPSVmRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLC 478
Cdd:COG2319 59 TLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASA-SADGTV-RLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLA 136
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 479 GVGKDHhgrtMVVAWGTGQvglgGEVVVLAKAHTDfDVQAfrVTFF-DETRMASCGQ-GSVRLWRLRGGVLrscpVDLGE 556
Cdd:COG2319 137 SGSADG----TVRLWDLAT----GKLLRTLTGHSG-AVTS--VAFSpDGKLLASGSDdGTVRLWDLATGKL----LRTLT 201
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 557 HHALQFTDLAFkqARDGcpepsaAMLFVCSRSGHIleidcqrmvvrharRLLPTRTPggphpQKQTFSSGPGIAISSLSV 636
Cdd:COG2319 202 GHTGAVRSVAF--SPDG------KLLASGSADGTV--------------RLWDLATG-----KLLRTLTGHSGSVRSVAF 254
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 637 SP--AMCAVGSEDGFLRLWPLDFSSVL-LEAEHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAPVL 713
Cdd:COG2319 255 SPdgRLLASGSADGTVRLWDLATGELLrTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVR 334
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217304987 714 ALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVR 776
Cdd:COG2319 335 SVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVR 397
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
303-740 |
8.23e-27 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 115.01 E-value: 8.23e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 303 VSLSQERSDASNADGPGFHSLEPWAQLEASDIHTAAAGTHVLTHESAEVPVARTGSCEGFLPDPVLRLKGVIGFGGHGTR 382
Cdd:COG2319 2 LSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAV 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 383 QAL-WTPDGAAVVYPCHAVIVVLL-VDTGEQRFFL-GHTDKVSALALDGSSSLLASAQAraPSVMRLWDFQTGRCLCLFR 459
Cdd:COG2319 82 LSVaFSPDGRLLASASADGTVRLWdLATGLLLRTLtGHTGAVRSVAFSPDGKTLASGSA--DGTVRLWDLATGKLLRTLT 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 460 SPMHVVCSLSFSDSGallcgvgkdhhgRTMVVAWGTGQVGL----GGEVVVLAKAHTDFdvqAFRVTF-FDETRMASCGQ 534
Cdd:COG2319 160 GHSGAVTSVAFSPDG------------KLLASGSDDGTVRLwdlaTGKLLRTLTGHTGA---VRSVAFsPDGKLLASGSA 224
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 535 -GSVRLWRLRGGVLrscpVDLGEHHALQFTDLAFkqARDGcpepsaAMLFVCSRSGHIleidcqrmvvrharRLLPTRTP 613
Cdd:COG2319 225 dGTVRLWDLATGKL----LRTLTGHSGSVRSVAF--SPDG------RLLASGSADGTV--------------RLWDLATG 278
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 614 ggphpQKQTFSSGPGIAISSLSVSP--AMCAVGSEDGFLRLWPLDFSSVLLEAE-HEGPVSSVCVSPDGLRVLSATSSGH 690
Cdd:COG2319 279 -----ELLRTLTGHSGGVNSVAFSPdgKLLASGSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGT 353
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|
gi 2217304987 691 LGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLAT 740
Cdd:COG2319 354 VRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1151-1510 |
2.76e-26 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 110.89 E-value: 2.76e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1151 QHWSGHSAEISTLALSHSAQVLASASGRSSttahcqIRVWDVSGGLCQHLIFPHSTTVLALAFSPDDRLLVTLGDhdGRT 1230
Cdd:cd00200 3 RTLKGHTGGVTCVAFSPDGKLLATGSGDGT------IKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSS--DKT 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1231 LALW----GTATYDLVSSTrlpEPVHGVAFNPwdageltcvgqgtvtfwllqqrgadislqvrrepvpeavgageltslc 1306
Cdd:cd00200 75 IRLWdletGECVRTLTGHT---SYVSSVAFSP------------------------------------------------ 103
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1307 ygAPPLLYCGTSSGQVCVWDTRAGRCFLSWEADDGGIGLLLFSGSR--LVSGSSTGRLRLWAVgavSELRCKGsgassVF 1384
Cdd:cd00200 104 --DGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGtfVASSSQDGTIKLWDL---RTGKCVA-----TL 173
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1385 MEHElvldGAVVSASFDDSVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVWALA 1464
Cdd:cd00200 174 TGHT----GEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLR 249
|
330 340 350 360
....*....|....*....|....*....|....*....|....*.
gi 2217304987 1465 SMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGSLRIFS 1510
Cdd:cd00200 250 TGECVQTLSGHTNSVTSLAWS------PDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1325-1746 |
1.38e-25 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 111.54 E-value: 1.38e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1325 WDTRAGRCFLSWEADDGGIGLLLFSGSRLVSGSSTGRLRLWAVGAVSELRCKGSGASSVFMEHELVLDGAVVSASFDDSV 1404
Cdd:COG2319 23 AALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTV 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1405 dmgvvgttagTLWFVswAEGTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVWALASMELVIQFQVLNQSCLCLAW 1484
Cdd:COG2319 103 ----------RLWDL--ATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAF 170
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1485 SppccgrPEQQRLAAGYGDGSLRIFSVSRTAMELKMHPHPVALTTVAFSTDGQTVLSGDKDGLVAVSHPCTGTTFRVLSD 1564
Cdd:COG2319 171 S------PDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTG 244
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1565 HQGAPISticvtckecedLGVEGTDLWLA-ASGDQRVSVWasdwlrnhcelvDWLSFPMPATTETQGHLPPSLaAFCPwD 1643
Cdd:COG2319 245 HSGSVRS-----------VAFSPDGRLLAsGSADGTVRLW------------DLATGELLRTLTGHSGGVNSV-AFSP-D 299
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1644 GALLMYVGPGvyKEVIIYNLCQKQVVEKIPLPFFA-MSLSLSPGTHLLAVGFAECMLRLVDCAMGTA-QDFAGHDNAVHL 1721
Cdd:COG2319 300 GKLLASGSDD--GTVRLWDLATGKLLRTLTGHTGAvRSVAFSPDGKTLASGSDDGTVRLWDLATGELlRTLTGHTGAVTS 377
|
410 420
....*....|....*....|....*.
gi 2217304987 1722 CRFTPSARLLFTAAR-NEILVWEVPG 1746
Cdd:COG2319 378 VAFSPDGRTLASGSAdGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
625-881 |
1.48e-25 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 108.58 E-value: 1.48e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 625 SGPGIAISSLSVSPAMcAVGSEDGFLRLWPLDFSSVLLE-AEHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHM 703
Cdd:cd00200 51 TGPVRDVAASADGTYL-ASGSSDKTIRLWDLETGECVRTlTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLT 129
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 704 LARSHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAA 783
Cdd:cd00200 130 TLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTG 209
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 784 EVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAqyscadpQWHVLRVAADMVCPDAPASPSALAVSRDGRLLAfvgps 863
Cdd:cd00200 210 KCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIR-------VWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLA----- 277
|
250
....*....|....*...
gi 2217304987 864 rctvtvmgSASLDELLRV 881
Cdd:cd00200 278 --------SGSADGTIRI 287
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1357-1746 |
2.53e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 107.69 E-value: 2.53e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1357 SSTGRLRLWAVGAVSELRCKGSGASSVFMEHELVLDGAVVSASFDDSVDMGVVGTTAGTLWFVSWAEGTSTRLISGHRSK 1436
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1437 VNEVVFSPGESHCATCSEDGSVRVWALASMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGSLRIFSVSRTAM 1516
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFS------PDGKTLASGSADGTVRLWDLATGKL 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1517 ELKMHPHPVALTTVAFSTDGQTVLSGDKDGLVAVSHPCTGTTFRVLSDHQGaPISTICVTckecedlgVEGTdlWLA-AS 1595
Cdd:COG2319 155 LRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTG-AVRSVAFS--------PDGK--LLAsGS 223
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1596 GDQRVSVWasDWLRNHCelvdwlsfpmpaTTETQGHLPPSLA-AFCPwDGALLmyVGPGVYKEVIIYNLCQKQVVEKIPL 1674
Cdd:COG2319 224 ADGTVRLW--DLATGKL------------LRTLTGHSGSVRSvAFSP-DGRLL--ASGSADGTVRLWDLATGELLRTLTG 286
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217304987 1675 PFFA-MSLSLSPGTHLLAVGFAECMLRLVDCAMG-TAQDFAGHDNAVHLCRFTPSARLLFTAAR-NEILVWEVPG 1746
Cdd:COG2319 287 HSGGvNSVAFSPDGKLLASGSDDGTVRLWDLATGkLLRTLTGHTGAVRSVAFSPDGKTLASGSDdGTVRLWDLAT 361
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
631-964 |
2.94e-22 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 98.95 E-value: 2.94e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 631 ISSLSVSPA--MCAVGSEDGFLRLWPLDFSSVLLE-AEHEGPVSSVCVSPDGLRVLSATSSGHLGFLDT----LSRVYHM 703
Cdd:cd00200 12 VTCVAFSPDgkLLATGSGDGTIKVWDLETGELLRTlKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLetgeCVRTLTG 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 704 larsHTAPVLALAMEQRRGQLATVSQDRTVRIWDLATLQQLYDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAA 783
Cdd:cd00200 92 ----HTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTG 167
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 784 EVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQWHvlrvaadMVCPDAPASPSALAVSRDGRLLAfvgps 863
Cdd:cd00200 168 KCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCL-------GTLRGHENGVNSVAFSPDGYLLA----- 235
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 864 rctvtvmgSASLDELLRVDigtldlassrldsamavcfgpaalghllvstssnrvvvlDAVSGRIIRELPGvHPEPCPSL 943
Cdd:cd00200 236 --------SGSEDGTIRVW---------------------------------------DLRTGECVQTLSG-HTNSVTSL 267
|
330 340
....*....|....*....|..
gi 2217304987 944 TLSEDARFLLIAAG-RTIKVWD 964
Cdd:cd00200 268 AWSPDGKRLASGSAdGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
414-737 |
3.11e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 95.86 E-value: 3.11e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 414 FLGHTDKVSALALDGSSSLLASAQARapSVMRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLCGVGKDHhgrtMVVAW 493
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLLATGSGD--GTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDK----TIRLW 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 494 GTGQvglgGEVVVLAKAHTDfDVQAfrVTFFDETRMASCG--QGSVRLWRLRGGVLRSCpvdLGEHHAlqftdlafkQAR 571
Cdd:cd00200 79 DLET----GECVRTLTGHTS-YVSS--VAFSPDGRILSSSsrDKTIKVWDVETGKCLTT---LRGHTD---------WVN 139
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 572 DGCPEPSAAMLFVCSRSGHILEIDcqrmvVRHARRLlptrtpggphpqkQTFSsGPGIAISSLSVSP--AMCAVGSEDGF 649
Cdd:cd00200 140 SVAFSPDGTFVASSSQDGTIKLWD-----LRTGKCV-------------ATLT-GHTGEVNSVAFSPdgEKLLSSSSDGT 200
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 650 LRLWPLDFSSVL--LEAeHEGPVSSVCVSPDGLRVLSATSSGHLGFLDTLSRVYHMLARSHTAPVLALAMEQRRGQLATV 727
Cdd:cd00200 201 IKLWDLSTGKCLgtLRG-HENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASG 279
|
330
....*....|
gi 2217304987 728 SQDRTVRIWD 737
Cdd:cd00200 280 SADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1124-1366 |
4.75e-19 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 89.70 E-value: 4.75e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1124 VWRPDTGFFAyTCG--RLVVVEDLHSGAQQH-WSGHSAEISTLALSHSAQVLASASgrssttAHCQIRVWDVSGGLCQHL 1200
Cdd:cd00200 58 AASADGTYLA-SGSsdKTIRLWDLETGECVRtLTGHTSYVSSVAFSPDGRILSSSS------RDKTIKVWDVETGKCLTT 130
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1201 IFPHSTTVLALAFSPDDRLLVTlGDHDGrTLALW----GTATYDLVSSTRlpePVHGVAFNPwDAGEL-TCVGQGTVTFW 1275
Cdd:cd00200 131 LRGHTDWVNSVAFSPDGTFVAS-SSQDG-TIKLWdlrtGKCVATLTGHTG---EVNSVAFSP-DGEKLlSSSSDGTIKLW 204
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1276 LLQQRGADISLQVRREPVpeavgagelTSLCYGAPPLLYCGTSS-GQVCVWDTRAGRCFLSWEADDGGIGLLLFS--GSR 1352
Cdd:cd00200 205 DLSTGKCLGTLRGHENGV---------NSVAFSPDGYLLASGSEdGTIRVWDLRTGECVQTLSGHTNSVTSLAWSpdGKR 275
|
250
....*....|....
gi 2217304987 1353 LVSGSSTGRLRLWA 1366
Cdd:cd00200 276 LASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1426-1744 |
3.53e-17 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 83.92 E-value: 3.53e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1426 STRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVWALASMELVIQFQVLNQSCLCLAWSppccgrPEQQRLAAGYGDGS 1505
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS------ADGTYLASGSSDKT 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1506 LRIFSVS--RTAMELKMHPHPValTTVAFSTDGQTVLSGDKDGLVAVSHPCTGTTFRVLSDHQGaPISTICVTckecedl 1583
Cdd:cd00200 75 IRLWDLEtgECVRTLTGHTSYV--SSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD-WVNSVAFS------- 144
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1584 gvEGTDLWLAASGDQRVSVWASDWLRnhcelvdwlsfpmpattetqghlppslaafcpwdgalLMYVGPGVYKEViiynl 1663
Cdd:cd00200 145 --PDGTFVASSSQDGTIKLWDLRTGK-------------------------------------CVATLTGHTGEV----- 180
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 1664 cqkqvvekiplpffaMSLSLSPGTHLLAVGFAECMLRLVDCAMGTA-QDFAGHDNAVHLCRFTPSARLLFTAARNE-ILV 1741
Cdd:cd00200 181 ---------------NSVAFSPDGEKLLSSSSDGTIKLWDLSTGKClGTLRGHENGVNSVAFSPDGYLLASGSEDGtIRV 245
|
...
gi 2217304987 1742 WEV 1744
Cdd:cd00200 246 WDL 248
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
788-1009 |
5.35e-14 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 74.68 E-value: 5.35e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 788 EHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCADPQW-HVLRVAADmvcpdapaSPSALAVSRDGRLLAfvgpsrct 866
Cdd:cd00200 4 TLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELlRTLKGHTG--------PVRDVAASADGTYLA-------- 67
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 867 vtvmgSASLDELLRV-DIGTLDLASS---RLDSAMAVCFGPAalGHLLVSTSSNRVVVL-DAVSGRIIRELPGvHPEPCP 941
Cdd:cd00200 68 -----SGSSDKTIRLwDLETGECVRTltgHTSYVSSVAFSPD--GRILSSSSRDKTIKVwDVETGKCLTTLRG-HTDWVN 139
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217304987 942 SLTLSEDARFLLIAAG-RTIKVWDYATqASPGpQVYIGHSEPVQAVAFSPDQQQVLSAGD--AVFLWDVLA 1009
Cdd:cd00200 140 SVAFSPDGTFVASSSQdGTIKLWDLRT-GKCV-ATLTGHTGEVNSVAFSPDGEKLLSSSSdgTIKLWDLST 208
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
412-541 |
1.62e-09 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 61.20 E-value: 1.62e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 412 RFFLGHTDKVSALALDGSSSLLASAQARapSVMRLWDFQTGRCLCLFRSPMHVVCSLSFSDSGALLCGVGKDhhgRTMVV 491
Cdd:cd00200 171 ATLTGHTGEVNSVAFSPDGEKLLSSSSD--GTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSED---GTIRV 245
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|.
gi 2217304987 492 aWgtgQVGLGGEVVVLaKAHTDFdVQAFRVTfFDETRMASCGQ-GSVRLWR 541
Cdd:cd00200 246 -W---DLRTGECVQTL-SGHTNS-VTSLAWS-PDGKRLASGSAdGTIRIWD 289
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1424-1461 |
3.69e-06 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 45.38 E-value: 3.69e-06
10 20 30
....*....|....*....|....*....|....*...
gi 2217304987 1424 GTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVW 1461
Cdd:smart00320 2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1424-1461 |
6.87e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 44.26 E-value: 6.87e-06
10 20 30
....*....|....*....|....*....|....*...
gi 2217304987 1424 GTSTRLISGHRSKVNEVVFSPGESHCATCSEDGSVRVW 1461
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
840-967 |
1.44e-04 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 45.46 E-value: 1.44e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 840 DAPASPSALAVSRDGRLLAFVGPSRCTVTVMGSASLDELLRVDIGtldlassrlDSAMAVCFGPAAlGHLLVS-TSSNR- 917
Cdd:COG3391 107 PVGGGPRGLAVDPDGGRLYVADSGNGRVSVIDTATGKVVATIPVG---------AGPHGIAVDPDG-KRLYVAnSGSNTv 176
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217304987 918 ---VVVLDAVSGRIIRELP-GVHPEpcpSLTLSEDARFLLIA---------AGRTIKVWDYAT 967
Cdd:COG3391 177 sviVSVIDTATGKVVATIPvGGGPV---GVAVSPDGRRLYVAnrgsntsngGSNTVSVIDLAT 236
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
745-928 |
3.67e-04 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 44.24 E-value: 3.67e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 745 YDFTSSEDAPCAVTFHPTRPTFFCGFSSGAVRSFSLEAAEVLVEHTCHRGAVTGLTATPDGRLLFSSCSQGSLAQYSCAD 824
Cdd:COG4257 10 YPVPAPGSGPRDVAVDPDGAVWFTDQGGGRIGRLDPATGEFTEYPLGGGSGPHGIAVDPDGNLWFTDNGNNRIGRIDPKT 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 825 PQwhVLRVAAdmvcPDAPASPSALAVSRDGRLLafvgpsrctVTVMGSaslDELLRVDIGT----LDLASSRLDSAMAVC 900
Cdd:COG4257 90 GE--ITTFAL----PGGGSNPHGIAFDPDGNLW---------FTDQGG---NRIGRLDPATgevtEFPLPTGGAGPYGIA 151
|
170 180
....*....|....*....|....*....
gi 2217304987 901 FGPAalGHLLV-STSSNRVVVLDAVSGRI 928
Cdd:COG4257 152 VDPD--GNLWVtDFGANAIGRIDPDTGTL 178
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
706-737 |
8.87e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 38.45 E-value: 8.87e-04
10 20 30
....*....|....*....|....*....|..
gi 2217304987 706 RSHTAPVLALAMEQRRGQLATVSQDRTVRIWD 737
Cdd:smart00320 9 KGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
829-995 |
1.18e-03 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 42.37 E-value: 1.18e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 829 VLRVAADMVCPDAPASPSALAVSRDGRLLAFVGPSRCTVTVMGSASLDELLRVDIGTLDLASSRLDSAMAVCFGPAALGH 908
Cdd:COG3391 2 LVASSLLVAVLLAVLALAALAVAVAALGLGGGGPLLAAASGGVVGAAVGGGGVALLAGLGLGAAAVADADGADAGADGRR 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 909 LLVS-TSSNRVVVLDAVSGRIIRELP-GVHPEpcpSLTLSEDARFLLIAAGR--TIKVWDYATQASPGpQVYIGhSEPVq 984
Cdd:COG3391 82 LYVAnSGSGRVSVIDLATGKVVATIPvGGGPR---GLAVDPDGGRLYVADSGngRVSVIDTATGKVVA-TIPVG-AGPH- 155
|
170
....*....|.
gi 2217304987 985 AVAFSPDQQQV 995
Cdd:COG3391 156 GIAVDPDGKRL 166
|
|
| WDR74 |
cd22857 |
WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and ... |
666-744 |
3.24e-03 |
|
WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and plants is an essential factor for ribosome assembly. In cooperation with the assembly factor NVL2, WDR74 participates in an early cleavage of the pre-rRNA processing pathway. NVL2 is a type II double ring, AAA-ATPase, that may mediate the release of WDR74 from nucleolar pre-60S particles. WDR74 has been implicated in tumorigenesis. In lung cancer, it regulates cell proliferation, cell cycle progression, chemoresistance and cell aggressiveness, by inducing nuclear beta-catenin accumulation and driving downstream Wnt-responsive genes expression. In melanoma, it promotes apoptosis resistance and aggressive behavior by regulating the RPL5-MDM2-p53 pathway. WDR74 contains an N-terminal seven-bladed beta-propeller WD40 domain that associates with the D1-AAA domain of the AAA-ATPase NVL2, and a flexible lysine-rich C-terminus that extends outward from the WD40 domain, and is required for nucleolar localization.
Pssm-ID: 439303 [Multi-domain] Cd Length: 325 Bit Score: 41.83 E-value: 3.24e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 666 HEGPVSSVCVSPDGLRVLSATSSGHLGFLD----TLSRVYHMLArshTAPVLALAMEQRRGQLATVSQDRTVRIWDLATL 741
Cdd:cd22857 222 GETPIKAVAEDPDGHTVYVGDTSGDLASIDlrtgKLLGCFKGKC---GGSIRSIARHPELPLIASCGLDRYLRIWDTETR 298
|
...
gi 2217304987 742 QQL 744
Cdd:cd22857 299 QLL 301
|
|
| YvrE |
COG3386 |
Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase ... |
785-973 |
3.42e-03 |
|
Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase YvrE is part of the Pathway/BioSystem: Non-phosphorylated Entner-Doudoroff pathway
Pssm-ID: 442613 [Multi-domain] Cd Length: 266 Bit Score: 41.42 E-value: 3.42e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 785 VLVEHTCHRGavTGLTATPDGRLLFSSCSQGSLAQYSCADPQWHVLRvaadmvcpDAPASPSALAVSRDGRLLAfvgpsr 864
Cdd:COG3386 1 KLADAGFRLG--EGPVWDPDGRLYWVDIPGGRIHRYDPDGGAVEVFA--------EPSGRPNGLAFDPDGRLLV------ 64
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217304987 865 ctvtvmgSASLDELLRVDIGT------LDLASSRLDSAMAVCFGPAalGHLLVSTSSN-----RVVVLDAvSGRIIRELP 933
Cdd:COG3386 65 -------ADHGRGLVRFDPADgevtvlADEYGKPLNRPNDGVVDPD--GRLYFTDMGEylptgALYRVDP-DGSLRVLAD 134
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 2217304987 934 GVHpepCP-SLTLSEDARFLLIA--AGRTIKVWDYATQASPGP 973
Cdd:COG3386 135 GLT---FPnGIAFSPDGRTLYVAdtGAGRIYRFDLDADGTLGN 174
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
974-1006 |
4.78e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 36.52 E-value: 4.78e-03
10 20 30
....*....|....*....|....*....|....*
gi 2217304987 974 QVYIGHSEPVQAVAFSPDQQQVLSAGD--AVFLWD 1006
Cdd:smart00320 6 KTLKGHTGPVTSVAFSPDGKYLASGSDdgTIKLWD 40
|
|
|