NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|19075378|ref|NP_587878|]
View 

U3 snoRNP-associated protein Utp21 [Schizosaccharomyces pombe]

Protein Classification

WD40 repeat and Utp21 domain-containing protein( domain architecture ID 13236737)

WD40 repeat and Utp21 domain-containing protein similar to human WD repeat-containing protein 36 and Schizosaccharomyces pombe U3 small nucleolar RNA-associated protein 21 (Utp21) homolog, which is involved in nucleolar processing of pre-18S ribosomal RNA and ribosome assembly

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Utp21 pfam04192
Utp21 specific WD40 associated putative domain; Utp21 is a subunit of U3 snoRNP, which is ...
699-901 2.22e-86

Utp21 specific WD40 associated putative domain; Utp21 is a subunit of U3 snoRNP, which is essential for synthesis of 18S rRNA.


:

Pssm-ID: 461219  Cd Length: 209  Bit Score: 274.41  E-value: 2.22e-86
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378   699 DQLDPNLQTLSKLPRTQWQTLINLEAIKARNAPKEVPKVPEKAPFFLPSL--KDQSEATVPKQPIATEISKPTAVASIKV 776
Cdd:pfam04192   1 DQLSEDLVTLSLLPRSRWQTLLHLDLIKQRNKPKEAPKKPEKAPFFLPTLggLVGDFASVEAQEEEEEEEEEERSRLLKL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378   777 SG----TEFSTLLHG----NDDDAFFEYLKSLGPAKIDLEIRSLDAYPPYEEFILFINIMTRRLSKRRDFELVQACMSVF 848
Cdd:pfam04192  81 GSlgfeSEFTKLLREgsetGDYTPFLEYLKSLSPSAIDLEIRSLNSGGPLEELVSFIRALTSRLKSNRDFELVQAYMAVF 160
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|...
gi 19075378   849 TKSHEDVLLMHDTPEdtvpVFESLKAWESVHKEENQRLLDLVGYCSGILSFMR 901
Cdd:pfam04192 161 LKLHGDVIHSNEEEE----LREALEEWKSVQEEEWERLDELVGYCSGVVGFLR 209
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
284-600 3.70e-33

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 129.76  E-value: 3.70e-33
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 284 TYNAHFGSLPKIQFLNGQPILVTAGPDNSLKEWIFDSMDgaprILRSRNGHYEPPSFVKFYGKSvHFLISAATDRSLRav 363
Cdd:cd00200   4 TLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGE----LLRTLKGHTGPVRDVAASADG-TYLASGSSDKTIR-- 76
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 364 sLYqdsqstELSQGSVISkakklnvrpeELK--LPEITALSSSNTREkywdnVLTAHKNDSSARTWNWKS----KTLGQH 437
Cdd:cd00200  77 -LW------DLETGECVR----------TLTghTSYVSSVAFSPDGR-----ILSSSSRDKTIKVWDVETgkclTTLRGH 134
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 438 vlptsdGTSVRSVCVSCCGNFGLIGSSKGVVDVYNMQSGIKRKSFGQSSlsgKPVTAVMLDNVNRILVTASLDGILKFWD 517
Cdd:cd00200 135 ------TDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHT---GEVNSVAFSPDGEKLLSSSSDGTIKLWD 205
                       250       260       270       280       290       300       310       320
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 518 FNKGNLIDSLDV-GSSITHAIYQHSSDLVAVACDDFGIRIVDVQTRKIVRELWGHSNRLTSFDFSDTGRWLVTASLDGTI 596
Cdd:cd00200 206 LSTGKCLGTLRGhENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTI 285

                ....
gi 19075378 597 RTWD 600
Cdd:cd00200 286 RIWD 289
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
77-361 2.71e-16

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 80.46  E-value: 2.71e-16
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378  77 KEITCLKSFKDFMLVAAGSkifayKRGKI-IWDIDVEQEHGTVT-HLD--------AFGEWIIACTSSRHVYVWkhasKY 146
Cdd:cd00200  10 GGVTCVAFSPDGKLLATGS-----GDGTIkVWDLETGELLRTLKgHTGpvrdvaasADGTYLASGSSDKTIRLW----DL 80
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 147 SVPELHTTFLPNTNaDITSL-LHPSTYLnkILLGFSDGALQIWNLRVSKRVHEFQeFFGDGITSLTQAPVLDVLAVGTIS 225
Cdd:cd00200  81 ETGECVRTLTGHTS-YVSSVaFSPDGRI--LSSSSRDKTIKVWDVETGKCLTTLR-GHTDWVNSVAFSPDGTFVASSSQD 156
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 226 GRIVIFNLKNGSILMEFKQ-DGQVLSCSFRTDGTPILASSNPiGDLSFWDLSKRRIQnVTYNAHFGSLPKIQFLNGQPIL 304
Cdd:cd00200 157 GTIKLWDLRTGKCVATLTGhTGEVNSVAFSPDGEKLLSSSSD-GTIKLWDLSTGKCL-GTLRGHENGVNSVAFSPDGYLL 234
                       250       260       270       280       290
                ....*....|....*....|....*....|....*....|....*....|....*..
gi 19075378 305 VTAGPDNSLKEWifDSMDGapRILRSRNGHYEPPSFVKFYGKSvHFLISAATDRSLR 361
Cdd:cd00200 235 ASGSEDGTIRVW--DLRTG--ECVQTLSGHTNSVTSLAWSPDG-KRLASGSADGTIR 286
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
565-641 9.39e-09

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 57.73  E-value: 9.39e-09
                        10        20        30        40        50        60        70
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 19075378 565 VRELWGHSNRLTSFDFSDTGRWLVTASLDGTIRTWDLPTGHLIDSISTPSVC-TSLTFAPTGDYLATTHVDQVgISLW 641
Cdd:cd00200   2 RRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPvRDVAASADGTYLASGSSDKT-IRLW 78
 
Name Accession Description Interval E-value
Utp21 pfam04192
Utp21 specific WD40 associated putative domain; Utp21 is a subunit of U3 snoRNP, which is ...
699-901 2.22e-86

Utp21 specific WD40 associated putative domain; Utp21 is a subunit of U3 snoRNP, which is essential for synthesis of 18S rRNA.


Pssm-ID: 461219  Cd Length: 209  Bit Score: 274.41  E-value: 2.22e-86
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378   699 DQLDPNLQTLSKLPRTQWQTLINLEAIKARNAPKEVPKVPEKAPFFLPSL--KDQSEATVPKQPIATEISKPTAVASIKV 776
Cdd:pfam04192   1 DQLSEDLVTLSLLPRSRWQTLLHLDLIKQRNKPKEAPKKPEKAPFFLPTLggLVGDFASVEAQEEEEEEEEEERSRLLKL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378   777 SG----TEFSTLLHG----NDDDAFFEYLKSLGPAKIDLEIRSLDAYPPYEEFILFINIMTRRLSKRRDFELVQACMSVF 848
Cdd:pfam04192  81 GSlgfeSEFTKLLREgsetGDYTPFLEYLKSLSPSAIDLEIRSLNSGGPLEELVSFIRALTSRLKSNRDFELVQAYMAVF 160
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|...
gi 19075378   849 TKSHEDVLLMHDTPEdtvpVFESLKAWESVHKEENQRLLDLVGYCSGILSFMR 901
Cdd:pfam04192 161 LKLHGDVIHSNEEEE----LREALEEWKSVQEEEWERLDELVGYCSGVVGFLR 209
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
284-600 3.70e-33

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 129.76  E-value: 3.70e-33
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 284 TYNAHFGSLPKIQFLNGQPILVTAGPDNSLKEWIFDSMDgaprILRSRNGHYEPPSFVKFYGKSvHFLISAATDRSLRav 363
Cdd:cd00200   4 TLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGE----LLRTLKGHTGPVRDVAASADG-TYLASGSSDKTIR-- 76
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 364 sLYqdsqstELSQGSVISkakklnvrpeELK--LPEITALSSSNTREkywdnVLTAHKNDSSARTWNWKS----KTLGQH 437
Cdd:cd00200  77 -LW------DLETGECVR----------TLTghTSYVSSVAFSPDGR-----ILSSSSRDKTIKVWDVETgkclTTLRGH 134
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 438 vlptsdGTSVRSVCVSCCGNFGLIGSSKGVVDVYNMQSGIKRKSFGQSSlsgKPVTAVMLDNVNRILVTASLDGILKFWD 517
Cdd:cd00200 135 ------TDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHT---GEVNSVAFSPDGEKLLSSSSDGTIKLWD 205
                       250       260       270       280       290       300       310       320
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 518 FNKGNLIDSLDV-GSSITHAIYQHSSDLVAVACDDFGIRIVDVQTRKIVRELWGHSNRLTSFDFSDTGRWLVTASLDGTI 596
Cdd:cd00200 206 LSTGKCLGTLRGhENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTI 285

                ....
gi 19075378 597 RTWD 600
Cdd:cd00200 286 RIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
246-641 2.77e-32

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 130.42  E-value: 2.77e-32
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 246 GQVLSCSFRTDGTPILASSNPIGDLSFWDLSKRRIQnvTYNAHFGSLPKIQFLNGQPILVTAGPDNSLKEWIFDSmdgaP 325
Cdd:COG2319  37 AAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLA--TLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLAT----G 110
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 326 RILRSRNGHYEPPSFVKFY--GKsvhFLISAATDRSLRAVSLYQDSQSTELS--QGSVISkakklnvrpeelklpeiTAL 401
Cdd:COG2319 111 LLLRTLTGHTGAVRSVAFSpdGK---TLASGSADGTVRLWDLATGKLLRTLTghSGAVTS-----------------VAF 170
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 402 SSSNTRekywdnVLTAHkNDSSARTWNWKSKTLgQHVLPTSDGtSVRSVCVSCCGNFGLIGSSKGVVDVYNMQSGIKRKS 481
Cdd:COG2319 171 SPDGKL------LASGS-DDGTVRLWDLATGKL-LRTLTGHTG-AVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRT 241
                       250       260       270       280       290       300       310       320
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 482 FGqssLSGKPVTAVMLDNVNRILVTASLDGILKFWDFNKGNLIDSLDVGSS-ITHAIYQHSSDLVAVACDDFGIRIVDVQ 560
Cdd:COG2319 242 LT---GHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGgVNSVAFSPDGKLLASGSDDGTVRLWDLA 318
                       330       340       350       360       370       380       390       400
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 561 TRKIVRELWGHSNRLTSFDFSDTGRWLVTASLDGTIRTWDLPTGHLIDSIS--TPSVcTSLTFAPTGDYLATTHVDQVgI 638
Cdd:COG2319 319 TGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTghTGAV-TSVAFSPDGRTLASGSADGT-V 396

                ...
gi 19075378 639 SLW 641
Cdd:COG2319 397 RLW 399
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
77-361 2.71e-16

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 80.46  E-value: 2.71e-16
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378  77 KEITCLKSFKDFMLVAAGSkifayKRGKI-IWDIDVEQEHGTVT-HLD--------AFGEWIIACTSSRHVYVWkhasKY 146
Cdd:cd00200  10 GGVTCVAFSPDGKLLATGS-----GDGTIkVWDLETGELLRTLKgHTGpvrdvaasADGTYLASGSSDKTIRLW----DL 80
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 147 SVPELHTTFLPNTNaDITSL-LHPSTYLnkILLGFSDGALQIWNLRVSKRVHEFQeFFGDGITSLTQAPVLDVLAVGTIS 225
Cdd:cd00200  81 ETGECVRTLTGHTS-YVSSVaFSPDGRI--LSSSSRDKTIKVWDVETGKCLTTLR-GHTDWVNSVAFSPDGTFVASSSQD 156
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 226 GRIVIFNLKNGSILMEFKQ-DGQVLSCSFRTDGTPILASSNPiGDLSFWDLSKRRIQnVTYNAHFGSLPKIQFLNGQPIL 304
Cdd:cd00200 157 GTIKLWDLRTGKCVATLTGhTGEVNSVAFSPDGEKLLSSSSD-GTIKLWDLSTGKCL-GTLRGHENGVNSVAFSPDGYLL 234
                       250       260       270       280       290
                ....*....|....*....|....*....|....*....|....*....|....*..
gi 19075378 305 VTAGPDNSLKEWifDSMDGapRILRSRNGHYEPPSFVKFYGKSvHFLISAATDRSLR 361
Cdd:cd00200 235 ASGSEDGTIRVW--DLRTG--ECVQTLSGHTNSVTSLAWSPDG-KRLASGSADGTIR 286
WD40 COG2319
WD40 repeat [General function prediction only];
106-361 9.22e-13

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 71.10  E-value: 9.22e-13
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 106 IWDIDVEQEHGTVTHLD------AF---GEWIIACTSSRHVYVWKHASKysvPELHTtfLPNTNADITSL-LHPStylNK 175
Cdd:COG2319 104 LWDLATGLLLRTLTGHTgavrsvAFspdGKTLASGSADGTVRLWDLATG---KLLRT--LTGHSGAVTSVaFSPD---GK 175
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 176 ILL-GFSDGALQIWNLRVSKRVHEFQEfFGDGITSLTQAPVLDVLAVGTISGRIVIFNLKNGSILMEFK-QDGQVLSCSF 253
Cdd:COG2319 176 LLAsGSDDGTVRLWDLATGKLLRTLTG-HTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTgHSGSVRSVAF 254
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 254 RTDGTpILASSNPIGDLSFWDLSKRRIQnVTYNAHFGSLPKIQFL-NGQpILVTAGPDNSLKEWIFDSmdgaPRILRSRN 332
Cdd:COG2319 255 SPDGR-LLASGSADGTVRLWDLATGELL-RTLTGHSGGVNSVAFSpDGK-LLASGSDDGTVRLWDLAT----GKLLRTLT 327
                       250       260       270
                ....*....|....*....|....*....|.
gi 19075378 333 GHYEPPSFVKF--YGKsvhFLISAATDRSLR 361
Cdd:COG2319 328 GHTGAVRSVAFspDGK---TLASGSDDGTVR 355
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
565-641 9.39e-09

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 57.73  E-value: 9.39e-09
                        10        20        30        40        50        60        70
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 19075378 565 VRELWGHSNRLTSFDFSDTGRWLVTASLDGTIRTWDLPTGHLIDSISTPSVC-TSLTFAPTGDYLATTHVDQVgISLW 641
Cdd:cd00200   2 RRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPvRDVAASADGTYLASGSSDKT-IRLW 78
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
561-600 8.78e-08

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 48.85  E-value: 8.78e-08
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 19075378    561 TRKIVRELWGHSNRLTSFDFSDTGRWLVTASLDGTIRTWD 600
Cdd:smart00320   1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
562-600 3.87e-07

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 46.95  E-value: 3.87e-07
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 19075378   562 RKIVRELWGHSNRLTSFDFSDTGRWLVTASLDGTIRTWD 600
Cdd:pfam00400   1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
PLN00181 PLN00181
protein SPA1-RELATED; Provisional
182-341 1.93e-04

protein SPA1-RELATED; Provisional


Pssm-ID: 177776 [Multi-domain]  Cd Length: 793  Bit Score: 45.08  E-value: 1.93e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378  182 DGALQIWNLRVSKRVHEFQEFFGD--GITSLTQAPVLdvLAVGTISGRIVIFNLKNGSILMEFKQDGQVLSCSFRTDGTP 259
Cdd:PLN00181 554 EGVVQVWDVARSQLVTEMKEHEKRvwSIDYSSADPTL--LASGSDDGSVKLWSINQGVSIGTIKTKANICCVQFPSESGR 631
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378  260 ILASSNPIGDLSFWDLSKRRIQNVTYNAHFGSLPKIQFLNGQpILVTAGPDNSLKEWIFD-SMDG---APriLRSRNGHY 335
Cdd:PLN00181 632 SLAFGSADHKVYYYDLRNPKLPLCTMIGHSKTVSYVRFVDSS-TLVSSSTDNTLKLWDLSmSISGineTP--LHSFMGHT 708

                 ....*.
gi 19075378  336 EPPSFV 341
Cdd:PLN00181 709 NVKNFV 714
PTZ00421 PTZ00421
coronin; Provisional
587-641 7.33e-03

coronin; Provisional


Pssm-ID: 173611 [Multi-domain]  Cd Length: 493  Bit Score: 39.88  E-value: 7.33e-03
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 19075378  587 LVTASLDGTIRTWDLPTGHLIDSISTPSV--------CTSLTFAPTG-DYLATTHVDQVgISLW 641
Cdd:PTZ00421  91 LFTASEDGTIMGWGIPEEGLTQNISDPIVhlqghtkkVGIVSFHPSAmNVLASAGADMV-VNVW 153
 
Name Accession Description Interval E-value
Utp21 pfam04192
Utp21 specific WD40 associated putative domain; Utp21 is a subunit of U3 snoRNP, which is ...
699-901 2.22e-86

Utp21 specific WD40 associated putative domain; Utp21 is a subunit of U3 snoRNP, which is essential for synthesis of 18S rRNA.


Pssm-ID: 461219  Cd Length: 209  Bit Score: 274.41  E-value: 2.22e-86
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378   699 DQLDPNLQTLSKLPRTQWQTLINLEAIKARNAPKEVPKVPEKAPFFLPSL--KDQSEATVPKQPIATEISKPTAVASIKV 776
Cdd:pfam04192   1 DQLSEDLVTLSLLPRSRWQTLLHLDLIKQRNKPKEAPKKPEKAPFFLPTLggLVGDFASVEAQEEEEEEEEEERSRLLKL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378   777 SG----TEFSTLLHG----NDDDAFFEYLKSLGPAKIDLEIRSLDAYPPYEEFILFINIMTRRLSKRRDFELVQACMSVF 848
Cdd:pfam04192  81 GSlgfeSEFTKLLREgsetGDYTPFLEYLKSLSPSAIDLEIRSLNSGGPLEELVSFIRALTSRLKSNRDFELVQAYMAVF 160
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|...
gi 19075378   849 TKSHEDVLLMHDTPEdtvpVFESLKAWESVHKEENQRLLDLVGYCSGILSFMR 901
Cdd:pfam04192 161 LKLHGDVIHSNEEEE----LREALEEWKSVQEEEWERLDELVGYCSGVVGFLR 209
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
284-600 3.70e-33

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 129.76  E-value: 3.70e-33
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 284 TYNAHFGSLPKIQFLNGQPILVTAGPDNSLKEWIFDSMDgaprILRSRNGHYEPPSFVKFYGKSvHFLISAATDRSLRav 363
Cdd:cd00200   4 TLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGE----LLRTLKGHTGPVRDVAASADG-TYLASGSSDKTIR-- 76
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 364 sLYqdsqstELSQGSVISkakklnvrpeELK--LPEITALSSSNTREkywdnVLTAHKNDSSARTWNWKS----KTLGQH 437
Cdd:cd00200  77 -LW------DLETGECVR----------TLTghTSYVSSVAFSPDGR-----ILSSSSRDKTIKVWDVETgkclTTLRGH 134
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 438 vlptsdGTSVRSVCVSCCGNFGLIGSSKGVVDVYNMQSGIKRKSFGQSSlsgKPVTAVMLDNVNRILVTASLDGILKFWD 517
Cdd:cd00200 135 ------TDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHT---GEVNSVAFSPDGEKLLSSSSDGTIKLWD 205
                       250       260       270       280       290       300       310       320
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 518 FNKGNLIDSLDV-GSSITHAIYQHSSDLVAVACDDFGIRIVDVQTRKIVRELWGHSNRLTSFDFSDTGRWLVTASLDGTI 596
Cdd:cd00200 206 LSTGKCLGTLRGhENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTI 285

                ....
gi 19075378 597 RTWD 600
Cdd:cd00200 286 RIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
246-641 2.77e-32

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 130.42  E-value: 2.77e-32
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 246 GQVLSCSFRTDGTPILASSNPIGDLSFWDLSKRRIQnvTYNAHFGSLPKIQFLNGQPILVTAGPDNSLKEWIFDSmdgaP 325
Cdd:COG2319  37 AAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLA--TLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLAT----G 110
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 326 RILRSRNGHYEPPSFVKFY--GKsvhFLISAATDRSLRAVSLYQDSQSTELS--QGSVISkakklnvrpeelklpeiTAL 401
Cdd:COG2319 111 LLLRTLTGHTGAVRSVAFSpdGK---TLASGSADGTVRLWDLATGKLLRTLTghSGAVTS-----------------VAF 170
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 402 SSSNTRekywdnVLTAHkNDSSARTWNWKSKTLgQHVLPTSDGtSVRSVCVSCCGNFGLIGSSKGVVDVYNMQSGIKRKS 481
Cdd:COG2319 171 SPDGKL------LASGS-DDGTVRLWDLATGKL-LRTLTGHTG-AVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRT 241
                       250       260       270       280       290       300       310       320
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 482 FGqssLSGKPVTAVMLDNVNRILVTASLDGILKFWDFNKGNLIDSLDVGSS-ITHAIYQHSSDLVAVACDDFGIRIVDVQ 560
Cdd:COG2319 242 LT---GHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGgVNSVAFSPDGKLLASGSDDGTVRLWDLA 318
                       330       340       350       360       370       380       390       400
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 561 TRKIVRELWGHSNRLTSFDFSDTGRWLVTASLDGTIRTWDLPTGHLIDSIS--TPSVcTSLTFAPTGDYLATTHVDQVgI 638
Cdd:COG2319 319 TGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTghTGAV-TSVAFSPDGRTLASGSADGT-V 396

                ...
gi 19075378 639 SLW 641
Cdd:COG2319 397 RLW 399
WD40 COG2319
WD40 repeat [General function prediction only];
176-603 5.37e-31

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 126.56  E-value: 5.37e-31
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 176 ILLGFSDGALQIWNLRVSKRVHEFQEFFGDGITSLTQAPVLDVLAVGTISGRIVIFNLKNGSILMEFKQDGQVLSCSFRT 255
Cdd:COG2319   9 LAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSP 88
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 256 DGTpILASSNPIGDLSFWDLSKRRIQnVTYNAHFGSLPKIQFL-NGQpILVTAGPDNSLKEWifDSMDGapRILRSRNGH 334
Cdd:COG2319  89 DGR-LLASASADGTVRLWDLATGLLL-RTLTGHTGAVRSVAFSpDGK-TLASGSADGTVRLW--DLATG--KLLRTLTGH 161
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 335 YEPPSFVKFygkSV--HFLISAATDRSLRAVSLYQDSQSTELS--QGSVISkakklnvrpeelklpeiTALSSSNTReky 410
Cdd:COG2319 162 SGAVTSVAF---SPdgKLLASGSDDGTVRLWDLATGKLLRTLTghTGAVRS-----------------VAFSPDGKL--- 218
                       250       260       270       280       290       300       310       320
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 411 wdnVLTAHkNDSSARTWNWKSKTLGQhvLPTSDGTSVRSVCVSCCGNFGLIGSSKGVVDVYNMQSGIKRKSFGQSSlsgK 490
Cdd:COG2319 219 ---LASGS-ADGTVRLWDLATGKLLR--TLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHS---G 289
                       330       340       350       360       370       380       390       400
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 491 PVTAVMLDNVNRILVTASLDGILKFWDFNKGNLIDSLDVGSSITHAIyQHSSD--LVAVACDDFGIRIVDVQTRKIVREL 568
Cdd:COG2319 290 GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSV-AFSPDgkTLASGSDDGTVRLWDLATGELLRTL 368
                       410       420       430
                ....*....|....*....|....*....|....*
gi 19075378 569 WGHSNRLTSFDFSDTGRWLVTASLDGTIRTWDLPT 603
Cdd:COG2319 369 TGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
421-652 2.78e-28

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 115.51  E-value: 2.78e-28
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 421 DSSARTWNWKS----KTLGQHVLPtsdgtsVRSVCVSCCGNFGLIGSSKGVVDVYNMQSGIKRKSFGQSSlsgKPVTAVM 496
Cdd:cd00200  30 DGTIKVWDLETgellRTLKGHTGP------VRDVAASADGTYLASGSSDKTIRLWDLETGECVRTLTGHT---SYVSSVA 100
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 497 LDNVNRILVTASLDGILKFWDFNKGNLIDSL-DVGSSITHAIYQHSSDLVAVACDDFGIRIVDVQTRKIVRELWGHSNRL 575
Cdd:cd00200 101 FSPDGRILSSSSRDKTIKVWDVETGKCLTTLrGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEV 180
                       170       180       190       200       210       220       230
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 19075378 576 TSFDFSDTGRWLVTASLDGTIRTWDLPTGHLIDSISTPSV-CTSLTFAPTGDYLATTHVDQVgISLWtNLSMFKHVST 652
Cdd:cd00200 181 NSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENgVNSVAFSPDGYLLASGSEDGT-IRVW-DLRTGECVQT 256
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
490-652 1.03e-23

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 102.41  E-value: 1.03e-23
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 490 KPVTAVMLDNVNRILVTASLDGILKFWDFNKGNLIDSLdVG--SSITHAIYQHSSDLVAVACDDFGIRIVDVQTRKIVRE 567
Cdd:cd00200  10 GGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTL-KGhtGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRT 88
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 568 LWGHSNRLTSFDFSDTGRWLVTASLDGTIRTWDLPTGHLIDSIS--TPSVcTSLTFAPTGDYLATTHVDQVgISLWtNLS 645
Cdd:cd00200  89 LTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRghTDWV-NSVAFSPDGTFVASSSQDGT-IKLW-DLR 165

                ....*..
gi 19075378 646 MFKHVST 652
Cdd:cd00200 166 TGKCVAT 172
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
77-361 2.71e-16

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 80.46  E-value: 2.71e-16
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378  77 KEITCLKSFKDFMLVAAGSkifayKRGKI-IWDIDVEQEHGTVT-HLD--------AFGEWIIACTSSRHVYVWkhasKY 146
Cdd:cd00200  10 GGVTCVAFSPDGKLLATGS-----GDGTIkVWDLETGELLRTLKgHTGpvrdvaasADGTYLASGSSDKTIRLW----DL 80
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 147 SVPELHTTFLPNTNaDITSL-LHPSTYLnkILLGFSDGALQIWNLRVSKRVHEFQeFFGDGITSLTQAPVLDVLAVGTIS 225
Cdd:cd00200  81 ETGECVRTLTGHTS-YVSSVaFSPDGRI--LSSSSRDKTIKVWDVETGKCLTTLR-GHTDWVNSVAFSPDGTFVASSSQD 156
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 226 GRIVIFNLKNGSILMEFKQ-DGQVLSCSFRTDGTPILASSNPiGDLSFWDLSKRRIQnVTYNAHFGSLPKIQFLNGQPIL 304
Cdd:cd00200 157 GTIKLWDLRTGKCVATLTGhTGEVNSVAFSPDGEKLLSSSSD-GTIKLWDLSTGKCL-GTLRGHENGVNSVAFSPDGYLL 234
                       250       260       270       280       290
                ....*....|....*....|....*....|....*....|....*....|....*..
gi 19075378 305 VTAGPDNSLKEWifDSMDGapRILRSRNGHYEPPSFVKFYGKSvHFLISAATDRSLR 361
Cdd:cd00200 235 ASGSEDGTIRVW--DLRTG--ECVQTLSGHTNSVTSLAWSPDG-KRLASGSADGTIR 286
WD40 COG2319
WD40 repeat [General function prediction only];
106-361 9.22e-13

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 71.10  E-value: 9.22e-13
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 106 IWDIDVEQEHGTVTHLD------AF---GEWIIACTSSRHVYVWKHASKysvPELHTtfLPNTNADITSL-LHPStylNK 175
Cdd:COG2319 104 LWDLATGLLLRTLTGHTgavrsvAFspdGKTLASGSADGTVRLWDLATG---KLLRT--LTGHSGAVTSVaFSPD---GK 175
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 176 ILL-GFSDGALQIWNLRVSKRVHEFQEfFGDGITSLTQAPVLDVLAVGTISGRIVIFNLKNGSILMEFK-QDGQVLSCSF 253
Cdd:COG2319 176 LLAsGSDDGTVRLWDLATGKLLRTLTG-HTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTgHSGSVRSVAF 254
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 254 RTDGTpILASSNPIGDLSFWDLSKRRIQnVTYNAHFGSLPKIQFL-NGQpILVTAGPDNSLKEWIFDSmdgaPRILRSRN 332
Cdd:COG2319 255 SPDGR-LLASGSADGTVRLWDLATGELL-RTLTGHSGGVNSVAFSpDGK-LLASGSDDGTVRLWDLAT----GKLLRTLT 327
                       250       260       270
                ....*....|....*....|....*....|.
gi 19075378 333 GHYEPPSFVKF--YGKsvhFLISAATDRSLR 361
Cdd:COG2319 328 GHTGAVRSVAFspDGK---TLASGSDDGTVR 355
WD40 COG2319
WD40 repeat [General function prediction only];
504-641 5.93e-11

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 65.32  E-value: 5.93e-11
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 504 LVTASLDGILKFWDFNKGNLIDSLDVGSS-ITHAIYQHSSDLVAVACDDFGIRIVDVQTRKIVRELWGHSNRLTSFDFSD 582
Cdd:COG2319   9 LAAASADLALALLAAALGALLLLLLGLAAaVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSP 88
                        90       100       110       120       130       140
                ....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 583 TGRWLVTASLDGTIRTWDLPTGHLIDSIS-TPSVCTSLTFAPTGDYLATTHVDQvGISLW 641
Cdd:COG2319  89 DGRLLASASADGTVRLWDLATGLLLRTLTgHTGAVRSVAFSPDGKTLASGSADG-TVRLW 147
YncE COG3391
DNA-binding beta-propeller fold protein YncE [General function prediction only];
501-628 1.37e-10

DNA-binding beta-propeller fold protein YncE [General function prediction only];


Pssm-ID: 442618 [Multi-domain]  Cd Length: 237  Bit Score: 62.40  E-value: 1.37e-10
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 501 NRILVTASLDGILKFWDFNKGNLIDSLDVGSSITHAIYQHSSDLVAVAC-DDFGIRIVDVQTRKIVRElWGHSNRLTSFD 579
Cdd:COG3391  80 RRLYVANSGSGRVSVIDLATGKVVATIPVGGGPRGLAVDPDGGRLYVADsGNGRVSVIDTATGKVVAT-IPVGAGPHGIA 158
                        90       100       110       120       130
                ....*....|....*....|....*....|....*....|....*....|....
gi 19075378 580 FSDTGRWLVTASLDGT-----IRTWDLPTGHLIDSISTPSVCTSLTFAPTGDYL 628
Cdd:COG3391 159 VDPDGKRLYVANSGSNtvsviVSVIDTATGKVVATIPVGGGPVGVAVSPDGRRL 212
WD40 COG2319
WD40 repeat [General function prediction only];
89-316 7.97e-10

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 61.85  E-value: 7.97e-10
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378  89 MLVAAGS----KIFAYKRGKIIWDIDVEQehGTVTHLdAF---GEWIIACTSSRHVYVWKHASKysvPELHTtfLPNTNA 161
Cdd:COG2319 176 LLASGSDdgtvRLWDLATGKLLRTLTGHT--GAVRSV-AFspdGKLLASGSADGTVRLWDLATG---KLLRT--LTGHSG 247
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 162 DITSL-LHPStylNKILL-GFSDGALQIWNLRVSKRVHEFQEFfGDGITSLTQAPVLDVLAVGTISGRIVIFNLKNGSIL 239
Cdd:COG2319 248 SVRSVaFSPD---GRLLAsGSADGTVRLWDLATGELLRTLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLL 323
                       170       180       190       200       210       220       230
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 19075378 240 MEFK-QDGQVLSCSFRTDGTpILASSNPIGDLSFWDLSKRRIQnVTYNAHFGSLPKIQFLNGQPILVTAGPDNSLKEW 316
Cdd:COG2319 324 RTLTgHTGAVRSVAFSPDGK-TLASGSDDGTVRLWDLATGELL-RTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLW 399
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
565-641 9.39e-09

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 57.73  E-value: 9.39e-09
                        10        20        30        40        50        60        70
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 19075378 565 VRELWGHSNRLTSFDFSDTGRWLVTASLDGTIRTWDLPTGHLIDSISTPSVC-TSLTFAPTGDYLATTHVDQVgISLW 641
Cdd:cd00200   2 RRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPvRDVAASADGTYLASGSSDKT-IRLW 78
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
561-600 8.78e-08

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 48.85  E-value: 8.78e-08
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 19075378    561 TRKIVRELWGHSNRLTSFDFSDTGRWLVTASLDGTIRTWD 600
Cdd:smart00320   1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 COG2319
WD40 repeat [General function prediction only];
47-276 1.50e-07

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 54.53  E-value: 1.50e-07
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378  47 HFLVTTSVGNTFQTYDCEKLNLLFVGKQLDKEITCLK-SFKDFMLVAAGSkifaykRGKI-IWDIDVEQEHGTVTHLD-- 122
Cdd:COG2319 175 KLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAfSPDGKLLASGSA------DGTVrLWDLATGKLLRTLTGHSgs 248
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 123 ----AF---GEWIIACTSSRHVYVWKHASKysvPELHTtfLPNTNADITSL-LHPStylNKILL-GFSDGALQIWNLRVS 193
Cdd:COG2319 249 vrsvAFspdGRLLASGSADGTVRLWDLATG---ELLRT--LTGHSGGVNSVaFSPD---GKLLAsGSDDGTVRLWDLATG 320
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 194 KRVHEFQEFfGDGITSLTQAPVLDVLAVGTISGRIVIFNLKNGSILMEFKQ-DGQVLSCSFRTDGTpILASSNPIGDLSF 272
Cdd:COG2319 321 KLLRTLTGH-TGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGR-TLASGSADGTVRL 398

                ....
gi 19075378 273 WDLS 276
Cdd:COG2319 399 WDLA 402
WD40 pfam00400
WD domain, G-beta repeat;
562-600 3.87e-07

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 46.95  E-value: 3.87e-07
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 19075378   562 RKIVRELWGHSNRLTSFDFSDTGRWLVTASLDGTIRTWD 600
Cdd:pfam00400   1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
YncE COG3391
DNA-binding beta-propeller fold protein YncE [General function prediction only];
491-631 9.39e-05

DNA-binding beta-propeller fold protein YncE [General function prediction only];


Pssm-ID: 442618 [Multi-domain]  Cd Length: 237  Bit Score: 45.07  E-value: 9.39e-05
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 491 PVTAVMLDNVNRILVTASLDGILKFWDFNKGNLIDSLDVGSSITHAIYQHSSD---LVAVACDDFGIRIVDVQTRKIVRE 567
Cdd:COG3391  26 AALGLGGGGPLLAAASGGVVGAAVGGGGVALLAGLGLGAAAVADADGADAGADgrrLYVANSGSGRVSVIDLATGKVVAT 105
                        90       100       110       120       130       140
                ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 19075378 568 lWGHSNRLTSFDFS-DTGRWLVTASLDGTIRTWDLPTGHLIDSISTPSVCTSLTFAPTGDYLATT 631
Cdd:COG3391 106 -IPVGGGPRGLAVDpDGGRLYVADSGNGRVSVIDTATGKVVATIPVGAGPHGIAVDPDGKRLYVA 169
PLN00181 PLN00181
protein SPA1-RELATED; Provisional
182-341 1.93e-04

protein SPA1-RELATED; Provisional


Pssm-ID: 177776 [Multi-domain]  Cd Length: 793  Bit Score: 45.08  E-value: 1.93e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378  182 DGALQIWNLRVSKRVHEFQEFFGD--GITSLTQAPVLdvLAVGTISGRIVIFNLKNGSILMEFKQDGQVLSCSFRTDGTP 259
Cdd:PLN00181 554 EGVVQVWDVARSQLVTEMKEHEKRvwSIDYSSADPTL--LASGSDDGSVKLWSINQGVSIGTIKTKANICCVQFPSESGR 631
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378  260 ILASSNPIGDLSFWDLSKRRIQNVTYNAHFGSLPKIQFLNGQpILVTAGPDNSLKEWIFD-SMDG---APriLRSRNGHY 335
Cdd:PLN00181 632 SLAFGSADHKVYYYDLRNPKLPLCTMIGHSKTVSYVRFVDSS-TLVSSSTDNTLKLWDLSmSISGineTP--LHSFMGHT 708

                 ....*.
gi 19075378  336 EPPSFV 341
Cdd:PLN00181 709 NVKNFV 714
PTZ00421 PTZ00421
coronin; Provisional
587-641 7.33e-03

coronin; Provisional


Pssm-ID: 173611 [Multi-domain]  Cd Length: 493  Bit Score: 39.88  E-value: 7.33e-03
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 19075378  587 LVTASLDGTIRTWDLPTGHLIDSISTPSV--------CTSLTFAPTG-DYLATTHVDQVgISLW 641
Cdd:PTZ00421  91 LFTASEDGTIMGWGIPEEGLTQNISDPIVhlqghtkkVGIVSFHPSAmNVLASAGADMV-VNVW 153
WD40 COG2319
WD40 repeat [General function prediction only];
47-235 8.29e-03

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 39.51  E-value: 8.29e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378  47 HFLVTTSVGNTFQTYDCEKLNLLFVGKQLDKEITCLkSF-KDFMLVAAGSKifaykRGKI-IWDIDVEQEHGTVTHLD-- 122
Cdd:COG2319 217 KLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSV-AFsPDGRLLASGSA-----DGTVrLWDLATGELLRTLTGHSgg 290
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 123 ----AF---GEWIIACTSSRHVYVWKHASKysvPELHTtfLPNTNADITSL-LHPStylNKILL-GFSDGALQIWNLRVS 193
Cdd:COG2319 291 vnsvAFspdGKLLASGSDDGTVRLWDLATG---KLLRT--LTGHTGAVRSVaFSPD---GKTLAsGSDDGTVRLWDLATG 362
                       170       180       190       200
                ....*....|....*....|....*....|....*....|..
gi 19075378 194 KRVHEFQEfFGDGITSLTQAPVLDVLAVGTISGRIVIFNLKN 235
Cdd:COG2319 363 ELLRTLTG-HTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
47-232 8.97e-03

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 39.24  E-value: 8.97e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378  47 HFLVTTSVGNTFQTYDCEKLNLLFVGKQLDKEITCLKSFKDFMLVAAGS-----KIFAYKRGKIIWDIDVEQehGTVTHL 121
Cdd:cd00200 106 RILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSqdgtiKLWDLRTGKCVATLTGHT--GEVNSV 183
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 19075378 122 DAF--GEWIIACTSSRHVYVWKHASKysvpELHTTFLPNTNAdITSLL-HPSTYLnkILLGFSDGALQIWNLRVSKRVHE 198
Cdd:cd00200 184 AFSpdGEKLLSSSSDGTIKLWDLSTG----KCLGTLRGHENG-VNSVAfSPDGYL--LASGSEDGTIRVWDLRTGECVQT 256
                       170       180       190
                ....*....|....*....|....*....|....
gi 19075378 199 FQEFFGdGITSLTQAPVLDVLAVGTISGRIVIFN 232
Cdd:cd00200 257 LSGHTN-SVTSLAWSPDGKRLASGSADGTIRIWD 289
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH