|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
15-138 |
9.89e-66 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization. :
Pssm-ID: 461094 Cd Length: 117 Bit Score: 216.14 E-value: 9.89e-66
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 15 IKFSVPDACDRIKDELSYLQQQCHTLKVECERLTQEKTELHRIYMVYYELAYGLNVEIHKqvnvffhrslQTEISKRLGA 94
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHK----------QTEIAKRLNA 70
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 674574518 95 IIAQVLPYLPQEHQAQVAAAVERAKQVTLPELNNLISQHQLLFA 138
Cdd:pfam03920 71 ICAQVIPFLSQEHQQQVAQAVERAKQVTMAELNAIIGQQQQLQA 114
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
694-988 |
1.80e-29 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 121.94 E-value: 1.80e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 694 HGEVVCAVMINSSTRHIYTGGK-GTVKLWDLAaataegaptvSKSPLATmdcLQG-GNYIRSIKMGQEGNLLVVGGEASV 771
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLA----------TGKLLRT---LTGhSGAVTSVAFSPDGKLLASGSDDGT 185
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 772 VSIWDMggLTPRPKGEIDIGVQACYAVAVSNDSKLCYFCQSDGVISVWDLHNQSQIRRLQGHGDGASCVDLSATCAQLWT 851
Cdd:COG2319 186 VRLWDL--ATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLAS 263
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 852 GGLDKTVRNWDIREmHRQIAQFN-FNSQIFSLAKSPTEEWVAAGMESDEIELF--APGRMdRYRLRMHESCVLSLKFAHS 928
Cdd:COG2319 264 GSADGTVRLWDLAT-GELLRTLTgHSGGVNSVAFSPDGKLLASGSDDGTVRLWdlATGKL-LRTLTGHTGAVRSVAFSPD 341
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 674574518 929 GAWFASTGKDSWLNAWRPPWGANLFRVKE-SLSVLSCDISADDRHLVTGSGDRRASIYEIN 988
Cdd:COG2319 342 GKTLASGSDDGTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| PPE super family |
cl35037 |
PPE-repeat protein [Function unknown]; |
81-286 |
2.20e-03 |
|
PPE-repeat protein [Function unknown]; The actual alignment was detected with superfamily member COG5651:
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 41.42 E-value: 2.20e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 81 HRSLQTEISKRLGAIIAQVLPYLPQEHQAQVAAAVERAKQVTLPELNNlisqhQLLFAGANPIGGQLAAAFPPPGIPGLP 160
Cdd:COG5651 180 LLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPG-----NTGFAGTGAAAGAAAAAAAAAAAAGAG 254
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 161 QPPSSTVNGAGTLSSSPAGGVNSLSGTLLSLPGAPPNPFAPPITSASGPNVSTTTPNAGAFLGAPTSTSPGFPNSNLAQS 240
Cdd:COG5651 255 ASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAA 334
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 674574518 241 QSAMAAAMAAMAAGLQTNSAAAFGFPNSQTPNSSPAGLAANSLLPN 286
Cdd:COG5651 335 AAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASG 380
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
152-440 |
3.74e-03 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.46 E-value: 3.74e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 152 PPPGIPGLPQPPSsTVNGAGTLSSSPAGGVNSLSGTLLSLPGAPPNPFAPPITSASGpnvSTTTPNAGAflGAPTSTSPG 231
Cdd:PHA03247 2712 PHALVSATPLPPG-PAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPP---APAPPAAPA--AGPPRRLTR 2785
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 232 FPNSNLAQSQSAMAAAMAAMAAglqtnSAAAFGFPNSQTPNSSPAGLAANsllPNMMSPTAlsalmgdksmdekqraaaa 311
Cdd:PHA03247 2786 PAVASLSESRESLPSPWDPADP-----PAAVLAPAAALPPAASPAGPLPP---PTSAQPTA------------------- 2838
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 312 AVAAAMTAAVAQQQCGGLTGVDPTAMMATianmSQSAPGTGFDLSHtacaaaaAAAAAAGVLPPPPPAPAPPQPPSGLPS 391
Cdd:PHA03247 2839 PPPPPGPPPPSLPLGGSVAPGGDVRRRPP----SRSPAAKPAAPAR-------PPVRRLARPAVSRSTESFALPPDQPER 2907
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 674574518 392 APAPLPEVPPTAAAVPPIPPLPSSAEPPRLHSPQALGASSPTQPARASS 440
Cdd:PHA03247 2908 PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPS 2956
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
15-138 |
9.89e-66 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 216.14 E-value: 9.89e-66
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 15 IKFSVPDACDRIKDELSYLQQQCHTLKVECERLTQEKTELHRIYMVYYELAYGLNVEIHKqvnvffhrslQTEISKRLGA 94
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHK----------QTEIAKRLNA 70
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 674574518 95 IIAQVLPYLPQEHQAQVAAAVERAKQVTLPELNNLISQHQLLFA 138
Cdd:pfam03920 71 ICAQVIPFLSQEHQQQVAQAVERAKQVTMAELNAIIGQQQQLQA 114
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
694-988 |
1.80e-29 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 121.94 E-value: 1.80e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 694 HGEVVCAVMINSSTRHIYTGGK-GTVKLWDLAaataegaptvSKSPLATmdcLQG-GNYIRSIKMGQEGNLLVVGGEASV 771
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLA----------TGKLLRT---LTGhSGAVTSVAFSPDGKLLASGSDDGT 185
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 772 VSIWDMggLTPRPKGEIDIGVQACYAVAVSNDSKLCYFCQSDGVISVWDLHNQSQIRRLQGHGDGASCVDLSATCAQLWT 851
Cdd:COG2319 186 VRLWDL--ATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLAS 263
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 852 GGLDKTVRNWDIREmHRQIAQFN-FNSQIFSLAKSPTEEWVAAGMESDEIELF--APGRMdRYRLRMHESCVLSLKFAHS 928
Cdd:COG2319 264 GSADGTVRLWDLAT-GELLRTLTgHSGGVNSVAFSPDGKLLASGSDDGTVRLWdlATGKL-LRTLTGHTGAVRSVAFSPD 341
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 674574518 929 GAWFASTGKDSWLNAWRPPWGANLFRVKE-SLSVLSCDISADDRHLVTGSGDRRASIYEIN 988
Cdd:COG2319 342 GKTLASGSDDGTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
689-986 |
3.14e-28 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 115.51 E-value: 3.14e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 689 IQTIE-HGEVVCAVMINSSTRHIYTGGK-GTVKLWDLAAATAEGAPTVSKSPLATMDCLQGGNYirsikmgqegnlLVVG 766
Cdd:cd00200 2 RRTLKgHTGGVTCVAFSPDGKLLATGSGdGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTY------------LASG 69
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 767 GEASVVSIWDMGglTPRPKGEIdIGV-QACYAVAVSNDSKLCYFCQSDGVISVWDLHNQSQIRRLQGHGDGASCVDLSAT 845
Cdd:cd00200 70 SSDKTIRLWDLE--TGECVRTL-TGHtSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPD 146
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 846 CAQLWTGGLDKTVRNWDIREMHRqIAQFN-FNSQIFSLAKSPTEEWVAAGMESDEIELFAPgRMDR--YRLRMHESCVLS 922
Cdd:cd00200 147 GTFVASSSQDGTIKLWDLRTGKC-VATLTgHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDL-STGKclGTLRGHENGVNS 224
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 674574518 923 LKFAHSGAWFASTGKDSWLNAWRppwgANLFRVKESL-----SVLSCDISADDRHLVTGSGDRRASIYE 986
Cdd:cd00200 225 VAFSPDGYLLASGSEDGTIRVWD----LRTGECVQTLsghtnSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
823-862 |
3.89e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 38.83 E-value: 3.89e-04
10 20 30 40
....*....|....*....|....*....|....*....|
gi 674574518 823 NQSQIRRLQGHGDGASCVDLSATCAQLWTGGLDKTVRNWD 862
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
825-862 |
1.00e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 37.71 E-value: 1.00e-03
10 20 30
....*....|....*....|....*....|....*...
gi 674574518 825 SQIRRLQGHGDGASCVDLSATCAQLWTGGLDKTVRNWD 862
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
81-286 |
2.20e-03 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 41.42 E-value: 2.20e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 81 HRSLQTEISKRLGAIIAQVLPYLPQEHQAQVAAAVERAKQVTLPELNNlisqhQLLFAGANPIGGQLAAAFPPPGIPGLP 160
Cdd:COG5651 180 LLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPG-----NTGFAGTGAAAGAAAAAAAAAAAAGAG 254
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 161 QPPSSTVNGAGTLSSSPAGGVNSLSGTLLSLPGAPPNPFAPPITSASGPNVSTTTPNAGAFLGAPTSTSPGFPNSNLAQS 240
Cdd:COG5651 255 ASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAA 334
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 674574518 241 QSAMAAAMAAMAAGLQTNSAAAFGFPNSQTPNSSPAGLAANSLLPN 286
Cdd:COG5651 335 AAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASG 380
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
152-440 |
3.74e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.46 E-value: 3.74e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 152 PPPGIPGLPQPPSsTVNGAGTLSSSPAGGVNSLSGTLLSLPGAPPNPFAPPITSASGpnvSTTTPNAGAflGAPTSTSPG 231
Cdd:PHA03247 2712 PHALVSATPLPPG-PAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPP---APAPPAAPA--AGPPRRLTR 2785
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 232 FPNSNLAQSQSAMAAAMAAMAAglqtnSAAAFGFPNSQTPNSSPAGLAANsllPNMMSPTAlsalmgdksmdekqraaaa 311
Cdd:PHA03247 2786 PAVASLSESRESLPSPWDPADP-----PAAVLAPAAALPPAASPAGPLPP---PTSAQPTA------------------- 2838
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 312 AVAAAMTAAVAQQQCGGLTGVDPTAMMATianmSQSAPGTGFDLSHtacaaaaAAAAAAGVLPPPPPAPAPPQPPSGLPS 391
Cdd:PHA03247 2839 PPPPPGPPPPSLPLGGSVAPGGDVRRRPP----SRSPAAKPAAPAR-------PPVRRLARPAVSRSTESFALPPDQPER 2907
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 674574518 392 APAPLPEVPPTAAAVPPIPPLPSSAEPPRLHSPQALGASSPTQPARASS 440
Cdd:PHA03247 2908 PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPS 2956
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
15-138 |
9.89e-66 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 216.14 E-value: 9.89e-66
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 15 IKFSVPDACDRIKDELSYLQQQCHTLKVECERLTQEKTELHRIYMVYYELAYGLNVEIHKqvnvffhrslQTEISKRLGA 94
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHK----------QTEIAKRLNA 70
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 674574518 95 IIAQVLPYLPQEHQAQVAAAVERAKQVTLPELNNLISQHQLLFA 138
Cdd:pfam03920 71 ICAQVIPFLSQEHQQQVAQAVERAKQVTMAELNAIIGQQQQLQA 114
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
694-988 |
1.80e-29 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 121.94 E-value: 1.80e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 694 HGEVVCAVMINSSTRHIYTGGK-GTVKLWDLAaataegaptvSKSPLATmdcLQG-GNYIRSIKMGQEGNLLVVGGEASV 771
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLA----------TGKLLRT---LTGhSGAVTSVAFSPDGKLLASGSDDGT 185
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 772 VSIWDMggLTPRPKGEIDIGVQACYAVAVSNDSKLCYFCQSDGVISVWDLHNQSQIRRLQGHGDGASCVDLSATCAQLWT 851
Cdd:COG2319 186 VRLWDL--ATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLAS 263
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 852 GGLDKTVRNWDIREmHRQIAQFN-FNSQIFSLAKSPTEEWVAAGMESDEIELF--APGRMdRYRLRMHESCVLSLKFAHS 928
Cdd:COG2319 264 GSADGTVRLWDLAT-GELLRTLTgHSGGVNSVAFSPDGKLLASGSDDGTVRLWdlATGKL-LRTLTGHTGAVRSVAFSPD 341
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 674574518 929 GAWFASTGKDSWLNAWRPPWGANLFRVKE-SLSVLSCDISADDRHLVTGSGDRRASIYEIN 988
Cdd:COG2319 342 GKTLASGSDDGTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
689-986 |
3.14e-28 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 115.51 E-value: 3.14e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 689 IQTIE-HGEVVCAVMINSSTRHIYTGGK-GTVKLWDLAAATAEGAPTVSKSPLATMDCLQGGNYirsikmgqegnlLVVG 766
Cdd:cd00200 2 RRTLKgHTGGVTCVAFSPDGKLLATGSGdGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTY------------LASG 69
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 767 GEASVVSIWDMGglTPRPKGEIdIGV-QACYAVAVSNDSKLCYFCQSDGVISVWDLHNQSQIRRLQGHGDGASCVDLSAT 845
Cdd:cd00200 70 SSDKTIRLWDLE--TGECVRTL-TGHtSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPD 146
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 846 CAQLWTGGLDKTVRNWDIREMHRqIAQFN-FNSQIFSLAKSPTEEWVAAGMESDEIELFAPgRMDR--YRLRMHESCVLS 922
Cdd:cd00200 147 GTFVASSSQDGTIKLWDLRTGKC-VATLTgHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDL-STGKclGTLRGHENGVNS 224
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 674574518 923 LKFAHSGAWFASTGKDSWLNAWRppwgANLFRVKESL-----SVLSCDISADDRHLVTGSGDRRASIYE 986
Cdd:cd00200 225 VAFSPDGYLLASGSEDGTIRVWD----LRTGECVQTLsghtnSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
690-988 |
1.96e-26 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 113.08 E-value: 1.96e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 690 QTIEHGEVVCAVMINSSTRHIYTGGKGTVKLWDLAAATAEGAPTVSKSPlatmdclqggnyIRSIKMGQEGNLLVVGGEA 769
Cdd:COG2319 32 LLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA------------VLSVAFSPDGRLLASASAD 99
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 770 SVVSIWDMggLTPRPKGEIDIGVQACYAVAVSNDSKLCYFCQSDGVISVWDLHNQSQIRRLQGHGDGASCVDLSATCAQL 849
Cdd:COG2319 100 GTVRLWDL--ATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLL 177
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 850 WTGGLDKTVRNWDIREmHRQIAQFN-FNSQIFSLAKSPTEEWVAAGMESDEIELF-APGRMDRYRLRMHESCVLSLKFAH 927
Cdd:COG2319 178 ASGSDDGTVRLWDLAT-GKLLRTLTgHTGAVRSVAFSPDGKLLASGSADGTVRLWdLATGKLLRTLTGHSGSVRSVAFSP 256
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 674574518 928 SGAWFASTGKDSWLNAWRPPWGANLFRVKESLS-VLSCDISADDRHLVTGSGDRRASIYEIN 988
Cdd:COG2319 257 DGRLLASGSADGTVRLWDLATGELLRTLTGHSGgVNSVAFSPDGKLLASGSDDGTVRLWDLA 318
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
795-988 |
1.08e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 96.64 E-value: 1.08e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 795 CYAVAVSNDSKLCYFCQSDGVISVWDLHNQSQIRRLQGHGDGASCVDLSATCAQLWTGGLDKTVRNWDIR--EMHRQIAQ 872
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLEtgECVRTLTG 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 873 fnFNSQIFSLAKSPTEEWVAAGMESDEIELF-APGRMDRYRLRMHESCVLSLKFAHSGaWFASTGK-DSWLNAWRppwgA 950
Cdd:cd00200 92 --HTSYVSSVAFSPDGRILSSSSRDKTIKVWdVETGKCLTTLRGHTDWVNSVAFSPDG-TFVASSSqDGTIKLWD----L 164
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 674574518 951 NLFRVKESL-----SVLSCDISADDRHLVTGSGDRRASIYEIN 988
Cdd:cd00200 165 RTGKCVATLtghtgEVNSVAFSPDGEKLLSSSSDGTIKLWDLS 207
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
685-862 |
5.52e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 94.32 E-value: 5.52e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 685 STKVIQTIE-HGEVVCAVMINSSTRHIYTGGK-GTVKLWDLAaataegaptvSKSPLATMDclqgG--NYIRSIKMGQEG 760
Cdd:cd00200 124 TGKCLTTLRgHTDWVNSVAFSPDGTFVASSSQdGTIKLWDLR----------TGKCVATLT----GhtGEVNSVAFSPDG 189
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 761 NLLVVGGEASVVSIWDMggLTPRPKGEIDIGVQACYAVAVSNDSKLCYFCQSDGVISVWDLHNQSQIRRLQGHGDGASCV 840
Cdd:cd00200 190 EKLLSSSSDGTIKLWDL--STGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSL 267
|
170 180
....*....|....*....|..
gi 674574518 841 DLSATCAQLWTGGLDKTVRNWD 862
Cdd:cd00200 268 AWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
684-894 |
2.51e-19 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 91.51 E-value: 2.51e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 684 HSTKVIQTIE-HGEVVCAVMINSSTRHIYTGGK-GTVKLWDLAaataegaptvSKSPLATmdcLQG-GNYIRSIKMGQEG 760
Cdd:COG2319 192 ATGKLLRTLTgHTGAVRSVAFSPDGKLLASGSAdGTVRLWDLA----------TGKLLRT---LTGhSGSVRSVAFSPDG 258
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 761 NLLVVGGEASVVSIWDMGglTPRPKGEIDIGVQACYAVAVSNDSKLCYFCQSDGVISVWDLHNQSQIRRLQGHGDGASCV 840
Cdd:COG2319 259 RLLASGSADGTVRLWDLA--TGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSV 336
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 674574518 841 DLSATCAQLWTGGLDKTVRNWDIREMHRQIAQFNFNSQIFSLAKSPTEEWVAAG 894
Cdd:COG2319 337 AFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASG 390
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
685-864 |
7.53e-19 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 89.97 E-value: 7.53e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 685 STKVIQTIE-HGEVVCAVMINSSTRHIYTGGK-GTVKLWDLAaataegaptvSKSPLATMDclQGGNYIRSIKMGQEGNL 762
Cdd:COG2319 235 TGKLLRTLTgHSGSVRSVAFSPDGRLLASGSAdGTVRLWDLA----------TGELLRTLT--GHSGGVNSVAFSPDGKL 302
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 763 LVVGGEASVVSIWDMGglTPRPKGEIDIGVQACYAVAVSNDSKLCYFCQSDGVISVWDLHNQSQIRRLQGHGDGASCVDL 842
Cdd:COG2319 303 LASGSDDGTVRLWDLA--TGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAF 380
|
170 180
....*....|....*....|..
gi 674574518 843 SATCAQLWTGGLDKTVRNWDIR 864
Cdd:COG2319 381 SPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
827-989 |
4.51e-10 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 61.97 E-value: 4.51e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 827 IRRLQGHGDGASCVDLSATCAQLWTGGLDKTVRNWDIREMHRQIAQFNFNSQIFSLAKSPTEEWVAAGMESDEIELF-AP 905
Cdd:cd00200 2 RRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWdLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 906 GRMDRYRLRMHESCVLSLKFAHSGAWFASTGKDSWLNAWRPPWGANLFRVKE-SLSVLSCDISADDRHLVTGSGDRRASI 984
Cdd:cd00200 82 TGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGhTDWVNSVAFSPDGTFVASSSQDGTIKL 161
|
....*
gi 674574518 985 YEINY 989
Cdd:cd00200 162 WDLRT 166
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
823-862 |
3.89e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 38.83 E-value: 3.89e-04
10 20 30 40
....*....|....*....|....*....|....*....|
gi 674574518 823 NQSQIRRLQGHGDGASCVDLSATCAQLWTGGLDKTVRNWD 862
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
825-862 |
1.00e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 37.71 E-value: 1.00e-03
10 20 30
....*....|....*....|....*....|....*...
gi 674574518 825 SQIRRLQGHGDGASCVDLSATCAQLWTGGLDKTVRNWD 862
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
81-286 |
2.20e-03 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 41.42 E-value: 2.20e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 81 HRSLQTEISKRLGAIIAQVLPYLPQEHQAQVAAAVERAKQVTLPELNNlisqhQLLFAGANPIGGQLAAAFPPPGIPGLP 160
Cdd:COG5651 180 LLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPG-----NTGFAGTGAAAGAAAAAAAAAAAAGAG 254
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 161 QPPSSTVNGAGTLSSSPAGGVNSLSGTLLSLPGAPPNPFAPPITSASGPNVSTTTPNAGAFLGAPTSTSPGFPNSNLAQS 240
Cdd:COG5651 255 ASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAA 334
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 674574518 241 QSAMAAAMAAMAAGLQTNSAAAFGFPNSQTPNSSPAGLAANSLLPN 286
Cdd:COG5651 335 AAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASG 380
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
152-440 |
3.74e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.46 E-value: 3.74e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 152 PPPGIPGLPQPPSsTVNGAGTLSSSPAGGVNSLSGTLLSLPGAPPNPFAPPITSASGpnvSTTTPNAGAflGAPTSTSPG 231
Cdd:PHA03247 2712 PHALVSATPLPPG-PAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPP---APAPPAAPA--AGPPRRLTR 2785
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 232 FPNSNLAQSQSAMAAAMAAMAAglqtnSAAAFGFPNSQTPNSSPAGLAANsllPNMMSPTAlsalmgdksmdekqraaaa 311
Cdd:PHA03247 2786 PAVASLSESRESLPSPWDPADP-----PAAVLAPAAALPPAASPAGPLPP---PTSAQPTA------------------- 2838
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 312 AVAAAMTAAVAQQQCGGLTGVDPTAMMATianmSQSAPGTGFDLSHtacaaaaAAAAAAGVLPPPPPAPAPPQPPSGLPS 391
Cdd:PHA03247 2839 PPPPPGPPPPSLPLGGSVAPGGDVRRRPP----SRSPAAKPAAPAR-------PPVRRLARPAVSRSTESFALPPDQPER 2907
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 674574518 392 APAPLPEVPPTAAAVPPIPPLPSSAEPPRLHSPQALGASSPTQPARASS 440
Cdd:PHA03247 2908 PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPS 2956
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
687-779 |
4.24e-03 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 40.66 E-value: 4.24e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 674574518 687 KVIQTIE-HGEVVCAVMINSSTRHIYTGGK-GTVKLWDLAaataegaptvSKSPLATmdcLQG-GNYIRSIKMGQEGNLL 763
Cdd:COG2319 321 KLLRTLTgHTGAVRSVAFSPDGKTLASGSDdGTVRLWDLA----------TGELLRT---LTGhTGAVTSVAFSPDGRTL 387
|
90
....*....|....*.
gi 674574518 764 VVGGEASVVSIWDMGG 779
Cdd:COG2319 388 ASGSADGTVRLWDLAT 403
|
|
|