|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
18-133 |
7.28e-91 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization. :
Pssm-ID: 461094 Cd Length: 117 Bit Score: 280.08 E-value: 7.28e-91
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 18 FKFTVAESCDRIKDEFQFLQAQYHSLKVEYDKLANEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNTILAQIMPFLS 97
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100 110
....*....|....*....|....*....|....*..
gi 564364087 98 QEHQQQVAQAVERAKQVTMTELNAIIGQ-QQLQAQHL 133
Cdd:pfam03920 81 QEHQQQVAQAVERAKQVTMAELNAIIGQqQQLQAQHL 117
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
485-770 |
3.49e-43 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 161.62 E-value: 3.49e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 485 HGEVVCAVTISNPTRHVYTGGK-GCVKIWDISQPGSKSPISqldclNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLAS 563
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLATGKLLRTLT-----GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLAT 193
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 564 PTPRikAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVR 643
Cdd:COG2319 194 GKLL--RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVR 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 644 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLH-HTKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKD 721
Cdd:COG2319 272 LWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDD 351
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 564364087 722 NLLNAWRTPYGASIFQSKE-SSSVLSCDISADDKYIVTGSGDKKATVYEV 770
Cdd:COG2319 352 GTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| Herpes_BLLF1 super family |
cl37540 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
251-472 |
1.86e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo. The actual alignment was detected with superfamily member pfam05109:
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 41.83 E-value: 1.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 251 DVSNEDPA-----TPRVSPAHSPPENGLD-KARGLKKDAPTSPASVASSSSTPSSKTKDlghNDKSSTPGLKSNTPTPRN 324
Cdd:pfam05109 472 DVTSPTPAgttsgASPVTPSPSPRDNGTEsKAPDMTSPTSAVTTPTPNATSPTPAVTTP---TPNATSPTLGKTSPTSAV 548
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 325 DAPTP-GTSTTPGLrSMPGKPPGMDPIGIMA--SALRTP----ISLTSSYAAPFAMMSHHEMNGSLTSP----------- 386
Cdd:pfam05109 549 TTPTPnATSPTPAV-TTPTPNATIPTLGKTSptSAVTTPtpnaTSPTVGETSPQANTTNHTLGGTSSTPvvtsppknats 627
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 387 SAYAGLHNIPSQMSAAAAAAAAAYGR--SPMVSFGAVGfdpHPPMRATGLPSSLASIPGGKPA-YSFHVSADGQMQPVPF 463
Cdd:pfam05109 628 AVTTGQHNITSSSTSSMSLRPSSISEtlSPSTSDNSTS---HMPLLTSAHPTGGENITQVTPAsTSTHHVSTSSPAPRPG 704
|
....*....
gi 564364087 464 PHDALAGPG 472
Cdd:pfam05109 705 TTSQASGPG 713
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
18-133 |
7.28e-91 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 280.08 E-value: 7.28e-91
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 18 FKFTVAESCDRIKDEFQFLQAQYHSLKVEYDKLANEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNTILAQIMPFLS 97
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100 110
....*....|....*....|....*....|....*..
gi 564364087 98 QEHQQQVAQAVERAKQVTMTELNAIIGQ-QQLQAQHL 133
Cdd:pfam03920 81 QEHQQQVAQAVERAKQVTMAELNAIIGQqQQLQAQHL 117
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
485-770 |
3.49e-43 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 161.62 E-value: 3.49e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 485 HGEVVCAVTISNPTRHVYTGGK-GCVKIWDISQPGSKSPISqldclNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLAS 563
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLATGKLLRTLT-----GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLAT 193
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 564 PTPRikAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVR 643
Cdd:COG2319 194 GKLL--RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVR 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 644 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLH-HTKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKD 721
Cdd:COG2319 272 LWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDD 351
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 564364087 722 NLLNAWRTPYGASIFQSKE-SSSVLSCDISADDKYIVTGSGDKKATVYEV 770
Cdd:COG2319 352 GTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
484-769 |
5.93e-40 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 149.02 E-value: 5.93e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 484 SHGEVVCAVTISNPTRHVYTGGK-GCVKIWDISqpgSKSPISQLdCLNRDNyIRSCKLLPDGRTLIVGGEASTLTIWDLa 562
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATGSGdGTIKVWDLE---TGELLRTL-KGHTGP-VRDVAASADGTYLASGSSDKTIRLWDL- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 563 sPTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTV 642
Cdd:cd00200 81 -ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTI 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 643 RSWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLHHTKPD-KYQLHLHESCVLSLKFAYCGKWFVSTGK 720
Cdd:cd00200 160 KLWDLRTGKCVATlTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASGSE 239
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 564364087 721 DNLLNAWRTPYGASIFQ-SKESSSVLSCDISADDKYIVTGSGDKKATVYE 769
Cdd:cd00200 240 DGTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
607-646 |
2.72e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 47.31 E-value: 2.72e-07
10 20 30 40
....*....|....*....|....*....|....*....|
gi 564364087 607 NQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWD 646
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
506-768 |
2.07e-06 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 51.24 E-value: 2.07e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 506 KGCVKIWDISQPGSKSPISQLDCLNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLASPtprIKAELTSSAPACYALAIS 585
Cdd:PLN00181 457 EGLCKYLSFSKLRVKADLKQGDLLNSSNLVCAIGFDRDGEFFATAGVNKKIKIFECESI---IKDGRDIHYPVVELASRS 533
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 586 PDAKVCF---------SCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISH-DGTKLWTGGLDNTVRSWDLREGRQLQQ 655
Cdd:PLN00181 534 KLSGICWnsyiksqvaSSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSaDPTLLASGSDDGSVKLWSINQGVSIGT 613
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 656 HDFTSQIFSLGY-CPTGEWLAVGMESSNVEV--LHHTKPDKYQLHLHESCVLSLKFAYCGKwFVSTGKDNLLNAWRTPYG 732
Cdd:PLN00181 614 IKTKANICCVQFpSESGRSLAFGSADHKVYYydLRNPKLPLCTMIGHSKTVSYVRFVDSST-LVSSSTDNTLKLWDLSMS 692
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 564364087 733 ASIFQSKESSSVLS-------CDISADDKYIVTGSGDKKATVY 768
Cdd:PLN00181 693 ISGINETPLHSFMGhtnvknfVGLSVSDGYIATGSETNEVFVY 735
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
609-646 |
3.97e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 44.26 E-value: 3.97e-06
10 20 30
....*....|....*....|....*....|....*...
gi 564364087 609 TLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWD 646
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
251-472 |
1.86e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 41.83 E-value: 1.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 251 DVSNEDPA-----TPRVSPAHSPPENGLD-KARGLKKDAPTSPASVASSSSTPSSKTKDlghNDKSSTPGLKSNTPTPRN 324
Cdd:pfam05109 472 DVTSPTPAgttsgASPVTPSPSPRDNGTEsKAPDMTSPTSAVTTPTPNATSPTPAVTTP---TPNATSPTLGKTSPTSAV 548
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 325 DAPTP-GTSTTPGLrSMPGKPPGMDPIGIMA--SALRTP----ISLTSSYAAPFAMMSHHEMNGSLTSP----------- 386
Cdd:pfam05109 549 TTPTPnATSPTPAV-TTPTPNATIPTLGKTSptSAVTTPtpnaTSPTVGETSPQANTTNHTLGGTSSTPvvtsppknats 627
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 387 SAYAGLHNIPSQMSAAAAAAAAAYGR--SPMVSFGAVGfdpHPPMRATGLPSSLASIPGGKPA-YSFHVSADGQMQPVPF 463
Cdd:pfam05109 628 AVTTGQHNITSSSTSSMSLRPSSISEtlSPSTSDNSTS---HMPLLTSAHPTGGENITQVTPAsTSTHHVSTSSPAPRPG 704
|
....*....
gi 564364087 464 PHDALAGPG 472
Cdd:pfam05109 705 TTSQASGPG 713
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
18-133 |
7.28e-91 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 280.08 E-value: 7.28e-91
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 18 FKFTVAESCDRIKDEFQFLQAQYHSLKVEYDKLANEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNTILAQIMPFLS 97
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100 110
....*....|....*....|....*....|....*..
gi 564364087 98 QEHQQQVAQAVERAKQVTMTELNAIIGQ-QQLQAQHL 133
Cdd:pfam03920 81 QEHQQQVAQAVERAKQVTMAELNAIIGQqQQLQAQHL 117
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
485-770 |
3.49e-43 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 161.62 E-value: 3.49e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 485 HGEVVCAVTISNPTRHVYTGGK-GCVKIWDISQPGSKSPISqldclNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLAS 563
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLATGKLLRTLT-----GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLAT 193
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 564 PTPRikAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVR 643
Cdd:COG2319 194 GKLL--RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVR 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 644 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLH-HTKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKD 721
Cdd:COG2319 272 LWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDD 351
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 564364087 722 NLLNAWRTPYGASIFQSKE-SSSVLSCDISADDKYIVTGSGDKKATVYEV 770
Cdd:COG2319 352 GTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
484-769 |
5.93e-40 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 149.02 E-value: 5.93e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 484 SHGEVVCAVTISNPTRHVYTGGK-GCVKIWDISqpgSKSPISQLdCLNRDNyIRSCKLLPDGRTLIVGGEASTLTIWDLa 562
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATGSGdGTIKVWDLE---TGELLRTL-KGHTGP-VRDVAASADGTYLASGSSDKTIRLWDL- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 563 sPTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTV 642
Cdd:cd00200 81 -ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTI 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 643 RSWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLHHTKPD-KYQLHLHESCVLSLKFAYCGKWFVSTGK 720
Cdd:cd00200 160 KLWDLRTGKCVATlTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASGSE 239
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 564364087 721 DNLLNAWRTPYGASIFQ-SKESSSVLSCDISADDKYIVTGSGDKKATVYE 769
Cdd:cd00200 240 DGTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
452-770 |
7.93e-40 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 151.99 E-value: 7.93e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 452 VSADGQMQPVPFPHDALAGPGIPRHARQINTLSHGEVVCAVTISNPTRHVYTGGKGCVKIWDISQPGSKSPISQLdclnR 531
Cdd:COG2319 2 LSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLG----H 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 532 DNYIRSCKLLPDGRTLIVGGEASTLTIWDLAspTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLV 611
Cdd:COG2319 78 TAAVLSVAFSPDGRLLASASADGTVRLWDLA--TGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLL 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 612 RQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWDLREGRQLQQ---HdfTSQIFSLGYCPTGEWLAVGMESSNVEVLH- 687
Cdd:COG2319 156 RTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTltgH--TGAVRSVAFSPDGKLLASGSADGTVRLWDl 233
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 688 HTKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKDNLLNAWRTPYGASI-FQSKESSSVLSCDISADDKYIVTGSGDKKAT 766
Cdd:COG2319 234 ATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLrTLTGHSGGVNSVAFSPDGKLLASGSDDGTVR 313
|
....
gi 564364087 767 VYEV 770
Cdd:COG2319 314 LWDL 317
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
478-730 |
4.42e-39 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 149.68 E-value: 4.42e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 478 RQINTLS-HGEVVCAVTISNPTRHVYTGGK-GCVKIWDISqpgSKSPISQLDclNRDNYIRSCKLLPDGRTLIVGGEAST 555
Cdd:COG2319 153 KLLRTLTgHSGAVTSVAFSPDGKLLASGSDdGTVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKLLASGSADGT 227
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 556 LTIWDLAspTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWT 635
Cdd:COG2319 228 VRLWDLA--TGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLAS 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 636 GGLDNTVRSWDLREGRQLQQHD-FTSQIFSLGYCPTGEWLAVGMESSNVEVLH-HTKPDKYQLHLHESCVLSLKFAYCGK 713
Cdd:COG2319 306 GSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDlATGELLRTLTGHTGAVTSVAFSPDGR 385
|
250
....*....|....*..
gi 564364087 714 WFVSTGKDNLLNAWRTP 730
Cdd:COG2319 386 TLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
579-770 |
3.53e-25 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 106.27 E-value: 3.53e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 579 CYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWDLREGRQLQQ-HD 657
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTlTG 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 658 FTSQIFSLGYCPTGEWLAVGMESSNVEVLH-HTKPDKYQLHLHESCVLSLKFAyCGKWFVSTGK-DNLLNAWRTPYGASI 735
Cdd:cd00200 92 HTSYVSSVAFSPDGRILSSSSRDKTIKVWDvETGKCLTTLRGHTDWVNSVAFS-PDGTFVASSSqDGTIKLWDLRTGKCV 170
|
170 180 190
....*....|....*....|....*....|....*..
gi 564364087 736 --FQSkESSSVLSCDISADDKYIVTGSGDKKATVYEV 770
Cdd:cd00200 171 atLTG-HTGEVNSVAFSPDGEKLLSSSSDGTIKLWDL 206
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
610-770 |
1.33e-17 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 83.92 E-value: 1.33e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 610 LVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWDLREG---RQLQQHdfTSQIFSLGYCPTGEWLAVGMESSNVEVL 686
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellRTLKGH--TGPVRDVAASADGTYLASGSSDKTIRLW 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 687 H-HTKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKDNLLNAWRTPYGASI--FQSKEsSSVLSCDISADDKYIVTGSGDK 763
Cdd:cd00200 79 DlETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLttLRGHT-DWVNSVAFSPDGTFVASSSQDG 157
|
....*..
gi 564364087 764 KATVYEV 770
Cdd:cd00200 158 TIKLWDL 164
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
478-607 |
4.07e-13 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 71.87 E-value: 4.07e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 478 RQINTLS-HGEVVCAVTISNPTRHVYTGGKGC-VKIWDISqpgSKSPISQLDclNRDNYIRSCKLLPDGRTLIVGGEAST 555
Cdd:COG2319 279 ELLRTLTgHSGGVNSVAFSPDGKLLASGSDDGtVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKTLASGSDDGT 353
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 564364087 556 LTIWDLAspTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHN 607
Cdd:COG2319 354 VRLWDLA--TGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
607-646 |
2.72e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 47.31 E-value: 2.72e-07
10 20 30 40
....*....|....*....|....*....|....*....|
gi 564364087 607 NQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWD 646
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
506-768 |
2.07e-06 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 51.24 E-value: 2.07e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 506 KGCVKIWDISQPGSKSPISQLDCLNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLASPtprIKAELTSSAPACYALAIS 585
Cdd:PLN00181 457 EGLCKYLSFSKLRVKADLKQGDLLNSSNLVCAIGFDRDGEFFATAGVNKKIKIFECESI---IKDGRDIHYPVVELASRS 533
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 586 PDAKVCF---------SCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISH-DGTKLWTGGLDNTVRSWDLREGRQLQQ 655
Cdd:PLN00181 534 KLSGICWnsyiksqvaSSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSaDPTLLASGSDDGSVKLWSINQGVSIGT 613
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 656 HDFTSQIFSLGY-CPTGEWLAVGMESSNVEV--LHHTKPDKYQLHLHESCVLSLKFAYCGKwFVSTGKDNLLNAWRTPYG 732
Cdd:PLN00181 614 IKTKANICCVQFpSESGRSLAFGSADHKVYYydLRNPKLPLCTMIGHSKTVSYVRFVDSST-LVSSSTDNTLKLWDLSMS 692
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 564364087 733 ASIFQSKESSSVLS-------CDISADDKYIVTGSGDKKATVY 768
Cdd:PLN00181 693 ISGINETPLHSFMGhtnvknfVGLSVSDGYIATGSETNEVFVY 735
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
609-646 |
3.97e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 44.26 E-value: 3.97e-06
10 20 30
....*....|....*....|....*....|....*...
gi 564364087 609 TLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWD 646
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
251-472 |
1.86e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 41.83 E-value: 1.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 251 DVSNEDPA-----TPRVSPAHSPPENGLD-KARGLKKDAPTSPASVASSSSTPSSKTKDlghNDKSSTPGLKSNTPTPRN 324
Cdd:pfam05109 472 DVTSPTPAgttsgASPVTPSPSPRDNGTEsKAPDMTSPTSAVTTPTPNATSPTPAVTTP---TPNATSPTLGKTSPTSAV 548
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 325 DAPTP-GTSTTPGLrSMPGKPPGMDPIGIMA--SALRTP----ISLTSSYAAPFAMMSHHEMNGSLTSP----------- 386
Cdd:pfam05109 549 TTPTPnATSPTPAV-TTPTPNATIPTLGKTSptSAVTTPtpnaTSPTVGETSPQANTTNHTLGGTSSTPvvtsppknats 627
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 387 SAYAGLHNIPSQMSAAAAAAAAAYGR--SPMVSFGAVGfdpHPPMRATGLPSSLASIPGGKPA-YSFHVSADGQMQPVPF 463
Cdd:pfam05109 628 AVTTGQHNITSSSTSSMSLRPSSISEtlSPSTSDNSTS---HMPLLTSAHPTGGENITQVTPAsTSTHHVSTSSPAPRPG 704
|
....*....
gi 564364087 464 PHDALAGPG 472
Cdd:pfam05109 705 TTSQASGPG 713
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
565-604 |
2.38e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 36.14 E-value: 2.38e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 564364087 565 TPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWD 604
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| NBCH_WD40 |
pfam20426 |
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at ... |
570-651 |
3.42e-03 |
|
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at the C-terminus of neurobeachin-like proteins.
Pssm-ID: 466575 [Multi-domain] Cd Length: 350 Bit Score: 40.44 E-value: 3.42e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 564364087 570 AELTSSAPACYALAISPDAKVCFSCcsdGNiavWD-------LHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTV 642
Cdd:pfam20426 75 AENVELGAQCFATLQTPSENFLISC---GN---WEnsfqvisLNDGRMVQSIRQHKDVVSCVAVTSDGSILATGSYDTTV 148
|
....*....
gi 564364087 643 RSWDLREGR 651
Cdd:pfam20426 149 MVWEVLRGR 157
|
|
|