|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
17-133 |
3.79e-87 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 269.30 E-value: 3.79e-87
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 17 FKFSILEICDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQAEIVKRLSGICAQIIPFLT 96
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100 110
....*....|....*....|....*....|....*..
gi 768003721 97 QEHQQQVLQAVERAKQVTVGELNSLIGQQQLQPLSHH 133
Cdd:pfam03920 81 QEHQQQVAQAVERAKQVTMAELNAIIGQQQQLQAQHL 117
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
327-730 |
7.55e-42 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 157.38 E-value: 7.55e-42
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 327 LAAKPAPSTDSVALRSPLTLSSPFTTSFSLGSHSTLNGDLSVPSSYVSLHLSPQMAFESHPHLRGSSVSSSLPSIPGGKP 406
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 407 AYSFHVSADGQMQPVPFPSDALVGAGIPRHARQLHTLAHGEVVCAVTISGSTQHVYTGGK-GCVKVWDVGQPgakTPVAQ 485
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSAdGTVRLWDLATG---KLLRT 157
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 486 LDclNRDNYIRSCKLLPDGRSLIVGGEASTLSIWDLAapTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDL 565
Cdd:COG2319 158 LT--GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA--TGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDL 233
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 566 QNQTMVRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWDLREGRQLQQ-HDFSSQIFSLGHCPNQDWLAVGMESSNVE 644
Cdd:COG2319 234 ATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVR 313
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 645 ILHVRKPEK-YQLHLHESCVLSLKFASCGRWFVSTGKDNLLNAWRTPYGASIFQSKE-SSSVLSCDISRNNKYIVTGSGD 722
Cdd:COG2319 314 LWDLATGKLlRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSAD 393
|
....*...
gi 768003721 723 KKATVYEV 730
Cdd:COG2319 394 GTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
445-729 |
1.29e-36 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 139.39 E-value: 1.29e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 445 HGEVVCAVTISGSTQHVYTGGK-GCVKVWDVgqpgaKTPVAQLDCLNRDNYIRSCKLLPDGRSLIVGGEASTLSIWDLaa 523
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGdGTIKVWDL-----ETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDL-- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 524 PTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQTMVRQFQGHTDGASCIDISDYGTRLWTGGLDNTVR 603
Cdd:cd00200 81 ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIK 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 604 CWDLREGR---QLQQHDfsSQIFSLGHCPNQDWLAVGMESSNVEILHVRKPE-KYQLHLHESCVLSLKFASCGRWFVSTG 679
Cdd:cd00200 161 LWDLRTGKcvaTLTGHT--GEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASGS 238
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 768003721 680 KDNLLNAWRTPYGASIFQ-SKESSSVLSCDISRNNKYIVTGSGDKKATVYE 729
Cdd:cd00200 239 EDGTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
127-367 |
3.44e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 57.26 E-value: 3.44e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 127 LQPLSHHAPPVPLTPRPAGLVGGSATGLLALSGALAAQAQLAAAVKEDR----AGVEAEGSRVERAPSRSASPSPPESLV 202
Cdd:PHA03247 2617 LPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRrarrLGRAAQASSPPQRPRRRAARPTVGSLT 2696
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 203 EEERPSGPGGGGKQRADEKEPSGPYESDEDKSDYNLVVDEDQPSEPPSPAT--TPCGKVPICIPARRDLVDSPASLASSL 280
Cdd:PHA03247 2697 SLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGpaTPGGPARPARPPTTAGPPAPAPPAAPA 2776
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 281 GSPLPRAKELILNDLPASTPASKSCDSSPPQDASTPGPSSASHLCQLAAKPAPSTDSVALRSPLTLSSPFTTSFSLGSHS 360
Cdd:PHA03247 2777 AGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSV 2856
|
....*..
gi 768003721 361 TLNGDLS 367
Cdd:PHA03247 2857 APGGDVR 2863
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
571-606 |
4.64e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 46.54 E-value: 4.64e-07
10 20 30
....*....|....*....|....*....|....*.
gi 768003721 571 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWD 606
Cdd:smart00320 5 LKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
571-606 |
6.91e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 43.49 E-value: 6.91e-06
10 20 30
....*....|....*....|....*....|....*.
gi 768003721 571 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWD 606
Cdd:pfam00400 4 LKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
17-133 |
3.79e-87 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 269.30 E-value: 3.79e-87
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 17 FKFSILEICDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQAEIVKRLSGICAQIIPFLT 96
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100 110
....*....|....*....|....*....|....*..
gi 768003721 97 QEHQQQVLQAVERAKQVTVGELNSLIGQQQLQPLSHH 133
Cdd:pfam03920 81 QEHQQQVAQAVERAKQVTMAELNAIIGQQQQLQAQHL 117
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
327-730 |
7.55e-42 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 157.38 E-value: 7.55e-42
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 327 LAAKPAPSTDSVALRSPLTLSSPFTTSFSLGSHSTLNGDLSVPSSYVSLHLSPQMAFESHPHLRGSSVSSSLPSIPGGKP 406
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 407 AYSFHVSADGQMQPVPFPSDALVGAGIPRHARQLHTLAHGEVVCAVTISGSTQHVYTGGK-GCVKVWDVGQPgakTPVAQ 485
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSAdGTVRLWDLATG---KLLRT 157
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 486 LDclNRDNYIRSCKLLPDGRSLIVGGEASTLSIWDLAapTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDL 565
Cdd:COG2319 158 LT--GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA--TGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDL 233
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 566 QNQTMVRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWDLREGRQLQQ-HDFSSQIFSLGHCPNQDWLAVGMESSNVE 644
Cdd:COG2319 234 ATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVR 313
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 645 ILHVRKPEK-YQLHLHESCVLSLKFASCGRWFVSTGKDNLLNAWRTPYGASIFQSKE-SSSVLSCDISRNNKYIVTGSGD 722
Cdd:COG2319 314 LWDLATGKLlRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSAD 393
|
....*...
gi 768003721 723 KKATVYEV 730
Cdd:COG2319 394 GTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
445-729 |
1.29e-36 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 139.39 E-value: 1.29e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 445 HGEVVCAVTISGSTQHVYTGGK-GCVKVWDVgqpgaKTPVAQLDCLNRDNYIRSCKLLPDGRSLIVGGEASTLSIWDLaa 523
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGdGTIKVWDL-----ETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDL-- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 524 PTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQTMVRQFQGHTDGASCIDISDYGTRLWTGGLDNTVR 603
Cdd:cd00200 81 ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIK 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 604 CWDLREGR---QLQQHDfsSQIFSLGHCPNQDWLAVGMESSNVEILHVRKPE-KYQLHLHESCVLSLKFASCGRWFVSTG 679
Cdd:cd00200 161 LWDLRTGKcvaTLTGHT--GEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASGS 238
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 768003721 680 KDNLLNAWRTPYGASIFQ-SKESSSVLSCDISRNNKYIVTGSGDKKATVYE 729
Cdd:cd00200 239 EDGTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
438-690 |
2.96e-36 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 141.20 E-value: 2.96e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 438 RQLHTL-AHGEVVCAVTISGSTQHVYTGGK-GCVKVWDVGQPgakTPVAQLDclNRDNYIRSCKLLPDGRSLIVGGEAST 515
Cdd:COG2319 153 KLLRTLtGHSGAVTSVAFSPDGKLLASGSDdGTVRLWDLATG---KLLRTLT--GHTGAVRSVAFSPDGKLLASGSADGT 227
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 516 LSIWDLAapTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQTMVRQFQGHTDGASCIDISDYGTRLWT 595
Cdd:COG2319 228 VRLWDLA--TGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLAS 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 596 GGLDNTVRCWDLREGRQLQQHD-FSSQIFSLGHCPNQDWLAVGMESSNVEILHVR-KPEKYQLHLHESCVLSLKFASCGR 673
Cdd:COG2319 306 GSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLAtGELLRTLTGHTGAVTSVAFSPDGR 385
|
250
....*....|....*..
gi 768003721 674 WFVSTGKDNLLNAWRTP 690
Cdd:COG2319 386 TLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
438-688 |
4.02e-34 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 132.07 E-value: 4.02e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 438 RQLHTL-AHGEVVCAVTISGSTQHVYTGGK-GCVKVWDVGQPgaktpvaqlDCLNR----DNYIRSCKLLPDGRSLIVGG 511
Cdd:cd00200 42 ELLRTLkGHTGPVRDVAASADGTYLASGSSdKTIRLWDLETG---------ECVRTltghTSYVSSVAFSPDGRILSSSS 112
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 512 EASTLSIWDLaaPTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQTMVRQFQGHTDGASCIDISDYGT 591
Cdd:cd00200 113 RDKTIKVWDV--ETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGE 190
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 592 RLWTGGLDNTVRCWDLREGRQLQQHD-FSSQIFSLGHCPNQDWLAVGMESSNVEILHVRKPE-KYQLHLHESCVLSLKFA 669
Cdd:cd00200 191 KLLSSSSDGTIKLWDLSTGKCLGTLRgHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGEcVQTLSGHTNSVTSLAWS 270
|
250
....*....|....*....
gi 768003721 670 SCGRWFVSTGKDNLLNAWR 688
Cdd:cd00200 271 PDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
539-730 |
9.90e-25 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 104.72 E-value: 9.90e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 539 CYALAVSPDAKVCFSCCSDGNIVVWDLQNQTMVRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWDLREGRQLQQ-HD 617
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTlTG 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 618 FSSQIFSLGHCPNQDWLAVGMESSNVEILHVRKPE-KYQLHLHESCVLSLKFASCGRwFVSTGK-DNLLNAWRTPYGASI 695
Cdd:cd00200 92 HTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKcLTTLRGHTDWVNSVAFSPDGT-FVASSSqDGTIKLWDLRTGKCV 170
|
170 180 190
....*....|....*....|....*....|....*..
gi 768003721 696 --FQSkESSSVLSCDISRNNKYIVTGSGDKKATVYEV 730
Cdd:cd00200 171 atLTG-HTGEVNSVAFSPDGEKLLSSSSDGTIKLWDL 206
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
570-730 |
1.17e-18 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 87.01 E-value: 1.17e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 570 MVRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWDLREG---RQLQQHdfSSQIFSLGHCPNQDWLAVGMESSNVEIL 646
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellRTLKGH--TGPVRDVAASADGTYLASGSSDKTIRLW 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 647 HVRKPEK-YQLHLHESCVLSLKFASCGRWFVSTGKDNLLNAWRTPYGASI--FQSKEsSSVLSCDISRNNKYIVTGSGDK 723
Cdd:cd00200 79 DLETGECvRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLttLRGHT-DWVNSVAFSPDGTFVASSSQDG 157
|
....*..
gi 768003721 724 KATVYEV 730
Cdd:cd00200 158 TIKLWDL 164
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
404-567 |
2.56e-13 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 72.64 E-value: 2.56e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 404 GKPAYSFHVSADGQMqpvpfpsdaLVGAGIPRHAR--------QLHTLA-HGEVVCAVTISGSTQHVYTGGKGC-VKVWD 473
Cdd:COG2319 246 SGSVRSVAFSPDGRL---------LASGSADGTVRlwdlatgeLLRTLTgHSGGVNSVAFSPDGKLLASGSDDGtVRLWD 316
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 474 VGQPgakTPVAQLDclNRDNYIRSCKLLPDGRSLIVGGEASTLSIWDLAapTPRIKAELTSSAPACYALAVSPDAKVCFS 553
Cdd:COG2319 317 LATG---KLLRTLT--GHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLA--TGELLRTLTGHTGAVTSVAFSPDGRTLAS 389
|
170
....*....|....
gi 768003721 554 CCSDGNIVVWDLQN 567
Cdd:COG2319 390 GSADGTVRLWDLAT 403
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
127-367 |
3.44e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 57.26 E-value: 3.44e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 127 LQPLSHHAPPVPLTPRPAGLVGGSATGLLALSGALAAQAQLAAAVKEDR----AGVEAEGSRVERAPSRSASPSPPESLV 202
Cdd:PHA03247 2617 LPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRrarrLGRAAQASSPPQRPRRRAARPTVGSLT 2696
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 203 EEERPSGPGGGGKQRADEKEPSGPYESDEDKSDYNLVVDEDQPSEPPSPAT--TPCGKVPICIPARRDLVDSPASLASSL 280
Cdd:PHA03247 2697 SLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGpaTPGGPARPARPPTTAGPPAPAPPAAPA 2776
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 281 GSPLPRAKELILNDLPASTPASKSCDSSPPQDASTPGPSSASHLCQLAAKPAPSTDSVALRSPLTLSSPFTTSFSLGSHS 360
Cdd:PHA03247 2777 AGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSV 2856
|
....*..
gi 768003721 361 TLNGDLS 367
Cdd:PHA03247 2857 APGGDVR 2863
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
571-606 |
4.64e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 46.54 E-value: 4.64e-07
10 20 30
....*....|....*....|....*....|....*.
gi 768003721 571 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWD 606
Cdd:smart00320 5 LKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
571-606 |
6.91e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 43.49 E-value: 6.91e-06
10 20 30
....*....|....*....|....*....|....*.
gi 768003721 571 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWD 606
Cdd:pfam00400 4 LKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
500-607 |
4.58e-04 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 42.37 E-value: 4.58e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 500 LLPDGRSLIVGGEAS-TLSIWDLAapTPRIKAEL-TSSAPacYALAVSPDAKVCFSCCSDGN-----IVVWDLQNQTMVR 572
Cdd:COG3391 117 VDPDGGRLYVADSGNgRVSVIDTA--TGKVVATIpVGAGP--HGIAVDPDGKRLYVANSGSNtvsviVSVIDTATGKVVA 192
|
90 100 110 120
....*....|....*....|....*....|....*....|...
gi 768003721 573 QFQGHtDGASCIDISDYGTRLW--------TGGLDNTVRCWDL 607
Cdd:COG3391 193 TIPVG-GGPVGVAVSPDGRRLYvanrgsntSNGGSNTVSVIDL 234
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
525-564 |
1.21e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 36.91 E-value: 1.21e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 768003721 525 TPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWD 564
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
206-352 |
7.98e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 39.92 E-value: 7.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 206 RPSGPGgggkQRADEKEPSGPYESDEDKSDynlVVDEDQPSEPPSPATTPCGKVPICIPArrdlvDSPASLASSLGSPLP 285
Cdd:PHA03247 2576 RPSEPA----VTSRARRPDAPPQSARPRAP---VDDRGDPRGPAPPSPLPPDTHAPDPPP-----PSPSPAANEPDPHPP 2643
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768003721 286 RAKELILNDLPASTP------------ASKSCDSSPPQDASTPG-PSSASHLCQLAAKPAPSTDSVALRSPLTLSSPFTT 352
Cdd:PHA03247 2644 PTVPPPERPRDDPAPgrvsrprrarrlGRAAQASSPPQRPRRRAaRPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPP 2723
|
|
|