|
Name |
Accession |
Description |
Interval |
E-value |
| MIF4G |
pfam02854 |
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ... |
876-1104 |
1.13e-62 |
|
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.
Pssm-ID: 397130 Cd Length: 203 Bit Score: 212.61 E-value: 1.13e-62
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 876 FRKVRSILNKLTPQMFHQLMKQVTDLTINTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLATLKvptadkpntTVNFR 955
Cdd:pfam02854 1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLRN---------PTDFG 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 956 KLLLNRCQKEFERDkvddvvlerkqkeidsatsptekerlqEELEEAKDKARRRSTGNIKFIGELFKLKMLTEPIMHDCV 1035
Cdd:pfam02854 72 IHLLNRLQEEFEKR---------------------------FELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 528494466 1036 VKLLKNH-------DDESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLHN 1104
Cdd:pfam02854 125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
|
|
| MIF4G |
smart00543 |
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ... |
877-1101 |
1.12e-51 |
|
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)
Pssm-ID: 214713 Cd Length: 200 Bit Score: 181.02 E-value: 1.12e-51
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 877 RKVRSILNKLTPQMFHQLMKQVTDLTINTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLAtLKVPtadkpnttvNFRK 956
Cdd:smart00543 2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLN-AKNP---------DFGS 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 957 LLLNRCQKEFERDkvddvvlerkqkeidsatsptekerlqeeLEEAKDKARRRSTGNIKFIGELFKLKMLTEPIMHDCVV 1036
Cdd:smart00543 72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 528494466 1037 KLLKNH-------DDESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERKT---SSRIRFMLQDVIDLR 1101
Cdd:smart00543 123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELR 197
|
|
| W2_eIF4G1_like |
cd11559 |
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ... |
1547-1682 |
1.40e-47 |
|
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.
Pssm-ID: 211397 Cd Length: 134 Bit Score: 166.69 E-value: 1.40e-47
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 1547 LSPEELFKQLEQLLLEDMSSDEqIFDWIEANLDESQMSSSPFLRALMTAICKAAVKDESTsCRVDTAIIQKRLPILHKYF 1626
Cdd:cd11559 1 LPLLRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSL-PEKEKALLEKYAPLLQKYL 78
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 528494466 1627 DSDTERQLQALYALQSLIVALDQPPNLLRMFFDCLYDEDVISEDAFYQWETSKDPT 1682
Cdd:cd11559 79 DDDEQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
|
|
| MA3 |
pfam02847 |
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ... |
1348-1459 |
7.06e-34 |
|
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.
Pssm-ID: 397128 Cd Length: 113 Bit Score: 126.62 E-value: 7.06e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 1348 IERKSKAIIDEFLHINDYKEAVQCVLEIEQPSMLCVFVRMGLESTLERSQKAREHMGLLYYQLIQKGILPHSQLYKGFSE 1427
Cdd:pfam02847 1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
|
90 100 110
....*....|....*....|....*....|..
gi 528494466 1428 MLEQADDMAIDIPFIWLYLAELLSPLLKEGGI 1459
Cdd:pfam02847 81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGL 112
|
|
| MA3 |
smart00544 |
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ... |
1348-1460 |
2.52e-33 |
|
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press
Pssm-ID: 214714 Cd Length: 113 Bit Score: 125.05 E-value: 2.52e-33
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 1348 IERKSKAIIDEFLHINDYKEAVQCVLEIEQPSMLCVFVRMGLESTLERSQKAREHMGLLYYQLIQKGILPHSQLYKGFSE 1427
Cdd:smart00544 1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
|
90 100 110
....*....|....*....|....*....|...
gi 528494466 1428 MLEQADDMAIDIPFIWLYLAELLSPLLKEGGIN 1460
Cdd:smart00544 81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
|
|
| W2 |
pfam02020 |
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ... |
1633-1709 |
3.25e-25 |
|
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.
Pssm-ID: 460415 Cd Length: 76 Bit Score: 100.29 E-value: 3.25e-25
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 528494466 1633 QLQALYALQSLIVALDQPPNLLRMFFDCLYDEDVISEDAFYQWETSKDPTEQlGKGVALKSVNAFFTWLREAEEESE 1709
Cdd:pfam02020 1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAEK-GMKKVRKQAKPFVEWLEEAEEESD 76
|
|
| eIF5C |
smart00515 |
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5; |
1620-1704 |
1.20e-24 |
|
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
Pssm-ID: 214705 Cd Length: 83 Bit Score: 99.29 E-value: 1.20e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 1620 PILHKYFDSDTERQLQALYALQSLIVALDQPPNLLRMFFDCLYDEDVISEDAFYQWETSKDPTEqlGKGVALKSVNAFFT 1699
Cdd:smart00515 1 GPLLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVT 78
|
....*
gi 528494466 1700 WLREA 1704
Cdd:smart00515 79 WLQEA 83
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
4-424 |
9.35e-15 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 80.75 E-value: 9.35e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 4 PPKVVPKPAAVAVSGHVTGP-APPTQLRAaltsvSLPPGAQNAPPSAVPPTQIPRAALSLDErmfPAhsgvtavysvsrh 82
Cdd:PHA03247 2570 PPRPAPRPSEPAVTSRARRPdAPPQSARP-----RAPVDDRGDPRGPAPPSPLPPDTHAPDP---PP------------- 2628
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 83 PGPPFPGHDLSKTHPNLAGTPPGHATSPALSQVSVPAGPSyRILKPWETGGAP--PYNPAQNAGSAPLVYSPQTQPMNVQ 160
Cdd:PHA03247 2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRAR-RLGRAAQASSPPqrPRRRAARPTVGSLTSLADPPPPPPT 2707
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 161 PQTRPFVTGPR-PTHHQFIHRSQMQPARPTLPTNNPS----------IRPGSQTPTATVYPPNQPIMMTMTPMPFATQTH 229
Cdd:PHA03247 2708 PEPAPHALVSAtPLPPGPAAARQASPALPAAPAPPAVpagpatpggpARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPA 2787
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 230 QYYIPQYRHSAPYVGPPQQYAVQPPGsgtfyPGPSPAEYPTPYAAGPPYYTGQTVYPPSPPIIVPAPMPPPPTKREKKPI 309
Cdd:PHA03247 2788 VASLSESRESLPSPWDPADPPAAVLA-----PAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDV 2862
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 310 RIRDPNQG--GKDITEEIMFGSRNPTPPAGHPASTLTPPAGRPSSTPTPPSgrlsstPTPPQRPSNCQTPEQTAYVNQNQ 387
Cdd:PHA03247 2863 RRRPPSRSpaAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQA------PPPPQPQPQPPPPPQPQPPPPPP 2936
|
410 420 430
....*....|....*....|....*....|....*..
gi 528494466 388 RLSESPAPMDGKPSLAIDDRPKMESGPIKSISPGPRP 424
Cdd:PHA03247 2937 PRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVA 2973
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
11-425 |
1.61e-10 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 66.33 E-value: 1.61e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 11 PAAVAVSGHVTGPAPPTQLRAALTSVSLPPGAQNAPPSAVPPTQIPraalsldermfPAHSGVTAVYSVSRHPGPPFPGH 90
Cdd:pfam03154 172 PVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQP-----------PNQTQSTAAPHTLIQQTPTLHPQ 240
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 91 DLSKTHPNLAGTPPghatSPALSQVSVPAGPSyrilkPWETGGAPPYNPAQNAGSAPLVYSPQTQPMNVQPQTRPFVTGP 170
Cdd:pfam03154 241 RLPSPHPPLQPMTQ----PPPPSQVSPQPLPQ-----PSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPP 311
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 171 RPTHHQFIHRSQmqpaRPTLPTNNPSIRPGsQTPTATVYPPnQPIMMTMTPMPFATQTHQYYIPQ-YRHSAPYVGP-PQQ 248
Cdd:pfam03154 312 GPSPAAPGQSQQ----RIHTPPSQSQLQSQ-QPPREQPLPP-APLSMPHIKPPPTTPIPQLPNPQsHKHPPHLSGPsPFQ 385
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 249 YAVQPPGSGTFYPGPSPAEYPTPYAAGPPYY---TGQTVYPPSPpiivpapmpppptkreKKPIRIRDPNQGGKditeei 325
Cdd:pfam03154 386 MNSNLPPPPALKPLSSLSTHHPPSAHPPPLQlmpQSQQLPPPPA----------------QPPVLTQSQSLPPP------ 443
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 326 mfGSRNPTPPAGHPAstltppagrPSSTPTPPSGRLSSTPTPPQRPSNCQTPEQTAYVNQNQRLSESPA---PMDGKPSL 402
Cdd:pfam03154 444 --AASHPPTSGLHQV---------PSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSssgPVPAAVSC 512
|
410 420 430
....*....|....*....|....*....|...
gi 528494466 403 ----------AIDDRPKMESGPIKSISPGPRPS 425
Cdd:pfam03154 513 plppvqikeeALDEAEEPESPPPPPRSPSPEPT 545
|
|
| KLF1_N |
cd21581 |
N-terminal domain of Kruppel-like Factor 1; Kruppel-like Factor 1 (KLF1, also known as ... |
139-277 |
6.76e-03 |
|
N-terminal domain of Kruppel-like Factor 1; Kruppel-like Factor 1 (KLF1, also known as Krueppel-like factor 1 or Erythroid Kruppel-like Factor/EKLF) was the first Kruppel-like factor discovered. It was found to be vitally important for embryonic erythropoiesis in promoting the switch from fetal hemoglobin (Hemoglobin F) to adult hemoglobin (Hemoglobin A) gene expression by binding to highly conserved CACCC domains. EKLF ablation in mouse embryos produces a lethal anemic phenotype, causing death by embryonic day 14, and natural mutations lead to beta+ thalassemia in humans. However, expression of embryonic hemoglobin and fetal hemoglobin genes is normal in EKLF-deficient mice, suggesting other factors may be involved. KLF1 functions as a transcriptional activator. It belongs to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domain of KLF1, which is related to the N-terminal domains of KLF2 and KLF4.
Pssm-ID: 409227 [Multi-domain] Cd Length: 278 Bit Score: 40.41 E-value: 6.76e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 139 PAQNAGSAPLVYSPQTQPMNVQPQTRPFVTGP----------RPTHHQFIHRSQMQPAR-PTL-PTNNPSIRPGSQTPTA 206
Cdd:cd21581 93 EEQPGAYYEPPKKDQPGTEGLQVGGPGLMAELlspeestgwaPPEPHHGYPDAFVGPALfPAPaNVDQFGFPQGGSVDRR 172
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 207 TV------------YPPNQPIMMTMTPMPF----ATQT------HQYYIPQYRHSApyvGPPQQYAvQPPGSGTFYPGPS 264
Cdd:cd21581 173 GNlsksgswdfgsyYPQQHPSVVAFPDSRFgplsGPQAltpdpqHYGYFQLFRHNA---ALFPDYA-HSPGPGHLPLGQQ 248
|
170
....*....|....*
gi 528494466 265 P--AEYPTPYAAGPP 277
Cdd:cd21581 249 PllPDPPLPPGGAEG 263
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| MIF4G |
pfam02854 |
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ... |
876-1104 |
1.13e-62 |
|
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.
Pssm-ID: 397130 Cd Length: 203 Bit Score: 212.61 E-value: 1.13e-62
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 876 FRKVRSILNKLTPQMFHQLMKQVTDLTINTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLATLKvptadkpntTVNFR 955
Cdd:pfam02854 1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLRN---------PTDFG 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 956 KLLLNRCQKEFERDkvddvvlerkqkeidsatsptekerlqEELEEAKDKARRRSTGNIKFIGELFKLKMLTEPIMHDCV 1035
Cdd:pfam02854 72 IHLLNRLQEEFEKR---------------------------FELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 528494466 1036 VKLLKNH-------DDESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLHN 1104
Cdd:pfam02854 125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
|
|
| MIF4G |
smart00543 |
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ... |
877-1101 |
1.12e-51 |
|
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)
Pssm-ID: 214713 Cd Length: 200 Bit Score: 181.02 E-value: 1.12e-51
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 877 RKVRSILNKLTPQMFHQLMKQVTDLTINTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLAtLKVPtadkpnttvNFRK 956
Cdd:smart00543 2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLN-AKNP---------DFGS 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 957 LLLNRCQKEFERDkvddvvlerkqkeidsatsptekerlqeeLEEAKDKARRRSTGNIKFIGELFKLKMLTEPIMHDCVV 1036
Cdd:smart00543 72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 528494466 1037 KLLKNH-------DDESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERKT---SSRIRFMLQDVIDLR 1101
Cdd:smart00543 123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELR 197
|
|
| W2_eIF4G1_like |
cd11559 |
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ... |
1547-1682 |
1.40e-47 |
|
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.
Pssm-ID: 211397 Cd Length: 134 Bit Score: 166.69 E-value: 1.40e-47
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 1547 LSPEELFKQLEQLLLEDMSSDEqIFDWIEANLDESQMSSSPFLRALMTAICKAAVKDESTsCRVDTAIIQKRLPILHKYF 1626
Cdd:cd11559 1 LPLLRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSL-PEKEKALLEKYAPLLQKYL 78
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 528494466 1627 DSDTERQLQALYALQSLIVALDQPPNLLRMFFDCLYDEDVISEDAFYQWETSKDPT 1682
Cdd:cd11559 79 DDDEQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
|
|
| MA3 |
pfam02847 |
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ... |
1348-1459 |
7.06e-34 |
|
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.
Pssm-ID: 397128 Cd Length: 113 Bit Score: 126.62 E-value: 7.06e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 1348 IERKSKAIIDEFLHINDYKEAVQCVLEIEQPSMLCVFVRMGLESTLERSQKAREHMGLLYYQLIQKGILPHSQLYKGFSE 1427
Cdd:pfam02847 1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
|
90 100 110
....*....|....*....|....*....|..
gi 528494466 1428 MLEQADDMAIDIPFIWLYLAELLSPLLKEGGI 1459
Cdd:pfam02847 81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGL 112
|
|
| MA3 |
smart00544 |
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ... |
1348-1460 |
2.52e-33 |
|
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press
Pssm-ID: 214714 Cd Length: 113 Bit Score: 125.05 E-value: 2.52e-33
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 1348 IERKSKAIIDEFLHINDYKEAVQCVLEIEQPSMLCVFVRMGLESTLERSQKAREHMGLLYYQLIQKGILPHSQLYKGFSE 1427
Cdd:smart00544 1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
|
90 100 110
....*....|....*....|....*....|...
gi 528494466 1428 MLEQADDMAIDIPFIWLYLAELLSPLLKEGGIN 1460
Cdd:smart00544 81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
|
|
| W2 |
pfam02020 |
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ... |
1633-1709 |
3.25e-25 |
|
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.
Pssm-ID: 460415 Cd Length: 76 Bit Score: 100.29 E-value: 3.25e-25
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 528494466 1633 QLQALYALQSLIVALDQPPNLLRMFFDCLYDEDVISEDAFYQWETSKDPTEQlGKGVALKSVNAFFTWLREAEEESE 1709
Cdd:pfam02020 1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAEK-GMKKVRKQAKPFVEWLEEAEEESD 76
|
|
| eIF5C |
smart00515 |
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5; |
1620-1704 |
1.20e-24 |
|
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
Pssm-ID: 214705 Cd Length: 83 Bit Score: 99.29 E-value: 1.20e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 1620 PILHKYFDSDTERQLQALYALQSLIVALDQPPNLLRMFFDCLYDEDVISEDAFYQWETSKDPTEqlGKGVALKSVNAFFT 1699
Cdd:smart00515 1 GPLLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVT 78
|
....*
gi 528494466 1700 WLREA 1704
Cdd:smart00515 79 WLQEA 83
|
|
| W2 |
cd11473 |
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of ... |
1550-1676 |
5.67e-20 |
|
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of several translation initiation factors, including the epsilon chain of eIF2b, where it has been found to catalyze the conversion of eIF2.GDP to its active eIF2.GTP form. The structure of the domain resembles that of a set of concatenated HEAT repeats.
Pssm-ID: 211395 Cd Length: 135 Bit Score: 87.53 E-value: 5.67e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 1550 EELFKQLEQLLLEDMSSDEQIFDWIEANLDESQMSSSPFLRALMTAIC---KAAVKDESTSCRVDTAIIQKRLPILHKYF 1626
Cdd:cd11473 4 KKLRDSLLKELEEDKSSDVESVKAAKSKLDLDPISLEEVVKVLLTAVVnavESADSISLTQKEQLVLVLKKYGPVLRELL 83
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 528494466 1627 DSDTERQLQALYALQ--SLIVALDQPPNLLRMFFDCLYDEDVISEDAFYQWE 1676
Cdd:cd11473 84 KLIKKDQLYLLLKIEklCLQLKLSELISLLEKILDLLYDADVLSEEAILSWF 135
|
|
| W2_eIF2B_epsilon |
cd11558 |
C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a ... |
1590-1709 |
2.34e-15 |
|
C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a heteropentameric complex which functions as a guanine nucleotide exchange factor in the recycling of eIF-2 during the initiation of translation in eukaryotes. The epsilon and gamma subunits are sequence similar and both are essential in yeast. Epsilon appears to be the catalytically active subunit, with gamma enhancing its activity. The C-terminal domain of the eIF2B epsilon subunit contains bipartite motifs rich in acidic and aromatic residues, which are responsible for the interaction with eIF2. The structure of the domain resembles that of a set of concatenated HEAT repeats.
Pssm-ID: 211396 Cd Length: 169 Bit Score: 75.37 E-value: 2.34e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 1590 RALMTAICKAAVKDESTS---CRVDTAIIQKRL-PILHKYFDSDTErQLQALYALQSLIVALDQPPNLLRMFFDCLYDED 1665
Cdd:cd11558 47 RAVVKALLELILEVSSTStaeLLEALKKLLSKWgPLLENYVKSQDD-QVELLLALEEFCLESEEGGPLFAKLLHALYDLD 125
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 528494466 1666 VISEDAFYQWETSKDPTEQLGKGVALKSVNAFFTWLREAEEESE 1709
Cdd:cd11558 126 ILEEEAILEWWEEPDAGADEEMKKVRELVKKFIEWLEEAEEESD 169
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
4-424 |
9.35e-15 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 80.75 E-value: 9.35e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 4 PPKVVPKPAAVAVSGHVTGP-APPTQLRAaltsvSLPPGAQNAPPSAVPPTQIPRAALSLDErmfPAhsgvtavysvsrh 82
Cdd:PHA03247 2570 PPRPAPRPSEPAVTSRARRPdAPPQSARP-----RAPVDDRGDPRGPAPPSPLPPDTHAPDP---PP------------- 2628
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 83 PGPPFPGHDLSKTHPNLAGTPPGHATSPALSQVSVPAGPSyRILKPWETGGAP--PYNPAQNAGSAPLVYSPQTQPMNVQ 160
Cdd:PHA03247 2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRAR-RLGRAAQASSPPqrPRRRAARPTVGSLTSLADPPPPPPT 2707
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 161 PQTRPFVTGPR-PTHHQFIHRSQMQPARPTLPTNNPS----------IRPGSQTPTATVYPPNQPIMMTMTPMPFATQTH 229
Cdd:PHA03247 2708 PEPAPHALVSAtPLPPGPAAARQASPALPAAPAPPAVpagpatpggpARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPA 2787
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 230 QYYIPQYRHSAPYVGPPQQYAVQPPGsgtfyPGPSPAEYPTPYAAGPPYYTGQTVYPPSPPIIVPAPMPPPPTKREKKPI 309
Cdd:PHA03247 2788 VASLSESRESLPSPWDPADPPAAVLA-----PAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDV 2862
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 310 RIRDPNQG--GKDITEEIMFGSRNPTPPAGHPASTLTPPAGRPSSTPTPPSgrlsstPTPPQRPSNCQTPEQTAYVNQNQ 387
Cdd:PHA03247 2863 RRRPPSRSpaAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQA------PPPPQPQPQPPPPPQPQPPPPPP 2936
|
410 420 430
....*....|....*....|....*....|....*..
gi 528494466 388 RLSESPAPMDGKPSLAIDDRPKMESGPIKSISPGPRP 424
Cdd:PHA03247 2937 PRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVA 2973
|
|
| W2_eIF5 |
cd11561 |
C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase ... |
1568-1709 |
7.11e-11 |
|
C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase acceleration protein (GAP), as well as a GDP dissociation inhibitor (GDI) during translational initiation in eukaryotes. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.
Pssm-ID: 211399 Cd Length: 157 Bit Score: 62.25 E-value: 7.11e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 1568 EQIFDWIEANLDESQMSSspflralMTAICKAAVKDESTSCRV---------DTAI---IQKRLPILHKYFDSDtERQLQ 1635
Cdd:cd11561 9 DELGEFLKKNKDESGLSE-------LKEILKEAERLDVVKDKAvlvlaevlfDENIvkeIKKRKALLLKLVTDE-KAQKA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 1636 ALYALQSLIV-----ALDQPPNLLRmffdCLYDEDVISEDAFYQWETsKDPTEQLGKGVA---LKSVNAFFTWLREAEEE 1707
Cdd:cd11561 81 LLGGIERFCGkhspeLLKKVPLILK----ALYDNDILEEEVILKWYE-KVSKKYVSKEKSkkvRKAAEPFVEWLEEAEEE 155
|
..
gi 528494466 1708 SE 1709
Cdd:cd11561 156 EE 157
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
11-425 |
1.61e-10 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 66.33 E-value: 1.61e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 11 PAAVAVSGHVTGPAPPTQLRAALTSVSLPPGAQNAPPSAVPPTQIPraalsldermfPAHSGVTAVYSVSRHPGPPFPGH 90
Cdd:pfam03154 172 PVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQP-----------PNQTQSTAAPHTLIQQTPTLHPQ 240
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 91 DLSKTHPNLAGTPPghatSPALSQVSVPAGPSyrilkPWETGGAPPYNPAQNAGSAPLVYSPQTQPMNVQPQTRPFVTGP 170
Cdd:pfam03154 241 RLPSPHPPLQPMTQ----PPPPSQVSPQPLPQ-----PSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPP 311
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 171 RPTHHQFIHRSQmqpaRPTLPTNNPSIRPGsQTPTATVYPPnQPIMMTMTPMPFATQTHQYYIPQ-YRHSAPYVGP-PQQ 248
Cdd:pfam03154 312 GPSPAAPGQSQQ----RIHTPPSQSQLQSQ-QPPREQPLPP-APLSMPHIKPPPTTPIPQLPNPQsHKHPPHLSGPsPFQ 385
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 249 YAVQPPGSGTFYPGPSPAEYPTPYAAGPPYY---TGQTVYPPSPpiivpapmpppptkreKKPIRIRDPNQGGKditeei 325
Cdd:pfam03154 386 MNSNLPPPPALKPLSSLSTHHPPSAHPPPLQlmpQSQQLPPPPA----------------QPPVLTQSQSLPPP------ 443
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 326 mfGSRNPTPPAGHPAstltppagrPSSTPTPPSGRLSSTPTPPQRPSNCQTPEQTAYVNQNQRLSESPA---PMDGKPSL 402
Cdd:pfam03154 444 --AASHPPTSGLHQV---------PSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSssgPVPAAVSC 512
|
410 420 430
....*....|....*....|....*....|...
gi 528494466 403 ----------AIDDRPKMESGPIKSISPGPRPS 425
Cdd:pfam03154 513 plppvqikeeALDEAEEPESPPPPPRSPSPEPT 545
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2-357 |
9.23e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 64.19 E-value: 9.23e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 2 SLPPKVVPKPAAVAVSGHVTGPAPPTQLRAALTSVSLPPGAQNAPPSAVPPTQIPRAALSLDERMFPAHSGVTAVYSVSR 81
Cdd:PHA03247 2718 ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES 2797
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 82 HPGPPFPGHDLSKTHPNLAGTPPGHATSPAL-----SQVSVPAGPSYRILKPWETGG----------------------A 134
Cdd:PHA03247 2798 LPSPWDPADPPAAVLAPAAALPPAASPAGPLppptsAQPTAPPPPPGPPPPSLPLGGsvapggdvrrrppsrspaakpaA 2877
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 135 PPYNPAQNAGSAPLVYSPQTQPMNVQPQTRPfvTGPRPTHHQFIHRSQMQPARPTLPTNNPSIRPGSQTPTATVYPPNQP 214
Cdd:PHA03247 2878 PARPPVRRLARPAVSRSTESFALPPDQPERP--PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEP 2955
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 215 IMMTMTPMPFATQTHQYYIPQYRHSAPyvGPPQQYAVQPPGSGTFYPGPSPAEYPTPYA-----AGPPYYTGQTVYPPSp 289
Cdd:PHA03247 2956 SGAVPQPWLGALVPGRVAVPRFRVPQP--APSREAPASSTPPLTGHSLSRVSSWASSLAlheetDPPPVSLKQTLWPPD- 3032
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 528494466 290 PIIVPAPMPPPPTKREKKPIRIRDPNQGgkditeeimfgsrNPTPPAGHPASTLTPPAGR---PSSTPTPP 357
Cdd:PHA03247 3033 DTEDSDADSLFDSDSERSDLEALDPLPP-------------EPHDPFAHEPDPATPEAGAresPSSQFGPP 3090
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
58-592 |
8.70e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.11 E-value: 8.70e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 58 AALSLDERMFPAHSGVTAVYSVSRHPGPPFPGHDLSKTHPNLA------GTP-PGHATSPALSQVSVPAGPSYRILKPWE 130
Cdd:PHA03247 2445 AGLAADGDPFFARTILGAPFSLSLLLGELFPGAPVYRRPAEARfpfaagAAPdPGGGGPPDPDAPPAPSRLAPAILPDEP 2524
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 131 TGGAPPYN-----------PAQNAGSAPLVYSPQTQPMNVQPQTRPFVTGPRPTHHQFIHRSqmqpARPTLPtnnpsirP 199
Cdd:PHA03247 2525 VGEPVHPRmltwirgleelASDDAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRA----RRPDAP-------P 2593
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 200 GSQTPTATVYPPNQPimmtmtpmpfatqthqyyipqYRHSAPYVGPPQQYAVQPPGSGtfyPGPSPAEYPTPYAAGPPyy 279
Cdd:PHA03247 2594 QSARPRAPVDDRGDP---------------------RGPAPPSPLPPDTHAPDPPPPS---PSPAANEPDPHPPPTVP-- 2647
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 280 tgqTVYPPSPPIIVPAPMPPPPTKREKKPIRIRDPNQGGKDITEEIMFGS----RNPTPPAGHPAStltPPAGRPSSTPT 355
Cdd:PHA03247 2648 ---PPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSltslADPPPPPPTPEP---APHALVSATPL 2721
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 356 P--PSGRLSSTPTPPQRPSNCQTPEQTAyvnqnqrLSESPAPMDGKPSLAIDDRPKMESGPIKSISPG-PRPSESCLekr 432
Cdd:PHA03247 2722 PpgPAAARQASPALPAAPAPPAVPAGPA-------TPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRlTRPAVASL--- 2791
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 433 eissLPLLVSSSPEVDVSSHPTSgcIKPTAAGEPEFISPSATKAQTYQVISGEESVPEASPRLSASLSLRVVNGVNEPQT 512
Cdd:PHA03247 2792 ----SESRESLPSPWDPADPPAA--VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRR 2865
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 513 PSSYEEPEVQEAlkmSSSCEIQGTSFMEESGQEVPVALEELQAEHLPSLAAHVPliPGVQASSITSSTTSVLAPPPGLAP 592
Cdd:PHA03247 2866 PPSRSPAAKPAA---PARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPP--PQPQPQPPPPPQPQPPPPPPPRPQ 2940
|
|
| PHA03379 |
PHA03379 |
EBNA-3A; Provisional |
39-425 |
1.40e-08 |
|
EBNA-3A; Provisional
Pssm-ID: 223066 [Multi-domain] Cd Length: 935 Bit Score: 60.07 E-value: 1.40e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 39 PPGAQNAPPSavPPTQIPRAALSLDERMFPAHSGVTAVYSVSRHPGPPFPGHDLSKTHPN-LAGTPPG------------ 105
Cdd:PHA03379 408 ASEPTYGTPR--PPVEKPRPEVPQSLETATSHGSAQVPEPPPVHDLEPGPLHDQHSMAPCpVAQLPPGplqdlepgdqlp 485
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 106 ---HATSPALSQVSVPAGPsyrILKPWETggappyNPAQNAGSAPLVYSPqtQPMNVQPQTRPFVTGPRPTHHQFIHRSQ 182
Cdd:PHA03379 486 gvvQDGRPACAPVPAPAGP---IVRPWEA------SLSQVPGVAFAPVMP--QPMPVEPVPVPTVALERPVCPAPPLIAM 554
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 183 MQPARPT-LPTNNPSIRPGSQTPTatvyPPNQPIMMTMTPMPFATQTHQYyipQYRHSApyvgppqqyAVQPPgSGTFYP 261
Cdd:PHA03379 555 QGPGETSgIVRVRERWRPAPWTPN----PPRSPSQMSVRDRLARLRAEAQ---PYQASV---------EVQPP-QLTQVS 617
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 262 GPSPAEYP-TPYAAGPPYYTGQTVYPPSPPIIVPAPMPPPPTKREKKPIRIRDPnqggkditEEIMFGSRNPTPPAGHPA 340
Cdd:PHA03379 618 PQQPMEYPlEPEQQMFPGSPFSQVADVMRAGGVPAMQPQYFDLPLQQPISQGAP--------LAPLRASMGPVPPVPATQ 689
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 341 ST-LTPPAGRPSSTPTPPSGRLSSTP-TPPQRPSNCQTPEQTAYVNQNQRLSESP---APMD-----GKPSLAIDDRPKM 410
Cdd:PHA03379 690 PQyFDIPLTEPINQGASAAHFLPQQPmEGPLVPERWMFQGATLSQSVRPGVAQSQyfdLPLTqpinhGAPAAHFLHQPPM 769
|
410 420
....*....|....*....|.
gi 528494466 411 ------ESGPIKSISPGPRPS 425
Cdd:PHA03379 770 egpwvpEQWMFQGAPPSQGTD 790
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
11-368 |
2.98e-06 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 52.23 E-value: 2.98e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 11 PAAVAVSGHVTGPAP--PTQLRAALTSVSlpPGAQNAPPSAVPPTQIPR--AALSLDERMFPAHSGVTAVYSVSRHPGPP 86
Cdd:pfam05109 449 PSSTHVPTNLTAPAStgPTVSTADVTSPT--PAGTTSGASPVTPSPSPRdnGTESKAPDMTSPTSAVTTPTPNATSPTPA 526
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 87 FPGHDLSKTHPNLAGTPPGHATSPALSQVSVPAgPSYRILKPWET----GGAPPYN----PAQNAgSAPLV--YSPQTQP 156
Cdd:pfam05109 527 VTTPTPNATSPTLGKTSPTSAVTTPTPNATSPT-PAVTTPTPNATiptlGKTSPTSavttPTPNA-TSPTVgeTSPQANT 604
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 157 MNVQ---PQTRPFVTGPRPTHHQFIHRSQMQPARPTlpTNNPSIRPGSQTPTATVYPPNQ-----PIMMTMTP------- 221
Cdd:pfam05109 605 TNHTlggTSSTPVVTSPPKNATSAVTTGQHNITSSS--TSSMSLRPSSISETLSPSTSDNstshmPLLTSAHPtggenit 682
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 222 --MPFATQTHQYYI----PQYRHSAPYVGPPQQYAVQPPGSGTFYPGPSPAEYPTPYAAgppyyTGQTVYPPSPPIIVPA 295
Cdd:pfam05109 683 qvTPASTSTHHVSTsspaPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAP-----SGQKTAVPTVTSTGGK 757
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 296 PMPPPPTKREKkpirirdpNQGGKDITEEIM-FGSRNPTPPAGHPASTLTPPAG----RPSSTPTPP--SGRLSSTPTPP 368
Cdd:pfam05109 758 ANSTTGGKHTT--------GHGARTSTEPTTdYGGDSTTPRTRYNATTYLPPSTssklRPRWTFTSPpvTTAQATVPVPP 829
|
|
| W2_eIF5C_like |
cd11560 |
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; ... |
1548-1707 |
1.15e-05 |
|
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; eIF5C appears to be essential for the initiation of protein translation; its actual function, and specifically that of the C-terminal W2 domain, are not well understood. The Drosophila ortholog, kra (krasavietz) or exba (extra bases), may be involved in translational inhibition in neural development. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.
Pssm-ID: 211398 [Multi-domain] Cd Length: 194 Bit Score: 47.98 E-value: 1.15e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 1548 SPEELFKQLEQLLLEDMSSDEqifdwIEANLDEsQMSSSPFL---------RALMTAICKAAVKDESTscrvDTAI--IQ 1616
Cdd:cd11560 37 IKKELQQELKEMIAEEEPVKE-----IIAAVKE-QMKKSSLPehevvgllwTALMDAVEWSKKEDQIA----EQALrhLK 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 1617 KRLPILhKYFDSDTERQLQALYALQslIVALDQPpNLLRMFFDC---LYDEDVISEDAFYQW--ETSKDPteqlGKGVAL 1691
Cdd:cd11560 107 KYAPLL-AAFCTTARAELALLNKIQ--EYCYENM-KFMKVFQKIvklLYKADVLSEDAILKWykKGHSPK----GKQVFL 178
|
170
....*....|....*.
gi 528494466 1692 KSVNAFFTWLREAEEE 1707
Cdd:cd11560 179 KQMEPFVEWLQEAEEE 194
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
80-379 |
1.27e-05 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 50.45 E-value: 1.27e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 80 SRHPGPPFPGHDLSKTHPNLAGTPPGHATSPALSQVSVPAgPSYrILKPWetggaPPYNPAQNAGSaplvysPQTQ---P 156
Cdd:PHA03378 550 SDEPASTEPVHDQLLPAPGLGPLQIQPLTSPTTSQLASSA-PSY-AQTPW-----PVPHPSQTPEP------PTTQshiP 616
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 157 MNVQPQTRPFVTGPRPthhqfIHRSQMQPArptlpTNNPSIRPGSQTPTATVYPPNQPIMMTMTPMPFATQTHQYYIPQY 236
Cdd:PHA03378 617 ETSAPRQWPMPLRPIP-----MRPLRMQPI-----TFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTMLP 686
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 237 RHSAPYvgppqqyAVQPPGSGtfyPGPSPAEYPTPYAAGPPYYTGQTVYPPSPPiivpapmpppptkrekkPIRIRDPNq 316
Cdd:PHA03378 687 IQWAPG-------TMQPPPRA---PTPMRPPAAPPGRAQRPAAATGRARPPAAA-----------------PGRARPPA- 738
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 528494466 317 ggkditeeimfGSRNPTPPAGHPASTLTPPAGRPSSTPTPPSGRLSSTPTPP--------QRPSNCQTPEQ 379
Cdd:PHA03378 739 -----------AAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPpqappapqQRPRGAPTPQP 798
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
12-426 |
1.84e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 49.60 E-value: 1.84e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 12 AAVAVSGHVTGPAPPTQLRAAltsvslPPGAQNAPPSAVPPTQIPRAAlsldermfPAHSGVTAVYSVSRHPGPPFPGhd 91
Cdd:PRK07764 385 LGVAGGAGAPAAAAPSAAAAA------PAAAPAPAAAAPAAAAAPAPA--------AAPQPAPAPAPAPAPPSPAGNA-- 448
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 92 lskthPNLAGTPPGHATSPALSQVSVPAGPSYRILKPW---ETGGAPPYNPAQNAGSAPlvysPQTQPMNVQPQ------ 162
Cdd:PRK07764 449 -----PAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPApapPAAPAPAAAPAAPAAPAA----PAGADDAATLRerwpei 519
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 163 ----------------TRPFVTGPRPTHHQFIHRSQMQPARPTLPTNNPSIRP-------GSQTPTATVYPPNQPIMMTM 219
Cdd:PRK07764 520 laavpkrsrktwaillPEATVLGVRGDTLVLGFSTGGLARRFASPGNAEVLVTalaeelgGDWQVEAVVGPAPGAAGGEG 599
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 220 TPMPFATQTHqyyiPQYRHSAPYVGPPQQyAVQPPGSGTFYPGPSPAEYPTPYAAGPPYYTGQTVYPPSPPIIVPapmpp 299
Cdd:PRK07764 600 PPAPASSGPP----EEAARPAAPAAPAAP-AAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGW----- 669
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 300 pptkrekkPIRIRDPNQGGkditeeimfGSRNPTPPAGHPASTLTPPAGRPSSTPTPPSGRLSSTPTPPQRPSNCQTPEQ 379
Cdd:PRK07764 670 --------PAKAGGAAPAA---------PPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPS 732
|
410 420 430 440
....*....|....*....|....*....|....*....|....*..
gi 528494466 380 TAYVNQnQRLSESPAPMDGKPSLAIDDRPKMESGPIKSISPGPRPSE 426
Cdd:PRK07764 733 PAADDP-VPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSP 778
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
63-363 |
3.67e-05 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 48.93 E-value: 3.67e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 63 DERMFPAHSGVTA-----VYSVSRHPGPPFPGHDlskthPNLAG---TPPGHATSPALSQVSVPAGPSYRILKpwetggA 134
Cdd:PRK10263 275 DEEITYTARGVAAdpddvLFSGNRATQPEYDEYD-----PLLNGapiTEPVAVAAAATTATQSWAAPVEPVTQ------T 343
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 135 PPYNPAQNAGSAPLVySPQTQPmnvQPQTRPFVTGPRPTHHQfihrSQMQPARPTLPTNNPSIRP-GSQTPTATVYPPNQ 213
Cdd:PRK10263 344 PPVASVDVPPAQPTV-AWQPVP---GPQTGEPVIAPAPEGYP----QQSQYAQPAVQYNEPLQQPvQPQQPYYAPAAEQP 415
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 214 PIMMTMTPMPFATQTHQYYIPQYRHSA---PYVGPPQQYAVQPPGSGTFYpgpspAEYPTPYAAGPPYYTGQTVYPPSpp 290
Cdd:PRK10263 416 AQQPYYAPAPEQPAQQPYYAPAPEQPVagnAWQAEEQQSTFAPQSTYQTE-----QTYQQPAAQEPLYQQPQPVEQQP-- 488
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 528494466 291 iIVPAPMPPPPTKREKKPIRIRDPNQGGKDITEEIMFGSRNPTPPAGHPASTLTPPAGRPSSTPTPPSGRLSS 363
Cdd:PRK10263 489 -VVEPEPVVEETKPARPPLYYFEEVEEKRAREREQLAAWYQPIPEPVKEPEPIKSSLKAPSVAAVPPVEAAAA 560
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
7-196 |
3.91e-05 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 48.49 E-value: 3.91e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 7 VVPKPAAVAVSGHVTGPAPPTQ------------LRAALTSVSLPPGAQNAPPSAVPPTQIPrAALSLDERMFPAHSGVT 74
Cdd:pfam09770 165 VAPKKAAAPAPAPQPAAQPASLpapsrkmmsleeVEAAMRAQAKKPAQQPAPAPAQPPAAPP-AQQAQQQQQFPPQIQQQ 243
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 75 AVYSVSRHPGPPFPGHDLSKT---HPNLAGTPPGHATSPALSQVSVPAGPSyrilkpwetggaPPYNPAQ-----NAGSA 146
Cdd:pfam09770 244 QQPQQQPQQPQQHPGQGHPVTilqRPQSPQPDPAQPSIQPQAQQFHQQPPP------------VPVQPTQilqnpNRLSA 311
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|
gi 528494466 147 PLVYSPQTQPMNVQPQtrpfvtgprPTHHQfiHRSQMQPARPTLPTNNPS 196
Cdd:pfam09770 312 ARVGYPQNPQPGVQPA---------PAHQA--HRQQGSFGRQAPIITHPQ 350
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
4-222 |
4.44e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 48.33 E-value: 4.44e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 4 PPKVVPKPAAVAVSGHVTGPAPPTQLRAALTSVSLPPGAQNAPPSAVPPTQIPRAAlsldERMFPAHSGVTAVYSVSRHP 83
Cdd:PRK12323 380 APVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAA----ARQASARGPGGAPAPAPAPA 455
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 84 GPPFPGhdlskTHPNLAGTPPGHATSPALSQVSVPAG---PSYRILKPWET--GGAPPYNPAQN-AGSAPLVYSPQTQPM 157
Cdd:PRK12323 456 AAPAAA-----ARPAAAGPRPVAAAAAAAPARAAPAAapaPADDDPPPWEElpPEFASPAPAQPdAAPAGWVAESIPDPA 530
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 528494466 158 NVQPqtrpfvTGPRPTHHQFIHRSQMQPARPTLPTNNPSIRPG-SQTPTATVYPPNQPIMMTMTPM 222
Cdd:PRK12323 531 TADP------DDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRaSASGLPDMFDGDWPALAARLPV 590
|
|
| dnaA |
PRK14086 |
chromosomal replication initiator protein DnaA; |
110-361 |
5.14e-05 |
|
chromosomal replication initiator protein DnaA;
Pssm-ID: 237605 [Multi-domain] Cd Length: 617 Bit Score: 47.90 E-value: 5.14e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 110 PALSQ-VSVPAGPSYRILKPWETGGAPPYNPAQNAGSAPLVYSPQTQPMNVQPQTRPFVTGPRPTHHQFihrSQMQPARP 188
Cdd:PRK14086 68 PIISEtLSRELGRPIRIAITVDPSAGEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQ---DQLPTARP 144
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 189 TLPTNNPSIRPGSQTPTATVYPPNQPIMMTMTPMPFATQTHQYyipqyrhsapyvgPPQQYAVQPPGSGTfypgpspAEY 268
Cdd:PRK14086 145 AYPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRAPYASPASYA-------------PEQERDREPYDAGR-------PEY 204
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 269 PTPYaaGPPYYTGQTVYPPsppiivpapmpppptKREKKPIRIRDPnqGGKDITEEIMFGSRNPTPPAGHPASTL-TPPA 347
Cdd:PRK14086 205 DQRR--RDYDHPRPDWDRP---------------RRDRTDRPEPPP--GAGHVHRGGPGPPERDDAPVVPIRPSApGPLA 265
|
250
....*....|....*.
gi 528494466 348 GRPSSTPTP--PSGRL 361
Cdd:PRK14086 266 AQPAPAPGPgePTARL 281
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
4-254 |
2.34e-04 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 46.21 E-value: 2.34e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 4 PPKVVPKPAAV--AVSGHV------TGPAPPTQLRAALTSVSLPPGAQN-APPSAVPPT--QIPRAALSldeRMFPAHSG 72
Cdd:PHA03378 654 PPQVEITPYKPtwTQIGHIpyqpspTGANTMLPIQWAPGTMQPPPRAPTpMRPPAAPPGraQRPAAATG---RARPPAAA 730
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 73 VTAVYSVSRHPGPPFPGHDLSKTHPNLAGTP----PGHATSPALSQVSVPAGPSYRILKPwetGGAPPYNPAQNAGSAPL 148
Cdd:PHA03378 731 PGRARPPAAAPGRARPPAAAPGRARPPAAAPgrarPPAAAPGAPTPQPPPQAPPAPQQRP---RGAPTPQPPPQAGPTSM 807
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 149 VYSPQTQPMNVQPQT---RPFVTGP----RPTHHQfihRSQMQPARPTLPTNNPSIRPGSQTPTATV-YPP-NQPIMMTM 219
Cdd:PHA03378 808 QLMPRAAPGQQGPTKqilRQLLTGGvkrgRPSLKK---PAALERQAAAGPTPSPGSGTSDKIVQAPVfYPPvLQPIQVMR 884
|
250 260 270
....*....|....*....|....*....|....*...
gi 528494466 220 ---TPMPFATQTHQYYIPQYRHSAPYVGPPQQYAVQPP 254
Cdd:PHA03378 885 qlgSVRAAAASTVTQAPTEYTGERRGVGPMHPTDIPPS 922
|
|
| Med15 |
pfam09606 |
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ... |
133-271 |
4.87e-04 |
|
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.
Pssm-ID: 312941 [Multi-domain] Cd Length: 732 Bit Score: 45.00 E-value: 4.87e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 133 GAPPYNPAQNAGSAPLVYSPQTQPMNVQPQTRP---FVTGPRPTHHQFIHRSQMQPAR--------PTLPTNNPSIRPGS 201
Cdd:pfam09606 281 GQPMGPPGQQPGAMPNVMSIGDQNNYQQQQTRQqqqQQGGNHPAAHQQQMNQSVGQGGqvvalgglNHLETWNPGNFGGL 360
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 528494466 202 QTPTATvypPNQPIMMTM-TPMPFAT----QTHQYYIPQYRHSAPYVGPPQQyavQPPGSGTFYPGPSPAEYPTP 271
Cdd:pfam09606 361 GANPMQ---RGQPGMMSSpSPVPGQQvrqvTPNQFMRQSPQPSVPSPQGPGS---QPPQSHPGGMIPSPALIPSP 429
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
11-398 |
1.10e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 44.01 E-value: 1.10e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 11 PAAVAVSGHVTGPAPPTQLRAALTSVSLPPGAQNAPPSAVPPTQIPRAALSLDERMFPAHSGVTAVYSVSRHPGPPFPGH 90
Cdd:PHA03307 31 AADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPG 110
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 91 DLSKTHPNLAGTPPGHATSPAlSQVSVPAGPSYRILKPWETGGAPPYNPAQNAGSAPlvySPQTQPMNVQPQTRPFVTGP 170
Cdd:PHA03307 111 PSSPDPPPPTPPPASPPPSPA-PDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDA---ASSRQAALPLSSPEETARAP 186
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 171 RPTHHQFIHRSQMQPARPTLPTNNPSIRPGSQTPTAT------VYPPNQPIMMTMTPMPFATQTHQYYIPQYRHSaPYVG 244
Cdd:PHA03307 187 SSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPApgrsaaDDAGASSSDSSSSESSGCGWGPENECPLPRPA-PITL 265
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 245 PPQQYAVQPPGSGTFYPGPSPAEYPTPYAAGPPyytgqtvyPPSppiivPAPMPPPPTKREKKPIRIRDPNqGGKDITEE 324
Cdd:PHA03307 266 PTRIWEASGWNGPSSRPGPASSSSSPRERSPSP--------SPS-----SPGSGPAPSSPRASSSSSSSRE-SSSSSTSS 331
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 325 IMFGSRNPTPPAG-----HPASTLTPPAGRPSSTPT--PPSGRLSSTPTPPQRPsncqTPEQTAY-VNQNQRLSESPAPM 396
Cdd:PHA03307 332 SSESSRGAAVSPGpspsrSPSPSRPPPPADPSSPRKrpRPSRAPSSPAASAGRP----TRRRARAaVAGRARRRDATGRF 407
|
..
gi 528494466 397 DG 398
Cdd:PHA03307 408 PA 409
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
3-274 |
1.30e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 43.60 E-value: 1.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 3 LPPKVVPKPAAVAVSGhvTGPAPPTQLRAALTSVSLPPGAQNAPPSAVPPTQIPraalslderMFPAHSGVTAVYSVSRH 82
Cdd:pfam03154 293 VPPQPFPLTPQSSQSQ--VPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQP---------LPPAPLSMPHIKPPPTT 361
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 83 PGPPFPGHDLSKTHPNLAGTPPGHATSpalsqvSVPAGPSYRILKPWETGGAPPYNPaqnagsAPLVYSPQTQPMNVQPQ 162
Cdd:pfam03154 362 PIPQLPNPQSHKHPPHLSGPSPFQMNS------NLPPPPALKPLSSLSTHHPPSAHP------PPLQLMPQSQQLPPPPA 429
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 163 TRPFVT-----GPRPTHH---QFIHRSQMQPARPT---LPTNNPSIRPGSQTPTAT------VYPPNQPIMMTMTPMPFA 225
Cdd:pfam03154 430 QPPVLTqsqslPPPAASHpptSGLHQVPSQSPFPQhpfVPGGPPPITPPSGPPTSTssampgIQPPSSASVSSSGPVPAA 509
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 528494466 226 TQTHQYYIpQYRHSAPYVGPPQQYAVQPPGSgtfyPGPSPAEYPTPYAA 274
Cdd:pfam03154 510 VSCPLPPV-QIKEEALDEAEEPESPPPPPRS----PSPEPTVVNTPSHA 553
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
4-121 |
1.82e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 43.16 E-value: 1.82e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 4 PPKVVPKPAAVAVSGHVTGPAPPTQLRAALTSVSLPPGAQNAPPSAVPPTQIPRAALSldermfpahsgVTAVYSVSRHP 83
Cdd:PRK14951 386 AAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPA-----------AVALAPAPPAQ 454
|
90 100 110
....*....|....*....|....*....|....*...
gi 528494466 84 GPPFPGHDLSKTHPNLAGTPPGHATSPALSQVSVPAGP 121
Cdd:PRK14951 455 AAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTE 492
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
4-147 |
1.97e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 42.91 E-value: 1.97e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 4 PPKVVPKPAAVAvsghvtGPAPPTQLRAALTSVSLPPGAQNAPPSAVP--------PTQIPRAALSLDERMFPAHSGVTA 75
Cdd:PRK07003 376 VAGAVPAPGARA------AAAVGASAVPAVTAVTGAAGAALAPKAAAAaaatraeaPPAAPAPPATADRGDDAADGDAPV 449
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 528494466 76 VYSVSRHPGPPFPGHDLS---KTHPNLAGTPPGHATSPALSQVSVPAGPSYRILKPWETGGAPPYNPAQNAGSAP 147
Cdd:PRK07003 450 PAKANARASADSRCDERDaqpPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAA 524
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
221-424 |
2.28e-03 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 42.72 E-value: 2.28e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 221 PMPFATQTHQYYIPQYRHSAPYVGPPQQYAVQPPGSGtfY----------------------PGPSPAEYPTPYAAGPPY 278
Cdd:pfam09770 107 PAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRTG--YekykepepipdlqvdaslwgvaPKKAAAPAPAPQPAAQPA 184
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 279 YTGQTV-----------------YPPSPPIIVPAPMPPPPTKREKKPIRIRDPNQGGKDITEEIMFGSRNPTPPAGHPAS 341
Cdd:pfam09770 185 SLPAPSrkmmsleeveaamraqaKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVT 264
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 342 TLTppagRPSSTPTPPSGRLSSTPTPPQRPSNCQTPEQTAYVNQN-QRLSESPAPMDGKPSLAIDDRPKMESGPIKSISP 420
Cdd:pfam09770 265 ILQ----RPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNpNRLSAARVGYPQNPQPGVQPAPAHQAHRQQGSFG 340
|
....
gi 528494466 421 GPRP 424
Cdd:pfam09770 341 RQAP 344
|
|
| dnaA |
PRK14086 |
chromosomal replication initiator protein DnaA; |
185-369 |
2.39e-03 |
|
chromosomal replication initiator protein DnaA;
Pssm-ID: 237605 [Multi-domain] Cd Length: 617 Bit Score: 42.51 E-value: 2.39e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 185 PARPTLPTNNPSIRPGSQTPTATVYPpnQPIMMTMTPMPFATQTHQYYIPQyrhsAPYVGPPQQYAVQPPGSGTFYPGPS 264
Cdd:PRK14086 81 PIRIAITVDPSAGEPAPPPPHARRTS--EPELPRPGRRPYEGYGGPRADDR----PPGLPRQDQLPTARPAYPAYQQRPE 154
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 265 PAEYPTPYAAGPPYYTgQTVYPP--------SPPIIVPAPMPPPPTKREKKPIRIRDPNQGGKDITEEIMFGSRNPTPP- 335
Cdd:PRK14086 155 PGAWPRAADDYGWQQQ-RLGFPPrapyaspaSYAPEQERDREPYDAGRPEYDQRRRDYDHPRPDWDRPRRDRTDRPEPPp 233
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 528494466 336 -AGHPASTLTPPAGRPS--------STPTPPSGRLSSTPTPPQ 369
Cdd:PRK14086 234 gAGHVHRGGPGPPERDDapvvpirpSAPGPLAAQPAPAPGPGE 276
|
|
| TYA |
pfam01021 |
Ty transposon capsid protein; Ty are yeast transposons. A 5.7kb transcript codes for p3 a ... |
158-287 |
4.96e-03 |
|
Ty transposon capsid protein; Ty are yeast transposons. A 5.7kb transcript codes for p3 a fusion protein of TYA and TYB. The TYA protein is analogous to the gag protein of retroviruses. TYA a is cleaved to form 46kd protein which can form mature virion like particles. This entry corresponds to the capsid protein from Ty1 and Ty2 transposons.
Pssm-ID: 425992 Cd Length: 384 Bit Score: 41.10 E-value: 4.96e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 158 NVQPQTRPfVTGPRPTHHqfiHRSQMQPARPTLPTN--------------NPSIRPGSQTPTATVYPPNQpimmtMTPMP 223
Cdd:pfam01021 35 NSQQTTTP-GSSAVPENH---HHASPQPASVPPPQNgpysqqcmmtpnqaNPSGWPFYGHPSMMPYTPYQ-----MSPMY 105
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 528494466 224 FATQTHqYYIPQYrhsAPYVGPPqqYAVQPPGSGTFYPGPSPAEYPTPyaagppyYTGQTVYPP 287
Cdd:pfam01021 106 FPPGPQ-SQFPQY---PSSVGTP--LSTPSPESGNTFTDSSSAKSDMT-------STNKYVRPP 156
|
|
| KLF1_N |
cd21581 |
N-terminal domain of Kruppel-like Factor 1; Kruppel-like Factor 1 (KLF1, also known as ... |
139-277 |
6.76e-03 |
|
N-terminal domain of Kruppel-like Factor 1; Kruppel-like Factor 1 (KLF1, also known as Krueppel-like factor 1 or Erythroid Kruppel-like Factor/EKLF) was the first Kruppel-like factor discovered. It was found to be vitally important for embryonic erythropoiesis in promoting the switch from fetal hemoglobin (Hemoglobin F) to adult hemoglobin (Hemoglobin A) gene expression by binding to highly conserved CACCC domains. EKLF ablation in mouse embryos produces a lethal anemic phenotype, causing death by embryonic day 14, and natural mutations lead to beta+ thalassemia in humans. However, expression of embryonic hemoglobin and fetal hemoglobin genes is normal in EKLF-deficient mice, suggesting other factors may be involved. KLF1 functions as a transcriptional activator. It belongs to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domain of KLF1, which is related to the N-terminal domains of KLF2 and KLF4.
Pssm-ID: 409227 [Multi-domain] Cd Length: 278 Bit Score: 40.41 E-value: 6.76e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 139 PAQNAGSAPLVYSPQTQPMNVQPQTRPFVTGP----------RPTHHQFIHRSQMQPAR-PTL-PTNNPSIRPGSQTPTA 206
Cdd:cd21581 93 EEQPGAYYEPPKKDQPGTEGLQVGGPGLMAELlspeestgwaPPEPHHGYPDAFVGPALfPAPaNVDQFGFPQGGSVDRR 172
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 528494466 207 TV------------YPPNQPIMMTMTPMPF----ATQT------HQYYIPQYRHSApyvGPPQQYAvQPPGSGTFYPGPS 264
Cdd:cd21581 173 GNlsksgswdfgsyYPQQHPSVVAFPDSRFgplsGPQAltpdpqHYGYFQLFRHNA---ALFPDYA-HSPGPGHLPLGQQ 248
|
170
....*....|....*
gi 528494466 265 P--AEYPTPYAAGPP 277
Cdd:cd21581 249 PllPDPPLPPGGAEG 263
|
|
|