NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1956591577|ref|NP_001378829|]
View 

eukaryotic translation initiation factor 4 gamma 3 isoform 9 [Homo sapiens]

Protein Classification

eukaryotic translation initiation factor 4 gamma 3( domain architecture ID 10501431)

eukaryotic translation initiation factor 4 gamma 3 (EIF4G3) is component of the protein complex eIF4F, which is involved in the recognition of the mRNA cap, ATP-dependent unwinding of 5'-terminal secondary structure and recruitment of mRNA to the ribosome

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
754-982 4.84e-62

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


:

Pssm-ID: 397130  Cd Length: 203  Bit Score: 210.68  E-value: 4.84e-62
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577  754 FRKVRSILNKLTPQMFNQLMKQVSGLTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVTLkvpmadkpgNTVNFR 833
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577  834 KLLLNRCQKEFEKdkadddvfekkqkeleaasapeertrlHDELEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCV 913
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1956591577  914 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLCN 982
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1427-1555 9.00e-48

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


:

Pssm-ID: 211397  Cd Length: 134  Bit Score: 167.08  E-value: 9.00e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577 1427 KRLEKLIIEDKANDEqIFDWVEANLDEIQMSSPTFLRALMTAVCKAAIIADSSTfRVDTAVIKQRVPILLKYLDSDTEKE 1506
Cdd:cd11559      8 AELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSLP-EKEKALLEKYAPLLQKYLDDDEQLQ 85
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 1956591577 1507 LQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPA 1555
Cdd:cd11559     86 LQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1221-1333 6.58e-37

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


:

Pssm-ID: 397128  Cd Length: 113  Bit Score: 135.10  E-value: 6.58e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577 1221 LERKSKSIIDEFLHINDFKEAMQCVEELNAQGLLHVFVRVGVESTLERSQITRDHMGQLLYQLVQSEKLSKQDFFKGFSE 1300
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1956591577 1301 TLELADDMAIDIPHIWLYLAELVTPMLKEGGIS 1333
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
W2 super family cl17013
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of ...
1534-1580 9.61e-05

C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of several translation initiation factors, including the epsilon chain of eIF2b, where it has been found to catalyze the conversion of eIF2.GDP to its active eIF2.GTP form. The structure of the domain resembles that of a set of concatenated HEAT repeats.


The actual alignment was detected with superfamily member cd11560:

Pssm-ID: 473053 [Multi-domain]  Cd Length: 194  Bit Score: 44.90  E-value: 9.61e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1956591577 1534 LYDEEVISEDAFYKWesSKDPAEQNGKGVALKSVTAFFTWLREAEEE 1580
Cdd:cd11560    150 LYKADVLSEDAILKW--YKKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
PRK10263 super family cl35903
DNA translocase FtsK; Provisional
10-295 1.20e-04

DNA translocase FtsK; Provisional


The actual alignment was detected with superfamily member PRK10263:

Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 47.00  E-value: 1.20e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577   10 PFFQRPQIqpPRATIPNSSPSIR----PGAQTPTAVYQanqhimmvnhlPMPYPVPQGPQYCIPQYRHSGP---PYvgpp 82
Cdd:PRK10263   339 PVTQTPPV--ASVDVPPAQPTVAwqpvPGPQTGEPVIA-----------PAPEGYPQQSQYAQPAVQYNEPlqqPV---- 401
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577   83 qqypvqppgpgpfypgpGPGDFPNAYGTPFYPSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDPNQGGKDITEEIMSGG 162
Cdd:PRK10263   402 -----------------QPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQT 464
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577  163 GSRNPTPPIGRPTSTPTPPQLPSQVPEHSPVVYGTVES-AHLAASTPVTAASDQKQEE-----KPKPDPVlKSPSPVlrl 236
Cdd:PRK10263   465 EQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPArPPLYYFEEVEEKRAREREQlaawyQPIPEPV-KEPEPI--- 540
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1956591577  237 vlsgekkeqegqtSETTAIVSIAELPLPPSPTTVSSVA---RSTIAAPTSSALSSQPIFTTA 295
Cdd:PRK10263   541 -------------KSSLKAPSVAAVPPVEAAAAVSPLAsgvKKATLATGAAATVAAPVFSLA 589
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
754-982 4.84e-62

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 210.68  E-value: 4.84e-62
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577  754 FRKVRSILNKLTPQMFNQLMKQVSGLTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVTLkvpmadkpgNTVNFR 833
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577  834 KLLLNRCQKEFEKdkadddvfekkqkeleaasapeertrlHDELEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCV 913
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1956591577  914 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLCN 982
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
755-979 1.62e-49

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 174.47  E-value: 1.62e-49
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577   755 RKVRSILNKLTPQMFNQLMKQVSGLTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVtLKVPmadkpgntvNFRK 834
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLN-AKNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577   835 LLLNRCQKEFEKDkadddvfekkqkeleaasapeertrlhdeLEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCVV 914
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1956591577   915 KLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERKT---SSRIRFMLQDVIDLR 979
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELR 197
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1427-1555 9.00e-48

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 167.08  E-value: 9.00e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577 1427 KRLEKLIIEDKANDEqIFDWVEANLDEIQMSSPTFLRALMTAVCKAAIIADSSTfRVDTAVIKQRVPILLKYLDSDTEKE 1506
Cdd:cd11559      8 AELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSLP-EKEKALLEKYAPLLQKYLDDDEQLQ 85
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 1956591577 1507 LQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPA 1555
Cdd:cd11559     86 LQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1221-1333 6.58e-37

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 135.10  E-value: 6.58e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577 1221 LERKSKSIIDEFLHINDFKEAMQCVEELNAQGLLHVFVRVGVESTLERSQITRDHMGQLLYQLVQSEKLSKQDFFKGFSE 1300
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1956591577 1301 TLELADDMAIDIPHIWLYLAELVTPMLKEGGIS 1333
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1221-1333 1.40e-34

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 128.52  E-value: 1.40e-34
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577  1221 LERKSKSIIDEFLHINDFKEAMQCVEELNAQGLLHVFVRVGVESTLERSQITRDHMGQLLYQLVQSEKLSKQDFFKGFSE 1300
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 1956591577  1301 TLELADDMAIDIPHIWLYLAELVTPMLKEGGIS 1333
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1493-1577 3.48e-28

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 108.92  E-value: 3.48e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577  1493 PILLKYLDSDTEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEqnGKGVALKSVTAFFT 1572
Cdd:smart00515    1 GPLLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVT 78

                    ....*
gi 1956591577  1573 WLREA 1577
Cdd:smart00515   79 WLQEA 83
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1506-1582 1.34e-23

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 95.67  E-value: 1.34e-23
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1956591577 1506 ELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEQnGKGVALKSVTAFFTWLREAEEESE 1582
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAEK-GMKKVRKQAKPFVEWLEEAEEESD 76
W2_eIF5C_like cd11560
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; ...
1534-1580 9.61e-05

C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; eIF5C appears to be essential for the initiation of protein translation; its actual function, and specifically that of the C-terminal W2 domain, are not well understood. The Drosophila ortholog, kra (krasavietz) or exba (extra bases), may be involved in translational inhibition in neural development. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211398 [Multi-domain]  Cd Length: 194  Bit Score: 44.90  E-value: 9.61e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1956591577 1534 LYDEEVISEDAFYKWesSKDPAEQNGKGVALKSVTAFFTWLREAEEE 1580
Cdd:cd11560    150 LYKADVLSEDAILKW--YKKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
PRK10263 PRK10263
DNA translocase FtsK; Provisional
10-295 1.20e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 47.00  E-value: 1.20e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577   10 PFFQRPQIqpPRATIPNSSPSIR----PGAQTPTAVYQanqhimmvnhlPMPYPVPQGPQYCIPQYRHSGP---PYvgpp 82
Cdd:PRK10263   339 PVTQTPPV--ASVDVPPAQPTVAwqpvPGPQTGEPVIA-----------PAPEGYPQQSQYAQPAVQYNEPlqqPV---- 401
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577   83 qqypvqppgpgpfypgpGPGDFPNAYGTPFYPSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDPNQGGKDITEEIMSGG 162
Cdd:PRK10263   402 -----------------QPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQT 464
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577  163 GSRNPTPPIGRPTSTPTPPQLPSQVPEHSPVVYGTVES-AHLAASTPVTAASDQKQEE-----KPKPDPVlKSPSPVlrl 236
Cdd:PRK10263   465 EQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPArPPLYYFEEVEEKRAREREQlaawyQPIPEPV-KEPEPI--- 540
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1956591577  237 vlsgekkeqegqtSETTAIVSIAELPLPPSPTTVSSVA---RSTIAAPTSSALSSQPIFTTA 295
Cdd:PRK10263   541 -------------KSSLKAPSVAAVPPVEAAAAVSPLAsgvKKATLATGAAATVAAPVFSLA 589
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
155-370 4.40e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 44.91  E-value: 4.40e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577  155 TEEIMSGGGSRNPTPPIGRPTSTPTPPQLP---------SQVPEHSPVVYGTVESAhlAASTPVTAASDQKQEEKP---- 221
Cdd:pfam05109  430 TSPTLNTTGFAAPNTTTGLPSSTHVPTNLTapastgptvSTADVTSPTPAGTTSGA--SPVTPSPSPRDNGTESKApdmt 507
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577  222 -------KPDPVLKSPSPVLRLVLSGEKKEQEGQTSETTAIVSIAELPLPPSPTTVSSVARSTI-----AAPTSSALSSQ 289
Cdd:pfam05109  508 sptsavtTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIptlgkTSPTSAVTTPT 587
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577  290 PiftTAIDDRCELSSPREDTIpipSLTSCTETSDPLPTNENDDdiCKKPCSVAPNDIPLVSSTNLINEINGVSEKLSATE 369
Cdd:pfam05109  588 P---NATSPTVGETSPQANTT---NHTLGGTSSTPVVTSPPKN--ATSAVTTGQHNITSSSTSSMSLRPSSISETLSPST 659

                   .
gi 1956591577  370 S 370
Cdd:pfam05109  660 S 660
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
754-982 4.84e-62

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 210.68  E-value: 4.84e-62
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577  754 FRKVRSILNKLTPQMFNQLMKQVSGLTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVTLkvpmadkpgNTVNFR 833
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577  834 KLLLNRCQKEFEKdkadddvfekkqkeleaasapeertrlHDELEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCV 913
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1956591577  914 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLCN 982
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
755-979 1.62e-49

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 174.47  E-value: 1.62e-49
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577   755 RKVRSILNKLTPQMFNQLMKQVSGLTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVtLKVPmadkpgntvNFRK 834
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLN-AKNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577   835 LLLNRCQKEFEKDkadddvfekkqkeleaasapeertrlhdeLEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCVV 914
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1956591577   915 KLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERKT---SSRIRFMLQDVIDLR 979
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELR 197
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1427-1555 9.00e-48

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 167.08  E-value: 9.00e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577 1427 KRLEKLIIEDKANDEqIFDWVEANLDEIQMSSPTFLRALMTAVCKAAIIADSSTfRVDTAVIKQRVPILLKYLDSDTEKE 1506
Cdd:cd11559      8 AELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSLP-EKEKALLEKYAPLLQKYLDDDEQLQ 85
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 1956591577 1507 LQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPA 1555
Cdd:cd11559     86 LQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1221-1333 6.58e-37

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 135.10  E-value: 6.58e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577 1221 LERKSKSIIDEFLHINDFKEAMQCVEELNAQGLLHVFVRVGVESTLERSQITRDHMGQLLYQLVQSEKLSKQDFFKGFSE 1300
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1956591577 1301 TLELADDMAIDIPHIWLYLAELVTPMLKEGGIS 1333
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1221-1333 1.40e-34

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 128.52  E-value: 1.40e-34
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577  1221 LERKSKSIIDEFLHINDFKEAMQCVEELNAQGLLHVFVRVGVESTLERSQITRDHMGQLLYQLVQSEKLSKQDFFKGFSE 1300
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 1956591577  1301 TLELADDMAIDIPHIWLYLAELVTPMLKEGGIS 1333
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1493-1577 3.48e-28

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 108.92  E-value: 3.48e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577  1493 PILLKYLDSDTEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEqnGKGVALKSVTAFFT 1572
Cdd:smart00515    1 GPLLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVT 78

                    ....*
gi 1956591577  1573 WLREA 1577
Cdd:smart00515   79 WLQEA 83
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1506-1582 1.34e-23

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 95.67  E-value: 1.34e-23
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1956591577 1506 ELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEQnGKGVALKSVTAFFTWLREAEEESE 1582
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAEK-GMKKVRKQAKPFVEWLEEAEEESD 76
W2 cd11473
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of ...
1423-1549 3.71e-19

C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of several translation initiation factors, including the epsilon chain of eIF2b, where it has been found to catalyze the conversion of eIF2.GDP to its active eIF2.GTP form. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211395  Cd Length: 135  Bit Score: 85.22  E-value: 3.71e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577 1423 EELYKRLEKLIIEDKANDEQIFDWVEANLDEIQMSSPTFLRALMTAVCKAAIIADSSTF---RVDTAVIKQRVPILLKYL 1499
Cdd:cd11473      4 KKLRDSLLKELEEDKSSDVESVKAAKSKLDLDPISLEEVVKVLLTAVVNAVESADSISLtqkEQLVLVLKKYGPVLRELL 83
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1956591577 1500 DSDTEKELQALYALQA--SIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWE 1549
Cdd:cd11473     84 KLIKKDQLYLLLKIEKlcLQLKLSELISLLEKILDLLYDADVLSEEAILSWF 135
W2_eIF2B_epsilon cd11558
C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a ...
1463-1582 2.01e-15

C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a heteropentameric complex which functions as a guanine nucleotide exchange factor in the recycling of eIF-2 during the initiation of translation in eukaryotes. The epsilon and gamma subunits are sequence similar and both are essential in yeast. Epsilon appears to be the catalytically active subunit, with gamma enhancing its activity. The C-terminal domain of the eIF2B epsilon subunit contains bipartite motifs rich in acidic and aromatic residues, which are responsible for the interaction with eIF2. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211396  Cd Length: 169  Bit Score: 75.76  E-value: 2.01e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577 1463 RALMTAVCK-AAIIADSSTFRVDTA---VIKQRVPILLKYLDSDTEkELQALYALQASIVKLDQPANLLRMFFDCLYDEE 1538
Cdd:cd11558     47 RAVVKALLElILEVSSTSTAELLEAlkkLLSKWGPLLENYVKSQDD-QVELLLALEEFCLESEEGGPLFAKLLHALYDLD 125
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1956591577 1539 VISEDAFYKWESSKDPAEQNGKGVALKSVTAFFTWLREAEEESE 1582
Cdd:cd11558    126 ILEEEAILEWWEEPDAGADEEMKKVRELVKKFIEWLEEAEEESD 169
W2_eIF5 cd11561
C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase ...
1441-1582 2.79e-08

C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase acceleration protein (GAP), as well as a GDP dissociation inhibitor (GDI) during translational initiation in eukaryotes. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211399  Cd Length: 157  Bit Score: 54.54  E-value: 2.79e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577 1441 EQIFDWVEANLDEIQMSSptfLRALMTAV-------CKAAIIADSSTFRVDTA-VIKQRVPILLKYLDSDtekelQALYA 1512
Cdd:cd11561      9 DELGEFLKKNKDESGLSE---LKEILKEAerldvvkDKAVLVLAEVLFDENIVkEIKKRKALLLKLVTDE-----KAQKA 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1956591577 1513 LQASIVKL--DQPANLLRMF---FDCLYDEEVISEDAFYKW--ESSKD--PAEQNGKgvALKSVTAFFTWLREAEEESE 1582
Cdd:cd11561     81 LLGGIERFcgKHSPELLKKVpliLKALYDNDILEEEVILKWyeKVSKKyvSKEKSKK--VRKAAEPFVEWLEEAEEEEE 157
W2_eIF5C_like cd11560
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; ...
1534-1580 9.61e-05

C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; eIF5C appears to be essential for the initiation of protein translation; its actual function, and specifically that of the C-terminal W2 domain, are not well understood. The Drosophila ortholog, kra (krasavietz) or exba (extra bases), may be involved in translational inhibition in neural development. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211398 [Multi-domain]  Cd Length: 194  Bit Score: 44.90  E-value: 9.61e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1956591577 1534 LYDEEVISEDAFYKWesSKDPAEQNGKGVALKSVTAFFTWLREAEEE 1580
Cdd:cd11560    150 LYKADVLSEDAILKW--YKKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
PRK10263 PRK10263
DNA translocase FtsK; Provisional
10-295 1.20e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 47.00  E-value: 1.20e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577   10 PFFQRPQIqpPRATIPNSSPSIR----PGAQTPTAVYQanqhimmvnhlPMPYPVPQGPQYCIPQYRHSGP---PYvgpp 82
Cdd:PRK10263   339 PVTQTPPV--ASVDVPPAQPTVAwqpvPGPQTGEPVIA-----------PAPEGYPQQSQYAQPAVQYNEPlqqPV---- 401
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577   83 qqypvqppgpgpfypgpGPGDFPNAYGTPFYPSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDPNQGGKDITEEIMSGG 162
Cdd:PRK10263   402 -----------------QPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQT 464
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577  163 GSRNPTPPIGRPTSTPTPPQLPSQVPEHSPVVYGTVES-AHLAASTPVTAASDQKQEE-----KPKPDPVlKSPSPVlrl 236
Cdd:PRK10263   465 EQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPArPPLYYFEEVEEKRAREREQlaawyQPIPEPV-KEPEPI--- 540
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1956591577  237 vlsgekkeqegqtSETTAIVSIAELPLPPSPTTVSSVA---RSTIAAPTSSALSSQPIFTTA 295
Cdd:PRK10263   541 -------------KSSLKAPSVAAVPPVEAAAAVSPLAsgvKKATLATGAAATVAAPVFSLA 589
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
155-370 4.40e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 44.91  E-value: 4.40e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577  155 TEEIMSGGGSRNPTPPIGRPTSTPTPPQLP---------SQVPEHSPVVYGTVESAhlAASTPVTAASDQKQEEKP---- 221
Cdd:pfam05109  430 TSPTLNTTGFAAPNTTTGLPSSTHVPTNLTapastgptvSTADVTSPTPAGTTSGA--SPVTPSPSPRDNGTESKApdmt 507
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577  222 -------KPDPVLKSPSPVLRLVLSGEKKEQEGQTSETTAIVSIAELPLPPSPTTVSSVARSTI-----AAPTSSALSSQ 289
Cdd:pfam05109  508 sptsavtTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIptlgkTSPTSAVTTPT 587
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577  290 PiftTAIDDRCELSSPREDTIpipSLTSCTETSDPLPTNENDDdiCKKPCSVAPNDIPLVSSTNLINEINGVSEKLSATE 369
Cdd:pfam05109  588 P---NATSPTVGETSPQANTT---NHTLGGTSSTPVVTSPPKN--ATSAVTTGQHNITSSSTSSMSLRPSSISETLSPST 659

                   .
gi 1956591577  370 S 370
Cdd:pfam05109  660 S 660
PHA03247 PHA03247
large tegument protein UL36; Provisional
5-294 9.68e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 9.68e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577    5 PQTRSPFFQRPQIQPPRATIPNSSPSIRPGAQTPTAVYQANQHIMMVNHLPMPYPVPQGPQycipqyRHSGPPYVGPPQQ 84
Cdd:PHA03247  2593 PQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPR------DDPAPGRVSRPRR 2666
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577   85 YPVQPPGpgpfypgpgpgdfPNAYGTPFYPSQPVYQSAPIIVPTQQQPPPAKREKKTiriRDPNQGGKDITEEIMSGGGS 164
Cdd:PHA03247  2667 ARRLGRA-------------AQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEP---APHALVSATPLPPGPAAARQ 2730
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577  165 RNPTPPIgrptsTPTPPQLPSqvpehSPVVYGTvESAHLAASTPVTAASDQKQEEKPKPDPVLKSPSPVLRLVLSGEKKE 244
Cdd:PHA03247  2731 ASPALPA-----APAPPAVPA-----GPATPGG-PARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLP 2799
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|
gi 1956591577  245 QEGQTSETTAIVSIAELPLPPSPTTVSSVARSTIAAPTSSALSSQPIFTT 294
Cdd:PHA03247  2800 SPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPS 2849
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
5-343 3.84e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.06  E-value: 3.84e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577    5 PQTRSPFFQRPQIQPPRATIPNSSPSIRPGAQTPtavyqanqhimmvnhlPMPYPVPQGPqyciPQYRHSGP--PYVGPP 82
Cdd:pfam03154  243 PSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMP----------------PMPHSLQTGP----SHMQHPVPpqPFPLTP 302
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577   83 QQYPVQPPGPGPFYPGPGPGDFPNaygTPfyPSQPVYQSAPiivPTQQQP-PPAKREKKtiRIRDPnqggkditeeimsg 161
Cdd:pfam03154  303 QSSQSQVPPGPSPAAPGQSQQRIH---TP--PSQSQLQSQQ---PPREQPlPPAPLSMP--HIKPP-------------- 358
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577  162 ggSRNPTPPIGRPTSTPTPPQL----PSQVPEHSPVVYGTVESAHLAASTPVTAASDQKQeEKPKPDPVlkSPSPVLRLV 237
Cdd:pfam03154  359 --PTTPIPQLPNPQSHKHPPHLsgpsPFQMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQ-LMPQSQQL--PPPPAQPPV 433
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577  238 LSGEKKEQEGQTSETTAIVSIAELPLPPSPTTVSSVARSTIAAPTSSALSSQPIFTTAIDDRCelSSPREDTIPIPSLTS 317
Cdd:pfam03154  434 LTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPS--SASVSSSGPVPAAVS 511
                          330       340
                   ....*....|....*....|....*.
gi 1956591577  318 CteTSDPLPTNENDDDICKKPCSVAP 343
Cdd:pfam03154  512 C--PLPPVQIKEEALDEAEEPESPPP 535
PHA03378 PHA03378
EBNA-3B; Provisional
13-291 4.47e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 41.98  E-value: 4.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577   13 QRPQIQPPRATIP-NSSPSIRPGAQTPTAVYQANQHIMMVNHLPMPYPvPQGPQYCIPQYRHS--GPPYVGPPQQYPVQP 89
Cdd:PHA03378   604 QTPEPPTTQSHIPeTSAPRQWPMPLRPIPMRPLRMQPITFNVLVFPTP-HQPPQVEITPYKPTwtQIGHIPYQPSPTGAN 682
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577   90 PGPGPFYPGPGPGDfPNAYGTPFYPSQ--PVYQSAPIIVPTQQQPPPA------KREKKTIRIRDPNQGGKDITEEIMSG 161
Cdd:PHA03378   683 TMLPIQWAPGTMQP-PPRAPTPMRPPAapPGRAQRPAAATGRARPPAAapgrarPPAAAPGRARPPAAAPGRARPPAAAP 761
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577  162 GGSRNPTPPIGRPTST------PTPPQLPSQVPEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVLKSPSPVLR 235
Cdd:PHA03378   762 GRARPPAAAPGAPTPQpppqapPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLK 841
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1956591577  236 LVLSGEKKEqegqtsettaivsiaelPLPPSPTTVSSVARSTIAAPTSSALSSQPI 291
Cdd:PHA03378   842 KPAALERQA-----------------AAGPTPSPGSGTSDKIVQAPVFYPPVLQPI 880
PRK11901 PRK11901
hypothetical protein; Reviewed
114-240 7.27e-03

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 40.44  E-value: 7.27e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577  114 PSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDP---------NQGGKDITEEIMSGGGSRNPTPPI----GRPTSTPTP 180
Cdd:PRK11901   113 TAPPQDISAPPISPTPTQAAPPQTPNGQQRIELPgnisdalsqQQGQVNAASQNAQGNTSTLPTAPAtvapSKGAKVPAT 192
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1956591577  181 PQLPSQVPEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVLKSPSPVLRLVLSG 240
Cdd:PRK11901   193 AETHPTPPQKPATKKPAVNHHKTATVAVPPATSGKPKSGAASARALSSAPASHYTLQLSS 252
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH