NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2217271878|ref|XP_047289218|]
View 

eukaryotic translation initiation factor 4 gamma 3 isoform X11 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
804-1032 1.63e-62

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


:

Pssm-ID: 397130  Cd Length: 203  Bit Score: 212.22  E-value: 1.63e-62
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  804 FRKVRSILNKLTPQMFNQLMKQVSGLTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVTLkvpmadkpgNTVNFR 883
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  884 KLLLNRCQKEFEKdkadddvfekkqkeleaasapeertrlHDELEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCV 963
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217271878  964 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLCN 1032
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1477-1605 6.07e-48

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


:

Pssm-ID: 211397  Cd Length: 134  Bit Score: 167.46  E-value: 6.07e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878 1477 KRLEKLIIEDKANDEqIFDWVEANLDEIQMSSPTFLRALMTAVCKAAIIADSSTfRVDTAVIKQRVPILLKYLDSDTEKE 1556
Cdd:cd11559      8 AELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSLP-EKEKALLEKYAPLLQKYLDDDEQLQ 85
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 2217271878 1557 LQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPA 1605
Cdd:cd11559     86 LQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1271-1383 4.52e-37

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


:

Pssm-ID: 397128  Cd Length: 113  Bit Score: 135.48  E-value: 4.52e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878 1271 LERKSKSIIDEFLHINDFKEAMQCVEELNAQGLLHVFVRVGVESTLERSQITRDHMGQLLYQLVQSEKLSKQDFFKGFSE 1350
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 2217271878 1351 TLELADDMAIDIPHIWLYLAELVTPMLKEGGIS 1383
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
W2 super family cl17013
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of ...
1584-1630 8.48e-05

C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of several translation initiation factors, including the epsilon chain of eIF2b, where it has been found to catalyze the conversion of eIF2.GDP to its active eIF2.GTP form. The structure of the domain resembles that of a set of concatenated HEAT repeats.


The actual alignment was detected with superfamily member cd11560:

Pssm-ID: 473053 [Multi-domain]  Cd Length: 194  Bit Score: 45.28  E-value: 8.48e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 2217271878 1584 LYDEEVISEDAFYKWesSKDPAEQNGKGVALKSVTAFFTWLREAEEE 1630
Cdd:cd11560    150 LYKADVLSEDAILKW--YKKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
PHA03378 super family cl33729
EBNA-3B; Provisional
2-276 1.32e-04

EBNA-3B; Provisional


The actual alignment was detected with superfamily member PHA03378:

Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 46.98  E-value: 1.32e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878    2 NSQPQTRSPPSRTVPIH------CTDNWKRRKVLEQTP-VYRSLAGRGWIKYCIFAAGPRPPHHQFFQRPQIQPPRATIP 74
Cdd:PHA03378   618 TSAPRQWPMPLRPIPMRplrmqpITFNVLVFPTPHQPPqVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPP 697
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878   75 NSSPS-IRPGAQTPTAVYQANQhimmvnhLPMPYPVPQGPQYCIPQYRHSGPPYVGPPQQYPVQPPGPGPFYPGPGPGDF 153
Cdd:PHA03378   698 PRAPTpMRPPAAPPGRAQRPAA-------ATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAA 770
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  154 PNAyGTPFYPSQ--PVYQSAPIIVPTQQQPPPAKREKKTIRIRD-PNQGG--KDITEEIMSGGGSRN------------- 215
Cdd:PHA03378   771 PGA-PTPQPPPQapPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAaPGQQGptKQILRQLLTGGVKRGrpslkkpaalerq 849
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217271878  216 ----PTPPIGRPTSTPT-------PPQQLPSQVPehspvvyGTVESAHLAASTPVTAASDQKQEEKPKPDPV 276
Cdd:PHA03378   850 aaagPTPSPGSGTSDKIvqapvfyPPVLQPIQVM-------RQLGSVRAAAASTVTQAPTEYTGERRGVGPM 914
PTZ00449 super family cl33186
104 kDa microneme/rhoptry antigen; Provisional
166-390 9.52e-03

104 kDa microneme/rhoptry antigen; Provisional


The actual alignment was detected with superfamily member PTZ00449:

Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 40.83  E-value: 9.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  166 PVYQSAPIIVPTQQQPPPAKREKKTIRirDPnqggkditEEIMSGGGSRNPTPPIGRPT-------STPTPPQQLPSQVP 238
Cdd:PTZ00449   563 PAKEHKPSKIPTLSKKPEFPKDPKHPK--DP--------EEPKKPKRPRSAQRPTRPKSpklpellDIPKSPKRPESPKS 632
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  239 EHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVLK-------------SPSPVLRLVLSGEKKEQEGQT-SETTA 304
Cdd:PTZ00449   633 PKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKekfyddyldaaakSKETKTTVVLDESFESILKETlPETPG 712
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  305 IVSIAELPLPPSPTTVSSVARSTIAAPTSSALSSQPIFTTAIDDRCELSSPREDTIPIPSLTSCTETSDPLPTNENDDDI 384
Cdd:PTZ00449   713 TPFTTPRPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEA 792

                   ....*.
gi 2217271878  385 CKKPCS 390
Cdd:PTZ00449   793 MKRPDS 798
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
804-1032 1.63e-62

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 212.22  E-value: 1.63e-62
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  804 FRKVRSILNKLTPQMFNQLMKQVSGLTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVTLkvpmadkpgNTVNFR 883
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  884 KLLLNRCQKEFEKdkadddvfekkqkeleaasapeertrlHDELEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCV 963
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217271878  964 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLCN 1032
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
805-1029 6.07e-50

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 176.01  E-value: 6.07e-50
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878   805 RKVRSILNKLTPQMFNQLMKQVSGLTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVtLKVPmadkpgntvNFRK 884
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLN-AKNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878   885 LLLNRCQKEFEKDkadddvfekkqkeleaasapeertrlhdeLEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCVV 964
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217271878   965 KLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERKT---SSRIRFMLQDVIDLR 1029
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELR 197
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1477-1605 6.07e-48

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 167.46  E-value: 6.07e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878 1477 KRLEKLIIEDKANDEqIFDWVEANLDEIQMSSPTFLRALMTAVCKAAIIADSSTfRVDTAVIKQRVPILLKYLDSDTEKE 1556
Cdd:cd11559      8 AELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSLP-EKEKALLEKYAPLLQKYLDDDEQLQ 85
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 2217271878 1557 LQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPA 1605
Cdd:cd11559     86 LQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1271-1383 4.52e-37

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 135.48  E-value: 4.52e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878 1271 LERKSKSIIDEFLHINDFKEAMQCVEELNAQGLLHVFVRVGVESTLERSQITRDHMGQLLYQLVQSEKLSKQDFFKGFSE 1350
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 2217271878 1351 TLELADDMAIDIPHIWLYLAELVTPMLKEGGIS 1383
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1271-1383 9.60e-35

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 128.90  E-value: 9.60e-35
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  1271 LERKSKSIIDEFLHINDFKEAMQCVEELNAQGLLHVFVRVGVESTLERSQITRDHMGQLLYQLVQSEKLSKQDFFKGFSE 1350
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 2217271878  1351 TLELADDMAIDIPHIWLYLAELVTPMLKEGGIS 1383
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1543-1627 2.61e-28

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 109.30  E-value: 2.61e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  1543 PILLKYLDSDTEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEqnGKGVALKSVTAFFT 1622
Cdd:smart00515    1 GPLLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVT 78

                    ....*
gi 2217271878  1623 WLREA 1627
Cdd:smart00515   79 WLQEA 83
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1556-1632 9.34e-24

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 96.06  E-value: 9.34e-24
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217271878 1556 ELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEQnGKGVALKSVTAFFTWLREAEEESE 1632
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAEK-GMKKVRKQAKPFVEWLEEAEEESD 76
W2_eIF5C_like cd11560
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; ...
1584-1630 8.48e-05

C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; eIF5C appears to be essential for the initiation of protein translation; its actual function, and specifically that of the C-terminal W2 domain, are not well understood. The Drosophila ortholog, kra (krasavietz) or exba (extra bases), may be involved in translational inhibition in neural development. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211398 [Multi-domain]  Cd Length: 194  Bit Score: 45.28  E-value: 8.48e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 2217271878 1584 LYDEEVISEDAFYKWesSKDPAEQNGKGVALKSVTAFFTWLREAEEE 1630
Cdd:cd11560    150 LYKADVLSEDAILKW--YKKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
PHA03378 PHA03378
EBNA-3B; Provisional
2-276 1.32e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 46.98  E-value: 1.32e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878    2 NSQPQTRSPPSRTVPIH------CTDNWKRRKVLEQTP-VYRSLAGRGWIKYCIFAAGPRPPHHQFFQRPQIQPPRATIP 74
Cdd:PHA03378   618 TSAPRQWPMPLRPIPMRplrmqpITFNVLVFPTPHQPPqVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPP 697
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878   75 NSSPS-IRPGAQTPTAVYQANQhimmvnhLPMPYPVPQGPQYCIPQYRHSGPPYVGPPQQYPVQPPGPGPFYPGPGPGDF 153
Cdd:PHA03378   698 PRAPTpMRPPAAPPGRAQRPAA-------ATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAA 770
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  154 PNAyGTPFYPSQ--PVYQSAPIIVPTQQQPPPAKREKKTIRIRD-PNQGG--KDITEEIMSGGGSRN------------- 215
Cdd:PHA03378   771 PGA-PTPQPPPQapPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAaPGQQGptKQILRQLLTGGVKRGrpslkkpaalerq 849
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217271878  216 ----PTPPIGRPTSTPT-------PPQQLPSQVPehspvvyGTVESAHLAASTPVTAASDQKQEEKPKPDPV 276
Cdd:PHA03378   850 aaagPTPSPGSGTSDKIvqapvfyPPVLQPIQVM-------RQLGSVRAAAASTVTQAPTEYTGERRGVGPM 914
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
53-393 2.87e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 45.91  E-value: 2.87e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878   53 PRPPHHQFFQRPQIQPPRatIPNSSPSIRPGAQTPTAVYQANQhimmvnhlPMPYPVPQGPQYCIPQYRHSGPPYVGPPQ 132
Cdd:pfam03154  224 TAAPHTLIQQTPTLHPQR--LPSPHPPLQPMTQPPPPSQVSPQ--------PLPQPSLHGQMPPMPHSLQTGPSHMQHPV 293
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  133 QYPVQPPGPGPFYPGPGPGDFPNAYG-------TPfyPSQPVYQSAPiivPTQQQP-PPAKREKKTIRIRDPNQGGKDIT 204
Cdd:pfam03154  294 PPQPFPLTPQSSQSQVPPGPSPAAPGqsqqrihTP--PSQSQLQSQQ---PPREQPlPPAPLSMPHIKPPPTTPIPQLPN 368
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  205 EEIMSGGGSRNPTPPIGRPTSTPTPPQQLP-SQVPEHSPvvygtvESAHlaaSTPVTAASDQKQEEKPKPDPVLKSPSPV 283
Cdd:pfam03154  369 PQSHKHPPHLSGPSPFQMNSNLPPPPALKPlSSLSTHHP------PSAH---PPPLQLMPQSQQLPPPPAQPPVLTQSQS 439
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  284 LrlvlsgekKEQEGQTSETTAIVSIAelPLPPSPTTVSSVARSTIAAPTSSALSSQPIFTTAIDDRCelSSPREDTIPIP 363
Cdd:pfam03154  440 L--------PPPAASHPPTSGLHQVP--SQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPS--SASVSSSGPVP 507
                          330       340       350
                   ....*....|....*....|....*....|
gi 2217271878  364 SLTSCteTSDPLPTNENDDDICKKPCSVAP 393
Cdd:pfam03154  508 AAVSC--PLPPVQIKEEALDEAEEPESPPP 535
SP2_N cd22540
N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins ...
159-345 8.14e-04

N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP2 contains the least conserved DNA-binding domain within the SP subfamily of proteins, and its DNA sequence specificity differs from the other SP proteins. It localizes primarily within subnuclear foci associated with the nuclear matrix, and can activate, or in some cases, repress expression from different promoters. The transcription factor SP2 serves as a paradigm for indirect genomic binding. It does not require its DNA-binding domain for genomic DNA binding and occupies target promoters independently of whether they contain a cognate DNA-binding motif. SP2 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP2.


Pssm-ID: 411776 [Multi-domain]  Cd Length: 511  Bit Score: 44.15  E-value: 8.14e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  159 TPFYPSQPVYQSAPIIVPTQQQPPPAKR-------EKKTIRIRdPNQggkditeeIMSGGGS-------RNPTPPIGRPT 224
Cdd:cd22540     47 TPPAPPQPTPRKLVPIKPAPLPLGPGKNsigflsaKGNIIQLQ-GSQ--------LSSSAPGgqqvfaiQNPTMIIKGSQ 117
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  225 STPTPPQQL---PSQVPEHSPVVYGTVE---SAHLAASTPVTAASDQKQEEKP---KPDPVLKS---------PSPVLRL 286
Cdd:cd22540    118 TRSSTNQQYqisPQIQAAGQINNSGQIQiipGTNQAIITPVQVLQQPQQAHKPvpiKPAPLQTSntnsaslqvPGNVIKL 197
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217271878  287 VLSGEKKEQEGQTSETTAIVSIAELPLPPSPTTVSSVAR--STIAAPTSSALSSQP----IFTTA 345
Cdd:cd22540    198 QSGGNVALTLPVNNLVGTQDGATQLQLAAAPSKPSKKIRkkSAQAAQPAVTVAEQVetvlIETTA 262
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
166-390 9.52e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 40.83  E-value: 9.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  166 PVYQSAPIIVPTQQQPPPAKREKKTIRirDPnqggkditEEIMSGGGSRNPTPPIGRPT-------STPTPPQQLPSQVP 238
Cdd:PTZ00449   563 PAKEHKPSKIPTLSKKPEFPKDPKHPK--DP--------EEPKKPKRPRSAQRPTRPKSpklpellDIPKSPKRPESPKS 632
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  239 EHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVLK-------------SPSPVLRLVLSGEKKEQEGQT-SETTA 304
Cdd:PTZ00449   633 PKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKekfyddyldaaakSKETKTTVVLDESFESILKETlPETPG 712
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  305 IVSIAELPLPPSPTTVSSVARSTIAAPTSSALSSQPIFTTAIDDRCELSSPREDTIPIPSLTSCTETSDPLPTNENDDDI 384
Cdd:PTZ00449   713 TPFTTPRPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEA 792

                   ....*.
gi 2217271878  385 CKKPCS 390
Cdd:PTZ00449   793 MKRPDS 798
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
804-1032 1.63e-62

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 212.22  E-value: 1.63e-62
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  804 FRKVRSILNKLTPQMFNQLMKQVSGLTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVTLkvpmadkpgNTVNFR 883
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  884 KLLLNRCQKEFEKdkadddvfekkqkeleaasapeertrlHDELEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCV 963
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217271878  964 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLCN 1032
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
805-1029 6.07e-50

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 176.01  E-value: 6.07e-50
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878   805 RKVRSILNKLTPQMFNQLMKQVSGLTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVtLKVPmadkpgntvNFRK 884
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLN-AKNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878   885 LLLNRCQKEFEKDkadddvfekkqkeleaasapeertrlhdeLEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCVV 964
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217271878   965 KLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERKT---SSRIRFMLQDVIDLR 1029
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELR 197
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1477-1605 6.07e-48

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 167.46  E-value: 6.07e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878 1477 KRLEKLIIEDKANDEqIFDWVEANLDEIQMSSPTFLRALMTAVCKAAIIADSSTfRVDTAVIKQRVPILLKYLDSDTEKE 1556
Cdd:cd11559      8 AELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSLP-EKEKALLEKYAPLLQKYLDDDEQLQ 85
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 2217271878 1557 LQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPA 1605
Cdd:cd11559     86 LQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1271-1383 4.52e-37

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 135.48  E-value: 4.52e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878 1271 LERKSKSIIDEFLHINDFKEAMQCVEELNAQGLLHVFVRVGVESTLERSQITRDHMGQLLYQLVQSEKLSKQDFFKGFSE 1350
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 2217271878 1351 TLELADDMAIDIPHIWLYLAELVTPMLKEGGIS 1383
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1271-1383 9.60e-35

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 128.90  E-value: 9.60e-35
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  1271 LERKSKSIIDEFLHINDFKEAMQCVEELNAQGLLHVFVRVGVESTLERSQITRDHMGQLLYQLVQSEKLSKQDFFKGFSE 1350
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 2217271878  1351 TLELADDMAIDIPHIWLYLAELVTPMLKEGGIS 1383
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1543-1627 2.61e-28

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 109.30  E-value: 2.61e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  1543 PILLKYLDSDTEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEqnGKGVALKSVTAFFT 1622
Cdd:smart00515    1 GPLLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVT 78

                    ....*
gi 2217271878  1623 WLREA 1627
Cdd:smart00515   79 WLQEA 83
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1556-1632 9.34e-24

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 96.06  E-value: 9.34e-24
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217271878 1556 ELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEQnGKGVALKSVTAFFTWLREAEEESE 1632
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAEK-GMKKVRKQAKPFVEWLEEAEEESD 76
W2 cd11473
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of ...
1473-1599 2.20e-19

C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of several translation initiation factors, including the epsilon chain of eIF2b, where it has been found to catalyze the conversion of eIF2.GDP to its active eIF2.GTP form. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211395  Cd Length: 135  Bit Score: 85.99  E-value: 2.20e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878 1473 EELYKRLEKLIIEDKANDEQIFDWVEANLDEIQMSSPTFLRALMTAVCKAAIIADSSTF---RVDTAVIKQRVPILLKYL 1549
Cdd:cd11473      4 KKLRDSLLKELEEDKSSDVESVKAAKSKLDLDPISLEEVVKVLLTAVVNAVESADSISLtqkEQLVLVLKKYGPVLRELL 83
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2217271878 1550 DSDTEKELQALYALQA--SIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWE 1599
Cdd:cd11473     84 KLIKKDQLYLLLKIEKlcLQLKLSELISLLEKILDLLYDADVLSEEAILSWF 135
W2_eIF2B_epsilon cd11558
C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a ...
1513-1632 1.44e-15

C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a heteropentameric complex which functions as a guanine nucleotide exchange factor in the recycling of eIF-2 during the initiation of translation in eukaryotes. The epsilon and gamma subunits are sequence similar and both are essential in yeast. Epsilon appears to be the catalytically active subunit, with gamma enhancing its activity. The C-terminal domain of the eIF2B epsilon subunit contains bipartite motifs rich in acidic and aromatic residues, which are responsible for the interaction with eIF2. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211396  Cd Length: 169  Bit Score: 76.14  E-value: 1.44e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878 1513 RALMTAVCK-AAIIADSSTFRVDTA---VIKQRVPILLKYLDSDTEkELQALYALQASIVKLDQPANLLRMFFDCLYDEE 1588
Cdd:cd11558     47 RAVVKALLElILEVSSTSTAELLEAlkkLLSKWGPLLENYVKSQDD-QVELLLALEEFCLESEEGGPLFAKLLHALYDLD 125
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 2217271878 1589 VISEDAFYKWESSKDPAEQNGKGVALKSVTAFFTWLREAEEESE 1632
Cdd:cd11558    126 ILEEEAILEWWEEPDAGADEEMKKVRELVKKFIEWLEEAEEESD 169
W2_eIF5 cd11561
C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase ...
1491-1632 1.74e-08

C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase acceleration protein (GAP), as well as a GDP dissociation inhibitor (GDI) during translational initiation in eukaryotes. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211399  Cd Length: 157  Bit Score: 55.31  E-value: 1.74e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878 1491 EQIFDWVEANLDEIQMSSptfLRALMTAV-------CKAAIIADSSTFRVDTA-VIKQRVPILLKYLDSDtekelQALYA 1562
Cdd:cd11561      9 DELGEFLKKNKDESGLSE---LKEILKEAerldvvkDKAVLVLAEVLFDENIVkEIKKRKALLLKLVTDE-----KAQKA 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217271878 1563 LQASIVKL--DQPANLLRMF---FDCLYDEEVISEDAFYKW--ESSKD--PAEQNGKgvALKSVTAFFTWLREAEEESE 1632
Cdd:cd11561     81 LLGGIERFcgKHSPELLKKVpliLKALYDNDILEEEVILKWyeKVSKKyvSKEKSKK--VRKAAEPFVEWLEEAEEEEE 157
W2_eIF5C_like cd11560
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; ...
1584-1630 8.48e-05

C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; eIF5C appears to be essential for the initiation of protein translation; its actual function, and specifically that of the C-terminal W2 domain, are not well understood. The Drosophila ortholog, kra (krasavietz) or exba (extra bases), may be involved in translational inhibition in neural development. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211398 [Multi-domain]  Cd Length: 194  Bit Score: 45.28  E-value: 8.48e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 2217271878 1584 LYDEEVISEDAFYKWesSKDPAEQNGKGVALKSVTAFFTWLREAEEE 1630
Cdd:cd11560    150 LYKADVLSEDAILKW--YKKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
PHA03378 PHA03378
EBNA-3B; Provisional
2-276 1.32e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 46.98  E-value: 1.32e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878    2 NSQPQTRSPPSRTVPIH------CTDNWKRRKVLEQTP-VYRSLAGRGWIKYCIFAAGPRPPHHQFFQRPQIQPPRATIP 74
Cdd:PHA03378   618 TSAPRQWPMPLRPIPMRplrmqpITFNVLVFPTPHQPPqVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPP 697
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878   75 NSSPS-IRPGAQTPTAVYQANQhimmvnhLPMPYPVPQGPQYCIPQYRHSGPPYVGPPQQYPVQPPGPGPFYPGPGPGDF 153
Cdd:PHA03378   698 PRAPTpMRPPAAPPGRAQRPAA-------ATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAA 770
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  154 PNAyGTPFYPSQ--PVYQSAPIIVPTQQQPPPAKREKKTIRIRD-PNQGG--KDITEEIMSGGGSRN------------- 215
Cdd:PHA03378   771 PGA-PTPQPPPQapPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAaPGQQGptKQILRQLLTGGVKRGrpslkkpaalerq 849
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217271878  216 ----PTPPIGRPTSTPT-------PPQQLPSQVPehspvvyGTVESAHLAASTPVTAASDQKQEEKPKPDPV 276
Cdd:PHA03378   850 aaagPTPSPGSGTSDKIvqapvfyPPVLQPIQVM-------RQLGSVRAAAASTVTQAPTEYTGERRGVGPM 914
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
53-393 2.87e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 45.91  E-value: 2.87e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878   53 PRPPHHQFFQRPQIQPPRatIPNSSPSIRPGAQTPTAVYQANQhimmvnhlPMPYPVPQGPQYCIPQYRHSGPPYVGPPQ 132
Cdd:pfam03154  224 TAAPHTLIQQTPTLHPQR--LPSPHPPLQPMTQPPPPSQVSPQ--------PLPQPSLHGQMPPMPHSLQTGPSHMQHPV 293
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  133 QYPVQPPGPGPFYPGPGPGDFPNAYG-------TPfyPSQPVYQSAPiivPTQQQP-PPAKREKKTIRIRDPNQGGKDIT 204
Cdd:pfam03154  294 PPQPFPLTPQSSQSQVPPGPSPAAPGqsqqrihTP--PSQSQLQSQQ---PPREQPlPPAPLSMPHIKPPPTTPIPQLPN 368
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  205 EEIMSGGGSRNPTPPIGRPTSTPTPPQQLP-SQVPEHSPvvygtvESAHlaaSTPVTAASDQKQEEKPKPDPVLKSPSPV 283
Cdd:pfam03154  369 PQSHKHPPHLSGPSPFQMNSNLPPPPALKPlSSLSTHHP------PSAH---PPPLQLMPQSQQLPPPPAQPPVLTQSQS 439
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  284 LrlvlsgekKEQEGQTSETTAIVSIAelPLPPSPTTVSSVARSTIAAPTSSALSSQPIFTTAIDDRCelSSPREDTIPIP 363
Cdd:pfam03154  440 L--------PPPAASHPPTSGLHQVP--SQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPS--SASVSSSGPVP 507
                          330       340       350
                   ....*....|....*....|....*....|
gi 2217271878  364 SLTSCteTSDPLPTNENDDDICKKPCSVAP 393
Cdd:pfam03154  508 AAVSC--PLPPVQIKEEALDEAEEPESPPP 535
PRK10263 PRK10263
DNA translocase FtsK; Provisional
69-345 3.37e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 45.46  E-value: 3.37e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878   69 PRATIPNSSPSIR----PGAQTPTAVYQanqhimmvnhlPMPYPVPQGPQYCIPQYRHSGP---PYvgppqqypvqppgp 141
Cdd:PRK10263   347 ASVDVPPAQPTVAwqpvPGPQTGEPVIA-----------PAPEGYPQQSQYAQPAVQYNEPlqqPV-------------- 401
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  142 gpfypgpGPGDFPNAYGTPFYPSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDPNQGGKDITEEimsggGSRNPTPPIG 221
Cdd:PRK10263   402 -------QPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQ-----STYQTEQTYQ 469
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  222 RPtsTPTPPQQLPSQVPEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVLKS-----PSPVlrlvlsgekKEQE 296
Cdd:PRK10263   470 QP--AAQEPLYQQPQPVEQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQLAAwyqpiPEPV---------KEPE 538
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2217271878  297 GQTSETTAIVSIAELPLPPSPtTVSSVA---RSTIAAPTSSALSSQPIFTTA 345
Cdd:PRK10263   539 PIKSSLKAPSVAAVPPVEAAA-AVSPLAsgvKKATLATGAAATVAAPVFSLA 589
PRK11901 PRK11901
hypothetical protein; Reviewed
163-290 6.99e-04

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 43.90  E-value: 6.99e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  163 PSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDP---------NQGGKDITEEIMSGGGSRNPTPPIGRPTSTPTPPQQL 233
Cdd:PRK11901   113 TAPPQDISAPPISPTPTQAAPPQTPNGQQRIELPgnisdalsqQQGQVNAASQNAQGNTSTLPTAPATVAPSKGAKVPAT 192
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  234 PSQVPEHSPVVYGT--VESAHLAASTPVTAASDQKQEEKPKPDPVLKS-PSPVLRLVLSG 290
Cdd:PRK11901   193 AETHPTPPQKPATKkpAVNHHKTATVAVPPATSGKPKSGAASARALSSaPASHYTLQLSS 252
PHA03247 PHA03247
large tegument protein UL36; Provisional
53-344 7.99e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.54  E-value: 7.99e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878   53 PRPPHHQFFQR-------PQIQPPRATIPNSSPSIRPGAQTPTAVyqanqhimmVNHLPMPYPVPQGPQYCIPQYRHSGP 125
Cdd:PHA03247  2575 PRPSEPAVTSRarrpdapPQSARPRAPVDDRGDPRGPAPPSPLPP---------DTHAPDPPPPSPSPAANEPDPHPPPT 2645
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  126 PYVGPPQQYPVQPPGPGPFYPGPGPGDFPNAYGTPFYPSQPVYQSAPIIVPTQQQPPPAKREKKTirirdpnqggkdiTE 205
Cdd:PHA03247  2646 VPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEP-------------AP 2712
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  206 EIMSGGGSRNPTPPIGRPTSTPTPPQQLPSQVPEhSPVVYGTvESAHLAASTPVTAASDQKQEEKPKPDPVLKSPSPVLR 285
Cdd:PHA03247  2713 HALVSATPLPPGPAAARQASPALPAAPAPPAVPA-GPATPGG-PARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVAS 2790
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2217271878  286 LVLSGEKKEQEGQTSETTAIVSIAELPLPPSPTTVSSVARSTIAAPTSSALSSQPIFTT 344
Cdd:PHA03247  2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPS 2849
SP2_N cd22540
N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins ...
159-345 8.14e-04

N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP2 contains the least conserved DNA-binding domain within the SP subfamily of proteins, and its DNA sequence specificity differs from the other SP proteins. It localizes primarily within subnuclear foci associated with the nuclear matrix, and can activate, or in some cases, repress expression from different promoters. The transcription factor SP2 serves as a paradigm for indirect genomic binding. It does not require its DNA-binding domain for genomic DNA binding and occupies target promoters independently of whether they contain a cognate DNA-binding motif. SP2 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP2.


Pssm-ID: 411776 [Multi-domain]  Cd Length: 511  Bit Score: 44.15  E-value: 8.14e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  159 TPFYPSQPVYQSAPIIVPTQQQPPPAKR-------EKKTIRIRdPNQggkditeeIMSGGGS-------RNPTPPIGRPT 224
Cdd:cd22540     47 TPPAPPQPTPRKLVPIKPAPLPLGPGKNsigflsaKGNIIQLQ-GSQ--------LSSSAPGgqqvfaiQNPTMIIKGSQ 117
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  225 STPTPPQQL---PSQVPEHSPVVYGTVE---SAHLAASTPVTAASDQKQEEKP---KPDPVLKS---------PSPVLRL 286
Cdd:cd22540    118 TRSSTNQQYqisPQIQAAGQINNSGQIQiipGTNQAIITPVQVLQQPQQAHKPvpiKPAPLQTSntnsaslqvPGNVIKL 197
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217271878  287 VLSGEKKEQEGQTSETTAIVSIAELPLPPSPTTVSSVAR--STIAAPTSSALSSQP----IFTTA 345
Cdd:cd22540    198 QSGGNVALTLPVNNLVGTQDGATQLQLAAAPSKPSKKIRkkSAQAAQPAVTVAEQVetvlIETTA 262
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
154-357 9.26e-04

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 43.50  E-value: 9.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  154 PNAYGTPFYPSQpvyqsapiiVPTQQQPPPAKREKKTIRIRDPNQGgkditeeimSGGGSRNPTPPIGRPTSTPTPPQQL 233
Cdd:pfam05539  181 PTEVSHPTYPSQ---------VTPQSQPATQGHQTATANQRLSSTE---------PVGTQGTTTSSNPEPQTEPPPSQRG 242
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  234 PSQVPEHSPvvygtvesahlaaSTP----VTAASDQKQEEKPKPDPVLKSPSPVLRLVLSGEKKEQEGQTSETtaivsia 309
Cdd:pfam05539  243 PSGSPQHPP-------------STTsqdqSTTGDGQEHTQRRKTPPATSNRRSPHSTATPPPTTKRQETGRPT------- 302
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*...
gi 2217271878  310 elplpPSPTTVSSVARStiaAPTSSALSSQPIFTTAIDDRCELSSPRE 357
Cdd:pfam05539  303 -----PRPTATTQSGSS---PPHSSPPGVQANPTTQNLVDCKELDPPK 342
PHA03247 PHA03247
large tegument protein UL36; Provisional
4-376 1.13e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 1.13e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878    4 QPQTRSPPSRTVPihctdnwKRRKVLEQTPVYRSLAGRGWIKYcifAAGPRPPHHQFFQRPqiqPPRATIPNSSP-SIRP 82
Cdd:PHA03247  2651 RPRDDPAPGRVSR-------PRRARRLGRAAQASSPPQRPRRR---AARPTVGSLTSLADP---PPPPPTPEPAPhALVS 2717
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878   83 GAQTPTAVYQANQHIMMVNHLPMPYPVPQGPQYCIPQYRHSGPPYVGPPQQYPVqppgpgpfypgpgpgdfPNAYGTPFY 162
Cdd:PHA03247  2718 ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAP-----------------PAAPAAGPP 2780
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  163 PSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDPNQggkdiTEEIMSGGGSRNPTPPIGRPTSTPTPPQQLPSQVPEHSP 242
Cdd:PHA03247  2781 RRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAA-----ALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  243 VVYG---------------TVESAHLAASTPVTAASDQKQEEKPKPDPVLKSPSPVLRLVLSGEKKEQEGQTSETTAIVS 307
Cdd:PHA03247  2856 VAPGgdvrrrppsrspaakPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPP 2935
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217271878  308 IAELPLPPSPTTVSSVA---RSTIAAPTSSALSSQPIFTTaiddRCELSSPReDTIPIPSLTSCTETSDPLP 376
Cdd:PHA03247  2936 PPRPQPPLAPTTDPAGAgepSGAVPQPWLGALVPGRVAVP----RFRVPQPA-PSREAPASSTPPLTGHSLS 3002
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
166-390 9.52e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 40.83  E-value: 9.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  166 PVYQSAPIIVPTQQQPPPAKREKKTIRirDPnqggkditEEIMSGGGSRNPTPPIGRPT-------STPTPPQQLPSQVP 238
Cdd:PTZ00449   563 PAKEHKPSKIPTLSKKPEFPKDPKHPK--DP--------EEPKKPKRPRSAQRPTRPKSpklpellDIPKSPKRPESPKS 632
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  239 EHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVLK-------------SPSPVLRLVLSGEKKEQEGQT-SETTA 304
Cdd:PTZ00449   633 PKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKekfyddyldaaakSKETKTTVVLDESFESILKETlPETPG 712
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217271878  305 IVSIAELPLPPSPTTVSSVARSTIAAPTSSALSSQPIFTTAIDDRCELSSPREDTIPIPSLTSCTETSDPLPTNENDDDI 384
Cdd:PTZ00449   713 TPFTTPRPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEA 792

                   ....*.
gi 2217271878  385 CKKPCS 390
Cdd:PTZ00449   793 MKRPDS 798
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH