NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|125656165|ref|NP_001075019|]
View 

THO complex subunit 2 [Homo sapiens]

Protein Classification

Thoc2 and Tho2 domain-containing protein( domain architecture ID 13162558)

protein containing domains THOC2_N, Thoc2, and Tho2

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Tho2 pfam11262
Transcription factor/nuclear export subunit protein 2; THO and TREX form a eukaryotic complex ...
874-1173 1.77e-128

Transcription factor/nuclear export subunit protein 2; THO and TREX form a eukaryotic complex which functions in messenger ribonucleoprotein metabolism and plays a role in preventing the transcription-associated genetic instability. Tho2, along with four other subunits forms THO


:

Pssm-ID: 463251  Cd Length: 304  Bit Score: 402.00  E-value: 1.77e-128
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165   874 DDISPQFYATFWSLTMYDLAVPHTSYEREVNKLKVQMKAI----DDNQEMPPNKKKKEKERCTALQDKLLEEEKKQMEHV 949
Cdd:pfam11262    1 EYISPEFYVTFWQLSLYDIYVPTESYEAEIERLKKQIRELsrdrSDMSRAGASKKKKEKKRLEALIDKLKEELKEHIEHV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165   950 QRVLQRLKLEKDNWLLA-KSTKNETITKFLQLCIFPRCIFSAIDAVYCARFVELVHQQKTPNFSTLLCYDRVFSD-IIYT 1027
Cdd:pfam11262   81 EKTRKRLQKEKDSWFPGsKAKKNALIDAFLQHCILPRALLSPADALYCAKFIKLLHELGTPNFSTLLLYDRLFKDnLRSL 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165  1028 VASCTENEASRYGRFLCCMLETVTRWHSDRATYEKEC---GNYPGFLTILRAtgfDGGNKADQLDYENFRHVVHKWHYKL 1104
Cdd:pfam11262  161 IFSCTEREAENLGRFLNEILKDLSRWHADEAVYEKEAlgkKNLPGFATKFND---DDGKPTDFLSYEDFRRLLYKWHKKL 237
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 125656165  1105 TKASVHCLETGEYTHIRNILIVLTKILPWYPKVLNLGQALERRVHKICqeEKEKRPDLYALAMGYSGQL 1173
Cdd:pfam11262  238 TSALKSCLESGEYMHIRNAIIVLKKILPVFPAVDFMGEALLKAVEKLA--EREKREDLKVLANSYLGLL 304
THOC2_N super family cl24644
THO complex subunit 2 N-terminus; This family represents the N-terminus of THO complex subunit ...
11-566 2.14e-87

THO complex subunit 2 N-terminus; This family represents the N-terminus of THO complex subunit 2.


The actual alignment was detected with superfamily member pfam16134:

Pssm-ID: 465032  Cd Length: 614  Bit Score: 298.77  E-value: 2.14e-87
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165    11 EWIKNWEKSGRGEflhLCRILSENKSHDSSTyrDFQQALYELSYHVIKGNLKHEQASNVLSDI-SEFREDMPSILADVFC 89
Cdd:pfam16134    3 ERINNWGGSGRQE---LIEQLKLARNDEDED--ELSDLFQELIRSVLDGRLDPEDAGSFLKEIiKEEPTDSSEDVAKLFL 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165    90 ilDIEtNCLEEKSKRDYFTQLVLACLylVSDTVLKERLDPETLESLGLIKQsQQFNQKSVKIKTKLFYKQQKFNLLREEN 169
Cdd:pfam16134   78 --DVL-STFSDSEDMLALRDLLAATI--ISPSLMRLELDTKLLQELGLVRD-TTFHRMLIRKSTNLLYRQKKYNLLREES 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165   170 EGYAKLIAELgQDLSGSITSDLI---LENIKSLIGCFNLDPNRVLDVILEVFECR-PEHDDFFISLL------------- 232
Cdd:pfam16134  152 EGYSKLITEL-FTTSDNDTFEKVdytFERVKALIGKFDLDPGRVLDVILDVFAAFlVKHYRFFVKFLrasswwprteesd 230
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165   233 --ESYMSMCEP--QTLCHILGFKFKFYQ-EPNGETPSSLYRVAAVLLQFNLIDLDDLYVHLLPADNcIMDEHKREI-AEA 306
Cdd:pfam16134  231 wiSSTKTLPPGgnRVAAQLLGFKLRFYSsDADDELPENLIYLAALLIKEGFISFGDLYPHLSPDDE-EMEALKEEYkKEL 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165   307 KQIVRK-----LTMV-VLS-------------SEKMDEREKEKEKEEEKVEKPPDNQKLGLLEALLKIGDWQHAQNIMDQ 367
Cdd:pfam16134  310 EEESMEgganaLAMAgALPddddtlppakedeAAASKKAPTKEEEKKEKEPEPKDNQKIQLLKSLLAIGALPESLFILGR 389
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165   368 MpPYYAASHKLIALAICKLIHITIEPLYR-----------RVGVPKGAK-----GSPVNALQNKRAPK------------ 419
Cdd:pfam16134  390 Y-PWLALVDPEIPELIHRILEHSIEPLYEstrsvplssrpESGLPKGNIvrldeNPPRRLLRWPKTDKpffdlgtkyrfy 468
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165   420 ---------QAESFEDLRRDVFNMFCYLGPHLSHDPILFAKVVRIGKSFMKEfqsDGSKQEDKEKTevilsclLSITDQV 490
Cdd:pfam16134  469 ydewkdnlpVCQTVDDLFTLSHEFLNLIGVNLGQDPSLLSKLCRIGVKDLEN---SDESEENRDRW-------IDYLRRF 538
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 125656165   491 LLPSLSLMDCNACMSEELWGMFKTFPYQHRYRLYGQWKNETYNSHPLLVKVKAQTIDRAKYIMKRLTKENVKPSGR 566
Cdd:pfam16134  539 IFPALSLLEANPIVVDEVYELLKLFPFETRYFLYGEWYEKLTKRNPLIKIAFNKAEKETKDILKRLSKDNIRPMAR 614
Thoc2 pfam11732
Transcription- and export-related complex subunit; The THO/TREX complex is the transcription- ...
568-642 2.63e-39

Transcription- and export-related complex subunit; The THO/TREX complex is the transcription- and export-related complex associated with spliceosomes that preferentially deal with spliced mRNAs as opposed to unspliced mRNAs. Thoc2 plays a role in RNA polymerase II (RNA pol II)-dependent transcription and is required for the stability of DNA repeats. In humans, the TRE complex is comprised of the exon-junction-associated proteins Aly/REF and UAP56 together with the THO proteins THOC1 (hHpr1/p84), Thoc2 (hRlr1), THOC3 (hTex1), THOC5 (fSAP79), THOC6 (fSAP35), and THOC7 (fSAP24). Although much evidence indicates that the function of the TREX complex as an adaptor between the mRNA and components of the export machinery is conserved among eukaryotes, in Drosophila the majority of mRNAs can be exported from the nucleus independently of the THO complex.


:

Pssm-ID: 463334  Cd Length: 75  Bit Score: 140.69  E-value: 2.63e-39
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 125656165   568 IGKLSHSNPTILFDYILSQIQKYDNLITPVVDSLKYLTSLNYDVLAYCIIEALANPEKERMKHDDTTISSWLQSL 642
Cdd:pfam11732    1 LAKLSHSNPLIVFEVALNQIESYDNLIEPVVDALKYFTDLGYDVLTYCLLERLTNPGRSRVKDDGTNISPWLQSL 75
U2AF_lg super family cl36941
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
1448-1556 1.22e-04

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


The actual alignment was detected with superfamily member TIGR01642:

Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 46.42  E-value: 1.22e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165  1448 SKSKEREMDKkDLDKSRERSREREKKDEKDRKERKRDHSNNDREVPPDLTKRRKEENGTM-----GVSKHKSESPCESPY 1522
Cdd:TIGR01642    8 EREKSRGRDR-DRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPRSLryssvRRSRDRPRRRSRSVR 86
                           90       100       110
                   ....*....|....*....|....*....|....
gi 125656165  1523 PNEKDKEKNKSKSSGKEKGSDSFKSEKMDKISSG 1556
Cdd:TIGR01642   87 SIEQHRRRLRDRSPSNQWRKDDKKRSLWDIKPPG 120
PTZ00121 super family cl31754
MAEBL; Provisional
1291-1593 1.08e-03

MAEBL; Provisional


The actual alignment was detected with superfamily member PTZ00121:

Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 43.98  E-value: 1.08e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165 1291 EARVLGKDGKEKPK-EERPNKDEKARETKE-RTPKSDKEKEKFKKEEKAKDEKFKTTVPNAESKSTQEREREKEPSRErd 1368
Cdd:PTZ00121 1458 KAEEAKKKAEEAKKaDEAKKKAEEAKKADEaKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKK-- 1535
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165 1369 iAKEMKSKENVKGGEKTPVSGSLKSPVPRSDIPEPEREQKRRKIdthpSPSHSSTVKDSLIELKESSAKLYinhtpppls 1448
Cdd:PTZ00121 1536 -ADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNM----ALRKAEEAKKAEEARIEEVMKLY--------- 1601
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165 1449 kSKEREMDKKDLDKSRERSRERE--KKDEKDRKERKRDHSNNDREVPPDLTKRRKEENGTMGVSKHKSESPCESPYPNEK 1526
Cdd:PTZ00121 1602 -EEEKKMKAEEAKKAEEAKIKAEelKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEA 1680
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165 1527 DK---EKNKSKSSGKEKGSDSFKSEKMDKISSGGKKESRHDKEKIEKKEKRDSSGGKEEKKHHKSSDKHR 1593
Cdd:PTZ00121 1681 KKaeeDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAK 1750
 
Name Accession Description Interval E-value
Tho2 pfam11262
Transcription factor/nuclear export subunit protein 2; THO and TREX form a eukaryotic complex ...
874-1173 1.77e-128

Transcription factor/nuclear export subunit protein 2; THO and TREX form a eukaryotic complex which functions in messenger ribonucleoprotein metabolism and plays a role in preventing the transcription-associated genetic instability. Tho2, along with four other subunits forms THO


Pssm-ID: 463251  Cd Length: 304  Bit Score: 402.00  E-value: 1.77e-128
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165   874 DDISPQFYATFWSLTMYDLAVPHTSYEREVNKLKVQMKAI----DDNQEMPPNKKKKEKERCTALQDKLLEEEKKQMEHV 949
Cdd:pfam11262    1 EYISPEFYVTFWQLSLYDIYVPTESYEAEIERLKKQIRELsrdrSDMSRAGASKKKKEKKRLEALIDKLKEELKEHIEHV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165   950 QRVLQRLKLEKDNWLLA-KSTKNETITKFLQLCIFPRCIFSAIDAVYCARFVELVHQQKTPNFSTLLCYDRVFSD-IIYT 1027
Cdd:pfam11262   81 EKTRKRLQKEKDSWFPGsKAKKNALIDAFLQHCILPRALLSPADALYCAKFIKLLHELGTPNFSTLLLYDRLFKDnLRSL 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165  1028 VASCTENEASRYGRFLCCMLETVTRWHSDRATYEKEC---GNYPGFLTILRAtgfDGGNKADQLDYENFRHVVHKWHYKL 1104
Cdd:pfam11262  161 IFSCTEREAENLGRFLNEILKDLSRWHADEAVYEKEAlgkKNLPGFATKFND---DDGKPTDFLSYEDFRRLLYKWHKKL 237
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 125656165  1105 TKASVHCLETGEYTHIRNILIVLTKILPWYPKVLNLGQALERRVHKICqeEKEKRPDLYALAMGYSGQL 1173
Cdd:pfam11262  238 TSALKSCLESGEYMHIRNAIIVLKKILPVFPAVDFMGEALLKAVEKLA--EREKREDLKVLANSYLGLL 304
THOC2_N pfam16134
THO complex subunit 2 N-terminus; This family represents the N-terminus of THO complex subunit ...
11-566 2.14e-87

THO complex subunit 2 N-terminus; This family represents the N-terminus of THO complex subunit 2.


Pssm-ID: 465032  Cd Length: 614  Bit Score: 298.77  E-value: 2.14e-87
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165    11 EWIKNWEKSGRGEflhLCRILSENKSHDSSTyrDFQQALYELSYHVIKGNLKHEQASNVLSDI-SEFREDMPSILADVFC 89
Cdd:pfam16134    3 ERINNWGGSGRQE---LIEQLKLARNDEDED--ELSDLFQELIRSVLDGRLDPEDAGSFLKEIiKEEPTDSSEDVAKLFL 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165    90 ilDIEtNCLEEKSKRDYFTQLVLACLylVSDTVLKERLDPETLESLGLIKQsQQFNQKSVKIKTKLFYKQQKFNLLREEN 169
Cdd:pfam16134   78 --DVL-STFSDSEDMLALRDLLAATI--ISPSLMRLELDTKLLQELGLVRD-TTFHRMLIRKSTNLLYRQKKYNLLREES 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165   170 EGYAKLIAELgQDLSGSITSDLI---LENIKSLIGCFNLDPNRVLDVILEVFECR-PEHDDFFISLL------------- 232
Cdd:pfam16134  152 EGYSKLITEL-FTTSDNDTFEKVdytFERVKALIGKFDLDPGRVLDVILDVFAAFlVKHYRFFVKFLrasswwprteesd 230
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165   233 --ESYMSMCEP--QTLCHILGFKFKFYQ-EPNGETPSSLYRVAAVLLQFNLIDLDDLYVHLLPADNcIMDEHKREI-AEA 306
Cdd:pfam16134  231 wiSSTKTLPPGgnRVAAQLLGFKLRFYSsDADDELPENLIYLAALLIKEGFISFGDLYPHLSPDDE-EMEALKEEYkKEL 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165   307 KQIVRK-----LTMV-VLS-------------SEKMDEREKEKEKEEEKVEKPPDNQKLGLLEALLKIGDWQHAQNIMDQ 367
Cdd:pfam16134  310 EEESMEgganaLAMAgALPddddtlppakedeAAASKKAPTKEEEKKEKEPEPKDNQKIQLLKSLLAIGALPESLFILGR 389
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165   368 MpPYYAASHKLIALAICKLIHITIEPLYR-----------RVGVPKGAK-----GSPVNALQNKRAPK------------ 419
Cdd:pfam16134  390 Y-PWLALVDPEIPELIHRILEHSIEPLYEstrsvplssrpESGLPKGNIvrldeNPPRRLLRWPKTDKpffdlgtkyrfy 468
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165   420 ---------QAESFEDLRRDVFNMFCYLGPHLSHDPILFAKVVRIGKSFMKEfqsDGSKQEDKEKTevilsclLSITDQV 490
Cdd:pfam16134  469 ydewkdnlpVCQTVDDLFTLSHEFLNLIGVNLGQDPSLLSKLCRIGVKDLEN---SDESEENRDRW-------IDYLRRF 538
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 125656165   491 LLPSLSLMDCNACMSEELWGMFKTFPYQHRYRLYGQWKNETYNSHPLLVKVKAQTIDRAKYIMKRLTKENVKPSGR 566
Cdd:pfam16134  539 IFPALSLLEANPIVVDEVYELLKLFPFETRYFLYGEWYEKLTKRNPLIKIAFNKAEKETKDILKRLSKDNIRPMAR 614
Thoc2 pfam11732
Transcription- and export-related complex subunit; The THO/TREX complex is the transcription- ...
568-642 2.63e-39

Transcription- and export-related complex subunit; The THO/TREX complex is the transcription- and export-related complex associated with spliceosomes that preferentially deal with spliced mRNAs as opposed to unspliced mRNAs. Thoc2 plays a role in RNA polymerase II (RNA pol II)-dependent transcription and is required for the stability of DNA repeats. In humans, the TRE complex is comprised of the exon-junction-associated proteins Aly/REF and UAP56 together with the THO proteins THOC1 (hHpr1/p84), Thoc2 (hRlr1), THOC3 (hTex1), THOC5 (fSAP79), THOC6 (fSAP35), and THOC7 (fSAP24). Although much evidence indicates that the function of the TREX complex as an adaptor between the mRNA and components of the export machinery is conserved among eukaryotes, in Drosophila the majority of mRNAs can be exported from the nucleus independently of the THO complex.


Pssm-ID: 463334  Cd Length: 75  Bit Score: 140.69  E-value: 2.63e-39
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 125656165   568 IGKLSHSNPTILFDYILSQIQKYDNLITPVVDSLKYLTSLNYDVLAYCIIEALANPEKERMKHDDTTISSWLQSL 642
Cdd:pfam11732    1 LAKLSHSNPLIVFEVALNQIESYDNLIEPVVDALKYFTDLGYDVLTYCLLERLTNPGRSRVKDDGTNISPWLQSL 75
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
1448-1556 1.22e-04

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 46.42  E-value: 1.22e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165  1448 SKSKEREMDKkDLDKSRERSREREKKDEKDRKERKRDHSNNDREVPPDLTKRRKEENGTM-----GVSKHKSESPCESPY 1522
Cdd:TIGR01642    8 EREKSRGRDR-DRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPRSLryssvRRSRDRPRRRSRSVR 86
                           90       100       110
                   ....*....|....*....|....*....|....
gi 125656165  1523 PNEKDKEKNKSKSSGKEKGSDSFKSEKMDKISSG 1556
Cdd:TIGR01642   87 SIEQHRRRLRDRSPSNQWRKDDKKRSLWDIKPPG 120
PTZ00121 PTZ00121
MAEBL; Provisional
1291-1593 1.08e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 43.98  E-value: 1.08e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165 1291 EARVLGKDGKEKPK-EERPNKDEKARETKE-RTPKSDKEKEKFKKEEKAKDEKFKTTVPNAESKSTQEREREKEPSRErd 1368
Cdd:PTZ00121 1458 KAEEAKKKAEEAKKaDEAKKKAEEAKKADEaKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKK-- 1535
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165 1369 iAKEMKSKENVKGGEKTPVSGSLKSPVPRSDIPEPEREQKRRKIdthpSPSHSSTVKDSLIELKESSAKLYinhtpppls 1448
Cdd:PTZ00121 1536 -ADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNM----ALRKAEEAKKAEEARIEEVMKLY--------- 1601
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165 1449 kSKEREMDKKDLDKSRERSRERE--KKDEKDRKERKRDHSNNDREVPPDLTKRRKEENGTMGVSKHKSESPCESPYPNEK 1526
Cdd:PTZ00121 1602 -EEEKKMKAEEAKKAEEAKIKAEelKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEA 1680
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165 1527 DK---EKNKSKSSGKEKGSDSFKSEKMDKISSGGKKESRHDKEKIEKKEKRDSSGGKEEKKHHKSSDKHR 1593
Cdd:PTZ00121 1681 KKaeeDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAK 1750
EIF4E-T pfam10477
Nucleocytoplasmic shuttling protein for mRNA cap-binding EIF4E; EIF4E-T is the transporter ...
1348-1593 1.14e-03

Nucleocytoplasmic shuttling protein for mRNA cap-binding EIF4E; EIF4E-T is the transporter protein for shuttling the mRNA cap-binding protein EIF4E protein, targeting it for nuclear import. EIF4E-T contains several key binding domains including two functional leucine-rich NESs (nuclear export signals) between residues 438-447 and 613-638 in the human protein. The other two binding domains are an EIF4E-binding site, between residues 27-42 in Q9EST3, and a bipartite NLS (nuclear localization signals) between 194-211, and these lie in family EIF4E-T_N. EIF4E is the eukaryotic translation initiation factor 4E that is the rate-limiting factor for cap-dependent translation initiation.


Pssm-ID: 371079  Cd Length: 646  Bit Score: 43.46  E-value: 1.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165  1348 NAESKSTQEREREKEPSRERDIAKEMKSKENVKGGEKTpVSGSLKSPVPRSDIPEPEREQKRRKIDTHPSPSHSSTVKDS 1427
Cdd:pfam10477   46 NKRSGLVSSRRALRNRERANNYYQRFDPRDRVLGHKES-QDDVVLSPQRRSFGGGCHVVPQDHEQPVASSPTFSSSREGS 124
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165  1428 LIELKE---SSAKLYINHTPPPLSKSK--EREMDKKDLDKSRERSRERekkDEKDRKERKRDHSNNDREVPPDLTKRRKE 1502
Cdd:pfam10477  125 PLNLQErriGSGRLLPRDYKNQPADEAriEREKDRSPNGSGSSRQDEE---DQRNNQQNLRTFSGRHNERGRDGDRRSKR 201
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165  1503 ENGTMGVSKHKSESPCESPYPNEkDKEKNKSKSSGKEKGS----DSFKSEKMDKISSGGKkeSRHDKEkiekkekrDSSG 1578
Cdd:pfam10477  202 NRRTFGRFQYDGKKPVDRGRRIS-NKDKFSFQDRGKRYNSrrrnDSYTEEEPEWFSAGPT--SQHETI--------ELHG 270
                          250
                   ....*....|....*
gi 125656165  1579 GKEEKKHHKSSDKHR 1593
Cdd:pfam10477  271 FDDKEEDNKRRSRTR 285
 
Name Accession Description Interval E-value
Tho2 pfam11262
Transcription factor/nuclear export subunit protein 2; THO and TREX form a eukaryotic complex ...
874-1173 1.77e-128

Transcription factor/nuclear export subunit protein 2; THO and TREX form a eukaryotic complex which functions in messenger ribonucleoprotein metabolism and plays a role in preventing the transcription-associated genetic instability. Tho2, along with four other subunits forms THO


Pssm-ID: 463251  Cd Length: 304  Bit Score: 402.00  E-value: 1.77e-128
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165   874 DDISPQFYATFWSLTMYDLAVPHTSYEREVNKLKVQMKAI----DDNQEMPPNKKKKEKERCTALQDKLLEEEKKQMEHV 949
Cdd:pfam11262    1 EYISPEFYVTFWQLSLYDIYVPTESYEAEIERLKKQIRELsrdrSDMSRAGASKKKKEKKRLEALIDKLKEELKEHIEHV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165   950 QRVLQRLKLEKDNWLLA-KSTKNETITKFLQLCIFPRCIFSAIDAVYCARFVELVHQQKTPNFSTLLCYDRVFSD-IIYT 1027
Cdd:pfam11262   81 EKTRKRLQKEKDSWFPGsKAKKNALIDAFLQHCILPRALLSPADALYCAKFIKLLHELGTPNFSTLLLYDRLFKDnLRSL 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165  1028 VASCTENEASRYGRFLCCMLETVTRWHSDRATYEKEC---GNYPGFLTILRAtgfDGGNKADQLDYENFRHVVHKWHYKL 1104
Cdd:pfam11262  161 IFSCTEREAENLGRFLNEILKDLSRWHADEAVYEKEAlgkKNLPGFATKFND---DDGKPTDFLSYEDFRRLLYKWHKKL 237
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 125656165  1105 TKASVHCLETGEYTHIRNILIVLTKILPWYPKVLNLGQALERRVHKICqeEKEKRPDLYALAMGYSGQL 1173
Cdd:pfam11262  238 TSALKSCLESGEYMHIRNAIIVLKKILPVFPAVDFMGEALLKAVEKLA--EREKREDLKVLANSYLGLL 304
THOC2_N pfam16134
THO complex subunit 2 N-terminus; This family represents the N-terminus of THO complex subunit ...
11-566 2.14e-87

THO complex subunit 2 N-terminus; This family represents the N-terminus of THO complex subunit 2.


Pssm-ID: 465032  Cd Length: 614  Bit Score: 298.77  E-value: 2.14e-87
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165    11 EWIKNWEKSGRGEflhLCRILSENKSHDSSTyrDFQQALYELSYHVIKGNLKHEQASNVLSDI-SEFREDMPSILADVFC 89
Cdd:pfam16134    3 ERINNWGGSGRQE---LIEQLKLARNDEDED--ELSDLFQELIRSVLDGRLDPEDAGSFLKEIiKEEPTDSSEDVAKLFL 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165    90 ilDIEtNCLEEKSKRDYFTQLVLACLylVSDTVLKERLDPETLESLGLIKQsQQFNQKSVKIKTKLFYKQQKFNLLREEN 169
Cdd:pfam16134   78 --DVL-STFSDSEDMLALRDLLAATI--ISPSLMRLELDTKLLQELGLVRD-TTFHRMLIRKSTNLLYRQKKYNLLREES 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165   170 EGYAKLIAELgQDLSGSITSDLI---LENIKSLIGCFNLDPNRVLDVILEVFECR-PEHDDFFISLL------------- 232
Cdd:pfam16134  152 EGYSKLITEL-FTTSDNDTFEKVdytFERVKALIGKFDLDPGRVLDVILDVFAAFlVKHYRFFVKFLrasswwprteesd 230
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165   233 --ESYMSMCEP--QTLCHILGFKFKFYQ-EPNGETPSSLYRVAAVLLQFNLIDLDDLYVHLLPADNcIMDEHKREI-AEA 306
Cdd:pfam16134  231 wiSSTKTLPPGgnRVAAQLLGFKLRFYSsDADDELPENLIYLAALLIKEGFISFGDLYPHLSPDDE-EMEALKEEYkKEL 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165   307 KQIVRK-----LTMV-VLS-------------SEKMDEREKEKEKEEEKVEKPPDNQKLGLLEALLKIGDWQHAQNIMDQ 367
Cdd:pfam16134  310 EEESMEgganaLAMAgALPddddtlppakedeAAASKKAPTKEEEKKEKEPEPKDNQKIQLLKSLLAIGALPESLFILGR 389
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165   368 MpPYYAASHKLIALAICKLIHITIEPLYR-----------RVGVPKGAK-----GSPVNALQNKRAPK------------ 419
Cdd:pfam16134  390 Y-PWLALVDPEIPELIHRILEHSIEPLYEstrsvplssrpESGLPKGNIvrldeNPPRRLLRWPKTDKpffdlgtkyrfy 468
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165   420 ---------QAESFEDLRRDVFNMFCYLGPHLSHDPILFAKVVRIGKSFMKEfqsDGSKQEDKEKTevilsclLSITDQV 490
Cdd:pfam16134  469 ydewkdnlpVCQTVDDLFTLSHEFLNLIGVNLGQDPSLLSKLCRIGVKDLEN---SDESEENRDRW-------IDYLRRF 538
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 125656165   491 LLPSLSLMDCNACMSEELWGMFKTFPYQHRYRLYGQWKNETYNSHPLLVKVKAQTIDRAKYIMKRLTKENVKPSGR 566
Cdd:pfam16134  539 IFPALSLLEANPIVVDEVYELLKLFPFETRYFLYGEWYEKLTKRNPLIKIAFNKAEKETKDILKRLSKDNIRPMAR 614
Thoc2 pfam11732
Transcription- and export-related complex subunit; The THO/TREX complex is the transcription- ...
568-642 2.63e-39

Transcription- and export-related complex subunit; The THO/TREX complex is the transcription- and export-related complex associated with spliceosomes that preferentially deal with spliced mRNAs as opposed to unspliced mRNAs. Thoc2 plays a role in RNA polymerase II (RNA pol II)-dependent transcription and is required for the stability of DNA repeats. In humans, the TRE complex is comprised of the exon-junction-associated proteins Aly/REF and UAP56 together with the THO proteins THOC1 (hHpr1/p84), Thoc2 (hRlr1), THOC3 (hTex1), THOC5 (fSAP79), THOC6 (fSAP35), and THOC7 (fSAP24). Although much evidence indicates that the function of the TREX complex as an adaptor between the mRNA and components of the export machinery is conserved among eukaryotes, in Drosophila the majority of mRNAs can be exported from the nucleus independently of the THO complex.


Pssm-ID: 463334  Cd Length: 75  Bit Score: 140.69  E-value: 2.63e-39
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 125656165   568 IGKLSHSNPTILFDYILSQIQKYDNLITPVVDSLKYLTSLNYDVLAYCIIEALANPEKERMKHDDTTISSWLQSL 642
Cdd:pfam11732    1 LAKLSHSNPLIVFEVALNQIESYDNLIEPVVDALKYFTDLGYDVLTYCLLERLTNPGRSRVKDDGTNISPWLQSL 75
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
1448-1556 1.22e-04

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 46.42  E-value: 1.22e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165  1448 SKSKEREMDKkDLDKSRERSREREKKDEKDRKERKRDHSNNDREVPPDLTKRRKEENGTM-----GVSKHKSESPCESPY 1522
Cdd:TIGR01642    8 EREKSRGRDR-DRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSRSPRSLryssvRRSRDRPRRRSRSVR 86
                           90       100       110
                   ....*....|....*....|....*....|....
gi 125656165  1523 PNEKDKEKNKSKSSGKEKGSDSFKSEKMDKISSG 1556
Cdd:TIGR01642   87 SIEQHRRRLRDRSPSNQWRKDDKKRSLWDIKPPG 120
SF-CC1 TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
1448-1549 4.72e-04

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 44.52  E-value: 4.72e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165  1448 SKSKEREMDK---------KDLDKSRERSREREKKDEKDRKERKRDHSNNDREVPPDLTKRRKEENGtmgvskhksespc 1518
Cdd:TIGR01622    2 YRDRERERLRdsssagdrdRRRDKGRERSRDRSRDRERSRSRRRDRHRDRDYYRGRERRSRSRRPNR------------- 68
                           90       100       110
                   ....*....|....*....|....*....|.
gi 125656165  1519 espYPNEKDKEKNKSKSSGKEKGSDSFKSEK 1549
Cdd:TIGR01622   69 ---RYRPREKRRRRGDSYRRRRDDRRSRREK 96
PTZ00121 PTZ00121
MAEBL; Provisional
1291-1593 1.08e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 43.98  E-value: 1.08e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165 1291 EARVLGKDGKEKPK-EERPNKDEKARETKE-RTPKSDKEKEKFKKEEKAKDEKFKTTVPNAESKSTQEREREKEPSRErd 1368
Cdd:PTZ00121 1458 KAEEAKKKAEEAKKaDEAKKKAEEAKKADEaKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKK-- 1535
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165 1369 iAKEMKSKENVKGGEKTPVSGSLKSPVPRSDIPEPEREQKRRKIdthpSPSHSSTVKDSLIELKESSAKLYinhtpppls 1448
Cdd:PTZ00121 1536 -ADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNM----ALRKAEEAKKAEEARIEEVMKLY--------- 1601
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165 1449 kSKEREMDKKDLDKSRERSRERE--KKDEKDRKERKRDHSNNDREVPPDLTKRRKEENGTMGVSKHKSESPCESPYPNEK 1526
Cdd:PTZ00121 1602 -EEEKKMKAEEAKKAEEAKIKAEelKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEA 1680
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165 1527 DK---EKNKSKSSGKEKGSDSFKSEKMDKISSGGKKESRHDKEKIEKKEKRDSSGGKEEKKHHKSSDKHR 1593
Cdd:PTZ00121 1681 KKaeeDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAK 1750
EIF4E-T pfam10477
Nucleocytoplasmic shuttling protein for mRNA cap-binding EIF4E; EIF4E-T is the transporter ...
1348-1593 1.14e-03

Nucleocytoplasmic shuttling protein for mRNA cap-binding EIF4E; EIF4E-T is the transporter protein for shuttling the mRNA cap-binding protein EIF4E protein, targeting it for nuclear import. EIF4E-T contains several key binding domains including two functional leucine-rich NESs (nuclear export signals) between residues 438-447 and 613-638 in the human protein. The other two binding domains are an EIF4E-binding site, between residues 27-42 in Q9EST3, and a bipartite NLS (nuclear localization signals) between 194-211, and these lie in family EIF4E-T_N. EIF4E is the eukaryotic translation initiation factor 4E that is the rate-limiting factor for cap-dependent translation initiation.


Pssm-ID: 371079  Cd Length: 646  Bit Score: 43.46  E-value: 1.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165  1348 NAESKSTQEREREKEPSRERDIAKEMKSKENVKGGEKTpVSGSLKSPVPRSDIPEPEREQKRRKIDTHPSPSHSSTVKDS 1427
Cdd:pfam10477   46 NKRSGLVSSRRALRNRERANNYYQRFDPRDRVLGHKES-QDDVVLSPQRRSFGGGCHVVPQDHEQPVASSPTFSSSREGS 124
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165  1428 LIELKE---SSAKLYINHTPPPLSKSK--EREMDKKDLDKSRERSRERekkDEKDRKERKRDHSNNDREVPPDLTKRRKE 1502
Cdd:pfam10477  125 PLNLQErriGSGRLLPRDYKNQPADEAriEREKDRSPNGSGSSRQDEE---DQRNNQQNLRTFSGRHNERGRDGDRRSKR 201
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 125656165  1503 ENGTMGVSKHKSESPCESPYPNEkDKEKNKSKSSGKEKGS----DSFKSEKMDKISSGGKkeSRHDKEkiekkekrDSSG 1578
Cdd:pfam10477  202 NRRTFGRFQYDGKKPVDRGRRIS-NKDKFSFQDRGKRYNSrrrnDSYTEEEPEWFSAGPT--SQHETI--------ELHG 270
                          250
                   ....*....|....*
gi 125656165  1579 GKEEKKHHKSSDKHR 1593
Cdd:pfam10477  271 FDDKEEDNKRRSRTR 285
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH