NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|578836633|ref|XP_006724099|]
View 

SR-related and CTD-associated factor 4 isoform X2 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
CID_SFRS15_SCAF4 cd17005
CID (CTD-Interacting Domain) of Splicing factor arginine serine rich 15; Splicing factor ...
5-135 3.26e-88

CID (CTD-Interacting Domain) of Splicing factor arginine serine rich 15; Splicing factor arginine serine rich 15 (SFRS15) is also called CTD-binding SR-like protein RA4 or SR-related and CTD-associated factor 4 (SCAF4). It may act to physically and functionally link transcription and pre-mRNA processing. SFRS15/SCAF4 contains a CTD-interacting domain (CID) at the amino terminus and a Ser/Arg-rich domain followed by an RNA recognition motif. CID binds tightly to the carboxy-terminal domain (CTD) of RNA polymerase (Pol) II (RNAP II). During transcription, RNAP II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices.


:

Pssm-ID: 340802  Cd Length: 131  Bit Score: 279.93  E-value: 3.26e-88
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633    5 NAFNQELFSLMDMKPPISRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGTDKDVF 84
Cdd:cd17005     1 NAFNQELFSLMDMKPPISRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGADKDVF 80
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|.
gi 578836633   85 GPRFSKNITATFQYLYLCPSEDKSKIVRVLNLWQKNGVFKIEIIQPLLDMA 135
Cdd:cd17005    81 GPRFSKNITATFQYLYLCPSEDKSKIVRVLNLWQKNGVFKIEIIQPLLDMA 131
RRM_SCAF4 cd12461
RNA recognition motif (RRM) found in SR-related and CTD-associated factor 4 (SCAF4) and ...
500-580 8.42e-56

RNA recognition motif (RRM) found in SR-related and CTD-associated factor 4 (SCAF4) and similar proteins; The CD corresponds to the RRM of SCAF4 (also termed splicing factor, arginine/serine-rich 15 or SFR15, or CTD-binding SR-like protein RA4) that belongs to a new class of SCAFs (SR-like CTD-associated factors). Although its biological function remains unclear, SCAF4 shows high sequence similarity to SCAF8 that interacts specifically with a highly serine-phosphorylated form of the carboxy-terminal domain (CTD) of the largest subunit of RNA polymerase II (pol II) and may play a direct role in coupling with both, transcription and pre-mRNA processing, processes. SCAF4 and SCAF8 both contain a conserved N-terminal CTD-interacting domain (CID), an atypical RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and serine/arginine-rich motifs.


:

Pssm-ID: 409894 [Multi-domain]  Cd Length: 81  Bit Score: 187.56  E-value: 8.42e-56
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633  500 SVCSTTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSRGNYKVNQKSIKIAWALNK 579
Cdd:cd12461     1 SVCSTTLWVGQLDKRTTQQDVASLLEEFGQIESINMIPPRGCAYIVMVHRQDAYRALQKLSRGNYKVNQKSIKIAWALNK 80

                  .
gi 578836633  580 G 580
Cdd:cd12461    81 G 81
PRK12323 super family cl46901
DNA polymerase III subunit gamma/tau;
133-281 5.81e-05

DNA polymerase III subunit gamma/tau;


The actual alignment was detected with superfamily member PRK07994:

Pssm-ID: 481241 [Multi-domain]  Cd Length: 647  Bit Score: 47.17  E-value: 5.81e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633  133 DMAAGTSNAAPVAENVTNNEGSPPPPVKVSSEPPTQATPNSVPAV-PQLPSSDAFAAVAQLFQTTQGQQLQQILQTFQQP 211
Cdd:PRK07994  369 EVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVpLPETTSQLLAARQQLQRAQGATKAKKSEPAAASR 448
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 578836633  212 PKPQSPALDNAVMAQVQAITAQLKTTPTQPSEQKAAFPPPEQKT--AFDKKLLDRFDYDDEPE----AVEESKKED 281
Cdd:PRK07994  449 ARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEpvATPKALKKALEHEKTPElaakLAAEAIERD 524
PRK12678 super family cl36163
transcription termination factor Rho; Provisional
958-1027 9.65e-05

transcription termination factor Rho; Provisional


The actual alignment was detected with superfamily member PRK12678:

Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 46.44  E-value: 9.65e-05
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 578836633  958 RNDNRQQFNSGRDQERFGRRSFGNRVEN-DRERYGNRNDDRDNSNRDRREWGRRSPDRDRHRDLEERNRRS 1027
Cdd:PRK12678  196 RGRDGDDRDRRDRREQGDRREERGRRDGgDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGRRF 266
 
Name Accession Description Interval E-value
CID_SFRS15_SCAF4 cd17005
CID (CTD-Interacting Domain) of Splicing factor arginine serine rich 15; Splicing factor ...
5-135 3.26e-88

CID (CTD-Interacting Domain) of Splicing factor arginine serine rich 15; Splicing factor arginine serine rich 15 (SFRS15) is also called CTD-binding SR-like protein RA4 or SR-related and CTD-associated factor 4 (SCAF4). It may act to physically and functionally link transcription and pre-mRNA processing. SFRS15/SCAF4 contains a CTD-interacting domain (CID) at the amino terminus and a Ser/Arg-rich domain followed by an RNA recognition motif. CID binds tightly to the carboxy-terminal domain (CTD) of RNA polymerase (Pol) II (RNAP II). During transcription, RNAP II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices.


Pssm-ID: 340802  Cd Length: 131  Bit Score: 279.93  E-value: 3.26e-88
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633    5 NAFNQELFSLMDMKPPISRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGTDKDVF 84
Cdd:cd17005     1 NAFNQELFSLMDMKPPISRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGADKDVF 80
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|.
gi 578836633   85 GPRFSKNITATFQYLYLCPSEDKSKIVRVLNLWQKNGVFKIEIIQPLLDMA 135
Cdd:cd17005    81 GPRFSKNITATFQYLYLCPSEDKSKIVRVLNLWQKNGVFKIEIIQPLLDMA 131
RRM_SCAF4 cd12461
RNA recognition motif (RRM) found in SR-related and CTD-associated factor 4 (SCAF4) and ...
500-580 8.42e-56

RNA recognition motif (RRM) found in SR-related and CTD-associated factor 4 (SCAF4) and similar proteins; The CD corresponds to the RRM of SCAF4 (also termed splicing factor, arginine/serine-rich 15 or SFR15, or CTD-binding SR-like protein RA4) that belongs to a new class of SCAFs (SR-like CTD-associated factors). Although its biological function remains unclear, SCAF4 shows high sequence similarity to SCAF8 that interacts specifically with a highly serine-phosphorylated form of the carboxy-terminal domain (CTD) of the largest subunit of RNA polymerase II (pol II) and may play a direct role in coupling with both, transcription and pre-mRNA processing, processes. SCAF4 and SCAF8 both contain a conserved N-terminal CTD-interacting domain (CID), an atypical RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and serine/arginine-rich motifs.


Pssm-ID: 409894 [Multi-domain]  Cd Length: 81  Bit Score: 187.56  E-value: 8.42e-56
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633  500 SVCSTTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSRGNYKVNQKSIKIAWALNK 579
Cdd:cd12461     1 SVCSTTLWVGQLDKRTTQQDVASLLEEFGQIESINMIPPRGCAYIVMVHRQDAYRALQKLSRGNYKVNQKSIKIAWALNK 80

                  .
gi 578836633  580 G 580
Cdd:cd12461    81 G 81
RPR smart00582
domain present in proteins, which are involved in regulation of nuclear pre-mRNA;
6-134 1.08e-36

domain present in proteins, which are involved in regulation of nuclear pre-mRNA;


Pssm-ID: 214731  Cd Length: 124  Bit Score: 134.33  E-value: 1.08e-36
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633      6 AFNQELFSLMdmkppISRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGTD-KDVF 84
Cdd:smart00582    1 AFEQKLESLN-----NSQESIQTLTKWAIEHASHAKEIVELWEKYIKKAPVPRKLPLLYLLDSIVQNSKRKYGSEfGDEL 75
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|
gi 578836633     85 GPRFSKNITATFQYlylCPSEDKSKIVRVLNLWQKNGVFKIEIIQPLLDM 134
Cdd:smart00582   76 GPVFQDALRRVLGA---APEELKKKIRRLLNIWEERGIFPPEVLRPLREK 122
CID pfam04818
CID domain; This domain binds to the phosphorylated C-terminal domain (CTD) of RNA polymerase ...
6-128 3.87e-30

CID domain; This domain binds to the phosphorylated C-terminal domain (CTD) of RNA polymerase II. This domain is known as the CTD-interacting domain (CID).


Pssm-ID: 461442  Cd Length: 117  Bit Score: 115.38  E-value: 3.87e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633     6 AFNQELFSLMDmkppiSRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGTDkdvFG 85
Cdd:pfam04818    2 ALEKKLSSLNN-----SQESIQTLSKWILFHRKHAKAIVEVWEKYLKKAKPEKKLHLLYLANDVLQNSRKKGKSE---FA 73
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 578836633    86 PRFSKNITATFQYLYL-CPSEDKSKIVRVLNLWQKNGVFKIEII 128
Cdd:pfam04818   74 DAFEPVLPEAFASAYKkCDEKLKKKLERLLNIWEERNVFSPEVI 117
RRM smart00360
RNA recognition motif;
505-573 7.81e-12

RNA recognition motif;


Pssm-ID: 214636 [Multi-domain]  Cd Length: 73  Bit Score: 61.84  E-value: 7.81e-12
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 578836633    505 TLWVGQLDKRTTQQDVASLLEEFGPIESINMIP------PRGCAYIVMVHRQDAYRALQKLsrGNYKVNQKSIKI 573
Cdd:smart00360    1 TLFVGNLPPDTTEEELRELFSKFGKVESVRLVRdketgkSKGFAFVEFESEEDAEKALEAL--NGKELDGRPLKV 73
RRM_1 pfam00076
RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic ...
506-572 3.49e-08

RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have an N terminal rrm which is included in the seed. There is a second region towards the C terminus that has some features characteriztic of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.


Pssm-ID: 425453 [Multi-domain]  Cd Length: 70  Bit Score: 51.46  E-value: 3.49e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 578836633   506 LWVGQLDKRTTQQDVASLLEEFGPIESINMIP-----PRGCAYIVMVHRQDAYRALQKLSrgNYKVNQKSIK 572
Cdd:pfam00076    1 LFVGNLPPDTTEEDLKDLFSKFGPIKSIRLVRdetgrSKGFAFVEFEDEEDAEKAIEALN--GKELGGRELK 70
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
505-575 3.28e-05

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 47.88  E-value: 3.28e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 578836633   505 TLWVGQLDKRTTQQDVASLLEEFGPIESI----NMIPPR--GCAYIVMVHRQDAYRALQKLsrgNYK-VNQKSIKIAW 575
Cdd:TIGR01628    2 SLYVGDLDPDVTEAKLYDLFKPFGPVLSVrvcrDSVTRRslGYGYVNFQNPADAERALETM---NFKrLGGKPIRIMW 76
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
133-281 5.81e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 47.17  E-value: 5.81e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633  133 DMAAGTSNAAPVAENVTNNEGSPPPPVKVSSEPPTQATPNSVPAV-PQLPSSDAFAAVAQLFQTTQGQQLQQILQTFQQP 211
Cdd:PRK07994  369 EVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVpLPETTSQLLAARQQLQRAQGATKAKKSEPAAASR 448
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 578836633  212 PKPQSPALDNAVMAQVQAITAQLKTTPTQPSEQKAAFPPPEQKT--AFDKKLLDRFDYDDEPE----AVEESKKED 281
Cdd:PRK07994  449 ARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEpvATPKALKKALEHEKTPElaakLAAEAIERD 524
PRK12678 PRK12678
transcription termination factor Rho; Provisional
958-1027 9.65e-05

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 46.44  E-value: 9.65e-05
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 578836633  958 RNDNRQQFNSGRDQERFGRRSFGNRVEN-DRERYGNRNDDRDNSNRDRREWGRRSPDRDRHRDLEERNRRS 1027
Cdd:PRK12678  196 RGRDGDDRDRRDRREQGDRREERGRRDGgDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGRRF 266
SF-CC1 TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
969-1060 2.26e-03

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 41.83  E-value: 2.26e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633   969 RDQERFGRRSFGNRVENDRERYGNRNDDRDNSNRDRREwgrRSPDRDRHRD--------LEERNRRSSGHRDRERDSRDR 1040
Cdd:TIGR01622    3 RDRERERLRDSSSAGDRDRRRDKGRERSRDRSRDRERS---RSRRRDRHRDrdyyrgreRRSRSRRPNRRYRPREKRRRR 79
                           90       100
                   ....*....|....*....|...
gi 578836633  1041 ESRREKE---EARGKEKPEVTDR 1060
Cdd:TIGR01622   80 GDSYRRRrddRRSRREKPRARDG 102
 
Name Accession Description Interval E-value
CID_SFRS15_SCAF4 cd17005
CID (CTD-Interacting Domain) of Splicing factor arginine serine rich 15; Splicing factor ...
5-135 3.26e-88

CID (CTD-Interacting Domain) of Splicing factor arginine serine rich 15; Splicing factor arginine serine rich 15 (SFRS15) is also called CTD-binding SR-like protein RA4 or SR-related and CTD-associated factor 4 (SCAF4). It may act to physically and functionally link transcription and pre-mRNA processing. SFRS15/SCAF4 contains a CTD-interacting domain (CID) at the amino terminus and a Ser/Arg-rich domain followed by an RNA recognition motif. CID binds tightly to the carboxy-terminal domain (CTD) of RNA polymerase (Pol) II (RNAP II). During transcription, RNAP II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices.


Pssm-ID: 340802  Cd Length: 131  Bit Score: 279.93  E-value: 3.26e-88
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633    5 NAFNQELFSLMDMKPPISRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGTDKDVF 84
Cdd:cd17005     1 NAFNQELFSLMDMKPPISRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGADKDVF 80
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|.
gi 578836633   85 GPRFSKNITATFQYLYLCPSEDKSKIVRVLNLWQKNGVFKIEIIQPLLDMA 135
Cdd:cd17005    81 GPRFSKNITATFQYLYLCPSEDKSKIVRVLNLWQKNGVFKIEIIQPLLDMA 131
CID_SCAF8_like cd16983
CID (CTD-Interacting Domain) of SR-related and CTD-associated factor 8 and similar proteins; ...
6-135 2.32e-78

CID (CTD-Interacting Domain) of SR-related and CTD-associated factor 8 and similar proteins; This subfamily includes SR-related and CTD-associated factors 8 (SCAF8) and 4 (SCAF4), and similar proteins. SCAF4 is also called Splicing factor arginine serine rich 15 (SFRS15). Members may play roles in mRNA processing. Both SCAF4 and SCAF8 contains a CTD-interacting domain (CID) at the amino terminus and a Ser/Arg-rich domain followed by an RNA recognition motif. CID binds tightly to the carboxy-terminal domain (CTD) of RNA polymerase (Pol) II (RNAP II). During transcription, RNAP II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices.


Pssm-ID: 340780  Cd Length: 131  Bit Score: 252.53  E-value: 2.32e-78
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633    6 AFNQELFSLMDMKPPISRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGTDKDVFG 85
Cdd:cd16983     2 EFNKELSSLLDSKPPVSKSKINAITKLAIKAIKFYKHVVQSVEKFIQKCKPEYKLPGLYVIDSIIRQSRHQYGKEKDVYA 81
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 578836633   86 PRFSKNITATFQYLYLCPSEDKSKIVRVLNLWQKNGVFKIEIIQPLLDMA 135
Cdd:cd16983    82 PRFAKNLSKTFLNLLKCPEKDKPKVKRVLNLWQKNGVFPKEIIQPLLDAA 131
CID_SCAF8 cd17004
CID (CTD-Interacting Domain) of SR-related and CTD-associated factor 8; SR-related and ...
6-135 1.23e-76

CID (CTD-Interacting Domain) of SR-related and CTD-associated factor 8; SR-related and CTD-associated factor 8 (SCAF8) is also called CDC5L complex-associated protein 7 (CCAP7) or RNA-binding motif protein 16 (RBM16). It may play a role in mRNA processing. SCAF8 contains a CTD-interacting domain (CID) at the amino terminus and a Ser/Arg-rich domain followed by an RNA recognition motif. CID binds tightly to the carboxy-terminal domain (CTD) of RNA polymerase (Pol) II (RNAP II). During transcription, RNAP II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices.


Pssm-ID: 340801  Cd Length: 131  Bit Score: 248.02  E-value: 1.23e-76
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633    6 AFNQELFSLMDMKPPISRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGTDKDVFG 85
Cdd:cd17004     2 TFNSELYSLNDYKPPISKAKMTQITKAAIKAIKFYKHVVQSVEKFIQKCKPEYKVPGLYVIDSIVRQSRHQFGQEKDVFA 81
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 578836633   86 PRFSKNITATFQYLYLCPSEDKSKIVRVLNLWQKNGVFKIEIIQPLLDMA 135
Cdd:cd17004    82 PRFSNNIISTFQNLYRCPGDDKSKIVRVLNLWQKNNVFKSEIIQPLLDMA 131
RRM_SCAF4 cd12461
RNA recognition motif (RRM) found in SR-related and CTD-associated factor 4 (SCAF4) and ...
500-580 8.42e-56

RNA recognition motif (RRM) found in SR-related and CTD-associated factor 4 (SCAF4) and similar proteins; The CD corresponds to the RRM of SCAF4 (also termed splicing factor, arginine/serine-rich 15 or SFR15, or CTD-binding SR-like protein RA4) that belongs to a new class of SCAFs (SR-like CTD-associated factors). Although its biological function remains unclear, SCAF4 shows high sequence similarity to SCAF8 that interacts specifically with a highly serine-phosphorylated form of the carboxy-terminal domain (CTD) of the largest subunit of RNA polymerase II (pol II) and may play a direct role in coupling with both, transcription and pre-mRNA processing, processes. SCAF4 and SCAF8 both contain a conserved N-terminal CTD-interacting domain (CID), an atypical RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and serine/arginine-rich motifs.


Pssm-ID: 409894 [Multi-domain]  Cd Length: 81  Bit Score: 187.56  E-value: 8.42e-56
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633  500 SVCSTTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSRGNYKVNQKSIKIAWALNK 579
Cdd:cd12461     1 SVCSTTLWVGQLDKRTTQQDVASLLEEFGQIESINMIPPRGCAYIVMVHRQDAYRALQKLSRGNYKVNQKSIKIAWALNK 80

                  .
gi 578836633  580 G 580
Cdd:cd12461    81 G 81
RRM_SCAF8 cd12462
RNA recognition motif (RRM) found in SR-related and CTD-associated factor 8 (SCAF8) and ...
502-580 9.34e-44

RNA recognition motif (RRM) found in SR-related and CTD-associated factor 8 (SCAF8) and similar proteins; This subgroup corresponds to the RRM of SCAF8 (also termed CDC5L complex-associated protein 7, or RNA-binding motif protein 16, or CTD-binding SR-like protein RA8), a nuclear matrix protein that interacts specifically with a highly serine-phosphorylated form of the carboxy-terminal domain (CTD) of the largest subunit of RNA polymerase II (pol II). The pol II CTD plays a role in coupling transcription and pre-mRNA processing. SCAF8 co-localizes primarily with transcription sites that are enriched in nuclear matrix fraction, which is known to contain proteins involved in pre-mRNA processing. Thus, SCAF8 may play a direct role in coupling with both, transcription and pre-mRNA processing, processes. SCAF8, together with SCAF4, represents a new class of SCAFs (SR-like CTD-associated factors). They contain a conserved N-terminal CTD-interacting domain (CID), an atypical RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and serine/arginine-rich motifs.


Pssm-ID: 409895 [Multi-domain]  Cd Length: 79  Bit Score: 152.92  E-value: 9.34e-44
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 578836633  502 CSTTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSRGNYKVNQKSIKIAWALNKG 580
Cdd:cd12462     1 CSTTLWVGQVDKKATQQDLTNLFEEFGQIESINMIPPRGCAYVCMVHRQDAYRALQKLSTGSFKIGSKIIKIAWALNKG 79
RRM_SCAF4_SCAF8 cd12227
RNA recognition motif (RRM) found in SR-related and CTD-associated factor 4 (SCAF4), ...
502-580 1.13e-43

RNA recognition motif (RRM) found in SR-related and CTD-associated factor 4 (SCAF4), SR-related and CTD-associated factor 8 (SCAF8) and similar proteins; This subfamily corresponds to the RRM in a new class of SCAFs (SR-like CTD-associated factors), including SCAF4, SCAF8 and similar proteins. The biological role of SCAF4 remains unclear, but it shows high sequence similarity to SCAF8 (also termed CDC5L complex-associated protein 7, or RNA-binding motif protein 16, or CTD-binding SR-like protein RA8). SCAF8 is a nuclear matrix protein that interacts specifically with a highly serine-phosphorylated form of the carboxy-terminal domain (CTD) of the largest subunit of RNA polymerase II (pol II). The pol II CTD plays a role in coupling transcription and pre-mRNA processing. In addition, SCAF8 co-localizes primarily with transcription sites that are enriched in nuclear matrix fraction, which is known to contain proteins involved in pre-mRNA processing. Thus, SCAF8 may play a direct role in coupling with both, transcription and pre-mRNA processing, processes. SCAF8 and SCAF4 both contain a conserved N-terminal CTD-interacting domain (CID), an atypical RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNPs (ribonucleoprotein domain), and serine/arginine-rich motifs.


Pssm-ID: 409674 [Multi-domain]  Cd Length: 77  Bit Score: 152.59  E-value: 1.13e-43
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 578836633  502 CSTTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSrgNYKVNQKSIKIAWALNKG 580
Cdd:cd12227     1 CSTTLWVGHLSKKVTQEELKNLFEEYGEIQSIDMIPPRGCAYVCMKTRQDAHRALQKLK--NHKLRGKSIKIAWAPNKG 77
CID cd03562
CID (CTD-Interacting Domain) family; The CTD-Interacting Domain (CID) is present in several ...
6-134 2.06e-43

CID (CTD-Interacting Domain) family; The CTD-Interacting Domain (CID) is present in several eukaryotic RNA-processing factors including yeast proteins, Pcf11 and Nrd1, and vertebrate proteins, CTD-associated factors 8 (SCAF8) and Regulation of nuclear pre-mRNA domain-containing proteins (such as RPRD1 and RPRD2). Pcf11 is a conserved and essential subunit of the yeast cleavage factor IA, which is required for polyadenylation-dependent 3'-RNA processing and transcription termination. Nrd1 is implicated in polyadenylation-independent 3'-RNA processing. CID binds tightly to the carboxy-terminal domain (CTD) of RNA polymerase (Pol) II (RNAP II). During transcription, RNAP II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices.


Pssm-ID: 340766  Cd Length: 123  Bit Score: 153.83  E-value: 2.06e-43
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633    6 AFNQELFSLMDMkppiSRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQfgtdKDVFG 85
Cdd:cd03562     2 AFNSKLEELSDL----SQQSITTLTKWAIHHIKHSRPIVTVIEREIRKCKPNRKLTFLYLIDSIIRNSKRK----GPEFT 73
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 578836633   86 PRFSKNITATFQYLY-LCPSEDKSKIVRVLNLWQKNGVFKIEIIQPLLDM 134
Cdd:cd03562    74 KDFSPVIVELFKHVYsETDEDCKKKLGRVLSIWEERNVFENSVLEQLKQA 123
RPR smart00582
domain present in proteins, which are involved in regulation of nuclear pre-mRNA;
6-134 1.08e-36

domain present in proteins, which are involved in regulation of nuclear pre-mRNA;


Pssm-ID: 214731  Cd Length: 124  Bit Score: 134.33  E-value: 1.08e-36
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633      6 AFNQELFSLMdmkppISRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGTD-KDVF 84
Cdd:smart00582    1 AFEQKLESLN-----NSQESIQTLTKWAIEHASHAKEIVELWEKYIKKAPVPRKLPLLYLLDSIVQNSKRKYGSEfGDEL 75
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|
gi 578836633     85 GPRFSKNITATFQYlylCPSEDKSKIVRVLNLWQKNGVFKIEIIQPLLDM 134
Cdd:smart00582   76 GPVFQDALRRVLGA---APEELKKKIRRLLNIWEERGIFPPEVLRPLREK 122
CID pfam04818
CID domain; This domain binds to the phosphorylated C-terminal domain (CTD) of RNA polymerase ...
6-128 3.87e-30

CID domain; This domain binds to the phosphorylated C-terminal domain (CTD) of RNA polymerase II. This domain is known as the CTD-interacting domain (CID).


Pssm-ID: 461442  Cd Length: 117  Bit Score: 115.38  E-value: 3.87e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633     6 AFNQELFSLMDmkppiSRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGTDkdvFG 85
Cdd:pfam04818    2 ALEKKLSSLNN-----SQESIQTLSKWILFHRKHAKAIVEVWEKYLKKAKPEKKLHLLYLANDVLQNSRKKGKSE---FA 73
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 578836633    86 PRFSKNITATFQYLYL-CPSEDKSKIVRVLNLWQKNGVFKIEII 128
Cdd:pfam04818   74 DAFEPVLPEAFASAYKkCDEKLKKKLERLLNIWEERNVFSPEVI 117
CID_Nrd1_like cd16984
CID (CTD-Interacting Domain) of Nrd1 and similar proteins; This subfamily includes ...
7-131 9.76e-14

CID (CTD-Interacting Domain) of Nrd1 and similar proteins; This subfamily includes Saccharomyces cerevisiae protein Nrd1, Schizosaccharomyces pombe Rpb7-binding protein Seb1, and similar proteins. Nrd1 cooperates with Nab3 and Sen1, also called the Nrd1-Nab3-Sen1 (NNS) complex, to terminate the transcription by RNA polymerase (Pol) II (RNAPII) of many noncoding RNAs (ncRNAs), including small nuclear RNAs (snRNAs), small nucleolar RNAs (snoRNAs), and cryptic unstable transcripts (CUTs). Schizosaccharomyces pombe Seb1 does not function in an NNS-like termination pathway but promotes polyadenylation site selection of coding and noncoding genes. It cotranscriptionally controls alternative polyadenylation. CID binds tightly to the carboxy-terminal domain (CTD) of RNAP II. Nrd1 CID preferentially interacts with CTD phosphorylated at Ser5. During transcription, RNAP II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices.


Pssm-ID: 340781  Cd Length: 145  Bit Score: 69.56  E-value: 9.76e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633    7 FNQELFSLMDMKPP-ISRAKMILITKAAIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQ---FGTDKD 82
Cdd:cd16984     3 FEATLKSLQALKPPgVSGSKIKKLTDIAVDNVQSESQIVSKLYRYFKKAPPTHKLGVLYVVDSVVRAWIDQakkNGQSID 82
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 578836633   83 VFGP---------RFSKNITATFQ-YLYLCPSEDKSKIVRVLNLWQKNGVFKIEIIQPL 131
Cdd:cd16984    83 SSAPdgtfaagvyKITELIESLMNdAIQSAPEDQKEKIKKLVDIWEKGNTFPAEMLNSI 141
RRM smart00360
RNA recognition motif;
505-573 7.81e-12

RNA recognition motif;


Pssm-ID: 214636 [Multi-domain]  Cd Length: 73  Bit Score: 61.84  E-value: 7.81e-12
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 578836633    505 TLWVGQLDKRTTQQDVASLLEEFGPIESINMIP------PRGCAYIVMVHRQDAYRALQKLsrGNYKVNQKSIKI 573
Cdd:smart00360    1 TLFVGNLPPDTTEEELRELFSKFGKVESVRLVRdketgkSKGFAFVEFESEEDAEKALEAL--NGKELDGRPLKV 73
CID_Pcf11 cd16982
CID (CTD-Interacting Domain) of Pcf11; Pcf11 is conserved across eukaryotes. The best studied ...
33-117 1.54e-11

CID (CTD-Interacting Domain) of Pcf11; Pcf11 is conserved across eukaryotes. The best studied protein is Saccharomyces cerevisiae Pcf11, also called protein 1 of CF I, an essential subunit of the cleavage factor IA (CFIA) complex which is required for polyadenylation-dependent pre-mRNA 3'-end processing and RNA polymerase (Pol) II (RNAP II) transcription termination. Human Pcf11, also referred to as pre-mRNA cleavage complex 2 protein Pcf11, has been shown to enhance degradation of RNAP II-associated nascent RNA and transcriptional termination. The family also includes plant PCFS4 (Pcf11-similar-4 protein or Polyadenylation and cleavage factor homolog 4) and Caenorhabditis elegans Polyadenylation and cleavage factor homolog 11. CID binds tightly to the carboxy-terminal domain (CTD) of RNAP II. Pcf11 CID preferentially interacts with CTD phosphorylated at Ser2. During transcription, RNAP II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices.


Pssm-ID: 340779  Cd Length: 127  Bit Score: 62.58  E-value: 1.54e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633   33 AIKAIKLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRqsrhqfgTDKDVFGPRFSKNITATFQYLY-LCPSEDKSKIV 111
Cdd:cd16982    26 AEENIQAAQAIVEAIEERIRKVPPEQKLPALYLLDSIVK-------NVGGPYTSLFSPNLVDLFLDAYrLVDEKTRKKLE 98

                  ....*.
gi 578836633  112 RVLNLW 117
Cdd:cd16982    99 KLLNTW 104
RRM_SF cd00590
RNA recognition motif (RRM) superfamily; RRM, also known as RBD (RNA binding domain) or RNP ...
506-573 2.55e-10

RNA recognition motif (RRM) superfamily; RRM, also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).


Pssm-ID: 409669 [Multi-domain]  Cd Length: 72  Bit Score: 57.29  E-value: 2.55e-10
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 578836633  506 LWVGQLDKRTTQQDVASLLEEFGPIESINMIP-----PRGCAYIVMVHRQDAYRALQKLsrGNYKVNQKSIKI 573
Cdd:cd00590     1 LFVGNLPPDTTEEDLRELFSKFGEVVSVRIVRdrdgkSKGFAFVEFESPEDAEKALEAL--NGTELGGRPLKV 71
RRM3_NGR1_NAM8_like cd12346
RNA recognition motif 3 (RRM3) found in yeast negative growth regulatory protein NGR1 (RBP1), ...
504-575 2.99e-10

RNA recognition motif 3 (RRM3) found in yeast negative growth regulatory protein NGR1 (RBP1), yeast protein NAM8 and similar proteins; This subfamily corresponds to the RRM3 of NGR1 and NAM8. NGR1, also termed RNA-binding protein RBP1, is a putative glucose-repressible protein that binds both RNA and single-stranded DNA (ssDNA) in yeast. It may function in regulating cell growth in early log phase, possibly through its participation in RNA metabolism. NGR1 contains two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), followed by a glutamine-rich stretch that may be involved in transcriptional activity. In addition, NGR1 has an asparagine-rich region near the carboxyl terminus which also harbors a methionine-rich region. The family also includes protein NAM8, which is a putative RNA-binding protein that acts as a suppressor of mitochondrial splicing deficiencies when overexpressed in yeast. It may be a non-essential component of the mitochondrial splicing machinery. Like NGR1, NAM8 contains two RRMs.


Pssm-ID: 409782 [Multi-domain]  Cd Length: 72  Bit Score: 57.33  E-value: 2.99e-10
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 578836633  504 TTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSrgNYKVNQKSIKIAW 575
Cdd:cd12346     2 TTVFVGGLDPNVTEEDLRVLFGPFGEIVYVKIPPGKGCGFVQFVNRASAEAAIQKLQ--GTPIGGSRIRLSW 71
RRM_RBM22 cd12224
RNA recognition motif (RRM) found in Pre-mRNA-splicing factor RBM22 and similar proteins; This ...
504-576 9.40e-10

RNA recognition motif (RRM) found in Pre-mRNA-splicing factor RBM22 and similar proteins; This subgroup corresponds to the RRM of RBM22 (also known as RNA-binding motif protein 22, or Zinc finger CCCH domain-containing protein 16), a newly discovered RNA-binding motif protein which belongs to the SLT11 gene family. SLT11 gene encoding protein (Slt11p) is a splicing factor in yeast, which is required for spliceosome assembly. Slt11p has two distinct biochemical properties: RNA-annealing and RNA-binding activities. RBM22 is the homolog of SLT11 in vertebrate. It has been reported to be involved in pre-splicesome assembly and to interact with the Ca2+-signaling protein ALG-2. It also plays an important role in embryogenesis. RBM22 contains a conserved RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), a zinc finger of the unusual type C-x8-C-x5-C-x3-H, and a C-terminus that is unusually rich in the amino acids Gly and Pro, including sequences of tetraprolines.


Pssm-ID: 409671 [Multi-domain]  Cd Length: 74  Bit Score: 55.75  E-value: 9.40e-10
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 578836633  504 TTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSrGNYKVNQKSIKIAWA 576
Cdd:cd12224     2 TTLYVGGLGDKITEKDLRDHFYQFGEIRSITVVARQQCAFVQFTTRQAAERAAERTF-NKLIIKGRRLKVKWG 73
CID_Rtt103 cd17003
CID (CTD-Interacting Domain) of yeast transcription termination factor Rtt103 and similar ...
40-131 3.73e-09

CID (CTD-Interacting Domain) of yeast transcription termination factor Rtt103 and similar proteins; Yeast transcription termination factor Rtt103 is a CID (CTD-Interacting Domain) containing protein that functions in DNA damage response. It associates with sites of DNA breaks and is essential for recovery from DNA double strand breaks in the chromosome. CID binds tightly to the carboxy-terminal domain (CTD) of RNA polymerase (Pol) II (RNAP II). Rtt103 CID preferentially interacts with CTD phosphorylated at Ser2. During transcription, RNAP II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices.


Pssm-ID: 340800  Cd Length: 127  Bit Score: 55.69  E-value: 3.73e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633   40 YKH---VVQIVEKFIKK--CKPEYKVPGLYVIDSIVRQSRHQfgtDKDVFGPRFSKNITATFQYLYL-CPSEDKSKIVRV 113
Cdd:cd17003    29 YRHadeIAEIWSDYLLKssVNSRRKLLLIYLANDVVQQAKAK---KKTEFIDAFSKVLPEVLEKIYPsLPSDIKKKIKRV 105
                          90
                  ....*....|....*...
gi 578836633  114 LNLWQKNGVFKIEIIQPL 131
Cdd:cd17003   106 VNVWKQRQIFSKDVIDDI 123
RRM_1 pfam00076
RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic ...
506-572 3.49e-08

RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have an N terminal rrm which is included in the seed. There is a second region towards the C terminus that has some features characteriztic of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.


Pssm-ID: 425453 [Multi-domain]  Cd Length: 70  Bit Score: 51.46  E-value: 3.49e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 578836633   506 LWVGQLDKRTTQQDVASLLEEFGPIESINMIP-----PRGCAYIVMVHRQDAYRALQKLSrgNYKVNQKSIK 572
Cdd:pfam00076    1 LFVGNLPPDTTEEDLKDLFSKFGPIKSIRLVRdetgrSKGFAFVEFEDEEDAEKAIEALN--GKELGGRELK 70
RRM2_4_MRN1 cd12262
RNA recognition motif 2 (RRM2) and 4 (RRM4) found in RNA-binding protein MRN1 and similar ...
503-571 3.50e-08

RNA recognition motif 2 (RRM2) and 4 (RRM4) found in RNA-binding protein MRN1 and similar proteins; This subgroup corresponds to the RRM2 and RRM4 of MRN1, also termed multicopy suppressor of RSC-NHP6 synthetic lethality protein 1, or post-transcriptional regulator of 69 kDa, and is an RNA-binding protein found in yeast. Although its specific biological role remains unclear, MRN1 might be involved in translational regulation. Members in this family contain four copies of conserved RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain).


Pssm-ID: 409706 [Multi-domain]  Cd Length: 78  Bit Score: 51.63  E-value: 3.50e-08
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 578836633  503 STTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSRGNYKVNQKSI 571
Cdd:cd12262     3 SRNVYVGNLDDSLTEEEIRGILEKYGEIESIKILKEKNCAFVNYLNIANAIKAVQELPIKNPKFKKVRI 71
RRM_SR140 cd12223
RNA recognition motif (RRM) found in U2-associated protein SR140 and similar proteins; This ...
503-576 6.71e-08

RNA recognition motif (RRM) found in U2-associated protein SR140 and similar proteins; This subgroup corresponds to the RRM of SR140 (also termed U2 snRNP-associated SURP motif-containing protein orU2SURP, or 140 kDa Ser/Arg-rich domain protein) which is a putative splicing factor mainly found in higher eukaryotes. Although it is initially identified as one of the 17S U2 snRNP-associated proteins, the molecular and physiological function of SR140 remains unclear. SR140 contains an N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), a SWAP/SURP domain that is found in a number of pre-mRNA splicing factors in the middle region, and a C-terminal arginine/serine-rich domain (RS domain).


Pssm-ID: 409670 [Multi-domain]  Cd Length: 84  Bit Score: 51.14  E-value: 6.71e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633  503 STTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRG---------CAYIVMVHRQDAYRALQKLSrgNYKVNQKSIKI 573
Cdd:cd12223     1 TTNLYVGNLPPSVTEEVLLREFGRFGPLASVKIMWPRTeeerrrnrnCGFVAFMSRADAERAMRELN--GKDVMGYELKL 78

                  ...
gi 578836633  574 AWA 576
Cdd:cd12223    79 GWG 81
RRM_RBM18 cd12355
RNA recognition motif (RRM) found in eukaryotic RNA-binding protein 18 and similar proteins; ...
506-576 8.38e-08

RNA recognition motif (RRM) found in eukaryotic RNA-binding protein 18 and similar proteins; This subfamily corresponds to the RRM of RBM18, a putative RNA-binding protein containing a well-conserved RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). The biological role of RBM18 remains unclear.


Pssm-ID: 409791 [Multi-domain]  Cd Length: 80  Bit Score: 50.38  E-value: 8.38e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633  506 LWVGQLDKRTTQQDVASLLEEFGPIESINMI---------PPRGCAYIVMVHRQDAYRALQKLsrgNYK-VNQKSIKIAW 575
Cdd:cd12355     2 LWIGNLDPRLTEYHLLKLLSKYGKIKKFDFLfhktgplkgQPRGYCFVTFETKEEAEKAIECL---NGKlALGKKLVVRW 78

                  .
gi 578836633  576 A 576
Cdd:cd12355    79 A 79
RRM3_PUB1 cd12622
RNA recognition motif 3 (RRM3) found in yeast nuclear and cytoplasmic polyadenylated ...
504-575 2.53e-07

RNA recognition motif 3 (RRM3) found in yeast nuclear and cytoplasmic polyadenylated RNA-binding protein PUB1 and similar proteins; This subfamily corresponds to the RRM3 of yeast protein PUB1, also termed ARS consensus-binding protein ACBP-60, or poly uridylate-binding protein, or poly(U)-binding protein. PUB1 has been identified as both, a heterogeneous nuclear RNA-binding protein (hnRNP) and a cytoplasmic mRNA-binding protein (mRNP), which may be stably bound to a translationally inactive subpopulation of mRNAs within the cytoplasm. PUB1 is distributed in both, the nucleus and the cytoplasm, and binds to poly(A)+ RNA (mRNA or pre-mRNA). Although it is one of the major cellular proteins cross-linked by UV light to polyadenylated RNAs in vivo, PUB1 is nonessential for cell growth in yeast. PUB1 also binds to T-rich single stranded DNA (ssDNA); however, there is no strong evidence implicating PUB1 in the mechanism of DNA replication. PUB1 contains three RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and a GAR motif (glycine and arginine rich stretch) that is located between RRM2 and RRM3.


Pssm-ID: 410033 [Multi-domain]  Cd Length: 74  Bit Score: 48.99  E-value: 2.53e-07
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 578836633  504 TTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSrgNYKVNQKSIKIAW 575
Cdd:cd12622     1 TTVYVGNLPPEVTQADLIPLFQNFGVIEEVRVQRDKGFGFVKYDTHEEAALAIQQLN--GQPFLGRPIKCSW 70
RRM1_2_CELF1-6_like cd12361
RNA recognition motif 1 (RRM1) and 2 (RRM2) found in CELF/Bruno-like family of RNA binding ...
506-559 3.09e-07

RNA recognition motif 1 (RRM1) and 2 (RRM2) found in CELF/Bruno-like family of RNA binding proteins and plant flowering time control protein FCA; This subfamily corresponds to the RRM1 and RRM2 domains of the CUGBP1 and ETR-3-like factors (CELF) as well as plant flowering time control protein FCA. CELF, also termed BRUNOL (Bruno-like) proteins, is a family of structurally related RNA-binding proteins involved in regulation of pre-mRNA splicing in the nucleus, and control of mRNA translation and deadenylation in the cytoplasm. The family contains six members: CELF-1 (also known as BRUNOL-2, CUG-BP1, NAPOR, EDEN-BP), CELF-2 (also known as BRUNOL-3, ETR-3, CUG-BP2, NAPOR-2), CELF-3 (also known as BRUNOL-1, TNRC4, ETR-1, CAGH4, ER DA4), CELF-4 (BRUNOL-4), CELF-5 (BRUNOL-5) and CELF-6 (BRUNOL-6). They all contain three highly conserved RNA recognition motifs (RRMs), also known as RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains): two consecutive RRMs (RRM1 and RRM2) situated in the N-terminal region followed by a linker region and the third RRM (RRM3) close to the C-terminus of the protein. The low sequence conservation of the linker region is highly suggestive of a large variety in the co-factors that associate with the various CELF family members. Based on both, sequence similarity and function, the CELF family can be divided into two subfamilies, the first containing CELFs 1 and 2, and the second containing CELFs 3, 4, 5, and 6. The different CELF proteins may act through different sites on at least some substrates. Furthermore, CELF proteins may interact with each other in varying combinations to influence alternative splicing in different contexts. This subfamily also includes plant flowering time control protein FCA that functions in the posttranscriptional regulation of transcripts involved in the flowering process. FCA contains two RRMs, and a WW protein interaction domain.


Pssm-ID: 409796 [Multi-domain]  Cd Length: 77  Bit Score: 48.77  E-value: 3.09e-07
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633  506 LWVGQLDKRTTQQDVASLLEEFGPIESINMI------PPRGCAYIVMVHRQDAYRALQKL 559
Cdd:cd12361     2 LFVGMIPKTASEEDVRPLFEQFGNIEEVQILrdkqtgQSKGCAFVTFSTREEALRAIEAL 61
RRM1_TIA1_like cd12352
RNA recognition motif 1 (RRM1) found in granule-associated RNA binding proteins p40-TIA-1 and ...
506-576 1.35e-06

RNA recognition motif 1 (RRM1) found in granule-associated RNA binding proteins p40-TIA-1 and TIAR; This subfamily corresponds to the RRM1 of nucleolysin TIA-1 isoform p40 (p40-TIA-1 or TIA-1) and nucleolysin TIA-1-related protein (TIAR), both of which are granule-associated RNA binding proteins involved in inducing apoptosis in cytotoxic lymphocyte (CTL) target cells. TIA-1 and TIAR share high sequence similarity. They are expressed in a wide variety of cell types. TIA-1 can be phosphorylated by a serine/threonine kinase that is activated during Fas-mediated apoptosis.TIAR is mainly localized in the nucleus of hematopoietic and nonhematopoietic cells. It is translocated from the nucleus to the cytoplasm in response to exogenous triggers of apoptosis. Both, TIA-1 and TIAR, bind specifically to poly(A) but not to poly(C) homopolymers. They are composed of three N-terminal highly homologous RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and a glutamine-rich C-terminal auxiliary domain containing a lysosome-targeting motif. TIA-1 and TIAR interact with RNAs containing short stretches of uridylates and their RRM2 can mediate the specific binding to uridylate-rich RNAs. The C-terminal auxiliary domain may be responsible for interacting with other proteins. In addition, TIA-1 and TIAR share a potential serine protease-cleavage site (Phe-Val-Arg) localized at the junction between their RNA binding domains and their C-terminal auxiliary domains.


Pssm-ID: 409788 [Multi-domain]  Cd Length: 73  Bit Score: 47.02  E-value: 1.35e-06
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 578836633  506 LWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGC----AYIVMVHRQDAYRALQKLSRgnYKVNQKSIKIAWA 576
Cdd:cd12352     1 LYVGNLDRQVTEDLILQLFSQIGPCKSCKMITEHGGndpyCFVEFYEHNHAAAALQAMNG--RKILGKEVKVNWA 73
RRM_snRNP70 cd12236
RNA recognition motif (RRM) found in U1 small nuclear ribonucleoprotein 70 kDa (U1-70K) and ...
505-555 3.53e-06

RNA recognition motif (RRM) found in U1 small nuclear ribonucleoprotein 70 kDa (U1-70K) and similar proteins; This subfamily corresponds to the RRM of U1-70K, also termed snRNP70, a key component of the U1 snRNP complex, which is one of the key factors facilitating the splicing of pre-mRNA via interaction at the 5' splice site, and is involved in regulation of polyadenylation of some viral and cellular genes, enhancing or inhibiting efficient poly(A) site usage. U1-70K plays an essential role in targeting the U1 snRNP to the 5' splice site through protein-protein interactions with regulatory RNA-binding splicing factors, such as the RS protein ASF/SF2. Moreover, U1-70K protein can specifically bind to stem-loop I of the U1 small nuclear RNA (U1 snRNA) contained in the U1 snRNP complex. It also mediates the binding of U1C, another U1-specific protein, to the U1 snRNP complex. U1-70K contains a conserved RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), followed by an adjacent glycine-rich region at the N-terminal half, and two serine/arginine-rich (SR) domains at the C-terminal half. The RRM is responsible for the binding of stem-loop I of U1 snRNA molecule. Additionally, the most prominent immunodominant region that can be recognized by auto-antibodies from autoimmune patients may be located within the RRM. The SR domains are involved in protein-protein interaction with SR proteins that mediate 5' splice site recognition. For instance, the first SR domain is necessary and sufficient for ASF/SF2 Binding. The family also includes Drosophila U1-70K that is an essential splicing factor required for viability in flies, but its SR domain is dispensable. The yeast U1-70k doesn't contain easily recognizable SR domains and shows low sequence similarity in the RRM region with other U1-70k proteins and therefore not included in this family. The RRM domain is dispensable for yeast U1-70K function.


Pssm-ID: 409682 [Multi-domain]  Cd Length: 91  Bit Score: 46.46  E-value: 3.53e-06
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633  505 TLWVGQLDKRTTQQDVASLLEEFGPIESINMI------PPRGCAYIVMVHRQD---AYRA 555
Cdd:cd12236     3 TLFVARLSYDTTESKLRREFEKYGPIKRVRLVrdkktgKSRGYAFIEFEHERDmkaAYKH 62
RRM3_TIA1_like cd12354
RNA recognition motif 2 (RRM2) found in granule-associated RNA binding proteins (p40-TIA-1 and ...
504-575 5.93e-06

RNA recognition motif 2 (RRM2) found in granule-associated RNA binding proteins (p40-TIA-1 and TIAR), and yeast nuclear and cytoplasmic polyadenylated RNA-binding protein PUB1; This subfamily corresponds to the RRM3 of TIA-1, TIAR, and PUB1. Nucleolysin TIA-1 isoform p40 (p40-TIA-1 or TIA-1) and nucleolysin TIA-1-related protein (TIAR) are granule-associated RNA binding proteins involved in inducing apoptosis in cytotoxic lymphocyte (CTL) target cells. They share high sequence similarity and are expressed in a wide variety of cell types. TIA-1 can be phosphorylated by a serine/threonine kinase that is activated during Fas-mediated apoptosis.TIAR is mainly localized in the nucleus of hematopoietic and nonhematopoietic cells. It is translocated from the nucleus to the cytoplasm in response to exogenous triggers of apoptosis. Both TIA-1 and TIAR bind specifically to poly(A) but not to poly(C) homopolymers. They are composed of three N-terminal highly homologous RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and a glutamine-rich C-terminal auxiliary domain containing a lysosome-targeting motif. TIA-1 and TIAR interact with RNAs containing short stretches of uridylates and their RRM2 can mediate the specific binding to uridylate-rich RNAs. The C-terminal auxiliary domain may be responsible for interacting with other proteins. In addition, TIA-1 and TIAR share a potential serine protease-cleavage site (Phe-Val-Arg) localized at the junction between their RNA binding domains and their C-terminal auxiliary domains. This subfamily also includes a yeast nuclear and cytoplasmic polyadenylated RNA-binding protein PUB1, termed ARS consensus-binding protein ACBP-60, or poly uridylate-binding protein, or poly(U)-binding protein, which has been identified as both a heterogeneous nuclear RNA-binding protein (hnRNP) and a cytoplasmic mRNA-binding protein (mRNP). It may be stably bound to a translationally inactive subpopulation of mRNAs within the cytoplasm. PUB1 is distributed in both, the nucleus and the cytoplasm, and binds to poly(A)+ RNA (mRNA or pre-mRNA). Although it is one of the major cellular proteins cross-linked by UV light to polyadenylated RNAs in vivo, PUB1 is nonessential for cell growth in yeast. PUB1 also binds to T-rich single stranded DNA (ssDNA); however, there is no strong evidence implicating PUB1 in the mechanism of DNA replication. PUB1 contains three RRMs, and a GAR motif (glycine and arginine rich stretch) that is located between RRM2 and RRM3.


Pssm-ID: 409790 [Multi-domain]  Cd Length: 71  Bit Score: 44.97  E-value: 5.93e-06
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 578836633  504 TTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSrgNYKVNQKSIKIAW 575
Cdd:cd12354     1 TTVYVGNITKGLTEALLQQTFSPFGQILEVRVFPDKGYAFIRFDSHEAATHAIVSVN--GTIINGQAVKCSW 70
RRM_eIF3G_like cd12408
RNA recognition motif (RRM) found in eukaryotic translation initiation factor 3 subunit G ...
505-565 1.04e-05

RNA recognition motif (RRM) found in eukaryotic translation initiation factor 3 subunit G (eIF-3G) and similar proteins; This subfamily corresponds to the RRM of eIF-3G and similar proteins. eIF-3G, also termed eIF-3 subunit 4, or eIF-3-delta, or eIF3-p42, or eIF3-p44, is the RNA-binding subunit of eIF3, a large multisubunit complex that plays a central role in the initiation of translation by binding to the 40 S ribosomal subunit and promoting the binding of methionyl-tRNAi and mRNA. eIF-3G binds 18 S rRNA and beta-globin mRNA, and therefore appears to be a nonspecific RNA-binding protein. eIF-3G is one of the cytosolic targets and interacts with mature apoptosis-inducing factor (AIF). eIF-3G contains one RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). This family also includes yeast eIF3-p33, a homolog of vertebrate eIF-3G, plays an important role in the initiation phase of protein synthesis in yeast. It binds both, mRNA and rRNA, fragments due to an RRM near its C-terminus.


Pssm-ID: 409842 [Multi-domain]  Cd Length: 76  Bit Score: 44.42  E-value: 1.04e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 578836633  505 TLWVGQLDKRTTQQDVASLLEEFGPIESINmIP-------PRGCAYIVMVHRQDAYRALQKLSRGNYK 565
Cdd:cd12408     1 TIRVTNLSEDATEEDLRELFRPFGPISRVY-LAkdketgqSKGFAFVTFETREDAERAIEKLNGFGYD 67
RRM1_Crp79 cd21619
RNA recognition motif 1 (RRM1) found in Schizosaccharomyces pombe mRNA export factor Crp79 and ...
503-574 1.19e-05

RNA recognition motif 1 (RRM1) found in Schizosaccharomyces pombe mRNA export factor Crp79 and similar proteins; Crp79, also called meiotic expression up-regulated protein 5 (Mug5), or polyadenylate-binding protein crp79, or PABP, or poly(A)-binding protein, is an auxiliary mRNA export factor that binds the poly(A) tail of mRNA and is involved in the export of mRNA from the nucleus to the cytoplasm. Members in this family contain three RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). The model corresponds to the first RRM motif.


Pssm-ID: 410198 [Multi-domain]  Cd Length: 78  Bit Score: 44.44  E-value: 1.19e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633  503 STTLWVGQLDKRTTQQDVASLLEEFGPIESInMIPP---------RGCAYIVMVHRQDAYRALQKLsrGNYKVNQKSIKI 573
Cdd:cd21619     1 SNTIYVGNIDMTINEDALEKIFSRYGQVESV-RRPPihtdkadrtTGFGFIKYTDAESAERAMQQA--DGILLGRRRLVV 77

                  .
gi 578836633  574 A 574
Cdd:cd21619    78 R 78
CID_RPRD_like cd16981
CID (CTD-Interacting Domain) of Regulation of nuclear pre-mRNA domain-containing proteins; ...
38-134 1.29e-05

CID (CTD-Interacting Domain) of Regulation of nuclear pre-mRNA domain-containing proteins; This family is composed of Regulation of nuclear pre-mRNA domain-containing proteins 1A (RPRD1A), 1B (RPRD1B), 2 (RPRD2), yeast Rtt103, and similar proteins. RPRD1A, RPRD1B, and RPRD2 are CID (CTD-Interacting Domain) containing proteins that co-purify with RNA polymerase (Pol) II (RNAP II) and three other RNAP II-associated proteins, RPAP2, GRINL1A and RECQL5, but not with the Mediator complex. Yeast transcription termination factor Rtt103 is a CID containing protein that functions in DNA damage response. CID binds tightly to the carboxy-terminal domain (CTD) of RNAP II. During transcription, RNAP II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices.


Pssm-ID: 340778  Cd Length: 125  Bit Score: 45.65  E-value: 1.29e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633   38 KLYKHVVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQFGTDkdvFGPRFSKNITATFQYLYLCPSED-KSKIVRVLNL 116
Cdd:cd16981    30 KHAKQIVKIWLKELKKAKPERKLTLLYLANDVLQNSRRKGAPE---FVEAFKKVLPEALALVRSEGDESvRKKVLRVLNI 106
                          90
                  ....*....|....*...
gi 578836633  117 WQKNGVFKIEIIQPLLDM 134
Cdd:cd16981   107 WEERNVFGSEFLAELRAI 124
RRM2_CELF3_4_5_6 cd12635
RNA recognition motif 2 (RRM2) found in CUGBP Elav-like family member CELF-3, CELF-4, CELF-5, ...
506-559 2.23e-05

RNA recognition motif 2 (RRM2) found in CUGBP Elav-like family member CELF-3, CELF-4, CELF-5, CELF-6 and similar proteins; This subgroup corresponds to the RRM2 of CELF-3, CELF-4, CELF-5, and CELF-6, all of which belong to the CUGBP1 and ETR-3-like factors (CELF) or BRUNOL (Bruno-like) family of RNA-binding proteins that display dual nuclear and cytoplasmic localizations and have been implicated in the regulation of pre-mRNA splicing and in the control of mRNA translation and deadenylation. CELF-3, expressed in brain and testis only, is also known as bruno-like protein 1 (BRUNOL-1), or CAG repeat protein 4, or CUG-BP- and ETR-3-like factor 3, or embryonic lethal abnormal vision (ELAV)-type RNA-binding protein 1 (ETR-1), or expanded repeat domain protein CAG/CTG 4, or trinucleotide repeat-containing gene 4 protein (TNRC4). It plays an important role in the pathogenesis of tauopathies. CELF-3 contains three highly conserved RNA recognition motifs (RRMs), also known as RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains): two consecutive RRMs (RRM1 and RRM2) situated in the N-terminal region followed by a linker region and the third RRM (RRM3) close to the C-terminus of the protein. The effect of CELF-3 on tau splicing is mediated mainly by the RNA-binding activity of RRM2. The divergent linker region might mediate the interaction of CELF-3 with other proteins regulating its activity or involved in target recognition. CELF-4, being highly expressed throughout the brain and in glandular tissues, moderately expressed in heart, skeletal muscle, and liver, is also known as bruno-like protein 4 (BRUNOL-4), or CUG-BP- and ETR-3-like factor 4. Like CELF-3, CELF-4 also contain three highly conserved RRMs. The splicing activation or repression activity of CELF-4 on some specific substrates is mediated by its RRM1/RRM2. On the other hand, both RRM1 and RRM2 of CELF-4 can activate cardiac troponin T (cTNT) exon 5 inclusion. CELF-5, expressed in brain, is also known as bruno-like protein 5 (BRUNOL-5), or CUG-BP- and ETR-3-like factor 5. Although its biological role remains unclear, CELF-5 shares same domain architecture with CELF-3. CELF-6, being strongly expressed in kidney, brain, and testis, is also known as bruno-like protein 6 (BRUNOL-6), or CUG-BP- and ETR-3-like factor 6. It activates exon inclusion of a cardiac troponin T minigene in transient transfection assays in a muscle-specific splicing enhancer (MSE)-dependent manner and can activate inclusion via multiple copies of a single element, MSE2. CELF-6 also promotes skipping of exon 11 of insulin receptor, a known target of CELF activity that is expressed in kidney. In addition to three highly conserved RRMs, CELF-6 also possesses numerous potential phosphorylation sites, a potential nuclear localization signal (NLS) at the C terminus, and an alanine-rich region within the divergent linker region.


Pssm-ID: 410043 [Multi-domain]  Cd Length: 81  Bit Score: 43.56  E-value: 2.23e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 578836633  506 LWVGQLDKRTTQQDVASLLEEFGPIESINMI-----PPRGCAYIVMVHRQDAYRALQKL 559
Cdd:cd12635     4 LFVGMLGKQQSEDDVRRLFEPFGSIEECTILrgpdgNSKGCAFVKFSSHAEAQAAINAL 62
CID_RPRD1A cd17011
CID (CTD-Interacting Domain) of Regulation of nuclear pre-mRNA domain-containing protein 1A; ...
41-131 3.02e-05

CID (CTD-Interacting Domain) of Regulation of nuclear pre-mRNA domain-containing protein 1A; Regulation of nuclear pre-mRNA domain-containing protein 1A (RPRD1A) is also called Cyclin-dependent kinase inhibitor 2B-related protein or p15INK4B-related protein (P15RS). RPRD1A is a CID (CTD-Interacting Domain) containing protein that co-purifies with RNA polymerase (Pol) II (RNAP II) and three other RNAP II-associated proteins, RPAP2, GRINL1A and RECQL5, but not with the Mediator complex. CID binds tightly to the carboxy-terminal domain (CTD) of RNAP II. During transcription, RNAP II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7. RPRD1A form homodimers and heterodimers with RPRD1B through their coiled-coil domains. Both RPRD1A and RPRD1B associate directly with RPAP2 phosphatase and serve as CTD scaffolds to coordinate the dephosphorylation of phospho-S5 by RPAP2. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices.


Pssm-ID: 340808  Cd Length: 128  Bit Score: 44.65  E-value: 3.02e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633   41 KH---VVQIVEKFIKKCKPEYKVPGLYVIDSIVRQSRHQfgtdkdvfGPRFSKN----ITATFQYLYLCPSED-KSKIVR 112
Cdd:cd17011    32 KHsrpIVTVWERELRKAKPNRKLTFLYLANDVIQNSKRK--------GPEFTKDfapvIVEAFKHVSSETDEScKKHLGR 103
                          90
                  ....*....|....*....
gi 578836633  113 VLNLWQKNGVFKIEIIQPL 131
Cdd:cd17011   104 VLSIWEERSVYENDVLEQL 122
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
505-575 3.28e-05

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 47.88  E-value: 3.28e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 578836633   505 TLWVGQLDKRTTQQDVASLLEEFGPIESI----NMIPPR--GCAYIVMVHRQDAYRALQKLsrgNYK-VNQKSIKIAW 575
Cdd:TIGR01628    2 SLYVGDLDPDVTEAKLYDLFKPFGPVLSVrvcrDSVTRRslGYGYVNFQNPADAERALETM---NFKrLGGKPIRIMW 76
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
133-281 5.81e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 47.17  E-value: 5.81e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633  133 DMAAGTSNAAPVAENVTNNEGSPPPPVKVSSEPPTQATPNSVPAV-PQLPSSDAFAAVAQLFQTTQGQQLQQILQTFQQP 211
Cdd:PRK07994  369 EVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVpLPETTSQLLAARQQLQRAQGATKAKKSEPAAASR 448
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 578836633  212 PKPQSPALDNAVMAQVQAITAQLKTTPTQPSEQKAAFPPPEQKT--AFDKKLLDRFDYDDEPE----AVEESKKED 281
Cdd:PRK07994  449 ARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEpvATPKALKKALEHEKTPElaakLAAEAIERD 524
PRK12678 PRK12678
transcription termination factor Rho; Provisional
958-1027 9.65e-05

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 46.44  E-value: 9.65e-05
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 578836633  958 RNDNRQQFNSGRDQERFGRRSFGNRVEN-DRERYGNRNDDRDNSNRDRREWGRRSPDRDRHRDLEERNRRS 1027
Cdd:PRK12678  196 RGRDGDDRDRRDRREQGDRREERGRRDGgDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGRRF 266
RRM_TRA2 cd12363
RNA recognition motif (RRM) found in transformer-2 protein homolog TRA2-alpha, TRA2-beta and ...
503-559 1.63e-04

RNA recognition motif (RRM) found in transformer-2 protein homolog TRA2-alpha, TRA2-beta and similar proteins; This subfamily corresponds to the RRM of two mammalian homologs of Drosophila transformer-2 (Tra2), TRA2-alpha, TRA2-beta (also termed SFRS10), and similar proteins found in eukaryotes. TRA2-alpha is a 40-kDa serine/arginine-rich (SR) protein that specifically binds to gonadotropin-releasing hormone (GnRH) exonic splicing enhancer on exon 4 (ESE4) and is necessary for enhanced GnRH pre-mRNA splicing. It strongly stimulates GnRH intron A excision in a dose-dependent manner. In addition, TRA2-alpha can interact with either 9G8 or SRp30c, which may also be crucial for ESE-dependent GnRH pre-mRNA splicing. TRA2-beta is a serine/arginine-rich (SR) protein that controls the pre-mRNA alternative splicing of the calcitonin/calcitonin gene-related peptide (CGRP), the survival motor neuron 1 (SMN1) protein and the tau protein. Both, TRA2-alpha and TRA2-beta, contains a well conserved RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), flanked by the N- and C-terminal arginine/serine (RS)-rich regions.


Pssm-ID: 409798 [Multi-domain]  Cd Length: 80  Bit Score: 41.06  E-value: 1.63e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 578836633  503 STTLWVGQLDKRTTQQDVASLLEEFGPIESINMI------PPRGCAYIVMVHRQDAYRALQKL 559
Cdd:cd12363     1 SRCLGVFGLSLYTTERDLREVFSRYGPIEKVQVVydqqtgRSRGFGFVYFESVEDAKEAKERL 63
RRM1_3_MRN1 cd12261
RNA recognition motif 1 (RRM1) and 3 (RRM3) found in RNA-binding protein MRN1 and similar ...
505-576 1.69e-04

RNA recognition motif 1 (RRM1) and 3 (RRM3) found in RNA-binding protein MRN1 and similar proteins; This subfamily corresponds to the RRM1 and RRM3 of MRN1, also termed multicopy suppressor of RSC-NHP6 synthetic lethality protein 1, or post-transcriptional regulator of 69 kDa, which is an RNA-binding protein found in yeast. Although its specific biological role remains unclear, MRN1 might be involved in translational regulation. Members in this family contain four copies of conserved RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain).


Pssm-ID: 240707 [Multi-domain]  Cd Length: 73  Bit Score: 41.05  E-value: 1.69e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 578836633  505 TLWVGQLDKRTTQQDVASLLEEfGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSRGNYKVNQKSIKIAWA 576
Cdd:cd12261     2 TVYLGNLPEDTTIRDILSAIRG-GPLESIKLLPTKNSATVSFLDEAAAEAFYAYARNNGFYINGKRIKVGWG 72
RRM2_MRN1 cd12523
RNA recognition motif 2 (RRM2) found in RNA-binding protein MRN1 and similar proteins; This ...
503-560 2.73e-04

RNA recognition motif 2 (RRM2) found in RNA-binding protein MRN1 and similar proteins; This subgroup corresponds to the RRM2 of MRN1, also termed multicopy suppressor of RSC-NHP6 synthetic lethality protein 1, or post-transcriptional regulator of 69 kDa, which is a RNA-binding protein found in yeast. Although its specific biological role remains unclear, MRN1 might be involved in translational regulation. Members in this family contain four copies of conserved RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain).


Pssm-ID: 409943 [Multi-domain]  Cd Length: 78  Bit Score: 40.50  E-value: 2.73e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 578836633  503 STTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLS 560
Cdd:cd12523     3 SRNVYLGNLPESITEEELREDLEKFGPIDQIKIVKEKNIAFVHFLSIANAIKVVTTLP 60
PRK12678 PRK12678
transcription termination factor Rho; Provisional
958-1027 3.71e-04

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 44.51  E-value: 3.71e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633  958 RNDNRQQFNSGRDQERFGRRSFGNRVENDRERYGNRNDDRDNSNRDRREWGRRSPDRDRHRDLEERNRRS 1027
Cdd:PRK12678  180 DRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGD 249
RRM1_MRN1 cd12520
RNA recognition motif 1 (RRM1) found in RNA-binding protein MRN1 and similar proteins; This ...
503-575 4.02e-04

RNA recognition motif 1 (RRM1) found in RNA-binding protein MRN1 and similar proteins; This subgroup corresponds to the RRM1 of MRN1, also termed multicopy suppressor of RSC-NHP6 synthetic lethality protein 1, or post-transcriptional regulator of 69 kDa,which is a RNA-binding protein found in yeast. Although its specific biological role remains unclear, MRN1 might be involved in translational regulation. Members in this family contain four copies of conserved RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain).


Pssm-ID: 240964 [Multi-domain]  Cd Length: 74  Bit Score: 40.12  E-value: 4.02e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 578836633  503 STTLWVGQLDKRTTQQDVASLLEeFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSRGNYKVNQKSIKIAW 575
Cdd:cd12520     1 SRTVYLGNLPPNTTVKELLSHVR-SGPIENVRILPEKNCAFISFLDPSAATAFHSDAILKRLSIKGVELKIGW 72
RRM1_PUB1 cd12614
RNA recognition motif 1 (RRM1) found in yeast nuclear and cytoplasmic polyadenylated ...
506-576 4.28e-04

RNA recognition motif 1 (RRM1) found in yeast nuclear and cytoplasmic polyadenylated RNA-binding protein PUB1 and similar proteins; This subgroup corresponds to the RRM1 of yeast protein PUB1, also termed ARS consensus-binding protein ACBP-60, or poly uridylate-binding protein, or poly(U)-binding protein. PUB1 has been identified as both, a heterogeneous nuclear RNA-binding protein (hnRNP) and a cytoplasmic mRNA-binding protein (mRNP), which may be stably bound to a translationally inactive subpopulation of mRNAs within the cytoplasm. It is distributed in both, the nucleus and the cytoplasm, and binds to poly(A)+ RNA (mRNA or pre-mRNA). Although it is one of the major cellular proteins cross-linked by UV light to polyadenylated RNAs in vivo, PUB1 is nonessential for cell growth in yeast. PUB1 also binds to T-rich single stranded DNA (ssDNA); however, there is no strong evidence implicating PUB1 in the mechanism of DNA replication. PUB1 contains three RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and a GAR motif (glycine and arginine rich stretch) that is located between RRM2 and RRM3.


Pssm-ID: 410026 [Multi-domain]  Cd Length: 74  Bit Score: 39.73  E-value: 4.28e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 578836633  506 LWVGQLDKRTTQQDVASLLEEFGPIESINMIPPR---GCAYiVMVH---RQDAYRALQKLsrGNYKVNQKSIKIAWA 576
Cdd:cd12614     1 LYVGNLDPRVTEDLLQEIFAVTGPVENCKIIPDKnskGVNY-GFVEyydRRSAEIAIQTL--NGRQIFGQEIKVNWA 74
RRM1_Hu_like cd12375
RNA recognition motif 1 (RRM1) found in the Hu proteins family, Drosophila sex-lethal (SXL), ...
506-576 4.58e-04

RNA recognition motif 1 (RRM1) found in the Hu proteins family, Drosophila sex-lethal (SXL), and similar proteins; This subfamily corresponds to the RRM1 of Hu proteins and SXL. The Hu proteins family represents a group of RNA-binding proteins involved in diverse biological processes. Since the Hu proteins share high homology with the Drosophila embryonic lethal abnormal vision (ELAV) protein, the Hu family is sometimes referred to as the ELAV family. Drosophila ELAV is exclusively expressed in neurons and is required for the correct differentiation and survival of neurons in flies. The neuronal members of the Hu family include Hu-antigen B (HuB or ELAV-2 or Hel-N1), Hu-antigen C (HuC or ELAV-3 or PLE21), and Hu-antigen D (HuD or ELAV-4), which play important roles in neuronal differentiation, plasticity and memory. HuB is also expressed in gonads. Hu-antigen R (HuR or ELAV-1 or HuA) is ubiquitously expressed Hu family member. It has a variety of biological functions mostly related to the regulation of cellular response to DNA damage and other types of stress. Hu proteins perform their cytoplasmic and nuclear molecular functions by coordinately regulating functionally related mRNAs. In the cytoplasm, Hu proteins recognize and bind to AU-rich RNA elements (AREs) in the 3' untranslated regions (UTRs) of certain target mRNAs, such as GAP-43, vascular epithelial growth factor (VEGF), the glucose transporter GLUT1, eotaxin and c-fos, and stabilize those ARE-containing mRNAs. They also bind and regulate the translation of some target mRNAs, such as neurofilament M, GLUT1, and p27. In the nucleus, Hu proteins function as regulators of polyadenylation and alternative splicing. Each Hu protein contains three RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). RRM1 and RRM2 may cooperate in binding to an ARE. RRM3 may help to maintain the stability of the RNA-protein complex, and might also bind to poly(A) tails or be involved in protein-protein interactions. This family also includes the sex-lethal protein (SXL) from Drosophila melanogaster. SXL governs sexual differentiation and X chromosome dosage compensation in flies. It induces female-specific alternative splicing of the transformer (tra) pre-mRNA by binding to the tra uridine-rich polypyrimidine tract at the non-sex-specific 3' splice site during the sex-determination process. SXL binds to its own pre-mRNA and promotes female-specific alternative splicing. It contains an N-terminal Gly/Asn-rich domain that may be responsible for the protein-protein interaction, and tandem RRMs that show high preference to bind single-stranded, uridine-rich target RNA transcripts.


Pssm-ID: 409810 [Multi-domain]  Cd Length: 76  Bit Score: 39.70  E-value: 4.58e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 578836633  506 LWVGQLDKRTTQQDVASLLEEFGPIESINMIPPR------GCAYIVMVHRQDAYRALQKLSrgNYKVNQKSIKIAWA 576
Cdd:cd12375     2 LIVNYLPQSMTQEELRSLFGAIGPIESCKLVRDKitgqslGYGFVNYRDPNDARKAINTLN--GLDLENKRLKVSYA 76
RRM1_Nop4p cd12674
RNA recognition motif 1 (RRM1) found in yeast nucleolar protein 4 (Nop4p) and similar proteins; ...
504-576 5.53e-04

RNA recognition motif 1 (RRM1) found in yeast nucleolar protein 4 (Nop4p) and similar proteins; This subgroup corresponds to the RRM1 of Nop4p (also known as Nop77p), encoded by YPL043W from Saccharomyces cerevisiae. It is an essential nucleolar protein involved in processing and maturation of 27S pre-rRNA and biogenesis of 60S ribosomal subunits. Nop4p has four RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains).


Pssm-ID: 410075 [Multi-domain]  Cd Length: 80  Bit Score: 39.76  E-value: 5.53e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 578836633  504 TTLWVGQLDKRTTQQDVASLLEEFGPIESINMI------PPRGCAYIVMVHRQDAYRALQKLSrgNYKVNQKSIKIAWA 576
Cdd:cd12674     1 TTLFVRNLPFDVTLESLTDFFSDIGPVKHAVVVtdpetkKSRGYGFVSFSTHDDAEEALAKLK--NRKLSGHILKLDFA 77
RRM1_CELF3_4_5_6 cd12632
RNA recognition motif 1 (RRM1) found in CUGBP Elav-like family member CELF-3, CELF-4, CELF-5, ...
503-555 5.86e-04

RNA recognition motif 1 (RRM1) found in CUGBP Elav-like family member CELF-3, CELF-4, CELF-5, CELF-6 and similar proteins; This subfamily corresponds to the RRM1 of CELF-3, CELF-4, CELF-5, CELF-6, all of which belong to the CUGBP1 and ETR-3-like factors (CELF) or BRUNOL (Bruno-like) family of RNA-binding proteins that display dual nuclear and cytoplasmic localizations and have been implicated in the regulation of pre-mRNA splicing and in the control of mRNA translation and deadenylation. CELF-3, expressed in brain and testis only, is also known as bruno-like protein 1 (BRUNOL-1), or CAG repeat protein 4, or CUG-BP- and ETR-3-like factor 3, or embryonic lethal abnormal vision (ELAV)-type RNA-binding protein 1 (ETR-1), or expanded repeat domain protein CAG/CTG 4, or trinucleotide repeat-containing gene 4 protein (TNRC4). It plays an important role in the pathogenesis of tauopathies. CELF-3 contains three highly conserved RNA recognition motifs (RRMs), also known as RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains): two consecutive RRMs (RRM1 and RRM2) situated in the N-terminal region followed by a linker region and the third RRM (RRM3) close to the C-terminus of the protein.The effect of CELF-3 on tau splicing is mediated mainly by the RNA-binding activity of RRM2. The divergent linker region might mediate the interaction of CELF-3 with other proteins regulating its activity or involved in target recognition. CELF-4, highly expressed throughout the brain and in glandular tissues, moderately expressed in heart, skeletal muscle, and liver, is also known as bruno-like protein 4 (BRUNOL-4), or CUG-BP- and ETR-3-like factor 4. Like CELF-3, CELF-4 also contain three highly conserved RRMs. The splicing activation or repression activity of CELF-4 on some specific substrates is mediated by its RRM1/RRM2. On the other hand, both RRM1 and RRM2 of CELF-4 can activate cardiac troponin T (cTNT) exon 5 inclusion. CELF-5, expressed in brain, is also known as bruno-like protein 5 (BRUNOL-5), or CUG-BP- and ETR-3-like factor 5. Although its biological role remains unclear, CELF-5 shares same domain architecture with CELF-3. CELF-6, strongly expressed in kidney, brain, and testis, is also known as bruno-like protein 6 (BRUNOL-6), or CUG-BP- and ETR-3-like factor 6. It activates exon inclusion of a cardiac troponin T minigene in transient transfection assays in an muscle-specific splicing enhancer (MSE)-dependent manner and can activate inclusion via multiple copies of a single element, MSE2. CELF-6 also promotes skipping of exon 11 of insulin receptor, a known target of CELF activity that is expressed in kidney. In additiona to three highly conserved RRMs, CELF-6 also possesses numerous potential phosphorylation sites, a potential nuclear localization signal (NLS) at the C terminus, and an alanine-rich region within the divergent linker region.


Pssm-ID: 410041 [Multi-domain]  Cd Length: 87  Bit Score: 40.09  E-value: 5.86e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 578836633  503 STTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPR------GCAYIVMVHRQDAYRA 555
Cdd:cd12632     5 AIKLFIGQIPRNLEEKDLRPLFEQFGKIYELTVLKDKytgmhkGCAFLTYCARESALKA 63
RRM_SNP1_like cd21615
RNA recognition motif (RRM) found in Saccharomyces cerevisiae U1 small nuclear ...
505-561 6.63e-04

RNA recognition motif (RRM) found in Saccharomyces cerevisiae U1 small nuclear ribonucleoprotein SNP1 and similar proteins; SNP1, also called U1 snRNP protein SNP1, or U1 small nuclear ribonucleoprotein 70 kDa homolog, or U1 70K, or U1 snRNP 70 kDa homolog, interacts with mRNA and is involved in nuclear mRNA splicing. It is a component of the spliceosome, where it is associated with snRNP U1 by binding stem loop I of U1 snRNA. Members in this family contain an N-terminal U1snRNP70 domain and an RNA recognition motif (RRM), also called RBD (RNA binding domain) or RNP (ribonucleoprotein domain).


Pssm-ID: 410194 [Multi-domain]  Cd Length: 118  Bit Score: 40.76  E-value: 6.63e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 578836633  505 TLWVGQLDKRTTQQDVASLLEEFGPIESINMI------PPRGCAYIVMVHRQDAYRALQKLSR 561
Cdd:cd21615    20 TLFVGRLDYSLTELELQKKFSKFGEIEKIRIVrdketgKSRGYAFIVFKSESDAKNAFKEGNG 82
RRM4_RBM19_RRM3_MRD1 cd12317
RNA recognition motif 4 (RRM4) found in RNA-binding protein 19 (RBM19) and RNA recognition ...
504-565 7.89e-04

RNA recognition motif 4 (RRM4) found in RNA-binding protein 19 (RBM19) and RNA recognition motif 3 (RRM3) found in multiple RNA-binding domain-containing protein 1 (MRD1); This subfamily corresponds to the RRM4 of RBM19 and the RRM3 of MRD1. RBM19, also termed RNA-binding domain-1 (RBD-1), is a nucleolar protein conserved in eukaryotes involved in ribosome biogenesis by processing rRNA and is essential for preimplantation development. It has a unique domain organization containing 6 conserved RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). MRD1 is encoded by a novel yeast gene MRD1 (multiple RNA-binding domain). It is well conserved in yeast and its homologues exist in all eukaryotes. MRD1 is present in the nucleolus and the nucleoplasm. It interacts with the 35 S precursor rRNA (pre-rRNA) and U3 small nucleolar RNAs (snoRNAs). MRD1 is essential for the initial processing at the A0-A2 cleavage sites in the 35 S pre-rRNA. MRD1 contains 5 conserved RRMs, which may play an important structural role in organizing specific rRNA processing events.


Pssm-ID: 409756 [Multi-domain]  Cd Length: 72  Bit Score: 39.16  E-value: 7.89e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 578836633  504 TTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSRGNYK 565
Cdd:cd12317     1 TVILVKNLPFGATEEELRELFEKFGTLGRLLLPPSRTIALVEFLEPQDARRAFKKLAYKRFK 62
RRM2_Bruno_like cd12636
RNA recognition motif 2 (RRM2) found in Drosophila melanogaster Bruno protein and similar ...
506-559 9.14e-04

RNA recognition motif 2 (RRM2) found in Drosophila melanogaster Bruno protein and similar proteins; This subgroup corresponds to the RRM2 of Bruno, a Drosophila RNA recognition motif (RRM)-containing protein that plays a central role in regulation of Oskar (Osk) expression. It mediates repression by binding to regulatory Bruno response elements (BREs) in the Osk mRNA 3' UTR. The full-length Bruno protein contains three RRMs, two located in the N-terminal half of the protein and the third near the C-terminus, separated by a linker region.


Pssm-ID: 410044 [Multi-domain]  Cd Length: 81  Bit Score: 39.09  E-value: 9.14e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 578836633  506 LWVGQLDKRTTQQDVASLLEEFGPIESINMI-----PPRGCAYIVMVHRQDAYRALQKL 559
Cdd:cd12636     4 LFVGMLSKKCNESDVRIMFSPYGSIEECTVLrdqngKSRGCAFVTFTSRQCAVNAIKAM 62
RRM1_I_PABPs cd12378
RNA recognition motif 1 (RRM1) found in type I polyadenylate-binding proteins; This subfamily ...
506-576 1.15e-03

RNA recognition motif 1 (RRM1) found in type I polyadenylate-binding proteins; This subfamily corresponds to the RRM1 of type I poly(A)-binding proteins (PABPs), highly conserved proteins that bind to the poly(A) tail present at the 3' ends of most eukaryotic mRNAs. They have been implicated in the regulation of poly(A) tail length during the polyadenylation reaction, translation initiation, mRNA stabilization by influencing the rate of deadenylation and inhibition of mRNA decapping. The family represents type I polyadenylate-binding proteins (PABPs), including polyadenylate-binding protein 1 (PABP-1 or PABPC1), polyadenylate-binding protein 3 (PABP-3 or PABPC3), polyadenylate-binding protein 4 (PABP-4 or APP-1 or iPABP), polyadenylate-binding protein 5 (PABP-5 or PABPC5), polyadenylate-binding protein 1-like (PABP-1-like or PABPC1L), polyadenylate-binding protein 1-like 2 (PABPC1L2 or RBM32), polyadenylate-binding protein 4-like (PABP-4-like or PABPC4L), yeast polyadenylate-binding protein, cytoplasmic and nuclear (PABP or ACBP-67), and similar proteins. PABP-1 is a ubiquitously expressed multifunctional protein that may play a role in 3' end formation of mRNA, translation initiation, mRNA stabilization, protection of poly(A) from nuclease activity, mRNA deadenylation, inhibition of mRNA decapping, and mRNP maturation. Although PABP-1 is thought to be a cytoplasmic protein, it is also found in the nucleus. PABP-1 may be involved in nucleocytoplasmic trafficking and utilization of mRNP particles. PABP-1 contains four copies of RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), a less well conserved linker region, and a proline-rich C-terminal conserved domain (CTD). PABP-3 is a testis-specific poly(A)-binding protein specifically expressed in round spermatids. It is mainly found in mammalian and may play an important role in the testis-specific regulation of mRNA homeostasis. PABP-3 shows significant sequence similarity to PABP-1. However, it binds to poly(A) with a lower affinity than PABP-1. Moreover, PABP-1 possesses an A-rich sequence in its 5'-UTR and allows binding of PABP and blockage of translation of its own mRNA. In contrast, PABP-3 lacks the A-rich sequence in its 5'-UTR. PABP-4 is an inducible poly(A)-binding protein (iPABP) that is primarily localized to the cytoplasm. It shows significant sequence similarity to PABP-1 as well. The RNA binding properties of PABP-1 and PABP-4 appear to be identical. PABP-5 is encoded by PABPC5 gene within the X-specific subinterval, and expressed in fetal brain and in a range of adult tissues in mammals, such as ovary and testis. It may play an important role in germ cell development. Moreover, unlike other PABPs, PABP-5 contains only four RRMs, but lacks both the linker region and the CTD. PABP-1-like and PABP-1-like 2 are the orthologs of PABP-1. PABP-4-like is the ortholog of PABP-5. Their cellular functions remain unclear. The family also includes yeast PABP, a conserved poly(A) binding protein containing poly(A) tails that can be attached to the 3'-ends of mRNAs. The yeast PABP and its homologs may play important roles in the initiation of translation and in mRNA decay. Like vertebrate PABP-1, the yeast PABP contains four RRMs, a linker region, and a proline-rich CTD as well. The first two RRMs are mainly responsible for specific binding to poly(A). The proline-rich region may be involved in protein-protein interactions.


Pssm-ID: 409812 [Multi-domain]  Cd Length: 80  Bit Score: 38.77  E-value: 1.15e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633  506 LWVGQLDKRTTQqdvASLLEEF---GPIESI----NMIPPR--GCAYIVMVHRQDAYRALQKLsrgNYK-VNQKSIKIAW 575
Cdd:cd12378     2 LYVGDLHPDVTE---AMLYEKFspaGPVLSIrvcrDAVTRRslGYAYVNFQQPADAERALDTL---NFDvIKGKPIRIMW 75

                  .
gi 578836633  576 A 576
Cdd:cd12378    76 S 76
RRM_cwf2 cd12360
RNA recognition motif (RRM) found in yeast pre-mRNA-splicing factor Cwc2 and similar proteins; ...
505-552 1.36e-03

RNA recognition motif (RRM) found in yeast pre-mRNA-splicing factor Cwc2 and similar proteins; This subfamily corresponds to the RRM of yeast protein Cwc2, also termed Complexed with CEF1 protein 2, or PRP19-associated complex protein 40 (Ntc40), or synthetic lethal with CLF1 protein 3, one of the components of the Prp19-associated complex [nineteen complex (NTC)] that can bind to RNA. NTC is composed of the scaffold protein Prp19 and a number of associated splicing factors, and plays a crucial role in intron removal during premature mRNA splicing in eukaryotes. Cwc2 functions as an RNA-binding protein that can bind both small nuclear RNAs (snRNAs) and pre-mRNA in vitro. It interacts directly with the U6 snRNA to link the NTC to the spliceosome during pre-mRNA splicing. In the N-terminal half, Cwc2 contains a CCCH-type zinc finger (ZnF domain), a RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and an intervening loop, also termed RNA-binding loop or RB loop, between ZnF and RRM, all of which are necessary and sufficient for RNA binding. The ZnF is also responsible for mediating protein-protein interaction. The C-terminal flexible region of Cwc2 interacts with the WD40 domain of Prp19.


Pssm-ID: 409795 [Multi-domain]  Cd Length: 79  Bit Score: 38.78  E-value: 1.36e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|...
gi 578836633  505 TLWVGQLDKRTTQ-QDVASLLE----EFGPIESINMIPPRGCAYIVMVHRQDA 552
Cdd:cd12360     3 TLYVGGIKAASNKlAQIEEILRrhfgEWGEIERIRVLPSKGIAFVRYKNRANA 55
CID_RPRD1B cd17012
CID (CTD-Interacting Domain) of Regulation of nuclear pre-mRNA domain-containing protein 1B; ...
41-131 2.13e-03

CID (CTD-Interacting Domain) of Regulation of nuclear pre-mRNA domain-containing protein 1B; Regulation of nuclear pre-mRNA domain-containing protein 1B (RPRD1B) is also called Cell cycle-related and expression-elevated protein in tumor (CREPT). RPRD1B is a CID (CTD-Interacting Domain) containing protein that co-purifies with RNA polymerase (Pol) II (RNAP II) and three other RNAP II-associated proteins, RPAP2, GRINL1A and RECQL5, but not with the Mediator complex. CID binds tightly to the carboxy-terminal domain (CTD) of RNAP II. During transcription, RNAP II synthesizes eukaryotic messenger RNA. Transcription is coupled to RNA processing through the CTD, which consists of up to 52 repeats of the sequence Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7. RPRD1B form homodimers and heterodimers with RPRD1A through their coiled-coil domains. Both RPRD1A and RPRD1B associate directly with RPAP2 phosphatase and serve as CTD scaffolds to coordinate the dephosphorylation of phospho-S5 by RPAP2. RPRD1B is highly expressed during tumorigenesis and in endometrial cancer, has been shown to promote tumor growth by accelerating the cell cycle. CID contains eight alpha-helices in a right-handed superhelical arrangement, which closely resembles that of the VHS domains and ARM (Armadillo) repeat proteins, except for its two amino-terminal helices.


Pssm-ID: 340809  Cd Length: 129  Bit Score: 39.60  E-value: 2.13e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633   41 KHVVQIV---EKFIKKCKPEYKVPGLYVIDSIVRQSRHQfgtdkdvfGPRFSKNITATFQYLYLCPSED-----KSKIVR 112
Cdd:cd17012    33 KHAGPIVsvwHRELRKAKSSRKLTFLYLANDVIQNSKRK--------GPEFTREFESVLVDAFSHVAREadegcKKPLER 104
                          90
                  ....*....|....*....
gi 578836633  113 VLNLWQKNGVFKIEIIQPL 131
Cdd:cd17012   105 LLNIWQERSVYGGDFIQQL 123
SF-CC1 TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
969-1060 2.26e-03

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 41.83  E-value: 2.26e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633   969 RDQERFGRRSFGNRVENDRERYGNRNDDRDNSNRDRREwgrRSPDRDRHRD--------LEERNRRSSGHRDRERDSRDR 1040
Cdd:TIGR01622    3 RDRERERLRDSSSAGDRDRRRDKGRERSRDRSRDRERS---RSRRRDRHRDrdyyrgreRRSRSRRPNRRYRPREKRRRR 79
                           90       100
                   ....*....|....*....|...
gi 578836633  1041 ESRREKE---EARGKEKPEVTDR 1060
Cdd:TIGR01622   80 GDSYRRRrddRRSRREKPRARDG 102
RRM1_AtRSp31_like cd12234
RNA recognition motif (RRM) found in Arabidopsis thaliana arginine/serine-rich-splicing factor ...
506-576 2.40e-03

RNA recognition motif (RRM) found in Arabidopsis thaliana arginine/serine-rich-splicing factor RSp31 and similar proteins from plants; This subfamily corresponds to the RRM1in a family that represents a novel group of arginine/serine (RS) or serine/arginine (SR) splicing factors existing in plants, such as A. thaliana RSp31, RSp35, RSp41 and similar proteins. Like vertebrate RS splicing factors, these proteins function as plant splicing factors and play crucial roles in constitutive and alternative splicing in plants. They all contain two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), at their N-terminus, and an RS domain at their C-terminus.


Pssm-ID: 409680 [Multi-domain]  Cd Length: 72  Bit Score: 37.90  E-value: 2.40e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 578836633  506 LWVGQLDKRTTQQDVASLLEEFGPIESINMipPRGCAYIVMVHRQDAYRALQKLSRGNYKVNQKSIKIAWA 576
Cdd:cd12234     3 VFCGNFEYDARQSEIERLFGKYGRVDRVDM--KSGYAFVYMEDERDAEDAIRGLDNFEFGRQRRRLRVEWT 71
RRM_Aly_REF_like cd12418
RNA recognition motif (RRM) found in the Aly/REF family; This subfamily corresponds to the RRM ...
504-559 2.42e-03

RNA recognition motif (RRM) found in the Aly/REF family; This subfamily corresponds to the RRM of Aly/REF family which includes THO complex subunit 4 (THOC4, also termed Aly/REF), S6K1 Aly/REF-like target (SKAR, also termed PDIP3 or PDIP46) and similar proteins. THOC4 is an mRNA transporter protein with a well conserved RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). It is involved in RNA transportation from the nucleus, and was initially identified as a transcription coactivator of LEF-1 and AML-1 for the TCRalpha enhancer function. In addition, THOC4 specifically binds to rhesus (RH) promoter in erythroid, and might be a novel transcription cofactor for erythroid-specific genes. SKAR shows high sequence homology with THOC4 and possesses one RRM as well. SKAR is widely expressed and localizes to the nucleus. It may be a critical player in the function of S6K1 in cell and organism growth control by binding the activated, hyperphosphorylated form of S6K1 but not S6K2. Furthermore, SKAR functions as a protein partner of the p50 subunit of DNA polymerase delta. In addition, SKAR may have particular importance in pancreatic beta cell size determination and insulin secretion.


Pssm-ID: 409852 [Multi-domain]  Cd Length: 75  Bit Score: 37.94  E-value: 2.42e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 578836633  504 TTLWVGQLDKRTTQQDVASLLEEFGPIESINMIPPR-----GCAYIVMVHRQDAYRALQKL 559
Cdd:cd12418     1 TRVRVSNLHPDVTEEDLRELFGRVGPVKSVKINYDRsgrstGTAYVVFERPEDAEKAIKQF 61
RRM2_SART3 cd12392
RNA recognition motif 2 (RRM2) found in squamous cell carcinoma antigen recognized by T-cells ...
505-574 2.55e-03

RNA recognition motif 2 (RRM2) found in squamous cell carcinoma antigen recognized by T-cells 3 (SART3) and similar proteins; This subfamily corresponds to the RRM2 of SART3, also termed Tat-interacting protein of 110 kDa (Tip110), is an RNA-binding protein expressed in the nucleus of the majority of proliferating cells, including normal cells and malignant cells, but not in normal tissues except for the testes and fetal liver. It is involved in the regulation of mRNA splicing probably via its complex formation with RNA-binding protein with a serine-rich domain (RNPS1), a pre-mRNA-splicing factor. SART3 has also been identified as a nuclear Tat-interacting protein that regulates Tat transactivation activity through direct interaction and functions as an important cellular factor for HIV-1 gene expression and viral replication. In addition, SART3 is required for U6 snRNP targeting to Cajal bodies. It binds specifically and directly to the U6 snRNA, interacts transiently with the U6 and U4/U6 snRNPs, and promotes the reassembly of U4/U6 snRNPs after splicing in vitro. SART3 contains an N-terminal half-a-tetratricopeptide repeat (HAT)-rich domain, a nuclearlocalization signal (NLS) domain, and two C-terminal RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains).


Pssm-ID: 409826 [Multi-domain]  Cd Length: 81  Bit Score: 38.08  E-value: 2.55e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 578836633  505 TLWVGQLDKRTTQQDVASLLEEFGPIESINMIP-----PRGCAYIVMVHRQDAYRALQKLSrgNYKVNQKSIKIA 574
Cdd:cd12392     4 KLFVKGLPFSCTKEELEELFKQHGTVKDVRLVTyrngkPKGLAYVEYENEADASQAVLKTD--GTEIKDHTISVA 76
RRM_HP0827_like cd12399
RNA recognition motif (RRM) found in Helicobacter pylori HP0827 protein and similar proteins; ...
506-559 3.22e-03

RNA recognition motif (RRM) found in Helicobacter pylori HP0827 protein and similar proteins; This subfamily corresponds to the RRM of H. pylori HP0827, a putative ssDNA-binding protein 12rnp2 precursor, containing one RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain). The ssDNA binding may be important in activation of HP0827.


Pssm-ID: 409833 [Multi-domain]  Cd Length: 75  Bit Score: 37.50  E-value: 3.22e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633  506 LWVGQLDKRTTQQDVASLLEEFGPIESINMI------PPRGCAYIVMVHRQDAYRALQKL 559
Cdd:cd12399     1 LYVGNLPYSASEEQLKSLFGQFGAVFDVKLPmdretkRPRGFGFVELQEEESAEKAIAKL 60
RRM2_FCA cd12637
RNA recognition motif 2 (RRM2) found in plant flowering time control protein FCA and similar ...
506-565 3.24e-03

RNA recognition motif 2 (RRM2) found in plant flowering time control protein FCA and similar proteins; This subgroup corresponds to the RRM2 of FCA, a gene controlling flowering time in Arabidopsis, which encodes a flowering time control protein that functions in the posttranscriptional regulation of transcripts involved in the flowering process. The flowering time control protein FCA contains two RNA recognition motifs (RRMs), also known as RBDs (RNA binding domains) or RNP (ribonucleoprotein domains), and a WW protein interaction domain.


Pssm-ID: 410045 [Multi-domain]  Cd Length: 81  Bit Score: 37.75  E-value: 3.24e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 578836633  506 LWVGQLDKRTTQQDVASLLEEFGPIESINMIPPR------GCAYIVMVHRQDAYRALQKLSrGNYK 565
Cdd:cd12637     2 LFVGSLPKTATEQEVRDLFEAYGEVEEVYLMKDPvtqqgtGCAFVKFAYKEEALAAIRSLN-GTVT 66
RRM4_MRN1 cd12522
RNA recognition motif 4 (RRM4) found in RNA-binding protein MRN1 and similar proteins; This ...
503-567 4.55e-03

RNA recognition motif 4 (RRM4) found in RNA-binding protein MRN1 and similar proteins; This subgroup corresponds to the RRM4 of MRN1, also termed multicopy suppressor of RSC-NHP6 synthetic lethality protein 1, or post-transcriptional regulator of 69 kDa, which is a RNA-binding protein found in yeast. Although its specific biological role remains unclear, MRN1 might be involved in translational regulation. Members in this family contain four copies of conserved RNA recognition motif (RRM), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain).


Pssm-ID: 409942 [Multi-domain]  Cd Length: 81  Bit Score: 37.12  E-value: 4.55e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 578836633  503 STTLWVGQLDKRT--TQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSRG----NYKVN 567
Cdd:cd12522     3 SRNVYIGNIDDVRvlTEERLRHDFSQYGEIEQVNFLREKNCAFVNFTNIANAIKAIDKIKSKpyykDLKIN 73
RRM1_2_CoAA_like cd12343
RNA recognition motif 1 (RRM1) and 2 (RRM2) found in RRM-containing coactivator activator ...
506-573 4.66e-03

RNA recognition motif 1 (RRM1) and 2 (RRM2) found in RRM-containing coactivator activator/modulator (CoAA) and similar proteins; This subfamily corresponds to the RRM in CoAA (also known as RBM14 or PSP2) and RNA-binding protein 4 (RBM4). CoAA is a heterogeneous nuclear ribonucleoprotein (hnRNP)-like protein identified as a nuclear receptor coactivator. It mediates transcriptional coactivation and RNA splicing effects in a promoter-preferential manner, and is enhanced by thyroid hormone receptor-binding protein (TRBP). CoAA contains two N-terminal RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and a TRBP-interacting domain. RBM4 is a ubiquitously expressed splicing factor with two isoforms, RBM4A (also known as Lark homolog) and RBM4B (also known as RBM30), which are very similar in structure and sequence. RBM4 may also function as a translational regulator of stress-associated mRNAs as well as play a role in micro-RNA-mediated gene regulation. RBM4 contains two N-terminal RRMs, a CCHC-type zinc finger, and three alanine-rich regions within their C-terminal regions. This family also includes Drosophila RNA-binding protein lark (Dlark), a homolog of human RBM4. It plays an important role in embryonic development and in the circadian regulation of adult eclosion. Dlark shares high sequence similarity with RBM4 at the N-terminal region. However, Dlark has three proline-rich segments instead of three alanine-rich segments within the C-terminal region.


Pssm-ID: 409779 [Multi-domain]  Cd Length: 66  Bit Score: 36.82  E-value: 4.66e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 578836633  506 LWVGQLDKRTTQQDVASLLEEFGPIESINMIppRGCAYIVMVHRQDAYRALQKLSrgNYKVNQKSIKI 573
Cdd:cd12343     2 IFVGNLPDAATSEELRALFEKYGKVTECDIV--KNYAFVHMEKEEDAEDAIKALN--GYEFMGSRINV 65
RRM3_Prp24 cd12298
RNA recognition motif 3 in fungal pre-messenger RNA splicing protein 24 (Prp24) and similar ...
511-558 4.75e-03

RNA recognition motif 3 in fungal pre-messenger RNA splicing protein 24 (Prp24) and similar proteins; This subfamily corresponds to the RRM3 of Prp24, also termed U4/U6 snRNA-associated-splicing factor PRP24 (U4/U6 snRNP), an RNA-binding protein with four well conserved RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). It facilitates U6 RNA base-pairing with U4 RNA during spliceosome assembly. Prp24 specifically binds free U6 RNA primarily with RRMs 1 and 2 and facilitates pairing of U6 RNA bases with U4 RNA bases. Additionally, it may also be involved in dissociation of the U4/U6 complex during spliceosome activation.


Pssm-ID: 409739 [Multi-domain]  Cd Length: 78  Bit Score: 36.86  E-value: 4.75e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 578836633  511 LDKRTTQQDVASLLEEFGPIESINMiPP----------RGCAYIVMVHRQDAYRALQK 558
Cdd:cd12298     8 LDFELDEEALRGIFEKFGEIESINI-PKkqknrkgrhnNGFAFVTFEDADSAESALQL 64
RRM_SRSF10_SRSF12 cd12312
RNA recognition motif (RRM) found in serine/arginine-rich splicing factor SRSF10, SRSF12 and ...
504-576 5.03e-03

RNA recognition motif (RRM) found in serine/arginine-rich splicing factor SRSF10, SRSF12 and similar proteins; This subfamily corresponds to the RRM of SRSF10 and SRSF12. SRSF10, also termed 40 kDa SR-repressor protein (SRrp40), or FUS-interacting serine-arginine-rich protein 1 (FUSIP1), or splicing factor SRp38, or splicing factor, arginine/serine-rich 13A (SFRS13A), or TLS-associated protein with Ser-Arg repeats (TASR). It is a serine-arginine (SR) protein that acts as a potent and general splicing repressor when dephosphorylated. It mediates global inhibition of splicing both in M phase of the cell cycle and in response to heat shock. SRSF10 emerges as a modulator of cholesterol homeostasis through the regulation of low-density lipoprotein receptor (LDLR) splicing efficiency. It also regulates cardiac-specific alternative splicing of triadin pre-mRNA and is required for proper Ca2+ handling during embryonic heart development. In contrast, the phosphorylated SRSF10 functions as a sequence-specific splicing activator in the presence of a nuclear cofactor. It activates distal alternative 5' splice site of adenovirus E1A pre-mRNA in vivo. Moreover, SRSF10 strengthens pre-mRNA recognition by U1 and U2 snRNPs. SRSF10 localizes to the nuclear speckles and can shuttle between nucleus and cytoplasm. SRSF12, also termed 35 kDa SR repressor protein (SRrp35), or splicing factor, arginine/serine-rich 13B (SFRS13B), or splicing factor, arginine/serine-rich 19 (SFRS19), is a serine/arginine (SR) protein-like alternative splicing regulator that antagonizes authentic SR proteins in the modulation of alternative 5' splice site choice. For instance, it activates distal alternative 5' splice site of the adenovirus E1A pre-mRNA in vivo. Both, SRSF10 and SRSF12, contain a single N-terminal RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), followed by a C-terminal RS domain rich in serine-arginine dipeptides.


Pssm-ID: 240758 [Multi-domain]  Cd Length: 84  Bit Score: 37.35  E-value: 5.03e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633  504 TTLWVGQLDKRTTQQDVASLLEEFGPIESInMIP-------PRGCAYIVMVHRQDAYRALQKLSRGNykVNQKSIKIAWA 576
Cdd:cd12312     1 TSLFVRNVADDTRPDDLRREFGRYGPIVDV-YIPldfytrrPRGFAYIQFEDVRDAEDALYYLDRTR--FLGREIEIQFA 77
RRM1_CELF1_2_Bruno cd12631
RNA recognition motif 1 (RRM1) found in CUGBP Elav-like family member CELF-1, CELF-2, ...
506-555 5.29e-03

RNA recognition motif 1 (RRM1) found in CUGBP Elav-like family member CELF-1, CELF-2, Drosophila melanogaster Bruno protein and similar proteins; This subgroup corresponds to the RRM1 of CELF-1, CELF-2 and Bruno protein. CELF-1 (also termed BRUNOL-2, or CUG-BP1, or EDEN-BP) and CELF-2 (also termed BRUNOL-3, or ETR-3, or CUG-BP2, or NAPOR) belong to the CUGBP1 and ETR-3-like factors (CELF) or BRUNOL (Bruno-like) family of RNA-binding proteins that have been implicated in regulation of pre-mRNA splicing, and control of mRNA translation and deadenylation. CELF-1 is strongly expressed in all adult and fetal tissues tested. The human CELF-1 is a nuclear and cytoplasmic RNA-binding protein that regulates multiple aspects of nuclear and cytoplasmic mRNA processing, with implications for onset of type 1 myotonic dystrophy (DM1), a neuromuscular disease associated with an unstable CUG triplet expansion in the 3'-UTR (3'-untranslated region) of the DMPK (myotonic dystrophy protein kinase) gene; it preferentially targets UGU-rich mRNA elements. It has been shown to bind to a Bruno response element, a cis-element involved in translational control of oskar mRNA in Drosophila, and share sequence similarity to Bruno, the Drosophila protein that mediates this process. The Xenopus homolog embryo deadenylation element-binding protein (EDEN-BP) mediates sequence-specific deadenylation of Eg5 mRNA. It binds specifically to the EDEN motif in the 3'-untranslated regions of maternal mRNAs and targets these mRNAs for deadenylation and translational repression. CELF-1 contain three highly conserved RNA recognition motifs (RRMs), also known as RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains): two consecutive RRMs (RRM1 and RRM2) situated in the N-terminal region followed by a linker region and the third RRM (RRM3) close to the C-terminus of the protein. The two N-terminal RRMs of EDEN-BP are necessary for the interaction with EDEN as well as a part of the linker region (between RRM2 and RRM3). Oligomerization of EDEN-BP is required for specific mRNA deadenylation and binding. CELF-2 is expressed in all tissues at some level, but highest in brain, heart, and thymus. It has been implicated in the regulation of nuclear and cytoplasmic RNA processing events, including alternative splicing, RNA editing, stability and translation. CELF-2 shares high sequence identity with CELF-1, but shows different binding specificity; it binds preferentially to sequences with UG repeats and UGUU motifs. It has been shown to bind to a Bruno response element, a cis-element involved in translational control of oskar mRNA in Drosophila, and share sequence similarity to Bruno, the Drosophila protein that mediates this process. It also binds to the 3'-UTR of cyclooxygenase-2 messages, affecting both translation and mRNA stability, and binds to apoB mRNA, regulating its C to U editing. CELF-2 also contains three highly conserved RRMs. It binds to RNA via the first two RRMs, which are also important for localization in the cytoplasm. The splicing activation or repression activity of CELF-2 on some specific substrates is mediated by RRM1/RRM2. Both, RRM1 and RRM2 of CELF-2, can activate cardiac troponin T (cTNT) exon 5 inclusion. In addition, CELF-2 possesses a typical arginine and lysine-rich nuclear localization signal (NLS) in the C-terminus, within RRM3. This subgroup also includes Drosophila melanogaster Bruno protein, which plays a central role in regulation of Oskar (Osk) expression in flies. It mediates repression by binding to regulatory Bruno response elements (BREs) in the Osk mRNA 3' UTR. The full-length Bruno protein contains three RRMs, two located in the N-terminal half of the protein and the third near the C-terminus, separated by a linker region.


Pssm-ID: 410040 [Multi-domain]  Cd Length: 84  Bit Score: 37.11  E-value: 5.29e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 578836633  506 LWVGQLDKRTTQQDVASLLEEFGPIESINMI------PP--RGCAYIVMVHRQDAYRA 555
Cdd:cd12631     4 MFVGQIPRSWSEKELRELFEQYGAVYQINVLrdrsqnPPqsKGCCFVTFYTRKAALEA 61
RRM2_RBM23_RBM39 cd12284
RNA recognition motif 2 (RRM2) found in vertebrate RNA-binding protein RBM23, RBM39 and ...
506-559 6.47e-03

RNA recognition motif 2 (RRM2) found in vertebrate RNA-binding protein RBM23, RBM39 and similar proteins; This subfamily corresponds to the RRM2 of RBM39 (also termed HCC1), a nuclear autoantigen that contains an N-terminal arginine/serine rich (RS) motif and three RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). An octapeptide sequence called the RS-ERK motif is repeated six times in the RS region of RBM39. Although the cellular function of RBM23 remains unclear, it shows high sequence homology to RBM39 and contains two RRMs. It may possibly function as a pre-mRNA splicing factor.


Pssm-ID: 409726 [Multi-domain]  Cd Length: 78  Bit Score: 36.84  E-value: 6.47e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 578836633  506 LWVGQLDKRTTQQDVASLLEEFGPIESINMIP------PRGCAYIVMVHRQDAYRALQKL 559
Cdd:cd12284     1 LYVGSLHFNITEDMLRGIFEPFGKIEFVQLQKdpetgrSKGYGFIQFRDAEDAKKALEQL 60
U2AF_lg TIGR01642
U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of ...
969-1027 6.68e-03

U2 snRNP auxilliary factor, large subunit, splicing factor; These splicing factors consist of an N-terminal arginine-rich low complexity domain followed by three tandem RNA recognition motifs (pfam00076). The well-characterized members of this family are auxilliary components of the U2 small nuclear ribonuclearprotein splicing factor (U2AF). These proteins are closely related to the CC1-like subfamily of splicing factors (TIGR01622). Members of this subfamily are found in plants, metazoa and fungi.


Pssm-ID: 273727 [Multi-domain]  Cd Length: 509  Bit Score: 40.26  E-value: 6.68e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 578836633   969 RDQERfgRRSFGNRVENDRERYGNRNDDRDNSNRDRREWGRRSPDRDRHRDLEERNRRS 1027
Cdd:TIGR01642    1 RDEEP--DREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRR 57
RRM1_2_MATR3_like cd12436
RNA recognition motif 1 (RRM1) and 2 (RRM2) found in the matrin 3 family of nuclear proteins; ...
516-574 8.76e-03

RNA recognition motif 1 (RRM1) and 2 (RRM2) found in the matrin 3 family of nuclear proteins; This subfamily corresponds to the RRM of the matrin 3 family of nuclear proteins consisting of Matrin 3 (MATR3), nuclear protein 220 (NP220) and similar proteins. MATR3 is a highly conserved inner nuclear matrix protein that has been implicated in various biological processes. NP220 is a large nucleoplasmic DNA-binding protein that binds to cytidine-rich sequences, such as CCCCC (G/C), in double-stranded DNA (dsDNA). Both, Matrin 3 and NP220, contain two RNA recognition motif (RRM), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and a Cys2-His2 zinc finger-like motif at the C-terminal region.


Pssm-ID: 409870 [Multi-domain]  Cd Length: 76  Bit Score: 36.17  E-value: 8.76e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 578836633  516 TQQDVASLLEEFGPIESINMIPPRGCAYIVMVHRQDAYRALQKLSRGNYKVNQKSIKIA 574
Cdd:cd12436    14 SEEDVLKLAEPFGKVNNVLLIRSKREAFIEMETAEDAQAMLSYCKTKPITIKGKKVKVS 72
RRM1_RBM34 cd12394
RNA recognition motif 1 (RRM1) found in RNA-binding protein 34 (RBM34) and similar proteins; ...
505-556 8.90e-03

RNA recognition motif 1 (RRM1) found in RNA-binding protein 34 (RBM34) and similar proteins; This subfamily corresponds to the RRM1 of RBM34, a putative RNA-binding protein containing two RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). Although the function of RBM34 remains unclear currently, its RRM domains may participate in mRNA processing. RBM34 may act as an mRNA processing-related protein.


Pssm-ID: 409828 [Multi-domain]  Cd Length: 91  Bit Score: 36.80  E-value: 8.90e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 578836633  505 TLWVGQLDKRTTQQDVASLLEEFGPIESINM---------IPPRG--------------CAYIVMVHRQDAYRAL 556
Cdd:cd12394     2 TVFVGNLPVTVKKKALKKLFKEFGKIESVRFrsvavanpkLPKKVavikkkfhpkrdsmNAYVVFKEEESAQKAL 76
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH