NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|206729892|sp|O14802|]
View 

RecName: Full=DNA-directed RNA polymerase III subunit RPC1; Short=RNA polymerase III subunit C1; AltName: Full=DNA-directed RNA polymerase III largest subunit; AltName: Full=DNA-directed RNA polymerase III subunit A; AltName: Full=RNA polymerase III 155 kDa subunit; Short=RPC155; AltName: Full=RNA polymerase III subunit C160

Protein Classification

DNA-directed RNA polymerase III subunit RPC1( domain architecture ID 10118853)

DNA-directed RNA polymerase III subunit RPC1 is the largest and is a catalytic core component of RNA polymerase III which synthesizes small RNAs, such as 5S rRNA and tRNAs

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
RNAP_III_RPC1_N cd02583
Largest subunit (RPC1) of eukaryotic RNA polymerase III (RNAP III), N-terminal domain; Rpc1 ...
24-891 0e+00

Largest subunit (RPC1) of eukaryotic RNA polymerase III (RNAP III), N-terminal domain; Rpc1 (C160) subunit forms part of the active site region of RNAP III. RNAP III is one of the three distinct classes of nuclear RNAP in eukaryotes that is responsible for the synthesis of tRNAs, 5SrRNA, Alu-RNA, U6 snRNA genes, and some others. RNAP III is the largest nuclear RNA polymerase with 17 subunits. Structure studies suggest that different RNA polymerase complexes share a similar crab-claw-shaped structure. The N-terminal domain of Rpb1, the largest subunit of RNAP II in yeast, forms part of the active site, making up the head and core of the one clamp, as well as the pore and funnel structures of RNAP II. The strong homology between Rpc1 and Rpb1 suggests a similar functional and structural role.


:

Pssm-ID: 259847 [Multi-domain]  Cd Length: 816  Bit Score: 1668.46  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   24 SPEEMRQQAHIQVVSKNLYSQDNQHaPLLYGVLDHRMGTSEKDRPCETCGKNLADCLGHYGYIDLELPCFHVGYFRAVIG 103
Cdd:cd02583     2 SPEDIIRLSEVEVTNRNLYDIETRK-PLPYGVLDPRLGTSDKDGICETCGLNLADCVGHFGYIKLELPVFHIGYFKAIIN 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  104 ILQMICKTCCHIMLSQEEKKQFLDYLKRPGLTYLQKRGLKKKISDKCRKKNICHHCGafngtvkkcgllkiihekyktnk 183
Cdd:cd02583    81 ILQCICKTCSRVLLPEEEKRKFLKRLRRPNLDNLQKKALKKKILEKCKKVRKCPHCG----------------------- 137
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  184 kvvdpivsnflqsfetaiehnkevepLLGRAQENLNPLVVLNLFKRIPAEDVPLLLMNPEAGKPSDLILTRLLVPPLCIR 263
Cdd:cd02583   138 --------------------------LLKKAQEDLNPLKVLNLFKNIPPEDVELLLMNPLAGRPENLILTRIPVPPLCIR 191
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  264 PSVVSDLKSGTNEDDLTMKLTEIIFLNDVIKKHRISGAKTQMIMEDWDFLQLQCALYINSELSGIPLNMAPKKWTRGFVQ 343
Cdd:cd02583   192 PSVVMDEKSGTNEDDLTVKLSEIIFLNDVIKKHLEKGAKTQKIMEDWDFLQLQCALYINSELPGLPLSMQPKKPIRGFCQ 271
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  344 RLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPEKVNKANINFLRKLVQNGPEVHPGANFIQQ 423
Cdd:cd02583   272 RLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDQVGVPEHVAKILTYPERVTRYNIEKLRKLVLNGPDVHPGANFVIK 351
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  424 RHTQMKRFLKYGNREKMAQELKYGDIVERHLIDGDVVLFNRQPSLHKLSIMAHLARVKPHRTFRFNECVCTPYNADFDGD 503
Cdd:cd02583   352 RDGGKKKFLKYGNRRKIARELKIGDIVERHLEDGDIVLFNRQPSLHRLSIMAHRAKVMPWRTFRFNECVCTPYNADFDGD 431
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  504 EMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAIQDFLTGAYLLTLKDTFFDRAKACQIIASILvgkDEKIKVRL 583
Cdd:cd02583   432 EMNLHVPQTEEARAEALELMGVKNNLVTPRNGEPLIAATQDFLTASYLLTSKDVFFDRAQFCQLCSYML---DGEIKIDL 508
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  584 PPPTILKPVTLWTGKQIFSVILRPSDDNPVRANLRTKGKQYCGKGEDLCANDSYVTIQNSELMSGSMDKGTLGSGSKNNI 663
Cdd:cd02583   509 PPPAILKPVELWTGKQIFSLLLRPNKKSPVLVNLEAKEKSYTKKSPDMCPNDGYVVIRNSELLCGRLDKSTLGSGSKNSL 588
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  664 FYILLRDWGQNEAADAMSRLARLAPVYLSNRGFSIGIGDVTPGQGLLKAKYELLNAGYKKCDEYIEALNTGKLQQQPGCT 743
Cdd:cd02583   589 FYVLLRDYGPEAAAAAMNRLAKLSSRWLSNRGFSIGIDDVTPSKELLKKKEELVDNGYAKCDEYIKQYKKGKLELQPGCT 668
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  744 AEETLEALILKELSVIRDHAGSACLRELDKSNSPLTMALCGSKGSFINISQMIACVGQQAISGSRVPDGFENRSLPHFEK 823
Cdd:cd02583   669 AEQTLEAKISGELSKIREDAGKACLKELHKSNSPLIMALCGSKGSNINISQMIACVGQQIISGKRIPNGFEDRTLPHFPR 748
                         810       820       830       840       850       860
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 206729892  824 HSKLPAAKGFVANSFYSGLTPTEFFFHTMAGREGLVDTAVKTAETGYMQRRLVKSLEDLCSQYDLTVR 891
Cdd:cd02583   749 NSKTPAAKGFVANSFYSGLTPTEFFFHTMSGREGLVDTAVKTAETGYMQRRLMKALEDLSVQYDGTVR 816
RNAP_III_Rpc1_C cd02736
Largest subunit (Rpc1) of Eukaryotic RNA polymerase III (RNAP III), C-terminal domain; ...
1021-1360 0e+00

Largest subunit (Rpc1) of Eukaryotic RNA polymerase III (RNAP III), C-terminal domain; Eukaryotic RNA polymerase III (RNAP III) is a large multi-subunit complex responsible for the synthesis of tRNAs, 5SrRNA, Alu-RNA, U6 snRNA, among others. Rpc1 is also known as C160 in yeast. Structure studies suggest that different RNA polymerase complexes share a similar crab-claw-shape structure. The C-terminal domain of Rpb1, the largest subunit of RNAP II, makes up part of the foot and jaw structures of RNAP II. The similarity between this domain and the C-terminal domain of Rpb1, its counterpart in RNAP II, suggests a similar functional and structural role.


:

Pssm-ID: 132723 [Multi-domain]  Cd Length: 300  Bit Score: 565.31  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1021 KYMRAQMEPGSAVGALCAQSIGEPGTQMTLKTFHFAGVASMNITLGVPRIKEIINASKAISTPIITAQLDKDDDADYARL 1100
Cdd:cd02736     1 KYMRAKVEPGTAVGAIAAQSIGEPGTQMTLKTFHFAGVASMNITLGVPRIKEIINASKNISTPIITAKLENDRDEKSARI 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1101 VKGRIEKTLLGEISEYIEEVFLPDDCFILVKLSLERIRLLRLevnaetvrysictsklrvkpgdvavhgeavvcvtpren 1180
Cdd:cd02736    81 VKGRIEKTYLGEVASYIEEVYSPDDCYILIKLDKKIIEKLQL-------------------------------------- 122
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1181 SKSSMYYVLQFLKEDLPKVVVQGIPEVSRAVIHIDEQSGKekYKLLVEGDNLRAVMATHGVKGTRTTSNNTYEVEKTLGI 1260
Cdd:cd02736   123 SKSNLYFLLQSLKRKLPDVVVSGIPEVKRAVINKDKKKGK--YKLLVEGYGLRAVMNTPGVIGTRTTSNHIMEVEKVLGI 200
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1261 EAARTTIINEIQYTMVNHGMSIDRRHVMLLSDLMTYKGEVLGITRFGLAKMKESVLMLASFEKTADHLFDAAYFGQKDSV 1340
Cdd:cd02736   201 EAARSTIINEIQYTMKSHGMSIDPRHIMLLADLMTFKGEVLGITRFGIAKMKESVLMLASFEKTTDHLFNAALHGRKDSI 280
                         330       340
                  ....*....|....*....|
gi 206729892 1341 CGVSECIIMGIPMNIGTGLF 1360
Cdd:cd02736   281 EGVSECIIMGKPMPIGTGLF 300
rpoC2 super family cl33332
RNA polymerase beta'' subunit; Reviewed
841-1058 1.63e-08

RNA polymerase beta'' subunit; Reviewed


The actual alignment was detected with superfamily member CHL00117:

Pssm-ID: 214368 [Multi-domain]  Cd Length: 1364  Bit Score: 59.57  E-value: 1.63e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  841 GLTPTEFFFHTMAGREGLVDTAVKTAETGYMQRRLVKSLEDL------CsqydLTVRSSTgdiiqfiyggdgLDPAAMEG 914
Cdd:CHL00117  172 GLSLTEYIISCYGARKGVVDTAVRTADAGYLTRRLVEVVQHIvvretdC----GTTRGIS------------VSPRNGMM 235
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  915 KDEPLEFK---RVL-DNIKavfpcpsepalSKNELILTTEsimkkseflccQDSFLQEIKKFIKgvsekikktrdkygin 990
Cdd:CHL00117  236 IERILIQTligRVLaDDIY-----------IGSRCIATRN-----------QDIGIGLANRFIT---------------- 277
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  991 dngtteprvlYQLDRI---TPTqvekfleTCRDkyMRA--QM------------EPGSAVGALCAQSIGEPGTQMTLKTF 1053
Cdd:CHL00117  278 ----------FRAQPIsirSPL-------TCRS--TSWicQLcygwslahgdlvELGEAVGIIAGQSIGEPGTQLTLRTF 338

                  ....*
gi 206729892 1054 HFAGV 1058
Cdd:CHL00117  339 HTGGV 343
 
Name Accession Description Interval E-value
RNAP_III_RPC1_N cd02583
Largest subunit (RPC1) of eukaryotic RNA polymerase III (RNAP III), N-terminal domain; Rpc1 ...
24-891 0e+00

Largest subunit (RPC1) of eukaryotic RNA polymerase III (RNAP III), N-terminal domain; Rpc1 (C160) subunit forms part of the active site region of RNAP III. RNAP III is one of the three distinct classes of nuclear RNAP in eukaryotes that is responsible for the synthesis of tRNAs, 5SrRNA, Alu-RNA, U6 snRNA genes, and some others. RNAP III is the largest nuclear RNA polymerase with 17 subunits. Structure studies suggest that different RNA polymerase complexes share a similar crab-claw-shaped structure. The N-terminal domain of Rpb1, the largest subunit of RNAP II in yeast, forms part of the active site, making up the head and core of the one clamp, as well as the pore and funnel structures of RNAP II. The strong homology between Rpc1 and Rpb1 suggests a similar functional and structural role.


Pssm-ID: 259847 [Multi-domain]  Cd Length: 816  Bit Score: 1668.46  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   24 SPEEMRQQAHIQVVSKNLYSQDNQHaPLLYGVLDHRMGTSEKDRPCETCGKNLADCLGHYGYIDLELPCFHVGYFRAVIG 103
Cdd:cd02583     2 SPEDIIRLSEVEVTNRNLYDIETRK-PLPYGVLDPRLGTSDKDGICETCGLNLADCVGHFGYIKLELPVFHIGYFKAIIN 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  104 ILQMICKTCCHIMLSQEEKKQFLDYLKRPGLTYLQKRGLKKKISDKCRKKNICHHCGafngtvkkcgllkiihekyktnk 183
Cdd:cd02583    81 ILQCICKTCSRVLLPEEEKRKFLKRLRRPNLDNLQKKALKKKILEKCKKVRKCPHCG----------------------- 137
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  184 kvvdpivsnflqsfetaiehnkevepLLGRAQENLNPLVVLNLFKRIPAEDVPLLLMNPEAGKPSDLILTRLLVPPLCIR 263
Cdd:cd02583   138 --------------------------LLKKAQEDLNPLKVLNLFKNIPPEDVELLLMNPLAGRPENLILTRIPVPPLCIR 191
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  264 PSVVSDLKSGTNEDDLTMKLTEIIFLNDVIKKHRISGAKTQMIMEDWDFLQLQCALYINSELSGIPLNMAPKKWTRGFVQ 343
Cdd:cd02583   192 PSVVMDEKSGTNEDDLTVKLSEIIFLNDVIKKHLEKGAKTQKIMEDWDFLQLQCALYINSELPGLPLSMQPKKPIRGFCQ 271
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  344 RLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPEKVNKANINFLRKLVQNGPEVHPGANFIQQ 423
Cdd:cd02583   272 RLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDQVGVPEHVAKILTYPERVTRYNIEKLRKLVLNGPDVHPGANFVIK 351
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  424 RHTQMKRFLKYGNREKMAQELKYGDIVERHLIDGDVVLFNRQPSLHKLSIMAHLARVKPHRTFRFNECVCTPYNADFDGD 503
Cdd:cd02583   352 RDGGKKKFLKYGNRRKIARELKIGDIVERHLEDGDIVLFNRQPSLHRLSIMAHRAKVMPWRTFRFNECVCTPYNADFDGD 431
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  504 EMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAIQDFLTGAYLLTLKDTFFDRAKACQIIASILvgkDEKIKVRL 583
Cdd:cd02583   432 EMNLHVPQTEEARAEALELMGVKNNLVTPRNGEPLIAATQDFLTASYLLTSKDVFFDRAQFCQLCSYML---DGEIKIDL 508
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  584 PPPTILKPVTLWTGKQIFSVILRPSDDNPVRANLRTKGKQYCGKGEDLCANDSYVTIQNSELMSGSMDKGTLGSGSKNNI 663
Cdd:cd02583   509 PPPAILKPVELWTGKQIFSLLLRPNKKSPVLVNLEAKEKSYTKKSPDMCPNDGYVVIRNSELLCGRLDKSTLGSGSKNSL 588
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  664 FYILLRDWGQNEAADAMSRLARLAPVYLSNRGFSIGIGDVTPGQGLLKAKYELLNAGYKKCDEYIEALNTGKLQQQPGCT 743
Cdd:cd02583   589 FYVLLRDYGPEAAAAAMNRLAKLSSRWLSNRGFSIGIDDVTPSKELLKKKEELVDNGYAKCDEYIKQYKKGKLELQPGCT 668
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  744 AEETLEALILKELSVIRDHAGSACLRELDKSNSPLTMALCGSKGSFINISQMIACVGQQAISGSRVPDGFENRSLPHFEK 823
Cdd:cd02583   669 AEQTLEAKISGELSKIREDAGKACLKELHKSNSPLIMALCGSKGSNINISQMIACVGQQIISGKRIPNGFEDRTLPHFPR 748
                         810       820       830       840       850       860
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 206729892  824 HSKLPAAKGFVANSFYSGLTPTEFFFHTMAGREGLVDTAVKTAETGYMQRRLVKSLEDLCSQYDLTVR 891
Cdd:cd02583   749 NSKTPAAKGFVANSFYSGLTPTEFFFHTMSGREGLVDTAVKTAETGYMQRRLMKALEDLSVQYDGTVR 816
PRK08566 PRK08566
DNA-directed RNA polymerase subunit A'; Validated
7-930 0e+00

DNA-directed RNA polymerase subunit A'; Validated


Pssm-ID: 236292 [Multi-domain]  Cd Length: 882  Bit Score: 955.84  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892    7 RETDVAKKISHICFGMKSPEEMRQQAHIQVVSKNLYSQDNqhAPLLYGVLDHRMGTSEKDRPCETCGKNLADCLGHYGYI 86
Cdd:PRK08566    1 SMMMIPKRIGSIKFGLLSPEEIRKMSVTKIITADTYDDDG--YPIDGGLMDPRLGVIDPGLRCKTCGGRAGECPGHFGHI 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   87 DLELPCFHVGYFRAVIGILQMICKTCCHIMLSQEEKKQFLDYLKRPGLTYLQKRGLKKKISDKCRKKNICHHCGAfngtv 166
Cdd:PRK08566   79 ELARPVIHVGFAKLIYKLLRATCRECGRLKLTEEEIEEYLEKLERLKEWGSLADDLIKEVKKEAAKRMVCPHCGE----- 153
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  167 KKcglLKIIHEKyktnkkvvdPIvsnflqsfeTAIEHNKEVEpllgraqENLNPLVVLNLFKRIPAEDVPLLLMNPEAGK 246
Cdd:PRK08566  154 KQ---YKIKFEK---------PT---------TFYEERKEGL-------VKLTPSDIRERLEKIPDEDLELLGINPEVAR 205
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  247 PSDLILTRLLVPPLCIRPSVVsdLKSG-TNEDDLTMKLTEIIFLNDVIKKHRISGAKtQMIMED-WDFLQLQCALYINSE 324
Cdd:PRK08566  206 PEWMVLTVLPVPPVTVRPSIT--LETGqRSEDDLTHKLVDIIRINQRLKENIEAGAP-QLIIEDlWELLQYHVTTYFDNE 282
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  325 LSGIPlnmaP-----KKWTRGFVQRLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPEKVNKA 399
Cdd:PRK08566  283 IPGIP----ParhrsGRPLKTLAQRLKGKEGRFRGNLSGKRVNFSARTVISPDPNLSINEVGVPEAIAKELTVPERVTEW 358
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  400 NINFLRKLVQNGPEVHPGANFIqqRHTQMKRF-LKYGNREKMAQELKYGDIVERHLIDGDVVLFNRQPSLHKLSIMAHLA 478
Cdd:PRK08566  359 NIEELREYVLNGPEKHPGANYV--IRPDGRRIkLTDKNKEELAEKLEPGWIVERHLIDGDIVLFNRQPSLHRMSIMAHRV 436
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  479 RVKPHRTFRFNECVCTPYNADFDGDEMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAIQDFLTGAYLLTLKDTF 558
Cdd:PRK08566  437 RVLPGKTFRLNLAVCPPYNADFDGDEMNLHVPQTEEARAEARILMLVQEHILSPRYGGPIIGGIQDHISGAYLLTRKSTL 516
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  559 FDRAKACQIIASILVGKDEkikvrLPPPTILKPVTLWTGKQIFSVILrPSDDNPVRanlRTKGKQYCGKGED-LCANDSY 637
Cdd:PRK08566  517 FTKEEALDLLRAAGIDELP-----EPEPAIENGKPYWTGKQIFSLFL-PKDLNLEF---KAKICSGCDECKKeDCEHDAY 587
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  638 VTIQNSELMSGSMDKGTLGSGsKNNIFYILLRDWGQNEAADAMSRLARLAPVYLSNRGFSIGIGDVT-PGQGLLKAKyEL 716
Cdd:PRK08566  588 VVIKNGKLLEGVIDKKAIGAE-QGSILDRIVKEYGPERARRFLDSVTRLAIRFIMLRGFTTGIDDEDiPEEAKEEID-EI 665
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  717 LNAGYKKCDEYIEALNTGKLQQQPGCTAEETLEALILKELSVIRDHAGSACLRELDKSNSPLTMALCGSKGSFINISQMI 796
Cdd:PRK08566  666 IEEAEKRVEELIEAYENGELEPLPGRTLEETLEMKIMQVLGKARDEAGEIAEKYLGLDNPAVIMARTGARGSMLNLTQMA 745
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  797 ACVGQQAISGSRVPDGFENRSLPHFEKHSKLPAAKGFVANSFYSGLTPTEFFFHTMAGREGLVDTAVKTAETGYMQRRLV 876
Cdd:PRK08566  746 ACVGQQSVRGERIRRGYRDRTLPHFKPGDLGAEARGFVRSSYKSGLTPTEFFFHAMGGREGLVDTAVRTSQSGYMQRRLI 825
                         890       900       910       920       930
                  ....*....|....*....|....*....|....*....|....*....|....
gi 206729892  877 KSLEDLCSQYDLTVRSSTGDIIQFIYGGDGLDPAAMEGkDEPLEFKRVLDNIKA 930
Cdd:PRK08566  826 NALQDLKVEYDGTVRDTRGNIVQFKYGEDGVDPMKSDH-GKPVDVDRIIERVLG 878
RNA_pol_rpoA1 TIGR02390
DNA-directed RNA polymerase subunit A'; This family consists of the archaeal A' subunit of the ...
13-925 0e+00

DNA-directed RNA polymerase subunit A'; This family consists of the archaeal A' subunit of the DNA-directed RNA polymerase. The example from Methanocaldococcus jannaschii contains an intein.


Pssm-ID: 274106 [Multi-domain]  Cd Length: 868  Bit Score: 884.44  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892    13 KKISHICFGMKSPEEMRQQAHIQVVSKNLYSQDNqhAPLLYGVLDHRMGTSEKDRPCETCGKNLADCLGHYGYIDLELPC 92
Cdd:TIGR02390    2 KKIGSIKFGLLSPEEIRKMSVVEVVTADTYDDDG--YPIEGGLMDPRLGVIEPGLRCKTCGGKVGECPGHFGHIELARPV 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892    93 FHVGYFRAVIGILQMICKTCCHIMLSQEEKKQFLD-YLKRPGLTYLQKRGLKKKISDKCRKKNICHHCGAfngtvkkcGL 171
Cdd:TIGR02390   80 VHVGFAKEIYKILRATCRKCGRITLTEEEIEQYLEkINKLKEEGGDLASTLIEKIVKEAAKRMKCPHCGE--------EQ 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   172 LKIIHEKyktnkkvvdPivSNFLQsfetaiehnkevepLLGRAQENLNPLVVLNLFKRIPAEDVPLLLMNPEAGKPSDLI 251
Cdd:TIGR02390  152 KKIKFEK---------P--TYFYE--------------EGKEGDVKLTPSEIRERLEKIPDEDAELLGINPKVARPEWMV 206
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   252 LTRLLVPPLCIRPSVVsdLKSGT-NEDDLTMKLTEIIFLNDVIKKHRISGAKTQMIMEDWDFLQLQCALYINSELSGIP- 329
Cdd:TIGR02390  207 LTVLPVPPVTVRPSIT--LETGErSEDDLTHKLVDIIRINQRLKENIEAGAPQLIIEDLWELLQYHVATYFDNELPGIPp 284
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   330 LNMAPKKWTRGFVQRLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPEKVNKANINFLRKLVQ 409
Cdd:TIGR02390  285 ARHRSGRPLKTLAQRLKGKEGRFRGNLSGKRVNFSARTVISPDPNISINEVGVPEQIAKELTVPERVTPWNIDELREYVL 364
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   410 NGPEVHPGANFIQQrhTQMKRF-LKYGNREKMAQELKYGDIVERHLIDGDVVLFNRQPSLHKLSIMAHLARVKPHRTFRF 488
Cdd:TIGR02390  365 NGPDSWPGANYVIR--PDGRRIkIRDENKEELAERLEPGWVVERHLIDGDIVLFNRQPSLHRMSMMGHKVKVLPGKTFRL 442
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   489 NECVCTPYNADFDGDEMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAIQDFLTGAYLLTLKDTFFDRAKACQII 568
Cdd:TIGR02390  443 NLAVCPPYNADFDGDEMNLHVPQTEEARAEARELMLVEEHILTPRYGGPIIGGIHDYISGAYLLTHKSTLFTKEEVQTIL 522
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   569 ASIlvgkdeKIKVRLPPPTILKPVTLWTGKQIFSVILrPSDDNPVRANLRTKGKQYCGKGEdlCANDSYVTIQNSELMSG 648
Cdd:TIGR02390  523 GVA------GYFGDPPEPAIEKPKEYWTGKQIFSAFL-PEDLNFEGRAKICSGSDACKKEE--CPHDAYVVIKNGKLLKG 593
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   649 SMDKGTLGSgSKNNIFYILLRDWGQNEAADAMSRLARLAPVYLSNRGFSIGIGDVTPGQGLLKAKYELLNAGYKKCDEYI 728
Cdd:TIGR02390  594 VIDKKAIGA-EKGKILHRIVREYGPEAARRFLDSVTRLFIRFITLRGFTTGIDDIDIPKEAKEEIEELIEKAEKRVDNLI 672
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   729 EALNTGKLQQQPGCTAEETLEALILKELSVIRDHAGSACLRELDKSNSPLTMALCGSKGSFINISQMIACVGQQAISGSR 808
Cdd:TIGR02390  673 ERYRNGELEPLPGRTVEETLEMKIMEVLGKARDEAGEVAEKYLDPENHAVIMARTGARGSLLNITQMAAMVGQQSVRGGR 752
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   809 VPDGFENRSLPHFEKHSKLPAAKGFVANSFYSGLTPTEFFFHTMAGREGLVDTAVKTAETGYMQRRLVKSLEDLCSQYDL 888
Cdd:TIGR02390  753 IRRGYRNRTLPHFKKGDIGAKARGFVRSSFKKGLDPTEYFFHAAGGREGLVDTAVRTSQSGYMQRRLINALQDLYVEYDG 832
                          890       900       910
                   ....*....|....*....|....*....|....*...
gi 206729892   889 TVRSSTGDIIQFIYGGDGLDPAAME-GKdePLEFKRVL 925
Cdd:TIGR02390  833 TVRDTRGNLIQFKYGEDGVDPMKSDhGK--PVDVKKIF 868
RNAP_III_Rpc1_C cd02736
Largest subunit (Rpc1) of Eukaryotic RNA polymerase III (RNAP III), C-terminal domain; ...
1021-1360 0e+00

Largest subunit (Rpc1) of Eukaryotic RNA polymerase III (RNAP III), C-terminal domain; Eukaryotic RNA polymerase III (RNAP III) is a large multi-subunit complex responsible for the synthesis of tRNAs, 5SrRNA, Alu-RNA, U6 snRNA, among others. Rpc1 is also known as C160 in yeast. Structure studies suggest that different RNA polymerase complexes share a similar crab-claw-shape structure. The C-terminal domain of Rpb1, the largest subunit of RNAP II, makes up part of the foot and jaw structures of RNAP II. The similarity between this domain and the C-terminal domain of Rpb1, its counterpart in RNAP II, suggests a similar functional and structural role.


Pssm-ID: 132723 [Multi-domain]  Cd Length: 300  Bit Score: 565.31  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1021 KYMRAQMEPGSAVGALCAQSIGEPGTQMTLKTFHFAGVASMNITLGVPRIKEIINASKAISTPIITAQLDKDDDADYARL 1100
Cdd:cd02736     1 KYMRAKVEPGTAVGAIAAQSIGEPGTQMTLKTFHFAGVASMNITLGVPRIKEIINASKNISTPIITAKLENDRDEKSARI 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1101 VKGRIEKTLLGEISEYIEEVFLPDDCFILVKLSLERIRLLRLevnaetvrysictsklrvkpgdvavhgeavvcvtpren 1180
Cdd:cd02736    81 VKGRIEKTYLGEVASYIEEVYSPDDCYILIKLDKKIIEKLQL-------------------------------------- 122
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1181 SKSSMYYVLQFLKEDLPKVVVQGIPEVSRAVIHIDEQSGKekYKLLVEGDNLRAVMATHGVKGTRTTSNNTYEVEKTLGI 1260
Cdd:cd02736   123 SKSNLYFLLQSLKRKLPDVVVSGIPEVKRAVINKDKKKGK--YKLLVEGYGLRAVMNTPGVIGTRTTSNHIMEVEKVLGI 200
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1261 EAARTTIINEIQYTMVNHGMSIDRRHVMLLSDLMTYKGEVLGITRFGLAKMKESVLMLASFEKTADHLFDAAYFGQKDSV 1340
Cdd:cd02736   201 EAARSTIINEIQYTMKSHGMSIDPRHIMLLADLMTFKGEVLGITRFGIAKMKESVLMLASFEKTTDHLFNAALHGRKDSI 280
                         330       340
                  ....*....|....*....|
gi 206729892 1341 CGVSECIIMGIPMNIGTGLF 1360
Cdd:cd02736   281 EGVSECIIMGKPMPIGTGLF 300
RPOLA_N smart00663
RNA polymerase I subunit A N-terminus;
248-550 8.40e-150

RNA polymerase I subunit A N-terminus;


Pssm-ID: 214767 [Multi-domain]  Cd Length: 295  Bit Score: 455.44  E-value: 8.40e-150
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892    248 SDLILTRLLVPPLCIRPSVVSDLkSGTNEDDLTMKLTEIIFLNDVIKKHRISGAKTQMIMEDWDFLQLQCALYINSElSG 327
Cdd:smart00663    1 EWMILTVLPVPPPCLRPSVQLDG-GRFAEDDLTHLLRDIIKRNNRLKRLLELGAPSIIIRNEKRLLQEAVDTLIDNE-GL 78
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892    328 IPLNMAPKKWTRGFVQRLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPEKVNKANINFLRKL 407
Cdd:smart00663   79 PRANQKSGRPLKSLSQRLKGKEGRFRQNLLGKRVDFSARSVITPDPNLKLNEVGVPKEIALELTFPEIVTPLNIDKLRKL 158
                           170       180       190       200       210       220       230       240
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892    408 VQNGPevhPGANFIQQrhtQMKRFLKYGNREKMAQELKYGDIVERHLIDGDVVLFNRQPSLHKLSIMAHLARVKPHRTFR 487
Cdd:smart00663  159 VRNGP---NGAKYIIR---GKKTNLKLAKKSKIANHLKIGDIVERHVIDGDVVLFNRQPTLHRMSIQAHRVRVLEGKTIR 232
                           250       260       270       280       290       300
                    ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 206729892    488 FNECVCTPYNADFDGDEMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAIQDFLTGAY 550
Cdd:smart00663  233 LNPLVCSPYNADFDGDEMNLHVPQSLEARAEARELMLVPNNILSPKNGKPIIGPIQDMLLGLY 295
RNA_pol_Rpb1_1 pfam04997
RNA polymerase Rpb1, domain 1; RNA polymerases catalyze the DNA dependent polymerization of ...
12-356 4.12e-132

RNA polymerase Rpb1, domain 1; RNA polymerases catalyze the DNA dependent polymerization of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). This domain, domain 1, represents the clamp domain, which a mobile domain involved in positioning the DNA, maintenance of the transcription bubble and positioning of the nascent RNA strand.


Pssm-ID: 398595  Cd Length: 320  Bit Score: 409.76  E-value: 4.12e-132
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892    12 AKKISHICFGMKSPEEMRQQAHIQVVSKNLYSqDNQHAPLLYGVLDHRMGTSEKDRPCETCGKNLADCLGHYGYIDLELP 91
Cdd:pfam04997    1 LKKIKEIQFGIASPEEIRKWSVGEVTKPETYN-YGSLKPEEGGLLDERMGTIDKDYECETCGKKKKDCPGHFGHIELAKP 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892    92 CFHVGYFRAVIGILQMICKTCCHIMLSQEEKKQFLDYLKRPGLTYLQKRglKKKISDKCRKKNICHHCGAFNGTVKKcgl 171
Cdd:pfam04997   80 VFHIGFFKKTLKILECVCKYCSKLLLDPGKPKLFNKDKKRLGLENLKMG--AKAILELCKKKDLCEHCGGKNGVCGS--- 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   172 lkiihekyktnkkvVDPIVSNFLQSFETAIEHNKEVEpllgrAQENLNPLVVLNLFKRIPAEDVPLLLMNPEAGKPSDLI 251
Cdd:pfam04997  155 --------------QQPVSRKEGLKLKAAIKKSKEEE-----EKEILNPEKVLKIFKRISDEDVEILGFNPSGSRPEWMI 215
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   252 LTRLLVPPLCIRPSVVSDLKsGTNEDDLTMKLTEIIFLNDVIKKHRISGAKTQMIMEDWDFLQLQCALYINSELSGIPL- 330
Cdd:pfam04997  216 LTVLPVPPPCIRPSVQLDGG-RRAEDDLTHKLRDIIKRNNRLKKLLELGAPSHIIREEWRLLQEHVATLFDNEIPGLPPa 294
                          330       340
                   ....*....|....*....|....*.
gi 206729892   331 NMAPKKWTRGFVQRLKGKQGRFRGNL 356
Cdd:pfam04997  295 LQKSKRPLKSISQRLKGKEGRFRGNL 320
RNA_pol_Rpb1_5 pfam04998
RNA polymerase Rpb1, domain 5; RNA polymerases catalyze the DNA dependent polymerization of ...
841-1316 2.78e-117

RNA polymerase Rpb1, domain 5; RNA polymerases catalyze the DNA dependent polymerization of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). This domain, domain 5, represents the discontinuous cleft domain that is required to from the central cleft or channel where the DNA is bound.


Pssm-ID: 398596 [Multi-domain]  Cd Length: 516  Bit Score: 377.46  E-value: 2.78e-117
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   841 GLTPTEFFFHTMAGREGLVDTAVKTAETGYMQRRLVKSLEDLCSQYDLTVRSSTGDIIQFIYGGDGLDPAAMEGKDePLE 920
Cdd:pfam04998    1 GLTPQEFFFHTMGGREGLIDTAVKTAESGYLQRRLVKALEDLVVTYDDTVRNSGGEIVQFLYGEDGLDPLKIEKQG-RFT 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   921 FKRVLDNIKAVFPCP-SEPALSKNELILTTESIMKKSEFLCCQDSFLQEIKKFIKGVSEKI-------KKTRDKYGINDN 992
Cdd:pfam04998   80 IEFSDLKLEDKFKNDlLDDLLLLSEFSLSYKKEILVRDSKLGRDRLSKEAQERATLLFELLlksglesKRVRSELTCNSK 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   993 gtteprvlyqldritptQVEKFLETCRDKYMRAQMEPGSAVGALCAQSIGEPGTQMTLKTFHFAGVASMNITLGVPRIKE 1072
Cdd:pfam04998  160 -----------------AFVCLLCYGRLLYQQSLINPGEAVGIIAAQSIGEPGTQMTLNTFHFAGVASKNVTLGVPRLKE 222
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  1073 IINASKAISTPIITAQL--DKDDDADYARLVKGRIEKTLLGEISEYIE------------------------------EV 1120
Cdd:pfam04998  223 IINVSKNIKSPSLTVYLfdEVGRELEKAKKVYGAIEKVTLGSVVESGEilydpdpfntpiisdvkgvvkffdiidevtNE 302
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  1121 FLPDDCFILVKLSLERIRLLRLEVNAETVRYSIcTSKLRVKpgDVAVHGEAVVCVTPRENSKSSMYY-------VLQFLK 1193
Cdd:pfam04998  303 EEIDPETGLLILVIRLLKILNKSIKKVVKSEVI-PRSIRNK--VDEGRDIAIGEITAFIIKISKKIRqdtgglrRVDELF 379
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  1194 ED------------LPKVVVQGIPEVSRAVIHIDEqSGKEK--YKLLVEGDNLRAVMATHG-VKGTRTTSNNTYEVEKTL 1258
Cdd:pfam04998  380 MEedpklailvaslLGNITLRGIPGIKRILVNEDD-KGKVEpdWVLETEGVNLLRVLLVPGfVDAGRILSNDIHEILEIL 458
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 206729892  1259 GIEAARTTIINEIQYTMVNHGMSIDRRHVMLLSDLMTYKGEVLGITRFGLAKMKESVL 1316
Cdd:pfam04998  459 GIEAARNALLNEIRNVYRFQGIYINDRHLELIADQMTRKGYIMAIGRHGINKAELSAL 516
PRK04309 PRK04309
DNA-directed RNA polymerase subunit A''; Validated
1006-1363 1.03e-96

DNA-directed RNA polymerase subunit A''; Validated


Pssm-ID: 235277 [Multi-domain]  Cd Length: 383  Bit Score: 316.02  E-value: 1.03e-96
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1006 ITPTQVEKFLETCRDKYMRAQMEPGSAVGALCAQSIGEPGTQMTLKTFHFAGVASMNITLGVPRIKEIINASKAISTPII 1085
Cdd:PRK04309   35 LTEEEVEEIIEEVVREYLRSLVEPGEAVGVVAAQSIGEPGTQMTMRTFHYAGVAEINVTLGLPRLIEIVDARKEPSTPMM 114
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1086 TAQLDKD--DDADYARLVKGRIEKTLLGEISEYIEevFLPDDCFILVKLSLERI--RLLRLEVNAETVRysictsklRVK 1161
Cdd:PRK04309  115 TIYLKDEyaYDREKAEEVARKIEATTLENLAKDIS--VDLANMTIIIELDEEMLedRGLTVDDVKEAIE--------KKK 184
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1162 PGDVAVHGEAVVcVTPRENSkssmYYVLQFLKEDLPKVVVQGIPEVSRAVIHIDEqsgkEKYKLLVEGDNLRAVMATHGV 1241
Cdd:PRK04309  185 GGEVEIEGNTLI-ISPKEPS----YRELRKLAEKIRNIKIKGIKGIKRVIIRKEG----DEYVIYTEGSNLKEVLKVEGV 255
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1242 KGTRTTSNNTYEVEKTLGIEAARTTIINEIQYTMVNHGMSIDRRHVMLLSDLMTYKGEVLGITRFGLAKMKESVLMLASF 1321
Cdd:PRK04309  256 DATRTTTNNIHEIEEVLGIEAARNAIIEEIKNTLEEQGLDVDIRHIMLVADMMTWDGEVRQIGRHGVSGEKASVLARAAF 335
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|..
gi 206729892 1322 EKTADHLFDAAYFGQKDSVCGVSECIIMGIPMNIGTGLFKLL 1363
Cdd:PRK04309  336 EVTVKHLLDAAVRGEVDELKGVTENIIVGQPIPLGTGDVELT 377
RNA_pol_rpoA2 TIGR02389
DNA-directed RNA polymerase, subunit A''; This family consists of the archaeal A'' subunit of ...
1010-1365 1.49e-85

DNA-directed RNA polymerase, subunit A''; This family consists of the archaeal A'' subunit of the DNA-directed RNA polymerase. The example from Methanocaldococcus jannaschii contains an intein. [Transcription, DNA-dependent RNA polymerase]


Pssm-ID: 274105 [Multi-domain]  Cd Length: 367  Bit Score: 283.87  E-value: 1.49e-85
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  1010 QVEKFLETCRDKYMRAQMEPGSAVGALCAQSIGEPGTQMTLKTFHFAGVASMNITLGVPRIKEIINASKAISTPIITAQL 1089
Cdd:TIGR02389   24 ELDEIIKRVEEEYLRSLIDPGEAVGIVAAQSIGEPGTQMTMRTFHYAGVAELNVTLGLPRLIEIVDARKTPSTPSMTIYL 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  1090 DKDD--DADYARLVKGRIEKTLLGEISEYIEevflpddcfilVKLSLERIrllRLEVNAETVRYSICTSKL------RVK 1161
Cdd:TIGR02389  104 EDEYekDREKAEEVAKKIEATKLEDVAKDIS-----------IDLADMTV---IIELDEEQLKERGITVDDvekaikKAK 169
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  1162 PGDVAV--HGEAVVCVTPRENSkssmYYVLQFLKEDLPKVVVQGIPEVSRAVIhideQSGKEKYKLLVEGDNLRAVMATH 1239
Cdd:TIGR02389  170 LGKVIEidMDNNTITIKPGNPS----LKELRKLKEKIKNLHIKGIKGIKRVVI----RKEGDEYVIYTEGSNLKEVLKLE 241
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  1240 GVKGTRTTSNNTYEVEKTLGIEAARTTIINEIQYTMVNHGMSIDRRHVMLLSDLMTYKGEVLGITRFGLAKMKESVLMLA 1319
Cdd:TIGR02389  242 GVDKTRTTTNDIHEIAEVLGIEAARNAIIEEIKRTLEEQGLDVDIRHLMLVADLMTWDGEVRQIGRHGISGEKASVLARA 321
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*.
gi 206729892  1320 SFEKTADHLFDAAYFGQKDSVCGVSECIIMGIPMNIGTGLFKLLHK 1365
Cdd:TIGR02389  322 AFEVTVKHLLDAAIRGEVDELKGVIENIIVGQPIPLGTGDVDLVMD 367
RpoC COG0086
DNA-directed RNA polymerase, beta' subunit/160 kD subunit [Transcription]; DNA-directed RNA ...
345-1120 5.43e-53

DNA-directed RNA polymerase, beta' subunit/160 kD subunit [Transcription]; DNA-directed RNA polymerase, beta' subunit/160 kD subunit is part of the Pathway/BioSystem: RNA polymerase


Pssm-ID: 439856 [Multi-domain]  Cd Length: 1165  Bit Score: 203.85  E-value: 5.43e-53
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  345 LKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPekvnkaninFL-RKLVQNGpevhpGANFIQQ 423
Cdd:COG0086   321 LKGKQGRFRQNLLGKRVDYSGRSVIVVGPELKLHQCGLPKKMALELFKP---------FIyRKLEERG-----LATTIKS 386
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  424 rhtqMKRFLkygnrEKMAQELKygDIVERhLIDGDVVLFNRQPSLHKLSIMAHLARVKPHRTFRFNECVCTPYNADFDGD 503
Cdd:COG0086   387 ----AKKMV-----EREEPEVW--DILEE-VIKEHPVLLNRAPTLHRLGIQAFEPVLIEGKAIQLHPLVCTAFNADFDGD 454
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  504 EMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAIQDFLTGAYLLTLKD--------TFFDRAKACQIIASILVGK 575
Cdd:COG0086   455 QMAVHVPLSLEAQLEARLLMLSTNNILSPANGKPIIVPSQDMVLGLYYLTRERegakgegmIFADPEEVLRAYENGAVDL 534
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  576 DEKIKVRLPPPTILKPVTLWT--GKQIFSVILrPSDdnpvranlrtkgkqycgkgedlcandsyVTIQNSElmsgsMDKG 653
Cdd:COG0086   535 HARIKVRITEDGEQVGKIVETtvGRYLVNEIL-PQE----------------------------VPFYNQV-----INKK 580
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  654 TLGsgsknNIFYILLRDWGQNEAADAMSRLARLAPVYLSNRGFSIGIGD-VTPgqgllKAKYELLNAGYKKCDEYIEALN 732
Cdd:COG0086   581 HIE-----VIIRQMYRRCGLKETVIFLDRLKKLGFKYATRAGISIGLDDmVVP-----KEKQEIFEEANKEVKEIEKQYA 650
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  733 TGKLqqqpgcTAEETLEALILkelsvIRDHAG----SACLRELDKSNSPLTMALCGSKGSFINISQMIACVGQQAisgsr 808
Cdd:COG0086   651 EGLI------TEPERYNKVID-----GWTKASleteSFLMAAFSSQNTTYMMADSGARGSADQLRQLAGMRGLMA----- 714
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  809 VPDG--FENRslphfekhsklpaakgfVANSFYSGLTPTEFFFHTMAGREGLVDTAVKTAETGYMQRRLVKSLEDLcsqy 886
Cdd:COG0086   715 KPSGniIETP-----------------IGSNFREGLGVLEYFISTHGARKGLADTALKTADSGYLTRRLVDVAQDV---- 773
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  887 dltvrsstgdIIQFIYGG--DGLD-PAAMEGKD--EPLEfKRVLDNIKA---VFPCPSEPALSKNELILTtesimkksef 958
Cdd:COG0086   774 ----------IVTEEDCGtdRGITvTAIKEGGEviEPLK-ERILGRVAAedvVDPGTGEVLVPAGTLIDE---------- 832
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  959 lccqdsflqEIKKFIKGVSEKIKKTRdkygindngtteprvlyqldriTPTQVEKFLETCRDKYMR--AQMEP---GSAV 1033
Cdd:COG0086   833 ---------EVAEIIEEAGIDSVKVR----------------------SVLTCETRGGVCAKCYGRdlARGHLvniGEAV 881
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1034 GALCAQSIGEPGTQMTLKTFHFAGVASmnitlgvPRIKEIINASKAISTPIITAQLDKDDDADYARLVKGRIEKTLLGEI 1113
Cdd:COG0086   882 GVIAAQSIGEPGTQLTMRTFHIGGAAS-------RAAEESSIEAKAGGIVRLNNLKVVVNEEGKGVVVSRNSELVIVDDG 954

                  ....*..
gi 206729892 1114 SEYIEEV 1120
Cdd:COG0086   955 GRREEEY 961
rpoC2 CHL00117
RNA polymerase beta'' subunit; Reviewed
841-1058 1.63e-08

RNA polymerase beta'' subunit; Reviewed


Pssm-ID: 214368 [Multi-domain]  Cd Length: 1364  Bit Score: 59.57  E-value: 1.63e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  841 GLTPTEFFFHTMAGREGLVDTAVKTAETGYMQRRLVKSLEDL------CsqydLTVRSSTgdiiqfiyggdgLDPAAMEG 914
Cdd:CHL00117  172 GLSLTEYIISCYGARKGVVDTAVRTADAGYLTRRLVEVVQHIvvretdC----GTTRGIS------------VSPRNGMM 235
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  915 KDEPLEFK---RVL-DNIKavfpcpsepalSKNELILTTEsimkkseflccQDSFLQEIKKFIKgvsekikktrdkygin 990
Cdd:CHL00117  236 IERILIQTligRVLaDDIY-----------IGSRCIATRN-----------QDIGIGLANRFIT---------------- 277
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  991 dngtteprvlYQLDRI---TPTqvekfleTCRDkyMRA--QM------------EPGSAVGALCAQSIGEPGTQMTLKTF 1053
Cdd:CHL00117  278 ----------FRAQPIsirSPL-------TCRS--TSWicQLcygwslahgdlvELGEAVGIIAGQSIGEPGTQLTLRTF 338

                  ....*
gi 206729892 1054 HFAGV 1058
Cdd:CHL00117  339 HTGGV 343
 
Name Accession Description Interval E-value
RNAP_III_RPC1_N cd02583
Largest subunit (RPC1) of eukaryotic RNA polymerase III (RNAP III), N-terminal domain; Rpc1 ...
24-891 0e+00

Largest subunit (RPC1) of eukaryotic RNA polymerase III (RNAP III), N-terminal domain; Rpc1 (C160) subunit forms part of the active site region of RNAP III. RNAP III is one of the three distinct classes of nuclear RNAP in eukaryotes that is responsible for the synthesis of tRNAs, 5SrRNA, Alu-RNA, U6 snRNA genes, and some others. RNAP III is the largest nuclear RNA polymerase with 17 subunits. Structure studies suggest that different RNA polymerase complexes share a similar crab-claw-shaped structure. The N-terminal domain of Rpb1, the largest subunit of RNAP II in yeast, forms part of the active site, making up the head and core of the one clamp, as well as the pore and funnel structures of RNAP II. The strong homology between Rpc1 and Rpb1 suggests a similar functional and structural role.


Pssm-ID: 259847 [Multi-domain]  Cd Length: 816  Bit Score: 1668.46  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   24 SPEEMRQQAHIQVVSKNLYSQDNQHaPLLYGVLDHRMGTSEKDRPCETCGKNLADCLGHYGYIDLELPCFHVGYFRAVIG 103
Cdd:cd02583     2 SPEDIIRLSEVEVTNRNLYDIETRK-PLPYGVLDPRLGTSDKDGICETCGLNLADCVGHFGYIKLELPVFHIGYFKAIIN 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  104 ILQMICKTCCHIMLSQEEKKQFLDYLKRPGLTYLQKRGLKKKISDKCRKKNICHHCGafngtvkkcgllkiihekyktnk 183
Cdd:cd02583    81 ILQCICKTCSRVLLPEEEKRKFLKRLRRPNLDNLQKKALKKKILEKCKKVRKCPHCG----------------------- 137
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  184 kvvdpivsnflqsfetaiehnkevepLLGRAQENLNPLVVLNLFKRIPAEDVPLLLMNPEAGKPSDLILTRLLVPPLCIR 263
Cdd:cd02583   138 --------------------------LLKKAQEDLNPLKVLNLFKNIPPEDVELLLMNPLAGRPENLILTRIPVPPLCIR 191
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  264 PSVVSDLKSGTNEDDLTMKLTEIIFLNDVIKKHRISGAKTQMIMEDWDFLQLQCALYINSELSGIPLNMAPKKWTRGFVQ 343
Cdd:cd02583   192 PSVVMDEKSGTNEDDLTVKLSEIIFLNDVIKKHLEKGAKTQKIMEDWDFLQLQCALYINSELPGLPLSMQPKKPIRGFCQ 271
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  344 RLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPEKVNKANINFLRKLVQNGPEVHPGANFIQQ 423
Cdd:cd02583   272 RLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDQVGVPEHVAKILTYPERVTRYNIEKLRKLVLNGPDVHPGANFVIK 351
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  424 RHTQMKRFLKYGNREKMAQELKYGDIVERHLIDGDVVLFNRQPSLHKLSIMAHLARVKPHRTFRFNECVCTPYNADFDGD 503
Cdd:cd02583   352 RDGGKKKFLKYGNRRKIARELKIGDIVERHLEDGDIVLFNRQPSLHRLSIMAHRAKVMPWRTFRFNECVCTPYNADFDGD 431
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  504 EMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAIQDFLTGAYLLTLKDTFFDRAKACQIIASILvgkDEKIKVRL 583
Cdd:cd02583   432 EMNLHVPQTEEARAEALELMGVKNNLVTPRNGEPLIAATQDFLTASYLLTSKDVFFDRAQFCQLCSYML---DGEIKIDL 508
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  584 PPPTILKPVTLWTGKQIFSVILRPSDDNPVRANLRTKGKQYCGKGEDLCANDSYVTIQNSELMSGSMDKGTLGSGSKNNI 663
Cdd:cd02583   509 PPPAILKPVELWTGKQIFSLLLRPNKKSPVLVNLEAKEKSYTKKSPDMCPNDGYVVIRNSELLCGRLDKSTLGSGSKNSL 588
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  664 FYILLRDWGQNEAADAMSRLARLAPVYLSNRGFSIGIGDVTPGQGLLKAKYELLNAGYKKCDEYIEALNTGKLQQQPGCT 743
Cdd:cd02583   589 FYVLLRDYGPEAAAAAMNRLAKLSSRWLSNRGFSIGIDDVTPSKELLKKKEELVDNGYAKCDEYIKQYKKGKLELQPGCT 668
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  744 AEETLEALILKELSVIRDHAGSACLRELDKSNSPLTMALCGSKGSFINISQMIACVGQQAISGSRVPDGFENRSLPHFEK 823
Cdd:cd02583   669 AEQTLEAKISGELSKIREDAGKACLKELHKSNSPLIMALCGSKGSNINISQMIACVGQQIISGKRIPNGFEDRTLPHFPR 748
                         810       820       830       840       850       860
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 206729892  824 HSKLPAAKGFVANSFYSGLTPTEFFFHTMAGREGLVDTAVKTAETGYMQRRLVKSLEDLCSQYDLTVR 891
Cdd:cd02583   749 NSKTPAAKGFVANSFYSGLTPTEFFFHTMSGREGLVDTAVKTAETGYMQRRLMKALEDLSVQYDGTVR 816
RNAP_archeal_A' cd02582
A' subunit of archaeal RNA polymerase (RNAP); A' is the largest subunit of the archaeal RNA ...
12-910 0e+00

A' subunit of archaeal RNA polymerase (RNAP); A' is the largest subunit of the archaeal RNA polymerase (RNAP). Archaeal RNAP is closely related to RNA polymerases in eukaryotes based on the subunit compositions. Archaeal RNAP is a large multi-protein complex, made up of 11 to 13 subunits, depending on the species, that are responsible for the synthesis of RNA. Structure studies suggest that RNAP complexes from different organisms share a crab-claw-shaped structure. The largest eukaryotic RNAP subunit is encoded by two separate archaeal subunits (A' and A'') which correspond to the N- and C-terminal domains of eukaryotic RNAP II Rpb1, respectively. The N-terminal domain of Rpb1 forms part of the active site and includes the head and the core of one clamp as well as the pore and funnel structures of RNAP II. Based on a structural comparison among the archaeal, bacterial and eukaryotic RNAPs the DNA binding channel and the active site are part of A' subunit which is conserved. The strong similarity between subunit A' and the N-terminal domain of Rpb1 suggests a similar functional and structural role for these two proteins.


Pssm-ID: 259846 [Multi-domain]  Cd Length: 861  Bit Score: 969.79  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   12 AKKISHICFGMKSPEEMRQQAHIQVVSKNLYSQDNqhAPLLYGVLDHRMGTSEKDRPCETCGKNLADCLGHYGYIDLELP 91
Cdd:cd02582     1 PKRIKGIKFGLLSPEEIRKMSVVEIITPDTYDEDG--YPIEGGLMDPRLGVIEPGLRCKTCGNTAGECPGHFGHIELARP 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   92 CFHVGYFRAVIGILQMICKTCCHIMLSQEEKKQFLDYLKR-PGLTYLQKRGLKKKISDKCRKKNICHHCGAfngtvkkcG 170
Cdd:cd02582    79 VIHVGFAKHIYDLLRATCRSCGRILLPEEEIEKYLERIRRlKEKWPELVKRVIEKVKKKAKKRKVCPHCGA--------P 150
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  171 LLKIIHEKyktnkkvvdpiVSNFLQSFEtaIEHNKevepllgraqenLNPLVVLNLFKRIPAEDVPLLLMNPEAGKPSDL 250
Cdd:cd02582   151 QYKIKLEK-----------PTTFYEEKE--EGEVK------------LTPSEIRERLEKIPDEDLELLGIDPKTARPEWM 205
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  251 ILTRLLVPPLCIRPSVVsdLKSG-TNEDDLTMKLTEIIFLNDVIKKHRISGAKTQMIMEDWDFLQLQCALYINSELSGIP 329
Cdd:cd02582   206 VLTVLPVPPVTVRPSIT--LETGeRSEDDLTHKLVDIIRINQRLKENIEAGAPQLIIEDLWDLLQYHVTTYFDNEIPGIP 283
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  330 lnMAPKKWTR---GFVQRLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPEKVNKANINFLRK 406
Cdd:cd02582   284 --PARHRSGRplkTLAQRLKGKEGRFRGNLSGKRVNFSARTVISPDPNLSINEVGVPEDIAKELTVPERVTEWNIEKMRK 361
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  407 LVQNGPEVHPGANFIQQRHTQMKRfLKYGNREKMAQELKYGDIVERHLIDGDVVLFNRQPSLHKLSIMAHLARVKPHRTF 486
Cdd:cd02582   362 LVLNGPDKWPGANYVIRPDGRRIR-LRYVNREELAERLEPGWIVERHLIDGDIVLFNRQPSLHRMSIMAHRVRVLPGKTF 440
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  487 RFNECVCTPYNADFDGDEMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAIQDFLTGAYLLTLKDTFFDRAKACQ 566
Cdd:cd02582   441 RLNLAVCPPYNADFDGDEMNLHVPQSEEARAEARELMLVQEHILSPRYGGPIIGGIQDYISGAYLLTRKTTLFTKEEALQ 520
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  567 IIASIlvgkdeKIKVRLPPPTILKPVTLWTGKQIFSVILrPSDDNPVRanlRTKGKQYCGKGED-LCANDSYVTIQNSEL 645
Cdd:cd02582   521 LLSAA------GYDGLLPEPAILEPKPLWTGKQLFSLFL-PKDLNFEG---KAKVCSGCSECKDeDCPNDGYVVIKNGKL 590
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  646 MSGSMDKGTLGSGSKNNIFYILLRDWGQNEAADAMSRLARLAPVYLSNRGFSIGIGDVTPGQGLLKAKYELLNAGYKKCD 725
Cdd:cd02582   591 LEGVIDKKAIGAEQPGSLLHRIAKEYGNEVARRFLDSVTRLAIRFIELRGFTIGIDDEDIPEEARKEIEEIIKEAEKKVY 670
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  726 EYIEALNTGKLQQQPGCTAEETLEALILKELSVIRDHAGSACLRELDKSNSPLTMALCGSKGSFINISQMIACVGQQAIS 805
Cdd:cd02582   671 ELIEQYKNGELEPLPGRTLEETLEMKIMQVLGKARDEAGKVASKYLDPFNNAVIMARTGARGSMLNLTQMAACLGQQSVR 750
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  806 GSRVPDGFENRSLPHFEKHSKLPAAKGFVANSFYSGLTPTEFFFHTMAGREGLVDTAVKTAETGYMQRRLVKSLEDLCSQ 885
Cdd:cd02582   751 GERINRGYRNRTLPHFKPGDLGPEARGFVRSSFRDGLSPTEFFFHAMGGREGLVDTAVRTSQSGYMQRRLINALQDLYVE 830
                         890       900
                  ....*....|....*....|....*
gi 206729892  886 YDLTVRSSTGDIIQFIYGGDGLDPA 910
Cdd:cd02582   831 YDGTVRDSRGNIIQFKYGEDGVDPA 855
PRK08566 PRK08566
DNA-directed RNA polymerase subunit A'; Validated
7-930 0e+00

DNA-directed RNA polymerase subunit A'; Validated


Pssm-ID: 236292 [Multi-domain]  Cd Length: 882  Bit Score: 955.84  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892    7 RETDVAKKISHICFGMKSPEEMRQQAHIQVVSKNLYSQDNqhAPLLYGVLDHRMGTSEKDRPCETCGKNLADCLGHYGYI 86
Cdd:PRK08566    1 SMMMIPKRIGSIKFGLLSPEEIRKMSVTKIITADTYDDDG--YPIDGGLMDPRLGVIDPGLRCKTCGGRAGECPGHFGHI 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   87 DLELPCFHVGYFRAVIGILQMICKTCCHIMLSQEEKKQFLDYLKRPGLTYLQKRGLKKKISDKCRKKNICHHCGAfngtv 166
Cdd:PRK08566   79 ELARPVIHVGFAKLIYKLLRATCRECGRLKLTEEEIEEYLEKLERLKEWGSLADDLIKEVKKEAAKRMVCPHCGE----- 153
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  167 KKcglLKIIHEKyktnkkvvdPIvsnflqsfeTAIEHNKEVEpllgraqENLNPLVVLNLFKRIPAEDVPLLLMNPEAGK 246
Cdd:PRK08566  154 KQ---YKIKFEK---------PT---------TFYEERKEGL-------VKLTPSDIRERLEKIPDEDLELLGINPEVAR 205
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  247 PSDLILTRLLVPPLCIRPSVVsdLKSG-TNEDDLTMKLTEIIFLNDVIKKHRISGAKtQMIMED-WDFLQLQCALYINSE 324
Cdd:PRK08566  206 PEWMVLTVLPVPPVTVRPSIT--LETGqRSEDDLTHKLVDIIRINQRLKENIEAGAP-QLIIEDlWELLQYHVTTYFDNE 282
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  325 LSGIPlnmaP-----KKWTRGFVQRLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPEKVNKA 399
Cdd:PRK08566  283 IPGIP----ParhrsGRPLKTLAQRLKGKEGRFRGNLSGKRVNFSARTVISPDPNLSINEVGVPEAIAKELTVPERVTEW 358
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  400 NINFLRKLVQNGPEVHPGANFIqqRHTQMKRF-LKYGNREKMAQELKYGDIVERHLIDGDVVLFNRQPSLHKLSIMAHLA 478
Cdd:PRK08566  359 NIEELREYVLNGPEKHPGANYV--IRPDGRRIkLTDKNKEELAEKLEPGWIVERHLIDGDIVLFNRQPSLHRMSIMAHRV 436
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  479 RVKPHRTFRFNECVCTPYNADFDGDEMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAIQDFLTGAYLLTLKDTF 558
Cdd:PRK08566  437 RVLPGKTFRLNLAVCPPYNADFDGDEMNLHVPQTEEARAEARILMLVQEHILSPRYGGPIIGGIQDHISGAYLLTRKSTL 516
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  559 FDRAKACQIIASILVGKDEkikvrLPPPTILKPVTLWTGKQIFSVILrPSDDNPVRanlRTKGKQYCGKGED-LCANDSY 637
Cdd:PRK08566  517 FTKEEALDLLRAAGIDELP-----EPEPAIENGKPYWTGKQIFSLFL-PKDLNLEF---KAKICSGCDECKKeDCEHDAY 587
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  638 VTIQNSELMSGSMDKGTLGSGsKNNIFYILLRDWGQNEAADAMSRLARLAPVYLSNRGFSIGIGDVT-PGQGLLKAKyEL 716
Cdd:PRK08566  588 VVIKNGKLLEGVIDKKAIGAE-QGSILDRIVKEYGPERARRFLDSVTRLAIRFIMLRGFTTGIDDEDiPEEAKEEID-EI 665
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  717 LNAGYKKCDEYIEALNTGKLQQQPGCTAEETLEALILKELSVIRDHAGSACLRELDKSNSPLTMALCGSKGSFINISQMI 796
Cdd:PRK08566  666 IEEAEKRVEELIEAYENGELEPLPGRTLEETLEMKIMQVLGKARDEAGEIAEKYLGLDNPAVIMARTGARGSMLNLTQMA 745
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  797 ACVGQQAISGSRVPDGFENRSLPHFEKHSKLPAAKGFVANSFYSGLTPTEFFFHTMAGREGLVDTAVKTAETGYMQRRLV 876
Cdd:PRK08566  746 ACVGQQSVRGERIRRGYRDRTLPHFKPGDLGAEARGFVRSSYKSGLTPTEFFFHAMGGREGLVDTAVRTSQSGYMQRRLI 825
                         890       900       910       920       930
                  ....*....|....*....|....*....|....*....|....*....|....
gi 206729892  877 KSLEDLCSQYDLTVRSSTGDIIQFIYGGDGLDPAAMEGkDEPLEFKRVLDNIKA 930
Cdd:PRK08566  826 NALQDLKVEYDGTVRDTRGNIVQFKYGEDGVDPMKSDH-GKPVDVDRIIERVLG 878
RNA_pol_rpoA1 TIGR02390
DNA-directed RNA polymerase subunit A'; This family consists of the archaeal A' subunit of the ...
13-925 0e+00

DNA-directed RNA polymerase subunit A'; This family consists of the archaeal A' subunit of the DNA-directed RNA polymerase. The example from Methanocaldococcus jannaschii contains an intein.


Pssm-ID: 274106 [Multi-domain]  Cd Length: 868  Bit Score: 884.44  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892    13 KKISHICFGMKSPEEMRQQAHIQVVSKNLYSQDNqhAPLLYGVLDHRMGTSEKDRPCETCGKNLADCLGHYGYIDLELPC 92
Cdd:TIGR02390    2 KKIGSIKFGLLSPEEIRKMSVVEVVTADTYDDDG--YPIEGGLMDPRLGVIEPGLRCKTCGGKVGECPGHFGHIELARPV 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892    93 FHVGYFRAVIGILQMICKTCCHIMLSQEEKKQFLD-YLKRPGLTYLQKRGLKKKISDKCRKKNICHHCGAfngtvkkcGL 171
Cdd:TIGR02390   80 VHVGFAKEIYKILRATCRKCGRITLTEEEIEQYLEkINKLKEEGGDLASTLIEKIVKEAAKRMKCPHCGE--------EQ 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   172 LKIIHEKyktnkkvvdPivSNFLQsfetaiehnkevepLLGRAQENLNPLVVLNLFKRIPAEDVPLLLMNPEAGKPSDLI 251
Cdd:TIGR02390  152 KKIKFEK---------P--TYFYE--------------EGKEGDVKLTPSEIRERLEKIPDEDAELLGINPKVARPEWMV 206
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   252 LTRLLVPPLCIRPSVVsdLKSGT-NEDDLTMKLTEIIFLNDVIKKHRISGAKTQMIMEDWDFLQLQCALYINSELSGIP- 329
Cdd:TIGR02390  207 LTVLPVPPVTVRPSIT--LETGErSEDDLTHKLVDIIRINQRLKENIEAGAPQLIIEDLWELLQYHVATYFDNELPGIPp 284
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   330 LNMAPKKWTRGFVQRLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPEKVNKANINFLRKLVQ 409
Cdd:TIGR02390  285 ARHRSGRPLKTLAQRLKGKEGRFRGNLSGKRVNFSARTVISPDPNISINEVGVPEQIAKELTVPERVTPWNIDELREYVL 364
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   410 NGPEVHPGANFIQQrhTQMKRF-LKYGNREKMAQELKYGDIVERHLIDGDVVLFNRQPSLHKLSIMAHLARVKPHRTFRF 488
Cdd:TIGR02390  365 NGPDSWPGANYVIR--PDGRRIkIRDENKEELAERLEPGWVVERHLIDGDIVLFNRQPSLHRMSMMGHKVKVLPGKTFRL 442
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   489 NECVCTPYNADFDGDEMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAIQDFLTGAYLLTLKDTFFDRAKACQII 568
Cdd:TIGR02390  443 NLAVCPPYNADFDGDEMNLHVPQTEEARAEARELMLVEEHILTPRYGGPIIGGIHDYISGAYLLTHKSTLFTKEEVQTIL 522
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   569 ASIlvgkdeKIKVRLPPPTILKPVTLWTGKQIFSVILrPSDDNPVRANLRTKGKQYCGKGEdlCANDSYVTIQNSELMSG 648
Cdd:TIGR02390  523 GVA------GYFGDPPEPAIEKPKEYWTGKQIFSAFL-PEDLNFEGRAKICSGSDACKKEE--CPHDAYVVIKNGKLLKG 593
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   649 SMDKGTLGSgSKNNIFYILLRDWGQNEAADAMSRLARLAPVYLSNRGFSIGIGDVTPGQGLLKAKYELLNAGYKKCDEYI 728
Cdd:TIGR02390  594 VIDKKAIGA-EKGKILHRIVREYGPEAARRFLDSVTRLFIRFITLRGFTTGIDDIDIPKEAKEEIEELIEKAEKRVDNLI 672
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   729 EALNTGKLQQQPGCTAEETLEALILKELSVIRDHAGSACLRELDKSNSPLTMALCGSKGSFINISQMIACVGQQAISGSR 808
Cdd:TIGR02390  673 ERYRNGELEPLPGRTVEETLEMKIMEVLGKARDEAGEVAEKYLDPENHAVIMARTGARGSLLNITQMAAMVGQQSVRGGR 752
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   809 VPDGFENRSLPHFEKHSKLPAAKGFVANSFYSGLTPTEFFFHTMAGREGLVDTAVKTAETGYMQRRLVKSLEDLCSQYDL 888
Cdd:TIGR02390  753 IRRGYRNRTLPHFKKGDIGAKARGFVRSSFKKGLDPTEYFFHAAGGREGLVDTAVRTSQSGYMQRRLINALQDLYVEYDG 832
                          890       900       910
                   ....*....|....*....|....*....|....*...
gi 206729892   889 TVRSSTGDIIQFIYGGDGLDPAAME-GKdePLEFKRVL 925
Cdd:TIGR02390  833 TVRDTRGNLIQFKYGEDGVDPMKSDhGK--PVDVKKIF 868
PRK14977 PRK14977
bifunctional DNA-directed RNA polymerase A'/A'' subunit; Provisional
13-1363 0e+00

bifunctional DNA-directed RNA polymerase A'/A'' subunit; Provisional


Pssm-ID: 184940 [Multi-domain]  Cd Length: 1321  Bit Score: 880.51  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   13 KKISHICFGMKSPEEMRQQAHIQVVSKNLYSQDNQhaPLLYGVLDHRMGTSEKDRPCETCGKNLADCLGHYGYIDLELPC 92
Cdd:PRK14977    7 KAIDGIIFGLISPADARKIGFAEITAPEAYDEDGL--PVQGGLLDGRLGTIEPGQKCLTCGNLAANCPGHFGHIELAEPV 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   93 FHVGYFRAVIGILQMICKTCCHIMLSQEEKKQFlDYLKRPGLTYL---QKR---GLKKKISDKC----RKKNICHHCGAF 162
Cdd:PRK14977   85 IHIAFIDNIKDLLNSTCHKCAKLKLPQEDLNVF-KLIEEAHAAARdipEKRiddEIIEEVRDQVkvyaKKAKECPHCGAP 163
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  163 NGTVkkcgllkIIHEKYKTNKKVvdpivsnflqsfetaiehnkEVEPllgraqENLNPLVVLNLFKRIPAEDVPLLLMNP 242
Cdd:PRK14977  164 QHEL-------EFEEPTIFIEKT--------------------EIEE------HRLLPIEIRDIFEKIIDDDLELIGFDP 210
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  243 EAGKPSDLILTRLLVPPLCIRPSVVsdLKSGT-NEDDLTMKLTEIIFLNDVIKKHRISGAKTQMIMEDWDFLQLQCALYI 321
Cdd:PRK14977  211 KKARPEWAVLQAFLVPPLTARPSII--LETGErSEDDLTHILVDIIKANQKLKESKDAGAPPLIVEDEVDHLQYHTSTFF 288
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  322 NSELSGIPLNM--APKKWTRGFVQRLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPEKVNKA 399
Cdd:PRK14977  289 DNATAGIPQAHhkGSGRPLKSLFQRLKGKEGRFRGNLIGKRVDFSARTVISPDPMIDIDEVGVPEAIAMKLTIPEIVNEN 368
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  400 NINFLRKLVQNGPEVHPGANFIQQ------RHTQMKRFLKYGNREkMAQELKYGDIVERHLIDGDVVLFNRQPSLHKLSI 473
Cdd:PRK14977  369 NIEKMKELVINGPDEFPGANAIRKgdgtkiRLDFLEDKGKDALRE-AAEQLEIGDIVERHLADGDIVIFNRQPSLHKLSI 447
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  474 MAHLARVKPHRTFRFNECVCTPYNADFDGDEMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAIQDFLTGAYLLT 553
Cdd:PRK14977  448 LAHRVKVLPGATFRLHPAVCPPYNADFDGDEMNLHVPQIEDARAEAIELMGVKDNLISPRTGGPIIGALQDFITAAYLIT 527
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  554 LKDTFFDRAKACQIIAsiLVGkdekIKVRLPPPTI-LKPVTLWTGKQIFSVILrPSDDNPVRANLRTKGKQycGKGED-L 631
Cdd:PRK14977  528 KDDALFDKNEASNIAM--LAG----ITDPLPEPAIkTKDGPAWTGKQLFSLFL-PKDFNFEGIAKWSAGKA--GEAKDpS 598
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  632 CANDSYVTIQNSELMSGSMDKGTLGSGSKN--NIFYILLRDWGQNEAADAMSRLARLAPVYLSNRGFSIGIGDVTPGQgl 709
Cdd:PRK14977  599 CLGDGYVLIKEGELISGVIDDNIIGALVEEpeSLIDRIAKDYGEAVAIEFLNKILIIAKKEILHYGFSNGPGDLIIPD-- 676
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  710 lKAKYELLNAGYKKCDEYIEALNT-----------GKLQQQPGCTAEETLEALILKELSVIRDHAGSACLRELDKSNSPL 778
Cdd:PRK14977  677 -EAKQEIEDDIQGMKDEVSDLIDQrkitrkitiykGKEELLRGMKEEEALEADIVNELDKARDKAGSSANDCIDADNAGK 755
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  779 TMALCGSKGSFINISQMIACVGQQAI--------SGSRVPDGFENRSLPHFEKHSKLPAAKGFVANSFYSGLTPTEFFFH 850
Cdd:PRK14977  756 IMAKTGARGSMANLAQIAGALGQQKRktrigfvlTGGRLHEGYKDRALSHFQEGDDNPDAHGFVKNNYREGLNAAEFFFH 835
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  851 TMAGREGLVDTAVKTAETGYMQRRLVKSLEDLCSQYDLTVRSSTGDIIQFIYGGDGLDPAAMEgKDEPLEFKRVLDNIKA 930
Cdd:PRK14977  836 AMGGREGLIDKARRTEDSGYFQRRLANALEDIRLEYDETVRDPHGHIIQFKFGEDGIDPQKLD-HGEAFNLERIIEKQKI 914
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  931 VfpcPSEPALSKNELilttESIMKKSEflccqdsfLQEIKKFIKGVSEKIKKTRDKygindngtteprvlyqldritPTQ 1010
Cdd:PRK14977  915 E---DRGKGASKDEI----EELAKEYT--------KTFNANLPKLLADAIHGAELK---------------------EDE 958
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1011 VEKFLETCRDKYMRAQMEPGSAVGALCAQSIGEPGTQMTLKTFHFAGVASMNITLGVPRIKEIINASKAISTPIITAQLD 1090
Cdd:PRK14977  959 LEAICAEGKEGFEKAKVEPGQAIGIISAQSIAEPGTQMTLRTFHAAGIKAMDVTHGLERFIELVDARAKPSTPTMDIYLD 1038
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1091 KDDDADYARLVKgrIEKTLLG-EISEYIEEVFLPDDCFILVKLSLERIR---LLRLEVNAETVRysiCTSKlrVKPGDVA 1166
Cdd:PRK14977 1039 DECKEDIEKAIE--IARNLKElKVRALIADSAIDNANEIKLIKPDKRALengCIPMERFAEIEA---ALAK--GKKFEME 1111
                        1210      1220      1230      1240      1250      1260      1270      1280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1167 VHGEAVVCVTPRENSKSSMYYVLQFLKEDLPKVVVQGIPEVSRAVIHIDEQSGKEKYKLLVEGDNLRAVMATHGVKGTRT 1246
Cdd:PRK14977 1112 LEDDLIILDLVEAADRDKPLATLIAIRNKILDKPVKGVPDIERAWVELVEKDGRDEWIIQTSGSNLAAVLEMKCIDIANT 1191
                        1290      1300      1310      1320      1330      1340      1350      1360
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1247 TSNNTYEVEKTLGIEAARTTIINEIQYTMVNHGMSIDRRHVMLLSDLMTYKGEVLGI------TRFGLAKMKESVLMLAS 1320
Cdd:PRK14977 1192 ITNDCFEIAGTLGIEAARNAIFNELASILEDQGLEVDNRYIMLVADIMCSRGTIEAIglqaagVRHGFAGEKDSPLAKAA 1271
                        1370      1380      1390      1400
                  ....*....|....*....|....*....|....*....|...
gi 206729892 1321 FEKTADHLFDAAYFGQKDSVCGVSECIIMGIPMNIGTGLFKLL 1363
Cdd:PRK14977 1272 FEITTHTIAHAALGGEIEKIKGILDALIMGQNIPIGSGKVDLL 1314
RNAP_II_RPB1_N cd02733
Largest subunit (Rpb1) of eukaryotic RNA polymerase II (RNAP II), N-terminal domain; The two ...
20-887 0e+00

Largest subunit (Rpb1) of eukaryotic RNA polymerase II (RNAP II), N-terminal domain; The two largest subunits of RNA polymerase II (RNAP II), Rpb1 and Rpb2, form the active site, DNA entry channel and RNA exit channel. RNAP II is a large multi-subunit complex responsible for the synthesis of mRNA in eukaryotes. RNAP II consists of a 10-subunit core enzyme and a peripheral heterodimer of two subunits. Structure studies suggest that RNAP complexes from different organisms share a crab-claw-shape structure. In yeast, Rpb1 and Rpb2, each makes up one clamp, one jaw, and part of the cleft. Rpb1_N contains part of the active site, forms the head and core of the one clamp, and makes up the pore and funnel regions of RNAP II.


Pssm-ID: 259848 [Multi-domain]  Cd Length: 751  Bit Score: 852.60  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   20 FGMKSPEEMRQQAHIQVVSKnlYSQDNQHAPLLYGVLDHRMGTSEKDRPCETCGKNLADCLGHYGYIDLELPCFHVGYFR 99
Cdd:cd02733     5 FGILSPDEIRAMSVAEIEHP--ETYENGGGPKLGGLNDPRMGTIDRNSRCQTCGGDMKECPGHFGHIELAKPVFHIGFLT 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  100 AVIGILQMICKtcchIMLSQEEkkqfldylkrpgltylqkrglkkkisdkcrkknichhcgafngtvkkcgllkiiheky 179
Cdd:cd02733    83 KILKILRCVCK----RELSAER---------------------------------------------------------- 100
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  180 ktnkkvvdpivsnflqsfetaiehnkevepllgraqenlnplvVLNLFKRIPAEDVPLLLMNPEAGKPSDLILTRLLVPP 259
Cdd:cd02733   101 -------------------------------------------VLEIFKRISDEDCRILGFDPKFSRPDWMILTVLPVPP 137
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  260 LCIRPSVVSDLkSGTNEDDLTMKLTEIIFLNDVIKKHRISGAKTQMIMEDWDFLQLQCALYINSELSGIPlnMAPKKWTR 339
Cdd:cd02733   138 PAVRPSVVMDG-SARSEDDLTHKLADIIKANNQLKRQEQNGAPAHIIEEDEQLLQFHVATYMDNEIPGLP--QATQKSGR 214
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  340 ---GFVQRLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPEKVNKANINFLRKLVQNGPEVHP 416
Cdd:cd02733   215 plkSIRQRLKGKEGRIRGNLMGKRVDFSARTVITPDPNLELDQVGVPRSIAMNLTFPEIVTPFNIDRLQELVRNGPNEYP 294
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  417 GANFIQqRHTQMKRFLKYGNReKMAQELKYGDIVERHLIDGDVVLFNRQPSLHKLSIMAHLARVKPHRTFRFNECVCTPY 496
Cdd:cd02733   295 GAKYII-RDDGERIDLRYLKK-ASDLHLQYGYIVERHLQDGDVVLFNRQPSLHKMSMMGHRVKVLPYSTFRLNLSVTTPY 372
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  497 NADFDGDEMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAIQDFLTGAYLLTLKDTFFDRAKACQIIASIlvgkd 576
Cdd:cd02733   373 NADFDGDEMNLHVPQSLETRAELKELMMVPRQIVSPQSNKPVMGIVQDTLLGVRKLTKRDTFLEKDQVMNLLMWL----- 447
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  577 EKIKVRLPPPTILKPVTLWTGKQIFSVILrPSDDNPVRANLRTKGKQYcgkgeDLCANDSYVTIQNSELMSGSMDKGTLG 656
Cdd:cd02733   448 PDWDGKIPQPAILKPKPLWTGKQIFSLII-PKINNLIRSSSHHDGDKK-----WISPGDTKVIIENGELLSGILCKKTVG 521
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  657 SGSkNNIFYILLRDWGQNEAADAMSRLARLAPVYLSNRGFSIGIGDVTPGQGLLKAKYELLNAGYKKCDEYIEALNTGKL 736
Cdd:cd02733   522 ASS-GGLIHVIWLEYGPEAARDFIGNIQRVVNNWLLHNGFSIGIGDTIADKETMKKIQETIKKAKRDVIKLIEKAQNGEL 600
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  737 QQQPGCTAEETLEALILKELSVIRDHAGSACLRELDKSNSPLTMALCGSKGSFINISQMIACVGQQAISGSRVPDGFENR 816
Cdd:cd02733   601 EPQPGKTLRESFENKVNRILNKARDKAGKSAQKSLSEDNNFKAMVTAGSKGSFINISQIIACVGQQNVEGKRIPFGFRRR 680
                         810       820       830       840       850       860       870
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 206729892  817 SLPHFEKHSKLPAAKGFVANSFYSGLTPTEFFFHTMAGREGLVDTAVKTAETGYMQRRLVKSLEDLCSQYD 887
Cdd:cd02733   681 TLPHFIKDDYGPESRGFVENSYLRGLTPQEFFFHAMGGREGLIDTAVKTAETGYIQRRLVKAMEDVMVKYD 751
RNAP_largest_subunit_N cd00399
Largest subunit of RNA polymerase (RNAP), N-terminal domain; This region represents the ...
23-887 0e+00

Largest subunit of RNA polymerase (RNAP), N-terminal domain; This region represents the N-terminal domain of the largest subunit of RNA polymerase (RNAP). RNAP is a large multi-protein complex responsible for the synthesis of RNA. It is the principle enzyme of the transcription process, and is a final target in many regulatory pathways that control gene expression in all living cells. At least three distinct RNAP complexes are found in eukaryotic nuclei; RNAP I transcribes the ribosomal RNA precursor, RNAP II the mRNA precursor, and RNAP III the 5S and tRNA genes. A single distinct RNAP complex is found in prokaryotes and archaea, respectively, which may be responsible for the synthesis of all RNAs. Structure studies reveal that prokaryotic and eukaryotic RNAPs share a conserved crab-claw-shaped structure. The largest and the second largest subunits each make up one clamp, one jaw, and part of the cleft. All RNAPs are metalloenzymes. At least one Mg2+ ion is bound in the catalytic center. In addition, all cellular RNAPs contain several tightly bound zinc ions to different subunits that vary between RNAPs from prokaryotic to eukaryotic lineages. This domain represents the N-terminal region of the largest subunit of RNAP, and includes part of the active site. In archaea and some of the photosynthetic organisms or cellular organelle, however, this domain exists as a separate subunit.


Pssm-ID: 259843 [Multi-domain]  Cd Length: 528  Bit Score: 709.59  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   23 KSPEEMRQQAHIQVVSKNLYSQDNQHAPLlYGVLDHRMGTSEKDRPCETCGKNLADCLGHYGYIDLELPCFHVGYFRAVI 102
Cdd:cd00399     1 MSPEEIRKWSVAKVIKPETIDNRTLKAER-GGKYDPRLGSIDRCEKCGTCGTGLNDCPGHFGHIELAKPVFHVGFIKKVP 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  103 GILQmicktcchimlsqeekkqfldylkrpgltylqkrglkkkisdkcrkknichhcgafngtvkkcgllkiihekyktn 182
Cdd:cd00399    80 SFLG---------------------------------------------------------------------------- 83
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  183 kkvvdpivsnflqsfetaiehnkevepllgraqenlnplvvlnlfkripaedvplllmnpeagkPSDLILTRLLVPPLCI 262
Cdd:cd00399    84 ----------------------------------------------------------------PEWMILTCLPVPPPCL 99
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  263 RPSVvsdlksgtneddltmklteiiflndvikkhrisgaktqMIMEDWDFLQLQCALYINSELSGIPLNMAPKKWTRGFV 342
Cdd:cd00399   100 RPSV--------------------------------------IIEERWRLLQEHVDTYLDNGIAGQPQTQKSGRPLRSLA 141
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  343 QRLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILtfpekvnkaninflrklvqngpevhpganfiq 422
Cdd:cd00399   142 QRLKGKEGRFRGNLMGKRVDFSGRSVISPDPNLRLDQVGVPKSIALTL-------------------------------- 189
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  423 qrhtqmkrflkygnrekmaqelkygdiverhliDGDVVLFNRQPSLHKLSIMAHLARVKPHRTFRFNECVCTPYNADFDG 502
Cdd:cd00399   190 ---------------------------------DGDPVLFNRQPSLHKLSIMAHRVRVLPGSTFRLNPLVCSPYNADFDG 236
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  503 DEMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAIQDFLTGAYLLTLkdtffdrakacqiiasilvgkdekikvr 582
Cdd:cd00399   237 DEMNLHVPQSEEARAEARELMLVPNNILSPQNGEPLIGLSQDTLLGAYLLTL---------------------------- 288
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  583 lppptilkpvtlwtGKQIFSVILrpsddnpvranlrtkgkqycgkgedlcandsyvtiqnselmsgsmdkgtlgsgsKNN 662
Cdd:cd00399   289 --------------GKQIVSAAL------------------------------------------------------PGG 300
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  663 IFYILLRDWGQNEAADAMSRLARLAPVYLSNRGFSIGIGDVTPGQGLLKAKYELLNAGYKKCDEYIEALNTGKLQQQPGC 742
Cdd:cd00399   301 LLHTVTRELGPEKAAKLLSNLQRVGFVFLTTSGFSVGIGDVIDDGVIPEEKTELIEEAKKKVDEVEEAFQAGLLTAQEGM 380
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  743 TAEETLEALILKELSVIRDHAGSACLRELD---KSNSPLTMALCGSKGSFINISQMIACVGQQAISGSRVPDGFENRSLP 819
Cdd:cd00399   381 TLEESLEDNILDFLNEARDKAGSAASVNLDlvsKFNSIYVMAMSGAKGSFINIRQMSACVGQQSVEGKRIPRGFSDRTLP 460
                         810       820       830       840       850       860
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 206729892  820 HFEKHSKLPAAKGFVANSFYSGLTPTEFFFHTMAGREGLVDTAVKTAETGYMQRRLVKSLEDLCSQYD 887
Cdd:cd00399   461 HFSKDDYSPEAKGFIRNSFLEGLTPLEYFFHAMGGREGLVDTAVKTAESGYLQRRLVKALEDLVVHYD 528
RNAP_I_RPA1_N cd01435
Largest subunit (RPA1) of eukaryotic RNA polymerase I (RNAP I), N-terminal domain; RPA1 is the ...
20-887 0e+00

Largest subunit (RPA1) of eukaryotic RNA polymerase I (RNAP I), N-terminal domain; RPA1 is the largest subunit of the eukaryotic RNA polymerase I (RNAP I). RNAP I is a multi-subunit protein complex responsible for the synthesis of rRNA precursors. RNAP I consists of at least 14 different subunits, the largest being homologous to subunit Rpb1 of yeast RNAP II and subunit beta' of bacterial RNAP. The yeast member of this family is known as Rpb190. Structure studies suggest that different RNA polymerase complexes share a similar crab-claw-shaped structure. The N-terminal domain of Rpb1, the largest subunit of RNAP II in yeast, forms part of the active site. It makes up the head and core of one clamp, as well as the pore and funnel structures of RNAP II. The strong homology between RPA1 and Rpb1 suggests a similar functional and structural role.


Pssm-ID: 259844 [Multi-domain]  Cd Length: 779  Bit Score: 574.13  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   20 FGMKSPEEMRQQAHIQVVSKNLYsqDNQHAPLLYGVLDHRMGTSEKDRPCETCGKNLADCLGHYGYIDLELPCFHVGYFR 99
Cdd:cd01435     2 FSFYSAEEIRKLSVKEITNPVTF--DSLGHPVPGGLYDPALGPLDKDDICSTCGLNYLNCPGHFGHIELPLPVYNPLFFD 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  100 AVIGILQMICKTCCHIMLSQEEKKQFldylkrpgltylqkrglkkkisdkcrkknichhcgafngtVKKCGLLkiiheky 179
Cdd:cd01435    80 LLYKLLRGSCFYCHRFRISKWEVKLF----------------------------------------VAKLKLL------- 112
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  180 ktnkkvvdpivsnflqsfetaiEHNKEVEpllgrAQENLNPLvvlNLFkripaedvplllmnpeagkpsdlILTRLLVPP 259
Cdd:cd01435   113 ----------------------DKGLLVE-----AAELDFGY---DMF-----------------------FLDVLLVPP 139
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  260 LCIRPsvVSDLKSGTNEDDLTMKLTEIIFLNDVIKKHRISGAKTQMIMEDWD-------------FLQLQCA--LYINSE 324
Cdd:cd01435   140 NRFRP--PSFLGDKVFENPQNVLLSKILKDNQQIRDLLASMRQAESQSKLDLisgktnseklinaWLQLQSAvnELFDST 217
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  325 LSGIPLNMAPKkwtrGFVQRLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPEKVNKANINFL 404
Cdd:cd01435   218 KAPKSGKKSPP----GIKQLLEKKEGLFRMNMMGKRVNYAARSVISPDPFIETNEIGIPLVFAKKLTFPEPVTPFNVEEL 293
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  405 RKLVQNGPEVHPGANFIQQR-----------HTQMKRFLKYGNREKMAQELKYG-DIVERHLIDGDVVLFNRQPSLHKLS 472
Cdd:cd01435   294 RQAVINGPDVYPGANAIEDEdgrlillsalsEERRKALAKLLLLLSSAKLLLNGpKKVYRHLLDGDVVLLNRQPTLHKPS 373
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  473 IMAHLARV-KPHRTFRFNECVCTPYNADFDGDEMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAIQDFLTGAYL 551
Cdd:cd01435   374 IMAHKVRVlPGEKTLRLHYANCKSYNADFDGDEMNLHFPQSELARAEAYYIASTDNQYLVPTDGKPLRGLIQDHVVSGVL 453
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  552 LTLKDTFFDRAKACQIIASILVGK---DEKIKVRLPPPTILKPVTLWTGKQIFSVILRPSDDNPVRANLRTKGKQYCGKG 628
Cdd:cd01435   454 LTSRDTFFTREEYQQLVYAALRPLftsDKDGRIKLLPPAILKPKPLWTGKQVISTILKNLIPGNAPLLNLSGKKKTKKKV 533
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  629 EDLC----ANDSYVTIQNSELMSGSMDKGTLGSgSKNNI---FYILlrdWGQNEAADAMSRLARLAPVYLSNRGFSIGIG 701
Cdd:cd01435   534 GGGKwgggSEESQVIIRNGELLTGVLDKSQFGA-SAYGLvhaVYEL---YGGETAGKLLSALGRLFTAYLQMRGFTCGIE 609
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  702 DVtpgqgLLKAKYELlnagyKKCDEYIEALNTGKlqqqpgCTAEETLEALILKELSVIRDhagsACLRE--LDK--SNSP 777
Cdd:cd01435   610 DL-----LLTPKADE-----KRRKILRKAKKLGL------EAAAEFLGLKLNKVTSSIIK----ACLPKglLKPfpENNL 669
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  778 LTMALCGSKGSFINISQMIACVGQQAISGSRVPDGFENRSLPHFEKHSKLPAAKGFVANSFYSGLTPTEFFFHTMAGREG 857
Cdd:cd01435   670 QLMVQSGAKGSMVNASQISCLLGQQELEGRRVPLMVSGKTLPSFPPYDTSPRAGGFITDRFLTGIRPQEYFFHCMAGREG 749
                         890       900       910
                  ....*....|....*....|....*....|
gi 206729892  858 LVDTAVKTAETGYMQRRLVKSLEDLCSQYD 887
Cdd:cd01435   750 LIDTAVKTSRSGYLQRCLIKHLEGLKVNYD 779
RNAP_III_Rpc1_C cd02736
Largest subunit (Rpc1) of Eukaryotic RNA polymerase III (RNAP III), C-terminal domain; ...
1021-1360 0e+00

Largest subunit (Rpc1) of Eukaryotic RNA polymerase III (RNAP III), C-terminal domain; Eukaryotic RNA polymerase III (RNAP III) is a large multi-subunit complex responsible for the synthesis of tRNAs, 5SrRNA, Alu-RNA, U6 snRNA, among others. Rpc1 is also known as C160 in yeast. Structure studies suggest that different RNA polymerase complexes share a similar crab-claw-shape structure. The C-terminal domain of Rpb1, the largest subunit of RNAP II, makes up part of the foot and jaw structures of RNAP II. The similarity between this domain and the C-terminal domain of Rpb1, its counterpart in RNAP II, suggests a similar functional and structural role.


Pssm-ID: 132723 [Multi-domain]  Cd Length: 300  Bit Score: 565.31  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1021 KYMRAQMEPGSAVGALCAQSIGEPGTQMTLKTFHFAGVASMNITLGVPRIKEIINASKAISTPIITAQLDKDDDADYARL 1100
Cdd:cd02736     1 KYMRAKVEPGTAVGAIAAQSIGEPGTQMTLKTFHFAGVASMNITLGVPRIKEIINASKNISTPIITAKLENDRDEKSARI 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1101 VKGRIEKTLLGEISEYIEEVFLPDDCFILVKLSLERIRLLRLevnaetvrysictsklrvkpgdvavhgeavvcvtpren 1180
Cdd:cd02736    81 VKGRIEKTYLGEVASYIEEVYSPDDCYILIKLDKKIIEKLQL-------------------------------------- 122
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1181 SKSSMYYVLQFLKEDLPKVVVQGIPEVSRAVIHIDEQSGKekYKLLVEGDNLRAVMATHGVKGTRTTSNNTYEVEKTLGI 1260
Cdd:cd02736   123 SKSNLYFLLQSLKRKLPDVVVSGIPEVKRAVINKDKKKGK--YKLLVEGYGLRAVMNTPGVIGTRTTSNHIMEVEKVLGI 200
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1261 EAARTTIINEIQYTMVNHGMSIDRRHVMLLSDLMTYKGEVLGITRFGLAKMKESVLMLASFEKTADHLFDAAYFGQKDSV 1340
Cdd:cd02736   201 EAARSTIINEIQYTMKSHGMSIDPRHIMLLADLMTFKGEVLGITRFGIAKMKESVLMLASFEKTTDHLFNAALHGRKDSI 280
                         330       340
                  ....*....|....*....|
gi 206729892 1341 CGVSECIIMGIPMNIGTGLF 1360
Cdd:cd02736   281 EGVSECIIMGKPMPIGTGLF 300
RPOLA_N smart00663
RNA polymerase I subunit A N-terminus;
248-550 8.40e-150

RNA polymerase I subunit A N-terminus;


Pssm-ID: 214767 [Multi-domain]  Cd Length: 295  Bit Score: 455.44  E-value: 8.40e-150
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892    248 SDLILTRLLVPPLCIRPSVVSDLkSGTNEDDLTMKLTEIIFLNDVIKKHRISGAKTQMIMEDWDFLQLQCALYINSElSG 327
Cdd:smart00663    1 EWMILTVLPVPPPCLRPSVQLDG-GRFAEDDLTHLLRDIIKRNNRLKRLLELGAPSIIIRNEKRLLQEAVDTLIDNE-GL 78
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892    328 IPLNMAPKKWTRGFVQRLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPEKVNKANINFLRKL 407
Cdd:smart00663   79 PRANQKSGRPLKSLSQRLKGKEGRFRQNLLGKRVDFSARSVITPDPNLKLNEVGVPKEIALELTFPEIVTPLNIDKLRKL 158
                           170       180       190       200       210       220       230       240
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892    408 VQNGPevhPGANFIQQrhtQMKRFLKYGNREKMAQELKYGDIVERHLIDGDVVLFNRQPSLHKLSIMAHLARVKPHRTFR 487
Cdd:smart00663  159 VRNGP---NGAKYIIR---GKKTNLKLAKKSKIANHLKIGDIVERHVIDGDVVLFNRQPTLHRMSIQAHRVRVLEGKTIR 232
                           250       260       270       280       290       300
                    ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 206729892    488 FNECVCTPYNADFDGDEMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAIQDFLTGAY 550
Cdd:smart00663  233 LNPLVCSPYNADFDGDEMNLHVPQSLEARAEARELMLVPNNILSPKNGKPIIGPIQDMLLGLY 295
RNA_pol_Rpb1_1 pfam04997
RNA polymerase Rpb1, domain 1; RNA polymerases catalyze the DNA dependent polymerization of ...
12-356 4.12e-132

RNA polymerase Rpb1, domain 1; RNA polymerases catalyze the DNA dependent polymerization of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). This domain, domain 1, represents the clamp domain, which a mobile domain involved in positioning the DNA, maintenance of the transcription bubble and positioning of the nascent RNA strand.


Pssm-ID: 398595  Cd Length: 320  Bit Score: 409.76  E-value: 4.12e-132
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892    12 AKKISHICFGMKSPEEMRQQAHIQVVSKNLYSqDNQHAPLLYGVLDHRMGTSEKDRPCETCGKNLADCLGHYGYIDLELP 91
Cdd:pfam04997    1 LKKIKEIQFGIASPEEIRKWSVGEVTKPETYN-YGSLKPEEGGLLDERMGTIDKDYECETCGKKKKDCPGHFGHIELAKP 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892    92 CFHVGYFRAVIGILQMICKTCCHIMLSQEEKKQFLDYLKRPGLTYLQKRglKKKISDKCRKKNICHHCGAFNGTVKKcgl 171
Cdd:pfam04997   80 VFHIGFFKKTLKILECVCKYCSKLLLDPGKPKLFNKDKKRLGLENLKMG--AKAILELCKKKDLCEHCGGKNGVCGS--- 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   172 lkiihekyktnkkvVDPIVSNFLQSFETAIEHNKEVEpllgrAQENLNPLVVLNLFKRIPAEDVPLLLMNPEAGKPSDLI 251
Cdd:pfam04997  155 --------------QQPVSRKEGLKLKAAIKKSKEEE-----EKEILNPEKVLKIFKRISDEDVEILGFNPSGSRPEWMI 215
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   252 LTRLLVPPLCIRPSVVSDLKsGTNEDDLTMKLTEIIFLNDVIKKHRISGAKTQMIMEDWDFLQLQCALYINSELSGIPL- 330
Cdd:pfam04997  216 LTVLPVPPPCIRPSVQLDGG-RRAEDDLTHKLRDIIKRNNRLKKLLELGAPSHIIREEWRLLQEHVATLFDNEIPGLPPa 294
                          330       340
                   ....*....|....*....|....*.
gi 206729892   331 NMAPKKWTRGFVQRLKGKQGRFRGNL 356
Cdd:pfam04997  295 LQKSKRPLKSISQRLKGKEGRFRGNL 320
RNA_pol_Rpb1_5 pfam04998
RNA polymerase Rpb1, domain 5; RNA polymerases catalyze the DNA dependent polymerization of ...
841-1316 2.78e-117

RNA polymerase Rpb1, domain 5; RNA polymerases catalyze the DNA dependent polymerization of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). This domain, domain 5, represents the discontinuous cleft domain that is required to from the central cleft or channel where the DNA is bound.


Pssm-ID: 398596 [Multi-domain]  Cd Length: 516  Bit Score: 377.46  E-value: 2.78e-117
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   841 GLTPTEFFFHTMAGREGLVDTAVKTAETGYMQRRLVKSLEDLCSQYDLTVRSSTGDIIQFIYGGDGLDPAAMEGKDePLE 920
Cdd:pfam04998    1 GLTPQEFFFHTMGGREGLIDTAVKTAESGYLQRRLVKALEDLVVTYDDTVRNSGGEIVQFLYGEDGLDPLKIEKQG-RFT 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   921 FKRVLDNIKAVFPCP-SEPALSKNELILTTESIMKKSEFLCCQDSFLQEIKKFIKGVSEKI-------KKTRDKYGINDN 992
Cdd:pfam04998   80 IEFSDLKLEDKFKNDlLDDLLLLSEFSLSYKKEILVRDSKLGRDRLSKEAQERATLLFELLlksglesKRVRSELTCNSK 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   993 gtteprvlyqldritptQVEKFLETCRDKYMRAQMEPGSAVGALCAQSIGEPGTQMTLKTFHFAGVASMNITLGVPRIKE 1072
Cdd:pfam04998  160 -----------------AFVCLLCYGRLLYQQSLINPGEAVGIIAAQSIGEPGTQMTLNTFHFAGVASKNVTLGVPRLKE 222
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  1073 IINASKAISTPIITAQL--DKDDDADYARLVKGRIEKTLLGEISEYIE------------------------------EV 1120
Cdd:pfam04998  223 IINVSKNIKSPSLTVYLfdEVGRELEKAKKVYGAIEKVTLGSVVESGEilydpdpfntpiisdvkgvvkffdiidevtNE 302
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  1121 FLPDDCFILVKLSLERIRLLRLEVNAETVRYSIcTSKLRVKpgDVAVHGEAVVCVTPRENSKSSMYY-------VLQFLK 1193
Cdd:pfam04998  303 EEIDPETGLLILVIRLLKILNKSIKKVVKSEVI-PRSIRNK--VDEGRDIAIGEITAFIIKISKKIRqdtgglrRVDELF 379
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  1194 ED------------LPKVVVQGIPEVSRAVIHIDEqSGKEK--YKLLVEGDNLRAVMATHG-VKGTRTTSNNTYEVEKTL 1258
Cdd:pfam04998  380 MEedpklailvaslLGNITLRGIPGIKRILVNEDD-KGKVEpdWVLETEGVNLLRVLLVPGfVDAGRILSNDIHEILEIL 458
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 206729892  1259 GIEAARTTIINEIQYTMVNHGMSIDRRHVMLLSDLMTYKGEVLGITRFGLAKMKESVL 1316
Cdd:pfam04998  459 GIEAARNALLNEIRNVYRFQGIYINDRHLELIADQMTRKGYIMAIGRHGINKAELSAL 516
RNA_pol_Rpb1_2 pfam00623
RNA polymerase Rpb1, domain 2; RNA polymerases catalyze the DNA dependent polymerization of ...
358-525 5.33e-100

RNA polymerase Rpb1, domain 2; RNA polymerases catalyze the DNA dependent polymerization of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). This domain, domain 2, contains the active site. The invariant motif -NADFDGD- binds the active site magnesium ion.


Pssm-ID: 395498  Cd Length: 166  Bit Score: 316.17  E-value: 5.33e-100
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   358 GKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPEKVNKANINFLRKLVQNGPEVHPGANFIQqRHTQMKRFLKYGNR 437
Cdd:pfam00623    1 GKRVDFSARTVISPDPNLKLDEVGVPISFAKTLTFPEIVTPYNIKRLRQLVENGPNVYPGANYII-RINGARRDLRYQKR 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   438 eKMAQELKYGDIVERHLIDGDVVLFNRQPSLHKLSIMAHLARVKPHRTFRFNECVCTPYNADFDGDEMNLHLPQTEEAKA 517
Cdd:pfam00623   80 -RLDKELEIGDIVERHVIDGDVVLFNRQPSLHRLSIMGHRVRVLPGKTFRLNLSVTTPYNADFDGDEMNLHVPQSEEARA 158

                   ....*...
gi 206729892   518 EALVLMGT 525
Cdd:pfam00623  159 EAEELMLV 166
RNAP_A'' cd06528
A'' subunit of Archaeal RNA Polymerase (RNAP); Archaeal RNA polymerase (RNAP), like bacterial ...
978-1363 4.27e-99

A'' subunit of Archaeal RNA Polymerase (RNAP); Archaeal RNA polymerase (RNAP), like bacterial RNAP, is a large multi-subunit complex responsible for the synthesis of all RNAs in the cell. The relative positioning of the RNAP core is highly conserved between archaeal RNAP and the three classes of eukaryotic RNAPs. In archaea, the largest subunit is split into two polypeptides, A' and A'', which are encoded by separate genes in an operon. Sequence alignments reveal that the archaeal A'' subunit corresponds to the C-terminal one-third of the RNAPII largest subunit (Rpb1). In subunit A'', several loops in the jaw domain are shorter. The RNAPII Rpb1 interacts with the second-largest subunit (Rpb2) to form the DNA entry and RNA exit channels in addition to the catalytic center of RNA synthesis.


Pssm-ID: 132725 [Multi-domain]  Cd Length: 363  Bit Score: 321.51  E-value: 4.27e-99
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  978 EKIKKTRDKYGIndngtteprvlyqldriTPTQVEKFLETCRDKYMRAQMEPGSAVGALCAQSIGEPGTQMTLKTFHFAG 1057
Cdd:cd06528     5 EKLEEVLKEHGL-----------------TLSEAEEIIKEVLREYLRSLIEPGEAVGIVAAQSIGEPGTQMTLRTFHYAG 67
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1058 VASMNITLGVPRIKEIINASKAISTPIITAQLDKD--DDADYARLVKGRIEKTLLGEISEYIEevFLPDDCFILVKLSLE 1135
Cdd:cd06528    68 VAEINVTLGLPRLIEIVDARKEPSTPTMTIYLEEEykYDREKAEEVARKIEETTLENLAEDIS--IDLFNMRITIELDEE 145
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1136 RI--RLLRLEVNAETVRysictsklRVKPGDVAVHGEAVVCVTPRENSKssmYYVLQFLKEDLPKVVVQGIPEVSRAVIh 1213
Cdd:cd06528   146 MLedRGITVDDVLKAIE--------KLKKGKVGEEGDVTLIVLKAEEPS---IKELRKLAEKILNTKIKGIKGIKRVIV- 213
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1214 idEQSGKEkYKLLVEGDNLRAVMATHGVKGTRTTSNNTYEVEKTLGIEAARTTIINEIQYTMVNHGMSIDRRHVMLLSDL 1293
Cdd:cd06528   214 --RKEEDE-YVIYTEGSNLKAVLKVEGVDPTRTTTNNIHEIEEVLGIEAARNAIINEIKRTLEEQGLDVDIRHIMLVADI 290
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1294 MTYKGEVLGITRFGLAKMKESVLMLASFEKTADHLFDAAYFGQKDSVCGVSECIIMGIPMNIGTGLFKLL 1363
Cdd:cd06528   291 MTYDGEVRQIGRHGIAGEKPSVLARAAFEVTVKHLLDAAVRGEVDELRGVIENIIVGQPIPLGTGDVELT 360
PRK04309 PRK04309
DNA-directed RNA polymerase subunit A''; Validated
1006-1363 1.03e-96

DNA-directed RNA polymerase subunit A''; Validated


Pssm-ID: 235277 [Multi-domain]  Cd Length: 383  Bit Score: 316.02  E-value: 1.03e-96
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1006 ITPTQVEKFLETCRDKYMRAQMEPGSAVGALCAQSIGEPGTQMTLKTFHFAGVASMNITLGVPRIKEIINASKAISTPII 1085
Cdd:PRK04309   35 LTEEEVEEIIEEVVREYLRSLVEPGEAVGVVAAQSIGEPGTQMTMRTFHYAGVAEINVTLGLPRLIEIVDARKEPSTPMM 114
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1086 TAQLDKD--DDADYARLVKGRIEKTLLGEISEYIEevFLPDDCFILVKLSLERI--RLLRLEVNAETVRysictsklRVK 1161
Cdd:PRK04309  115 TIYLKDEyaYDREKAEEVARKIEATTLENLAKDIS--VDLANMTIIIELDEEMLedRGLTVDDVKEAIE--------KKK 184
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1162 PGDVAVHGEAVVcVTPRENSkssmYYVLQFLKEDLPKVVVQGIPEVSRAVIHIDEqsgkEKYKLLVEGDNLRAVMATHGV 1241
Cdd:PRK04309  185 GGEVEIEGNTLI-ISPKEPS----YRELRKLAEKIRNIKIKGIKGIKRVIIRKEG----DEYVIYTEGSNLKEVLKVEGV 255
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1242 KGTRTTSNNTYEVEKTLGIEAARTTIINEIQYTMVNHGMSIDRRHVMLLSDLMTYKGEVLGITRFGLAKMKESVLMLASF 1321
Cdd:PRK04309  256 DATRTTTNNIHEIEEVLGIEAARNAIIEEIKNTLEEQGLDVDIRHIMLVADMMTWDGEVRQIGRHGVSGEKASVLARAAF 335
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|..
gi 206729892 1322 EKTADHLFDAAYFGQKDSVCGVSECIIMGIPMNIGTGLFKLL 1363
Cdd:PRK04309  336 EVTVKHLLDAAVRGEVDELKGVTENIIVGQPIPLGTGDVELT 377
RNA_pol_rpoA2 TIGR02389
DNA-directed RNA polymerase, subunit A''; This family consists of the archaeal A'' subunit of ...
1010-1365 1.49e-85

DNA-directed RNA polymerase, subunit A''; This family consists of the archaeal A'' subunit of the DNA-directed RNA polymerase. The example from Methanocaldococcus jannaschii contains an intein. [Transcription, DNA-dependent RNA polymerase]


Pssm-ID: 274105 [Multi-domain]  Cd Length: 367  Bit Score: 283.87  E-value: 1.49e-85
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  1010 QVEKFLETCRDKYMRAQMEPGSAVGALCAQSIGEPGTQMTLKTFHFAGVASMNITLGVPRIKEIINASKAISTPIITAQL 1089
Cdd:TIGR02389   24 ELDEIIKRVEEEYLRSLIDPGEAVGIVAAQSIGEPGTQMTMRTFHYAGVAELNVTLGLPRLIEIVDARKTPSTPSMTIYL 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  1090 DKDD--DADYARLVKGRIEKTLLGEISEYIEevflpddcfilVKLSLERIrllRLEVNAETVRYSICTSKL------RVK 1161
Cdd:TIGR02389  104 EDEYekDREKAEEVAKKIEATKLEDVAKDIS-----------IDLADMTV---IIELDEEQLKERGITVDDvekaikKAK 169
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  1162 PGDVAV--HGEAVVCVTPRENSkssmYYVLQFLKEDLPKVVVQGIPEVSRAVIhideQSGKEKYKLLVEGDNLRAVMATH 1239
Cdd:TIGR02389  170 LGKVIEidMDNNTITIKPGNPS----LKELRKLKEKIKNLHIKGIKGIKRVVI----RKEGDEYVIYTEGSNLKEVLKLE 241
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  1240 GVKGTRTTSNNTYEVEKTLGIEAARTTIINEIQYTMVNHGMSIDRRHVMLLSDLMTYKGEVLGITRFGLAKMKESVLMLA 1319
Cdd:TIGR02389  242 GVDKTRTTTNDIHEIAEVLGIEAARNAIIEEIKRTLEEQGLDVDIRHLMLVADLMTWDGEVRQIGRHGISGEKASVLARA 321
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*.
gi 206729892  1320 SFEKTADHLFDAAYFGQKDSVCGVSECIIMGIPMNIGTGLFKLLHK 1365
Cdd:TIGR02389  322 AFEVTVKHLLDAAIRGEVDELKGVIENIIVGQPIPLGTGDVDLVMD 367
PRK14897 PRK14897
unknown domain/DNA-directed RNA polymerase subunit A'' fusion protein; Provisional
935-1365 4.72e-85

unknown domain/DNA-directed RNA polymerase subunit A'' fusion protein; Provisional


Pssm-ID: 237853 [Multi-domain]  Cd Length: 509  Bit Score: 287.86  E-value: 4.72e-85
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  935 PSEPALSKNELILTTESIMKK---------------SEFLCCQDSFLQEIKKFIKGVSEKiKKTRDKYGINDNGTTEPRV 999
Cdd:PRK14897   53 KIAPYSNSNGIIKKKKPVLKTvleieseekieaidlMEFKRLFGRILDENMSFSTGELLT-AEEKEYYEENSNEDVLKVI 131
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1000 LYQLD---RITPTQVEKFLETCRDK-----------------YMRAQMEPGSAVGALCAQSIGEPGTQMTLKTFHFAGVA 1059
Cdd:PRK14897  132 DDVKKlgfRLPPSVIEEIAKAMKKKelsddeyeeilrrireeYERARVDPYEAVGIVAAQSIGEPGTQMTMRTFHYAGVA 211
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1060 SMNITLGVPRIKEIINASKAISTPIITAQLDKD--DDADYARLVKGRIEKTLLGEISEYIEEVflpDDCFILVKLSLERI 1137
Cdd:PRK14897  212 EMNVTLGLPRLIEIVDARKKPSTPTMTIYLKKDyrEDEEKVREVAKKIENTTLIDVADIITDI---AEMSVVVELDEEKM 288
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1138 rllrlevNAETVRYSICTSKLRVKPGDVAVHGEAVVCVTPRENSkssmYYVLQFLKEDLPKVVVQGIPEVSRAVIHIDeq 1217
Cdd:PRK14897  289 -------KERLIEYDDILAAISKLTFKTVEIDDGIIRLKPQQPS----FKKLYLLAEKVKSLTIKGIKGIKRAIARKE-- 355
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1218 sGKEK-YKLLVEGDNLRAVMATHGVKGTRTTSNNTYEVEKTLGIEAARTTIINEIQYTMVNHGMSIDRRHVMLLSDLMTY 1296
Cdd:PRK14897  356 -NDERrWVIYTQGSNLKDVLEIDEVDPTRTYTNDIIEIATVLGIEAARNAIIHEAKRTLQEQGLNVDIRHIMLVADMMTF 434
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 206729892 1297 KGEVLGITRFGLAKMKESVLMLASFEKTADHLFDAAYFGQKDSVCGVSECIIMGIPMNIGTGLFKLLHK 1365
Cdd:PRK14897  435 DGSVKAIGRHGISGEKSSVLARAAFEITGKHLLRAGILGEVDKLAGVAENIIVGQPITLGTGAVSLVYK 503
RNAP_IV_RPD1_N cd10506
Largest subunit (NRPD1) of higher plant RNA polymerase IV, N-terminal domain; NRPD1 and NRPE1 ...
55-891 4.38e-84

Largest subunit (NRPD1) of higher plant RNA polymerase IV, N-terminal domain; NRPD1 and NRPE1 are the largest subunits of plant DNA-dependent RNA polymerase IV and V that, together with second largest subunits (NRPD2 and NRPE2), form the active site region of the DNA entry and RNA exit channel. Higher plants have five multi-subunit nuclear RNA polymerases; RNAP I, RNAP II and RNAP III, which are essential for viability, plus the two isoforms of the non-essential polymerase RNAP IV and V, which specialize in small RNA-mediated gene silencing pathways. RNAP IV and/or V might be involved in RNA-directed DNA methylation of endogenous repetitive elements, silencing of transgenes, regulation of flowering-time genes, inducible regulation of adjacent gene pairs, and spreading of mobile silencing signals. The subunit compositions of RNAP IV and V reveal that they evolved from RNAP II.


Pssm-ID: 259849 [Multi-domain]  Cd Length: 744  Bit Score: 292.00  E-value: 4.38e-84
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   55 VLDHRMGTSEKDRPCETCG-KNLADCLGHYGYIDLELPCFHVGYFRAVIGILQMICKTCchimlsqeekkqfldylkrpg 133
Cdd:cd10506    20 VTNPRLGLPNESGQCTTCGaKDNKKCEGHFGVIKLPVTIYHPYFISEVAQILNKICPGC--------------------- 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  134 ltylqkrglkkkisdkcrkknichhcgafngtvkkcglLKIIHEKYKTNKKVVDPIVSNFLQSFETaiEHNKEVEPLLgr 213
Cdd:cd10506    79 --------------------------------------KSIKQKKKKPPRETLPPDYWDFIPKDGQ--QEESCVTKNL-- 116
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  214 aqenlnPLVVLNLFKRIPAEDVPLLLMN--PeagKPSDLILTRLLVPPLCIRpsvVSDLKSGTNEddlTMKLTEIIFLND 291
Cdd:cd10506   117 ------PILSLAQVKKILKEIDPKLIAKglP---RQEGLFLKCLPVPPNCHR---VTEFTHGFST---GSRLIFDERTRA 181
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  292 VIKKhrisgaktqmimedwdflqlqcALYINSELSGIPLNMAPKKWtrgfvqrlkgkqgrFRGNLSGKRVDFSGRTVISP 371
Cdd:cd10506   182 YKKL----------------------VDFIGTANESAASKKSGLKW--------------MKDLLLGKRSGHSFRSVVVG 225
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  372 DPNLRIDEVAVPVHVAKILTFPEKVNKANINFLRKLVQNGPevhpganfiQQRHTQMKRflKYGN--REKMAQELKYGDI 449
Cdd:cd10506   226 DPYLELNEIGIPCEIAERLTVSERVSSWNRERLQEYCDLTL---------LLKGVIGVR--RNGRlvGVRSHNTLQIGDV 294
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  450 VERHLIDGDVVLFNRQPSLHKLSIMAHLARVKPHR-TFRFNECVCTPYNADFDGDEMNLHLPQTEEAKAEALVLMGTKAN 528
Cdd:cd10506   295 IHRPLVDGDVVLVNRPPSIHQHSLIALSVKVLPTNsVVSINPLCCSPFRGDFDGDCLHGYIPQSLQARAELEELVALPKQ 374
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  529 LVTPRNGEPLIAAIQDFLTGAYLLTLKDTFFDRAKACQIiaSILVGKdekikvRLPPPTILKPVT----LWTGKQIFSVI 604
Cdd:cd10506   375 LISSQSGQNLLSLTQDSLLAAHLMTERGVFLDKAQMQQL--QMLCPS------QLPPPAIIKSPPsngpLWTGKQLFQML 446
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  605 LrPSDDNpvranlrtkgkqycgkgedlCANDSY-VTIQNSELMSGSmDKGTLGSGSKNNIFYILLRDwGQNEAADAMSRL 683
Cdd:cd10506   447 L-PTDLD--------------------YSFPSNlVFISDGELISSS-GGSSWLRDSEGNLFSILVKH-GPGKALDFLDSA 503
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  684 ARLAPVYLSNRGFSIGIGDVTPGQGLLKAK--YELLNAGY--KKCDEYIEALNTGKLQQQPGCTAEETLEALILKELSVI 759
Cdd:cd10506   504 QGLLCEWLSMRGFSVSLSDLYLSSDSYSRQkmIEEISLGLreAEIACNIKQLLVDSRKDFLSGSGEENDVSSDVERVIYE 583
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  760 RDHAGSAC------------------LRELDKSNSPLTMALCGSKGSFINISQMIACVGQQ--------AISGSRVPDGF 813
Cdd:cd10506   584 RQKSAALSqasvsafkqvfrdiqnlvYKYASKDNSLLAMIKAGSKGSLLKLVQQSGCLGLQlslvklsyRIPRQLSCAAW 663
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  814 ENRSLPHFEKHSK-----LPAAKGFVANSFYSGLTPTEFFFHTMAGREGLVDtavKTAET-GYMQRRLVKSLEDLCSQYD 887
Cdd:cd10506   664 NSQKSPRVIEKDGsecteSYIPYGVVESSFLDGLNPLECFVHSITSRDSSFS---SNADLpGTLFRKLMFFMRDIYVAYD 740

                  ....
gi 206729892  888 LTVR 891
Cdd:cd10506   741 GTVR 744
RNAP_II_Rpb1_C cd02584
Largest subunit (Rpb1) of Eukaryotic RNA polymerase II (RNAP II), C-terminal domain; RNA ...
1005-1363 5.11e-81

Largest subunit (Rpb1) of Eukaryotic RNA polymerase II (RNAP II), C-terminal domain; RNA polymerase II (RNAP II) is a large multi-subunit complex responsible for the synthesis of mRNA. RNAP II consists of a 10-subunit core enzyme and a peripheral heterodimer of two subunits. The largest core subunit (Rpb1) of yeast RNAP II is the best characterized member of this family. Structure studies suggest that RNAP complexes from different organisms share a crab-claw-shape structure. In yeast, Rpb1 and Rpb2, the largest and the second largest subunits, each makes up one clamp, one jaw, and part of the cleft. Rpb1 interacts with Rpb2 to form the DNA entry and RNA exit channels in addition to the catalytic center of RNA synthesis. The C-terminal domain of Rpb1 makes up part of the foot and jaw structures.


Pssm-ID: 132720 [Multi-domain]  Cd Length: 410  Bit Score: 272.93  E-value: 5.11e-81
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1005 RITPTQVEKFLETCRDKYMRAQMEPGSAVGALCAQSIGEPGTQMTLKTFHFAGVASMNITLGVPRIKEIINASKAISTPI 1084
Cdd:cd02584     2 RLNKEAFDWILGEIETRFNRSLVHPGEMVGTIAAQSIGEPATQMTLNTFHFAGVSAKNVTLGVPRLKEIINVAKNIKTPS 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1085 ITAQLDKDD--DADYARLVKGRIEKTLLGEISEYIEEVFLPDDCFILVK----------------LSLERIR--LLRLEV 1144
Cdd:cd02584    82 LTVYLEPGFakDEEKAKKIQSRLEHTTLKDVTAATEIYYDPDPQNTVIEedkefvesyfefpdedVEQDRLSpwLLRIEL 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1145 NAETVrysicTSKlRVKPGDVA-----VHGEAVVCVTPRENS---------------KSSMYYVLQFLKED----LPKVV 1200
Cdd:cd02584   162 DRKKM-----TDK-KLSMEQIAkkikeEFKDDLNVIFSDDNAeklviririinddeeKEEDSEDDVFLKKIesnmLSDMT 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1201 VQGIPEVSRAVIH------IDEQSGKEK----YKLLVEGDNLRAVMATHGVKGTRTTSNNTYEVEKTLGIEAARTTIINE 1270
Cdd:cd02584   236 LKGIEGIRKVFIReenkkkVDIETGEFKkreeWVLETDGVNLREVLSHPGVDPTRTTSNDIVEIFEVLGIEAARKALLKE 315
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1271 IQYTMVNHGMSIDRRHVMLLSDLMTYKGEVLGITRFGLAKMKESVLMLASFEKTADHLFDAAYFGQKDSVCGVSECIIMG 1350
Cdd:cd02584   316 LRNVISFDGSYVNYRHLALLCDVMTQRGHLMAITRHGINRQDTGPLMRCSFEETVDILLEAAAFGETDDLKGVSENIMLG 395
                         410
                  ....*....|...
gi 206729892 1351 IPMNIGTGLFKLL 1363
Cdd:cd02584   396 QLAPIGTGCFDLL 408
rpoC_TIGR TIGR02386
DNA-directed RNA polymerase, beta' subunit, predominant form; Bacteria have a single ...
246-1359 5.03e-80

DNA-directed RNA polymerase, beta' subunit, predominant form; Bacteria have a single DNA-directed RNA polymerase, with required subunits that include alpha, beta, and beta-prime. This model describes the predominant architecture of the beta-prime subunit in most bacteria. This model excludes from among the bacterial mostly sequences from the cyanobacteria, where RpoC is replaced by two tandem genes homologous to it but also encoding an additional domain. [Transcription, DNA-dependent RNA polymerase]


Pssm-ID: 274103 [Multi-domain]  Cd Length: 1140  Bit Score: 288.10  E-value: 5.03e-80
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   246 KPSDLILTRLLVPPLCIRPSV--------VSDLksgtneDDLTMKlteIIFLNDVIKKHRISGA-------KTQMIMEDW 310
Cdd:TIGR02386  215 RPEWMVLDVIPVIPPELRPMVqldggrfaTSDL------NDLYRR---VINRNNRLKRLLELGApeiivrnEKRMLQEAV 285
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   311 DflqlqcALYINSElSGIPLNMAPKKWTRGFVQRLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKIL 390
Cdd:TIGR02386  286 D------ALFDNGR-RGKPVVGKNNRPLKSLSDMLKGKQGRFRQNLLGKRVDYSGRSVIVVGPELKMYQCGLPKKMALEL 358
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   391 TFPEKVNK-------ANINFLRKLVQNG-PEVHpganfiqqrhtqmkrflkygnrekmaqelkygDIVERhLIDGDVVLF 462
Cdd:TIGR02386  359 FKPFIIKRlidrelaANIKSAKKMIEQEdPEVW--------------------------------DVLED-VIKEHPVLL 405
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   463 NRQPSLHKLSIMAHLARVKPHRTFRFNECVCTPYNADFDGDEMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAI 542
Cdd:TIGR02386  406 NRAPTLHRLGIQAFEPVLVEGKAIRLHPLVCTAFNADFDGDQMAVHVPLSPEAQAEARALMLASNNILNPKDGKPIVTPS 485
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   543 QDFLTGAYLLTL--------KDTFFDRAKACQIIASILVGKDEKIKVRLPPPTILKPVtlwtGKQIFSVILrpsddnPVr 614
Cdd:TIGR02386  486 QDMVLGLYYLTTekpgakgeGKIFSNVDEAIRAYDNGKVHLHALIGVRTSGEILETTV----GRVIFNEIL------PE- 554
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   615 anlrtkgkqycgkgedlcaNDSYVTIQNselmsgSMDKGTLGSgsknnIFYILLRDWGQNEAADAMSRLARLAPVYLSNR 694
Cdd:TIGR02386  555 -------------------GFPYINDNE------PLSKKEISS-----LIDLLYEVHGIEETAEMLDKIKALGFKYATKS 604
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   695 GFSIGIGDV-TPGQgllkaKYELLNAGYKKCDEYIEALNTGKLqqqpgcTAEETLEALIlKELSVIRDHAGSACLRELDK 773
Cdd:TIGR02386  605 GTTISASDIvVPDE-----KYEILKEADKEVAKIQKFYNKGLI------TDEERYRKVV-SIWSETKDKVTDAMMKLLKK 672
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   774 S----NSPLTMALCGSKGSFINISQMIACVGQQAisgsrVPDGfENRSLPhfekhsklpaakgfVANSFYSGLTPTEFFF 849
Cdd:TIGR02386  673 DtykfNPIFMMADSGARGNISQFRQLAGMRGLMA-----KPSG-DIIELP--------------IKSSFREGLTVLEYFI 732
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   850 HTMAGREGLVDTAVKTAETGYMQRRLVKSledlcSQyDLTVR-----SSTGDIIqfiyggdgldPAAMEGKDEPLE--FK 922
Cdd:TIGR02386  733 STHGARKGLADTALKTADSGYLTRRLVDV-----AQ-DVVVReedcgTEEGIEV----------EAIVEGKDEIIEslKD 796
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   923 RVLDNIKA---VFPCPSEPALSKNELIltTESIMKKSEFLccqdsflqeikkfikGVSE-KIKKT---RDKYGINdngtt 995
Cdd:TIGR02386  797 RIVGRYSAedvYDPDTGKLIAEANTLI--TEEIAEKIENS---------------GIEKvKVRSVltcESEHGVC----- 854
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   996 epRVLYQLDRITPTQVEKfletcrdkymraqmepGSAVGALCAQSIGEPGTQMTLKTFHFAGVAS--MNITLGVPRIKEI 1073
Cdd:TIGR02386  855 --QKCYGRDLATGKLVEI----------------GEAVGVIAAQSIGEPGTQLTMRTFHTGGVAGasGDITQGLPRVKEL 916
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  1074 INAskaiSTPiitaqldkDDDADYARlVKGRIEktllgeiseyieevFLPDDcfilVKlslERIRLLRLEVNAETVRYSI 1153
Cdd:TIGR02386  917 FEA----RTP--------KDKAVIAE-VDGTVE--------------IIEDI----VK---NKRVVVIKDENDEEKKYTI 962
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  1154 -CTSKLRVKPGDVAVHGEAVV--CVTPREnskssmyyVLQFLK-EDLPKVVVQGIPEVSRAV-IHIDeqsgkEKYKLLVE 1228
Cdd:TIGR02386  963 pFGAQLRVKDGDSVSAGDKLTegSIDPHD--------LLRIKGiQAVQEYLVKEVQKVYRLQgVEIN-----DKHIEVIV 1029
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  1229 GDNLRAVMAT-HG----VKGTRTTSNNTYEVEKTLgIEAARTTIIneiqytmvnhgmsidrrhvmllsdlmtYKGEVLGI 1303
Cdd:TIGR02386 1030 RQMLRKVRITdSGdsnlLPGELIDIHEFNEENRKL-LEQGKKPAS---------------------------AIPQLLGI 1081
                         1130      1140      1150      1160      1170
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 206729892  1304 TRFGLAkmKESVLMLASFEKTADHLFDAAYFGQKDSVCGVSECIIMG--IPMniGTGL 1359
Cdd:TIGR02386 1082 TKASLN--TESFLSAASFQETTKVLTDAAIKGKVDYLLGLKENVIIGnlIPA--GTGL 1135
RNAP_I_Rpa1_C cd02735
Largest subunit (Rpa1) of Eukaryotic RNA polymerase I (RNAP I), C-terminal domain; RNA ...
1021-1363 2.47e-68

Largest subunit (Rpa1) of Eukaryotic RNA polymerase I (RNAP I), C-terminal domain; RNA polymerase I (RNAP I) is a multi-subunit protein complex responsible for the synthesis of rRNA precursor. It consists of at least 14 different subunits, and the largest one is homologous to subunit Rpb1 of yeast RNAP II and subunit beta' of bacterial RNAP. Rpa1 is also known as Rpa190 in yeast. Structure studies suggest that different RNAP complexes share a similar crab-claw-shape structure. The C-terminal domain of Rpb1, the largest subunit of RNAP II, makes up part of the foot and jaw structures of RNAP II. The similarity between this domain and the C-terminal domain of Rpb1, its counterpart in RNAP II, suggests a similar functional and structural role.


Pssm-ID: 132722 [Multi-domain]  Cd Length: 309  Bit Score: 232.85  E-value: 2.47e-68
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1021 KYMRAQMEPGSAVGALCAQSIGEPGTQMTLKTFHFAGVASMNITLGVPRIKEII-NASKAISTPIITAQLDKDDDADYAR 1099
Cdd:cd02735     1 KYMRSLVEPGEAVGLLAAQSIGEPSTQMTLNTFHFAGRGEMNVTLGIPRLREILmTASKNIKTPSMTLPLKNGKSAERAE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1100 LVKGRIEKTLLGEISEYIE--EVFlpdDCFILV-KLSLERirllRLEVnaeTVRYSICTSKLrvkpgdvavhgeavvcvt 1176
Cdd:cd02735    81 TLKKRLSRVTLSDVVEKVEvtEIL---KTIERVfKKLLGK----WCEV---TIKLPLSSPKL------------------ 132
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1177 prenskssmyYVLQFLKEDLPKVVVQGIPEVSRAVIHIDEQSGKEKYKLLVEGDNLRAVMATHG-VKGTRTTSNNTYEVE 1255
Cdd:cd02735   133 ----------LLLSIVEKLARKAVIREIPGITRCFVVEEDKGGKTKYLVITEGVNLAALWKFSDiLDVNRIYTNDIHAML 202
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1256 KTLGIEAARTTIINEIQYTMVNHGMSIDRRHVMLLSDLMTYKGEVLGITRFGLAKmKESVLMLASFEKTADHLFDAAYFG 1335
Cdd:cd02735   203 NTYGIEAARRAIVKEISNVFKVYGIAVDPRHLSLIADYMTFEGGYRPFNRIGMES-STSPLQKMSFETTLAFLKKATLNG 281
                         330       340
                  ....*....|....*....|....*...
gi 206729892 1336 QKDSVCGVSECIIMGIPMNIGTGLFKLL 1363
Cdd:cd02735   282 DIDNLSSPSSRLVVGKPVNGGTGLFDLL 309
PRK14898 PRK14898
DNA-directed RNA polymerase subunit A''; Provisional
1046-1369 1.63e-66

DNA-directed RNA polymerase subunit A''; Provisional


Pssm-ID: 237854 [Multi-domain]  Cd Length: 858  Bit Score: 242.88  E-value: 1.63e-66
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1046 TQMTLKTFHFAGVASMNITLGVPRIKEIINASKAISTPIITAQLDKD--DDADYARLVKGRIEKTLLGEISEYIEEVFLP 1123
Cdd:PRK14898  541 THNTMRTFHYAGVAEINVTLGLPRMIEIVDARKEPSTPIMTVHLKGEyaTDREKAEEVAKKIESLTLGDVATSIAIDLWT 620
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1124 DDcfILVKLSLERI--RLLRLEVNAETVRysictSKLRVKpgdVAVHGeAVVCVTPRENSkssmYYVLQFLKEDLPKVVV 1201
Cdd:PRK14898  621 QS--IKVELDEETLadRGLTIESVEEAIE-----KKLGVK---IDRKG-TVLYLKPKTPS----YKALRKRIPKIKNIVL 685
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1202 QGIPEVSRAVIHIDEQSGKEKYKLLVEGDNLRAVMATHGVKGTRTTSNNTYEVEKTLGIEAARTTIINEIQYTMVNHGMS 1281
Cdd:PRK14898  686 KGIPGIERVLVKKEEHENDEEYVLYTQGSNLREVFKIEGVDTSRTTTNNIIEIQEVLGIEAARNAIINEMMNTLEQQGLE 765
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1282 IDRRHVMLLSDLMTYKGEVLGITRFGLAKMKESVLMLASFEKTADHLFDAAYFGQKDSVCGVSECIIMGIPMNIGTGLFK 1361
Cdd:PRK14898  766 VDIRHLMLVADIMTADGEVKPIGRHGVAGEKGSVLARAAFEETVKHLYDAAEHGEVDKLKGVIENVIVGKPIKLGTGCVD 845

                  ....*...
gi 206729892 1362 LLHKADRD 1369
Cdd:PRK14898  846 LRIDREYE 853
PRK00566 PRK00566
DNA-directed RNA polymerase subunit beta'; Provisional
345-1359 4.72e-63

DNA-directed RNA polymerase subunit beta'; Provisional


Pssm-ID: 234794 [Multi-domain]  Cd Length: 1156  Bit Score: 235.73  E-value: 4.72e-63
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  345 LKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPekvnkaninF-LRKLVQNGpevhpganfIQQ 423
Cdd:PRK00566  321 LKGKQGRFRQNLLGKRVDYSGRSVIVVGPELKLHQCGLPKKMALELFKP---------FiMKKLVERG---------LAT 382
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  424 RHTQMKRFLkygnrEKMAQELkyGDIVErHLIDGDVVLFNRQPSLHKLSIMA-------------H-Larvkphrtfrfn 489
Cdd:PRK00566  383 TIKSAKKMV-----EREDPEV--WDVLE-EVIKEHPVLLNRAPTLHRLGIQAfepvliegkaiqlHpL------------ 442
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  490 ecVCTPYNADFDGDEMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAIQDFLTGAYLLTLKD--------TFFDR 561
Cdd:PRK00566  443 --VCTAFNADFDGDQMAVHVPLSLEAQAEARVLMLSSNNILSPANGKPIIVPSQDMVLGLYYLTRERegakgegmVFSSP 520
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  562 AKACQIIASILVGKDEKIKVRLPPPTILKpVTLwtGKQIFSVILrPSD---DNPVRAnlrtkgkqycgkgedlcandsyv 638
Cdd:PRK00566  521 EEALRAYENGEVDLHARIKVRITSKKLVE-TTV--GRVIFNEIL-PEGlpfINVNKP----------------------- 573
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  639 tiqnselmsgsMDKGTLGsgsknNIFYILLRDWGQNEAADAMSRLARLAPVYLSNRGFSIGIGDVTpgqgLLKAKYELLN 718
Cdd:PRK00566  574 -----------LKKKEIS-----KIINEVYRRYGLKETVIFLDKIKDLGFKYATRSGISIGIDDIV----IPPEKKEIIE 633
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  719 AGYKKCDEYIEALNTGKLqqqpgcTAEETLEALIlKELSVIRDHAGSACLRELDKSNSPL----TMALCGSKGSFINISQ 794
Cdd:PRK00566  634 EAEKEVAEIEKQYRRGLI------TDGERYNKVI-DIWSKATDEVAKAMMKNLSKDQESFnpiyMMADSGARGSASQIRQ 706
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  795 miacvgqqaISGSRvpdgfenrslphfekhsklpaakGFVAN------------SFYSGLTPTEFFFHTMAGREGLVDTA 862
Cdd:PRK00566  707 ---------LAGMR-----------------------GLMAKpsgeiietpiksNFREGLTVLEYFISTHGARKGLADTA 754
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  863 VKTAETGYMQRRLVksleDLcSQyDLTVR-----SSTGDIIqfiyggdgldPAAMEGKD--EPLE---FKRVLdnIKAVF 932
Cdd:PRK00566  755 LKTADSGYLTRRLV----DV-AQ-DVIVReddcgTDRGIEV----------TAIIEGGEviEPLEeriLGRVL--AEDVV 816
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  933 -PCPSEPALSKNELIltTESIMKKSEflccqDSFLQEIKkfI---------KGVSEKIkktrdkYGINdngtteprvlyq 1002
Cdd:PRK00566  817 dPETGEVIVPAGTLI--DEEIADKIE-----EAGIEEVK--IrsvltcetrHGVCAKC------YGRD------------ 869
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1003 LDRITPTQVekfletcrdkymraqmepGSAVGALCAQSIGEPGTQMTLKTFHFAGVasmNITLGVPRIKEIINASK---- 1078
Cdd:PRK00566  870 LATGKLVNI------------------GEAVGVIAAQSIGEPGTQLTMRTFHTGGV---DITGGLPRVAELFEARKpkgp 928
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1079 AISTPIitaqldkdddadyarlvKGRIEKtllGEISEYIEEVFLPDDcfilvklslerirllrlevNAETVRYSICTSK- 1157
Cdd:PRK00566  929 AIIAEI-----------------DGTVSF---GKETKGKRRIVITPD-------------------DGEEREYLIPKGKh 969
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1158 LRVKPGDVAVHGEAvvcvtprenskssmyyvlqflkedlpkvvvqgipevsravihideqsgkekyklLVEGdnlravma 1237
Cdd:PRK00566  970 LLVQEGDHVEAGDK------------------------------------------------------LTDG-------- 987
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1238 thgvkgtrttSNNTYEVEKTLGIEAARTTIINEIQ--YTMvnHGMSIDRRHV------MLL---------SDLM------ 1294
Cdd:PRK00566  988 ----------SIDPHDILRVLGVEAVQNYLVNEVQkvYRL--QGVKINDKHIevivrqMLRkvritdpgdTDFLpgelvd 1055
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1295 -------------------TYKGEVLGITRFGLAkmKESVLMLASFEKTADHLFDAAYFGQKDSVCGVSECIIMG--IPM 1353
Cdd:PRK00566 1056 rsefeeenrkliaegkepaTGRPVLLGITKASLA--TESFLSAASFQETTRVLTEAAIKGKVDPLRGLKENVIIGrlIPA 1133

                  ....*.
gi 206729892 1354 niGTGL 1359
Cdd:PRK00566 1134 --GTGL 1137
RNAP_largest_subunit_C cd00630
Largest subunit of RNA polymerase (RNAP), C-terminal domain; RNA polymerase (RNAP) is a large ...
1030-1359 3.49e-61

Largest subunit of RNA polymerase (RNAP), C-terminal domain; RNA polymerase (RNAP) is a large multi-subunit complex responsible for the synthesis of RNA. It is the principal enzyme of the transcription process, and is the final target in many regulatory pathways that control gene expression in all living cells. At least three distinct RNAP complexes are found in eukaryotic nuclei, RNAP I, RNAP II, and RNAP III, for the synthesis of ribosomal RNA precursor, mRNA precursor, and 5S and tRNA, respectively. A single distinct RNAP complex is found in prokaryotes and archaea, which may be responsible for the synthesis of all RNAs. Structure studies revealed that prokaryotic and eukaryotic RNAPs share a conserved crab-claw-shape structure. The largest and the second largest subunits each make up one clamp, one jaw, and part of the cleft. The largest RNAP subunit (Rpb1) interacts with the second-largest RNAP subunit (Rpb2) to form the DNA entry and RNA exit channels in addition to the catalytic center of RNA synthesis. The region covered by this domain makes up part of the foot and jaw structures. In archaea, some photosynthetic organisms, and some organelles, this domain exists as a separate subunit, while it forms the C-terminal region of the RNAP largest subunit in eukaryotes and bacteria.


Pssm-ID: 132719 [Multi-domain]  Cd Length: 158  Bit Score: 206.11  E-value: 3.49e-61
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1030 GSAVGALCAQSIGEPGTQMTLKTFHFAGVASMNITLGVPRIKEIINASkaistpiitaqldkdddadyarlvkgriektl 1109
Cdd:cd00630     1 GEAVGVLAAQSIGEPGTQMTLRTFHFAGVASMNVTLGLPRLKEILNAA-------------------------------- 48
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1110 lgeiseyieevflpddcfilvklslerirllrlevnaetvrysictsklrvkpgdvavhgeavvcvtprenskssmyyvl 1189
Cdd:cd00630       --------------------------------------------------------------------------------
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1190 qflkedlpkvvvqgipevsravihideqsgkekykllvegdnlravmathgvkgtrttsnNTYEVEKTLGIEAARTTIIN 1269
Cdd:cd00630    49 ------------------------------------------------------------SIHEMLEALGIEAARETIIR 68
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1270 EIQYTMVNHGMSIDRRHVMLLSDLMTYKGEVLGITRFGLAKMKESVLMLASFEKTADHLFDAAYFGQKDSVCGVSECIIM 1349
Cdd:cd00630    69 EIQKVLASQGVSVDRRHIELIADVMTYSGGLRGVTRSGFRASKTSPLMRASFEKTTKHLLDAAAAGEKDELEGVSENIIL 148
                         330
                  ....*....|
gi 206729892 1350 GIPMNIGTGL 1359
Cdd:cd00630   149 GRPAPLGTGS 158
PRK14906 PRK14906
DNA-directed RNA polymerase subunit beta';
246-1359 1.50e-58

DNA-directed RNA polymerase subunit beta';


Pssm-ID: 184899 [Multi-domain]  Cd Length: 1460  Bit Score: 222.82  E-value: 1.50e-58
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  246 KPSDLILTRLLVPPLCIRPSVvsDLKSGT-NEDDLTMKLTEIIFLNDVIKKHRISGAKTQMIMEDWDFLQLQC-ALYINS 323
Cdd:PRK14906  311 DPADMILDVIPVIPPDLRPMV--QLDGGRfATSDLNDLYRRVINRNNRLKRLLDLGAPEIIVNNEKRMLQEAVdSLFDNG 388
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  324 ElSGIPLNMAPKKWTRGFVQRLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPekvnkaniNF 403
Cdd:PRK14906  389 R-RGRPVTGPGNRPLKSLADMLKGKQGRFRQNLLGKRVDYSGRSVIVVGPHLKLHQCGLPSAMALELFKP--------FV 459
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  404 LRKLVqngpEVHPGANfIQQRHTQMKRFLKYgnrekmaqelkYGDIVERhLIDGDVVLFNRQPSLHKLSIMAHLARVKPH 483
Cdd:PRK14906  460 MKRLV----ELEYAAN-IKAAKRAVDRGASY-----------VWDVLEE-VIQDHPVLLNRAPTLHRLGIQAFEPVLVEG 522
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  484 RTFRFNECVCTPYNADFDGDEMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAIQDFLTGAYLLT-LKDTF---- 558
Cdd:PRK14906  523 KAIKLHPLVCTAFNADFDGDQMAVHVPLSTQAQAEARVLMLSSNNIKSPAHGRPLTVPTQDMIIGVYYLTtERDGFegeg 602
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  559 -----FDRAKACqIIASILVGKDEKIKVRLPPPTIL-------------KPVTLWTGKQIFSVILrpSDDNPVrANLRTK 620
Cdd:PRK14906  603 rtfadFDDALNA-YDARADLDLQAKIVVRLSRDMTVrgsygdleetkagERIETTVGRIIFNQVL--PEDYPY-LNYKMV 678
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  621 GKQYCGKGEDLCanDSYVTIQnselMSGSMDkgtlgsGSKNNIFYillrdwgqneaadamsrlarlapvYLSNRGFSIGI 700
Cdd:PRK14906  679 KKDIGRLVNDCC--NRYSTAE----VEPILD------GIKKTGFH------------------------YATRAGLTVSV 722
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  701 GDVTpgqgLLKAKYELLNAGYKKCDEYIEALNTGKLqqqpgctAEETLEALILKELSVIRDHAGSACLRELDKSNSPLTM 780
Cdd:PRK14906  723 YDAT----IPDDKPEILAEADEKVAAIDEDYEDGFL-------SERERHKQVVDIWTEATEEVGEAMLAGFDEDNPIYMM 791
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  781 ALCGSKGSFINISQMIACVGQQAISGSRVPDgfenrsLPhfekhsklpaakgfVANSFYSGLTPTEFFFHTMAGREGLVD 860
Cdd:PRK14906  792 ADSGARGNIKQIRQLAGMRGLMADMKGEIID------LP--------------IKANFREGLSVLEYFISTHGARKGLVD 851
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  861 TAVKTAETGYMQRRLVKSLEdlcsqyDLTVRSstgdiiqfiyggdgLDPAAMEGKDEPLEFKRVLDNIKAVFPCPSEPAL 940
Cdd:PRK14906  852 TALRTADSGYLTRRLVDVAQ------DVIVRE--------------EDCGTDEGVTYPLVKPKGDVDTNLIGRCLLEDVC 911
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  941 SKNeliltTESIMKKSEFLCCQDSFLQEIKKFIKGVSEKIKKT-RDKYGINDNgtteprvLYQLDRITptqvekfletcr 1019
Cdd:PRK14906  912 DPN-----GEVLLSAGDYIESMDDLKRLVEAGVTKVQIRTLMTcHAEYGVCQK-------CYGWDLAT------------ 967
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1020 dkymRAQMEPGSAVGALCAQSIGEPGTQMTLKTFHFAGVASMNITLGVPRIKEIINASKAISTPIItaqldkdddADYAr 1099
Cdd:PRK14906  968 ----RRPVNIGTAVGIIAAQSIGEPGTQLTMRTFHSGGVAGDDITQGLPRVAELFEARKPKGEAVL---------AEIS- 1033
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1100 lvkGRIEKTllGEISEYIEEVFLPDDcfilvklsleRIRLLRleVNAETVRYSICTSKLRVKPGDVAVHGEavvcVTPRE 1179
Cdd:PRK14906 1034 ---GTLQIT--GDKTEKTLTIHDQDG----------NSREYV--VSARVQFMPGVEDGVEVRVGQQITRGS----VNPHD 1092
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1180 -----NSKSSMYYVLQFLKEdlpKVVVQGIpEVSRAVIHIDEQSGKEKYKLLVEGDNL----RAVmathgvkgtrttsnN 1250
Cdd:PRK14906 1093 llrltDPNTTLRYIVSQVQD---VYVSQGV-DINDKHIEVIARQMLRKVAVTNPGDSDylpgRQV--------------N 1154
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1251 TYEVEKTlgieaarttiINEIqytmvnhgmsidrrhvmLLSDLMTYKGE--VLGITRFGLAkmKESVLMLASFEKTADHL 1328
Cdd:PRK14906 1155 RYEFEDT----------ANNL-----------------ILEGKQPPVGQplLLGITKASLA--TDSWLSAASFQETTKVL 1205
                        1130      1140      1150
                  ....*....|....*....|....*....|.
gi 206729892 1329 FDAAYFGQKDSVCGVSECIIMGIPMNIGTGL 1359
Cdd:PRK14906 1206 TDAAIEGKVDHLAGLKENVIIGKPIPAGTGL 1236
RNAP_beta'_N cd01609
Largest subunit (beta') of bacterial DNA-dependent RNA polymerase (RNAP), N-terminal domain; ...
345-876 3.83e-53

Largest subunit (beta') of bacterial DNA-dependent RNA polymerase (RNAP), N-terminal domain; Beta' is the largest subunit of bacterial DNA-dependent RNA polymerase (RNAP). This family also includes the eukaryotic plastid-encoded RNAP beta' subunit. Bacterial RNAP is a large multi-subunit complex responsible for the synthesis of all RNAs in the cell. Structure studies suggest that RNA polymerase complexes from different organisms share a crab-claw-shaped structure with two "pincers" defining a central cleft. Beta' and beta, the largest and the second largest subunits of bacterial RNAP, each makes up one pincer and part of the base of the cleft. Beta' contains part of the active site and binds two zinc ions that have a structural role in the formation of the active polymerase.


Pssm-ID: 259845 [Multi-domain]  Cd Length: 659  Bit Score: 198.90  E-value: 3.83e-53
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  345 LKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPekvnkaninFL-RKLVQNGpevhpGANFIQQ 423
Cdd:cd01609   236 LKGKQGRFRQNLLGKRVDYSGRSVIVVGPELKLHQCGLPKEMALELFKP---------FViRELIERG-----LAPNIKS 301
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  424 rhtqMKRFLkygnrEKMAQELkyGDIVErHLIDGDVVLFNRQPSLHKLSIMAHLARVKPHRTFRFNECVCTPYNADFDGD 503
Cdd:cd01609   302 ----AKKMI-----ERKDPEV--WDILE-EVIKGHPVLLNRAPTLHRLGIQAFEPVLIEGKAIQLHPLVCTAFNADFDGD 369
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  504 EMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAIQDFLTGAYLLTLKDTffdrakacqiiasilVGKDEKIKVRL 583
Cdd:cd01609   370 QMAVHVPLSLEAQAEARVLMLSSNNILSPASGKPIVTPSQDMVLGLYYLTKERK---------------GDKGEGIIETT 434
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  584 PpptilkpvtlwtGKQIFSVILRPsddnpvraNLRtkgkqycgkgedlcandsYVTIqnselmsgSMDKGTLGsgsknNI 663
Cdd:cd01609   435 V------------GRVIFNEILPE--------GLP------------------FINK--------TLKKKVLK-----KL 463
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  664 FYILLRDWGQNEAADAMSRLARLAPVYLSNRGFSIGIGD-VTPgqgllKAKYELLNAGYKKCDEYIEALNTGKLqqqpgc 742
Cdd:cd01609   464 INECYDRYGLEETAELLDDIKELGFKYATRSGISISIDDiVVP-----PEKKEIIKEAEEKVKEIEKQYEKGLL------ 532
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  743 TAEETLEALILKELSViRDHAGSACLRELDKS--NSPLTMALCGSKGSFINISQMIACVGQQA-ISGSRVPdgfenrsLP 819
Cdd:cd01609   533 TEEERYNKVIEIWTEV-TEKVADAMMKNLDKDpfNPIYMMADSGARGSKSQIRQLAGMRGLMAkPSGKIIE-------LP 604
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 206729892  820 hfekhsklpaakgfVANSFYSGLTPTEFFFHTMAGREGLVDTAVKTAETGYMQRRLV 876
Cdd:cd01609   605 --------------IKSNFREGLTVLEYFISTHGARKGLADTALKTADSGYLTRRLV 647
RpoC COG0086
DNA-directed RNA polymerase, beta' subunit/160 kD subunit [Transcription]; DNA-directed RNA ...
345-1120 5.43e-53

DNA-directed RNA polymerase, beta' subunit/160 kD subunit [Transcription]; DNA-directed RNA polymerase, beta' subunit/160 kD subunit is part of the Pathway/BioSystem: RNA polymerase


Pssm-ID: 439856 [Multi-domain]  Cd Length: 1165  Bit Score: 203.85  E-value: 5.43e-53
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  345 LKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPekvnkaninFL-RKLVQNGpevhpGANFIQQ 423
Cdd:COG0086   321 LKGKQGRFRQNLLGKRVDYSGRSVIVVGPELKLHQCGLPKKMALELFKP---------FIyRKLEERG-----LATTIKS 386
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  424 rhtqMKRFLkygnrEKMAQELKygDIVERhLIDGDVVLFNRQPSLHKLSIMAHLARVKPHRTFRFNECVCTPYNADFDGD 503
Cdd:COG0086   387 ----AKKMV-----EREEPEVW--DILEE-VIKEHPVLLNRAPTLHRLGIQAFEPVLIEGKAIQLHPLVCTAFNADFDGD 454
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  504 EMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAIQDFLTGAYLLTLKD--------TFFDRAKACQIIASILVGK 575
Cdd:COG0086   455 QMAVHVPLSLEAQLEARLLMLSTNNILSPANGKPIIVPSQDMVLGLYYLTRERegakgegmIFADPEEVLRAYENGAVDL 534
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  576 DEKIKVRLPPPTILKPVTLWT--GKQIFSVILrPSDdnpvranlrtkgkqycgkgedlcandsyVTIQNSElmsgsMDKG 653
Cdd:COG0086   535 HARIKVRITEDGEQVGKIVETtvGRYLVNEIL-PQE----------------------------VPFYNQV-----INKK 580
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  654 TLGsgsknNIFYILLRDWGQNEAADAMSRLARLAPVYLSNRGFSIGIGD-VTPgqgllKAKYELLNAGYKKCDEYIEALN 732
Cdd:COG0086   581 HIE-----VIIRQMYRRCGLKETVIFLDRLKKLGFKYATRAGISIGLDDmVVP-----KEKQEIFEEANKEVKEIEKQYA 650
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  733 TGKLqqqpgcTAEETLEALILkelsvIRDHAG----SACLRELDKSNSPLTMALCGSKGSFINISQMIACVGQQAisgsr 808
Cdd:COG0086   651 EGLI------TEPERYNKVID-----GWTKASleteSFLMAAFSSQNTTYMMADSGARGSADQLRQLAGMRGLMA----- 714
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  809 VPDG--FENRslphfekhsklpaakgfVANSFYSGLTPTEFFFHTMAGREGLVDTAVKTAETGYMQRRLVKSLEDLcsqy 886
Cdd:COG0086   715 KPSGniIETP-----------------IGSNFREGLGVLEYFISTHGARKGLADTALKTADSGYLTRRLVDVAQDV---- 773
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  887 dltvrsstgdIIQFIYGG--DGLD-PAAMEGKD--EPLEfKRVLDNIKA---VFPCPSEPALSKNELILTtesimkksef 958
Cdd:COG0086   774 ----------IVTEEDCGtdRGITvTAIKEGGEviEPLK-ERILGRVAAedvVDPGTGEVLVPAGTLIDE---------- 832
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  959 lccqdsflqEIKKFIKGVSEKIKKTRdkygindngtteprvlyqldriTPTQVEKFLETCRDKYMR--AQMEP---GSAV 1033
Cdd:COG0086   833 ---------EVAEIIEEAGIDSVKVR----------------------SVLTCETRGGVCAKCYGRdlARGHLvniGEAV 881
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1034 GALCAQSIGEPGTQMTLKTFHFAGVASmnitlgvPRIKEIINASKAISTPIITAQLDKDDDADYARLVKGRIEKTLLGEI 1113
Cdd:COG0086   882 GVIAAQSIGEPGTQLTMRTFHIGGAAS-------RAAEESSIEAKAGGIVRLNNLKVVVNEEGKGVVVSRNSELVIVDDG 954

                  ....*..
gi 206729892 1114 SEYIEEV 1120
Cdd:COG0086   955 GRREEEY 961
PRK09603 PRK09603
DNA-directed RNA polymerase subunit beta/beta';
246-1060 5.79e-52

DNA-directed RNA polymerase subunit beta/beta';


Pssm-ID: 181983 [Multi-domain]  Cd Length: 2890  Bit Score: 202.08  E-value: 5.79e-52
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  246 KPSDLILTRLLVPPLCIRPSV--------VSDLksgtneDDLTMKlteIIFLNDVIKKHRISGAKTQMIMEDWDFLQLQC 317
Cdd:PRK09603 1622 RPEWMMLTVLPVLPPDLRPLValdggkfaVSDV------NELYRR---VINRNQRLKRLMELGAPEIIVRNEKRMLQEAV 1692
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  318 ALYINSELSGIPLNMAPKKWTRGFVQRLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPekvn 397
Cdd:PRK09603 1693 DVLFDNGRSTNAVKGANKRPLKSLSEIIKGKQGRFRQNLLGKRVDFSGRSVIVVGPNLKMDECGLPKNMALELFKP---- 1768
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  398 kaniNFLRKLVQNGpevhpganfiqqRHTQMKRflkygnREKMAQElKYGDIVE--RHLIDGDVVLFNRQPSLHKLSIMA 475
Cdd:PRK09603 1769 ----HLLSKLEERG------------YATTLKQ------AKRMIEQ-KSNEVWEclQEITEGYPVLLNRAPTLHKQSIQA 1825
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  476 HLARVKPHRTFRFNECVCTPYNADFDGDEMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAIQDFLTGAYLLTL- 554
Cdd:PRK09603 1826 FHPKLIDGKAIQLHPLVCSAFNADFDGDQMAVHVPLSQEAIAECKVLMLSSMNILLPASGKAVAIPSQDMVLGLYYLSLe 1905
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  555 KDTFFDRAKACQIIASILVGKDEK---IKVRLPPPTILKPVTLWTGKQIFSVILrpSDDNPVRANLRTKGKQycgkgedl 631
Cdd:PRK09603 1906 KSGVKGEHKLFSSVNEIITAIDTKeldIHAKIRVLDQGNIIATSAGRMIIKSIL--PDFIPTDLWNRPMKKK-------- 1975
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  632 candsyvtiqnselmsgsmDKGTLgsgsknnIFYIlLRDWGQNEAADAMSRLARLAPVYLSNRGFSIGIGDV-TPgqgll 710
Cdd:PRK09603 1976 -------------------DIGVL-------VDYV-HKVGGIGITATFLDNLKTLGFRYATKAGISISMEDIiTP----- 2023
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  711 KAKYELLNAGYKKCDEYIEALNTGKLqqqpgcTAEETLEALIlKELSVIRDHAGSACLR--ELDKS--NSPLTMALCGSK 786
Cdd:PRK09603 2024 KDKQKMVEKAKVEVKKIQQQYDQGLL------TDQERYNKII-DTWTEVNDKMSKEMMTaiAKDKEgfNSIYMMADSGAR 2096
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  787 GSFINISQMIACVGQQAisgsrVPDGfenrslphfekhsklPAAKGFVANSFYSGLTPTEFFFHTMAGREGLVDTAVKTA 866
Cdd:PRK09603 2097 GSAAQIRQLSAMRGLMT-----KPDG---------------SIIETPIISNFKEGLNVLEYFNSTHGARKGLADTALKTA 2156
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  867 ETGYMQRRL------VKSLEDLCSQY------DLTVRSstgDIIqfiyggdgldpaamegkdEPLE---FKRVLDNiKAV 931
Cdd:PRK09603 2157 NAGYLTRKLidvsqnVKVVSDDCGTHegieitDIAVGS---ELI------------------EPLEeriFGRVLLE-DVI 2214
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  932 FPCPSEPALSKNELILTT----------ESIMKKSEFLCcqdsflqeikKFIKGVSEKIkktrdkYGINdngTTEPRVLY 1001
Cdd:PRK09603 2215 DPITNEILLYADTLIDEEgakkvveagiKSITIRTPVTC----------KAPKGVCAKC------YGLN---LGEGKMSY 2275
                         810       820       830       840       850
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 206729892 1002 qldritptqvekfletcrdkymraqmePGSAVGALCAQSIGEPGTQMTLKTFHFAGVAS 1060
Cdd:PRK09603 2276 ---------------------------PGEAVGVVAAQSIGEPGTQLTLRTFHVGGTAS 2307
RNA_pol_Rpb1_3 pfam04983
RNA polymerase Rpb1, domain 3; RNA polymerases catalyze the DNA dependent polymerization of ...
528-703 4.02e-51

RNA polymerase Rpb1, domain 3; RNA polymerases catalyze the DNA dependent polymerization of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). This domain, domain 3, represents the pore domain. The 3' end of RNA is positioned close to this domain. The pore delimited by this domain is thought to act as a channel through which nucleotides enter the active site and/or where the 3' end of the RNA may be extruded during back-tracking.


Pssm-ID: 461507  Cd Length: 158  Bit Score: 177.44  E-value: 4.02e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   528 NLVTPRNGEPLIAAIQDFLTGAYLLTLKDTFFDRAKACQIIASILVgkdekikvrLPPPTILKPVT-LWTGKQIFSVILR 606
Cdd:pfam04983    1 NILSPQNGKPIIGPSQDMVLGAYLLTREDTFFDREEVMQLLMYGIV---------LPHPAILKPIKpLWTGKQTFSRLLP 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   607 PsddnpvRANLRTKGKQYCgkgEDLCANDSYVTIQNSELMSGSMDKGTLGsGSKNNIFYILLRDWGQNEAADAMSRLARL 686
Cdd:pfam04983   72 N------EINPKGKPKTNE---EDLCENDSYVLINNGELISGVIDKKTVG-KSLGSLIHIIYKEYGPEETAKFLDRLQKL 141
                          170
                   ....*....|....*..
gi 206729892   687 APVYLSNRGFSIGIGDV 703
Cdd:pfam04983  142 GFRYLTKSGFSIGIDDI 158
PRK14844 PRK14844
DNA-directed RNA polymerase subunit beta/beta';
246-1359 1.88e-49

DNA-directed RNA polymerase subunit beta/beta';


Pssm-ID: 173305 [Multi-domain]  Cd Length: 2836  Bit Score: 194.07  E-value: 1.88e-49
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  246 KPSDLILTRLLVPPLCIRPSVvsDLKSGTNE-DDLTMKLTEIIFLNDVIKKHRISGAKTQMIMEDWDFLQLQC-ALYINS 323
Cdd:PRK14844 1665 RPEWMILTTIPILPPDLRPLV--SLESGRPAvSDLNHHYRTIINRNNRLRKLLSLNPPEIMIRNEKRMLQEAVdSLFDNS 1742
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  324 ELSGIPLNMAPKKWTRGFVQRLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPEKVNKAninf 403
Cdd:PRK14844 1743 RRNALVNKAGAVGYKKSISDMLKGKQGRFRQNLLGKRVDYSGRSVIVVGPTLKLNQCGLPKRMALELFKPFVYSKL---- 1818
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  404 lrKLVQNGPEVhpganfiqqrhtqmkrflKYGNREKMAQELKYGDIVErHLIDGDVVLFNRQPSLHKLSIMAHLARVKPH 483
Cdd:PRK14844 1819 --KMYGMAPTI------------------KFASKLIRAEKPEVWDMLE-EVIKEHPVLLNRAPTLHRLGIQAFEPILIEG 1877
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  484 RTFRFNECVCTPYNADFDGDEMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAIQDFLTGAYLLTLKDTFFDR-- 561
Cdd:PRK14844 1878 KAIQLHPLVCTAFNADFDGDQMAVHVPISLEAQLEARVLMMSTNNVLSPSNGRPIIVPSKDIVLGIYYLTLQEPKEDDlp 1957
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  562 --AKACQIIASILVGK---DEKIKVRL-----PPPTILKPVTLWTGKQIFSVILrpsddnPVRANLrtkgkqycgkGEDL 631
Cdd:PRK14844 1958 sfGAFCEVEHSLSDGTlhiHSSIKYRMeyinsSGETHYKTICTTPGRLILWQIF------PKHENL----------GFDL 2021
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  632 CanDSYVTIQnsELMSgsmdkgtlgsgsknnIFYILLRDWGQNEAADAMSRLARLAPVYLSNRGFSIGIGDVTpgqgLLK 711
Cdd:PRK14844 2022 I--NQVLTVK--EITS---------------IVDLVYRNCGQSATVAFSDKLMVLGFEYATFSGVSFSRCDMV----IPE 2078
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  712 AKYELLNAGYKKCDEYiealntgKLQQQPGCTAEETLEALILKELSVIRDHAGSACLREL------DKSNSPLTMALCGS 785
Cdd:PRK14844 2079 TKATHVDHARGEIKKF-------SMQYQDGLITRSERYNKVIDEWSKCTDMIANDMLKAIsiydgnSKYNSVYMMVNSGA 2151
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  786 KGSfinISQMIACVGQQAISGSrvPDGfENRSLPhfekhsklpaakgfVANSFYSGLTPTEFFFHTMAGREGLVDTAVKT 865
Cdd:PRK14844 2152 RGS---TSQMKQLAGMRGLMTK--PSG-EIIETP--------------IISNFREGLNVFEYFNSTHGARKGLADTALKT 2211
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  866 AETGYMQRRLVKSLED-LCSQYDltVRSSTGDIIQFIYGGDGLdPAAMEGKdeplefkrVLDNIKAvfpcpsepalskNE 944
Cdd:PRK14844 2212 ANSGYLTRRLVDVSQNcIVTKHD--CKTKNGLVVRATVEGSTI-VASLESV--------VLGRTAA------------ND 2268
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  945 LI--LTTESIMKKSEFLccqdsflqeikkfikgVSEKIKKtrdkygINDNGTTEPRVLYQLD-RITPTQVEkfLETCRDK 1021
Cdd:PRK14844 2269 IYnpVTKELLVKAGELI----------------DEDKVKQ------INIAGLDVVKIRSPLTcEISPGVCS--LCYGRDL 2324
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1022 YMRAQMEPGSAVGALCAQSIGEPGTQMTLKTFHFAGV-----------ASMNI--------------------------- 1063
Cdd:PRK14844 2325 ATGKIVSIGEAVGVIAAQSVGEPGTQLTMRTFHIGGVmtrgvessniiASINAkiklnnsniiidkngnkivisrscevv 2404
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1064 ---TLGVPRIKE--------IINASKAI------------STPIITaqlDKDDDADYARLVKGrIEKTLLGEISEYIEEV 1120
Cdd:PRK14844 2405 lidSLGSEKLKHsvpygaklYVDEGGSVkigdkvaewdpyTLPIIT---EKTGTVSYQDLKDG-ISITEVMDESTGISSK 2480
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1121 FLPD--------DCFILVKLSLERIRLLRLEVNAETVRYSICTSKLRVKPGDvAVHGEAVVCVTPRENSKS-----SMYY 1187
Cdd:PRK14844 2481 VVKDwklysggaNLRPRIVLLDDNGKVMTLASGVEACYFIPIGAVLNVQDGQ-KVHAGDVITRTPRESVKTrditgGLPR 2559
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1188 VLQFLKEDLPK--VVVQGIP------------EVSRAVIHIDEQS-------GKEKYKLLVEGDNLRavmathgvKGTRT 1246
Cdd:PRK14844 2560 VIELFEARRPKehAIVSEIDgyvafsekdrrgKRSILIKPVDEQIspveylvSRSKHVIVNEGDFVR--------KGDLL 2631
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1247 TSNNT--YEVEKTLGIEAARTTIINEIQYTMVNHGMSIDRRHVMLLSDLMTYKGEVL----------------------- 1301
Cdd:PRK14844 2632 MDGDPdlHDILRVLGLEALAHYMISEIQQVYRLQGVRIDNKHLEVILKQMLQKVEITdpgdtmylvgesidklevdrend 2711
                        1210      1220      1230      1240      1250      1260      1270
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 206729892 1302 -----------------GITRFGLAkmKESVLMLASFEKTADHLFDAAYFGQKDSVCGVSECIIMGIPMNIGTGL 1359
Cdd:PRK14844 2712 amsnsgkrpahylpilqGITRASLE--TSSFISAASFQETTKVLTEAAFCGKSDPLSGLKENVIVGRLIPAGTGL 2784
RNA_pol_Rpb1_4 pfam05000
RNA polymerase Rpb1, domain 4; RNA polymerases catalyze the DNA dependent polymerization of ...
729-834 1.31e-40

RNA polymerase Rpb1, domain 4; RNA polymerases catalyze the DNA dependent polymerization of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). This domain, domain 4, represents the funnel domain. The funnel contain the binding site for some elongation factors.


Pssm-ID: 398598  Cd Length: 108  Bit Score: 145.20  E-value: 1.31e-40
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   729 EALNTGKLQQQPGCTAEETLEALILKELSVIRDHAGSACLRELDKSNSPLTMALCGSKGSFINISQMIACVGQQAISGSR 808
Cdd:pfam05000    3 DAERYGKLEDIWGMTLEESFEALINNILNKARDPAGNIASKSLDPNNSIYMMADSGAKGSIINISQIAGCRGQQNVEGKR 82
                           90       100
                   ....*....|....*....|....*.
gi 206729892   809 VPDGFENRSLPHFEKHSKLPAAKGFV 834
Cdd:pfam05000   83 IPFGFSGRTLPHFKKDDEGPESRGFV 108
rpoC1 PRK02625
DNA-directed RNA polymerase subunit gamma; Provisional
345-553 6.26e-33

DNA-directed RNA polymerase subunit gamma; Provisional


Pssm-ID: 235055 [Multi-domain]  Cd Length: 627  Bit Score: 136.80  E-value: 6.26e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  345 LKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAKILTFPEKVNK-------ANINFLRKLVQNG-PEVHp 416
Cdd:PRK02625  339 IEGKQGRFRQNLLGKRVDYSGRSVIVVGPKLKMHQCGLPKEMAIELFQPFVIHRlirqgivNNIKAAKKLIQRAdPEVW- 417
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  417 ganfiqqrhtqmkrflkygnrekmaqelkygDIVERhLIDGDVVLFNRQPSLHKLSIMAHLARVKPHRTFRFNECVCTPY 496
Cdd:PRK02625  418 -------------------------------QVLEE-VIEGHPVLLNRAPTLHRLGIQAFEPILVEGRAIQLHPLVCPAF 465
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 206729892  497 NADFDGDEMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAIQDFLTGAYLLT 553
Cdd:PRK02625  466 NADFDGDQMAVHVPLSLEAQAEARLLMLASNNILSPATGEPIVTPSQDMVLGCYYLT 522
rpoC1 CHL00018
RNA polymerase beta' subunit
315-554 2.13e-32

RNA polymerase beta' subunit


Pssm-ID: 214336 [Multi-domain]  Cd Length: 663  Bit Score: 135.42  E-value: 2.13e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  315 LQCAL--YINSELSGIPLNMAPKKWTRGFVQRLKGKQGRFRGNLSGKRVDFSGRTVISPDPNLRIDEVAVPVHVAkILTF 392
Cdd:CHL00018  328 LQEAVdaLLDNGIRGQPMRDGHNKPYKSFSDVIEGKEGRFRENLLGKRVDYSGRSVIVVGPSLSLHQCGLPREIA-IELF 406
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  393 PEKVNKANINflRKLVQNgpeVHPGANFIQQRHTQMKRFLKygnrekmaqelkygdiverHLIDGDVVLFNRQPSLHKLS 472
Cdd:CHL00018  407 QPFVIRGLIR--QHLASN---IRAAKSKIREKEPIVWEILQ-------------------EVMQGHPVLLNRAPTLHRLG 462
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  473 IMAhlarVKP----HRTFRFNECVCTPYNADFDGDEMNLHLPQTEEAKAEALVLMGTKANLVTPRNGEPLIAAIQDFLTG 548
Cdd:CHL00018  463 IQA----FQPilveGRAICLHPLVCKGFNADFDGDQMAVHVPLSLEAQAEARLLMFSHMNLLSPAIGDPISVPSQDMLLG 538

                  ....*.
gi 206729892  549 AYLLTL 554
Cdd:CHL00018  539 LYVLTI 544
rpoC2_cyan TIGR02388
DNA-directed RNA polymerase, beta'' subunit; The family consists of the product of the rpoC2 ...
650-1058 1.09e-17

DNA-directed RNA polymerase, beta'' subunit; The family consists of the product of the rpoC2 gene, a subunit of DNA-directed RNA polymerase of cyanobacteria and chloroplasts. RpoC2 corresponds largely to the C-terminal region of the RpoC (the beta' subunit) of other bacteria. Members of this family are designated beta'' in chloroplasts/plastids, and beta' (confusingly) in Cyanobacteria, where RpoC1 is called beta' in chloroplasts/plastids and gamma in Cyanobacteria. We prefer to name this family beta'', after its organellar members, to emphasize that this RpoC1 and RpoC2 together replace RpoC in other bacteria. [Transcription, DNA-dependent RNA polymerase]


Pssm-ID: 274104 [Multi-domain]  Cd Length: 1227  Bit Score: 89.52  E-value: 1.09e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   650 MDKGTLgsgskNNIFYILLRDWGQNEAADAMSRLARLAPVYLSNRGFSIGIGDVTpgqgLLKAKYELLNAGYKKCDEYIE 729
Cdd:TIGR02388    7 VDKKAL-----KNLISWAYKTYGTARTAAMADKLKDLGFRYATRAGVSISVDDLK----VPPAKQDLLEAAEKEIRATEE 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   730 ALNTGKLQ-----QQPGCTAEETLEALILKelsVIRDhagsacLRELDKSNSPLTMALCGSKGsfiNISQMIACVGQQAI 804
Cdd:TIGR02388   78 RYRRGEITeverfQKVIDTWNGTNEELKDE---VVNN------FRQTDPLNSVYMMAFSGARG---NMSQVRQLVGMRGL 145
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   805 SGSrvPDGfENRSLPhfekhsklpaakgfVANSFYSGLTPTEFFFHTMAGREGLVDTAVKTAETGYMQRRLVKSLEDL-- 882
Cdd:TIGR02388  146 MAN--PQG-EIIDLP--------------IKTNFREGLTVTEYVISSYGARKGLVDTALRTADSGYLTRRLVDVSQDViv 208
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   883 ----C-SQYDLTVRSST-GDiiQFIYGGDGLdpaamegkdepleFKRVLdnIKAVFPCPSEPALSKNELIlttesimkks 956
Cdd:TIGR02388  209 reedCgTERSIVVRAMTeGD--KKISLGDRL-------------LGRLV--AEDVLHPEGEVIVPKNTAI---------- 261
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892   957 eflccqdsflqeikkfikgvSEKIKKTrdkygINDNGTTEPRVLyqldriTPTQVEKFLETCRDKYMRA-----QMEPGS 1031
Cdd:TIGR02388  262 --------------------DPDLAKT-----IETAGISEVVVR------SPLTCEAARSVCRKCYGWSlahahLVDLGE 310
                          410       420
                   ....*....|....*....|....*..
gi 206729892  1032 AVGALCAQSIGEPGTQMTLKTFHFAGV 1058
Cdd:TIGR02388  311 AVGIIAAQSIGEPGTQLTMRTFHTGGV 337
RNAP_beta'_C cd02655
Largest subunit (beta') of Bacterial DNA-dependent RNA polymerase (RNAP), C-terminal domain; ...
1028-1078 1.14e-14

Largest subunit (beta') of Bacterial DNA-dependent RNA polymerase (RNAP), C-terminal domain; Bacterial RNA polymerase (RNAP) is a large multi-subunit complex responsible for the synthesis of all RNAs in the cell. This family also includes the eukaryotic plastid-encoded RNAP beta" subunit. Structure studies suggest that RNAP complexes from different organisms share a crab-claw-shape structure with two pincers defining a central cleft. Beta' and beta, the largest and the second largest subunits of bacterial RNAP, each makes up one pincer and part of the base of the cleft. The C-terminal domain includes a G loop that forms part of the floor of the downstream DNA-binding cavity. The position of the G loop may determine the switch of the bridge helix between flipped-out and normal alpha-helical conformations.


Pssm-ID: 132721 [Multi-domain]  Cd Length: 204  Bit Score: 74.10  E-value: 1.14e-14
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 206729892 1028 EPGSAVGALCAQSIGEPGTQMTLKTFHFAGVASmNITLGVPRIKEIINASK 1078
Cdd:cd02655     4 ELGEAVGIIAAQSIGEPGTQLTMRTFHTGGVAT-DITQGLPRVEELFEARK 53
rpoC2 PRK02597
DNA-directed RNA polymerase subunit beta'; Provisional
769-1058 1.74e-14

DNA-directed RNA polymerase subunit beta'; Provisional


Pssm-ID: 235052 [Multi-domain]  Cd Length: 1331  Bit Score: 78.88  E-value: 1.74e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  769 RELDKSNSPLTMALCGSKGsfiNISQMIACVGQQAISGSrvPDGfENRSLPhfekhsklpaakgfVANSFYSGLTPTEFF 848
Cdd:PRK02597  114 RQNDPLNSVYMMAFSGARG---NMSQVRQLVGMRGLMAN--PQG-EIIDLP--------------IKTNFREGLTVTEYV 173
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  849 FHTMAGREGLVDTAVKTAETGYMQRRLVksleDLcSQyDLTVRSStgdiiqfiyggD-----GLDPAAMEGKDeplefkR 923
Cdd:PRK02597  174 ISSYGARKGLVDTALRTADSGYLTRRLV----DV-SQ-DVIVREE-----------DcgttrGIVVEAMDDGD------R 230
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  924 VLDNIKavfpcpsepalskNELI--LTTESIM-KKSEFLCCQDsflQEIKKfikGVSEKIKKTrdkygindnGTTEPRVl 1000
Cdd:PRK02597  231 VLIPLG-------------DRLLgrVLAEDVVdPEGEVIAERN---TAIDP---DLAKKIEKA---------GVEEVMV- 281
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 206729892 1001 yqldRiTPTQVEKFLETCRDKY----MRAQM-EPGSAVGALCAQSIGEPGTQMTLKTFHFAGV 1058
Cdd:PRK02597  282 ----R-SPLTCEAARSVCRKCYgwslAHNHLvDLGEAVGIIAAQSIGEPGTQLTMRTFHTGGV 339
RNAP_IV_NRPD1_C cd02737
Largest subunit (NRPD1) of Higher plant RNA polymerase IV, C-terminal domain; Higher plants ...
1030-1363 7.96e-11

Largest subunit (NRPD1) of Higher plant RNA polymerase IV, C-terminal domain; Higher plants have five multi-subunit nuclear RNA polymerases: RNAP I, RNAP II and RNAP III, which are essential for viability; plus the two isoforms of the non-essential polymerase RNAP IV (IVa and IVb), which specialize in small RNA-mediated gene silencing pathways. RNAP IVa and/or RNAP IVb might be involved in RNA-directed DNA methylation of endogenous repetitive elements, silencing of transgenes, regulation of flowering-time genes, inducible regulation of adjacent gene pairs, and spreading of mobile silencing signals. NRPD1a is the largest subunit of RNAP IVa, whereas NRPD1b is the largest subunit of RNAP IVb. The full subunit compositions of RNAP IVa and RNAP IVb are not known, nor are their templates or enzymatic products. However, it has been shown that RNAP IVa and, to a lesser extent, RNAP IVb are crucial for several RNA-mediated gene silencing phenomena.


Pssm-ID: 132724 [Multi-domain]  Cd Length: 381  Bit Score: 65.52  E-value: 7.96e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1030 GSAVGALCAQSIGEPGTQMTLKTfhfagvASMNITLGVPRIKEIINASKAISTP------IITAQLDKDD---DADYARL 1100
Cdd:cd02737     1 GEPVGSLAATAISEPAYKALLDP------PQSLESSPLELLKEVLECRSKSKSKendrrvILSLHLCKCDhgfEYERAAL 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1101 -VKGRIEKTLLGEISEYIEEVFLPDDCFILVKLSLERIRLLRLEVNAEtvRYSICTSKLRVKPGDVAVH-------GEAV 1172
Cdd:cd02737    75 eVKNHLERVTLEDLATTSMIKYSPQATEAIVGEIGDQLNTKKKGKKKA--IFSTSLKITKFSPWVCHFHldkecqkLSDG 152
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1173 VCVT---PRENSKSSMYyVLQFLKE----DLPKVVVQGIPEV-----------SRAVIHIDEQSGKEKYKLLV------- 1227
Cdd:cd02737   153 PCLTfsvSKEVSKSSEE-LLDVLRDriipFLLETVIKGDERIksvnilwedspSTSWVKSVGKSSRGELVLEVtveesck 231
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892 1228 --EGDNLRAVMAT-----HGVKGTRTTSNNTYEVEKTLGIEAARTTIINEIQYTMVNHGMSIDRRHVMLLSDLMTYKGEV 1300
Cdd:cd02737   232 ktRGNAWNVVMDAcipvmDLIDWERSMPYSIQQIKSVLGIDAAFEQFVQRLESAVSMTGKSVLREHLLLVADSMTYSGEF 311
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 206729892 1301 LGITRFGLAKMKESV-----LMLASFEKTADHLFDAAYFGQKDSVCGVSECIIMGIPMNIGTG-LFKLL 1363
Cdd:cd02737   312 VGLNAKGYKAQRRSLkisapFTEACFSSPIKCFLKAAKKGASDSLSGVLDACAWGKEAPVGTGsKFEIL 380
rpoC2 CHL00117
RNA polymerase beta'' subunit; Reviewed
841-1058 1.63e-08

RNA polymerase beta'' subunit; Reviewed


Pssm-ID: 214368 [Multi-domain]  Cd Length: 1364  Bit Score: 59.57  E-value: 1.63e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  841 GLTPTEFFFHTMAGREGLVDTAVKTAETGYMQRRLVKSLEDL------CsqydLTVRSSTgdiiqfiyggdgLDPAAMEG 914
Cdd:CHL00117  172 GLSLTEYIISCYGARKGVVDTAVRTADAGYLTRRLVEVVQHIvvretdC----GTTRGIS------------VSPRNGMM 235
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  915 KDEPLEFK---RVL-DNIKavfpcpsepalSKNELILTTEsimkkseflccQDSFLQEIKKFIKgvsekikktrdkygin 990
Cdd:CHL00117  236 IERILIQTligRVLaDDIY-----------IGSRCIATRN-----------QDIGIGLANRFIT---------------- 277
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 206729892  991 dngtteprvlYQLDRI---TPTqvekfleTCRDkyMRA--QM------------EPGSAVGALCAQSIGEPGTQMTLKTF 1053
Cdd:CHL00117  278 ----------FRAQPIsirSPL-------TCRS--TSWicQLcygwslahgdlvELGEAVGIIAGQSIGEPGTQLTLRTF 338

                  ....*
gi 206729892 1054 HFAGV 1058
Cdd:CHL00117  339 HTGGV 343
PRK14898 PRK14898
DNA-directed RNA polymerase subunit A''; Provisional
1004-1050 3.40e-07

DNA-directed RNA polymerase subunit A''; Provisional


Pssm-ID: 237854 [Multi-domain]  Cd Length: 858  Bit Score: 54.90  E-value: 3.40e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 206729892 1004 DRITPTQVEKFLETCRDKYMRAQMEPGSAVGALCAQSIGEPGTQMTL 1050
Cdd:PRK14898   31 DGVTEEMVEEIIDEVVSAYLNALVEPYEAVGIVAAQSIGEPGTQMSL 77
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH