|
Name |
Accession |
Description |
Interval |
E-value |
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
2-1302 |
7.58e-131 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 435.63 E-value: 7.58e-131
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 2 SAIYKLSIQGIRSF--DSNDRETIEFGKPLTLIVGMNGSGKTTIIECLKYATTGDLPPNSKGGVFIHDPKITGEKDIRAQ 79
Cdd:TIGR00606 1 AKFLKMSILGVRSFgiEDKDKQIIDFFSPLTILVGPNGAGKTTIIECLKYICTGDFPPGTKGNTFVHDPKVAQETDVRAQ 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 80 VKLAFTSANGLNMIVTRNIQLLMKKTTTTFKTLEGqLVAINNSGDRSTLSTRSLELDAQVPLYLGVPKAILEYVIFCHQE 159
Cdd:TIGR00606 81 IRLQFRDVNGEECAVVRSMVCTQKTKKTEFKTLEG-VITRYKHGEKVSLSSKCAEIDREMISHLGVSKAVLNNVIFCHQE 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 160 DSLWPLSEPSNLKKKFDEIFQAMKFTKALDNLKSIKKDMSVDIKLLKQSVEHLKLDKDRSKAMKLNIHQLQTKIDQYNEE 239
Cdd:TIGR00606 160 DSNWPLSEGKALKQKFDEIFSATRYIKALETLRQVRQTQGQKVQEHQMELKYLKQYKEKACEIRDQITSKEAQLESSREI 239
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 240 VSEIESQLNEITEKSDKLFKSNQDFQKILSKVENLKNTKLSISDQVKRLSNS-IDILDLSKPDLQNLLANFSKVLMDKNN 318
Cdd:TIGR00606 240 VKSYENELDPLKNRLKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELKmEKVFQGTDEQLNDLYHNHQRTVREKER 319
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 319 QLRDLETDISSLKDRQSSLQSLSNSLIRRQGELEAGKETYEKNRNHLSSLKEAFQHKFQgLSNIENSdmAQVNHEMSQFK 398
Cdd:TIGR00606 320 ELVDCQRELEKLNKERRLLNQEKTELLVEQGRLQLQADRHQEHIRARDSLIQSLATRLE-LDGFERG--PFSERQIKNFH 396
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 399 AFISQDLTD---TIDQFAKDIQLKETNLSDLIKSITVDSQNLEYNKKDRS-KLIHDSEELAEKLKSFKSL--STQDSLNH 472
Cdd:TIGR00606 397 TLVIERQEDeakTAAQLCADLQSKERLKQEQADEIRDEKKGLGRTIELKKeILEKKQEELKFVIKELQQLegSSDRILEL 476
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 473 ELENLKTYKEKLQSWESENIIPKLNQKIEEKNNEMIILENQIEKFQDRIMKTNQQADLYAKLGLIKKsintKLDELQKIt 552
Cdd:TIGR00606 477 DQELRKAERELSKAEKNSLTETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKD----KMDKDEQI- 551
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 553 eklqNDSRIRQVFPLTQEFQRADLEMDFQKLFINMQKNIAINNKKMHELDRRYTNALYNLNTIEKDLqdnqKSKEKVIQL 632
Cdd:TIGR00606 552 ----RKIKSRHSDELTSLLGYFPNKKQLEDWLHSKSKEINQTRDRLAKLNKELASLEQNKNHINNEL----ESKEEQLSS 623
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 633 LSENLPEDCTIDEYNDVLEETELSYKTALENLKMHQTTLEFNRKALE--IAERDSCCYLCSRKFENESFKSKLLQELKTK 710
Cdd:TIGR00606 624 YEDKLFDVCGSQDEESDLERLKEEIEKSSKQRAMLAGATAVYSQFITqlTDENQSCCPVCQRVFQTEAELQEFISDLQSK 703
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 711 TDA---NFEKTLKDTVQNEKEYLHSLRLLEKHIITLNSINEKIDNSQKCLEKAKEETKTSKSKLDELEVDSTKLKDEKEL 787
Cdd:TIGR00606 704 LRLapdKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNDIEEQETLLGTIMPEEES 783
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 788 AESeirpLIEKFTYLEKELKDLENSSKTISEELSIYNTSeDGIQTVDELRDQQRKMNDSLRELRKTISDLQMEKDEKVRE 867
Cdd:TIGR00606 784 AKV----CLTDVTIMERFQMELKDVERKIAQQAAKLQGS-DLDRTVQQVNQEKQEKQHELDTVVSKIELNRKLIQDQQEQ 858
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 868 NSRMINLIKEKELTVSEIESSLTQKQNIDDSIRSKRENINDIDSRVKELEARIISLKNKKDEAQSVLDKVKNERDIQVRN 947
Cdd:TIGR00606 859 IQHLKSKTNELKSEKLQIGTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETSNKK 938
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 948 KQKTVADINRLIDRFQTIYNEVVDFEAKGFDElQTTIKELELNK-AQMLELKEQLDLKSNEVNEEKRKLADSNN-EEKNL 1025
Cdd:TIGR00606 939 AQDKVNDIKEKVKNIHGYMKDIENKIQDGKDD-YLKQKETELNTvNAQLEECEKHQEKINEDMRLMRQDIDTQKiQERWL 1017
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1026 KQNLELIELKSQLQHIESEISRLDVQNAEAERDKYQEESLRLRTRFEKLSSENAGKLGEMKQLQNQIDSLTHQLRT-DYK 1104
Cdd:TIGR00606 1018 QDNLTLRKRENELKEVEEELKQHLKEMGQMQVLQMKQEHQKLEENIDLIKRNHVLALGRQKGYEKEIKHFKKELREpQFR 1097
|
1130 1140 1150 1160 1170 1180 1190 1200
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1105 DIEKNYHKEWVELQTRSFVTDDIDVYSKALDSAIMKYHGLKMQDINRIIDELWKRTYSGTDIDTIKIRSD---EVSSTVK 1181
Cdd:TIGR00606 1098 DAEEKYREMMIVMRTTELVNKDLDIYYKTLDQAIMKFHSMKMEEINKIIRDLWRSTYRGQDIEYIEIRSDadeNVSASDK 1177
|
1210 1220 1230 1240 1250 1260 1270 1280
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1182 GKSYNYRVVMYKQDVELDMRGRCSAGQKVLASIIIRLALSETFGANCGVIALDEPTTNLDEENIESLAKSLHNIINMRRH 1261
Cdd:TIGR00606 1178 RRNYNYRVVMLKGDTALDMRGRCSAGQKVLASLIIRLALAETFCLNCGIIALDEPTTNLDRENIESLAHALVEIIKSRSQ 1257
|
1290 1300 1310 1320
....*....|....*....|....*....|....*....|.
gi 6324079 1262 QKNFQLIVITHDEKFLGHMNAAAFTDHFFKVKRDDRQKSQI 1302
Cdd:TIGR00606 1258 QRNFQLLVITHDEDFVELLGRSEYVEKFYRLKKNEDQCSEI 1298
|
|
| ABC_Rad50 |
cd03240 |
ATP-binding cassette domain of Rad50; The catalytic domains of Rad50 are similar to the ... |
1198-1300 |
9.50e-43 |
|
ATP-binding cassette domain of Rad50; The catalytic domains of Rad50 are similar to the ATP-binding cassette of ABC transporters, but are not associated with membrane-spanning domains. The conserved ATP-binding motifs common to Rad50 and the ABC transporter family include the Walker A and Walker B motifs, the Q loop, a histidine residue in the switch region, a D-loop, and a conserved LSGG sequence. This conserved sequence, LSGG, is the most specific and characteristic motif of this family and is thus known as the ABC signature sequence.
Pssm-ID: 213207 [Multi-domain] Cd Length: 204 Bit Score: 155.07 E-value: 9.50e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1198 LDMRGRCSAGQKVLASIIIRLALSETFGANCGVIALDEPTTNLDEENIEslaKSLHNIINMRRHQKNFQLIVITHDEKFL 1277
Cdd:cd03240 110 LDMRGRCSGGEKVLASLIIRLALAETFGSNCGILALDEPTTNLDEENIE---ESLAEIIEERKSQKNFQLIVITHDEELV 186
|
90 100
....*....|....*....|...
gi 6324079 1278 GHMnaaaftDHFFKVKRDDRQKS 1300
Cdd:cd03240 187 DAA------DHIYRVEKDGRQKS 203
|
|
| ABC_Rad50 |
cd03240 |
ATP-binding cassette domain of Rad50; The catalytic domains of Rad50 are similar to the ... |
4-168 |
1.49e-39 |
|
ATP-binding cassette domain of Rad50; The catalytic domains of Rad50 are similar to the ATP-binding cassette of ABC transporters, but are not associated with membrane-spanning domains. The conserved ATP-binding motifs common to Rad50 and the ABC transporter family include the Walker A and Walker B motifs, the Q loop, a histidine residue in the switch region, a D-loop, and a conserved LSGG sequence. This conserved sequence, LSGG, is the most specific and characteristic motif of this family and is thus known as the ABC signature sequence.
Pssm-ID: 213207 [Multi-domain] Cd Length: 204 Bit Score: 145.83 E-value: 1.49e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 4 IYKLSIQGIRSFDsnDRETIEFGKPLTLIVGMNGSGKTTIIECLKYATTGDLPPNSKGGVfiHDPKITGEKDIRAQVKLA 83
Cdd:cd03240 1 IDKLSIRNIRSFH--ERSEIEFFSPLTLIVGQNGAGKTTIIEALKYALTGELPPNSKGGA--HDPKLIREGEVRAQVKLA 76
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 84 FTSANGLNMIVTRNIqllmkkttttfktlegqlvainnsgdrstlstrsleldaqvplylgvpkAILEYVIFCHQEDSLW 163
Cdd:cd03240 77 FENANGKKYTITRSL-------------------------------------------------AILENVIFCHQGESNW 107
|
....*
gi 6324079 164 PLSEP 168
Cdd:cd03240 108 PLLDM 112
|
|
| AAA_23 |
pfam13476 |
AAA domain; |
7-196 |
7.92e-21 |
|
AAA domain;
Pssm-ID: 463890 [Multi-domain] Cd Length: 190 Bit Score: 91.79 E-value: 7.92e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 7 LSIQGIRSFDSndrETIEFGKPLTLIVGMNGSGKTTIIECLKYATTG---DLPPNSKGGVFIHDPKITGEKDIRAQVKLA 83
Cdd:pfam13476 1 LTIENFRSFRD---QTIDFSKGLTLITGPNGSGKTTILDAIKLALYGktsRLKRKSGGGFVKGDIRIGLEGKGKAYVEIT 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 84 FTSANGLNMIVTRNIQLLMKKTTTTFKTLEGQLVAINNSGDrsTLSTRSLELDAQVPLYLGVPKAILEYVIFCHQEDSLW 163
Cdd:pfam13476 78 FENNDGRYTYAIERSRELSKKKGKTKKKEILEILEIDELQQ--FISELLKSDKIILPLLVFLGQEREEEFERKEKKERLE 155
|
170 180 190
....*....|....*....|....*....|...
gi 6324079 164 PLSEPSNLKKKFDEIFQAMKftKALDNLKSIKK 196
Cdd:pfam13476 156 ELEKALEEKEDEKKLLEKLL--QLKEKKKELEE 186
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
531-1274 |
3.04e-15 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 81.26 E-value: 3.04e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 531 YAKLGLIKKSINTKLDELQKITEKlqndsrirqvfpltqefqradlEMDFQKLFINMQKNIAINNKKMHELDRRYTNALY 610
Cdd:PRK03918 164 YKNLGEVIKEIKRRIERLEKFIKR----------------------TENIEELIKEKEKELEEVLREINEISSELPELRE 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 611 NLNTIEKDLQDNQKSKEKVIQLLSENLPEDCTIDEYNDVLEETELSyktaLENLKMHQTTLEFNRKALEIAERDSCCYLC 690
Cdd:PRK03918 222 ELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEER----IEELKKEIEELEEKVKELKELKEKAEEYIK 297
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 691 SRKFenesfKSKLLQELktktdANFEKTLKDTVQNEKEYLHSLRLLEKHIITLNSINEKIDNSQKCLEKAKEETKT---S 767
Cdd:PRK03918 298 LSEF-----YEEYLDEL-----REIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELyeeA 367
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 768 KSKLDELEvdstklKDEKELAESEIRPLIEKFTYLEKELKDLENSSKTISEEL-SIYNTSEDGIQTVDELRDQQRK---- 842
Cdd:PRK03918 368 KAKKEELE------RLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIgELKKEIKELKKAIEELKKAKGKcpvc 441
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 843 ------------MNDSLRELRKTISDLQ--MEKDEKVRENSRMINLIKEKELTVSEIESSLTQKQNIDDSIRS------- 901
Cdd:PRK03918 442 grelteehrkelLEEYTAELKRIEKELKeiEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKynleele 521
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 902 -KRENINDIDSRVKELEARIISLKNKKDEAQSvLDKVKNERDIQVRNKQKTVADINRlidrfqtiynEVVDFEAKGFDEL 980
Cdd:PRK03918 522 kKAEEYEKLKEKLIKLKGEIKSLKKELEKLEE-LKKKLAELEKKLDELEEELAELLK----------ELEELGFESVEEL 590
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 981 QTTIKELELNKAQMLELKEQLDlksnEVNEEKRKLAdsnneeknlKQNLELIELKSQLQHIESEISRLDVQNAEAERDKY 1060
Cdd:PRK03918 591 EERLKELEPFYNEYLELKDAEK----ELEREEKELK---------KLEEELDKAFEELAETEKRLEELRKELEELEKKYS 657
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1061 QEESLRLRTRFEKLSSENAGKLGEMKQLQNQIDSlthqlrtdykdIEKNYHKEWVELQTRSFVTDDIDVYSKALD----- 1135
Cdd:PRK03918 658 EEEYEELREEYLELSRELAGLRAELEELEKRREE-----------IKKTLEKLKEELEEREKAKKELEKLEKALErveel 726
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1136 -------SAIMKYHGL-KMQDI-NRIIDELWKRTYSGtdidtIKIRSDEVSSTVKgksynyrVVMYKQDVELDMrgrCSA 1206
Cdd:PRK03918 727 rekvkkyKALLKERALsKVGEIaSEIFEELTEGKYSG-----VRVKAEENKVKLF-------VVYQGKERPLTF---LSG 791
|
730 740 750 760 770 780
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 6324079 1207 GQKVLASIIIRLALSETFGANCGVIALDEPTTNLDEENieslAKSLHNIINmRRHQKNFQLIVITHDE 1274
Cdd:PRK03918 792 GERIALGLAFRLALSLYLAGNIPLLILDEPTPFLDEER----RRKLVDIME-RYLRKIPQVIIVSHDE 854
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
417-1110 |
3.39e-13 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 74.71 E-value: 3.39e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 417 QLKETNLSDLIKSITVDSQNLEYNKKDRSKLIHDSEELAEKLKSFKSLSTQdsLNHELENLKTYKEKLQS--WESENIIP 494
Cdd:TIGR02168 221 ELRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEE--LRLEVSELEEEIEELQKelYALANEIS 298
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 495 KLNQKIEEKNNEMIILENQIEKFQDRIMKTNQQADLYAK-LGLIKKSINTKLDELQKITEKLQNDSRIRQVFPLTQEFQR 573
Cdd:TIGR02168 299 RLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEeLAELEEKLEELKEELESLEAELEELEAELEELESRLEELE 378
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 574 ADLEmDFQKLFINMQKNIAINNKKMHELDRRytnalynLNTIEKDLQDNQKSKEKVIQLLSENLPEDC--TIDEYNDVLE 651
Cdd:TIGR02168 379 EQLE-TLRSKVAQLELQIASLNNEIERLEAR-------LERLEDRRERLQQEIEELLKKLEEAELKELqaELEELEEELE 450
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 652 ETELSYKTALENLKMHQTTLEFNRKALEIAERD-----SCCYLCSRKFENESFKSKLLQELKT----------------K 710
Cdd:TIGR02168 451 ELQEELERLEEALEELREELEEAEQALDAAERElaqlqARLDSLERLQENLEGFSEGVKALLKnqsglsgilgvlseliS 530
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 711 TDANFEKTL---------KDTVQNEKEYLHSLRLLEKH------IITLNSINE-KIDNSQKCLEKAKEETKTSKSKLDEL 774
Cdd:TIGR02168 531 VDEGYEAAIeaalggrlqAVVVENLNAAKKAIAFLKQNelgrvtFLPLDSIKGtEIQGNDREILKNIEGFLGVAKDLVKF 610
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 775 EVDSTKL----------KDEKELAESEIRPLIEKFTYL------------------EKELKDLENSSKtiSEELSiyNTS 826
Cdd:TIGR02168 611 DPKLRKAlsyllggvlvVDDLDNALELAKKLRPGYRIVtldgdlvrpggvitggsaKTNSSILERRRE--IEELE--EKI 686
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 827 EDGIQTVDELRDQQRKMNDSLRELRKTISDLQMEKDEKVRENSRMINLIKEKEltvSEIESSLTQKQNIDDSIRSKRENI 906
Cdd:TIGR02168 687 EELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLE---AEVEQLEERIAQLSKELTELEAEI 763
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 907 NDIDSRVKELEARIISLKNKKDEAQSVLDKVKNERDIQVRNKQKTVADINRLIDRFQTIYNEVVDFEAKgFDELQTTIKE 986
Cdd:TIGR02168 764 EELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERR-IAATERRLED 842
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 987 LELNKAQMLELKEQLDLKSNEVNEEKRKLADSNNEEKNLKQNLE--LIELKSQLQHIESEIsrldvQNAEAERDKYQEES 1064
Cdd:TIGR02168 843 LEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEeaLALLRSELEELSEEL-----RELESKRSELRREL 917
|
730 740 750 760
....*....|....*....|....*....|....*....|....*..
gi 6324079 1065 LRLRTRFEKLSSenagKLGEMKQ-LQNQIDSLTHQLRTDYKDIEKNY 1110
Cdd:TIGR02168 918 EELREKLAQLEL----RLEGLEVrIDNLQERLSEEYSLTLEEAEALE 960
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
835-1108 |
7.53e-13 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 73.55 E-value: 7.53e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 835 ELRDQQRKMN-----DSLRELRKTISDLQMEKDEKVRENSRMINLIKEKELTVSEIESsltQKQNIDDSIRSKRENINDI 909
Cdd:TIGR02168 217 ELKAELRELElallvLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRL---EVSELEEEIEELQKELYAL 293
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 910 DSRVKELEARIISLKNKKDEAQSVLDKVKNERDIQVRNKQKTVADINRLIDRFQTIYNEVVDFEAKgFDELQTTIKELEl 989
Cdd:TIGR02168 294 ANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAE-LEELEAELEELE- 371
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 990 nkAQMLELKEQLDLKSNEVNEEKRKLADSNNEEKNLKQNLEliELKSQLQHIESEISRLDVQNAEAERDKYQEESLRLRT 1069
Cdd:TIGR02168 372 --SRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLE--RLEDRRERLQQEIEELLKKLEEAELKELQAELEELEE 447
|
250 260 270
....*....|....*....|....*....|....*....
gi 6324079 1070 RFEKLSSENAGKLGEMKQLQNQIDSLTHQLRTDYKDIEK 1108
Cdd:TIGR02168 448 ELEELQEELERLEEALEELREELEEAEQALDAAERELAQ 486
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
347-975 |
2.50e-12 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 71.63 E-value: 2.50e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 347 RQGELEAGKETYEKnrnhlsslKEAFQHKFQGLSNIENSD------MAQVNHEMSQFKAFISQdlTDTIDQFAKDIQLKE 420
Cdd:PRK03918 133 RQGEIDAILESDES--------REKVVRQILGLDDYENAYknlgevIKEIKRRIERLEKFIKR--TENIEELIKEKEKEL 202
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 421 TNLSDLIKSITVDSQNLEYNKKDRSKLIHDSEELAEKLksfkslstqDSLNHELENLKTYKEKLqswesENIIPKLNQKI 500
Cdd:PRK03918 203 EEVLREINEISSELPELREELEKLEKEVKELEELKEEI---------EELEKELESLEGSKRKL-----EEKIRELEERI 268
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 501 EEKNNEMIILENQIEkfqdRIMKTNQQADLYAKLGLIKKSINTKLDELQKITEKLQndSRIRQVFPLTQEFQRADLEM-D 579
Cdd:PRK03918 269 EELKKEIEELEEKVK----ELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLE--EEINGIEERIKELEEKEERLeE 342
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 580 FQKLFINMQKNIAInNKKMHELDRRYTNALYNLNTIEKDLQDNqkSKEKVIQLLSEnlpedctIDEYNDVLEETELSYKT 659
Cdd:PRK03918 343 LKKKLKELEKRLEE-LEERHELYEEAKAKKEELERLKKRLTGL--TPEKLEKELEE-------LEKAKEEIEEEISKITA 412
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 660 ALENLKMHQTTLEFNRKALEIAERDscCYLCSRKFENESfKSKLLQELkTKTDANFEKTLKDTVQNEKEYLHSLRLLEK- 738
Cdd:PRK03918 413 RIGELKKEIKELKKAIEELKKAKGK--CPVCGRELTEEH-RKELLEEY-TAELKRIEKELKEIEEKERKLRKELRELEKv 488
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 739 -----HIITLNSINEKIDNSQKCL--------EKAKEETKTSKSKLDELEVDSTKLKDEKElaesEIRPLIEKFTYLEKE 805
Cdd:PRK03918 489 lkkesELIKLKELAEQLKELEEKLkkynleelEKKAEEYEKLKEKLIKLKGEIKSLKKELE----KLEELKKKLAELEKK 564
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 806 LKDLENSSKTISEELSI--YNTSEDGIQTVDELRDQQRKMN------DSLRELRKTISDLQMEKDEKVRENSRMINLIKE 877
Cdd:PRK03918 565 LDELEEELAELLKELEElgFESVEELEERLKELEPFYNEYLelkdaeKELEREEKELKKLEEELDKAFEELAETEKRLEE 644
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 878 KELTVSEIESSLTQKQNiddsiRSKRENINDIDSRVKELEARIISLKNKKDEAQSVLDKVKNERDiQVRNKQKTVADINR 957
Cdd:PRK03918 645 LRKELEELEKKYSEEEY-----EELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELE-EREKAKKELEKLEK 718
|
650
....*....|....*...
gi 6324079 958 LIDRFQTIYNEVVDFEAK 975
Cdd:PRK03918 719 ALERVEELREKVKKYKAL 736
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
348-1091 |
2.55e-12 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 71.69 E-value: 2.55e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 348 QGELEAGKETYEKNRNHLSSLKEAFQHKFQGLsNIENSDMAQVNHEMSQFKAFISQDLTDTIDQFAKDIQLKETNLSDli 427
Cdd:pfam15921 91 QRRLNESNELHEKQKFYLRQSVIDLQTKLQEM-QMERDAMADIRRRESQSQEDLRNQLQNTVHELEAAKCLKEDMLED-- 167
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 428 ksitvdsQNLEYNKKDRSKLIHDS--EELAEKLKSFKSLSTQDSlnHELENLKTYKEKLQSWESENIIPKLNQKIEEKNN 505
Cdd:pfam15921 168 -------SNTQIEQLRKMMLSHEGvlQEIRSILVDFEEASGKKI--YEHDSMSTMHFRSLGSAISKILRELDTEISYLKG 238
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 506 EMIILENQIE---------------KFQDRIMKTNQQADLYAKlGLIKK--SINTKLDELQKITEKLQNDSRIRQVFPLT 568
Cdd:pfam15921 239 RIFPVEDQLEalksesqnkielllqQHQDRIEQLISEHEVEIT-GLTEKasSARSQANSIQSQLEIIQEQARNQNSMYMR 317
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 569 QefqRADLEMDFQKLFINMQKNIAINNKKMHELDRRYTnaLYNLNTIEKDLQDNQKSKE---------KVIQLLSENLPE 639
Cdd:pfam15921 318 Q---LSDLESTVSQLRSELREAKRMYEDKIEELEKQLV--LANSELTEARTERDQFSQEsgnlddqlqKLLADLHKREKE 392
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 640 DCTIDEYNDVLEETELSYKTALENLKMhqttlEFNRKALEIAERDSccylcsrkfenesfkskLLQELKTKTDANFEKTL 719
Cdd:pfam15921 393 LSLEKEQNKRLWDRDTGNSITIDHLRR-----ELDDRNMEVQRLEA-----------------LLKAMKSECQGQMERQM 450
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 720 KdTVQNEKEYLHSLrllekhiitlNSINEKIDNSQKCLEKAKEETKTSKSKLDELE---VDSTKLKDEKELA----ESEI 792
Cdd:pfam15921 451 A-AIQGKNESLEKV----------SSLTAQLESTKEMLRKVVEELTAKKMTLESSErtvSDLTASLQEKERAieatNAEI 519
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 793 RPLIEKFTYLEKELKDLEN------SSKTISEELSIYNTSEDgiQTVDELRDQQRKMNDSLRELRKTISDLQMEKDEKVR 866
Cdd:pfam15921 520 TKLRSRVDLKLQELQHLKNegdhlrNVQTECEALKLQMAEKD--KVIEILRQQIENMTQLVGQHGRTAGAMQVEKAQLEK 597
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 867 EnsrminlIKEKELTVSEIEsslTQKQNIDDSIRskreninDIDSRVKELEARIISLKNKKDEAQSVLDKVKNERDiQVR 946
Cdd:pfam15921 598 E-------INDRRLELQEFK---ILKDKKDAKIR-------ELEARVSDLELEKVKLVNAGSERLRAVKDIKQERD-QLL 659
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 947 NKQKTV-ADINRLIDRFQTIYNevvDFEAKGfDELQTTIKELELN-KAQMLELKEQLD-LKSNEVNE-EKRKLADSNNEE 1022
Cdd:pfam15921 660 NEVKTSrNELNSLSEDYEVLKR---NFRNKS-EEMETTTNKLKMQlKSAQSELEQTRNtLKSMEGSDgHAMKVAMGMQKQ 735
|
730 740 750 760 770 780
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 6324079 1023 KNLKQNlELIELKSQLQHIESEISrldvqNAEAERDKYQEESLRLRTRFEKLSSENAGKLGEMKQLQNQ 1091
Cdd:pfam15921 736 ITAKRG-QIDALQSKIQFLEEAMT-----NANKEKHFLKEEKNKLSQELSTVATEKNKMAGELEVLRSQ 798
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
727-1100 |
7.76e-12 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 70.10 E-value: 7.76e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 727 KEYLHSLRLLEKHIITLNSINEKIDNSQkclEKAKEETKTSKSKLDELEVDSTKLKDEKELAESEIRPLIEKFTYLEKEL 806
Cdd:TIGR02169 705 DELSQELSDASRKIGEIEKEIEQLEQEE---EKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEAL 781
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 807 KDLENSsktiseelsiyntseDGIQTVDELRDQQRKMNDSLRELRKTISDLQmekdekvrensrminlikekeltvSEIE 886
Cdd:TIGR02169 782 NDLEAR---------------LSHSRIPEIQAELSKLEEEVSRIEARLREIE------------------------QKLN 822
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 887 SSLTQKQNIDDSIRSKRENINDIDSRVKELEARIISLKNKKDEAQSVLDKVKNErdiqVRNKQKTVADINRLIDRFQTIY 966
Cdd:TIGR02169 823 RLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAA----LRDLESRLGDLKKERDELEAQL 898
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 967 NEVvdfeAKGFDELQTTIKELELNkaqMLELKEQLDLKSNEVNEEKRKLADSNNEEKNLkqnLELIELKSQLQHIESEIS 1046
Cdd:TIGR02169 899 REL----ERKIEELEAQIEKKRKR---LSELKAKLEALEEELSEIEDPKGEDEEIPEEE---LSLEDVQAELQRVEEEIR 968
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....
gi 6324079 1047 RLDVQNAEAErDKYQEESLRLRTRFEKLSSENAgklgEMKQLQNQIDSLTHQLR 1100
Cdd:TIGR02169 969 ALEPVNMLAI-QEYEEVLKRLDELKEKRAKLEE----ERKAILERIEEYEKKKR 1017
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
855-1057 |
1.06e-11 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 68.25 E-value: 1.06e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 855 SDLQMEKDEKVRENSRMINLIKEK-ELTVSEIESSLTQKQNIDDSIRSKRENINDIDSRVKELEARIISLKNKKDEAQSV 933
Cdd:COG4942 19 ADAAAEAEAELEQLQQEIAELEKElAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAE 98
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 934 LDKVKNERDIQVRNKQKT-----------VADINRLIDRFQtIYNEVVDFEAKGFDELQTTIKELELNKAQMLELKEQLD 1002
Cdd:COG4942 99 LEAQKEELAELLRALYRLgrqpplalllsPEDFLDAVRRLQ-YLKYLAPARREQAEELRADLAELAALRAELEAERAELE 177
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 6324079 1003 LKSNEVNEEKRKLADSNNEEKNLKQNL---------ELIELKSQLQHIESEISRLDVQNAEAER 1057
Cdd:COG4942 178 ALLAELEEERAALEALKAERQKLLARLekelaelaaELAELQQEAEELEALIARLEAEAAAAAE 241
|
|
| Rad50_zn_hook |
pfam04423 |
Rad50 zinc hook motif; The Mre11 complex (Mre11 Rad50 Nbs1) is central to chromosomal ... |
666-718 |
1.31e-11 |
|
Rad50 zinc hook motif; The Mre11 complex (Mre11 Rad50 Nbs1) is central to chromosomal maintenance and functions in homologous recombination, telomere maintenance and sister chromatid association. The Rad50 coiled-coil region contains a dimer interface at the apex of the coiled coils in which pairs of conserved Cys-X-X-Cys motifs form interlocking hooks that bind one Zn ion. This alignment includes the zinc hook motif and a short stretch of coiled-coil on either side.
Pssm-ID: 427940 [Multi-domain] Cd Length: 52 Bit Score: 60.67 E-value: 1.31e-11
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 6324079 666 MHQTTLEFNRKALEIAERDSCCYLCSRKFENESfKSKLLQELKTKTDANFEKT 718
Cdd:pfam04423 1 LHQETLELNKKIEELKEAEGCCPLCGRPLDEEH-RSELIKELQSKLERLPEEL 52
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
205-1002 |
1.52e-11 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 69.32 E-value: 1.52e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 205 LKQSVEHLKLD--KDRSKAMKLNIHQLQTKIDQYNEEVSEIESQLNE----ITEKSDKLFKSNQDFQKILSKVENLKNTK 278
Cdd:TIGR02168 218 LKAELRELELAllVLRLEELREELEELQEELKEAEEELEELTAELQEleekLEELRLEVSELEEEIEELQKELYALANEI 297
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 279 LSISDQVKRLSNSIDILDLSKPDLQNLLANFSKVLMDKNNQLRDLETDISSLKDRQSSLQSLSNSLIRRQGELEAGKETY 358
Cdd:TIGR02168 298 SRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEEL 377
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 359 EKNRNHLSSlkEAFQHKFQGLSN---IEN--SDMAQVNHEMSQFKAFISQDLTDTIDQFAKDIQL----KETNLSDLIKS 429
Cdd:TIGR02168 378 EEQLETLRS--KVAQLELQIASLnneIERleARLERLEDRRERLQQEIEELLKKLEEAELKELQAeleeLEEELEELQEE 455
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 430 ITVDSQNLEYNKKDRSKLIHDSEELAEKLKSFKSLstQDSLNHELENLKTYKE-----KLQSWESENIIPKLNQKIEEKN 504
Cdd:TIGR02168 456 LERLEEALEELREELEEAEQALDAAERELAQLQAR--LDSLERLQENLEGFSEgvkalLKNQSGLSGILGVLSELISVDE 533
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 505 NEMIILENQIEKF-QDRIMKTNQQADLyaKLGLIKKSINTKLD--ELQKITEKlQNDSRIRQVFPLTQEFQRADLEM--- 578
Cdd:TIGR02168 534 GYEAAIEAALGGRlQAVVVENLNAAKK--AIAFLKQNELGRVTflPLDSIKGT-EIQGNDREILKNIEGFLGVAKDLvkf 610
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 579 --DFQKLFINMQKNIAI-NNKKMHELDRRYTNALYNLNTIEKDLQD-----NQKSKEKVIQLLS-----ENLPEDCtide 645
Cdd:TIGR02168 611 dpKLRKALSYLLGGVLVvDDLDNALELAKKLRPGYRIVTLDGDLVRpggviTGGSAKTNSSILErrreiEELEEKI---- 686
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 646 ynDVLEETELSYKTALENLKMHQTTLEFNRKALEIAERDSCCYLCSRKFENESFKSKllQELKTKTDANFEKTLKDTVQN 725
Cdd:TIGR02168 687 --EELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAE--VEQLEERIAQLSKELTELEAE 762
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 726 EKEYLHSLRLLEKHIITLNsinEKIDNSQKCLEKAKEETKTSKSKLDELEVDSTKLKDEKELAESEIRPLIEKFTYLEKE 805
Cdd:TIGR02168 763 IEELEERLEEAEEELAEAE---AEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERR 839
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 806 LKDLENSSKTISEELSiyntsedgiqtvdELRDQQRKMNDSLRELRKTISDLQMEKDEkvrENSRMINLIKEKELTVSEI 885
Cdd:TIGR02168 840 LEDLEEQIEELSEDIE-------------SLAAEIEELEELIEELESELEALLNERAS---LEEALALLRSELEELSEEL 903
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 886 ESSLTQKQNIDDSIRSKRENINDIDSRVKELEARIISLKNKKDEAQSVL----DKVKNERDIQVRNKQKTVADINRLIDR 961
Cdd:TIGR02168 904 RELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTleeaEALENKIEDDEEEARRRLKRLENKIKE 983
|
810 820 830 840
....*....|....*....|....*....|....*....|....
gi 6324079 962 FQTIYNEVVD-FEA--KGFDELQTTIKELELNKAQMLELKEQLD 1002
Cdd:TIGR02168 984 LGPVNLAAIEeYEElkERYDFLTAQKEDLTEAKETLEEAIEEID 1027
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
757-1101 |
1.55e-11 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 69.33 E-value: 1.55e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 757 LEKAKEETKTSKSKLDELEVDSTKLKDEKELAESEiRPLIEKFTYLEKELKDLEnssktISEELSIYNTSEDGIQTVDEL 836
Cdd:TIGR02169 172 KEKALEELEEVEENIERLDLIIDEKRQQLERLRRE-REKAERYQALLKEKREYE-----GYELLKEKEALERQKEAIERQ 245
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 837 RDQQRKMNDSLRELRKTISDLQMEKDEKVRENSRMINLIKEKE-LTV-SEIESSLTQKQNIDDSIRSKRENINDIDSRVK 914
Cdd:TIGR02169 246 LASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEqLRVkEKIGELEAEIASLERSIAEKERELEDAEERLA 325
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 915 ELEARIISLKNKKDEAQSVLDKVKNERDiqvrnkqKTVADINRLIDRFQTIYNEVVDFEAKG---FDELQTTIKELELNK 991
Cdd:TIGR02169 326 KLEAEIDKLLAEIEELEREIEEERKRRD-------KLTEEYAELKEELEDLRAELEEVDKEFaetRDELKDYREKLEKLK 398
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 992 AQMLELKEQLDLKSNEVNEEKRKLADSNNEEKNLKQnlELIELKSQLQHIESEISRLD--VQNAEAERDKYQEESLRLRT 1069
Cdd:TIGR02169 399 REINELKRELDRLQEELQRLSEELADLNAAIAGIEA--KINELEEEKEDKALEIKKQEwkLEQLAADLSKYEQELYDLKE 476
|
330 340 350
....*....|....*....|....*....|..
gi 6324079 1070 RFEKLSSenagklgEMKQLQNQIDSLTHQLRT 1101
Cdd:TIGR02169 477 EYDRVEK-------ELSKLQRELAEAEAQARA 501
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
315-1106 |
1.63e-11 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 69.32 E-value: 1.63e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 315 DKNNQLRDLETDIS-----SLKDRQSSLQSLSNSLIRRQGELEAGKETYE-------KNRNHLSSLKEAFQHKFQGLSNI 382
Cdd:TIGR02168 217 ELKAELRELELALLvlrleELREELEELQEELKEAEEELEELTAELQELEekleelrLEVSELEEEIEELQKELYALANE 296
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 383 ENsdmaqvnhEMSQFKAFISQdltdtidqfakdiqlKETNLSDLIKSITVDSQNLEynkkdrSKLIHDSEELAEKLKSFK 462
Cdd:TIGR02168 297 IS--------RLEQQKQILRE---------------RLANLERQLEELEAQLEELE------SKLDELAEELAELEEKLE 347
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 463 SLSTQ--------DSLNHELENLKTYKEKLQS-WESEN-IIPKLNQKIEEKNNEMIILENQIEKFQDRIMKT--NQQADL 530
Cdd:TIGR02168 348 ELKEElesleaelEELEAELEELESRLEELEEqLETLRsKVAQLELQIASLNNEIERLEARLERLEDRRERLqqEIEELL 427
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 531 YAKLGLIKKSINTKLDELQKITEKLQ--NDSRIRQVFPLTQEFQRADLEMDFQKLFINMQKNIAINNKKMHELDRRYTNA 608
Cdd:TIGR02168 428 KKLEEAELKELQAELEELEEELEELQeeLERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEG 507
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 609 LynlntieKDLQDNQKSKEKVIQLLSENLPEDctiDEY------------NDVLEETELSYKTALENLKMHqttlEFNRK 676
Cdd:TIGR02168 508 V-------KALLKNQSGLSGILGVLSELISVD---EGYeaaieaalggrlQAVVVENLNAAKKAIAFLKQN----ELGRV 573
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 677 ALEIAERDSCCYLCSRKFENESFKSKLLQELKT--KTDANFEKTLK---------DTVQNEKEYLHSLRLLEKhIITL-- 743
Cdd:TIGR02168 574 TFLPLDSIKGTEIQGNDREILKNIEGFLGVAKDlvKFDPKLRKALSyllggvlvvDDLDNALELAKKLRPGYR-IVTLdg 652
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 744 -----------------NSINEK---IDNSQKCLEKAKEETKTSKSKLDELEVDSTKLKDEKELAESEIRPLIEKFTYLE 803
Cdd:TIGR02168 653 dlvrpggvitggsaktnSSILERrreIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALR 732
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 804 KELKDLENSSKTISEELSiyntsedgiQTVDELRDQQRKMNDSLRELRKTISDLQMEKDEKVRENSRMINLIKEKELTVS 883
Cdd:TIGR02168 733 KDLARLEAEVEQLEERIA---------QLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALRE 803
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 884 EIESSLTQKQNIDDSIRSKRENINDIDSRVKELEARIISLKNKKDEAQSVLDKVKNERDIQVRNKQKTVADINRLIDRFQ 963
Cdd:TIGR02168 804 ALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERA 883
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 964 TIyNEVVDFEAKGFDELQTTIKELElnkAQMLELKEQLDLKSNEVNEEKRKLADSNNEEKNLKQNL-------------- 1029
Cdd:TIGR02168 884 SL-EEALALLRSELEELSEELRELE---SKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLseeysltleeaeal 959
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1030 ------ELIELKSQLQHIESEISRLDVQNAEAErDKYQEeslrLRTRFEKLSSENAGKLGEMKQLQNQIDSLTHQLRTDY 1103
Cdd:TIGR02168 960 enkiedDEEEARRRLKRLENKIKELGPVNLAAI-EEYEE----LKERYDFLTAQKEDLTEAKETLEEAIEEIDREARERF 1034
|
...
gi 6324079 1104 KDI 1106
Cdd:TIGR02168 1035 KDT 1037
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
835-1118 |
5.81e-11 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 67.27 E-value: 5.81e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 835 ELRDQ--QRKMNDSLRELRKTISDLQMEKDEKVRENSRMINLIKEKELTVSEIESSLTQKQNIDDSIRSKRENINDIDSR 912
Cdd:COG1196 217 ELKEElkELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAE 296
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 913 VKELEARIISLKNKKDEAQSVLDKVKNERDIQVRNKQKTVADINRLIDRFQTIYNEVVDFEAkgfdELQTTIKELELNKA 992
Cdd:COG1196 297 LARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEA----ELAEAEEALLEAEA 372
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 993 QMLELKEQLDLKSNEVNEEKRKLADSNNEEKNLKQNLELIELKSQLQHIESEISRLDVQNAEAERDKYQEESLRLRTRFE 1072
Cdd:COG1196 373 ELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEA 452
|
250 260 270 280
....*....|....*....|....*....|....*....|....*.
gi 6324079 1073 KLSSENAGKLGEMKQLQNQIDSLTHQLRTDYKDIEKNYHKEWVELQ 1118
Cdd:COG1196 453 ELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLE 498
|
|
| ABC_Class2 |
cd03227 |
ATP-binding cassette domain of non-transporter proteins; ABC-type Class 2 contains systems ... |
1200-1294 |
2.62e-10 |
|
ATP-binding cassette domain of non-transporter proteins; ABC-type Class 2 contains systems involved in cellular processes other than transport. These families are characterized by the fact that the ABC subunit is made up of duplicated, fused ABC modules (ABC2). No known transmembrane proteins or domains are associated with these proteins.
Pssm-ID: 213194 [Multi-domain] Cd Length: 162 Bit Score: 60.45 E-value: 2.62e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1200 MRGRCSAGQKVLASIIIRLALSETfgANCGVIALDEPTTNLDEENIESLAkslhNIInMRRHQKNFQLIVITHDEKFlgh 1279
Cdd:cd03227 74 TRLQLSGGEKELSALALILALASL--KPRPLYILDEIDRGLDPRDGQALA----EAI-LEHLVKGAQVIVITHLPEL--- 143
|
90
....*....|....*
gi 6324079 1280 mnaAAFTDHFFKVKR 1294
Cdd:cd03227 144 ---AELADKLIHIKK 155
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
356-1277 |
4.02e-10 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 64.61 E-value: 4.02e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 356 ETYEKNRNHLSSLKEAFQHKFQGLSNIENSDMAQVNHEMSQFKAfISQDLTDTIDQFAKDIQLKETNLSDLIKSITVDSQ 435
Cdd:pfam02463 236 EERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEK-EKKLQEEELKLLAKEEEELKSELLKLERRKVDDEE 314
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 436 NLEYNKKDRSKLIHD----SEELAEKLKSFKSLSTQDSLNHELENLKTYKEKLQSWESENIIPKLNQKIEEKNNEMIILE 511
Cdd:pfam02463 315 KLKESEKEKKKAEKElkkeKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKE 394
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 512 NQIEKFQDRIMKTNQQADLYAKLGLIKKSINTKLDELQKITEKLQNDSRIRQVFPLTQEF--------------QRADLE 577
Cdd:pfam02463 395 EELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEkqelkllkdelelkKSEDLL 474
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 578 MDFQKLFINMQKNIAINNKKMHELDRRYTNALYNLNTIEKDLQDNQKSKEKVIQLLSENLPEDCTIDEYNDVLEETELSY 657
Cdd:pfam02463 475 KETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVIVEVS 554
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 658 KTA-------------LENLKMHQTTLEFNRKALEIAERDSCCYLCSRKFENESFKSKLLQELKTKTDANFEKTLKDTVQ 724
Cdd:pfam02463 555 ATAdeveerqklvralTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDKATLEADEDDKRAKVVEGILKDTEL 634
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 725 nekeylhsLRLLEKHIITLNSINEKIDNSQKCLEKAKEETKTSKSKLDELEVDSTKLKDEKELAESEIRPLIEKFTYLEK 804
Cdd:pfam02463 635 --------TKLKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQ 706
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 805 ELKDLENSSKTISEELSIYNTSEDGIQTVDELRDQQRKMNDSLRELRKTISDLQMEKDEKVRENSRMINLIKEKE--LTV 882
Cdd:pfam02463 707 REKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEEREktEKL 786
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 883 SEIESSLTQKQNIDDSIRSKRENINDIDSRVKELEARIISLKNKKDEAQSVLDKVKNERdiQVRNKQKTVADINRLIDRF 962
Cdd:pfam02463 787 KVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEE--QKLEKLAEEELERLEEEIT 864
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 963 QTIYNEVVDFEakgfdELQTTIKELELNKAQMLELKEQLDLKSNEVNEEKRKLADSNNEEKNLKQNLELIELKSQLQHIE 1042
Cdd:pfam02463 865 KEELLQELLLK-----EEELEEQKLKDELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPEE 939
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1043 SEISRLDVQNAEAE-RDKYQEESLRLRTRFEKLSSENAGKLGEMK---QLQNQIDSLTHQLRTDYKDIEKNYHKEwveLQ 1118
Cdd:pfam02463 940 LLLEEADEKEKEENnKEEEEERNKRLLLAKEELGKVNLMAIEEFEekeERYNKDELEKERLEEEKKKLIRAIIEE---TC 1016
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1119 TRSFVTDDIDVYSKALDSAIMKYhgLKMQDINRIIDELWKRTYSGtdidtikirSDEVSSTVKGKSYNYRVVMykqdvel 1198
Cdd:pfam02463 1017 QRLKEFLELFVSINKGWNKVFFY--LELGGSAELRLEDPDDPFSG---------GIEISARPPGKGVKNLDLL------- 1078
|
890 900 910 920 930 940 950
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 6324079 1199 dmrgrcSAGQKVLASIIIRLALSEtFGANCGVIaLDEPTTNLDEENIESLAKSLHNiinmrrHQKNFQLIVITHDEKFL 1277
Cdd:pfam02463 1079 ------SGGEKTLVALALIFAIQK-YKPAPFYL-LDEIDAALDDQNVSRVANLLKE------LSKNAQFIVISLREEML 1143
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
222-1083 |
6.97e-10 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 63.84 E-value: 6.97e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 222 MKLNIHQLQTKIDQYNEEVSEIESQLNEITEKSDKLFKSNQDFQKILSKVENLKNTKLSISDQVKRLSNSIDILDLSKPD 301
Cdd:pfam02463 232 LKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVD 311
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 302 LQNLLANfskvlmdKNNQLRDLETDISSLKDRQSSLQSLSNSLIRRQGELEAGKETYEKNRNHLSSLKEAFQHKFQGLSN 381
Cdd:pfam02463 312 DEEKLKE-------SEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESE 384
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 382 IENSDMAQVNHEMSQF-KAFISQDLTDTIDQFAKDIQLKETNLSDLIKSITVDSQNLEYNKKDRSKLIHDSEELAEKLKS 460
Cdd:pfam02463 385 RLSSAAKLKEEELELKsEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDE 464
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 461 FKSlsTQDSLNHELENLKTYKEKLQSWESENIIPKLNQKIEEKNNEMIILENQIEKFQDRIMKTNQQ----ADLYAKLGL 536
Cdd:pfam02463 465 LEL--KKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGrlgdLGVAVENYK 542
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 537 IKKSINTKLDELQKITEKLQNDSRIRQVFPLTQEFQRADLEMDFQKLFINMQKNIAINNKKMHELDRRYTNALYNLNTIE 616
Cdd:pfam02463 543 VAISTAVIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDKATLEADEDDKRA 622
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 617 KDLQDNQKSKEKVIQLLSENLPEDCTIDEYNDVLEETELSYKTALENLKMhqttlefnrkaleiaerdsccylcSRKFEN 696
Cdd:pfam02463 623 KVVEGILKDTELTKLKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELT------------------------KELLEI 678
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 697 ESFKSKLLQELKTKTDANFEKTLKDTVQNEKEYLHSLRLLEKHIitlnsineKIDNSQKCLEKAKEETKTSKSKLDELEV 776
Cdd:pfam02463 679 QELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEEL--------LADRVQEAQDKINEELKLLKQKIDEEEE 750
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 777 DSTKLKDEKELAESEIRPLIEKFTYLEKELKDLENSSKTiseelsiyntsedgiqtvDELRDQQRKMNDSLRELRKTiSD 856
Cdd:pfam02463 751 EEEKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLKVE------------------EEKEEKLKAQEEELRALEEE-LK 811
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 857 LQMEKDEKVRENSRMINLIKEKELTVSEIESSLTQKQ---NIDDSIRSKRENINDIDSRVKELEARIISLKNKKDEAQSV 933
Cdd:pfam02463 812 EEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLeklAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESK 891
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 934 LDKVKNERDIQVRNKQKTVADINR-----LIDRFQTIYNEVVDFEAKGFDELQTTIKELELNKAQMLELKEQLDLKSNEV 1008
Cdd:pfam02463 892 EEKEKEEKKELEEESQKLNLLEEKeneieERIKEEAEILLKYEEEPEELLLEEADEKEKEENNKEEEEERNKRLLLAKEE 971
|
810 820 830 840 850 860 870
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 6324079 1009 NEEKRKLADSNNEEKNLKQNLELIELKSQLQHIES---EISRLDVQNAEAERDKYQEESLRLRTRFEKLSSENAGKLG 1083
Cdd:pfam02463 972 LGKVNLMAIEEFEEKEERYNKDELEKERLEEEKKKlirAIIEETCQRLKEFLELFVSINKGWNKVFFYLELGGSAELR 1049
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
883-1064 |
7.86e-10 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 62.54 E-value: 7.86e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 883 SEIESSLTQKQNIDDSIRSKRENINDIDSRVKELEARIISLKNKKDEAQSVLDKVKNERDIQVRNKQKT----------- 951
Cdd:COG3883 30 AELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERARALYRSggsvsyldvll 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 952 -VADINRLIDRFQTIyNEVVDFEAKGFDELQTTIKELELNKAQMLELKEQLDLKSNEVNEEKRKLadsnnEEKNLKQNLE 1030
Cdd:COG3883 110 gSESFSDFLDRLSAL-SKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAKAEL-----EAQQAEQEAL 183
|
170 180 190
....*....|....*....|....*....|....
gi 6324079 1031 LIELKSQLQHIESEISRLDVQNAEAERDKYQEES 1064
Cdd:COG3883 184 LAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAA 217
|
|
| SbcC |
COG0419 |
DNA repair exonuclease SbcCD ATPase subunit [Replication, recombination and repair]; |
4-96 |
1.07e-09 |
|
DNA repair exonuclease SbcCD ATPase subunit [Replication, recombination and repair];
Pssm-ID: 440188 [Multi-domain] Cd Length: 204 Bit Score: 59.64 E-value: 1.07e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 4 IYKLSIQGIRSFdsNDRETIEFGKPLTLIVGMNGSGKTTIIECLKYATTGDLppnSKGGVFIHDPKITGEKdiRAQVKLA 83
Cdd:COG0419 2 LLRLRLENFRSY--RDTETIDFDDGLNLIVGPNGAGKSTILEAIRYALYGKA---RSRSKLRSDLINVGSE--EASVELE 74
|
90
....*....|...
gi 6324079 84 FtSANGLNMIVTR 96
Cdd:COG0419 75 F-EHGGKRYRIER 86
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
895-1114 |
1.46e-09 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 62.63 E-value: 1.46e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 895 IDDSIRSKRENI--NDIDSRVKELEARIISLKNKKDEAQSVLDKVKNERDiqvrnkqkTVADINRLIDRFQTIYNEVVDF 972
Cdd:COG4913 592 KDDRRRIRSRYVlgFDNRAKLAALEAELAELEEELAEAEERLEALEAELD--------ALQERREALQRLAEYSWDEIDV 663
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 973 EA--KGFDELQTTIKELELNKAQMLELKEQLDlksnEVNEEKRKLadsnnEEKNLKQNLELIELKSQLQHIESEISRLDV 1050
Cdd:COG4913 664 ASaeREIAELEAELERLDASSDDLAALEEQLE----ELEAELEEL-----EEELDELKGEIGRLEKELEQAEEELDELQD 734
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 6324079 1051 QNAEAERDKYQEESLRLRTRFEKLSSENAGKlGEMKQLQNQIDSLTHQLRTDYKDIEKN---YHKEW 1114
Cdd:COG4913 735 RLEAAEDLARLELRALLEERFAAALGDAVER-ELRENLEERIDALRARLNRAEEELERAmraFNREW 800
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
178-880 |
1.56e-09 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 62.35 E-value: 1.56e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 178 IFQAMKFTKALDNLKSIKKDMSVDIKLLKQSVEHLKLDKDRSKAMKLNIHQLQTKIDQYNeevseiesqlNEITEKSDKL 257
Cdd:TIGR04523 29 NKQDTEEKQLEKKLKTIKNELKNKEKELKNLDKNLNKDEEKINNSNNKIKILEQQIKDLN----------DKLKKNKDKI 98
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 258 FKSNQDFQKILSKVENLKNTKLSISDQVKRLSNSIDILDLSKPDLQNLLANFSKVLMDKNNQLRDLETDISSLKDrqssl 337
Cdd:TIGR04523 99 NKLNSDLSKINSEIKNDKEQKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELEN----- 173
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 338 qslsnslirrqgELEAGKETYEKNRNHLSSLKEAFQHKFQGLSNIE--NSDMAQVNHEMSQFKAfISQDLTDTIDQFAKD 415
Cdd:TIGR04523 174 ------------ELNLLEKEKLNIQKNIDKIKNKLLKLELLLSNLKkkIQKNKSLESQISELKK-QNNQLKDNIEKKQQE 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 416 IQLKETNLSDLIKSITVDSQNLEYNKKDRSKLIHDSEELAEKLKsfkslstqdSLNHELENLKTYKEKLQSWESENIIPK 495
Cdd:TIGR04523 241 INEKTTEISNTQTQLNQLKDEQNKIKKQLSEKQKELEQNNKKIK---------ELEKQLNQLKSEISDLNNQKEQDWNKE 311
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 496 LNQKIEEKNNEMIILENQIEKFQDRIMKTNQQ--------ADLYAKLGLIKKSINTKLDELQKIteKLQNDSRIRQVFPL 567
Cdd:TIGR04523 312 LKSELKNQEKKLEEIQNQISQNNKIISQLNEQisqlkkelTNSESENSEKQRELEEKQNEIEKL--KKENQSYKQEIKNL 389
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 568 TQEFQRADLEMDFQ-KLFINMQKNIAINNKKMHELDRRYTNaLYNLNTIEKDLQDNQKSKEKVIQLLSENLpeDCTIDEY 646
Cdd:TIGR04523 390 ESQINDLESKIQNQeKLNQQKDEQIKKLQQEKELLEKEIER-LKETIIKNNSEIKDLTNQDSVKELIIKNL--DNTRESL 466
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 647 NDVLEETELSYKTALENLKMHQTTLEFNRK---ALEIAERDSCCYLCSRKFENESFKSKLLQELKTKTDANFEKTLKDTV 723
Cdd:TIGR04523 467 ETQLKVLSRSINKIKQNLEQKQKELKSKEKelkKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLEDE 546
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 724 QNEKEYLHSLRLLEKHIITLNSINEKIDNSQKCLEKAKEETKTsksKLDELEVDSTKLKDEKELAESEIRPLIEKFTYLE 803
Cdd:TIGR04523 547 LNKDDFELKKENLEKEIDEKNKEIEELKQTQKSLKKKQEEKQE---LIDQKEKEKKDLIKEIEEKEKKISSLEKELEKAK 623
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 804 KELKDLENSSKTISEELSIYNTSEDGIQ-TVDELRDQQRKMNDSLRELRKTISDL-----QMEKDEKVRENSRMINLIKE 877
Cdd:TIGR04523 624 KENEKLSSIIKNIKSKKNKLKQEVKQIKeTIKEIRNKWPEIIKKIKESKTKIDDIielmkDWLKELSLHYKKYITRMIRI 703
|
...
gi 6324079 878 KEL 880
Cdd:TIGR04523 704 KDL 706
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
768-1272 |
2.63e-09 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 62.01 E-value: 2.63e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 768 KSKLDELEVDSTKLKDEKELAESEIRPLIEKFTYLEKELKDLENSSKTISEELSIyntsedgiqtvdeLRDQQRKMNDSL 847
Cdd:TIGR02169 673 PAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQ-------------LEQEEEKLKERL 739
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 848 RELRKTISDLQMEKDEKVRENSRMINLIKEKELTVSEIESSL---------TQKQNIDDSIRSKRENINDIDSRVKELEA 918
Cdd:TIGR02169 740 EELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALndlearlshSRIPEIQAELSKLEEEVSRIEARLREIEQ 819
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 919 RIislkNKKDEAQSVLDKVKNERDIQVRNKQKTVADINRLIDRFQTIYNEVVDFEAKGFDELQTTIKELELNKAQMLELK 998
Cdd:TIGR02169 820 KL----NRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELE 895
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 999 EQLDLKSNEVNEEKRKLADSNNEEKNLKQNLEliELKSQLQHIESEISR-LDVQNAEAERDKYQEESLRLRTRFEKLSSE 1077
Cdd:TIGR02169 896 AQLRELERKIEELEAQIEKKRKRLSELKAKLE--ALEEELSEIEDPKGEdEEIPEEELSLEDVQAELQRVEEEIRALEPV 973
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1078 NAGKLGEMKQLQNQIDSLTHQLRTdykdieknYHKEWVELQTRsfvtddIDVYSKALDSAIMKyhglKMQDINRIIDELW 1157
Cdd:TIGR02169 974 NMLAIQEYEEVLKRLDELKEKRAK--------LEEERKAILER------IEEYEKKKREVFME----AFEAINENFNEIF 1035
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1158 KRTYSGTDIDTIKIRSD------EVSSTVKGKSYNYRVVMykqdveldmrgrcSAGQKVLASIIIRLALSETFGANcgVI 1231
Cdd:TIGR02169 1036 AELSGGTGELILENPDDpfagglELSAKPKGKPVQRLEAM-------------SGGEKSLTALSFIFAIQRYKPSP--FY 1100
|
490 500 510 520
....*....|....*....|....*....|....*....|.
gi 6324079 1232 ALDEPTTNLDEENIESLAKSLhniinmRRHQKNFQLIVITH 1272
Cdd:TIGR02169 1101 AFDEVDMFLDGVNVERVAKLI------REKAGEAQFIVVSL 1135
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
748-1103 |
1.06e-08 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 59.67 E-value: 1.06e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 748 EKIDNSQKCLEKAKEETKTSKSKLDELE--------------VDSTKLKDEKELAESEIRPLIEKFTYLEKELKDLENSS 813
Cdd:PRK02224 363 EEAAELESELEEAREAVEDRREEIEELEeeieelrerfgdapVDLGNAEDFLEELREERDELREREAELEATLRTARERV 442
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 814 KTISEELSIYNTSEDG--------IQTVDELRDQQRKMNDSLRELRKTISDLQMEKD------------EKVRENSRMIN 873
Cdd:PRK02224 443 EEAEALLEAGKCPECGqpvegsphVETIEEDRERVEELEAELEDLEEEVEEVEERLEraedlveaedriERLEERREDLE 522
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 874 -LIKEKELTVSE----IESSLTQKQNIDDSIRSKRENINDIDSRVKELEARIISLKNKKDEAQSVLDKVKNERDIQV--- 945
Cdd:PRK02224 523 eLIAERRETIEEkrerAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIESLERIRTLLAaia 602
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 946 ---------RNKQKTVADINRLI-DRFQTIYNEVVDFEAKgFDElqTTIKELELNKAQMLELKEQLDLKSNEVNEEKRKL 1015
Cdd:PRK02224 603 daedeierlREKREALAELNDERrERLAEKRERKRELEAE-FDE--ARIEEAREDKERAEEYLEQVEEKLDELREERDDL 679
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1016 ADSNNEEKNLKQNLEliELKSQLQHIESEISRLDV--QNAEAERDKYQEESLRLRTR-FEKLSSenagKLGEMKQLQNQI 1092
Cdd:PRK02224 680 QAEIGAVENELEELE--ELRERREALENRVEALEAlyDEAEELESMYGDLRAELRQRnVETLER----MLNETFDLVYQN 753
|
410
....*....|..
gi 6324079 1093 DSLTH-QLRTDY 1103
Cdd:PRK02224 754 DAYSHiELDGEY 765
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
612-939 |
1.35e-08 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 59.69 E-value: 1.35e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 612 LNTIEKDLQDNQKSKEKVIQLLsenlpedctidEYNDVLEETELSYKTALenLKMHQTTLEFNRKALEIAERDsccylcs 691
Cdd:TIGR02168 195 LNELERQLKSLERQAEKAERYK-----------ELKAELRELELALLVLR--LEELREELEELQEELKEAEEE------- 254
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 692 rKFENESFKSKLLQELKTKTDANFEKTLKDTVQNEKEYLHSLRL--LEKHIItlnSINEKIDNSQKCLEKAKEETKTSKS 769
Cdd:TIGR02168 255 -LEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEIsrLEQQKQ---ILRERLANLERQLEELEAQLEELES 330
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 770 KLDELEVDSTKLKDEKELAESEIRPLIEKFTYLEKELKDLENSSKTISEELSIYNTSEDGIQ--------TVDELRDQQR 841
Cdd:TIGR02168 331 KLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLElqiaslnnEIERLEARLE 410
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 842 KMNDSLRELRKTISDLQMEKDEKVRENSRM--INLIKEKELTVSEIESSLTQKQNIDDSIRSKRENINDIDSRVKELEAR 919
Cdd:TIGR02168 411 RLEDRRERLQQEIEELLKKLEEAELKELQAelEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQAR 490
|
330 340
....*....|....*....|
gi 6324079 920 IISLKNKKDEAQSVLDKVKN 939
Cdd:TIGR02168 491 LDSLERLQENLEGFSEGVKA 510
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
727-1044 |
1.55e-08 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 59.30 E-value: 1.55e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 727 KEYLHSLRLLEKHIIT--LNSINEKIDNSQKCLEKAKEETKTSKSKLDELEVDSTKLKDEKELAESEIRPLIEKFTYLEK 804
Cdd:TIGR02168 216 KELKAELRELELALLVlrLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALAN 295
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 805 ELKDLENSSKTISEELSIYNTSEDGIQTVDELRDQQR-KMNDSLRELRKTISDLQMEKDEKVRENSRMINLIKEKELTVS 883
Cdd:TIGR02168 296 EISRLEQQKQILRERLANLERQLEELEAQLEELESKLdELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLE 375
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 884 EIESSLTQKQNiddSIRSKRENINDIDSRVKELEARIISLKNKKDEAQSVLDKVKNERDIQVRNKQKTvadinrlidrfq 963
Cdd:TIGR02168 376 ELEEQLETLRS---KVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQA------------ 440
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 964 tiynevvdfeakgfdELQTTIKELELNKAQMLELKEQLDLKSNEVNEEKRKLADSNNEEKNLKQNLELieLKSQLQHIES 1043
Cdd:TIGR02168 441 ---------------ELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDS--LERLQENLEG 503
|
.
gi 6324079 1044 E 1044
Cdd:TIGR02168 504 F 504
|
|
| PRK01156 |
PRK01156 |
chromosome segregation protein; Provisional |
618-1283 |
1.90e-08 |
|
chromosome segregation protein; Provisional
Pssm-ID: 100796 [Multi-domain] Cd Length: 895 Bit Score: 59.14 E-value: 1.90e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 618 DLQDNQKSKEKVIQLLSENLPEDCTIDEYNDVLEETELSYKTALENLKMHQTTLefNRKALEIAERDSccylcsrKFENE 697
Cdd:PRK01156 198 ELENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDMK--NRYESEIKTAES-------DLSME 268
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 698 SFKSKLLQELKTKtdanFEKTLKDTVQNEKEYLHSLRLLEKHIITLNSINEKID--------NSQKC--LEKAKEETKTS 767
Cdd:PRK01156 269 LEKNNYYKELEER----HMKIINDPVYKNRNYINDYFKYKNDIENKKQILSNIDaeinkyhaIIKKLsvLQKDYNDYIKK 344
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 768 KSKLDELEVDSTKLKDEKELAESEIRPLIEKFTYLEKELKDLENSSKTISEELSIYNTSEDGIQTV-DELRDQQRKMNDS 846
Cdd:PRK01156 345 KSRYDDLNNQILELEGYEMDYNSYLKSIESLKKKIEEYSKNIERMSAFISEILKIQEIDPDAIKKElNEINVKLQDISSK 424
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 847 LRELRKTISDLQMEKDEkVRENSRMINlikekELTVSEIESSLTQKQNIDDSIRSKRENINDIDSRVKELEARIISLKNK 926
Cdd:PRK01156 425 VSSLNQRIRALRENLDE-LSRNMEMLN-----GQSVCPVCGTTLGEEKSNHIINHYNEKKSRLEEKIREIEIEVKDIDEK 498
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 927 KDEAQSVLDKVKNERDIQVRNKQKTVADINRLIDRFQTIYNEVVDFEAKgFDELQTTIKELEL----------NKAQMLE 996
Cdd:PRK01156 499 IVDLKKRKEYLESEEINKSINEYNKIESARADLEDIKIKINELKDKHDK-YEEIKNRYKSLKLedldskrtswLNALAVI 577
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 997 LKEQLDLKSNEVNEEKRKLADSNNEEKNLKQNLELIE--LKSQLQHIESEISRLDVQNAEAERDKYQEEslRLRTRFEKL 1074
Cdd:PRK01156 578 SLIDIETNRSRSNEIKKQLNDLESRLQEIEIGFPDDKsyIDKSIREIENEANNLNNKYNEIQENKILIE--KLRGKIDNY 655
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1075 SSENAGKLGEMKQlQNQIDSLTHQLRTDYKDIEKNYHKEWVELQTRSFVTDDIDVYSKALDSAIMKYHGL--KMQDINRI 1152
Cdd:PRK01156 656 KKQIAEIDSIIPD-LKEITSRINDIEDNLKKSRKALDDAKANRARLESTIEILRTRINELSDRINDINETleSMKKIKKA 734
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1153 IDEL--WKRTYSGTDIDTIKIRSDEVSSTVKGKSYNYRVVMYKQDVELD-------MRG-------RCSAGQKVLASIII 1216
Cdd:PRK01156 735 IGDLkrLREAFDKSGVPAMIRKSASQAMTSLTRKYLFEFNLDFDDIDVDqdfnitvSRGgmvegidSLSGGEKTAVAFAL 814
|
650 660 670 680 690 700
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 6324079 1217 RLALSETFGANCGVIALDEPTTNLDEENIESLAkslhNIINMRRHQKNF--QLIVITHDEKFLGHMNAA 1283
Cdd:PRK01156 815 RVAVAQFLNNDKSLLIMDEPTAFLDEDRRTNLK----DIIEYSLKDSSDipQVIMISHHRELLSVADVA 879
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
791-1112 |
2.67e-08 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 58.70 E-value: 2.67e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 791 EIRPLIEKFTYLEKELKDLENSSKTISEELSiyntsEDGIQTVDELRDQQRKMNDSLRELRKTISDLQMEKDEKVRENSR 870
Cdd:pfam12128 238 KIRPEFTKLQQEFNTLESAELRLSHLHFGYK-----SDETLIASRQEERQETSAELNQLLRTLDDQWKEKRDELNGELSA 312
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 871 MINLIKEKELTVSEIESSLTQKQNID-DSIRSKRENINDIDSRVKELEARIISLKNKKDEAQSVLDKVKNERDiqvrnkQ 949
Cdd:pfam12128 313 ADAAVAKDRSELEALEDQHGAFLDADiETAAADQEQLPSWQSELENLEERLKALTGKHQDVTAKYNRRRSKIK------E 386
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 950 KTVADINRLIDRFQTIYNEVVDFEAKGFDELQttikelelnkAQMLELKEQLDLKSNEVNEEKRKLAdSNNEEKNLKQNL 1029
Cdd:pfam12128 387 QNNRDIAGIKDKLAKIREARDRQLAVAEDDLQ----------ALESELREQLEAGKLEFNEEEYRLK-SRLGELKLRLNQ 455
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1030 ELI--ELKSQLQHIESEISRLD--VQNAEAERDKYQEESLRLRTRFEKLS----------SENAGKLGE-MKQLQNQIDS 1094
Cdd:pfam12128 456 ATAtpELLLQLENFDERIERAReeQEAANAEVERLQSELRQARKRRDQASealrqasrrlEERQSALDElELQLFPQAGT 535
|
330
....*....|....*...
gi 6324079 1095 LTHQLRTDYKDIEKNYHK 1112
Cdd:pfam12128 536 LLHFLRKEAPDWEQSIGK 553
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
197-1107 |
3.97e-08 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 58.14 E-value: 3.97e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 197 DMSVDIKLLKQSVEHLKLDKDRSKAMKLNIHQLQTKIDQYNEEVSEIESQLNEITEKSDKLFKSNQDFQKI-LSKVENLK 275
Cdd:TIGR01612 531 DQNIKAKLYKEIEAGLKESYELAKNWKKLIHEIKKELEEENEDSIHLEKEIKDLFDKYLEIDDEIIYINKLkLELKEKIK 610
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 276 NtklsISDQVKRLSNSIDIldlskpdlqnllanfsKVLMDKNNQLRDletdisslkdrqsslqslsnslirrqgelEAGK 355
Cdd:TIGR01612 611 N----ISDKNEYIKKAIDL----------------KKIIENNNAYID-----------------------------ELAK 641
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 356 ETYEKNRNHLSSLKEAFQHKFQGLSNIENSDMAQVNHEMSqfkAFISQDLTDTIDQFAK--DIQLKETNLSDLIKSI--- 430
Cdd:TIGR01612 642 ISPYQVPEHLKNKDKIYSTIKSELSKIYEDDIDALYNELS---SIVKENAIDNTEDKAKldDLKSKIDKEYDKIQNMeta 718
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 431 TVDSQ--NLEYNKKDRSKLIHD---------SEELAEKLKSFKSLSTQdsLNHELENLKTYKEKLQSWESEniipklNQK 499
Cdd:TIGR01612 719 TVELHlsNIENKKNELLDIIVEikkhihgeiNKDLNKILEDFKNKEKE--LSNKINDYAKEKDELNKYKSK------ISE 790
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 500 IEEKNNEMIILENqiekfqdriMKTNQQADLYAKLGLIKKSINTKLDELQKITEKLQNdsrirqvfpLTQEFqradleMD 579
Cdd:TIGR01612 791 IKNHYNDQINIDN---------IKDEDAKQNYDKSKEYIKTISIKEDEIFKIINEMKF---------MKDDF------LN 846
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 580 FQKLFINMQKNIAINNKKMHELDRRYTNALYN------LNTIEKDLQDNQ---KSKEKVIQLLSENLPEDCTIDEYNDVL 650
Cdd:TIGR01612 847 KVDKFINFENNCKEKIDSEHEQFAELTNKIKAeisddkLNDYEKKFNDSKsliNEINKSIEEEYQNINTLKKVDEYIKIC 926
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 651 EETelsyKTALENLKMHQTTLE--FNRKALEIAERDSCCYLCSRKFENEsfksklLQELKTKTDANF-EKTLKDTVQNEK 727
Cdd:TIGR01612 927 ENT----KESIEKFHNKQNILKeiLNKNIDTIKESNLIEKSYKDKFDNT------LIDKINELDKAFkDASLNDYEAKNN 996
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 728 EYLHSLRLLEKHIITL--NSINEKIDNSQKCLEKAKEETKTSKSKLDELE-VDSTKLKDEKELAESEIRPLIEKFT--YL 802
Cdd:TIGR01612 997 ELIKYFNDLKANLGKNkeNMLYHQFDEKEKATNDIEQKIEDANKNIPNIEiAIHTSIYNIIDEIEKEIGKNIELLNkeIL 1076
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 803 EKELKDLENSSKtISEELSIYNTSEDGIQTVDELRDQQRKMNDSLRELrktisdlqmekDEKVRENSRMINLIKEK-ELT 881
Cdd:TIGR01612 1077 EEAEINITNFNE-IKEKLKHYNFDDFGKEENIKYADEINKIKDDIKNL-----------DQKIDHHIKALEEIKKKsENY 1144
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 882 VSEIESSLTQKQNIDDSIRSKrENINDIDSRVKELEARIISLKNKKDEAQSVLDKV-KNERDiqvRNKQKTVADINRLID 960
Cdd:TIGR01612 1145 IDEIKAQINDLEDVADKAISN-DDPEEIEKKIENIVTKIDKKKNIYDEIKKLLNEIaEIEKD---KTSLEEVKGINLSYG 1220
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 961 R-FQTIYNEVVDFEAKgfdELQTTIKELELNKAQMLELKEQLDLKSNE------VNEEKRKLADSNNEEK-------NLK 1026
Cdd:TIGR01612 1221 KnLGKLFLEKIDEEKK---KSEHMIKAMEAYIEDLDEIKEKSPEIENEmgiemdIKAEMETFNISHDDDKdhhiiskKHD 1297
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1027 QNLELIELKSqLQHIESEISRLDVQNAEAERDKYQEESLRLRTRFEKLSSE-----NAGKLGEMKQLQNQIDSLTHQLRT 1101
Cdd:TIGR01612 1298 ENISDIREKS-LKIIEDFSEESDINDIKKELQKNLLDAQKHNSDINLYLNEianiyNILKLNKIKKIIDEVKEYTKEIEE 1376
|
....*.
gi 6324079 1102 DYKDIE 1107
Cdd:TIGR01612 1377 NNKNIK 1382
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
408-1030 |
4.00e-08 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 57.72 E-value: 4.00e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 408 TIDQFAKDIQLKETNLSDLIKSITVDSQNLEYNKKDRSKLIHDSEELAEKLKSFKSlstQDSLNHELENLKTYKEKLQSW 487
Cdd:TIGR04523 79 ILEQQIKDLNDKLKKNKDKINKLNSDLSKINSEIKNDKEQKNKLEVELNKLEKQKK---ENKKNIDKFLTEIKKKEKELE 155
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 488 ESENIIPKLNQKIEEKNNEMIILENQIEKFQDRIMKTNQQADLYAKLGLIKKSINTKLDELQKITEKL--QNDSRIRQVF 565
Cdd:TIGR04523 156 KLNNKYNDLKKQKEELENELNLLEKEKLNIQKNIDKIKNKLLKLELLLSNLKKKIQKNKSLESQISELkkQNNQLKDNIE 235
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 566 PLTQEFQRADLEMDFQKLFINMQKNIAINNKKMHEldrRYTNALYNLNTIEKDLQDNQKSKEKVIQLLSENLPEDCTIDE 645
Cdd:TIGR04523 236 KKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQLS---EKQKELEQNNKKIKELEKQLNQLKSEISDLNNQKEQDWNKEL 312
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 646 YNDVLEETELSYKTALENLKMHQTTLEFNRKALEIAERdsccylcsrKFENESFKSKLLQELKTKtdanfEKTLKDTVQN 725
Cdd:TIGR04523 313 KSELKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKE---------LTNSESENSEKQRELEEK-----QNEIEKLKKE 378
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 726 EKEYLHSLRLLEKHIitlNSINEKIDNSQKCLEKAKEETKTSKSKLDELEVDSTKLKDEKELAESEIRPLIEKFTYLEKE 805
Cdd:TIGR04523 379 NQSYKQEIKNLESQI---NDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDLTNQDSVKELI 455
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 806 LKDLENSSKTISEELSIYNtsedgiqtvDELRDQQRKMNDSLRELRKTISDLQMEKDEKVRENSRMINLIKEKELTVSEI 885
Cdd:TIGR04523 456 IKNLDNTRESLETQLKVLS---------RSINKIKQNLEQKQKELKSKEKELKKLNEEKKELEEKVKDLTKKISSLKEKI 526
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 886 ESSLTQKQNIDDSIRSKRENINDIDSRVKELEariisLKNKKDEAQSVLDKVKNERDIQVRNKQKTVADINRLIDRFQTI 965
Cdd:TIGR04523 527 EKLESEKKEKESKISDLEDELNKDDFELKKEN-----LEKEIDEKNKEIEELKQTQKSLKKKQEEKQELIDQKEKEKKDL 601
|
570 580 590 600 610 620
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 6324079 966 YNEVVDFEAKgFDELQTTIKELELNKAQMLELKEQLDLKSNEVNEEKRKLADSNNEEKNLKQNLE 1030
Cdd:TIGR04523 602 IKEIEEKEKK-ISSLEKELEKAKKENEKLSSIIKNIKSKKNKLKQEVKQIKETIKEIRNKWPEII 665
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
166-916 |
5.74e-08 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 57.37 E-value: 5.74e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 166 SEPSNLKKKFDEIfqamkfTKALDNLKSIKKDMSVDIKLLKQSVEHLKLDKDRSKAmklNIHQLQTKIDQYNEEVS---- 241
Cdd:TIGR02168 274 LEVSELEEEIEEL------QKELYALANEISRLEQQKQILRERLANLERQLEELEA---QLEELESKLDELAEELAelee 344
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 242 -------EIESQLNEITEKSDKLFKSNQDFQKILSKVENLKNTKLSISDQVKRLSNSIDILDLSKPDLQNLLANFSKVL- 313
Cdd:TIGR02168 345 kleelkeELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIe 424
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 314 -MDKNNQLRDLETDISSLKDRQSSLQSLSNSLIRRQGELEAGKETYEKNRNHLSSLKEAFQH---KFQGLSNIENS---- 385
Cdd:TIGR02168 425 eLLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQlqaRLDSLERLQENlegf 504
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 386 --DMAQVNHEMSQFKAFISQdLTDTIDQFAKDIQLKETNLSDLIKSITVDSQN--------LEYNKKDRSKLIHDSEELA 455
Cdd:TIGR02168 505 seGVKALLKNQSGLSGILGV-LSELISVDEGYEAAIEAALGGRLQAVVVENLNaakkaiafLKQNELGRVTFLPLDSIKG 583
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 456 EKLKSFK--SLSTQDSLNHELENLKTYKEKLQSW-------------------------ESENIIP-------------- 494
Cdd:TIGR02168 584 TEIQGNDreILKNIEGFLGVAKDLVKFDPKLRKAlsyllggvlvvddldnalelakklrPGYRIVTldgdlvrpggvitg 663
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 495 ---KLNQKIEEKNNEMIILENQIEKFQDRImktnqqadlyaklglikKSINTKLDELQKITEKLQNDSRIRQVFPLTQEF 571
Cdd:TIGR02168 664 gsaKTNSSILERRREIEELEEKIEELEEKI-----------------AELEKALAELRKELEELEEELEQLRKELEELSR 726
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 572 QRADLEMDFQKLfinmQKNIAINNKKMHELDRRYTNALYNLNTIEKDLQDNQ---KSKEKVIQLLSEnlpedcTIDEYND 648
Cdd:TIGR02168 727 QISALRKDLARL----EAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEeelAEAEAEIEELEA------QIEQLKE 796
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 649 VLEETElsykTALENLKMHQTTLefNRKALEIAERdsccylcsrkFENESFKSKLLQELKTKTDANFEKTLKDTVQNEKE 728
Cdd:TIGR02168 797 ELKALR----EALDELRAELTLL--NEEAANLRER----------LESLERRIAATERRLEDLEEQIEELSEDIESLAAE 860
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 729 YLHSLRLLEKHIITLNSINEKIDNSQKCLEKAKEETKTSKSKLDELEVDSTKLKDEKELAEseirpliEKFTYLEKELKD 808
Cdd:TIGR02168 861 IEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELR-------EKLAQLELRLEG 933
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 809 LENSSKTISEELsiyntSEDGIQTVDELRDQQRKMNDSLRELRKTISDLQMEKDEKVRENSRMINLIKEKELTVSEIESs 888
Cdd:TIGR02168 934 LEVRIDNLQERL-----SEEYSLTLEEAEALENKIEDDEEEARRRLKRLENKIKELGPVNLAAIEEYEELKERYDFLTA- 1007
|
810 820
....*....|....*....|....*...
gi 6324079 889 ltQKQNIDDSIRSKRENINDIDSRVKEL 916
Cdd:TIGR02168 1008 --QKEDLTEAKETLEEAIEEIDREARER 1033
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
742-931 |
6.69e-08 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 56.38 E-value: 6.69e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 742 TLNSINEKIDNSQKCLEKAKEETKTSKSKLDELEVDSTKLKDEKELAESEIRPLIEKFTYLEKELKDL-------ENSSK 814
Cdd:COG3883 24 ELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERaralyrsGGSVS 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 815 TISEELSIYNTSE--DGIQTVDELRDQQRKMNDSLRELRKTISDLQMEKDEKVREnsrMINLIKEKELTVSEIESSLTQK 892
Cdd:COG3883 104 YLDVLLGSESFSDflDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAE---LEALKAELEAAKAELEAQQAEQ 180
|
170 180 190
....*....|....*....|....*....|....*....
gi 6324079 893 QNIddsirskrenINDIDSRVKELEARIISLKNKKDEAQ 931
Cdd:COG3883 181 EAL----------LAQLSAEEAAAEAQLAELEAELAAAE 209
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
189-912 |
1.19e-07 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 56.67 E-value: 1.19e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 189 DNLKSIKKDMSVDIKLLKQsvEHLKLDKDRSKAMKLNIHQLQTKIDQYNEEVSEIESQLNEITEKSDKlfkSNQDFQKIL 268
Cdd:pfam15921 245 DQLEALKSESQNKIELLLQ--QHQDRIEQLISEHEVEITGLTEKASSARSQANSIQSQLEIIQEQARN---QNSMYMRQL 319
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 269 SKVEN--------LKNTKLSISDQVKRLSNSIDILDLSKPDLQNLLANFSKVLMDKNNQLRDLETDissLKDRQSSLQSL 340
Cdd:pfam15921 320 SDLEStvsqlrseLREAKRMYEDKIEELEKQLVLANSELTEARTERDQFSQESGNLDDQLQKLLAD---LHKREKELSLE 396
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 341 SNSLIRRQGELEAGKETYEKNRNHLSSLKEAFQHKFQGLSNIENSDMAQVNHEMSQFKAfiSQDLTDTIDQFAKDIQLKE 420
Cdd:pfam15921 397 KEQNKRLWDRDTGNSITIDHLRRELDDRNMEVQRLEALLKAMKSECQGQMERQMAAIQG--KNESLEKVSSLTAQLESTK 474
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 421 TNLSDLIKSITVDSQNLEYNKKDRSKLIHDSEELAEKLKSFKSLSTQ-----DSLNHELENLKTYKEKLQSWESEniIPK 495
Cdd:pfam15921 475 EMLRKVVEELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEITKlrsrvDLKLQELQHLKNEGDHLRNVQTE--CEA 552
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 496 LNQKIEEKNNEMIILENQIEKFQDRIMKTNQQAD-LYAKLGLIKKSINTKLDELQKI-TEKLQNDSRIRQVfpltqEFQR 573
Cdd:pfam15921 553 LKLQMAEKDKVIEILRQQIENMTQLVGQHGRTAGaMQVEKAQLEKEINDRRLELQEFkILKDKKDAKIREL-----EARV 627
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 574 ADLEMDFQKLF------INMQKNIAINNKKMHELDRRYTNALYNLNTIEKDLQDNQKSKEKVIQLLSENLpeDCTIDEYN 647
Cdd:pfam15921 628 SDLELEKVKLVnagserLRAVKDIKQERDQLLNEVKTSRNELNSLSEDYEVLKRNFRNKSEEMETTTNKL--KMQLKSAQ 705
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 648 DVLEETELSYKTaLENLKMHQTTLEFNRKALEIAERDSCCYLCSR-KFENESF-------------KSKLLQELKTKTda 713
Cdd:pfam15921 706 SELEQTRNTLKS-MEGSDGHAMKVAMGMQKQITAKRGQIDALQSKiQFLEEAMtnankekhflkeeKNKLSQELSTVA-- 782
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 714 nfekTLKDTVQNEKEYLHSLRllekhiitlNSINEKIDNSQKCLEKAKEETKTSKSKLDELEVDSTKLK-----DEKEL- 787
Cdd:pfam15921 783 ----TEKNKMAGELEVLRSQE---------RRLKEKVANMEVALDKASLQFAECQDIIQRQEQESVRLKlqhtlDVKELq 849
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 788 -----AESEIRPLIEKFTYLEKELKDLENSSKTISeELSIYNTSEDGIQtvdelRDQQRKMNDSLRELRKTISDLQMEKD 862
Cdd:pfam15921 850 gpgytSNSSMKPRLLQPASFTRTHSNVPSSQSTAS-FLSHHSRKTNALK-----EDPTRDLKQLLQELRSVINEEPTVQL 923
|
730 740 750 760 770
....*....|....*....|....*....|....*....|....*....|...
gi 6324079 863 EKVRENSRMINLIK-EKELTVSEIESSLTQK--QNIDDSIRSKRENINDIDSR 912
Cdd:pfam15921 924 SKAEDKGRAPSLGAlDDRVRDCIIESSLRSDicHSSSNSLQTEGSKSSETCSR 976
|
|
| ABCF_EF-3 |
cd03221 |
ATP-binding cassette domain of elongation factor 3, subfamily F; Elongation factor 3 (EF-3) is ... |
1205-1277 |
1.31e-07 |
|
ATP-binding cassette domain of elongation factor 3, subfamily F; Elongation factor 3 (EF-3) is a cytosolic protein required by fungal ribosomes for in vitro protein synthesis and for in vivo growth. EF-3 stimulates the binding of the EF-1: GTP: aa-tRNA ternary complex to the ribosomal A site by facilitated release of the deacylated tRNA from the E site. The reaction requires ATP hydrolysis. EF-3 contains two ATP nucleotide binding sequence (NBS) motifs. NBSI is sufficient for the intrinsic ATPase activity. NBSII is essential for the ribosome-stimulated functions.
Pssm-ID: 213188 [Multi-domain] Cd Length: 144 Bit Score: 52.07 E-value: 1.31e-07
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 6324079 1205 SAGQKVlasiiiRLALSETFGANCGVIALDEPTTNLDEENIESLAKSLhniinmrrhqKNFQ--LIVITHDEKFL 1277
Cdd:cd03221 72 SGGEKM------RLALAKLLLENPNLLLLDEPTNHLDLESIEALEEAL----------KEYPgtVILVSHDRYFL 130
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
770-1068 |
2.03e-07 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 55.71 E-value: 2.03e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 770 KLDELEVDSTKLKDEKELAESEIRPLIEKFTYLEKELKDLENSSKTISEELSIYNtsedgiqtvDELRDQQRKMNDSLRE 849
Cdd:COG1196 233 KLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQ---------AEEYELLAELARLEQD 303
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 850 LRKTISDLQMEKDEKVRENSRMINLIKEKELTVSEIESSLTQKQNIDDSIRSKRENINDIDSRVKELEARIISLKNKKDE 929
Cdd:COG1196 304 IARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEE 383
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 930 AQSVLDKVKNERDIQVRNKQKTVADINRLIDRFQTIYNEVVDFEAkgfDELQTTIKELELNKAQMLELKEQLDLKSNEVN 1009
Cdd:COG1196 384 LAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEE---ALAELEEEEEEEEEALEEAAEEEAELEEEEEA 460
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*....
gi 6324079 1010 EEKRKLADsnnEEKNLKQNLELIELKSQLQHIESEISRLdvQNAEAERDKYQEESLRLR 1068
Cdd:COG1196 461 LLELLAEL---LEEAALLEAALAELLEELAEAAARLLLL--LEAEADYEGFLEGVKAAL 514
|
|
| ABC_ATPase |
cd00267 |
ATP-binding cassette transporter nucleotide-binding domain; ABC transporters are a large ... |
1205-1294 |
3.33e-07 |
|
ATP-binding cassette transporter nucleotide-binding domain; ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide-binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins.
Pssm-ID: 213179 [Multi-domain] Cd Length: 157 Bit Score: 51.48 E-value: 3.33e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1205 SAGQKVlasiiiRLALSETFGANCGVIALDEPTTNLDEENIESLAKSLHNIinmrrHQKNFQLIVITHDEKFlghmnAAA 1284
Cdd:cd00267 82 SGGQRQ------RVALARALLLNPDLLLLDEPTSGLDPASRERLLELLREL-----AEEGRTVIIVTHDPEL-----AEL 145
|
90
....*....|
gi 6324079 1285 FTDHFFKVKR 1294
Cdd:cd00267 146 AADRVIVLKD 155
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
12-849 |
3.38e-07 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 55.07 E-value: 3.38e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 12 IRSFDSNDRETIEFGKPLTLIVGMNGSGKTTIIECLKYATTGDLPPNSKGgVFIHDPKITGEKDIRAQVKLAFtsaNGLN 91
Cdd:PRK03918 8 IKNFRSHKSSVVEFDDGINLIIGQNGSGKSSILEAILVGLYWGHGSKPKG-LKKDDFTRIGGSGTEIELKFEK---NGRK 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 92 MIVTRNIqllmKKTTTTFKTLEGQLVAINNSGDRSTLSTRSleldaqvplylgVPKAILEYVIFCHQEDSLWPLSEPSNL 171
Cdd:PRK03918 84 YRIVRSF----NRGESYLKYLDGSEVLEEGDSSVREWVERL------------IPYHVFLNAIYIRQGEIDAILESDESR 147
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 172 KKKFDEIFQAMKFTKALDNLKSIkkdmsvdIKLLKQSVEHLKLDKDRSKAMKLNIHQLQTKIDQYNEEVSEIESQLNEIT 251
Cdd:PRK03918 148 EKVVRQILGLDDYENAYKNLGEV-------IKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELR 220
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 252 EKSDKLFKSNQDFQKILSKVENLKNTKLSISDQVKRLsnsidildlsKPDLQNLlanfskvlmdkNNQLRDLETDISSLK 331
Cdd:PRK03918 221 EELEKLEKEVKELEELKEEIEELEKELESLEGSKRKL----------EEKIREL-----------EERIEELKKEIEELE 279
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 332 DRqsslqslsnslIRRQGELEAGKETYEKNRNHLSSLKEAFQHKFQGLSNIENsdmaqvnhemsqfkafisqdLTDTIDQ 411
Cdd:PRK03918 280 EK-----------VKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEE--------------------EINGIEE 328
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 412 FAKDIQLKETNLSDLIKSITVDSQNLEYNKKDRSKL--IHDSEELAEKLKSFKSLSTQDSLNHELENLKTYKEKLQSWES 489
Cdd:PRK03918 329 RIKELEEKEERLEELKKKLKELEKRLEELEERHELYeeAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEIS 408
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 490 ENI--IPKLNQKIEEKNNEMIILENQIEK--FQDRIMKTNQQADLYAKLGLIKKSINTKLDELQKITEKLQNDSRirqvf 565
Cdd:PRK03918 409 KITarIGELKKEIKELKKAIEELKKAKGKcpVCGRELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELR----- 483
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 566 pltqefqraDLEMdfqklFINMQKNIaINNKKMHELDRRYTNAL--YNLNTIEKDLQDNQKSKEKVIQLLSEnlpedctI 643
Cdd:PRK03918 484 ---------ELEK-----VLKKESEL-IKLKELAEQLKELEEKLkkYNLEELEKKAEEYEKLKEKLIKLKGE-------I 541
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 644 DEYNDVLEETELsYKTALENLKMHQTTLEFNRKALEiaerdsccylcsRKFENESFKSKLLQELKTKTDANFEKTLKDTV 723
Cdd:PRK03918 542 KSLKKELEKLEE-LKKKLAELEKKLDELEEELAELL------------KELEELGFESVEELEERLKELEPFYNEYLELK 608
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 724 QNEKEYLHSLRLLEKHIITLNSINEKIDNSQKCLEKAKEETKTSKSKLDELE---VDSTKLKDEKELA--ESEIRPLIEK 798
Cdd:PRK03918 609 DAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEyeeLREEYLELSRELAglRAELEELEKR 688
|
810 820 830 840 850
....*....|....*....|....*....|....*....|....*....|.
gi 6324079 799 FTYLEKELKDLENSSKTISEELSIYNTSEDGIQTVDELRDQQRKMNDSLRE 849
Cdd:PRK03918 689 REEIKKTLEKLKEELEEREKAKKELEKLEKALERVEELREKVKKYKALLKE 739
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
899-1109 |
5.63e-07 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 54.29 E-value: 5.63e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 899 IRSKRENINDIDSRVKELEARIISLKNKKDEAQSVLDKVKNERDIQvrnKQKTVADINRL---IDRFQTIYNEVVDFEAK 975
Cdd:TIGR02168 181 LERTRENLDRLEDILNELERQLKSLERQAEKAERYKELKAELRELE---LALLVLRLEELreeLEELQEELKEAEEELEE 257
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 976 GFDELQTTIKELELNKAQMLELKEQLDLKSNEVNEEKRKLADSNNEEKNLKQNLEliELKSQLQHIESEI----SRLDVQ 1051
Cdd:TIGR02168 258 LTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLA--NLERQLEELEAQLeeleSKLDEL 335
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 6324079 1052 NAEAERDKYQEESLR-LRTRFEKLSSENAGKLGEMKQLQNQIDSLTHQLRTDYKDIEKN 1109
Cdd:TIGR02168 336 AEELAELEEKLEELKeELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQ 394
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
743-946 |
7.04e-07 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 53.23 E-value: 7.04e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 743 LNSINEKIDNSQKCLEKAKEETKTSKSKLDELEVDSTKLKDEKELAESEIRPLIEKFTYLEKELKDLENSSKTISEELS- 821
Cdd:COG4942 29 LEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEELAe 108
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 822 -IYNTSEDGIQTVDELRDQQRKMNDSLR----------ELRKTISDLQMEKDEKVRENSRMINLIKEKELTVSEIEsslT 890
Cdd:COG4942 109 lLRALYRLGRQPPLALLLSPEDFLDAVRrlqylkylapARREQAEELRADLAELAALRAELEAERAELEALLAELE---E 185
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 6324079 891 QKQNIDDSIRSKRENINDIDSRVKELEARIISLKNKKDEAQSVLDKVKNERDIQVR 946
Cdd:COG4942 186 ERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAE 241
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
758-1069 |
7.26e-07 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 53.89 E-value: 7.26e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 758 EKAKEETKTSKSKLDELEVDSTKLKDEKELAESEIRPLIEKFTYLEKELKDL-------ENSSKTISEELSIYNTSEDGI 830
Cdd:PRK02224 247 EERREELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLlaeagldDADAEAVEARREELEDRDEEL 326
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 831 Q-TVDELRDQQRKMNDSLRELRKTISDLQMEKDEKvRENSRmiNLIKEKELTVSEIESSLTQKQNIDDSIRSKR------ 903
Cdd:PRK02224 327 RdRLEECRVAAQAHNEEAESLREDADDLEERAEEL-REEAA--ELESELEEAREAVEDRREEIEELEEEIEELRerfgda 403
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 904 ---------------ENINDIDSRVKELEARIISLKNKKDEAQSVLDKVKN---ERDIQVRNKQKTVAD----INRLIDR 961
Cdd:PRK02224 404 pvdlgnaedfleelrEERDELREREAELEATLRTARERVEEAEALLEAGKCpecGQPVEGSPHVETIEEdrerVEELEAE 483
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 962 FQTIYNEVVDFEAK--GFDELQTTIKELELNKAQMLELKEQLDLKSNEVNEEKRKLADSNNEEKNLKQNLELIELKSQLQ 1039
Cdd:PRK02224 484 LEDLEEEVEEVEERleRAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEA 563
|
330 340 350
....*....|....*....|....*....|
gi 6324079 1040 HIESEISRLDVQNAEAERDKYQEESLRLRT 1069
Cdd:PRK02224 564 EEEAEEAREEVAELNSKLAELKERIESLER 593
|
|
| ABC_cobalt_CbiO_domain2 |
cd03226 |
Second domain of the ATP-binding cassette component of cobalt transport system; Domain II of ... |
1204-1278 |
9.47e-07 |
|
Second domain of the ATP-binding cassette component of cobalt transport system; Domain II of the ABC component of a cobalt transport family found in bacteria, archaea, and eukaryota. The transition metal cobalt is an essential component of many enzymes and must be transported into cells in appropriate amounts when needed. The CbiMNQO family ABC transport system is involved in cobalt transport in association with the cobalamin (vitamin B12) biosynthetic pathways. Most cobalt (Cbi) transport systems possess a separate CbiN component, the cobalt-binding periplasmic protein, and they are encoded by the conserved gene cluster cbiMNQO. Both the CbiM and CbiQ proteins are integral cytoplasmic membrane proteins, and the CbiO protein has the linker peptide and the Walker A and B motifs commonly found in the ATPase components of the ABC-type transport systems.
Pssm-ID: 213193 [Multi-domain] Cd Length: 205 Bit Score: 51.10 E-value: 9.47e-07
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 6324079 1204 CSAGQKVlasiiiRLALSETFGANCGVIALDEPTTNLDEENIESLAKSLHNIINMRRhqknfQLIVITHDEKFLG 1278
Cdd:cd03226 127 LSGGQKQ------RLAIAAALLSGKDLLIFDEPTSGLDYKNMERVGELIRELAAQGK-----AVIVITHDYEFLA 190
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
600-943 |
1.02e-06 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 53.53 E-value: 1.02e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 600 ELDRRYTNALYNLNTIEKDLQDNQKSKEKVIQLLSENLPEDCTIDEYNDVLEET-ELSYKTALENLKMHQTTLEFNRKAL 678
Cdd:TIGR02169 167 EFDRKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKrEYEGYELLKEKEALERQKEAIERQL 246
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 679 EIAERDsccylcsrKFENESFKSKLLQELKTKTDANFEKTLKDTVQNEKEYLHSLRLLEKHIITLNSINEKIDNSQKCLE 758
Cdd:TIGR02169 247 ASLEEE--------LEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELE 318
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 759 KAKEETKTSKSKLDELEVDSTKLKDEKELAESEIRPLIEKFTYLEKELKDLENSSKTISEELSIyntsedgiqTVDELRD 838
Cdd:TIGR02169 319 DAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAE---------TRDELKD 389
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 839 QQRKM---NDSLRELRKTISDLQMEKDEKVRENSRMINLIKEKELTVSEIESSLTQKQnidDSIRSKRENINDIDSRVKE 915
Cdd:TIGR02169 390 YREKLeklKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKA---LEIKKQEWKLEQLAADLSK 466
|
330 340
....*....|....*....|....*...
gi 6324079 916 LEARIISLKNKKDEAQSVLDKVKNERDI 943
Cdd:TIGR02169 467 YEQELYDLKEEYDRVEKELSKLQRELAE 494
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
454-1112 |
1.23e-06 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 53.14 E-value: 1.23e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 454 LAEKLKSFKSLSTQDSLNHELENLKTYKEKLQSWESENIIPKLNQKIEEKNNEMIILENQIEKFQDRIMKTNQQadlYAK 533
Cdd:TIGR02168 195 LNELERQLKSLERQAEKAERYKELKAELRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEK---LEE 271
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 534 LGLIKKSINTKLDELQKITEKLQNDSRIrqvfpLTQEFQRADLEMDfqklfiNMQKNIAINNKKMHELDRRYTNALYNLN 613
Cdd:TIGR02168 272 LRLEVSELEEEIEELQKELYALANEISR-----LEQQKQILRERLA------NLERQLEELEAQLEELESKLDELAEELA 340
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 614 TIEKDLQdnqkSKEKVIQLLSENLPEDC--------TIDEYNDVLEETELSYKTALENLKMHQTTLEFNRKALEIAERds 685
Cdd:TIGR02168 341 ELEEKLE----ELKEELESLEAELEELEaeleelesRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLED-- 414
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 686 ccylcsrkfENESFKSKLLQELKTKTDANFEKTLKDTVQNEKEYLHSLRLLEKHIITLNSINEKIDNSQKCLEKAKEETK 765
Cdd:TIGR02168 415 ---------RRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELA 485
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 766 TSKSKLDELEVDSTKLKDE----KELAESEIR------PLIEKF-----------TYLEKELKDL----ENSSKTISEEL 820
Cdd:TIGR02168 486 QLQARLDSLERLQENLEGFsegvKALLKNQSGlsgilgVLSELIsvdegyeaaieAALGGRLQAVvvenLNAAKKAIAFL 565
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 821 SIYNT-----------SEDGIQTVD-ELRDQQRKMNDSLRELRKTISDLQ-------------------MEKDEKVRENS 869
Cdd:TIGR02168 566 KQNELgrvtflpldsiKGTEIQGNDrEILKNIEGFLGVAKDLVKFDPKLRkalsyllggvlvvddldnaLELAKKLRPGY 645
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 870 RMINL----------------------------IKEKELTVSEIESSLTQKQNIDDSIRSKRENINDidsRVKELEARII 921
Cdd:TIGR02168 646 RIVTLdgdlvrpggvitggsaktnssilerrreIEELEEKIEELEEKIAELEKALAELRKELEELEE---ELEQLRKELE 722
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 922 SLKNKKDEAQSVLDKVKNErdiqVRNKQKTVADINRLIDRFQTIYNEVVDFEAKGFDELQTTIKELELNKAQMLELKEQL 1001
Cdd:TIGR02168 723 ELSRQISALRKDLARLEAE----VEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEEL 798
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1002 DL-------KSNEVNEEKRKLADSNNEEKNLKQNLELIELKSQLQHIESEISRLDVQNAEAERDKYQEESLRLRTRFEKL 1074
Cdd:TIGR02168 799 KAlrealdeLRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEAL 878
|
730 740 750
....*....|....*....|....*....|....*...
gi 6324079 1075 SSENAGKLGEMKQLQNQIDSLTHQLRTDYKDIEKNYHK 1112
Cdd:TIGR02168 879 LNERASLEEALALLRSELEELSEELRELESKRSELRRE 916
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
138-1004 |
2.18e-06 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 52.43 E-value: 2.18e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 138 QVPLY------LGVPKAILEYVIFCHQEDSLWPLS-EPSNLKKKFDE---IFQAMKF--TKALDNLKS------IKKDMS 199
Cdd:pfam15921 50 QIPIFpkyeveLDSPRKIIAYPGKEHIERVLEEYShQVKDLQRRLNEsneLHEKQKFylRQSVIDLQTklqemqMERDAM 129
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 200 VDIKLLK-QSVEHLKldkdrsKAMKLNIHQLQT----KIDQYNEEVSEIEsQLNEITEKSDKLFksnQDFQKILSKVENL 274
Cdd:pfam15921 130 ADIRRREsQSQEDLR------NQLQNTVHELEAakclKEDMLEDSNTQIE-QLRKMMLSHEGVL---QEIRSILVDFEEA 199
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 275 KNTKLSISDqvkrlsnsidilDLSKPDLQNLLANFSKVLmdknnqlRDLETDISSLKDRQ-SSLQSLSNSLIRRQGELEA 353
Cdd:pfam15921 200 SGKKIYEHD------------SMSTMHFRSLGSAISKIL-------RELDTEISYLKGRIfPVEDQLEALKSESQNKIEL 260
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 354 gkeTYEKNRNHLSSLKEAFQHKFQGLSNIENSDMAQVNHEMSQFKAFISQ-------------DLTDTIDQFAKDI---- 416
Cdd:pfam15921 261 ---LLQQHQDRIEQLISEHEVEITGLTEKASSARSQANSIQSQLEIIQEQarnqnsmymrqlsDLESTVSQLRSELreak 337
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 417 QLKETNLSDLIKSITVDSQNLEYNKKDRSKLIHDSEELAEKLKsfKSLSTQDSLNHELENLKTYKEKLQSWESEN--IIP 494
Cdd:pfam15921 338 RMYEDKIEELEKQLVLANSELTEARTERDQFSQESGNLDDQLQ--KLLADLHKREKELSLEKEQNKRLWDRDTGNsiTID 415
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 495 KLNQKIEEKNNEMIILENQIekfqdRIMKTNQQADLYAKLGLIKKSiNTKLDELQKITEKLQNdsrirqvfplTQEFQRA 574
Cdd:pfam15921 416 HLRRELDDRNMEVQRLEALL-----KAMKSECQGQMERQMAAIQGK-NESLEKVSSLTAQLES----------TKEMLRK 479
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 575 DLEmdfqklfinmqkniAINNKKMheldrrytnALYNLNTIEKDLQDNQKSKEKVIQLLSENLPE-DCTIDeyndvLEET 653
Cdd:pfam15921 480 VVE--------------ELTAKKM---------TLESSERTVSDLTASLQEKERAIEATNAEITKlRSRVD-----LKLQ 531
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 654 ELSY-KTALENLKMHQTTLEFNRkaLEIAERDSCCYLCSRKFENesFKSKLLQELKTKTDANFEKT-LKDTVQNEKEYLH 731
Cdd:pfam15921 532 ELQHlKNEGDHLRNVQTECEALK--LQMAEKDKVIEILRQQIEN--MTQLVGQHGRTAGAMQVEKAqLEKEINDRRLELQ 607
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 732 SLRLLEkhiitlnsinekidnsqkclEKAKEETKTSKSKLDELEVDSTKLKDEKELAESEIRPLIEKFTYLEKELKDLEN 811
Cdd:pfam15921 608 EFKILK--------------------DKKDAKIRELEARVSDLELEKVKLVNAGSERLRAVKDIKQERDQLLNEVKTSRN 667
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 812 SSKTISEELSIY-----NTSEDGIQTVDELRDQQRKMNDSLRELRKTISDLQMEKDEKVRENSRMinlikekeltvseie 886
Cdd:pfam15921 668 ELNSLSEDYEVLkrnfrNKSEEMETTTNKLKMQLKSAQSELEQTRNTLKSMEGSDGHAMKVAMGM--------------- 732
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 887 ssltQKQniddsIRSKRENINDIDSRVKELEARIISLKNKKDEAQSVLDKVKNERDIQVRNKQKTVADINRLIDRFQTIY 966
Cdd:pfam15921 733 ----QKQ-----ITAKRGQIDALQSKIQFLEEAMTNANKEKHFLKEEKNKLSQELSTVATEKNKMAGELEVLRSQERRLK 803
|
890 900 910 920
....*....|....*....|....*....|....*....|....
gi 6324079 967 NEVVDFEAK------GFDELQTTIKELElNKAQMLELKEQLDLK 1004
Cdd:pfam15921 804 EKVANMEVAldkaslQFAECQDIIQRQE-QESVRLKLQHTLDVK 846
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
922-1101 |
2.48e-06 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 51.94 E-value: 2.48e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 922 SLKNKKDEAQSVLDKVKNerdiQVRNKQKTVADINRLIDRFQTIYNeVVDFEakgfDELQTTIKELELNKAQMLELKEQL 1001
Cdd:COG3206 165 NLELRREEARKALEFLEE----QLPELRKELEEAEAALEEFRQKNG-LVDLS----EEAKLLLQQLSELESQLAEARAEL 235
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1002 DLKSNEVNEEKRKLADSNNEEKNLKQNLELIELKSQLQHIESEISRL---------DVQNAEAERDKYQEE-SLRLRTRF 1071
Cdd:COG3206 236 AEAEARLAALRAQLGSGPDALPELLQSPVIQQLRAQLAELEAELAELsarytpnhpDVIALRAQIAALRAQlQQEAQRIL 315
|
170 180 190
....*....|....*....|....*....|
gi 6324079 1072 EKLSSENAGKLGEMKQLQNQIDSLTHQLRT 1101
Cdd:COG3206 316 ASLEAELEALQAREASLQAQLAQLEARLAE 345
|
|
| ABC_Class2 |
cd03227 |
ATP-binding cassette domain of non-transporter proteins; ABC-type Class 2 contains systems ... |
6-87 |
2.70e-06 |
|
ATP-binding cassette domain of non-transporter proteins; ABC-type Class 2 contains systems involved in cellular processes other than transport. These families are characterized by the fact that the ABC subunit is made up of duplicated, fused ABC modules (ABC2). No known transmembrane proteins or domains are associated with these proteins.
Pssm-ID: 213194 [Multi-domain] Cd Length: 162 Bit Score: 48.90 E-value: 2.70e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 6 KLSIQGIRSFDsnDRETIEFGKP-LTLIVGMNGSGKTTIIECLKYATTGDLPPNSKGGVFihdpkitGEKDIRAQVKLAF 84
Cdd:cd03227 1 KIVLGRFPSYF--VPNDVTFGEGsLTIITGPNGSGKSTILDAIGLALGGAQSATRRRSGV-------KAGCIVAAVSAEL 71
|
...
gi 6324079 85 TSA 87
Cdd:cd03227 72 IFT 74
|
|
| YbjD |
COG3593 |
Predicted ATP-dependent endonuclease of the OLD family, contains P-loop ATPase and TOPRIM ... |
4-86 |
2.90e-06 |
|
Predicted ATP-dependent endonuclease of the OLD family, contains P-loop ATPase and TOPRIM domains [Replication, recombination and repair];
Pssm-ID: 442812 [Multi-domain] Cd Length: 359 Bit Score: 51.16 E-value: 2.90e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 4 IYKLSIQGIRSFDsndRETIEFGKPLTLIVGMNGSGKTTIIECLKYAttgdLPPNSKGGVFIHD-PKITGEKDIRAQVKL 82
Cdd:COG3593 3 LEKIKIKNFRSIK---DLSIELSDDLTVLVGENNSGKSSILEALRLL----LGPSSSRKFDEEDfYLGDDPDLPEIEIEL 75
|
....
gi 6324079 83 AFTS 86
Cdd:COG3593 76 TFGS 79
|
|
| ABC_cobalt_CbiO_domain1 |
cd03225 |
First domain of the ATP-binding cassette component of cobalt transport system; Domain I of the ... |
1205-1277 |
3.29e-06 |
|
First domain of the ATP-binding cassette component of cobalt transport system; Domain I of the ABC component of a cobalt transport family found in bacteria, archaea, and eukaryota. The transition metal cobalt is an essential component of many enzymes and must be transported into cells in appropriate amounts when needed. This ABC transport system of the CbiMNQO family is involved in cobalt transport in association with the cobalamin (vitamin B12) biosynthetic pathways. Most of cobalt (Cbi) transport systems possess a separate CbiN component, the cobalt-binding periplasmic protein, and they are encoded by the conserved gene cluster cbiMNQO. Both the CbiM and CbiQ proteins are integral cytoplasmic membrane proteins, and the CbiO protein has the linker peptide and the Walker A and B motifs commonly found in the ATPase components of the ABC-type transport systems.
Pssm-ID: 213192 [Multi-domain] Cd Length: 211 Bit Score: 49.39 E-value: 3.29e-06
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 6324079 1205 SAGQKVLASIIIRLALsetfgaNCGVIALDEPTTNLDEENIESLAKSLHNIinmrrHQKNFQLIVITHDEKFL 1277
Cdd:cd03225 136 SGGQKQRVAIAGVLAM------DPDILLLDEPTAGLDPAGRRELLELLKKL-----KAEGKTIIIVTHDLDLL 197
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
232-964 |
3.81e-06 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 51.59 E-value: 3.81e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 232 KIDQYNEEVSEIESQLNEITEKSDKLFKSNQDFQKILSKVENLKNTKLSISD----QVK---RLSNSIDILDLSKPDLQN 304
Cdd:TIGR01612 884 KLNDYEKKFNDSKSLINEINKSIEEEYQNINTLKKVDEYIKICENTKESIEKfhnkQNIlkeILNKNIDTIKESNLIEKS 963
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 305 LLANFSKVLMDKNNQLRDLETDISslkdrQSSLQSLSNSLIRRQGELEAGKETYEKNrnhlsSLKEAFQHKFQGLSNIEN 384
Cdd:TIGR01612 964 YKDKFDNTLIDKINELDKAFKDAS-----LNDYEAKNNELIKYFNDLKANLGKNKEN-----MLYHQFDEKEKATNDIEQ 1033
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 385 SdMAQVNHEMSQFKAFISQDLTDTIDQFAKDI---------------------------QLKETNLSDLIKSITVDSQNl 437
Cdd:TIGR01612 1034 K-IEDANKNIPNIEIAIHTSIYNIIDEIEKEIgkniellnkeileeaeinitnfneikeKLKHYNFDDFGKEENIKYAD- 1111
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 438 EYNK---------KDRSKLIHDSEELAEKLKSF-KSLSTQDSLNHELENLKTYKEKLQSWES--ENIIPKLNQK--IEEK 503
Cdd:TIGR01612 1112 EINKikddiknldQKIDHHIKALEEIKKKSENYiDEIKAQINDLEDVADKAISNDDPEEIEKkiENIVTKIDKKknIYDE 1191
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 504 NNEMIILENQIEKFQDRIMKTNQQADLYAK-LGLI---------KKSINT---------KLDELQKITEKLQNDSRIRQv 564
Cdd:TIGR01612 1192 IKKLLNEIAEIEKDKTSLEEVKGINLSYGKnLGKLflekideekKKSEHMikameayieDLDEIKEKSPEIENEMGIEM- 1270
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 565 fPLTQEFQRADLEMDFQKLFINMQKNIAINNKKMHELDRRYTNALY---NLNTIEKDLQ----DNQKSKEKVIQLLSE-- 635
Cdd:TIGR01612 1271 -DIKAEMETFNISHDDDKDHHIISKKHDENISDIREKSLKIIEDFSeesDINDIKKELQknllDAQKHNSDINLYLNEia 1349
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 636 NLPEDCTIDEYNDVLEETELSYKTALENLKMHQTTLEFNRKALEIAERDSCCYLCSRKFE---NESFKSKLLQELK-TKT 711
Cdd:TIGR01612 1350 NIYNILKLNKIKKIIDEVKEYTKEIEENNKNIKDELDKSEKLIKKIKDDINLEECKSKIEstlDDKDIDECIKKIKeLKN 1429
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 712 DANFEKTLKDT-VQNEKEYLHSLRLLEKHI-----------------------ITLNSINEKIDNSQKC---LEKAKEET 764
Cdd:TIGR01612 1430 HILSEESNIDTyFKNADENNENVLLLFKNIemadnksqhilkikkdnatndhdFNINELKEHIDKSKGCkdeADKNAKAI 1509
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 765 KTSKSKLDELEVDSTKL----------------KDEKELAESEIRPLIEKFTY----LEKELKDLENSSKTISEELSIYN 824
Cdd:TIGR01612 1510 EKNKELFEQYKKDVTELlnkysalaiknkfaktKKDSEIIIKEIKDAHKKFILeaekSEQKIKEIKKEKFRIEDDAAKND 1589
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 825 TSED---GIQT-VDELRDQQRKMNDslreLRKTISDLQMEKDEKVRENSRMInlIKEKELTVSEIESSLTQKQNIDDSIR 900
Cdd:TIGR01612 1590 KSNKaaiDIQLsLENFENKFLKISD----IKKKINDCLKETESIEKKISSFS--IDSQDTELKENGDNLNSLQEFLESLK 1663
|
810 820 830 840 850 860 870
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 6324079 901 SKRENINDIDSRVKELEARIISLKN-----KKDEAQSVLDKVkNERDIQVRNKQKTVAD-----INRLIDRFQT 964
Cdd:TIGR01612 1664 DQKKNIEDKKKELDELDSEIEKIEIdvdqhKKNYEIGIIEKI-KEIAIANKEEIESIKElieptIENLISSFNT 1736
|
|
| SbcC_Walker_B |
pfam13558 |
SbcC/RAD50-like, Walker B motif; This entry represents the Walker B domain of RAD50 from ... |
1184-1254 |
3.86e-06 |
|
SbcC/RAD50-like, Walker B motif; This entry represents the Walker B domain of RAD50 from eukaryotes and the prokaryotic homolog SbcCD complex subunit C. RAD50-ATPase forms a complex with Mre11-nuclease that detects and processes diverse and obstructed DNA ends. This domain is separated of the Walker A domain by a long coiled-coil domain and forms the nucleotide-binding domain (NBD) when the coiled coils fold back on themselves and bring together Walker A and B domains. Two RAD50-NBDs forms heterotetramers with a Mre11 nuclease dimer that assemble as catalytic head module that binds and cleaves DNA in an ATP-dependent reaction. Through secondary structural analysis, it has been suggested that there is a wide structural conservation in the Rad50/SMC protein family as seen in structural similarities between RAD50's hook and ABC-ATPase MukB's elbow region.
Pssm-ID: 463921 [Multi-domain] Cd Length: 90 Bit Score: 46.46 E-value: 3.86e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1184 SYNYRVVMY-KQDVEL-DMRGRCSAGQK-VLASIIIRLALSETFGAN------CGVIALDEPTTNLDEENIESLAKSLHN 1254
Cdd:pfam13558 11 SFEVEVRDEdGSEVETyRRSGGLSGGEKqLLAYLPLAAALAAQYGSAegrppaPRLVFLDEAFAKLDEENIRTALELLRA 90
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
170-383 |
3.86e-06 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 51.61 E-value: 3.86e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 170 NLKKKFDEIFQAMKFTKA-LDNLKSIKKDMSVDIKLLKQSVEHLKLDKDRSKamklnIHQLQTKIDQYNEEVSEIESQLN 248
Cdd:TIGR02169 741 ELEEDLSSLEQEIENVKSeLKELEARIEELEEDLHKLEEALNDLEARLSHSR-----IPEIQAELSKLEEEVSRIEARLR 815
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 249 EITEKsdkLFKSNQDFQKILSKVENLKNTKLSISDQVKRLSNSIDILDLSKPDLQNLLANfskvlmdKNNQLRDLETDIS 328
Cdd:TIGR02169 816 EIEQK---LNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEE-------LEAALRDLESRLG 885
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 6324079 329 SLKDRQSSLQSLSNSLIRRQGELEAgkeTYEKNRNHLSSLKEAFQHKFQGLSNIE 383
Cdd:TIGR02169 886 DLKKERDELEAQLRELERKIEELEA---QIEKKRKRLSELKAKLEALEEELSEIE 937
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
760-1006 |
3.97e-06 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 50.53 E-value: 3.97e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 760 AKEETKTSKSKLDELEVDSTKLKDEKELAESEIRPLIEKFTYLEKELKDLENSSKTISEELsiyNTSEDGIQtvdELRDQ 839
Cdd:COG4942 18 QADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQEL---AALEAELA---ELEKE 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 840 QRKMNDSLRELRKTISDLqmekdekVRENSRMINLIKEKELTVSE-IESSLTQKQNIDDSIRSKRENINDIDSRVKELEA 918
Cdd:COG4942 92 IAELRAELEAQKEELAEL-------LRALYRLGRQPPLALLLSPEdFLDAVRRLQYLKYLAPARREQAEELRADLAELAA 164
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 919 RIISLKNKKDEAQSVLDKVKNERDIQVRNKQKTVADINRLIDRFQTIYNEVVDFEAKgFDELQTTIKELELNKAQMLELK 998
Cdd:COG4942 165 LRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQE-AEELEALIARLEAEAAAAAERT 243
|
....*...
gi 6324079 999 EQLDLKSN 1006
Cdd:COG4942 244 PAAGFAAL 251
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
748-1109 |
7.12e-06 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 50.54 E-value: 7.12e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 748 EKIDNSQKCLEKAKEetKTSKSKLDELEVDSTKLKDEKElAESEIRPLIEKFTYLEKELKDLENSSKTISEELSIYNTSE 827
Cdd:COG4717 49 ERLEKEADELFKPQG--RKPELNLKELKELEEELKEAEE-KEEEYAELQEELEELEEELEELEAELEELREELEKLEKLL 125
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 828 DGIQTVDELRDQQRKMN------DSLRELRKTISDLQMEKDEKVRENSRminliKEKELTVSEIESSLTQKQNIDDSIrs 901
Cdd:COG4717 126 QLLPLYQELEALEAELAelperlEELEERLEELRELEEELEELEAELAE-----LQEELEELLEQLSLATEEELQDLA-- 198
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 902 krENINDIDSRVKELEARIISLKNKKDEAQSVLDKVKNERDIQVRNKQK-----------TVADINRLIDRFQTIYNEVV 970
Cdd:COG4717 199 --EELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALEERLkearlllliaaALLALLGLGGSLLSLILTIA 276
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 971 DF---------------------EAKGFDELQTTIKELELNKAQMLELKEQLDLKSNEVNEEKRKLADSNNEEKNLKQNL 1029
Cdd:COG4717 277 GVlflvlgllallflllarekasLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLREA 356
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1030 ELIELKSQLQHIESEISRL----DVQNAEA--ERDKYQEESLRLRTRFEKLSSENAGKLGEMKQLQNQIDslTHQLRTDY 1103
Cdd:COG4717 357 EELEEELQLEELEQEIAALlaeaGVEDEEElrAALEQAEEYQELKEELEELEEQLEELLGELEELLEALD--EEELEEEL 434
|
....*.
gi 6324079 1104 KDIEKN 1109
Cdd:COG4717 435 EELEEE 440
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
695-1274 |
8.02e-06 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 50.15 E-value: 8.02e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 695 ENESFKSKLLQE----LKTKTDANFEKTLKDTVQNEKEYLHSLRLLEKHIITLNSINEKIDNSQKCLEKAKEETKTSKSK 770
Cdd:COG4717 31 PNEAGKSTLLAFiramLLERLEKEADELFKPQGRKPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEAE 110
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 771 LDELEVDSTKLKDEKELaeseiRPLIEKFTYLEKELKDLEnssktiseelSIYNTSEDGIQTVDELRDQQRKMNDSLREL 850
Cdd:COG4717 111 LEELREELEKLEKLLQL-----LPLYQELEALEAELAELP----------ERLEELEERLEELRELEEELEELEAELAEL 175
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 851 RKTISDLQMEKDEKVRENSRMinlikekelTVSEIESSLTQKQNIDDSIRSKRENINDIDSRVKELEARIISLKNKK--D 928
Cdd:COG4717 176 QEELEELLEQLSLATEEELQD---------LAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALEErlK 246
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 929 EAQSVL-------------------------------------------DKVKNERDIQVRNKQKTVADINRL-IDRFQT 964
Cdd:COG4717 247 EARLLLliaaallallglggsllsliltiagvlflvlgllallflllarEKASLGKEAEELQALPALEELEEEeLEELLA 326
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 965 IYNEVVDFEAKGFDELQTTIKELELNKAQMLELKEQLDLKSNEvnEEKRKL---ADSNNEE---KNLKQNLELIELKSQL 1038
Cdd:COG4717 327 ALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEELE--QEIAALlaeAGVEDEEelrAALEQAEEYQELKEEL 404
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1039 QHIESEISRL-----------DVQNAEAERDKYQEESLRLRTRFEKLSSENAGKLGEMKQLQNqiDSLTHQLRTDYKDIE 1107
Cdd:COG4717 405 EELEEQLEELlgeleellealDEEELEEELEELEEELEELEEELEELREELAELEAELEQLEE--DGELAELLQELEELK 482
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1108 ---KNYHKEWVELQtrsfvtddidVYSKALDSAIMKYHGLK----MQDINRIIDELWKRTYSGTDID-TIKIRSDEVSST 1179
Cdd:COG4717 483 aelRELAEEWAALK----------LALELLEEAREEYREERlppvLERASEYFSRLTDGRYRLIRIDeDLSLKVDTEDGR 552
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1180 VKGKSynyrvvmykqdvELdmrgrcSAGQKVLASIIIRLALSETF-GANCGVIaLDEPTTNLDEENIESLAKSLHNIinm 1258
Cdd:COG4717 553 TRPVE------------EL------SRGTREQLYLALRLALAELLaGEPLPLI-LDDAFVNFDDERLRAALELLAEL--- 610
|
650
....*....|....*.
gi 6324079 1259 rrhQKNFQLIVITHDE 1274
Cdd:COG4717 611 ---AKGRQVIYFTCHE 623
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
475-972 |
1.31e-05 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 49.65 E-value: 1.31e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 475 ENLKTYKEKLQSWES-ENIIPKLNQKIEEKNNEMIILENQIEKFQDRIMK-TNQQADLYAKLGLIKKSINT---KLDELQ 549
Cdd:PRK02224 241 EVLEEHEERREELETlEAEIEDLRETIAETEREREELAEEVRDLRERLEElEEERDDLLAEAGLDDADAEAveaRREELE 320
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 550 KITEKLQndSRIRQVFPLTQEFQradlemdfqklfinmqKNIAINNKKMHELDRRYTNALYNLNTIEKDLQDNQ----KS 625
Cdd:PRK02224 321 DRDEELR--DRLEECRVAAQAHN----------------EEAESLREDADDLEERAEELREEAAELESELEEAReaveDR 382
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 626 KEKVIQLLSE---------NLPEDctIDEYNDVLEETELSYKTALENLKMHQTTLEFNRKALEIAER---DSCCYLCSRK 693
Cdd:PRK02224 383 REEIEELEEEieelrerfgDAPVD--LGNAEDFLEELREERDELREREAELEATLRTARERVEEAEAlleAGKCPECGQP 460
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 694 FENESFKSKL--LQELKTKTDANFE--KTLKDTVQNEKEYLHSLRLLEKHIitlNSINEKIDNSQKCLEKAKEETKTSKS 769
Cdd:PRK02224 461 VEGSPHVETIeeDRERVEELEAELEdlEEEVEEVEERLERAEDLVEAEDRI---ERLEERREDLEELIAERRETIEEKRE 537
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 770 KLDELEVDSTKLKDEKEL-------AESEIRPLIEKFTYLEKELKDLEN---SSKTISEELSIYNTSEDGIQTVDELRDQ 839
Cdd:PRK02224 538 RAEELRERAAELEAEAEEkreaaaeAEEEAEEAREEVAELNSKLAELKErieSLERIRTLLAAIADAEDEIERLREKREA 617
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 840 QRKMNDS----LRELRKTISDLQMEKDEKVRENSRminliKEKELTVSEIEssltqkqNIDDSIRSKRENINDIDSRVKE 915
Cdd:PRK02224 618 LAELNDErrerLAEKRERKRELEAEFDEARIEEAR-----EDKERAEEYLE-------QVEEKLDELREERDDLQAEIGA 685
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 6324079 916 LEARIISLKNKKDEAQSVLDKVKNERDI--QVRNKQKTVADI-----NRLIDRFQTIYNEVVDF 972
Cdd:PRK02224 686 VENELEELEELRERREALENRVEALEALydEAEELESMYGDLraelrQRNVETLERMLNETFDL 749
|
|
| COG3950 |
COG3950 |
Predicted ATP-binding protein involved in virulence [General function prediction only]; |
4-50 |
1.84e-05 |
|
Predicted ATP-binding protein involved in virulence [General function prediction only];
Pssm-ID: 443150 [Multi-domain] Cd Length: 276 Bit Score: 48.07 E-value: 1.84e-05
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 6324079 4 IYKLSIQGIRSFDSndrETIEFGKP--LTLIVGMNGSGKTTIIECLKYA 50
Cdd:COG3950 3 IKSLTIENFRGFED---LEIDFDNPprLTVLVGENGSGKTTLLEAIALA 48
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
766-1307 |
1.85e-05 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 49.14 E-value: 1.85e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 766 TSKSKLDELEVDSTKLKDEKELAESEIRPLIEKFTYLEKELKDLENSSKTISEELSIYnTSEDGIQTVDELRDQQRKMND 845
Cdd:COG4913 607 DNRAKLAALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVA-SAEREIAELEAELERLDASSD 685
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 846 SLRELRKTISDLQMEKDEKVREnsrminlIKEKELTVSEIESSLTQKQNIDDSIRSKRENINDIDS--RVKELEARIISL 923
Cdd:COG4913 686 DLAALEEQLEELEAELEELEEE-------LDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARleLRALLEERFAAA 758
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 924 KNKKDEAQsVLDKVKNERDIQVRNKQKTVADINRLIDRFQTIY-NEVVDFEAkGFDELQTTIKELElnkaqmlELKEQlD 1002
Cdd:COG4913 759 LGDAVERE-LRENLEERIDALRARLNRAEEELERAMRAFNREWpAETADLDA-DLESLPEYLALLD-------RLEED-G 828
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1003 LKSNEVNEEKRKLADSNNEEKNLKQnleliELKSQLQHIESEISRLdvqNAEAERDKYQEESlRLRTRFEKLSSEnagkl 1082
Cdd:COG4913 829 LPEYEERFKELLNENSIEFVADLLS-----KLRRAIREIKERIDPL---NDSLKRIPFGPGR-YLRLEARPRPDP----- 894
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1083 gEMKQLQNQIDSLT-HQLRTDYKDIEKNYH--KEWVE-LQTRSFVTDdidvyskaldsaimkyhglkMQDINRIIDelwK 1158
Cdd:COG4913 895 -EVREFRQELRAVTsGASLFDEELSEARFAalKRLIErLRSEEEESD--------------------RRWRARVLD---V 950
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1159 RTYSGTDIDTIKIRSDEVSSTvkgksynyrvvmYKqdvelDMRGRcSAGQK-VLASIIIRLALSETFGANC------GVI 1231
Cdd:COG4913 951 RNHLEFDAEEIDREDGEEVET------------YS-----SSGGK-SGGEKqKLAYFILAAALAYQLGLEGrgrpsfRTV 1012
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 6324079 1232 ALDEPTTNLDEENIESLAKSLHNIinmrrhqkNFQLIVITHDEKFlghMNAAAFTDHFFKVKRDDRQKSQIEWVDI 1307
Cdd:COG4913 1013 VLDEAFSKMDEEFARRALRLFKEL--------GLQLLIATPLDKI---QAIEPYVGSVLVVHKDDGRRSRVRELTI 1077
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
172-1107 |
2.05e-05 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 49.28 E-value: 2.05e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 172 KKKFDEIFQAM-KFTKALDNLK----SIKKDMSVDIKLLKQ-SVEHLKLDKDR-----SKAMKLNIHQLQTKIDQYNEEV 240
Cdd:TIGR01612 1235 KKKSEHMIKAMeAYIEDLDEIKekspEIENEMGIEMDIKAEmETFNISHDDDKdhhiiSKKHDENISDIREKSLKIIEDF 1314
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 241 SEiESQLNEITEKSDKLF----KSNQDFQKILSKVENLKNT-KLS----ISDQVKRLSNSIDildLSKPDLQNLLANFSK 311
Cdd:TIGR01612 1315 SE-ESDINDIKKELQKNLldaqKHNSDINLYLNEIANIYNIlKLNkikkIIDEVKEYTKEIE---ENNKNIKDELDKSEK 1390
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 312 VL--MDKNNQLRDLETDISSLKDRQSSLQSLSNSLIRRQGEL--EAGKETYEKNRnhlsslKEAFQHKFQGLSNIENSDm 387
Cdd:TIGR01612 1391 LIkkIKDDINLEECKSKIESTLDDKDIDECIKKIKELKNHILseESNIDTYFKNA------DENNENVLLLFKNIEMAD- 1463
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 388 AQVNHEMSQFKAFISQDLTDTIDQFAKDIQLK---ETNLSDLIKSITVDSQNLEYNKKDRSKLIHDSEELAEKLKSFKSL 464
Cdd:TIGR01612 1464 NKSQHILKIKKDNATNDHDFNINELKEHIDKSkgcKDEADKNAKAIEKNKELFEQYKKDVTELLNKYSALAIKNKFAKTK 1543
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 465 STQDSLNHELENLKTyKEKLQSWESENIIPKL-NQKI--------EEKNNEMII-LENQIEKFQDRIMKTNQqadlyakl 534
Cdd:TIGR01612 1544 KDSEIIIKEIKDAHK-KFILEAEKSEQKIKEIkKEKFrieddaakNDKSNKAAIdIQLSLENFENKFLKISD-------- 1614
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 535 glIKKSINTKLDELQKITEKLQNDSRIRQVFPLTQEFQRADLEMDFQKLFINMQKNIAINNKKMHELDRRytnalynLNT 614
Cdd:TIGR01612 1615 --IKKKINDCLKETESIEKKISSFSIDSQDTELKENGDNLNSLQEFLESLKDQKKNIEDKKKELDELDSE-------IEK 1685
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 615 IEKDLQDNQKSKE-KVIQLLSEnlpedcTIDEYNDVLEETELSYKTALENLkmhqtTLEFNRKALEIAErdsccylcsrk 693
Cdd:TIGR01612 1686 IEIDVDQHKKNYEiGIIEKIKE------IAIANKEEIESIKELIEPTIENL-----ISSFNTNDLEGID----------- 1743
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 694 fENESfksklLQELKTKTDANFEKTLKdtvqNEKEYLHSLRLLEKHIITLNSINEKIDNSQKCLEKAKEETKTSKSKLDE 773
Cdd:TIGR01612 1744 -PNEK-----LEEYNTEIGDIYEEFIE----LYNIIAGCLETVSKEPITYDEIKNTRINAQNEFLKIIEIEKKSKSYLDD 1813
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 774 LEVDS---------TKLKDEKELAESEIRPLIEKFTYLEKELKDLENSSKTISeELSIYNTSEDGIQTVdeLRDQQRKMN 844
Cdd:TIGR01612 1814 IEAKEfdriinhfkKKLDHVNDKFTKEYSKINEGFDDISKSIENVKNSTDENL-LFDILNKTKDAYAGI--IGKKYYSYK 1890
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 845 DSLRELRKTISDLQMEKDEKVrENSRMINLIKEKELTV-----SEIESSLT------QKQNIDDSIRSKRENINDIDSRV 913
Cdd:TIGR01612 1891 DEAEKIFINISKLANSINIQI-QNNSGIDLFDNINIAIlssldSEKEDTLKfipspeKEPEIYTKIRDSYDTLLDIFKKS 1969
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 914 KELEARIISLKNKKDEAQSVLDKVKNERDIqvrnkQKTVADINRlidRFQTIYNEVVDFEAKgFDELQTTIKELElNKAQ 993
Cdd:TIGR01612 1970 QDLHKKEQDTLNIIFENQQLYEKIQASNEL-----KDTLSDLKY---KKEKILNDVKLLLHK-FDELNKLSCDSQ-NYDT 2039
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 994 MLELKEQLDLKSNEVNEEKrkladsnnEEKNLKQNLELIELKSQLQHIESEISRLDVQNAEAERDKYQeeslrlrtrfek 1073
Cdd:TIGR01612 2040 ILELSKQDKIKEKIDNYEK--------EKEKFGIDFDVKAMEEKFDNDIKDIEKFENNYKHSEKDNHD------------ 2099
|
970 980 990
....*....|....*....|....*....|....
gi 6324079 1074 LSSENAGKLgemkQLQNQIDSLTHQLRTDYKDIE 1107
Cdd:TIGR01612 2100 FSEEKDNII----QSKKKLKELTEAFNTEIKIIE 2129
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
704-930 |
2.19e-05 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 48.22 E-value: 2.19e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 704 LQELKTKTDANfEKTLKDTVQNEKEYLHSLRLLEKHIITLnsiNEKIDNSQKCLEKAKEETKTSKSKLDELEvdsTKLKD 783
Cdd:COG4942 29 LEQLQQEIAEL-EKELAALKKEEKALLKQLAALERRIAAL---ARRIRALEQELAALEAELAELEKEIAELR---AELEA 101
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 784 EKELAESEIRPLIEKFTYLEKELKDLENSSKTISEELSIYNT-SEDGIQTVDELRDQQRKMNDSLRELRKTISDLQMEKD 862
Cdd:COG4942 102 QKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYlAPARREQAEELRADLAELAALRAELEAERAELEALLA 181
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 6324079 863 EKVRENSRMINLIKEKELTVSEIESSLTQKQNiddSIRSKRENINDIDSRVKELEARIISLKNKKDEA 930
Cdd:COG4942 182 ELEEERAALEALKAERQKLLARLEKELAELAA---ELAELQQEAEELEALIARLEAEAAAAAERTPAA 246
|
|
| CcmA |
COG4133 |
ABC-type transport system involved in cytochrome c biogenesis, ATPase component ... |
1202-1277 |
3.18e-05 |
|
ABC-type transport system involved in cytochrome c biogenesis, ATPase component [Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 443308 [Multi-domain] Cd Length: 206 Bit Score: 46.32 E-value: 3.18e-05
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 6324079 1202 GRCSAGQKVlasiiiRLALSETFGANCGVIALDEPTTNLDEENIESLAKSlhniinMRRHQKNFQLIVI-THDEKFL 1277
Cdd:COG4133 130 RQLSAGQKR------RVALARLLLSPAPLWLLDEPFTALDAAGVALLAEL------IAAHLARGGAVLLtTHQPLEL 194
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
726-1072 |
3.26e-05 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 48.20 E-value: 3.26e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 726 EKEYLHSLRLLEKHiitLNSINEKidNSQKCLEKAKEET--KTSKSKLDELEvDSTKLKDEKELAESEIRPliEKFTYLE 803
Cdd:pfam17380 267 ENEFLNQLLHIVQH---QKAVSER--QQQEKFEKMEQERlrQEKEEKAREVE-RRRKLEEAEKARQAEMDR--QAAIYAE 338
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 804 KELKDLENSSKTISEELsiyntsEDGIQTVDELRDQQRKMNDS-LRELRKtisdLQME---KDEKVR---ENSRMINLIK 876
Cdd:pfam17380 339 QERMAMERERELERIRQ------EERKRELERIRQEEIAMEISrMRELER----LQMErqqKNERVRqelEAARKVKILE 408
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 877 EkeltvsEIESSLTQKQNIDDSIRSKRENINDIDSRVKElEARIISLKNKKDEAQsvldkvknERDIQVRNKQKTVADIN 956
Cdd:pfam17380 409 E------ERQRKIQQQKVEMEQIRAEQEEARQREVRRLE-EERAREMERVRLEEQ--------ERQQQVERLRQQEEERK 473
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 957 RlidrfQTIYNEVVDFEAKGFDELQTTI--KELELNKAQMLE-------LKEQLDLKSNEVNEEKRKlADSNNEEKNLKQ 1027
Cdd:pfam17380 474 R-----KKLELEKEKRDRKRAEEQRRKIleKELEERKQAMIEeerkrklLEKEMEERQKAIYEEERR-REAEEERRKQQE 547
|
330 340 350 360
....*....|....*....|....*....|....*....|....*
gi 6324079 1028 NLELIELKSQLQHIESEISRLDVQNAEAERDKYQEESLRLRTRFE 1072
Cdd:pfam17380 548 MEERRRIQEQMRKATEERSRLEAMEREREMMRQIVESEKARAEYE 592
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
444-1017 |
3.51e-05 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 48.30 E-value: 3.51e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 444 RSKLIHDSEELAEKLKSFK-SLSTQDS----LNHELENLKtykEKLQSWESENIiPKLNQKIEEK---NNEMIILENQIE 515
Cdd:pfam12128 289 NQLLRTLDDQWKEKRDELNgELSAADAavakDRSELEALE---DQHGAFLDADI-ETAAADQEQLpswQSELENLEERLK 364
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 516 KFQDRIMKTNQQADlyAKLGLIKKSINTKLDELQKITEKlQNDSRIRQvfpltqefqRADLEMDFQKL--FINMQKNIAI 593
Cdd:pfam12128 365 ALTGKHQDVTAKYN--RRRSKIKEQNNRDIAGIKDKLAK-IREARDRQ---------LAVAEDDLQALesELREQLEAGK 432
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 594 NNKKMHELdrRYTNALYNLN------TIEKDLQDNQ---------------KSKEKVIQLLSENLPEDCTIDEYNDVLEE 652
Cdd:pfam12128 433 LEFNEEEY--RLKSRLGELKlrlnqaTATPELLLQLenfderierareeqeAANAEVERLQSELRQARKRRDQASEALRQ 510
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 653 TE---LSYKTALENLkMHQ------TTLEFNRKAL-----EIAERDSCCYLC-----------SRKFENESFKSKL-LQE 706
Cdd:pfam12128 511 ASrrlEERQSALDEL-ELQlfpqagTLLHFLRKEApdweqSIGKVISPELLHrtdldpevwdgSVGGELNLYGVKLdLKR 589
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 707 LKTKTDANFEKTLKDTVQNEKEYLHSLRLLEKHIIT-LNSINEKIDNSQKCLEKAKEETKTSKSKLDELEVDSTKLKDEK 785
Cdd:pfam12128 590 IDVPEWAASEEELRERLDKAEEALQSAREKQAAAEEqLVQANGELEKASREETFARTALKNARLDLRRLFDEKQSEKDKK 669
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 786 ELA-ESEIRPLIEKFTYLEKELKDLENSSKTISEelsiyntsedgiQTVDELRDQQRKMNDSLRELrktisdlqmEKDEK 864
Cdd:pfam12128 670 NKAlAERKDSANERLNSLEAQLKQLDKKHQAWLE------------EQKEQKREARTEKQAYWQVV---------EGALD 728
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 865 VRENSRMINLIKEKELTVSEIESSLTQKQNIDDSIRSKRENINDIDSRVKELEARIISLKNKKDEAQSVLDKVKNERDIQ 944
Cdd:pfam12128 729 AQLALLKAAIAARRSGAKAELKALETWYKRDLASLGVDPDVIAKLKREIRTLERKIERIAVRRQEVLRYFDWYQETWLQR 808
|
570 580 590 600 610 620 630
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 6324079 945 VRNKQKTVADINRLIDRFQtiynevvdfeakgfDELQTTIKELELNKAQMLELKEQLDLKSNEVNEEKRKLAD 1017
Cdd:pfam12128 809 RPRLATQLSNIERAISELQ--------------QQLARLIADTKLRRAKLEMERKASEKQQVRLSENLRGLRC 867
|
|
| ABC_SMC_barmotin |
cd03278 |
ATP-binding cassette domain of barmotin, a member of the SMC protein family; Barmotin is a ... |
6-50 |
3.65e-05 |
|
ATP-binding cassette domain of barmotin, a member of the SMC protein family; Barmotin is a tight junction-associated protein expressed in rat epithelial cells which is thought to have an important regulatory role in tight junction barrier function. Barmotin belongs to the SMC protein family. SMC proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (formerly known as Rad18).
Pssm-ID: 213245 [Multi-domain] Cd Length: 197 Bit Score: 45.92 E-value: 3.65e-05
10 20 30 40
....*....|....*....|....*....|....*....|....*
gi 6324079 6 KLSIQGIRSFdsNDRETIEFGKPLTLIVGMNGSGKTTIIECLKYA 50
Cdd:cd03278 3 KLELKGFKSF--ADKTTIPFPPGLTAIVGPNGSGKSNIIDAIRWV 45
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
742-1081 |
3.69e-05 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 48.11 E-value: 3.69e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 742 TLNSINEKIDNSQKCLEKAKEETKTSKSKLDELEV---DSTKLKDEKELAESEIRPLIEKFTYLEKELKDLEnssktise 818
Cdd:PRK02224 207 RLNGLESELAELDEEIERYEEQREQARETRDEADEvleEHEERREELETLEAEIEDLRETIAETEREREELA-------- 278
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 819 elsiyntsedgiqtvDELRDQQRKMNDSLRELRKTISDLQMEKDEKVRENSRMINLIKEKELTVSEIESSLTQKQNIDDS 898
Cdd:PRK02224 279 ---------------EEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEE 343
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 899 IRSKRENINDIDSRVKELEARIISLKNKKDEAQSVLDKVKNERDI---QVRNKQKTVADINRLIDRFQTIYNEVVDF--E 973
Cdd:PRK02224 344 AESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEEleeEIEELRERFGDAPVDLGNAEDFLEELREErdE 423
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 974 AKG-FDELQTTIKELE--LNKAQML----------------ELKEQLDLKSNEVNEEKRKLADSNNEEKNLKQNLE---- 1030
Cdd:PRK02224 424 LRErEAELEATLRTARerVEEAEALleagkcpecgqpvegsPHVETIEEDRERVEELEAELEDLEEEVEEVEERLEraed 503
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*
gi 6324079 1031 LIELKSQLQHIESEIS----RLDVQNAEAERDKYQEESlrLRTRFEKLSSENAGK 1081
Cdd:PRK02224 504 LVEAEDRIERLEERREdleeLIAERRETIEEKRERAEE--LRERAAELEAEAEEK 556
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
802-1095 |
4.96e-05 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 47.75 E-value: 4.96e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 802 LEKELKDLENSSKTISEELSIYntsEDGIQTVDELRDQQRKMNDSLRELRKTISDLQMEKDEKVRENSRMINLIKEKELT 881
Cdd:PRK03918 160 YENAYKNLGEVIKEIKRRIERL---EKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEEL 236
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 882 VSEIESSLTQKQNIDDSIRSKRENINDIDSRVKELEARIISLKNKKDEAQSvLDKVKNERDIQVRNKQKTVADINRLIDR 961
Cdd:PRK03918 237 KEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKE-LKEKAEEYIKLSEFYEEYLDELREIEKR 315
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 962 FQTIYNEVvdfeakgfDELQTTIKELELNKAQMLELKEQLDlksnEVNEEKRKLadsnneEKNLKQNLELIELKSQLQHI 1041
Cdd:PRK03918 316 LSRLEEEI--------NGIEERIKELEEKEERLEELKKKLK----ELEKRLEEL------EERHELYEEAKAKKEELERL 377
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....
gi 6324079 1042 ESEISRLDVQNAEAERDKYQEESLRLRTRFEKLSSENAGKLGEMKQLQNQIDSL 1095
Cdd:PRK03918 378 KKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEEL 431
|
|
| PRK05771 |
PRK05771 |
V-type ATP synthase subunit I; Validated |
719-944 |
7.66e-05 |
|
V-type ATP synthase subunit I; Validated
Pssm-ID: 235600 [Multi-domain] Cd Length: 646 Bit Score: 47.23 E-value: 7.66e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 719 LKDTVQNEK-EYLHSLRL-LEKHIITLNSINEKidnsqkclEKAKEETKTSKSKLDELEVdSTKLKDEKELAESEIRPLI 796
Cdd:PRK05771 36 LKEELSNERlRKLRSLLTkLSEALDKLRSYLPK--------LNPLREEKKKVSVKSLEEL-IKDVEEELEKIEKEIKELE 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 797 EKFTYLEKELKDLENSS------KTISEELSIYNTSEDGIQTVDELRDQQrkmnDSLRELRKTISDLQMEKDEKvreNSR 870
Cdd:PRK05771 107 EEISELENEIKELEQEIerlepwGNFDLDLSLLLGFKYVSVFVGTVPEDK----LEELKLESDVENVEYISTDK---GYV 179
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 871 MINLIKEKELTV---SEIESSLTQKQNIDDS------IRSKRENINDIDSRVKELEARIISLKNK-KDEAQSVLDKVKNE 940
Cdd:PRK05771 180 YVVVVVLKELSDeveEELKKLGFERLELEEEgtpselIREIKEELEEIEKERESLLEELKELAKKyLEELLALYEYLEIE 259
|
....
gi 6324079 941 RDIQ 944
Cdd:PRK05771 260 LERA 263
|
|
| COG4637 |
COG4637 |
Predicted ATPase [General function prediction only]; |
4-83 |
1.08e-04 |
|
Predicted ATPase [General function prediction only];
Pssm-ID: 443675 [Multi-domain] Cd Length: 371 Bit Score: 46.08 E-value: 1.08e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 4 IYKLSIQGIRSFDSndrETIEFGkPLTLIVGMNGSGKTTIIEC---LKYATTGDLPP--NSKGGV--FIHDPKITGEKDI 76
Cdd:COG4637 2 ITRIRIKNFKSLRD---LELPLG-PLTVLIGANGSGKSNLLDAlrfLSDAARGGLQDalARRGGLeeLLWRGPRTITEPI 77
|
....*..
gi 6324079 77 RAQVKLA 83
Cdd:COG4637 78 RLELEFA 84
|
|
| PRK05771 |
PRK05771 |
V-type ATP synthase subunit I; Validated |
841-1077 |
1.10e-04 |
|
V-type ATP synthase subunit I; Validated
Pssm-ID: 235600 [Multi-domain] Cd Length: 646 Bit Score: 46.46 E-value: 1.10e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 841 RKMNDSLRELRKT----ISDL----QMEKDEKVREN----SRMINLIKE--KELTVSEIESSLTQKQNIDDSIRSKRENI 906
Cdd:PRK05771 16 SYKDEVLEALHELgvvhIEDLkeelSNERLRKLRSLltklSEALDKLRSylPKLNPLREEKKKVSVKSLEELIKDVEEEL 95
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 907 NDIDSRVKELEARIISLKNKKDEAQSVLDKVKNERDIQVrnkqktvaDINRLID-RFQTIYNEVVDFEAKGFDELQTTIK 985
Cdd:PRK05771 96 EKIEKEIKELEEEISELENEIKELEQEIERLEPWGNFDL--------DLSLLLGfKYVSVFVGTVPEDKLEELKLESDVE 167
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 986 ELELNKaqmlELKE-------QLDLKSNEVNEEKRKLADSNNEEKNLKQNLELI-ELKSQLQHIESEISRLDVQnAEAER 1057
Cdd:PRK05771 168 NVEYIS----TDKGyvyvvvvVLKELSDEVEEELKKLGFERLELEEEGTPSELIrEIKEELEEIEKERESLLEE-LKELA 242
|
250 260
....*....|....*....|
gi 6324079 1058 DKYQEESLRLRtrfEKLSSE 1077
Cdd:PRK05771 243 KKYLEELLALY---EYLEIE 259
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
884-1080 |
1.11e-04 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 45.30 E-value: 1.11e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 884 EIESSLTQKQNIDDSIRSKRENINDIDSRVKELEARIISLKNKKDEAQSVLDKVKNERDIQVRNKqktvadinrlidrfq 963
Cdd:COG1579 32 ELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVRNNKEYEALQK--------------- 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 964 tiynevvdfeakgfdELQTTIKELELNKAQMLELKEQLDLKSNEVNEEKRKLADSNNEEKNLKQnleliELKSQLQHIES 1043
Cdd:COG1579 97 ---------------EIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKA-----ELDEELAELEA 156
|
170 180 190
....*....|....*....|....*....|....*...
gi 6324079 1044 EISRLdvqnaEAERDKYQEE-SLRLRTRFEKLSSENAG 1080
Cdd:COG1579 157 ELEEL-----EAEREELAAKiPPELLALYERIRKRKNG 189
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
491-1109 |
1.24e-04 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 46.55 E-value: 1.24e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 491 NIIPKLNQKIEEKNNEMIILENQIEKFQDRIMKTNQQADLYAKlglIKKSINTKLDELQKITEKLQNDsrirqvfpLTQE 570
Cdd:TIGR04523 26 NIANKQDTEEKQLEKKLKTIKNELKNKEKELKNLDKNLNKDEE---KINNSNNKIKILEQQIKDLNDK--------LKKN 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 571 FQRADlEMDFQKLFINMQKNIAINNKKMHELDrrytnalynLNTIEKDLQDNQKSKEKVIQLLSENLPEDCTIDEYNDVL 650
Cdd:TIGR04523 95 KDKIN-KLNSDLSKINSEIKNDKEQKNKLEVE---------LNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYNDL 164
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 651 EETELSYKTALENLKMHQTTLEFNRKALEIAERDSCCYLCSRKFENESFKSKLLQELKTKTDANFEKTLKDTVQNEKEYL 730
Cdd:TIGR04523 165 KKQKEELENELNLLEKEKLNIQKNIDKIKNKLLKLELLLSNLKKKIQKNKSLESQISELKKQNNQLKDNIEKKQQEINEK 244
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 731 HS-LRLLEKHIITL----NSINEKIDNSQKCLEKAKEETKTSKSKLDELEVDSTKLKDEKElaeseirpliekftylEKE 805
Cdd:TIGR04523 245 TTeISNTQTQLNQLkdeqNKIKKQLSEKQKELEQNNKKIKELEKQLNQLKSEISDLNNQKE----------------QDW 308
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 806 LKDLENSSKTISEELsiyntsedgiqtvDELRDQQRKMNDSLRELRKTISDLQMEKDEKVRENSRMINLIKEKELTV--- 882
Cdd:TIGR04523 309 NKELKSELKNQEKKL-------------EEIQNQISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIekl 375
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 883 -SEIESSLTQKQNIDDSIRSKRENINDIDSRVKELEARIISLKNKKDEAQSVLDKVKNERDIQvrnkQKTVADINRLIDR 961
Cdd:TIGR04523 376 kKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIKN----NSEIKDLTNQDSV 451
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 962 FQTIYNEVVDFEAKGFDELQTTIKELELNKAQMLELKEQLDLKSNEVNEEKRKLADSNNEEKNLKQNLEliELKSQLQHI 1041
Cdd:TIGR04523 452 KELIIKNLDNTRESLETQLKVLSRSINKIKQNLEQKQKELKSKEKELKKLNEEKKELEEKVKDLTKKIS--SLKEKIEKL 529
|
570 580 590 600 610 620
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 6324079 1042 ESEISRLDVQNAEAERDKYQEESLRLRTRFEKLSSENAGKLGEMKQLQNQIDSLTHQLRTDYKDIEKN 1109
Cdd:TIGR04523 530 ESEKKEKESKISDLEDELNKDDFELKKENLEKEIDEKNKEIEELKQTQKSLKKKQEEKQELIDQKEKE 597
|
|
| RecF |
COG1195 |
Recombinational DNA repair ATPase RecF [Replication, recombination and repair]; |
6-53 |
1.53e-04 |
|
Recombinational DNA repair ATPase RecF [Replication, recombination and repair];
Pssm-ID: 440808 [Multi-domain] Cd Length: 352 Bit Score: 45.53 E-value: 1.53e-04
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 6324079 6 KLSIQGIRSFDSNDretIEFGKPLTLIVGMNGSGKTTIIECLKYATTG 53
Cdd:COG1195 4 RLSLTNFRNYESLE---LEFSPGINVLVGPNGQGKTNLLEAIYLLATG 48
|
|
| PRK01156 |
PRK01156 |
chromosome segregation protein; Provisional |
750-1181 |
2.16e-04 |
|
chromosome segregation protein; Provisional
Pssm-ID: 100796 [Multi-domain] Cd Length: 895 Bit Score: 45.66 E-value: 2.16e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 750 IDNSQKCLEKAKEETKTSKSKLDELEVDSTKLKDEKELAESEIRPLIEKFTYLEKELKDLENSSKTISEELSIYNTSEDG 829
Cdd:PRK01156 185 IDYLEEKLKSSNLELENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDMKNRYESEIKTAESD 264
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 830 IQTVDElrdQQRKMNDSLRELRKTISDLQMEKDEKVRENSRMINLIKEKELTVSEIESSLTQKQNID---DSIRSKRENI 906
Cdd:PRK01156 265 LSMELE---KNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQILSNIDAEINKYHAIIkklSVLQKDYNDY 341
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 907 NDIDSRVKELEARIISLKNKKDEAQSVLDKVKNERdIQVRNKQKTVADINRLIDRFQTIYNEVVDFEAKGFDELQTTIKE 986
Cdd:PRK01156 342 IKKKSRYDDLNNQILELEGYEMDYNSYLKSIESLK-KKIEEYSKNIERMSAFISEILKIQEIDPDAIKKELNEINVKLQD 420
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 987 LELNKAQMLELKEQLDLKSNEVneeKRKLADSNNEEKNLKQNLELIELKSQ--LQHIESEISRL--DVQNAEAERDKYQE 1062
Cdd:PRK01156 421 ISSKVSSLNQRIRALRENLDEL---SRNMEMLNGQSVCPVCGTTLGEEKSNhiINHYNEKKSRLeeKIREIEIEVKDIDE 497
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1063 ESLRLRTRFEKLsseNAGKLGEMKQLQNQIDSLTHQLRTDYKDIEknyhkewvELQTRSFVTDDIDVYSKALDSAIMKYH 1142
Cdd:PRK01156 498 KIVDLKKRKEYL---ESEEINKSINEYNKIESARADLEDIKIKIN--------ELKDKHDKYEEIKNRYKSLKLEDLDSK 566
|
410 420 430
....*....|....*....|....*....|....*....
gi 6324079 1143 GLKMQDINRIIDELwkrtysgtDIDTIKIRSDEVSSTVK 1181
Cdd:PRK01156 567 RTSWLNALAVISLI--------DIETNRSRSNEIKKQLN 597
|
|
| PRK01156 |
PRK01156 |
chromosome segregation protein; Provisional |
12-810 |
2.31e-04 |
|
chromosome segregation protein; Provisional
Pssm-ID: 100796 [Multi-domain] Cd Length: 895 Bit Score: 45.66 E-value: 2.31e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 12 IRSFDSNDRETIEFGKPLTLIVGMNGSGKTTIIECLKYATTGDlppnsKGGVFIHDPKITGEKDIraQVKLAFTSAnGLN 91
Cdd:PRK01156 8 LKNFLSHDDSEIEFDTGINIITGKNGAGKSSIVDAIRFALFTD-----KRTEKIEDMIKKGKNNL--EVELEFRIG-GHV 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 92 MIVTRNIQLLMKKTTTTFktlegqLVAINNSGDRSTLSTRSLELDAQVplyLGVPKAILEYVIFCHQEDSLWPLS-EPSN 170
Cdd:PRK01156 80 YQIRRSIERRGKGSRREA------YIKKDGSIIAEGFDDTTKYIEKNI---LGISKDVFLNSIFVGQGEMDSLISgDPAQ 150
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 171 LKKKFDEIFQAMKFTKALDNLKSIKKDMSVDIKLLKQSVEHLKLDKDRSKAMKLNIHQLQTKIDQYNEEVSEIESqlnEI 250
Cdd:PRK01156 151 RKKILDEILEINSLERNYDKLKDVIDMLRAEISNIDYLEEKLKSSNLELENIKKQIADDEKSHSITLKEIERLSI---EY 227
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 251 TEKSDKLFKSNQDFQKILSKVENLKNTKLSISDQVKRLSNSIDILDLSKPDLQNLLANFSKVLMDKNNQLRDLETDISSL 330
Cdd:PRK01156 228 NNAMDDYNNLKSALNELSSLEDMKNRYESEIKTAESDLSMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDI 307
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 331 KDRQSSLQSlsnslirrqgeLEAGKETYEKNRNHLSSLkEAFQHKFQglsnIENSDMAQVNHEMSQFKAFIS--QDLTDT 408
Cdd:PRK01156 308 ENKKQILSN-----------IDAEINKYHAIIKKLSVL-QKDYNDYI----KKKSRYDDLNNQILELEGYEMdyNSYLKS 371
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 409 IDQFAKDIQLKETNLSDLIKSItvdSQNLEYNKKDRSKLIHDSEELAEKLKSFKS--LSTQDSLNHELENLKTYKEKLQS 486
Cdd:PRK01156 372 IESLKKKIEEYSKNIERMSAFI---SEILKIQEIDPDAIKKELNEINVKLQDISSkvSSLNQRIRALRENLDELSRNMEM 448
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 487 WESENIIP---------KLNQKIEEKNNEMIILENQIEKFQDRIMKTNQQADLYAKLgliKKSINTKldelqKITEKLQN 557
Cdd:PRK01156 449 LNGQSVCPvcgttlgeeKSNHIINHYNEKKSRLEEKIREIEIEVKDIDEKIVDLKKR---KEYLESE-----EINKSINE 520
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 558 DSRIRQVFPLTQEFQRADLEMDFQKLFINMQKNiAINNKKMHELDRRYTNALYNLNTIEK-DLQDNQKSKEKVIQllsen 636
Cdd:PRK01156 521 YNKIESARADLEDIKIKINELKDKHDKYEEIKN-RYKSLKLEDLDSKRTSWLNALAVISLiDIETNRSRSNEIKK----- 594
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 637 lpedcTIDEYNDVLEETELSYktalenlkmhQTTLEFNRKALEIAERDsccylcSRKFENesfKSKLLQELKTKTDanfe 716
Cdd:PRK01156 595 -----QLNDLESRLQEIEIGF----------PDDKSYIDKSIREIENE------ANNLNN---KYNEIQENKILIE---- 646
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 717 kTLKDTVQNEKEYLHSLRLLEKHIITLNSINEKIDNSQKCLEKAKEETKTSKSKLDELEVDSTKLKDEKELAESEIRPLI 796
Cdd:PRK01156 647 -KLRGKIDNYKKQIAEIDSIIPDLKEITSRINDIEDNLKKSRKALDDAKANRARLESTIEILRTRINELSDRINDINETL 725
|
810
....*....|....
gi 6324079 797 EKFTYLEKELKDLE 810
Cdd:PRK01156 726 ESMKKIKKAIGDLK 739
|
|
| CydD |
COG4988 |
ABC-type transport system involved in cytochrome bd biosynthesis, ATPase and permease ... |
1217-1281 |
3.59e-04 |
|
ABC-type transport system involved in cytochrome bd biosynthesis, ATPase and permease components [Energy production and conversion, Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 444012 [Multi-domain] Cd Length: 563 Bit Score: 44.75 E-value: 3.59e-04
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 6324079 1217 RLALSETFGANCGVIALDEPTTNLDEENIESLAKSLHNIinmRRHQknfQLIVITHDEKFLGHMN 1281
Cdd:COG4988 481 RLALARALLRDAPLLLLDEPTAHLDAETEAEILQALRRL---AKGR---TVILITHRLALLAQAD 539
|
|
| COG1340 |
COG1340 |
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown]; |
810-1089 |
4.43e-04 |
|
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
Pssm-ID: 440951 [Multi-domain] Cd Length: 297 Bit Score: 43.75 E-value: 4.43e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 810 ENSSKTISEELSIYNTSEDGIQTVDELRDQQRKMNDSLRELRKTISDLQMEKDEKVRENSRMINLIKEKELTVSEIESsl 889
Cdd:COG1340 1 SKTDELSSSLEELEEKIEELREEIEELKEKRDELNEELKELAEKRDELNAQVKELREEAQELREKRDELNEKVKELKE-- 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 890 tQKQNIDDSIRSKRENINDIDSRVKELEARIISLKNKKDEAQ--------SVLDKvKNERDI-----QVRNKQKTVADIN 956
Cdd:COG1340 79 -ERDELNEKLNELREELDELRKELAELNKAGGSIDKLRKEIErlewrqqtEVLSP-EEEKELvekikELEKELEKAKKAL 156
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 957 RLIDRFQTIYNEVVDFEAKG---FDELQTTIKELELNKAQMLELKEQLDLKSNEVNEEKRKLADSNNEEKNLKQnlELIE 1033
Cdd:COG1340 157 EKNEKLKELRAELKELRKEAeeiHKKIKELAEEAQELHEEMIELYKEADELRKEADELHKEIVEAQEKADELHE--EIIE 234
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*..
gi 6324079 1034 LKSQLQHIESEISRLDVQNAEAERDKYQEESLRLRTR-FEKLSSENAGKLGEMKQLQ 1089
Cdd:COG1340 235 LQKELRELRKELKKLRKKQRALKREKEKEELEEKAEEiFEKLKKGEKLTTEELKLLQ 291
|
|
| ABC_RecF |
cd03242 |
ATP-binding cassette domain of RecF; RecF is a recombinational DNA repair ATPase that ... |
13-53 |
4.93e-04 |
|
ATP-binding cassette domain of RecF; RecF is a recombinational DNA repair ATPase that maintains replication in the presence of DNA damage. When replication is prematurely disrupted by DNA damage, several recF pathway gene products play critical roles processing the arrested replication fork, allowing it to resume and complete its task. This CD represents the nucleotide binding domain of RecF. RecF belongs to a large superfamily of ABC transporters involved in the transport of a wide variety of different compounds including sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases with a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins.
Pssm-ID: 213209 [Multi-domain] Cd Length: 270 Bit Score: 43.44 E-value: 4.93e-04
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 6324079 13 RSFDSNDRETIEFGKPLTLIVGMNGSGKTTIIECLKYATTG 53
Cdd:cd03242 7 RNFRNYAELELEFEPGVTVLVGENAQGKTNLLEAISLLATG 47
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
875-1083 |
5.62e-04 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 44.24 E-value: 5.62e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 875 IKEKELTVSEIESSLT---QKQNIDDSirskRENINDIDSRVKELEARIISLKNKKDEAQSVLDKVKNERDIQVRNKQKT 951
Cdd:COG3206 184 LPELRKELEEAEAALEefrQKNGLVDL----SEEAKLLLQQLSELESQLAEARAELAEAEARLAALRAQLGSGPDALPEL 259
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 952 VADinrliDRFQTIYNEVVDFEAKgFDELQTTIKElelNKAQMLELKEQLD-LKSNEVNEEKRKLADSNNEEKNLKQNLE 1030
Cdd:COG3206 260 LQS-----PVIQQLRAQLAELEAE-LAELSARYTP---NHPDVIALRAQIAaLRAQLQQEAQRILASLEAELEALQAREA 330
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 6324079 1031 liELKSQLQHIESEISRLDV---------QNAEAERDKYQEeslrLRTRFEKLSSENAGKLG 1083
Cdd:COG3206 331 --SLQAQLAQLEARLAELPEleaelrrleREVEVARELYES----LLQRLEEARLAEALTVG 386
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
726-1109 |
9.30e-04 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 43.56 E-value: 9.30e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 726 EKEYLHSLRLLEKHIITLNSINEKIDNSQKCLEKAKEETKTSKSKLDElevdSTKLKDEKelaeseIRPLIEKFTYLEKE 805
Cdd:pfam05483 228 EEEYKKEINDKEKQVSLLLIQITEKENKMKDLTFLLEESRDKANQLEE----KTKLQDEN------LKELIEKKDHLTKE 297
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 806 LKDLENSsktiseelsiyntsedgiqtVDELRDQQRKMNDSLRELRKTISDLQMEKDEKVRENSRMinlIKEKELTVSEI 885
Cdd:pfam05483 298 LEDIKMS--------------------LQRSMSTQKALEEDLQIATKTICQLTEEKEAQMEELNKA---KAAHSFVVTEF 354
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 886 ESSLTQKQNIDDSIRSKRENINDidsrvkELEARIISLKNKKDEAQSvLDKVKNERDIQVRNKQKTVADINRLID---RF 962
Cdd:pfam05483 355 EATTCSLEELLRTEQQRLEKNED------QLKIITMELQKKSSELEE-MTKFKNNKEVELEELKKILAEDEKLLDekkQF 427
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 963 QTIYNEVVDFEAKGFDELQTTIKELELNKAQMLELKEQLDLKSNEVNEEKRKLADSNNEEKNLKQNLELIELKSQLQHIE 1042
Cdd:pfam05483 428 EKIAEELKGKEQELIFLLQAREKEIHDLEIQLTAIKTSEEHYLKEVEDLKTELEKEKLKNIELTAHCDKLLLENKELTQE 507
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 6324079 1043 SEISRLDVQNAEAERDKYQEESLRLRTRFEKLSSENAGKLGEMKQLQNQIDSLTHQLRTDYKDIEKN 1109
Cdd:pfam05483 508 ASDMTLELKKHQEDIINCKKQEERMLKQIENLEEKEMNLRDELESVREEFIQKGDEVKCKLDKSEEN 574
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
903-1100 |
1.00e-03 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 43.77 E-value: 1.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 903 RENINDIDSRVKELEARIISLKNKKDEAQS--VLDKVKNERDIQVRnkqktVADINRLIDRFQTIYNEVvdfeakgfDEL 980
Cdd:COG1196 185 EENLERLEDILGELERQLEPLERQAEKAERyrELKEELKELEAELL-----LLKLRELEAELEELEAEL--------EEL 251
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 981 QTTIKELElnkAQMLELKEQLDLKSNEVNEEKRKLADSNNEEKNLKQnlELIELKSQLQHIESEIsrldvQNAEAERDKY 1060
Cdd:COG1196 252 EAELEELE---AELAELEAELEELRLELEELELELEEAQAEEYELLA--ELARLEQDIARLEERR-----RELEERLEEL 321
|
170 180 190 200
....*....|....*....|....*....|....*....|
gi 6324079 1061 QEESLRLRTRFEKLSSENAGKLGEMKQLQNQIDSLTHQLR 1100
Cdd:COG1196 322 EEELAELEEELEELEEELEELEEELEEAEEELEEAEAELA 361
|
|
| COG3910 |
COG3910 |
Predicted ATPase [General function prediction only]; |
4-45 |
1.06e-03 |
|
Predicted ATPase [General function prediction only];
Pssm-ID: 443116 [Multi-domain] Cd Length: 239 Bit Score: 42.06 E-value: 1.06e-03
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 6324079 4 IYKLSIQGIRSFdsndrETIEFGKPLTLIVGMNGSGKTTIIE 45
Cdd:COG3910 19 AYPFNLPAVRNL-----EGLEFHPPVTFFVGENGSGKSTLLE 55
|
|
| CydD |
TIGR02857 |
thiol reductant ABC exporter, CydD subunit; The gene pair cydCD encodes an ABC-family ... |
1195-1275 |
1.36e-03 |
|
thiol reductant ABC exporter, CydD subunit; The gene pair cydCD encodes an ABC-family transporter in which each gene contains an N-terminal membrane-spanning domain (pfam00664) and a C-terminal ATP-binding domain (pfam00005). In E. coli these genes were discovered as mutants which caused the terminal heme-copper oxidase complex cytochrome bd to fail to assemble. Recent work has shown that the transporter is involved in export of redox-active thiol compounds such as cysteine and glutathione. The linkage to assembly of the cytochrome bd complex is further supported by the conserved operon structure found outside the gammaproteobacteria (cydABCD) containing both the transporter and oxidase genes components. The genes used as the seed members for this model are all either found in the gammproteobacterial context or the CydABCD context. All members of this family scoring above trusted at the time of its creation were from genomes which encode a cytochrome bd complex. Unfortunately, the gene symbol nomenclature adopted based on this operon in B. subtilis assigns cydC to the third gene in the operon where this gene is actually homologous to the E. coli cydD gene. We have chosen to name all homologs in this family in accordance with the precedence of publication of the E. coli name, CydD
Pssm-ID: 274323 [Multi-domain] Cd Length: 529 Bit Score: 43.04 E-value: 1.36e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1195 DVELDMRGR-CSAGQKVlasiiiRLALSETFGANCGVIALDEPTTNLDEENiESLAKSlhniiNMRRHQKNFQLIVITHD 1273
Cdd:TIGR02857 449 DTPIGEGGAgLSGGQAQ------RLALARAFLRDAPLLLLDEPTAHLDAET-EAEVLE-----ALRALAQGRTVLLVTHR 516
|
..
gi 6324079 1274 EK 1275
Cdd:TIGR02857 517 LA 518
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
220-1043 |
1.74e-03 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 43.12 E-value: 1.74e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 220 KAMKLNIHQLQTKIDQYNEEVSEIESQLNE--------ITEKSDKLFKSNQdfqkilskvENLKNTKLSISDQVKRLSNS 291
Cdd:TIGR01612 1663 KDQKKNIEDKKKELDELDSEIEKIEIDVDQhkknyeigIIEKIKEIAIANK---------EEIESIKELIEPTIENLISS 1733
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 292 IDILDLSKPDLQNLLANFSKVLMDKNNQLRDLETDIS----SLKDRQSSLQSLSNSLIRRQGELEAGKETYEKNRNHLSS 367
Cdd:TIGR01612 1734 FNTNDLEGIDPNEKLEEYNTEIGDIYEEFIELYNIIAgcleTVSKEPITYDEIKNTRINAQNEFLKIIEIEKKSKSYLDD 1813
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 368 LK--------EAFQHKFQGLSNIENSDMAQVNHEMSQFKAFISQDLTDTIDQFAKDI--QLKETNLSDLIKSITVDSQNL 437
Cdd:TIGR01612 1814 IEakefdriiNHFKKKLDHVNDKFTKEYSKINEGFDDISKSIENVKNSTDENLLFDIlnKTKDAYAGIIGKKYYSYKDEA 1893
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 438 EYNKKDRSKL-----IHDSEELAEKLKSFKSLSTQDSLNHELENL-----------KTYKEKLQSWESENIIPKLNQKIE 501
Cdd:TIGR01612 1894 EKIFINISKLansinIQIQNNSGIDLFDNINIAILSSLDSEKEDTlkfipspekepEIYTKIRDSYDTLLDIFKKSQDLH 1973
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 502 EKNNEMIILENQIEKFQDRIMKTNQQADLYAKLGLIKKSINT-------KLDELQKITEKLQNDSRIRQVFPLTQEFQRA 574
Cdd:TIGR01612 1974 KKEQDTLNIIFENQQLYEKIQASNELKDTLSDLKYKKEKILNdvklllhKFDELNKLSCDSQNYDTILELSKQDKIKEKI 2053
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 575 D-LEMDFQKLFINMQKNIainnkkmheLDRRYTNalyNLNTIEKdLQDNQKSKEKVIQLLSEnlpEDCTIDEYNDVLEET 653
Cdd:TIGR01612 2054 DnYEKEKEKFGIDFDVKA---------MEEKFDN---DIKDIEK-FENNYKHSEKDNHDFSE---EKDNIIQSKKKLKEL 2117
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 654 ELSYKTALEnlKMHQTTLEFNRKALEIAERDSCCYLCSRKFENESFKSKLLQELKTKTDA-NFEKTLKDTVQNEKEYL-- 730
Cdd:TIGR01612 2118 TEAFNTEIK--IIEDKIIEKNDLIDKLIEMRKECLLFSYATLVETLKSKVINHSEFITSAaKFSKDFFEFIEDISDSLnd 2195
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 731 --------HSLRLLEKHIITLnsINEKIDNSQKCLEKAKEETKTSKSKLDELEVDSTKLkDEKELAESEIrplieKFTYL 802
Cdd:TIGR01612 2196 didalqikYNLNQTKKHMISI--LADATKDHNNLIEKEKEATKIINNLTELFTIDFNNA-DADILHNNKI-----QIIYF 2267
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 803 EKELKDLENSSKTISEELSIYNTSEdgIQTVDELRDQQRKMNDSLrelrktisdLQMEKDeKVRENsrmINLIKEkeltv 882
Cdd:TIGR01612 2268 NSELHKSIESIKKLYKKINAFKLLN--ISHINEKYFDISKEFDNI---------IQLQKH-KLTEN---LNDLKE----- 2327
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 883 seIESSLTQKQNIddSIRSKRENINDIDSRVKELEARIISLKNKKDEAQSVLDKvknerdiQVRNKQKTVADINRLIDRF 962
Cdd:TIGR01612 2328 --IDQYISDKKNI--FLHALNENTNFNFNALKEIYDDIINRENKADEIENINNK-------ENENIMQYIDTITKLTEKI 2396
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 963 QTIYNEVVDFE------------------AKGFDELQTTIKELELNKAQMLELKEQLdLKSNEVNEEKRKLADSNNEEKN 1024
Cdd:TIGR01612 2397 QDILIFVTTYEndnniikqhiqdndendvSKIKDNLKKTIQSFQEILNKIDEIKAQF-YGGNNINNIIITISQNANDVKN 2475
|
890 900
....*....|....*....|....
gi 6324079 1025 L-KQNL----ELIELKSQLQHIES 1043
Cdd:TIGR01612 2476 HfSKDLtienELIQIQKRLEDIKN 2499
|
|
| MAD |
pfam05557 |
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ... |
658-1108 |
1.80e-03 |
|
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.
Pssm-ID: 461677 [Multi-domain] Cd Length: 660 Bit Score: 42.81 E-value: 1.80e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 658 KTALENLK--MHQTTLEFNRkalEIAERDSCCYLCSRKFENESFKSKLLQ---ELKTKTDANFEKTLKDTVQNEKEYLHS 732
Cdd:pfam05557 8 KARLSQLQneKKQMELEHKR---ARIELEKKASALKRQLDRESDRNQELQkriRLLEKREAEAEEALREQAELNRLKKKY 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 733 LRLLEKHIITLNSINEKIDNSQKCLEKAKEETKTSKSKlDELEVDSTKLK--------DEKELAESEIRPLIEKFTYLEK 804
Cdd:pfam05557 85 LEALNKKLNEKESQLADAREVISCLKNELSELRRQIQR-AELELQSTNSEleelqerlDLLKAKASEAEQLRQNLEKQQS 163
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 805 ELKDLENSSKTISEELSIYNTSEDGIQTVDE----------LRDQQRKMNDSLRELRKTISDLQ---------MEKDEKV 865
Cdd:pfam05557 164 SLAEAEQRIKELEFEIQSQEQDSEIVKNSKSelaripelekELERLREHNKHLNENIENKLLLKeevedlkrkLEREEKY 243
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 866 REnsRMINLIKEKELTVSEIESSLTQKQNIDDSIRS-----------KRENIN------DIDSRVKELEARIISLKNKKD 928
Cdd:pfam05557 244 RE--EAATLELEKEKLEQELQSWVKLAQDTGLNLRSpedlsrrieqlQQREIVlkeensSLTSSARQLEKARRELEQELA 321
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 929 EAQSVLDKVKNER---DIQVRNKQKTVADINRLIDRFQTIYnEVVDFE---AKGFDELQTTIKELE--LNKAQML--ELK 998
Cdd:pfam05557 322 QYLKKIEDLNKKLkrhKALVRRLQRRVLLLTKERDGYRAIL-ESYDKEltmSNYSPQLLERIEEAEdmTQKMQAHneEME 400
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 999 EQLDLKSNEVNEEKRKLADSNNEEKNLKQNLELIE----------LKSQLQHIESEISRLDVQN----AEAERDKYQEES 1064
Cdd:pfam05557 401 AQLSVAEEELGGYKQQAQTLERELQALRQQESLADpsyskeevdsLRRKLETLELERQRLREQKneleMELERRCLQGDY 480
|
490 500 510 520
....*....|....*....|....*....|....*....|....*....
gi 6324079 1065 LRLRTRFEKLSSENAGKL-----GEMKQLQNQIDSLTHQLRTDYKDIEK 1108
Cdd:pfam05557 481 DPKKTKVLHLSMNPAAEAyqqrkNQLEKLQAEIERLKRLLKKLEDDLEQ 529
|
|
| 46 |
PHA02562 |
endonuclease subunit; Provisional |
845-1084 |
1.81e-03 |
|
endonuclease subunit; Provisional
Pssm-ID: 222878 [Multi-domain] Cd Length: 562 Bit Score: 42.69 E-value: 1.81e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 845 DSLRELRKTISDLQMEKD---EKVRENSRMINLIKEKEltvseiESSLTQKQNIDDSIRskrENINDIDSRVKELEARII 921
Cdd:PHA02562 174 DKIRELNQQIQTLDMKIDhiqQQIKTYNKNIEEQRKKN------GENIARKQNKYDELV---EEAKTIKAEIEELTDELL 244
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 922 SLKNKKDEAQSVLDKVKNERdIQVRNKQKTVADINRLIDRFQTIYNEVVDFEAKG--FDELQTTIKELELNKAQMLELKE 999
Cdd:PHA02562 245 NLVMDIEDPSAALNKLNTAA-AKIKSKIEQFQKVIKMYEKGGVCPTCTQQISEGPdrITKIKDKLKELQHSLEKLDTAID 323
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1000 QLDLKSNEVNEEKRKLADSNNEEKNLKQNLELIElkSQLQHIESEISRLdvqnaEAERDKYQEESLRLRTRFEKLSSENA 1079
Cdd:PHA02562 324 ELEEIMDEFNEQSKKLLELKNKISTNKQSLITLV--DKAKKVKAAIEEL-----QAEFVDNAEELAKLQDELDKIVKTKS 396
|
....*
gi 6324079 1080 GKLGE 1084
Cdd:PHA02562 397 ELVKE 401
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
789-940 |
1.99e-03 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 41.45 E-value: 1.99e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 789 ESEIRPLIEKFTYLEKELKDLENSSKTISEELSIYNTsedgiqTVDELRDQQRKMNDSLRELRKTIsdlqmEKDEKvren 868
Cdd:COG1579 16 DSELDRLEHRLKELPAELAELEDELAALEARLEAAKT------ELEDLEKEIKRLELEIEEVEARI-----KKYEE---- 80
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 6324079 869 sRMINLIKEKELTV--SEIESSLTQKQNIDDSIRSKRENINDIDSRVKELEARIISLKNKKDEAQSVLDKVKNE 940
Cdd:COG1579 81 -QLGNVRNNKEYEAlqKEIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAE 153
|
|
| COG1340 |
COG1340 |
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown]; |
743-993 |
2.13e-03 |
|
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
Pssm-ID: 440951 [Multi-domain] Cd Length: 297 Bit Score: 41.82 E-value: 2.13e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 743 LNSINEKIDNSQKCLEKAKEETKTSKSKLDELEVDSTKLKDEKELAESEIRPLIEKFTYLEKELKDLENSSKTISEELSI 822
Cdd:COG1340 17 IEELREEIEELKEKRDELNEELKELAEKRDELNAQVKELREEAQELREKRDELNEKVKELKEERDELNEKLNELREELDE 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 823 YNtsedgiqtvdELRDQQRKMNDSLRELRKTISDLQMEKDEKV---RENSRMINLIKEKELTVSEIESSLTQKQNIDD-- 897
Cdd:COG1340 97 LR----------KELAELNKAGGSIDKLRKEIERLEWRQQTEVlspEEEKELVEKIKELEKELEKAKKALEKNEKLKElr 166
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 898 -SIRSKRENINDIDSRVKELEARIISLKNKKDEAQSVLDKVKNERDI---QVRNKQKTVADINRLIDRFQTIYNEVVDFE 973
Cdd:COG1340 167 aELKELRKEAEEIHKKIKELAEEAQELHEEMIELYKEADELRKEADElhkEIVEAQEKADELHEEIIELQKELRELRKEL 246
|
250 260
....*....|....*....|
gi 6324079 974 AKGFDELQTTIKELELNKAQ 993
Cdd:COG1340 247 KKLRKKQRALKREKEKEELE 266
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
179-420 |
2.22e-03 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 42.06 E-value: 2.22e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 179 FQAMKFTKALDNLKSIKKDMSVDIKLLKQSVEHLKLDKDRSKAmklNIHQLQTKIDQYNEEVSEIESQLNEITEKSDKLF 258
Cdd:COG4942 13 LAAAAQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLK---QLAALERRIAALARRIRALEQELAALEAELAELE 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 259 KSNQDFQKILSKVENL------KNTKLSISDQVKRLSNSIDILDLSKpdlqnLLANFSKVLMDKNNQLRDLETDISSLKD 332
Cdd:COG4942 90 KEIAELRAELEAQKEElaellrALYRLGRQPPLALLLSPEDFLDAVR-----RLQYLKYLAPARREQAEELRADLAELAA 164
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 333 RQSSLQslsnsliRRQGELEAGKETYEKNRNHLSSLKEAFQHKFQGLSNIENSDMAQVNHEMSQfkafiSQDLTDTIDQF 412
Cdd:COG4942 165 LRAELE-------AERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQE-----AEELEALIARL 232
|
....*...
gi 6324079 413 AKDIQLKE 420
Cdd:COG4942 233 EAEAAAAA 240
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
612-1112 |
2.39e-03 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 42.26 E-value: 2.39e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 612 LNTIEKDLQDNQKSKEKVIQLLSENLPEDCTIDEYNDVLEETElSYKTALENLKMHQTTLEFNRKALEIAERDSCCYLCS 691
Cdd:TIGR00618 228 LKHLREALQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRARIE-ELRAQEAVLEETQERINRARKAAPLAAHIKAVTQIE 306
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 692 RKFENESFKSKLLQELKTKTDANFEKTLKDTvQNEKEYLHSLRLLEKHIITLNSINEKIDNSQKCLEKAKEETKTSKSKL 771
Cdd:TIGR00618 307 QQAQRIHTELQSKMRSRAKLLMKRAAHVKQQ-SSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHTLTQHIHTLQ 385
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 772 DELEVDSTKLKDEKELAESE------IRPLIEKFTYLEKELKDLENSSKTISE-----ELSIYNTSEDGIQTVDELRDQQ 840
Cdd:TIGR00618 386 QQKTTLTQKLQSLCKELDILqreqatIDTRTSAFRDLQGQLAHAKKQQELQQRyaelcAAAITCTAQCEKLEKIHLQESA 465
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 841 RKMnDSLRELRKTISDLQMEKDEKVRENSRMINLIKEKELTVSEIESSLTQKQNIDDSIRSKRENINDIDSRVKELEARI 920
Cdd:TIGR00618 466 QSL-KEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDNPGPLTRRMQRGEQTYAQLETSE 544
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 921 ISLKNKKDEAQSVLDKVKNERDIQVRNKQKTVADINRLIDRFQTIYNEVVDFEAKGFDELQTTIKELELNKAQMLELKEQ 1000
Cdd:TIGR00618 545 EDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLSEAEDMLACEQHALLRKLQPE 624
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1001 LDlksnevNEEKRKladsnnEEKNLKQNLELieLKSQLQHIESEISRLDVQNAEAERDKYQEESLRLRTRFEKLSSENAG 1080
Cdd:TIGR00618 625 QD------LQDVRL------HLQQCSQELAL--KLTALHALQLTLTQERVREHALSIRVLPKELLASRQLALQKMQSEKE 690
|
490 500 510
....*....|....*....|....*....|..
gi 6324079 1081 KLGEMKQLQNQIDSLTHQLRTDYKDIEKNYHK 1112
Cdd:TIGR00618 691 QLTYWKEMLAQCQTLLRELETHIEEYDREFNE 722
|
|
| Uup |
COG0488 |
ATPase components of ABC transporters with duplicated ATPase domains [General function ... |
1202-1277 |
2.51e-03 |
|
ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only];
Pssm-ID: 440254 [Multi-domain] Cd Length: 520 Bit Score: 41.97 E-value: 2.51e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 1202 GRCSAGQKVlasiiiRLALSETF--GANcgVIALDEPTTNLDEENIESLAKSLhniinmrrhqKNFQ--LIVITHDEKFL 1277
Cdd:COG0488 431 GVLSGGEKA------RLALAKLLlsPPN--VLLLDEPTNHLDIETLEALEEAL----------DDFPgtVLLVSHDRYFL 492
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
834-1073 |
2.58e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 42.21 E-value: 2.58e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 834 DELRDQQRKMnDSLRELRKTISDLQmEKDEKVRENSRMINLI------KEKELTVSEIESSLTQKQNIDDSIRSKRENIN 907
Cdd:COG4913 242 EALEDAREQI-ELLEPIRELAERYA-AARERLAELEYLRAALrlwfaqRRLELLEAELEELRAELARLEAELERLEARLD 319
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 908 DIDSRVKELEARIislknkkdeAQSVLDKVKN-ERDIqvRNKQKTVADINRLIDRFQTiynevvDFEAKGFdELQTTIKE 986
Cdd:COG4913 320 ALREELDELEAQI---------RGNGGDRLEQlEREI--ERLERELEERERRRARLEA------LLAALGL-PLPASAEE 381
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 987 LELNKAQMLELKEQLDLKSNEVNEEKRKLADsnneeknlkqnlELIELKSQLQHIESEISRLdvqnaEAERDKYQEESLR 1066
Cdd:COG4913 382 FAALRAEAAALLEALEEELEALEEALAEAEA------------ALRDLRRELRELEAEIASL-----ERRKSNIPARLLA 444
|
....*..
gi 6324079 1067 LRTRFEK 1073
Cdd:COG4913 445 LRDALAE 451
|
|
| ABC_subfamily_A |
cd03263 |
ATP-binding cassette domain of the lipid transporters, subfamily A; The ABCA subfamily ... |
22-86 |
2.77e-03 |
|
ATP-binding cassette domain of the lipid transporters, subfamily A; The ABCA subfamily mediates the transport of a variety of lipid compounds. Mutations of members of ABCA subfamily are associated with human genetic diseases, such as, familial high-density lipoprotein (HDL) deficiency, neonatal surfactant deficiency, degenerative retinopathies, and congenital keratinization disorders. The ABCA1 protein is involved in disorders of cholesterol transport and high-density lipoprotein (HDL) biosynthesis. The ABCA4 (ABCR) protein transports vitamin A derivatives in the outer segments of photoreceptor cells, and therefore, performs a crucial step in the visual cycle. The ABCA genes are not present in yeast. However, evolutionary studies of ABCA genes indicate that they arose as transporters that subsequently duplicated and that certain sets of ABCA genes were lost in different eukaryotic lineages.
Pssm-ID: 213230 [Multi-domain] Cd Length: 220 Bit Score: 40.95 E-value: 2.77e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 6324079 22 TIEFGKPLTLiVGMNGSGKTTIIECLkyatTGDLPPNSkGGVFIHDpkitgeKDIRAQVKLAFTS 86
Cdd:cd03263 24 NVYKGEIFGL-LGHNGAGKTTTLKML----TGELRPTS-GTAYING------YSIRTDRKAARQS 76
|
|
| ABC_sbcCD |
cd03279 |
ATP-binding cassette domain of sbcCD; SbcCD and other Mre11/Rad50 (MR) complexes are ... |
6-100 |
2.98e-03 |
|
ATP-binding cassette domain of sbcCD; SbcCD and other Mre11/Rad50 (MR) complexes are implicated in the metabolism of DNA ends. They cleave ends sealed by hairpin structures and are thought to play a role in removing protein bound to DNA termini.
Pssm-ID: 213246 [Multi-domain] Cd Length: 213 Bit Score: 40.72 E-value: 2.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 6 KLSIQGIRSFdsNDRETIEF----GKPLTLIVGMNGSGKTTIIECLKYATTGDLPPNSKGGVfIHDPKITGEKdiRAQVK 81
Cdd:cd03279 5 KLELKNFGPF--REEQVIDFtgldNNGLFLICGPTGAGKSTILDAITYALYGKTPRYGRQEN-LRSVFAPGED--TAEVS 79
|
90
....*....|....*....
gi 6324079 82 LAFtSANGLNMIVTRNIQL 100
Cdd:cd03279 80 FTF-QLGGKKYRVERSRGL 97
|
|
| PRK11637 |
PRK11637 |
AmiB activator; Provisional |
845-1070 |
3.11e-03 |
|
AmiB activator; Provisional
Pssm-ID: 236942 [Multi-domain] Cd Length: 428 Bit Score: 41.60 E-value: 3.11e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 845 DSLRELRKTISdlqmEKDEKVREN----SRMINLIKEKELTVSEIESSLTQKQNiddsirskreNINDIDSRVKELEARI 920
Cdd:PRK11637 47 DQLKSIQQDIA----AKEKSVRQQqqqrASLLAQLKKQEEAISQASRKLRETQN----------TLNQLNKQIDELNASI 112
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 921 ISLKNKKDEAQSVLDKvknERDIQVRNKQKTVAdinRLI---------DRFQTIYNEVVDFEAKGFDELQTTIKELELNK 991
Cdd:PRK11637 113 AKLEQQQAAQERLLAA---QLDAAFRQGEHTGL---QLIlsgeesqrgERILAYFGYLNQARQETIAELKQTREELAAQK 186
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 992 AQMLELKEQLDLKSNEVNEEKRKLADSNNEEKNLKQNLE--LIELKSQLQHIESEISRLDVQNAEAER------DKYQEE 1063
Cdd:PRK11637 187 AELEEKQSQQKTLLYEQQAQQQKLEQARNERKKTLTGLEssLQKDQQQLSELRANESRLRDSIARAEReakaraEREARE 266
|
....*..
gi 6324079 1064 SLRLRTR 1070
Cdd:PRK11637 267 AARVRDK 273
|
|
| PRK01156 |
PRK01156 |
chromosome segregation protein; Provisional |
829-1092 |
3.78e-03 |
|
chromosome segregation protein; Provisional
Pssm-ID: 100796 [Multi-domain] Cd Length: 895 Bit Score: 41.81 E-value: 3.78e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 829 GIQTVDELRDQQRKMNDSLRELRKTISDLQMEKDEKVRENSRMINLIKEKELTVSEIESSLTQKQNIDDSIRSKRENIND 908
Cdd:PRK01156 181 EISNIDYLEEKLKSSNLELENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDMKNRYESEIKT 260
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 909 IDSRVKELEARIISLKNKKDEaqsvLDKVKNERDIQVRNK-QKTVADINRLIDRFQTIYNevVDFEAKGFDELQTTIKEL 987
Cdd:PRK01156 261 AESDLSMELEKNNYYKELEER----HMKIINDPVYKNRNYiNDYFKYKNDIENKKQILSN--IDAEINKYHAIIKKLSVL 334
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 988 ELNKAQMLELKEQLDLKSNEVNEEKRKLADSNN-----EEKNLKQNLELIELKSQLQHIeSEISRLDVQNAEAERDKYQE 1062
Cdd:PRK01156 335 QKDYNDYIKKKSRYDDLNNQILELEGYEMDYNSylksiESLKKKIEEYSKNIERMSAFI-SEILKIQEIDPDAIKKELNE 413
|
250 260 270
....*....|....*....|....*....|
gi 6324079 1063 ESLRLRTRFEKLSSENAgKLGEMKQLQNQI 1092
Cdd:PRK01156 414 INVKLQDISSKVSSLNQ-RIRALRENLDEL 442
|
|
| RloC |
COG4694 |
Wobble nucleotide-excising tRNase [Translation, ribosomal structure and biogenesis]; |
8-77 |
3.85e-03 |
|
Wobble nucleotide-excising tRNase [Translation, ribosomal structure and biogenesis];
Pssm-ID: 443729 [Multi-domain] Cd Length: 692 Bit Score: 41.65 E-value: 3.85e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 8 SIQGIRSFDSNDreTIEFGKPLTLIVGMNGSGKTTIIECLKYATTGDLPPNSKGGVFIHDPKITGEKDIR 77
Cdd:COG4694 7 KLKNVGAFKDFG--WLAFFKKLNLIYGENGSGKSTLSRILRSLELGDTSSEVIAEFEIEAGGSAPNPSVR 74
|
|
| AAA_13 |
pfam13166 |
AAA domain; This family of domains contain a P-loop motif that is characteriztic of the AAA ... |
537-929 |
4.01e-03 |
|
AAA domain; This family of domains contain a P-loop motif that is characteriztic of the AAA superfamily. Many of the proteins in this family are conjugative transfer proteins. This family includes the PrrC protein that is thought to be the active component of the anticodon nuclease.
Pssm-ID: 463796 [Multi-domain] Cd Length: 712 Bit Score: 41.58 E-value: 4.01e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 537 IKKSINTKLDELQKITEKLQNdsrirqvfpltQEFQRADLEMDFQKLFINMQKNIAINNKKMHelDRRYTNAL----YNL 612
Cdd:pfam13166 94 IQEKIAKLKKEIKDHEEKLDA-----------AEANLQKLDKEKEKLEADFLDECWKKIKRKK--NSALSEALngfkYEA 160
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 613 NTIEKDLQDNQKS-------------KEKVIQLLSENLPE----DCTIDEYNDVLEETELSYKT-----ALENLKMHQTT 670
Cdd:pfam13166 161 NFKSRLLREIEKDnfnagvllsdedrKAALATVFSDNKPEiaplTFNVIDFDALEKAEILIQKVigkssAIEELIKNPDL 240
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 671 LEFNRKALEIAERDS-CCYLCsrkfenesfKSKLLQELKTKTDANFEKTLKDTVQNEKEylhslrLLEKHIITLNSINEK 749
Cdd:pfam13166 241 ADWVEQGLELHKAHLdTCPFC---------GQPLPAERKAALEAHFDDEFTEFQNRLQK------LIEKVESAISSLLAQ 305
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 750 IDNsqkclekakeeTKTSKSKLDELEVDSTKLKDEKELAESEIRPLIEKftyLEKELKDLENSSKTISEELSIYNTSEDg 829
Cdd:pfam13166 306 LPA-----------VSDLASLLSAFELDVEDIESEAEVLNSQLDGLRRA---LEAKRKDPFKSIELDSVDAKIESINDL- 370
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 830 IQTVDELRDQQRKMNDSLRElRKTISDLQMEKDEKVRENSRMINLIKEKELTVSEIESSLTQKQNIDDSIRSKREnindi 909
Cdd:pfam13166 371 VASINELIAKHNEITDNFEE-EKNKAKKKLRLHLVEEFKSEIDEYKDKYAGLEKAINSLEKEIKNLEAEIKKLRE----- 444
|
410 420
....*....|....*....|
gi 6324079 910 dsRVKELEARIISLKNKKDE 929
Cdd:pfam13166 445 --EIKELEAQLRDHKPGADE 462
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
201-399 |
4.11e-03 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 40.97 E-value: 4.11e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 201 DIKLLKQSVEHLKLDKDRSKAmklNIHQLQTKIDQYNEEVSEIESQLNEITEKSDKLfksNQDFQKILSKVENLKNTkls 280
Cdd:COG3883 17 QIQAKQKELSELQAELEAAQA---ELDALQAELEELNEEYNELQAELEALQAEIDKL---QAEIAEAEAEIEERREE--- 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 281 ISDQVKRL---SNSIDILD--LSKPDLQNLLANFSKV--LMDKNNQ-LRDLETDISSLKDRQSSLQSLsnslirrQGELE 352
Cdd:COG3883 88 LGERARALyrsGGSVSYLDvlLGSESFSDFLDRLSALskIADADADlLEELKADKAELEAKKAELEAK-------LAELE 160
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 6324079 353 AGKETYEKNRNHLSSLKEAFQHKFQGLSNIENSDMAQVNHEMSQFKA 399
Cdd:COG3883 161 ALKAELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAA 207
|
|
| PRK04778 |
PRK04778 |
septation ring formation regulator EzrA; Provisional |
444-852 |
4.17e-03 |
|
septation ring formation regulator EzrA; Provisional
Pssm-ID: 179877 [Multi-domain] Cd Length: 569 Bit Score: 41.36 E-value: 4.17e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 444 RSKLIHDSEELAEKLKSFKSLSTQDSLNhELENLK---TYKEKLQSWESE------NIIPKLNQKIEEknnemiiLENQI 514
Cdd:PRK04778 24 RKRNYKRIDELEERKQELENLPVNDELE-KVKKLNltgQSEEKFEEWRQKwdeivtNSLPDIEEQLFE-------AEELN 95
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 515 EKFqdRIMKTNQQ-ADLYAKLGLIKKSINTKLDELQKITEK-LQNDSRIRQVFPLTQEFQRADLEMDFQ-----KLFINM 587
Cdd:PRK04778 96 DKF--RFRKAKHEiNEIESLLDLIEEDIEQILEELQELLESeEKNREEVEQLKDLYRELRKSLLANRFSfgpalDELEKQ 173
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 588 QKNIAINNKKMHEL--DRRYTNALYNLNTIEKDLQDNQKSKEKVIQLLSEnlpedcTIDEYNDVLEETELSYKTALENlK 665
Cdd:PRK04778 174 LENLEEEFSQFVELteSGDYVEAREILDQLEEELAALEQIMEEIPELLKE------LQTELPDQLQELKAGYRELVEE-G 246
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 666 MHQTTLEFNRKALEI-AERDSCcylcsrkfenesfkSKLLQELKTKtdaNFEKTLKDtVQNEKEYLHSLrlLEKHIITLN 744
Cdd:PRK04778 247 YHLDHLDIEKEIQDLkEQIDEN--------------LALLEELDLD---EAEEKNEE-IQERIDQLYDI--LEREVKARK 306
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 745 SINEKIDNSQKCLEKAKEETKTSKSKLDELevdstklKDEKELAESEIrpliEKFTYLEKELKDLENSSKTISEEL---- 820
Cdd:PRK04778 307 YVEKNSDTLPDFLEHAKEQNKELKEEIDRV-------KQSYTLNESEL----ESVRQLEKQLESLEKQYDEITERIaeqe 375
|
410 420 430
....*....|....*....|....*....|....*.
gi 6324079 821 ----SIYNTSEDGIQTVDELRDQQRKMNDSLRELRK 852
Cdd:PRK04778 376 iaysELQEELEEILKQLEEIEKEQEKLSEMLQGLRK 411
|
|
| COG4938 |
COG4938 |
Predicted ATPase [General function prediction only]; |
4-44 |
4.41e-03 |
|
Predicted ATPase [General function prediction only];
Pssm-ID: 443965 [Multi-domain] Cd Length: 277 Bit Score: 40.72 E-value: 4.41e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 6324079 4 IYKLSIQGIRSFDSndrETIEFgKPLTLIVGMNGSGKTTII 44
Cdd:COG4938 1 IKSISIKNFGPFKE---AELEL-KPLTLLIGPNGSGKSTLI 37
|
|
| ABC_Mj1267_LivG_branched |
cd03219 |
ATP-binding cassette component of branched chain amino acids transport system; The Mj1267/LivG ... |
29-79 |
4.56e-03 |
|
ATP-binding cassette component of branched chain amino acids transport system; The Mj1267/LivG ABC transporter subfamily is involved in the transport of the hydrophobic amino acids leucine, isoleucine and valine. MJ1267 is a branched-chain amino acid transporter with 29% similarity to both the LivF and LivG components of the E. coli branched-chain amino acid transporter. MJ1267 contains an insertion from residues 114 to 123 characteristic of LivG (Leucine-Isoleucine-Valine) homologs. The branched-chain amino acid transporter from E. coli comprises a heterodimer of ABCs (LivF and LivG), a heterodimer of six-helix TM domains (LivM and LivH), and one of two alternative soluble periplasmic substrate binding proteins (LivK or LivJ).
Pssm-ID: 213186 [Multi-domain] Cd Length: 236 Bit Score: 40.11 E-value: 4.56e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 6324079 29 LTLIVGMNGSGKTTIIECLkyatTGDLPPnSKGGVFIHDPKITGEK-DIRAQ 79
Cdd:cd03219 28 IHGLIGPNGAGKTTLFNLI----SGFLRP-TSGSVLFDGEDITGLPpHEIAR 74
|
|
| 46 |
PHA02562 |
endonuclease subunit; Provisional |
843-1125 |
5.12e-03 |
|
endonuclease subunit; Provisional
Pssm-ID: 222878 [Multi-domain] Cd Length: 562 Bit Score: 41.15 E-value: 5.12e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 843 MNDSLRELRKTISDL-------QMEK--DEKVRENSRMINLIkekELTVSEIESSL-TQKQNIDDSIRSKRENINDIDSR 912
Cdd:PHA02562 145 MQLSAPARRKLVEDLldisvlsEMDKlnKDKIRELNQQIQTL---DMKIDHIQQQIkTYNKNIEEQRKKNGENIARKQNK 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 913 VKELEARIISLKNKKDEAQSVLDKVknerdiqVRNKQKTVADINRLIDRFQTIYNEVVDF--EAKGFDE----------L 980
Cdd:PHA02562 222 YDELVEEAKTIKAEIEELTDELLNL-------VMDIEDPSAALNKLNTAAAKIKSKIEQFqkVIKMYEKggvcptctqqI 294
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 981 QTTIKELELNKAQMLELKEQLDLKSNEVNEEKRKLADSNNEEKNLKqnleliELKSQLQHIESEISRLDVQNaeaerdky 1060
Cdd:PHA02562 295 SEGPDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLL------ELKNKISTNKQSLITLVDKA-------- 360
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 6324079 1061 qeesLRLRTRFEKLSSENAGKLGEMKQLQNQIDSLThqlrTDYKDIEKnyhkewvELQTRSFVTD 1125
Cdd:PHA02562 361 ----KKVKAAIEELQAEFVDNAEELAKLQDELDKIV----KTKSELVK-------EKYHRGIVTD 410
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
715-1030 |
5.13e-03 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 41.26 E-value: 5.13e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 715 FEKTLKDTVQNEKEYLhsLRLLEKHiitlnsinEKIDNSQKCLEKAKEETKTSKSKLDEL------EVDSTKLKDEK--- 785
Cdd:pfam17380 293 FEKMEQERLRQEKEEK--AREVERR--------RKLEEAEKARQAEMDRQAAIYAEQERMamererELERIRQEERKrel 362
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 786 -ELAESEIRPLIEKFTYLEKELKDLENSSKTISEELSiyntSEDGIQTVDELRdqQRKMNDSLRElrktisdlqMEKDEK 864
Cdd:pfam17380 363 eRIRQEEIAMEISRMRELERLQMERQQKNERVRQELE----AARKVKILEEER--QRKIQQQKVE---------MEQIRA 427
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 865 VRENSRMINLIKEKELTVSEIESSLTQKQNIDDSIRSKRENINDIDSRVKELEARIISLKNKKDEAQSVLDKVKNERD-- 942
Cdd:pfam17380 428 EQEEARQREVRRLEEERAREMERVRLEEQERQQQVERLRQQEEERKRKKLELEKEKRDRKRAEEQRRKILEKELEERKqa 507
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 943 -IQVRNKQKTVAdiNRLIDRFQTIYnevvdfeakgfDELQTTIKELELNKAQMLELKEQLDLKSNEVNEEKRKLADSNNE 1021
Cdd:pfam17380 508 mIEEERKRKLLE--KEMEERQKAIY-----------EEERRREAEEERRKQQEMEERRRIQEQMRKATEERSRLEAMERE 574
|
....*....
gi 6324079 1022 EKNLKQNLE 1030
Cdd:pfam17380 575 REMMRQIVE 583
|
|
| ABCC_Protease_Secretion |
cd03246 |
ATP-binding cassette domain of PrtD, subfamily C; This family represents the ABC component of ... |
1205-1272 |
5.23e-03 |
|
ATP-binding cassette domain of PrtD, subfamily C; This family represents the ABC component of the protease secretion system PrtD, a 60-kDa integral membrane protein sharing 37% identity with HlyB, the ABC component of the alpha-hemolysin secretion pathway, in the C-terminal domain. They export degradative enzymes by using a type I protein secretion system and lack an N-terminal signal peptide, but contain a C-terminal secretion signal. The Type I secretion apparatus is made up of three components, an ABC transporter, a membrane fusion protein (MFP), and an outer membrane protein (OMP). For the HlyA transporter complex, HlyB (ABC transporter) and HlyD (MFP) reside in the inner membrane of E. coli. The OMP component is TolC, which is thought to interact with the MFP to form a continuous channel across the periplasm from the cytoplasm to the exterior. HlyB belongs to the family of ABC transporters, which are ubiquitous, ATP-dependent transmembrane pumps or channels. The spectrum of transport substrates ranges from inorganic ions, nutrients such as amino acids, sugars, or peptides, hydrophobic drugs, to large polypeptides, such as HlyA.
Pssm-ID: 213213 [Multi-domain] Cd Length: 173 Bit Score: 39.51 E-value: 5.23e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 6324079 1205 SAGQKVlasiiiRLALSETFGANCGVIALDEPTTNLDEENIESLaksLHNIINMRRHQKNfqLIVITH 1272
Cdd:cd03246 98 SGGQRQ------RLGLARALYGNPRILVLDEPNSHLDVEGERAL---NQAIAALKAAGAT--RIVIAH 154
|
|
| CALCOCO1 |
pfam07888 |
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ... |
757-1077 |
5.95e-03 |
|
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
Pssm-ID: 462303 [Multi-domain] Cd Length: 488 Bit Score: 40.65 E-value: 5.95e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 757 LEKAKEETKTSKSKLDELEVDSTKLKDEKELAESEIRPLIEKFTYLEKELKDLENSSKTISEELSIYNTSedgiqtVDEL 836
Cdd:pfam07888 82 VAELKEELRQSREKHEELEEKYKELSASSEELSEEKDALLAQRAAHEARIRELEEDIKTLTQRVLERETE------LERM 155
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 837 RDQQRKMNDSLRELRKTISDLQMEKDEKVRENSRMINLIKEKELTVSEIESSLTQKQnidDSIRSKRENINDIDSRVKEL 916
Cdd:pfam07888 156 KERAKKAGAQRKEEEAERKQLQAKLQQTEEELRSLSKEFQELRNSLAQRDTQVLQLQ---DTITTLTQKLTTAHRKEAEN 232
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 917 EARIISLKNKKDEAQSVLDKV---KNERDIQVRNKQKTVADINRLIDRFQTIYNEVVDF-------EAKGFDELQTTIKE 986
Cdd:pfam07888 233 EALLEELRSLQERLNASERKVeglGEELSSMAAQRDRTQAELHQARLQAAQLTLQLADAslalregRARWAQERETLQQS 312
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 987 LELNKAQMLELKEQLDLKSNEVNEE---KRKLADSNNEEK--NLKQ----NLELIELKSQLQHIESEISRLdvqnaEAER 1057
Cdd:pfam07888 313 AEADKDRIEKLSAELQRLEERLQEErmeREKLEVELGREKdcNRVQlsesRRELQELKASLRVAQKEKEQL-----QAEK 387
|
330 340
....*....|....*....|
gi 6324079 1058 DKYQEESLRLRTRFEKLSSE 1077
Cdd:pfam07888 388 QELLEYIRQLEQRLETVADA 407
|
|
| COG2433 |
COG2433 |
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only]; |
818-962 |
7.40e-03 |
|
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
Pssm-ID: 441980 [Multi-domain] Cd Length: 644 Bit Score: 40.61 E-value: 7.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 818 EELSIYNTSEDGIQTVDELRDQQRKMNDSLRELRKTISDLQMEKDEKVREnsrminlIKEKELTVSEIessltqKQNIDD 897
Cdd:COG2433 393 EEPEAEREKEHEERELTEEEEEIRRLEEQVERLEAEVEELEAELEEKDER-------IERLERELSEA------RSEERR 459
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 6324079 898 SIRSKREnINDIDSRVKELEARIISLKNKKDEAQSVLDKVKNERDIQVRNK--------QKTVADINRLIDRF 962
Cdd:COG2433 460 EIRKDRE-ISRLDREIERLERELEEERERIEELKRKLERLKELWKLEHSGElvpvkvveKFTKEAIRRLEEEY 531
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
757-1068 |
9.15e-03 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 40.51 E-value: 9.15e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 757 LEKAKEETKTSKSKLDELEVDSTKLKDEKELAESEIRPLIEKFTYLEKELKDLENSSKTISEElsiyntsedgIQTVDEL 836
Cdd:PTZ00121 1652 LKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEE----------KKKAEEL 1721
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 837 RDQQRKMNDSLRELRKtisdlQMEKDEKvrensrminliKEKELTVSEIESSltQKQNIDDSIRSKRENINDIDSRVKEL 916
Cdd:PTZ00121 1722 KKAEEENKIKAEEAKK-----EAEEDKK-----------KAEEAKKDEEEKK--KIAHLKKEEEKKAEEIRKEKEAVIEE 1783
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6324079 917 EARIISLKNKKDEAQSVLDKVKNERDIQVRNKQKTVAdINRLIDRFQTIYNEVVDfeakgfdelqttIKELELNKAQmlE 996
Cdd:PTZ00121 1784 ELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLV-INDSKEMEDSAIKEVAD------------SKNMQLEEAD--A 1848
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 6324079 997 LKEQLDLKSNEVNEEKRKLADSNNEEKNLKQNLELIELKSQLQHI-----ESEISRLDV--QNAEAERDKYQEESLRLR 1068
Cdd:PTZ00121 1849 FEKHKFNKNNENGEDGNKEADFNKEKDLKEDDEEEIEEADEIEKIdkddiEREIPNNNMagKNNDIIDDKLDKDEYIKR 1927
|
|
|