|
Name |
Accession |
Description |
Interval |
E-value |
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
370-726 |
5.72e-13 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 72.78 E-value: 5.72e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 370 LASLTEKIQKMEENhhstAEELQATLQELSDQQqmvQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKLMNLLQE 449
Cdd:TIGR02168 679 IEELEEKIEELEEK----IAELEKALAELRKEL---EELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQ 751
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 450 RvkNEEPTTQEGKIIELEQKctgiLEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEFLELERHN 529
Cdd:TIGR02168 752 L--SKELTELEAEIEELEER----LEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRER 825
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 530 NNMMAKTLEECRVTLEGLKMENGSLK--------SHLQGEKQKATEASAVEQ-TAESCEVQEMLKVARAEKDLLELSCNE 600
Cdd:TIGR02168 826 LESLERRIAATERRLEDLEEQIEELSedieslaaEIEELEELIEELESELEAlLNERASLEEALALLRSELEELSEELRE 905
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 601 LRQELLKANGEIKHVSSLLAKVEKDYSYLKEICDHQAEQLS-RTSLKLQ---EKASESDAEIKDMKETIFELEDQVEQHR 676
Cdd:TIGR02168 906 LESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSeEYSLTLEeaeALENKIEDDEEEARRRLKRLENKIKELG 985
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|.
gi 1034602044 677 AVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWR-RFQA 726
Cdd:TIGR02168 986 PVNLAAIEEYEELKERYDFLTAQKEDLTEAKETLEEAIEEIDREAReRFKD 1036
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
367-771 |
1.92e-11 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 67.74 E-value: 1.92e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 367 ELSLASLTEKIQKMEEnhhstaeeLQATLQELSDQQQmvqELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKLMNL 446
Cdd:TIGR04523 200 ELLLSNLKKKIQKNKS--------LESQISELKKQNN---QLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQ 268
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 447 LQErvKNEEPTTQEGKIIELEQKCTGI-LEQGRFEREKLLNIQQQLTCSLRKVEEENQGA-------LEMIKRLKEENEK 518
Cdd:TIGR04523 269 LSE--KQKELEQNNKKIKELEKQLNQLkSEISDLNNQKEQDWNKELKSELKNQEKKLEEIqnqisqnNKIISQLNEQISQ 346
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 519 LNEFLELERHNNNMMAKTLEECRVTLEGLKMENGSLKSHLQGEKQKAT--EASAVEQTAESCEVQEMLKVARAEKDLLEL 596
Cdd:TIGR04523 347 LKKELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINdlESKIQNQEKLNQQKDEQIKKLQQEKELLEK 426
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 597 SCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEICDHQAEQLSRTS------------------------LKLQEKAS 652
Cdd:TIGR04523 427 EIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLSrsinkikqnleqkqkelkskekelKKLNEEKK 506
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 653 ESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSdlERQLKTLTKQMKEETEEWRRFQADLQTAV 732
Cdd:TIGR04523 507 ELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLEDELNKDDFELK--KENLEKEIDEKNKEIEELKQTQKSLKKKQ 584
|
410 420 430
....*....|....*....|....*....|....*....
gi 1034602044 733 VVANDIKCEAQQELRTVKRKLLEEEEKNARLQKELGDVQ 771
Cdd:TIGR04523 585 EEKQELIDQKEKEKKDLIKEIEEKEKKISSLEKELEKAK 623
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
462-767 |
1.92e-10 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 64.69 E-value: 1.92e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 462 KIIELEQKCTGI---LEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEFLELERHNNNMMAKTLE 538
Cdd:TIGR02168 678 EIEELEEKIEELeekIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELT 757
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 539 ECRVTLEGLKMENGSLKSHLQgekqkateasavEQTAESCEVQEMLKVARAEKDLLELSCNELRQELLKANGEIKHVSSL 618
Cdd:TIGR02168 758 ELEAEIEELEERLEEAEEELA------------EAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRER 825
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 619 LAKVEKDYSYLKEicdhQAEQLSRTSLKLQEKASESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEE 698
Cdd:TIGR02168 826 LESLERRIAATER----RLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSE 901
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034602044 699 QKSDLERQLKTLTKQMKEETEEWRRFQADLQtavvvandikcEAQQELRTVKRKLLEEEEKNARLQKEL 767
Cdd:TIGR02168 902 ELRELESKRSELRRELEELREKLAQLELRLE-----------GLEVRIDNLQERLSEEYSLTLEEAEAL 959
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
509-767 |
1.60e-09 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 61.49 E-value: 1.60e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 509 IKRLKEENEKLNEFLELErhnnnmmaKTLEECRVTLEGLKMEngslksHLQGEKQKAtEASAVEQTAESCEVQEMLKVAR 588
Cdd:COG1196 202 LEPLERQAEKAERYRELK--------EELKELEAELLLLKLR------ELEAELEEL-EAELEELEAELEELEAELAELE 266
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 589 AEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEicdhQAEQLSRTSLKLQEKASESDAEIKDMKETIFEL 668
Cdd:COG1196 267 AELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEE----RRRELEERLEELEEELAELEEELEELEEELEEL 342
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 669 EDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQTAVVVANDIKcEAQQELRT 748
Cdd:COG1196 343 EEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALL-ERLERLEE 421
|
250
....*....|....*....
gi 1034602044 749 VKRKLLEEEEKNARLQKEL 767
Cdd:COG1196 422 ELEELEEALAELEEEEEEE 440
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
370-638 |
3.97e-09 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 60.46 E-value: 3.97e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 370 LASLTEKIQKMEENHHSTAEELQATLQELSD-------QQQMVQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEK 442
Cdd:TIGR02168 241 LEELQEELKEAEEELEELTAELQELEEKLEElrlevseLEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEE 320
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 443 LMNLLQE-----RVKNEEPTTQEGKIIELEQKCTGILEQGrferEKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENE 517
Cdd:TIGR02168 321 LEAQLEElesklDELAEELAELEEKLEELKEELESLEAEL----EELEAELEELESRLEELEEQLETLRSKVAQLELQIA 396
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 518 KLNEFLELERHNNNMMAKTLEECRVTLEGLKMENGSLKSHLQGEKQKATEASAVEQTAESCEVQEMLKVARAEKDLLELS 597
Cdd:TIGR02168 397 SLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQA 476
|
250 260 270 280
....*....|....*....|....*....|....*....|.
gi 1034602044 598 CNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEICDHQAE 638
Cdd:TIGR02168 477 LDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSG 517
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
495-777 |
5.00e-09 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 60.07 E-value: 5.00e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 495 LRKVEEENQGALeMIKRLKEENEKLNEFLELERHNNNMM---AKTLEECRVTLEGLKMENGSLKSHLQgEKQKATEASAV 571
Cdd:TIGR02168 218 LKAELRELELAL-LVLRLEELREELEELQEELKEAEEELeelTAELQELEEKLEELRLEVSELEEEIE-ELQKELYALAN 295
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 572 EQTaescEVQEMLKVARAEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEICDHQAEQLSRTSLKLQE-- 649
Cdd:TIGR02168 296 EIS----RLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEEle 371
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 650 -KASESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQ--------MKEETEE 720
Cdd:TIGR02168 372 sRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKelqaeleeLEEELEE 451
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*..
gi 1034602044 721 WRRFQADLQTAVVVANDIKCEAQQELRTVKRKLLEEEEKNARLQKELGDVQGHGRVV 777
Cdd:TIGR02168 452 LQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGV 508
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
367-627 |
5.20e-08 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 56.87 E-value: 5.20e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 367 ELSLASLTEKIQKMEENHHSTAEELQATLQELSDQQQMVQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKLMNL 446
Cdd:COG1196 252 EAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEE 331
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 447 LQERVKNEEptTQEGKIIELEQKCTGILEQGRFEREKLLNIQQQLtcsLRKVEEENQGALEMIKRLKEENEKLNEFLELE 526
Cdd:COG1196 332 LEELEEELE--ELEEELEEAEEELEEAEAELAEAEEALLEAEAEL---AEAEEELEELAEELLEALRAAAELAAQLEELE 406
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 527 RHNNNmMAKTLEECRVTLEGLKMENGSLKSHLQGEKQKATEASAVEQTAESCEVQEMLKVARAEKDLLELscNELRQELL 606
Cdd:COG1196 407 EAEEA-LLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALL--EAALAELL 483
|
250 260
....*....|....*....|.
gi 1034602044 607 KANGEIKHVSSLLAKVEKDYS 627
Cdd:COG1196 484 EELAEAAARLLLLLEAEADYE 504
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
367-772 |
9.98e-08 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 55.82 E-value: 9.98e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 367 ELSLASLTEKIQKMEENHH---STAEELQATLQELSDQQQMVQELTAENEKLVD-------EKTILETSFHQHRERAEQL 436
Cdd:PRK02224 212 ESELAELDEEIERYEEQREqarETRDEADEVLEEHEERREELETLEAEIEDLREtiaeterEREELAEEVRDLRERLEEL 291
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 437 SQENEKLmnLLQERVKNEEPTTQEGKIIELEQKCTGI---LEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMIKRLK 513
Cdd:PRK02224 292 EEERDDL--LAEAGLDDADAEAVEARREELEDRDEELrdrLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELE 369
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 514 EENEKLNEFLELERHNNNMMAKTLEECR-------VTLEGLKMENGSLKSHLQGEKQKATEASAVEQTAESC--EVQEML 584
Cdd:PRK02224 370 SELEEAREAVEDRREEIEELEEEIEELRerfgdapVDLGNAEDFLEELREERDELREREAELEATLRTARERveEAEALL 449
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 585 KV---------------------ARAEKDLLELSCNELRQELLKANGEIKHVSSLlAKVEKDYSYLKE-------ICDHQ 636
Cdd:PRK02224 450 EAgkcpecgqpvegsphvetieeDRERVEELEAELEDLEEEVEEVEERLERAEDL-VEAEDRIERLEErredleeLIAER 528
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 637 AEQLSRTSLKLQEK---ASESDAEIKDMKETIFELEDQVEQHR-AVKLHNNQL---------ISELESSVIKLEEQKSDL 703
Cdd:PRK02224 529 RETIEEKRERAEELrerAAELEAEAEEKREAAAEAEEEAEEAReEVAELNSKLaelkeriesLERIRTLLAAIADAEDEI 608
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 704 ER------QLKTLTKQMKEETEEWRRFQADLQ-----TAVVVANDIKCEAQQELRTVKRKLLEEEEKNARLQKELGDVQG 772
Cdd:PRK02224 609 ERlrekreALAELNDERRERLAEKRERKRELEaefdeARIEEAREDKERAEEYLEQVEEKLDELREERDDLQAEIGAVEN 688
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
376-765 |
1.04e-07 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 55.84 E-value: 1.04e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 376 KIQKMEENHHSTAEELQATLQELSDQQQMVQELTAENEKLVDEKTILETsFHQHRERAEQLSQENEKlmnlLQERVKNEE 455
Cdd:PRK03918 311 EIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEE-RHELYEEAKAKKEELER----LKKRLTGLT 385
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 456 PTTQEGKIIELEQKCTGIleqgrfeREKLLNIQQQLTcSLRKVEEENQGALEMIKRLKEENEKLNEFLElERHNNNMMA- 534
Cdd:PRK03918 386 PEKLEKELEELEKAKEEI-------EEEISKITARIG-ELKKEIKELKKAIEELKKAKGKCPVCGRELT-EEHRKELLEe 456
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 535 ---------KTLEECRVTLEGLKMENGSLKSHLQGEKQKATEASAVEQTAESCEVQEMLKVARAEKDLLELScnELRQEL 605
Cdd:PRK03918 457 ytaelkrieKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYE--KLKEKL 534
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 606 LKANGEIKHVSSLLAKV---EKDYSYLKEICDHQAEQLSRTSLKLQEKASESdaeIKDMKETIFELE----------DQV 672
Cdd:PRK03918 535 IKLKGEIKSLKKELEKLeelKKKLAELEKKLDELEEELAELLKELEELGFES---VEELEERLKELEpfyneylelkDAE 611
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 673 EQHRAVKLHNNQLISELESSVIKLEEQKSDLERQLKTLT-----------KQMKEETEEWRRFQADLQTAVVVANDIKCE 741
Cdd:PRK03918 612 KELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEelekkyseeeyEELREEYLELSRELAGLRAELEELEKRREE 691
|
410 420
....*....|....*....|....
gi 1034602044 742 AQQELRTVKRKLLEEEEKNARLQK 765
Cdd:PRK03918 692 IKKTLEKLKEELEEREKAKKELEK 715
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
356-720 |
1.43e-07 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 55.46 E-value: 1.43e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 356 STAGSSPNSVSEL-SLASLTEKIQKMEENHHSTAEELQATLQELSDQQQMVQELTAENEKLVDEKTILETSFHQHRERAE 434
Cdd:TIGR02169 661 APRGGILFSRSEPaELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLE 740
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 435 QLSqenEKLMNLLQERVKNE-EPTTQEGKIIELEQKCTGI-LEQGRFEREKLLNIQQQLTCSLRKVEEENQgalEMIKRL 512
Cdd:TIGR02169 741 ELE---EDLSSLEQEIENVKsELKELEARIEELEEDLHKLeEALNDLEARLSHSRIPEIQAELSKLEEEVS---RIEARL 814
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 513 KEENEKLNEfLELERHNNNMMAKTLEECRVTLEGLKMENGSLKSHLQGEKQKaTEASAVEQTAESCEVQEMLKVARAEKD 592
Cdd:TIGR02169 815 REIEQKLNR-LTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEE-LEEELEELEAALRDLESRLGDLKKERD 892
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 593 LLELSCNELRQELLKANGEI----KHVSSLLAKVEKDYSYLKEICDHQAEQLSrtslklqekASESDAEIKDMKETIFEL 668
Cdd:TIGR02169 893 ELEAQLRELERKIEELEAQIekkrKRLSELKAKLEALEEELSEIEDPKGEDEE---------IPEEELSLEDVQAELQRV 963
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|..
gi 1034602044 669 EDQVEQHRAVklhNNQLISELESSVIKLEEQKSDLERqLKTLTKQMKEETEE 720
Cdd:TIGR02169 964 EEEIRALEPV---NMLAIQEYEEVLKRLDELKEKRAK-LEEERKAILERIEE 1011
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
374-768 |
3.72e-07 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 53.91 E-value: 3.72e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 374 TEKIQKMEENHHstaEELQATLQELSDQQQMVQELTAENEKLVDEKTILEtsfhQHRERAEQLSQENEKL---MNLLQER 450
Cdd:PRK03918 188 TENIEELIKEKE---KELEEVLREINEISSELPELREELEKLEKEVKELE----ELKEEIEELEKELESLegsKRKLEEK 260
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 451 VKN------------EEPTTQEGKIIELEQKCTGILEQGRFeREKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEK 518
Cdd:PRK03918 261 IREleerieelkkeiEELEEKVKELKELKEKAEEYIKLSEF-YEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEER 339
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 519 LNEFLELERHNNNMMAKtLEECRVTLEGLKMENGSLKSHLQGEKQKATEasaveqtaescEVQEMLKVARAEKDLLELSC 598
Cdd:PRK03918 340 LEELKKKLKELEKRLEE-LEERHELYEEAKAKKEELERLKKRLTGLTPE-----------KLEKELEELEKAKEEIEEEI 407
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 599 NELRQELLKANGEIKHVSSLLAKVEKdysyLKEICDHQAEQLSRTSLKlqEKASESDAEIKDMKETIFELEDQVEQHRAV 678
Cdd:PRK03918 408 SKITARIGELKKEIKELKKAIEELKK----AKGKCPVCGRELTEEHRK--ELLEEYTAELKRIEKELKEIEEKERKLRKE 481
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 679 KLHNNQLISElESSVIKLE---EQKSDLERQLKTLT-KQMKEETEEWRRFQADLQT----AVVVANDIK--CEAQQELRT 748
Cdd:PRK03918 482 LRELEKVLKK-ESELIKLKelaEQLKELEEKLKKYNlEELEKKAEEYEKLKEKLIKlkgeIKSLKKELEklEELKKKLAE 560
|
410 420
....*....|....*....|
gi 1034602044 749 VKRKLLEEEEKNARLQKELG 768
Cdd:PRK03918 561 LEKKLDELEEELAELLKELE 580
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
364-766 |
3.74e-07 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 53.87 E-value: 3.74e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 364 SVSELSLASLTEKIQKMEENHHSTAEELQATLQELSDQQQMVQELTAENEKLVDEKTILETSFHQHRERAEQL-SQENEK 442
Cdd:TIGR04523 120 NKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEKLNIQKNIDKIkNKLLKL 199
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 443 LMNLLQERVKNEEPTTQEGKIIELEQKctgileqgrfeREKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEF 522
Cdd:TIGR04523 200 ELLLSNLKKKIQKNKSLESQISELKKQ-----------NNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQ 268
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 523 L-----ELERHNNNMMAKT--LEECRVTLEGLKME-----NGSLKSHLQGEKQKATEA-SAVEQTAESC-EVQEMLKVAR 588
Cdd:TIGR04523 269 LsekqkELEQNNKKIKELEkqLNQLKSEISDLNNQkeqdwNKELKSELKNQEKKLEEIqNQISQNNKIIsQLNEQISQLK 348
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 589 AEKDLLELSCNELRQELLKANGEIKHVssllaKVEKDySYLKEICD--HQAEQLSRTSLKLQEKASESDAEIKDMKETIF 666
Cdd:TIGR04523 349 KELTNSESENSEKQRELEEKQNEIEKL-----KKENQ-SYKQEIKNleSQINDLESKIQNQEKLNQQKDEQIKKLQQEKE 422
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 667 ELEDQVEQHRAVKLHNNQLISELES-------SVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQTAVvvandik 739
Cdd:TIGR04523 423 LLEKEIERLKETIIKNNSEIKDLTNqdsvkelIIKNLDNTRESLETQLKVLSRSINKIKQNLEQKQKELKSKE------- 495
|
410 420
....*....|....*....|....*..
gi 1034602044 740 ceaqQELRTVKRKLLEEEEKNARLQKE 766
Cdd:TIGR04523 496 ----KELKKLNEEKKELEEKVKDLTKK 518
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
506-758 |
9.19e-07 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 52.76 E-value: 9.19e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 506 LEMIKRLKEENEKLNEFL--ELERHNNNMMAKT--LEECRVTLEGLKMENGSLKSHLQGEKQKATEA----SAVEQTAEs 577
Cdd:TIGR02169 676 LQRLRERLEGLKRELSSLqsELRRIENRLDELSqeLSDASRKIGEIEKEIEQLEQEEEKLKERLEELeedlSSLEQEIE- 754
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 578 cEVQEMLKVARAEKDLLELSCNELRQELLK-----ANGEIKHVSSLLAKVEKDYSYLKEICDHQAEQLSRTSLK---LQE 649
Cdd:TIGR02169 755 -NVKSELKELEARIEELEEDLHKLEEALNDlearlSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEkeyLEK 833
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 650 KASESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQ 729
Cdd:TIGR02169 834 EIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIE 913
|
250 260 270
....*....|....*....|....*....|..
gi 1034602044 730 TAVVVANDIKC---EAQQELRTVKRKLLEEEE 758
Cdd:TIGR02169 914 KKRKRLSELKAkleALEEELSEIEDPKGEDEE 945
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
366-773 |
1.85e-06 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 51.71 E-value: 1.85e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 366 SELSLASLTEKIQKMEENHHSTAEELQATLQELSDQQQMVQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKL-- 443
Cdd:pfam01576 24 AESELKELEKKHQQLCEEKNALQEQLQAETELCAEAEEMRARLAARKQELEEILHELESRLEEEEERSQQLQNEKKKMqq 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 444 -MNLLQERVKNEEPTTQ---------EGKIIELEQKCTGILEQ-GRFEREKLLNIQQQLTCSLRKVEEENQGalEMIKRL 512
Cdd:pfam01576 104 hIQDLEEQLDEEEAARQklqlekvttEAKIKKLEEDILLLEDQnSKLSKERKLLEERISEFTSNLAEEEEKA--KSLSKL 181
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 513 KEENEKLNEFLELERHNNNMMAKTLEECRVTLEG------------------LKMENGSLKSHLQGEKQKATEASAV--- 571
Cdd:pfam01576 182 KNKHEAMISDLEERLKKEEKGRQELEKAKRKLEGestdlqeqiaelqaqiaeLRAQLAKKEEELQAALARLEEETAQknn 261
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 572 ------EQTAESCEVQEMLKVARAEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKdysyLKEICDHQAEQLSRTsl 645
Cdd:pfam01576 262 alkkirELEAQISELQEDLESERAARNKAEKQRRDLGEELEALKTELEDTLDTTAAQQE----LRSKREQEVTELKKA-- 335
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 646 kLQEKASESDAEIKDMKE----TIFELEDQVEQHRAVKlhnnqliSELESSVIKLEEQKSDLERQLKTLTKQMKEETEEW 721
Cdd:pfam01576 336 -LEEETRSHEAQLQEMRQkhtqALEELTEQLEQAKRNK-------ANLEKAKQALESENAELQAELRTLQQAKQDSEHKR 407
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034602044 722 RRFQADLQTAVVVANDIK----------CEAQQELRTVKRKLLEEEEKNARLQKELGDVQGH 773
Cdd:pfam01576 408 KKLEGQLQELQARLSESErqraelaeklSKLQSELESVSSLLNEAEGKNIKLSKDVSSLESQ 469
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
370-764 |
1.90e-06 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 51.31 E-value: 1.90e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 370 LASLTEKIQKMEENH---HSTAEELQATLQELSDQQQMVQELTAENEKLVDEKTILET--SFHQHRERAEQLSQENEKLM 444
Cdd:COG4717 73 LKELEEELKEAEEKEeeyAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLyqELEALEAELAELPERLEELE 152
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 445 NLLQERVKNEEP-TTQEGKIIELEQKCTGILEQGRFEREKLLniqQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEfl 523
Cdd:COG4717 153 ERLEELRELEEElEELEAELAELQEELEELLEQLSLATEEEL---QDLAEELEELQQRLAELEEELEEAQEELEELEE-- 227
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 524 ELERHNNNMMAKTLEEC-------------RVTLEGLKMENGSLKSHLQG----------------EKQKATEASAVEQT 574
Cdd:COG4717 228 ELEQLENELEAAALEERlkearlllliaaaLLALLGLGGSLLSLILTIAGvlflvlgllallflllAREKASLGKEAEEL 307
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 575 AESCEVQEMLKVARAE-KDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYS--YLKEICDHQAEQLSR---TSLKLQ 648
Cdd:COG4717 308 QALPALEELEEEELEElLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEelQLEELEQEIAALLAEagvEDEEEL 387
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 649 EKASESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELEssvikLEEQKSDLERQLKTLTKQMKEETEEWRRFQADL 728
Cdd:COG4717 388 RAALEQAEEYQELKEELEELEEQLEELLGELEELLEALDEEE-----LEEELEELEEELEELEEELEELREELAELEAEL 462
|
410 420 430
....*....|....*....|....*....|....*.
gi 1034602044 729 QTAvvVANDIKCEAQQELRTVKRKLLEEEEKNARLQ 764
Cdd:COG4717 463 EQL--EEDGELAELLQELEELKAELRELAEEWAALK 496
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
431-789 |
2.06e-06 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 51.48 E-value: 2.06e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 431 ERAEQLSQENEKL---MNLLQERVKNEEPTTQEGKIIELEQKCTGILEQGRFEREKLLNIQQQLTcSLRKVEEENQGALE 507
Cdd:COG1196 213 ERYRELKEELKELeaeLLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELE-ELELELEEAQAEEY 291
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 508 MIKRLKEENEKLNEFLELERHNNnmmaktleecRVTLEGLKMENGSLKSHLQGEKQKATEAsaveqtaescevQEMLKVA 587
Cdd:COG1196 292 ELLAELARLEQDIARLEERRREL----------EERLEELEEELAELEEELEELEEELEEL------------EEELEEA 349
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 588 RAEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEicdhQAEQLSRTSLKLQEKASESDAEIKDMKETIFE 667
Cdd:COG1196 350 EEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALR----AAAELAAQLEELEEAEEALLERLERLEEELEE 425
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 668 LEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQTAVVVANDIK-CEAQQEL 746
Cdd:COG1196 426 LEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLeAEADYEG 505
|
330 340 350 360
....*....|....*....|....*....|....*....|...
gi 1034602044 747 RTVKRKLLEEEEKNARLQKELGDVQGHGRVVTsRAAPPSLGSV 789
Cdd:COG1196 506 FLEGVKAALLLAGLRGLAGAVAVLIGVEAAYE-AALEAALAAA 547
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
579-780 |
2.13e-06 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 51.45 E-value: 2.13e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 579 EVQEMLKVARAEKDLLELSCnELRQELLKANGEIKHVSSLLAKV-----EKDYSYLKEICDHQAEQLSRtslkLQEKASE 653
Cdd:COG4913 239 RAHEALEDAREQIELLEPIR-ELAERYAAARERLAELEYLRAALrlwfaQRRLELLEAELEELRAELAR----LEAELER 313
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 654 SDAEIKDMKETIFELEDQVEQHRAVKLHN-NQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQTAV 732
Cdd:COG4913 314 LEARLDALREELDELEAQIRGNGGDRLEQlEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEAAALL 393
|
170 180 190 200
....*....|....*....|....*....|....*....|....*...
gi 1034602044 733 VVANDIKCEAQQELRTVKRKLLEEEEKNARLQKELGDVQGHGRVVTSR 780
Cdd:COG4913 394 EALEEELEALEEALAEAEAALRDLRRELRELEAEIASLERRKSNIPAR 441
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
367-651 |
8.48e-06 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 49.55 E-value: 8.48e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 367 ELSLASLTEKIQKMEEnhhsTAEELQATLQELSDQqqmVQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKLMNL 446
Cdd:COG1196 238 EAELEELEAELEELEA----ELEELEAELAELEAE---LEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEER 310
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 447 LQERVKNEEptTQEGKIIELEQKCTGILEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEFLELE 526
Cdd:COG1196 311 RRELEERLE--ELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEEL 388
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 527 ---RHNNNMMAKTLEECRVTLEGLKMENGSLKSHLQGEKQKATEASAVEQTAESCEVQEMLKVARAEKDLLELScNELRQ 603
Cdd:COG1196 389 leaLRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALL-ELLAE 467
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 1034602044 604 ELLKANGEIKHVSSLLAKVEKDYS--YLKEICDHQAEQLSRTSLKLQEKA 651
Cdd:COG1196 468 LLEEAALLEAALAELLEELAEAAArlLLLLEAEADYEGFLEGVKAALLLA 517
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
370-767 |
9.36e-06 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 49.20 E-value: 9.36e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 370 LASLTEKIQKmeeNHHSTAEELQATLQELSD-QQQMVQELTAENEKLVDEKTILETSFHQHRERAEQLSQENE------K 442
Cdd:TIGR00618 181 LALMEFAKKK---SLHGKAELLTLRSQLLTLcTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQKREaqeeqlK 257
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 443 LMNLLQERVKNEEPTTQEGKIIELEQKCTGI-------------LEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMI 509
Cdd:TIGR00618 258 KQQLLKQLRARIEELRAQEAVLEETQERINRarkaaplaahikaVTQIEQQAQRIHTELQSKMRSRAKLLMKRAAHVKQQ 337
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 510 KRLKEENEKLNEFLELERHNnnmmAKTLEECRVTLEGLKMENgSLKSHLQGEKQKATEASAVEQTAesCEVQEMLKVARA 589
Cdd:TIGR00618 338 SSIEEQRRLLQTLHSQEIHI----RDAHEVATSIREISCQQH-TLTQHIHTLQQQKTTLTQKLQSL--CKELDILQREQA 410
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 590 EKDLLELSCNELRQELLKANGEI----KHVSSLLAKVEKDYSYLKEICDHQAEQLSRTSLKLQ-----EKASESDAEIKD 660
Cdd:TIGR00618 411 TIDTRTSAFRDLQGQLAHAKKQQelqqRYAELCAAAITCTAQCEKLEKIHLQESAQSLKEREQqlqtkEQIHLQETRKKA 490
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 661 MKETIfeLEDQVEQHRAVK---LHNNQ-------------LISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRF 724
Cdd:TIGR00618 491 VVLAR--LLELQEEPCPLCgscIHPNParqdidnpgpltrRMQRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEI 568
|
410 420 430 440
....*....|....*....|....*....|....*....|...
gi 1034602044 725 QADLQTAVVVANDIKCEAQQeLRTVKRKLLEEEEKNARLQKEL 767
Cdd:TIGR00618 569 QQSFSILTQCDNRSKEDIPN-LQNITVRLQDLTEKLSEAEDML 610
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
507-767 |
1.63e-05 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 48.58 E-value: 1.63e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 507 EMIKRLKEENEKLNEFLELERHNNNMMAKTLEECRVTLEGLKMENGSLKSHLQGEKQKATE-ASAVEQTAESCEVQEMLK 585
Cdd:pfam15921 82 EYSHQVKDLQRRLNESNELHEKQKFYLRQSVIDLQTKLQEMQMERDAMADIRRRESQSQEDlRNQLQNTVHELEAAKCLK 161
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 586 varaeKDLLELS---CNELRQELLKANGEIKHVSSLLAKVEKDYSylKEIcdHQAEQLSRTSLK-----LQEKASESDAE 657
Cdd:pfam15921 162 -----EDMLEDSntqIEQLRKMMLSHEGVLQEIRSILVDFEEASG--KKI--YEHDSMSTMHFRslgsaISKILRELDTE 232
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 658 IKDMKETIFELEDQVEQHRA---------VKLHNN---QLISELESSVIKLEE-------QKSDLERQLKTLTKQMKEET 718
Cdd:pfam15921 233 ISYLKGRIFPVEDQLEALKSesqnkiellLQQHQDrieQLISEHEVEITGLTEkassarsQANSIQSQLEIIQEQARNQN 312
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 1034602044 719 EEWRRFQADLQTAVvvandikCEAQQELRTVKRKLleeEEKNARLQKEL 767
Cdd:pfam15921 313 SMYMRQLSDLESTV-------SQLRSELREAKRMY---EDKIEELEKQL 351
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
562-788 |
2.60e-05 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 47.13 E-value: 2.60e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 562 KQKATEASAVEQTAESCEVQEMLKVARAEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEICDHQAEQLS 641
Cdd:COG3883 17 QIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERARALY 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 642 RTSlklqekASESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEw 721
Cdd:COG3883 97 RSG------GSVSYLDVLLGSESFSDFLDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAA- 169
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034602044 722 rrfQADLQTAVVVANDIKCEAQQELRTVKRKLLEEEEKNARLQKELGDVQGHGRVVTSRAAPPSLGS 788
Cdd:COG3883 170 ---KAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAA 233
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
388-773 |
3.75e-05 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 47.27 E-value: 3.75e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 388 AEELQATLQELSDQQQMVQELTAENEKLVDEKTILEtsfhqhreRAEQLSQENEKLMNLLQErvkneEPTTQEGKIIELE 467
Cdd:TIGR00618 448 TCTAQCEKLEKIHLQESAQSLKEREQQLQTKEQIHL--------QETRKKAVVLARLLELQE-----EPCPLCGSCIHPN 514
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 468 QKCTGILEQGRFER--EKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEKL-NEFLELERHNN------NMMAKTLE 538
Cdd:TIGR00618 515 PARQDIDNPGPLTRrmQRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIqQSFSILTQCDNrskediPNLQNITV 594
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 539 ECRVTLEGLKMENGSLKSHLQGEKQKATEASA---VEQTAESCEVQEMLKVARAEKDLLELSCNELRQELL--------- 606
Cdd:TIGR00618 595 RLQDLTEKLSEAEDMLACEQHALLRKLQPEQDlqdVRLHLQQCSQELALKLTALHALQLTLTQERVREHALsirvlpkel 674
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 607 ---------KANGEIKHVSSLLAKVEKDYSYLKEICDH------QAEQLSRTSLKLQEKASESDAEIKDMKETIFELEDQ 671
Cdd:TIGR00618 675 lasrqlalqKMQSEKEQLTYWKEMLAQCQTLLRELETHieeydrEFNEIENASSSLGSDLAAREDALNQSLKELMHQART 754
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 672 VEQHRAVKLHNNqliSELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQTAVVVANDIKCEAQQELRTVKR 751
Cdd:TIGR00618 755 VLKARTEAHFNN---NEEVTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETLVQEEE 831
|
410 420
....*....|....*....|..
gi 1034602044 752 KLLEEEEKNARLQKELGDVQGH 773
Cdd:TIGR00618 832 QFLSRLEEKSATLGEITHQLLK 853
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
351-766 |
5.62e-05 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 46.64 E-value: 5.62e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 351 KSSKCSTAGSSPNSVSELSLASLTEKIQKMEENHHSTAEELQATLQELSDQQQMVQELTA----------ENEKLVDEKT 420
Cdd:pfam05483 339 ELNKAKAAHSFVVTEFEATTCSLEELLRTEQQRLEKNEDQLKIITMELQKKSSELEEMTKfknnkeveleELKKILAEDE 418
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 421 ILETSFHQHRERAEQLSQENEKLMNLLQERVKN-EEPTTQEGKIIELEQKCTGILEQGR--FEREKLLNIQQQLTCSLRK 497
Cdd:pfam05483 419 KLLDEKKQFEKIAEELKGKEQELIFLLQAREKEiHDLEIQLTAIKTSEEHYLKEVEDLKteLEKEKLKNIELTAHCDKLL 498
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 498 VEEEN--QGALEMIKRLKEENEKLNEFLELERHnnnmMAKTLEECRVTLEGLKMENGSLKSHLqgeKQKATEASAVEQTA 575
Cdd:pfam05483 499 LENKEltQEASDMTLELKKHQEDIINCKKQEER----MLKQIENLEEKEMNLRDELESVREEF---IQKGDEVKCKLDKS 571
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 576 ESCEVQEMLKVARAEKDL--LELSCNELRQELLKANGEIKHVSSLLAKVEKDYSY------LKEICDHQAE-QLSRTSLK 646
Cdd:pfam05483 572 EENARSIEYEVLKKEKQMkiLENKCNNLKKQIENKNKNIEELHQENKALKKKGSAenkqlnAYEIKVNKLElELASAKQK 651
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 647 LQEKASESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELESSVI-KLEEQKSDLERQLKTLTKQMKEETEEWRRFQ 725
Cdd:pfam05483 652 FEEIIDNYQKEIEDKKISEEKLLEEVEKAKAIADEAVKLQKEIDKRCQhKIAEMVALMEKHKHQYDKIIEERDSELGLYK 731
|
410 420 430 440
....*....|....*....|....*....|....*....|....
gi 1034602044 726 ADLQTAVVVANDIKCE---AQQELRTVKRKLLEEEEKNARLQKE 766
Cdd:pfam05483 732 NKEQEQSSAKAALEIElsnIKAELLSLKKQLEIEKEEKEKLKME 775
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
509-767 |
1.12e-04 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 45.82 E-value: 1.12e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 509 IKRLKEENEKLNEFLELErhnnnmmaKTLEECRVTLEGLKMEngSLKSHLQgEKQKATEASAVEQTAESCEVQE------ 582
Cdd:TIGR02168 202 LKSLERQAEKAERYKELK--------AELRELELALLVLRLE--ELREELE-ELQEELKEAEEELEELTAELQEleekle 270
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 583 --MLKVARAEKDLLELS--CNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEicdhQAEQLSRTSLKLQEKASESDAEI 658
Cdd:TIGR02168 271 elRLEVSELEEEIEELQkeLYALANEISRLEQQKQILRERLANLERQLEELEA----QLEELESKLDELAEELAELEEKL 346
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 659 KDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQ----------MKEETEEWRRFQADL 728
Cdd:TIGR02168 347 EELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEierlearlerLEDRRERLQQEIEEL 426
|
250 260 270
....*....|....*....|....*....|....*....
gi 1034602044 729 QTAVVVANdiKCEAQQELRTVKRKLLEEEEKNARLQKEL 767
Cdd:TIGR02168 427 LKKLEEAE--LKELQAELEELEEELEELQEELERLEEAL 463
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
388-766 |
1.23e-04 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 45.90 E-value: 1.23e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 388 AEELQATLQELSDQQQMVQELTAENEKLVDEKTILEtsfhqHRERAEQLSQENEKLMNLLQERVKNEEPTTQEgkiiELE 467
Cdd:PTZ00121 1393 ADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAE-----EKKKADEAKKKAEEAKKADEAKKKAEEAKKAE----EAK 1463
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 468 QKCTgilEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEFLELERHNNNMMAKTLEECRVTLEGL 547
Cdd:PTZ00121 1464 KKAE---EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAK 1540
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 548 KMEngslkshlqgEKQKATEASAVEQTAESCEVQEMLKVARAEKD--LLELSCNELRQELLKANGEIKHVSSLLAKVEKD 625
Cdd:PTZ00121 1541 KAE----------EKKKADELKKAEELKKAEEKKKAEEAKKAEEDknMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAE 1610
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 626 YSYLKEICDHQAEQLSrtslKLQEKASESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLE- 704
Cdd:PTZ00121 1611 EAKKAEEAKIKAEELK----KAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEd 1686
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034602044 705 ---------------RQLKTLTKQMKEETEEWRRFQADLQTAVVVANDIKCEAQQELRTVKRKLLEEEEKN--ARLQKE 766
Cdd:PTZ00121 1687 ekkaaealkkeaeeaKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKkiAHLKKE 1765
|
|
| PRK04778 |
PRK04778 |
septation ring formation regulator EzrA; Provisional |
396-754 |
1.33e-04 |
|
septation ring formation regulator EzrA; Provisional
Pssm-ID: 179877 [Multi-domain] Cd Length: 569 Bit Score: 45.21 E-value: 1.33e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 396 QELSDQQQMVQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKLM-NLLQERvkneeptTQEGKIIELeqkctgiL 474
Cdd:PRK04778 105 HEINEIESLLDLIEEDIEQILEELQELLESEEKNREEVEQLKDLYRELRkSLLANR-------FSFGPALDE-------L 170
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 475 EqgrferEKLLNIQQQLTcslRKVEEENQG----ALEMIKRLKEENEKLNEFLE----LERHNNNMMAKTLEECRVTLEG 546
Cdd:PRK04778 171 E------KQLENLEEEFS---QFVELTESGdyveAREILDQLEEELAALEQIMEeipeLLKELQTELPDQLQELKAGYRE 241
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 547 LKMENGSLKsHLQGEKqkateasaveqtaescEVQEML-KVARAEKDLLELSCNELRQELLKANGEIKHVSSLLakvEKD 625
Cdd:PRK04778 242 LVEEGYHLD-HLDIEK----------------EIQDLKeQIDENLALLEELDLDEAEEKNEEIQERIDQLYDIL---ERE 301
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 626 YSYLKEIcDHQAEQLSRTSLKLQEKASESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQK---SD 702
Cdd:PRK04778 302 VKARKYV-EKNSDTLPDFLEHAKEQNKELKEEIDRVKQSYTLNESELESVRQLEKQLESLEKQYDEITERIAEQEiaySE 380
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*
gi 1034602044 703 LERQLKTLTKQMKEETEEWRRFQADLQT---AVVVANDIKCEAQQELRTVKRKLL 754
Cdd:PRK04778 381 LQEELEEILKQLEEIEKEQEKLSEMLQGlrkDELEAREKLERYRNKLHEIKRYLE 435
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
367-756 |
1.49e-04 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 45.49 E-value: 1.49e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 367 ELSLASLTEKIQKMEENHHSTAEeLQATLQELsdqQQMVQELTAENEKLVD-EKTILE-TSFHQHRERA-EQLSQENEKL 443
Cdd:pfam15921 447 ERQMAAIQGKNESLEKVSSLTAQ-LESTKEML---RKVVEELTAKKMTLESsERTVSDlTASLQEKERAiEATNAEITKL 522
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 444 MNLLQERVKNEEPTTQEGKIIELEQKCTGILEQGRFEREKLLNI-QQQLTCSLRKVEEENQGA----LEMIKRLKEENEK 518
Cdd:pfam15921 523 RSRVDLKLQELQHLKNEGDHLRNVQTECEALKLQMAEKDKVIEIlRQQIENMTQLVGQHGRTAgamqVEKAQLEKEINDR 602
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 519 LNEFLELERHNNNMMAKTLE-ECRVtleglkmengslkSHLQGEKQKATEASAveqtaescevqEMLKVARaekdllels 597
Cdd:pfam15921 603 RLELQEFKILKDKKDAKIRElEARV-------------SDLELEKVKLVNAGS-----------ERLRAVK--------- 649
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 598 cnELRQELLKANGEIKHVSSLLAKVEKDYSYLKEICDHQAEQLSRTSLKLQEKASESDAEIKDMKETIFELEDQVEQHRA 677
Cdd:pfam15921 650 --DIKQERDQLLNEVKTSRNELNSLSEDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSELEQTRNTLKSMEGSDGHAMK 727
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034602044 678 VKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQTAVVVANDIKCEAQQeLRTVKRKLLEE 756
Cdd:pfam15921 728 VAMGMQKQITAKRGQIDALQSKIQFLEEAMTNANKEKHFLKEEKNKLSQELSTVATEKNKMAGELEV-LRSQERRLKEK 805
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
389-715 |
1.53e-04 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 45.40 E-value: 1.53e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 389 EELQATLQELSDQQQMVQELTAENEKLVDEKTILETsfhQHRERAEQLSQENEKLMNLLQER-VKNEEPTTQEGKIIELE 467
Cdd:TIGR04523 321 KKLEEIQNQISQNNKIISQLNEQISQLKKELTNSES---ENSEKQRELEEKQNEIEKLKKENqSYKQEIKNLESQINDLE 397
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 468 QKctgileqgrFEREKLLNiqQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEFL-ELERHNN------NMMAKTLEEC 540
Cdd:TIGR04523 398 SK---------IQNQEKLN--QQKDEQIKKLQQEKELLEKEIERLKETIIKNNSEIkDLTNQDSvkeliiKNLDNTRESL 466
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 541 RVTLEGLKMENGSLKSHLQGEKQ--KATEASAVEQTAESCEVQEMLKVARAEKDLLELSCNELRQELLKANGEIkhvSSL 618
Cdd:TIGR04523 467 ETQLKVLSRSINKIKQNLEQKQKelKSKEKELKKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKI---SDL 543
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 619 LAKVEKDYSYLKEicdhqaEQLSRTSLKLQEKASESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEE 698
Cdd:TIGR04523 544 EDELNKDDFELKK------ENLEKEIDEKNKEIEELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLEK 617
|
330
....*....|....*..
gi 1034602044 699 QKSDLERQLKTLTKQMK 715
Cdd:TIGR04523 618 ELEKAKKENEKLSSIIK 634
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
600-782 |
1.60e-04 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 45.29 E-value: 1.60e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 600 ELRQELLKANGEIKHVSSLLAKVEKDYSYLKEIcdhqaeqlsRTSLKLQEKASESDAEIKDMKETIFELEDQVEQHRAvk 679
Cdd:COG4913 614 ALEAELAELEEELAEAEERLEALEAELDALQER---------REALQRLAEYSWDEIDVASAEREIAELEAELERLDA-- 682
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 680 lhNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQTAVVVANDIKCEAQQELRTVKRKLLEEEEK 759
Cdd:COG4913 683 --SSDDLAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALG 760
|
170 180
....*....|....*....|...
gi 1034602044 760 NARLQKELGDVQGHGRVVTSRAA 782
Cdd:COG4913 761 DAVERELRENLEERIDALRARLN 783
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
475-771 |
1.68e-04 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 45.35 E-value: 1.68e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 475 EQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEFLELERHNNNMMAKT----LEECRVTLEGLKME 550
Cdd:pfam02463 169 RKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDylklNEERIDLLQELLRD 248
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 551 NGSLKSHLQGEKQKATEASAVEQTAESCEVQEMlKVARAEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKdysylk 630
Cdd:pfam02463 249 EQEEIESSKQEIEKEEEKLAQVLKENKEEEKEK-KLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEK------ 321
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 631 eicdhQAEQLSRTSLKLQEKASESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQlKTL 710
Cdd:pfam02463 322 -----EKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKL-KEE 395
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034602044 711 TKQMKEETEEWRRFQADLQTAVVVANDIKCEAQQELRTVKRKLLEEEEKNARLQKELGDVQ 771
Cdd:pfam02463 396 ELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQ 456
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
389-776 |
3.44e-04 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 44.29 E-value: 3.44e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 389 EELQATLQELsdqqQMVQELTAENEKLVDEKtiletsfhqhRERAEQLSQENEKLMNL--LQERVKNEEPTTQEGKIIEL 466
Cdd:TIGR02169 170 RKKEKALEEL----EEVEENIERLDLIIDEK----------RQQLERLRREREKAERYqaLLKEKREYEGYELLKEKEAL 235
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 467 EQKctgiLEQGRFEREKLLNIQQQLTcslRKVEEENQGALEMIKRLKEENEKLNEFLELERhnnNMMAKTLEECRVTLEG 546
Cdd:TIGR02169 236 ERQ----KEAIERQLASLEEELEKLT---EEISELEKRLEEIEQLLEELNKKIKDLGEEEQ---LRVKEKIGELEAEIAS 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 547 LKMENGSLKSHLQ--GEKQKATEASAVEQTAESCEVQEMLKVARAEKDLLELSCNELRQELLKANGEIKHVSSLLAK-VE 623
Cdd:TIGR02169 306 LERSIAEKERELEdaEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAEtRD 385
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 624 KDYSYLKEICD--HQAEQLSRTSLKLQEKASESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKS 701
Cdd:TIGR02169 386 ELKDYREKLEKlkREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLS 465
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1034602044 702 DLERQLKTLTKQMKEETEEWRRFQADLQTAvvvandiKCEAQQELRTVKRKLLEEEEKNARLQKELGDVQGHGRV 776
Cdd:TIGR02169 466 KYEQELYDLKEEYDRVEKELSKLQRELAEA-------EAQARASEERVRGGRAVEEVLKASIQGVHGTVAQLGSV 533
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
377-667 |
3.76e-04 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 44.19 E-value: 3.76e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 377 IQKMEENHHSTAEELQATLQELSDQQQMvQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKLMNLLQERVKNEEP 456
Cdd:TIGR00618 600 TEKLSEAEDMLACEQHALLRKLQPEQDL-QDVRLHLQQCSQELALKLTALHALQLTLTQERVREHALSIRVLPKELLASR 678
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 457 TTQEGKIIELEQKCTGILEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEFL-ELERHNNNMMAK 535
Cdd:TIGR00618 679 QLALQKMQSEKEQLTYWKEMLAQCQTLLRELETHIEEYDREFNEIENASSSLGSDLAAREDALNQSLkELMHQARTVLKA 758
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 536 TLEECRVTLEGLKME--NGSLKSHLQGEKQKATEASAVEQTAESCEVQEMLKVARAEKDLLELSCNELRQELLKANGEIK 613
Cdd:TIGR00618 759 RTEAHFNNNEEVTAAlqTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETLVQEEEQFLSRLE 838
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....
gi 1034602044 614 HVSSLLAKVEKDYSYLKEiCDHQAEQLSRTSLKLQEKasESDAEIKDMKETIFE 667
Cdd:TIGR00618 839 EKSATLGEITHQLLKYEE-CSKQLAQLTQEQAKIIQL--SDKLNGINQIKIQFD 889
|
|
| COG5022 |
COG5022 |
Myosin heavy chain [General function prediction only]; |
440-771 |
4.92e-04 |
|
Myosin heavy chain [General function prediction only];
Pssm-ID: 227355 [Multi-domain] Cd Length: 1463 Bit Score: 43.91 E-value: 4.92e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 440 NEKLMNLLQERVKNEEPTTQEGKiieLEQKCTGILEQGRFEREKL-LNIQQQLTCSLRKVEEENQgalEMIKRLKEENEK 518
Cdd:COG5022 775 QVIQHGFRLRRLVDYELKWRLFI---KLQPLLSLLGSRKEYRSYLaCIIKLQKTIKREKKLRETE---EVEFSLKAEVLI 848
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 519 LNEFLELERHNnnmMAKTLEECRVTLEGL-KMENgsLKSHLQGEKQKATEASAVEQTAEscevqemlkvaRAEKDLLELS 597
Cdd:COG5022 849 QKFGRSLKAKK---RFSLLKKETIYLQSAqRVEL--AERQLQELKIDVKSISSLKLVNL-----------ELESEIIELK 912
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 598 CNElrQELLKANGEIKHVSSLLAKVEKDYSYLKEICDHQAEQLSRTsLKLQEkasesdaEIKDMKETIFELEDQVEQHRA 677
Cdd:COG5022 913 KSL--SSDLIENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLPEL-NKLHE-------VESKLKETSEEYEDLLKKSTI 982
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 678 VKLHNNQLISELESSVIKLEEQKSDLERqLKTLTKQMKEEteewRRFQADLQTAVVVANDIKCEAQQELRTVKRKLLEEE 757
Cdd:COG5022 983 LVREGNKANSELKNFKKELAELSKQYGA-LQESTKQLKEL----PVEVAELQSASKIISSESTELSILKPLQKLKGLLLL 1057
|
330
....*....|....
gi 1034602044 758 EKNaRLQKELGDVQ 771
Cdd:COG5022 1058 ENN-QLQARYKALK 1070
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
372-716 |
5.39e-04 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 43.49 E-value: 5.39e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 372 SLTEKIQKMEENH---HSTAEELQATLQ----ELSDQQQMVQELTAE----NEKLVDEKTILETSFHQHRERAEQLSQEN 440
Cdd:PRK02224 346 SLREDADDLEERAeelREEAAELESELEeareAVEDRREEIEELEEEieelRERFGDAPVDLGNAEDFLEELREERDELR 425
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 441 EKLMNL------LQERVKNEEPTTQEGKIIELEQKCTG-----ILEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEM- 508
Cdd:PRK02224 426 EREAELeatlrtARERVEEAEALLEAGKCPECGQPVEGsphveTIEEDRERVEELEAELEDLEEEVEEVEERLERAEDLv 505
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 509 -----IKRLKEENEKLNEFLELERhnnnmmaKTLEECRVTLEGLKMENGSLKSHLQGEKQKATEAsavEQTAEscEVQEM 583
Cdd:PRK02224 506 eaedrIERLEERREDLEELIAERR-------ETIEEKRERAEELRERAAELEAEAEEKREAAAEA---EEEAE--EAREE 573
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 584 LKvaraekdllelSCNELRQELLKANGEIKHVSSLLAKVEKdysylkeiCDHQAEQLSRTSLKLQEKASESDAEIKDMKE 663
Cdd:PRK02224 574 VA-----------ELNSKLAELKERIESLERIRTLLAAIAD--------AEDEIERLREKREALAELNDERRERLAEKRE 634
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*...
gi 1034602044 664 TIFELEDQVEQHRAVKLHNN-----QLISELESSVIKLEEQKSDLERQLKTLTKQMKE 716
Cdd:PRK02224 635 RKRELEAEFDEARIEEAREDkeraeEYLEQVEEKLDELREERDDLQAEIGAVENELEE 692
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
355-596 |
8.08e-04 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 42.44 E-value: 8.08e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 355 CSTAGSSPNSVSELS--LASLTEKIQKMEENHHSTAEELQATLQELSDQQQMVQELTAENEKLVDEKTILETSFHQHRER 432
Cdd:COG4942 12 ALAAAAQADAAAEAEaeLEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKE 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 433 AEQLSQENEKLMNLLQERVKNeepttqegkiieleqkctgILEQGRFEREKLLNIQQQLTCSLRkveeenqgALEMIKRL 512
Cdd:COG4942 92 IAELRAELEAQKEELAELLRA-------------------LYRLGRQPPLALLLSPEDFLDAVR--------RLQYLKYL 144
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 513 KEENEKLNEFLELERHNNNMMAKTLEECRVTLEGLKMENGSLKSHLQGEKQKATEASAVEQTAESCEVQEMLKVARAEKD 592
Cdd:COG4942 145 APARREQAEELRADLAELAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEE 224
|
....
gi 1034602044 593 LLEL 596
Cdd:COG4942 225 LEAL 228
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
565-788 |
8.44e-04 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 42.44 E-value: 8.44e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 565 ATEASAVEQTAESCEVQEMLKVARAEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEicdhQAEQLSRTS 644
Cdd:COG4942 17 AQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEA----ELAELEKEI 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 645 LKLQEKASESDAEIKDM------------------KETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQ 706
Cdd:COG4942 93 AELRAELEAQKEELAELlralyrlgrqpplalllsPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAE 172
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 707 LKTLTKQMKEETEEwrrfQADLQTAVVVANDIKCEAQQELRTVKRKLLEEEEKNARLQKELGDVQGHGRVVTSRAAPPSL 786
Cdd:COG4942 173 RAELEALLAELEEE----RAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPAAGF 248
|
..
gi 1034602044 787 GS 788
Cdd:COG4942 249 AA 250
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
430-766 |
8.64e-04 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 43.04 E-value: 8.64e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 430 RERAEQLSQENEKLMNLLQERVKNEEPTTQEGKIIELEQKCTGILEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMI 509
Cdd:pfam02463 172 KEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQE 251
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 510 KRLKEENEKLNEFLELERHNNNMMAKTLEEcrvtlEGLKMENGSLKSHLQGEKQKATEASAVEQTAESCEVQEMLKVARA 589
Cdd:pfam02463 252 EIESSKQEIEKEEEKLAQVLKENKEEEKEK-----KLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEKKKA 326
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 590 EKDLLELSCNELRQELLKANGEIKhvsSLLAKVEKDYSYLKEICDHQAEQLSRTSLKLQEKASESDAEIKDMKETIFELE 669
Cdd:pfam02463 327 EKELKKEKEEIEELEKELKELEIK---REAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKSEE 403
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 670 DQVEQHRAVKLHNNQLISELESSVI--KLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQTAVVVANDIKCEAQQELR 747
Cdd:pfam02463 404 EKEAQLLLELARQLEDLLKEEKKEEleILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQ 483
|
330
....*....|....*....
gi 1034602044 748 TVKRKLLEEEEKNARLQKE 766
Cdd:pfam02463 484 EQLELLLSRQKLEERSQKE 502
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
371-730 |
9.81e-04 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 42.73 E-value: 9.81e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 371 ASLTEKIQKMEENHHSTAEELQATLQELSDQQQMVQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKLMNLLQER 450
Cdd:TIGR00606 691 AELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNDIEEQ 770
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 451 VKNEEPTTQEGKIIELEQKCTGILEQGRFEREKLLNIQQQLTC------------SLRKVEEENQGALEMIKRLKEENEK 518
Cdd:TIGR00606 771 ETLLGTIMPEEESAKVCLTDVTIMERFQMELKDVERKIAQQAAklqgsdldrtvqQVNQEKQEKQHELDTVVSKIELNRK 850
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 519 LNE-----FLELERHNNNM------MAKTLEECRVTLEGLKMENGSLKSHLQGEKQKATEASAVEQTAES--CEVQEMLK 585
Cdd:TIGR00606 851 LIQdqqeqIQHLKSKTNELkseklqIGTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKdqQEKEELIS 930
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 586 VARAEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEI-----------CDHQAEQLSR---------TSL 645
Cdd:TIGR00606 931 SKETSNKKAQDKVNDIKEKVKNIHGYMKDIENKIQDGKDDYLKQKETelntvnaqleeCEKHQEKINEdmrlmrqdiDTQ 1010
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 646 KLQEKASESDAEIKDMKETIFELEDQVEQHraVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQ 725
Cdd:TIGR00606 1011 KIQERWLQDNLTLRKRENELKEVEEELKQH--LKEMGQMQVLQMKQEHQKLEENIDLIKRNHVLALGRQKGYEKEIKHFK 1088
|
....*
gi 1034602044 726 ADLQT 730
Cdd:TIGR00606 1089 KELRE 1093
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
534-766 |
1.00e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 42.59 E-value: 1.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 534 AKTLEECRVTLEGLKMENGSLKSHLQGEKQKATEASAVEQTAEscevqEMLKVARAEKDLLELscNELRQELLKANGEIK 613
Cdd:COG4913 616 EAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSW-----DEIDVASAEREIAEL--EAELERLDASSDDLA 688
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 614 HVSSLLAKVEKDYSYLKEICDHQAEQLSRTSLKLQEKASE--------SDAEIKDMKETIFELEDQVEQHrAVKLHNNQL 685
Cdd:COG4913 689 ALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEEldelqdrlEAAEDLARLELRALLEERFAAA-LGDAVEREL 767
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 686 ISELESSVIKLEEQKSDLERQLktlTKQMKEETEEWRRFQADLQTAVVVANDIkceaQQELRTVKRKLLEE-EEKNARLQ 764
Cdd:COG4913 768 RENLEERIDALRARLNRAEEEL---ERAMRAFNREWPAETADLDADLESLPEY----LALLDRLEEDGLPEyEERFKELL 840
|
..
gi 1034602044 765 KE 766
Cdd:COG4913 841 NE 842
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
534-739 |
1.08e-03 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 42.06 E-value: 1.08e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 534 AKTLEECRVTLEGLKMENGSLKSHLQGEKQKATEASAVEQTAES--CEVQEMLKVARAEKDLLELSCNELRQELLKANGE 611
Cdd:COG4942 19 ADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERriAALARRIRALEQELAALEAELAELEKEIAELRAE 98
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 612 IKHVSSLLAKV-------------------------EKDYSYLKEICDH---QAEQLSRTSLKLQEKASESDAEIKDMKE 663
Cdd:COG4942 99 LEAQKEELAELlralyrlgrqpplalllspedfldaVRRLQYLKYLAPArreQAEELRADLAELAALRAELEAERAELEA 178
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034602044 664 TIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQTAVVVANDIK 739
Cdd:COG4942 179 LLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPAAGFAALKGK 254
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
480-770 |
1.16e-03 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 42.36 E-value: 1.16e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 480 EREKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEFLELERHNnnmmAKTLEECRVTLEGLKMENGSLKSHLQ 559
Cdd:PRK03918 180 RLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKE----VKELEELKEEIEELEKELESLEGSKR 255
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 560 G--EKQKATEASAVEQTAESCEVQEmlKVARAEK-DLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEICDhQ 636
Cdd:PRK03918 256 KleEKIRELEERIEELKKEIEELEE--KVKELKElKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIK-E 332
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 637 AEQLSRTSLKLQEKASESDAEIKDMK------ETIFELEDQVEQHRAVKLHNNqlISELESSVIKLEEQKSDLERQLKTL 710
Cdd:PRK03918 333 LEEKEERLEELKKKLKELEKRLEELEerhelyEEAKAKKEELERLKKRLTGLT--PEKLEKELEELEKAKEEIEEEISKI 410
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034602044 711 TkqmkEETEEWRRFQADLQTAVvvaNDIKcEAQQELRTVKRKLLEEEEKN--ARLQKELGDV 770
Cdd:PRK03918 411 T----ARIGELKKEIKELKKAI---EELK-KAKGKCPVCGRELTEEHRKEllEEYTAELKRI 464
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
370-772 |
1.20e-03 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 42.36 E-value: 1.20e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 370 LASLTEKIQKMEENHH---STAEELQATLQELSDQQQMVQEltaENEKLVDEKTILETSFHQHRERAEQLSQENEKL--- 443
Cdd:TIGR02169 352 RDKLTEEYAELKEELEdlrAELEEVDKEFAETRDELKDYRE---KLEKLKREINELKRELDRLQEELQRLSEELADLnaa 428
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 444 -------MNLLQERV--KNEEPTTQEGKIIELEQKCTGILEQGRFEREKLLNIQQQLTCS-------------------- 494
Cdd:TIGR02169 429 iagieakINELEEEKedKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLqrelaeaeaqaraseervrg 508
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 495 ---------------------LRKVEEENQGALEM--------------------IKRLKEENEKLNEFLELER------ 527
Cdd:TIGR02169 509 graveevlkasiqgvhgtvaqLGSVGERYATAIEVaagnrlnnvvveddavakeaIELLKRRKAGRATFLPLNKmrderr 588
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 528 -------------------------------HNNNMMAKTLEECR--------VTLEGL------KMENGSLKSHLQGEK 562
Cdd:TIGR02169 589 dlsilsedgvigfavdlvefdpkyepafkyvFGDTLVVEDIEAARrlmgkyrmVTLEGElfeksgAMTGGSRAPRGGILF 668
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 563 QKATEASAVEQTAESCEVQEMLKVARAEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEICDHQAEQLSR 642
Cdd:TIGR02169 669 SRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSS 748
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 643 TSLKLQEKASESD---AEIKDMKETIFELEDQVEQHRAVKLHnnQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETE 719
Cdd:TIGR02169 749 LEQEIENVKSELKeleARIEELEEDLHKLEEALNDLEARLSH--SRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTL 826
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1034602044 720 EWRRFQADLQTAVVVANDIK------CEAQQELRTVKRKLLEEEEKNA----RLQKELGDVQG 772
Cdd:TIGR02169 827 EKEYLEKEIQELQEQRIDLKeqiksiEKEIENLNGKKEELEEELEELEaalrDLESRLGDLKK 889
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
561-767 |
1.27e-03 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 42.06 E-value: 1.27e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 561 EKQKATEASAVEQTAESCEVQEMLKVARAEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEICDHQAEQL 640
Cdd:COG4942 27 AELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEEL 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 641 SRTSLKLQEKASESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQ---LKTLTKQMKEE 717
Cdd:COG4942 107 AELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAEraeLEALLAELEEE 186
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|
gi 1034602044 718 TEEWRRFQADLQTAVVVANDIKCEAQQELRTVKRKLLEEEEKNARLQKEL 767
Cdd:COG4942 187 RAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEA 236
|
|
| PRK12705 |
PRK12705 |
hypothetical protein; Provisional |
551-717 |
1.42e-03 |
|
hypothetical protein; Provisional
Pssm-ID: 237178 [Multi-domain] Cd Length: 508 Bit Score: 42.00 E-value: 1.42e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 551 NGSLKSHLQGEKQKATEASAVEQTAEScEVQEMLKVARAE-KDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYL 629
Cdd:PRK12705 18 LGVLVVLLKKRQRLAKEAERILQEAQK-EAEEKLEAALLEaKELLLRERNQQRQEARREREELQREEERLVQKEEQLDAR 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 630 KEICDHQAEQLSRTSLKLQEKASESDAEIKDMKETIFELEDQVEQHRavklhNNQLISELEssvikleeqkSDLERQLKT 709
Cdd:PRK12705 97 AEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQA-----RKLLLKLLD----------AELEEEKAQ 161
|
....*...
gi 1034602044 710 LTKQMKEE 717
Cdd:PRK12705 162 RVKKIEEE 169
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
381-772 |
2.83e-03 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 41.31 E-value: 2.83e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 381 EENHHSTAEELQATLQELSDQQQMVQELTAENEKLVDEKTILETSFH----------QHRERAEQLSQENEKLMNLLQER 450
Cdd:pfam01576 4 EEEMQAKEEELQKVKERQQKAESELKELEKKHQQLCEEKNALQEQLQaetelcaeaeEMRARLAARKQELEEILHELESR 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 451 VKNEEPTTQ---------EGKIIELEQKctgiLEQGRFEREKLLNIQQQLTCSLRKVEEE------------------NQ 503
Cdd:pfam01576 84 LEEEEERSQqlqnekkkmQQHIQDLEEQ----LDEEEAARQKLQLEKVTTEAKIKKLEEDillledqnsklskerkllEE 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 504 GALEMIKRLKEENEKLNEFLELeRHNNNMMAKTLEECRVTLEGLKMENGSLKSHLQGEkqkATEASavEQTAESCEVQEM 583
Cdd:pfam01576 160 RISEFTSNLAEEEEKAKSLSKL-KNKHEAMISDLEERLKKEEKGRQELEKAKRKLEGE---STDLQ--EQIAELQAQIAE 233
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 584 LKVARAEKDllelscNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEIC-DHQAEQLSRTSLKLQEK--ASESDAEIKD 660
Cdd:pfam01576 234 LRAQLAKKE------EELQAALARLEEETAQKNNALKKIRELEAQISELQeDLESERAARNKAEKQRRdlGEELEALKTE 307
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 661 MKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQTAVVVANDIKC 740
Cdd:pfam01576 308 LEDTLDTTAAQQELRSKREQEVTELKKALEEETRSHEAQLQEMRQKHTQALEELTEQLEQAKRNKANLEKAKQALESENA 387
|
410 420 430
....*....|....*....|....*....|..
gi 1034602044 741 EAQQELRTVKRKLLEEEEKNARLQKELGDVQG 772
Cdd:pfam01576 388 ELQAELRTLQQAKQDSEHKRKKLEGQLQELQA 419
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
366-649 |
2.89e-03 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 41.26 E-value: 2.89e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 366 SELSLASLTEKIQKMEENHHSTAEELQATLQELSDQQQMVQELTAENEKLVDEKTILETSFHQHRER---AEQLSQENEK 442
Cdd:pfam15921 494 SERTVSDLTASLQEKERAIEATNAEITKLRSRVDLKLQELQHLKNEGDHLRNVQTECEALKLQMAEKdkvIEILRQQIEN 573
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 443 LMNL-----------------LQERVKNEEPTTQEGKII---------ELEQKCTGIleqgRFEREKLLNIQQQLTCSLR 496
Cdd:pfam15921 574 MTQLvgqhgrtagamqvekaqLEKEINDRRLELQEFKILkdkkdakirELEARVSDL----ELEKVKLVNAGSERLRAVK 649
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 497 KVEEENQGALEMIKRLKEENEKLNEFLELERHNnnmMAKTLEECRVTLEGLKMENGSLKSHLQGEK-------------- 562
Cdd:pfam15921 650 DIKQERDQLLNEVKTSRNELNSLSEDYEVLKRN---FRNKSEEMETTTNKLKMQLKSAQSELEQTRntlksmegsdgham 726
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 563 ------QKATEASAVEQTAESCEVQ---EMLKVARAEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEIC 633
Cdd:pfam15921 727 kvamgmQKQITAKRGQIDALQSKIQfleEAMTNANKEKHFLKEEKNKLSQELSTVATEKNKMAGELEVLRSQERRLKEKV 806
|
330
....*....|....*.
gi 1034602044 634 DHQAEQLSRTSLKLQE 649
Cdd:pfam15921 807 ANMEVALDKASLQFAE 822
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
369-767 |
3.22e-03 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 41.18 E-value: 3.22e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 369 SLASLTEKIQKMEE-NHHSTAEELQATLQELSDQqqmVQELTAENEKLVDEKTILETSFHQHRERAEQLS---QENEKLM 444
Cdd:PRK02224 188 SLDQLKAQIEEKEEkDLHERLNGLESELAELDEE---IERYEEQREQARETRDEADEVLEEHEERREELEtleAEIEDLR 264
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 445 NLLQERVKN-----EEPTTQEGKIIELEQKCTGILEQGRFE----------REKLLNIQQQLTCSLRKVEEENQGALEMI 509
Cdd:PRK02224 265 ETIAETEREreelaEEVRDLRERLEELEEERDDLLAEAGLDdadaeavearREELEDRDEELRDRLEECRVAAQAHNEEA 344
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 510 KRLKEENEKLNEFLELERHNNNMMAKTLEECRVTLEGLKMENGSLkshlqgEKQKATEASAVEQTAESCE-VQEMLKVAR 588
Cdd:PRK02224 345 ESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEEL------EEEIEELRERFGDAPVDLGnAEDFLEELR 418
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 589 AEKDLLELSCNELRQELLKANGEIKHVSSLLAKVE--------KDYSYLKEICDHqaeqlsrtslklQEKASESDAEIKD 660
Cdd:PRK02224 419 EERDELREREAELEATLRTARERVEEAEALLEAGKcpecgqpvEGSPHVETIEED------------RERVEELEAELED 486
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 661 MKETIFELEDQVEqhRAVKLhnnqliSELESSVIKLEEQKSDLERQL---KTLTKQMKEETEEWRRFQADLQTAVVVAND 737
Cdd:PRK02224 487 LEEEVEEVEERLE--RAEDL------VEAEDRIERLEERREDLEELIaerRETIEEKRERAEELRERAAELEAEAEEKRE 558
|
410 420 430
....*....|....*....|....*....|
gi 1034602044 738 IKCEAQQELRTVKRKLLEEEEKNARLQKEL 767
Cdd:PRK02224 559 AAAEAEEEAEEAREEVAELNSKLAELKERI 588
|
|
| PLN02939 |
PLN02939 |
transferase, transferring glycosyl groups |
424-762 |
3.32e-03 |
|
transferase, transferring glycosyl groups
Pssm-ID: 215507 [Multi-domain] Cd Length: 977 Bit Score: 41.04 E-value: 3.32e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 424 TSFHQHRERAE-QLSQENEKLMNLLQERVKNEEPTT--QEGKIIEL---EQKCTGILEQGRFER----EKLLNIQQQLTC 493
Cdd:PLN02939 91 TSSDDDHNRASmQRDEAIAAIDNEQQTNSKDGEQLSdfQLEDLVGMiqnAEKNILLLNQARLQAledlEKILTEKEALQG 170
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 494 SLRKVEEENQGALEMIKRLKEEN---EKLNEFLELERHNNNMMAKTLEECRVT----LEGLKMENGSLKSHLQGEKqkaT 566
Cdd:PLN02939 171 KINILEMRLSETDARIKLAAQEKihvEILEEQLEKLRNELLIRGATEGLCVHSlskeLDVLKEENMLLKDDIQFLK---A 247
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 567 EASAVEQTAESCEVQEmlkvarAEKDLLELSCNELRQELLKANGEIKHVSSL-----LAKVEKdysyLKEICDHQAEQLS 641
Cdd:PLN02939 248 ELIEVAETEERVFKLE------KERSLLDASLRELESKFIVAQEDVSKLSPLqydcwWEKVEN----LQDLLDRATNQVE 317
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 642 RTSLKLQEKasesdaeiKDMKETIFELEDQVEqhravklhnnqliselESSVIKLEEQKSDLerqlktLTKQMKEETEEW 721
Cdd:PLN02939 318 KAALVLDQN--------QDLRDKVDKLEASLK----------------EANVSKFSSYKVEL------LQQKLKLLEERL 367
|
330 340 350 360
....*....|....*....|....*....|....*....|.
gi 1034602044 722 RRFQADLQTAVVVANDIKCEAQQELrtvkRKLLEEEEKNAR 762
Cdd:PLN02939 368 QASDHEIHSYIQLYQESIKEFQDTL----SKLKEESKKRSL 404
|
|
| PRK00409 |
PRK00409 |
recombination and DNA strand exchange inhibitor protein; Reviewed |
683-766 |
4.10e-03 |
|
recombination and DNA strand exchange inhibitor protein; Reviewed
Pssm-ID: 234750 [Multi-domain] Cd Length: 782 Bit Score: 40.58 E-value: 4.10e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 683 NQLISELESSVIKLEEQKSDLE---RQLKTLTKQMKEETEEWR-RFQADLQTAVVVANDIKCEAQQELRTVKRKLLEEEE 758
Cdd:PRK00409 519 NELIASLEELERELEQKAEEAEallKEAEKLKEELEEKKEKLQeEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQK 598
|
....*...
gi 1034602044 759 KNARLQKE 766
Cdd:PRK00409 599 GGYASVKA 606
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
370-768 |
4.52e-03 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 40.80 E-value: 4.52e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 370 LASLTEKIQKMEEN-HHSTAEELQATLQELSDQQQMVQELTAENEklvdektilETSFHQHRERAEQLSQENEKLMNLLQ 448
Cdd:TIGR00606 593 LAKLNKELASLEQNkNHINNELESKEEQLSSYEDKLFDVCGSQDE---------ESDLERLKEEIEKSSKQRAMLAGATA 663
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 449 ERVKNEEPTTQEgkiielEQKCTGILEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEFLELERH 528
Cdd:TIGR00606 664 VYSQFITQLTDE------NQSCCPVCQRVFQTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQS 737
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 529 NNNMMAKTLEECRVTLEGLKMENGSLKSHLQGEKQKATEASAVEQTAESC--------EVQEMLK---------VARAEK 591
Cdd:TIGR00606 738 IIDLKEKEIPELRNKLQKVNRDIQRLKNDIEEQETLLGTIMPEEESAKVCltdvtimeRFQMELKdverkiaqqAAKLQG 817
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 592 DLLELSCNELRQELLKANGEIKHVSS---LLAKVEKDY--------SYLKEICDHQAeQLSRTSLKLQEKASESDAEIKD 660
Cdd:TIGR00606 818 SDLDRTVQQVNQEKQEKQHELDTVVSkieLNRKLIQDQqeqiqhlkSKTNELKSEKL-QIGTNLQRRQQFEEQLVELSTE 896
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 661 MKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQTAvvvANDIKC 740
Cdd:TIGR00606 897 VQSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETSNKKAQDKVNDIKEKVKNIHGYMKDIENKIQDG---KDDYLK 973
|
410 420
....*....|....*....|....*...
gi 1034602044 741 EAQQELRTVKRKLLEEEEKNARLQKELG 768
Cdd:TIGR00606 974 QKETELNTVNAQLEECEKHQEKINEDMR 1001
|
|
| PRK10884 |
PRK10884 |
SH3 domain-containing protein; Provisional |
647-771 |
4.56e-03 |
|
SH3 domain-containing protein; Provisional
Pssm-ID: 182809 [Multi-domain] Cd Length: 206 Bit Score: 39.25 E-value: 4.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 647 LQEKASESDAEIKDMKEtifeledqveqhRAVKLHNNQLiseleSSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQA 726
Cdd:PRK10884 59 LQVNANTNYAQIRDSKG------------RTAWIPLKQL-----STTPSLRTRVPDLENQVKTLTDKLNNIDNTWNQRTA 121
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 1034602044 727 DLQTAVVVANDIKCEAQQELRTVKRKLLEEEEKNARLQKELGDVQ 771
Cdd:PRK10884 122 EMQQKVAQSDSVINGLKEENQKLKNQLIVAQKKVDAANLQLDDKQ 166
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
390-725 |
4.74e-03 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 40.54 E-value: 4.74e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 390 ELQATLQELSDQ-QQMVQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKLMNLLQERVKNEEPTTQEGKIIE-LE 467
Cdd:pfam01576 226 ELQAQIAELRAQlAKKEEELQAALARLEEETAQKNNALKKIRELEAQISELQEDLESERAARNKAEKQRRDLGEELEaLK 305
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 468 QKCTGILEQGRFEREKLLNIQQQLTCSLRKVEEE----NQGALEMIKRLKEENEKLNEFLELERHNNNMMAKTLEecrvt 543
Cdd:pfam01576 306 TELEDTLDTTAAQQELRSKREQEVTELKKALEEEtrshEAQLQEMRQKHTQALEELTEQLEQAKRNKANLEKAKQ----- 380
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 544 leGLKMENGSLKSHLQGEKQKATEASAVEQTAEScEVQE-MLKVARAEKDLlelscNELRQELLKANGEIKHVSSLLAKV 622
Cdd:pfam01576 381 --ALESENAELQAELRTLQQAKQDSEHKRKKLEG-QLQElQARLSESERQR-----AELAEKLSKLQSELESVSSLLNEA 452
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 623 EKDYSYLKEICDHQAEQLSRTSLKLQEKASES---DAEIKDMKETIFELEDQVEQH--------RAVKLHNNQL------ 685
Cdd:pfam01576 453 EGKNIKLSKDVSSLESQLQDTQELLQEETRQKlnlSTRLRQLEDERNSLQEQLEEEeeakrnveRQLSTLQAQLsdmkkk 532
|
330 340 350 360
....*....|....*....|....*....|....*....|
gi 1034602044 686 ISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQ 725
Cdd:pfam01576 533 LEEDAGTLEALEEGKKRLQRELEALTQQLEEKAAAYDKLE 572
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
369-720 |
5.07e-03 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 40.34 E-value: 5.07e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 369 SLASLTEKIQKMEENHHSTAEELQATLQELSDQQQMVQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKLMNLLQ 448
Cdd:TIGR00618 543 SEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLSEAEDMLACEQHALLRKLQ 622
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 449 ERVKNEEPTTQEGKIIELEQKctgileqgrferEKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEFLELERH 528
Cdd:TIGR00618 623 PEQDLQDVRLHLQQCSQELAL------------KLTALHALQLTLTQERVREHALSIRVLPKELLASRQLALQKMQSEKE 690
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 529 NNNMMAKTLEECRVTLEGLKMENGSLKSHLQgEKQKATEASAVEQTAESCEVQEMLKVARAEKDllelscnelrqELLKA 608
Cdd:TIGR00618 691 QLTYWKEMLAQCQTLLRELETHIEEYDREFN-EIENASSSLGSDLAAREDALNQSLKELMHQAR-----------TVLKA 758
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 609 NGEIKHVSSLLAKVE----KDYSYLKEICDHQAEQLSRTSLKLQEKASESDAEIKDmKETIFELEDQVEQHRAVKLHNnq 684
Cdd:TIGR00618 759 RTEAHFNNNEEVTAAlqtgAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPS-DEDILNLQCETLVQEEEQFLS-- 835
|
330 340 350
....*....|....*....|....*....|....*....
gi 1034602044 685 LISELESSVIKLEEQKSDLE---RQLKTLTKQMKEETEE 720
Cdd:TIGR00618 836 RLEEKSATLGEITHQLLKYEecsKQLAQLTQEQAKIIQL 874
|
|
| mukB |
PRK04863 |
chromosome partition protein MukB; |
363-768 |
6.43e-03 |
|
chromosome partition protein MukB;
Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 40.33 E-value: 6.43e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 363 NSVSELSlASLTEKIQKMEENHHS----TAEELQATLQElsDQQQMVQELTAENEKLVDEKTILETSFHQHRERAEQLsQ 438
Cdd:PRK04863 796 EELAERY-ATLSFDVQKLQRLHQAfsrfIGSHLAVAFEA--DPEAELRQLNRRRVELERALADHESQEQQQRSQLEQA-K 871
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 439 ENEKLMNLLQERVKNEEPTTQEGKIIELEQKCTGILEQGRFERE--KLLNIQQQLTCSLRKVEEEnqgaLEMIKRLKEEN 516
Cdd:PRK04863 872 EGLSALNRLLPRLNLLADETLADRVEEIREQLDEAEEAKRFVQQhgNALAQLEPIVSVLQSDPEQ----FEQLKQDYQQA 947
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 517 EKlneflelERHNNNMMAKTLEECRVTLEGLKMENgslkshlqgekqkateasAVEQTAESCEVQEMLkvaRAEKDLLEL 596
Cdd:PRK04863 948 QQ-------TQRDAKQQAFALTEVVQRRAHFSYED------------------AAEMLAKNSDLNEKL---RQRLEQAEQ 999
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 597 SCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEICDHQAEQLSRTSLKLQEKASESDAEIKDmketifELEDQVEQHR 676
Cdd:PRK04863 1000 ERTRAREQLRQAQAQLAQYNQVLASLKSSYDAKRQMLQELKQELQDLGVPADSGAEERARARRD------ELHARLSANR 1073
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 677 AVKlhnNQLiselessviklEEQKSDLERQLKTLTKQMKEETEEWRrfqaDLQTAVVVANDIKCEAQQELRT--VKRKLL 754
Cdd:PRK04863 1074 SRR---NQL-----------EKQLTFCEAEMDNLTKKLRKLERDYH----EMREQVVNAKAGWCAVLRLVKDngVERRLH 1135
|
410 420
....*....|....*....|.
gi 1034602044 755 EEE-------EKNARLQKELG 768
Cdd:PRK04863 1136 RRElaylsadELRSMSDKALG 1156
|
|
| PRK12705 |
PRK12705 |
hypothetical protein; Provisional |
373-540 |
6.77e-03 |
|
hypothetical protein; Provisional
Pssm-ID: 237178 [Multi-domain] Cd Length: 508 Bit Score: 39.69 E-value: 6.77e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 373 LTEKIQKMEENHHSTAEELQATLQeLSDQQQMVQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKLMNLLQERVK 452
Cdd:PRK12705 31 LAKEAERILQEAQKEAEEKLEAAL-LEAKELLLRERNQQRQEARREREELQREEERLVQKEEQLDARAEKLDNLENQLEE 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 453 NEEP-TTQEGKIIELEQKCTGILEQ-GRFEREKllniQQQLTCSLRKVEEENQGALEmIKRLKEENEklnefLELERHNN 530
Cdd:PRK12705 110 REKAlSARELELEELEKQLDNELYRvAGLTPEQ----ARKLLLKLLDAELEEEKAQR-VKKIEEEAD-----LEAERKAQ 179
|
170
....*....|
gi 1034602044 531 NMMAKTLEEC 540
Cdd:PRK12705 180 NILAQAMQRI 189
|
|
| HMMR_N |
pfam15905 |
Hyaluronan mediated motility receptor N-terminal; HMMR_N is the N-terminal region of ... |
474-727 |
6.87e-03 |
|
Hyaluronan mediated motility receptor N-terminal; HMMR_N is the N-terminal region of eukaryotic hyaluronan-mediated motility receptor proteins. The protein is functionally associated with BRCA1 and thus predicted to be a common, low-penetrance breast cancer candidate.
Pssm-ID: 464932 [Multi-domain] Cd Length: 329 Bit Score: 39.41 E-value: 6.87e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 474 LEQGRFEREKLLniqQQLTCSLRKVEEENQGALEMIKRLKEENEKLN-EFLELERHNNNMMAKTLEE-CRVTLEGLKMEN 551
Cdd:pfam15905 85 LVQERGEQDKRL---QALEEELEKVEAKLNAAVREKTSLSASVASLEkQLLELTRVNELLKAKFSEDgTQKKMSSLSMEL 161
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 552 GSLKSHLQGeKQKATEASAVEQTAESCEVQEMLKVARAEKDLLELSCNELRQE----------LLKANGEIKHVSSLLAK 621
Cdd:pfam15905 162 MKLRNKLEA-KMKEVMAKQEGMEGKLQVTQKNLEHSKGKVAQLEEKLVSTEKEkieeksetekLLEYITELSCVSEQVEK 240
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 622 VEKDYSYLKEICDHQAEQLSRTSLKLQEKASESDAEIKDMKETIFELEdqveqhravklhnnqliSELESSVIKLEEQKS 701
Cdd:pfam15905 241 YKLDIAQLEELLKEKNDEIESLKQSLEEKEQELSKQIKDLNEKCKLLE-----------------SEKEELLREYEEKEQ 303
|
250 260
....*....|....*....|....*.
gi 1034602044 702 DLERQLKTLTKQMKEETEEWRRFQAD 727
Cdd:pfam15905 304 TLNAELEELKEKLTLEEQEHQKLQQK 329
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
435-767 |
8.26e-03 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 39.70 E-value: 8.26e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 435 QLSQENEKLMNL---LQERVKNEEPTTQEG-KIIELEQKCtgiLEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMIK 510
Cdd:pfam05483 82 KLYKEAEKIKKWkvsIEAELKQKENKLQENrKIIEAQRKA---IQELQFENEKVSLKLEEEIQENKDLIKENNATRHLCN 158
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 511 RLKE----ENEKLNEFlELERH---------NNNM--MAKTLEECRVTLEGLKME-NGSLK------SHLQGEKQKATEA 568
Cdd:pfam05483 159 LLKEtcarSAEKTKKY-EYEREetrqvymdlNNNIekMILAFEELRVQAENARLEmHFKLKedhekiQHLEEEYKKEIND 237
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 569 SAVEQTAESCEVQEMLKVARAEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEIcdhqaeQLSRTSLKLQ 648
Cdd:pfam05483 238 KEKQVSLLLIQITEKENKMKDLTFLLEESRDKANQLEEKTKLQDENLKELIEKKDHLTKELEDI------KMSLQRSMST 311
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 649 EKASESDAEI--KDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQA 726
Cdd:pfam05483 312 QKALEEDLQIatKTICQLTEEKEAQMEELNKAKAAHSFVVTEFEATTCSLEELLRTEQQRLEKNEDQLKIITMELQKKSS 391
|
330 340 350 360
....*....|....*....|....*....|....*....|....
gi 1034602044 727 DLQTAVVVANDIKCEAqQELRTV---KRKLLEEEEKNARLQKEL 767
Cdd:pfam05483 392 ELEEMTKFKNNKEVEL-EELKKIlaeDEKLLDEKKQFEKIAEEL 434
|
|
|