NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1034602044|ref|XP_016880814|]
View 

cytospin-B isoform X11 [Homo sapiens]

Protein Classification

coiled-coil domain-containing protein( domain architecture ID 1000095)

coiled-coil domain-containing protein contains a region with alpha-helical coiled-coil sequence signatures that is being annotated by a variety of protein family models, not necessarily indicating family membership

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
SMC_prok_B super family cl37069
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
370-726 5.72e-13

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


The actual alignment was detected with superfamily member TIGR02168:

Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 72.78  E-value: 5.72e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  370 LASLTEKIQKMEENhhstAEELQATLQELSDQQqmvQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKLMNLLQE 449
Cdd:TIGR02168  679 IEELEEKIEELEEK----IAELEKALAELRKEL---EELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQ 751
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  450 RvkNEEPTTQEGKIIELEQKctgiLEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEFLELERHN 529
Cdd:TIGR02168  752 L--SKELTELEAEIEELEER----LEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRER 825
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  530 NNMMAKTLEECRVTLEGLKMENGSLK--------SHLQGEKQKATEASAVEQ-TAESCEVQEMLKVARAEKDLLELSCNE 600
Cdd:TIGR02168  826 LESLERRIAATERRLEDLEEQIEELSedieslaaEIEELEELIEELESELEAlLNERASLEEALALLRSELEELSEELRE 905
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  601 LRQELLKANGEIKHVSSLLAKVEKDYSYLKEICDHQAEQLS-RTSLKLQ---EKASESDAEIKDMKETIFELEDQVEQHR 676
Cdd:TIGR02168  906 LESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSeEYSLTLEeaeALENKIEDDEEEARRRLKRLENKIKELG 985
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1034602044  677 AVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWR-RFQA 726
Cdd:TIGR02168  986 PVNLAAIEEYEELKERYDFLTAQKEDLTEAKETLEEAIEEIDREAReRFKD 1036
 
Name Accession Description Interval E-value
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
370-726 5.72e-13

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 72.78  E-value: 5.72e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  370 LASLTEKIQKMEENhhstAEELQATLQELSDQQqmvQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKLMNLLQE 449
Cdd:TIGR02168  679 IEELEEKIEELEEK----IAELEKALAELRKEL---EELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQ 751
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  450 RvkNEEPTTQEGKIIELEQKctgiLEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEFLELERHN 529
Cdd:TIGR02168  752 L--SKELTELEAEIEELEER----LEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRER 825
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  530 NNMMAKTLEECRVTLEGLKMENGSLK--------SHLQGEKQKATEASAVEQ-TAESCEVQEMLKVARAEKDLLELSCNE 600
Cdd:TIGR02168  826 LESLERRIAATERRLEDLEEQIEELSedieslaaEIEELEELIEELESELEAlLNERASLEEALALLRSELEELSEELRE 905
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  601 LRQELLKANGEIKHVSSLLAKVEKDYSYLKEICDHQAEQLS-RTSLKLQ---EKASESDAEIKDMKETIFELEDQVEQHR 676
Cdd:TIGR02168  906 LESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSeEYSLTLEeaeALENKIEDDEEEARRRLKRLENKIKELG 985
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1034602044  677 AVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWR-RFQA 726
Cdd:TIGR02168  986 PVNLAAIEEYEELKERYDFLTAQKEDLTEAKETLEEAIEEIDREAReRFKD 1036
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
509-767 1.60e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 61.49  E-value: 1.60e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 509 IKRLKEENEKLNEFLELErhnnnmmaKTLEECRVTLEGLKMEngslksHLQGEKQKAtEASAVEQTAESCEVQEMLKVAR 588
Cdd:COG1196   202 LEPLERQAEKAERYRELK--------EELKELEAELLLLKLR------ELEAELEEL-EAELEELEAELEELEAELAELE 266
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 589 AEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEicdhQAEQLSRTSLKLQEKASESDAEIKDMKETIFEL 668
Cdd:COG1196   267 AELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEE----RRRELEERLEELEEELAELEEELEELEEELEEL 342
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 669 EDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQTAVVVANDIKcEAQQELRT 748
Cdd:COG1196   343 EEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALL-ERLERLEE 421
                         250
                  ....*....|....*....
gi 1034602044 749 VKRKLLEEEEKNARLQKEL 767
Cdd:COG1196   422 ELEELEEALAELEEEEEEE 440
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
367-772 9.98e-08

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 55.82  E-value: 9.98e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 367 ELSLASLTEKIQKMEENHH---STAEELQATLQELSDQQQMVQELTAENEKLVD-------EKTILETSFHQHRERAEQL 436
Cdd:PRK02224  212 ESELAELDEEIERYEEQREqarETRDEADEVLEEHEERREELETLEAEIEDLREtiaeterEREELAEEVRDLRERLEEL 291
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 437 SQENEKLmnLLQERVKNEEPTTQEGKIIELEQKCTGI---LEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMIKRLK 513
Cdd:PRK02224  292 EEERDDL--LAEAGLDDADAEAVEARREELEDRDEELrdrLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELE 369
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 514 EENEKLNEFLELERHNNNMMAKTLEECR-------VTLEGLKMENGSLKSHLQGEKQKATEASAVEQTAESC--EVQEML 584
Cdd:PRK02224  370 SELEEAREAVEDRREEIEELEEEIEELRerfgdapVDLGNAEDFLEELREERDELREREAELEATLRTARERveEAEALL 449
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 585 KV---------------------ARAEKDLLELSCNELRQELLKANGEIKHVSSLlAKVEKDYSYLKE-------ICDHQ 636
Cdd:PRK02224  450 EAgkcpecgqpvegsphvetieeDRERVEELEAELEDLEEEVEEVEERLERAEDL-VEAEDRIERLEErredleeLIAER 528
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 637 AEQLSRTSLKLQEK---ASESDAEIKDMKETIFELEDQVEQHR-AVKLHNNQL---------ISELESSVIKLEEQKSDL 703
Cdd:PRK02224  529 RETIEEKRERAEELrerAAELEAEAEEKREAAAEAEEEAEEAReEVAELNSKLaelkeriesLERIRTLLAAIADAEDEI 608
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 704 ER------QLKTLTKQMKEETEEWRRFQADLQ-----TAVVVANDIKCEAQQELRTVKRKLLEEEEKNARLQKELGDVQG 772
Cdd:PRK02224  609 ERlrekreALAELNDERRERLAEKRERKRELEaefdeARIEEAREDKERAEEYLEQVEEKLDELREERDDLQAEIGAVEN 688
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
366-773 1.85e-06

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 51.71  E-value: 1.85e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  366 SELSLASLTEKIQKMEENHHSTAEELQATLQELSDQQQMVQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKL-- 443
Cdd:pfam01576   24 AESELKELEKKHQQLCEEKNALQEQLQAETELCAEAEEMRARLAARKQELEEILHELESRLEEEEERSQQLQNEKKKMqq 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  444 -MNLLQERVKNEEPTTQ---------EGKIIELEQKCTGILEQ-GRFEREKLLNIQQQLTCSLRKVEEENQGalEMIKRL 512
Cdd:pfam01576  104 hIQDLEEQLDEEEAARQklqlekvttEAKIKKLEEDILLLEDQnSKLSKERKLLEERISEFTSNLAEEEEKA--KSLSKL 181
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  513 KEENEKLNEFLELERHNNNMMAKTLEECRVTLEG------------------LKMENGSLKSHLQGEKQKATEASAV--- 571
Cdd:pfam01576  182 KNKHEAMISDLEERLKKEEKGRQELEKAKRKLEGestdlqeqiaelqaqiaeLRAQLAKKEEELQAALARLEEETAQknn 261
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  572 ------EQTAESCEVQEMLKVARAEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKdysyLKEICDHQAEQLSRTsl 645
Cdd:pfam01576  262 alkkirELEAQISELQEDLESERAARNKAEKQRRDLGEELEALKTELEDTLDTTAAQQE----LRSKREQEVTELKKA-- 335
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  646 kLQEKASESDAEIKDMKE----TIFELEDQVEQHRAVKlhnnqliSELESSVIKLEEQKSDLERQLKTLTKQMKEETEEW 721
Cdd:pfam01576  336 -LEEETRSHEAQLQEMRQkhtqALEELTEQLEQAKRNK-------ANLEKAKQALESENAELQAELRTLQQAKQDSEHKR 407
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034602044  722 RRFQADLQTAVVVANDIK----------CEAQQELRTVKRKLLEEEEKNARLQKELGDVQGH 773
Cdd:pfam01576  408 KKLEGQLQELQARLSESErqraelaeklSKLQSELESVSSLLNEAEGKNIKLSKDVSSLESQ 469
 
Name Accession Description Interval E-value
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
370-726 5.72e-13

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 72.78  E-value: 5.72e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  370 LASLTEKIQKMEENhhstAEELQATLQELSDQQqmvQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKLMNLLQE 449
Cdd:TIGR02168  679 IEELEEKIEELEEK----IAELEKALAELRKEL---EELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQ 751
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  450 RvkNEEPTTQEGKIIELEQKctgiLEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEFLELERHN 529
Cdd:TIGR02168  752 L--SKELTELEAEIEELEER----LEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRER 825
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  530 NNMMAKTLEECRVTLEGLKMENGSLK--------SHLQGEKQKATEASAVEQ-TAESCEVQEMLKVARAEKDLLELSCNE 600
Cdd:TIGR02168  826 LESLERRIAATERRLEDLEEQIEELSedieslaaEIEELEELIEELESELEAlLNERASLEEALALLRSELEELSEELRE 905
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  601 LRQELLKANGEIKHVSSLLAKVEKDYSYLKEICDHQAEQLS-RTSLKLQ---EKASESDAEIKDMKETIFELEDQVEQHR 676
Cdd:TIGR02168  906 LESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSeEYSLTLEeaeALENKIEDDEEEARRRLKRLENKIKELG 985
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1034602044  677 AVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWR-RFQA 726
Cdd:TIGR02168  986 PVNLAAIEEYEELKERYDFLTAQKEDLTEAKETLEEAIEEIDREAReRFKD 1036
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
367-771 1.92e-11

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 67.74  E-value: 1.92e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 367 ELSLASLTEKIQKMEEnhhstaeeLQATLQELSDQQQmvqELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKLMNL 446
Cdd:TIGR04523 200 ELLLSNLKKKIQKNKS--------LESQISELKKQNN---QLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQ 268
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 447 LQErvKNEEPTTQEGKIIELEQKCTGI-LEQGRFEREKLLNIQQQLTCSLRKVEEENQGA-------LEMIKRLKEENEK 518
Cdd:TIGR04523 269 LSE--KQKELEQNNKKIKELEKQLNQLkSEISDLNNQKEQDWNKELKSELKNQEKKLEEIqnqisqnNKIISQLNEQISQ 346
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 519 LNEFLELERHNNNMMAKTLEECRVTLEGLKMENGSLKSHLQGEKQKAT--EASAVEQTAESCEVQEMLKVARAEKDLLEL 596
Cdd:TIGR04523 347 LKKELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINdlESKIQNQEKLNQQKDEQIKKLQQEKELLEK 426
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 597 SCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEICDHQAEQLSRTS------------------------LKLQEKAS 652
Cdd:TIGR04523 427 EIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLSrsinkikqnleqkqkelkskekelKKLNEEKK 506
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 653 ESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSdlERQLKTLTKQMKEETEEWRRFQADLQTAV 732
Cdd:TIGR04523 507 ELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLEDELNKDDFELK--KENLEKEIDEKNKEIEELKQTQKSLKKKQ 584
                         410       420       430
                  ....*....|....*....|....*....|....*....
gi 1034602044 733 VVANDIKCEAQQELRTVKRKLLEEEEKNARLQKELGDVQ 771
Cdd:TIGR04523 585 EEKQELIDQKEKEKKDLIKEIEEKEKKISSLEKELEKAK 623
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
462-767 1.92e-10

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 64.69  E-value: 1.92e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  462 KIIELEQKCTGI---LEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEFLELERHNNNMMAKTLE 538
Cdd:TIGR02168  678 EIEELEEKIEELeekIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELT 757
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  539 ECRVTLEGLKMENGSLKSHLQgekqkateasavEQTAESCEVQEMLKVARAEKDLLELSCNELRQELLKANGEIKHVSSL 618
Cdd:TIGR02168  758 ELEAEIEELEERLEEAEEELA------------EAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRER 825
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  619 LAKVEKDYSYLKEicdhQAEQLSRTSLKLQEKASESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEE 698
Cdd:TIGR02168  826 LESLERRIAATER----RLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSE 901
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034602044  699 QKSDLERQLKTLTKQMKEETEEWRRFQADLQtavvvandikcEAQQELRTVKRKLLEEEEKNARLQKEL 767
Cdd:TIGR02168  902 ELRELESKRSELRRELEELREKLAQLELRLE-----------GLEVRIDNLQERLSEEYSLTLEEAEAL 959
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
509-767 1.60e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 61.49  E-value: 1.60e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 509 IKRLKEENEKLNEFLELErhnnnmmaKTLEECRVTLEGLKMEngslksHLQGEKQKAtEASAVEQTAESCEVQEMLKVAR 588
Cdd:COG1196   202 LEPLERQAEKAERYRELK--------EELKELEAELLLLKLR------ELEAELEEL-EAELEELEAELEELEAELAELE 266
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 589 AEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEicdhQAEQLSRTSLKLQEKASESDAEIKDMKETIFEL 668
Cdd:COG1196   267 AELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEE----RRRELEERLEELEEELAELEEELEELEEELEEL 342
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 669 EDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQTAVVVANDIKcEAQQELRT 748
Cdd:COG1196   343 EEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALL-ERLERLEE 421
                         250
                  ....*....|....*....
gi 1034602044 749 VKRKLLEEEEKNARLQKEL 767
Cdd:COG1196   422 ELEELEEALAELEEEEEEE 440
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
370-638 3.97e-09

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 60.46  E-value: 3.97e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  370 LASLTEKIQKMEENHHSTAEELQATLQELSD-------QQQMVQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEK 442
Cdd:TIGR02168  241 LEELQEELKEAEEELEELTAELQELEEKLEElrlevseLEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEE 320
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  443 LMNLLQE-----RVKNEEPTTQEGKIIELEQKCTGILEQGrferEKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENE 517
Cdd:TIGR02168  321 LEAQLEElesklDELAEELAELEEKLEELKEELESLEAEL----EELEAELEELESRLEELEEQLETLRSKVAQLELQIA 396
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  518 KLNEFLELERHNNNMMAKTLEECRVTLEGLKMENGSLKSHLQGEKQKATEASAVEQTAESCEVQEMLKVARAEKDLLELS 597
Cdd:TIGR02168  397 SLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQA 476
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|.
gi 1034602044  598 CNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEICDHQAE 638
Cdd:TIGR02168  477 LDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSG 517
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
495-777 5.00e-09

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 60.07  E-value: 5.00e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  495 LRKVEEENQGALeMIKRLKEENEKLNEFLELERHNNNMM---AKTLEECRVTLEGLKMENGSLKSHLQgEKQKATEASAV 571
Cdd:TIGR02168  218 LKAELRELELAL-LVLRLEELREELEELQEELKEAEEELeelTAELQELEEKLEELRLEVSELEEEIE-ELQKELYALAN 295
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  572 EQTaescEVQEMLKVARAEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEICDHQAEQLSRTSLKLQE-- 649
Cdd:TIGR02168  296 EIS----RLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEEle 371
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  650 -KASESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQ--------MKEETEE 720
Cdd:TIGR02168  372 sRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKelqaeleeLEEELEE 451
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034602044  721 WRRFQADLQTAVVVANDIKCEAQQELRTVKRKLLEEEEKNARLQKELGDVQGHGRVV 777
Cdd:TIGR02168  452 LQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGV 508
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
367-627 5.20e-08

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 56.87  E-value: 5.20e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 367 ELSLASLTEKIQKMEENHHSTAEELQATLQELSDQQQMVQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKLMNL 446
Cdd:COG1196   252 EAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEE 331
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 447 LQERVKNEEptTQEGKIIELEQKCTGILEQGRFEREKLLNIQQQLtcsLRKVEEENQGALEMIKRLKEENEKLNEFLELE 526
Cdd:COG1196   332 LEELEEELE--ELEEELEEAEEELEEAEAELAEAEEALLEAEAEL---AEAEEELEELAEELLEALRAAAELAAQLEELE 406
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 527 RHNNNmMAKTLEECRVTLEGLKMENGSLKSHLQGEKQKATEASAVEQTAESCEVQEMLKVARAEKDLLELscNELRQELL 606
Cdd:COG1196   407 EAEEA-LLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALL--EAALAELL 483
                         250       260
                  ....*....|....*....|.
gi 1034602044 607 KANGEIKHVSSLLAKVEKDYS 627
Cdd:COG1196   484 EELAEAAARLLLLLEAEADYE 504
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
367-772 9.98e-08

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 55.82  E-value: 9.98e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 367 ELSLASLTEKIQKMEENHH---STAEELQATLQELSDQQQMVQELTAENEKLVD-------EKTILETSFHQHRERAEQL 436
Cdd:PRK02224  212 ESELAELDEEIERYEEQREqarETRDEADEVLEEHEERREELETLEAEIEDLREtiaeterEREELAEEVRDLRERLEEL 291
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 437 SQENEKLmnLLQERVKNEEPTTQEGKIIELEQKCTGI---LEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMIKRLK 513
Cdd:PRK02224  292 EEERDDL--LAEAGLDDADAEAVEARREELEDRDEELrdrLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELE 369
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 514 EENEKLNEFLELERHNNNMMAKTLEECR-------VTLEGLKMENGSLKSHLQGEKQKATEASAVEQTAESC--EVQEML 584
Cdd:PRK02224  370 SELEEAREAVEDRREEIEELEEEIEELRerfgdapVDLGNAEDFLEELREERDELREREAELEATLRTARERveEAEALL 449
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 585 KV---------------------ARAEKDLLELSCNELRQELLKANGEIKHVSSLlAKVEKDYSYLKE-------ICDHQ 636
Cdd:PRK02224  450 EAgkcpecgqpvegsphvetieeDRERVEELEAELEDLEEEVEEVEERLERAEDL-VEAEDRIERLEErredleeLIAER 528
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 637 AEQLSRTSLKLQEK---ASESDAEIKDMKETIFELEDQVEQHR-AVKLHNNQL---------ISELESSVIKLEEQKSDL 703
Cdd:PRK02224  529 RETIEEKRERAEELrerAAELEAEAEEKREAAAEAEEEAEEAReEVAELNSKLaelkeriesLERIRTLLAAIADAEDEI 608
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 704 ER------QLKTLTKQMKEETEEWRRFQADLQ-----TAVVVANDIKCEAQQELRTVKRKLLEEEEKNARLQKELGDVQG 772
Cdd:PRK02224  609 ERlrekreALAELNDERRERLAEKRERKRELEaefdeARIEEAREDKERAEEYLEQVEEKLDELREERDDLQAEIGAVEN 688
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
376-765 1.04e-07

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 55.84  E-value: 1.04e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 376 KIQKMEENHHSTAEELQATLQELSDQQQMVQELTAENEKLVDEKTILETsFHQHRERAEQLSQENEKlmnlLQERVKNEE 455
Cdd:PRK03918  311 EIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEE-RHELYEEAKAKKEELER----LKKRLTGLT 385
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 456 PTTQEGKIIELEQKCTGIleqgrfeREKLLNIQQQLTcSLRKVEEENQGALEMIKRLKEENEKLNEFLElERHNNNMMA- 534
Cdd:PRK03918  386 PEKLEKELEELEKAKEEI-------EEEISKITARIG-ELKKEIKELKKAIEELKKAKGKCPVCGRELT-EEHRKELLEe 456
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 535 ---------KTLEECRVTLEGLKMENGSLKSHLQGEKQKATEASAVEQTAESCEVQEMLKVARAEKDLLELScnELRQEL 605
Cdd:PRK03918  457 ytaelkrieKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYE--KLKEKL 534
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 606 LKANGEIKHVSSLLAKV---EKDYSYLKEICDHQAEQLSRTSLKLQEKASESdaeIKDMKETIFELE----------DQV 672
Cdd:PRK03918  535 IKLKGEIKSLKKELEKLeelKKKLAELEKKLDELEEELAELLKELEELGFES---VEELEERLKELEpfyneylelkDAE 611
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 673 EQHRAVKLHNNQLISELESSVIKLEEQKSDLERQLKTLT-----------KQMKEETEEWRRFQADLQTAVVVANDIKCE 741
Cdd:PRK03918  612 KELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEelekkyseeeyEELREEYLELSRELAGLRAELEELEKRREE 691
                         410       420
                  ....*....|....*....|....
gi 1034602044 742 AQQELRTVKRKLLEEEEKNARLQK 765
Cdd:PRK03918  692 IKKTLEKLKEELEEREKAKKELEK 715
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
356-720 1.43e-07

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 55.46  E-value: 1.43e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  356 STAGSSPNSVSEL-SLASLTEKIQKMEENHHSTAEELQATLQELSDQQQMVQELTAENEKLVDEKTILETSFHQHRERAE 434
Cdd:TIGR02169  661 APRGGILFSRSEPaELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLE 740
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  435 QLSqenEKLMNLLQERVKNE-EPTTQEGKIIELEQKCTGI-LEQGRFEREKLLNIQQQLTCSLRKVEEENQgalEMIKRL 512
Cdd:TIGR02169  741 ELE---EDLSSLEQEIENVKsELKELEARIEELEEDLHKLeEALNDLEARLSHSRIPEIQAELSKLEEEVS---RIEARL 814
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  513 KEENEKLNEfLELERHNNNMMAKTLEECRVTLEGLKMENGSLKSHLQGEKQKaTEASAVEQTAESCEVQEMLKVARAEKD 592
Cdd:TIGR02169  815 REIEQKLNR-LTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEE-LEEELEELEAALRDLESRLGDLKKERD 892
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  593 LLELSCNELRQELLKANGEI----KHVSSLLAKVEKDYSYLKEICDHQAEQLSrtslklqekASESDAEIKDMKETIFEL 668
Cdd:TIGR02169  893 ELEAQLRELERKIEELEAQIekkrKRLSELKAKLEALEEELSEIEDPKGEDEE---------IPEEELSLEDVQAELQRV 963
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1034602044  669 EDQVEQHRAVklhNNQLISELESSVIKLEEQKSDLERqLKTLTKQMKEETEE 720
Cdd:TIGR02169  964 EEEIRALEPV---NMLAIQEYEEVLKRLDELKEKRAK-LEEERKAILERIEE 1011
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
374-768 3.72e-07

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 53.91  E-value: 3.72e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 374 TEKIQKMEENHHstaEELQATLQELSDQQQMVQELTAENEKLVDEKTILEtsfhQHRERAEQLSQENEKL---MNLLQER 450
Cdd:PRK03918  188 TENIEELIKEKE---KELEEVLREINEISSELPELREELEKLEKEVKELE----ELKEEIEELEKELESLegsKRKLEEK 260
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 451 VKN------------EEPTTQEGKIIELEQKCTGILEQGRFeREKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEK 518
Cdd:PRK03918  261 IREleerieelkkeiEELEEKVKELKELKEKAEEYIKLSEF-YEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEER 339
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 519 LNEFLELERHNNNMMAKtLEECRVTLEGLKMENGSLKSHLQGEKQKATEasaveqtaescEVQEMLKVARAEKDLLELSC 598
Cdd:PRK03918  340 LEELKKKLKELEKRLEE-LEERHELYEEAKAKKEELERLKKRLTGLTPE-----------KLEKELEELEKAKEEIEEEI 407
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 599 NELRQELLKANGEIKHVSSLLAKVEKdysyLKEICDHQAEQLSRTSLKlqEKASESDAEIKDMKETIFELEDQVEQHRAV 678
Cdd:PRK03918  408 SKITARIGELKKEIKELKKAIEELKK----AKGKCPVCGRELTEEHRK--ELLEEYTAELKRIEKELKEIEEKERKLRKE 481
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 679 KLHNNQLISElESSVIKLE---EQKSDLERQLKTLT-KQMKEETEEWRRFQADLQT----AVVVANDIK--CEAQQELRT 748
Cdd:PRK03918  482 LRELEKVLKK-ESELIKLKelaEQLKELEEKLKKYNlEELEKKAEEYEKLKEKLIKlkgeIKSLKKELEklEELKKKLAE 560
                         410       420
                  ....*....|....*....|
gi 1034602044 749 VKRKLLEEEEKNARLQKELG 768
Cdd:PRK03918  561 LEKKLDELEEELAELLKELE 580
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
364-766 3.74e-07

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 53.87  E-value: 3.74e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 364 SVSELSLASLTEKIQKMEENHHSTAEELQATLQELSDQQQMVQELTAENEKLVDEKTILETSFHQHRERAEQL-SQENEK 442
Cdd:TIGR04523 120 NKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEKLNIQKNIDKIkNKLLKL 199
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 443 LMNLLQERVKNEEPTTQEGKIIELEQKctgileqgrfeREKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEF 522
Cdd:TIGR04523 200 ELLLSNLKKKIQKNKSLESQISELKKQ-----------NNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQ 268
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 523 L-----ELERHNNNMMAKT--LEECRVTLEGLKME-----NGSLKSHLQGEKQKATEA-SAVEQTAESC-EVQEMLKVAR 588
Cdd:TIGR04523 269 LsekqkELEQNNKKIKELEkqLNQLKSEISDLNNQkeqdwNKELKSELKNQEKKLEEIqNQISQNNKIIsQLNEQISQLK 348
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 589 AEKDLLELSCNELRQELLKANGEIKHVssllaKVEKDySYLKEICD--HQAEQLSRTSLKLQEKASESDAEIKDMKETIF 666
Cdd:TIGR04523 349 KELTNSESENSEKQRELEEKQNEIEKL-----KKENQ-SYKQEIKNleSQINDLESKIQNQEKLNQQKDEQIKKLQQEKE 422
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 667 ELEDQVEQHRAVKLHNNQLISELES-------SVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQTAVvvandik 739
Cdd:TIGR04523 423 LLEKEIERLKETIIKNNSEIKDLTNqdsvkelIIKNLDNTRESLETQLKVLSRSINKIKQNLEQKQKELKSKE------- 495
                         410       420
                  ....*....|....*....|....*..
gi 1034602044 740 ceaqQELRTVKRKLLEEEEKNARLQKE 766
Cdd:TIGR04523 496 ----KELKKLNEEKKELEEKVKDLTKK 518
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
506-758 9.19e-07

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 52.76  E-value: 9.19e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  506 LEMIKRLKEENEKLNEFL--ELERHNNNMMAKT--LEECRVTLEGLKMENGSLKSHLQGEKQKATEA----SAVEQTAEs 577
Cdd:TIGR02169  676 LQRLRERLEGLKRELSSLqsELRRIENRLDELSqeLSDASRKIGEIEKEIEQLEQEEEKLKERLEELeedlSSLEQEIE- 754
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  578 cEVQEMLKVARAEKDLLELSCNELRQELLK-----ANGEIKHVSSLLAKVEKDYSYLKEICDHQAEQLSRTSLK---LQE 649
Cdd:TIGR02169  755 -NVKSELKELEARIEELEEDLHKLEEALNDlearlSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEkeyLEK 833
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  650 KASESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQ 729
Cdd:TIGR02169  834 EIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIE 913
                          250       260       270
                   ....*....|....*....|....*....|..
gi 1034602044  730 TAVVVANDIKC---EAQQELRTVKRKLLEEEE 758
Cdd:TIGR02169  914 KKRKRLSELKAkleALEEELSEIEDPKGEDEE 945
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
366-773 1.85e-06

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 51.71  E-value: 1.85e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  366 SELSLASLTEKIQKMEENHHSTAEELQATLQELSDQQQMVQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKL-- 443
Cdd:pfam01576   24 AESELKELEKKHQQLCEEKNALQEQLQAETELCAEAEEMRARLAARKQELEEILHELESRLEEEEERSQQLQNEKKKMqq 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  444 -MNLLQERVKNEEPTTQ---------EGKIIELEQKCTGILEQ-GRFEREKLLNIQQQLTCSLRKVEEENQGalEMIKRL 512
Cdd:pfam01576  104 hIQDLEEQLDEEEAARQklqlekvttEAKIKKLEEDILLLEDQnSKLSKERKLLEERISEFTSNLAEEEEKA--KSLSKL 181
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  513 KEENEKLNEFLELERHNNNMMAKTLEECRVTLEG------------------LKMENGSLKSHLQGEKQKATEASAV--- 571
Cdd:pfam01576  182 KNKHEAMISDLEERLKKEEKGRQELEKAKRKLEGestdlqeqiaelqaqiaeLRAQLAKKEEELQAALARLEEETAQknn 261
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  572 ------EQTAESCEVQEMLKVARAEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKdysyLKEICDHQAEQLSRTsl 645
Cdd:pfam01576  262 alkkirELEAQISELQEDLESERAARNKAEKQRRDLGEELEALKTELEDTLDTTAAQQE----LRSKREQEVTELKKA-- 335
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  646 kLQEKASESDAEIKDMKE----TIFELEDQVEQHRAVKlhnnqliSELESSVIKLEEQKSDLERQLKTLTKQMKEETEEW 721
Cdd:pfam01576  336 -LEEETRSHEAQLQEMRQkhtqALEELTEQLEQAKRNK-------ANLEKAKQALESENAELQAELRTLQQAKQDSEHKR 407
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034602044  722 RRFQADLQTAVVVANDIK----------CEAQQELRTVKRKLLEEEEKNARLQKELGDVQGH 773
Cdd:pfam01576  408 KKLEGQLQELQARLSESErqraelaeklSKLQSELESVSSLLNEAEGKNIKLSKDVSSLESQ 469
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
370-764 1.90e-06

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 51.31  E-value: 1.90e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 370 LASLTEKIQKMEENH---HSTAEELQATLQELSDQQQMVQELTAENEKLVDEKTILET--SFHQHRERAEQLSQENEKLM 444
Cdd:COG4717    73 LKELEEELKEAEEKEeeyAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLyqELEALEAELAELPERLEELE 152
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 445 NLLQERVKNEEP-TTQEGKIIELEQKCTGILEQGRFEREKLLniqQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEfl 523
Cdd:COG4717   153 ERLEELRELEEElEELEAELAELQEELEELLEQLSLATEEEL---QDLAEELEELQQRLAELEEELEEAQEELEELEE-- 227
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 524 ELERHNNNMMAKTLEEC-------------RVTLEGLKMENGSLKSHLQG----------------EKQKATEASAVEQT 574
Cdd:COG4717   228 ELEQLENELEAAALEERlkearlllliaaaLLALLGLGGSLLSLILTIAGvlflvlgllallflllAREKASLGKEAEEL 307
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 575 AESCEVQEMLKVARAE-KDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYS--YLKEICDHQAEQLSR---TSLKLQ 648
Cdd:COG4717   308 QALPALEELEEEELEElLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEelQLEELEQEIAALLAEagvEDEEEL 387
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 649 EKASESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELEssvikLEEQKSDLERQLKTLTKQMKEETEEWRRFQADL 728
Cdd:COG4717   388 RAALEQAEEYQELKEELEELEEQLEELLGELEELLEALDEEE-----LEEELEELEEELEELEEELEELREELAELEAEL 462
                         410       420       430
                  ....*....|....*....|....*....|....*.
gi 1034602044 729 QTAvvVANDIKCEAQQELRTVKRKLLEEEEKNARLQ 764
Cdd:COG4717   463 EQL--EEDGELAELLQELEELKAELRELAEEWAALK 496
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
431-789 2.06e-06

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 51.48  E-value: 2.06e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 431 ERAEQLSQENEKL---MNLLQERVKNEEPTTQEGKIIELEQKCTGILEQGRFEREKLLNIQQQLTcSLRKVEEENQGALE 507
Cdd:COG1196   213 ERYRELKEELKELeaeLLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELE-ELELELEEAQAEEY 291
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 508 MIKRLKEENEKLNEFLELERHNNnmmaktleecRVTLEGLKMENGSLKSHLQGEKQKATEAsaveqtaescevQEMLKVA 587
Cdd:COG1196   292 ELLAELARLEQDIARLEERRREL----------EERLEELEEELAELEEELEELEEELEEL------------EEELEEA 349
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 588 RAEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEicdhQAEQLSRTSLKLQEKASESDAEIKDMKETIFE 667
Cdd:COG1196   350 EEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALR----AAAELAAQLEELEEAEEALLERLERLEEELEE 425
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 668 LEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQTAVVVANDIK-CEAQQEL 746
Cdd:COG1196   426 LEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLeAEADYEG 505
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|...
gi 1034602044 747 RTVKRKLLEEEEKNARLQKELGDVQGHGRVVTsRAAPPSLGSV 789
Cdd:COG1196   506 FLEGVKAALLLAGLRGLAGAVAVLIGVEAAYE-AALEAALAAA 547
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
579-780 2.13e-06

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 51.45  E-value: 2.13e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  579 EVQEMLKVARAEKDLLELSCnELRQELLKANGEIKHVSSLLAKV-----EKDYSYLKEICDHQAEQLSRtslkLQEKASE 653
Cdd:COG4913    239 RAHEALEDAREQIELLEPIR-ELAERYAAARERLAELEYLRAALrlwfaQRRLELLEAELEELRAELAR----LEAELER 313
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  654 SDAEIKDMKETIFELEDQVEQHRAVKLHN-NQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQTAV 732
Cdd:COG4913    314 LEARLDALREELDELEAQIRGNGGDRLEQlEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEAAALL 393
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*...
gi 1034602044  733 VVANDIKCEAQQELRTVKRKLLEEEEKNARLQKELGDVQGHGRVVTSR 780
Cdd:COG4913    394 EALEEELEALEEALAEAEAALRDLRRELRELEAEIASLERRKSNIPAR 441
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
367-651 8.48e-06

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 49.55  E-value: 8.48e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 367 ELSLASLTEKIQKMEEnhhsTAEELQATLQELSDQqqmVQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKLMNL 446
Cdd:COG1196   238 EAELEELEAELEELEA----ELEELEAELAELEAE---LEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEER 310
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 447 LQERVKNEEptTQEGKIIELEQKCTGILEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEFLELE 526
Cdd:COG1196   311 RRELEERLE--ELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEEL 388
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 527 ---RHNNNMMAKTLEECRVTLEGLKMENGSLKSHLQGEKQKATEASAVEQTAESCEVQEMLKVARAEKDLLELScNELRQ 603
Cdd:COG1196   389 leaLRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALL-ELLAE 467
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|
gi 1034602044 604 ELLKANGEIKHVSSLLAKVEKDYS--YLKEICDHQAEQLSRTSLKLQEKA 651
Cdd:COG1196   468 LLEEAALLEAALAELLEELAEAAArlLLLLEAEADYEGFLEGVKAALLLA 517
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
370-767 9.36e-06

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 49.20  E-value: 9.36e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  370 LASLTEKIQKmeeNHHSTAEELQATLQELSD-QQQMVQELTAENEKLVDEKTILETSFHQHRERAEQLSQENE------K 442
Cdd:TIGR00618  181 LALMEFAKKK---SLHGKAELLTLRSQLLTLcTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQKREaqeeqlK 257
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  443 LMNLLQERVKNEEPTTQEGKIIELEQKCTGI-------------LEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMI 509
Cdd:TIGR00618  258 KQQLLKQLRARIEELRAQEAVLEETQERINRarkaaplaahikaVTQIEQQAQRIHTELQSKMRSRAKLLMKRAAHVKQQ 337
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  510 KRLKEENEKLNEFLELERHNnnmmAKTLEECRVTLEGLKMENgSLKSHLQGEKQKATEASAVEQTAesCEVQEMLKVARA 589
Cdd:TIGR00618  338 SSIEEQRRLLQTLHSQEIHI----RDAHEVATSIREISCQQH-TLTQHIHTLQQQKTTLTQKLQSL--CKELDILQREQA 410
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  590 EKDLLELSCNELRQELLKANGEI----KHVSSLLAKVEKDYSYLKEICDHQAEQLSRTSLKLQ-----EKASESDAEIKD 660
Cdd:TIGR00618  411 TIDTRTSAFRDLQGQLAHAKKQQelqqRYAELCAAAITCTAQCEKLEKIHLQESAQSLKEREQqlqtkEQIHLQETRKKA 490
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  661 MKETIfeLEDQVEQHRAVK---LHNNQ-------------LISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRF 724
Cdd:TIGR00618  491 VVLAR--LLELQEEPCPLCgscIHPNParqdidnpgpltrRMQRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEI 568
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|...
gi 1034602044  725 QADLQTAVVVANDIKCEAQQeLRTVKRKLLEEEEKNARLQKEL 767
Cdd:TIGR00618  569 QQSFSILTQCDNRSKEDIPN-LQNITVRLQDLTEKLSEAEDML 610
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
507-767 1.63e-05

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 48.58  E-value: 1.63e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  507 EMIKRLKEENEKLNEFLELERHNNNMMAKTLEECRVTLEGLKMENGSLKSHLQGEKQKATE-ASAVEQTAESCEVQEMLK 585
Cdd:pfam15921   82 EYSHQVKDLQRRLNESNELHEKQKFYLRQSVIDLQTKLQEMQMERDAMADIRRRESQSQEDlRNQLQNTVHELEAAKCLK 161
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  586 varaeKDLLELS---CNELRQELLKANGEIKHVSSLLAKVEKDYSylKEIcdHQAEQLSRTSLK-----LQEKASESDAE 657
Cdd:pfam15921  162 -----EDMLEDSntqIEQLRKMMLSHEGVLQEIRSILVDFEEASG--KKI--YEHDSMSTMHFRslgsaISKILRELDTE 232
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  658 IKDMKETIFELEDQVEQHRA---------VKLHNN---QLISELESSVIKLEE-------QKSDLERQLKTLTKQMKEET 718
Cdd:pfam15921  233 ISYLKGRIFPVEDQLEALKSesqnkiellLQQHQDrieQLISEHEVEITGLTEkassarsQANSIQSQLEIIQEQARNQN 312
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*....
gi 1034602044  719 EEWRRFQADLQTAVvvandikCEAQQELRTVKRKLleeEEKNARLQKEL 767
Cdd:pfam15921  313 SMYMRQLSDLESTV-------SQLRSELREAKRMY---EDKIEELEKQL 351
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
562-788 2.60e-05

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 47.13  E-value: 2.60e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 562 KQKATEASAVEQTAESCEVQEMLKVARAEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEICDHQAEQLS 641
Cdd:COG3883    17 QIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERARALY 96
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 642 RTSlklqekASESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEw 721
Cdd:COG3883    97 RSG------GSVSYLDVLLGSESFSDFLDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAA- 169
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034602044 722 rrfQADLQTAVVVANDIKCEAQQELRTVKRKLLEEEEKNARLQKELGDVQGHGRVVTSRAAPPSLGS 788
Cdd:COG3883   170 ---KAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAA 233
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
388-773 3.75e-05

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 47.27  E-value: 3.75e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  388 AEELQATLQELSDQQQMVQELTAENEKLVDEKTILEtsfhqhreRAEQLSQENEKLMNLLQErvkneEPTTQEGKIIELE 467
Cdd:TIGR00618  448 TCTAQCEKLEKIHLQESAQSLKEREQQLQTKEQIHL--------QETRKKAVVLARLLELQE-----EPCPLCGSCIHPN 514
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  468 QKCTGILEQGRFER--EKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEKL-NEFLELERHNN------NMMAKTLE 538
Cdd:TIGR00618  515 PARQDIDNPGPLTRrmQRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIqQSFSILTQCDNrskediPNLQNITV 594
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  539 ECRVTLEGLKMENGSLKSHLQGEKQKATEASA---VEQTAESCEVQEMLKVARAEKDLLELSCNELRQELL--------- 606
Cdd:TIGR00618  595 RLQDLTEKLSEAEDMLACEQHALLRKLQPEQDlqdVRLHLQQCSQELALKLTALHALQLTLTQERVREHALsirvlpkel 674
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  607 ---------KANGEIKHVSSLLAKVEKDYSYLKEICDH------QAEQLSRTSLKLQEKASESDAEIKDMKETIFELEDQ 671
Cdd:TIGR00618  675 lasrqlalqKMQSEKEQLTYWKEMLAQCQTLLRELETHieeydrEFNEIENASSSLGSDLAAREDALNQSLKELMHQART 754
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  672 VEQHRAVKLHNNqliSELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQTAVVVANDIKCEAQQELRTVKR 751
Cdd:TIGR00618  755 VLKARTEAHFNN---NEEVTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETLVQEEE 831
                          410       420
                   ....*....|....*....|..
gi 1034602044  752 KLLEEEEKNARLQKELGDVQGH 773
Cdd:TIGR00618  832 QFLSRLEEKSATLGEITHQLLK 853
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
351-766 5.62e-05

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 46.64  E-value: 5.62e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 351 KSSKCSTAGSSPNSVSELSLASLTEKIQKMEENHHSTAEELQATLQELSDQQQMVQELTA----------ENEKLVDEKT 420
Cdd:pfam05483 339 ELNKAKAAHSFVVTEFEATTCSLEELLRTEQQRLEKNEDQLKIITMELQKKSSELEEMTKfknnkeveleELKKILAEDE 418
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 421 ILETSFHQHRERAEQLSQENEKLMNLLQERVKN-EEPTTQEGKIIELEQKCTGILEQGR--FEREKLLNIQQQLTCSLRK 497
Cdd:pfam05483 419 KLLDEKKQFEKIAEELKGKEQELIFLLQAREKEiHDLEIQLTAIKTSEEHYLKEVEDLKteLEKEKLKNIELTAHCDKLL 498
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 498 VEEEN--QGALEMIKRLKEENEKLNEFLELERHnnnmMAKTLEECRVTLEGLKMENGSLKSHLqgeKQKATEASAVEQTA 575
Cdd:pfam05483 499 LENKEltQEASDMTLELKKHQEDIINCKKQEER----MLKQIENLEEKEMNLRDELESVREEF---IQKGDEVKCKLDKS 571
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 576 ESCEVQEMLKVARAEKDL--LELSCNELRQELLKANGEIKHVSSLLAKVEKDYSY------LKEICDHQAE-QLSRTSLK 646
Cdd:pfam05483 572 EENARSIEYEVLKKEKQMkiLENKCNNLKKQIENKNKNIEELHQENKALKKKGSAenkqlnAYEIKVNKLElELASAKQK 651
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 647 LQEKASESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELESSVI-KLEEQKSDLERQLKTLTKQMKEETEEWRRFQ 725
Cdd:pfam05483 652 FEEIIDNYQKEIEDKKISEEKLLEEVEKAKAIADEAVKLQKEIDKRCQhKIAEMVALMEKHKHQYDKIIEERDSELGLYK 731
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|....
gi 1034602044 726 ADLQTAVVVANDIKCE---AQQELRTVKRKLLEEEEKNARLQKE 766
Cdd:pfam05483 732 NKEQEQSSAKAALEIElsnIKAELLSLKKQLEIEKEEKEKLKME 775
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
509-767 1.12e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 45.82  E-value: 1.12e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  509 IKRLKEENEKLNEFLELErhnnnmmaKTLEECRVTLEGLKMEngSLKSHLQgEKQKATEASAVEQTAESCEVQE------ 582
Cdd:TIGR02168  202 LKSLERQAEKAERYKELK--------AELRELELALLVLRLE--ELREELE-ELQEELKEAEEELEELTAELQEleekle 270
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  583 --MLKVARAEKDLLELS--CNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEicdhQAEQLSRTSLKLQEKASESDAEI 658
Cdd:TIGR02168  271 elRLEVSELEEEIEELQkeLYALANEISRLEQQKQILRERLANLERQLEELEA----QLEELESKLDELAEELAELEEKL 346
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  659 KDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQ----------MKEETEEWRRFQADL 728
Cdd:TIGR02168  347 EELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEierlearlerLEDRRERLQQEIEEL 426
                          250       260       270
                   ....*....|....*....|....*....|....*....
gi 1034602044  729 QTAVVVANdiKCEAQQELRTVKRKLLEEEEKNARLQKEL 767
Cdd:TIGR02168  427 LKKLEEAE--LKELQAELEELEEELEELQEELERLEEAL 463
PTZ00121 PTZ00121
MAEBL; Provisional
388-766 1.23e-04

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 45.90  E-value: 1.23e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  388 AEELQATLQELSDQQQMVQELTAENEKLVDEKTILEtsfhqHRERAEQLSQENEKLMNLLQERVKNEEPTTQEgkiiELE 467
Cdd:PTZ00121  1393 ADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAE-----EKKKADEAKKKAEEAKKADEAKKKAEEAKKAE----EAK 1463
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  468 QKCTgilEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEFLELERHNNNMMAKTLEECRVTLEGL 547
Cdd:PTZ00121  1464 KKAE---EAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAK 1540
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  548 KMEngslkshlqgEKQKATEASAVEQTAESCEVQEMLKVARAEKD--LLELSCNELRQELLKANGEIKHVSSLLAKVEKD 625
Cdd:PTZ00121  1541 KAE----------EKKKADELKKAEELKKAEEKKKAEEAKKAEEDknMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAE 1610
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  626 YSYLKEICDHQAEQLSrtslKLQEKASESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLE- 704
Cdd:PTZ00121  1611 EAKKAEEAKIKAEELK----KAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEd 1686
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034602044  705 ---------------RQLKTLTKQMKEETEEWRRFQADLQTAVVVANDIKCEAQQELRTVKRKLLEEEEKN--ARLQKE 766
Cdd:PTZ00121  1687 ekkaaealkkeaeeaKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKkiAHLKKE 1765
PRK04778 PRK04778
septation ring formation regulator EzrA; Provisional
396-754 1.33e-04

septation ring formation regulator EzrA; Provisional


Pssm-ID: 179877 [Multi-domain]  Cd Length: 569  Bit Score: 45.21  E-value: 1.33e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 396 QELSDQQQMVQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKLM-NLLQERvkneeptTQEGKIIELeqkctgiL 474
Cdd:PRK04778  105 HEINEIESLLDLIEEDIEQILEELQELLESEEKNREEVEQLKDLYRELRkSLLANR-------FSFGPALDE-------L 170
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 475 EqgrferEKLLNIQQQLTcslRKVEEENQG----ALEMIKRLKEENEKLNEFLE----LERHNNNMMAKTLEECRVTLEG 546
Cdd:PRK04778  171 E------KQLENLEEEFS---QFVELTESGdyveAREILDQLEEELAALEQIMEeipeLLKELQTELPDQLQELKAGYRE 241
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 547 LKMENGSLKsHLQGEKqkateasaveqtaescEVQEML-KVARAEKDLLELSCNELRQELLKANGEIKHVSSLLakvEKD 625
Cdd:PRK04778  242 LVEEGYHLD-HLDIEK----------------EIQDLKeQIDENLALLEELDLDEAEEKNEEIQERIDQLYDIL---ERE 301
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 626 YSYLKEIcDHQAEQLSRTSLKLQEKASESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQK---SD 702
Cdd:PRK04778  302 VKARKYV-EKNSDTLPDFLEHAKEQNKELKEEIDRVKQSYTLNESELESVRQLEKQLESLEKQYDEITERIAEQEiaySE 380
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 1034602044 703 LERQLKTLTKQMKEETEEWRRFQADLQT---AVVVANDIKCEAQQELRTVKRKLL 754
Cdd:PRK04778  381 LQEELEEILKQLEEIEKEQEKLSEMLQGlrkDELEAREKLERYRNKLHEIKRYLE 435
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
367-756 1.49e-04

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 45.49  E-value: 1.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  367 ELSLASLTEKIQKMEENHHSTAEeLQATLQELsdqQQMVQELTAENEKLVD-EKTILE-TSFHQHRERA-EQLSQENEKL 443
Cdd:pfam15921  447 ERQMAAIQGKNESLEKVSSLTAQ-LESTKEML---RKVVEELTAKKMTLESsERTVSDlTASLQEKERAiEATNAEITKL 522
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  444 MNLLQERVKNEEPTTQEGKIIELEQKCTGILEQGRFEREKLLNI-QQQLTCSLRKVEEENQGA----LEMIKRLKEENEK 518
Cdd:pfam15921  523 RSRVDLKLQELQHLKNEGDHLRNVQTECEALKLQMAEKDKVIEIlRQQIENMTQLVGQHGRTAgamqVEKAQLEKEINDR 602
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  519 LNEFLELERHNNNMMAKTLE-ECRVtleglkmengslkSHLQGEKQKATEASAveqtaescevqEMLKVARaekdllels 597
Cdd:pfam15921  603 RLELQEFKILKDKKDAKIRElEARV-------------SDLELEKVKLVNAGS-----------ERLRAVK--------- 649
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  598 cnELRQELLKANGEIKHVSSLLAKVEKDYSYLKEICDHQAEQLSRTSLKLQEKASESDAEIKDMKETIFELEDQVEQHRA 677
Cdd:pfam15921  650 --DIKQERDQLLNEVKTSRNELNSLSEDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSELEQTRNTLKSMEGSDGHAMK 727
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034602044  678 VKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQTAVVVANDIKCEAQQeLRTVKRKLLEE 756
Cdd:pfam15921  728 VAMGMQKQITAKRGQIDALQSKIQFLEEAMTNANKEKHFLKEEKNKLSQELSTVATEKNKMAGELEV-LRSQERRLKEK 805
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
389-715 1.53e-04

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 45.40  E-value: 1.53e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 389 EELQATLQELSDQQQMVQELTAENEKLVDEKTILETsfhQHRERAEQLSQENEKLMNLLQER-VKNEEPTTQEGKIIELE 467
Cdd:TIGR04523 321 KKLEEIQNQISQNNKIISQLNEQISQLKKELTNSES---ENSEKQRELEEKQNEIEKLKKENqSYKQEIKNLESQINDLE 397
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 468 QKctgileqgrFEREKLLNiqQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEFL-ELERHNN------NMMAKTLEEC 540
Cdd:TIGR04523 398 SK---------IQNQEKLN--QQKDEQIKKLQQEKELLEKEIERLKETIIKNNSEIkDLTNQDSvkeliiKNLDNTRESL 466
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 541 RVTLEGLKMENGSLKSHLQGEKQ--KATEASAVEQTAESCEVQEMLKVARAEKDLLELSCNELRQELLKANGEIkhvSSL 618
Cdd:TIGR04523 467 ETQLKVLSRSINKIKQNLEQKQKelKSKEKELKKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKI---SDL 543
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 619 LAKVEKDYSYLKEicdhqaEQLSRTSLKLQEKASESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEE 698
Cdd:TIGR04523 544 EDELNKDDFELKK------ENLEKEIDEKNKEIEELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLEK 617
                         330
                  ....*....|....*..
gi 1034602044 699 QKSDLERQLKTLTKQMK 715
Cdd:TIGR04523 618 ELEKAKKENEKLSSIIK 634
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
600-782 1.60e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 45.29  E-value: 1.60e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  600 ELRQELLKANGEIKHVSSLLAKVEKDYSYLKEIcdhqaeqlsRTSLKLQEKASESDAEIKDMKETIFELEDQVEQHRAvk 679
Cdd:COG4913    614 ALEAELAELEEELAEAEERLEALEAELDALQER---------REALQRLAEYSWDEIDVASAEREIAELEAELERLDA-- 682
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  680 lhNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQTAVVVANDIKCEAQQELRTVKRKLLEEEEK 759
Cdd:COG4913    683 --SSDDLAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALG 760
                          170       180
                   ....*....|....*....|...
gi 1034602044  760 NARLQKELGDVQGHGRVVTSRAA 782
Cdd:COG4913    761 DAVERELRENLEERIDALRARLN 783
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
475-771 1.68e-04

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 45.35  E-value: 1.68e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  475 EQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEFLELERHNNNMMAKT----LEECRVTLEGLKME 550
Cdd:pfam02463  169 RKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDylklNEERIDLLQELLRD 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  551 NGSLKSHLQGEKQKATEASAVEQTAESCEVQEMlKVARAEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKdysylk 630
Cdd:pfam02463  249 EQEEIESSKQEIEKEEEKLAQVLKENKEEEKEK-KLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEK------ 321
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  631 eicdhQAEQLSRTSLKLQEKASESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQlKTL 710
Cdd:pfam02463  322 -----EKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKL-KEE 395
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034602044  711 TKQMKEETEEWRRFQADLQTAVVVANDIKCEAQQELRTVKRKLLEEEEKNARLQKELGDVQ 771
Cdd:pfam02463  396 ELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQ 456
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
389-776 3.44e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 44.29  E-value: 3.44e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  389 EELQATLQELsdqqQMVQELTAENEKLVDEKtiletsfhqhRERAEQLSQENEKLMNL--LQERVKNEEPTTQEGKIIEL 466
Cdd:TIGR02169  170 RKKEKALEEL----EEVEENIERLDLIIDEK----------RQQLERLRREREKAERYqaLLKEKREYEGYELLKEKEAL 235
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  467 EQKctgiLEQGRFEREKLLNIQQQLTcslRKVEEENQGALEMIKRLKEENEKLNEFLELERhnnNMMAKTLEECRVTLEG 546
Cdd:TIGR02169  236 ERQ----KEAIERQLASLEEELEKLT---EEISELEKRLEEIEQLLEELNKKIKDLGEEEQ---LRVKEKIGELEAEIAS 305
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  547 LKMENGSLKSHLQ--GEKQKATEASAVEQTAESCEVQEMLKVARAEKDLLELSCNELRQELLKANGEIKHVSSLLAK-VE 623
Cdd:TIGR02169  306 LERSIAEKERELEdaEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAEtRD 385
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  624 KDYSYLKEICD--HQAEQLSRTSLKLQEKASESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKS 701
Cdd:TIGR02169  386 ELKDYREKLEKlkREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLS 465
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1034602044  702 DLERQLKTLTKQMKEETEEWRRFQADLQTAvvvandiKCEAQQELRTVKRKLLEEEEKNARLQKELGDVQGHGRV 776
Cdd:TIGR02169  466 KYEQELYDLKEEYDRVEKELSKLQRELAEA-------EAQARASEERVRGGRAVEEVLKASIQGVHGTVAQLGSV 533
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
377-667 3.76e-04

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 44.19  E-value: 3.76e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  377 IQKMEENHHSTAEELQATLQELSDQQQMvQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKLMNLLQERVKNEEP 456
Cdd:TIGR00618  600 TEKLSEAEDMLACEQHALLRKLQPEQDL-QDVRLHLQQCSQELALKLTALHALQLTLTQERVREHALSIRVLPKELLASR 678
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  457 TTQEGKIIELEQKCTGILEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEFL-ELERHNNNMMAK 535
Cdd:TIGR00618  679 QLALQKMQSEKEQLTYWKEMLAQCQTLLRELETHIEEYDREFNEIENASSSLGSDLAAREDALNQSLkELMHQARTVLKA 758
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  536 TLEECRVTLEGLKME--NGSLKSHLQGEKQKATEASAVEQTAESCEVQEMLKVARAEKDLLELSCNELRQELLKANGEIK 613
Cdd:TIGR00618  759 RTEAHFNNNEEVTAAlqTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETLVQEEEQFLSRLE 838
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1034602044  614 HVSSLLAKVEKDYSYLKEiCDHQAEQLSRTSLKLQEKasESDAEIKDMKETIFE 667
Cdd:TIGR00618  839 EKSATLGEITHQLLKYEE-CSKQLAQLTQEQAKIIQL--SDKLNGINQIKIQFD 889
COG5022 COG5022
Myosin heavy chain [General function prediction only];
440-771 4.92e-04

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 43.91  E-value: 4.92e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  440 NEKLMNLLQERVKNEEPTTQEGKiieLEQKCTGILEQGRFEREKL-LNIQQQLTCSLRKVEEENQgalEMIKRLKEENEK 518
Cdd:COG5022    775 QVIQHGFRLRRLVDYELKWRLFI---KLQPLLSLLGSRKEYRSYLaCIIKLQKTIKREKKLRETE---EVEFSLKAEVLI 848
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  519 LNEFLELERHNnnmMAKTLEECRVTLEGL-KMENgsLKSHLQGEKQKATEASAVEQTAEscevqemlkvaRAEKDLLELS 597
Cdd:COG5022    849 QKFGRSLKAKK---RFSLLKKETIYLQSAqRVEL--AERQLQELKIDVKSISSLKLVNL-----------ELESEIIELK 912
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  598 CNElrQELLKANGEIKHVSSLLAKVEKDYSYLKEICDHQAEQLSRTsLKLQEkasesdaEIKDMKETIFELEDQVEQHRA 677
Cdd:COG5022    913 KSL--SSDLIENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLPEL-NKLHE-------VESKLKETSEEYEDLLKKSTI 982
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  678 VKLHNNQLISELESSVIKLEEQKSDLERqLKTLTKQMKEEteewRRFQADLQTAVVVANDIKCEAQQELRTVKRKLLEEE 757
Cdd:COG5022    983 LVREGNKANSELKNFKKELAELSKQYGA-LQESTKQLKEL----PVEVAELQSASKIISSESTELSILKPLQKLKGLLLL 1057
                          330
                   ....*....|....
gi 1034602044  758 EKNaRLQKELGDVQ 771
Cdd:COG5022   1058 ENN-QLQARYKALK 1070
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
372-716 5.39e-04

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 43.49  E-value: 5.39e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 372 SLTEKIQKMEENH---HSTAEELQATLQ----ELSDQQQMVQELTAE----NEKLVDEKTILETSFHQHRERAEQLSQEN 440
Cdd:PRK02224  346 SLREDADDLEERAeelREEAAELESELEeareAVEDRREEIEELEEEieelRERFGDAPVDLGNAEDFLEELREERDELR 425
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 441 EKLMNL------LQERVKNEEPTTQEGKIIELEQKCTG-----ILEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEM- 508
Cdd:PRK02224  426 EREAELeatlrtARERVEEAEALLEAGKCPECGQPVEGsphveTIEEDRERVEELEAELEDLEEEVEEVEERLERAEDLv 505
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 509 -----IKRLKEENEKLNEFLELERhnnnmmaKTLEECRVTLEGLKMENGSLKSHLQGEKQKATEAsavEQTAEscEVQEM 583
Cdd:PRK02224  506 eaedrIERLEERREDLEELIAERR-------ETIEEKRERAEELRERAAELEAEAEEKREAAAEA---EEEAE--EAREE 573
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 584 LKvaraekdllelSCNELRQELLKANGEIKHVSSLLAKVEKdysylkeiCDHQAEQLSRTSLKLQEKASESDAEIKDMKE 663
Cdd:PRK02224  574 VA-----------ELNSKLAELKERIESLERIRTLLAAIAD--------AEDEIERLREKREALAELNDERRERLAEKRE 634
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 1034602044 664 TIFELEDQVEQHRAVKLHNN-----QLISELESSVIKLEEQKSDLERQLKTLTKQMKE 716
Cdd:PRK02224  635 RKRELEAEFDEARIEEAREDkeraeEYLEQVEEKLDELREERDDLQAEIGAVENELEE 692
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
355-596 8.08e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 42.44  E-value: 8.08e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 355 CSTAGSSPNSVSELS--LASLTEKIQKMEENHHSTAEELQATLQELSDQQQMVQELTAENEKLVDEKTILETSFHQHRER 432
Cdd:COG4942    12 ALAAAAQADAAAEAEaeLEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKE 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 433 AEQLSQENEKLMNLLQERVKNeepttqegkiieleqkctgILEQGRFEREKLLNIQQQLTCSLRkveeenqgALEMIKRL 512
Cdd:COG4942    92 IAELRAELEAQKEELAELLRA-------------------LYRLGRQPPLALLLSPEDFLDAVR--------RLQYLKYL 144
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 513 KEENEKLNEFLELERHNNNMMAKTLEECRVTLEGLKMENGSLKSHLQGEKQKATEASAVEQTAESCEVQEMLKVARAEKD 592
Cdd:COG4942   145 APARREQAEELRADLAELAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEE 224

                  ....
gi 1034602044 593 LLEL 596
Cdd:COG4942   225 LEAL 228
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
565-788 8.44e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 42.44  E-value: 8.44e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 565 ATEASAVEQTAESCEVQEMLKVARAEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEicdhQAEQLSRTS 644
Cdd:COG4942    17 AQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEA----ELAELEKEI 92
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 645 LKLQEKASESDAEIKDM------------------KETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQ 706
Cdd:COG4942    93 AELRAELEAQKEELAELlralyrlgrqpplalllsPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAE 172
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 707 LKTLTKQMKEETEEwrrfQADLQTAVVVANDIKCEAQQELRTVKRKLLEEEEKNARLQKELGDVQGHGRVVTSRAAPPSL 786
Cdd:COG4942   173 RAELEALLAELEEE----RAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPAAGF 248

                  ..
gi 1034602044 787 GS 788
Cdd:COG4942   249 AA 250
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
430-766 8.64e-04

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 43.04  E-value: 8.64e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  430 RERAEQLSQENEKLMNLLQERVKNEEPTTQEGKIIELEQKCTGILEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMI 509
Cdd:pfam02463  172 KEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQE 251
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  510 KRLKEENEKLNEFLELERHNNNMMAKTLEEcrvtlEGLKMENGSLKSHLQGEKQKATEASAVEQTAESCEVQEMLKVARA 589
Cdd:pfam02463  252 EIESSKQEIEKEEEKLAQVLKENKEEEKEK-----KLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEKKKA 326
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  590 EKDLLELSCNELRQELLKANGEIKhvsSLLAKVEKDYSYLKEICDHQAEQLSRTSLKLQEKASESDAEIKDMKETIFELE 669
Cdd:pfam02463  327 EKELKKEKEEIEELEKELKELEIK---REAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKSEE 403
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  670 DQVEQHRAVKLHNNQLISELESSVI--KLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQTAVVVANDIKCEAQQELR 747
Cdd:pfam02463  404 EKEAQLLLELARQLEDLLKEEKKEEleILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQ 483
                          330
                   ....*....|....*....
gi 1034602044  748 TVKRKLLEEEEKNARLQKE 766
Cdd:pfam02463  484 EQLELLLSRQKLEERSQKE 502
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
371-730 9.81e-04

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 42.73  E-value: 9.81e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  371 ASLTEKIQKMEENHHSTAEELQATLQELSDQQQMVQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKLMNLLQER 450
Cdd:TIGR00606  691 AELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNDIEEQ 770
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  451 VKNEEPTTQEGKIIELEQKCTGILEQGRFEREKLLNIQQQLTC------------SLRKVEEENQGALEMIKRLKEENEK 518
Cdd:TIGR00606  771 ETLLGTIMPEEESAKVCLTDVTIMERFQMELKDVERKIAQQAAklqgsdldrtvqQVNQEKQEKQHELDTVVSKIELNRK 850
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  519 LNE-----FLELERHNNNM------MAKTLEECRVTLEGLKMENGSLKSHLQGEKQKATEASAVEQTAES--CEVQEMLK 585
Cdd:TIGR00606  851 LIQdqqeqIQHLKSKTNELkseklqIGTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKdqQEKEELIS 930
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  586 VARAEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEI-----------CDHQAEQLSR---------TSL 645
Cdd:TIGR00606  931 SKETSNKKAQDKVNDIKEKVKNIHGYMKDIENKIQDGKDDYLKQKETelntvnaqleeCEKHQEKINEdmrlmrqdiDTQ 1010
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  646 KLQEKASESDAEIKDMKETIFELEDQVEQHraVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQ 725
Cdd:TIGR00606 1011 KIQERWLQDNLTLRKRENELKEVEEELKQH--LKEMGQMQVLQMKQEHQKLEENIDLIKRNHVLALGRQKGYEKEIKHFK 1088

                   ....*
gi 1034602044  726 ADLQT 730
Cdd:TIGR00606 1089 KELRE 1093
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
534-766 1.00e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 42.59  E-value: 1.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  534 AKTLEECRVTLEGLKMENGSLKSHLQGEKQKATEASAVEQTAEscevqEMLKVARAEKDLLELscNELRQELLKANGEIK 613
Cdd:COG4913    616 EAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSW-----DEIDVASAEREIAEL--EAELERLDASSDDLA 688
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  614 HVSSLLAKVEKDYSYLKEICDHQAEQLSRTSLKLQEKASE--------SDAEIKDMKETIFELEDQVEQHrAVKLHNNQL 685
Cdd:COG4913    689 ALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEEldelqdrlEAAEDLARLELRALLEERFAAA-LGDAVEREL 767
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  686 ISELESSVIKLEEQKSDLERQLktlTKQMKEETEEWRRFQADLQTAVVVANDIkceaQQELRTVKRKLLEE-EEKNARLQ 764
Cdd:COG4913    768 RENLEERIDALRARLNRAEEEL---ERAMRAFNREWPAETADLDADLESLPEY----LALLDRLEEDGLPEyEERFKELL 840

                   ..
gi 1034602044  765 KE 766
Cdd:COG4913    841 NE 842
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
534-739 1.08e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 42.06  E-value: 1.08e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 534 AKTLEECRVTLEGLKMENGSLKSHLQGEKQKATEASAVEQTAES--CEVQEMLKVARAEKDLLELSCNELRQELLKANGE 611
Cdd:COG4942    19 ADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERriAALARRIRALEQELAALEAELAELEKEIAELRAE 98
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 612 IKHVSSLLAKV-------------------------EKDYSYLKEICDH---QAEQLSRTSLKLQEKASESDAEIKDMKE 663
Cdd:COG4942    99 LEAQKEELAELlralyrlgrqpplalllspedfldaVRRLQYLKYLAPArreQAEELRADLAELAALRAELEAERAELEA 178
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034602044 664 TIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQTAVVVANDIK 739
Cdd:COG4942   179 LLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPAAGFAALKGK 254
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
480-770 1.16e-03

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 42.36  E-value: 1.16e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 480 EREKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEFLELERHNnnmmAKTLEECRVTLEGLKMENGSLKSHLQ 559
Cdd:PRK03918  180 RLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKE----VKELEELKEEIEELEKELESLEGSKR 255
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 560 G--EKQKATEASAVEQTAESCEVQEmlKVARAEK-DLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEICDhQ 636
Cdd:PRK03918  256 KleEKIRELEERIEELKKEIEELEE--KVKELKElKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIK-E 332
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 637 AEQLSRTSLKLQEKASESDAEIKDMK------ETIFELEDQVEQHRAVKLHNNqlISELESSVIKLEEQKSDLERQLKTL 710
Cdd:PRK03918  333 LEEKEERLEELKKKLKELEKRLEELEerhelyEEAKAKKEELERLKKRLTGLT--PEKLEKELEELEKAKEEIEEEISKI 410
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034602044 711 TkqmkEETEEWRRFQADLQTAVvvaNDIKcEAQQELRTVKRKLLEEEEKN--ARLQKELGDV 770
Cdd:PRK03918  411 T----ARIGELKKEIKELKKAI---EELK-KAKGKCPVCGRELTEEHRKEllEEYTAELKRI 464
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
370-772 1.20e-03

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 42.36  E-value: 1.20e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  370 LASLTEKIQKMEENHH---STAEELQATLQELSDQQQMVQEltaENEKLVDEKTILETSFHQHRERAEQLSQENEKL--- 443
Cdd:TIGR02169  352 RDKLTEEYAELKEELEdlrAELEEVDKEFAETRDELKDYRE---KLEKLKREINELKRELDRLQEELQRLSEELADLnaa 428
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  444 -------MNLLQERV--KNEEPTTQEGKIIELEQKCTGILEQGRFEREKLLNIQQQLTCS-------------------- 494
Cdd:TIGR02169  429 iagieakINELEEEKedKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLqrelaeaeaqaraseervrg 508
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  495 ---------------------LRKVEEENQGALEM--------------------IKRLKEENEKLNEFLELER------ 527
Cdd:TIGR02169  509 graveevlkasiqgvhgtvaqLGSVGERYATAIEVaagnrlnnvvveddavakeaIELLKRRKAGRATFLPLNKmrderr 588
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  528 -------------------------------HNNNMMAKTLEECR--------VTLEGL------KMENGSLKSHLQGEK 562
Cdd:TIGR02169  589 dlsilsedgvigfavdlvefdpkyepafkyvFGDTLVVEDIEAARrlmgkyrmVTLEGElfeksgAMTGGSRAPRGGILF 668
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  563 QKATEASAVEQTAESCEVQEMLKVARAEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEICDHQAEQLSR 642
Cdd:TIGR02169  669 SRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSS 748
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  643 TSLKLQEKASESD---AEIKDMKETIFELEDQVEQHRAVKLHnnQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETE 719
Cdd:TIGR02169  749 LEQEIENVKSELKeleARIEELEEDLHKLEEALNDLEARLSH--SRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTL 826
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1034602044  720 EWRRFQADLQTAVVVANDIK------CEAQQELRTVKRKLLEEEEKNA----RLQKELGDVQG 772
Cdd:TIGR02169  827 EKEYLEKEIQELQEQRIDLKeqiksiEKEIENLNGKKEELEEELEELEaalrDLESRLGDLKK 889
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
561-767 1.27e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 42.06  E-value: 1.27e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 561 EKQKATEASAVEQTAESCEVQEMLKVARAEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEICDHQAEQL 640
Cdd:COG4942    27 AELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEEL 106
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 641 SRTSLKLQEKASESDAEIKDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQ---LKTLTKQMKEE 717
Cdd:COG4942   107 AELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAEraeLEALLAELEEE 186
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|
gi 1034602044 718 TEEWRRFQADLQTAVVVANDIKCEAQQELRTVKRKLLEEEEKNARLQKEL 767
Cdd:COG4942   187 RAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEA 236
PRK12705 PRK12705
hypothetical protein; Provisional
551-717 1.42e-03

hypothetical protein; Provisional


Pssm-ID: 237178 [Multi-domain]  Cd Length: 508  Bit Score: 42.00  E-value: 1.42e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 551 NGSLKSHLQGEKQKATEASAVEQTAEScEVQEMLKVARAE-KDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYL 629
Cdd:PRK12705   18 LGVLVVLLKKRQRLAKEAERILQEAQK-EAEEKLEAALLEaKELLLRERNQQRQEARREREELQREEERLVQKEEQLDAR 96
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 630 KEICDHQAEQLSRTSLKLQEKASESDAEIKDMKETIFELEDQVEQHRavklhNNQLISELEssvikleeqkSDLERQLKT 709
Cdd:PRK12705   97 AEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLTPEQA-----RKLLLKLLD----------AELEEEKAQ 161

                  ....*...
gi 1034602044 710 LTKQMKEE 717
Cdd:PRK12705  162 RVKKIEEE 169
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
381-772 2.83e-03

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 41.31  E-value: 2.83e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  381 EENHHSTAEELQATLQELSDQQQMVQELTAENEKLVDEKTILETSFH----------QHRERAEQLSQENEKLMNLLQER 450
Cdd:pfam01576    4 EEEMQAKEEELQKVKERQQKAESELKELEKKHQQLCEEKNALQEQLQaetelcaeaeEMRARLAARKQELEEILHELESR 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  451 VKNEEPTTQ---------EGKIIELEQKctgiLEQGRFEREKLLNIQQQLTCSLRKVEEE------------------NQ 503
Cdd:pfam01576   84 LEEEEERSQqlqnekkkmQQHIQDLEEQ----LDEEEAARQKLQLEKVTTEAKIKKLEEDillledqnsklskerkllEE 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  504 GALEMIKRLKEENEKLNEFLELeRHNNNMMAKTLEECRVTLEGLKMENGSLKSHLQGEkqkATEASavEQTAESCEVQEM 583
Cdd:pfam01576  160 RISEFTSNLAEEEEKAKSLSKL-KNKHEAMISDLEERLKKEEKGRQELEKAKRKLEGE---STDLQ--EQIAELQAQIAE 233
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  584 LKVARAEKDllelscNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEIC-DHQAEQLSRTSLKLQEK--ASESDAEIKD 660
Cdd:pfam01576  234 LRAQLAKKE------EELQAALARLEEETAQKNNALKKIRELEAQISELQeDLESERAARNKAEKQRRdlGEELEALKTE 307
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  661 MKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQTAVVVANDIKC 740
Cdd:pfam01576  308 LEDTLDTTAAQQELRSKREQEVTELKKALEEETRSHEAQLQEMRQKHTQALEELTEQLEQAKRNKANLEKAKQALESENA 387
                          410       420       430
                   ....*....|....*....|....*....|..
gi 1034602044  741 EAQQELRTVKRKLLEEEEKNARLQKELGDVQG 772
Cdd:pfam01576  388 ELQAELRTLQQAKQDSEHKRKKLEGQLQELQA 419
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
366-649 2.89e-03

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 41.26  E-value: 2.89e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  366 SELSLASLTEKIQKMEENHHSTAEELQATLQELSDQQQMVQELTAENEKLVDEKTILETSFHQHRER---AEQLSQENEK 442
Cdd:pfam15921  494 SERTVSDLTASLQEKERAIEATNAEITKLRSRVDLKLQELQHLKNEGDHLRNVQTECEALKLQMAEKdkvIEILRQQIEN 573
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  443 LMNL-----------------LQERVKNEEPTTQEGKII---------ELEQKCTGIleqgRFEREKLLNIQQQLTCSLR 496
Cdd:pfam15921  574 MTQLvgqhgrtagamqvekaqLEKEINDRRLELQEFKILkdkkdakirELEARVSDL----ELEKVKLVNAGSERLRAVK 649
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  497 KVEEENQGALEMIKRLKEENEKLNEFLELERHNnnmMAKTLEECRVTLEGLKMENGSLKSHLQGEK-------------- 562
Cdd:pfam15921  650 DIKQERDQLLNEVKTSRNELNSLSEDYEVLKRN---FRNKSEEMETTTNKLKMQLKSAQSELEQTRntlksmegsdgham 726
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  563 ------QKATEASAVEQTAESCEVQ---EMLKVARAEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEIC 633
Cdd:pfam15921  727 kvamgmQKQITAKRGQIDALQSKIQfleEAMTNANKEKHFLKEEKNKLSQELSTVATEKNKMAGELEVLRSQERRLKEKV 806
                          330
                   ....*....|....*.
gi 1034602044  634 DHQAEQLSRTSLKLQE 649
Cdd:pfam15921  807 ANMEVALDKASLQFAE 822
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
369-767 3.22e-03

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 41.18  E-value: 3.22e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 369 SLASLTEKIQKMEE-NHHSTAEELQATLQELSDQqqmVQELTAENEKLVDEKTILETSFHQHRERAEQLS---QENEKLM 444
Cdd:PRK02224  188 SLDQLKAQIEEKEEkDLHERLNGLESELAELDEE---IERYEEQREQARETRDEADEVLEEHEERREELEtleAEIEDLR 264
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 445 NLLQERVKN-----EEPTTQEGKIIELEQKCTGILEQGRFE----------REKLLNIQQQLTCSLRKVEEENQGALEMI 509
Cdd:PRK02224  265 ETIAETEREreelaEEVRDLRERLEELEEERDDLLAEAGLDdadaeavearREELEDRDEELRDRLEECRVAAQAHNEEA 344
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 510 KRLKEENEKLNEFLELERHNNNMMAKTLEECRVTLEGLKMENGSLkshlqgEKQKATEASAVEQTAESCE-VQEMLKVAR 588
Cdd:PRK02224  345 ESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEEL------EEEIEELRERFGDAPVDLGnAEDFLEELR 418
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 589 AEKDLLELSCNELRQELLKANGEIKHVSSLLAKVE--------KDYSYLKEICDHqaeqlsrtslklQEKASESDAEIKD 660
Cdd:PRK02224  419 EERDELREREAELEATLRTARERVEEAEALLEAGKcpecgqpvEGSPHVETIEED------------RERVEELEAELED 486
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 661 MKETIFELEDQVEqhRAVKLhnnqliSELESSVIKLEEQKSDLERQL---KTLTKQMKEETEEWRRFQADLQTAVVVAND 737
Cdd:PRK02224  487 LEEEVEEVEERLE--RAEDL------VEAEDRIERLEERREDLEELIaerRETIEEKRERAEELRERAAELEAEAEEKRE 558
                         410       420       430
                  ....*....|....*....|....*....|
gi 1034602044 738 IKCEAQQELRTVKRKLLEEEEKNARLQKEL 767
Cdd:PRK02224  559 AAAEAEEEAEEAREEVAELNSKLAELKERI 588
PLN02939 PLN02939
transferase, transferring glycosyl groups
424-762 3.32e-03

transferase, transferring glycosyl groups


Pssm-ID: 215507 [Multi-domain]  Cd Length: 977  Bit Score: 41.04  E-value: 3.32e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 424 TSFHQHRERAE-QLSQENEKLMNLLQERVKNEEPTT--QEGKIIEL---EQKCTGILEQGRFER----EKLLNIQQQLTC 493
Cdd:PLN02939   91 TSSDDDHNRASmQRDEAIAAIDNEQQTNSKDGEQLSdfQLEDLVGMiqnAEKNILLLNQARLQAledlEKILTEKEALQG 170
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 494 SLRKVEEENQGALEMIKRLKEEN---EKLNEFLELERHNNNMMAKTLEECRVT----LEGLKMENGSLKSHLQGEKqkaT 566
Cdd:PLN02939  171 KINILEMRLSETDARIKLAAQEKihvEILEEQLEKLRNELLIRGATEGLCVHSlskeLDVLKEENMLLKDDIQFLK---A 247
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 567 EASAVEQTAESCEVQEmlkvarAEKDLLELSCNELRQELLKANGEIKHVSSL-----LAKVEKdysyLKEICDHQAEQLS 641
Cdd:PLN02939  248 ELIEVAETEERVFKLE------KERSLLDASLRELESKFIVAQEDVSKLSPLqydcwWEKVEN----LQDLLDRATNQVE 317
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 642 RTSLKLQEKasesdaeiKDMKETIFELEDQVEqhravklhnnqliselESSVIKLEEQKSDLerqlktLTKQMKEETEEW 721
Cdd:PLN02939  318 KAALVLDQN--------QDLRDKVDKLEASLK----------------EANVSKFSSYKVEL------LQQKLKLLEERL 367
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|.
gi 1034602044 722 RRFQADLQTAVVVANDIKCEAQQELrtvkRKLLEEEEKNAR 762
Cdd:PLN02939  368 QASDHEIHSYIQLYQESIKEFQDTL----SKLKEESKKRSL 404
PRK00409 PRK00409
recombination and DNA strand exchange inhibitor protein; Reviewed
683-766 4.10e-03

recombination and DNA strand exchange inhibitor protein; Reviewed


Pssm-ID: 234750 [Multi-domain]  Cd Length: 782  Bit Score: 40.58  E-value: 4.10e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 683 NQLISELESSVIKLEEQKSDLE---RQLKTLTKQMKEETEEWR-RFQADLQTAVVVANDIKCEAQQELRTVKRKLLEEEE 758
Cdd:PRK00409  519 NELIASLEELERELEQKAEEAEallKEAEKLKEELEEKKEKLQeEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQK 598

                  ....*...
gi 1034602044 759 KNARLQKE 766
Cdd:PRK00409  599 GGYASVKA 606
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
370-768 4.52e-03

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 40.80  E-value: 4.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  370 LASLTEKIQKMEEN-HHSTAEELQATLQELSDQQQMVQELTAENEklvdektilETSFHQHRERAEQLSQENEKLMNLLQ 448
Cdd:TIGR00606  593 LAKLNKELASLEQNkNHINNELESKEEQLSSYEDKLFDVCGSQDE---------ESDLERLKEEIEKSSKQRAMLAGATA 663
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  449 ERVKNEEPTTQEgkiielEQKCTGILEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEFLELERH 528
Cdd:TIGR00606  664 VYSQFITQLTDE------NQSCCPVCQRVFQTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQS 737
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  529 NNNMMAKTLEECRVTLEGLKMENGSLKSHLQGEKQKATEASAVEQTAESC--------EVQEMLK---------VARAEK 591
Cdd:TIGR00606  738 IIDLKEKEIPELRNKLQKVNRDIQRLKNDIEEQETLLGTIMPEEESAKVCltdvtimeRFQMELKdverkiaqqAAKLQG 817
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  592 DLLELSCNELRQELLKANGEIKHVSS---LLAKVEKDY--------SYLKEICDHQAeQLSRTSLKLQEKASESDAEIKD 660
Cdd:TIGR00606  818 SDLDRTVQQVNQEKQEKQHELDTVVSkieLNRKLIQDQqeqiqhlkSKTNELKSEKL-QIGTNLQRRQQFEEQLVELSTE 896
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  661 MKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQADLQTAvvvANDIKC 740
Cdd:TIGR00606  897 VQSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETSNKKAQDKVNDIKEKVKNIHGYMKDIENKIQDG---KDDYLK 973
                          410       420
                   ....*....|....*....|....*...
gi 1034602044  741 EAQQELRTVKRKLLEEEEKNARLQKELG 768
Cdd:TIGR00606  974 QKETELNTVNAQLEECEKHQEKINEDMR 1001
PRK10884 PRK10884
SH3 domain-containing protein; Provisional
647-771 4.56e-03

SH3 domain-containing protein; Provisional


Pssm-ID: 182809 [Multi-domain]  Cd Length: 206  Bit Score: 39.25  E-value: 4.56e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 647 LQEKASESDAEIKDMKEtifeledqveqhRAVKLHNNQLiseleSSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQA 726
Cdd:PRK10884   59 LQVNANTNYAQIRDSKG------------RTAWIPLKQL-----STTPSLRTRVPDLENQVKTLTDKLNNIDNTWNQRTA 121
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 1034602044 727 DLQTAVVVANDIKCEAQQELRTVKRKLLEEEEKNARLQKELGDVQ 771
Cdd:PRK10884  122 EMQQKVAQSDSVINGLKEENQKLKNQLIVAQKKVDAANLQLDDKQ 166
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
390-725 4.74e-03

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 40.54  E-value: 4.74e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  390 ELQATLQELSDQ-QQMVQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKLMNLLQERVKNEEPTTQEGKIIE-LE 467
Cdd:pfam01576  226 ELQAQIAELRAQlAKKEEELQAALARLEEETAQKNNALKKIRELEAQISELQEDLESERAARNKAEKQRRDLGEELEaLK 305
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  468 QKCTGILEQGRFEREKLLNIQQQLTCSLRKVEEE----NQGALEMIKRLKEENEKLNEFLELERHNNNMMAKTLEecrvt 543
Cdd:pfam01576  306 TELEDTLDTTAAQQELRSKREQEVTELKKALEEEtrshEAQLQEMRQKHTQALEELTEQLEQAKRNKANLEKAKQ----- 380
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  544 leGLKMENGSLKSHLQGEKQKATEASAVEQTAEScEVQE-MLKVARAEKDLlelscNELRQELLKANGEIKHVSSLLAKV 622
Cdd:pfam01576  381 --ALESENAELQAELRTLQQAKQDSEHKRKKLEG-QLQElQARLSESERQR-----AELAEKLSKLQSELESVSSLLNEA 452
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  623 EKDYSYLKEICDHQAEQLSRTSLKLQEKASES---DAEIKDMKETIFELEDQVEQH--------RAVKLHNNQL------ 685
Cdd:pfam01576  453 EGKNIKLSKDVSSLESQLQDTQELLQEETRQKlnlSTRLRQLEDERNSLQEQLEEEeeakrnveRQLSTLQAQLsdmkkk 532
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|
gi 1034602044  686 ISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQ 725
Cdd:pfam01576  533 LEEDAGTLEALEEGKKRLQRELEALTQQLEEKAAAYDKLE 572
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
369-720 5.07e-03

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 40.34  E-value: 5.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  369 SLASLTEKIQKMEENHHSTAEELQATLQELSDQQQMVQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKLMNLLQ 448
Cdd:TIGR00618  543 SEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLSEAEDMLACEQHALLRKLQ 622
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  449 ERVKNEEPTTQEGKIIELEQKctgileqgrferEKLLNIQQQLTCSLRKVEEENQGALEMIKRLKEENEKLNEFLELERH 528
Cdd:TIGR00618  623 PEQDLQDVRLHLQQCSQELAL------------KLTALHALQLTLTQERVREHALSIRVLPKELLASRQLALQKMQSEKE 690
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  529 NNNMMAKTLEECRVTLEGLKMENGSLKSHLQgEKQKATEASAVEQTAESCEVQEMLKVARAEKDllelscnelrqELLKA 608
Cdd:TIGR00618  691 QLTYWKEMLAQCQTLLRELETHIEEYDREFN-EIENASSSLGSDLAAREDALNQSLKELMHQAR-----------TVLKA 758
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  609 NGEIKHVSSLLAKVE----KDYSYLKEICDHQAEQLSRTSLKLQEKASESDAEIKDmKETIFELEDQVEQHRAVKLHNnq 684
Cdd:TIGR00618  759 RTEAHFNNNEEVTAAlqtgAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPS-DEDILNLQCETLVQEEEQFLS-- 835
                          330       340       350
                   ....*....|....*....|....*....|....*....
gi 1034602044  685 LISELESSVIKLEEQKSDLE---RQLKTLTKQMKEETEE 720
Cdd:TIGR00618  836 RLEEKSATLGEITHQLLKYEecsKQLAQLTQEQAKIIQL 874
mukB PRK04863
chromosome partition protein MukB;
363-768 6.43e-03

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 40.33  E-value: 6.43e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  363 NSVSELSlASLTEKIQKMEENHHS----TAEELQATLQElsDQQQMVQELTAENEKLVDEKTILETSFHQHRERAEQLsQ 438
Cdd:PRK04863   796 EELAERY-ATLSFDVQKLQRLHQAfsrfIGSHLAVAFEA--DPEAELRQLNRRRVELERALADHESQEQQQRSQLEQA-K 871
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  439 ENEKLMNLLQERVKNEEPTTQEGKIIELEQKCTGILEQGRFERE--KLLNIQQQLTCSLRKVEEEnqgaLEMIKRLKEEN 516
Cdd:PRK04863   872 EGLSALNRLLPRLNLLADETLADRVEEIREQLDEAEEAKRFVQQhgNALAQLEPIVSVLQSDPEQ----FEQLKQDYQQA 947
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  517 EKlneflelERHNNNMMAKTLEECRVTLEGLKMENgslkshlqgekqkateasAVEQTAESCEVQEMLkvaRAEKDLLEL 596
Cdd:PRK04863   948 QQ-------TQRDAKQQAFALTEVVQRRAHFSYED------------------AAEMLAKNSDLNEKL---RQRLEQAEQ 999
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  597 SCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEICDHQAEQLSRTSLKLQEKASESDAEIKDmketifELEDQVEQHR 676
Cdd:PRK04863  1000 ERTRAREQLRQAQAQLAQYNQVLASLKSSYDAKRQMLQELKQELQDLGVPADSGAEERARARRD------ELHARLSANR 1073
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044  677 AVKlhnNQLiselessviklEEQKSDLERQLKTLTKQMKEETEEWRrfqaDLQTAVVVANDIKCEAQQELRT--VKRKLL 754
Cdd:PRK04863  1074 SRR---NQL-----------EKQLTFCEAEMDNLTKKLRKLERDYH----EMREQVVNAKAGWCAVLRLVKDngVERRLH 1135
                          410       420
                   ....*....|....*....|.
gi 1034602044  755 EEE-------EKNARLQKELG 768
Cdd:PRK04863  1136 RRElaylsadELRSMSDKALG 1156
PRK12705 PRK12705
hypothetical protein; Provisional
373-540 6.77e-03

hypothetical protein; Provisional


Pssm-ID: 237178 [Multi-domain]  Cd Length: 508  Bit Score: 39.69  E-value: 6.77e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 373 LTEKIQKMEENHHSTAEELQATLQeLSDQQQMVQELTAENEKLVDEKTILETSFHQHRERAEQLSQENEKLMNLLQERVK 452
Cdd:PRK12705   31 LAKEAERILQEAQKEAEEKLEAAL-LEAKELLLRERNQQRQEARREREELQREEERLVQKEEQLDARAEKLDNLENQLEE 109
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 453 NEEP-TTQEGKIIELEQKCTGILEQ-GRFEREKllniQQQLTCSLRKVEEENQGALEmIKRLKEENEklnefLELERHNN 530
Cdd:PRK12705  110 REKAlSARELELEELEKQLDNELYRvAGLTPEQ----ARKLLLKLLDAELEEEKAQR-VKKIEEEAD-----LEAERKAQ 179
                         170
                  ....*....|
gi 1034602044 531 NMMAKTLEEC 540
Cdd:PRK12705  180 NILAQAMQRI 189
HMMR_N pfam15905
Hyaluronan mediated motility receptor N-terminal; HMMR_N is the N-terminal region of ...
474-727 6.87e-03

Hyaluronan mediated motility receptor N-terminal; HMMR_N is the N-terminal region of eukaryotic hyaluronan-mediated motility receptor proteins. The protein is functionally associated with BRCA1 and thus predicted to be a common, low-penetrance breast cancer candidate.


Pssm-ID: 464932 [Multi-domain]  Cd Length: 329  Bit Score: 39.41  E-value: 6.87e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 474 LEQGRFEREKLLniqQQLTCSLRKVEEENQGALEMIKRLKEENEKLN-EFLELERHNNNMMAKTLEE-CRVTLEGLKMEN 551
Cdd:pfam15905  85 LVQERGEQDKRL---QALEEELEKVEAKLNAAVREKTSLSASVASLEkQLLELTRVNELLKAKFSEDgTQKKMSSLSMEL 161
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 552 GSLKSHLQGeKQKATEASAVEQTAESCEVQEMLKVARAEKDLLELSCNELRQE----------LLKANGEIKHVSSLLAK 621
Cdd:pfam15905 162 MKLRNKLEA-KMKEVMAKQEGMEGKLQVTQKNLEHSKGKVAQLEEKLVSTEKEkieeksetekLLEYITELSCVSEQVEK 240
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 622 VEKDYSYLKEICDHQAEQLSRTSLKLQEKASESDAEIKDMKETIFELEdqveqhravklhnnqliSELESSVIKLEEQKS 701
Cdd:pfam15905 241 YKLDIAQLEELLKEKNDEIESLKQSLEEKEQELSKQIKDLNEKCKLLE-----------------SEKEELLREYEEKEQ 303
                         250       260
                  ....*....|....*....|....*.
gi 1034602044 702 DLERQLKTLTKQMKEETEEWRRFQAD 727
Cdd:pfam15905 304 TLNAELEELKEKLTLEEQEHQKLQQK 329
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
435-767 8.26e-03

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 39.70  E-value: 8.26e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 435 QLSQENEKLMNL---LQERVKNEEPTTQEG-KIIELEQKCtgiLEQGRFEREKLLNIQQQLTCSLRKVEEENQGALEMIK 510
Cdd:pfam05483  82 KLYKEAEKIKKWkvsIEAELKQKENKLQENrKIIEAQRKA---IQELQFENEKVSLKLEEEIQENKDLIKENNATRHLCN 158
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 511 RLKE----ENEKLNEFlELERH---------NNNM--MAKTLEECRVTLEGLKME-NGSLK------SHLQGEKQKATEA 568
Cdd:pfam05483 159 LLKEtcarSAEKTKKY-EYEREetrqvymdlNNNIekMILAFEELRVQAENARLEmHFKLKedhekiQHLEEEYKKEIND 237
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 569 SAVEQTAESCEVQEMLKVARAEKDLLELSCNELRQELLKANGEIKHVSSLLAKVEKDYSYLKEIcdhqaeQLSRTSLKLQ 648
Cdd:pfam05483 238 KEKQVSLLLIQITEKENKMKDLTFLLEESRDKANQLEEKTKLQDENLKELIEKKDHLTKELEDI------KMSLQRSMST 311
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034602044 649 EKASESDAEI--KDMKETIFELEDQVEQHRAVKLHNNQLISELESSVIKLEEQKSDLERQLKTLTKQMKEETEEWRRFQA 726
Cdd:pfam05483 312 QKALEEDLQIatKTICQLTEEKEAQMEELNKAKAAHSFVVTEFEATTCSLEELLRTEQQRLEKNEDQLKIITMELQKKSS 391
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....
gi 1034602044 727 DLQTAVVVANDIKCEAqQELRTV---KRKLLEEEEKNARLQKEL 767
Cdd:pfam05483 392 ELEEMTKFKNNKEVEL-EELKKIlaeDEKLLDEKKQFEKIAEEL 434
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH