NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|568982046|ref|XP_006516842|]
View 

cancer-associated gene 1 protein homolog isoform X4 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
CAGE1 pfam15066
Cancer-associated gene protein 1 family; CAGE-1 is a family of proteins overexpressed in ...
1-526 0e+00

Cancer-associated gene protein 1 family; CAGE-1 is a family of proteins overexpressed in tumour tissues compared with surrounding tissues. CAGE-1 gene showed testis-specific expression among normal tissues and displayed wide expression in a variety of cancer cell lines and cancer tissues. CAGE-1 is predominantly expressed during post-meiotic stages. It localizes to the acrosomal matrix and acrosomal granule showing it to be a component of the acrosome of mammalian spermatids and spermatozoa.


:

Pssm-ID: 464481  Cd Length: 528  Bit Score: 806.36  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046    1 MSESETINVNGPSQDFSYSDSPFCMEASFSSSDLLQSETKNVKRGNESTHTFSEDIYSTEGSLLGDINLGNYPESEQNQP 80
Cdd:pfam15066   1 MSESDAMNVSGLSQDLTHSDSPLCMETSSTTSDLPQNEIKNVKRENESKFTLSEDIYSTLDNLLGDINIGSYSQNVLIQP 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   81 ANTRLSSLRQFEPICKFHWIEAFNDEM-TVEDLRGAFSYSEKPELPSQVYNDAADGSEKPDPFKEESSVESSISENKDEL 159
Cdd:pfam15066  81 VDTSISSLRQFEPICKFHWTEAFNDEMtTFQNLTEGFSYTEKPELQSHVYNYAKDTNIKQDSFKEENPVETSISTNKDQL 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  160 VPAPVRKSPRSLCLNYYRGEAQPLTEAPFVRSAVVDVGLNISQPQSFLDKENVCKNGDNSSDRENCFEQLDLRAIYKAEE 239
Cdd:pfam15066 161 ANECVRQSSRSPPLIHCSGETLPFTEKSLAKSTAKESALNPSQPQSFLYEENVPRNVEKPFYKENSFSLLDLRANYKTEE 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  240 PEVSSKEVQNSGEISELSVSHQEEVTEDGVDSLAITSPWSPAGI-FKGSGPQDNSLRPDREVSCEGLEPLEEDMALNEAL 318
Cdd:pfam15066 241 TEVSSKEIQNSGEIPEMSVSHQKEVTEEGVESPEIASTWSPAGIsWSSGASQENCKTPDTEQSFESLQPLEEDMALNEVL 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  319 QKLKQTNKKQELQIQDLHGKNLNLENRVQELQTKVTKQHVLVDIINKLKVNIEELINDKYNVILEKNDINKKLQDLQEAS 398
Cdd:pfam15066 321 QKLKHTNRKQQMQIQDLQCSNLYLEKKVKELQMKITKQQVFVDIINKLKENVEELIEDKYNVILEKNDINKTLQNLQEIL 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  399 AHTKKHLQESKKDKESLQLQVKKIKVHYVRLQERYIAEIQQKNRSASQCLEIEKTLSKKDEELQRLQRHKGELEKATSSA 478
Cdd:pfam15066 401 ANTQKHLQESRKEKETLQLELKKIKVNYVHLQERYITEMQQKNKSVSQCLEMDKTLSKKEEEVERLQQLKGELEKATTSA 480
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|....*...
gi 568982046  479 LDLLKREKEIREQEFLSFQEEFQRREKESLKERRKLKSRVEKLVAQVK 526
Cdd:pfam15066 481 LDLLKREKETREQEFLSLQEEFQKHEKENLEERQKLKSRLEKLVAQVK 528
SMC_prok_B super family cl37069
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
385-691 2.01e-05

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


The actual alignment was detected with superfamily member TIGR02168:

Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 48.51  E-value: 2.01e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   385 NDINKKLQDLQEASAHTKKhLQESKKDKESLQLQVkkIKVHYVRLQERYIAEIQQKNRSASQCLEIEKTLSKKDEELQRL 464
Cdd:TIGR02168  196 NELERQLKSLERQAEKAER-YKELKAELRELELAL--LVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEEL 272
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   465 QRHKGELEKATSSA---LDLLKREKEIREQEFLSFQEEFQRREKESLK---ERRKLKSRVEKLVAQVKSLLFTCESERAQ 538
Cdd:TIGR02168  273 RLEVSELEEEIEELqkeLYALANEISRLEQQKQILRERLANLERQLEEleaQLEELESKLDELAEELAELEEKLEELKEE 352
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   539 TMALQRQVEELKLENLELRQLAAKREAQACTPSFEITQSKEQLEEAVEPDITQETKGTHcnLFLNRSSCKENLELQPLKK 618
Cdd:TIGR02168  353 LESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLER--LEDRRERLQQEIEELLKKL 430
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568982046   619 TSPLASGIHSLLALRIGLLTcqDLATPDAELCQESKKANdimlQRLKDCQLKKKDLDKELLKHKNRIATLKEL 691
Cdd:TIGR02168  431 EEAELKELQAELEELEEELE--ELQEELERLEEALEELR----EELEEAEQALDAAERELAQLQARLDSLERL 497
 
Name Accession Description Interval E-value
CAGE1 pfam15066
Cancer-associated gene protein 1 family; CAGE-1 is a family of proteins overexpressed in ...
1-526 0e+00

Cancer-associated gene protein 1 family; CAGE-1 is a family of proteins overexpressed in tumour tissues compared with surrounding tissues. CAGE-1 gene showed testis-specific expression among normal tissues and displayed wide expression in a variety of cancer cell lines and cancer tissues. CAGE-1 is predominantly expressed during post-meiotic stages. It localizes to the acrosomal matrix and acrosomal granule showing it to be a component of the acrosome of mammalian spermatids and spermatozoa.


Pssm-ID: 464481  Cd Length: 528  Bit Score: 806.36  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046    1 MSESETINVNGPSQDFSYSDSPFCMEASFSSSDLLQSETKNVKRGNESTHTFSEDIYSTEGSLLGDINLGNYPESEQNQP 80
Cdd:pfam15066   1 MSESDAMNVSGLSQDLTHSDSPLCMETSSTTSDLPQNEIKNVKRENESKFTLSEDIYSTLDNLLGDINIGSYSQNVLIQP 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   81 ANTRLSSLRQFEPICKFHWIEAFNDEM-TVEDLRGAFSYSEKPELPSQVYNDAADGSEKPDPFKEESSVESSISENKDEL 159
Cdd:pfam15066  81 VDTSISSLRQFEPICKFHWTEAFNDEMtTFQNLTEGFSYTEKPELQSHVYNYAKDTNIKQDSFKEENPVETSISTNKDQL 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  160 VPAPVRKSPRSLCLNYYRGEAQPLTEAPFVRSAVVDVGLNISQPQSFLDKENVCKNGDNSSDRENCFEQLDLRAIYKAEE 239
Cdd:pfam15066 161 ANECVRQSSRSPPLIHCSGETLPFTEKSLAKSTAKESALNPSQPQSFLYEENVPRNVEKPFYKENSFSLLDLRANYKTEE 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  240 PEVSSKEVQNSGEISELSVSHQEEVTEDGVDSLAITSPWSPAGI-FKGSGPQDNSLRPDREVSCEGLEPLEEDMALNEAL 318
Cdd:pfam15066 241 TEVSSKEIQNSGEIPEMSVSHQKEVTEEGVESPEIASTWSPAGIsWSSGASQENCKTPDTEQSFESLQPLEEDMALNEVL 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  319 QKLKQTNKKQELQIQDLHGKNLNLENRVQELQTKVTKQHVLVDIINKLKVNIEELINDKYNVILEKNDINKKLQDLQEAS 398
Cdd:pfam15066 321 QKLKHTNRKQQMQIQDLQCSNLYLEKKVKELQMKITKQQVFVDIINKLKENVEELIEDKYNVILEKNDINKTLQNLQEIL 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  399 AHTKKHLQESKKDKESLQLQVKKIKVHYVRLQERYIAEIQQKNRSASQCLEIEKTLSKKDEELQRLQRHKGELEKATSSA 478
Cdd:pfam15066 401 ANTQKHLQESRKEKETLQLELKKIKVNYVHLQERYITEMQQKNKSVSQCLEMDKTLSKKEEEVERLQQLKGELEKATTSA 480
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|....*...
gi 568982046  479 LDLLKREKEIREQEFLSFQEEFQRREKESLKERRKLKSRVEKLVAQVK 526
Cdd:pfam15066 481 LDLLKREKETREQEFLSLQEEFQKHEKENLEERQKLKSRLEKLVAQVK 528
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
313-567 3.22e-10

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 63.80  E-value: 3.22e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 313 ALNEALQKLKQTNKKQELQIQDLHGKNLNLENRVQELQTKVTKQHvlvDIINKLKVNIEELINDKYNVILEKNDINKKLQ 392
Cdd:COG1196  243 ELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQ---AEEYELLAELARLEQDIARLEERRRELEERLE 319
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 393 DLQEASAHTKKHLQESKKDKESLQLQVKKIKVHYVRLQERYIAEIQQKNRSASQCLEIEKTLSKKDEELQRLQRHKGELE 472
Cdd:COG1196  320 ELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELA 399
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 473 KATSSALDLLKREKEiREQEFLSFQEEFQRREKESLKERRKLKSRVEKLVAQVKSLLFTCESERAQTMALQRQVEELKLE 552
Cdd:COG1196  400 AQLEELEEAEEALLE-RLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAA 478
                        250
                 ....*....|....*
gi 568982046 553 NLELRQLAAKREAQA 567
Cdd:COG1196  479 LAELLEELAEAAARL 493
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
309-546 1.01e-08

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 59.30  E-value: 1.01e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   309 EEDMALNEALQKLKQTNKKQ---ELQIQDLHGKNLNLENRVQELQTKV----TKQHVLVDIINKLKVNIEELINDKYNVI 381
Cdd:TIGR02168  257 ELTAELQELEEKLEELRLEVselEEEIEELQKELYALANEISRLEQQKqilrERLANLERQLEELEAQLEELESKLDELA 336
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   382 LEKNDINKKLQDLQEASAHTKKHLQESKKDKESLQLQVKKIKVHYVRLQERYIAEIQQKNRSASQCLEIEKTLSKKDEEL 461
Cdd:TIGR02168  337 EELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRR 416
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   462 QRLQRHKGELE-KATSSALDLLKREKEIREQEFLSFQEEFQRRE--KESLKERRKLKSR----VEKLVAQVKSLLFTCES 534
Cdd:TIGR02168  417 ERLQQEIEELLkKLEEAELKELQAELEELEEELEELQEELERLEeaLEELREELEEAEQaldaAERELAQLQARLDSLER 496
                          250
                   ....*....|..
gi 568982046   535 ERAQTMALQRQV 546
Cdd:TIGR02168  497 LQENLEGFSEGV 508
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
306-528 6.61e-06

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 49.68  E-value: 6.61e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 306 EPLEEDMALNEALQKLKQTNKKQELQIqdLHGKNlNLENRVQELQTKVTKQHVLVDIINKLKVNIEELINDKYNVILEKN 385
Cdd:PRK03918 179 ERLEKFIKRTENIEELIKEKEKELEEV--LREIN-EISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKR 255
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 386 DINKKLQDLQEASAHTKKHLQESKKDKESLQlQVKKIKVHYVRLQERYIAEIQQKNrsasqclEIEKTLSKKDEELQRLQ 465
Cdd:PRK03918 256 KLEEKIRELEERIEELKKEIEELEEKVKELK-ELKEKAEEYIKLSEFYEEYLDELR-------EIEKRLSRLEEEINGIE 327
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 466 RHKGELEKATSSALDLLKREKEIREQ--EFLSFQEEFQR-----REKESLKERRKLKSrVEKLVAQVKSL 528
Cdd:PRK03918 328 ERIKELEEKEERLEELKKKLKELEKRleELEERHELYEEakakkEELERLKKRLTGLT-PEKLEKELEEL 396
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
385-691 2.01e-05

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 48.51  E-value: 2.01e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   385 NDINKKLQDLQEASAHTKKhLQESKKDKESLQLQVkkIKVHYVRLQERYIAEIQQKNRSASQCLEIEKTLSKKDEELQRL 464
Cdd:TIGR02168  196 NELERQLKSLERQAEKAER-YKELKAELRELELAL--LVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEEL 272
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   465 QRHKGELEKATSSA---LDLLKREKEIREQEFLSFQEEFQRREKESLK---ERRKLKSRVEKLVAQVKSLLFTCESERAQ 538
Cdd:TIGR02168  273 RLEVSELEEEIEELqkeLYALANEISRLEQQKQILRERLANLERQLEEleaQLEELESKLDELAEELAELEEKLEELKEE 352
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   539 TMALQRQVEELKLENLELRQLAAKREAQACTPSFEITQSKEQLEEAVEPDITQETKGTHcnLFLNRSSCKENLELQPLKK 618
Cdd:TIGR02168  353 LESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLER--LEDRRERLQQEIEELLKKL 430
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568982046   619 TSPLASGIHSLLALRIGLLTcqDLATPDAELCQESKKANdimlQRLKDCQLKKKDLDKELLKHKNRIATLKEL 691
Cdd:TIGR02168  431 EEAELKELQAELEELEEELE--ELQEELERLEEALEELR----EELEEAEQALDAAERELAQLQARLDSLERL 497
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
268-525 7.50e-03

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 40.00  E-value: 7.50e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 268 GVDSLAITSpwspagIFKGS--GPQDNSLRPDREVSCEGLEPLEEDM-ALNEALQK-LKQTNKKQElqiQDLHGKNLNLE 343
Cdd:NF033838  20 GVASVVVAS------LFLGGvvHAEEVRGGNNPTVTSSGNESQKEHAkEVESHLEKiLSEIQKSLD---KRKHTQNVALN 90
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 344 NRVQELQTKVTkqHVLVDIINKLKVNIEELINDKYNVILEKndINKKLQDLQEASAHTKKHLQESKKDKESlqlQVKKIK 423
Cdd:NF033838  91 KKLSDIKTEYL--YELNVLKEKSEAELTSKTKKELDAAFEQ--FKKDTLEPGKKVAEATKKVEEAEKKAKD---QKEEDR 163
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 424 VHY----VRLQERYIAEIQQKNRSASQCLEIEKTLSKKDEELQRLQRHKGELEKATSSALDLLKREKEIREQEFLSFQEE 499
Cdd:NF033838 164 RNYptntYKTLELEIAESDVEVKKAELELVKEEAKEPRDEEKIKQAKAKVESKKAEATRLEKIKTDREKAEEEAKRRADA 243
                        250       260
                 ....*....|....*....|....*....
gi 568982046 500 FQRREKESL---KERRKLKSRVEKLVAQV 525
Cdd:NF033838 244 KLKEAVEKNvatSEQDKPKRRAKRGVLGE 272
 
Name Accession Description Interval E-value
CAGE1 pfam15066
Cancer-associated gene protein 1 family; CAGE-1 is a family of proteins overexpressed in ...
1-526 0e+00

Cancer-associated gene protein 1 family; CAGE-1 is a family of proteins overexpressed in tumour tissues compared with surrounding tissues. CAGE-1 gene showed testis-specific expression among normal tissues and displayed wide expression in a variety of cancer cell lines and cancer tissues. CAGE-1 is predominantly expressed during post-meiotic stages. It localizes to the acrosomal matrix and acrosomal granule showing it to be a component of the acrosome of mammalian spermatids and spermatozoa.


Pssm-ID: 464481  Cd Length: 528  Bit Score: 806.36  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046    1 MSESETINVNGPSQDFSYSDSPFCMEASFSSSDLLQSETKNVKRGNESTHTFSEDIYSTEGSLLGDINLGNYPESEQNQP 80
Cdd:pfam15066   1 MSESDAMNVSGLSQDLTHSDSPLCMETSSTTSDLPQNEIKNVKRENESKFTLSEDIYSTLDNLLGDINIGSYSQNVLIQP 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   81 ANTRLSSLRQFEPICKFHWIEAFNDEM-TVEDLRGAFSYSEKPELPSQVYNDAADGSEKPDPFKEESSVESSISENKDEL 159
Cdd:pfam15066  81 VDTSISSLRQFEPICKFHWTEAFNDEMtTFQNLTEGFSYTEKPELQSHVYNYAKDTNIKQDSFKEENPVETSISTNKDQL 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  160 VPAPVRKSPRSLCLNYYRGEAQPLTEAPFVRSAVVDVGLNISQPQSFLDKENVCKNGDNSSDRENCFEQLDLRAIYKAEE 239
Cdd:pfam15066 161 ANECVRQSSRSPPLIHCSGETLPFTEKSLAKSTAKESALNPSQPQSFLYEENVPRNVEKPFYKENSFSLLDLRANYKTEE 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  240 PEVSSKEVQNSGEISELSVSHQEEVTEDGVDSLAITSPWSPAGI-FKGSGPQDNSLRPDREVSCEGLEPLEEDMALNEAL 318
Cdd:pfam15066 241 TEVSSKEIQNSGEIPEMSVSHQKEVTEEGVESPEIASTWSPAGIsWSSGASQENCKTPDTEQSFESLQPLEEDMALNEVL 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  319 QKLKQTNKKQELQIQDLHGKNLNLENRVQELQTKVTKQHVLVDIINKLKVNIEELINDKYNVILEKNDINKKLQDLQEAS 398
Cdd:pfam15066 321 QKLKHTNRKQQMQIQDLQCSNLYLEKKVKELQMKITKQQVFVDIINKLKENVEELIEDKYNVILEKNDINKTLQNLQEIL 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  399 AHTKKHLQESKKDKESLQLQVKKIKVHYVRLQERYIAEIQQKNRSASQCLEIEKTLSKKDEELQRLQRHKGELEKATSSA 478
Cdd:pfam15066 401 ANTQKHLQESRKEKETLQLELKKIKVNYVHLQERYITEMQQKNKSVSQCLEMDKTLSKKEEEVERLQQLKGELEKATTSA 480
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|....*...
gi 568982046  479 LDLLKREKEIREQEFLSFQEEFQRREKESLKERRKLKSRVEKLVAQVK 526
Cdd:pfam15066 481 LDLLKREKETREQEFLSLQEEFQKHEKENLEERQKLKSRLEKLVAQVK 528
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
313-567 3.22e-10

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 63.80  E-value: 3.22e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 313 ALNEALQKLKQTNKKQELQIQDLHGKNLNLENRVQELQTKVTKQHvlvDIINKLKVNIEELINDKYNVILEKNDINKKLQ 392
Cdd:COG1196  243 ELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQ---AEEYELLAELARLEQDIARLEERRRELEERLE 319
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 393 DLQEASAHTKKHLQESKKDKESLQLQVKKIKVHYVRLQERYIAEIQQKNRSASQCLEIEKTLSKKDEELQRLQRHKGELE 472
Cdd:COG1196  320 ELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELA 399
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 473 KATSSALDLLKREKEiREQEFLSFQEEFQRREKESLKERRKLKSRVEKLVAQVKSLLFTCESERAQTMALQRQVEELKLE 552
Cdd:COG1196  400 AQLEELEEAEEALLE-RLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAA 478
                        250
                 ....*....|....*
gi 568982046 553 NLELRQLAAKREAQA 567
Cdd:COG1196  479 LAELLEELAEAAARL 493
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
309-546 1.01e-08

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 59.30  E-value: 1.01e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   309 EEDMALNEALQKLKQTNKKQ---ELQIQDLHGKNLNLENRVQELQTKV----TKQHVLVDIINKLKVNIEELINDKYNVI 381
Cdd:TIGR02168  257 ELTAELQELEEKLEELRLEVselEEEIEELQKELYALANEISRLEQQKqilrERLANLERQLEELEAQLEELESKLDELA 336
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   382 LEKNDINKKLQDLQEASAHTKKHLQESKKDKESLQLQVKKIKVHYVRLQERYIAEIQQKNRSASQCLEIEKTLSKKDEEL 461
Cdd:TIGR02168  337 EELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRR 416
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   462 QRLQRHKGELE-KATSSALDLLKREKEIREQEFLSFQEEFQRRE--KESLKERRKLKSR----VEKLVAQVKSLLFTCES 534
Cdd:TIGR02168  417 ERLQQEIEELLkKLEEAELKELQAELEELEEELEELQEELERLEeaLEELREELEEAEQaldaAERELAQLQARLDSLER 496
                          250
                   ....*....|..
gi 568982046   535 ERAQTMALQRQV 546
Cdd:TIGR02168  497 LQENLEGFSEGV 508
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
316-586 4.67e-07

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 53.79  E-value: 4.67e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 316 EALQKLKQTnkKQELQ-IQDLHGknlNLENRVQEL--QTKVTKQHvlvdiiNKLKVNIEELinDKYNVILEKNDINKKLQ 392
Cdd:COG1196  176 EAERKLEAT--EENLErLEDILG---ELERQLEPLerQAEKAERY------RELKEELKEL--EAELLLLKLRELEAELE 242
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 393 DLQEASAHTKKHLQESKKDKESLQLQVKKIKVHYVRLQERYIAEIQQKNRSASQCLEIEKTLSKKDEELQRLQRHKGELE 472
Cdd:COG1196  243 ELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELE 322
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 473 KATSSA---LDLLKREKEIREQEFLSFQEEFQRREKESLKERRKLKSRVEKLVAQVKSLLftcESERAQTMALQRQVEEL 549
Cdd:COG1196  323 EELAELeeeLEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELE---ELAEELLEALRAAAELA 399
                        250       260       270
                 ....*....|....*....|....*....|....*..
gi 568982046 550 KLENLELRQLAAKREAQActpsfEITQSKEQLEEAVE 586
Cdd:COG1196  400 AQLEELEEAEEALLERLE-----RLEEELEELEEALA 431
Mitofilin pfam09731
Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. ...
233-528 7.73e-07

Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. Mitofilin is enriched in the narrow space between the inner boundary and the outer membranes, where it forms a homotypic interaction and assembles into a large multimeric protein complex. The first 78 amino acids contain a typical amino-terminal-cleavable mitochondrial presequence rich in positive-charged and hydroxylated residues and a membrane anchor domain. In addition, it has three centrally located coiled coil domains.


Pssm-ID: 430783 [Multi-domain]  Cd Length: 618  Bit Score: 52.84  E-value: 7.73e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  233 AIYKAEEPEVSSKEVQNSGEISELSVSHQEEVTEDGVDSLAITSPWSPAGIFKGSGPQDNSLRPDREVSCEGLEPLEEDm 312
Cdd:pfam09731 149 KEAKDDAIQAVKAHTDSLKEASDTAEISREKATDSALQKAEALAEKLKEVINLAKQSEEEAAPPLLDAAPETPPKLPEH- 227
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  313 aLNEALQKLkQTNKKQELQIQDLHGknLNLENRVQELQTkvtkqhvLVDIINKLKVNIEE---LINDKYNVILEK----- 384
Cdd:pfam09731 228 -LDNVEEKV-EKAQSLAKLVDQYKE--LVASERIVFQQE-------LVSIFPDIIPVLKEdnlLSNDDLNSLIAHahrei 296
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  385 NDINKKLQDLQ-EASAHTKKHLQESKKDKESLQLQVKKikvhyvRLQERYIAEIQQKNRSASQclEIEKTLSKKDEELqr 463
Cdd:pfam09731 297 DQLSKKLAELKkREEKHIERALEKQKEELDKLAEELSA------RLEEVRAADEAQLRLEFER--EREEIRESYEEKL-- 366
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568982046  464 lqrhKGELEKATSSALDLLKREKEIREQEflsFQEEFQRREKESL-KERRKLKSRVEKLVAQVKSL 528
Cdd:pfam09731 367 ----RTELERQAEAHEEHLKDVLVEQEIE---LQREFLQDIKEKVeEERAGRLLKLNELLANLKGL 425
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
383-586 5.76e-06

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 49.38  E-value: 5.76e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 383 EKNDINKKLQDLQEASAHTKKHLQESKKDKESLQLQVKKIkvhyvrlqERYIAEIQQKNRSASQCL-EIEKTLSKKDEEL 461
Cdd:COG4942   21 AAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAAL--------ERRIAALARRIRALEQELaALEAELAELEKEI 92
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 462 QRLQRHKGELEKATSSALDLLKREKEIREQEFLSFQEEFQR--REKESLKE-RRKLKSRVEKLVAQVKSLLftcESERAQ 538
Cdd:COG4942   93 AELRAELEAQKEELAELLRALYRLGRQPPLALLLSPEDFLDavRRLQYLKYlAPARREQAEELRADLAELA---ALRAEL 169
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|....*...
gi 568982046 539 TMALQRQVEELKLENLELRQLAAKREAQACTPSfEITQSKEQLEEAVE 586
Cdd:COG4942  170 EAERAELEALLAELEEERAALEALKAERQKLLA-RLEKELAELAAELA 216
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
306-528 6.61e-06

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 49.68  E-value: 6.61e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 306 EPLEEDMALNEALQKLKQTNKKQELQIqdLHGKNlNLENRVQELQTKVTKQHVLVDIINKLKVNIEELINDKYNVILEKN 385
Cdd:PRK03918 179 ERLEKFIKRTENIEELIKEKEKELEEV--LREIN-EISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKR 255
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 386 DINKKLQDLQEASAHTKKHLQESKKDKESLQlQVKKIKVHYVRLQERYIAEIQQKNrsasqclEIEKTLSKKDEELQRLQ 465
Cdd:PRK03918 256 KLEEKIRELEERIEELKKEIEELEEKVKELK-ELKEKAEEYIKLSEFYEEYLDELR-------EIEKRLSRLEEEINGIE 327
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 466 RHKGELEKATSSALDLLKREKEIREQ--EFLSFQEEFQR-----REKESLKERRKLKSrVEKLVAQVKSL 528
Cdd:PRK03918 328 ERIKELEEKEERLEELKKKLKELEKRleELEERHELYEEakakkEELERLKKRLTGLT-PEKLEKELEEL 396
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
363-584 1.95e-05

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 47.90  E-value: 1.95e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 363 INKLKVNIEELINDKYNVILEKNDINKKLQDLQEASAHTKKHLQESKKDKESLQLQVKKIKVHYVRLQERYIAEIQQKNR 442
Cdd:COG3883   18 IQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERARALYR 97
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 443 SASQCLEIEKTLSKKD--EELQRLQRhkgeLEKATSSALDLLKREKEIREQefLSFQEEFQRREKESLKERRK----LKS 516
Cdd:COG3883   98 SGGSVSYLDVLLGSESfsDFLDRLSA----LSKIADADADLLEELKADKAE--LEAKKAELEAKLAELEALKAeleaAKA 171
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 568982046 517 RVEKLVAQVKSLLFTCESERAQtmALQRQVEELKLENLELRQLAAKREAQACTPSFEITQSKEQLEEA 584
Cdd:COG3883  172 ELEAQQAEQEALLAQLSAEEAA--AEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAA 237
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
385-691 2.01e-05

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 48.51  E-value: 2.01e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   385 NDINKKLQDLQEASAHTKKhLQESKKDKESLQLQVkkIKVHYVRLQERYIAEIQQKNRSASQCLEIEKTLSKKDEELQRL 464
Cdd:TIGR02168  196 NELERQLKSLERQAEKAER-YKELKAELRELELAL--LVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEEL 272
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   465 QRHKGELEKATSSA---LDLLKREKEIREQEFLSFQEEFQRREKESLK---ERRKLKSRVEKLVAQVKSLLFTCESERAQ 538
Cdd:TIGR02168  273 RLEVSELEEEIEELqkeLYALANEISRLEQQKQILRERLANLERQLEEleaQLEELESKLDELAEELAELEEKLEELKEE 352
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   539 TMALQRQVEELKLENLELRQLAAKREAQACTPSFEITQSKEQLEEAVEPDITQETKGTHcnLFLNRSSCKENLELQPLKK 618
Cdd:TIGR02168  353 LESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLER--LEDRRERLQQEIEELLKKL 430
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568982046   619 TSPLASGIHSLLALRIGLLTcqDLATPDAELCQESKKANdimlQRLKDCQLKKKDLDKELLKHKNRIATLKEL 691
Cdd:TIGR02168  431 EEAELKELQAELEELEEELE--ELQEELERLEEALEELR----EELEEAEQALDAAERELAQLQARLDSLERL 497
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
303-584 3.20e-05

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 47.74  E-value: 3.20e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   303 EGLEPLEEDMALNEALQKLKQTNKKQ-ELQIQDLHGKNLNLENRVQELQTKVtkqHVLVDIINKLKVNIEElindkynvi 381
Cdd:TIGR02168  239 EELEELQEELKEAEEELEELTAELQElEEKLEELRLEVSELEEEIEELQKEL---YALANEISRLEQQKQI--------- 306
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   382 lekndINKKLQDLQEASAHTKKHLQESKKDKESLQLQVKKIKVHYVRLQERYIAEIQQKNRSASQCLEIEKTLSKKDEEL 461
Cdd:TIGR02168  307 -----LRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQL 381
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   462 QRLqrhkgelekatSSALDLLKREKEireqeflsfQEEFQRREKESLKERrkLKSRVEKLVAQVKSLLftCESERAQTMA 541
Cdd:TIGR02168  382 ETL-----------RSKVAQLELQIA---------SLNNEIERLEARLER--LEDRRERLQQEIEELL--KKLEEAELKE 437
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|...
gi 568982046   542 LQRQVEELKLENLELRQLAAKREAQACTPSFEITQSKEQLEEA 584
Cdd:TIGR02168  438 LQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAA 480
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
309-584 4.35e-05

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 47.27  E-value: 4.35e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   309 EEDMALNEALQKLKQTNKKQELQIQDLHGKNLN-----------LENRVQELQTKVTKQHVLVDIINKLKVNIEELINDK 377
Cdd:pfam02463  166 RLKRKKKEALKKLIEETENLAELIIDLEELKLQelklkeqakkaLEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQEL 245
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   378 YNVILEKNDINKKLQDLQEASAHTKKHLQESKKDKESLQLQVKKIKVHYVRLQER-YIAEIQQKNRSASQCLEIEKTLSK 456
Cdd:pfam02463  246 LRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSeLLKLERRKVDDEEKLKESEKEKKK 325
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   457 KDEELQRLQRHKGELEKATSSAldLLKREKEIREQEFLSFQEEFQRREKESLKERRKLKSRVEKLVAQVKSLLFTCESER 536
Cdd:pfam02463  326 AEKELKKEKEEIEELEKELKEL--EIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKSEE 403
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*...
gi 568982046   537 AQTMALQRQVEELKLENLELRQLAAKREAQACTPSFEITQSKEQLEEA 584
Cdd:pfam02463  404 EKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKE 451
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
303-586 2.02e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 45.06  E-value: 2.02e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   303 EGLEPLEEDmaLNEALQKLKQTNKKQELQIQDLHGKNLNLENRVQELQTKVTKQhvlVDIINKLKVNIEELINDKYNVIL 382
Cdd:TIGR02169  684 EGLKRELSS--LQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKL---KERLEELEEDLSSLEQEIENVKS 758
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   383 EKNDINKKLQDLQEASAHTKKHLQESKKDK-----ESLQLQVKKIKVHYVRLQERyIAEIQQKNRSASQCLEIEKTLSKK 457
Cdd:TIGR02169  759 ELKELEARIEELEEDLHKLEEALNDLEARLshsriPEIQAELSKLEEEVSRIEAR-LREIEQKLNRLTLEKEYLEKEIQE 837
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   458 DEELQRLQRHKGELEKATSSALDLLKREKEIREQEFLSFQEEFQRREKESLKERRKLKSRVEKLVAQVKSLLFTCESERA 537
Cdd:TIGR02169  838 LQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRK 917
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 568982046   538 QTMALQRQVEELKLENLE-LRQLAAKREAQACTPSFE-ITQSKEQLEEAVE 586
Cdd:TIGR02169  918 RLSELKAKLEALEEELSEiEDPKGEDEEIPEEELSLEdVQAELQRVEEEIR 968
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
315-521 2.46e-04

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 44.63  E-value: 2.46e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  315 NEALQKLKQTNKKQELQIQDLHGKNLNLENRVQELQTKvtkqhvlvdiINKLKVNIEELINDKYNVILEKNDINKKLQDL 394
Cdd:TIGR04523 334 NKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNE----------IEKLKKENQSYKQEIKNLESQINDLESKIQNQ 403
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  395 QEASAHTKKHLQESKKDKESLQLQVKKIKVHYVRLQEryiaEIQQ-KNRSASQCLEIEKTLSKKDEELQRLQRHKGELEK 473
Cdd:TIGR04523 404 EKLNQQKDEQIKKLQQEKELLEKEIERLKETIIKNNS----EIKDlTNQDSVKELIIKNLDNTRESLETQLKVLSRSINK 479
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|.
gi 568982046  474 ATSSaLDLLKREKEIREQEFLSFQEEFQRRE---KESLKERRKLKSRVEKL 521
Cdd:TIGR04523 480 IKQN-LEQKQKELKSKEKELKKLNEEKKELEekvKDLTKKISSLKEKIEKL 529
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
313-526 7.16e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 42.83  E-value: 7.16e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 313 ALNEALQKLKQTNKKQELQIQDLHGKNLNLENRVQELQTKVTKqhvLVDIINKLKVNIEELINDKYNVILEKNDINKKLQ 392
Cdd:COG4942   24 EAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAA---LARRIRALEQELAALEAELAELEKEIAELRAELE 100
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 393 DLQEASAHTKKHLQE-SKKDKESLQLQVKKIKVHYVRLQ-ERYIAEIQQKNRSA--SQCLEIEKTLSKKDEELQRLQRHK 468
Cdd:COG4942  101 AQKEELAELLRALYRlGRQPPLALLLSPEDFLDAVRRLQyLKYLAPARREQAEElrADLAELAALRAELEAERAELEALL 180
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....*...
gi 568982046 469 GELEKATsSALDLLKREKEIREQEFLSFQEEFQRREKESLKERRKLKSRVEKLVAQVK 526
Cdd:COG4942  181 AELEEER-AALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAA 237
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
405-566 1.42e-03

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 42.03  E-value: 1.42e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  405 LQESKKDKESLQLQVKKIKVHYVRLQERYIAEIQQKNRSASQCLEIEKTLSKKDEELQR--------LQRHKGELEKATS 476
Cdd:pfam17380 355 QEERKRELERIRQEEIAMEISRMRELERLQMERQQKNERVRQELEAARKVKILEEERQRkiqqqkveMEQIRAEQEEARQ 434
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  477 SALDLLKREKE-----IREQEF--------LSFQEEFQRREKESLKERRKLKSRVEKlvaQVKSLLFTCESERAQTMALQ 543
Cdd:pfam17380 435 REVRRLEEERAremerVRLEEQerqqqverLRQQEEERKRKKLELEKEKRDRKRAEE---QRRKILEKELEERKQAMIEE 511
                         170       180
                  ....*....|....*....|...
gi 568982046  544 RQVEELKLENLELRQLAAKREAQ 566
Cdd:pfam17380 512 ERKRKLLEKEMEERQKAIYEEER 534
PRK11637 PRK11637
AmiB activator; Provisional
385-545 1.58e-03

AmiB activator; Provisional


Pssm-ID: 236942 [Multi-domain]  Cd Length: 428  Bit Score: 41.60  E-value: 1.58e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 385 NDINKKLQDLQEASAHTKKHLQESKKDKESLQLQVKKikvhyvrlQERYIAEIQQKNRSASQCLE-IEKTLSKKDEELQR 463
Cdd:PRK11637  43 SDNRDQLKSIQQDIAAKEKSVRQQQQQRASLLAQLKK--------QEEAISQASRKLRETQNTLNqLNKQIDELNASIAK 114
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 464 LQRHKGELEKATSSALDLLKREKEIREQEFLSFQEEFQRREK-----ESLKERRK-----LKSRVEKLVAQvKSLLFTCE 533
Cdd:PRK11637 115 LEQQQAAQERLLAAQLDAAFRQGEHTGLQLILSGEESQRGERilayfGYLNQARQetiaeLKQTREELAAQ-KAELEEKQ 193
                        170
                 ....*....|..
gi 568982046 534 SERAQTMALQRQ 545
Cdd:PRK11637 194 SQQKTLLYEQQA 205
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
405-586 1.87e-03

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 41.98  E-value: 1.87e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   405 LQESKKDKESLQ-LQVKKIKVH-YVRLQERYIAEIQQKNrsasqcleIEKTLSKKDEELQRLQRHKGELEKATSSALDLL 482
Cdd:TIGR02169  203 LRREREKAERYQaLLKEKREYEgYELLKEKEALERQKEA--------IERQLASLEEELEKLTEEISELEKRLEEIEQLL 274
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   483 ----KREKEIREQEFLSFQE-----------------EFQRREKESLKERRKLKSRVEKLVAQVKSLLFTCESERAQTMA 541
Cdd:TIGR02169  275 eelnKKIKDLGEEEQLRVKEkigeleaeiaslersiaEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDK 354
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 568982046   542 LQRQVEELKLENLELRQLAAKREAQACTPSFEITQSKEQLEEAVE 586
Cdd:TIGR02169  355 LTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKR 399
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
313-528 2.31e-03

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 41.58  E-value: 2.31e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   313 ALNEALQKLKQTNKKQELQIQDLHGKNLNLENRVQELQTKVTKQHVLVDIINKLKVNIEELINDKYNVILEKNDINKKLQ 392
Cdd:TIGR02168  706 ELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIE 785
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   393 DLQEASAHTKKHLQESKKDKESLQLQVKKIKVHYVRLQERYIAEIQQKNRSASQCLEIEKTLSKKDEELQRL---QRHKG 469
Cdd:TIGR02168  786 ELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLaaeIEELE 865
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 568982046   470 ELEKATSSALDLLKREKEIREQEFLSFQEEFQ---RREKESLKERRKLKSRVEKLVAQVKSL 528
Cdd:TIGR02168  866 ELIEELESELEALLNERASLEEALALLRSELEelsEELRELESKRSELRRELEELREKLAQL 927
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
313-472 2.58e-03

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 41.59  E-value: 2.58e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   313 ALNEALQKLKQTNKKQELQIQDLHGKNLNLENRVQELQTKvtkqhvlvdiINKLKVNIEELINDKYNVILEKNDINKKLQ 392
Cdd:TIGR02169  354 KLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREK----------LEKLKREINELKRELDRLQEELQRLSEELA 423
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   393 DLQEASAHTKKHLQESKKDKESLQLQVKKIKVHYVRLQERYIAEIQQKNRSASQCLEIEKTLSKKDEELQRLQRHKGELE 472
Cdd:TIGR02169  424 DLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASE 503
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
303-528 2.94e-03

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 41.20  E-value: 2.94e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 303 EGLEPLEEDmaLNEALQKLKQTNKKQElQIQDLHGKNLNLENRVQELQTKvtkqHVLVDIINKLKVNIEELinDKYNVIL 382
Cdd:PRK03918 314 KRLSRLEEE--INGIEERIKELEEKEE-RLEELKKKLKELEKRLEELEER----HELYEEAKAKKEELERL--KKRLTGL 384
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 383 EKNDINKKLQDLQEASAHTKKHLQESKKDKESLQLQVKKIKVHYVRLQE---------RYIAEIQQKNRSASQCLE---I 450
Cdd:PRK03918 385 TPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKakgkcpvcgRELTEEHRKELLEEYTAElkrI 464
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 451 EKTLSKKDEELQRLQRHKGELEKATSSALDLLKrEKEIREQeFLSFQEEFQRREKESL----KERRKLKSRVEKLVAQVK 526
Cdd:PRK03918 465 EKELKEIEEKERKLRKELRELEKVLKKESELIK-LKELAEQ-LKELEEKLKKYNLEELekkaEEYEKLKEKLIKLKGEIK 542

                 ..
gi 568982046 527 SL 528
Cdd:PRK03918 543 SL 544
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
413-803 4.03e-03

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 40.72  E-value: 4.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   413 ESLQLQVKKIKVHYVRLQERYIAE--IQQKNRSASQCLEIEKTLSKKDEELQRLQRHKGELEKATSSALDLLKREKEIRE 490
Cdd:TIGR00618  219 ERKQVLEKELKHLREALQQTQQSHayLTQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERINRARKAAPLAAH 298
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   491 QEFLSfQEEFQRRE-----KESLKERRKLKSRVEKLVAQVKSLLFTCESER---AQTMALQRQVEELKLENLELRQlaAK 562
Cdd:TIGR00618  299 IKAVT-QIEQQAQRihtelQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQtlhSQEIHIRDAHEVATSIREISCQ--QH 375
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   563 REAQACTPSFEITQSKEQLEEAVEPDITQETKGTHCNLFLNRSSCKENLELQPLKKTSPLASGIHSLLALRIG-LLTCQD 641
Cdd:TIGR00618  376 TLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAAITcTAQCEK 455
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   642 LATPDAELCQESKKANDIMLQRLKDCQLKKKDLDKELLKHKNRIATL-KELIASEKALQAHTIEITKLGG---LLESKED 717
Cdd:TIGR00618  456 LEKIHLQESAQSLKEREQQLQTKEQIHLQETRKKAVVLARLLELQEEpCPLCGSCIHPNPARQDIDNPGPltrRMQRGEQ 535
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046   718 HYSRLIEENDKYRRHVGSLINKVTSYEEIIKCADQRLEISHSQIAHLEErnrhLEDLIRMPREKAKRPRPRLDNHPKSLT 797
Cdd:TIGR00618  536 TYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKE----DIPNLQNITVRLQDLTEKLSEAEDMLA 611

                   ....*.
gi 568982046   798 LISHLE 803
Cdd:TIGR00618  612 CEQHAL 617
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
340-526 6.65e-03

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 39.84  E-value: 6.65e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 340 LNLENRVQELQTKVTKQH----VLVDIINKLKVN--IEELINDkynvileknDINKKLQDLQEASAHTKKHLQESKKDKE 413
Cdd:COG2433  346 DAYKNKFERVEKKVPPDVdrdeVKARVIRGLSIEeaLEELIEK---------ELPEEEPEAEREKEHEERELTEEEEEIR 416
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 414 SLQLQVKKIKVHYVRLQ----------ERYIAEIQQKNRSASQCLEIEKTLSKKDEELQRLQRHKGELEKATSSALDLLK 483
Cdd:COG2433  417 RLEEQVERLEAEVEELEaeleekderiERLERELSEARSEERREIRKDREISRLDREIERLERELEEERERIEELKRKLE 496
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|....
gi 568982046 484 REKEIREQEFlsfqeefqRREKESLKERRKL-KSRVEKLVAQVK 526
Cdd:COG2433  497 RLKELWKLEH--------SGELVPVKVVEKFtKEAIRRLEEEYG 532
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
268-525 7.50e-03

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 40.00  E-value: 7.50e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 268 GVDSLAITSpwspagIFKGS--GPQDNSLRPDREVSCEGLEPLEEDM-ALNEALQK-LKQTNKKQElqiQDLHGKNLNLE 343
Cdd:NF033838  20 GVASVVVAS------LFLGGvvHAEEVRGGNNPTVTSSGNESQKEHAkEVESHLEKiLSEIQKSLD---KRKHTQNVALN 90
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 344 NRVQELQTKVTkqHVLVDIINKLKVNIEELINDKYNVILEKndINKKLQDLQEASAHTKKHLQESKKDKESlqlQVKKIK 423
Cdd:NF033838  91 KKLSDIKTEYL--YELNVLKEKSEAELTSKTKKELDAAFEQ--FKKDTLEPGKKVAEATKKVEEAEKKAKD---QKEEDR 163
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 424 VHY----VRLQERYIAEIQQKNRSASQCLEIEKTLSKKDEELQRLQRHKGELEKATSSALDLLKREKEIREQEFLSFQEE 499
Cdd:NF033838 164 RNYptntYKTLELEIAESDVEVKKAELELVKEEAKEPRDEEKIKQAKAKVESKKAEATRLEKIKTDREKAEEEAKRRADA 243
                        250       260
                 ....*....|....*....|....*....
gi 568982046 500 FQRREKESL---KERRKLKSRVEKLVAQV 525
Cdd:NF033838 244 KLKEAVEKNvatSEQDKPKRRAKRGVLGE 272
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
383-525 8.11e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 38.75  E-value: 8.11e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 383 EKNDINKKLQDLQEASAHTKKHLQESKKDKESLQLQVKKIKVHYVRLQERyIAEIQQKNRSAS-----QCLEIEKTLSKK 457
Cdd:COG1579   25 RLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEAR-IKKYEEQLGNVRnnkeyEALQKEIESLKR 103
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046 458 DEEL--QRLQRHKGELEKATSsALDLLKREKEIREQEFLSFQEEFQRREKESLKERRKLKSRVEKLVAQV 525
Cdd:COG1579  104 RISDleDEILELMERIEELEE-ELAELEAELAELEAELEEKKAELDEELAELEAELEELEAEREELAAKI 172
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
318-745 8.19e-03

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 39.62  E-value: 8.19e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  318 LQKLKQTNKKQELQIQDLHGKNLNLENRVQELQTKVT-KQHVLVDIINKLKVNIEELINDK---YNVILEKNDINKKLQD 393
Cdd:TIGR04523 206 LKKKIQKNKSLESQISELKKQNNQLKDNIEKKQQEINeKTTEISNTQTQLNQLKDEQNKIKkqlSEKQKELEQNNKKIKE 285
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  394 L-------------------QEASAHTKKHLQESKKDKESLQLQVKKIKVHYVRLQERYIAEIQQKNRSASQCLEIEKTL 454
Cdd:TIGR04523 286 LekqlnqlkseisdlnnqkeQDWNKELKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKELTNSESENSEKQREL 365
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  455 SKKDEELQRLQR----HKGELEKATSSALDLlkrEKEIREQEFLSFQEEFQRREKESLKErrklksrveKLVAQVKSLLF 530
Cdd:TIGR04523 366 EEKQNEIEKLKKenqsYKQEIKNLESQINDL---ESKIQNQEKLNQQKDEQIKKLQQEKE---------LLEKEIERLKE 433
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  531 TCESERAQTMALQRQVEELKLENLELRQLAAKREAQACTPSFEITQSKEQLEeavepDITQETKgTHCNLFLNRSSCKEN 610
Cdd:TIGR04523 434 TIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKIKQNLE-----QKQKELK-SKEKELKKLNEEKKE 507
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  611 LElqplKKTSPLASGIHSLLalriglLTCQDLATPDAELCQESKKANDIMLQrlKDCQLKKKDLDKELLKHKNRIATLKE 690
Cdd:TIGR04523 508 LE----EKVKDLTKKISSLK------EKIEKLESEKKEKESKISDLEDELNK--DDFELKKENLEKEIDEKNKEIEELKQ 575
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 568982046  691 LIASEKALQahtieiTKLGGLLESKEDHYSRLIEENDKYRRHVGSLINKVTSYEE 745
Cdd:TIGR04523 576 TQKSLKKKQ------EEKQELIDQKEKEKKDLIKEIEEKEKKISSLEKELEKAKK 624
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
309-531 8.69e-03

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 39.62  E-value: 8.69e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  309 EEDMALNEALQKLKQTNKKQELQIQDLHGK----NLNLENRVQELQTKVTKqhvlvdiINKLKVNIEELINDKYNVILEK 384
Cdd:TIGR04523 447 NQDSVKELIIKNLDNTRESLETQLKVLSRSinkiKQNLEQKQKELKSKEKE-------LKKLNEEKKELEEKVKDLTKKI 519
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  385 NDINKKLQDLQeasahtkkhLQESKKDKESLQLQVKKIKVHYVRLQERYIAEIQQKNRSASQCLEIEKTLSKKDEELQrl 464
Cdd:TIGR04523 520 SSLKEKIEKLE---------SEKKEKESKISDLEDELNKDDFELKKENLEKEIDEKNKEIEELKQTQKSLKKKQEEKQ-- 588
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568982046  465 qrhkgELEKATSSALDLLKREKEIREQEFLSFQEEFQRREKESLK---ERRKLKSRVEKLVAQVKSLLFT 531
Cdd:TIGR04523 589 -----ELIDQKEKEKKDLIKEIEEKEKKISSLEKELEKAKKENEKlssIIKNIKSKKNKLKQEVKQIKET 653
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH