|
Name |
Accession |
Description |
Interval |
E-value |
| SH3_10 |
pfam17902 |
SH3 domain; This entry represents an SH3 domain. |
404-468 |
1.59e-26 |
|
SH3 domain; This entry represents an SH3 domain.
Pssm-ID: 407754 Cd Length: 65 Bit Score: 103.88 E-value: 1.59e-26
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1002819366 404 PLPQRRNPPQQPLHVDSICDWDSGEVQLLQGERYKLVDNTDPHAWVVQGPGGETKRAPAACFCIP 468
Cdd:pfam17902 1 PLKQRRSPVTRPIPVKALCDYKQGEVTVEKGEECTLLDNSDREKWKVQTSSGVEKLVPSVCFLIP 65
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
930-1473 |
1.13e-17 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 90.00 E-value: 1.13e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 930 AQAGRESEALKAQLEEERKRVARVQHELEAQRSQLLQLRTQrpLERLEEKEVVEFYRDPQLEGSLSRVKAQVEEEGKRRA 1009
Cdd:COG1196 242 EELEAELEELEAELEELEAELAELEAELEELRLELEELELE--LEEAQAEEYELLAELARLEQDIARLEERRRELEERLE 319
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1010 GLQADLEVAAQKVVQLESKRKTMQPHLLTKEVTQVErdpgLDSQAAQLRIQIQQLRGEDAVISARLEGLKKELLALEKRE 1089
Cdd:COG1196 320 ELEEELAELEEELEELEEELEELEEELEEAEEELEE----AEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAA 395
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1090 VDVKEKVVVKEVVKVEKNLEMVKAAQALRLQMEEDAARRKQAEEAVAKLQARIEDLERAISSVEpKVIVKEVKKVEQDPG 1169
Cdd:COG1196 396 AELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEE-ALLELLAELLEEAAL 474
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1170 LLQESSRLRSLLEEERTKNATLARELSDLHSKYSVVEKQRPKVQLQ---ERVHEIFQVDPETEQEI-TRLKAKLQEMAGK 1245
Cdd:COG1196 475 LEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRglaGAVAVLIGVEAAYEAALeAALAAALQNIVVE 554
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1246 RSGVEKEVEKLLPDLEVLRA---QKPTVEYKEVTQEVVRHERSPEVLREIDRLKAQLNELVNSHGRSQEQLIRLQGERDE 1322
Cdd:COG1196 555 DDEVAAAAIEYLKAAKAGRAtflPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEA 634
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1323 WRRERAKVETKTVSKEVVRHEKDPVLEKEAERLRQEVREAAQKRRAAEDAVYELQSKRLLLERRKpEEKVVVQEVVVTQK 1402
Cdd:COG1196 635 ALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEAL-LAEEEEERELAEAE 713
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1002819366 1403 DPKLREEHSRLSGSLDEEVGRRRQLELEVQQLRAGVEEQEGLLSFQEDRSkklAVERELRQLTLRIQELEK 1473
Cdd:COG1196 714 EERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLE---ELERELERLEREIEALGP 781
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
934-1667 |
1.94e-14 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 79.72 E-value: 1.94e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 934 RESEALKAQLEEERKRVARVQHELEAQRSQLLQLRTQrpLERLEEKEVVEFYRDPQLEGSLSRVKAQVEEEGKRRAGLQA 1013
Cdd:TIGR02168 260 AELQELEEKLEELRLEVSELEEEIEELQKELYALANE--ISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAE 337
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1014 DLEVAAQKVVQLESKRKTMQPHLLTKEVTQVERDPGLDSQAAQLRIQ---IQQLRGEDAVISARLEGLKKELLALEKREV 1090
Cdd:TIGR02168 338 ELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLrskVAQLELQIASLNNEIERLEARLERLEDRRE 417
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1091 DVKEKVVVKEVVKVEKNLEMVKAAQALRLQMEEDA-ARRKQAEEAVAKLQARIEDLERAISSVEPKvivkevkkveqdpg 1169
Cdd:TIGR02168 418 RLQQEIEELLKKLEEAELKELQAELEELEEELEELqEELERLEEALEELREELEEAEQALDAAERE-------------- 483
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1170 LLQESSRLRSlLEEERTKNATLARELSdlhskySVVEKQRPKVQLQERVHEIFQVDPETEQEI-TRLKAKLQEMAGKR-S 1247
Cdd:TIGR02168 484 LAQLQARLDS-LERLQENLEGFSEGVK------ALLKNQSGLSGILGVLSELISVDEGYEAAIeAALGGRLQAVVVENlN 556
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1248 GVEKEVEKLLPD---------LEVLRAQKPTVEYKEVTQEV-------VRHERSPEVLRE-----------IDRLKAQLN 1300
Cdd:TIGR02168 557 AAKKAIAFLKQNelgrvtflpLDSIKGTEIQGNDREILKNIegflgvaKDLVKFDPKLRKalsyllggvlvVDDLDNALE 636
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1301 ELVNSHGRsqEQLIRLQGERDEWRRERAKVETKTVSKEVVRhekdpvlEKEAERLRQEVREAAQKRRAAEDAVYELQSKr 1380
Cdd:TIGR02168 637 LAKKLRPG--YRIVTLDGDLVRPGGVITGGSAKTNSSILER-------RREIEELEEKIEELEEKIAELEKALAELRKE- 706
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1381 lLLERRKPEEKVVVQEVVVTQKDPKLREEHSRLSGSLDEEVGRRRQLELEVQQLRAGVEEQEGLLsfQEDRSKKLAVERE 1460
Cdd:TIGR02168 707 -LEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERL--EEAEEELAEAEAE 783
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1461 LRQLTLRIQELEKRPPTVQEKI-IMEEVVKLEKDP--DLEKSTEALRWDLDQEKTQVTELNRECKNLQVQIDVLQKA-KS 1536
Cdd:TIGR02168 784 IEELEAQIEQLKEELKALREALdELRAELTLLNEEaaNLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEiEE 863
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1537 QEKTIYKEVIRVQKdrvLEDERARVWEMLNRERTARQAREEEARRLRERIDRAEtlgrtwsREESELQRARDQADQECGR 1616
Cdd:TIGR02168 864 LEELIEELESELEA---LLNERASLEEALALLRSELEELSEELRELESKRSELR-------RELEELREKLAQLELRLEG 933
|
730 740 750 760 770
....*....|....*....|....*....|....*....|....*....|.
gi 1002819366 1617 LQQELRALERQKQQQTLQLQEESKLLSQKTESERQKAAQRgqeLSRLEAAI 1667
Cdd:TIGR02168 934 LEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRR---LKRLENKI 981
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
901-1684 |
1.45e-12 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 73.64 E-value: 1.45e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 901 EFARKMLEKKELSEDIRRTHDAKQGSEspaqAGRESEALKAqleEERKRVARVQHELEAQRSQllQLRTQRPLERLEE-K 979
Cdd:PTZ00121 1112 EEARKAEEAKKKAEDARKAEEARKAED----ARKAEEARKA---EDAKRVEIARKAEDARKAE--EARKAEDAKKAEAaR 1182
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 980 EVVEFYRDPQLEgslsrvkaQVEEEGKRRAGLQADLEVAAQKVVQLESKRKTMQphllTKEVTQVERDPGLDSQAAQLRI 1059
Cdd:PTZ00121 1183 KAEEVRKAEELR--------KAEDARKAEAARKAEEERKAEEARKAEDAKKAEA----VKKAEEAKKDAEEAKKAEEERN 1250
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1060 QIQQLRGEDAVIsARLEGLKKELLALEKREVDVKEKVVVKEVVKVEKNLEMVKAAQALRLQMEEdaarRKQAEEAVAKLQ 1139
Cdd:PTZ00121 1251 NEEIRKFEEARM-AHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEE----AKKADEAKKKAE 1325
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1140 ARIEDLERAISSVEPKvivkevkkveqdpgllQESSRLRSLLEEERTKNATLARELSDLHSKYSVVEKQRPKvQLQERVH 1219
Cdd:PTZ00121 1326 EAKKKADAAKKKAEEA----------------KKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKAD-AAKKKAE 1388
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1220 EIFQVDpeteqeitRLKAKLQEMAGKRSGVEKEVEkllpdlevlrAQKPTVEYKEVTQEVVRHE---RSPEVLREIDRLK 1296
Cdd:PTZ00121 1389 EKKKAD--------EAKKKAEEDKKKADELKKAAA----------AKKKADEAKKKAEEKKKADeakKKAEEAKKADEAK 1450
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1297 AQLNElvnshGRSQEQLIRLQGERDEWRRERAKVETKTVSKEVVRHEKdpvlekEAERLRQEVREAAQKRRAAEDAvYEL 1376
Cdd:PTZ00121 1451 KKAEE-----AKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAE------EAKKKADEAKKAAEAKKKADEA-KKA 1518
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1377 QSKRLLLERRKPEEKVVVQEVVVTQKDPKLREEHSRLSGSLDEEVGRRRQLELEVQQLRAGVEEQEGLLSFQEDRSKKLA 1456
Cdd:PTZ00121 1519 EEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVM 1598
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1457 VERElRQLTLRIQELEKRPptvQEKIIMEEVVKLEkdpDLEKSTEALRWDLDQEKTQVTELNRECKNLQVQIDVLQKAKS 1536
Cdd:PTZ00121 1599 KLYE-EEKKMKAEEAKKAE---EAKIKAEELKKAE---EEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAE 1671
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1537 QEKTIYKEVIRVQKDRVLEDERARVWEMLNRERTARQAREEEARRlreridRAETLGRTWSREESELQRARDQADQEcGR 1616
Cdd:PTZ00121 1672 EDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKK------KAEELKKAEEENKIKAEEAKKEAEED-KK 1744
|
730 740 750 760 770 780 790
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1617 LQQELRALErqkqqqtlqlqEESKLLSQKTESERQKAAQRGQELSRLEAAILREKDQI--YEKERTLRDL 1684
Cdd:PTZ00121 1745 KAEEAKKDE-----------EEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKrrMEVDKKIKDI 1803
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
939-1710 |
1.91e-10 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 66.53 E-value: 1.91e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 939 LKAQLEEERKRVARVQHELEAQRSQLLQLRTQRPLERLEEKEVVEFYRDPQLEGSLSRVKAQVEEEGKRRAGLQADLEVA 1018
Cdd:pfam02463 167 LKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELL 246
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1019 AQKVVQLESKRKTMQPHllTKEVTQVERDPGLDSQaaqlriQIQQLRGEDAVISARLEGLKKELLALEKREVDVKEKVVV 1098
Cdd:pfam02463 247 RDEQEEIESSKQEIEKE--EEKLAQVLKENKEEEK------EKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKE 318
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1099 --KEVVKVEKNLEMVKAAQALRLQMEEDAARRKQAEEAVAKLQARIEDLERAISSVEPKVIVKEVKKVEQDPGLLQESSR 1176
Cdd:pfam02463 319 seKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELE 398
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1177 LRSLLEEERTKNATLAR----ELSDLHSKYSVVEKQRPKVQLQERVHEIFQVDPETEQEITRLKAKLQemagkrsgvEKE 1252
Cdd:pfam02463 399 LKSEEEKEAQLLLELARqledLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELE---------LKK 469
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1253 VEKLLPDLEVLraqkptveyKEVTQEVVRHERSPEVLREIDRLKAQLNELVNSHGRSQEQLIRLQGERDEWRRERAKVET 1332
Cdd:pfam02463 470 SEDLLKETQLV---------KLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVEN 540
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1333 KTVSKEVVRHEKDPVLEKEAERLRQEVREAAQKRRAAEDAVYELQSKRLLLERRKPEEKVVVQEVVVTQKDPKLREEHS- 1411
Cdd:pfam02463 541 YKVAISTAVIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDKATLEADEDDk 620
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1412 -RLSGSLDEEVGRRRQLELEVQQLRAGVEEQEGLLSFQEDRSKKLAVERELRQLTLRIQELEKRPPTVQEKIIMEEVVKL 1490
Cdd:pfam02463 621 rAKVVEGILKDTELTKLKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLE 700
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1491 EKDPDLEKSTEALRWDLDQEKTQVTELNRECKNLQVQIDVLQKAKSQEKTIYKEvIRVQKDRVLEDERARVWEMLNRERT 1570
Cdd:pfam02463 701 IKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEK-SRLKKEEKEEEKSELSLKEKELAEE 779
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1571 ARQAREEEARRLRERIDRAETLGRTWSREESELQRARDQADQECGRLQQELRALERQKQQQTLQLQEESKLLSQKtESER 1650
Cdd:pfam02463 780 REKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEE-ELER 858
|
730 740 750 760 770 780
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1651 QKAAQRGQELSRLEAAILREKDQIYEKERTLrDLHAKVSREELSQETQTRETNLSTKISI 1710
Cdd:pfam02463 859 LEEEITKEELLQELLLKEEELEEQKLKDELE-SKEEKEKEEKKELEEESQKLNLLEEKEN 917
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1348-1701 |
3.96e-10 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 65.34 E-value: 3.96e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1348 LEKEAERLRQEvREAAQKRRAAEDAVYELQSKRLLLERRKpeekvvvqevvvtqkdpkLREEHSRLSGSLDEEVGRRRQL 1427
Cdd:COG1196 198 LERQLEPLERQ-AEKAERYRELKEELKELEAELLLLKLRE------------------LEAELEELEAELEELEAELEEL 258
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1428 ELEVQQLRAGVEEQegllsfqedRSKKLAVERELRQLTLRIQELEKRPPTVQEKIIMEEvvklEKDPDLEKSTEALRWDL 1507
Cdd:COG1196 259 EAELAELEAELEEL---------RLELEELELELEEAQAEEYELLAELARLEQDIARLE----ERRRELEERLEELEEEL 325
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1508 DQEKTQVTELNRECKNLQVQIDVLQKAKSQEKtiykEVIRVQKDRVLEDERARvwemLNRERTARQAREEEARRLRERID 1587
Cdd:COG1196 326 AELEEELEELEEELEELEEELEEAEEELEEAE----AELAEAEEALLEAEAEL----AEAEEELEELAEELLEALRAAAE 397
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1588 RAEtlgrtwsrEESELQRARDQADQECGRLQQELRALERQKQQQTLQLQEESKLLSQKTESERQKAAQRGQELSRLEAAI 1667
Cdd:COG1196 398 LAA--------QLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELL 469
|
330 340 350
....*....|....*....|....*....|....
gi 1002819366 1668 LREKDQIYEKERTLRDLHAKVSREELSQETQTRE 1701
Cdd:COG1196 470 EEAALLEAALAELLEELAEAAARLLLLLEAEADY 503
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
1880-1918 |
5.73e-10 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 56.18 E-value: 5.73e-10
10 20 30
....*....|....*....|....*....|....*....
gi 1002819366 1880 LLEAQAATGGIVDLLSRERYSVHKAMERGLIENTSTQRL 1918
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| PLEC |
smart00250 |
Plectin repeat; |
1916-1953 |
8.42e-08 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 49.79 E-value: 8.42e-08
10 20 30
....*....|....*....|....*....|....*...
gi 1002819366 1916 QRLLNAQKAFTGIEDPVTKKRLSVGEAVQKGWMPRESV 1953
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| SPEC |
cd00176 |
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ... |
228-415 |
9.81e-08 |
|
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here
Pssm-ID: 238103 [Multi-domain] Cd Length: 213 Bit Score: 54.76 E-value: 9.81e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 228 LYTHLQGCTRQLSALAEQQRRILQQDwsDLMADPAGVRREYEHFK--QHELLSQEQSVNQLEDDGERMVELRHPAVGPIQ 305
Cdd:cd00176 1 KLQQFLRDADELEAWLSEKEELLSST--DYGDDLESVEALLKKHEalEAELAAHEERVEALNELGEQLIEEGHPDAEEIQ 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 306 AHQEALKMEWQNFLNLCICQETQLQHVEDYRRFQEEADSVSQTLAKLNSNLDakySPAPGGPPGAPTELLQQLEAEEKRL 385
Cdd:cd00176 79 ERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQWLEEKEAALA---SEDLGKDLESVEELLKKHKELEEEL 155
|
170 180 190
....*....|....*....|....*....|
gi 1002819366 386 avtERATGDLQRRSRDVAPLPQRRNPPQQP 415
Cdd:cd00176 156 ---EAHEPRLKSLNELAEELLEEGHPDADE 182
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
1957-1994 |
1.43e-07 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 49.25 E-value: 1.43e-07
10 20 30
....*....|....*....|....*....|....*...
gi 1002819366 1957 LQVQHLTGGLIDPKRTGRIPIQQALLSGMISEELAQLL 1994
Cdd:pfam00681 2 LEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| PLEC |
smart00250 |
Plectin repeat; |
1878-1911 |
6.52e-07 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 47.48 E-value: 6.52e-07
10 20 30
....*....|....*....|....*....|....
gi 1002819366 1878 QKLLEAQAATGGIVDLLSRERYSVHKAMERGLIE 1911
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLID 34
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
651-1154 |
8.82e-06 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 51.09 E-value: 8.82e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 651 EKAKAALDLERQIQDADRVIRGFEATLVQEApipAEPGALQERVSELQRQRRELLEQqtcVLRLHRALKASEHACAALQN 730
Cdd:COG1196 243 ELEAELEELEAELEELEAELAELEAELEELR---LELEELELELEEAQAEEYELLAE---LARLEQDIARLEERRRELEE 316
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 731 NFQEFCQDLPRQQRQVRALTDRYHAVGDQLDLREKVVQDAALTYQQFKN-----------CKDNLSSWLEHLPRSQVRPS 799
Cdd:COG1196 317 RLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEalleaeaelaeAEEELEELAEELLEALRAAA 396
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 800 DGPSQIAYKLQAQKRLTQEIQSRERDRATASHLSQALQAALQDYELQADTYRCSLEPTLAVSAPKRPRVAPLQESIQAQE 879
Cdd:COG1196 397 ELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLE 476
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 880 KNLAKAYTEVAAAQQQLLQQLEFARKM---------LEKKELSEDIRRTHDAKQGSESPAQAGRESEALKAQLEEERKRV 950
Cdd:COG1196 477 AALAELLEELAEAAARLLLLLEAEADYegflegvkaALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVEDD 556
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 951 ARVQHELEAQRSQLLQLRTQRPLERLEEKEVVEFYRDPQLEGSLSR-VKAQVEEEGKRRAGLQADLEVAAQKVVQLESKR 1029
Cdd:COG1196 557 EVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDlVASDLREADARYYVLGDTLLGRTLVAARLEAAL 636
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1030 KTMQPHLLTKEVTQVERDPGLDSQA-----------AQLRIQIQQLRGEDAVISARLEGLKKELLALEKREVDVKEKVVV 1098
Cdd:COG1196 637 RRAVTLAGRLREVTLEGEGGSAGGSltggsrrellaALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEER 716
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1002819366 1099 KEVVKVEKNLEMVKAAQALRLQME---------EDAARRKQAEEAVAKLQARIEDLERAISSVEP 1154
Cdd:COG1196 717 LEEELEEEALEEQLEAEREELLEElleeeelleEEALEELPEPPDLEELERELERLEREIEALGP 781
|
|
| SPEC |
smart00150 |
Spectrin repeats; |
237-330 |
8.69e-05 |
|
Spectrin repeats;
Pssm-ID: 197544 [Multi-domain] Cd Length: 101 Bit Score: 43.47 E-value: 8.69e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 237 RQLSALAEQQRRILQQDwsDLMADPAGVRREYEHFK--QHELLSQEQSVNQLEDDGERMVELRHPAVGPIQAHQEALKME 314
Cdd:smart00150 8 DELEAWLEEKEQLLASE--DLGKDLESVEALLKKHEafEAELEAHEERVEALNELGEQLIEEGHPDAEEIEERLEELNER 85
|
90
....*....|....*.
gi 1002819366 315 WQNFLNLCICQETQLQ 330
Cdd:smart00150 86 WEELKELAEERRQKLE 101
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
1710-1738 |
1.39e-04 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 40.77 E-value: 1.39e-04
10 20
....*....|....*....|....*....
gi 1002819366 1710 ILEPETGKDMSPYEAYKRGIIDRGQYLQL 1738
Cdd:pfam00681 11 IIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| PLEC |
smart00250 |
Plectin repeat; |
1840-1877 |
2.70e-04 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 40.16 E-value: 2.70e-04
10 20 30
....*....|....*....|....*....|....*...
gi 1002819366 1840 LGLGDDSFPIAGIYDTTTDNKCSIKTAVAKNMLDPITG 1877
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| PLEC |
smart00250 |
Plectin repeat; |
1954-1991 |
6.74e-04 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 39.00 E-value: 6.74e-04
10 20 30
....*....|....*....|....*....|....*...
gi 1002819366 1954 LPHLQVQHLTGGLIDPKRTGRIPIQQALLSGMISEELA 1991
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
1350-1740 |
1.77e-03 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 43.57 E-value: 1.77e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1350 KEAERLRQEVREAAQKRR-AAEDAVYELQSKRLLLERRKPEEKVVVQEVVVTQKDPKLREEHS----RLSGSLDEEVGRR 1424
Cdd:pfam15921 88 KDLQRRLNESNELHEKQKfYLRQSVIDLQTKLQEMQMERDAMADIRRRESQSQEDLRNQLQNTvhelEAAKCLKEDMLED 167
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1425 RQLELEvqQLRAGVEEQEGLL--------SFQEDRSKKL----------------AVERELRQLTLRIQELEKRPPTVQE 1480
Cdd:pfam15921 168 SNTQIE--QLRKMMLSHEGVLqeirsilvDFEEASGKKIyehdsmstmhfrslgsAISKILRELDTEISYLKGRIFPVED 245
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1481 KIimeEVVKLEKDPDLE-------KSTEALRWDLDQEKTQVTE----LNRECKNLQVQIDVLQKAKSQEKTIYKevirvq 1549
Cdd:pfam15921 246 QL---EALKSESQNKIElllqqhqDRIEQLISEHEVEITGLTEkassARSQANSIQSQLEIIQEQARNQNSMYM------ 316
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1550 kdRVLEDERARVWEMLNRERTARQAREeearrlreriDRAETLGRTWSREESELQRAR---DQADQECG----RLQQELR 1622
Cdd:pfam15921 317 --RQLSDLESTVSQLRSELREAKRMYE----------DKIEELEKQLVLANSELTEARterDQFSQESGnlddQLQKLLA 384
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1623 ALERQKQQQTLQLQEESKLLSQKTESE------RQKAAQRGQELSRLEAAILREKDQIY-EKERTLRDLHAK-VSREELS 1694
Cdd:pfam15921 385 DLHKREKELSLEKEQNKRLWDRDTGNSitidhlRRELDDRNMEVQRLEALLKAMKSECQgQMERQMAAIQGKnESLEKVS 464
|
410 420 430 440
....*....|....*....|....*....|....*....|....*.
gi 1002819366 1695 QETQTRETNLSTKISILEPETGKDMSpYEAYKRGIIDRGQYLQLQE 1740
Cdd:pfam15921 465 SLTAQLESTKEMLRKVVEELTAKKMT-LESSERTVSDLTASLQEKE 509
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1407-1729 |
2.33e-03 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 43.13 E-value: 2.33e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1407 REEHSRLSGSLDEEVGRRRQLElEVQQLRAGVEEQEGLLSFQEDRSKKLAVERELRqltLRIQELEKRPPTVQEKIIMEE 1486
Cdd:TIGR02169 156 RKIIDEIAGVAEFDRKKEKALE-ELEEVEENIERLDLIIDEKRQQLERLRREREKA---ERYQALLKEKREYEGYELLKE 231
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1487 VVKLEKD-PDLEKSTEALRWDLDQEKTQVTELNRECKNLQVQIDvlQKAKSQEKTIYKEVIRVQKD-RVLEDERARVWEM 1564
Cdd:TIGR02169 232 KEALERQkEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLE--ELNKKIKDLGEEEQLRVKEKiGELEAEIASLERS 309
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1565 LNRERTARQAREEEARRLRERIDRAETLGRTWSREESELQRARDQADQECGRLQQELRALERqkqqQTLQLQEESKLLSQ 1644
Cdd:TIGR02169 310 IAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRA----ELEEVDKEFAETRD 385
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1645 KTESERQKAAQRGQELSRLEAAILREKDQIYEKERTLRDLHAKVSR-EELSQETQTRETNLSTKISILEPE---TGKDMS 1720
Cdd:TIGR02169 386 ELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGiEAKINELEEEKEDKALEIKKQEWKleqLAADLS 465
|
....*....
gi 1002819366 1721 PYEAYKRGI 1729
Cdd:TIGR02169 466 KYEQELYDL 474
|
|
| PLEC |
smart00250 |
Plectin repeat; |
1710-1735 |
5.32e-03 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 36.31 E-value: 5.32e-03
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| SH3_10 |
pfam17902 |
SH3 domain; This entry represents an SH3 domain. |
404-468 |
1.59e-26 |
|
SH3 domain; This entry represents an SH3 domain.
Pssm-ID: 407754 Cd Length: 65 Bit Score: 103.88 E-value: 1.59e-26
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1002819366 404 PLPQRRNPPQQPLHVDSICDWDSGEVQLLQGERYKLVDNTDPHAWVVQGPGGETKRAPAACFCIP 468
Cdd:pfam17902 1 PLKQRRSPVTRPIPVKALCDYKQGEVTVEKGEECTLLDNSDREKWKVQTSSGVEKLVPSVCFLIP 65
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
930-1473 |
1.13e-17 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 90.00 E-value: 1.13e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 930 AQAGRESEALKAQLEEERKRVARVQHELEAQRSQLLQLRTQrpLERLEEKEVVEFYRDPQLEGSLSRVKAQVEEEGKRRA 1009
Cdd:COG1196 242 EELEAELEELEAELEELEAELAELEAELEELRLELEELELE--LEEAQAEEYELLAELARLEQDIARLEERRRELEERLE 319
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1010 GLQADLEVAAQKVVQLESKRKTMQPHLLTKEVTQVErdpgLDSQAAQLRIQIQQLRGEDAVISARLEGLKKELLALEKRE 1089
Cdd:COG1196 320 ELEEELAELEEELEELEEELEELEEELEEAEEELEE----AEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAA 395
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1090 VDVKEKVVVKEVVKVEKNLEMVKAAQALRLQMEEDAARRKQAEEAVAKLQARIEDLERAISSVEpKVIVKEVKKVEQDPG 1169
Cdd:COG1196 396 AELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEE-ALLELLAELLEEAAL 474
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1170 LLQESSRLRSLLEEERTKNATLARELSDLHSKYSVVEKQRPKVQLQ---ERVHEIFQVDPETEQEI-TRLKAKLQEMAGK 1245
Cdd:COG1196 475 LEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRglaGAVAVLIGVEAAYEAALeAALAAALQNIVVE 554
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1246 RSGVEKEVEKLLPDLEVLRA---QKPTVEYKEVTQEVVRHERSPEVLREIDRLKAQLNELVNSHGRSQEQLIRLQGERDE 1322
Cdd:COG1196 555 DDEVAAAAIEYLKAAKAGRAtflPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEA 634
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1323 WRRERAKVETKTVSKEVVRHEKDPVLEKEAERLRQEVREAAQKRRAAEDAVYELQSKRLLLERRKpEEKVVVQEVVVTQK 1402
Cdd:COG1196 635 ALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEAL-LAEEEEERELAEAE 713
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1002819366 1403 DPKLREEHSRLSGSLDEEVGRRRQLELEVQQLRAGVEEQEGLLSFQEDRSkklAVERELRQLTLRIQELEK 1473
Cdd:COG1196 714 EERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLE---ELERELERLEREIEALGP 781
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
937-1520 |
1.37e-16 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 86.53 E-value: 1.37e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 937 EALKaQLEEERKRVARVQ---HELEAQRSQL----------LQLRtqrplERLEEKEVVE-FYRDPQLEGSLSRVKAQVE 1002
Cdd:COG1196 176 EAER-KLEATEENLERLEdilGELERQLEPLerqaekaeryRELK-----EELKELEAELlLLKLRELEAELEELEAELE 249
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1003 EEGKRRAGLQADLEVAAQKVVQLESKRKTMQphLLTKEVTQVERDpgLDSQAAQLRIQIQQLRGEDAVISARLEGLKKEL 1082
Cdd:COG1196 250 ELEAELEELEAELAELEAELEELRLELEELE--LELEEAQAEEYE--LLAELARLEQDIARLEERRRELEERLEELEEEL 325
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1083 LALEKREVDVKEKVVVKEVVKVEKNLEMVKAAQALRLQMEEDAARRKQAEEAVAKLQARIEDLERAISSVEPKVIVKEVK 1162
Cdd:COG1196 326 AELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEEL 405
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1163 KVEQDpGLLQESSRLRSLLEEERTKNATLARELSDLHSKYS-----VVEKQRPKVQLQERVHEIFQVDPETEQEITRLKA 1237
Cdd:COG1196 406 EEAEE-ALLERLERLEEELEELEEALAELEEEEEEEEEALEeaaeeEAELEEEEEALLELLAELLEEAALLEAALAELLE 484
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1238 KLQEMAGKRSGVEKEVEKLLPDLEVLRAQKPTVEYKEVTQEVVRHERSPEVLRE--IDRLKAQLNELVNSHGRSQEQLIR 1315
Cdd:COG1196 485 ELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAalEAALAAALQNIVVEDDEVAAAAIE 564
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1316 LQGERDEWRRERAKVETKTVSKEVVRHEKDPVLEKEAERLRQEVREAAQKRRAAEDAVYELQSKRLLLERRKPEEKVVVQ 1395
Cdd:COG1196 565 YLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAG 644
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1396 EVVVTQKD-----PKLREEHSRLSGSLDEEVGRRRQLELEVQQLRAGVEEQEGLLSFQEDRSKKLAVERE-LRQLTLRIQ 1469
Cdd:COG1196 645 RLREVTLEgeggsAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEeRLEEELEEE 724
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|.
gi 1002819366 1470 ELEKRPPTVQEKIIMEEVVKLEKDPDLEKSTEALRWDLDQEKTQVTELNRE 1520
Cdd:COG1196 725 ALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELERELERLERE 775
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
934-1667 |
1.94e-14 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 79.72 E-value: 1.94e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 934 RESEALKAQLEEERKRVARVQHELEAQRSQLLQLRTQrpLERLEEKEVVEFYRDPQLEGSLSRVKAQVEEEGKRRAGLQA 1013
Cdd:TIGR02168 260 AELQELEEKLEELRLEVSELEEEIEELQKELYALANE--ISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAE 337
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1014 DLEVAAQKVVQLESKRKTMQPHLLTKEVTQVERDPGLDSQAAQLRIQ---IQQLRGEDAVISARLEGLKKELLALEKREV 1090
Cdd:TIGR02168 338 ELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLrskVAQLELQIASLNNEIERLEARLERLEDRRE 417
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1091 DVKEKVVVKEVVKVEKNLEMVKAAQALRLQMEEDA-ARRKQAEEAVAKLQARIEDLERAISSVEPKvivkevkkveqdpg 1169
Cdd:TIGR02168 418 RLQQEIEELLKKLEEAELKELQAELEELEEELEELqEELERLEEALEELREELEEAEQALDAAERE-------------- 483
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1170 LLQESSRLRSlLEEERTKNATLARELSdlhskySVVEKQRPKVQLQERVHEIFQVDPETEQEI-TRLKAKLQEMAGKR-S 1247
Cdd:TIGR02168 484 LAQLQARLDS-LERLQENLEGFSEGVK------ALLKNQSGLSGILGVLSELISVDEGYEAAIeAALGGRLQAVVVENlN 556
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1248 GVEKEVEKLLPD---------LEVLRAQKPTVEYKEVTQEV-------VRHERSPEVLRE-----------IDRLKAQLN 1300
Cdd:TIGR02168 557 AAKKAIAFLKQNelgrvtflpLDSIKGTEIQGNDREILKNIegflgvaKDLVKFDPKLRKalsyllggvlvVDDLDNALE 636
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1301 ELVNSHGRsqEQLIRLQGERDEWRRERAKVETKTVSKEVVRhekdpvlEKEAERLRQEVREAAQKRRAAEDAVYELQSKr 1380
Cdd:TIGR02168 637 LAKKLRPG--YRIVTLDGDLVRPGGVITGGSAKTNSSILER-------RREIEELEEKIEELEEKIAELEKALAELRKE- 706
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1381 lLLERRKPEEKVVVQEVVVTQKDPKLREEHSRLSGSLDEEVGRRRQLELEVQQLRAGVEEQEGLLsfQEDRSKKLAVERE 1460
Cdd:TIGR02168 707 -LEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERL--EEAEEELAEAEAE 783
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1461 LRQLTLRIQELEKRPPTVQEKI-IMEEVVKLEKDP--DLEKSTEALRWDLDQEKTQVTELNRECKNLQVQIDVLQKA-KS 1536
Cdd:TIGR02168 784 IEELEAQIEQLKEELKALREALdELRAELTLLNEEaaNLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEiEE 863
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1537 QEKTIYKEVIRVQKdrvLEDERARVWEMLNRERTARQAREEEARRLRERIDRAEtlgrtwsREESELQRARDQADQECGR 1616
Cdd:TIGR02168 864 LEELIEELESELEA---LLNERASLEEALALLRSELEELSEELRELESKRSELR-------RELEELREKLAQLELRLEG 933
|
730 740 750 760 770
....*....|....*....|....*....|....*....|....*....|.
gi 1002819366 1617 LQQELRALERQKQQQTLQLQEESKLLSQKTESERQKAAQRgqeLSRLEAAI 1667
Cdd:TIGR02168 934 LEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRR---LKRLENKI 981
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
930-1724 |
2.10e-13 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 76.26 E-value: 2.10e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 930 AQAGRESEALKAQLEEERKRVARVQHELEAQRSQLLQLRTQRP-----LERLEEKEVVEFY----RDPQLEGSLSRVKAQ 1000
Cdd:TIGR02169 166 AEFDRKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREkaeryQALLKEKREYEGYellkEKEALERQKEAIERQ 245
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1001 VEEEGKRRAGLQADLEVAAQKVVQLESKRKTmqphlLTKEVtqverDPGLDSQAAQLRIQIQQLRGEDAVISARLEglkk 1080
Cdd:TIGR02169 246 LASLEEELEKLTEEISELEKRLEEIEQLLEE-----LNKKI-----KDLGEEEQLRVKEKIGELEAEIASLERSIA---- 311
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1081 ellalekrevdvkekvvvkevvkveknlemvkaaqALRLQMEEDAARRKQAEEAVAKLQARIEDLERAISSVEPKVIVKE 1160
Cdd:TIGR02169 312 -----------------------------------EKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLT 356
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1161 VKKVEqdpgLLQESSRLRSLLEEERTKNATLARELSDLHSKYSVV-----EKQRPKVQLQERVHEIFQVDPETEQEITRL 1235
Cdd:TIGR02169 357 EEYAE----LKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLkreinELKRELDRLQEELQRLSEELADLNAAIAGI 432
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1236 KAKLQEMAGKRSGVEKEVEKLLPDLEVLRAQKPTVEYK----EVTQEVVRHERSpEVLREIDRLKAQLNELVNS--HGRS 1309
Cdd:TIGR02169 433 EAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQElydlKEEYDRVEKELS-KLQRELAEAEAQARASEERvrGGRA 511
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1310 QEQLIR--------LQGERDEWRRERAK-VETKTVSK-EVVRHEKDPVLEKEAERLRQEvreaaQKRRAAEDAVYELQSK 1379
Cdd:TIGR02169 512 VEEVLKasiqgvhgTVAQLGSVGERYATaIEVAAGNRlNNVVVEDDAVAKEAIELLKRR-----KAGRATFLPLNKMRDE 586
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1380 RLLLERRKPEEKVVVQEVVVTQkDPKLREEHSRLSGSL----DEEVGRRRQLELEVQQLRAGVEEQEGLLSFQEDRSKKL 1455
Cdd:TIGR02169 587 RRDLSILSEDGVIGFAVDLVEF-DPKYEPAFKYVFGDTlvveDIEAARRLMGKYRMVTLEGELFEKSGAMTGGSRAPRGG 665
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1456 AV-----ERELRQLTLRIQELEKrpptvQEKIIMEEVVKLEKD--------PDLEKSTEALRWDLDQEKTQVTELNRECK 1522
Cdd:TIGR02169 666 ILfsrsePAELQRLRERLEGLKR-----ELSSLQSELRRIENRldelsqelSDASRKIGEIEKEIEQLEQEEEKLKERLE 740
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1523 NLQVQIDVLQKAKSQEKTIYKEVIRV--QKDRVLEDERARVWEMLNRERTAR-QAREEEARRLRERIDRAETLGRTWSRE 1599
Cdd:TIGR02169 741 ELEEDLSSLEQEIENVKSELKELEARieELEEDLHKLEEALNDLEARLSHSRiPEIQAELSKLEEEVSRIEARLREIEQK 820
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1600 ESELQRARDQADQECGRLQQELRALE-RQKQQQTLQLQEESKLLSQKTESERQKAAQR--GQELSRLEAAILREKDQIYE 1676
Cdd:TIGR02169 821 LNRLTLEKEYLEKEIQELQEQRIDLKeQIKSIEKEIENLNGKKEELEEELEELEAALRdlESRLGDLKKERDELEAQLRE 900
|
810 820 830 840
....*....|....*....|....*....|....*....|....*....
gi 1002819366 1677 KERTLRDLHAKVSREELSQ-ETQTRETNLSTKISILEPETGKDMSPYEA 1724
Cdd:TIGR02169 901 LERKIEELEAQIEKKRKRLsELKAKLEALEEELSEIEDPKGEDEEIPEE 949
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1178-1704 |
8.87e-13 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 73.82 E-value: 8.87e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1178 RSLLEEERTKNATL-ARELSDLHSKYSVVEKQRPKVQLQERvheifqvdpETEQEITRLKAKLQEMAGKRSGVEKEVEKL 1256
Cdd:COG1196 216 RELKEELKELEAELlLLKLRELEAELEELEAELEELEAELE---------ELEAELAELEAELEELRLELEELELELEEA 286
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1257 LPDLEVLRAQKPTVEykevTQEVVRHERSPEVLREIDRLKAQLNELVNSHGRSQEQLIRLQGERDEWRRERAKVETKTVS 1336
Cdd:COG1196 287 QAEEYELLAELARLE----QDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAE 362
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1337 KEVVRHEKDPVLEKEAERLRQEVREAAQKRRAAEDAVYELQSKRLLLERRKP-EEKVVVQEVVVTQKDPKLREEHSRLSG 1415
Cdd:COG1196 363 AEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLErLERLEEELEELEEALAELEEEEEEEEE 442
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1416 SLDEEVGRRRQLELEVQQLRAGVEEQEGLLSFQEDRSKKLAVERELRQLTLRIQELEKRPPTVQEKIIMEEVVKLEKDPD 1495
Cdd:COG1196 443 ALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGL 522
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1496 LEKSTEALRWDLDQEKTQVTELNRECKNLQVQIDvlQKAKSQEKTIYKEVIRVQKDRVLEDERARVWEMLNRERTARQAR 1575
Cdd:COG1196 523 AGAVAVLIGVEAAYEAALEAALAAALQNIVVEDD--EVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAA 600
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1576 EEEARRLRERIDRAETLGRTWSREESELQRARDQADQECGRLQQELRALERQKQQQTLQLQEESKLLSQKTESERQKAAQ 1655
Cdd:COG1196 601 VDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAE 680
|
490 500 510 520
....*....|....*....|....*....|....*....|....*....
gi 1002819366 1656 RGQELSRLEAAILREKDQIYEKERTLRDLHAKVSREELSQETQTRETNL 1704
Cdd:COG1196 681 LEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQ 729
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
968-1712 |
1.30e-12 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 73.55 E-value: 1.30e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 968 RTQRPLERLEEkeVVEfyrdpQLEGSLSRVKAQVEEEGK----RRAGLQADLEVAAQKVVQLESKRKTMQpHLLTKEVTQ 1043
Cdd:TIGR02168 183 RTRENLDRLED--ILN-----ELERQLKSLERQAEKAERykelKAELRELELALLVLRLEELREELEELQ-EELKEAEEE 254
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1044 VERdpgLDSQAAQLRIQIQQLRGEDAVISARLEGLKKELLAL--EKREVDVKEKVVVKEVVKVEKNLEMVKAAQALRLQM 1121
Cdd:TIGR02168 255 LEE---LTAELQELEEKLEELRLEVSELEEEIEELQKELYALanEISRLEQQKQILRERLANLERQLEELEAQLEELESK 331
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1122 -EEDAARRKQAEEAVAKLQARIEDLERAISSVEPKVIVKEVKKVEQDPGLLQESSRLRSLLEEERTKNATLARELSDLHs 1200
Cdd:TIGR02168 332 lDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLE- 410
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1201 kysvvekqrpkvQLQERVHEIFQVDPETEQEITRlkAKLQEMAGKRSGVEKEVEKLLPDLEVLRAQKPTVEykevtqevv 1280
Cdd:TIGR02168 411 ------------RLEDRRERLQQEIEELLKKLEE--AELKELQAELEELEEELEELQEELERLEEALEELR--------- 467
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1281 rhERSPEVLREIDRLKAQLNELVNSHGRSQEQLIRLQGERDEWRRERAKVETKT----VSKEVVRHEKDPVLEKEA---E 1353
Cdd:TIGR02168 468 --EELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSgilgVLSELISVDEGYEAAIEAalgG 545
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1354 RLRQEVREAAQkrrAAEDAVYELQSKRL-------------------LLERRKPEEKVVVQEVVVTQKDPKLRE------ 1408
Cdd:TIGR02168 546 RLQAVVVENLN---AAKKAIAFLKQNELgrvtflpldsikgteiqgnDREILKNIEGFLGVAKDLVKFDPKLRKalsyll 622
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1409 EHSRLSGSLDEEVGRRRQLE-------LEVQQLRAGveeqeGLLSFQEDR--SKKLAVERELRQLTLRIQELEkrpptVQ 1479
Cdd:TIGR02168 623 GGVLVVDDLDNALELAKKLRpgyrivtLDGDLVRPG-----GVITGGSAKtnSSILERRREIEELEEKIEELE-----EK 692
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1480 EKIIMEEVVklekdpDLEKSTEALRWDLDQEKTQVTELNRECKNLQVQidvLQKAKSQEKTIYKEVIRVQKDRV-LEDER 1558
Cdd:TIGR02168 693 IAELEKALA------ELRKELEELEEELEQLRKELEELSRQISALRKD---LARLEAEVEQLEERIAQLSKELTeLEAEI 763
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1559 ARVWEMLNRERTARQAREEEARRLRERIDRAETLGRTWSREESELQRARDQADQECGRLQQELRALERQKQQQTLQLQEE 1638
Cdd:TIGR02168 764 EELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDL 843
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1639 SKLLSQKTESERQKAAQRGQ-----------------ELSRLEAAILREKDQIYEKERTLRDLHAKVSR-EELSQETQTR 1700
Cdd:TIGR02168 844 EEQIEELSEDIESLAAEIEEleelieeleseleallnERASLEEALALLRSELEELSEELRELESKRSElRRELEELREK 923
|
810
....*....|..
gi 1002819366 1701 ETNLSTKISILE 1712
Cdd:TIGR02168 924 LAQLELRLEGLE 935
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
901-1684 |
1.45e-12 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 73.64 E-value: 1.45e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 901 EFARKMLEKKELSEDIRRTHDAKQGSEspaqAGRESEALKAqleEERKRVARVQHELEAQRSQllQLRTQRPLERLEE-K 979
Cdd:PTZ00121 1112 EEARKAEEAKKKAEDARKAEEARKAED----ARKAEEARKA---EDAKRVEIARKAEDARKAE--EARKAEDAKKAEAaR 1182
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 980 EVVEFYRDPQLEgslsrvkaQVEEEGKRRAGLQADLEVAAQKVVQLESKRKTMQphllTKEVTQVERDPGLDSQAAQLRI 1059
Cdd:PTZ00121 1183 KAEEVRKAEELR--------KAEDARKAEAARKAEEERKAEEARKAEDAKKAEA----VKKAEEAKKDAEEAKKAEEERN 1250
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1060 QIQQLRGEDAVIsARLEGLKKELLALEKREVDVKEKVVVKEVVKVEKNLEMVKAAQALRLQMEEdaarRKQAEEAVAKLQ 1139
Cdd:PTZ00121 1251 NEEIRKFEEARM-AHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEE----AKKADEAKKKAE 1325
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1140 ARIEDLERAISSVEPKvivkevkkveqdpgllQESSRLRSLLEEERTKNATLARELSDLHSKYSVVEKQRPKvQLQERVH 1219
Cdd:PTZ00121 1326 EAKKKADAAKKKAEEA----------------KKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKAD-AAKKKAE 1388
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1220 EIFQVDpeteqeitRLKAKLQEMAGKRSGVEKEVEkllpdlevlrAQKPTVEYKEVTQEVVRHE---RSPEVLREIDRLK 1296
Cdd:PTZ00121 1389 EKKKAD--------EAKKKAEEDKKKADELKKAAA----------AKKKADEAKKKAEEKKKADeakKKAEEAKKADEAK 1450
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1297 AQLNElvnshGRSQEQLIRLQGERDEWRRERAKVETKTVSKEVVRHEKdpvlekEAERLRQEVREAAQKRRAAEDAvYEL 1376
Cdd:PTZ00121 1451 KKAEE-----AKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAE------EAKKKADEAKKAAEAKKKADEA-KKA 1518
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1377 QSKRLLLERRKPEEKVVVQEVVVTQKDPKLREEHSRLSGSLDEEVGRRRQLELEVQQLRAGVEEQEGLLSFQEDRSKKLA 1456
Cdd:PTZ00121 1519 EEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVM 1598
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1457 VERElRQLTLRIQELEKRPptvQEKIIMEEVVKLEkdpDLEKSTEALRWDLDQEKTQVTELNRECKNLQVQIDVLQKAKS 1536
Cdd:PTZ00121 1599 KLYE-EEKKMKAEEAKKAE---EAKIKAEELKKAE---EEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAE 1671
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1537 QEKTIYKEVIRVQKDRVLEDERARVWEMLNRERTARQAREEEARRlreridRAETLGRTWSREESELQRARDQADQEcGR 1616
Cdd:PTZ00121 1672 EDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKK------KAEELKKAEEENKIKAEEAKKEAEED-KK 1744
|
730 740 750 760 770 780 790
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1617 LQQELRALErqkqqqtlqlqEESKLLSQKTESERQKAAQRGQELSRLEAAILREKDQI--YEKERTLRDL 1684
Cdd:PTZ00121 1745 KAEEAKKDE-----------EEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKrrMEVDKKIKDI 1803
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
934-1595 |
8.15e-12 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 70.74 E-value: 8.15e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 934 RESEALKAQLEEERKRVARVQHELEAQRSQLLQLRTQRPlerleekevvefyrdpQLEGSLSRVKAQVEEEGKRRAGLQA 1013
Cdd:COG1196 225 LEAELLLLKLRELEAELEELEAELEELEAELEELEAELA----------------ELEAELEELRLELEELELELEEAQA 288
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1014 DLEVAAQKVVQLESKRKTmqphlltkevtQVERDPGLDSQAAQLRIQIQQLRGEDAVISARLEGLKKELLALEKREVDVK 1093
Cdd:COG1196 289 EEYELLAELARLEQDIAR-----------LEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAE 357
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1094 EKVVVKEVVKveknLEMVKAAQALRLQMEEDAARRKQAEEAVAKLQARIEDLERAISSvepkvivkevkkveqdpgLLQE 1173
Cdd:COG1196 358 AELAEAEEAL----LEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEA------------------LLER 415
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1174 SSRLRSLLEEERTKNATLARELSDLHSKYS-----VVEKQRPKVQLQERVHEIFQVDPETEQEITRLKAKLQEMAGKRSG 1248
Cdd:COG1196 416 LERLEEELEELEEALAELEEEEEEEEEALEeaaeeEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLL 495
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1249 VEKEVEKLLPDLEVLRAQKPTVEYKEVTQEVVRHERSPEVLRE--IDRLKAQLNELVNSHGRSQEQLIRLQGERDEWRRE 1326
Cdd:COG1196 496 LLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAalEAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRAT 575
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1327 RAKVETKTVSKEVVRHEKDPVLEKEAERLRQEVREAAQKRRAAEDAVYELQSKRLLLERRKPEEKVVVQEVVVTQKD--- 1403
Cdd:COG1196 576 FLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEgeg 655
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1404 --PKLREEHSRLSGSLDEEVGRRRQLELEVQQLRAGVEEQEGLLSFQEDRSKKLAVERE-LRQLTLRIQELEKRPPTVQE 1480
Cdd:COG1196 656 gsAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEeRLEEELEEEALEEQLEAERE 735
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1481 KIIMEEVVKLEKDPDLEKSTEALRWDLDQEKTQVTELNRECKNL-QVQIDVLQKAKSQEKTiyKEVIRVQKDrVLEDERA 1559
Cdd:COG1196 736 ELLEELLEEEELLEEEALEELPEPPDLEELERELERLEREIEALgPVNLLAIEEYEELEER--YDFLSEQRE-DLEEARE 812
|
650 660 670
....*....|....*....|....*....|....*.
gi 1002819366 1560 RvwemlnrertarqareeearrLRERIDRAETLGRT 1595
Cdd:COG1196 813 T---------------------LEEAIEEIDRETRE 827
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
939-1710 |
1.91e-10 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 66.53 E-value: 1.91e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 939 LKAQLEEERKRVARVQHELEAQRSQLLQLRTQRPLERLEEKEVVEFYRDPQLEGSLSRVKAQVEEEGKRRAGLQADLEVA 1018
Cdd:pfam02463 167 LKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELL 246
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1019 AQKVVQLESKRKTMQPHllTKEVTQVERDPGLDSQaaqlriQIQQLRGEDAVISARLEGLKKELLALEKREVDVKEKVVV 1098
Cdd:pfam02463 247 RDEQEEIESSKQEIEKE--EEKLAQVLKENKEEEK------EKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKE 318
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1099 --KEVVKVEKNLEMVKAAQALRLQMEEDAARRKQAEEAVAKLQARIEDLERAISSVEPKVIVKEVKKVEQDPGLLQESSR 1176
Cdd:pfam02463 319 seKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELE 398
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1177 LRSLLEEERTKNATLAR----ELSDLHSKYSVVEKQRPKVQLQERVHEIFQVDPETEQEITRLKAKLQemagkrsgvEKE 1252
Cdd:pfam02463 399 LKSEEEKEAQLLLELARqledLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELE---------LKK 469
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1253 VEKLLPDLEVLraqkptveyKEVTQEVVRHERSPEVLREIDRLKAQLNELVNSHGRSQEQLIRLQGERDEWRRERAKVET 1332
Cdd:pfam02463 470 SEDLLKETQLV---------KLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVEN 540
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1333 KTVSKEVVRHEKDPVLEKEAERLRQEVREAAQKRRAAEDAVYELQSKRLLLERRKPEEKVVVQEVVVTQKDPKLREEHS- 1411
Cdd:pfam02463 541 YKVAISTAVIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDKATLEADEDDk 620
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1412 -RLSGSLDEEVGRRRQLELEVQQLRAGVEEQEGLLSFQEDRSKKLAVERELRQLTLRIQELEKRPPTVQEKIIMEEVVKL 1490
Cdd:pfam02463 621 rAKVVEGILKDTELTKLKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLE 700
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1491 EKDPDLEKSTEALRWDLDQEKTQVTELNRECKNLQVQIDVLQKAKSQEKTIYKEvIRVQKDRVLEDERARVWEMLNRERT 1570
Cdd:pfam02463 701 IKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEK-SRLKKEEKEEEKSELSLKEKELAEE 779
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1571 ARQAREEEARRLRERIDRAETLGRTWSREESELQRARDQADQECGRLQQELRALERQKQQQTLQLQEESKLLSQKtESER 1650
Cdd:pfam02463 780 REKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEE-ELER 858
|
730 740 750 760 770 780
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1651 QKAAQRGQELSRLEAAILREKDQIYEKERTLrDLHAKVSREELSQETQTRETNLSTKISI 1710
Cdd:pfam02463 859 LEEEITKEELLQELLLKEEELEEQKLKDELE-SKEEKEKEEKKELEEESQKLNLLEEKEN 917
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1348-1701 |
3.96e-10 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 65.34 E-value: 3.96e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1348 LEKEAERLRQEvREAAQKRRAAEDAVYELQSKRLLLERRKpeekvvvqevvvtqkdpkLREEHSRLSGSLDEEVGRRRQL 1427
Cdd:COG1196 198 LERQLEPLERQ-AEKAERYRELKEELKELEAELLLLKLRE------------------LEAELEELEAELEELEAELEEL 258
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1428 ELEVQQLRAGVEEQegllsfqedRSKKLAVERELRQLTLRIQELEKRPPTVQEKIIMEEvvklEKDPDLEKSTEALRWDL 1507
Cdd:COG1196 259 EAELAELEAELEEL---------RLELEELELELEEAQAEEYELLAELARLEQDIARLE----ERRRELEERLEELEEEL 325
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1508 DQEKTQVTELNRECKNLQVQIDVLQKAKSQEKtiykEVIRVQKDRVLEDERARvwemLNRERTARQAREEEARRLRERID 1587
Cdd:COG1196 326 AELEEELEELEEELEELEEELEEAEEELEEAE----AELAEAEEALLEAEAEL----AEAEEELEELAEELLEALRAAAE 397
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1588 RAEtlgrtwsrEESELQRARDQADQECGRLQQELRALERQKQQQTLQLQEESKLLSQKTESERQKAAQRGQELSRLEAAI 1667
Cdd:COG1196 398 LAA--------QLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELL 469
|
330 340 350
....*....|....*....|....*....|....
gi 1002819366 1668 LREKDQIYEKERTLRDLHAKVSREELSQETQTRE 1701
Cdd:COG1196 470 EEAALLEAALAELLEELAEAAARLLLLLEAEADY 503
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
1880-1918 |
5.73e-10 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 56.18 E-value: 5.73e-10
10 20 30
....*....|....*....|....*....|....*....
gi 1002819366 1880 LLEAQAATGGIVDLLSRERYSVHKAMERGLIENTSTQRL 1918
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
804-1361 |
8.41e-10 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 64.19 E-value: 8.41e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 804 QIAYKLQAQKRLTQEIQSRERDRATASHLSQALQAALQDYELQADTYRCSLEPTLAVSAPKRPRVAPLQESIQAQEKNLA 883
Cdd:COG1196 240 ELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLE 319
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 884 KAYTEVAAAQQQLLQQLEfarKMLEKKELSEDIRRTHDAKQGSESpAQAGRESEALKAQLEEERKRVARVQHELEAQRSQ 963
Cdd:COG1196 320 ELEEELAELEEELEELEE---ELEELEEELEEAEEELEEAEAELA-EAEEALLEAEAELAEAEEELEELAEELLEALRAA 395
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 964 LLQLRTQRPLERLEEKEVVEFYRDPQLEGSLSRVKAQVEEEGKRRAGLQADLEVAAQKVVQLESKRKTMQPHLLTKEVTQ 1043
Cdd:COG1196 396 AELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALL 475
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1044 VERDPGLDSQAAQLRIQIQQLRGEDAVISARLEGLKKELLALEKREVDVKEKVVVKEVVKVEKNLEMVKAAQALRLQMEE 1123
Cdd:COG1196 476 EAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVED 555
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1124 DAARRKQAEEAVAKLQARIEDLERAissvepkviVKEVKKVEQDPGLLQESSRLRSLLEEERTKNATLARELSDLHSKYS 1203
Cdd:COG1196 556 DEVAAAAIEYLKAAKAGRATFLPLD---------KIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRT 626
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1204 VVEKQRPKVQLQERVHEIFQVDPETEQEITRLKAKLQEMAGKRSGVEKEVEKLLPDLEVLRAQKptvEYKEVTQEVVRHE 1283
Cdd:COG1196 627 LVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAE---EELELEEALLAEE 703
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1002819366 1284 rspEVLREIDRLKAQLNELVNSHGRSQEQLIRLQGERDEWRRERAKVETKTVSKEVVRHEKDPVLEKEAERLRQEVRE 1361
Cdd:COG1196 704 ---EEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELERELERLEREIEA 778
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1060-1567 |
6.44e-09 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 61.23 E-value: 6.44e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1060 QIQQLRGEDAVISARLEGLKKELLALEKREVDVKekvvvkevvkveknlEMVKAAQALRLQMEEDAARRKQAEEAVAKLQ 1139
Cdd:PRK03918 208 EINEISSELPELREELEKLEKEVKELEELKEEIE---------------ELEKELESLEGSKRKLEEKIRELEERIEELK 272
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1140 ARIEDLE---RAISSVEPKVIVKEVKKVEQDPgLLQESSRLRSLLEEERTKNATLARELSDLHSKYSVVEKQRPKVQLQE 1216
Cdd:PRK03918 273 KEIEELEekvKELKELKEKAEEYIKLSEFYEE-YLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELE 351
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1217 RVHEIFQVDPETEQEITRLKAKLQEMAGKRSGveKEVEKLLPDLEVLRAQKPTVEykevtqevvrherspevlREIDRLK 1296
Cdd:PRK03918 352 KRLEELEERHELYEEAKAKKEELERLKKRLTG--LTPEKLEKELEELEKAKEEIE------------------EEISKIT 411
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1297 AQLNELVNSHGRSQEQLIRLQGerdewrrerAKVETKTVSKEVVRHEKDPVLEK---EAERLRQEVREAAQKRRAAEDAV 1373
Cdd:PRK03918 412 ARIGELKKEIKELKKAIEELKK---------AKGKCPVCGRELTEEHRKELLEEytaELKRIEKELKEIEEKERKLRKEL 482
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1374 YELQSKRLLLERRKPEEKVVVQEVVVTQKDPKLR--------EEHSRLSGSLDEEVGRRRQLELEVQQLRAGVEEQEGLL 1445
Cdd:PRK03918 483 RELEKVLKKESELIKLKELAEQLKELEEKLKKYNleelekkaEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELE 562
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1446 SFQEDRSKKLA-VERELR--------QLTLRIQELEKrpptvqekiIMEEVVKLEKDP----DLEKSTEALRWDLDQEKT 1512
Cdd:PRK03918 563 KKLDELEEELAeLLKELEelgfesveELEERLKELEP---------FYNEYLELKDAEkeleREEKELKKLEEELDKAFE 633
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|....*
gi 1002819366 1513 QVTELNRECKNLQVQIDVLQKAKSQEKTIYKEVIRVQKDRVLEDERARVWEMLNR 1567
Cdd:PRK03918 634 ELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKR 688
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
930-1302 |
2.17e-08 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 59.69 E-value: 2.17e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 930 AQAGRESEALKAQLEEERKRVARVQHELEAQRSQLLQLRTQrpLERLEEKEVvefyrdpQLEGSLSRVKAQVEEEGKRRA 1009
Cdd:TIGR02168 680 EELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKE--LEELSRQIS-------ALRKDLARLEAEVEQLEERIA 750
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1010 GLQADLEVAAQKVVQLESKRKTMQPHLLTKEVTQVErdpgLDSQAAQLRIQIQQLRGEDAVISARLEGLKKELLALEKRE 1089
Cdd:TIGR02168 751 QLSKELTELEAEIEELEERLEEAEEELAEAEAEIEE----LEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERL 826
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1090 VdvkekvvvkevvkveknlEMVKAAQALRLQMEEDAARRKQAEEAVAKLQARIEDLERAISSVEpkvivkevkkvEQDPG 1169
Cdd:TIGR02168 827 E------------------SLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELE-----------SELEA 877
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1170 LLQESSRLRSLLEEERTKNATLARELSDLHSKysVVEKQRPKVQLQERVHEIFQVDPETEQEITRLKAKLQEMAGkrsgv 1249
Cdd:TIGR02168 878 LLNERASLEEALALLRSELEELSEELRELESK--RSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYS----- 950
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....
gi 1002819366 1250 ekevekllpdlevlraqkptVEYKEVTQEVVRHERSPEVLR-EIDRLKAQLNEL 1302
Cdd:TIGR02168 951 --------------------LTLEEAEALENKIEDDEEEARrRLKRLENKIKEL 984
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
581-1361 |
2.64e-08 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 59.30 E-value: 2.64e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 581 LEDLEGRIHSHEGTAQRLQSLGTEKETAQKECEAFLStrpvgpaalQLPVALNSVKNKFSDVQvlcSLYGEKAKAALDLE 660
Cdd:TIGR02168 234 LEELREELEELQEELKEAEEELEELTAELQELEEKLE---------ELRLEVSELEEEIEELQ---KELYALANEISRLE 301
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 661 RQIQDADRVIRGFEATLVQeapipaepgaLQERVSELQRQRRELLEQqtcvlrlhraLKASEHACAALQNNFQEFCQDLP 740
Cdd:TIGR02168 302 QQKQILRERLANLERQLEE----------LEAQLEELESKLDELAEE----------LAELEEKLEELKEELESLEAELE 361
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 741 RQQRQVRALTDRYHAVGDQLdlrEKVVQDAALTYQQFKNCKDNLSSWLEHLPRSQVRpsdgpsqiayklqaQKRLTQEIQ 820
Cdd:TIGR02168 362 ELEAELEELESRLEELEEQL---ETLRSKVAQLELQIASLNNEIERLEARLERLEDR--------------RERLQQEIE 424
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 821 SrerdratasHLSQALQAALQDYELQADTyrcsLEPTLAVSAPKRPRVAPLQESIQAQEKNLAKAYTEVAAAQQQLLQQL 900
Cdd:TIGR02168 425 E---------LLKKLEEAELKELQAELEE----LEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARL 491
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 901 EFARKMLEKKE-LSEDIRRThdAKQGSESPAQAGRESEALKAQLEEER---------------KRVARVQHELEAQRSQL 964
Cdd:TIGR02168 492 DSLERLQENLEgFSEGVKAL--LKNQSGLSGILGVLSELISVDEGYEAaieaalggrlqavvvENLNAAKKAIAFLKQNE 569
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 965 LQLRTQRPLERLE--EKEVVEFYRDPQLEGSLSRVKAQVEEEGKRRAGLQA---------DLEVAAQKVVQLESKRK--T 1031
Cdd:TIGR02168 570 LGRVTFLPLDSIKgtEIQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYllggvlvvdDLDNALELAKKLRPGYRivT 649
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1032 MQPHLLTKEVTQVERDPGLDSQAAQLRIQIQQLRGEDAVISARLEGLKKELLALEKREVDVKEKVVVKEVVKVEKNLEMV 1111
Cdd:TIGR02168 650 LDGDLVRPGGVITGGSAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQIS 729
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1112 ---KAAQALRLQMEEDAARRKQAEEAVAKLQARIEDLERAISSVEPKVIVKEVKKVEQDPGLLQESSRLRSLLEEERTKN 1188
Cdd:TIGR02168 730 alrKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELR 809
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1189 ATLARELSDLHSKYSVVEkqrpkvQLQERVHEIFQVDPETEQEITRLKAKLQEMAGKRSGVEKEVEKLLPDLEVLRAQKP 1268
Cdd:TIGR02168 810 AELTLLNEEAANLRERLE------SLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERA 883
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1269 TVEYKEVTQEVVRHERSP---EVLREIDRLKAQLNELVNSHGRSQEQLIRLQGERDEWR---RERAKVETKTVSKEVVRH 1342
Cdd:TIGR02168 884 SLEEALALLRSELEELSEelrELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQerlSEEYSLTLEEAEALENKI 963
|
810 820
....*....|....*....|
gi 1002819366 1343 EKDP-VLEKEAERLRQEVRE 1361
Cdd:TIGR02168 964 EDDEeEARRRLKRLENKIKE 983
|
|
| PLEC |
smart00250 |
Plectin repeat; |
1916-1953 |
8.42e-08 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 49.79 E-value: 8.42e-08
10 20 30
....*....|....*....|....*....|....*...
gi 1002819366 1916 QRLLNAQKAFTGIEDPVTKKRLSVGEAVQKGWMPRESV 1953
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| SPEC |
cd00176 |
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ... |
228-415 |
9.81e-08 |
|
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here
Pssm-ID: 238103 [Multi-domain] Cd Length: 213 Bit Score: 54.76 E-value: 9.81e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 228 LYTHLQGCTRQLSALAEQQRRILQQDwsDLMADPAGVRREYEHFK--QHELLSQEQSVNQLEDDGERMVELRHPAVGPIQ 305
Cdd:cd00176 1 KLQQFLRDADELEAWLSEKEELLSST--DYGDDLESVEALLKKHEalEAELAAHEERVEALNELGEQLIEEGHPDAEEIQ 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 306 AHQEALKMEWQNFLNLCICQETQLQHVEDYRRFQEEADSVSQTLAKLNSNLDakySPAPGGPPGAPTELLQQLEAEEKRL 385
Cdd:cd00176 79 ERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQWLEEKEAALA---SEDLGKDLESVEELLKKHKELEEEL 155
|
170 180 190
....*....|....*....|....*....|
gi 1002819366 386 avtERATGDLQRRSRDVAPLPQRRNPPQQP 415
Cdd:cd00176 156 ---EAHEPRLKSLNELAEELLEEGHPDADE 182
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
1957-1994 |
1.43e-07 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 49.25 E-value: 1.43e-07
10 20 30
....*....|....*....|....*....|....*...
gi 1002819366 1957 LQVQHLTGGLIDPKRTGRIPIQQALLSGMISEELAQLL 1994
Cdd:pfam00681 2 LEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
1170-1433 |
1.47e-07 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 56.67 E-value: 1.47e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1170 LLQESSRLRSLLEEERTKNATLARELSDLHSKYSVV---EKQRPKVQLQERVHEIFQVdpeTEQEITRLKAKLQEMAGKR 1246
Cdd:pfam17380 308 KAREVERRRKLEEAEKARQAEMDRQAAIYAEQERMAmerERELERIRQEERKRELERI---RQEEIAMEISRMRELERLQ 384
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1247 SGVEKEVEKLLPDLEVLRAQK-------PTVEYKEVTQEVVRHERSPEVLREIDRLKAQ----LNELVNSHGRSQEQLIR 1315
Cdd:pfam17380 385 MERQQKNERVRQELEAARKVKileeerqRKIQQQKVEMEQIRAEQEEARQREVRRLEEErareMERVRLEEQERQQQVER 464
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1316 LQGERDEWRRERAKVETKTVSKEVVRHEKDPVLEKEAERLRQEVREAAQKRRAAE-------DAVYELQSKRLLLERRKP 1388
Cdd:pfam17380 465 LRQQEEERKRKKLELEKEKRDRKRAEEQRRKILEKELEERKQAMIEEERKRKLLEkemeerqKAIYEEERRREAEEERRK 544
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 1002819366 1389 EEKVVVQEVVVTQKDpKLREEHSRLSGSLDEEVGRRRQLELEVQQ 1433
Cdd:pfam17380 545 QQEMEERRRIQEQMR-KATEERSRLEAMEREREMMRQIVESEKAR 588
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
936-1492 |
2.58e-07 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 56.23 E-value: 2.58e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 936 SEALKAQLEEERKRVARVQHELEAQRSQLLQLRTQ-----RPLERLEE-KEVVEFYR--DPQLEGSLSRVKAQVEEEGKR 1007
Cdd:PRK03918 188 TENIEELIKEKEKELEEVLREINEISSELPELREElekleKEVKELEElKEEIEELEkeLESLEGSKRKLEEKIRELEER 267
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1008 RAGLQADLEVAAQKVVQLESKRKTMQPHL----LTKEVTQVERDpgLDSQAAQLRIQIQQLRGEDAVIS---ARLEGLKK 1080
Cdd:PRK03918 268 IEELKKEIEELEEKVKELKELKEKAEEYIklseFYEEYLDELRE--IEKRLSRLEEEINGIEERIKELEekeERLEELKK 345
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1081 ELLALEKREVDVKEKVVVkevvkveknLEMVKAaqalrLQMEEDAARRKQAEEAVAKLQARIEDLERAISSVEPKVIVKE 1160
Cdd:PRK03918 346 KLKELEKRLEELEERHEL---------YEEAKA-----KKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKIT 411
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1161 VKKVeqdpGLLQESSRLRSLLEE---ERTKNATLARELSDLHSKYSVVEKQRPKVQLQERVHEIFQVDPETEQEITRLKA 1237
Cdd:PRK03918 412 ARIG----ELKKEIKELKKAIEElkkAKGKCPVCGRELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEK 487
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1238 KLQ---EMAGKRSGVE--KEVEKLLPDLEVLRAQKPTVEYKEVtqevvrHERSPEVLREIDRLKAQLNELVNSHGRSQEq 1312
Cdd:PRK03918 488 VLKkesELIKLKELAEqlKELEEKLKKYNLEELEKKAEEYEKL------KEKLIKLKGEIKSLKKELEKLEELKKKLAE- 560
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1313 lirLQGERDEWRRERAKVETKTvskevvrHEKDPVLEKEAERLRQEVREAAQKRRAAEDAVYELQSKrllLERRKpeekv 1392
Cdd:PRK03918 561 ---LEKKLDELEEELAELLKEL-------EELGFESVEELEERLKELEPFYNEYLELKDAEKELERE---EKELK----- 622
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1393 vvqevvvtqkdpKLREEHSRLSGSLDEEVGRRRQLELEVQQLRAGVEEQEgllsFQEDRSKKLAVERELRQLTLRIQELE 1472
Cdd:PRK03918 623 ------------KLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEE----YEELREEYLELSRELAGLRAELEELE 686
|
570 580
....*....|....*....|..
gi 1002819366 1473 KRPPTVQEKI--IMEEVVKLEK 1492
Cdd:PRK03918 687 KRREEIKKTLekLKEELEEREK 708
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
1109-1701 |
5.98e-07 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 55.15 E-value: 5.98e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1109 EMVKAAQALRLQMEEDAARRKQAEEAVAKLQAR-IEDLERAIssvepKVIVKEVKKVEQDPGLLQESSRLRSLLEEERTK 1187
Cdd:PTZ00121 1144 EARKAEDAKRVEIARKAEDARKAEEARKAEDAKkAEAARKAE-----EVRKAEELRKAEDARKAEAARKAEEERKAEEAR 1218
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1188 NATLARELSDLHSkysvVEKQRPKVQLQERVHEIFQVDPETEQEITRLKAKLQEMAGKRSGVEKEVEKLLPDLEVLRAQK 1267
Cdd:PTZ00121 1219 KAEDAKKAEAVKK----AEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADE 1294
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1268 PtveykevtqevvrheRSPEVLREIDRLKAQLNElvnshGRSQEQLiRLQGERDEWRRERAKVETKTVSKevvrheKDPV 1347
Cdd:PTZ00121 1295 A---------------KKAEEKKKADEAKKKAEE-----AKKADEA-KKKAEEAKKKADAAKKKAEEAKK------AAEA 1347
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1348 LEKEAERLRQEVREAAQKRRAAEDAVYELQSKRLLLERRKPEEKVVVQEVVVTQKDPKLREEhsrlsgsldeeVGRRRQL 1427
Cdd:PTZ00121 1348 AKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADE-----------LKKAAAA 1416
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1428 ELEVQQLRAGVEEQEgllsfQEDRSKKLAVERElrqltlRIQELEKRpptVQEKIIMEEVVKleKDPDLEKSTEALRwdL 1507
Cdd:PTZ00121 1417 KKKADEAKKKAEEKK-----KADEAKKKAEEAK------KADEAKKK---AEEAKKAEEAKK--KAEEAKKADEAKK--K 1478
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1508 DQEKTQVTELNRECKNLQVQIDVLQKaKSQEKTIYKEVIRVQKDRVLEDERARvwemlNRERTARQAREEEARRLRERID 1587
Cdd:PTZ00121 1479 AEEAKKADEAKKKAEEAKKKADEAKK-AAEAKKKADEAKKAEEAKKADEAKKA-----EEAKKADEAKKAEEKKKADELK 1552
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1588 RAETLGRTWSREESELQRARDQADQECGRLQQELRALERQKQQQTLQLQEESKLL----SQKTESERQKAAQ-------- 1655
Cdd:PTZ00121 1553 KAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMkaeeAKKAEEAKIKAEElkkaeeek 1632
|
570 580 590 600
....*....|....*....|....*....|....*....|....*..
gi 1002819366 1656 -RGQELSRLEAAILREKDQIYEKERTLRDLHAKVSREELSQETQTRE 1701
Cdd:PTZ00121 1633 kKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEE 1679
|
|
| PLEC |
smart00250 |
Plectin repeat; |
1878-1911 |
6.52e-07 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 47.48 E-value: 6.52e-07
10 20 30
....*....|....*....|....*....|....
gi 1002819366 1878 QKLLEAQAATGGIVDLLSRERYSVHKAMERGLIE 1911
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLID 34
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
872-1424 |
1.02e-06 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 54.38 E-value: 1.02e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 872 QESIQAQEKNLAKAYTEVAAAQQQLLQQLEFARKMLEKKElSEDIRRTHDAKQGSES-PAQAGRESEALKAQLEEERK-R 949
Cdd:PTZ00121 1240 EEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARK-ADELKKAEEKKKADEAkKAEEKKKADEAKKKAEEAKKaD 1318
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 950 VARVQHELEAQRSQLLQLRTQRPLERLEEKEVVEFYRDPQLEGSLSRVKA--QVEEEGKRRAglqADLEVAAQKVVQLES 1027
Cdd:PTZ00121 1319 EAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAaeKKKEEAKKKA---DAAKKKAEEKKKADE 1395
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1028 KRKTMQPHllTKEVTQVERDPGLDSQAAQLRIQIQQLRGED-----AVISARLEGLKKELLALEKREVDVKEKVVVKEVV 1102
Cdd:PTZ00121 1396 AKKKAEED--KKKADELKKAAAAKKKADEAKKKAEEKKKADeakkkAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKAD 1473
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1103 KVEKNLEMVKAAQALRLQMEEdaaRRKQAEEAVAKLQA--RIEDLERAISSVEPKVIVKEVKKVEQDpgllqessRLRSL 1180
Cdd:PTZ00121 1474 EAKKKAEEAKKADEAKKKAEE---AKKKADEAKKAAEAkkKADEAKKAEEAKKADEAKKAEEAKKAD--------EAKKA 1542
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1181 LEEERTKNATLARELSDLHSKYSVVEKQRP---KVQLQERVHEIFQVDPETEQEITRLKAKLQEMAGKRSGVEKEVEKLL 1257
Cdd:PTZ00121 1543 EEKKKADELKKAEELKKAEEKKKAEEAKKAeedKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKA 1622
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1258 PDLEVLRAQKPTVEY--KEVTQEVVRHERSPEVLREIDRLKAQLNELVNSHGRSQEQLIRlqGERDEWRRERA---KVET 1332
Cdd:PTZ00121 1623 EELKKAEEEKKKVEQlkKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKK--AEEDEKKAAEAlkkEAEE 1700
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1333 KTVSKEVVRHEKDPVleKEAERLRQEvrEAAQKRRAAEDAVYELQSKRLLLERRKPEEKVVVQEVVVTQKDPKLREEHSR 1412
Cdd:PTZ00121 1701 AKKAEELKKKEAEEK--KKAEELKKA--EEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKE 1776
|
570
....*....|..
gi 1002819366 1413 LSGSLDEEVGRR 1424
Cdd:PTZ00121 1777 KEAVIEEELDEE 1788
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1213-1688 |
1.44e-06 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 53.53 E-value: 1.44e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1213 QLQERVHEIFQVDPETEQEITRLKAKLQEMAGKRSGVEKEV---EKLLPDLEVLRAQKPTVEYKEVTQEvvrhERSPEVL 1289
Cdd:PRK03918 190 NIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVkelEELKEEIEELEKELESLEGSKRKLE----EKIRELE 265
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1290 REIDRLKAQLNELVNSHGRSQE------QLIRLQGERDEWRRERAKVEtKTVSKEVVRHEKDPVLEKEAERLRQEVREAA 1363
Cdd:PRK03918 266 ERIEELKKEIEELEEKVKELKElkekaeEYIKLSEFYEEYLDELREIE-KRLSRLEEEINGIEERIKELEEKEERLEELK 344
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1364 QKRRAAEDAVYELQSKRLLLERRKPEEKVVVQEVVVTQKDP---------KLREEHSRLSGSLDEEVGRRRQLELEVQQL 1434
Cdd:PRK03918 345 KKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTpeklekeleELEKAKEEIEEEISKITARIGELKKEIKEL 424
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1435 RAGVEEQE---------GLLSFQEDRSKKLA--------VERELRQLTLRIQELEKRPPTVQEKIIME-EVVKLEKDPD- 1495
Cdd:PRK03918 425 KKAIEELKkakgkcpvcGRELTEEHRKELLEeytaelkrIEKELKEIEEKERKLRKELRELEKVLKKEsELIKLKELAEq 504
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1496 ---LEKSTEALRW-DLDQEKTQVTELNRECKNLQVQIDVLQKAKSQEKTIYKEVIRVQKD-RVLEDERARVwemlnrERT 1570
Cdd:PRK03918 505 lkeLEEKLKKYNLeELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKlDELEEELAEL------LKE 578
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1571 ARQAREEEARRLRERIDRAETLGRTWSR---EESELQRARDQADQECGRLQQELRALERQKQQQTLQLQEESKLLSQKTE 1647
Cdd:PRK03918 579 LEELGFESVEELEERLKELEPFYNEYLElkdAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSE 658
|
490 500 510 520
....*....|....*....|....*....|....*....|....*
gi 1002819366 1648 SERQKAAQR----GQELSRLEAAILREKDQIYEKERTLRDLHAKV 1688
Cdd:PRK03918 659 EEYEELREEylelSRELAGLRAELEELEKRREEIKKTLEKLKEEL 703
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
1175-1702 |
1.56e-06 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 53.58 E-value: 1.56e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1175 SRLRSLLEEERTKNATLARELSDLHSKYSVV--EKQRPKVQLQERVHEIFQVDPETEQEITRLKAKLQEMAGKRSGVEKE 1252
Cdd:pfam15921 299 SQLEIIQEQARNQNSMYMRQLSDLESTVSQLrsELREAKRMYEDKIEELEKQLVLANSELTEARTERDQFSQESGNLDDQ 378
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1253 VEKLLPDLEvlRAQKPTVEYKEVTQEVVRHERSPEVlrEIDRLKAQLNELVNSHGRSQEQLIRLQGE-RDEWRRERAKVE 1331
Cdd:pfam15921 379 LQKLLADLH--KREKELSLEKEQNKRLWDRDTGNSI--TIDHLRRELDDRNMEVQRLEALLKAMKSEcQGQMERQMAAIQ 454
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1332 TKTVSKEVVRhEKDPVLEKEAERLRQevreaaqkrraaedAVYELQSKRLLLERRKpeekvvvqeVVVTQKDPKLREEHS 1411
Cdd:pfam15921 455 GKNESLEKVS-SLTAQLESTKEMLRK--------------VVEELTAKKMTLESSE---------RTVSDLTASLQEKER 510
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1412 RLSGSLDEEVGRRRQLELEVQQLRAGVEEQEGLLSFQ-EDRSKKLAVERELRQLTLRIQELEKRPPTVQEKIIMEEVVKL 1490
Cdd:pfam15921 511 AIEATNAEITKLRSRVDLKLQELQHLKNEGDHLRNVQtECEALKLQMAEKDKVIEILRQQIENMTQLVGQHGRTAGAMQV 590
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1491 EKdPDLEKSTEALRWDLDQEKTQVTELNRECKNLQVQIDVLQKAKSQEKTIYKEVIRVQKDrvLEDERArvwEMLNRERT 1570
Cdd:pfam15921 591 EK-AQLEKEINDRRLELQEFKILKDKKDAKIRELEARVSDLELEKVKLVNAGSERLRAVKD--IKQERD---QLLNEVKT 664
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1571 ARQAREEEArrlreriDRAETLGRTWSREESELQRARDQADQECGRLQQElraLERQKQQQTLQLQEESKLLSQKTESER 1650
Cdd:pfam15921 665 SRNELNSLS-------EDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSE---LEQTRNTLKSMEGSDGHAMKVAMGMQK 734
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|....*
gi 1002819366 1651 QKAAQRGQelsrLEAaiLREKDQIYEKERTLRDLHAKVSREE---LSQETQTRET 1702
Cdd:pfam15921 735 QITAKRGQ----IDA--LQSKIQFLEEAMTNANKEKHFLKEEknkLSQELSTVAT 783
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
1212-1539 |
3.02e-06 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 52.43 E-value: 3.02e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1212 VQLQERVHEIFQVDPETEQEITRLKAKLQEMAgkrsgveKEVEKLLPDLEVLRAQKPTVEyKEVTQEVVRHERSPEVLRE 1291
Cdd:pfam17380 278 VQHQKAVSERQQQEKFEKMEQERLRQEKEEKA-------REVERRRKLEEAEKARQAEMD-RQAAIYAEQERMAMERERE 349
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1292 IDRLKAQLNELVNSHGRSQE---------QLIRLQGERDEwRRERakvetktVSKEVVRHEKDPVLEKEAERLRQEVREA 1362
Cdd:pfam17380 350 LERIRQEERKRELERIRQEEiameisrmrELERLQMERQQ-KNER-------VRQELEAARKVKILEEERQRKIQQQKVE 421
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1363 AQKRRAAEDAVYELQSKRLLLERRKPEEKVVVQEVVVTQKDPKLREehsrlsgslDEEVGRRRQLELEVQQL-RAGVEEQ 1441
Cdd:pfam17380 422 MEQIRAEQEEARQREVRRLEEERAREMERVRLEEQERQQQVERLRQ---------QEEERKRKKLELEKEKRdRKRAEEQ 492
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1442 EGLLSFQEDRSKKLAVERELRQLTLRIQELEKRPPTVQEKiimEEVVKLEKDPDLEKSTEAlRWDLDQEKTQVTELNREC 1521
Cdd:pfam17380 493 RRKILEKELEERKQAMIEEERKRKLLEKEMEERQKAIYEE---ERRREAEEERRKQQEMEE-RRRIQEQMRKATEERSRL 568
|
330
....*....|....*...
gi 1002819366 1522 KNLQVQIDVLQKAKSQEK 1539
Cdd:pfam17380 569 EAMEREREMMRQIVESEK 586
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
651-1473 |
3.55e-06 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 52.38 E-value: 3.55e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 651 EKAKAALDLERQIQDADRVIRGFEATLVQeapIPAEPGALQERVSELQRQRRELLEqqtcvlrlhRALKASEHACAALQN 730
Cdd:TIGR02169 227 ELLKEKEALERQKEAIERQLASLEEELEK---LTEEISELEKRLEEIEQLLEELNK---------KIKDLGEEEQLRVKE 294
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 731 NFQEFCQDLPRQQRQVRALTDRyhavgdqldlrekvVQDAALTYQQFKNCKDNLSSWLEHLPRS----QVRPSDGPSQIA 806
Cdd:TIGR02169 295 KIGELEAEIASLERSIAEKERE--------------LEDAEERLAKLEAEIDKLLAEIEELEREieeeRKRRDKLTEEYA 360
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 807 YKLQAQKRLTQEIQSRERDRATASHLSQALQAALQDYELQADTYRCSLEPTLAVSAPKRPRVAPLQESIQAQEKNLAKAY 886
Cdd:TIGR02169 361 ELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELE 440
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 887 TEVaaaqqqllqqlEFARKMLEKKElsedirrtHDAKQGSESPAQAGRESEALKA---QLEEERKRVARVQHELEAQRSQ 963
Cdd:TIGR02169 441 EEK-----------EDKALEIKKQE--------WKLEQLAADLSKYEQELYDLKEeydRVEKELSKLQRELAEAEAQARA 501
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 964 LLQlrtqrplERLEEKEVVEFyrdpqLEGSLSRVKAQVEEEGKRRAGLQADLEVA---------------AQKVVQLESK 1028
Cdd:TIGR02169 502 SEE-------RVRGGRAVEEV-----LKASIQGVHGTVAQLGSVGERYATAIEVAagnrlnnvvveddavAKEAIELLKR 569
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1029 RK----TMQPhlLTKeVTQVERDPGLDSQAAQLRIQIQQLRGED----AVISARLEGLKKELLALEKREVDVKEKVVVKE 1100
Cdd:TIGR02169 570 RKagraTFLP--LNK-MRDERRDLSILSEDGVIGFAVDLVEFDPkyepAFKYVFGDTLVVEDIEAARRLMGKYRMVTLEG 646
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1101 VVKVEKNLeMVKAAQALRlqmeEDAARRKQAEEAVAKLQARIEDLERAISSvepkvivkevkkveqdpgLLQESSRLRSL 1180
Cdd:TIGR02169 647 ELFEKSGA-MTGGSRAPR----GGILFSRSEPAELQRLRERLEGLKRELSS------------------LQSELRRIENR 703
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1181 LEEERTKNATLARELSDLHSKYSVVEKQRPKV-----QLQERVHEIFQVDPETEQEITRLKAKLQEMAGKRSGVEKEVEK 1255
Cdd:TIGR02169 704 LDELSQELSDASRKIGEIEKEIEQLEQEEEKLkerleELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALND 783
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1256 LLPDLEVLRAQKPTVEYKEVTQEVVRHErspEVLREIDR-------LKAQLNELVNShgrSQEQLIRLQGERDEWRRERA 1328
Cdd:TIGR02169 784 LEARLSHSRIPEIQAELSKLEEEVSRIE---ARLREIEQklnrltlEKEYLEKEIQE---LQEQRIDLKEQIKSIEKEIE 857
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1329 -----KVETKTVSKEVVRHEKDpvLEKEAERLRQEVREAAQKRRAAEDAVYELQS----KRLLLERRKPEEKVVVQEVVV 1399
Cdd:TIGR02169 858 nlngkKEELEEELEELEAALRD--LESRLGDLKKERDELEAQLRELERKIEELEAqiekKRKRLSELKAKLEALEEELSE 935
|
810 820 830 840 850 860 870
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1002819366 1400 TQKDPKLREEHSRLSGSLDEEVGRRRQLELEVQQL-----RAGVEEQEGLLSFQEDRSKKLAVERELRQLTLRIQELEK 1473
Cdd:TIGR02169 936 IEDPKGEDEEIPEEELSLEDVQAELQRVEEEIRALepvnmLAIQEYEEVLKRLDELKEKRAKLEEERKAILERIEEYEK 1014
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
688-1317 |
3.87e-06 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 52.28 E-value: 3.87e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 688 GALQERVSELQRQRRELLEQQTCVLRLHRALKASEHACAALQNNFQEFcQDLPRQQRQVRALTDRYHAVGDQLDLREKVV 767
Cdd:TIGR00618 263 KQLRARIEELRAQEAVLEETQERINRARKAAPLAAHIKAVTQIEQQAQ-RIHTELQSKMRSRAKLLMKRAAHVKQQSSIE 341
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 768 QDAALTYQQFKNCKDNlsswlehlprsqVRPSDGPSQIAYKLQAQKRLTQEIQSRERDRATASHLSQALQAALqdyeLQA 847
Cdd:TIGR00618 342 EQRRLLQTLHSQEIHI------------RDAHEVATSIREISCQQHTLTQHIHTLQQQKTTLTQKLQSLCKEL----DIL 405
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 848 DTYRCSLEPTLAVSAPKRPRVAPLQESIQAQEKNLAKAYTEVAAAQQQLLQQLEFARKMLEKkeLSEDIRRTHDAKQGSE 927
Cdd:TIGR00618 406 QREQATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQS--LKEREQQLQTKEQIHL 483
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 928 SPAQAGRESEALKAQLEEERKRVARVQHELEAQRSQLLQLR-TQRPLERLEekevvefYRDPQLEGSLSRVKAQVEEEGK 1006
Cdd:TIGR00618 484 QETRKKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDNPGpLTRRMQRGE-------QTYAQLETSEEDVYHQLTSERK 556
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1007 RRAGLQADLEVAAQKVVQLESKRKTMQPHLltkEVTQVERDPGLDSQAAQLRIQIQQLRGEDAVISARLEGLKKELLALE 1086
Cdd:TIGR00618 557 QRASLKEQMQEIQQSFSILTQCDNRSKEDI---PNLQNITVRLQDLTEKLSEAEDMLACEQHALLRKLQPEQDLQDVRLH 633
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1087 KREVDVKEKVVVKEVVKVEKNLEMVKAAQALRLQMEEDAARRKQAEEAVAKLQARIEDLERAISSVEPKVIVKEVKKVEq 1166
Cdd:TIGR00618 634 LQQCSQELALKLTALHALQLTLTQERVREHALSIRVLPKELLASRQLALQKMQSEKEQLTYWKEMLAQCQTLLRELETH- 712
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1167 dpglLQESSRLRSLLEE-ERTKNATLARELSDLHSKYSVVEKQRPKvQLQERVHEIFQVDPETEQEITRLkAKLQEMAGK 1245
Cdd:TIGR00618 713 ----IEEYDREFNEIENaSSSLGSDLAAREDALNQSLKELMHQART-VLKARTEAHFNNNEEVTAALQTG-AELSHLAAE 786
|
570 580 590 600 610 620 630
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1002819366 1246 RSGVEKEVEKLLPDLEVLRAQ-----KPTVEYKEVTQEVVRHERSpEVLREIDRLKAQLNELVNSHGRSQEQLIRLQ 1317
Cdd:TIGR00618 787 IQFFNRLREEDTHLLKTLEAEigqeiPSDEDILNLQCETLVQEEE-QFLSRLEEKSATLGEITHQLLKYEECSKQLA 862
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
1131-1711 |
4.01e-06 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 52.28 E-value: 4.01e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1131 AEEAVAKLQARIEDLERAISSVEPKVIVKEVKKV--EQDPGLLQESSRLRSLLEEERTKNATLARELSDLHSKYSVVEKQ 1208
Cdd:TIGR00618 210 TPCMPDTYHERKQVLEKELKHLREALQQTQQSHAylTQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERINRA 289
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1209 RPKVQLQERVHEIFQVDPETEQEITRLKAKLQEMAGKRSGV------EKEVEKLLPDLEVLRAQKPTVEyKEVTQEVVRH 1282
Cdd:TIGR00618 290 RKAAPLAAHIKAVTQIEQQAQRIHTELQSKMRSRAKLLMKRaahvkqQSSIEEQRRLLQTLHSQEIHIR-DAHEVATSIR 368
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1283 ERSPEVLREIDRLKAQ------LNELVNSHGRSQEQLIRLQG-------ERDEWRRERAKVETKTVSKEVVRHEKDPVLE 1349
Cdd:TIGR00618 369 EISCQQHTLTQHIHTLqqqkttLTQKLQSLCKELDILQREQAtidtrtsAFRDLQGQLAHAKKQQELQQRYAELCAAAIT 448
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1350 KEAERLRQEVREAAQKRRAAEDAVYELQSKRLLLERRKPEEKVVVQEVVVTQKDPKLREE-------HSRLSGSLDEEVG 1422
Cdd:TIGR00618 449 CTAQCEKLEKIHLQESAQSLKEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLCGscihpnpARQDIDNPGPLTR 528
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1423 RRRQLELEVQQLRAGVEEQEGLLSFQEDRSKKLAverelRQLTLRIQELEKRPPTVQE-KIIMEEVVKLEKD--PDLEKS 1499
Cdd:TIGR00618 529 RMQRGEQTYAQLETSEEDVYHQLTSERKQRASLK-----EQMQEIQQSFSILTQCDNRsKEDIPNLQNITVRlqDLTEKL 603
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1500 TEALRWDLDQEKTQVTELNRECKNLQVQIDVLQKAKSQEK----------TIYKEVIRVQKDRVLEDErARVWEMLNRER 1569
Cdd:TIGR00618 604 SEAEDMLACEQHALLRKLQPEQDLQDVRLHLQQCSQELALkltalhalqlTLTQERVREHALSIRVLP-KELLASRQLAL 682
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1570 TARQAREEEARRLRERIDRAETLGRTWSREESELQRARDQADQECGRLQQELRALERQKQQQTLQLQEESKLLSQKTESE 1649
Cdd:TIGR00618 683 QKMQSEKEQLTYWKEMLAQCQTLLRELETHIEEYDREFNEIENASSSLGSDLAAREDALNQSLKELMHQARTVLKARTEA 762
|
570 580 590 600 610 620 630
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1002819366 1650 RQKAA-------QRGQELSRLEAAILREKDQIYEKERTLRDLHAKVSRE--------ELSQET-QTRETNLSTKISIL 1711
Cdd:TIGR00618 763 HFNNNeevtaalQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEipsdedilNLQCETlVQEEEQFLSRLEEK 840
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
901-1444 |
6.78e-06 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 51.22 E-value: 6.78e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 901 EFARKMLEKKELSEDIRRTHDAKQGSESPAQagrESEALKAQLEEERKRVARVQHELEAQRSQLLQLRTQRP-----LER 975
Cdd:PRK03918 201 ELEEVLREINEISSELPELREELEKLEKEVK---ELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEelkkeIEE 277
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 976 LEEK---------------EVVEFYRdpQLEGSLSRVKAQVEEEGKRRAGLQADLEVAAQKVVQLESKRKTMQPhlLTKE 1040
Cdd:PRK03918 278 LEEKvkelkelkekaeeyiKLSEFYE--EYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKE--LEKR 353
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1041 VTQVERDPGLDSQAAQLRIQIQQLRGEDAVISarLEGLKKELLALEKREVDVKEKVVVKEVVKVeknlEMVKAAQALRLQ 1120
Cdd:PRK03918 354 LEELEERHELYEEAKAKKEELERLKKRLTGLT--PEKLEKELEELEKAKEEIEEEISKITARIG----ELKKEIKELKKA 427
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1121 MEE-DAARRK-----------QAEEAVAKLQARIEDLERAISSVEPKVIVkevkkveqdpgLLQESSRLRSLLEEERT-- 1186
Cdd:PRK03918 428 IEElKKAKGKcpvcgrelteeHRKELLEEYTAELKRIEKELKEIEEKERK-----------LRKELRELEKVLKKESEli 496
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1187 KNATLARELSDLHSKYSVVEKQRPKvqlqervheifqvdpETEQEITRLKAKLQEMAGKRSGVEKEVEKllpdLEVLRAQ 1266
Cdd:PRK03918 497 KLKELAEQLKELEEKLKKYNLEELE---------------KKAEEYEKLKEKLIKLKGEIKSLKKELEK----LEELKKK 557
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1267 KPTVEYKEVTQEvvrhERSPEVLREIDRLKAQLNELVNSHGRSQEQL----IRLQGERDEWRRERAKVEtKTVSKEVVRH 1342
Cdd:PRK03918 558 LAELEKKLDELE----EELAELLKELEELGFESVEELEERLKELEPFyneyLELKDAEKELEREEKELK-KLEEELDKAF 632
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1343 EKDPVLEKEAERLRQEVREAaqKRRAAEDAVYELQSKRLLLERRKPEEKVVVQEVVvtqkdpKLREEHSRLSGSLDEEVG 1422
Cdd:PRK03918 633 EELAETEKRLEELRKELEEL--EKKYSEEEYEELREEYLELSRELAGLRAELEELE------KRREEIKKTLEKLKEELE 704
|
570 580
....*....|....*....|..
gi 1002819366 1423 RRRQLELEVQQLRAGVEEQEGL 1444
Cdd:PRK03918 705 EREKAKKELEKLEKALERVEEL 726
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
651-1154 |
8.82e-06 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 51.09 E-value: 8.82e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 651 EKAKAALDLERQIQDADRVIRGFEATLVQEApipAEPGALQERVSELQRQRRELLEQqtcVLRLHRALKASEHACAALQN 730
Cdd:COG1196 243 ELEAELEELEAELEELEAELAELEAELEELR---LELEELELELEEAQAEEYELLAE---LARLEQDIARLEERRRELEE 316
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 731 NFQEFCQDLPRQQRQVRALTDRYHAVGDQLDLREKVVQDAALTYQQFKN-----------CKDNLSSWLEHLPRSQVRPS 799
Cdd:COG1196 317 RLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEalleaeaelaeAEEELEELAEELLEALRAAA 396
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 800 DGPSQIAYKLQAQKRLTQEIQSRERDRATASHLSQALQAALQDYELQADTYRCSLEPTLAVSAPKRPRVAPLQESIQAQE 879
Cdd:COG1196 397 ELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLE 476
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 880 KNLAKAYTEVAAAQQQLLQQLEFARKM---------LEKKELSEDIRRTHDAKQGSESPAQAGRESEALKAQLEEERKRV 950
Cdd:COG1196 477 AALAELLEELAEAAARLLLLLEAEADYegflegvkaALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVEDD 556
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 951 ARVQHELEAQRSQLLQLRTQRPLERLEEKEVVEFYRDPQLEGSLSR-VKAQVEEEGKRRAGLQADLEVAAQKVVQLESKR 1029
Cdd:COG1196 557 EVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDlVASDLREADARYYVLGDTLLGRTLVAARLEAAL 636
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1030 KTMQPHLLTKEVTQVERDPGLDSQA-----------AQLRIQIQQLRGEDAVISARLEGLKKELLALEKREVDVKEKVVV 1098
Cdd:COG1196 637 RRAVTLAGRLREVTLEGEGGSAGGSltggsrrellaALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEER 716
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1002819366 1099 KEVVKVEKNLEMVKAAQALRLQME---------EDAARRKQAEEAVAKLQARIEDLERAISSVEP 1154
Cdd:COG1196 717 LEEELEEEALEEQLEAEREELLEElleeeelleEEALEELPEPPDLEELERELERLEREIEALGP 781
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1175-1387 |
2.59e-05 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 49.53 E-value: 2.59e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1175 SRLRSLLEEERTKNATL------ARELSDLHSKYSVVEKQRPKVQLQERVHEIFQVdpetEQEITRLKAKLQEMAGKRSG 1248
Cdd:COG4913 238 ERAHEALEDAREQIELLepirelAERYAAARERLAELEYLRAALRLWFAQRRLELL----EAELEELRAELARLEAELER 313
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1249 VEKEVEKLLPDLEVLRAQKPTVEYKEVTQevvrherspeVLREIDRLKAQLNELVNSHGRSQEQLIRLQ----GERDEWR 1324
Cdd:COG4913 314 LEARLDALREELDELEAQIRGNGGDRLEQ----------LEREIERLERELEERERRRARLEALLAALGlplpASAEEFA 383
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1002819366 1325 RERAKVetktvskevvrHEKDPVLEKEAERLRQEVREAAQKRRAAEDAVYELQSKRLLLERRK 1387
Cdd:COG4913 384 ALRAEA-----------AALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLERRK 435
|
|
| MukB |
COG3096 |
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ... |
700-1082 |
3.19e-05 |
|
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 49.18 E-value: 3.19e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 700 QRRELLEQqtcVLRLHRALKASEHACAALQNNFQEFCQDLPRQQRQVRALTDRYHAVGDQLDLrekvVQDAALTYQQFKN 779
Cdd:COG3096 279 ERRELSER---ALELRRELFGARRQLAEEQYRLVEMARELEELSARESDLEQDYQAASDHLNL----VQTALRQQEKIER 351
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 780 CKDNLSSwlehlprsqvrpsdgpsqIAYKLQAQKRLTQEIQSRERDRATASHLSQA----LQAALQDYE----------- 844
Cdd:COG3096 352 YQEDLEE------------------LTERLEEQEEVVEEAAEQLAEAEARLEAAEEevdsLKSQLADYQqaldvqqtrai 413
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 845 -----LQA--DTYRCSLEPTLAVSApkrprVAPLQESIQAQEKNLakayTEVAAAQQQLLQQLEFARKMLEKK-ELSEDI 916
Cdd:COG3096 414 qyqqaVQAleKARALCGLPDLTPEN-----AEDYLAAFRAKEQQA----TEEVLELEQKLSVADAARRQFEKAyELVCKI 484
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 917 ------RRTHDA-----KQGSESPAQAGREsEALKAQLEEERKRVARvQHELEAQRSQLLQlRTQRPLERLEEKEVVEfy 985
Cdd:COG3096 485 ageverSQAWQTarellRRYRSQQALAQRL-QQLRAQLAELEQRLRQ-QQNAERLLEEFCQ-RIGQQLDAAEELEELL-- 559
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 986 rdPQLEGSLSRVKAQVEEEGKRRAGLQADLEVAAQKVVQLESK----------------------------RKTMQpHLL 1037
Cdd:COG3096 560 --AELEAQLEELEEQAAEAVEQRSELRQQLEQLRARIKELAARapawlaaqdalerlreqsgealadsqevTAAMQ-QLL 636
|
410 420 430 440
....*....|....*....|....*....|....*....|....*.
gi 1002819366 1038 TKEV-TQVERDPgLDSQAAQLRIQIQQLRGEDAVISARLEGLKKEL 1082
Cdd:COG3096 637 EREReATVERDE-LAARKQALESQIERLSQPGGAEDPRLLALAERL 681
|
|
| SPEC |
cd00176 |
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ... |
156-330 |
7.08e-05 |
|
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here
Pssm-ID: 238103 [Multi-domain] Cd Length: 213 Bit Score: 46.28 E-value: 7.08e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 156 LEQKQKQVCAGQYGPGMAELEQQIAEHNILQKEIDAY----------GQQLRSLVGPDAATIR-------SQYRDLLKAA 218
Cdd:cd00176 16 LSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHeervealnelGEQLIEEGHPDAEEIQerleelnQRWEELRELA 95
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 219 SWRGQSLG---SLYTHLQGCTRQLSALAEQQRRILQQDWSDlmaDPAGVRREYEHFK--QHELLSQEQSVNQLEDDGERM 293
Cdd:cd00176 96 EERRQRLEealDLQQFFRDADDLEQWLEEKEAALASEDLGK---DLESVEELLKKHKelEEELEAHEPRLKSLNELAEEL 172
|
170 180 190
....*....|....*....|....*....|....*...
gi 1002819366 294 VELRHPAVGP-IQAHQEALKMEWQNFLNLCICQETQLQ 330
Cdd:cd00176 173 LEEGHPDADEeIEEKLEELNERWEELLELAEERQKKLE 210
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1055-1552 |
7.82e-05 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 47.75 E-value: 7.82e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1055 AQLRIQIQQLRGEDAVISARLEGLKKELLALEKREVDVKEKVVVKEVVKveknlEMVKAAQALRLQMEEDAARRKQAEEA 1134
Cdd:PRK03918 241 EELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELK-----EKAEEYIKLSEFYEEYLDELREIEKR 315
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1135 VAKLQARIEDLERAISSVEPKVIVKEVKKVEQDpGLLQESSRLrslleEERTKNATLARELSDLHSKYSVVEKQRPKVQL 1214
Cdd:PRK03918 316 LSRLEEEINGIEERIKELEEKEERLEELKKKLK-ELEKRLEEL-----EERHELYEEAKAKKEELERLKKRLTGLTPEKL 389
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1215 QERVHEIFQVDPETEQEITRLKAKLQEMAGKRSGVEKEVEKllpdLEVLRAQKPTVEyKEVTQEvVRHERSPEVLREIDR 1294
Cdd:PRK03918 390 EKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEE----LKKAKGKCPVCG-RELTEE-HRKELLEEYTAELKR 463
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1295 LKAQLNELVNSHGRSQEQLIRLQGERDEWRRERAKVETKTVSKEVVRHEKDPVLEK------EAERLRQEVREAAQKRRA 1368
Cdd:PRK03918 464 IEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEElekkaeEYEKLKEKLIKLKGEIKS 543
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1369 AEDAV---YELQSKRLLLERRKPEEKVVVQEVVVtqkdpKLREEHSRLSGSLDEEVGR-----RRQLELE--VQQLRAGV 1438
Cdd:PRK03918 544 LKKELeklEELKKKLAELEKKLDELEEELAELLK-----ELEELGFESVEELEERLKElepfyNEYLELKdaEKELEREE 618
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1439 EEQEGLLSFQEDRSKKLA-VERELRQLTLRIQELEKRPPTVQEKIIMEEVVKLEKDpdleksTEALRWDLDQEKTQVTEL 1517
Cdd:PRK03918 619 KELKKLEEELDKAFEELAeTEKRLEELRKELEELEKKYSEEEYEELREEYLELSRE------LAGLRAELEELEKRREEI 692
|
490 500 510
....*....|....*....|....*....|....*
gi 1002819366 1518 NRECKNLQVQIDVLQKAKSQEKTIYKEVIRVQKDR 1552
Cdd:PRK03918 693 KKTLEKLKEELEEREKAKKELEKLEKALERVEELR 727
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1286-1474 |
8.46e-05 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 47.99 E-value: 8.46e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1286 PEVLREIDRLKAQLNELVNSHGRS---QEQLIRLQGERDEWRRERAKVETKTVSKEVV-------RHEKDPVLEKEAERL 1355
Cdd:COG4913 221 PDTFEAADALVEHFDDLERAHEALedaREQIELLEPIRELAERYAAARERLAELEYLRaalrlwfAQRRLELLEAELEEL 300
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1356 RQEVREAAQKRRAAEDAVYELQSKRLLLERRKPEEKVVVQEVVVTQKDpKLREEHSRLSGSLDEEVGRRRQLEL------ 1429
Cdd:COG4913 301 RAELARLEAELERLEARLDALREELDELEAQIRGNGGDRLEQLEREIE-RLERELEERERRRARLEALLAALGLplpasa 379
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 1002819366 1430 --------EVQQLRAGVEEQEGLLS--FQEDRSKKLAVERELRQLTLRIQELEKR 1474
Cdd:COG4913 380 eefaalraEAAALLEALEEELEALEeaLAEAEAALRDLRRELRELEAEIASLERR 434
|
|
| SPEC |
smart00150 |
Spectrin repeats; |
237-330 |
8.69e-05 |
|
Spectrin repeats;
Pssm-ID: 197544 [Multi-domain] Cd Length: 101 Bit Score: 43.47 E-value: 8.69e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 237 RQLSALAEQQRRILQQDwsDLMADPAGVRREYEHFK--QHELLSQEQSVNQLEDDGERMVELRHPAVGPIQAHQEALKME 314
Cdd:smart00150 8 DELEAWLEEKEQLLASE--DLGKDLESVEALLKKHEafEAELEAHEERVEALNELGEQLIEEGHPDAEEIEERLEELNER 85
|
90
....*....|....*.
gi 1002819366 315 WQNFLNLCICQETQLQ 330
Cdd:smart00150 86 WEELKELAEERRQKLE 101
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
817-1571 |
1.04e-04 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 47.76 E-value: 1.04e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 817 QEIQSRERDRATASHLsQALQAALQDYEL-----QADTYRCSLEPTLAVSAPKRPRVAPLQESIQAQEKNLAKAYTEVAA 891
Cdd:TIGR02169 198 QQLERLRREREKAERY-QALLKEKREYEGyellkEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEE 276
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 892 AQQQLLQQLE-----FARKMLEKKELSEDIRRTHDAKQGSESPAQAGR------------ESEALKAQLEEERKRVARVQ 954
Cdd:TIGR02169 277 LNKKIKDLGEeeqlrVKEKIGELEAEIASLERSIAEKERELEDAEERLakleaeidkllaEIEELEREIEEERKRRDKLT 356
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 955 HELEAQRSQLLQLRTQ-------------RPLERLEEKEVVEFYRDPqLEGSLSRVKAQVEEEGKRRAGLQADLEVAAQK 1021
Cdd:TIGR02169 357 EEYAELKEELEDLRAEleevdkefaetrdELKDYREKLEKLKREINE-LKRELDRLQEELQRLSEELADLNAAIAGIEAK 435
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1022 VVQLESKRKTMQPHLLTKEvTQVERdpgLDSQAAQLRIQIQQLRGEDAVISARLEGLKKELLALEKREVDVKEKVVVKEV 1101
Cdd:TIGR02169 436 INELEEEKEDKALEIKKQE-WKLEQ---LAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERVRGGRA 511
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1102 VKVEKN------------LEMVKAAQAL--------RLQM-----EEDAAR------RKQAEEA----VAKLQARIEDLE 1146
Cdd:TIGR02169 512 VEEVLKasiqgvhgtvaqLGSVGERYATaievaagnRLNNvvvedDAVAKEaiellkRRKAGRAtflpLNKMRDERRDLS 591
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1147 R---------AISSVE-PKVIVKEVKKVEQDPGLLQESSRLRSLLEEERTknATLARELSD-----------LHSKYSVV 1205
Cdd:TIGR02169 592 IlsedgvigfAVDLVEfDPKYEPAFKYVFGDTLVVEDIEAARRLMGKYRM--VTLEGELFEksgamtggsraPRGGILFS 669
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1206 EKQRPKVQ-LQERVHE-------IFQVDPETEQEITRLKAKLQEMAGKRSGVEKEVEKLLPDLEVLRAQkptVEYKEVTQ 1277
Cdd:TIGR02169 670 RSEPAELQrLRERLEGlkrelssLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKER---LEELEEDL 746
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1278 EVVRHERSpEVLREIDRLKAQLNELVNSHGRSQEQLIRLQGERDEWRRERAKVETKTVSKEVVRhekdpvLEKEAERLRQ 1357
Cdd:TIGR02169 747 SSLEQEIE-NVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSR------IEARLREIEQ 819
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1358 EVREAAQKRRAAEDAVYELQSKRLLLERRKPEekvvvqevvvtqkdpkLREEHSRLSGSLDEEVGRRRQLELEVQQLRag 1437
Cdd:TIGR02169 820 KLNRLTLEKEYLEKEIQELQEQRIDLKEQIKS----------------IEKEIENLNGKKEELEEELEELEAALRDLE-- 881
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1438 vEEQEGLLSFQEDRSKKL-AVERELRQLTLRIQELEKRPPTVQEK--IIMEEVVKLEKDPDLEKSTEALRWDLDQEKTQV 1514
Cdd:TIGR02169 882 -SRLGDLKKERDELEAQLrELERKIEELEAQIEKKRKRLSELKAKleALEEELSEIEDPKGEDEEIPEEELSLEDVQAEL 960
|
810 820 830 840 850
....*....|....*....|....*....|....*....|....*....|....*..
gi 1002819366 1515 TELNRECKNLQvqiDVLQKAKSQektiYKEVIRVQKDrvLEDERARvwemLNRERTA 1571
Cdd:TIGR02169 961 QRVEEEIRALE---PVNMLAIQE----YEEVLKRLDE--LKEKRAK----LEEERKA 1004
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
928-1153 |
1.33e-04 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 46.68 E-value: 1.33e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 928 SPAQAGRESEALKAQLEEERKRVARVQHELEAQRSQLLQLRTQrpLERLEEKEVVEFYRDPQLEGSLSRVKAQVEEEGKR 1007
Cdd:COG4942 14 AAAAQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQ--LAALERRIAALARRIRALEQELAALEAELAELEKE 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1008 RAGLQADLEVAAQKVVQL------ESKRKTMQPHLLTKEVTQVERDPG-LDSQAAQLRIQIQQLRGEdaviSARLEGLKK 1080
Cdd:COG4942 92 IAELRAELEAQKEELAELlralyrLGRQPPLALLLSPEDFLDAVRRLQyLKYLAPARREQAEELRAD----LAELAALRA 167
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1002819366 1081 ELLALEKREVDVKEKVVVKEVVKVEKNLEMVKAAQALRLQMEEDAARRKQAEEAVAKLQARIEDLERAISSVE 1153
Cdd:COG4942 168 ELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAA 240
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
994-1385 |
1.37e-04 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 47.07 E-value: 1.37e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 994 LSRVKAQVEEEGKRRAGLQADLEVAAQKVVQLESKRKTMQPHLLTKEVTQverdpgldsQAAQLRIQIQQLRGEDAVISA 1073
Cdd:COG4717 76 LEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLL---------QLLPLYQELEALEAELAELPE 146
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1074 RLEGLKKELLA-------LEKREVDVKEKVVVKEVVKVEKNLEMVKAAQALRLQMEEDAARRKQAEEAVAKLQARIEDLE 1146
Cdd:COG4717 147 RLEELEERLEElreleeeLEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELE 226
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1147 RAISSVEPKVIVKEVKKVEQDPGLLQESSRLRSLLEEERTKNATLARELSDLH------SKYSVVEKQRPKVQLQERVHE 1220
Cdd:COG4717 227 EELEQLENELEAAALEERLKEARLLLLIAAALLALLGLGGSLLSLILTIAGVLflvlglLALLFLLLAREKASLGKEAEE 306
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1221 IFQVDPETEQEITRLKAKLQEMAGKRSGVEKEVEKLLPDLEvlraqkptvEYKEVTQEVVRHERSpevlREIDRLKAQLN 1300
Cdd:COG4717 307 LQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIE---------ELQELLREAEELEEE----LQLEELEQEIA 373
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1301 ELVNSHG-RSQEQLIRLQGERDEWRRERAKVET--------KTVSKEVVRHEKDPVLEKEAERLRQEVREAAQKRRAAED 1371
Cdd:COG4717 374 ALLAEAGvEDEEELRAALEQAEEYQELKEELEEleeqleelLGELEELLEALDEEELEEELEELEEELEELEEELEELRE 453
|
410
....*....|....
gi 1002819366 1372 AVYELQSKRLLLER 1385
Cdd:COG4717 454 ELAELEAELEQLEE 467
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
1710-1738 |
1.39e-04 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 40.77 E-value: 1.39e-04
10 20
....*....|....*....|....*....
gi 1002819366 1710 ILEPETGKDMSPYEAYKRGIIDRGQYLQL 1738
Cdd:pfam00681 11 IIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| PLEC |
smart00250 |
Plectin repeat; |
1840-1877 |
2.70e-04 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 40.16 E-value: 2.70e-04
10 20 30
....*....|....*....|....*....|....*...
gi 1002819366 1840 LGLGDDSFPIAGIYDTTTDNKCSIKTAVAKNMLDPITG 1877
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
903-1387 |
4.59e-04 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 45.14 E-value: 4.59e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 903 ARKMLEKKELSEDIRRTHDAKQGSespaQAGRESEALKAQLEEERKRVARVQHELEAQRSQLLQLRtqrplERLEEKEVV 982
Cdd:COG4717 54 EADELFKPQGRKPELNLKELKELE----EELKEAEEKEEEYAELQEELEELEEELEELEAELEELR-----EELEKLEKL 124
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 983 EFYRDPQLEgsLSRVKAQVEEEGKRRAGLQADLEVAAQKVVQLESKRKTMQPHLLTKEVTQVERDPGLDSQAAQLRIQIQ 1062
Cdd:COG4717 125 LQLLPLYQE--LEALEAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELE 202
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1063 QLRGEDAVISARLEGLKKELLALEKREVDVKEKVVVKEVVKVEKNLEMVKAAQALRLQMEEDAARRKQAEEAVAKLQAri 1142
Cdd:COG4717 203 ELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALEERLKEARLLLLIAAALLALLGLGGSLLSLILTIAGVLF-- 280
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1143 edLERAISSVEPKVIVKEVKKVEQDPGLLQESSRLRSlLEEERTKNATLARELSDLHSKYSVVEKQRPKVQLQERVHEIF 1222
Cdd:COG4717 281 --LVLGLLALLFLLLAREKASLGKEAEELQALPALEE-LEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAE 357
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1223 QVDPE-TEQEITRLKAKLQEMAGKRSgvEKEVEKLLPDLEvlRAQKPTVEYKEVTQEVVRHERSPEVLREI---DRLKAQ 1298
Cdd:COG4717 358 ELEEElQLEELEQEIAALLAEAGVED--EEELRAALEQAE--EYQELKEELEELEEQLEELLGELEELLEAldeEELEEE 433
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1299 LNELvnshgrsQEQLIRLQGERDEWRRERAKVETKTvsKEVVRHEKDPVLEKEAERLRQEVREAAQKRRAAEDAVYELQS 1378
Cdd:COG4717 434 LEEL-------EEELEELEEELEELREELAELEAEL--EQLEEDGELAELLQELEELKAELRELAEEWAALKLALELLEE 504
|
....*....
gi 1002819366 1379 KRLLLERRK 1387
Cdd:COG4717 505 AREEYREER 513
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1406-1613 |
5.19e-04 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 45.29 E-value: 5.19e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1406 LREEHSRLSGSLDEEVGRRRQLELEVQQLRAGVEEQEGLLSFQEDRSKKLAVERELRQLTLRIQELEKRPPTVQEkiime 1485
Cdd:COG4913 615 LEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVASAEREIAELEAELERLDASSDDLAA----- 689
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1486 evvklekdpdLEKSTEALRWDLDQEKTQVTELNRECKNLQVQIDVLQKAKSQEKTIYKEVIRVQKDRVLEDERARVWEML 1565
Cdd:COG4913 690 ----------LEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAAL 759
|
170 180 190 200
....*....|....*....|....*....|....*....|....*...
gi 1002819366 1566 NRERTARQAREEEarrlreriDRAETLGRTWSREESELQRARDQADQE 1613
Cdd:COG4913 760 GDAVERELRENLE--------ERIDALRARLNRAEEELERAMRAFNRE 799
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
808-1081 |
6.58e-04 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 45.13 E-value: 6.58e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 808 KLQAQKRLTQEIQSRERDRATASHLSQALQAALQDYELQADTYRCSLEPTLAVSAPKRPRVAPLQESIQAQEKNLAKAYT 887
Cdd:PTZ00121 1491 KAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAK 1570
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 888 EVAAAQQQLLQQLEFARKM-------------LEKKELSEDIRRTHDAKQGSESPAQAGRE---SEALKAQLEEERKRVA 951
Cdd:PTZ00121 1571 KAEEDKNMALRKAEEAKKAeearieevmklyeEEKKMKAEEAKKAEEAKIKAEELKKAEEEkkkVEQLKKKEAEEKKKAE 1650
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 952 RVQHELEAQRSQLLQLRTQ-----------RPLERLEEKEVVEFYRDPQLEGSLSRVKAQVEEEGKRRAGLQADLEVAAQ 1020
Cdd:PTZ00121 1651 ELKKAEEENKIKAAEEAKKaeedkkkaeeaKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKI 1730
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1002819366 1021 KVVQLESKRKTMQphlltKEVTQVERDPGLDSQAAQLRIQIQQLRGE-----DAVISarlEGLKKE 1081
Cdd:PTZ00121 1731 KAEEAKKEAEEDK-----KKAEEAKKDEEEKKKIAHLKKEEEKKAEEirkekEAVIE---EELDEE 1788
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
1071-1712 |
6.64e-04 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 44.96 E-value: 6.64e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1071 ISARLEGLKKELLALEKREVDVKEKVVVKEVVKVEKNLEMVKAAQALRLQMEEDAARRKQAEEAVAKLQARIEDLERAIS 1150
Cdd:pfam02463 188 LIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKL 267
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1151 SvepKVIVKEVKKVEQDPGLLQESSRLRSLLEEERTKNATLARELSDLHSKYSVVEKQrpKVQLQERVHEIFQVDPETEQ 1230
Cdd:pfam02463 268 A---QVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKE--KKKAEKELKKEKEEIEELEK 342
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1231 EITRLKAKLQEMAGKRSGVEKEVEKLLPDLEVLRAQKpTVEYKEVTQEVVRHErspEVLREIDRLKAQLNELvnsHGRSQ 1310
Cdd:pfam02463 343 ELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKK-KLESERLSSAAKLKE---EELELKSEEEKEAQLL---LELAR 415
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1311 EQLIRLQGERDEWRRERAK----VETKTVSKEVVRHEKDPVLEKEAERLRQEVREAaqKRRAAEDAVYELQSKRLLLERR 1386
Cdd:pfam02463 416 QLEDLLKEEKKEELEILEEeeesIELKQGKLTEEKEELEKQELKLLKDELELKKSE--DLLKETQLVKLQEQLELLLSRQ 493
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1387 KPEEKVVVQEVVVTQKDPKLREEHSRLSGSLDEEVGRRRQLELEVQQLRAGVEEQEGLLSFQEDRSKKLAVERELRQLTL 1466
Cdd:pfam02463 494 KLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVIVEVSATADEVEERQKLVRALTEL 573
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1467 RIQELEKRPPTVQEKIIMEEVVKLEKDPDLEKSTEALRWDLDQEKTQVTELNRECKNLQVQIDVLQKAKSQEKTIYKeVI 1546
Cdd:pfam02463 574 PLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDKATLEADEDDKRAKVVEGILKDTELTKLKESAKAKESGLRK-GV 652
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1547 RVQKDRVLEDERARVWEMLNRERTARQAREEEARRLRERIDRAETLGRTWSREESELQRARDQADQECGRLQQELRALER 1626
Cdd:pfam02463 653 SLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQD 732
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1627 QKQQQTLQLQEESKLLSQKTESERQKAAQRGQELSRLEaaiLREKDQIYEKERTLRDLHAKVSREELSQETQTRETNLST 1706
Cdd:pfam02463 733 KINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELS---LKEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEE 809
|
....*.
gi 1002819366 1707 KISILE 1712
Cdd:pfam02463 810 LKEEAE 815
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1115-1328 |
6.66e-04 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 44.37 E-value: 6.66e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1115 QALRLQMEEDAARRKQAEEAVAKLQARIEDLERAISSVEPKVIVKEVKKVEQDPGLLQESSRLRSLLEEERTKNATLARE 1194
Cdd:COG4942 30 EQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEELAEL 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1195 LSDL-----HSKYSVVEKQRPKVQLQERVHEIFQVDPETEQEITRLKAKLQEMAGKRSGVEKEVEKLLPDLEVLRAQKPT 1269
Cdd:COG4942 110 LRALyrlgrQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAERAELEALLAELEEERAA 189
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1002819366 1270 VEykevTQEVVRHERSPEVLREIDRLKAQLNELVNSHGRSQEQLIRLQGERDEWRRERA 1328
Cdd:COG4942 190 LE----ALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTP 244
|
|
| PLEC |
smart00250 |
Plectin repeat; |
1954-1991 |
6.74e-04 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 39.00 E-value: 6.74e-04
10 20 30
....*....|....*....|....*....|....*...
gi 1002819366 1954 LPHLQVQHLTGGLIDPKRTGRIPIQQALLSGMISEELA 1991
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1125-1388 |
6.96e-04 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 44.37 E-value: 6.96e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1125 AARRKQAEEAVAKLQARIEDLERAISSVEpkvivkevkkveqdpgllQESSRLRSLLEEERTKNATLARELSDLHSKYSV 1204
Cdd:COG4942 19 ADAAAEAEAELEQLQQEIAELEKELAALK------------------KEEKALLKQLAALERRIAALARRIRALEQELAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1205 VEKQRPKVQLQERvheifqvdpETEQEITRLKAKLQEMAGK--RSGVEKEVEKLLPDLEVLRAQKPTVEYKEVTqevvrh 1282
Cdd:COG4942 81 LEAELAELEKEIA---------ELRAELEAQKEELAELLRAlyRLGRQPPLALLLSPEDFLDAVRRLQYLKYLA------ 145
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1283 ersPEVLREIDRLKAQLNELVNSHGRSQEQLIRLQGERDEWRRERAKVETKTVSKEVVRHEkdpvLEKEAERLRQEVREA 1362
Cdd:COG4942 146 ---PARREQAEELRADLAELAALRAELEAERAELEALLAELEEERAALEALKAERQKLLAR----LEKELAELAAELAEL 218
|
250 260
....*....|....*....|....*.
gi 1002819366 1363 AQKRRAAEDAVYELQSKRLLLERRKP 1388
Cdd:COG4942 219 QQEAEELEALIARLEAEAAAAAERTP 244
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1452-1679 |
8.13e-04 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 43.98 E-value: 8.13e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1452 SKKLAVERELRQLTLRIQELEKRpptvQEKIIMEEVVKLEKDPDLEKSTEALRWDLDQEKTQVTELNRECKNLQVQIDVL 1531
Cdd:COG4942 20 DAAAEAEAELEQLQQEIAELEKE----LAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAEL 95
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1532 QKAKSQEKTIYKEVIRVQKDRvledERARVWEMLNRERTARQAREEEARRLRERIDRAEtLGRTWSREESELQRARDQAD 1611
Cdd:COG4942 96 RAELEAQKEELAELLRALYRL----GRQPPLALLLSPEDFLDAVRRLQYLKYLAPARRE-QAEELRADLAELAALRAELE 170
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1002819366 1612 QECGRLQQELRALERQKQQQTLQLQEESKL---LSQKTESERQKAAQRGQELSRLEAAILREKDQIYEKER 1679
Cdd:COG4942 171 AERAELEALLAELEEERAALEALKAERQKLlarLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAE 241
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1227-1679 |
1.34e-03 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 43.60 E-value: 1.34e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1227 ETEQEITRLKAKLQEMAGKRSGVEKEVEKLLPDLEVLRAQKptvEYKEVTQEVVRHERspevlrEIDRLKAQLNELVNSH 1306
Cdd:COG4717 85 EKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLL---QLLPLYQELEALEA------ELAELPERLEELEERL 155
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1307 GRSQEQLIRLQGERDEWRRERAKVETKTVSKEVVRHEKDPVLEKEAERLRQEVREAAQKRRAAEDAVYELQSKRLLLERR 1386
Cdd:COG4717 156 EELRELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENE 235
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1387 KPEEkvvvqevvvtQKDPKLREEHSRLSG--------SLDEEVGRRRQLELEVQQLRAGVeeqeGLLSFQEDRSKKLAVE 1458
Cdd:COG4717 236 LEAA----------ALEERLKEARLLLLIaaallallGLGGSLLSLILTIAGVLFLVLGL----LALLFLLLAREKASLG 301
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1459 RELRQLtlriqELEKRPPTVQEKIIMEEVVKLEKDPDLEKStEALRWDLDQEKTQVTELNRECKNLQVQIDVLQKAKSQ- 1537
Cdd:COG4717 302 KEAEEL-----QALPALEELEEEELEELLAALGLPPDLSPE-ELLELLDRIEELQELLREAEELEEELQLEELEQEIAAl 375
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1538 --------EKTIYKEVIRVQKDRVLEDERARVWEMLNRERTARQAREEEARRLRERIDRAETlgrtwSREESELQRARDQ 1609
Cdd:COG4717 376 laeagvedEEELRAALEQAEEYQELKEELEELEEQLEELLGELEELLEALDEEELEEELEEL-----EEELEELEEELEE 450
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1610 ADQECGRLQQELRALERQKQQQTLQLQEESKllsqktESERQKAAQRGQELsRLEAAILREKDQIYEKER 1679
Cdd:COG4717 451 LREELAELEAELEQLEEDGELAELLQELEEL------KAELRELAEEWAAL-KLALELLEEAREEYREER 513
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
1350-1740 |
1.77e-03 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 43.57 E-value: 1.77e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1350 KEAERLRQEVREAAQKRR-AAEDAVYELQSKRLLLERRKPEEKVVVQEVVVTQKDPKLREEHS----RLSGSLDEEVGRR 1424
Cdd:pfam15921 88 KDLQRRLNESNELHEKQKfYLRQSVIDLQTKLQEMQMERDAMADIRRRESQSQEDLRNQLQNTvhelEAAKCLKEDMLED 167
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1425 RQLELEvqQLRAGVEEQEGLL--------SFQEDRSKKL----------------AVERELRQLTLRIQELEKRPPTVQE 1480
Cdd:pfam15921 168 SNTQIE--QLRKMMLSHEGVLqeirsilvDFEEASGKKIyehdsmstmhfrslgsAISKILRELDTEISYLKGRIFPVED 245
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1481 KIimeEVVKLEKDPDLE-------KSTEALRWDLDQEKTQVTE----LNRECKNLQVQIDVLQKAKSQEKTIYKevirvq 1549
Cdd:pfam15921 246 QL---EALKSESQNKIElllqqhqDRIEQLISEHEVEITGLTEkassARSQANSIQSQLEIIQEQARNQNSMYM------ 316
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1550 kdRVLEDERARVWEMLNRERTARQAREeearrlreriDRAETLGRTWSREESELQRAR---DQADQECG----RLQQELR 1622
Cdd:pfam15921 317 --RQLSDLESTVSQLRSELREAKRMYE----------DKIEELEKQLVLANSELTEARterDQFSQESGnlddQLQKLLA 384
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1623 ALERQKQQQTLQLQEESKLLSQKTESE------RQKAAQRGQELSRLEAAILREKDQIY-EKERTLRDLHAK-VSREELS 1694
Cdd:pfam15921 385 DLHKREKELSLEKEQNKRLWDRDTGNSitidhlRRELDDRNMEVQRLEALLKAMKSECQgQMERQMAAIQGKnESLEKVS 464
|
410 420 430 440
....*....|....*....|....*....|....*....|....*.
gi 1002819366 1695 QETQTRETNLSTKISILEPETGKDMSpYEAYKRGIIDRGQYLQLQE 1740
Cdd:pfam15921 465 SLTAQLESTKEMLRKVVEELTAKKMT-LESSERTVSDLTASLQEKE 509
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1516-1712 |
1.94e-03 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 43.39 E-value: 1.94e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1516 ELNRECKNLQVQIDVLQKAKSQEKTIYKEVIRVQKDRVLEDERARVWEMLNRERTARQAREEEARRLRERIDRAETLGRT 1595
Cdd:COG1196 217 ELKEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAE 296
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1596 WSREESELQRARDQADQECGRLQQELRALERQKQQQTLQLQEESKLLSQKTESERQKAAQRgQELSRLEAAILREKDQIY 1675
Cdd:COG1196 297 LARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAE-AELAEAEEALLEAEAELA 375
|
170 180 190
....*....|....*....|....*....|....*...
gi 1002819366 1676 EKERTLRDLHAKVSREELSQ-ETQTRETNLSTKISILE 1712
Cdd:COG1196 376 EAEEELEELAEELLEALRAAaELAAQLEELEEAEEALL 413
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
904-1080 |
2.31e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 42.98 E-value: 2.31e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 904 RKMLEKKELSEDIRRTHDAKQGSESPAQAGRESEALKAQLEEERKRVARVQHELEAQRSQLLQLRTQR---PLERLEeke 980
Cdd:COG4913 265 AAARERLAELEYLRAALRLWFAQRRLELLEAELEELRAELARLEAELERLEARLDALREELDELEAQIrgnGGDRLE--- 341
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 981 vvefyrdpQLEGSLSRVKAQVEEEGKRRAGLQADLEVAAQKVVQLESKRKTMQPHLLTKEVTQVERDPGLDSQAAQLRIQ 1060
Cdd:COG4913 342 --------QLEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAA 413
|
170 180
....*....|....*....|
gi 1002819366 1061 IQQLRGEDAVISARLEGLKK 1080
Cdd:COG4913 414 LRDLRRELRELEAEIASLER 433
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1407-1729 |
2.33e-03 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 43.13 E-value: 2.33e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1407 REEHSRLSGSLDEEVGRRRQLElEVQQLRAGVEEQEGLLSFQEDRSKKLAVERELRqltLRIQELEKRPPTVQEKIIMEE 1486
Cdd:TIGR02169 156 RKIIDEIAGVAEFDRKKEKALE-ELEEVEENIERLDLIIDEKRQQLERLRREREKA---ERYQALLKEKREYEGYELLKE 231
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1487 VVKLEKD-PDLEKSTEALRWDLDQEKTQVTELNRECKNLQVQIDvlQKAKSQEKTIYKEVIRVQKD-RVLEDERARVWEM 1564
Cdd:TIGR02169 232 KEALERQkEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLE--ELNKKIKDLGEEEQLRVKEKiGELEAEIASLERS 309
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1565 LNRERTARQAREEEARRLRERIDRAETLGRTWSREESELQRARDQADQECGRLQQELRALERqkqqQTLQLQEESKLLSQ 1644
Cdd:TIGR02169 310 IAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRA----ELEEVDKEFAETRD 385
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1645 KTESERQKAAQRGQELSRLEAAILREKDQIYEKERTLRDLHAKVSR-EELSQETQTRETNLSTKISILEPE---TGKDMS 1720
Cdd:TIGR02169 386 ELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGiEAKINELEEEKEDKALEIKKQEWKleqLAADLS 465
|
....*....
gi 1002819366 1721 PYEAYKRGI 1729
Cdd:TIGR02169 466 KYEQELYDL 474
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1171-1533 |
2.71e-03 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 42.83 E-value: 2.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1171 LQESSRLRSLLEEERTKNATLARELSDLHSKYSVVEKQRPKVQLQERVHEIFQVDPETEQEITRLKAKLQEMAGKRSGVE 1250
Cdd:COG4717 73 LKELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAELAELPERLEELE 152
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1251 KEVEKLLPDLEVLRAQKPTVEYKEVTQEVVRHERSPEVLREIDRLKAQLNELVNSHGRSQEQLIRLQGERDEWRRERAKV 1330
Cdd:COG4717 153 ERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQL 232
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1331 ETKTVSKEVVRHEKDP-------------------------------------------VLEKEAERLRQEVREAaqKRR 1367
Cdd:COG4717 233 ENELEAAALEERLKEArlllliaaallallglggsllsliltiagvlflvlgllallflLLAREKASLGKEAEEL--QAL 310
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1368 AAEDAVYELQSKRLLLERRKPEEKVVVQEVVVTQKDPKLREEHSRLSgSLDEEVgRRRQLELEVQQL--RAGVEEQEGLL 1445
Cdd:COG4717 311 PALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAE-ELEEEL-QLEELEQEIAALlaEAGVEDEEELR 388
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1446 SFQEDRSKKLAVERELRQLTLRIQELEKRpptvqekiiMEEVVKLEKDPDLEKSTEALRWDLDQEKTQVTELNRECKNLQ 1525
Cdd:COG4717 389 AALEQAEEYQELKEELEELEEQLEELLGE---------LEELLEALDEEELEEELEELEEELEELEEELEELREELAELE 459
|
....*...
gi 1002819366 1526 VQIDVLQK 1533
Cdd:COG4717 460 AELEQLEE 467
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1292-1705 |
2.93e-03 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 42.45 E-value: 2.93e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1292 IDRLKAQLNELVNSHGRsqeQLIRLQGERDEWRRERAKVETKtvskevvrhekdpvlEKEAERLRQEVREAAQKRRAAED 1371
Cdd:COG4717 48 LERLEKEADELFKPQGR---KPELNLKELKELEEELKEAEEK---------------EEEYAELQEELEELEEELEELEA 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1372 AVYELQSKRLLLERRKPEEKVVVQEVVVTQKdpkLREEHSRLsGSLDEEVGRRRQLELEVQQLRAGVEEQEGLLSfQEDR 1451
Cdd:COG4717 110 ELEELREELEKLEKLLQLLPLYQELEALEAE---LAELPERL-EELEERLEELRELEEELEELEAELAELQEELE-ELLE 184
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1452 SKKLAVERELRQLTLRIQELEKRPPTVQEKIIMEEVVKLEKDPDLEKSTEALRWDLDQEKTQVTELNRECKNLQVQIDVL 1531
Cdd:COG4717 185 QLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALEERLKEARLLLLIAAALLALLGL 264
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1532 QKAKSQEKTIYKEVIRVQKDRVLEDERARVWEMLNRERTARQAREEEARRLRERIDRAETLGRTWSREESELQRARDQAD 1611
Cdd:COG4717 265 GGSLLSLILTIAGVLFLVLGLLALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLD 344
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1612 --QECGRLQQELRALERQKQQQTLQLQEESKLLSQKTESERQ--KAAQRGQELSRLEAAILREKDQIYEKERTLRDLHAK 1687
Cdd:COG4717 345 riEELQELLREAEELEEELQLEELEQEIAALLAEAGVEDEEElrAALEQAEEYQELKEELEELEEQLEELLGELEELLEA 424
|
410
....*....|....*...
gi 1002819366 1688 VSREELSQETQTRETNLS 1705
Cdd:COG4717 425 LDEEELEEELEELEEELE 442
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
939-1708 |
3.20e-03 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 42.85 E-value: 3.20e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 939 LKAQLEEERKRVARVQHELEAQRSQLLQLRTQR--PLERLEEKE--VVEFYRDPQLE-GSLSRVKAQVEEEGKRRAGLQA 1013
Cdd:pfam01576 227 LQAQIAELRAQLAKKEEELQAALARLEEETAQKnnALKKIRELEaqISELQEDLESErAARNKAEKQRRDLGEELEALKT 306
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1014 DLE-----VAAQKvvQLESKRKTmqphlltkEVTQVERdpGLDSQAAQLRIQIQQLRGEDAV----ISARLEGLKKELLA 1084
Cdd:pfam01576 307 ELEdtldtTAAQQ--ELRSKREQ--------EVTELKK--ALEEETRSHEAQLQEMRQKHTQaleeLTEQLEQAKRNKAN 374
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1085 LEKrevdvkekvvvKEVVKVEKNLEMVKAAQALRLQMEEDAARRKQAEEAVAKLQARIEDLERAISSVEPKVIVkevkkv 1164
Cdd:pfam01576 375 LEK-----------AKQALESENAELQAELRTLQQAKQDSEHKRKKLEGQLQELQARLSESERQRAELAEKLSK------ 437
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1165 eqdpgLLQESSRLRSLLEEERTKNATLARELSDLHSKYSVV-----EKQRPKVQLQERVHEIfqvdpetEQEITRLKAKL 1239
Cdd:pfam01576 438 -----LQSELESVSSLLNEAEGKNIKLSKDVSSLESQLQDTqellqEETRQKLNLSTRLRQL-------EDERNSLQEQL 505
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1240 QEMAGKRSGVEKEVEKLLPDLEVLRAQkptVEYKEVTQEVVRHERSpEVLREIDRLKAQLNELVNSHGRSQEQLIRLQGE 1319
Cdd:pfam01576 506 EEEEEAKRNVERQLSTLQAQLSDMKKK---LEEDAGTLEALEEGKK-RLQRELEALTQQLEEKAAAYDKLEKTKNRLQQE 581
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1320 RDEWRRERAKVETKTVSKEVVRHEKDPVLEKEaerlRQEVREAAQKRRAAEDAVYELQSKRLLLERRKPEEKVVVQEVVV 1399
Cdd:pfam01576 582 LDDLLVDLDHQRQLVSNLEKKQKKFDQMLAEE----KAISARYAEERDRAEAEAREKETRALSLARALEEALEAKEELER 657
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1400 TQKdpKLREEHSRLSGSLDEeVGR--------RRQLELEVQQLRAGVEEQEGLLSFQEDRSKKLAV---------EREL- 1461
Cdd:pfam01576 658 TNK--QLRAEMEDLVSSKDD-VGKnvhelersKRALEQQVEEMKTQLEELEDELQATEDAKLRLEVnmqalkaqfERDLq 734
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1462 ----------RQLTLRIQELEKR--PPTVQEKIIMEEVVKLEKD-PDLEKSTEALRWDLDQEKTQVTELNRECKNLQVQI 1528
Cdd:pfam01576 735 ardeqgeekrRQLVKQVRELEAEleDERKQRAQAVAAKKKLELDlKELEAQIDAANKGREEAVKQLKKLQAQMKDLQREL 814
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1529 DVLQKAKSQEKTIYKEVIRvqKDRVLEDERARVWEMLNRERTARQAREEEARRLRERIDRAETLGRTWSREESELQRARD 1608
Cdd:pfam01576 815 EEARASRDEILAQSKESEK--KLKNLEAELLQLQEDLAASERARRQAQQERDELADEIASGASGKSALQDEKRRLEARIA 892
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1609 QADQECGRLQQELRAL-ERQKQQQTLQLQEESKL-----LSQKTESERQKAAQRGQELSrleAAILREKDQIYEKER-TL 1681
Cdd:pfam01576 893 QLEEELEEEQSNTELLnDRLRKSTLQVEQLTTELaaersTSQKSESARQQLERQNKELK---AKLQEMEGTVKSKFKsSI 969
|
810 820
....*....|....*....|....*..
gi 1002819366 1682 RDLHAKVSREELSQETQTRETNLSTKI 1708
Cdd:pfam01576 970 AALEAKIAQLEEQLEQESRERQAANKL 996
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
1349-1750 |
4.54e-03 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 42.27 E-value: 4.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1349 EKEAERLRQEVREAAQKRRAAEDAVYELQSKRLLLERRKpeekvVVQEVVVTQKDPKLREEHSRLsgsLDEEVGRRRQLE 1428
Cdd:pfam02463 169 RKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAK-----KALEYYQLKEKLELEEEYLLY---LDYLKLNEERID 240
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1429 LEVQQLRAGVEEQEGLLSFQEDRSKKLAVERELRQLTLRIQELEKRPPTVQEKIIMEEVVKLEKdpdLEKSTEALRWDLD 1508
Cdd:pfam02463 241 LLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLK---LERRKVDDEEKLK 317
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1509 QEKTQVTELNRECKNLQVQIDVLQKAKS--QEKTIYKEVIRVQKDRVLEDERARVWEMLNRERTARQAREEEARRLRERI 1586
Cdd:pfam02463 318 ESEKEKKKAEKELKKEKEEIEELEKELKelEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEEL 397
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1587 DRAETLGRTWSREESELQRARDQADqecgRLQQELRALERQKQQQTLQLQEESKLLSQKTESERQKAAQRGQELSRLEAA 1666
Cdd:pfam02463 398 ELKSEEEKEAQLLLELARQLEDLLK----EEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDL 473
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1667 ILREKDQIYEKERTLRDLHAKVsREELSQETQTRETNLSTKISILEPETGKDMSPYEAYKRGIIDRGQYLQLQELECDWE 1746
Cdd:pfam02463 474 LKETQLVKLQEQLELLLSRQKL-EERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVIVE 552
|
....
gi 1002819366 1747 EVTT 1750
Cdd:pfam02463 553 VSAT 556
|
|
| PLEC |
smart00250 |
Plectin repeat; |
1710-1735 |
5.32e-03 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 36.31 E-value: 5.32e-03
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
1180-1719 |
6.02e-03 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 41.75 E-value: 6.02e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1180 LLEEERTKNATLARELSDLHSKYsvVEKQRPKVQLQERVHEI-----FQVDPETEQeitrLKAKLQEMAGKRSGVEKEVE 1254
Cdd:pfam12128 245 KLQQEFNTLESAELRLSHLHFGY--KSDETLIASRQEERQETsaelnQLLRTLDDQ----WKEKRDELNGELSAADAAVA 318
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1255 KLLPDLEVLRAQKPTVEYKEVTQEVVRHERSPEVLREIDRLKAQLNELVNSHGRSQEqlirlqgerdEWRRERAKVETKT 1334
Cdd:pfam12128 319 KDRSELEALEDQHGAFLDADIETAAADQEQLPSWQSELENLEERLKALTGKHQDVTA----------KYNRRRSKIKEQN 388
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1335 VSKEVVRHEKdpvlekeaerlRQEVREAAQKRRAAEDAVYELQSKRLllerrkpeekvvvqEVVVTQKDPKLREEHSRLS 1414
Cdd:pfam12128 389 NRDIAGIKDK-----------LAKIREARDRQLAVAEDDLQALESEL--------------REQLEAGKLEFNEEEYRLK 443
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1415 GSLDEEVGRRRQL---ELEVQQLRAGVEEQEGLLSFQEDRSKK-LAVERELRQL-TLRIQELEKR---PPTVQEKIIMEE 1486
Cdd:pfam12128 444 SRLGELKLRLNQAtatPELLLQLENFDERIERAREEQEAANAEvERLQSELRQArKRRDQASEALrqaSRRLEERQSALD 523
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1487 VVKLEKDPDLEK-----STEALRWDLDQEKTQVTELNRECknlQVQIDVLQKAKSQEKTIYKEVIRVQKDRV-------- 1553
Cdd:pfam12128 524 ELELQLFPQAGTllhflRKEAPDWEQSIGKVISPELLHRT---DLDPEVWDGSVGGELNLYGVKLDLKRIDVpewaasee 600
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1554 -LEDERARVWEMLNRERTARQAREEEARRLRERIDRAetlgrtwSREESELQRARDQADQECGRLQQELRALERQKQQQT 1632
Cdd:pfam12128 601 eLRERLDKAEEALQSAREKQAAAEEQLVQANGELEKA-------SREETFARTALKNARLDLRRLFDEKQSEKDKKNKAL 673
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1002819366 1633 LQLQEeskllsQKTESERQKAAQRGQELSRLEAAILREKDQIYEkERTLRDLHAKVSREE-------LSQETQTRETNLS 1705
Cdd:pfam12128 674 AERKD------SANERLNSLEAQLKQLDKKHQAWLEEQKEQKRE-ARTEKQAYWQVVEGAldaqlalLKAAIAARRSGAK 746
|
570
....*....|....
gi 1002819366 1706 TKISILEPETGKDM 1719
Cdd:pfam12128 747 AELKALETWYKRDL 760
|
|
|