|
Name |
Accession |
Description |
Interval |
E-value |
| Spectrin_like |
pfam18373 |
Spectrin like domain; Desmoplakin (DP) is an integral part of desmosomes, where it links ... |
548-625 |
7.01e-36 |
|
Spectrin like domain; Desmoplakin (DP) is an integral part of desmosomes, where it links desmosomal cadherins to the intermediate filaments. The N-terminal region of DP contains a plakin domain common to members of the plakin family. Plakin domains contain multiple copies of spectrin repeats (SRs) pfam00435. Spectrin repeats (SRs) consist of three alpha-helices (A, B, and C) that form an antiparallel triple-helical bundle. This entry describes SR6 which has a divergent structure relative to the other SRs. SR6 shows significant deviations in helices A and B where they are significantly shorter than in other repeats. Structural comparison revealed that SR6 is more similar to other three-helix-bundle proteins, including target of Myb1 and the syntaxin Habc domain, than to other SR proteins. Due to these differences with other spectrin repeats, this region is termed spectrin-like repeat.
Pssm-ID: 465730 Cd Length: 78 Bit Score: 131.57 E-value: 7.01e-36
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 975830145 548 VSWHYCMIDIEKIRAMTIAKLKTMRQEDYMKTIADLELHYQEFIRNSQGSEMFGDDDKRKIQSQFTDAQKHYQTLVIQ 625
Cdd:pfam18373 1 VSWQYLLKDIQRINSWTISMLKTMRPEEYRQVLKNLETHYQDFLRDSQESEMFGAEDRRQLEREVNSAQQHYQTLLVS 78
|
|
| SH3_10 |
pfam17902 |
SH3 domain; This entry represents an SH3 domain. |
447-513 |
1.44e-26 |
|
SH3 domain; This entry represents an SH3 domain.
Pssm-ID: 407754 Cd Length: 65 Bit Score: 104.27 E-value: 1.44e-26
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 975830145 447 QLKPRNpdYRSNKPIILRALCDYKQDQKIVHKGDECILKDNNERSKWYVTGPGGVDMLVPSVGLIIP 513
Cdd:pfam17902 1 PLKQRR--SPVTRPIPVKALCDYKQGEVTVEKGEECTLLDNSDREKWKVQTSSGVEKLVPSVCFLIP 65
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
1018-1501 |
6.50e-17 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 87.38 E-value: 6.50e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1018 SEMLKSLEDLKLKNTKIEVLEEEL-RLARDANSENCNKNKFL------DQNLQKYQAECSQFKAKLASLEELKRQAELDG 1090
Cdd:TIGR04523 103 SDLSKINSEIKNDKEQKNKLEVELnKLEKQKKENKKNIDKFLteikkkEKELEKLNNKYNDLKKQKEELENELNLLEKEK 182
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1091 KSAKQNLDKCYGQ----------IKELNEKITRLTYEIEDEKRRRKSVEDRFDQQKNDYDQLQKARQCEKENLGWQKLES 1160
Cdd:TIGR04523 183 LNIQKNIDKIKNKllklelllsnLKKKIQKNKSLESQISELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQ 262
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1161 EKAIK---EKEYEIERLRVLLqeegtrkreyeNELAKVRNHYNEEMSNLRNKYETEINitkTTIKEISMQKEDDSKNLRN 1237
Cdd:TIGR04523 263 NKIKKqlsEKQKELEQNNKKI-----------KELEKQLNQLKSEISDLNNQKEQDWN---KELKSELKNQEKKLEEIQN 328
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1238 QLDRLSRENRDLKDEIvrlndSILQATEQRRRAEENALQQkacgsEIMQKKQHLEIELKQvmqqrsedNARHKQSLEEAA 1317
Cdd:TIGR04523 329 QISQNNKIISQLNEQI-----SQLKKELTNSESENSEKQR-----ELEEKQNEIEKLKKE--------NQSYKQEIKNLE 390
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1318 KTIQDKNKEIERLKAEFQEEAKRRWEYENELSKASNRIQESKNQCTQVVQERESLLVKIKVLEQDKARLQRLEDELNRAK 1397
Cdd:TIGR04523 391 SQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQL 470
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1398 STLEAETR-VKQRLECEKQQIQ---NDLNQWKTQYSRKEEAIRKIESEREKSEREKNSLRSEIERLQAEIKRIEErcrRK 1473
Cdd:TIGR04523 471 KVLSRSINkIKQNLEQKQKELKskeKELKKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLED---EL 547
|
490 500
....*....|....*....|....*...
gi 975830145 1474 LEDSTRETQSQLETERSRYQREIDKLRQ 1501
Cdd:TIGR04523 548 NKDDFELKKENLEKEIDEKNKEIEELKQ 575
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1104-1438 |
2.28e-16 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 86.26 E-value: 2.28e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1104 IKELNEKITRLTYEIEDEKRRRKSVEDRFDQQKNDYDQLQKARQCEKENLGWQKLESEKAIKEKEYEIERLRvLLQEEGT 1183
Cdd:TIGR02168 679 IEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIA-QLSKELT 757
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1184 RKREYENELAKVRNHYNEEMSNLR----------NKYETEINITKTTIKEISMQKEDDSKNLRNQLDRLSRENRDLKDEI 1253
Cdd:TIGR02168 758 ELEAEIEELEERLEEAEEELAEAEaeieeleaqiEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATE 837
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1254 VRLNDSILQATEQRRRAEENALQQKACGSEIMQKKQHLEIELKQVMQQRsEDNARHKQSLEEAAKTIQDKNKEIERLKAE 1333
Cdd:TIGR02168 838 RRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLE-EALALLRSELEELSEELRELESKRSELRRE 916
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1334 FQEEAKRRWEYENELSKASNRIQESKNQCT-------QVVQERESLLV-KIKVLEQDKARLQRLEDELNRAKSTLEAETR 1405
Cdd:TIGR02168 917 LEELREKLAQLELRLEGLEVRIDNLQERLSeeysltlEEAEALENKIEdDEEEARRRLKRLENKIKELGPVNLAAIEEYE 996
|
330 340 350
....*....|....*....|....*....|....
gi 975830145 1406 -VKQRLECEKQQIqNDLNQWKTQYsrkEEAIRKI 1438
Cdd:TIGR02168 997 eLKERYDFLTAQK-EDLTEAKETL---EEAIEEI 1026
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1002-1499 |
2.49e-16 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 85.89 E-value: 2.49e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1002 RYIELLTRSGDYYRFLSEMLKSLEDLKLKNTKIEVLEEELRLARDANSENCNKNKFLDQNLQKYQAECSQFKAKLASLEE 1081
Cdd:PRK03918 180 RLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEE 259
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1082 LKRQAELDGKSAKQNLDKCYGQIKELNE---------KITRLTYEIEDEKRRRKSVEDRFDQQKNDYDQLQKARQCEKEN 1152
Cdd:PRK03918 260 KIRELEERIEELKKEIEELEEKVKELKElkekaeeyiKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEER 339
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1153 LGWQKLESEK------AIKEKEYEIERLRVLLQE-EGTRKREYENELAKVRNHYnEEMSNLRNKYETEINITKTTIKEIS 1225
Cdd:PRK03918 340 LEELKKKLKElekrleELEERHELYEEAKAKKEElERLKKRLTGLTPEKLEKEL-EELEKAKEEIEEEISKITARIGELK 418
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1226 MQKEDDSKNL----------------------RNQLDRLSRENRDLKDEIVRLNDSILQATEQRRRAEENALQQKACGS- 1282
Cdd:PRK03918 419 KEIKELKKAIeelkkakgkcpvcgrelteehrKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKl 498
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1283 -EIMQKKQHLEIELKQVMQQRSEDNARHKQSLEEAA-------KTIQDKNKEIERLKAEFQEEAKRRWEYENELSKASNR 1354
Cdd:PRK03918 499 kELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLiklkgeiKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKE 578
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1355 IQESKNQCTQVVQER----ESLLVKIKVLEQDKARLQRLEDELNRAKSTLEAETRVKQRLECEKQQIQNDLNQWKTQYSr 1430
Cdd:PRK03918 579 LEELGFESVEELEERlkelEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYS- 657
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 975830145 1431 kEEAIRKIESEREksereknSLRSEIERLQAEIKRIEERcRRKLEDSTRETQSQLEtERSRYQREIDKL 1499
Cdd:PRK03918 658 -EEEYEELREEYL-------ELSRELAGLRAELEELEKR-REEIKKTLEKLKEELE-EREKAKKELEKL 716
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1128-1462 |
2.67e-15 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 82.68 E-value: 2.67e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1128 VEDRFDQQKNDYDQLQK-ARQCEKenlgWQKLESEKAIKEKEYEIERLRVLLQEegtrKREYENELAKVRNHYNEEMSNL 1206
Cdd:COG1196 191 LEDILGELERQLEPLERqAEKAER----YRELKEELKELEAELLLLKLRELEAE----LEELEAELEELEAELEELEAEL 262
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1207 RNKyETEINITKTTIKEismqkeddsknLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRRRAEENALQQKAcgsEIMQ 1286
Cdd:COG1196 263 AEL-EAELEELRLELEE-----------LELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEE---ELAE 327
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1287 KKQHLEiELKQVMQQRSEDNARHKQSLEEAAKTIQDKNKEIERLKAEFQEEAKRRWEYENELSKASNRIQESKNQCTQVV 1366
Cdd:COG1196 328 LEEELE-ELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELE 406
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1367 QERESLLVKIKVLEQDKARLQ----RLEDELNRAKSTLEAETRVKQRLECEKQQIQNDLNQWKTQYSRKEEAIRKIESER 1442
Cdd:COG1196 407 EAEEALLERLERLEEELEELEealaELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEEL 486
|
330 340
....*....|....*....|
gi 975830145 1443 EKSEREKNSLRSEIERLQAE 1462
Cdd:COG1196 487 AEAAARLLLLLEAEADYEGF 506
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1235-1502 |
3.33e-15 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 82.29 E-value: 3.33e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1235 LRNQLDRLSRE------NRDLKDEIVRLNDSILQATEQRRRAEENALQQKAcgseimqkkQHLEIELKQVMQQRSEDNAR 1308
Cdd:COG1196 198 LERQLEPLERQaekaerYRELKEELKELEAELLLLKLRELEAELEELEAEL---------EELEAELEELEAELAELEAE 268
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1309 hkqsLEEAAKTIQDKNKEIERLKAEFQEEAKRRWEYENELSKASNRIQESKNQCTQVVQERESLLvkiKVLEQDKARLQR 1388
Cdd:COG1196 269 ----LEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELE---EELEELEEELEE 341
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1389 LEDELNRAKSTLEAETRVKQRLECEKQQIQNDLNQWKTQYSRKEEAIRKIESEREKSEREKNSLRSEIERLQAEIKRIEE 1468
Cdd:COG1196 342 LEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEE 421
|
250 260 270
....*....|....*....|....*....|....
gi 975830145 1469 RCRRKLEDSTRETQSQLETERSRYQREIDKLRQR 1502
Cdd:COG1196 422 ELEELEEALAELEEEEEEEEEALEEAAEEEAELE 455
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
2207-2245 |
3.77e-15 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 71.20 E-value: 3.77e-15
10 20 30
....*....|....*....|....*....|....*....
gi 975830145 2207 LLEAQACTGGIIHPTTGQKLSLQDAVSQGVIDQDMATRL 2245
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
1848-1886 |
7.75e-15 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 70.05 E-value: 7.75e-15
10 20 30
....*....|....*....|....*....|....*....
gi 975830145 1848 LLEAQAATGFIVDPVSNLRLPVEEAYKRGLVGIEFKEKL 1886
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
1924-1962 |
1.38e-14 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 69.28 E-value: 1.38e-14
10 20 30
....*....|....*....|....*....|....*....
gi 975830145 1924 LLEAQIATGGIIDPKESHRLPVDIAYKRGYFNEELSEIL 1962
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1166-1501 |
1.42e-14 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 80.10 E-value: 1.42e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1166 EKEYEIERLRVLLQEEGTRKREYENELAKVRnhynEEMSNLRNKYEteinitktTIKEISMQKEDDSKNLRNQLDRLSRE 1245
Cdd:TIGR02168 674 ERRREIEELEEKIEELEEKIAELEKALAELR----KELEELEEELE--------QLRKELEELSRQISALRKDLARLEAE 741
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1246 NRDLKDEIVRLNDSILQATEQRRRAEENalqqkacgseiMQKKQHLEIELKQVMQQRSEDNARHKQSLEEAAKTIQDKNK 1325
Cdd:TIGR02168 742 VEQLEERIAQLSKELTELEAEIEELEER-----------LEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRA 810
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1326 EIERLKAEFQEEAKRRWEYENELSKASNRIQESKNQCTQVVQERESLlvkIKVLEQDKARLQRLEDELNRAKSTLEAETR 1405
Cdd:TIGR02168 811 ELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESL---AAEIEELEELIEELESELEALLNERASLEE 887
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1406 VKQRLECEKQQIQNDLNQWktqysrkEEAIRKIESEREKSEREKNSLRSEIERLQAEIKRIEERCRRKLEDSTR---ETQ 1482
Cdd:TIGR02168 888 ALALLRSELEELSEELREL-------ESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEeaeALE 960
|
330
....*....|....*....
gi 975830145 1483 SQLETERSRYQREIDKLRQ 1501
Cdd:TIGR02168 961 NKIEDDEEEARRRLKRLEN 979
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1159-1502 |
4.20e-14 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 78.57 E-value: 4.20e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1159 ESEKA---IKEKEYEIERLRVLLQEegtrKREYENELAKVRNHYNEEMSNLRNKYETEINITKTTIKEISMQKEddskNL 1235
Cdd:TIGR02169 171 KKEKAleeLEEVEENIERLDLIIDE----KRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKE----AI 242
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1236 RNQLDRLSRENRDLKDEIVRLNDSILQAtEQRRRAEENALQQKACGSEIMQKKQHLEIELKQVMQQRSEDNArhKQSLEE 1315
Cdd:TIGR02169 243 ERQLASLEEELEKLTEEISELEKRLEEI-EQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEK--ERELED 319
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1316 AAKTIQDKNKEIERLKAEFQEEAKRRWEYENELSKASNRIQESKNQCTQVVQERESLLVKIKV----LEQDKARLQRLED 1391
Cdd:TIGR02169 320 AEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAEtrdeLKDYREKLEKLKR 399
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1392 ELNRAKSTLEAETRVKQRLECEKQQIQNDLnqwktqySRKEEAIRKIESEREKSEREKNSLRSEIERLQAEIKRIEERCR 1471
Cdd:TIGR02169 400 EINELKRELDRLQEELQRLSEELADLNAAI-------AGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELY 472
|
330 340 350
....*....|....*....|....*....|.
gi 975830145 1472 RkledsTRETQSQLETERSRYQREIDKLRQR 1502
Cdd:TIGR02169 473 D-----LKEEYDRVEKELSKLQRELAEAEAQ 498
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1074-1471 |
4.53e-14 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 78.57 E-value: 4.53e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1074 AKLASLEELKRQAELDGKSAKQNLDKCYGQIKELNEKITRLTYEIEDEKRRRKsvedrFDQQKNDYDQLQKARqcEKENL 1153
Cdd:TIGR02169 163 AGVAEFDRKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQA-----LLKEKREYEGYELLK--EKEAL 235
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1154 GWQKLESEKAIKEKEYEIERLRVLLQEegtRKREYEnELAKVRNHYNEEmsnLRNKYETEINITKTTIKEISMQKEDDSK 1233
Cdd:TIGR02169 236 ERQKEAIERQLASLEEELEKLTEEISE---LEKRLE-EIEQLLEELNKK---IKDLGEEEQLRVKEKIGELEAEIASLER 308
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1234 NLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRRRAEENALQQKACGSEIMQKKQHLEiELKQVMQQRSEDNARHKQSL 1313
Cdd:TIGR02169 309 SIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELE-DLRAELEEVDKEFAETRDEL 387
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1314 EEAAKTIQDKNKEIERLKAEFQEEAKRRWEYENELSKASNRIQESKNQCTQVVQERESLLVKIKVLEQdkarlqrledel 1393
Cdd:TIGR02169 388 KDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEW------------ 455
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 975830145 1394 nrakstleaetrvkqrlecEKQQIQNDLNQWKTQYSRKEEAIRKIESEREksereknSLRSEIERLQAEIKRIEERCR 1471
Cdd:TIGR02169 456 -------------------KLEQLAADLSKYEQELYDLKEEYDRVEKELS-------KLQRELAEAEAQARASEERVR 507
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
1102-1498 |
8.82e-14 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 77.09 E-value: 8.82e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1102 GQIKELNEKITRLTYEIEDEKRRRKSVEDRFDQQKNDYDQLQKARQCEKENLGWQKLESEKAIKEKE---YEIERLRVLL 1178
Cdd:pfam17380 240 KESFNLAEDVTTMTPEYTVRYNGQTMTENEFLNQLLHIVQHQKAVSERQQQEKFEKMEQERLRQEKEekaREVERRRKLE 319
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1179 QEEGTRKREYENELAKVRNHynEEMSNLRNKyeteinitkttikeismqkeddsknlrnQLDRLSRENRDLKDEIVRLND 1258
Cdd:pfam17380 320 EAEKARQAEMDRQAAIYAEQ--ERMAMERER----------------------------ELERIRQEERKRELERIRQEE 369
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1259 SILQATEQRrraeenalqqkacgseimqkkqhleiELKQVMQQRSEDNARHKQSLEEAAK-TIQDKNKEiERLKAEFQEE 1337
Cdd:pfam17380 370 IAMEISRMR--------------------------ELERLQMERQQKNERVRQELEAARKvKILEEERQ-RKIQQQKVEM 422
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1338 AKRRWEYENELSKASNRIQESKnqctqvvqERESLLVKIKVLE--QDKARLQRLEDELNRAKSTLEAETRVKQRLECEKQ 1415
Cdd:pfam17380 423 EQIRAEQEEARQREVRRLEEER--------AREMERVRLEEQErqQQVERLRQQEEERKRKKLELEKEKRDRKRAEEQRR 494
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1416 QIQNDLNQWKTQYSRKEEAIRKIesereksereknsLRSEIERLQAEIkrIEERCRRKLEDStRETQSQLEtERSRYQRE 1495
Cdd:pfam17380 495 KILEKELEERKQAMIEEERKRKL-------------LEKEMEERQKAI--YEEERRREAEEE-RRKQQEME-ERRRIQEQ 557
|
...
gi 975830145 1496 IDK 1498
Cdd:pfam17380 558 MRK 560
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1208-1542 |
9.81e-14 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 77.40 E-value: 9.81e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1208 NKYETEINITKTtikEISMQK-EDDSKNLRNQLDRLSREN------RDLKDEI--VRLNDSILQATEQRRRAEENALQQK 1278
Cdd:TIGR02168 173 RRKETERKLERT---RENLDRlEDILNELERQLKSLERQAekaeryKELKAELreLELALLVLRLEELREELEELQEELK 249
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1279 ACGSEIMQKKQHLEIELKQVMQQRSEDNARHKQsLEEAAKTIQDKNKEIERLKAEFQEEAKRRWEYENELSKASNRIQES 1358
Cdd:TIGR02168 250 EAEEELEELTAELQELEEKLEELRLEVSELEEE-IEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEEL 328
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1359 KNQctqvvqeresllvkikvLEQDKARLQRLEDELNRAKSTLEAEtrvKQRLECEKQQIQNDLNQWKTQysrkEEAIRKI 1438
Cdd:TIGR02168 329 ESK-----------------LDELAEELAELEEKLEELKEELESL---EAELEELEAELEELESRLEEL----EEQLETL 384
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1439 ESEREKSEREKNSLRSEIERLQAEIKRIEERCRRKLEDSTRETQSQLETERSRYQREIDKLRQrpygSHRETQTECEWTV 1518
Cdd:TIGR02168 385 RSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEE----ELEELQEELERLE 460
|
330 340
....*....|....*....|....
gi 975830145 1519 DTSKLVFDGLRKKVTAMQLYECQL 1542
Cdd:TIGR02168 461 EALEELREELEEAEQALDAAEREL 484
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
1064-1501 |
1.02e-13 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 76.98 E-value: 1.02e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1064 KYQAECSQFKAKLASLEELKRQAELDGKSAKQNLDKCYGQIKELNEKITRLTYEIEDEKRRRKSVEDRFDQQKNDYDQLQ 1143
Cdd:TIGR04523 30 KQDTEEKQLEKKLKTIKNELKNKEKELKNLDKNLNKDEEKINNSNNKIKILEQQIKDLNDKLKKNKDKINKLNSDLSKIN 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1144 KARQCEKENLGWQKLESEKAIKEKEyEIERLRVLLQEEGTRKreyENELAKVRNHYN------EEMSNLRNKYETEINIT 1217
Cdd:TIGR04523 110 SEIKNDKEQKNKLEVELNKLEKQKK-ENKKNIDKFLTEIKKK---EKELEKLNNKYNdlkkqkEELENELNLLEKEKLNI 185
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1218 KTTIKEISMQ-------------KEDDSKNLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRRRAEEnalqqkacgsEI 1284
Cdd:TIGR04523 186 QKNIDKIKNKllklelllsnlkkKIQKNKSLESQISELKKQNNQLKDNIEKKQQEINEKTTEISNTQT----------QL 255
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1285 MQKKQHLEIELKQVMQQRSEdnarhkqsLEEAAKTIQDKNKEIERLKAEFQE---EAKRRW--EYENELSKASNRIQESK 1359
Cdd:TIGR04523 256 NQLKDEQNKIKKQLSEKQKE--------LEQNNKKIKELEKQLNQLKSEISDlnnQKEQDWnkELKSELKNQEKKLEEIQ 327
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1360 NQCTQVVQERESLLVKIKVLEQDKARL----QRLEDELNRAKSTLEAETRVKQ-------RLECEKQQIQNDLNQWKTQY 1428
Cdd:TIGR04523 328 NQISQNNKIISQLNEQISQLKKELTNSesenSEKQRELEEKQNEIEKLKKENQsykqeikNLESQINDLESKIQNQEKLN 407
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 975830145 1429 SRKEEAIRKIESEREKSEREKNSLRSEIERLQAEIKRIEER--CRRKLEDSTRETQSQLETERSRYQREIDKLRQ 1501
Cdd:TIGR04523 408 QQKDEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDLTNQdsVKELIIKNLDNTRESLETQLKVLSRSINKIKQ 482
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
1033-1499 |
2.80e-13 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 75.98 E-value: 2.80e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1033 KIEVLEEELRLARDANSENCNKNKFLDQNLQKYQAECSQFKAKLASLEELKRQ-----AELDGKSAK-----QNLDKCY- 1101
Cdd:pfam01576 132 KIKKLEEDILLLEDQNSKLSKERKLLEERISEFTSNLAEEEEKAKSLSKLKNKheamiSDLEERLKKeekgrQELEKAKr 211
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1102 ---GQIKELNEKITRLTYEIEdekrrrksvEDRFDQQKNDyDQLQKArQCEKENLGWQKLESEKAIKEKEYEIERLrvll 1178
Cdd:pfam01576 212 kleGESTDLQEQIAELQAQIA---------ELRAQLAKKE-EELQAA-LARLEEETAQKNNALKKIRELEAQISEL---- 276
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1179 QEEGTRKREYENELAKVRNHYNEEMSNLRNKYETEINITKTTiKEISMQKEDDSKNLRNQLDRLSRENR-DLKDEIVRLN 1257
Cdd:pfam01576 277 QEDLESERAARNKAEKQRRDLGEELEALKTELEDTLDTTAAQ-QELRSKREQEVTELKKALEEETRSHEaQLQEMRQKHT 355
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1258 DSILQATEQRRRAEENALQQKACGSEIMQKKQHLEIELKQVMQQRSEDNARHKQsLEEAAKTIQDKNKEIERLKAEFQEE 1337
Cdd:pfam01576 356 QALEELTEQLEQAKRNKANLEKAKQALESENAELQAELRTLQQAKQDSEHKRKK-LEGQLQELQARLSESERQRAELAEK 434
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1338 AKRrweYENELSKASNRIQESKNQCTQVVQERESL---LVKIKVLEQDKAR--------LQRLEDELNRAKSTLEAETRV 1406
Cdd:pfam01576 435 LSK---LQSELESVSSLLNEAEGKNIKLSKDVSSLesqLQDTQELLQEETRqklnlstrLRQLEDERNSLQEQLEEEEEA 511
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1407 KQRLECEKQQIQNDLNQWKTQYSRKEEAIRKIESEREKSEREKNSLRSEIERLQAEIKRIE---ERCRRKLEDSTRE--T 1481
Cdd:pfam01576 512 KRNVERQLSTLQAQLSDMKKKLEEDAGTLEALEEGKKRLQRELEALTQQLEEKAAAYDKLEktkNRLQQELDDLLVDldH 591
|
490
....*....|....*...
gi 975830145 1482 QSQLETERSRYQREIDKL 1499
Cdd:pfam01576 592 QRQLVSNLEKKQKKFDQM 609
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
1002-1588 |
5.45e-13 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 75.18 E-value: 5.45e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1002 RYIELLTRSGDYYRFLSEMLKSLEDLKLKNTKIEVLEEELRLARDANSENCNKnkfldqnlQKYQAECSQFKAKLASL-- 1079
Cdd:PTZ00121 1302 KKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEA--------AADEAEAAEEKAEAAEKkk 1373
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1080 EELKRQAELDGKSA--KQNLDKCYGQIKELNEKITRLTYEIEDEKR----RRKSVEDR-FDQQKNDYDQLQKARQCEKEN 1152
Cdd:PTZ00121 1374 EEAKKKADAAKKKAeeKKKADEAKKKAEEDKKKADELKKAAAAKKKadeaKKKAEEKKkADEAKKKAEEAKKADEAKKKA 1453
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1153 LGWQKLESEKAIKEKEYEIERLRVLLQEEgtRKREyenELAKVRNHYNEEMSNLRNKYETeinitKTTIKEISMQKEDDS 1232
Cdd:PTZ00121 1454 EEAKKAEEAKKKAEEAKKADEAKKKAEEA--KKAD---EAKKKAEEAKKKADEAKKAAEA-----KKKADEAKKAEEAKK 1523
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1233 KNLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRRRAEENALQQKACGSEIMQKKQHLEIELKQVMQQRSEDNARHKQS 1312
Cdd:PTZ00121 1524 ADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEE 1603
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1313 lEEAAKTIQDKNKEIERLKAEfqeEAKRRWEYENELSKASNRIQESKNQCTQVVQERESLLVKIKVLEQDKARLQRLEDE 1392
Cdd:PTZ00121 1604 -EKKMKAEEAKKAEEAKIKAE---ELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEE 1679
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1393 LNRAKSTLEAETRVKQRLECEKQQIQnDLNQWKTQYSRKEEAIRKIESEREKSEREKNSlRSEIERLQAEIKRIEERCRR 1472
Cdd:PTZ00121 1680 AKKAEEDEKKAAEALKKEAEEAKKAE-ELKKKEAEEKKKAEELKKAEEENKIKAEEAKK-EAEEDKKKAEEAKKDEEEKK 1757
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1473 KLEDSTRETQSQLETERSRYQREIDK-LRQRPYGSHRETQTECEWTVDTSKLVFDGLRKKVTAMQlyECQLIDKTTLDKL 1551
Cdd:PTZ00121 1758 KIAHLKKEEEKKAEEIRKEKEAVIEEeLDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVIN--DSKEMEDSAIKEV 1835
|
570 580 590 600
....*....|....*....|....*....|....*....|..
gi 975830145 1552 LKGKKSVEEVASEIQPFLRGAGSIAGASA-----SPKEKYSL 1588
Cdd:PTZ00121 1836 ADSKNMQLEEADAFEKHKFNKNNENGEDGnkeadFNKEKDLK 1877
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
1681-1719 |
9.90e-13 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 64.27 E-value: 9.90e-13
10 20 30
....*....|....*....|....*....|....*....
gi 975830145 1681 LLEAQIASGGVVDPVNSVFLPKDVALARGLIDRDLYRSL 1719
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1021-1345 |
1.51e-12 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 73.43 E-value: 1.51e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1021 LKSLED--------LKLKnTKIEVLEEELRLARDansencnknKFLDQNLQKYQAECSQFKAKLASLEELKRQAELDGKS 1092
Cdd:COG1196 202 LEPLERqaekaeryRELK-EELKELEAELLLLKL---------RELEAELEELEAELEELEAELEELEAELAELEAELEE 271
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1093 AKQNLDKCYGQIKELNEKITRLTYEIEDEKRRRKSVEDRfdqqkndydqlQKARQCEKENLGWQKLESEKAIKEKEYEIE 1172
Cdd:COG1196 272 LRLELEELELELEEAQAEEYELLAELARLEQDIARLEER-----------RRELEERLEELEEELAELEEELEELEEELE 340
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1173 RLRVLLQEEGTRKREYENELAKVRNHYNEEMSNLRnkyetEINITKTTIKEISMQKEDDSKNLRNQLDRLSRENRDLKDE 1252
Cdd:COG1196 341 ELEEELEEAEEELEEAEAELAEAEEALLEAEAELA-----EAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLER 415
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1253 IVRLNDSILQATEQRRRAEENALQQKacgsEIMQKKQHLEIELKQVMQQRSEDNARHKQSLEEAAKTIQDKNKEIERLKA 1332
Cdd:COG1196 416 LERLEEELEELEEALAELEEEEEEEE----EALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAA 491
|
330
....*....|...
gi 975830145 1333 EFQEEAKRRWEYE 1345
Cdd:COG1196 492 RLLLLLEAEADYE 504
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
1605-1643 |
2.12e-12 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 63.12 E-value: 2.12e-12
10 20 30
....*....|....*....|....*....|....*....
gi 975830145 1605 LLEAQAATGGIIDPHRNEKLTVDSAIARDLIDFDDRQQI 1643
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| SPEC |
cd00176 |
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ... |
285-450 |
2.99e-12 |
|
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here
Pssm-ID: 238103 [Multi-domain] Cd Length: 213 Bit Score: 68.24 E-value: 2.99e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 285 WINDCEEEELLYDWSDKNTNIAQKQEAFSIRMSQLEVKEKELNKLKQESDQLVLNQHPASDKIEAYMDTLQTQWSWILQI 364
Cdd:cd00176 15 WLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNELGEQLIEEGHPDAEEIQERLEELNQRWEELREL 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 365 TKCIDVHLKENAAYFQFFEEAQSTEAYLKGLQDSIRKKYPCDKNMPLQHLLEQIKELEKErekILEYKRQVQNLVNKSKK 444
Cdd:cd00176 95 AEERRQRLEEALDLQQFFRDADDLEQWLEEKEAALASEDLGKDLESVEELLKKHKELEEE---LEAHEPRLKSLNELAEE 171
|
....*.
gi 975830145 445 IVQLKP 450
Cdd:cd00176 172 LLEEGH 177
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1231-1502 |
3.68e-12 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 72.41 E-value: 3.68e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1231 DSKNLRNQLDRLSRENRDLKDEIvrlndSILQATEQRRRAEENALQQKACGSEimqkKQHLEIELKQvmQQRSEDNARHK 1310
Cdd:TIGR02169 668 FSRSEPAELQRLRERLEGLKREL-----SSLQSELRRIENRLDELSQELSDAS----RKIGEIEKEI--EQLEQEEEKLK 736
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1311 QSLEEAAKTIQDKNKEIERLKAEFQEEAKRRWEYE---NELSKASNRIQESKNQctQVVQERESLLVKIKvleqdkARLQ 1387
Cdd:TIGR02169 737 ERLEELEEDLSSLEQEIENVKSELKELEARIEELEedlHKLEEALNDLEARLSH--SRIPEIQAELSKLE------EEVS 808
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1388 RLEDELNRAKSTLEAETRVKQRLECEKQQIQNDLNQWKTQYSRKEEAIRKIESEREksereknSLRSEIERLQAEIKRIE 1467
Cdd:TIGR02169 809 RIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKE-------ELEEELEELEAALRDLE 881
|
250 260 270
....*....|....*....|....*....|....*...
gi 975830145 1468 ERcRRKLEDSTRETQSQLETERSRYQR---EIDKLRQR 1502
Cdd:TIGR02169 882 SR-LGDLKKERDELEAQLRELERKIEEleaQIEKKRKR 918
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
793-1537 |
4.97e-12 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 72.01 E-value: 4.97e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 793 EEETVCLDL--DKVEAYRCGLKKIKNDLNLKKSLLATMKTELQKAQQIHSQTsqqyplyDLDLGKFGEKVTQLTDRWQ-- 868
Cdd:TIGR02168 221 ELRELELALlvLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEEL-------RLEVSELEEEIEELQKELYal 293
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 869 -----RIDKQI---DFRLWDLEKQIKQLRNYRDNYQafcKWLYDAKRRQDSLESMKFGDSNTVMRFLNEQKNLHSEISGK 940
Cdd:TIGR02168 294 aneisRLEQQKqilRERLANLERQLEELEAQLEELE---SKLDELAEELAELEEKLEELKEELESLEAELEELEAELEEL 370
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 941 RDKSEEVQKIAELCANSIKDYELQLASYTSGLETLlnipikRTMIQSpsgviLQEAADVHARYIELLTRSGDyyrfLSEM 1020
Cdd:TIGR02168 371 ESRLEELEEQLETLRSKVAQLELQIASLNNEIERL------EARLER-----LEDRRERLQQEIEELLKKLE----EAEL 435
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1021 LKSLEDLKLKNTKIEVLEEELRLARDANSENCNKNKFLDQNLQKYQAECSQFKAKLASLEELKRQAELDGKSAKQNLDKC 1100
Cdd:TIGR02168 436 KELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQ 515
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1101 ygqiKELNEKITRLTYEIEDEKRRRKSVEDRFDQQKNDY--DQLQKARQC----EKENLGWQKLESEKAIKEKEYEIERL 1174
Cdd:TIGR02168 516 ----SGLSGILGVLSELISVDEGYEAAIEAALGGRLQAVvvENLNAAKKAiaflKQNELGRVTFLPLDSIKGTEIQGNDR 591
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1175 RVLLQEEGTRKreYENELAKVRNHYNEEMSNLRNKYETEINItkttikeismqkeDDSKNLRNQLDRLSReNRDLKDEIV 1254
Cdd:TIGR02168 592 EILKNIEGFLG--VAKDLVKFDPKLRKALSYLLGGVLVVDDL-------------DNALELAKKLRPGYR-IVTLDGDLV 655
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1255 RLNDSI--------LQATEQRRRAEENALQQKACGSEIMQkkqhLEIELKQVMQQRSEDNarhkQSLEEAAKTIQDKNKE 1326
Cdd:TIGR02168 656 RPGGVItggsaktnSSILERRREIEELEEKIEELEEKIAE----LEKALAELRKELEELE----EELEQLRKELEELSRQ 727
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1327 IERLKAEFQEEAKRRWEYENELSKASNRIQESKNQCTQVVQERESLLVKIKVLEQDKARLQ----RLEDELNRAKSTLEA 1402
Cdd:TIGR02168 728 ISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEaqieQLKEELKALREALDE 807
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1403 ETRVKQRLECEKQQIQNDLNQWKTQYSRKEEAIRKIESEREKsereknsLRSEIERLQAEIkrieERCRRKLEDSTRETQ 1482
Cdd:TIGR02168 808 LRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEE-------LSEDIESLAAEI----EELEELIEELESELE 876
|
730 740 750 760 770
....*....|....*....|....*....|....*....|....*....|....*
gi 975830145 1483 SQLEtERSRYQREIDKLRQRpYGSHRETQTECEWTVDTSKLVFDGLRKKVTAMQL 1537
Cdd:TIGR02168 877 ALLN-ERASLEEALALLRSE-LEELSEELRELESKRSELRRELEELREKLAQLEL 929
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
1017-1502 |
8.50e-12 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 70.84 E-value: 8.50e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1017 LSEMLKSLE-DLKLKNTKIEVLEEELRLARDANSEncnknkfLDQNLQKY---QAECSQFKAKLASLEELKRQAELDGKS 1092
Cdd:PRK02224 204 LHERLNGLEsELAELDEEIERYEEQREQARETRDE-------ADEVLEEHeerREELETLEAEIEDLRETIAETEREREE 276
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1093 AKQNLDKCYGQIKELNEKITRLTYEIEDEKRRRKSVEDRFDQQKNDYDQLQKARQcekenlgwqklESEKAIKEKEYEIE 1172
Cdd:PRK02224 277 LAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLE-----------ECRVAAQAHNEEAE 345
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1173 RLR---VLLQEEGTRKREYENELAKvrnhyneEMSNLRNKYETEinitKTTIKEISMQKEDDSKNLRN---QLDRLSREN 1246
Cdd:PRK02224 346 SLRedaDDLEERAEELREEAAELES-------ELEEAREAVEDR----REEIEELEEEIEELRERFGDapvDLGNAEDFL 414
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1247 RDLKDEIVRLNDSI--LQAT---EQRRRAEENALQQKA----CGSEI------------MQKKQHLEIELKQVMQQRSED 1305
Cdd:PRK02224 415 EELREERDELREREaeLEATlrtARERVEEAEALLEAGkcpeCGQPVegsphvetieedRERVEELEAELEDLEEEVEEV 494
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1306 NARHKQ--SLEEAAK---TIQDKNKEIERLKAEFQEEAKRRWEYENELSKASNRI----QESKNQCTQVVQERESLLVKI 1376
Cdd:PRK02224 495 EERLERaeDLVEAEDrieRLEERREDLEELIAERRETIEEKRERAEELRERAAELeaeaEEKREAAAEAEEEAEEAREEV 574
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1377 KVLEQDKARLQRLEDELNRAKSTLEAETRVKQRLEC--EKQQIQNDLN---------------------------QWKTQ 1427
Cdd:PRK02224 575 AELNSKLAELKERIESLERIRTLLAAIADAEDEIERlrEKREALAELNderrerlaekrerkreleaefdearieEARED 654
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1428 YSRKEEAIRKIESEREKSEREKNSLRSEIERLQAEIKRIEE-RCRRKLEDSTRET----QSQLETERSRYQREIDKLRQR 1502
Cdd:PRK02224 655 KERAEEYLEQVEEKLDELREERDDLQAEIGAVENELEELEElRERREALENRVEAlealYDEAEELESMYGDLRAELRQR 734
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1079-1498 |
1.39e-11 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 69.80 E-value: 1.39e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1079 LEELKRQAELDGKSAKQNLDKCYGQIKELNEKIT----------RLTYEIEDEKRRRKSVEDRFDQQKNDYDQLQKARQC 1148
Cdd:COG4717 48 LERLEKEADELFKPQGRKPELNLKELKELEEELKeaeekeeeyaELQEELEELEEELEELEAELEELREELEKLEKLLQL 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1149 EKENLGWQKLESEKAIKEKEYE-IERLRVLLQEEGTRKREYENELAKVRNHYNEEMSNLRNKYETEINITKTTIKEISMQ 1227
Cdd:COG4717 128 LPLYQELEALEAELAELPERLEeLEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQR 207
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1228 KEddskNLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRRRAEENALQQ------------------------------ 1277
Cdd:COG4717 208 LA----ELEEELEEAQEELEELEEELEQLENELEAAALEERLKEARLLLLiaaallallglggsllsliltiagvlflvl 283
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1278 --KACGSEIMQKKQHLEIELKQVMQQRSEDNARHKQSLEEAAKTIQDKNKEIERLKAEFQEEAKRRWEYENELSKASNRI 1355
Cdd:COG4717 284 glLALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEEL 363
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1356 QESKNQctqvvQERESLLVKIKV--------LEQDKARLQRLEDELNRAKSTLEAETRvkqrlECEKQQIQNDLNQWKTQ 1427
Cdd:COG4717 364 QLEELE-----QEIAALLAEAGVedeeelraALEQAEEYQELKEELEELEEQLEELLG-----ELEELLEALDEEELEEE 433
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 975830145 1428 YSRKEEAIRKIESERekserekNSLRSEIERLQAEIKRIEErcrrklEDSTRETQSQLETERSRYQREIDK 1498
Cdd:COG4717 434 LEELEEELEELEEEL-------EELREELAELEAELEQLEE------DGELAELLQELEELKAELRELAEE 491
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
855-1466 |
1.41e-11 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 70.85 E-value: 1.41e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 855 KFGEKVTQLTDRWQRIDKQIDFRLWDLEKQIKQLRNYRDNYQAFCKWLYDAKRR---QDSLESMKFGDSNTVMRF----- 926
Cdd:TIGR01612 1108 KYADEINKIKDDIKNLDQKIDHHIKALEEIKKKSENYIDEIKAQINDLEDVADKaisNDDPEEIEKKIENIVTKIdkkkn 1187
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 927 -LNEQKNLHSEISG-KRDKS--EEVQKIAELCANSI------------KDYELQLASYTSGLETLLNIPIKRTMIQSPSG 990
Cdd:TIGR01612 1188 iYDEIKKLLNEIAEiEKDKTslEEVKGINLSYGKNLgklflekideekKKSEHMIKAMEAYIEDLDEIKEKSPEIENEMG 1267
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 991 VILQEAADVHARYIELlTRSGDYYRFLSEMLKSLEDLKLKNTKIEvleeelrlarDANSENCNKN---KFLDQNL---QK 1064
Cdd:TIGR01612 1268 IEMDIKAEMETFNISH-DDDKDHHIISKKHDENISDIREKSLKII----------EDFSEESDINdikKELQKNLldaQK 1336
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1065 YQAECSQFKAKLASLEELKRQaeldgksakQNLDKCYGQIKELNEKITRLTYEIEDEKRRRKSVEDRFDQQKNdydqlqk 1144
Cdd:TIGR01612 1337 HNSDINLYLNEIANIYNILKL---------NKIKKIIDEVKEYTKEIEENNKNIKDELDKSEKLIKKIKDDIN------- 1400
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1145 ARQCEkenlgwQKLESEKAIKEKEYEIERLRVL----LQEEGTRKREYENElakvrNHYNEEMSNLRNKYETEINITKTT 1220
Cdd:TIGR01612 1401 LEECK------SKIESTLDDKDIDECIKKIKELknhiLSEESNIDTYFKNA-----DENNENVLLLFKNIEMADNKSQHI 1469
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1221 IKeisMQKEDDSKNLRNQLDRLsRENRDLKDEIVRLNDSILQATEQRRRAEEnalQQKACGSEIMQKkqHLEIELKQVMQ 1300
Cdd:TIGR01612 1470 LK---IKKDNATNDHDFNINEL-KEHIDKSKGCKDEADKNAKAIEKNKELFE---QYKKDVTELLNK--YSALAIKNKFA 1540
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1301 QRSEDNAR--------HKQSLEEAAKTIQdKNKEIERLKAEFQEEAKRRWE--------------YENELSKASNrIQES 1358
Cdd:TIGR01612 1541 KTKKDSEIiikeikdaHKKFILEAEKSEQ-KIKEIKKEKFRIEDDAAKNDKsnkaaidiqlslenFENKFLKISD-IKKK 1618
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1359 KNQCtqvVQERESLLVKIKVLEQD--KARLQRLEDELNRAKSTLEAETRVKQRLECEKQQ----------IQNDLNQWKT 1426
Cdd:TIGR01612 1619 INDC---LKETESIEKKISSFSIDsqDTELKENGDNLNSLQEFLESLKDQKKNIEDKKKEldeldseiekIEIDVDQHKK 1695
|
650 660 670 680
....*....|....*....|....*....|....*....|
gi 975830145 1427 QYSrkeeaIRKIESEREKSEREKNSLRSEIERLQAEIKRI 1466
Cdd:TIGR01612 1696 NYE-----IGIIEKIKEIAIANKEEIESIKELIEPTIENL 1730
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1002-1523 |
5.92e-11 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 68.56 E-value: 5.92e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1002 RYIELLTRSGDY--YRFLSEMLKSLEDLKLKNTKIEVLEEEL----RLARDANSENCNKNKFLDQNLQKYQA----ECSQ 1071
Cdd:TIGR02169 212 RYQALLKEKREYegYELLKEKEALERQKEAIERQLASLEEELekltEEISELEKRLEEIEQLLEELNKKIKDlgeeEQLR 291
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1072 FKAKLASLEELKRQAELDGKSAKQNLDKCYGQIKELNEKITR-------LTYEIEDEKRRRKSVEDRFDQQKNDYDQL-Q 1143
Cdd:TIGR02169 292 VKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKllaeieeLEREIEEERKRRDKLTEEYAELKEELEDLrA 371
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1144 KARQCEKENLGW-QKLESEK----AIKEKEYEIERLRVLLQEEGTRKR----EYENELAKVRNHYNEEMSNLRNKYEtEI 1214
Cdd:TIGR02169 372 ELEEVDKEFAETrDELKDYRekleKLKREINELKRELDRLQEELQRLSeelaDLNAAIAGIEAKINELEEEKEDKAL-EI 450
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1215 NITKTTIKEISMQKEDDSK---NLRNQLDRLSRENRDLKDEIVRLN---DSILQATEQRRRAEE---------------- 1272
Cdd:TIGR02169 451 KKQEWKLEQLAADLSKYEQelyDLKEEYDRVEKELSKLQRELAEAEaqaRASEERVRGGRAVEEvlkasiqgvhgtvaql 530
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1273 --------------------------------------------------NALQQKACGSEIMQKKQHLEIEL------- 1295
Cdd:TIGR02169 531 gsvgeryataievaagnrlnnvvveddavakeaiellkrrkagratflplNKMRDERRDLSILSEDGVIGFAVdlvefdp 610
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1296 ------KQVMQ------------------------------------------QRSEDNARHKQSLEEAAKTIQDKNKEI 1327
Cdd:TIGR02169 611 kyepafKYVFGdtlvvedieaarrlmgkyrmvtlegelfeksgamtggsraprGGILFSRSEPAELQRLRERLEGLKREL 690
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1328 ERLKAEFQEEAKRRWEYENELSKASNRIQESKNQCTQVVQER-------ESLLVKIKVLEQ----DKARLQRLEDELNRA 1396
Cdd:TIGR02169 691 SSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEeklkerlEELEEDLSSLEQeienVKSELKELEARIEEL 770
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1397 KSTL----EAETRVKQRLECEK-QQIQNDLNQWKTQYSRKEEAIRKIESEREKSEREKNSLRSEIERLQAEIK----RIE 1467
Cdd:TIGR02169 771 EEDLhkleEALNDLEARLSHSRiPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIdlkeQIK 850
|
650 660 670 680 690 700
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 975830145 1468 ERCR---------RKLEDSTRETQSQLETERSRY---QREIDKLRQRpYGSHRETQTECEWTVDTSKL 1523
Cdd:TIGR02169 851 SIEKeienlngkkEELEEELEELEAALRDLESRLgdlKKERDELEAQ-LRELERKIEELEAQIEKKRK 917
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
928-1416 |
6.84e-11 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 67.82 E-value: 6.84e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 928 NEQKNLHSEISGKRDKSEEVQKIAELCANSIKDYELQLASYTSGLETLlNIPIKRTMIQSPSgviLQEAADVHARYIELL 1007
Cdd:pfam05483 254 NKMKDLTFLLEESRDKANQLEEKTKLQDENLKELIEKKDHLTKELEDI-KMSLQRSMSTQKA---LEEDLQIATKTICQL 329
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1008 TRSGDYYRFLSEMLKSLEDLKLKNTKIEV--LEEELRLARDANSENCNKNKFLDQNLQKYQAECSQFKA----KLASLEE 1081
Cdd:pfam05483 330 TEEKEAQMEELNKAKAAHSFVVTEFEATTcsLEELLRTEQQRLEKNEDQLKIITMELQKKSSELEEMTKfknnKEVELEE 409
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1082 LKRQAELDGK--SAKQNLDKCYGQIKELNEKITRL-------TYEIEDEKRRRKSVEDRFDQQKNDYD-QLQKARQCEKE 1151
Cdd:pfam05483 410 LKKILAEDEKllDEKKQFEKIAEELKGKEQELIFLlqarekeIHDLEIQLTAIKTSEEHYLKEVEDLKtELEKEKLKNIE 489
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1152 ---NLGWQKLESEKAIKEKEYEIERLRVLLQEEGTRKREYENELAKVRNHYNEEMsNLRNKYETEINITKTTIKEISMQK 1228
Cdd:pfam05483 490 ltaHCDKLLLENKELTQEASDMTLELKKHQEDIINCKKQEERMLKQIENLEEKEM-NLRDELESVREEFIQKGDEVKCKL 568
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1229 EDDSKNLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRRRAEE----NALQQKACGSEIMQ------KKQHLEIELKQV 1298
Cdd:pfam05483 569 DKSEENARSIEYEVLKKEKQMKILENKCNNLKKQIENKNKNIEElhqeNKALKKKGSAENKQlnayeiKVNKLELELASA 648
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1299 MQQRSEDNARHKQSLEEAAKTIQDKNKEIERLKAEFQEEAKRRWEY----ENELSKASNRIQESKNQCTQVVQERESLLV 1374
Cdd:pfam05483 649 KQKFEEIIDNYQKEIEDKKISEEKLLEEVEKAKAIADEAVKLQKEIdkrcQHKIAEMVALMEKHKHQYDKIIEERDSELG 728
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|
gi 975830145 1375 KIKVLEQDKA--------RLQRLEDELNRAKSTLEAETRVKQRLECEKQQ 1416
Cdd:pfam05483 729 LYKNKEQEQSsakaaleiELSNIKAELLSLKKQLEIEKEEKEKLKMEAKE 778
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1118-1489 |
9.92e-11 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 67.40 E-value: 9.92e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1118 IEDEKRRRKSVED--RFDQQKNDYDQLQ------KARQCEKENLGWQKLESEKAIKEKEYEIERLRVLLQEEGTRKREYE 1189
Cdd:PRK03918 141 LESDESREKVVRQilGLDDYENAYKNLGevikeiKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELR 220
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1190 NELAKVRNHYnEEMSNLRNKYEtEINITKTTIKEISMQKEDDSKNLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRRR 1269
Cdd:PRK03918 221 EELEKLEKEV-KELEELKEEIE-ELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKL 298
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1270 AEENalqqkacgSEIMQKKQHLEIELkqvmqqrsednarhkQSLEEAAKTIQDKNKEIERLKAEFQEEAKRRWEYENELS 1349
Cdd:PRK03918 299 SEFY--------EEYLDELREIEKRL---------------SRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLE 355
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1350 KASNRIQESkNQCTQVVQERESLLVKIKVLEQDKarLQRLEDELNRAKSTLEAETRvkqRLECEKQQIQNDLNQWKTQYS 1429
Cdd:PRK03918 356 ELEERHELY-EEAKAKKEELERLKKRLTGLTPEK--LEKELEELEKAKEEIEEEIS---KITARIGELKKEIKELKKAIE 429
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 975830145 1430 RKEEAIRKI-----ESEREKSEREKNSLRSEIERLQAEIKRIEERcRRKLEDSTRETQSQLETER 1489
Cdd:PRK03918 430 ELKKAKGKCpvcgrELTEEHRKELLEEYTAELKRIEKELKEIEEK-ERKLRKELRELEKVLKKES 493
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
995-1502 |
1.79e-10 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 66.86 E-value: 1.79e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 995 EAADVHARYIELLtrsgDYYRFLSEMLKSLEDLKlknTKIEVLE--EELRLARDANSENCNKNKFLDQNLQKYQA----- 1067
Cdd:COG4913 219 EEPDTFEAADALV----EHFDDLERAHEALEDAR---EQIELLEpiRELAERYAAARERLAELEYLRAALRLWFAqrrle 291
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1068 ----ECSQFKAKLASLEELKRQAELDGKSAKQNLDKCYGQIKEL-NEKITRLTYEIEDEKRRRKSVEDRFDQQKN----- 1137
Cdd:COG4913 292 lleaELEELRAELARLEAELERLEARLDALREELDELEAQIRGNgGDRLEQLEREIERLERELEERERRRARLEAllaal 371
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1138 ---------DYDQLQKARQCEKENLGwqklESEKAIKEKEYEIERLRVLLQEEGTRKREYENELAKVRNHYNEEMSNLRN 1208
Cdd:COG4913 372 glplpasaeEFAALRAEAAALLEALE----EELEALEEALAEAEAALRDLRRELRELEAEIASLERRKSNIPARLLALRD 447
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1209 KYETEINITKTTIK----EISMQKEDDS------KNLRNQ----------LDRLSR--ENRDLKDEIVRLNDSILQATEQ 1266
Cdd:COG4913 448 ALAEALGLDEAELPfvgeLIEVRPEEERwrgaieRVLGGFaltllvppehYAAALRwvNRLHLRGRLVYERVRTGLPDPE 527
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1267 RRRAEENALQQKacgseimqkkqhLEIE-------LKQVMQQRS--------EDNARHKQSL------------------ 1313
Cdd:COG4913 528 RPRLDPDSLAGK------------LDFKphpfrawLEAELGRRFdyvcvdspEELRRHPRAItragqvkgngtrhekddr 595
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1314 ----------EEAAKTIQDKNKEIERLKAEFQEEAKRRweyeNELSKASNRIQESKNQCTQVVQ-------------ERE 1370
Cdd:COG4913 596 rrirsryvlgFDNRAKLAALEAELAELEEELAEAEERL----EALEAELDALQERREALQRLAEyswdeidvasaerEIA 671
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1371 SLLVKIKVLEQDKARLQRLEDELNRAKSTLEAETRVKQRLECEKQQIQNDLNQWKTQYSRKEEAIRKIESEREKSerekn 1450
Cdd:COG4913 672 ELEAELERLDASSDDLAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLE----- 746
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|..
gi 975830145 1451 sLRSEIERLQAEIkrIEERCRRKLEDSTRETQSQLETERSRYQREIDKLRQR 1502
Cdd:COG4913 747 -LRALLEERFAAA--LGDAVERELRENLEERIDALRARLNRAEEELERAMRA 795
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1017-1565 |
2.76e-10 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 65.86 E-value: 2.76e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1017 LSEMLKSLEDLKLKNTKIEVLEEELRLARDANSENCNKNKFLDQNLQKYQAECSQFKAKLASLEELKRQAEL-------- 1088
Cdd:PRK03918 223 LEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEyiklsefy 302
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1089 -DGKSAKQNLDKCYGQIKELNEKITRLTYEIEDEKRRRKSVEDRFDQQKNDYDQLQK-ARQCEKENLGWQKLESEKAiKE 1166
Cdd:PRK03918 303 eEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEErHELYEEAKAKKEELERLKK-RL 381
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1167 KEYEIERLRVLLQEEGTRKREYENELAKVRnhynEEMSNLRNkyetEINITKTTIKEISMQK-----------EDDSKNL 1235
Cdd:PRK03918 382 TGLTPEKLEKELEELEKAKEEIEEEISKIT----ARIGELKK----EIKELKKAIEELKKAKgkcpvcgreltEEHRKEL 453
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1236 rnqLDRLSRENRDLKDEIVRLNDSILQATEQRRRAEENALQQKACGS--EIMQKKQHLEIELKQVMQQRSEDNARHKQSL 1313
Cdd:PRK03918 454 ---LEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKlkELAEQLKELEEKLKKYNLEELEKKAEEYEKL 530
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1314 EEAAKTIQdknKEIERLKaefqEEAKRRWEYENELSKASNRIQESKNQCTQVVQERESLlvKIKVLEQDKARLQRLEDEL 1393
Cdd:PRK03918 531 KEKLIKLK---GEIKSLK----KELEKLEELKKKLAELEKKLDELEEELAELLKELEEL--GFESVEELEERLKELEPFY 601
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1394 NRAKSTLEAETRVKQRLECEKqqiqndlnqwktqysrkeeairkiesereKSEREKNSLRSEIERLQAEIKRIEERCRRK 1473
Cdd:PRK03918 602 NEYLELKDAEKELEREEKELK-----------------------------KLEEELDKAFEELAETEKRLEELRKELEEL 652
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1474 LEDSTRETQSQLETERSRYQREIDKLRQRPYG--SHREtqtECEWTVDTSKLVFDGLRKKVTAMQLYECQLIDKTTL-DK 1550
Cdd:PRK03918 653 EKKYSEEEYEELREEYLELSRELAGLRAELEEleKRRE---EIKKTLEKLKEELEEREKAKKELEKLEKALERVEELrEK 729
|
570 580
....*....|....*....|...
gi 975830145 1551 LLKGK--------KSVEEVASEI 1565
Cdd:PRK03918 730 VKKYKallkeralSKVGEIASEI 752
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
993-1415 |
3.00e-10 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 65.56 E-value: 3.00e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 993 LQEAADVHARYIELLTRsgdyyrfLSEMLKSLEDLKLKNTKIEVLEEELRLARDAnSENCNKNKFLDQNLQKYQAECSQF 1072
Cdd:COG4717 80 LKEAEEKEEEYAELQEE-------LEELEEELEELEAELEELREELEKLEKLLQL-LPLYQELEALEAELAELPERLEEL 151
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1073 KAKLASLEELKRQAEldgkSAKQNLDKCYGQIKELNEKITRLTY-EIEDEKRRRKSVEDRFDQQKNDYDQLQKARQCEKE 1151
Cdd:COG4717 152 EERLEELRELEEELE----ELEAELAELQEELEELLEQLSLATEeELQDLAEELEELQQRLAELEEELEEAQEELEELEE 227
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1152 NLgwQKLESEKAIKEKEYEIERLRVLLQEEGTR------------KREYENELAKVRNHYNEEMSNLRNKYETEINITKT 1219
Cdd:COG4717 228 EL--EQLENELEAAALEERLKEARLLLLIAAALlallglggsllsLILTIAGVLFLVLGLLALLFLLLAREKASLGKEAE 305
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1220 TIKEISMQKEDDSKNLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRRRAEEnaLQQKAcgseimqKKQHLEIELKQVM 1299
Cdd:COG4717 306 ELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEE--LEEEL-------QLEELEQEIAALL 376
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1300 QQRSEDNARHKQSLEEAAKTIQDKNKEIERLKAEFQEEAKRRWEY---------ENELSKASNRIQESKNQCTQVVQERE 1370
Cdd:COG4717 377 AEAGVEDEEELRAALEQAEEYQELKEELEELEEQLEELLGELEELlealdeeelEEELEELEEELEELEEELEELREELA 456
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*...
gi 975830145 1371 SLLVKIKVLEQD-------------KARLQRLEDELNRAKSTLEAETRVKQRLECEKQ 1415
Cdd:COG4717 457 ELEAELEQLEEDgelaellqeleelKAELRELAEEWAALKLALELLEEAREEYREERL 514
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1005-1368 |
3.08e-10 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 65.85 E-value: 3.08e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1005 ELLTRSGDYYRFLSEMLKSLEDLKlknTKIEVLEEELRLARDANSENCNKNKFLDQNLQKYQAECSQFKAKLASLEELKR 1084
Cdd:TIGR02168 716 QLRKELEELSRQISALRKDLARLE---AEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIE 792
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1085 QAELDGKSAKQNLDKCYGQIKELNEKITRLTYEIEDEKRRRKSVEDRFDqqkndydqlqkarqcekenlgwqklESEKAI 1164
Cdd:TIGR02168 793 QLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLE-------------------------DLEEQI 847
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1165 KEKEYEIERLRVLLQEEGTRKREYENELakvrnhynEEMSNLRNKYETEInitkttikeisMQKEDDSKNLRNQLDRLSR 1244
Cdd:TIGR02168 848 EELSEDIESLAAEIEELEELIEELESEL--------EALLNERASLEEAL-----------ALLRSELEELSEELRELES 908
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1245 ENRDLKDEIVRLNDSI--LQATEQRRRAEENALQQKACGSEIMQKKQHLEIELKQVMqqrSEDNARHKQ-SLEEAAKTIQ 1321
Cdd:TIGR02168 909 KRSELRRELEELREKLaqLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIED---DEEEARRRLkRLENKIKELG 985
|
330 340 350 360
....*....|....*....|....*....|....*....|....*...
gi 975830145 1322 DKNKE-IErlkaEFQEEAKRRweyeNELSKASNRIQESKNQCTQVVQE 1368
Cdd:TIGR02168 986 PVNLAaIE----EYEELKERY----DFLTAQKEDLTEAKETLEEAIEE 1025
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1104-1509 |
3.39e-10 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 65.86 E-value: 3.39e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1104 IKELNEKITRLTYEI---EDEKRRRKSVEDRFDQQKNDYDQLQKARQCEKENLGwqKLESE-KAIKEKEYEIERLRVLLQ 1179
Cdd:PRK03918 171 IKEIKRRIERLEKFIkrtENIEELIKEKEKELEEVLREINEISSELPELREELE--KLEKEvKELEELKEEIEELEKELE 248
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1180 EEGTRKREYE---NELAKVRNHYNEEMSNLRNKYE--TEINITKTTIKEISMQKEDDSKNLRN---QLDRLSRENRDLKD 1251
Cdd:PRK03918 249 SLEGSKRKLEekiRELEERIEELKKEIEELEEKVKelKELKEKAEEYIKLSEFYEEYLDELREiekRLSRLEEEINGIEE 328
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1252 EIVRLNDSILQATEQRRRAEE-----NALQQKA-CGSEIMQKKQHLEIELKQVMQQRSEDNARHKQSLEEAAKTIQDKNK 1325
Cdd:PRK03918 329 RIKELEEKEERLEELKKKLKElekrlEELEERHeLYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEIS 408
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1326 EIERLKAEFQEEAKRRWEYENELSKASNRIQESKNQCTQvvQERESLLVK----IKVLEQDKARLQRLEDELNRAKSTLE 1401
Cdd:PRK03918 409 KITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTE--EHRKELLEEytaeLKRIEKELKEIEEKERKLRKELRELE 486
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1402 AETRVKQRLECEKQ------QIQNDLNQW-KTQYSRKEEAIRKIesereksEREKNSLRSEIERLQAEIKRIEErcrrkL 1474
Cdd:PRK03918 487 KVLKKESELIKLKElaeqlkELEEKLKKYnLEELEKKAEEYEKL-------KEKLIKLKGEIKSLKKELEKLEE-----L 554
|
410 420 430
....*....|....*....|....*....|....*
gi 975830145 1475 EDSTRETQSQLETERSRYQREIDKLRQRPYGSHRE 1509
Cdd:PRK03918 555 KKKLAELEKKLDELEEELAELLKELEELGFESVEE 589
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
2283-2321 |
4.19e-10 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 56.57 E-value: 4.19e-10
10 20 30
....*....|....*....|....*....|....*....
gi 975830145 2283 FLEFQYLTGGLVDPEVHGRISTEEAIRKGFIDGRAAQRL 2321
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
1032-1495 |
4.38e-10 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 65.38 E-value: 4.38e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1032 TKIEVLEEELRLARDANSENCNKNKFLDQNL------QKYQAECSQFKAKLASLEEL-----KRQAELDGKSAKQNLDKC 1100
Cdd:TIGR00618 219 ERKQVLEKELKHLREALQQTQQSHAYLTQKReaqeeqLKKQQLLKQLRARIEELRAQeavleETQERINRARKAAPLAAH 298
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1101 YGQIKELNEKITRLTYEIEDEKRRRKS--------VEDRFDQQKNDYDQLQKARQCEKENLGWQKLESEKAIKEKEYEIE 1172
Cdd:TIGR00618 299 IKAVTQIEQQAQRIHTELQSKMRSRAKllmkraahVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHTLT 378
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1173 RLRVLLQEEGTRKREYENELAKVRNHYNEEMSN---------------LRNKYETEINITKTTIKEISMQKEDDSKNLRN 1237
Cdd:TIGR00618 379 QHIHTLQQQKTTLTQKLQSLCKELDILQREQATidtrtsafrdlqgqlAHAKKQQELQQRYAELCAAAITCTAQCEKLEK 458
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1238 QLDRLSRENRDLKDEIVRLNDSILQaTEQRRRAEENALQQKACGSEIMQKKQHLEIELKQVMQQRSEDNARHKQSLEEAA 1317
Cdd:TIGR00618 459 IHLQESAQSLKEREQQLQTKEQIHL-QETRKKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDNPGPLTRRMQRGEQTY 537
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1318 KTIQdknKEIERLKAEFQEEAKRRWEYENElskasnrIQESKNQCTQVVQERESLLVKIKVLEQDKARLQRLEDELNRAK 1397
Cdd:TIGR00618 538 AQLE---TSEEDVYHQLTSERKQRASLKEQ-------MQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLSEAE 607
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1398 STLEAETRV----------KQRLECEKQQIQNDLNQWKTQ---------YSRKEEAIRKIESEREKSEREKNSLRSEIER 1458
Cdd:TIGR00618 608 DMLACEQHAllrklqpeqdLQDVRLHLQQCSQELALKLTAlhalqltltQERVREHALSIRVLPKELLASRQLALQKMQS 687
|
490 500 510
....*....|....*....|....*....|....*..
gi 975830145 1459 LQAEIKRIEERCRRKLEDSTRETQSQLETERSRYQRE 1495
Cdd:TIGR00618 688 EKEQLTYWKEMLAQCQTLLRELETHIEEYDREFNEIE 724
|
|
| PLEC |
smart00250 |
Plectin repeat; |
2205-2242 |
4.75e-10 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 56.72 E-value: 4.75e-10
10 20 30
....*....|....*....|....*....|....*...
gi 975830145 2205 QRLLEAQACTGGIIHPTTGQKLSLQDAVSQGVIDQDMA 2242
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
1062-1613 |
5.20e-10 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 65.14 E-value: 5.20e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1062 LQKYQAECSQFKAKL----ASLEELKRQAELDGKSA-KQNLDKCYGQIKELNEKITRLTYEIEDEKRRRKSVEDRFD--- 1133
Cdd:pfam15921 226 LRELDTEISYLKGRIfpveDQLEALKSESQNKIELLlQQHQDRIEQLISEHEVEITGLTEKASSARSQANSIQSQLEiiq 305
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1134 QQKNDYDQLQKARQCEKEN----LGWQKLESEKAIKEKEYEIERLRVLLQEEGTRKREYenelakvRNHYNEEMSNLRNK 1209
Cdd:pfam15921 306 EQARNQNSMYMRQLSDLEStvsqLRSELREAKRMYEDKIEELEKQLVLANSELTEARTE-------RDQFSQESGNLDDQ 378
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1210 YETEINITKTTIKEISMQKEDDSK----NLRNQL--DRLSRENRDLKDEIVRLnDSILQA--TEQRRRAEEN--ALQQKa 1279
Cdd:pfam15921 379 LQKLLADLHKREKELSLEKEQNKRlwdrDTGNSItiDHLRRELDDRNMEVQRL-EALLKAmkSECQGQMERQmaAIQGK- 456
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1280 cgSEIMQKKQHLEIEL---KQVMQQRSEDNARHKQSLEEAAKTIQDknkeierLKAEFQEEAKRRWEYENELSKASNRIQ 1356
Cdd:pfam15921 457 --NESLEKVSSLTAQLestKEMLRKVVEELTAKKMTLESSERTVSD-------LTASLQEKERAIEATNAEITKLRSRVD 527
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1357 ESKNQCTQVVQERESL--------LVKIKVLEQDKArLQRLEDELNRAKSTLEAETRVKQRLECEKQQIQNDLN------ 1422
Cdd:pfam15921 528 LKLQELQHLKNEGDHLrnvqteceALKLQMAEKDKV-IEILRQQIENMTQLVGQHGRTAGAMQVEKAQLEKEINdrrlel 606
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1423 -QWKTQYSRKEEAIRKIESEREKSEREKNSL-RSEIERLQAeIKRIEERcRRKLEDSTRETQSQLETERSRYQ------- 1493
Cdd:pfam15921 607 qEFKILKDKKDAKIRELEARVSDLELEKVKLvNAGSERLRA-VKDIKQE-RDQLLNEVKTSRNELNSLSEDYEvlkrnfr 684
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1494 ---REIDKLRQRPYGSHRETQTECEWTVDTSKLV--FDGLRKKVT-AMQlyecqlidkttlDKLLKGKKSVEEVASEIQp 1567
Cdd:pfam15921 685 nksEEMETTTNKLKMQLKSAQSELEQTRNTLKSMegSDGHAMKVAmGMQ------------KQITAKRGQIDALQSKIQ- 751
|
570 580 590 600
....*....|....*....|....*....|....*....|....*.
gi 975830145 1568 FLRGAGSIAGasaspKEKYSLVEAKRKklISPESTVMLLEAQAATG 1613
Cdd:pfam15921 752 FLEEAMTNAN-----KEKHFLKEEKNK--LSQELSTVATEKNKMAG 790
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1083-1387 |
6.30e-10 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 65.08 E-value: 6.30e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1083 KRQAELDGKSAKQNLDKCYGQIKELNEKITRLTYEIED-EKRRRKSVE----------DRFDQQKNDYDQLQKarqcEKE 1151
Cdd:TIGR02168 174 RKETERKLERTRENLDRLEDILNELERQLKSLERQAEKaERYKELKAElrelelallvLRLEELREELEELQE----ELK 249
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1152 NLGWQKLESEKAIKEKEYEIERLRVLLQEEGTRKREYENELAKVRN----------HYNEEMSNLRNKYET--------- 1212
Cdd:TIGR02168 250 EAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANeisrleqqkqILRERLANLERQLEEleaqleele 329
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1213 -EINITKTTIKEISMQKEDDSKNLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRRRAEENALQQKACGSEIMQKKQHL 1291
Cdd:TIGR02168 330 sKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARL 409
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1292 EIELKQVMQQRSEDNARHKQSLEEAAKTIQDKNKEIERLKAEFQEEAKRRweyENELSKASNRIQESKNQCTQVVQERES 1371
Cdd:TIGR02168 410 ERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERL---EEALEELREELEEAEQALDAAERELAQ 486
|
330
....*....|....*.
gi 975830145 1372 LLVKIKVLEQDKARLQ 1387
Cdd:TIGR02168 487 LQARLDSLERLQENLE 502
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
1018-1549 |
7.39e-10 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 64.78 E-value: 7.39e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1018 SEMLKSLEDLKLKntkievlEEELRLARDANSENCNKNKFLDQnlQKYQAECSQFKAKLASLEELKRQAELDGKSAKQNL 1097
Cdd:PTZ00121 1298 AEEKKKADEAKKK-------AEEAKKADEAKKKAEEAKKKADA--AKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEA 1368
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1098 DKCygQIKELNEKITRLTYEIEdEKRRRKSVEDRFDQQKNDYDQLQKARQCEKENLGWQKLESEKAIKE---KEYEIERL 1174
Cdd:PTZ00121 1369 AEK--KKEEAKKKADAAKKKAE-EKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADeakKKAEEAKK 1445
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1175 RVLLQEEGTRKREYENelAKVRNHYNEEMSNLRNKYETEINITKTTIKEISMQKEDDSKNLRNQLDRLSRENRdlKDEIV 1254
Cdd:PTZ00121 1446 ADEAKKKAEEAKKAEE--AKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAK--KAEEA 1521
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1255 RLNDSILQATEQRRRAEENALQQKACGSEIMQKKQHLEIELKQVMQQRSEDNARHKQSLEEAAKTIQDKNKEIERLKAEF 1334
Cdd:PTZ00121 1522 KKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLY 1601
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1335 QEEAKRRWEyenELSKAsnriQESKNQCTQVVQERE-----SLLVKIKVLEQDKARLQRLEDELNRAKSTLEAetrvkqR 1409
Cdd:PTZ00121 1602 EEEKKMKAE---EAKKA----EEAKIKAEELKKAEEekkkvEQLKKKEAEEKKKAEELKKAEEENKIKAAEEA------K 1668
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1410 LECEKQQIQNDLNQWKTQYSRKEEAIRKIESEREKSEREKNSLRSEIERLQaEIKRIEERCRRKLEDSTRETQSQLETER 1489
Cdd:PTZ00121 1669 KAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAE-ELKKAEEENKIKAEEAKKEAEEDKKKAE 1747
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1490 SRYQREIDKLRQRPYGSHRETQTEcEWTVDTSKLVFDGLRKKVTAMQLYecqlIDKTTLD 1549
Cdd:PTZ00121 1748 EAKKDEEEKKKIAHLKKEEEKKAE-EIRKEKEAVIEEELDEEDEKRRME----VDKKIKD 1802
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
1222-1513 |
1.18e-09 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 63.99 E-value: 1.18e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1222 KEISMQKEDDsKNLRNQLDRLSRENRDLKDEIVRLNDsiLQATEQRRRAEENalQQKACGSEIMQKKQHLEIELKQVMQQ 1301
Cdd:pfam17380 282 KAVSERQQQE-KFEKMEQERLRQEKEEKAREVERRRK--LEEAEKARQAEMD--RQAAIYAEQERMAMERERELERIRQE 356
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1302 -RSEDNARHKQslEEAAKTIQdKNKEIERLKAEFQEEAKR-RWEYEnelSKASNRIQESKNQCTQVVQERESLLVKIKVL 1379
Cdd:pfam17380 357 eRKRELERIRQ--EEIAMEIS-RMRELERLQMERQQKNERvRQELE---AARKVKILEEERQRKIQQQKVEMEQIRAEQE 430
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1380 EQDKARLQRLEDELNRakstlEAEtRVKQRLECEKQQIQNDLNQWKTQYSRKEEAIRKIESEREKSEREKNSLRSEIE-R 1458
Cdd:pfam17380 431 EARQREVRRLEEERAR-----EME-RVRLEEQERQQQVERLRQQEEERKRKKLELEKEKRDRKRAEEQRRKILEKELEeR 504
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 975830145 1459 LQAEIKriEERCRRKLEDSTRETQSQLETERSRYQREIDKLRQRPYGSHRETQTE 1513
Cdd:pfam17380 505 KQAMIE--EERKRKLLEKEMEERQKAIYEEERRREAEEERRKQQEMEERRRIQEQ 557
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1028-1502 |
1.49e-09 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 63.25 E-value: 1.49e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1028 KLKNTKIEVLEEELRLARDANSENcnknKFLDQNLQKYQAECSQFKAKLASLEElkrqaELDGKSAKQNLDKCYGQIKEL 1107
Cdd:COG4717 67 ELNLKELKELEEELKEAEEKEEEY----AELQEELEELEEELEELEAELEELRE-----ELEKLEKLLQLLPLYQELEAL 137
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1108 NEKITRLTYEIEDEKRRRKSVEDRFDQQKNDYDQLQKARQCEKENLGWQKLESEKAIKEKEYEIERLRvllqeegTRKRE 1187
Cdd:COG4717 138 EAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQ-------QRLAE 210
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1188 YENELAKVRnhynEEMSNLRNKYEteinitkttikeiSMQKEDDSKNLRNQLDRLSRenrdlkdeIVRLNDSILQATEQR 1267
Cdd:COG4717 211 LEEELEEAQ----EELEELEEELE-------------QLENELEAAALEERLKEARL--------LLLIAAALLALLGLG 265
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1268 RRAEENALQQK----------ACGSEIMQKKQHLEIELKQVMQQRSEDNARHKQSLEEAAKTIQDKNKEIERLKAEFQEE 1337
Cdd:COG4717 266 GSLLSLILTIAgvlflvlgllALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDR 345
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1338 AKRRWEYENELSKASNRIQESKNQctqvvQERESLLVKIKV--------LEQDKARLQRLEDELNRAKSTLEAETRvkqr 1409
Cdd:COG4717 346 IEELQELLREAEELEEELQLEELE-----QEIAALLAEAGVedeeelraALEQAEEYQELKEELEELEEQLEELLG---- 416
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1410 lECEKQQIQNDLNQWKTQYSRKEEAIrkieserekserekNSLRSEIERLQAEIKRIEERcRRKLEDStrETQSQLETER 1489
Cdd:COG4717 417 -ELEELLEALDEEELEEELEELEEEL--------------EELEEELEELREELAELEAE-LEQLEED--GELAELLQEL 478
|
490
....*....|...
gi 975830145 1490 SRYQREIDKLRQR 1502
Cdd:COG4717 479 EELKAELRELAEE 491
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
1028-1566 |
2.90e-09 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 62.68 E-value: 2.90e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1028 KLKNTKIEVLEEELRLARDANSENCNKNKFLDQNLQKYQAECSQFKAKLASLEELKRQAELDGKSAKQNLDKCYGQIKEL 1107
Cdd:pfam02463 212 YYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLL 291
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1108 NEKITRLTYEIEDEKRRRKSVEDRFDQQKNDYDQLQKA---RQCEKENLGWQKLESEKAIKEKEYEIERLRVLLQEEGTR 1184
Cdd:pfam02463 292 AKEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEKElkkEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQL 371
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1185 KREYENELAKVRNHYNEEMSnLRNKYETEINITK---TTIKEISMQKEDDSKNLRNQLDRLSRENRDLKdEIVRLNDSIL 1261
Cdd:pfam02463 372 EEELLAKKKLESERLSSAAK-LKEEELELKSEEEkeaQLLLELARQLEDLLKEEKKEELEILEEEEESI-ELKQGKLTEE 449
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1262 QATEQRRRAEENALQQKAcgseimQKKQHLEIELKQVMQQRSEDNARHKQSLEEAAKTIQDKNKEIERLKAEFQEEAKRR 1341
Cdd:pfam02463 450 KEELEKQELKLLKDELEL------KKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVGGR 523
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1342 WEYENELSKASNRIQESKNQCTQVVQERESLLVKIKVLEQDKarLQRLEDELNRAKSTLEAETRVKQrleCEKQQIQNDL 1421
Cdd:pfam02463 524 IISAHGRLGDLGVAVENYKVAISTAVIVEVSATADEVEERQK--LVRALTELPLGARKLRLLIPKLK---LPLKSIAVLE 598
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1422 NQWKTQYSRKEEAIRKIesereKSEREKNSLRSEIERLQAEIKRIEERCRRKLEDSTRETQSQLETERSRYQREIDKLRQ 1501
Cdd:pfam02463 599 IDPILNLAQLDKATLEA-----DEDDKRAKVVEGILKDTELTKLKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTK 673
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 975830145 1502 RPYGShRETQTECEWTVDTSKLVFDGLRKKVTAMQLYECQLIDKTTLDKLLKGKKSVEEVASEIQ 1566
Cdd:pfam02463 674 ELLEI-QELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEE 737
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
715-1469 |
3.16e-09 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 62.83 E-value: 3.16e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 715 DSQAIAEVLNQLKDMLANFRGSEKYCY--LQNEVFGLFQKLENINGVTDGYLNSLCTvrallqAILQTEDMLKVYEARLT 792
Cdd:pfam15921 114 DLQTKLQEMQMERDAMADIRRRESQSQedLRNQLQNTVHELEAAKCLKEDMLEDSNT------QIEQLRKMMLSHEGVLQ 187
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 793 EEETVCLDLDKVEAyrcglKKIKNDLNLKKSLLATMKTELQKA-QQIHSQTS----QQYPLYDL-------DLGKFGEKV 860
Cdd:pfam15921 188 EIRSILVDFEEASG-----KKIYEHDSMSTMHFRSLGSAISKIlRELDTEISylkgRIFPVEDQlealkseSQNKIELLL 262
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 861 TQLTDRWQRIDKQIDFRLWDLEKQIKQLRNYRDNYQAFCKWLYDAKRRQDSLesmkfgdsntVMRFLNEqknLHSEISGK 940
Cdd:pfam15921 263 QQHQDRIEQLISEHEVEITGLTEKASSARSQANSIQSQLEIIQEQARNQNSM----------YMRQLSD---LESTVSQL 329
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 941 RDKSEEVQKIAElcaNSIKDYELQLASYTSgleTLLNIPIKRTMIQSPSGVI---LQEA-ADVHARYIELLTRSGDYYRF 1016
Cdd:pfam15921 330 RSELREAKRMYE---DKIEELEKQLVLANS---ELTEARTERDQFSQESGNLddqLQKLlADLHKREKELSLEKEQNKRL 403
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1017 ----------LSEMLKSLEDlklKNTKIEVLEEELRLARDANSENCNKNKFLDQNLQKYQAECSQFKAKLASLEELKRQA 1086
Cdd:pfam15921 404 wdrdtgnsitIDHLRRELDD---RNMEVQRLEALLKAMKSECQGQMERQMAAIQGKNESLEKVSSLTAQLESTKEMLRKV 480
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1087 ELDGKSAKQNLDKCYGQIKELNEKITRLTYEIEDEKRRRKSVEDRFDQQ-------KNDYDQLQKArQCEKENLGWQKLE 1159
Cdd:pfam15921 481 VEELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEITKLRSRVDLKlqelqhlKNEGDHLRNV-QTECEALKLQMAE 559
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1160 SEKAIKEKEYEIERLRVLLQEEGTRKREYENELAKVrnhyneemsnlrnkyETEINITKTTIKEISMQKEDDSKNLRNQL 1239
Cdd:pfam15921 560 KDKVIEILRQQIENMTQLVGQHGRTAGAMQVEKAQL---------------EKEINDRRLELQEFKILKDKKDAKIRELE 624
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1240 DRLSrenrDLKDEIVRLndsiLQATEQRRRAEEnalqqkacgsEIMQKKQHLeieLKQVMQQRSEDNARhKQSLEEAAKT 1319
Cdd:pfam15921 625 ARVS----DLELEKVKL----VNAGSERLRAVK----------DIKQERDQL---LNEVKTSRNELNSL-SEDYEVLKRN 682
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1320 IQDKNKEIERLKAEFQEEAKrrwEYENELSKASNRIQE---SKNQCTQV-VQERESLLVKIKVLEQDKARLQRLEDELNR 1395
Cdd:pfam15921 683 FRNKSEEMETTTNKLKMQLK---SAQSELEQTRNTLKSmegSDGHAMKVaMGMQKQITAKRGQIDALQSKIQFLEEAMTN 759
|
730 740 750 760 770 780 790
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 975830145 1396 AkstleaeTRVKQRLECEKQQIQNDLNQWKTQysrkeeairkiesereksereKNSLRSEIERLQAEIKRIEER 1469
Cdd:pfam15921 760 A-------NKEKHFLKEEKNKLSQELSTVATE---------------------KNKMAGELEVLRSQERRLKEK 805
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
671-1377 |
9.07e-09 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 60.89 E-value: 9.07e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 671 ELQKIRRQIEhcEGRMTLKNLPLADQGSSHHITVKINELKSVQNDSQAIAEVLNQLKDMLA-NFRGSEKYCYLQNEVFGL 749
Cdd:pfam05483 107 KLQENRKIIE--AQRKAIQELQFENEKVSLKLEEEIQENKDLIKENNATRHLCNLLKETCArSAEKTKKYEYEREETRQV 184
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 750 FQKLENingvtdgylNSLCTVRALLQAILQTEDmlkvyeARLTEEETVCLDLDKVEAYRCGLKKIKNDLNLKKSLLATMK 829
Cdd:pfam05483 185 YMDLNN---------NIEKMILAFEELRVQAEN------ARLEMHFKLKEDHEKIQHLEEEYKKEINDKEKQVSLLLIQI 249
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 830 TE-----------LQKAQQIHSQTSQQYPLYDLDLGKFGEKVTQLTDRWQRIDKQidfrlwdLEKQIKQLRNYRDNYQAF 898
Cdd:pfam05483 250 TEkenkmkdltflLEESRDKANQLEEKTKLQDENLKELIEKKDHLTKELEDIKMS-------LQRSMSTQKALEEDLQIA 322
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 899 CKWLYDAKRRQDslesmkfgdsnTVMRFLNEQKNLHS----EISGKRDKSEEV----QKIAELCANSIKDYELQLASYTS 970
Cdd:pfam05483 323 TKTICQLTEEKE-----------AQMEELNKAKAAHSfvvtEFEATTCSLEELlrteQQRLEKNEDQLKIITMELQKKSS 391
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 971 GLETLLNIPIKRTM-------IQSPSGVILQEAADVHARYIELLTRSGDYYRFLSEMLKSLEDLKLKNTKIEVLEEE-LR 1042
Cdd:pfam05483 392 ELEEMTKFKNNKEVeleelkkILAEDEKLLDEKKQFEKIAEELKGKEQELIFLLQAREKEIHDLEIQLTAIKTSEEHyLK 471
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1043 LARDANSENCN---KNKFLDQNLQKYQAECSQFKAKLASLE-ELKRQAElDGKSAKQNLDKCYGQIKELNEKITRLTYEI 1118
Cdd:pfam05483 472 EVEDLKTELEKeklKNIELTAHCDKLLLENKELTQEASDMTlELKKHQE-DIINCKKQEERMLKQIENLEEKEMNLRDEL 550
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1119 EdekrrrkSVEDRFDQQKNDYD-QLQKARQcEKENLGWQKLESEKAIKEKEYEIERLRVLLQEEGTRKREYENELAKVRN 1197
Cdd:pfam05483 551 E-------SVREEFIQKGDEVKcKLDKSEE-NARSIEYEVLKKEKQMKILENKCNNLKKQIENKNKNIEELHQENKALKK 622
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1198 HYNEEMSNLrNKYETEINITKTTIKEISMQKEDDSKNLRNQLD--RLSRENrdlkdeivrlndsILQATEQRRRAEENAL 1275
Cdd:pfam05483 623 KGSAENKQL-NAYEIKVNKLELELASAKQKFEEIIDNYQKEIEdkKISEEK-------------LLEEVEKAKAIADEAV 688
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1276 QQKacgSEIMQKKQHLEIELKQVMQqrsednaRHKQSLEeaaKTIQDKNKEIERLKAEFQEEAKRRWEYENELSKASNRI 1355
Cdd:pfam05483 689 KLQ---KEIDKRCQHKIAEMVALME-------KHKHQYD---KIIEERDSELGLYKNKEQEQSSAKAALEIELSNIKAEL 755
|
730 740
....*....|....*....|..
gi 975830145 1356 QESKNQCTQVVQERESLLVKIK 1377
Cdd:pfam05483 756 LSLKKQLEIEKEEKEKLKMEAK 777
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
1038-1497 |
1.02e-08 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 60.96 E-value: 1.02e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1038 EEELRLARDANSENCNKNKFLDQNLQKYQAECSQFKAKLASLEELKRQAEldgksakQNLDKCYGQIKELNEKITRLTYE 1117
Cdd:pfam01576 11 EEELQKVKERQQKAESELKELEKKHQQLCEEKNALQEQLQAETELCAEAE-------EMRARLAARKQELEEILHELESR 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1118 IEDEkrrrksvEDRFDQQKNDYDQLQK-----ARQCEKENLGWQKLESEKAIKE-KEYEIERLRVLLQEEGTR----KRE 1187
Cdd:pfam01576 84 LEEE-------EERSQQLQNEKKKMQQhiqdlEEQLDEEEAARQKLQLEKVTTEaKIKKLEEDILLLEDQNSKlskeRKL 156
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1188 YENELAKVRNHYNEE------MSNLRNKYETEINITKTTIKEISMQKEDDSKNLRnqldRLSRENRDLKDEIVRLNDSIL 1261
Cdd:pfam01576 157 LEERISEFTSNLAEEeekaksLSKLKNKHEAMISDLEERLKKEEKGRQELEKAKR----KLEGESTDLQEQIAELQAQIA 232
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1262 QATEQRRRAEENALQQKACGSEIMQKKQHLEIELKQVMQQRS---EDNARHKQSLEEAAKTIQDKNKEIERLKAEF---- 1334
Cdd:pfam01576 233 ELRAQLAKKEEELQAALARLEEETAQKNNALKKIRELEAQISelqEDLESERAARNKAEKQRRDLGEELEALKTELedtl 312
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1335 -----QEEAKRRWEYE-NELSKA--------SNRIQESKNQCTQVVQERESLLVKIK----VLEQDKARLQRLEDELNRA 1396
Cdd:pfam01576 313 dttaaQQELRSKREQEvTELKKAleeetrshEAQLQEMRQKHTQALEELTEQLEQAKrnkaNLEKAKQALESENAELQAE 392
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1397 KSTL-----EAETRvKQRLECEKQQIQNDLNQWKTQYSRKEEAIRKIESEREKSEREKNSLRSEIERLQAEIKRIE---- 1467
Cdd:pfam01576 393 LRTLqqakqDSEHK-RKKLEGQLQELQARLSESERQRAELAEKLSKLQSELESVSSLLNEAEGKNIKLSKDVSSLEsqlq 471
|
490 500 510
....*....|....*....|....*....|....*..
gi 975830145 1468 -------ERCRRKLEDSTRetQSQLETERSRYQREID 1497
Cdd:pfam01576 472 dtqellqEETRQKLNLSTR--LRQLEDERNSLQEQLE 506
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
949-1383 |
1.21e-08 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 60.85 E-value: 1.21e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 949 KIAELCANSIKDYELQLASYTSGLETLLN-IPIKRTMIQSPSGVILQEAADVHAryiELLTRSGDYYRFLSEMLKSLEDL 1027
Cdd:TIGR02169 663 RGGILFSRSEPAELQRLRERLEGLKRELSsLQSELRRIENRLDELSQELSDASR---KIGEIEKEIEQLEQEEEKLKERL 739
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1028 KLKNTKIEVLEEELRLARDansencnKNKFLDQNLQKYQAECSQFKAKLASLEELKRQAELDgksakqnldkcygqikel 1107
Cdd:TIGR02169 740 EELEEDLSSLEQEIENVKS-------ELKELEARIEELEEDLHKLEEALNDLEARLSHSRIP------------------ 794
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1108 neKITRLTYEIEDEKRRRKSVEDRFDQQKNDYDQLQKARQCEKENLGWQKLESEKAIKEKEYEIERLRvllqeegTRKRE 1187
Cdd:TIGR02169 795 --EIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLN-------GKKEE 865
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1188 YENELAKVRNHYNEemsnlrnkyeteinitkttikeismqkeddsknLRNQLDRLSRENRDLKDEIVRLNDSILQATEQR 1267
Cdd:TIGR02169 866 LEEELEELEAALRD---------------------------------LESRLGDLKKERDELEAQLRELERKIEELEAQI 912
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1268 RRAEENALQQKACGSEIMQKKQHLEIELKQVMQQRSEDnarhkQSLEEAAKTIQDKNKEIERL-----KA--EFQEEAKR 1340
Cdd:TIGR02169 913 EKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEE-----LSLEDVQAELQRVEEEIRALepvnmLAiqEYEEVLKR 987
|
410 420 430 440
....*....|....*....|....*....|....*....|...
gi 975830145 1341 RweyeNELSKASNRIQEsknqctqvvqERESLLVKIKVLEQDK 1383
Cdd:TIGR02169 988 L----DELKEKRAKLEE----------ERKAILERIEEYEKKK 1016
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
1105-1413 |
1.48e-08 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 60.14 E-value: 1.48e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1105 KELNEKITRLTYEIEDEKRRRKSVED--RFDQQKNDYDQLQKARQCEKENLGWQKLESEKAIKEKEYEIERLRvllQEEg 1182
Cdd:pfam17380 282 KAVSERQQQEKFEKMEQERLRQEKEEkaREVERRRKLEEAEKARQAEMDRQAAIYAEQERMAMERERELERIR---QEE- 357
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1183 tRKREYEN--------ELAKVRN----HYNEEMSNLRNKYETE---------------INITKTTIKEISMQKED-DSKN 1234
Cdd:pfam17380 358 -RKRELERirqeeiamEISRMRElerlQMERQQKNERVRQELEaarkvkileeerqrkIQQQKVEMEQIRAEQEEaRQRE 436
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1235 LR-------NQLDRLSRENRDLKDEIVRLNDsilQATEQRRRAEENALQQKacgseimqKKQHLEIELKQVMQQRSEDNa 1307
Cdd:pfam17380 437 VRrleeeraREMERVRLEEQERQQQVERLRQ---QEEERKRKKLELEKEKR--------DRKRAEEQRRKILEKELEER- 504
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1308 RHKQSLEEAAKTIQDKNKEiERLKAEFQEEAKRRWEyenelskasnriQESKNQctQVVQERESLLVKIKVLEQDKARLQ 1387
Cdd:pfam17380 505 KQAMIEEERKRKLLEKEME-ERQKAIYEEERRREAE------------EERRKQ--QEMEERRRIQEQMRKATEERSRLE 569
|
330 340
....*....|....*....|....*.
gi 975830145 1388 RLEDELNRAKSTLEAEtrvKQRLECE 1413
Cdd:pfam17380 570 AMEREREMMRQIVESE---KARAEYE 592
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1105-1459 |
1.74e-08 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 60.08 E-value: 1.74e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1105 KELNEKITRLTYEIEDEKRRRKSVEDRFDQQKNDYDQLQKARQCEKENLG-----WQKLESEKAiKEKEyEIERLRVLLQ 1179
Cdd:TIGR02169 670 RSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGeiekeIEQLEQEEE-KLKE-RLEELEEDLS 747
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1180 EEGTRKREYENELAKVRNHYnEEMSNLRNKYETEINitkttikeismqkEDDSKNLRNQLDRLSRENRDLKDEIVRLnDS 1259
Cdd:TIGR02169 748 SLEQEIENVKSELKELEARI-EELEEDLHKLEEALN-------------DLEARLSHSRIPEIQAELSKLEEEVSRI-EA 812
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1260 ILQATEQ---RRRAEENALQQKAcgSEIMQKKQHLEIELKQVMQQRSEDNARhkqsLEEAAKTIQDKNKEIERLKAEFQE 1336
Cdd:TIGR02169 813 RLREIEQklnRLTLEKEYLEKEI--QELQEQRIDLKEQIKSIEKEIENLNGK----KEELEEELEELEAALRDLESRLGD 886
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1337 EAKRRWEYENELSKASNRIQESKNQCTQVVQERESLLVKIKVLEQdkaRLQRLEDELNRAKS----TLEAETRVKQRLEC 1412
Cdd:TIGR02169 887 LKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEE---ELSEIEDPKGEDEEipeeELSLEDVQAELQRV 963
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|
gi 975830145 1413 EkQQIQ--NDLNQWKTQ-YSRKEEAIRKIESEREKSEREKNSLRSEIERL 1459
Cdd:TIGR02169 964 E-EEIRalEPVNMLAIQeYEEVLKRLDELKEKRAKLEEERKAILERIEEY 1012
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1303-1502 |
2.04e-08 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 59.01 E-value: 2.04e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1303 SEDNARHKQSLEEAAKTIQDKNKEIERLKAEFQEEAKRRWEYENELSKASNRIQESKNQCTQVVQERESLLVKIKVLEQD 1382
Cdd:COG4942 19 ADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAE 98
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1383 KARLQRLEDELNRAKSTLEAETRVKQRLECEK-QQIQNDLNQWKTQYSRKEEAIRKIESEREKSEREKNSLRSEIERLQA 1461
Cdd:COG4942 99 LEAQKEELAELLRALYRLGRQPPLALLLSPEDfLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAERAELEA 178
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 975830145 1462 EIKRIEERCRR--KLEDSTRETQSQLETERSRYQREIDKLRQR 1502
Cdd:COG4942 179 LLAELEEERAAleALKAERQKLLARLEKELAELAAELAELQQE 221
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
1027-1502 |
2.14e-08 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 59.74 E-value: 2.14e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1027 LKLKNTKIEVLEEELRlaRDANSENCNKNKFLDQNLQKYqaecSQFKAKLASLEELKRQA-ELDGKSAKQNLDkcygqIK 1105
Cdd:pfam05483 217 LKEDHEKIQHLEEEYK--KEINDKEKQVSLLLIQITEKE----NKMKDLTFLLEESRDKAnQLEEKTKLQDEN-----LK 285
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1106 ELNEKITRLTYEIEDEK-------RRRKSVEDRFDQQKNDYDQLQKARQCEKENLGWQKLESEKAIKEKEYEIERLRVLL 1178
Cdd:pfam05483 286 ELIEKKDHLTKELEDIKmslqrsmSTQKALEEDLQIATKTICQLTEEKEAQMEELNKAKAAHSFVVTEFEATTCSLEELL 365
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1179 QEEGTRKREYENELAKV------RNHYNEEMSNLRNKYETEINITKTTIKEisMQKEDDSKnlrNQLDRLSREnrdlkde 1252
Cdd:pfam05483 366 RTEQQRLEKNEDQLKIItmelqkKSSELEEMTKFKNNKEVELEELKKILAE--DEKLLDEK---KQFEKIAEE------- 433
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1253 ivrlndsiLQATEQrrraeenalqqkacgseimqkkqhleiELKQVMQQRSEDNARHKQSLEEAAKTIQDKNKEIERLKA 1332
Cdd:pfam05483 434 --------LKGKEQ---------------------------ELIFLLQAREKEIHDLEIQLTAIKTSEEHYLKEVEDLKT 478
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1333 EFQEEAKRrweyENELSKASNRIQ-ESKnqctQVVQERESLLVKIKVLEQDKARLQRLEDELNRAKSTL-EAETRVKQRL 1410
Cdd:pfam05483 479 ELEKEKLK----NIELTAHCDKLLlENK----ELTQEASDMTLELKKHQEDIINCKKQEERMLKQIENLeEKEMNLRDEL 550
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1411 ECEKQQIQNDLNQWKTQYSRKEEAIRKIESEREKSEREK-------NSLRSEIERLQAEIKRIEERCRRKLEDSTRETQ- 1482
Cdd:pfam05483 551 ESVREEFIQKGDEVKCKLDKSEENARSIEYEVLKKEKQMkilenkcNNLKKQIENKNKNIEELHQENKALKKKGSAENKq 630
|
490 500
....*....|....*....|.
gi 975830145 1483 -SQLETERSRYQREIDKLRQR 1502
Cdd:pfam05483 631 lNAYEIKVNKLELELASAKQK 651
|
|
| PLEC |
smart00250 |
Plectin repeat; |
2281-2314 |
2.60e-08 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 51.71 E-value: 2.60e-08
10 20 30
....*....|....*....|....*....|....
gi 975830145 2281 QRFLEFQYLTGGLVDPEVHGRISTEEAIRKGFID 2314
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLID 34
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
1062-1211 |
2.72e-08 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 56.86 E-value: 2.72e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1062 LQKYQAECSQFKAKLASLEELKRQAELDGKSAKQNLDKCYGQIKELNEKITRLTYEIEDEKRRRKSVEDRFDQQKN--DY 1139
Cdd:COG1579 12 LQELDSELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVRNnkEY 91
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 975830145 1140 DQLQKarqcEKENLGWQKLESEKAIKEKEYEIERLRVLLQEEGTRKREYENELAKVRNHYNEEMSNLRNKYE 1211
Cdd:COG1579 92 EALQK----EIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELE 159
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1258-1502 |
3.16e-08 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 59.30 E-value: 3.16e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1258 DSILQATEQRRRA--EENA--LQQKACGSEIMQKKQHLEIELKQVMQQRSEDNaRHKQSLEEAAKtiqdKNKEIERLKAE 1333
Cdd:TIGR02168 147 SEIIEAKPEERRAifEEAAgiSKYKERRKETERKLERTRENLDRLEDILNELE-RQLKSLERQAE----KAERYKELKAE 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1334 fqEEAKRRWEYENELSKASNRIQESKNQCTQVVQERESLLVKIKVLEQD----KARLQRLEDELNRAKSTLEAETRVKQR 1409
Cdd:TIGR02168 222 --LRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKleelRLEVSELEEEIEELQKELYALANEISR 299
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1410 LECEKQQIQNDLNQWKTQYSRKEEAIRKIESEREKSEREKNSLRSEIERLQAEIKRIEERCRRK------LEDSTRETQS 1483
Cdd:TIGR02168 300 LEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELeaeleeLESRLEELEE 379
|
250 260
....*....|....*....|..
gi 975830145 1484 QLETERSRY---QREIDKLRQR 1502
Cdd:TIGR02168 380 QLETLRSKVaqlELQIASLNNE 401
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
1026-1498 |
4.28e-08 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 59.00 E-value: 4.28e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1026 DLKLKNTKIEVLEEELRLARDANSENCNKNKfldqnlQKYQAECSQFKAKLASLEELKRQAElDGKSAKQNLDKCYGQIK 1105
Cdd:PTZ00121 1082 DAKEDNRADEATEEAFGKAEEAKKTETGKAE------EARKAEEAKKKAEDARKAEEARKAE-DARKAEEARKAEDAKRV 1154
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1106 ELNEKI--TRLTYEIEDEKRRRKSVEDRFDQQKNDYDQLQKARQCEKENLGwQKLESEKAIKE-KEYEIERL--RVLLQE 1180
Cdd:PTZ00121 1155 EIARKAedARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAA-RKAEEERKAEEaRKAEDAKKaeAVKKAE 1233
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1181 EGTRKREYENELAKVRNhyNEEMSNLRNKYETEINITKTTIKEISMQKEDDSKNLRNQldRLSRENRdlKDEIVRLNDSI 1260
Cdd:PTZ00121 1234 EAKKDAEEAKKAEEERN--NEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEK--KKADEAK--KAEEKKKADEA 1307
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1261 LQATEQRRRAEEnaLQQKA----CGSEIMQKKQHLEIELKQVMQQRSEDNARHKQSLEEAAKTIQDKNKEiERLKAefqE 1336
Cdd:PTZ00121 1308 KKKAEEAKKADE--AKKKAeeakKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEE-AKKKA---D 1381
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1337 EAKRRWEYENELSKASNRIQESKNQCTQVVQERESllvKIKVLE-QDKARLQRLEDELNRAKSTLEAETRVKQRLEcEKQ 1415
Cdd:PTZ00121 1382 AAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAA---KKKADEaKKKAEEKKKADEAKKKAEEAKKADEAKKKAE-EAK 1457
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1416 QIQNDLNqwKTQYSRKEEAIRKiesereksereknslRSEIERLQAEIKRIEERCRRKLEDSTRETQSQLETERSRYQRE 1495
Cdd:PTZ00121 1458 KAEEAKK--KAEEAKKADEAKK---------------KAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEE 1520
|
...
gi 975830145 1496 IDK 1498
Cdd:PTZ00121 1521 AKK 1523
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1156-1409 |
4.88e-08 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 57.85 E-value: 4.88e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1156 QKLESEKAIKEKEYEIERLRVLLQEEGTRKREYENELAKVRNHYNEeMSNLRNKYETEINITKTTIKEismqkeddsknL 1235
Cdd:COG4942 21 AAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAA-LARRIRALEQELAALEAELAE-----------L 88
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1236 RNQLDRLSRENRDLKDEIVRLndsiLQATEQRRRAEENALQQKACGSEIMQKKQHLeieLKQVMQQRsednarhkqslEE 1315
Cdd:COG4942 89 EKEIAELRAELEAQKEELAEL----LRALYRLGRQPPLALLLSPEDFLDAVRRLQY---LKYLAPAR-----------RE 150
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1316 AAKTIQDKNKEIERLKAEFQEEAKRRWEYENELSKASNRIQESKNQCTQVVQERESllvKIKVLEQDKARLQRLEDELNR 1395
Cdd:COG4942 151 QAEELRADLAELAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEK---ELAELAAELAELQQEAEELEA 227
|
250
....*....|....
gi 975830145 1396 AKSTLEAETRVKQR 1409
Cdd:COG4942 228 LIARLEAEAAAAAE 241
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1230-1473 |
4.96e-08 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 57.85 E-value: 4.96e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1230 DDSKNLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRRRAEENALQQKACGSEIMQKKQHLEIELKQVMQQRSEDNARH 1309
Cdd:COG4942 20 DAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAEL 99
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1310 KQSLEEAAKTIQD--KNKEIERLKAEFQ----EEAKRRWEYENELSKASnriqesKNQCTQVVQERESLLVKIKVLEQDK 1383
Cdd:COG4942 100 EAQKEELAELLRAlyRLGRQPPLALLLSpedfLDAVRRLQYLKYLAPAR------REQAEELRADLAELAALRAELEAER 173
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1384 ARLQRLEDELNRAKSTLEAETRvkqrlecEKQQIQNDLNQwktQYSRKEEAIRKiesereksereknsLRSEIERLQAEI 1463
Cdd:COG4942 174 AELEALLAELEEERAALEALKA-------ERQKLLARLEK---ELAELAAELAE--------------LQQEAEELEALI 229
|
250
....*....|
gi 975830145 1464 KRIEERCRRK 1473
Cdd:COG4942 230 ARLEAEAAAA 239
|
|
| SPEC |
cd00176 |
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ... |
190-366 |
5.24e-08 |
|
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here
Pssm-ID: 238103 [Multi-domain] Cd Length: 213 Bit Score: 55.91 E-value: 5.24e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 190 ECLGWMRQQRAEMDMVAWGVDLASVEQHINSHRGIHNSIGDYRWQLDKIKA---DLREKSAIY---------QLEEEYEN 257
Cdd:cd00176 11 ELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNElgeQLIEEGHPDaeeiqerleELNQRWEE 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 258 LLKASFERMDHL---RQLQNIIQATSREIMWINDCEEEELLYDWSDKNTNIAQKQEAFSIRMSQLEVKEKELNKLKQESD 334
Cdd:cd00176 91 LRELAEERRQRLeeaLDLQQFFRDADDLEQWLEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSLNELAE 170
|
170 180 190
....*....|....*....|....*....|...
gi 975830145 335 QLVLNQHPAS-DKIEAYMDTLQTQWSWILQITK 366
Cdd:cd00176 171 ELLEEGHPDAdEEIEEKLEELNERWEELLELAE 203
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
1017-1494 |
1.23e-07 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 57.49 E-value: 1.23e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1017 LSEMLKSLEDLKlkNTKIEVLEEELRLARDANSENCNknkfLDQNLQKYQAECSQFKAKLASLEELKRQAELDGKSAKQN 1096
Cdd:pfam01576 487 LSTRLRQLEDER--NSLQEQLEEEEEAKRNVERQLST----LQAQLSDMKKKLEEDAGTLEALEEGKKRLQRELEALTQQ 560
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1097 LDKCYGQIKELNEKITRLTYEIEDekrrrksVEDRFDQQKNDYDQLQKarqceKENLGWQKLESEKAIKEKeYEIERLRV 1176
Cdd:pfam01576 561 LEEKAAAYDKLEKTKNRLQQELDD-------LLVDLDHQRQLVSNLEK-----KQKKFDQMLAEEKAISAR-YAEERDRA 627
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1177 llqEEGTRKREYEnelAKVRNHYNEEMSNLRNKYETEINITKTTIKEISMQKEDDSKNLrNQLDRLSR----ENRDLKDE 1252
Cdd:pfam01576 628 ---EAEAREKETR---ALSLARALEEALEAKEELERTNKQLRAEMEDLVSSKDDVGKNV-HELERSKRaleqQVEEMKTQ 700
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1253 IVRLNDSiLQATEQRR-RAEENALQQKACGSEIMQKKQHLEIE-----LKQVMQQRSE-DNARHKQSLEEAAKtiqdKNK 1325
Cdd:pfam01576 701 LEELEDE-LQATEDAKlRLEVNMQALKAQFERDLQARDEQGEEkrrqlVKQVRELEAElEDERKQRAQAVAAK----KKL 775
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1326 EIERLKAEFQEEA--KRRWEYENELSKASNRIQESKNQCTQVVQERESLLVKIKvlEQDKaRLQRLEDELNRAKSTLEAE 1403
Cdd:pfam01576 776 ELDLKELEAQIDAanKGREEAVKQLKKLQAQMKDLQRELEEARASRDEILAQSK--ESEK-KLKNLEAELLQLQEDLAAS 852
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1404 TRVKQRLECEKQQIQNDLNQWKTQYSRKEEAIRKIESEREKsereknsLRSEIERLQAEIKRIEERcRRKLEDSTRETQS 1483
Cdd:pfam01576 853 ERARRQAQQERDELADEIASGASGKSALQDEKRRLEARIAQ-------LEEELEEEQSNTELLNDR-LRKSTLQVEQLTT 924
|
490
....*....|.
gi 975830145 1484 QLETERSRYQR 1494
Cdd:pfam01576 925 ELAAERSTSQK 935
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
1125-1500 |
1.42e-07 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 56.97 E-value: 1.42e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1125 RKSVEDRFDQQKNDYDQLqKARQCEKENLGWQKL--ESEKAIKEKEYEIERLrvllQEEGTRKREYENELAKVRNHYNEE 1202
Cdd:PRK02224 175 RLGVERVLSDQRGSLDQL-KAQIEEKEEKDLHERlnGLESELAELDEEIERY----EEQREQARETRDEADEVLEEHEER 249
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1203 MSNLrNKYETEINITKTTIKEISMQKEDDS---KNLRNQLDRLSRENRDLKDEiVRLNDSILQATEQRRraEENALQQKA 1279
Cdd:PRK02224 250 REEL-ETLEAEIEDLRETIAETEREREELAeevRDLRERLEELEEERDDLLAE-AGLDDADAEAVEARR--EELEDRDEE 325
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1280 CGSEIMQKKQHLEIELKQVMQQRS-----EDNARHKQS--------LEEAAKTIQDKNKEIERLKAEFQEEAKR------ 1340
Cdd:PRK02224 326 LRDRLEECRVAAQAHNEEAESLREdaddlEERAEELREeaaeleseLEEAREAVEDRREEIEELEEEIEELRERfgdapv 405
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1341 ----------------------RWEYENELSKASNRIQESKN--------QCTQ---------VVQERESLLVKIKV-LE 1380
Cdd:PRK02224 406 dlgnaedfleelreerdelrerEAELEATLRTARERVEEAEAlleagkcpECGQpvegsphveTIEEDRERVEELEAeLE 485
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1381 QDKARLQRLEDELNRAKSTLEAETRVkQRLECEKQQIQNDLNQWKTQYSRKEEAIrkieserekserekNSLRSEIERLQ 1460
Cdd:PRK02224 486 DLEEEVEEVEERLERAEDLVEAEDRI-ERLEERREDLEELIAERRETIEEKRERA--------------EELRERAAELE 550
|
410 420 430 440
....*....|....*....|....*....|....*....|..
gi 975830145 1461 AEIKRIEERCRRKLE--DSTRETQSQLETERSRYQREIDKLR 1500
Cdd:PRK02224 551 AEAEEKREAAAEAEEeaEEAREEVAELNSKLAELKERIESLE 592
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1233-1485 |
1.43e-07 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 57.23 E-value: 1.43e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1233 KNLRNqLDRLSREN----RDLKDEIVRLNDSILQATEQRRRAEEnALQQKACGSEIMQKKQHLEIELKQVMQQRSEDNA- 1307
Cdd:COG4913 204 KPIGD-LDDFVREYmleePDTFEAADALVEHFDDLERAHEALED-AREQIELLEPIRELAERYAAARERLAELEYLRAAl 281
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1308 ---RHKQSLEEAAKTIQDKNKEIERLKAEFQEEAKRRWEYENELSKASNRIQESKNQctqvvqERESLLVKIKVLEQDKA 1384
Cdd:COG4913 282 rlwFAQRRLELLEAELEELRAELARLEAELERLEARLDALREELDELEAQIRGNGGD------RLEQLEREIERLERELE 355
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1385 RLQRLEDELNRAKSTLEAET-RVKQRLECEKQQIQNDLNQWKTQYSRKEEAIRKIESEREKSEREKNSLRSEIERLQAEI 1463
Cdd:COG4913 356 ERERRRARLEALLAALGLPLpASAEEFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLERRK 435
|
250 260
....*....|....*....|....*
gi 975830145 1464 KRIEER---CRRKLEDSTRETQSQL 1485
Cdd:COG4913 436 SNIPARllaLRDALAEALGLDEAEL 460
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
1106-1512 |
1.54e-07 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 57.29 E-value: 1.54e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1106 ELNEKITRLTYEIEDEKRRRKSVED---RFDQQKNDYDQL-QKARQCEKENLGWQKLESEKAIKEKEYEIERLRVLLQEE 1181
Cdd:TIGR00618 82 SLGTKIYRVHRTLRCTRSHRKTEQPeqlYLEQKKGRGRILaAKKSETEEVIHDLLKLDYKTFTRVVLLPQGEFAQFLKAK 161
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1182 GTRKREYENELAKVrNHYNEEMSNLRNKY---ETEINITKTTIKEISMQKEDDSKNLRNQLDRLSRENRDLKDEIVRLND 1258
Cdd:TIGR00618 162 SKEKKELLMNLFPL-DQYTQLALMEFAKKkslHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQ 240
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1259 SILQATEQRRRAEENALQQKACGsEIMQKKQHLEIELKQVMQQRSEDN-ARHKQSLEEAAKTIQDKNKEIERLKAEFQEE 1337
Cdd:TIGR00618 241 SHAYLTQKREAQEEQLKKQQLLK-QLRARIEELRAQEAVLEETQERINrARKAAPLAAHIKAVTQIEQQAQRIHTELQSK 319
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1338 akrrweyENELSKAsnriqesKNQCTQVVQERESLLVKIKVLEQDKARLQRLEDELNRAKSTLE---AETRVKQRLECEK 1414
Cdd:TIGR00618 320 -------MRSRAKL-------LMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREiscQQHTLTQHIHTLQ 385
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1415 QQIQNDLNQWKTQySRKEEAIRKIESEREKSEREKNSLRSEIERLQAEIKRIEERCRRKLEDSTRETQSQ------LETE 1488
Cdd:TIGR00618 386 QQKTTLTQKLQSL-CKELDILQREQATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEklekihLQES 464
|
410 420
....*....|....*....|....
gi 975830145 1489 RSRYQREIDKLRQRPYGSHRETQT 1512
Cdd:TIGR00618 465 AQSLKEREQQLQTKEQIHLQETRK 488
|
|
| PLEC |
smart00250 |
Plectin repeat; |
1641-1678 |
1.70e-07 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 49.40 E-value: 1.70e-07
10 20 30
....*....|....*....|....*....|....*...
gi 975830145 1641 QQIYAAEKAITGFDDPFSGKTVSVSEAIKKNLIDRETG 1678
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
993-1467 |
1.98e-07 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 56.87 E-value: 1.98e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 993 LQEAADVHARYIELLTR-SGDYYRFLSEMLKSLEDLKLKNTKIEVLEEELRLARDANSENCNKNKFLDQNLQKYQAECSQ 1071
Cdd:COG1196 269 LEELRLELEELELELEEaQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEE 348
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1072 FKAKLASLEELKRQAELDGKSAKQNLDKCYGQIKELNEKITRLTYEIEDEKRRRKSVEDRFDQQKNDYDQLQKARQCEKE 1151
Cdd:COG1196 349 AEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEE 428
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1152 NLGwQKLESEKAIKEKEYEIERLRVLLQEEGTRKREYENELAKVRNHYNEEMSNLRNKYETEINItKTTIKEISMQKEDD 1231
Cdd:COG1196 429 ALA-ELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAAR-LLLLLEAEADYEGF 506
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1232 S-----KNLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRR--RAEENALQQKACGSEIMQKKQH--LEIELKQVMQQR 1302
Cdd:COG1196 507 LegvkaALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALqnIVVEDDEVAAAAIEYLKAAKAGraTFLPLDKIRARA 586
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1303 SEDNARHKQSLEEAAKTIQDKNKEIERLKAEFQEEAKRRWEYENELSKASNRIQESKNQCTQVVQERESLLV-------- 1374
Cdd:COG1196 587 ALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAggsltggs 666
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1375 KIKVLEQDKARLQRLEDELNRAKSTLEAETRVKQRLECEKQQIQNDLNQWKTQYSRKEEAIRKIESEREKSEREK----- 1449
Cdd:COG1196 667 RRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELleeee 746
|
490 500 510
....*....|....*....|....*....|....
gi 975830145 1450 ----------------NSLRSEIERLQAEIKRIE 1467
Cdd:COG1196 747 lleeealeelpeppdlEELERELERLEREIEALG 780
|
|
| PLEC |
smart00250 |
Plectin repeat; |
1605-1640 |
2.05e-07 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 49.02 E-value: 2.05e-07
10 20 30
....*....|....*....|....*....|....*.
gi 975830145 1605 LLEAQAATGGIIDPHRNEKLTVDSAIARDLIDFDDR 1640
Cdd:smart00250 3 LLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| HEC1 |
COG5185 |
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ... |
1134-1501 |
2.30e-07 |
|
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 444066 [Multi-domain] Cd Length: 594 Bit Score: 56.12 E-value: 2.30e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1134 QQKNDY----DQLQKARQCEKENLGWQKLESEKAIKEKEYEIERLRVLLQEEGTRKREYENELAKVRNHYNEEMSNLRNK 1209
Cdd:COG5185 140 VEKLDEiadiEASYGEVETGIIKDIFGKLTQELNQNLKKLEIFGLTLGLLKGISELKKAEPSGTVNSIKESETGNLGSES 219
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1210 YETEINITKTTIKEI---SMQKEDDSKNLRNQLDRLSR---ENRDLKDEIVRlndsilQATEQRRRAEENALQQKacgSE 1283
Cdd:COG5185 220 TLLEKAKEIINIEEAlkgFQDPESELEDLAQTSDKLEKlveQNTDLRLEKLG------ENAESSKRLNENANNLI---KQ 290
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1284 IMQKKQHLEIELKQVmqqrseDNARHKQSLEEAAKTIQdKNKEIERLKAEFQEEAKRRW-EYENELSKASNRIQESKNQC 1362
Cdd:COG5185 291 FENTKEKIAEYTKSI------DIKKATESLEEQLAAAE-AEQELEESKRETETGIQNLTaEIEQGQESLTENLEAIKEEI 363
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1363 TQVVQERESLLVKIKvLEQDKARLQRLEDEL--------NRAKSTLEAETRVKQRLECEKQQIQNDLNQWKTQYSRKEEA 1434
Cdd:COG5185 364 ENIVGEVELSKSSEE-LDSFKDTIESTKESLdeipqnqrGYAQEILATLEDTLKAADRQIEELQRQIEQATSSNEEVSKL 442
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 975830145 1435 IRKIESEREKSEREKNSLRSE---------IERLQAEIKRIEERcRRKLEDSTRETQSQLETERSRYQREIDKLRQ 1501
Cdd:COG5185 443 LNELISELNKVMREADEESQSrleeaydeiNRSVRSKKEDLNEE-LTQIESRVSTLKATLEKLRAKLERQLEGVRS 517
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
2025-2053 |
2.31e-07 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 48.86 E-value: 2.31e-07
10 20
....*....|....*....|....*....
gi 975830145 2025 IVDPETNKEMSVQEAYKKGLIDYETFKEL 2053
Cdd:pfam00681 11 IIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| PLEC |
smart00250 |
Plectin repeat; |
1848-1882 |
4.81e-07 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 47.86 E-value: 4.81e-07
10 20 30
....*....|....*....|....*....|....*
gi 975830145 1848 LLEAQAATGFIVDPVSNLRLPVEEAYKRGLVGIEF 1882
Cdd:smart00250 3 LLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPET 37
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1103-1352 |
5.09e-07 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 54.38 E-value: 5.09e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1103 QIKELNEKITRLTYEIEDEKRRRKSVEDRFDQQKNDYDQLQKARQcekenlgwqklESEKAIKEKEYEIERLRVLLQEEG 1182
Cdd:COG4942 28 ELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIR-----------ALEQELAALEAELAELEKEIAELR 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1183 TRKREYENELAKVrnhyneemsnLRNKYETEINitkTTIKEISMQkeDDSKNLRNQLDRLSRENRDLKDEIVRLNDSILQ 1262
Cdd:COG4942 97 AELEAQKEELAEL----------LRALYRLGRQ---PPLALLLSP--EDFLDAVRRLQYLKYLAPARREQAEELRADLAE 161
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1263 ATEQRRRAEENALQQKACGSEIMQKKQhleiELKQVMQQRSEDNARHKQSLEEAAKTIQDKNKEIERLKAEFQEEAKRRW 1342
Cdd:COG4942 162 LAALRAELEAERAELEALLAELEEERA----ALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAA 237
|
250
....*....|
gi 975830145 1343 EYENELSKAS 1352
Cdd:COG4942 238 AAAERTPAAG 247
|
|
| MAD |
pfam05557 |
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ... |
1004-1500 |
5.38e-07 |
|
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.
Pssm-ID: 461677 [Multi-domain] Cd Length: 660 Bit Score: 55.13 E-value: 5.38e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1004 IELLTRSGDYYRFLSEMLKSLEDLKlknTKIEVLEEELRLARDANSENCNKNKFLDQNLQKYQAECSQFKAKLASLEELK 1083
Cdd:pfam05557 30 IELEKKASALKRQLDRESDRNQELQ---KRIRLLEKREAEAEEALREQAELNRLKKKYLEALNKKLNEKESQLADAREVI 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1084 RqaeldgkSAKQNLDKCYGQIKELNEKITRLTYEIEDEKRRRKSVEDRFDQQKNDYDQLQKARQCEKE-NLGWQKLESE- 1161
Cdd:pfam05557 107 S-------CLKNELSELRRQIQRAELELQSTNSELEELQERLDLLKAKASEAEQLRQNLEKQQSSLAEaEQRIKELEFEi 179
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1162 ----------KAIKEK-------EYEIERLRV-------------LLQEEgtrKREYENELAKVRNhYNEEMSNL---RN 1208
Cdd:pfam05557 180 qsqeqdseivKNSKSElaripelEKELERLREhnkhlnenienklLLKEE---VEDLKRKLEREEK-YREEAATLeleKE 255
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1209 KYETEINITKTTIKE--ISMQKEDDsknLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRRRAEENALQQKACGSEIMQ 1286
Cdd:pfam05557 256 KLEQELQSWVKLAQDtgLNLRSPED---LSRRIEQLQQREIVLKEENSSLTSSARQLEKARRELEQELAQYLKKIEDLNK 332
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1287 KKQHLEiELKQVMQQR----SEDNARHKQSLEE------AAKTIQDKNKEIERLKAEFQEEAKRRWEYENELSKASNRIQ 1356
Cdd:pfam05557 333 KLKRHK-ALVRRLQRRvlllTKERDGYRAILESydkeltMSNYSPQLLERIEEAEDMTQKMQAHNEEMEAQLSVAEEELG 411
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1357 ESKNQCTQVvqERESLLVKIKVLEQDKARLQRLEDELNRAKSTLEAE-TRVKQR-----LECEKQQIQNDLNQWKTqysr 1430
Cdd:pfam05557 412 GYKQQAQTL--ERELQALRQQESLADPSYSKEEVDSLRRKLETLELErQRLREQkneleMELERRCLQGDYDPKKT---- 485
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1431 keeaiRKIESEREKSEREKNSLRSEIERLQAEIKRIEERCrRKLEDSTRETQSQLETERSRYQREIDKLR 1500
Cdd:pfam05557 486 -----KVLHLSMNPAAEAYQQRKNQLEKLQAEIERLKRLL-KKLEDDLEQVLRLPETTSTMNFKEVLDLR 549
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
993-1315 |
6.53e-07 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 54.92 E-value: 6.53e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 993 LQEAADVHARYIELLTRSGDYYRFLSEMLKSLEDLKLKNTKIEVLEEELRLARDANSEncnknkfldqnlqkyqaecsqf 1072
Cdd:COG4913 629 AEERLEALEAELDALQERREALQRLAEYSWDEIDVASAEREIAELEAELERLDASSDD---------------------- 686
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1073 kakLASLEELKRQAELDGKSAKQNLDKCYGQIKELNEKITRLTYEIEDEKRRRKSVEDRFDQQKNDY--DQLQKARQCEK 1150
Cdd:COG4913 687 ---LAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALleERFAAALGDAV 763
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1151 ENLGWQKLESE--KAIKEKEYEIERLRVLLQEegtRKREYENELAKV------RNHYNEEMSNLRN----KYETEIN--I 1216
Cdd:COG4913 764 ERELRENLEERidALRARLNRAEEELERAMRA---FNREWPAETADLdadlesLPEYLALLDRLEEdglpEYEERFKelL 840
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1217 TKTTIKEISmqkeddskNLRNQLDrlsRENRDLKDEIVRLNDSILQ-----------ATEQRRRAEENALQQ--KACGS- 1282
Cdd:COG4913 841 NENSIEFVA--------DLLSKLR---RAIREIKERIDPLNDSLKRipfgpgrylrlEARPRPDPEVREFRQelRAVTSg 909
|
330 340 350
....*....|....*....|....*....|....*..
gi 975830145 1283 ---EIMQKKQHLEIELKQVMQQ-RSEDNARHKQSLEE 1315
Cdd:COG4913 910 aslFDEELSEARFAALKRLIERlRSEEEESDRRWRAR 946
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1236-1525 |
6.82e-07 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 54.92 E-value: 6.82e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1236 RNQLDRLSRENRDLKDEIVRLNDSILQATEQRRRAEENALQqkacgseimqkkqhleieLKQVMQQRSEDnarhkQSLEE 1315
Cdd:COG4913 609 RAKLAALEAELAELEEELAEAEERLEALEAELDALQERREA------------------LQRLAEYSWDE-----IDVAS 665
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1316 AAKTIQDKNKEIERLKAEfqeeakrrweyENELSKASNRIQESKNQCTQVVQERESLLVKIKVLEQDKARLQRLEDELNR 1395
Cdd:COG4913 666 AEREIAELEAELERLDAS-----------SDDLAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQD 734
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1396 AKSTLEAETRVKQRLECEKQ---------------QIQNDLNQWKTQYSRKEEAIRKIESE-----REKSEREKNSLRS- 1454
Cdd:COG4913 735 RLEAAEDLARLELRALLEERfaaalgdaverelreNLEERIDALRARLNRAEEELERAMRAfnrewPAETADLDADLESl 814
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1455 -EIERLQAEIKRI-----EERCRRKLEDSTRET----QSQLETERSRYQREIDK----LRQRPYGSHRETQTECEWTVDT 1520
Cdd:COG4913 815 pEYLALLDRLEEDglpeyEERFKELLNENSIEFvadlLSKLRRAIREIKERIDPlndsLKRIPFGPGRYLRLEARPRPDP 894
|
....*
gi 975830145 1521 SKLVF 1525
Cdd:COG4913 895 EVREF 899
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1304-1515 |
6.90e-07 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 55.07 E-value: 6.90e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1304 EDNARHKQSLEEAAKTIQDKNKEIERLKAE------FQEEAKRRWEYE-NELSKASNRIQESKNQCTQVVQEREsllvki 1376
Cdd:TIGR02169 177 EELEEVEENIERLDLIIDEKRQQLERLRRErekaerYQALLKEKREYEgYELLKEKEALERQKEAIERQLASLE------ 250
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1377 KVLEQDKARLQRLEDELNRAKSTL-EAETRVKQRLECEKQQIQNDLNQWKTQYSRKEEAIRKIESEREKSEREKNSLRSE 1455
Cdd:TIGR02169 251 EELEKLTEEISELEKRLEEIEQLLeELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAE 330
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 975830145 1456 IERLQAEIKRIEERC------RRKLEDSTRETQSQLETERSRYQrEIDKLRQRPYGSHRETQTECE 1515
Cdd:TIGR02169 331 IDKLLAEIEELEREIeeerkrRDKLTEEYAELKEELEDLRAELE-EVDKEFAETRDELKDYREKLE 395
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
670-1423 |
7.88e-07 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 55.06 E-value: 7.88e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 670 MELQKIRRQIEHCEGRMTLKNLPLADQGSS-----HHITVKINELKSvQNDSQAIAEVLNQLKDMLANFRGSEKYCYlQN 744
Cdd:TIGR01612 1814 IEAKEFDRIINHFKKKLDHVNDKFTKEYSKinegfDDISKSIENVKN-STDENLLFDILNKTKDAYAGIIGKKYYSY-KD 1891
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 745 EVFGLFQK--------------------LENINGVTDGYLNSlctvrallqailQTEDMLKVYEARLTEEET-------- 796
Cdd:TIGR01612 1892 EAEKIFINisklansiniqiqnnsgidlFDNINIAILSSLDS------------EKEDTLKFIPSPEKEPEIytkirdsy 1959
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 797 -VCLDL-------------------------DKVEAYRcGLKKIKNDLNLKKS-LLATMKTELQKAQQIHSQT--SQQYP 847
Cdd:TIGR01612 1960 dTLLDIfkksqdlhkkeqdtlniifenqqlyEKIQASN-ELKDTLSDLKYKKEkILNDVKLLLHKFDELNKLScdSQNYD 2038
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 848 LYdLDLGKFgEKVTQLTDRWQRIDKQ--IDFRLWDLEKQ----IKQLRNYRDNYQAFCKWLYDAKRRQDSLESMKFGDSN 921
Cdd:TIGR01612 2039 TI-LELSKQ-DKIKEKIDNYEKEKEKfgIDFDVKAMEEKfdndIKDIEKFENNYKHSEKDNHDFSEEKDNIIQSKKKLKE 2116
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 922 TVMRFLNEQKNLHSEISGKRDKSEEVQKIAELCAnsikdyelqLASYTSGLETLlnipiKRTMIQspsgvilqeaadvHA 1001
Cdd:TIGR01612 2117 LTEAFNTEIKIIEDKIIEKNDLIDKLIEMRKECL---------LFSYATLVETL-----KSKVIN-------------HS 2169
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1002 RYIELLTR-SGDYYRFLSEMLKSLEDlklkntKIEVLEEELRLardansencnknkfldqnlqkyqaecSQFKAKLASLE 1080
Cdd:TIGR01612 2170 EFITSAAKfSKDFFEFIEDISDSLND------DIDALQIKYNL--------------------------NQTKKHMISIL 2217
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1081 ElkrqaelDGKSAKQNLDKcygQIKELNEKITRLT--YEIEDEKRRRKSVEDRFDQQKNDYDQLQKARQCEKE---NLGW 1155
Cdd:TIGR01612 2218 A-------DATKDHNNLIE---KEKEATKIINNLTelFTIDFNNADADILHNNKIQIIYFNSELHKSIESIKKlykKINA 2287
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1156 QKLESEKAIKEKEYEIER-LRVLLQEEGTRKREYENELAKVRNHYNEEMSNLRNKYETEINITKTTIKEISmqkeDDSKN 1234
Cdd:TIGR01612 2288 FKLLNISHINEKYFDISKeFDNIIQLQKHKLTENLNDLKEIDQYISDKKNIFLHALNENTNFNFNALKEIY----DDIIN 2363
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1235 LRNQLDRLSRENRDLKDEIVRLNDSILQATEQrrraeenaLQQKACGSEIMQKKQHLeieLKQVMQQRSEDN-ARHKQSL 1313
Cdd:TIGR01612 2364 RENKADEIENINNKENENIMQYIDTITKLTEK--------IQDILIFVTTYENDNNI---IKQHIQDNDENDvSKIKDNL 2432
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1314 EEAAKTIQDKNKEIERLKAEFqeeakrrweYENelskasNRIQESKNQCTQVVQERESLLVKIKVLEQDKARLQ-RLED- 1391
Cdd:TIGR01612 2433 KKTIQSFQEILNKIDEIKAQF---------YGG------NNINNIIITISQNANDVKNHFSKDLTIENELIQIQkRLEDi 2497
|
810 820 830
....*....|....*....|....*....|....*.
gi 975830145 1392 ----ELNRAKSTLEAETRVKQRLECEKQQIQNDLNQ 1423
Cdd:TIGR01612 2498 knaaHEIRSEQITKYTNAIHNHIEEQFKKIENNSNK 2533
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
1039-1331 |
1.03e-06 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 54.41 E-value: 1.03e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1039 EELRLARDansENCNKNKFLDQNLQKYQAECSQFKAKLASLEELKRQAELD-------------GKSAKQNlDKcygqiK 1105
Cdd:pfam01576 815 EEARASRD---EILAQSKESEKKLKNLEAELLQLQEDLAASERARRQAQQErdeladeiasgasGKSALQD-EK-----R 885
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1106 ELNEKITRLTYEIEDEKRRRKSVEDRFDQQKNDYDQLQKARQCEKENLgwQKLESekAIKEKEYEIERLRVLLQE-EGTr 1184
Cdd:pfam01576 886 RLEARIAQLEEELEEEQSNTELLNDRLRKSTLQVEQLTTELAAERSTS--QKSES--ARQQLERQNKELKAKLQEmEGT- 960
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1185 kreyenelakvrnhyneemsnLRNKYETEINITKTTIKEISMQKEDDSKNlRNQLDRLSRENRDlkdeivRLNDSILQAT 1264
Cdd:pfam01576 961 ---------------------VKSKFKSSIAALEAKIAQLEEQLEQESRE-RQAANKLVRRTEK------KLKEVLLQVE 1012
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 975830145 1265 EQRRRAEENALQQKACGSEIMQKKQHLEIELKQVmqQRSEDNARHKQ-SLEEAAKTIQDKNKEIERLK 1331
Cdd:pfam01576 1013 DERRHADQYKDQAEKGNSRMKQLKRQLEEAEEEA--SRANAARRKLQrELDDATESNESMNREVSTLK 1078
|
|
| PLEC |
smart00250 |
Plectin repeat; |
1808-1845 |
1.25e-06 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 46.71 E-value: 1.25e-06
10 20 30
....*....|....*....|....*....|....*...
gi 975830145 1808 KDFLQGSSCIAGIYNETTKQKLGIYEAMKIGLVRPGTA 1845
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
824-1499 |
1.42e-06 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 53.82 E-value: 1.42e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 824 LLATMKTELQKAQQIHSQTSQQyplydldlgkfgEKVTQLTDRWQRIDKQIDFRLWDLEKQIKQLRNYRDNYQAFCKWLY 903
Cdd:TIGR00618 227 ELKHLREALQQTQQSHAYLTQK------------REAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERINRARKAAP 294
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 904 DAKrRQDSLESMKFgDSNTVMRFLNEQKNLHSEISGKRdkseevqkiaelcANSIKDyELQLASYTSGLETLLNIPIKRT 983
Cdd:TIGR00618 295 LAA-HIKAVTQIEQ-QAQRIHTELQSKMRSRAKLLMKR-------------AAHVKQ-QSSIEEQRRLLQTLHSQEIHIR 358
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 984 MIQSPSGVILQEAADVHARYIELLTRSGDyYRFLSEMLKSLEDLKLKNTKIEVLEEELRLARDANSENCNKNKFLDQNLQ 1063
Cdd:TIGR00618 359 DAHEVATSIREISCQQHTLTQHIHTLQQQ-KTTLTQKLQSLCKELDILQREQATIDTRTSAFRDLQGQLAHAKKQQELQQ 437
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1064 KYQAECSQFKAKLASLEELKRQAELDGKSAKQNLDKCYGQIKELNEKITRLTYEIEDEKRRRKSVEDRFDQQKNDYDQLQ 1143
Cdd:TIGR00618 438 RYAELCAAAITCTAQCEKLEKIHLQESAQSLKEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLCGSCIHPNPAR 517
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1144 KA---------RQCEKENLGWQKLESEKAIKEKEYEIERLRVLLQEEGTRKREYENELAKVRNHYNEEMSNLRnkyetei 1214
Cdd:TIGR00618 518 QDidnpgpltrRMQRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQ------- 590
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1215 NITKTTIKEISMQKEDDSKNLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRR-----RAEENALQQK---ACGSEIMQ 1286
Cdd:TIGR00618 591 NITVRLQDLTEKLSEAEDMLACEQHALLRKLQPEQDLQDVRLHLQQCSQELALKltalhALQLTLTQERvreHALSIRVL 670
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1287 KKQHLEIELK--QVMQQRSEDNARHKQSLEEAAKTIQDKNKEIERLKAEFQeeakrrwEYENELSKASNRIQ---ESKNQ 1361
Cdd:TIGR00618 671 PKELLASRQLalQKMQSEKEQLTYWKEMLAQCQTLLRELETHIEEYDREFN-------EIENASSSLGSDLAareDALNQ 743
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1362 CTQVVQERESLLVKIKVLEQDKARLQRLEDELNRAK-STLEAETRVKQRLECEKQQIQNDLNQWKTQYSRKEEAIRKIES 1440
Cdd:TIGR00618 744 SLKELMHQARTVLKARTEAHFNNNEEVTAALQTGAElSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQC 823
|
650 660 670 680 690 700
....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1441 EREKSEREK-NSLRSEIERLQAEIKRIEErcrrKLEDsTRETQSQLETERSRYQREIDKL 1499
Cdd:TIGR00618 824 ETLVQEEEQfLSRLEEKSATLGEITHQLL----KYEE-CSKQLAQLTQEQAKIIQLSDKL 878
|
|
| COG1340 |
COG1340 |
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown]; |
1019-1287 |
1.61e-06 |
|
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
Pssm-ID: 440951 [Multi-domain] Cd Length: 297 Bit Score: 52.22 E-value: 1.61e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1019 EMLKSLEDLKLKNTKIEVLEEELRLARDANSENCNKNKfldQNLQKYQAECSQFKAKLASLEELKRQAELDGKSAKQNLD 1098
Cdd:COG1340 5 ELSSSLEELEEKIEELREEIEELKEKRDELNEELKELA---EKRDELNAQVKELREEAQELREKRDELNEKVKELKEERD 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1099 KCYGQIKELNEKITRLTYEIEDEKRRRKSVedrfDQQKNDYDQLQKARQCEKENLGWQK------------LESEKAIKE 1166
Cdd:COG1340 82 ELNEKLNELREELDELRKELAELNKAGGSI----DKLRKEIERLEWRQQTEVLSPEEEKelvekikelekeLEKAKKALE 157
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1167 KEYEIERLRVLLQEEGTRKREYEN---ELAKVRNHYNEEMSNLRNKYET---EINITKTTIKEISmqkeDDSKNLRNQLD 1240
Cdd:COG1340 158 KNEKLKELRAELKELRKEAEEIHKkikELAEEAQELHEEMIELYKEADElrkEADELHKEIVEAQ----EKADELHEEII 233
|
250 260 270 280
....*....|....*....|....*....|....*....|....*..
gi 975830145 1241 RLSRENRDLKDEIVRLNDsilQATEQRRRAEENALQQKAcgSEIMQK 1287
Cdd:COG1340 234 ELQKELRELRKELKKLRK---KQRALKREKEKEELEEKA--EEIFEK 275
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1058-1270 |
1.61e-06 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 52.84 E-value: 1.61e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1058 LDQNLQKYQAECSQFKAKLASLEELKRQAELDGKSAKQNLDKCYGQIKELNEKITRLTYEIEDEKRRRKSVEDRFDQQKN 1137
Cdd:COG4942 25 AEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKE 104
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1138 DYDQ----LQKARQCEKENLGWQKLESEKAIKEKEY----------EIERLRVLLQEEGTRKREYENELAKVRNHYnEEM 1203
Cdd:COG4942 105 ELAEllraLYRLGRQPPLALLLSPEDFLDAVRRLQYlkylaparreQAEELRADLAELAALRAELEAERAELEALL-AEL 183
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 975830145 1204 SNLRNKYETEINITKTTIKEISMQKeddsKNLRNQLDRLSRENRDLKDEIVRLndsILQATEQRRRA 1270
Cdd:COG4942 184 EEERAALEALKAERQKLLARLEKEL----AELAAELAELQQEAEELEALIARL---EAEAAAAAERT 243
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
1160-1341 |
1.87e-06 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 53.24 E-value: 1.87e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1160 SEKAIKEKEYEIERLrvlLQEEgtrKREYENELAKVRNHYNEEMSNLRNKYETEINITKTTIKEISMQKEDDSKNLRNQL 1239
Cdd:PRK12704 29 AEAKIKEAEEEAKRI---LEEA---KKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKL 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1240 DRLSRENRDLKDEIVRLNDSILQATEQRRRAEENALQQKACGSEIMQKKQHleiELKQVMQQRSEDNARHkqsleEAAKT 1319
Cdd:PRK12704 103 ELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAE---EAKEILLEKVEEEARH-----EAAVL 174
|
170 180
....*....|....*....|..
gi 975830145 1320 IqdknKEIErlkAEFQEEAKRR 1341
Cdd:PRK12704 175 I----KEIE---EEAKEEADKK 189
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
927-1502 |
2.27e-06 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 53.43 E-value: 2.27e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 927 LNEQKNLHSEISGKRDKSEEVQKIAELCAN---SIKDYELQLASYTSGLETLLNIPIKRTMIQSPSGV--ILQEAADVHA 1001
Cdd:TIGR00618 235 LQQTQQSHAYLTQKREAQEEQLKKQQLLKQlraRIEELRAQEAVLEETQERINRARKAAPLAAHIKAVtqIEQQAQRIHT 314
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1002 RYIELLTRSGDYYRFLSEMLKSLEDLKLKNTKIEVLEEELRLARDANS------ENCNKNKFLDQNLQKYQAECSQFKAK 1075
Cdd:TIGR00618 315 ELQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEvatsirEISCQQHTLTQHIHTLQQQKTTLTQK 394
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1076 LASLEELKRQ-AELDGKSAKQNLDKcygqiKELNEKITRLTYEIEDEKRR----RKSVEDRFDQQKNDYDQLQKARQCEK 1150
Cdd:TIGR00618 395 LQSLCKELDIlQREQATIDTRTSAF-----RDLQGQLAHAKKQQELQQRYaelcAAAITCTAQCEKLEKIHLQESAQSLK 469
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1151 ENLgwQKLESEKAIKEKEYEIERLRVLLQEEgtrKREYENELAKVRNHYNEEMSNLrnkYETEINITKT-TIKEISMQKE 1229
Cdd:TIGR00618 470 ERE--QQLQTKEQIHLQETRKKAVVLARLLE---LQEEPCPLCGSCIHPNPARQDI---DNPGPLTRRMqRGEQTYAQLE 541
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1230 DDSKNLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRRR--------------------AEENALQQKACGSEIMQKKQ 1289
Cdd:TIGR00618 542 TSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRskedipnlqnitvrlqdlteKLSEAEDMLACEQHALLRKL 621
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1290 HLEIELKQV---MQQRSEDNARHKQSLEEAAKTI-QDKNKEIERLKAEFQEEAKRRweYENELSKASNRIQesknQCTQV 1365
Cdd:TIGR00618 622 QPEQDLQDVrlhLQQCSQELALKLTALHALQLTLtQERVREHALSIRVLPKELLAS--RQLALQKMQSEKE----QLTYW 695
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1366 VQERESLLVKIKVLEQDKARLQRLEDELNRAKSTLEAEtrVKQRLECEKQQIQNDLNQWKTQYSRKEEAIRKIESEREKS 1445
Cdd:TIGR00618 696 KEMLAQCQTLLRELETHIEEYDREFNEIENASSSLGSD--LAAREDALNQSLKELMHQARTVLKARTEAHFNNNEEVTAA 773
|
570 580 590 600 610 620
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 975830145 1446 EREKNSL----------RSEIERLQAEIKRIEERCRRKLEDSTRETQSQLETERSRYQREIDKLRQR 1502
Cdd:TIGR00618 774 LQTGAELshlaaeiqffNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETLVQEEEQFLSRLEEK 840
|
|
| Macoilin |
pfam09726 |
Macoilin family; The Macoilin proteins has an N-terminal portion that is composed of 5 ... |
1250-1417 |
2.30e-06 |
|
Macoilin family; The Macoilin proteins has an N-terminal portion that is composed of 5 trasnmembrane helices, followed by a C-terminal coiled-coil region. Macoilin is a highly conserved protein present in eukaryotes. Macoilin appears to be found in the ER and be involved in the function of neurons.
Pssm-ID: 462859 [Multi-domain] Cd Length: 670 Bit Score: 52.93 E-value: 2.30e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1250 KDEIVRLNDSI--LQATEQRRRAEENALQQK---------ACGSEIMQKKQH---LEIELKQVMQQRSEDnarhKQSLEE 1315
Cdd:pfam09726 394 PDALVRLEQDIkkLKAELQASRQTEQELRSQissltslerSLKSELGQLRQEndlLQTKLHNAVSAKQKD----KQTVQQ 469
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1316 AAKTIQDKNKEIERLKAEFQEEAKRRWEYENELSKASNRIQESKNQCTqvvqerESLlvkikvleqdKARLQRLEDELNR 1395
Cdd:pfam09726 470 LEKRLKAEQEARASAEKQLAEEKKRKKEEEATAARAVALAAASRGECT------ESL----------KQRKRELESEIKK 533
|
170 180
....*....|....*....|....*.
gi 975830145 1396 akstLEAETRVKQR----LECEKQQI 1417
Cdd:pfam09726 534 ----LTHDIKLKEEqireLEIKVQEL 555
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1289-1502 |
2.87e-06 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 52.85 E-value: 2.87e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1289 QHLEIELKQVMQQRSEDNARHKQSLEEA-----AKTIQDKNKEIERLKAEFQEEAKRRWEyenELSKASNRIQESKNQCT 1363
Cdd:COG4717 15 RDRTIEFSPGLNVIYGPNEAGKSTLLAFiramlLERLEKEADELFKPQGRKPELNLKELK---ELEEELKEAEEKEEEYA 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1364 QVVQERESLLVKIKVLEQDKARLQRLEDELNRAKSTLEAETRVKQrLECEKQQIQNDLNQWKTQYSRKEEAIRKIESERE 1443
Cdd:COG4717 92 ELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEA-LEAELAELPERLEELEERLEELRELEEELEELEA 170
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 975830145 1444 KSEREKNSLRSEIERLQAEIKRIEERCRRKLEDStRETQSQLETERSRYQREIDKLRQR 1502
Cdd:COG4717 171 ELAELQEELEELLEQLSLATEEELQDLAEELEEL-QQRLAELEEELEEAQEELEELEEE 228
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
713-1487 |
3.28e-06 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 52.74 E-value: 3.28e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 713 QNDSQAIAEVLNQLKDMLANfrGSEKYCYLQNEVFGLFQKLENINGVTDGYLNSLCTVRALLQAILQTEDMLKVYEARLT 792
Cdd:TIGR00606 404 EDEAKTAAQLCADLQSKERL--KQEQADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQLEGSSDRILELDQELR 481
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 793 EEETVCLDLDKVEAYRCGLKKIKNDLNLKKSLLATMKTELQKAQQIHSQTSQQYPLYDLDLGKfgekvtqlTDRWQRIDK 872
Cdd:TIGR00606 482 KAERELSKAEKNSLTETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDK--------MDKDEQIRK 553
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 873 QidfrLWDLEKQIKQLRNYRDNYQAFCKWLYDAKRRQDSLESmkfgdsntvmRFLNEQKNLHS-EISGKRDKSEEVQKIA 951
Cdd:TIGR00606 554 I----KSRHSDELTSLLGYFPNKKQLEDWLHSKSKEINQTRD----------RLAKLNKELASlEQNKNHINNELESKEE 619
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 952 ELCANSIKDYElqlASYTSGLETLLNIPIKRTMIQSPSGVILQEAADVHARYIELLT-----------RSGDYYRFLSEM 1020
Cdd:TIGR00606 620 QLSSYEDKLFD---VCGSQDEESDLERLKEEIEKSSKQRAMLAGATAVYSQFITQLTdenqsccpvcqRVFQTEAELQEF 696
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1021 LKSLED-LKLKNTKIEVLEEELRlardansencNKNKFLDQNLQKYQAECSQFKAKLASLEELK---RQAELDGKSAKQN 1096
Cdd:TIGR00606 697 ISDLQSkLRLAPDKLKSTESELK----------KKEKRRDEMLGLAPGRQSIIDLKEKEIPELRnklQKVNRDIQRLKND 766
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1097 LDKCYGQIKELNEKItrltyEIEDEKRRRKSVEDRFDQQKND----YDQLQKARQCEKENLGWQKLESEKAIKEKEY--- 1169
Cdd:TIGR00606 767 IEEQETLLGTIMPEE-----ESAKVCLTDVTIMERFQMELKDverkIAQQAAKLQGSDLDRTVQQVNQEKQEKQHELdtv 841
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1170 --EIERLRVLLQEEGTRKREYENELAKVRNHYNEEMSNLRNKyeteinitkttikeisMQKEDDSKNLRNQLDRLSRENR 1247
Cdd:TIGR00606 842 vsKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRR----------------QQFEEQLVELSTEVQSLIREIK 905
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1248 DLKDEIVRLNdsilQATEQRRRAEENALQQKACGSEIMQKK-QHLEIELKQVMQQRSEDNARHKQSLEEAAKtiqDKNKE 1326
Cdd:TIGR00606 906 DAKEQDSPLE----TFLEKDQQEKEELISSKETSNKKAQDKvNDIKEKVKNIHGYMKDIENKIQDGKDDYLK---QKETE 978
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1327 IERLKAEFQEEAKRRWEYENELSKASNRIQESKnqctqvvqERESLLvkikvleQDKARLQRLEDELNRAKSTLeaetrv 1406
Cdd:TIGR00606 979 LNTVNAQLEECEKHQEKINEDMRLMRQDIDTQK--------IQERWL-------QDNLTLRKRENELKEVEEEL------ 1037
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1407 KQRLeceKQQIQNDLNQWKTQYSRKEEAIRKIESEREKSEREKNSLRSEIERLQAEIKriEERCRRKlEDSTRETQSQLE 1486
Cdd:TIGR00606 1038 KQHL---KEMGQMQVLQMKQEHQKLEENIDLIKRNHVLALGRQKGYEKEIKHFKKELR--EPQFRDA-EEKYREMMIVMR 1111
|
.
gi 975830145 1487 T 1487
Cdd:TIGR00606 1112 T 1112
|
|
| PLEC |
smart00250 |
Plectin repeat; |
1922-1957 |
4.68e-06 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 45.17 E-value: 4.68e-06
10 20 30
....*....|....*....|....*....|....*.
gi 975830145 1922 IRLLEAQIATGGIIDPKESHRLPVDIAYKRGYFNEE 1957
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPE 36
|
|
| PRK01156 |
PRK01156 |
chromosome segregation protein; Provisional |
1058-1473 |
4.92e-06 |
|
chromosome segregation protein; Provisional
Pssm-ID: 100796 [Multi-domain] Cd Length: 895 Bit Score: 52.21 E-value: 4.92e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1058 LDQNLQKY----------QAECSQFKAKLASLEELKRQA------ELDGKSAKQNLDKCYGQIKELNEKITRLTYEIEDE 1121
Cdd:PRK01156 317 IDAEINKYhaiikklsvlQKDYNDYIKKKSRYDDLNNQIlelegyEMDYNSYLKSIESLKKKIEEYSKNIERMSAFISEI 396
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1122 KRRRKSVEDRFDQQKNDYDQLQKARQCEKENLGwqklESEKAIKEKEYEIERLRVLLQEE------GTRKREYENElaKV 1195
Cdd:PRK01156 397 LKIQEIDPDAIKKELNEINVKLQDISSKVSSLN----QRIRALRENLDELSRNMEMLNGQsvcpvcGTTLGEEKSN--HI 470
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1196 RNHYNEEMSNLrnkyETEINITKTTIKEISMQKEDDSKNL----RNQLDRLSRENRDLKD---EIVRLNDSILQATEQRR 1268
Cdd:PRK01156 471 INHYNEKKSRL----EEKIREIEIEVKDIDEKIVDLKKRKeyleSEEINKSINEYNKIESaraDLEDIKIKINELKDKHD 546
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1269 RAEENALQQKACGSEIMQKKQHLEIELKQVMQQRSEDNARHKQslEEAAKTIQDKNKEIERLKAEFQEEAKRRWEYENEL 1348
Cdd:PRK01156 547 KYEEIKNRYKSLKLEDLDSKRTSWLNALAVISLIDIETNRSRS--NEIKKQLNDLESRLQEIEIGFPDDKSYIDKSIREI 624
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1349 SKASNRIQESKNQCTQVVQERESLLVKIKVLEQDKARLQRLEDELNrakstleaetrvkqrlecekqQIQNDLNQWKTQY 1428
Cdd:PRK01156 625 ENEANNLNNKYNEIQENKILIEKLRGKIDNYKKQIAEIDSIIPDLK---------------------EITSRINDIEDNL 683
|
410 420 430 440
....*....|....*....|....*....|....*....|....*
gi 975830145 1429 SRKEEAIRKIESEREKSEREKNSLRSEIERLQAEIKRIEERCRRK 1473
Cdd:PRK01156 684 KKSRKALDDAKANRARLESTIEILRTRINELSDRINDINETLESM 728
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
1281-1491 |
5.95e-06 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 51.06 E-value: 5.95e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1281 GSEIMQKKQHLEIELKQVMQQRSEDNARHKQSLEEAAKTIQDKNKEIERLKAEFQEEAKRRWEYENELSKASNRIQESKN 1360
Cdd:COG4372 1 GDRLGEKVGKARLSLFGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEE 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1361 QctqVVQERESLLVKIKVLEQDKARLQRLEDELNRAKSTLEAETRVKQRLECEKQQIQNDLNQWKTQYSRKEEAIrkies 1440
Cdd:COG4372 81 E---LEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEEL----- 152
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 975830145 1441 erekserekNSLRSEIERLQAEIKRIEERCRRKLEDSTRETQSQLETERSR 1491
Cdd:COG4372 153 ---------KELEEQLESLQEELAALEQELQALSEAEAEQALDELLKEANR 194
|
|
| PLEC |
smart00250 |
Plectin repeat; |
2020-2050 |
6.28e-06 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 44.78 E-value: 6.28e-06
10 20 30
....*....|....*....|....*....|.
gi 975830145 2020 KRRVVIVDPETNKEMSVQEAYKKGLIDYETF 2050
Cdd:smart00250 8 SAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| CALCOCO1 |
pfam07888 |
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ... |
971-1407 |
7.14e-06 |
|
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
Pssm-ID: 462303 [Multi-domain] Cd Length: 488 Bit Score: 51.05 E-value: 7.14e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 971 GLETLLNIPiKRTMIQSPSGVILQEAADVH-ARYIELLTRSGDYYRFLSEMLKSLEDLKLKNTKIEVLEEELRLARDANS 1049
Cdd:pfam07888 19 GTDMLLVVP-RAELLQNRLEECLQERAELLqAQEAANRQREKEKERYKRDREQWERQRRELESRVAELKEELRQSREKHE 97
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1050 ENCNKNKfldqNLQKYQAECSQFKAKLASLEELKRQaeldgksakqnldkcygQIKELNEKITRLT---YEIEDEKRRRK 1126
Cdd:pfam07888 98 ELEEKYK----ELSASSEELSEEKDALLAQRAAHEA-----------------RIRELEEDIKTLTqrvLERETELERMK 156
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1127 svedrfDQQKNDYDQLqkarqcekenlgwqklesekaiKEKEYEIERLRVLLQEEGTRKREYENELAKVRNHYNEEMSNL 1206
Cdd:pfam07888 157 ------ERAKKAGAQR----------------------KEEEAERKQLQAKLQQTEEELRSLSKEFQELRNSLAQRDTQV 208
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1207 RNKYETeinITKTTIK-EISMQKEDDSKNLRNQLdRLSREnrdlkdeivRLNDSILQATEQRRRAEENALQQKACGSEIM 1285
Cdd:pfam07888 209 LQLQDT---ITTLTQKlTTAHRKEAENEALLEEL-RSLQE---------RLNASERKVEGLGEELSSMAAQRDRTQAELH 275
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1286 QKK---QHLEIELKQVMQQRSEDNAR---HKQSLEEAAKTIQDK----NKEIERLKAEFQEEAKRRWEYENELS--KASN 1353
Cdd:pfam07888 276 QARlqaAQLTLQLADASLALREGRARwaqERETLQQSAEADKDRieklSAELQRLEERLQEERMEREKLEVELGreKDCN 355
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....
gi 975830145 1354 RIQESKNqCTQVVQERESLLVKIKVLEQDKARLQRLEDELNRAKSTLEAETRVK 1407
Cdd:pfam07888 356 RVQLSES-RRELQELKASLRVAQKEKEQLQAEKQELLEYIRQLEQRLETVADAK 408
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
1199-1502 |
8.42e-06 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 51.19 E-value: 8.42e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1199 YNEEMSNLRNKYETEINITKTTIKEISMQKED-DSKNLRNQLDRLSRENRDLKDEIVRLNdsilqatEQRRRAEENALQQ 1277
Cdd:PRK02224 167 YRERASDARLGVERVLSDQRGSLDQLKAQIEEkEEKDLHERLNGLESELAELDEEIERYE-------EQREQARETRDEA 239
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1278 KACGSEIMQKKQHLEiELKQVMQQRSEDNARHKQSLEEAAKTIQDKNKEIERLKAEFQEE--------------AKRRWE 1343
Cdd:PRK02224 240 DEVLEEHEERREELE-TLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLlaeaglddadaeavEARREE 318
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1344 YENELSKASNRIQESKNQCTQVVQERESLLVKIKVLEQDKARLQ----RLEDELNRAKSTLeaETRVKQRLECEKQqiqn 1419
Cdd:PRK02224 319 LEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELReeaaELESELEEAREAV--EDRREEIEELEEE---- 392
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1420 dLNQWKTQYSRKEEAIRKIESEREKSEREKNSLRSEIERLQAEIKRIEER---------------CRRKLEDS------- 1477
Cdd:PRK02224 393 -IEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARERveeaealleagkcpeCGQPVEGSphvetie 471
|
330 340
....*....|....*....|....*.
gi 975830145 1478 -TRETQSQLETERSRYQREIDKLRQR 1502
Cdd:PRK02224 472 eDRERVEELEAELEDLEEEVEEVEER 497
|
|
| PLEC |
smart00250 |
Plectin repeat; |
1680-1716 |
9.12e-06 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 44.40 E-value: 9.12e-06
10 20 30
....*....|....*....|....*....|....*..
gi 975830145 1680 RLLEAQIASGGVVDPVNSVFLPKDVALARGLIDRDLY 1716
Cdd:smart00250 2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| PLEC |
smart00250 |
Plectin repeat; |
1885-1921 |
9.21e-06 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 44.40 E-value: 9.21e-06
10 20 30
....*....|....*....|....*....|....*..
gi 975830145 1885 KLLSAERAVTGYNDPETGNIISLFQAMNKELIEKGHG 1921
Cdd:smart00250 2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| MutS2 |
COG1193 |
dsDNA-specific endonuclease/ATPase MutS2 [Replication, recombination and repair]; |
1240-1390 |
1.01e-05 |
|
dsDNA-specific endonuclease/ATPase MutS2 [Replication, recombination and repair];
Pssm-ID: 440806 [Multi-domain] Cd Length: 784 Bit Score: 50.91 E-value: 1.01e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1240 DRLSRENRDLKDEIVRLNDSILQATEQRRRAEEnaLQQkacgsEIMQKKQHLEIELKQVMQQRsedNARHKQSLEEAAKT 1319
Cdd:COG1193 507 ELLGEESIDVEKLIEELERERRELEEEREEAER--LRE-----ELEKLREELEEKLEELEEEK---EEILEKAREEAEEI 576
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 975830145 1320 IQDKNKEIERLKAEFQEEAKRrweyENELSKASNRIQESKnqctqvvQERESLLVKIKVLEQDKARLQRLE 1390
Cdd:COG1193 577 LREARKEAEELIRELREAQAE----EEELKEARKKLEELK-------QELEEKLEKPKKKAKPAKPPEELK 636
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
1295-1502 |
1.29e-05 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 50.79 E-value: 1.29e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1295 LKQVMQQRSEDNARHKQSLEEAAKTIQDKNKEIERLKAEFQEEakrrweyeNELSKASNRIQESKNQCTQVVQERESLLV 1374
Cdd:COG3206 162 LEQNLELRREEARKALEFLEEQLPELRKELEEAEAALEEFRQK--------NGLVDLSEEAKLLLQQLSELESQLAEARA 233
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1375 KikvLEQDKARLQRLEDELNRAKSTLEA--ETRVKQRLECEKQQIQNDLNQWKTQYSRKEEAIRkiesereksereknSL 1452
Cdd:COG3206 234 E---LAEAEARLAALRAQLGSGPDALPEllQSPVIQQLRAQLAELEAELAELSARYTPNHPDVI--------------AL 296
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|
gi 975830145 1453 RSEIERLQAEIKRIEERcrrkLEDSTRETQSQLETERSRYQREIDKLRQR 1502
Cdd:COG3206 297 RAQIAALRAQLQQEAQR----ILASLEAELEALQAREASLQAQLAQLEAR 342
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
1810-1848 |
1.30e-05 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 43.86 E-value: 1.30e-05
10 20 30
....*....|....*....|....*....|....*....
gi 975830145 1810 FLQGSSCIAGIYNETTKQKLGIYEAMKIGLVRPGTALEL 1848
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
1229-1502 |
2.11e-05 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 50.22 E-value: 2.11e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1229 EDDSKNLRNQLDRLSRENRDLKDEIVRLndsilqATEQRRRAEENAlqqkacgseimQKKQHLEIELKQVMQQRSEDNAR 1308
Cdd:pfam12128 247 QQEFNTLESAELRLSHLHFGYKSDETLI------ASRQEERQETSA-----------ELNQLLRTLDDQWKEKRDELNGE 309
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1309 hkqsLEEAAKTIQDKNKEIERLkaefqEEAKRRWEYENELSKASNRIQES--KNQCTQVVQERESLLVKIKVLEQDKARL 1386
Cdd:pfam12128 310 ----LSAADAAVAKDRSELEAL-----EDQHGAFLDADIETAAADQEQLPswQSELENLEERLKALTGKHQDVTAKYNRR 380
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1387 -----QRLEDELNRAKSTLEA--ETRVKQRLECEK--QQIQNDLNQWKTQYSR--KEEAIRKIESEREKSEREKNSLRSE 1455
Cdd:pfam12128 381 rskikEQNNRDIAGIKDKLAKirEARDRQLAVAEDdlQALESELREQLEAGKLefNEEEYRLKSRLGELKLRLNQATATP 460
|
250 260 270 280
....*....|....*....|....*....|....*....|....*..
gi 975830145 1456 IERLQAEIKriEERCRRkledsTRETQSQLETERSRYQREIDKLRQR 1502
Cdd:pfam12128 461 ELLLQLENF--DERIER-----AREEQEAANAEVERLQSELRQARKR 500
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1153-1393 |
2.99e-05 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 49.53 E-value: 2.99e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1153 LGWQkleSEKAIKEKEYEIERLRVLLQEEGTRKREYENELAKVRNHYnEEMSNLRNKYETEINI--TKTTIKEISMQKE- 1229
Cdd:COG4913 604 LGFD---NRAKLAALEAELAELEEELAEAEERLEALEAELDALQERR-EALQRLAEYSWDEIDVasAEREIAELEAELEr 679
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1230 -----DDSKNLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRRRAEEN-----ALQQKACGSEIMQKKQHLEIELKQVM 1299
Cdd:COG4913 680 ldassDDLAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEEldelqDRLEAAEDLARLELRALLEERFAAAL 759
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1300 QQRSEDNARHK--QSLEEAAKTIQDKNKEIERLKAEFqeeaKRRW---------------EYENELSKASN--------R 1354
Cdd:COG4913 760 GDAVERELRENleERIDALRARLNRAEEELERAMRAF----NREWpaetadldadleslpEYLALLDRLEEdglpeyeeR 835
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 975830145 1355 IQESKNQCTQvvQERESLLVKIKVLEQD-KARLQRLEDEL 1393
Cdd:COG4913 836 FKELLNENSI--EFVADLLSKLRRAIREiKERIDPLNDSL 873
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
1025-1338 |
3.37e-05 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 48.75 E-value: 3.37e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1025 EDLKLKNTKIEVLEEELRLARDANSENCNKNKFLDQNLQKYQAECSQFKAKLASLEELKRQAELDGKSAKQNLDKCYGQI 1104
Cdd:COG4372 52 EELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQR 131
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1105 KELNEKITRLTYEIEDEKRRRKSVEDRFDQQKNDYDQLQKARQCEKENLGWQKLESEKAIKEKEYEIERLR---VLLQEE 1181
Cdd:COG4372 132 KQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAEAEQALDELLKEANRNAEKEEELaeaEKLIES 211
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1182 GTRKREYENELAKVRNHYNEEMSNLRNKYETEINITKTTIKEISMQKEDDSKNLRNQLDRLSRENRDLKDEIVRLNDSIL 1261
Cdd:COG4372 212 LPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEA 291
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 975830145 1262 QATEQRRRAEENALQQKACGSEIMQKKQHLEIELKQVMQQRSEDNARHKQSLEEAAKTIQDKNKEIERLKAEFQEEA 1338
Cdd:COG4372 292 ALELKLLALLLNLAALSLIGALEDALLAALLELAKKLELALAILLAELADLLQLLLVGLLDNDVLELLSKGAEAGVA 368
|
|
| Caldesmon |
pfam02029 |
Caldesmon; |
1077-1351 |
3.41e-05 |
|
Caldesmon;
Pssm-ID: 460421 [Multi-domain] Cd Length: 495 Bit Score: 49.10 E-value: 3.41e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1077 ASLEELKRQAELDGKSAKQNLDKCygqiKELNEKITRLTYEIEdEKRRRKSVEDRFDQQKNDYDQLQKARQCEKEnlgwq 1156
Cdd:pfam02029 63 AFLDRTAKREERRQKRLQEALERQ----KEFDPTIADEKESVA-ERKENNEEEENSSWEKEEKRDSRLGRYKEEE----- 132
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1157 KLESEKAIKEKEYEIERLRVLLQEEGTRKREYENELAKVRNHYNEEMSNLRNKYETEINIT--------KTTIKEISMQK 1228
Cdd:pfam02029 133 TEIREKEYQENKWSTEVRQAEEEGEEEEDKSEEAEEVPTENFAKEEVKDEKIKKEKKVKYEskvfldqkRGHPEVKSQNG 212
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1229 EDDSKNLRNQLDRLSRENRDL---KDEIVRLNDSILQATEQRRRAEENALQQKacgSEIMQKKQHLEIELKQVMQQRSED 1305
Cdd:pfam02029 213 EEEVTKLKVTTKRRQGGLSQSqerEEEAEVFLEAEQKLEELRRRRQEKESEEF---EKLRQKQQEAELELEELKKKREER 289
|
250 260 270 280
....*....|....*....|....*....|....*....|....*.
gi 975830145 1306 narhKQSLEEAAKtiQDKNKEIERLKAEfqEEAKRRWEYENELSKA 1351
Cdd:pfam02029 290 ----RKLLEEEEQ--RRKQEEAERKLRE--EEEKRRMKEEIERRRA 327
|
|
| SPEC |
smart00150 |
Spectrin repeats; |
285-373 |
3.53e-05 |
|
Spectrin repeats;
Pssm-ID: 197544 [Multi-domain] Cd Length: 101 Bit Score: 44.63 E-value: 3.53e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 285 WINDCEEEELLYDWSDKNTNIAQKQEAFSIRMSQLEVKEKELNKLKQESDQLVLNQHPASDKIEAYMDTLQTQWSWILQI 364
Cdd:smart00150 13 WLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNELGEQLIEEGHPDAEEIEERLEELNERWEELKEL 92
|
....*....
gi 975830145 365 TKCIDVHLK 373
Cdd:smart00150 93 AEERRQKLE 101
|
|
| Cast |
pfam10174 |
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ... |
1054-1417 |
5.20e-05 |
|
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 48.66 E-value: 5.20e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1054 KNKF--LDQNLQKYQAECSQFKAKLASL-----------EELKR------------QAELDG-----KSAKQNLDKCYGQ 1103
Cdd:pfam10174 288 KNKIdqLKQELSKKESELLALQTKLETLtnqnsdckqhiEVLKEsltakeqraailQTEVDAlrlrlEEKESFLNKKTKQ 367
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1104 IKELNEKITRLTYEIEDEKrrrksveDRFDQQKNDYDQLQKarqcEKENLGWQklesekaIKEKEYEIERLRVLLQEEGT 1183
Cdd:pfam10174 368 LQDLTEEKSTLAGEIRDLK-------DMLDVKERKINVLQK----KIENLQEQ-------LRDKDKQLAGLKERVKSLQT 429
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1184 RKREYENELAKVRNHYNEEmsnlrnkyeteinitKTTIKEISMQKEDDSKNLRNQLDRLSRENRDLKDEIVRLNDSILQA 1263
Cdd:pfam10174 430 DSSNTDTALTTLEEALSEK---------------ERIIERLKEQREREDRERLEELESLKKENKDLKEKVSALQPELTEK 494
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1264 TEQRRRAEENALQQKACGSEIMQKKQHLEIELKQVMQQRSEDNARHK--QSLEEAAKT---IQDKNKEIERLKAEFQEEA 1338
Cdd:pfam10174 495 ESSLIDLKEHASSLASSGLKKDSKLKSLEIAVEQKKEECSKLENQLKkaHNAEEAVRTnpeINDRIRLLEQEVARYKEES 574
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1339 KRRW-----------EYENELSKASNRIQESKNQCTQVVQERESLLVKIKVLEQD--KARLQRLEDELNRAKS------- 1398
Cdd:pfam10174 575 GKAQaeverllgilrEVENEKNDKDKKIAELESLTLRQMKEQNKKVANIKHGQQEmkKKGAQLLEEARRREDNladnsqq 654
|
410 420
....*....|....*....|....
gi 975830145 1399 -----TLEAETRVKQRLECEKQQI 1417
Cdd:pfam10174 655 lqleeLMGALEKTRQELDATKARL 678
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
1349-1496 |
5.41e-05 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 48.24 E-value: 5.41e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1349 SKASNRIQESKNQCTQVVQERESllvKIKVLEQDKarLQRLEDELNRAKSTLEAETRVKQRlecEKQQIQNDLNQWKTQY 1428
Cdd:PRK12704 27 KIAEAKIKEAEEEAKRILEEAKK---EAEAIKKEA--LLEAKEEIHKLRNEFEKELRERRN---ELQKLEKRLLQKEENL 98
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1429 SRKEEAIRKIESEREKSEREKNSLRSEIERLQAEIKRIEERCRRKLEDS---TRE---------TQSQLETERSRYQREI 1496
Cdd:PRK12704 99 DRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERIsglTAEeakeillekVEEEARHEAAVLIKEI 178
|
|
| PLEC |
smart00250 |
Plectin repeat; |
2168-2204 |
5.66e-05 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 42.08 E-value: 5.66e-05
10 20 30
....*....|....*....|....*....|....*..
gi 975830145 2168 DTLEESSPIAAIFDTENLEKISITEGIERGIVDSITG 2204
Cdd:smart00250 2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| CALCOCO1 |
pfam07888 |
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ... |
1227-1500 |
5.78e-05 |
|
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
Pssm-ID: 462303 [Multi-domain] Cd Length: 488 Bit Score: 48.35 E-value: 5.78e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1227 QKEDDSKNLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRRRAEENALQQKACGSEIMQKKQHLeielkqvMQQRSEDN 1306
Cdd:pfam07888 56 QREKEKERYKRDREQWERQRRELESRVAELKEELRQSREKHEELEEKYKELSASSEELSEEKDAL-------LAQRAAHE 128
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1307 ARHKQsLEEAAKTIqdknkeierlkaefqeeAKRRWEYENELSKASNRIQESKNQCTQVVQERESLLVKikvLEQDKARL 1386
Cdd:pfam07888 129 ARIRE-LEEDIKTL-----------------TQRVLERETELERMKERAKKAGAQRKEEEAERKQLQAK---LQQTEEEL 187
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1387 QRLEDELNRAKSTLEAETRVKQRLECEKQQIQNDLNQWKTQYSRKE---EAIRKIESEREKSEREKNSLRSEIE------ 1457
Cdd:pfam07888 188 RSLSKEFQELRNSLAQRDTQVLQLQDTITTLTQKLTTAHRKEAENEallEELRSLQERLNASERKVEGLGEELSsmaaqr 267
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 975830145 1458 -RLQAEIKRIEERCRR---KLEDST---RETQSQLETERSRYQR--EIDKLR 1500
Cdd:pfam07888 268 dRTQAELHQARLQAAQltlQLADASlalREGRARWAQERETLQQsaEADKDR 319
|
|
| MukB |
COG3096 |
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ... |
1292-1502 |
6.24e-05 |
|
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 48.80 E-value: 6.24e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1292 EIELKQVMQQRSE----------DNARHKQSLEEAAKTIQDKNKEIERLKAEFQEE-AKRRWEYENELSKAsnriQESKN 1360
Cdd:COG3096 835 EAELAALRQRRSElerelaqhraQEQQLRQQLDQLKEQLQLLNKLLPQANLLADETlADRLEELREELDAA----QEAQA 910
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1361 ---QCTQVVQERESLLVkikVLEQD-------KARLQRLEDELNRAKSTLEAETRVKQRLE-----------CEKQ---- 1415
Cdd:COG3096 911 fiqQHGKALAQLEPLVA---VLQSDpeqfeqlQADYLQAKEQQRRLKQQIFALSEVVQRRPhfsyedavgllGENSdlne 987
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1416 QIQNDLNQWKTQYSRKEEAIRKIESEREKSEREKNSLRS-------EIERLQAEIKRI--------EERC---RRKLED- 1476
Cdd:COG3096 988 KLRARLEQAEEARREAREQLRQAQAQYSQYNQVLASLKSsrdakqqTLQELEQELEELgvqadaeaEERArirRDELHEe 1067
|
250 260
....*....|....*....|....*...
gi 975830145 1477 --STRETQSQLETERSRYQREIDKLRQR 1502
Cdd:COG3096 1068 lsQNRSRRSQLEKQLTRCEAEMDSLQKR 1095
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1251-1462 |
7.26e-05 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 47.52 E-value: 7.26e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1251 DEIVRLNDSILQATEQRRRAEE--NALQQKAcgSEIMQKKQHLEIELKQVmqqrsednarhKQSLEEAAKTIQDKNKEIE 1328
Cdd:COG3883 16 PQIQAKQKELSELQAELEAAQAelDALQAEL--EELNEEYNELQAELEAL-----------QAEIDKLQAEIAEAEAEIE 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1329 RLKAEFQEEAK-------------------------RRWEYENELSKASNRIQEsknqctQVVQERESLLVKIKVLEQDK 1383
Cdd:COG3883 83 ERREELGERARalyrsggsvsyldvllgsesfsdflDRLSALSKIADADADLLE------ELKADKAELEAKKAELEAKL 156
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1384 ARLQRLEDELNRAKSTLEAE----TRVKQRLECEKQQIQNDLNQWKTQYSRKEEAIRKIESEREKSEREKNSLRSEIERL 1459
Cdd:COG3883 157 AELEALKAELEAAKAELEAQqaeqEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAA 236
|
...
gi 975830145 1460 QAE 1462
Cdd:COG3883 237 AAA 239
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
770-1277 |
7.66e-05 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 48.14 E-value: 7.66e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 770 VRALLQAILQTEDMLKVYEARLTEEETVCLDLD----KVEAYRcGLKKIKNDLNLKKS----LLATMKTELQKAQQIHSQ 841
Cdd:PRK03918 254 KRKLEEKIRELEERIEELKKEIEELEEKVKELKelkeKAEEYI-KLSEFYEEYLDELReiekRLSRLEEEINGIEERIKE 332
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 842 TSQQyplyDLDLGKFGEKVTQLTDRWQRIDKqiDFRLWDLEKQIK-QLRNYRDNYQafCKWLYDAKRRQDSLESMKFGDS 920
Cdd:PRK03918 333 LEEK----EERLEELKKKLKELEKRLEELEE--RHELYEEAKAKKeELERLKKRLT--GLTPEKLEKELEELEKAKEEIE 404
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 921 NTVMRFLNEQKNLHSEISGKRDKSEEVQK---IAELCANSIKDYELQ--LASYTSGLETLLNIPIKRTMIQSPSGVILQE 995
Cdd:PRK03918 405 EEISKITARIGELKKEIKELKKAIEELKKakgKCPVCGRELTEEHRKelLEEYTAELKRIEKELKEIEEKERKLRKELRE 484
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 996 AADVHARYIELLTrsgdyYRFLSEMLKSLEDlKLKNTKIEVLEEELRLARDANSEncnknkfldqnLQKYQAECSQFKAK 1075
Cdd:PRK03918 485 LEKVLKKESELIK-----LKELAEQLKELEE-KLKKYNLEELEKKAEEYEKLKEK-----------LIKLKGEIKSLKKE 547
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1076 LASLEEL-KRQAELDGKsakqnLDKCYGQIKELNEKITRLTYEIEDEkrrrksVEDRFDQQKNDYDQLQKARQCEKEnlg 1154
Cdd:PRK03918 548 LEKLEELkKKLAELEKK-----LDELEEELAELLKELEELGFESVEE------LEERLKELEPFYNEYLELKDAEKE--- 613
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1155 wqKLESEKAIKEKEYEIERLRVLLQEEGTRKREYENELAKVRNHYN-EEMSNLRNKY---ETEINITKTTIKEISMQKED 1230
Cdd:PRK03918 614 --LEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSeEEYEELREEYlelSRELAGLRAELEELEKRREE 691
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|....
gi 975830145 1231 DSKNLRNqLDRLSRENRDLKDEIVRLNDSILQATEQRRR-------AEENALQQ 1277
Cdd:PRK03918 692 IKKTLEK-LKEELEEREKAKKELEKLEKALERVEELREKvkkykalLKERALSK 744
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
1023-1375 |
8.66e-05 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 48.04 E-value: 8.66e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1023 SLEDLKLKNTKIEVLEEELRLARDANSENCNKNKFLDQNLQKYQAECSQFKAKLasLEELKRQAELDGKSAKQNLDKCYG 1102
Cdd:pfam02463 669 SELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEEL--LADRVQEAQDKINEELKLLKQKID 746
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1103 QIKELNEKITRLTYEIEDEKRRRKSVEDRFDQQKNdYDQLQKARQCEKENLGWQKLESEKAIKEKEYEIERLR---VLLQ 1179
Cdd:pfam02463 747 EEEEEEEKSRLKKEEKEEEKSELSLKEKELAEERE-KTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEeeqLLIE 825
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1180 EEGTRKREYENELAK------VRNHYNEEMSNLRNKYETEinITKTTIKEISMQKEDDSKNLRNQLDRLSRENRDLKDEI 1253
Cdd:pfam02463 826 QEEKIKEEELEELALelkeeqKLEKLAEEELERLEEEITK--EELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELE 903
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1254 VRLNDSILQATEQRRRAEENALQQkacgsEIMQKKQHLEIELKQVMQQRSEDNARHKQSLEEAAKTIQDKNKEIERLKAE 1333
Cdd:pfam02463 904 EESQKLNLLEEKENEIEERIKEEA-----EILLKYEEEPEELLLEEADEKEKEENNKEEEEERNKRLLLAKEELGKVNLM 978
|
330 340 350 360
....*....|....*....|....*....|....*....|..
gi 975830145 1334 FQEEAKRRWEYENELSKASNRIQESKNQCTQVVQERESLLVK 1375
Cdd:pfam02463 979 AIEEFEEKEERYNKDELEKERLEEEKKKLIRAIIEETCQRLK 1020
|
|
| COG1340 |
COG1340 |
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown]; |
1219-1502 |
8.69e-05 |
|
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
Pssm-ID: 440951 [Multi-domain] Cd Length: 297 Bit Score: 46.83 E-value: 8.69e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1219 TTIKEISMQKEDdsknLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRRraeenalqqkacgSEIMQKKQHLEiELKQV 1298
Cdd:COG1340 1 SKTDELSSSLEE----LEEKIEELREEIEELKEKRDELNEELKELAEKRD-------------ELNAQVKELRE-EAQEL 62
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1299 MQQRSEDN---ARHKQSLEEAAKTIQDKNKEIERLKAEFQEEAKRRWEYEnELSKASNRIqESKNQCTQVVQERESLLV- 1374
Cdd:COG1340 63 REKRDELNekvKELKEERDELNEKLNELREELDELRKELAELNKAGGSID-KLRKEIERL-EWRQQTEVLSPEEEKELVe 140
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1375 KIKVLEQdkaRLQRLEDELNRAKSTLEAETRVKQrLECEKQQIQNDLNQWKTQYSRKEEAIRKIESEREKSEREKNSLRS 1454
Cdd:COG1340 141 KIKELEK---ELEKAKKALEKNEKLKELRAELKE-LRKEAEEIHKKIKELAEEAQELHEEMIELYKEADELRKEADELHK 216
|
250 260 270 280
....*....|....*....|....*....|....*....|....*...
gi 975830145 1455 EIERLQAEIKRIeercrrkledstRETQSQLETERSRYQREIDKLRQR 1502
Cdd:COG1340 217 EIVEAQEKADEL------------HEEIIELQKELRELRKELKKLRKK 252
|
|
| COG2433 |
COG2433 |
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only]; |
1073-1204 |
9.18e-05 |
|
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
Pssm-ID: 441980 [Multi-domain] Cd Length: 644 Bit Score: 47.93 E-value: 9.18e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1073 KAKLASLEELKRQAELDGKSAKQNLDKCYGQIKELNEKITRLTYEIEDEKRRRKSVEDRFDQQKNdydQLQKARQCEKEn 1152
Cdd:COG2433 384 ELIEKELPEEEPEAEREKEHEERELTEEEEEIRRLEEQVERLEAEVEELEAELEEKDERIERLER---ELSEARSEERR- 459
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 975830145 1153 lgwqKLESEKAIKEKEYEIERLRVLLQEEGTRKREYENELAKVRNHYNEEMS 1204
Cdd:COG2433 460 ----EIRKDREISRLDREIERLERELEEERERIEELKRKLERLKELWKLEHS 507
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1313-1594 |
1.19e-04 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 47.75 E-value: 1.19e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1313 LEEAAKTIQDKNKEIERLKAEFQEEAKRRWEYENELSKASNRIQESKNQCTQVVQERESLLVKIKVLEQDKARLQRLEDE 1392
Cdd:PRK03918 160 YENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEE 239
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1393 LNRAKSTLEAETRVKQRLECEKQQIQNDLNQWKTQYSRKEEAIRKIESEREKSEREKnSLRSEIERLQAEIKRIEERCRR 1472
Cdd:PRK03918 240 IEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYI-KLSEFYEEYLDELREIEKRLSR 318
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1473 kLEDSTRETQSQLEtERSRYQREIDKLRQRpygsHRETQTECEwTVDTSKLVFDGLRKKVTAMQLYECQLIDKTTLD--- 1549
Cdd:PRK03918 319 -LEEEINGIEERIK-ELEEKEERLEELKKK----LKELEKRLE-ELEERHELYEEAKAKKEELERLKKRLTGLTPEKlek 391
|
250 260 270 280
....*....|....*....|....*....|....*....|....*.
gi 975830145 1550 KLLKGKKSVEEVASEIQPFLRGAGSIAGASASPKEKYS-LVEAKRK 1594
Cdd:PRK03918 392 ELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEeLKKAKGK 437
|
|
| COG1340 |
COG1340 |
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown]; |
1159-1415 |
1.27e-04 |
|
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
Pssm-ID: 440951 [Multi-domain] Cd Length: 297 Bit Score: 46.44 E-value: 1.27e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1159 ESEKAIKEKeyeIERLRVLLQEEGTRKREYENELAKV---RNHYNEEMSNLRNKYETEINITKTTIKEIsmqkeddsKNL 1235
Cdd:COG1340 8 SSLEELEEK---IEELREEIEELKEKRDELNEELKELaekRDELNAQVKELREEAQELREKRDELNEKV--------KEL 76
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1236 RNQLDRLSRENRDLKDEIVRLNDsiLQATEQRRRAEENALQQKACGSEIMQKKQHLEIEL-KQVMQQRSEdnarhKQSLE 1314
Cdd:COG1340 77 KEERDELNEKLNELREELDELRK--ELAELNKAGGSIDKLRKEIERLEWRQQTEVLSPEEeKELVEKIKE-----LEKEL 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1315 EAAKTIQDKNKEIERLKAEFQEEAKRRWEYENELSKASNRIQESKNQCTQVVQERESLLVKI------------------ 1376
Cdd:COG1340 150 EKAKKALEKNEKLKELRAELKELRKEAEEIHKKIKELAEEAQELHEEMIELYKEADELRKEAdelhkeiveaqekadelh 229
|
250 260 270
....*....|....*....|....*....|....*....
gi 975830145 1377 KVLEQDKARLQRLEDELNRAKSTLEAETRVKQRLECEKQ 1415
Cdd:COG1340 230 EEIIELQKELRELRKELKKLRKKQRALKREKEKEELEEK 268
|
|
| PspC_subgroup_1 |
NF033838 |
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ... |
1160-1481 |
1.27e-04 |
|
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain] Cd Length: 684 Bit Score: 47.32 E-value: 1.27e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1160 SEKAIKEKEYEIERLRVLLQEEGTRKREYEN-----ELAKVRNHYNEEMSNLRNKYETEIniTKTTIKEISMQKEDDSKN 1234
Cdd:NF033838 56 QKEHAKEVESHLEKILSEIQKSLDKRKHTQNvalnkKLSDIKTEYLYELNVLKEKSEAEL--TSKTKKELDAAFEQFKKD 133
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1235 LRNQLDRLSRENRDLKDEIVRLNDsilQATEQRRRAEENA---LQQKACGSEIMQKKQHLEIELKQVMQQRSEDNARHKQ 1311
Cdd:NF033838 134 TLEPGKKVAEATKKVEEAEKKAKD---QKEEDRRNYPTNTyktLELEIAESDVEVKKAELELVKEEAKEPRDEEKIKQAK 210
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1312 SLEEAAKTIQDKNKEIERLKAEFQEEAKRRWEYE-----------NELSKASNRIQ---------------ESKNQCTQV 1365
Cdd:NF033838 211 AKVESKKAEATRLEKIKTDREKAEEEAKRRADAKlkeaveknvatSEQDKPKRRAKrgvlgepatpdkkenDAKSSDSSV 290
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1366 VQE---RESLLVKIKVLE--------QDKARLQRLEDELNRAKST---LE---AETRVKQRlECEKQQIQNDLNQwktqy 1428
Cdd:NF033838 291 GEEtlpSPSLKPEKKVAEaekkveeaKKKAKDQKEEDRRNYPTNTyktLEleiAESDVKVK-EAELELVKEEAKE----- 364
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*
gi 975830145 1429 SRKEEAIRKIESEREKSEREKNSLrseiERLQAEIKRIEERCRRK--LEDSTRET 1481
Cdd:NF033838 365 PRNEEKIKQAKAKVESKKAEATRL----EKIKTDRKKAEEEAKRKaaEEDKVKEK 415
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
1289-1498 |
1.39e-04 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 45.69 E-value: 1.39e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1289 QHLEIELKQVMQQRsednARHKQSLEEAAKTIQDKNKEIERLKAEFQEEAKRRWEYENELSKASNRIQESKNQCTQVVQE 1368
Cdd:COG1579 13 QELDSELDRLEHRL----KELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVRNN 88
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1369 RE--SLLvkiKVLEQDKARLQRLEDELnrakstLEAETRVKQrlecekqqiqndlnqwktqysrKEEAIRKieserekse 1446
Cdd:COG1579 89 KEyeALQ---KEIESLKRRISDLEDEI------LELMERIEE----------------------LEEELAE--------- 128
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 975830145 1447 reknsLRSEIERLQAEIKRIEERCRRKLEDsTRETQSQLETERSRYQREIDK 1498
Cdd:COG1579 129 -----LEAELAELEAELEEKKAELDEELAE-LEAELEELEAEREELAAKIPP 174
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1270-1493 |
1.58e-04 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 46.75 E-value: 1.58e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1270 AEENALQQKACGSEIMQKKQHLEIELKQVMQQRSEDNARhkqsLEEAAKTIQDKNKEIERLKAEFQEEAKRRWEYENELS 1349
Cdd:COG3883 14 ADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEE----YNELQAELEALQAEIDKLQAEIAEAEAEIEERREELG 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1350 KASNRIQESKNQCTQVvqereSLLVKIKVLEQDKARLQRLEDELNRAKSTLEAETRVKQRLECEKQQIQNDLNQWKTQYS 1429
Cdd:COG3883 90 ERARALYRSGGSVSYL-----DVLLGSESFSDFLDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKA 164
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 975830145 1430 RKEEAIRKIESEREKSEREKNSLRSEIERLQAEIKRIEERCRRKLEDSTRETQSQLETERSRYQ 1493
Cdd:COG3883 165 ELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAA 228
|
|
| PRK01156 |
PRK01156 |
chromosome segregation protein; Provisional |
1126-1476 |
1.79e-04 |
|
chromosome segregation protein; Provisional
Pssm-ID: 100796 [Multi-domain] Cd Length: 895 Bit Score: 46.82 E-value: 1.79e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1126 KSVEDRFDQQKNDYDQLQ---KARQCEKENLGWQKLESEKAIKEKEYEIERLRVllqeegtrkrEYENELAKvRNHYNEE 1202
Cdd:PRK01156 172 KDVIDMLRAEISNIDYLEeklKSSNLELENIKKQIADDEKSHSITLKEIERLSI----------EYNNAMDD-YNNLKSA 240
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1203 MSNL------RNKYETEINITKTTIKEISMqKEDDSKNLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRRRAEENALQ 1276
Cdd:PRK01156 241 LNELssledmKNRYESEIKTAESDLSMELE-KNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQILSNIDA 319
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1277 QKACGSEIMQKKQHLEIELKQ--VMQQRSEDNARHKQSLEEAAKTIQDKNKEIERLKAEFQEEAKRRWEYENELSKASNR 1354
Cdd:PRK01156 320 EINKYHAIIKKLSVLQKDYNDyiKKKSRYDDLNNQILELEGYEMDYNSYLKSIESLKKKIEEYSKNIERMSAFISEILKI 399
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1355 -------IQESKNQCTQVVQERESllvKIKVLEQDKARLQRLEDELNRAKSTLEAETRVK---QRLECEKqqIQNDLNQW 1424
Cdd:PRK01156 400 qeidpdaIKKELNEINVKLQDISS---KVSSLNQRIRALRENLDELSRNMEMLNGQSVCPvcgTTLGEEK--SNHIINHY 474
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|..
gi 975830145 1425 KTQYSRKEEAIRKIESEREKSEREKNSLRSEIERLQAEIKRIEERCRRKLED 1476
Cdd:PRK01156 475 NEKKSRLEEKIREIEIEVKDIDEKIVDLKKRKEYLESEEINKSINEYNKIES 526
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
1103-1496 |
1.80e-04 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 47.14 E-value: 1.80e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1103 QIKELNEKITRLTYEIEDEKRRRKSVEDRFDQQKNDY----DQLQKARQCEKENLGWQKLESEKAIKEKEYEIERL---- 1174
Cdd:pfam12128 252 TLESAELRLSHLHFGYKSDETLIASRQEERQETSAELnqllRTLDDQWKEKRDELNGELSAADAAVAKDRSELEALedqh 331
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1175 RVLLQEE-GTRKREYENeLAKVRNHYNEEMSNLRNKYETEINITKTTIKEISMQKEDDSKNLRNQLDRL--SRENRDLKD 1251
Cdd:pfam12128 332 GAFLDADiETAAADQEQ-LPSWQSELENLEERLKALTGKHQDVTAKYNRRRSKIKEQNNRDIAGIKDKLakIREARDRQL 410
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1252 EIVRlndSILQATEQRRRAEENALQQKACGSEIMQKKQHLEIELKQVMQQRSED---NARHKQSLEEAAKTIQDK-NKEI 1327
Cdd:pfam12128 411 AVAE---DDLQALESELREQLEAGKLEFNEEEYRLKSRLGELKLRLNQATATPElllQLENFDERIERAREEQEAaNAEV 487
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1328 ERLKAEFQEEAKRRWEYENELSKASNRIQESKNQCTQVVQE-------------------RESL-------------LVK 1375
Cdd:pfam12128 488 ERLQSELRQARKRRDQASEALRQASRRLEERQSALDELELQlfpqagtllhflrkeapdwEQSIgkvispellhrtdLDP 567
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1376 IKVLEQDKARL---------------------QRLEDELNRAKSTLEAETRVKQRLECEKQQIQNDLNQWKTQYSRKEEA 1434
Cdd:pfam12128 568 EVWDGSVGGELnlygvkldlkridvpewaaseEELRERLDKAEEALQSAREKQAAAEEQLVQANGELEKASREETFARTA 647
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 975830145 1435 IRKIESEREKSEREKNSLRSEIERLQAEIKRIEERCRRKLE--------------DSTRETQSQLETERSRYQREI 1496
Cdd:pfam12128 648 LKNARLDLRRLFDEKQSEKDKKNKALAERKDSANERLNSLEaqlkqldkkhqawlEEQKEQKREARTEKQAYWQVV 723
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
1018-1500 |
1.92e-04 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 47.14 E-value: 1.92e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1018 SEMLKSLEdLKLKNTKIEVLEEELRLARdansencnknkfldqnLQKyqaecsQFKAKLASLEELKRQAEldgKSAKQNL 1097
Cdd:pfam12128 250 FNTLESAE-LRLSHLHFGYKSDETLIAS----------------RQE------ERQETSAELNQLLRTLD---DQWKEKR 303
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1098 DKCYGQIKELNEKITRLTYEIED-EKRRRKSVEDRFDQQKNDYDQLQKAR-QCEKENLGWQKLESEKAIKEKEYEIERLR 1175
Cdd:pfam12128 304 DELNGELSAADAAVAKDRSELEAlEDQHGAFLDADIETAAADQEQLPSWQsELENLEERLKALTGKHQDVTAKYNRRRSK 383
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1176 VLLQ--------EEGT--RKREYENELAKVRNHYNEEMSNLRNKYE---TEINITKTTIKE-ISMQK---------EDDS 1232
Cdd:pfam12128 384 IKEQnnrdiagiKDKLakIREARDRQLAVAEDDLQALESELREQLEagkLEFNEEEYRLKSrLGELKlrlnqatatPELL 463
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1233 KNLRNQLDRLSR----------ENRDLKDEIVRLNDSILQATEQRRRAEENALQQKACGSEIMQK---KQHLEIE-LKQV 1298
Cdd:pfam12128 464 LQLENFDERIERareeqeaanaEVERLQSELRQARKRRDQASEALRQASRRLEERQSALDELELQlfpQAGTLLHfLRKE 543
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1299 MQQRSEDNAR-------HKQSL--EEAAKTIQDKN---------KEIERLKAEFQEEAKRRweyenELSKASNRIQESKn 1360
Cdd:pfam12128 544 APDWEQSIGKvispellHRTDLdpEVWDGSVGGELnlygvkldlKRIDVPEWAASEEELRE-----RLDKAEEALQSAR- 617
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1361 qctQVVQERESLLVKIKvleqdkARLQRLEDELNRAKSTLEAETRVKQRLECEKQQIQNDLNQwktqysRKEEAIRKIES 1440
Cdd:pfam12128 618 ---EKQAAAEEQLVQAN------GELEKASREETFARTALKNARLDLRRLFDEKQSEKDKKNK------ALAERKDSANE 682
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1441 EREKSEREKNSLRSEIERLQAEIKRIEERCRRKLEDSTRETQSQLETERSRYQREIDKLR 1500
Cdd:pfam12128 683 RLNSLEAQLKQLDKKHQAWLEEQKEQKREARTEKQAYWQVVEGALDAQLALLKAAIAARR 742
|
|
| 46 |
PHA02562 |
endonuclease subunit; Provisional |
1272-1470 |
1.99e-04 |
|
endonuclease subunit; Provisional
Pssm-ID: 222878 [Multi-domain] Cd Length: 562 Bit Score: 46.55 E-value: 1.99e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1272 ENALQQKACGSEIMQKKQHLEIELKQVMQQRS---EDNARHKQSLEEAAKTIQDKNKEIERLKAEFQEEAKRRWEYENEL 1348
Cdd:PHA02562 178 ELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKkngENIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAAL 257
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1349 SKASNRIQESKNQ-------------------CTQVVQERESLLVKIKvlEQDKARLQRLEDELNRAKstlEAETRVKQR 1409
Cdd:PHA02562 258 NKLNTAAAKIKSKieqfqkvikmyekggvcptCTQQISEGPDRITKIK--DKLKELQHSLEKLDTAID---ELEEIMDEF 332
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 975830145 1410 LECEK--QQIQNDLNQWKTQYSRKEEAIRKIESEREKSEREKNSLRSEIERLQAEIKRIEERC 1470
Cdd:PHA02562 333 NEQSKklLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEFVDNAEELAKLQDELDKIVKTK 395
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
1339-1501 |
2.09e-04 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 46.05 E-value: 2.09e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1339 KRRWEYENELSKASNRIQESKNQCTQVVQERESLLVKIKVLEQDkarLQRLEDELNRAKSTLEAETRVKQRLECEKQQIQ 1418
Cdd:COG4372 10 KARLSLFGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREE---LEQAREELEQLEEELEQARSELEQLEEELEELN 86
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1419 NDLNQWKTQYSRKEEAIRKIESEREKSEREKNSLRSEIERLQAEIKRIEERC-RRKLEDSTRETQ-SQLETERSRYQREI 1496
Cdd:COG4372 87 EQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIaELQSEIAEREEElKELEEQLESLQEEL 166
|
....*
gi 975830145 1497 DKLRQ 1501
Cdd:COG4372 167 AALEQ 171
|
|
| Crescentin |
pfam19220 |
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament ... |
1235-1461 |
2.23e-04 |
|
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament proteins, named crescentin, whose cytoskeletal function is required for the vibrioid and helical shapes of Caulobacter crescentus. Without crescentin, the cells adopt a straight-rod morphology. Crescentin has characteriztic features of IF proteins including the ability to assemble into filaments in vitro without energy or cofactor requirements. In vivo, crescentin forms a helical structure that colocalizes with the inner cell curvatures beneath the cytoplasmic membrane.
Pssm-ID: 437057 [Multi-domain] Cd Length: 401 Bit Score: 46.21 E-value: 2.23e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1235 LRNQLDRLsRENRDLKDEIVRLNDSILQATEQRRRAEENALQQKACGSEIMQKKQHLEIELKQVMQQRSE---------- 1304
Cdd:pfam19220 12 LGEMADRL-EDLRSLKADFSQLIEPIEAILRELPQAKSRLLELEALLAQERAAYGKLRRELAGLTRRLSAaegeleelva 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1305 DNARHKQSL-------EEAAKTIQDKNKEIERLKAEFQEEAKRRWEYENELSKASNRIQESKNQCTQVVQERESLLVKIK 1377
Cdd:pfam19220 91 RLAKLEAALreaeaakEELRIELRDKTAQAEALERQLAAETEQNRALEEENKALREEAQAAEKALQRAEGELATARERLA 170
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1378 VLEQDKARLQRLEDELNRAKSTLEAE-TRVKQRLECEKQQIQNDLNQWKTQYSRKEEAIRKIESEREKSEREKNSLRSEI 1456
Cdd:pfam19220 171 LLEQENRRLQALSEEQAAELAELTRRlAELETQLDATRARLRALEGQLAAEQAERERAEAQLEEAVEAHRAERASLRMKL 250
|
....*
gi 975830145 1457 ERLQA 1461
Cdd:pfam19220 251 EALTA 255
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
1258-1515 |
2.29e-04 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 46.58 E-value: 2.29e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1258 DSILQAT---EQRRRAEENALQQKACGSEIMQKKQHLEIELKQVMQQRSEDNARHKQsLEEAAKTIQDKNKEIERLK--- 1331
Cdd:TIGR00606 176 DEIFSATryiKALETLRQVRQTQGQKVQEHQMELKYLKQYKEKACEIRDQITSKEAQ-LESSREIVKSYENELDPLKnrl 254
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1332 AEFQEEAKRRWEYENELSKASNRIQESKNQCTQVVQERESLLV----KIKVLEQDKARLQR-LEDELNRAKSTLEAETRV 1406
Cdd:TIGR00606 255 KEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELKMEKVFQgtdeQLNDLYHNHQRTVReKERELVDCQRELEKLNKE 334
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1407 KQRLECEKQQIQNDLNQWKTQYSRKEEAIRKieserEKSEREKNSLRSEIERLQ------AEIKRIEERCRRKLEDSTR- 1479
Cdd:TIGR00606 335 RRLLNQEKTELLVEQGRLQLQADRHQEHIRA-----RDSLIQSLATRLELDGFErgpfseRQIKNFHTLVIERQEDEAKt 409
|
250 260 270
....*....|....*....|....*....|....*...
gi 975830145 1480 --ETQSQLETERSRYQREIDKLRQRPYGSHRETQTECE 1515
Cdd:TIGR00606 410 aaQLCADLQSKERLKQEQADEIRDEKKGLGRTIELKKE 447
|
|
| PRK01156 |
PRK01156 |
chromosome segregation protein; Provisional |
707-1281 |
2.30e-04 |
|
chromosome segregation protein; Provisional
Pssm-ID: 100796 [Multi-domain] Cd Length: 895 Bit Score: 46.82 E-value: 2.30e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 707 NELKSVQND----SQAIAEVLNQLKDMLANF-RGSEKYCYLQNEVFGLFQKLENINGVTD---GYLNSLCTVRALLQAIL 778
Cdd:PRK01156 190 EKLKSSNLEleniKKQIADDEKSHSITLKEIeRLSIEYNNAMDDYNNLKSALNELSSLEDmknRYESEIKTAESDLSMEL 269
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 779 QTEDMLKVYEARLTEEETvcldlDKVEAYR---CGLKKIKNDLNLKKSLLATMKTELQKAQQIHSQTSqqyplydlDLGK 855
Cdd:PRK01156 270 EKNNYYKELEERHMKIIN-----DPVYKNRnyiNDYFKYKNDIENKKQILSNIDAEINKYHAIIKKLS--------VLQK 336
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 856 fgekvtqltDRWQRIDKQIdfRLWDLEKQIKQLRNYRDNYQAFCKwlydakrrqdSLESMKFGDSNtvmrFLNEQKNLHS 935
Cdd:PRK01156 337 ---------DYNDYIKKKS--RYDDLNNQILELEGYEMDYNSYLK----------SIESLKKKIEE----YSKNIERMSA 391
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 936 EISGKRDKS----EEVQKIAELCANSIKDYELQLASYTSGLETLLN--IPIKRTM-IQSPSGVILQEAADVHARYIELLT 1008
Cdd:PRK01156 392 FISEILKIQeidpDAIKKELNEINVKLQDISSKVSSLNQRIRALREnlDELSRNMeMLNGQSVCPVCGTTLGEEKSNHII 471
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1009 RsgDYYrflsemlkslEDLKLKNTKIEVLEEElrlARDANSENCNKNKFLD----QNLQKYQAECSQFKAKLASLEELK- 1083
Cdd:PRK01156 472 N--HYN----------EKKSRLEEKIREIEIE---VKDIDEKIVDLKKRKEylesEEINKSINEYNKIESARADLEDIKi 536
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1084 RQAELDGKSAKQNldkcygqikELNEKITRLTYEIEDEKR----RRKSVEDRFDQQKNDYDQLQKARQCEKENLGWQKLE 1159
Cdd:PRK01156 537 KINELKDKHDKYE---------EIKNRYKSLKLEDLDSKRtswlNALAVISLIDIETNRSRSNEIKKQLNDLESRLQEIE 607
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1160 S---------EKAIKEKEYEIERLRVLLQEEGTRKREYEnELAKVRNHYNEEMSNLRnkyetEINITKTTIKEISMQKED 1230
Cdd:PRK01156 608 IgfpddksyiDKSIREIENEANNLNNKYNEIQENKILIE-KLRGKIDNYKKQIAEID-----SIIPDLKEITSRINDIED 681
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|....*...
gi 975830145 1231 DSKNLRNQLD-------RLSRENRDLKDEIVRLNDSIlqaTEQRRRAEENALQQKACG 1281
Cdd:PRK01156 682 NLKKSRKALDdakanraRLESTIEILRTRINELSDRI---NDINETLESMKKIKKAIG 736
|
|
| PspC_subgroup_1 |
NF033838 |
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ... |
1013-1347 |
2.37e-04 |
|
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain] Cd Length: 684 Bit Score: 46.54 E-value: 2.37e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1013 YYRFLSEMLKSLE----------DLKLKNTKIEVLEEELRLARDANSENCNKNKF-LDQNLQKYQAECSQFKAKLASLEE 1081
Cdd:NF033838 67 LEKILSEIQKSLDkrkhtqnvalNKKLSDIKTEYLYELNVLKEKSEAELTSKTKKeLDAAFEQFKKDTLEPGKKVAEATK 146
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1082 LKRQAELDGKSAKQNLDKCYgqikelnEKITRLTYEIEDEKRRRKSVEDRFDQQKndyDQLQKARQCEKENLGWQKLESE 1161
Cdd:NF033838 147 KVEEAEKKAKDQKEEDRRNY-------PTNTYKTLELEIAESDVEVKKAELELVK---EEAKEPRDEEKIKQAKAKVESK 216
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1162 KAIKEKEYEIERLRVLLQEEGTRKREYENELAKVRNHYNEEMSNLRNKYETEINITKTTIKeismQKEDDSKNLRNQLDR 1241
Cdd:NF033838 217 KAEATRLEKIKTDREKAEEEAKRRADAKLKEAVEKNVATSEQDKPKRRAKRGVLGEPATPD----KKENDAKSSDSSVGE 292
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1242 LSRENRDLKDEivrlnDSILQATEQRRRAEENALQQKA------------------CGSEIMQKKQHLEIELKQVMQQRS 1303
Cdd:NF033838 293 ETLPSPSLKPE-----KKVAEAEKKVEEAKKKAKDQKEedrrnyptntyktleleiAESDVKVKEAELELVKEEAKEPRN 367
|
330 340 350 360
....*....|....*....|....*....|....*....|....
gi 975830145 1304 EDNARHKQSLEEAAKTIQDKNKEIERLKAEFQEEAKRRWEYENE 1347
Cdd:NF033838 368 EEKIKQAKAKVESKKAEATRLEKIKTDRKKAEEEAKRKAAEEDK 411
|
|
| YqiK |
COG2268 |
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown]; |
1248-1418 |
2.90e-04 |
|
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
Pssm-ID: 441869 [Multi-domain] Cd Length: 439 Bit Score: 46.02 E-value: 2.90e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1248 DLKDEivrlnDSILQATEQRRRAEenaLQQKAcgsEIMQKKQHLEIELKQVMQQRSEDNARHKQSLEEAAKTIQDKNKEI 1327
Cdd:COG2268 178 DLEDE-----NNYLDALGRRKIAE---IIRDA---RIAEAEAERETEIAIAQANREAEEAELEQEREIETARIAEAEAEL 246
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1328 ERLKAEFQEEAKRrweyENELSKASNRIQESKNQctQVVQeresllVKIKVLEQDKA-RLQRLEDElnRAKSTLEAEtrV 1406
Cdd:COG2268 247 AKKKAEERREAET----ARAEAEAAYEIAEANAE--REVQ------RQLEIAEREREiELQEKEAE--REEAELEAD--V 310
|
170
....*....|..
gi 975830145 1407 KQRLECEKQQIQ 1418
Cdd:COG2268 311 RKPAEAEKQAAE 322
|
|
| Cast |
pfam10174 |
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ... |
1025-1509 |
3.35e-04 |
|
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 45.97 E-value: 3.35e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1025 EDLKLKNTKIEVLEEELRLARDANsencnknkfldQNLQKYQAECSQFKAKLASLEELKRQAELDGKSAKQNldkcygQI 1104
Cdd:pfam10174 67 EENQHLQLTIQALQDELRAQRDLN-----------QLLQQDFTTSPVDGEDKFSTPELTEENFRRLQSEHER------QA 129
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1105 KELneKITRLTYEiEDEKR---RRKSVEDRFDQQKNDYDQLQKARQCEKENLGWQklESEKAIKEKEYEIERLRVLLQEe 1181
Cdd:pfam10174 130 KEL--FLLRKTLE-EMELRietQKQTLGARDESIKKLLEMLQSKGLPKKSGEEDW--ERTRRIAEAEMQLGHLEVLLDQ- 203
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1182 gtrkREYENelakvrNHYNEEMSNlRNKYETEINITKTTIKEISMQkedDSKnlrnqLDRLSRENRDLKDEIVRLNDSIL 1261
Cdd:pfam10174 204 ----KEKEN------IHLREELHR-RNQLQPDPAKTKALQTVIEMK---DTK-----ISSLERNIRDLEDEVQMLKTNGL 264
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1262 QATEQRrraEENALQQKACGSEIMQKKQHLEiELKQVMQQRSEDNARHKQSLEEAAKTIQDKNKEIERLKAEFQEEAKRR 1341
Cdd:pfam10174 265 LHTEDR---EEEIKQMEVYKSHSKFMKNKID-QLKQELSKKESELLALQTKLETLTNQNSDCKQHIEVLKESLTAKEQRA 340
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1342 WEYENELSKASNRIqesknqctqvvQERESLLVKikvlEQDKarLQRLEDElnraKSTLEAETR-------VKQR-LECE 1413
Cdd:pfam10174 341 AILQTEVDALRLRL-----------EEKESFLNK----KTKQ--LQDLTEE----KSTLAGEIRdlkdmldVKERkINVL 399
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1414 KQQIQNDLNQWKTQYSRKEEAIRKIESEREKSEREKNSLRSEIERLqAEIKRIEERCRRKLEDSTRETQSQLETersrYQ 1493
Cdd:pfam10174 400 QKKIENLQEQLRDKDKQLAGLKERVKSLQTDSSNTDTALTTLEEAL-SEKERIIERLKEQREREDRERLEELES----LK 474
|
490
....*....|....*.
gi 975830145 1494 REIDKLRQRPYGSHRE 1509
Cdd:pfam10174 475 KENKDLKEKVSALQPE 490
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1078-1332 |
3.78e-04 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 46.06 E-value: 3.78e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1078 SLEELKRQAELDGKSAKQNLDKCYGQIKELNekitRLTYEIEDEKRRRKSVE------DRFDQQKNDYDQLQKARQceke 1151
Cdd:COG4913 208 DLDDFVREYMLEEPDTFEAADALVEHFDDLE----RAHEALEDAREQIELLEpirelaERYAAARERLAELEYLRA---- 279
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1152 nlGWQKLESEKAIKEKEYEIERLRVLLQEEGTRKREYENELAKVRNHYNEemsnLRNKYETeinitkttikeismQKEDD 1231
Cdd:COG4913 280 --ALRLWFAQRRLELLEAELEELRAELARLEAELERLEARLDALREELDE----LEAQIRG--------------NGGDR 339
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1232 SKNLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRRRAEENALQQKAcgsEIMQKKQHLEIELKQVMQQRsednARHKQ 1311
Cdd:COG4913 340 LEQLEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRA---EAAALLEALEEELEALEEAL----AEAEA 412
|
250 260
....*....|....*....|.
gi 975830145 1312 SLEEAAKTIQDKNKEIERLKA 1332
Cdd:COG4913 413 ALRDLRRELRELEAEIASLER 433
|
|
| FPP |
pfam05911 |
Filament-like plant protein, long coiled-coil; FPP is a family of long coiled-coil plant ... |
1272-1414 |
4.50e-04 |
|
Filament-like plant protein, long coiled-coil; FPP is a family of long coiled-coil plant proteins that are filament-like. It interacts with the nuclear envelope-associated protein, MAF1, the WPP family pfam13943.
Pssm-ID: 461778 [Multi-domain] Cd Length: 859 Bit Score: 45.82 E-value: 4.50e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1272 ENALQQKACGSEimqKKQHLEIELKQVMQQRSEDNARHKQSLEEAaktIQDKNKEIERLKAEFqeEAKRRwEYENELSKA 1351
Cdd:pfam05911 34 ESVTLQKLTAEE---RAAHLDGALKECMQQLRNVKEEQEQKIHDV---VLKKTKEWEKIKAEL--EAKLV-ETEQELLRA 104
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 975830145 1352 SnriQESkNQCTQVVQERESLLVKIKvleqdkarlqrleDELNRAKSTLEAetrVKQRLE-CEK 1414
Cdd:pfam05911 105 A---AEN-DALSRSLQERENLLMKLS-------------EEKSQAEAEIEA---LKSRLEsCEK 148
|
|
| PRK11637 |
PRK11637 |
AmiB activator; Provisional |
1257-1502 |
4.91e-04 |
|
AmiB activator; Provisional
Pssm-ID: 236942 [Multi-domain] Cd Length: 428 Bit Score: 45.07 E-value: 4.91e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1257 NDSILQATEQRRRAEENALQQkacgseimQKKQHLEIeLKQVMQQRsEDNARHKQSLEEAAKTIQDKNKEIERLKAEFQE 1336
Cdd:PRK11637 45 NRDQLKSIQQDIAAKEKSVRQ--------QQQQRASL-LAQLKKQE-EAISQASRKLRETQNTLNQLNKQIDELNASIAK 114
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1337 EAKRRWEYENELSK---ASNRiqESKNQCTQVV------QERESLLVKIKVLEQDK----ARLQRLEDELNRAKSTLEAE 1403
Cdd:PRK11637 115 LEQQQAAQERLLAAqldAAFR--QGEHTGLQLIlsgeesQRGERILAYFGYLNQARqetiAELKQTREELAAQKAELEEK 192
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1404 TRVKQRLECEKQQIQNDLNQwkTQYSRKEeairkieserekserEKNSLRSEIERLQ---AEIKRIEERCRRKLEDSTRE 1480
Cdd:PRK11637 193 QSQQKTLLYEQQAQQQKLEQ--ARNERKK---------------TLTGLESSLQKDQqqlSELRANESRLRDSIARAERE 255
|
250 260
....*....|....*....|..
gi 975830145 1481 TQSQLETErsryQREIDKLRQR 1502
Cdd:PRK11637 256 AKARAERE----AREAARVRDK 273
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
1112-1499 |
4.98e-04 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 45.48 E-value: 4.98e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1112 TRLTYEIEDEKRRRKSVEDRFDQQKNDYDQLQKARQCEKenlgwqkleseKAIKEKEYEIERLRVLLQEEGTRKREYENE 1191
Cdd:pfam05483 81 SKLYKEAEKIKKWKVSIEAELKQKENKLQENRKIIEAQR-----------KAIQELQFENEKVSLKLEEEIQENKDLIKE 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1192 lakvrNHYNEEMSNLrnkyeteinitkttIKEISMQKEDDSKNLRNQLDRLSRENRDLKDEIVRLndsILQATEQRRRAE 1271
Cdd:pfam05483 150 -----NNATRHLCNL--------------LKETCARSAEKTKKYEYEREETRQVYMDLNNNIEKM---ILAFEELRVQAE 207
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1272 ENALQQKACGSEIMQKKQHLEIELKQ-----------VMQQRSEDNARHKQ---SLEEAaktiQDKNKEIERLKAEFQEE 1337
Cdd:pfam05483 208 NARLEMHFKLKEDHEKIQHLEEEYKKeindkekqvslLLIQITEKENKMKDltfLLEES----RDKANQLEEKTKLQDEN 283
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1338 AKRRWEYENELSKASNRIQES--KNQCTQVVQErESLLVKIKVLEQDKARLQRLEDELNRAKST-------LEAET-RVK 1407
Cdd:pfam05483 284 LKELIEKKDHLTKELEDIKMSlqRSMSTQKALE-EDLQIATKTICQLTEEKEAQMEELNKAKAAhsfvvteFEATTcSLE 362
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1408 QRLECEKQQIQNDLNQWK------TQYSRKEEAIRKIESEREKSEREKNSLRSEIERLQAEIKRIEercrrKLEDSTRET 1481
Cdd:pfam05483 363 ELLRTEQQRLEKNEDQLKiitmelQKKSSELEEMTKFKNNKEVELEELKKILAEDEKLLDEKKQFE-----KIAEELKGK 437
|
410
....*....|....*...
gi 975830145 1482 QSQLETERSRYQREIDKL 1499
Cdd:pfam05483 438 EQELIFLLQAREKEIHDL 455
|
|
| mukB |
PRK04863 |
chromosome partition protein MukB; |
1262-1502 |
5.11e-04 |
|
chromosome partition protein MukB;
Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 45.72 E-value: 5.11e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1262 QATEQRRRAEENALQQKACGSEIMQKKQHLEIELKQVMQQRSEDNA--RHKQSLEEAAKTIQDKNKEIERLKAEFQEEAK 1339
Cdd:PRK04863 377 QQEENEARAEAAEEEVDELKSQLADYQQALDVQQTRAIQYQQAVQAleRAKQLCGLPDLTADNAEDWLEEFQAKEQEATE 456
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1340 RRWEYENELSKAsnriQESKNQCTQVVQ---------ERE-------SLLVKIKVLEQDKARLQRLEDELNRAKSTLEAE 1403
Cdd:PRK04863 457 ELLSLEQKLSVA----QAAHSQFEQAYQlvrkiagevSRSeawdvarELLRRLREQRHLAEQLQQLRMRLSELEQRLRQQ 532
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1404 TRVKQRL-ECEKQQIQNDLNQWKTQ--YSRKEEAIRKIESEREKSEREKNSLRSEIERLQAEIKRIE------------- 1467
Cdd:PRK04863 533 QRAERLLaEFCKRLGKNLDDEDELEqlQEELEARLESLSESVSEARERRMALRQQLEQLQARIQRLAarapawlaaqdal 612
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 975830145 1468 ----ERCRRKLEDS----------------TRETQSQLETERSRYQREIDKLRQR 1502
Cdd:PRK04863 613 arlrEQSGEEFEDSqdvteymqqllerereLTVERDELAARKQALDEEIERLSQP 667
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
1012-1498 |
5.63e-04 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 45.43 E-value: 5.63e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1012 DYYRFLSEMLKSleDLKLKNTKI-EVLEE-----ELRLARDANSENCNKNKFLDQNLQK----------YQAECSQFKAK 1075
Cdd:TIGR01612 448 DFDEFNKPIPKS--KLKALEKRFfEIFEEewgsyDIKKDIDENSKQDNTVKLILMRMKDfkdiidfmelYKPDEVPSKNI 525
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1076 LASLEELKRQAELdGKSAKQNLDKCYGQIKELNEKITRLTYEIEDEKRRRKSVEDrfdQQKNDYDQLQKarqCEKENLGW 1155
Cdd:TIGR01612 526 IGFDIDQNIKAKL-YKEIEAGLKESYELAKNWKKLIHEIKKELEEENEDSIHLEK---EIKDLFDKYLE---IDDEIIYI 598
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1156 QKLESE-----KAIKEKEYEIERlRVLLQEEGTRKREYENELAKVRNHYNEEmsNLRNKyeteiNITKTTIK-EISMQKE 1229
Cdd:TIGR01612 599 NKLKLElkekiKNISDKNEYIKK-AIDLKKIIENNNAYIDELAKISPYQVPE--HLKNK-----DKIYSTIKsELSKIYE 670
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1230 DDSKNLRNQLDRLSREN-----------RDLKDEIVRLNDSILQATEQRRRAEENALQQKA--CGSEIMQKKQHLEIELK 1296
Cdd:TIGR01612 671 DDIDALYNELSSIVKENaidntedkaklDDLKSKIDKEYDKIQNMETATVELHLSNIENKKneLLDIIVEIKKHIHGEIN 750
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1297 QVMQQRSEDnarHKQSLEEAAKTIQDKNKEIERLKAEFQEEAKRRWEYENELSKASNRIQESKNQCTQVVQERESLLVKI 1376
Cdd:TIGR01612 751 KDLNKILED---FKNKEKELSNKINDYAKEKDELNKYKSKISEIKNHYNDQINIDNIKDEDAKQNYDKSKEYIKTISIKE 827
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1377 KVLEQDKARLQRLEDE-LNRAKSTLEAETRVKQRLECEKQQIQNDLNQWKTQYSRKEEAI--RKIESEREKSEREKNSLR 1453
Cdd:TIGR01612 828 DEIFKIINEMKFMKDDfLNKVDKFINFENNCKEKIDSEHEQFAELTNKIKAEISDDKLNDyeKKFNDSKSLINEINKSIE 907
|
490 500 510 520
....*....|....*....|....*....|....*....|....*
gi 975830145 1454 SEIERLQAeIKRIEERCrrKLEDSTRETQSQLETERSRYQREIDK 1498
Cdd:TIGR01612 908 EEYQNINT-LKKVDEYI--KICENTKESIEKFHNKQNILKEILNK 949
|
|
| 46 |
PHA02562 |
endonuclease subunit; Provisional |
1124-1369 |
6.09e-04 |
|
endonuclease subunit; Provisional
Pssm-ID: 222878 [Multi-domain] Cd Length: 562 Bit Score: 45.01 E-value: 6.09e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1124 RRKSVEDRFDQQK-NDYDQLQKAR----QCEKENLGWQKLESEKAIKEKEYEIERLRVLLQEEGTRKRE-YENELAKVRN 1197
Cdd:PHA02562 152 RRKLVEDLLDISVlSEMDKLNKDKirelNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQNkYDELVEEAKT 231
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1198 HyneemsnlrnkyETEINITKTTIKEISMQKEDDS---KNLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRRRAEENA 1274
Cdd:PHA02562 232 I------------KAEIEELTDELLNLVMDIEDPSaalNKLNTAAAKIKSKIEQFQKVIKMYEKGGVCPTCTQQISEGPD 299
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1275 LQqkacgSEIMQKKQHLEIELKQVMQQRSEDNARhKQSLEEAAKTIQDKNKEIE--------------RLKAEFQEEAKR 1340
Cdd:PHA02562 300 RI-----TKIKDKLKELQHSLEKLDTAIDELEEI-MDEFNEQSKKLLELKNKIStnkqslitlvdkakKVKAAIEELQAE 373
|
250 260
....*....|....*....|....*....
gi 975830145 1341 RWEYENELSKASNRIQESKNQCTQVVQER 1369
Cdd:PHA02562 374 FVDNAEELAKLQDELDKIVKTKSELVKEK 402
|
|
| CCDC22 |
pfam05667 |
Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 ... |
1226-1496 |
7.11e-04 |
|
Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 (CCDC22) is involved in regulation of NF-kappa-B signalling; the function may involve association with COMMD8 and a CUL1-dependent E3 ubiquitin ligase complex. It is part of the OMMD/CCDC22/CCDC93 (CCC) complex, which interacts with the multisubunit WASH complex required for endosomal deposition of F-actin and cargo trafficking in conjunction with the retromer. This entry also includes CCDC22 homologs from animals and plants.
Pssm-ID: 461708 [Multi-domain] Cd Length: 600 Bit Score: 45.02 E-value: 7.11e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1226 MQKEDDSKNLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRRRAEENALQQKacgseimqkkQHLEIELKQVMQQRSED 1305
Cdd:pfam05667 222 WEEEWNSQGLASRLTPEEYRKRKRTKLLKRIAEQLRSAALAGTEATSGASRSA----------QDLAELLSSFSGSSTTD 291
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1306 NARHKQSL---EEAAKTIQDKNKEIERLKAEFQEEAKRRWEYENELSKASNRIQESKNQCTQVVQERESLLVKIK----V 1378
Cdd:pfam05667 292 TGLTKGSRfthTEKLQFTNEAPAATSSPPTKVETEEELQQQREEELEELQEQLEDLESSIQELEKEIKKLESSIKqveeE 371
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1379 LEQDKARLQRLEDELNRAKSTLE----AETRVKQrLECE----KQQIQNDLNQWKTQYSRKEEAIRKIESEREKSEREKN 1450
Cdd:pfam05667 372 LEELKEQNEELEKQYKVKKKTLDllpdAEENIAK-LQALvdasAQRLVELAGQWEKHRVPLIEEYRALKEAKSNKEDESQ 450
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 975830145 1451 SLRSEIERLQAEIKRIEERCRRKlEDSTRETQSQLE-----TERSRYQREI 1496
Cdd:pfam05667 451 RKLEEIKELREKIKEVAEEAKQK-EELYKQLVAEYErlpkdVSRSAYTRRI 500
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
1170-1360 |
7.93e-04 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 43.37 E-value: 7.93e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1170 EIERLrVLLQEEGTRKREYENELAKVRNHYnEEMSNLRNKYETEINITKTTIKEISMQ---KEDDSKNLRNQLDRLS--- 1243
Cdd:COG1579 5 DLRAL-LDLQELDSELDRLEHRLKELPAEL-AELEDELAALEARLEAAKTELEDLEKEikrLELEIEEVEARIKKYEeql 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1244 ---RENRDLKDeivrLNDSILQATEQRRRAEENALqqkacgsEIMQKKQHLEIELKQVMQQRSEDNARHKQSLEEAAKTI 1320
Cdd:COG1579 83 gnvRNNKEYEA----LQKEIESLKRRISDLEDEIL-------ELMERIEELEEELAELEAELAELEAELEEKKAELDEEL 151
|
170 180 190 200
....*....|....*....|....*....|....*....|
gi 975830145 1321 QDKNKEIERLKAEFQEEAKrrwEYENELSKASNRIQESKN 1360
Cdd:COG1579 152 AELEAELEELEAEREELAA---KIPPELLALYERIRKRKN 188
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
1060-1494 |
9.27e-04 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 44.65 E-value: 9.27e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1060 QNLQKYQAECSQFKAKLASLEELKrqaelDGKSAKQNLDKCYGQIKELNEKitrltyEIEDEKRRRKSVEDRFDQQKNDY 1139
Cdd:TIGR00606 200 QKVQEHQMELKYLKQYKEKACEIR-----DQITSKEAQLESSREIVKSYEN------ELDPLKNRLKEIEHNLSKIMKLD 268
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1140 DQLQKARQCEKENLGWQKLESEKAIKEKEYEIERLRVLLQEEGTRKREYENELAKVR------NHYNEEMSNLRNKYETE 1213
Cdd:TIGR00606 269 NEIKALKSRKKQMEKDNSELELKMEKVFQGTDEQLNDLYHNHQRTVREKERELVDCQreleklNKERRLLNQEKTELLVE 348
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1214 INITKTTIKEISMQ-KEDDSKNLRNQLD-RLSRENRDLKDEIVRLNDSILQATEQRRRAEE-NALQQKACGSEIMQKKQH 1290
Cdd:TIGR00606 349 QGRLQLQADRHQEHiRARDSLIQSLATRlELDGFERGPFSERQIKNFHTLVIERQEDEAKTaAQLCADLQSKERLKQEQA 428
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1291 LEIELKQVMQQRSEDNarHKQSLEEAAKTIQDKNKEIERLKAEfqeeAKRRWEYENELSKASNRIQES-KNQCTQVVQER 1369
Cdd:TIGR00606 429 DEIRDEKKGLGRTIEL--KKEILEKKQEELKFVIKELQQLEGS----SDRILELDQELRKAERELSKAeKNSLTETLKKE 502
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1370 ESLLVKIKV-LEQDKARLQRLEDELNRAKSTLEAETRVKQRLECEKQQIQNDLNQ-------------WKTQYSR----- 1430
Cdd:TIGR00606 503 VKSLQNEKAdLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDKMDKDEQIRKIKSRhsdeltsllgyfpNKKQLEDwlhsk 582
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 975830145 1431 ------KEEAIRKIESEREKSEREKNSLRSEIERLQAEIKRIEER----CRRKLEDSTRETQSQlETERSRYQR 1494
Cdd:TIGR00606 583 skeinqTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYEDKlfdvCGSQDEESDLERLKE-EIEKSSKQR 655
|
|
| RNase_Y_N |
pfam12072 |
RNase Y N-terminal region; |
1310-1411 |
9.96e-04 |
|
RNase Y N-terminal region;
Pssm-ID: 463456 [Multi-domain] Cd Length: 201 Bit Score: 42.57 E-value: 9.96e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1310 KQSLEEAAKTIQDKNKEIE--------RLKAEFQEEAKRRweyENELSKASNRIQesknqctqvvQERESLLVKIKVLEQ 1381
Cdd:pfam12072 37 KRIIEEAKKEAETKKKEALleakeeihKLRAEAERELKER---RNELQRQERRLL----------QKEETLDRKDESLEK 103
|
90 100 110
....*....|....*....|....*....|....*
gi 975830145 1382 DKARLQRLEDELNRAKSTL-----EAETRVKQRLE 1411
Cdd:pfam12072 104 KEESLEKKEKELEAQQQQLeekeeELEELIEEQRQ 138
|
|
| HEC1 |
COG5185 |
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ... |
1004-1329 |
1.11e-03 |
|
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 444066 [Multi-domain] Cd Length: 594 Bit Score: 44.18 E-value: 1.11e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1004 IELLTRSGDYYRFLSEMLKSLEDLKL--KNTKIEVLEE---ELRLARDANSENCNKNKFLDQNLQKYQAECSQFKA--KL 1076
Cdd:COG5185 245 LEDLAQTSDKLEKLVEQNTDLRLEKLgeNAESSKRLNEnanNLIKQFENTKEKIAEYTKSIDIKKATESLEEQLAAaeAE 324
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1077 ASLEELKRQAELDGKSAKQNLDKCYGQIKELNEKItrltyeieDEKRRRKSVEDRFDQQKNDYDQLQKARQCEKENLGWQ 1156
Cdd:COG5185 325 QELEESKRETETGIQNLTAEIEQGQESLTENLEAI--------KEEIENIVGEVELSKSSEELDSFKDTIESTKESLDEI 396
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1157 KLESEKAIKEKEYEIERlrvllqeegtRKREYENELAKVR------NHYNEEMSNLRNKYETEINITKTTIKEISMQKED 1230
Cdd:COG5185 397 PQNQRGYAQEILATLED----------TLKAADRQIEELQrqieqaTSSNEEVSKLLNELISELNKVMREADEESQSRLE 466
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1231 DSKNLRNQldRLSRENRDLKDEIVRLNDSI--LQATEQRRRAEENALQQKacgseIMQKKQHLEIELKQVMQQRSEDNAR 1308
Cdd:COG5185 467 EAYDEINR--SVRSKKEDLNEELTQIESRVstLKATLEKLRAKLERQLEG-----VRSKLDQVAESLKDFMRARGYAHIL 539
|
330 340
....*....|....*....|.
gi 975830145 1309 HKQSLEEAAKTIQDKNKEIER 1329
Cdd:COG5185 540 ALENLIPASELIQASNAKTDG 560
|
|
| Mitofilin |
pfam09731 |
Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. ... |
1108-1499 |
1.28e-03 |
|
Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. Mitofilin is enriched in the narrow space between the inner boundary and the outer membranes, where it forms a homotypic interaction and assembles into a large multimeric protein complex. The first 78 amino acids contain a typical amino-terminal-cleavable mitochondrial presequence rich in positive-charged and hydroxylated residues and a membrane anchor domain. In addition, it has three centrally located coiled coil domains.
Pssm-ID: 430783 [Multi-domain] Cd Length: 618 Bit Score: 43.98 E-value: 1.28e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1108 NEKITRLTYEIEDEKRRRKSvEDRFDQQKNDYDQLQKARQCEKENLGWQKLESEKAIKEKEyeiERLRVLLQEEGTRKRE 1187
Cdd:pfam09731 72 VSAVTGESKEPKEEKKQVKI-PRQSGVSSEVAEEEKEATKDAAEAKAQLPKSEQEKEKALE---EVLKEAISKAESATAV 147
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1188 YENELAKVRNHYNEEMSNLRNKYETEinitkttikeismqKEDDSKNLRNQLDRLSRENRDLKDEIVRLN-DSILQATEQ 1266
Cdd:pfam09731 148 AKEAKDDAIQAVKAHTDSLKEASDTA--------------EISREKATDSALQKAEALAEKLKEVINLAKqSEEEAAPPL 213
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1267 RRRAEENALQQKACGSEIMQKKQHLEIELKQVMQ--QRSEDNARhkQSLEEAAKTIQDKNKEIERLKAEFQEEakrrweY 1344
Cdd:pfam09731 214 LDAAPETPPKLPEHLDNVEEKVEKAQSLAKLVDQykELVASERI--VFQQELVSIFPDIIPVLKEDNLLSNDD------L 285
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1345 ENELSKASNRIQEsknqctqvvqeresllVKIKVLEQDKARLQRLEDELNRAKSTLE-AETRVKQRLEcekQQIQNDLNQ 1423
Cdd:pfam09731 286 NSLIAHAHREIDQ----------------LSKKLAELKKREEKHIERALEKQKEELDkLAEELSARLE---EVRAADEAQ 346
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1424 WKTQYSRKEEAIRKieserekseREKNSLRSEIERLQAEI-KRIEERCRRKLEDSTRETQ----SQLETERSRYQREIDK 1498
Cdd:pfam09731 347 LRLEFEREREEIRE---------SYEEKLRTELERQAEAHeEHLKDVLVEQEIELQREFLqdikEKVEEERAGRLLKLNE 417
|
.
gi 975830145 1499 L 1499
Cdd:pfam09731 418 L 418
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
1002-1394 |
1.43e-03 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 44.19 E-value: 1.43e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1002 RYIELLTRSGDYYRFLSEMLKSLEDLKLKNTKIEVLEEELRLARDANSENCNKNKFLDQNLQKYQAECSQFKAKLASL-- 1079
Cdd:TIGR00618 536 TYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLSEAEDMLACEqh 615
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1080 -EELKRQAELDGKSAKQNLDKCYGQIKELNEKITRLTYEIEDEKRRRKSVEDRfdQQKNDYDQLQKARQCEKENlgwqKL 1158
Cdd:TIGR00618 616 aLLRKLQPEQDLQDVRLHLQQCSQELALKLTALHALQLTLTQERVREHALSIR--VLPKELLASRQLALQKMQS----EK 689
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1159 ESEKAIKEkeyEIERLRVLLQEEGTRKREYENElakvRNHYNEEMSNLRNKYETEINITKTTIKEISMQKEDDSKNLRNQ 1238
Cdd:TIGR00618 690 EQLTYWKE---MLAQCQTLLRELETHIEEYDRE----FNEIENASSSLGSDLAAREDALNQSLKELMHQARTVLKARTEA 762
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1239 LDRLSRENRDLKDEIVRLNDSILQATEQRRRAEENALQQKACGSEIMQKKQHLEIELKQVMQQRSEDNARHKQSLEEAAK 1318
Cdd:TIGR00618 763 HFNNNEEVTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETLVQEEEQFLSRLEEKSA 842
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 975830145 1319 TIQdknkEIERLKAEFQEEAKRRweyeNELSKASNRIqesknqcTQVVQERESLLVKIKVLEQDkaRLQRLEDELN 1394
Cdd:TIGR00618 843 TLG----EITHQLLKYEECSKQL----AQLTQEQAKI-------IQLSDKLNGINQIKIQFDGD--ALIKFLHEIT 901
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
1262-1415 |
1.44e-03 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 43.64 E-value: 1.44e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1262 QATEQRRRAEENALQQKACGSEIMQKKQHLEIELKQVMQQRSEDNARHKQSlEEAAKTIQDKNKEIERLKAEFQEEAKRR 1341
Cdd:PRK09510 70 QQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQA-EEAAKQAALKQKQAEEAAAKAAAAAKAK 148
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 975830145 1342 WEYENE-LSKASNRIQESKnqctqvvqeresllvkiKVLEQDKARLQRLEDelnrAKSTLEAETRVKQRLECEKQ 1415
Cdd:PRK09510 149 AEAEAKrAAAAAKKAAAEA-----------------KKKAEAEAAKKAAAE----AKKKAEAEAAAKAAAEAKKK 202
|
|
| COG2433 |
COG2433 |
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only]; |
1311-1501 |
2.04e-03 |
|
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
Pssm-ID: 441980 [Multi-domain] Cd Length: 644 Bit Score: 43.31 E-value: 2.04e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1311 QSLEEA-AKTIQDKNKEIERlkaefqEEAKRRWEYENELSKASNRIQESKNQCTQVVQERESLLVKIKVLEQdkaRLQRL 1389
Cdd:COG2433 376 LSIEEAlEELIEKELPEEEP------EAEREKEHEERELTEEEEEIRRLEEQVERLEAEVEELEAELEEKDE---RIERL 446
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1390 EDELNRAKSTLEAETRvkqrlecekqqiqndlnqwktqysrKEEAIRKiesereksereknsLRSEIERLQAEIKRIEER 1469
Cdd:COG2433 447 ERELSEARSEERREIR-------------------------KDREISR--------------LDREIERLERELEEERER 487
|
170 180 190
....*....|....*....|....*....|..
gi 975830145 1470 cRRKLEDSTRETQSQLETERSRYQREIDKLRQ 1501
Cdd:COG2433 488 -IEELKRKLERLKELWKLEHSGELVPVKVVEK 518
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
1025-1489 |
2.07e-03 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 43.67 E-value: 2.07e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1025 EDLKLKNTKIEVLEEELRLARDANSEncnKNKFLDQNLQKYQAECSQFKAKLASLEELKRQAELDGKSAKQNLD-KCYGQ 1103
Cdd:pfam12128 315 AAVAKDRSELEALEDQHGAFLDADIE---TAAADQEQLPSWQSELENLEERLKALTGKHQDVTAKYNRRRSKIKeQNNRD 391
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1104 IKELNEKITRLTYEIEDEKRRRKSVEDRFDQQKNDYDQLQKARQCE-----KENLGWQKLESEKAIKEKEY--------- 1169
Cdd:pfam12128 392 IAGIKDKLAKIREARDRQLAVAEDDLQALESELREQLEAGKLEFNEeeyrlKSRLGELKLRLNQATATPELllqlenfde 471
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1170 EIERLRVLlQEEGTRKRE-YENELAKVRNHYNEEMSNLRNKyETEINITKTTIKEISMQKEDDSKNLrnqLDRLSRENRD 1248
Cdd:pfam12128 472 RIERAREE-QEAANAEVErLQSELRQARKRRDQASEALRQA-SRRLEERQSALDELELQLFPQAGTL---LHFLRKEAPD 546
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1249 LKDEIVRLNDSIL--------QATEQRRRAEEN----ALQQKACG-SEIMQKKQHLEIELKQV---MQQRSEDNARHKQS 1312
Cdd:pfam12128 547 WEQSIGKVISPELlhrtdldpEVWDGSVGGELNlygvKLDLKRIDvPEWAASEEELRERLDKAeeaLQSAREKQAAAEEQ 626
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1313 LEEAAKTIQDKNKEIERLKAEFQ--EEAKRRW--EYENELSKASNRIQESKNQctqVVQERESLLVKIKVLEQD-KARLQ 1387
Cdd:pfam12128 627 LVQANGELEKASREETFARTALKnaRLDLRRLfdEKQSEKDKKNKALAERKDS---ANERLNSLEAQLKQLDKKhQAWLE 703
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1388 RLEDELNRAKSTLEA------------ETRVKQRLECEKQQIQNDLNQWKTQYSRK------------------EEAIRK 1437
Cdd:pfam12128 704 EQKEQKREARTEKQAywqvvegaldaqLALLKAAIAARRSGAKAELKALETWYKRDlaslgvdpdviaklkreiRTLERK 783
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 975830145 1438 IESEREKS---------------------EREKNSLRSEIERLQAEIKRIEERCRRKLedstretqSQLETER 1489
Cdd:pfam12128 784 IERIAVRRqevlryfdwyqetwlqrrprlATQLSNIERAISELQQQLARLIADTKLRR--------AKLEMER 848
|
|
| mukB |
PRK04863 |
chromosome partition protein MukB; |
1207-1502 |
2.14e-03 |
|
chromosome partition protein MukB;
Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 43.41 E-value: 2.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1207 RNKYETEINITKTTIKEISmQKEDDSKNLRNQLDRLSRENRDLKDEivRLNDSILQATEQRRRAEENALQQKACGSEIMQ 1286
Cdd:PRK04863 846 RVELERALADHESQEQQQR-SQLEQAKEGLSALNRLLPRLNLLADE--TLADRVEEIREQLDEAEEAKRFVQQHGNALAQ 922
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1287 KKQHLeielkQVMQQRSEDNARHKQSLEEAAKTIQDKNKEIERLKAEFQ-------EEAKRRWEYENELSkasnriqesk 1359
Cdd:PRK04863 923 LEPIV-----SVLQSDPEQFEQLKQDYQQAQQTQRDAKQQAFALTEVVQrrahfsyEDAAEMLAKNSDLN---------- 987
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1360 nqctqvvqeresllvkikvlEQDKARLQRLEDELNRAKstlEAETRVKQRLEcEKQQIQNDLnqwKTQYSRKEEAIRKIE 1439
Cdd:PRK04863 988 --------------------EKLRQRLEQAEQERTRAR---EQLRQAQAQLA-QYNQVLASL---KSSYDAKRQMLQELK 1040
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 975830145 1440 SEREKSEREKNSlrSEIERLQAEIKRIEERCRRkledsTRETQSQLETERSRYQREIDKLRQR 1502
Cdd:PRK04863 1041 QELQDLGVPADS--GAEERARARRDELHARLSA-----NRSRRNQLEKQLTFCEAEMDNLTKK 1096
|
|
| HMMR_N |
pfam15905 |
Hyaluronan mediated motility receptor N-terminal; HMMR_N is the N-terminal region of ... |
1294-1499 |
2.15e-03 |
|
Hyaluronan mediated motility receptor N-terminal; HMMR_N is the N-terminal region of eukaryotic hyaluronan-mediated motility receptor proteins. The protein is functionally associated with BRCA1 and thus predicted to be a common, low-penetrance breast cancer candidate.
Pssm-ID: 464932 [Multi-domain] Cd Length: 329 Bit Score: 42.88 E-value: 2.15e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1294 ELKQVMQQRSEDNARHKQsLEeaaktiqdknKEIERLKAEFQEEAKRRWEYENELSKASNRIQesknqctQVVQERESLL 1373
Cdd:pfam15905 60 ELKKKSQKNLKESKDQKE-LE----------KEIRALVQERGEQDKRLQALEEELEKVEAKLN-------AAVREKTSLS 121
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1374 VKIKVLEQ--------------------DKARLQRLEDELNRAKSTLEAETR---VKQR-LECEKQQIQNDLNQWKTQYS 1429
Cdd:pfam15905 122 ASVASLEKqlleltrvnellkakfsedgTQKKMSSLSMELMKLRNKLEAKMKevmAKQEgMEGKLQVTQKNLEHSKGKVA 201
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 975830145 1430 RKEEAIRKIESEREKSEREKNSLRSEIERLQAEIKRIEERCR--RKLEDSTRETQSQLETERSRYQREIDKL 1499
Cdd:pfam15905 202 QLEEKLVSTEKEKIEEKSETEKLLEYITELSCVSEQVEKYKLdiAQLEELLKEKNDEIESLKQSLEEKEQEL 273
|
|
| COG5022 |
COG5022 |
Myosin heavy chain [General function prediction only]; |
1002-1393 |
2.15e-03 |
|
Myosin heavy chain [General function prediction only];
Pssm-ID: 227355 [Multi-domain] Cd Length: 1463 Bit Score: 43.53 E-value: 2.15e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1002 RYIELLTRSGDYYRFLSEMLKSLEDLKLKnTKIEVLEEELRLaRDANSENCNKNKFLDQNLQKYQAECSQFKAKLASLEE 1081
Cdd:COG5022 772 KKIQVIQHGFRLRRLVDYELKWRLFIKLQ-PLLSLLGSRKEY-RSYLACIIKLQKTIKREKKLRETEEVEFSLKAEVLIQ 849
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1082 LKRQAELDGKSAKQNLDK-CYGQIKELNEKITRLTYEIEDEKRRRKSVEDRFDQQKNDYDQLQKARQCE---------KE 1151
Cdd:COG5022 850 KFGRSLKAKKRFSLLKKEtIYLQSAQRVELAERQLQELKIDVKSISSLKLVNLELESEIIELKKSLSSDlienlefktEL 929
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1152 NLGWQKLESEKAIKE---KEYEIERLRVLLQEEGTRKREYENELAKVRNHYNEEMSNLrNKYETEINITKTTIKEISMQK 1228
Cdd:COG5022 930 IARLKKLLNNIDLEEgpsIEYVKLPELNKLHEVESKLKETSEEYEDLLKKSTILVREG-NKANSELKNFKKELAELSKQY 1008
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1229 EDDSKNLRnQLDRLSRENRDLKDEIVRLNDsilQATEQRRRAEENALQqkacgSEIMQKKQHLEIELKQVMQQRSEDNAR 1308
Cdd:COG5022 1009 GALQESTK-QLKELPVEVAELQSASKIISS---ESTELSILKPLQKLK-----GLLLLENNQLQARYKALKLRRENSLLD 1079
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1309 HKQSleeaaKTIQDKNKEIERLKAEFQEEAKRRWEYENELSKAsNRIQESKNQCTQvvQERESLLVKIKVLEQDKARLQR 1388
Cdd:COG5022 1080 DKQL-----YQLESTENLLKTINVKDLEVTNRNLVKPANVLQF-IVAQMIKLNLLQ--EISKFLSQLVNTLEPVFQKLSV 1151
|
....*
gi 975830145 1389 LEDEL 1393
Cdd:COG5022 1152 LQLEL 1156
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
1242-1501 |
2.16e-03 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 42.97 E-value: 2.16e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1242 LSRENRDLKDEIVRLNDSILQATEQRRRAEENALQQKacgSEIMQKKQHLEiELKQVMQQRSEDNARHKQSLEEAAKTIQ 1321
Cdd:COG4372 29 LSEQLRKALFELDKLQEELEQLREELEQAREELEQLE---EELEQARSELE-QLEEELEELNEQLQAAQAELAQAQEELE 104
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1322 DKNKEIERLKAEFQEEAKRRWEYENELSKASNRIQESKNQCTQVVQERESLLVKIKVLEQDKARLQRLEDELNRAKSTLE 1401
Cdd:COG4372 105 SLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAEAEQA 184
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1402 AETRVKQRLECEKQQIQNDLNQWKTQYSRKEEAIRKIESEREKSEREKNSLRS------EIERLQAEIKRIEERCRRKLE 1475
Cdd:COG4372 185 LDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSAlldaleLEEDKEELLEEVILKEIEELE 264
|
250 260
....*....|....*....|....*.
gi 975830145 1476 DSTRETQSQLETERSRYQREIDKLRQ 1501
Cdd:COG4372 265 LAILVEKDTEEEELEIAALELEALEE 290
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
992-1476 |
2.22e-03 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 43.39 E-value: 2.22e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 992 ILQEAADVHARYIELLTRSGDYYRFLSEMLKSLEDLKLKNTKIEVLEEELRLARDANSENCNKNKFLDQNLQKYQAECSQ 1071
Cdd:COG1196 360 LAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEE 439
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1072 FKAKLASLEELKRQAELDGKSAKQNLDKCYGQIKELNEKITRLTYEIEDEKRRRKSVEDRFDQQKNDYDQLQKARQCEKE 1151
Cdd:COG1196 440 EEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGL 519
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1152 NLGWQKLeSEKAIKEKEYEIERLRVLLQEEGTRKREYENELAKVRNHyneemsNLRNKYETEINITKTTIKEISMQKEDD 1231
Cdd:COG1196 520 RGLAGAV-AVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEY------LKAAKAGRATFLPLDKIRARAALAAAL 592
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1232 SKNLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRRRAEENALqqkacgsEIMQKKQHLEIELKQVMQQRSEDNARHKQ 1311
Cdd:COG1196 593 ARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAAL-------RRAVTLAGRLREVTLEGEGGSAGGSLTGG 665
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1312 SLEEAAKTIQDKNKEIERLKAEFQEEAKRRWEYENELSKASNRIQESKNQctqvvQERESLLVKIKVLEQDKARLQRLED 1391
Cdd:COG1196 666 SRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEE-----RLEEELEEEALEEQLEAEREELLEE 740
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1392 ELNRAKSTLEAETRVKQRLEcekqqiqnDLNQWKTQYSRKEEAIRKI--------------ESEREKSEREKNSLRSEIE 1457
Cdd:COG1196 741 LLEEEELLEEEALEELPEPP--------DLEELERELERLEREIEALgpvnllaieeyeelEERYDFLSEQREDLEEARE 812
|
490
....*....|....*....
gi 975830145 1458 RLQAEIKRIEERCRRKLED 1476
Cdd:COG1196 813 TLEEAIEEIDRETRERFLE 831
|
|
| DUF4686 |
pfam15742 |
Domain of unknown function (DUF4686); This family of proteins is found in eukaryotes. Proteins ... |
1218-1491 |
2.32e-03 |
|
Domain of unknown function (DUF4686); This family of proteins is found in eukaryotes. Proteins in this family are typically between 498 and 775 amino acids in length. There is a conserved DLK sequence motif.
Pssm-ID: 464838 [Multi-domain] Cd Length: 384 Bit Score: 42.74 E-value: 2.32e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1218 KTTIKEISMQKEDDSKN---LRNQLDRLSRENRDLKDEIVRLNdsilQATEQRRRAEENALQQKACGSEIMQKKQHLEIE 1294
Cdd:pfam15742 61 KAELKQAQQKLLDSTKMcssLTAEWKHCQQKIRELELEVLKQA----QSIKSQNSLQEKLAQEKSRVADAEEKILELQQK 136
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1295 LKQVMQQRSEDN-ARHKQSLEEAAKTIQDKnkeIERLKAEFQEEAKRRW---EYENELSKASNRIQESKNQCTQVVQERE 1370
Cdd:pfam15742 137 LEHAHKVCLTDTcILEKKQLEERIKEASEN---EAKLKQQYQEEQQKRKlldQNVNELQQQVRSLQDKEAQLEMTNSQQQ 213
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1371 SLlvkikvLEQDKARLQRLEDELNRAKSTLEAETRVKQRL-----ECEKQQ--IQNDLNQWKTQYSRKEEairkiesere 1443
Cdd:pfam15742 214 LR------IQQQEAQLKQLENEKRKSDEHLKSNQELSEKLsslqqEKEALQeeLQQVLKQLDVHVRKYNE---------- 277
|
250 260 270 280
....*....|....*....|....*....|....*....|....*...
gi 975830145 1444 KSEREKNSLRSEIERLQAEIKRIEERCrRKLEDSTRETQSQLETERSR 1491
Cdd:pfam15742 278 KHHHHKAKLRRAKDRLVHEVEQRDERI-KQLENEIGILQQQSEKEKAF 324
|
|
| Golgin_A5 |
pfam09787 |
Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining ... |
1291-1466 |
2.35e-03 |
|
Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining Golgi structure. They stimulate the formation of Golgi stacks and ribbons, and are involved in intra-Golgi retrograde transport. Two main interactions have been characterized: one with RAB1A that has been activated by GTP-binding and another with isoform CASP of CUTL1.
Pssm-ID: 462900 [Multi-domain] Cd Length: 305 Bit Score: 42.44 E-value: 2.35e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1291 LEIELKQVMQQRsednarhkqslEEAAKTIQDKNKEIERLKAEFQEEAKRrweYENELSKASNRIQESKNQCTQVVQERE 1370
Cdd:pfam09787 45 LTLELEELRQER-----------DLLREEIQKLRGQIQQLRTELQELEAQ---QQEEAESSREQLQELEEQLATERSARR 110
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1371 SLLVKikvLEQDKARLQRLEDELNRAKSTLEaeTRVKQRlECEKQQIQNDLNQwKTQYSRKE-----------EAIRKIE 1439
Cdd:pfam09787 111 EAEAE---LERLQEELRYLEEELRRSKATLQ--SRIKDR-EAEIEKLRNQLTS-KSQSSSSQselenrlhqltETLIQKQ 183
|
170 180
....*....|....*....|....*..
gi 975830145 1440 SEREKSEREKNSLRSEIERLQAEIKRI 1466
Cdd:pfam09787 184 TMLEALSTEKNSLVLQLERMEQQIKEL 210
|
|
| 46 |
PHA02562 |
endonuclease subunit; Provisional |
1193-1392 |
2.44e-03 |
|
endonuclease subunit; Provisional
Pssm-ID: 222878 [Multi-domain] Cd Length: 562 Bit Score: 43.08 E-value: 2.44e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1193 AKVR--NHYNEEMSNLRNKYETEINITKTTIKEISMQKEDDSKNLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRRRA 1270
Cdd:PHA02562 174 DKIRelNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMDIEDP 253
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1271 EE--NALQQKAC--GSEIMQ-KKQHLEIELKQV----MQQRSEDNARhkqsLEEAAKTIQDKNKEIERLKAEFQEEAKRr 1341
Cdd:PHA02562 254 SAalNKLNTAAAkiKSKIEQfQKVIKMYEKGGVcptcTQQISEGPDR----ITKIKDKLKELQHSLEKLDTAIDELEEI- 328
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 975830145 1342 weyENELSKASNRIQESKNQCTQVVQERESLLVKIKVLEqdkARLQRLEDE 1392
Cdd:PHA02562 329 ---MDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVK---AAIEELQAE 373
|
|
| EzrA |
pfam06160 |
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like ... |
858-1271 |
2.44e-03 |
|
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerizes into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation. The structure contains 5 spectrin like alpha helical repeats.
Pssm-ID: 428797 [Multi-domain] Cd Length: 542 Bit Score: 42.92 E-value: 2.44e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 858 EKVTQLTDRWQRIdkqIDFRLWDLEKQIKQLRNYRDNYQAFckwlyDAKRRQDSLESMkfgdsntVMRFLNEQKNLHSEI 937
Cdd:pfam06160 45 EKFEEWRKKWDDI---VTKSLPDIEELLFEAEELNDKYRFK-----KAKKALDEIEEL-------LDDIEEDIKQILEEL 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 938 SGKRDK----SEEVQKIAELCANSIKDYELQLASYTSGLETLLNIpikrtmiqspsgviLQEAADVHARYIELlTRSGDY 1013
Cdd:pfam06160 110 DELLESeeknREEVEELKDKYRELRKTLLANRFSYGPAIDELEKQ--------------LAEIEEEFSQFEEL-TESGDY 174
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1014 Y---RFLSEMLKSLEDLKLKNTKIEVLEEELrlardansencnKNKFLDQnLQKYQAECSQFKAKLASLEELKRQAELdg 1090
Cdd:pfam06160 175 LearEVLEKLEEETDALEELMEDIPPLYEEL------------KTELPDQ-LEELKEGYREMEEEGYALEHLNVDKEI-- 239
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1091 KSAKQNLDKCYGQIKEL------------NEKITRLtYEI-EDEKRRRKSVEdrfDQQKNDYDQLQKARqcekenlgwqk 1157
Cdd:pfam06160 240 QQLEEQLEENLALLENLeldeaeealeeiEERIDQL-YDLlEKEVDAKKYVE---KNLPEIEDYLEHAE----------- 304
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1158 lESEKAIKEkeyEIERLR---VLLQEEGTRKREYENELAKVRNHYNEEMSNLRNK---YeTEINITKTTIKEISMQKEDD 1231
Cdd:pfam06160 305 -EQNKELKE---ELERVQqsyTLNENELERVRGLEKQLEELEKRYDEIVERLEEKevaY-SELQEELEEILEQLEEIEEE 379
|
410 420 430 440
....*....|....*....|....*....|....*....|
gi 975830145 1232 SKNLRNQLDRLSRENRDLKDEIVRLNDSILqatEQRRRAE 1271
Cdd:pfam06160 380 QEEFKESLQSLRKDELEAREKLDEFKLELR---EIKRLVE 416
|
|
| PLEC |
smart00250 |
Plectin repeat; |
2319-2356 |
2.67e-03 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 37.46 E-value: 2.67e-03
10 20 30
....*....|....*....|....*....|....*...
gi 975830145 2319 QRLQDTSSYAKILTCPKTKLKISYKDAINRSMVEDITG 2356
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| Filament |
pfam00038 |
Intermediate filament protein; |
1171-1499 |
2.74e-03 |
|
Intermediate filament protein;
Pssm-ID: 459643 [Multi-domain] Cd Length: 313 Bit Score: 42.21 E-value: 2.74e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1171 IERLRVLLQEegtrKREYENELAKVRNHYNEEMSNLRNKYETEINitkttikeismqkeddskNLRNQLDRLSRENRDLK 1250
Cdd:pfam00038 17 IDKVRFLEQQ----NKLLETKISELRQKKGAEPSRLYSLYEKEIE------------------DLRRQLDTLTVERARLQ 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1251 DEIvrlnDSILQATEQRRRAEENALQqkacgseimqKKQHLEIELkqVMQQRSEDNAR-HKQSLEeaaKTIQDKNKEIER 1329
Cdd:pfam00038 75 LEL----DNLRLAAEDFRQKYEDELN----------LRTSAENDL--VGLRKDLDEATlARVDLE---AKIESLKEELAF 135
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1330 LKAEFQEEAKrrweyenELSKasnriqesKNQCTQVVqeresllvkikvLEQDKARLQRLEDELNRAKSTLEAETRvKQR 1409
Cdd:pfam00038 136 LKKNHEEEVR-------ELQA--------QVSDTQVN------------VEMDAARKLDLTSALAEIRAQYEEIAA-KNR 187
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1410 LECEKQqiqndlnqWKTQYSRKEEAIRKIESEREKSEREKNSLRSEIERLQAEIKRIEERCRrKLEDSTRETQSQLETER 1489
Cdd:pfam00038 188 EEAEEW--------YQSKLEELQQAAARNGDALRSAKEEITELRRTIQSLEIELQSLKKQKA-SLERQLAETEERYELQL 258
|
330
....*....|
gi 975830145 1490 SRYQREIDKL 1499
Cdd:pfam00038 259 ADYQELISEL 268
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
1234-1434 |
2.89e-03 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 43.08 E-value: 2.89e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1234 NLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRR--RAEENALQQKACGSEIMQKKQHLEIELKQVMQQRSEDNARH-- 1309
Cdd:COG3206 165 NLELRREEARKALEFLEEQLPELRKELEEAEAALEefRQKNGLVDLSEEAKLLLQQLSELESQLAEARAELAEAEARLaa 244
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1310 -KQSLEEAAKTIQD--KNKEIERLKAEFQEEAKRRWEYENELSKASNRIQESKNQCTQVVQERESLLVKIKV-LEQDKAR 1385
Cdd:COG3206 245 lRAQLGSGPDALPEllQSPVIQQLRAQLAELEAELAELSARYTPNHPDVIALRAQIAALRAQLQQEAQRILAsLEAELEA 324
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|...
gi 975830145 1386 LQRLEDELNRAKSTLEAETRVKQRLECEKQQIQNDLNQWKTQY----SRKEEA 1434
Cdd:COG3206 325 LQAREASLQAQLAQLEARLAELPELEAELRRLEREVEVARELYesllQRLEEA 377
|
|
| PTZ00440 |
PTZ00440 |
reticulocyte binding protein 2-like protein; Provisional |
1054-1402 |
3.20e-03 |
|
reticulocyte binding protein 2-like protein; Provisional
Pssm-ID: 240419 [Multi-domain] Cd Length: 2722 Bit Score: 43.28 E-value: 3.20e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1054 KNKFLDQNLQKYQAECSQFKAKLASLEelkrqAELDGKSAKQNLDKcyGQIKELNEKITRLTYEIEDEKRRRKSVEDRFD 1133
Cdd:PTZ00440 1036 KGKEIEEKVDQYISLLEKMKTKLSSFH-----FNIDIKKYKNPKIK--EEIKLLEEKVEALLKKIDENKNKLIEIKNKSH 1108
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1134 QQKNDYDQlQKARQCEKENLGWQKLES-----EKAIKEKEyEIERLRVLLQEEGTRKREYENELAkvrNHYNEEMSNLRN 1208
Cdd:PTZ00440 1109 EHVVNADK-EKNKQTEHYNKKKKSLEKiykqmEKTLKELE-NMNLEDITLNEVNEIEIEYERILI---DHIVEQINNEAK 1183
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1209 KYETEINITKTTIKEISMQKEDDSKNLRNQLDRLsrENRDLKDEIVRLNDSILQATEqrrraEENALQQKACGSEIMQKK 1288
Cdd:PTZ00440 1184 KSKTIMEEIESYKKDIDQVKKNMSKERNDHLTTF--EYNAYYDKATASYENIEELTT-----EAKGLKGEANRSTNVDEL 1256
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1289 QHLEIELKQVMQQRSEDNarhkQSLEEAAKTIQDKNK-----EIERLKAEFQEEAKRRWEYEN----ELSKASNRIQESK 1359
Cdd:PTZ00440 1257 KEIKLQVFSYLQQVIKEN----NKMENALHEIKNMYEflisiDSEKILKEILNSTKKAEEFSNdakkELEKTDNLIKQVE 1332
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|..
gi 975830145 1360 NQCTQVVQ---------ERESLLVKIKVLEQDKARLQRLEDELNRAKSTLEA 1402
Cdd:PTZ00440 1333 AKIEQAKEhknkiygslEDKQIDDEIKKIEQIKEEISNKRKEINKYLSNIKS 1384
|
|
| PLN03229 |
PLN03229 |
acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha; Provisional |
1017-1327 |
3.57e-03 |
|
acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha; Provisional
Pssm-ID: 178768 [Multi-domain] Cd Length: 762 Bit Score: 42.53 E-value: 3.57e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1017 LSEMLKSLE---DLKLKNTKIEV-LEEELRLARD----ANSENCNKNKFLDQNLQKYQAECSQFKAKLASLEELKRQAEL 1088
Cdd:PLN03229 460 LNEMIEKLKkeiDLEYTEAVIAMgLQERLENLREefskANSQDQLMHPVLMEKIEKLKDEFNKRLSRAPNYLSLKYKLDM 539
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1089 --DGKSAKQNLdkcygqikELNEKITRLTYEIEDekrRRKSVEDRFDQQkndydqlqkarqcekenlgwQKLESEKAI-- 1164
Cdd:PLN03229 540 lnEFSRAKALS--------EKKSKAEKLKAEINK---KFKEVMDRPEIK--------------------EKMEALKAEva 588
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1165 KEKEYEIERLRVLLQEEGTR-KREYENELAKVRNHYNEEMSNLR-NKYETEINITKTTIKE-ISMQKEDDSKNLRNQLDR 1241
Cdd:PLN03229 589 SSGASSGDELDDDLKEKVEKmKKEIELELAGVLKSMGLEVIGVTkKNKDTAEQTPPPNLQEkIESLNEEINKKIERVIRS 668
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1242 LSRENR--DLKDEIVRLNDSILQATEQRRRAEENALQQKACG----SEIMQKKQHLEIELKQVMQQRSEDNARHKQ-SLE 1314
Cdd:PLN03229 669 SDLKSKieLLKLEVAKASKTPDVTEKEKIEALEQQIKQKIAEalnsSELKEKFEELEAELAAARETAAESNGSLKNdDDK 748
|
330
....*....|...
gi 975830145 1315 EAAKTIQDKNKEI 1327
Cdd:PLN03229 749 EEDSKEDGSRVEV 761
|
|
| MscS_porin |
pfam12795 |
Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part ... |
1304-1491 |
4.38e-03 |
|
Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part of the turgor-driven solute efflux system that protects bacteria from lysis in the event of osmotic shock. The MscS protein alone is sufficient to form a functional mechanosensitive channel gated directly by tension in the lipid bilayer. The MscS proteins are heptamers of three transmembrane subunits with seven converging M3 domains, and this MscS_porin is towards the N-terminal of the molecules. The high concentration of negative charges at the extracellular entrance of the pore helps select the cations for efflux.
Pssm-ID: 432790 [Multi-domain] Cd Length: 238 Bit Score: 41.13 E-value: 4.38e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1304 EDNARHKQSLEEAAKTIQDKNKEIERLKAEFQEEAKRRWEY------ENELSKASNRIQESKNQCTQV---VQERESLLV 1374
Cdd:pfam12795 37 QRAAAYQKALDDAPAELRELRQELAALQAKAEAAPKEILASlsleelEQRLLQTSAQLQELQNQLAQLnsqLIELQTRPE 116
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1375 KI-KVLEQDKARLQRLEDELNRAKSTLEAETRVKQrlecEKQQIQNDLNQWKTQYSRKEEAIrkIESEREKSEREKNSLR 1453
Cdd:pfam12795 117 RAqQQLSEARQRLQQIRNRLNGPAPPGEPLSEAQR----WALQAELAALKAQIDMLEQELLS--NNNRQDLLKARRDLLT 190
|
170 180 190
....*....|....*....|....*....|....*...
gi 975830145 1454 SEIERLQAEIKRIEERCRRKLEDSTRETQSQLETERSR 1491
Cdd:pfam12795 191 LRIQRLEQQLQALQELLNEKRLQEAEQAVAQTEQLAEE 228
|
|
| Golgin_A5 |
pfam09787 |
Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining ... |
1062-1252 |
4.78e-03 |
|
Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining Golgi structure. They stimulate the formation of Golgi stacks and ribbons, and are involved in intra-Golgi retrograde transport. Two main interactions have been characterized: one with RAB1A that has been activated by GTP-binding and another with isoform CASP of CUTL1.
Pssm-ID: 462900 [Multi-domain] Cd Length: 305 Bit Score: 41.67 E-value: 4.78e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1062 LQKYQAECSQFKAKLASLEElKRQAELDgksakqnldkcygqikELNEKITRLTYEIEDEKRRRKSVEDRFDQQKNDYDQ 1141
Cdd:pfam09787 63 IQKLRGQIQQLRTELQELEA-QQQEEAE----------------SSREQLQELEEQLATERSARREAEAELERLQEELRY 125
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1142 LqkarqceKENLGWQKLESEKAIKEKEYEIERLRVLL---QEEGTRKREYENELAKVrnhyneemsnlrnkyeTEINITK 1218
Cdd:pfam09787 126 L-------EEELRRSKATLQSRIKDREAEIEKLRNQLtskSQSSSSQSELENRLHQL----------------TETLIQK 182
|
170 180 190
....*....|....*....|....*....|....
gi 975830145 1219 TTIKEiSMQKEDDSknLRNQLDRLSRENRDLKDE 1252
Cdd:pfam09787 183 QTMLE-ALSTEKNS--LVLQLERMEQQIKELQGE 213
|
|
| PRK11281 |
PRK11281 |
mechanosensitive channel MscK; |
1288-1502 |
5.66e-03 |
|
mechanosensitive channel MscK;
Pssm-ID: 236892 [Multi-domain] Cd Length: 1113 Bit Score: 42.21 E-value: 5.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1288 KQHLE--IELKQVMQQRSEDNARHKQSLEEAAKTIQDKNKEIERLKAEFQEEAKRRW------EYENELSKASNRIQESK 1359
Cdd:PRK11281 62 QQDLEqtLALLDKIDRQKEETEQLKQQLAQAPAKLRQAQAELEALKDDNDEETRETLstlslrQLESRLAQTLDQLQNAQ 141
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1360 NQCTQVvqerESLLVKI--------KVLEQDKARLQRLEDELNR---AKSTLEAETRVKqrLECEKQQI--QNDLNQWKT 1426
Cdd:PRK11281 142 NDLAEY----NSQLVSLqtqperaqAALYANSQRLQQIRNLLKGgkvGGKALRPSQRVL--LQAEQALLnaQNDLQRKSL 215
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1427 Q---------YSRKEEAIRKIeserekserekNSLRSEIERLQAEI--KRIEERcrrklEDSTRETQSQLETERSR---- 1491
Cdd:PRK11281 216 EgntqlqdllQKQRDYLTARI-----------QRLEHQLQLLQEAInsKRLTLS-----EKTVQEAQSQDEAARIQanpl 279
|
250
....*....|....
gi 975830145 1492 YQREID---KLRQR 1502
Cdd:PRK11281 280 VAQELEinlQLSQR 293
|
|
| TPH |
pfam13868 |
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ... |
1233-1495 |
5.99e-03 |
|
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.
Pssm-ID: 464007 [Multi-domain] Cd Length: 341 Bit Score: 41.44 E-value: 5.99e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1233 KNLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRRRAEENALQQKACGSEIMQKKQHLEIELKQVMQQRSEDNARHKQS 1312
Cdd:pfam13868 31 KKRIKAEEKEEERRLDEMMEEERERALEEEEEKEEERKEERKRYRQELEEQIEEREQKRQEEYEEKLQEREQMDEIVERI 110
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1313 LEEAAKTIQDKNKEIERLKAEFQE--EAKRRWEyenELSKASNRIQESKNQCTQVVQERESLLVKIKVLEQDKAR---LQ 1387
Cdd:pfam13868 111 QEEDQAEAEEKLEKQRQLREEIDEfnEEQAEWK---ELEKEEEREEDERILEYLKEKAEREEEREAEREEIEEEKereIA 187
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1388 RLEDELNRAKSTLEAETRVKQRLECEKQQIQNDLNQwktqysrKEEAIRKIESEREKSEREKNSLRSEIERLQAEIKRIE 1467
Cdd:pfam13868 188 RLRAQQEKAQDEKAERDELRAKLYQEEQERKERQKE-------REEAEKKARQRQELQQAREEQIELKERRLAEEAEREE 260
|
250 260
....*....|....*....|....*....
gi 975830145 1468 ERCRRKLED-STRETQSQLETERSRYQRE 1495
Cdd:pfam13868 261 EEFERMLRKqAEDEEIEQEEAEKRRMKRL 289
|
|
| RNase_Y_N |
pfam12072 |
RNase Y N-terminal region; |
1222-1361 |
6.22e-03 |
|
RNase Y N-terminal region;
Pssm-ID: 463456 [Multi-domain] Cd Length: 201 Bit Score: 40.25 E-value: 6.22e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1222 KEISMQKEDDSKNLRNQLDRlsrENRDLKDEIVRLNDSILQATEQRRRAEEnALQQKacgseimqkkqhlEIELKQVMQQ 1301
Cdd:pfam12072 52 KEALLEAKEEIHKLRAEAER---ELKERRNELQRQERRLLQKEETLDRKDE-SLEKK-------------EESLEKKEKE 114
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 975830145 1302 RSEDNARHKQSLEEAAKTIQDKNKEIERLKAEFQEEAKRRW--EYENELSK-ASNRIQESKNQ 1361
Cdd:pfam12072 115 LEAQQQQLEEKEEELEELIEEQRQELERISGLTSEEAKEILldEVEEELRHeAAVMIKEIEEE 177
|
|
| ERM_helical |
pfam20492 |
Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related ... |
1385-1489 |
6.32e-03 |
|
Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related proteins, ezrin, radixin and moesin. Ezrin was first identified as a constituent of microvilli, radixin as a barbed, end-capping actin-modulating protein from isolated junctional fractions, and moesin as a heparin binding protein. A tumour suppressor molecule responsible for neurofibromatosis type 2 (NF2) is highly similar to ERM proteins and has been designated merlin (moesin-ezrin-radixin-like protein). ERM molecules contain 3 domains, an N-terminal globular domain, an extended alpha-helical domain and a charged C-terminal domain (pfam00769). Ezrin, radixin and merlin also contain a polyproline linker region between the helical and C-terminal domains. The N-terminal domain is highly conserved and is also found in merlin, band 4.1 proteins and members of the band 4.1 superfamily, designated the FERM domain. ERM proteins crosslink actin filaments with plasma membranes. They co-localize with CD44 at actin filament plasma membrane interaction sites, associating with CD44 via their N-terminal domains and with actin filaments via their C-terminal domains. This is the alpha-helical domain, which is involved in intramolecular masking of protein-protein interaction sites, regulating the activity of this proteins.
Pssm-ID: 466641 [Multi-domain] Cd Length: 120 Bit Score: 38.75 E-value: 6.32e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1385 RLQRLEDELNRAKSTLEAETRVKQRLECEKQQIQNDLNQWKTQYSRKEEAIRKIESEREKSEREKNSLRSEIERLQAEIK 1464
Cdd:pfam20492 14 RLKQYEEETKKAQEELEESEETAEELEEERRQAEEEAERLEQKRQEAEEEKERLEESAEMEAEEKEQLEAELAEAQEEIA 93
|
90 100
....*....|....*....|....*
gi 975830145 1465 RIEERCRRKlEDSTRETQSQLETER 1489
Cdd:pfam20492 94 RLEEEVERK-EEEARRLQEELEEAR 117
|
|
| COG2433 |
COG2433 |
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only]; |
1233-1332 |
6.81e-03 |
|
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
Pssm-ID: 441980 [Multi-domain] Cd Length: 644 Bit Score: 41.77 E-value: 6.81e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1233 KNLRNQLDRLSRENRDLKDEIvrlndsilqaTEQRRRAEEnaLQQKacgSEIMQKKQHLEIELKQVMQQRSEDNARHKQS 1312
Cdd:COG2433 416 RRLEEQVERLEAEVEELEAEL----------EEKDERIER--LERE---LSEARSEERREIRKDREISRLDREIERLERE 480
|
90 100
....*....|....*....|
gi 975830145 1313 LEEAAKTIQDKNKEIERLKA 1332
Cdd:COG2433 481 LEEERERIEELKRKLERLKE 500
|
|
| PRK04778 |
PRK04778 |
septation ring formation regulator EzrA; Provisional |
1017-1361 |
8.05e-03 |
|
septation ring formation regulator EzrA; Provisional
Pssm-ID: 179877 [Multi-domain] Cd Length: 569 Bit Score: 41.36 E-value: 8.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1017 LSEMLKSLEDLKLKNTKIEVLEEELRLARDANSENC-NKNKFLDQNLQKYQAECSQFkaklaslEELKrqaeldgksAKQ 1095
Cdd:PRK04778 128 LQELLESEEKNREEVEQLKDLYRELRKSLLANRFSFgPALDELEKQLENLEEEFSQF-------VELT---------ESG 191
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1096 NLDKCYGQIKELNEKITRLTYEIEDekrrrksVEDRFDQQKNDY-DQLQK----ARQCEKENLGWQKLESEKAIKEKEYE 1170
Cdd:PRK04778 192 DYVEAREILDQLEEELAALEQIMEE-------IPELLKELQTELpDQLQElkagYRELVEEGYHLDHLDIEKEIQDLKEQ 264
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1171 IERLRVLLQEEGTRKREYENElakvrnHYNEEMSNLRNKYETEINITKTTIKEIsmqkeddsKNLRNQLDRLSRENRDLK 1250
Cdd:PRK04778 265 IDENLALLEELDLDEAEEKNE------EIQERIDQLYDILEREVKARKYVEKNS--------DTLPDFLEHAKEQNKELK 330
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1251 DEIVRLNDS-ILQATE-QRRRAEENALQqkacgseiMQKKQHLEIelkqvmQQRSEDNARHKQSLEEAAKTIQDKNKEIE 1328
Cdd:PRK04778 331 EEIDRVKQSyTLNESElESVRQLEKQLE--------SLEKQYDEI------TERIAEQEIAYSELQEELEEILKQLEEIE 396
|
330 340 350
....*....|....*....|....*....|...
gi 975830145 1329 RLKAEFQEEAKRRWEYENElskASNRIQESKNQ 1361
Cdd:PRK04778 397 KEQEKLSEMLQGLRKDELE---AREKLERYRNK 426
|
|
| tolA_full |
TIGR02794 |
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ... |
1261-1470 |
9.05e-03 |
|
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]
Pssm-ID: 274303 [Multi-domain] Cd Length: 346 Bit Score: 40.98 E-value: 9.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1261 LQATEQRRRAEE--NALQQKAcgsEIMQKKQHLEIELKQVMQQRSEDNARHKQSlEEAAKTIQDKNKEIERLKAEFQEEA 1338
Cdd:TIGR02794 57 QQKKPAAKKEQErqKKLEQQA---EEAEKQRAAEQARQKELEQRAAAEKAAKQA-EQAAKQAEEKQKQAEEAKAKQAAEA 132
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1339 KRRWEYENELSKAsnriQESKNQCTQVVQERESLLVKIKVLE---------------QDKARLQRLEDELNRAKSTLEAE 1403
Cdd:TIGR02794 133 KAKAEAEAERKAK----EEAAKQAEEEAKAKAAAEAKKKAEEakkkaeaeakakaeaEAKAKAEEAKAKAEAAKAKAAAE 208
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 975830145 1404 TRVKQRLECEKQQIQNDLNQWKTQYSRKEEAIRKIESEREKSEREKNSLRSEIERLQAEIKRIEERC 1470
Cdd:TIGR02794 209 AAAKAEAEAAAAAAAEAERKADEAELGDIFGLASGSNAEKQGGARGAAAGSEVDKYAAIIQQAIQQN 275
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
1060-1339 |
9.24e-03 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 41.04 E-value: 9.24e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1060 QNLQKYQAECSQFKAKLASLEELKRQAELDGKSAKQNLDKCYGQIKELNEKITRLTYEIEDEKRRRKSVEDRFDQQKNDY 1139
Cdd:COG4372 38 FELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEEL 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1140 DQLQKARQC---EKENLGWQKLESEKAIKEKEYEIERLRVLLQEEGTRKREYENELAKVRN-HYNEEMSNLRNKYETEIN 1215
Cdd:COG4372 118 EELQKERQDleqQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEaEAEQALDELLKEANRNAE 197
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1216 ITKTTIKEISMQKEDDSKNLRNQLDRLSRENRDLKDEIVRLNDSILQATEQRRRAEENALQQKACGSEIMQKKQHLEIEL 1295
Cdd:COG4372 198 KEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEIEELELAILVEKDTEEEE 277
|
250 260 270 280
....*....|....*....|....*....|....*....|....
gi 975830145 1296 KQVMQQRSEDNARHKQSLEEAAKTIQDKNKEIERLKAEFQEEAK 1339
Cdd:COG4372 278 LEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAAL 321
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
825-1383 |
9.76e-03 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 41.36 E-value: 9.76e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 825 LATMKTELQKAQQIHSQTSQQYPLYDLDLGKFGeKVTQLTDRW-QRIDKQIDFRLWDLEKQIKQLRNYRDNYQAFCKWLY 903
Cdd:pfam12128 339 IETAAADQEQLPSWQSELENLEERLKALTGKHQ-DVTAKYNRRrSKIKEQNNRDIAGIKDKLAKIREARDRQLAVAEDDL 417
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 904 DAkrrqdsLESmkfgdsntvmrFLNEQKNlhseiSGKRDKSEEvqkiAELCANSIKDYELQLASYTSGLETLLNIPIKRT 983
Cdd:pfam12128 418 QA------LES-----------ELREQLE-----AGKLEFNEE----EYRLKSRLGELKLRLNQATATPELLLQLENFDE 471
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 984 MIQSPSGVILQEAADVHARYIELLT----------RSGDYYRFLSEMLKSLEDLKLK-----NTKIEVLEEEL-----RL 1043
Cdd:pfam12128 472 RIERAREEQEAANAEVERLQSELRQarkrrdqaseALRQASRRLEERQSALDELELQlfpqaGTLLHFLRKEApdweqSI 551
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1044 ARDANSE---NCNKNKFLDQNLQKyqAECSQFKAKL-----------ASLEELKR---QAELDGKSAKQNLDKCYGQIKE 1106
Cdd:pfam12128 552 GKVISPEllhRTDLDPEVWDGSVG--GELNLYGVKLdlkridvpewaASEEELRErldKAEEALQSAREKQAAAEEQLVQ 629
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1107 LNEKITRLTYEIEDEKRRRKSVEDR----FDQQKNDYDQLQKARQCEKENLG--WQKLESEKAIKEKEyeierLRVLLQE 1180
Cdd:pfam12128 630 ANGELEKASREETFARTALKNARLDlrrlFDEKQSEKDKKNKALAERKDSANerLNSLEAQLKQLDKK-----HQAWLEE 704
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1181 EGTRKREYENELAKVRNHYNEEMSNLRNKYETEINITKTTIK-EISMQKEDDSKNLR------NQLDRLSRENRDLKDEI 1253
Cdd:pfam12128 705 QKEQKREARTEKQAYWQVVEGALDAQLALLKAAIAARRSGAKaELKALETWYKRDLAslgvdpDVIAKLKREIRTLERKI 784
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 975830145 1254 VRLNDSILQATEQRRRAEENALQQK----ACGSEIMQKKQHLEIELKQVMQQRSEDNARHKQSLEEAAKTIQDKNKEIER 1329
Cdd:pfam12128 785 ERIAVRRQEVLRYFDWYQETWLQRRprlaTQLSNIERAISELQQQLARLIADTKLRRAKLEMERKASEKQQVRLSENLRG 864
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|....*
gi 975830145 1330 LKAEFQEEAKRRWEYENElsKASNRIQESKNQCTQVVQERESLLVKI-KVLEQDK 1383
Cdd:pfam12128 865 LRCEMSKLATLKEDANSE--QAQGSIGERLAQLEDLKLKRDYLSESVkKYVEHFK 917
|
|
|