NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2217316421|ref|XP_047293350|]
View 

microtubule cross-linking factor 1 isoform X10 [Homo sapiens]

Protein Classification

SOGA and DUF4482 domain-containing protein( domain architecture ID 13530637)

protein containing domains SMC_prok_B, SOGA, and DUF4482

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
DUF4482 pfam14818
Domain of unknown function (DUF4482); This family is found in eukaryotes, and is approximately ...
1266-1389 1.36e-52

Domain of unknown function (DUF4482); This family is found in eukaryotes, and is approximately 140 amino acids in length. The family is found in association with pfam11365.


:

Pssm-ID: 464333 [Multi-domain]  Cd Length: 138  Bit Score: 181.42  E-value: 1.36e-52
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421 1266 MDLRWQIHHSEKNWNREKVELLDRLDRDRQEWERQKKEFLWRIEQLQKENSPRR------------GGSFLCDQKDGNVR 1333
Cdd:pfam14818    1 MDLRWQLQHTEKNWHREKMELLDRFDRERQEWESQKKIMQKKIEQLQREVSLRRkinmnerakvidGEKFVPDQKESSSP 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2217316421 1334 PFPHQGSLRMPR--PVAMWPCADADSIPFEDRPLSKLKESDRCSASENLYLDALSLDD 1389
Cdd:pfam14818   81 PFPDSGQCEFPRmnHPGSLSKSDSDEESFLDEGNQKLKEQKRCKASENLFLDALSLDN 138
SOGA pfam11365
Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, ...
501-595 1.67e-38

Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, SOGA2, and SOGA3. SOGA1 regulates autophagy by playing a role in the reduction of glucose production in an adiponectin and insulin dependent manner.


:

Pssm-ID: 463264 [Multi-domain]  Cd Length: 95  Bit Score: 139.36  E-value: 1.67e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  501 DSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREAELKLRLKLVEEEANI 580
Cdd:pfam11365    1 SSAELRRQLQFVEEEAELLRRSLSEIEDHNKQLTNELNKYKSKYGPDESSLSDGEGGGSDSSREAELQEELKLARLQINE 80
                           90
                   ....*....|....*
gi 2217316421  581 LGRKIVELEVENRGL 595
Cdd:pfam11365   81 LSGKVMKLQYENRVL 95
SOGA pfam11365
Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, ...
630-722 3.78e-38

Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, SOGA2, and SOGA3. SOGA1 regulates autophagy by playing a role in the reduction of glucose production in an adiponectin and insulin dependent manner.


:

Pssm-ID: 463264 [Multi-domain]  Cd Length: 95  Bit Score: 138.20  E-value: 3.78e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  630 SSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHELSKFKFEPPREPGWLGEGASPGA--GGGAPLQEELKSARLQISE 707
Cdd:pfam11365    1 SSAELRRQLQFVEEEAELLRRSLSEIEDHNKQLTNELNKYKSKYGPDESSLSDGEGGGSdsSREAELQEELKLARLQINE 80
                           90
                   ....*....|....*
gi 2217316421  708 LSGKVLKLQHENHAL 722
Cdd:pfam11365   81 LSGKVMKLQYENRVL 95
PHA03247 super family cl33720
large tegument protein UL36; Provisional
12-326 1.15e-14

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 80.75  E-value: 1.15e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   12 APDAKLQPPGQHHRHHHLHPVAERRRLHRAPSPARPFL----KDLHARPAAPGPAVP---------SSGRAPAPAAPRSP 78
Cdd:PHA03247  2563 APDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRApvddRGDPRGPAPPSPLPPdthapdpppPSPSPAANEPDPHP 2642
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   79 NLAGKAPPSPGSLAAPGRLSR----RSGGVPGAKDKPPpgagaraaggakaALGSRRAARvAPAEPLSRAGKPPGAEPPS 154
Cdd:PHA03247  2643 PPTVPPPERPRDDPAPGRVSRprraRRLGRAAQASSPP-------------QRPRRRAAR-PTVGSLTSLADPPPPPPTP 2708
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  155 AAAKgRKAKRGSRAPP----ARTVGPPTPAARIPAVTLAVTSVAGSPARCSRiSHTDSSSDLSDCPSEPLSDEQRLLPAA 230
Cdd:PHA03247  2709 EPAP-HALVSATPLPPgpaaARQASPALPAAPAPPAVPAGPATPGGPARPAR-PPTTAGPPAPAPPAAPAAGPPRRLTRP 2786
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  231 SSDAESGTGSSDREPPRGAPTPSPAARGAPPGSPEPPALLAAPLAAGACPGGRSIPSGVS------GGFAGPGvaEDVRG 304
Cdd:PHA03247  2787 AVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPppslplGGSVAPG--GDVRR 2864
                          330       340
                   ....*....|....*....|....
gi 2217316421  305 RSPPERPV--PGTPKEPSLGEQSR 326
Cdd:PHA03247  2865 RPPSRSPAakPAAPARPPVRRLAR 2888
SMC_prok_B super family cl37069
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
350-656 1.62e-14

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


The actual alignment was detected with superfamily member TIGR02168:

Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 79.72  E-value: 1.62e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEMEEMRDSyLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLKVAK 429
Cdd:TIGR02168  689 LEEKIAELEKALAELRKE-LEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELE 767
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  430 DVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQL 509
Cdd:TIGR02168  768 ERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQI 847
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  510 QFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLptgeaggppstreAELKLRLKLVEEEANILGRKIVELE 589
Cdd:TIGR02168  848 EELSEDIESLAAEIEELEELIEELESELEALLNERASLEEAL-------------ALLRSELEELSEELRELESKRSELR 914
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217316421  590 VENRGLKAEMEDMRGQQEREGPGRDHAPSIPTSPFGDSLEsstELRRHLQFVEEEAELLRRSISEIE 656
Cdd:TIGR02168  915 RELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLE---EAEALENKIEDDEEEARRRLKRLE 978
 
Name Accession Description Interval E-value
DUF4482 pfam14818
Domain of unknown function (DUF4482); This family is found in eukaryotes, and is approximately ...
1266-1389 1.36e-52

Domain of unknown function (DUF4482); This family is found in eukaryotes, and is approximately 140 amino acids in length. The family is found in association with pfam11365.


Pssm-ID: 464333 [Multi-domain]  Cd Length: 138  Bit Score: 181.42  E-value: 1.36e-52
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421 1266 MDLRWQIHHSEKNWNREKVELLDRLDRDRQEWERQKKEFLWRIEQLQKENSPRR------------GGSFLCDQKDGNVR 1333
Cdd:pfam14818    1 MDLRWQLQHTEKNWHREKMELLDRFDRERQEWESQKKIMQKKIEQLQREVSLRRkinmnerakvidGEKFVPDQKESSSP 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2217316421 1334 PFPHQGSLRMPR--PVAMWPCADADSIPFEDRPLSKLKESDRCSASENLYLDALSLDD 1389
Cdd:pfam14818   81 PFPDSGQCEFPRmnHPGSLSKSDSDEESFLDEGNQKLKEQKRCKASENLFLDALSLDN 138
SOGA pfam11365
Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, ...
501-595 1.67e-38

Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, SOGA2, and SOGA3. SOGA1 regulates autophagy by playing a role in the reduction of glucose production in an adiponectin and insulin dependent manner.


Pssm-ID: 463264 [Multi-domain]  Cd Length: 95  Bit Score: 139.36  E-value: 1.67e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  501 DSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREAELKLRLKLVEEEANI 580
Cdd:pfam11365    1 SSAELRRQLQFVEEEAELLRRSLSEIEDHNKQLTNELNKYKSKYGPDESSLSDGEGGGSDSSREAELQEELKLARLQINE 80
                           90
                   ....*....|....*
gi 2217316421  581 LGRKIVELEVENRGL 595
Cdd:pfam11365   81 LSGKVMKLQYENRVL 95
SOGA pfam11365
Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, ...
630-722 3.78e-38

Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, SOGA2, and SOGA3. SOGA1 regulates autophagy by playing a role in the reduction of glucose production in an adiponectin and insulin dependent manner.


Pssm-ID: 463264 [Multi-domain]  Cd Length: 95  Bit Score: 138.20  E-value: 3.78e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  630 SSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHELSKFKFEPPREPGWLGEGASPGA--GGGAPLQEELKSARLQISE 707
Cdd:pfam11365    1 SSAELRRQLQFVEEEAELLRRSLSEIEDHNKQLTNELNKYKSKYGPDESSLSDGEGGGSdsSREAELQEELKLARLQINE 80
                           90
                   ....*....|....*
gi 2217316421  708 LSGKVLKLQHENHAL 722
Cdd:pfam11365   81 LSGKVMKLQYENRVL 95
PHA03247 PHA03247
large tegument protein UL36; Provisional
12-326 1.15e-14

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 80.75  E-value: 1.15e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   12 APDAKLQPPGQHHRHHHLHPVAERRRLHRAPSPARPFL----KDLHARPAAPGPAVP---------SSGRAPAPAAPRSP 78
Cdd:PHA03247  2563 APDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRApvddRGDPRGPAPPSPLPPdthapdpppPSPSPAANEPDPHP 2642
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   79 NLAGKAPPSPGSLAAPGRLSR----RSGGVPGAKDKPPpgagaraaggakaALGSRRAARvAPAEPLSRAGKPPGAEPPS 154
Cdd:PHA03247  2643 PPTVPPPERPRDDPAPGRVSRprraRRLGRAAQASSPP-------------QRPRRRAAR-PTVGSLTSLADPPPPPPTP 2708
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  155 AAAKgRKAKRGSRAPP----ARTVGPPTPAARIPAVTLAVTSVAGSPARCSRiSHTDSSSDLSDCPSEPLSDEQRLLPAA 230
Cdd:PHA03247  2709 EPAP-HALVSATPLPPgpaaARQASPALPAAPAPPAVPAGPATPGGPARPAR-PPTTAGPPAPAPPAAPAAGPPRRLTRP 2786
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  231 SSDAESGTGSSDREPPRGAPTPSPAARGAPPGSPEPPALLAAPLAAGACPGGRSIPSGVS------GGFAGPGvaEDVRG 304
Cdd:PHA03247  2787 AVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPppslplGGSVAPG--GDVRR 2864
                          330       340
                   ....*....|....*....|....
gi 2217316421  305 RSPPERPV--PGTPKEPSLGEQSR 326
Cdd:PHA03247  2865 RPPSRSPAakPAAPARPPVRRLAR 2888
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
350-656 1.62e-14

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 79.72  E-value: 1.62e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEMEEMRDSyLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLKVAK 429
Cdd:TIGR02168  689 LEEKIAELEKALAELRKE-LEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELE 767
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  430 DVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQL 509
Cdd:TIGR02168  768 ERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQI 847
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  510 QFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLptgeaggppstreAELKLRLKLVEEEANILGRKIVELE 589
Cdd:TIGR02168  848 EELSEDIESLAAEIEELEELIEELESELEALLNERASLEEAL-------------ALLRSELEELSEELRELESKRSELR 914
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217316421  590 VENRGLKAEMEDMRGQQEREGPGRDHAPSIPTSPFGDSLEsstELRRHLQFVEEEAELLRRSISEIE 656
Cdd:TIGR02168  915 RELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLE---EAEALENKIEDDEEEARRRLKRLE 978
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
345-668 4.14e-10

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 63.77  E-value: 4.14e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  345 SENDYLKDELDELRAEMEEMRDsyleedvyQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETgqvdgelIRSLEQD 424
Cdd:COG4372     45 EELEQLREELEQAREELEQLEE--------ELEQARSELEQLEEELEELNEQLQAAQAELAQAQEE-------LESLQEE 109
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  425 LKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDS-- 502
Cdd:COG4372    110 AEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAEAEQALDEll 189
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  503 ADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPST---REAELKLRLKLVEEEAN 579
Cdd:COG4372    190 KEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEellEEVILKEIEELELAILV 269
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  580 ILGRKIVELEVENRGLKAEMEDMRGQQEREGPGRDHAPSIPTSPFGDSLESSTELRRHLQFVEEEAELLRRSISEIEDHN 659
Cdd:COG4372    270 EKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKKLELALAILLAELADLLQLLLVG 349

                   ....*....
gi 2217316421  660 RQLTHELSK 668
Cdd:COG4372    350 LLDNDVLEL 358
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
350-542 1.22e-08

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 60.12  E-value: 1.22e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEMEEMRDsyleedvyqlqELRRELDRANKNCRILQYRLRKAEqKSLKVAETG------QVD--GELIRSL 421
Cdd:pfam05483  546 LRDELESVREEFIQKGD-----------EVKCKLDKSEENARSIEYEVLKKE-KQMKILENKcnnlkkQIEnkNKNIEEL 613
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  422 EQDLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQ---QMI-----EVEISKQALQNELERLKESSLKRRSTREMYKE 493
Cdd:pfam05483  614 HQENKALKKKGSAENKQLNAYEIKVNKLELELASAKQkfeEIIdnyqkEIEDKKISEEKLLEEVEKAKAIADEAVKLQKE 693
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*....
gi 2217316421  494 kktfnqddsADLRCQLQFAkEEAFLMRKKMAKLGREKDELEQELQKYKS 542
Cdd:pfam05483  694 ---------IDKRCQHKIA-EMVALMEKHKHQYDKIIEERDSELGLYKN 732
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
350-670 3.38e-08

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 58.92  E-value: 3.38e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEMEEmrdsyLEEDVYQLQELRRELDRANKNCRILQYRLR---------KAEQKSL--KVAETGQVDG--- 415
Cdd:PRK03918   219 LREELEKLEKEVKE-----LEELKEEIEELEKELESLEGSKRKLEEKIReleerieelKKEIEELeeKVKELKELKEkae 293
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  416 ---ELIRSLEQDLKVAKDVSVRL---HHELKTVEEKRAKAEDENETLRqqmiEVEISKQALQNELERLKESSLK----RR 485
Cdd:PRK03918   294 eyiKLSEFYEEYLDELREIEKRLsrlEEEINGIEERIKELEEKEERLE----ELKKKLKELEKRLEELEERHELyeeaKA 369
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  486 STREMYKEKKTFNQDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREA 565
Cdd:PRK03918   370 KKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEH 449
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  566 ELKLRLKLVEEEANILGRKIvELEVENRGLKAEMEDMRGQQEREgpgRDHAPSIPTSPFGDSLESSTEL--RRHLQFVEE 643
Cdd:PRK03918   450 RKELLEEYTAELKRIEKELK-EIEEKERKLRKELRELEKVLKKE---SELIKLKELAEQLKELEEKLKKynLEELEKKAE 525
                          330       340
                   ....*....|....*....|....*..
gi 2217316421  644 EAELLRRSISEIEDHNRQLTHELSKFK 670
Cdd:PRK03918   526 EYEKLKEKLIKLKGEIKSLKKELEKLE 552
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
13-319 3.36e-04

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 45.44  E-value: 3.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   13 PDAKLQPPGQHHRHHHLHPVAERrrLHRAPSPARPFLKDLHARPAAPGPAVPSSGRAPAPAAPRSPNLAGKAPPSPGSLA 92
Cdd:COG5180    158 SDPILAKDPDGDSASTLPPPAEK--LDKVLTEPRDALKDSPEKLDRPKVEVKDEAQEEPPDLTGGADHPRPEAASSPKVD 235
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   93 APGRLSRRSGgvPGAKDKPPPGAGARAAGGAKAALGSRRAARVAPAEPLSRAGKPPGAEPPSAA-AKGRKAKRGSRAPPA 171
Cdd:COG5180    236 PPSTSEARSR--PATVDAQPEMRPPADAKERRRAAIGDTPAAEPPGLPVLEAGSEPQSDAPEAEtARPIDVKGVASAPPA 313
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  172 RTVGPPTPAAR---IPAVTLAVTSVAGSPARCSRISHTDSSSDlsdcPSEPLSDEQRLLPAASSDAESGTGSSDREPPRG 248
Cdd:COG5180    314 TRPVRPPGGARdpgTPRPGQPTERPAGVPEAASDAGQPPSAYP----PAEEAVPGKPLEQGAPRPGSSGGDGAPFQPPNG 389
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217316421  249 APTPSPaARGAPPGSPEPPALLAAPLAAGACPGGRSIPSGVSGGFAGPGVAedvrgRSPPERPVPGTPKEP 319
Cdd:COG5180    390 APQPGL-GRRGAPGPPMGAGDLVQAALDGGGRETASLGGAAGGAGQGPKAD-----FVPGDAESVSGPAGL 454
BAR cd07307
The Bin/Amphiphysin/Rvs (BAR) domain, a dimerization module that binds membranes and detects ...
376-587 8.67e-04

The Bin/Amphiphysin/Rvs (BAR) domain, a dimerization module that binds membranes and detects membrane curvature; BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions including organelle biogenesis, membrane trafficking or remodeling, and cell division and migration. Mutations in BAR containing proteins have been linked to diseases and their inactivation in cells leads to altered membrane dynamics. A BAR domain with an additional N-terminal amphipathic helix (an N-BAR) can drive membrane curvature. These N-BAR domains are found in amphiphysins and endophilins, among others. BAR domains are also frequently found alongside domains that determine lipid specificity, such as the Pleckstrin Homology (PH) and Phox Homology (PX) domains which are present in beta centaurins (ACAPs and ASAPs) and sorting nexins, respectively. A FES-CIP4 Homology (FCH) domain together with a coiled coil region is called the F-BAR domain and is present in Pombe/Cdc15 homology (PCH) family proteins, which include Fes/Fes tyrosine kinases, PACSIN or syndapin, CIP4-like proteins, and srGAPs, among others. The Inverse (I)-BAR or IRSp53/MIM homology Domain (IMD) is found in multi-domain proteins, such as IRSp53 and MIM, that act as scaffolding proteins and transducers of a variety of signaling pathways that link membrane dynamics and the underlying actin cytoskeleton. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions. The I-BAR domain induces membrane protrusions in the opposite direction compared to classical BAR and F-BAR domains, which produce membrane invaginations. BAR domains that also serve as protein interaction domains include those of arfaptin and OPHN1-like proteins, among others, which bind to Rac and Rho GAP domains, respectively.


Pssm-ID: 153271 [Multi-domain]  Cd Length: 194  Bit Score: 42.43  E-value: 8.67e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  376 LQELRRELDRANKNCRILQYRLRKAEQKSLKVAEtgqvdgelirSLEqdlKVAKDVSVRLHHELKTVEEKRAKAEDENET 455
Cdd:cd07307      2 LDELEKLLKKLIKDTKKLLDSLKELPAAAEKLSE----------ALQ---ELGKELPDLSNTDLGEALEKFGKIQKELEE 68
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  456 LRQQMIE------VEISKQALQNELERLKESSLKRRSTREMYkekktfnqdDSAdlrcqlqFAKEEAflMRKKMAKLGRE 529
Cdd:cd07307     69 FRDQLEQklenkvIEPLKEYLKKDLKEIKKRRKKLDKARLDY---------DAA-------REKLKK--LRKKKKDSSKL 130
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2217316421  530 KdELEQELQKYKSLYGDVdsplptgeaggppstrEAELKLRLKLVEEEANILGRKIVE 587
Cdd:cd07307    131 A-EAEEELQEAKEKYEEL----------------REELIEDLNKLEEKRKELFLSLLL 171
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
434-601 1.97e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 41.83  E-value: 1.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  434 RLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEK--KTFNQDDSADLRCQLQF 511
Cdd:COG1579     21 RLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQlgNVRNNKEYEALQKEIES 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  512 AKEEAFLMRKKMAKLGREKDELEQELQKYKSLYgdvdsplptgeaggppSTREAELKLRLKLVEEEANILGRKIVELEVE 591
Cdd:COG1579    101 LKRRISDLEDEILELMERIEELEEELAELEAEL----------------AELEAELEEKKAELDEELAELEAELEELEAE 164
                          170
                   ....*....|
gi 2217316421  592 NRGLKAEMED 601
Cdd:COG1579    165 REELAAKIPP 174
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
355-561 4.77e-03

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 41.92  E-value: 4.77e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  355 DELRAEMEEMRDSYLEEdvyQLQELRRELDRA---------NKNCRILQ---YRLRKAEQKSlKVAETGQVDGELIRSLE 422
Cdd:NF033838    53 NESQKEHAKEVESHLEK---ILSEIQKSLDKRkhtqnvalnKKLSDIKTeylYELNVLKEKS-EAELTSKTKKELDAAFE 128
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  423 QDLKVAKDVSVRLHHELKTVEEKRAKAEDENE-------TLRQQMIEVEISKQALQnelerLKESSLkrrstrEMYKEKK 495
Cdd:NF033838   129 QFKKDTLEPGKKVAEATKKVEEAEKKAKDQKEedrrnypTNTYKTLELEIAESDVE-----VKKAEL------ELVKEEA 197
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217316421  496 TFNQDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPS 561
Cdd:NF033838   198 KEPRDEEKIKQAKAKVESKKAEATRLEKIKTDREKAEEEAKRRADAKLKEAVEKNVATSEQDKPKR 263
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
56-329 7.97e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 41.10  E-value: 7.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   56 PAAPGPAVPSSGRAPAPAAPRSPNLAGKAPPSPGSLAAPGRLSRRS---------GGVP---GAKDKPPPGAGARAAGGA 123
Cdd:pfam17823  115 LAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPRAaiaaasaphAASPaprTAASSTTAASSTTAASSA 194
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  124 KAALGSRRAARVAPAEPLSRAGKPPGAepPSAAAKgrKAKRGSRAPPART----VGPPTPA--ARIPAVTLAVTSVAGSP 197
Cdd:pfam17823  195 PTTAASSAPATLTPARGISTAATATGH--PAAGTA--LAAVGNSSPAAGTvtaaVGTVTPAalATLAAAAGTVASAAGTI 270
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  198 ARCSRISHTDSSSdlSDCPSEPLSDEqrllPAASSDAES-GTGS--SDREP---PRGAPTPSPAARGAPPGSPEPPALLA 271
Cdd:pfam17823  271 NMGDPHARRLSPA--KHMPSDTMARN----PAAPMGAQAqGPIIqvSTDQPvhnTAGEPTPSPSNTTLEPNTPKSVASTN 344
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2217316421  272 APLAAGACPGGRSiPSGVSGGFAGPGVAEDVRGRSPPERPVPGTPKEPSLGEQSRLVP 329
Cdd:pfam17823  345 LAVVTTTKAQAKE-PSASPVPVLHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILLAP 401
 
Name Accession Description Interval E-value
DUF4482 pfam14818
Domain of unknown function (DUF4482); This family is found in eukaryotes, and is approximately ...
1266-1389 1.36e-52

Domain of unknown function (DUF4482); This family is found in eukaryotes, and is approximately 140 amino acids in length. The family is found in association with pfam11365.


Pssm-ID: 464333 [Multi-domain]  Cd Length: 138  Bit Score: 181.42  E-value: 1.36e-52
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421 1266 MDLRWQIHHSEKNWNREKVELLDRLDRDRQEWERQKKEFLWRIEQLQKENSPRR------------GGSFLCDQKDGNVR 1333
Cdd:pfam14818    1 MDLRWQLQHTEKNWHREKMELLDRFDRERQEWESQKKIMQKKIEQLQREVSLRRkinmnerakvidGEKFVPDQKESSSP 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2217316421 1334 PFPHQGSLRMPR--PVAMWPCADADSIPFEDRPLSKLKESDRCSASENLYLDALSLDD 1389
Cdd:pfam14818   81 PFPDSGQCEFPRmnHPGSLSKSDSDEESFLDEGNQKLKEQKRCKASENLFLDALSLDN 138
SOGA pfam11365
Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, ...
501-595 1.67e-38

Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, SOGA2, and SOGA3. SOGA1 regulates autophagy by playing a role in the reduction of glucose production in an adiponectin and insulin dependent manner.


Pssm-ID: 463264 [Multi-domain]  Cd Length: 95  Bit Score: 139.36  E-value: 1.67e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  501 DSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREAELKLRLKLVEEEANI 580
Cdd:pfam11365    1 SSAELRRQLQFVEEEAELLRRSLSEIEDHNKQLTNELNKYKSKYGPDESSLSDGEGGGSDSSREAELQEELKLARLQINE 80
                           90
                   ....*....|....*
gi 2217316421  581 LGRKIVELEVENRGL 595
Cdd:pfam11365   81 LSGKVMKLQYENRVL 95
SOGA pfam11365
Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, ...
630-722 3.78e-38

Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, SOGA2, and SOGA3. SOGA1 regulates autophagy by playing a role in the reduction of glucose production in an adiponectin and insulin dependent manner.


Pssm-ID: 463264 [Multi-domain]  Cd Length: 95  Bit Score: 138.20  E-value: 3.78e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  630 SSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHELSKFKFEPPREPGWLGEGASPGA--GGGAPLQEELKSARLQISE 707
Cdd:pfam11365    1 SSAELRRQLQFVEEEAELLRRSLSEIEDHNKQLTNELNKYKSKYGPDESSLSDGEGGGSdsSREAELQEELKLARLQINE 80
                           90
                   ....*....|....*
gi 2217316421  708 LSGKVLKLQHENHAL 722
Cdd:pfam11365   81 LSGKVMKLQYENRVL 95
PHA03247 PHA03247
large tegument protein UL36; Provisional
12-326 1.15e-14

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 80.75  E-value: 1.15e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   12 APDAKLQPPGQHHRHHHLHPVAERRRLHRAPSPARPFL----KDLHARPAAPGPAVP---------SSGRAPAPAAPRSP 78
Cdd:PHA03247  2563 APDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRApvddRGDPRGPAPPSPLPPdthapdpppPSPSPAANEPDPHP 2642
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   79 NLAGKAPPSPGSLAAPGRLSR----RSGGVPGAKDKPPpgagaraaggakaALGSRRAARvAPAEPLSRAGKPPGAEPPS 154
Cdd:PHA03247  2643 PPTVPPPERPRDDPAPGRVSRprraRRLGRAAQASSPP-------------QRPRRRAAR-PTVGSLTSLADPPPPPPTP 2708
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  155 AAAKgRKAKRGSRAPP----ARTVGPPTPAARIPAVTLAVTSVAGSPARCSRiSHTDSSSDLSDCPSEPLSDEQRLLPAA 230
Cdd:PHA03247  2709 EPAP-HALVSATPLPPgpaaARQASPALPAAPAPPAVPAGPATPGGPARPAR-PPTTAGPPAPAPPAAPAAGPPRRLTRP 2786
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  231 SSDAESGTGSSDREPPRGAPTPSPAARGAPPGSPEPPALLAAPLAAGACPGGRSIPSGVS------GGFAGPGvaEDVRG 304
Cdd:PHA03247  2787 AVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPppslplGGSVAPG--GDVRR 2864
                          330       340
                   ....*....|....*....|....
gi 2217316421  305 RSPPERPV--PGTPKEPSLGEQSR 326
Cdd:PHA03247  2865 RPPSRSPAakPAAPARPPVRRLAR 2888
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
350-656 1.62e-14

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 79.72  E-value: 1.62e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEMEEMRDSyLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLKVAK 429
Cdd:TIGR02168  689 LEEKIAELEKALAELRKE-LEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELE 767
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  430 DVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQL 509
Cdd:TIGR02168  768 ERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQI 847
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  510 QFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLptgeaggppstreAELKLRLKLVEEEANILGRKIVELE 589
Cdd:TIGR02168  848 EELSEDIESLAAEIEELEELIEELESELEALLNERASLEEAL-------------ALLRSELEELSEELRELESKRSELR 914
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217316421  590 VENRGLKAEMEDMRGQQEREGPGRDHAPSIPTSPFGDSLEsstELRRHLQFVEEEAELLRRSISEIE 656
Cdd:TIGR02168  915 RELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLE---EAEALENKIEDDEEEARRRLKRLE 978
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
7-316 1.94e-10

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 66.35  E-value: 1.94e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421    7 PAGGGAPDAklqPPGQHHRHHHLHPVAERRRLHRAPSPA-RPFLKDLHARPAAPGPAVPSSGRAPAPAAPRSPNLAGKAP 85
Cdd:PHA03307   122 PPASPPPSP---APDLSEMLRPVGSPGPPPAASPPAAGAsPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTP 198
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   86 PSPGSLAAPGRLSRRSGGVPGAKDKPPPGAGARAAGGAKAALGSR-RAARVAPAEPLSRAGKPPGAEPPSAAAKGRKAKR 164
Cdd:PHA03307   199 PAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSEsSGCGWGPENECPLPRPAPITLPTRIWEASGWNGP 278
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  165 GSRAPPARTVGPPTPAARIPAVTLAVTSVAGSPARCSRISHTDSSSDLSDCPSEPLSDEQRLLPAASSDAESGTGSSDRE 244
Cdd:PHA03307   279 SSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPP 358
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217316421  245 PPRGAPTPSPAARGAPPGSPEPPALLAAPLAAGACPGGRSIPSGVSGGFAGP-----GVAEDVRGRSPPERPVPGTP 316
Cdd:PHA03307   359 PADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGrprpsPLDAGAASGAFYARYPLLTP 435
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
345-668 4.14e-10

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 63.77  E-value: 4.14e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  345 SENDYLKDELDELRAEMEEMRDsyleedvyQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETgqvdgelIRSLEQD 424
Cdd:COG4372     45 EELEQLREELEQAREELEQLEE--------ELEQARSELEQLEEELEELNEQLQAAQAELAQAQEE-------LESLQEE 109
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  425 LKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDS-- 502
Cdd:COG4372    110 AEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAEAEQALDEll 189
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  503 ADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPST---REAELKLRLKLVEEEAN 579
Cdd:COG4372    190 KEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEellEEVILKEIEELELAILV 269
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  580 ILGRKIVELEVENRGLKAEMEDMRGQQEREGPGRDHAPSIPTSPFGDSLESSTELRRHLQFVEEEAELLRRSISEIEDHN 659
Cdd:COG4372    270 EKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKKLELALAILLAELADLLQLLLVG 349

                   ....*....
gi 2217316421  660 RQLTHELSK 668
Cdd:COG4372    350 LLDNDVLEL 358
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
351-708 4.17e-10

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 65.47  E-value: 4.17e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  351 KDELDELRAEMEEMrDSYLEEDVYQLQELRRELDRANKNcRILQYRLRKAEQKSL----------KVAETGQVDgELIRS 420
Cdd:TIGR02169  176 LEELEEVEENIERL-DLIIDEKRQQLERLRREREKAERY-QALLKEKREYEGYELlkekealerqKEAIERQLA-SLEEE 252
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  421 LEQDLKVAKDVSVRLHH---ELKTVEEK-RAKAEDENETLRQQM--IEVEISK-----QALQNELERLKESSLKRRSTRE 489
Cdd:TIGR02169  253 LEKLTEEISELEKRLEEieqLLEELNKKiKDLGEEEQLRVKEKIgeLEAEIASlersiAEKERELEDAEERLAKLEAEID 332
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  490 MYKEKKTfnqddsaDLRCQLQfakEEaflmRKKMAKLGREKDELEQELQKYKSLYGDVDsplptgeaggppsTREAELKL 569
Cdd:TIGR02169  333 KLLAEIE-------ELEREIE---EE----RKRRDKLTEEYAELKEELEDLRAELEEVD-------------KEFAETRD 385
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  570 RLKLVEEEANILGRKIVELEVENRGLKAEMedmrgqQEREGPGRDHAPSIPtspfgDSLESSTELrrhlqfvEEEAELLR 649
Cdd:TIGR02169  386 ELKDYREKLEKLKREINELKRELDRLQEEL------QRLSEELADLNAAIA-----GIEAKINEL-------EEEKEDKA 447
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2217316421  650 RSISEIEDHNRQLTHELSKFKFEpprepgwlgegASPGAGGGAPLQEELKSARLQISEL 708
Cdd:TIGR02169  448 LEIKKQEWKLEQLAADLSKYEQE-----------LYDLKEEYDRVEKELSKLQRELAEA 495
PHA03247 PHA03247
large tegument protein UL36; Provisional
10-319 8.87e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.57  E-value: 8.87e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   10 GGAPDAKLQPPGQHHRHHHLHPVAERRRLHRAPSPARPFLKDLHARPAAPGPAVPSSGRAPAPAAPRSPNLAGKAPPSPG 89
Cdd:PHA03247  2639 DPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSA 2718
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   90 SLAAPGRLSRRSGGVPGAKDKPPPGAGARAAGGAKAALGSRRAAR---VAPAEPLSRAGKPPGAEPPSAAAKGRKAKRGS 166
Cdd:PHA03247  2719 TPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTagpPAPAPPAAPAAGPPRRLTRPAVASLSESRESL 2798
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  167 RAPPARTVGPPTPAARIPAVTLAVTSVAGSPArcsrishtdSSSDLSDCPSEPLSDEQRLLPAASSDAESGTGSsdREPP 246
Cdd:PHA03247  2799 PSPWDPADPPAAVLAPAAALPPAASPAGPLPP---------PTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVR--RRPP 2867
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217316421  247 RGAPTPSPAARGAPPGSPEPPALLAAPLAAGACP--GGRSIPSGVSGGFAGPGVAEDVRGRSPPERPVPGTPKEP 319
Cdd:PHA03247  2868 SRSPAAKPAAPARPPVRRLARPAVSRSTESFALPpdQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP 2942
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
352-775 3.69e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 62.26  E-value: 3.69e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  352 DELDELRAEMEEmrdsyleedvyQLQELRRELDRANKNcRILQYRLRKAEQKSLKVAetgqvdgelIRSLEQDLKVAKDV 431
Cdd:COG1196    189 ERLEDILGELER-----------QLEPLERQAEKAERY-RELKEELKELEAELLLLK---------LRELEAELEELEAE 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  432 SVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQLQF 511
Cdd:COG1196    248 LEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAE 327
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  512 AKEEAFLMRKKMAKLGREKDELEQELQkykslygdvdsplptgeaggppSTREAELKLRLKLVEEEANILGRKIVELEVE 591
Cdd:COG1196    328 LEEELEELEEELEELEEELEEAEEELE----------------------EAEAELAEAEEALLEAEAELAEAEEELEELA 385
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  592 NRGLKAEMEDMRGQQEREGPGRDHApsiptspfgdslESSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHELSKfkf 671
Cdd:COG1196    386 EELLEALRAAAELAAQLEELEEAEE------------ALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEE--- 450
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  672 epprepgwlgegaspgaggGAPLQEELKSARLQISELSGKVLKLQHENHALLSNIQRCDLAAHLGLRApsprdsdAESDA 751
Cdd:COG1196    451 -------------------EAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEA-------EADYE 504
                          410       420
                   ....*....|....*....|....
gi 2217316421  752 GKKESDGEESRLPQPKREGPVGGE 775
Cdd:COG1196    505 GFLEGVKAALLLAGLRGLAGAVAV 528
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
39-331 4.82e-09

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 61.73  E-value: 4.82e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   39 HRAPSPARPFLKDLHARPAAPGPAVPSSGRAPAPAAPRSPNLAGKAPPSPGSLAAPGRLS--RRSGGVPGAKDKPPPGAG 116
Cdd:PHA03307    77 TEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEmlRPVGSPGPPPAASPPAAG 156
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  117 ARAAGGAKAALGSRRAARVAP-----AEPLSRAGKPPGAEPPSAAAKGRKAKRGS----RAPPARTVGPPTPAARIPAvt 187
Cdd:PHA03307   157 ASPAAVASDAASSRQAALPLSspeetARAPSSPPAEPPPSTPPAAASPRPPRRSSpisaSASSPAPAPGRSAADDAGA-- 234
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  188 lAVTSVAGSPARCSRISHTDSSSDLSDCPSEPLSDEQRLLPAASSDAESGTGSSDREPPRGAPTPSPAARGAPPGSPEPP 267
Cdd:PHA03307   235 -SSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPR 313
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2217316421  268 ALLAAPLAAGACPGGRSIPSGVSGGFAGPGVAEDVRGRSPPERPVPGTPKEPSLGEQSRLVPAA 331
Cdd:PHA03307   314 ASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSS 377
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
350-725 8.00e-09

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 60.80  E-value: 8.00e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEMEEMRD------SYLEEDVYQLQELRRELDRANKNCRILQYRLR--KAEQKSLKvaetGQVDGELIRSL 421
Cdd:TIGR04523  237 KQQEINEKTTEISNTQTqlnqlkDEQNKIKKQLSEKQKELEQNNKKIKELEKQLNqlKSEISDLN----NQKEQDWNKEL 312
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  422 EQDLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERlKESSLKR-RSTREMYKEKKTFNQD 500
Cdd:TIGR04523  313 KSELKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEE-KQNEIEKlKKENQSYKQEIKNLES 391
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  501 DSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEaggppsTREAELKLRLKLVEEEANI 580
Cdd:TIGR04523  392 QINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDLT------NQDSVKELIIKNLDNTRES 465
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  581 LGRKIVELEVENRGLKAEMEDMrgQQEREGPGRDHapSIPTSPFGDSLESSTELRRHLQFVEEEAELLRRSISEIEDHNR 660
Cdd:TIGR04523  466 LETQLKVLSRSINKIKQNLEQK--QKELKSKEKEL--KKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKIS 541
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217316421  661 QLTHELSKFKFepprepgwlgegaspgagggaplqeELKSARL--QISELSGKVLKLQHENHALLSN 725
Cdd:TIGR04523  542 DLEDELNKDDF-------------------------ELKKENLekEIDEKNKEIEELKQTQKSLKKK 583
PHA03247 PHA03247
large tegument protein UL36; Provisional
8-330 8.49e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.11  E-value: 8.49e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421    8 AGGGAPDAKLQPPGQHHRHHHLHPVAERRRlHRAPSPARPflkdlharPAAPGPAVPSSGRapapaaprspnlAGKAPPS 87
Cdd:PHA03247  2728 ARQASPALPAAPAPPAVPAGPATPGGPARP-ARPPTTAGP--------PAPAPPAAPAAGP------------PRRLTRP 2786
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   88 PGSLAAPGRLSRRSGGVPGAKDKPPPGAGARAAGGAKAALGSRRAARVAPAEPLSRAGKPPGAEPP--SAAAKGRKAKRG 165
Cdd:PHA03247  2787 AVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLggSVAPGGDVRRRP 2866
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  166 -SRAPPARTVGPPTPAAR---IPAVTLAVTSVAGSPARCSRISHTDSssdlsdcPSEPLSDEQRLLPAASSDAEsgtgss 241
Cdd:PHA03247  2867 pSRSPAAKPAAPARPPVRrlaRPAVSRSTESFALPPDQPERPPQPQA-------PPPPQPQPQPPPPPQPQPPP------ 2933
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  242 drePPRGAPTPSPAARGAPPGSPEppallaaplAAGACPGGRSipsgvsgGFAGPGVAEDVRGRSPPERPVPGTPKEPSL 321
Cdd:PHA03247  2934 ---PPPPRPQPPLAPTTDPAGAGE---------PSGAVPQPWL-------GALVPGRVAVPRFRVPQPAPSREAPASSTP 2994

                   ....*....
gi 2217316421  322 GEQSRLVPA 330
Cdd:PHA03247  2995 PLTGHSLSR 3003
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
350-542 1.22e-08

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 60.12  E-value: 1.22e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEMEEMRDsyleedvyqlqELRRELDRANKNCRILQYRLRKAEqKSLKVAETG------QVD--GELIRSL 421
Cdd:pfam05483  546 LRDELESVREEFIQKGD-----------EVKCKLDKSEENARSIEYEVLKKE-KQMKILENKcnnlkkQIEnkNKNIEEL 613
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  422 EQDLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQ---QMI-----EVEISKQALQNELERLKESSLKRRSTREMYKE 493
Cdd:pfam05483  614 HQENKALKKKGSAENKQLNAYEIKVNKLELELASAKQkfeEIIdnyqkEIEDKKISEEKLLEEVEKAKAIADEAVKLQKE 693
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*....
gi 2217316421  494 kktfnqddsADLRCQLQFAkEEAFLMRKKMAKLGREKDELEQELQKYKS 542
Cdd:pfam05483  694 ---------IDKRCQHKIA-EMVALMEKHKHQYDKIIEERDSELGLYKN 732
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
370-672 2.94e-08

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 59.30  E-value: 2.94e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  370 EEDVYQLQELRRELDRANKNCRI-----LQYRLRKAEQKSLKVAETGQVDGELIRSLEQdLKVAKDVSVRLHHELktvEE 444
Cdd:TIGR02168  185 RENLDRLEDILNELERQLKSLERqaekaERYKELKAELRELELALLVLRLEELREELEE-LQEELKEAEEELEEL---TA 260
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  445 KRAKAEDENETLRQQMIEVEISKQALQNELERLKE--SSLKRRstREMYKEKKTFNQDDSADLRCQLQFAKEEAFLMRKK 522
Cdd:TIGR02168  261 ELQELEEKLEELRLEVSELEEEIEELQKELYALANeiSRLEQQ--KQILRERLANLERQLEELEAQLEELESKLDELAEE 338
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  523 MAKLGREKDELEQELQkykslygDVDSPLPTGEAggppstREAELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDM 602
Cdd:TIGR02168  339 LAELEEKLEELKEELE-------SLEAELEELEA------ELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERL 405
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217316421  603 RGQQER--------EGPGRDHAPSIPTSPFGDSLESSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHELSKFKFE 672
Cdd:TIGR02168  406 EARLERledrrerlQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERE 483
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
350-670 3.38e-08

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 58.92  E-value: 3.38e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEMEEmrdsyLEEDVYQLQELRRELDRANKNCRILQYRLR---------KAEQKSL--KVAETGQVDG--- 415
Cdd:PRK03918   219 LREELEKLEKEVKE-----LEELKEEIEELEKELESLEGSKRKLEEKIReleerieelKKEIEELeeKVKELKELKEkae 293
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  416 ---ELIRSLEQDLKVAKDVSVRL---HHELKTVEEKRAKAEDENETLRqqmiEVEISKQALQNELERLKESSLK----RR 485
Cdd:PRK03918   294 eyiKLSEFYEEYLDELREIEKRLsrlEEEINGIEERIKELEEKEERLE----ELKKKLKELEKRLEELEERHELyeeaKA 369
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  486 STREMYKEKKTFNQDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREA 565
Cdd:PRK03918   370 KKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEH 449
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  566 ELKLRLKLVEEEANILGRKIvELEVENRGLKAEMEDMRGQQEREgpgRDHAPSIPTSPFGDSLESSTEL--RRHLQFVEE 643
Cdd:PRK03918   450 RKELLEEYTAELKRIEKELK-EIEEKERKLRKELRELEKVLKKE---SELIKLKELAEQLKELEEKLKKynLEELEKKAE 525
                          330       340
                   ....*....|....*....|....*..
gi 2217316421  644 EAELLRRSISEIEDHNRQLTHELSKFK 670
Cdd:PRK03918   526 EYEKLKEKLIKLKGEIKSLKKELEKLE 552
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
443-769 3.60e-08

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 58.93  E-value: 3.60e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  443 EEKRAKAEDENETLRQQMIEVEISKQALQNELERLKesslKRRSTREMYKEKKTFNQDDSADLRC-QLQFAKEEAFLMRK 521
Cdd:TIGR02169  169 DRKKEKALEELEEVEENIERLDLIIDEKRQQLERLR----REREKAERYQALLKEKREYEGYELLkEKEALERQKEAIER 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  522 KMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREAELKLRLKLVEEEANI----------------LGRKI 585
Cdd:TIGR02169  245 QLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIaslersiaekereledAEERL 324
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  586 VELEVENRGLKAEMEDMRGQQEREGPGRDHApsipTSPFGDSLESSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHE 665
Cdd:TIGR02169  325 AKLEAEIDKLLAEIEELEREIEEERKRRDKL----TEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKRE 400
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  666 LSKFKFEPPRepgwLGEGASPGAGGGAPLQEELKSARLQISELSGKVLKLQHENHALLSNIQRcdLAAHLGlrapsprDS 745
Cdd:TIGR02169  401 INELKRELDR----LQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQ--LAADLS-------KY 467
                          330       340
                   ....*....|....*....|....
gi 2217316421  746 DAESDAGKKESDGEESRLPQPKRE 769
Cdd:TIGR02169  468 EQELYDLKEEYDRVEKELSKLQRE 491
PHA03247 PHA03247
large tegument protein UL36; Provisional
85-331 5.34e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.41  E-value: 5.34e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   85 PPSPGSLAAPGRLSRRSggVPGAKDKPPPGAGARAAGGAKAALGSRRAARVAPAEPlsrAGKPPGAEPPSAAAKGRKAKR 164
Cdd:PHA03247  2551 PPPPLPPAAPPAAPDRS--VPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDD---RGDPRGPAPPSPLPPDTHAPD 2625
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  165 ---GSRAPPARTVGPPTPAARIPAvtlAVTSVAGSPARCSRISHTDSSSDlsdcPSEPLSDEQRLLPAASSDAESGTGSS 241
Cdd:PHA03247  2626 pppPSPSPAANEPDPHPPPTVPPP---ERPRDDPAPGRVSRPRRARRLGR----AAQASSPPQRPRRRAARPTVGSLTSL 2698
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  242 DREPPRGaPTPSPAARGAPPGSPEPPALLAAPLAAGACPGGRSIPSGVSGGFAGPGVAEDVRGRSPPERPVPGTPKEPSL 321
Cdd:PHA03247  2699 ADPPPPP-PTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAA 2777
                          250
                   ....*....|
gi 2217316421  322 GEQSRLVPAA 331
Cdd:PHA03247  2778 GPPRRLTRPA 2787
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
350-609 6.17e-08

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 58.02  E-value: 6.17e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEMEEMRDSyLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETgqvdgelIRSLEQDLKVAK 429
Cdd:COG1196    265 LEAELEELRLELEELELE-LEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEE-------LAELEEELEELE 336
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  430 DVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQL 509
Cdd:COG1196    337 EELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERL 416
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  510 QFAKEEAFLMRKKMAKLGREKDELEQELQkykslygdvdsplptgeaggppstreaELKLRLKLVEEEANILGRKIVELE 589
Cdd:COG1196    417 ERLEEELEELEEALAELEEEEEEEEEALE---------------------------EAAEEEAELEEEEEALLELLAELL 469
                          250       260
                   ....*....|....*....|
gi 2217316421  590 VENRGLKAEMEDMRGQQERE 609
Cdd:COG1196    470 EEAALLEAALAELLEELAEA 489
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
7-264 7.79e-08

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 57.55  E-value: 7.79e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421    7 PAGGGAPDAKLQPP---GQHHRHHHLHPVAERRRLHRAPSPARPFLKDLHARPAAPGPAVPSSGRAPAPAAPRSPNLAGK 83
Cdd:PRK07003   362 VTGGGAPGGGVPARvagAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDD 441
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   84 APPSPGSLAAPGRLSRRSGGVPGAKDKPPPGAGARAAGGAKAALGSRRAARVAPAEPLSRAGKPPGAEPPSAAAKGRKAK 163
Cdd:PRK07003   442 AADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDA 521
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  164 RGSRAPPARTVGPPTPAARIPAVTLAVTSVAGSPARCS--RIShtdsssdlsdcpseplSDEQRLLPAASSDAESGTGSS 241
Cdd:PRK07003   522 PAAAAPPAPEARPPTPAAAAPAARAGGAAAALDVLRNAgmRVS----------------SDRGARAAAAAKPAAAPAAAP 585
                          250       260
                   ....*....|....*....|....
gi 2217316421  242 DREPPRGA-PTPSPAARGAPPGSP 264
Cdd:PRK07003   586 KPAAPRVAvQVPTPRARAATGDAP 609
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
345-661 8.65e-08

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 57.77  E-value: 8.65e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  345 SENDYLKDELDELRAEMEEMRDSyLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKV-AETGQVDGElIRSLEQ 423
Cdd:TIGR02169  695 SELRRIENRLDELSQELSDASRK-IGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVkSELKELEAR-IEELEE 772
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  424 DLKVAK----DVSVRL-HHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFN 498
Cdd:TIGR02169  773 DLHKLEealnDLEARLsHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSI 852
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  499 QDDSADLRCQL-----QFAKEEAFL--MRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEaggppsTREAELKLRL 571
Cdd:TIGR02169  853 EKEIENLNGKKeeleeELEELEAALrdLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKR------KRLSELKAKL 926
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  572 KLVEEEANILGRKIVELEVEN------RGLKAEMEDMRGQQEREGPGRDHAPsiptSPFGDSLESSTELRRHLQFVEEEA 645
Cdd:TIGR02169  927 EALEEELSEIEDPKGEDEEIPeeelslEDVQAELQRVEEEIRALEPVNMLAI----QEYEEVLKRLDELKEKRAKLEEER 1002
                          330
                   ....*....|....*.
gi 2217316421  646 ELLRRSISEIEDHNRQ 661
Cdd:TIGR02169 1003 KAILERIEEYEKKKRE 1018
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
133-383 1.48e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 56.92  E-value: 1.48e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  133 ARVAPAEPLSRAGKPPGAEPPSAAAKGRKAKRGSR-APPARTVGPPTPAARIPAVTLAVTSVAGSPARCSRISHTDSSSD 211
Cdd:PRK07764   586 AVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAApAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDG 665
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  212 LSDCPSEPLSdeqrllPAASSDAESGTGSSDRePPRGAPTPSPAARGAPPGSPEPPallaaplaagacPGGRSIPSGVSG 291
Cdd:PRK07764   666 GDGWPAKAGG------AAPAAPPPAPAPAAPA-APAGAAPAQPAPAPAATPPAGQA------------DDPAAQPPQAAQ 726
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  292 GFAGPGVAEDVRGRSPPERPVPGTPKEPSLGEQSRLVPAAEEEELLREMEELRSEndylkdELDELRAEMEEMRDSYLEE 371
Cdd:PRK07764   727 GASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSE------EEEMAEDDAPSMDDEDRRD 800
                          250
                   ....*....|...
gi 2217316421  372 -DVYQLQELRREL 383
Cdd:PRK07764   801 aEEVAMELLEEEL 813
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
81-331 1.63e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 56.78  E-value: 1.63e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   81 AGKAPPS-PGSLAAPGrlsRRSGGVPGAKDKPPPGAGARAAGGAKAALGSRRAARVAPAEPLSRAGKPPGAEPPSAAAKG 159
Cdd:PRK07003   369 GGGVPARvAGAVPAPG---ARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADG 445
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  160 RKAKRGSRAPPARTVGPPTPAARIPAVTLAVTSVAGSPARcsrishtdsssdlSDCPSEPLSdeqrllPAASSDAESGTG 239
Cdd:PRK07003   446 DAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAP-------------PDAAFEPAP------RAAAPSAATPAA 506
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  240 SSDREPPRGAPTPSPAARGAPPgSPEPPALLAAPLAAGACPGGRSIP---------------SGVSGGFAGPGVAEDVRG 304
Cdd:PRK07003   507 VPDARAPAAASREDAPAAAAPP-APEARPPTPAAAAPAARAGGAAAAldvlrnagmrvssdrGARAAAAAKPAAAPAAAP 585
                          250       260       270
                   ....*....|....*....|....*....|
gi 2217316421  305 RSPPER---PVPgTPKEPSLGEQSRLVPAA 331
Cdd:PRK07003   586 KPAAPRvavQVP-TPRARAATGDAPPNGAA 614
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
350-662 2.06e-07

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 56.31  E-value: 2.06e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEMEEMRDSYLE--------EDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETgqvdGELIRSL 421
Cdd:COG4717     93 LQEELEELEEELEELEAELEElreeleklEKLLQLLPLYQELEALEAELAELPERLEELEERLEELREL----EEELEEL 168
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  422 EQDLKVAK--------DVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKE 493
Cdd:COG4717    169 EAELAELQeeleelleQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALEERLKEA 248
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  494 K---------------------KTFNQDDSADLRCQLQFAkeEAFLMRKKMAKLGREKDEL----------EQELQKYKS 542
Cdd:COG4717    249 RlllliaaallallglggsllsLILTIAGVLFLVLGLLAL--LFLLLAREKASLGKEAEELqalpaleeleEEELEELLA 326
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  543 LYGdVDSPLPTGEAGGPPST-----------REAELKLRLKLVEEEANILG-----------RKIVELEVENRGLKAEME 600
Cdd:COG4717    327 ALG-LPPDLSPEELLELLDRieelqellreaEELEEELQLEELEQEIAALLaeagvedeeelRAALEQAEEYQELKEELE 405
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217316421  601 DMRGQQEREGPGRDHAPSIPTspfGDSLES-STELRRHLQFVEEEAELLRRSISEIEDHNRQL 662
Cdd:COG4717    406 ELEEQLEELLGELEELLEALD---EEELEEeLEELEEELEELEEELEELREELAELEAELEQL 465
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
39-317 3.00e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 55.95  E-value: 3.00e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   39 HRAPSPARPFLKDLHARPAA--PGPAVPSSGRAPAPAAPRSPNLAGKAPPSPGSLAAP---GRLSRRSGGVPGAKDKP-P 112
Cdd:PHA03307   118 PPTPPPASPPPSPAPDLSEMlrPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPlssPEETARAPSSPPAEPPPsT 197
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  113 PGAGARAAGGAKAALGSRRAARVAPAEPLSRAGKPPGAEPPSAAAKGRKAKRGSRA-----PPARTVGPPTPAARIPAVT 187
Cdd:PHA03307   198 PPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENecplpRPAPITLPTRIWEASGWNG 277
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  188 LAVTSVAGSPARCSRISHTDSSSDLSDCPSEPLSDEQRLLPAASSDAESGTGSSDREPPRGAPTPSPAARG--------- 258
Cdd:PHA03307   278 PSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSrspspsrpp 357
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217316421  259 --APPGSPEPPALLAAPLAAGACPGGRSIPSGVSGGFAGPGVAEDVRGRSPPERPVPGTPK 317
Cdd:PHA03307   358 ppADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLD 418
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
60-265 5.07e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 54.99  E-value: 5.07e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   60 GPAVPSSGRAPAPAAPRSPNLAGKAPPSPGSLAAPGRLSRRSGGVPGAKDKPPPGAGARAAGGAKAALGSRRAARVAPAE 139
Cdd:PRK07764   590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGW 669
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  140 PLSRAGKPPGAEPPSAAAKGRKAKRGSRAPPARTVGPPTPAARipavtlAVTSVAGSPARCSRISHTDSSSDLSDCPSEP 219
Cdd:PRK07764   670 PAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAG------QADDPAAQPPQAAQGASAPSPAADDPVPLPP 743
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 2217316421  220 LSDEQRLLPAASSDAESGTGSSDREPPrgAPTPSPAARGAPPGSPE 265
Cdd:PRK07764   744 EPDDPPDPAGAPAQPPPPPAPAPAAAP--AAAPPPSPPSEEEEMAE 787
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
362-670 8.60e-07

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 54.30  E-value: 8.60e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  362 EEMRDSYLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVD--GELIRSLEQDLKVakdvsvrlhHEL 439
Cdd:PRK03918   447 EEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKelAEQLKELEEKLKK---------YNL 517
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  440 KTVEEKrakaEDENETLRQQMIEVEISKQALQNELERLKEssLKRRStREMYKEKKTfnqddsadlrcqlqfAKEEAFLM 519
Cdd:PRK03918   518 EELEKK----AEEYEKLKEKLIKLKGEIKSLKKELEKLEE--LKKKL-AELEKKLDE---------------LEEELAEL 575
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  520 RKKMAKLGREK-DELEQELQKYKSLYGDVDsplptgEAGGPPSTREAELKlRLKLVEEEANILGRKIVELEVENRGLKAE 598
Cdd:PRK03918   576 LKELEELGFESvEELEERLKELEPFYNEYL------ELKDAEKELEREEK-ELKKLEEELDKAFEELAETEKRLEELRKE 648
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217316421  599 MEDMR---GQQEREGPGRDHapsiptspfgdslessTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHELSKFK 670
Cdd:PRK03918   649 LEELEkkySEEEYEELREEY----------------LELSRELAGLRAELEELEKRREEIKKTLEKLKEELEERE 707
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
350-614 1.01e-06

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 54.29  E-value: 1.01e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEMEEMRD--SYLEEdvyQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLKV 427
Cdd:TIGR02168  279 LEEEIEELQKELYALANeiSRLEQ---QKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELES 355
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  428 AKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKE--SSLKRRSTRemykekktfNQDDSADL 505
Cdd:TIGR02168  356 LEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEArlERLEDRRER---------LQQEIEEL 426
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  506 RCQLQfakeeaflmRKKMAKLGREKDELEQELQKYKslygdvdsplptgeaggppsTREAELKLRLKLVEEEANILGRKI 585
Cdd:TIGR02168  427 LKKLE---------EAELKELQAELEELEEELEELQ--------------------EELERLEEALEELREELEEAEQAL 477
                          250       260       270
                   ....*....|....*....|....*....|
gi 2217316421  586 VELEVENRGLKAEMEDMRGQQER-EGPGRD 614
Cdd:TIGR02168  478 DAAERELAQLQARLDSLERLQENlEGFSEG 507
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
6-264 1.07e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 53.73  E-value: 1.07e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421    6 GPAGGGAPDAklqppgqhhrHHHLHPVAERRRLHRAPSPARPflkdlharPAAPGPAVPSSGRAPAPAAPRSPNLAGKAP 85
Cdd:PRK12323   366 GQSGGGAGPA----------TAAAAPVAQPAPAAAAPAAAAP--------APAAPPAAPAAAPAAAAAARAVAAAPARRS 427
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   86 PSPGSLAAP-GRLSRRSGGVPGAKDKPPPGAgaraaggakaalgsrrAARVAPAEPLSRAGKPPGAEPPSAAAKGRKAKR 164
Cdd:PRK12323   428 PAPEALAAArQASARGPGGAPAPAPAPAAAP----------------AAAARPAAAGPRPVAAAAAAAPARAAPAAAPAP 491
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  165 GSRAPPARTVGPPTPAARIPAvtlavtsvagsparcsrishtDSSSDLSDCPSEPLSDEqrllpaaSSDAESGTGSSDRE 244
Cdd:PRK12323   492 ADDDPPPWEELPPEFASPAPA---------------------QPDAAPAGWVAESIPDP-------ATADPDDAFETLAP 543
                          250       260
                   ....*....|....*....|
gi 2217316421  245 PPRGAPTPSPAARGAPPGSP 264
Cdd:PRK12323   544 APAAAPAPRAAAATEPVVAP 563
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
352-542 1.32e-06

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 53.77  E-value: 1.32e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  352 DELDELRAEMEEMRDSY--LEedvyQLQELRRELDRANKNCRILQYRLRKAEqkslkvAETGQVDGELIRSLEQDLKVAK 429
Cdd:COG4913    235 DDLERAHEALEDAREQIelLE----PIRELAERYAAARERLAELEYLRAALR------LWFAQRRLELLEAELEELRAEL 304
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  430 DvsvRLHHELKTVEEKRAKAEDENETLRQQMIEVE-ISKQALQNELERLKESSLKRRSTREMYKE------------KKT 496
Cdd:COG4913    305 A---RLEAELERLEARLDALREELDELEAQIRGNGgDRLEQLEREIERLERELEERERRRARLEAllaalglplpasAEE 381
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2217316421  497 FN------QDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKS 542
Cdd:COG4913    382 FAalraeaAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLER 433
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
353-542 1.36e-06

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 53.91  E-value: 1.36e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  353 ELDELRAEMEEMRDsylEEDVYQ--LQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQ-------VDGELIRSLEQ 423
Cdd:TIGR02168  268 KLEELRLEVSELEE---EIEELQkeLYALANEISRLEQQKQILRERLANLERQLEELEAQLEeleskldELAEELAELEE 344
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  424 DLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEV-----EISKQALQN---------ELERLKESSLKRRSTRE 489
Cdd:TIGR02168  345 KLEELKEELESLEAELEELEAELEELESRLEELEEQLETLrskvaQLELQIASLnneierleaRLERLEDRRERLQQEIE 424
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2217316421  490 MYKEKKTFN-----QDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKS 542
Cdd:TIGR02168  425 ELLKKLEEAelkelQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAER 482
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
346-484 1.51e-06

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 53.53  E-value: 1.51e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  346 ENDYLKDELDELRAEMEEMRDSYLEEDVyQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDL 425
Cdd:TIGR02169  351 RRDKLTEEYAELKEELEDLRAELEEVDK-EFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAI 429
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217316421  426 KVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKE--SSLKR 484
Cdd:TIGR02169  430 AGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKelSKLQR 490
PTZ00121 PTZ00121
MAEBL; Provisional
358-662 2.41e-06

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 53.22  E-value: 2.41e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  358 RAEMEEMRDSYLEEDVYQLQELRREldranKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLKVAKDVSVRLHH 437
Cdd:PTZ00121  1591 EARIEEVMKLYEEEKKMKAEEAKKA-----EEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAE 1665
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  438 ELKTVEEKRAKAEdenETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQLqfAKEEAF 517
Cdd:PTZ00121  1666 EAKKAEEDKKKAE---EAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEE--AKKEAE 1740
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  518 LMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREAELKLRLKLVEEEANILGRKIVELEVENRG--- 594
Cdd:PTZ00121  1741 EDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGnlv 1820
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2217316421  595 --LKAEMEDMRGQQ--EREGPGRDHAPSIPTSPFGDSLESSTELRRHLQFvEEEAELLRRSISEIE--DHNRQL 662
Cdd:PTZ00121  1821 inDSKEMEDSAIKEvaDSKNMQLEEADAFEKHKFNKNNENGEDGNKEADF-NKEKDLKEDDEEEIEeaDEIEKI 1893
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
54-313 2.51e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 52.93  E-value: 2.51e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   54 ARPAAPGPAVPSS-----GRAPAPAAPRSPNLAGKAPPSPGSLAAPGRLSRRSGGVPGAKD-------KPPPGAGARAAG 121
Cdd:PRK07003   361 AVTGGGAPGGGVParvagAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATraeappaAPAPPATADRGD 440
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  122 GAKAALGSRRAARVAPAEPLSRAGkPPGAEPPSAAAKgrKAKRGSRAPPARTVGPPTPAARIPAVTLAVTSVAGSPARCS 201
Cdd:PRK07003   441 DAADGDAPVPAKANARASADSRCD-ERDAQPPADSGS--ASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAAS 517
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  202 RishtdssSDLSDCPSEPLSDEQRLLPAASSDAESGTG---------------SSDREPPRGAPTPSPAARGAPPGSPEP 266
Cdd:PRK07003   518 R-------EDAPAAAAPPAPEARPPTPAAAAPAARAGGaaaaldvlrnagmrvSSDRGARAAAAAKPAAAPAAAPKPAAP 590
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*...
gi 2217316421  267 PALLAAPLaagacPGGRSIPSGVSGGFAGPGV-AEDVRGRSPPERPVP 313
Cdd:PRK07003   591 RVAVQVPT-----PRARAATGDAPPNGAARAEqAAESRGAPPPWEDIP 633
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
354-559 3.09e-06

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 52.61  E-value: 3.09e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  354 LDELRAEMEEmrdsyLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGElIRSLEQDLKVAKDVSV 433
Cdd:COG4913    612 LAALEAELAE-----LEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVASAERE-IAELEAELERLDASSD 685
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  434 rlhhELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQLQFAK 513
Cdd:COG4913    686 ----DLAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGD 761
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2217316421  514 EEAFLMRKKM--------AKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGP 559
Cdd:COG4913    762 AVERELRENLeeridalrARLNRAEEELERAMRAFNREWPAETADLDADLESLP 815
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
350-608 3.53e-06

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 52.05  E-value: 3.53e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEMEEMRdsyleedvYQLQELRRELDRANKNCRILQYRLRKAEQkslkvaetgqvdgeLIRSLEQDlkvak 429
Cdd:pfam05557  109 LKNELSELRRQIQRAE--------LELQSTNSELEELQERLDLLKAKASEAEQ--------------LRQNLEKQ----- 161
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  430 dvsvrlHHELKTVEEKRAKAEDENETLRQQMIEVEISKQAL------QNELERLKESSLKRRSTRE---MYKEKKtfnqd 500
Cdd:pfam05557  162 ------QSSLAEAEQRIKELEFEIQSQEQDSEIVKNSKSELaripelEKELERLREHNKHLNENIEnklLLKEEV----- 230
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  501 dsADLRCQLQfaKEEAflMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPP----STREAELKLRLKLVEE 576
Cdd:pfam05557  231 --EDLKRKLE--REEK--YREEAATLELEKEKLEQELQSWVKLAQDTGLNLRSPEDLSRRieqlQQREIVLKEENSSLTS 304
                          250       260       270
                   ....*....|....*....|....*....|..
gi 2217316421  577 EANILGRKIVELEVENRGLKAEMEDMRGQQER 608
Cdd:pfam05557  305 SARQLEKARRELEQELAQYLKKIEDLNKKLKR 336
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
350-485 3.95e-06

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 52.22  E-value: 3.95e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEMEEMRDSYLEEDVYQLQELRRELDRANKncrilqyRLRKAEQKSLKVAetgqvdgELIRSLEQDLKVAK 429
Cdd:COG4913    314 LEARLDALREELDELEAQIRGNGGDRLEQLEREIERLER-------ELEERERRRARLE-------ALLAALGLPLPASA 379
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2217316421  430 DVSVRLHHELKtveEKRAKAEDENETLRQQMIEVEISKQALQNELERLKE--SSLKRR 485
Cdd:COG4913    380 EEFAALRAEAA---ALLEALEEELEALEEALAEAEAALRDLRRELRELEAeiASLERR 434
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
350-589 4.37e-06

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 51.86  E-value: 4.37e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEmeemrdsyLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLKVAK 429
Cdd:COG1196    321 LEEELAELEEE--------LEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEAL 392
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  430 DVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKtfnqddsADLRCQL 509
Cdd:COG1196    393 RAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEE-------EALLELL 465
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  510 QFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDvdsplptgEAGGPPSTREAELKLRLKLVEEEANILGRKIVELE 589
Cdd:COG1196    466 AELLEEAALLEAALAELLEELAEAAARLLLLLEAEAD--------YEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYE 537
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
350-831 1.07e-05

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 50.71  E-value: 1.07e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEMEEMRDSyLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLKVAK 429
Cdd:COG1196    314 LEERLEELEEELAELEEE-LEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEAL 392
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  430 DVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQL 509
Cdd:COG1196    393 RAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEA 472
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  510 QFAKEEAflmRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREAELKLRLKLVEEEANILGRKIVELE 589
Cdd:COG1196    473 ALLEAAL---AELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQ 549
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  590 VENRGLKAEMEDMRGQQEREGPGRDHAPSIPTSPFGDSLESSTELRRHLQFVEEEAELLRRSisEIEDHNRQLTHELSKF 669
Cdd:COG1196    550 NIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREA--DARYYVLGDTLLGRTL 627
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  670 KFEPPREPGWLGEGASPGAGGGAPLQEELKSARLQISELSGKVLKLQHENHALLSNIQRCDLAAHLGLRAPSPRDSDAES 749
Cdd:COG1196    628 VAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEER 707
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  750 DAGKKESDGEESRLPQPKREGPVGGESDSEEMFEktsgfgsgkpSEASEPCPTELLKAREDSEYLVTLKHEAQRLERTVE 829
Cdd:COG1196    708 ELAEAEEERLEEELEEEALEEQLEAEREELLEEL----------LEEEELLEEEALEELPEPPDLEELERELERLEREIE 777

                   ..
gi 2217316421  830 RL 831
Cdd:COG1196    778 AL 779
PTZ00121 PTZ00121
MAEBL; Provisional
351-609 1.41e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 50.52  E-value: 1.41e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  351 KDELDELRAEMEEMRDSylEEDVYQLQELRRELDRANKNCRILQY--RLRKAEQK----SLKVAETGQVDGELIRSLE-- 422
Cdd:PTZ00121  1469 AKKADEAKKKAEEAKKA--DEAKKKAEEAKKKADEAKKAAEAKKKadEAKKAEEAkkadEAKKAEEAKKADEAKKAEEkk 1546
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  423 --QDLKVAKDVsvRLHHELKTVEEKRAKAEDENETLR--------------QQMIEVEISKQALQNELERLKESSLKRRS 486
Cdd:PTZ00121  1547 kaDELKKAEEL--KKAEEKKKAEEAKKAEEDKNMALRkaeeakkaeearieEVMKLYEEEKKMKAEEAKKAEEAKIKAEE 1624
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  487 TREMYKEKKTFNQ---DDSADLRCQLQFAKEEaflmRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAggppSTR 563
Cdd:PTZ00121  1625 LKKAEEEKKKVEQlkkKEAEEKKKAEELKKAE----EENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEA----LKK 1696
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 2217316421  564 EAELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDMRGQQERE 609
Cdd:PTZ00121  1697 EAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEED 1742
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
338-668 2.56e-05

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 49.58  E-value: 2.56e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  338 REMEELRSENDYLKDELDELRAEMEEMRDSYLEEDVYQLQELRRELDRankncRILQYRLRKAEQKSLKVAETGQVDG-E 416
Cdd:pfam02463  177 KLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEE-----YLLYLDYLKLNEERIDLLQELLRDEqE 251
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  417 LIRSLEQDLKVAKDVSVRLHHEL-------KTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTRE 489
Cdd:pfam02463  252 EIESSKQEIEKEEEKLAQVLKENkeeekekKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEKELK 331
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  490 MYKEKKtfnqddsadlrcqLQFAKEEAFLMRKKmAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREAELKL 569
Cdd:pfam02463  332 KEKEEI-------------EELEKELKELEIKR-EAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEEL 397
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  570 RLKLVEEeaNILGRKIVELEVENRGLKAEMEDMRGQQEREGpgrdhapsiptsPFGDSLESSTELRRHLQFVEEEAELLR 649
Cdd:pfam02463  398 ELKSEEE--KEAQLLLELARQLEDLLKEEKKEELEILEEEE------------ESIELKQGKLTEEKEELEKQELKLLKD 463
                          330
                   ....*....|....*....
gi 2217316421  650 RSISEIEDHNRQLTHELSK 668
Cdd:pfam02463  464 ELELKKSEDLLKETQLVKL 482
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
346-535 2.60e-05

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 49.35  E-value: 2.60e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  346 ENDYLKDELDELRAE--MEEMRDSYLEEDVYQLQELRRELDRANKN--CRILQYRLRKAEQKSLKVAETgQVDGELIRSL 421
Cdd:pfam17380  390 KNERVRQELEAARKVkiLEEERQRKIQQQKVEMEQIRAEQEEARQRevRRLEEERAREMERVRLEEQER-QQQVERLRQQ 468
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  422 EQDLKVAKDVSVRLHHELKTVEEKRAKA-EDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQ- 499
Cdd:pfam17380  469 EEERKRKKLELEKEKRDRKRAEEQRRKIlEKELEERKQAMIEEERKRKLLEKEMEERQKAIYEEERRREAEEERRKQQEm 548
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 2217316421  500 DDSADLRCQLQFAKEEaflmRKKMAKLGREKDELEQ 535
Cdd:pfam17380  549 EERRRIQEQMRKATEE----RSRLEAMEREREMMRQ 580
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
41-297 3.12e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 49.40  E-value: 3.12e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   41 APSPARPFLKDLHARPAAPGPAVPSSGRAPAPAAPRSPNLAGKAPPSPgsLAAPGRLSRRSGGVPGAKDKPPPGAGARAA 120
Cdd:PHA03307    48 AELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTL--APASPAREGSPTPPGPSSPDPPPPTPPPAS 125
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  121 GGAKAALGSRRAARVAPAEPLSRAGKPPGAEPPSAAAKGRKAKRGSRAPPARTVGPPTPAARIPAVTLAV-TSVAGSPAR 199
Cdd:PHA03307   126 PPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPsTPPAAASPR 205
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  200 CSRISHTDSSSDLSDCPSEPLSDEQRLLPAASSDAESGTGSSDREPPRGAPTPSPA--------ARGAPPGSPEPPALLA 271
Cdd:PHA03307   206 PPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPApitlptriWEASGWNGPSSRPGPA 285
                          250       260
                   ....*....|....*....|....*.
gi 2217316421  272 APLAAGACPGGRSIPSGVSGGFAGPG 297
Cdd:PHA03307   286 SSSSSPRERSPSPSPSSPGSGPAPSS 311
PHA03247 PHA03247
large tegument protein UL36; Provisional
6-264 3.76e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.17  E-value: 3.76e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421    6 GPAGGGAPDAKLQPPGQHHRHHHLHPVAERRRLHRAPSPARPFLKDLHARPAAPGP----AVPSSGRAPAPAAPRSPNLA 81
Cdd:PHA03247  2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPpptsAQPTAPPPPPGPPPPSLPLG 2853
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   82 GKAPP------------SPGSLAAPGR----------LSRRSGGVPGAKDKPPPGAGARAAGGAKAALGSRRAARVAPAE 139
Cdd:PHA03247  2854 GSVAPggdvrrrppsrsPAAKPAAPARppvrrlarpaVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPP 2933
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  140 PLSRAGKPPGAEPPSAAAKGRKAKR-----------GSRAPPARTVGPPTPAARIPAVTlaVTSVAGSPArcSRISHTDS 208
Cdd:PHA03247  2934 PPPPRPQPPLAPTTDPAGAGEPSGAvpqpwlgalvpGRVAVPRFRVPQPAPSREAPASS--TPPLTGHSL--SRVSSWAS 3009
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  209 SSDLS-DCPSEPLSDEQRLLPAASSDAESGTGSSDREPPR---GAPTPSPAARGAPPGSP 264
Cdd:PHA03247  3010 SLALHeETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERsdlEALDPLPPEPHDPFAHE 3069
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
352-591 4.16e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 48.22  E-value: 4.16e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  352 DELDELRAEMEEMRDsyleedvyQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETgqvdgelIRSLEQDLKVAKDv 431
Cdd:COG4942     20 DAAAEAEAELEQLQQ--------EIAELEKELAALKKEEKALLKQLAALERRIAALARR-------IRALEQELAALEA- 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  432 svrlhhELKTVEEKRAKAEDENETLRQQMIEV--EISKQALQNELERL--KESSLKRRSTREMYKEKKTFNQDDSADLRC 507
Cdd:COG4942     84 ------ELAELEKEIAELRAELEAQKEELAELlrALYRLGRQPPLALLlsPEDFLDAVRRLQYLKYLAPARREQAEELRA 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  508 QLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAggppstREAELKLRLKLVEEEANILGRKIVE 587
Cdd:COG4942    158 DLAELAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEK------ELAELAAELAELQQEAEELEALIAR 231

                   ....
gi 2217316421  588 LEVE 591
Cdd:COG4942    232 LEAE 235
HCR pfam07111
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ...
375-539 5.24e-05

Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.


Pssm-ID: 284517 [Multi-domain]  Cd Length: 749  Bit Score: 48.21  E-value: 5.24e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  375 QLQELRRELDRANKNcriLQYRLRKAEQKSLKVAETGQVD----GELIRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAE 450
Cdd:pfam07111  482 ELEQLREERNRLDAE---LQLSAHLIQQEVGRAREQGEAErqqlSEVAQQLEQELQRAQESLASVGQQLEVARQGQQEST 558
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  451 DENETLRQQMI-EVEISKQALQN---ELE-RLKE--SSLKRR---STREMYK---------EKKTFNQDDSADL-RCQLQ 510
Cdd:pfam07111  559 EEAASLRQELTqQQEIYGQALQEkvaEVEtRLREqlSDTKRRlneARREQAKavvslrqiqHRATQEKERNQELrRLQDE 638
                          170       180
                   ....*....|....*....|....*....
gi 2217316421  511 FAKEEAFLMRKKMAKLGREKDELEQELQK 539
Cdd:pfam07111  639 ARKEEGQRLARRVQELERDKNLMLATLQQ 667
Tropomyosin pfam00261
Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 ...
350-551 5.33e-05

Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites. The protein is best known for its role in regulating the interaction between actin and myosin in muscle contraction, but is also involved in the organization and dynamics of the cytoskeleton in non-muscle cells. There are multiple cell-specific isoforms, expressed by alternative promoters and alternative RNA processing of at least four genes. Muscle isoforms of tropomyosin are characterized by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region.


Pssm-ID: 459736 [Multi-domain]  Cd Length: 235  Bit Score: 46.95  E-value: 5.33e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEMEEMRDSYLEEDVY-------------QLQELRRELDRANKNCRILQYRLRKAEQKS------LKVAE- 409
Cdd:pfam00261    6 IKEELDEAEERLKEAMKKLEEAEKRaekaeaevaalnrRIQLLEEELERTEERLAEALEKLEEAEKAAdesergRKVLEn 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  410 TGQVDGELIRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTRE 489
Cdd:pfam00261   86 RALKDEEKMEILEAQLKEAKEIAEEADRKYEEVARKLVVVEGDLERAEERAELAESKIVELEEELKVVGNNLKSLEASEE 165
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217316421  490 MYKEKKTFNQDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPL 551
Cdd:pfam00261  166 KASEREDKYEEQIRFLTEKLKEAETRAEFAERSVQKLEKEVDRLEDELEAEKEKYKAISEEL 227
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
346-895 5.35e-05

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 48.50  E-value: 5.35e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  346 ENDYLKDE----LDELRAEMEEMRD-SYLEEDVYQLQELRRelDRANKNCRILQYRLRKAEQKSLKVAE---TGQVDgEL 417
Cdd:TIGR00606  502 EVKSLQNEkadlDRKLRKLDQEMEQlNHHTTTRTQMEMLTK--DKMDKDEQIRKIKSRHSDELTSLLGYfpnKKQLE-DW 578
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  418 IRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVE------ISKQALQNELERLKESSLKRRSTREM- 490
Cdd:TIGR00606  579 LHSKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYEdklfdvCGSQDEESDLERLKEEIEKSSKQRAMl 658
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  491 ----------------------------YKEKKTFNQDdSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKS 542
Cdd:TIGR00606  659 agatavysqfitqltdenqsccpvcqrvFQTEAELQEF-ISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQS 737
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  543 LYGDVDSPLP-TGEAGGPPSTREAELKLRLklvEEEANILGRKIVELEVENRGLKAEMEDMRGQQEREGPGRDHAPSIPT 621
Cdd:TIGR00606  738 IIDLKEKEIPeLRNKLQKVNRDIQRLKNDI---EEQETLLGTIMPEEESAKVCLTDVTIMERFQMELKDVERKIAQQAAK 814
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  622 SPFGDSLESSTELRrhlQFVEEEAELLRRSISEIE-------DHNRQLTHELSKFKfepprepgwlgegaspgagggapl 694
Cdd:TIGR00606  815 LQGSDLDRTVQQVN---QEKQEKQHELDTVVSKIElnrkliqDQQEQIQHLKSKTN------------------------ 867
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  695 qeELKSARLQISELSGKVLKLQHENHALLSNIQRCDLAAHLGLRAPSPRDSDAESDAGKKE----SDGEESRLPQPKREg 770
Cdd:TIGR00606  868 --ELKSEKLQIGTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEelisSKETSNKKAQDKVN- 944
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  771 pvggesDSEEMFEKTSGFGSGKPSEASEPCPTELL-KAREDSEYLVTLKHEAQRLERTVERLITDTDSFlhdaglrggap 849
Cdd:TIGR00606  945 ------DIKEKVKNIHGYMKDIENKIQDGKDDYLKqKETELNTVNAQLEECEKHQEKINEDMRLMRQDI----------- 1007
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|....*.
gi 2217316421  850 lpgpglqGEEEQGEGDQQEPQLLGTINAKMKAFKKELQAFLEQVNR 895
Cdd:TIGR00606 1008 -------DTQKIQERWLQDNLTLRKRENELKEVEEELKQHLKEMGQ 1046
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
86-218 6.80e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 47.75  E-value: 6.80e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   86 PSPGSLAAPGRLSRRSGGVPGAKDKPPPGagaraaggakaalgSRRAARVAPAEPLSRAGKPPGAEPPSAAAKGRKAKRG 165
Cdd:PRK14959   373 PSGGGASAPSGSAAEGPASGGAATIPTPG--------------TQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRVPWDD 438
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2217316421  166 SRAPPARTVGPPTPAARIPavtlAVTSVAGSPARCSRISHTDSSSDLSDCPSE 218
Cdd:PRK14959   439 APPAPPRSGIPPRPAPRMP----EASPVPGAPDSVASASDAPPTLGDPSDTAE 487
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
55-261 7.33e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 47.95  E-value: 7.33e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   55 RPAAPGPAVPSSGRAPAPAAprspnlAGKAPPSPGSLAAPGRLSRRSGGVPGAKDKPPPGAGARAAGGAKAALGSRRAAR 134
Cdd:PRK12323   359 RMLAFRPGQSGGGAGPATAA------AAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEA 432
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  135 VAPAEPLSRAGKPPGAEPPSAAAkgrkakrgsrAPPARTVGPPTPAARIPAVTLAVTSVAGSPARCSRISHTDSS--SDL 212
Cdd:PRK12323   433 LAAARQASARGPGGAPAPAPAPA----------AAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPpwEEL 502
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 2217316421  213 SDCPSEPLSDEQRLLPA-ASSDAESGTGSSDREPPRGAPTPSPAARGAPP 261
Cdd:PRK12323   503 PPEFASPAPAQPDAAPAgWVAESIPDPATADPDDAFETLAPAPAAAPAPR 552
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
374-551 8.48e-05

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 46.07  E-value: 8.48e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  374 YQLQELRRELDRANKNCRILQYRLRKAEQKslkvaetgqvdgelIRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDEN 453
Cdd:COG1579     10 LDLQELDSELDRLEHRLKELPAELAELEDE--------------LAALEARLEAAKTELEDLEKEIKRLELEIEEVEARI 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  454 ETLRQQMIEVEISK--QALQNELERLKesslKRRSTRE------MYK-EKKtfnQDDSADLRCQLQFAKEEaflMRKKMA 524
Cdd:COG1579     76 KKYEEQLGNVRNNKeyEALQKEIESLK----RRISDLEdeilelMERiEEL---EEELAELEAELAELEAE---LEEKKA 145
                          170       180       190
                   ....*....|....*....|....*....|.
gi 2217316421  525 KLGREKDELEQELQKYKS----LYGDVDSPL 551
Cdd:COG1579    146 ELDEELAELEAELEELEAereeLAAKIPPEL 176
PTZ00121 PTZ00121
MAEBL; Provisional
351-603 9.67e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 47.83  E-value: 9.67e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  351 KDELDELRaEMEEMRDSyleEDVYQLQELRRELDRANKNCRILQyRLRKAEQK------------------SLKVAETGQ 412
Cdd:PTZ00121  1545 KKKADELK-KAEELKKA---EEKKKAEEAKKAEEDKNMALRKAE-EAKKAEEArieevmklyeeekkmkaeEAKKAEEAK 1619
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  413 VDGELIRSLEQDLKVAKDVSVRLHHELKTVEEKRaKAEDENETLRQQMI--EVEISKQALQ----NELERLKESSLKRRS 486
Cdd:PTZ00121  1620 IKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELK-KAEEENKIKAAEEAkkAEEDKKKAEEakkaEEDEKKAAEALKKEA 1698
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  487 trEMYKEKKTFNQDDSADLRCQLQFAKEEaflmRKKMAKLGREKDELEQELQKYKSLYGDvdsplpTGEaggppSTREAE 566
Cdd:PTZ00121  1699 --EEAKKAEELKKKEAEEKKKAEELKKAE----EENKIKAEEAKKEAEEDKKKAEEAKKD------EEE-----KKKIAH 1761
                          250       260       270
                   ....*....|....*....|....*....|....*..
gi 2217316421  567 LKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDMR 603
Cdd:PTZ00121  1762 LKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDK 1798
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
431-662 1.08e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 47.37  E-value: 1.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  431 VSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQLQ 510
Cdd:TIGR02169  668 FSRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLS 747
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  511 FAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDV-----DSPLPTGEAGG----------PPSTREAELKLRLKLVE 575
Cdd:TIGR02169  748 SLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLearlsHSRIPEIQAELskleeevsriEARLREIEQKLNRLTLE 827
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  576 EEanILGRKIVELEVENRGLKaEMEDMRGQQEREGPGRdhapsiptspFGDSLESSTELRRHLQFVEEEAELLRRSISEI 655
Cdd:TIGR02169  828 KE--YLEKEIQELQEQRIDLK-EQIKSIEKEIENLNGK----------KEELEEELEELEAALRDLESRLGDLKKERDEL 894

                   ....*..
gi 2217316421  656 EDHNRQL 662
Cdd:TIGR02169  895 EAQLREL 901
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
350-494 1.12e-04

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 45.69  E-value: 1.12e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEMEEMRDSYLEEDVyQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDG---ElIRSLEQDLK 426
Cdd:COG1579     29 LPAELAELEDELAALEARLEAAKT-ELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVRNNKEYEAlqkE-IESLKRRIS 106
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2217316421  427 VAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVeisKQALQNELERLKESSLKRRSTREMYKEK 494
Cdd:COG1579    107 DLEDEILELMERIEELEEELAELEAELAELEAELEEK---KAELDEELAELEAELEELEAEREELAAK 171
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
350-609 1.26e-04

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 46.06  E-value: 1.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEMEEM---RDSY---LEEDVYQLQELRRELDRANKNCRILqyrlrKAEQKSLKvAETGQVDGELirsleQ 423
Cdd:COG1340     27 LKEKRDELNEELKELaekRDELnaqVKELREEAQELREKRDELNEKVKEL-----KEERDELN-EKLNELREEL-----D 95
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  424 DLKVAKDVSVRLHHELKTVEEKRAKAEDENET------LRQQMIEvEISKqaLQNELERLKESSLKRRSTREMYKEKKTF 497
Cdd:COG1340     96 ELRKELAELNKAGGSIDKLRKEIERLEWRQQTevlspeEEKELVE-KIKE--LEKELEKAKKALEKNEKLKELRAELKEL 172
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  498 nQDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYgdvdsplptgeaggppstreAELKLRLKLVEEE 577
Cdd:COG1340    173 -RKEAEEIHKKIKELAEEAQELHEEMIELYKEADELRKEADELHKEI--------------------VEAQEKADELHEE 231
                          250       260       270
                   ....*....|....*....|....*....|..
gi 2217316421  578 ANILGRKIVELEVENRGLKAEMEDMRGQQERE 609
Cdd:COG1340    232 IIELQKELRELRKELKKLRKKQRALKREKEKE 263
COG5022 COG5022
Myosin heavy chain [General function prediction only];
359-670 1.33e-04

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 47.38  E-value: 1.33e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  359 AEMEEMRDSYLEEDVYQLQE------LRRELDRANKNCRILQ--------YRLRKAEQKSL-------------KVAETG 411
Cdd:COG5022    734 AALEDMRDAKLDNIATRIQRairgryLRRRYLQALKRIKKIQviqhgfrlRRLVDYELKWRlfiklqpllsllgSRKEYR 813
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  412 QVDgELIRSLEQDLKVAKDVSVRLHHELKTVEE-------------KRAKAEDENETLRQQMIEVEISK---QALQNELE 475
Cdd:COG5022    814 SYL-ACIIKLQKTIKREKKLRETEEVEFSLKAEvliqkfgrslkakKRFSLLKKETIYLQSAQRVELAErqlQELKIDVK 892
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  476 RLKESSLKRRSTREMYKEkktFNQDDSADLRCQLQFaKEEAFLMRKKMAKLGREKDELEQELQKYKSLygdvdsplptge 555
Cdd:COG5022    893 SISSLKLVNLELESEIIE---LKKSLSSDLIENLEF-KTELIARLKKLLNNIDLEEGPSIEYVKLPEL------------ 956
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  556 aggppstreaelklrLKLVEEEANiLGRKIVELEVENRGLKAEMEDMRGQQERegpgrdhapsipTSPFGDSLESSTELR 635
Cdd:COG5022    957 ---------------NKLHEVESK-LKETSEEYEDLLKKSTILVREGNKANSE------------LKNFKKELAELSKQY 1008
                          330       340       350
                   ....*....|....*....|....*....|....*
gi 2217316421  636 RHLQFVEEEAELLRRSISEIEDHNRQLTHELSKFK 670
Cdd:COG5022   1009 GALQESTKQLKELPVEVAELQSASKIISSESTELS 1043
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
398-656 1.48e-04

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 46.43  E-value: 1.48e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  398 RKAEQKSLKVAETGQVDGELIRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERL 477
Cdd:COG4372      6 EKVGKARLSLFGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEEL 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  478 KEsSLKRRstremykekktfnQDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLptgeag 557
Cdd:COG4372     86 NE-QLQAA-------------QAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEI------ 145
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  558 gppSTREAELK-LRLKLVEEEANILGRKIVELEVENRGLKAEMEDMRGQQEREGPGRDHAPSIPTSPFGDSLESSTELRR 636
Cdd:COG4372    146 ---AEREEELKeLEEQLESLQEELAALEQELQALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLE 222
                          250       260
                   ....*....|....*....|
gi 2217316421  637 HLQFVEEEAELLRRSISEIE 656
Cdd:COG4372    223 AKDSLEAKLGLALSALLDAL 242
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
345-657 1.53e-04

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 47.09  E-value: 1.53e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  345 SENDYLKDELDEL-RAEME-EMRDSYLEEdvyQLQELRRELDRANKNCRILQYRLRKAEqkslkvAETGQVDGELIRSLE 422
Cdd:pfam01576  384 SENAELQAELRTLqQAKQDsEHKRKKLEG---QLQELQARLSESERQRAELAEKLSKLQ------SELESVSSLLNEAEG 454
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  423 QDLKVAKDVSvRLHHELKTV-----EEKRAK---------AEDENETLRQQMIEVEISKQALQNELERLKE--SSLKRR- 485
Cdd:pfam01576  455 KNIKLSKDVS-SLESQLQDTqellqEETRQKlnlstrlrqLEDERNSLQEQLEEEEEAKRNVERQLSTLQAqlSDMKKKl 533
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  486 ----STREMYKEKKTFNQDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDV-------DSPLPTG 554
Cdd:pfam01576  534 eedaGTLEALEEGKKRLQRELEALTQQLEEKAAAYDKLEKTKNRLQQELDDLLVDLDHQRQLVSNLekkqkkfDQMLAEE 613
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  555 EAGGPPSTRE---AELKLRLKlvEEEANILGR-------KIVELEVENRGLKAEMEDMRGQQEREGpgrdhapsiptspf 624
Cdd:pfam01576  614 KAISARYAEErdrAEAEAREK--ETRALSLARaleealeAKEELERTNKQLRAEMEDLVSSKDDVG-------------- 677
                          330       340       350
                   ....*....|....*....|....*....|...
gi 2217316421  625 gdslESSTELRRHLQFVEEEAELLRRSISEIED 657
Cdd:pfam01576  678 ----KNVHELERSKRALEQQVEEMKTQLEELED 706
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
82-326 1.70e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.09  E-value: 1.70e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   82 GKAPPSPGSLAAPGRLSRRSGGvPGAKDKPPPGAGARAAGGAKAALGSRRAARVAPAEPLSRAG-------KPPGAEPPS 154
Cdd:PHA03307    69 TGPPPGPGTEAPANESRSTPTW-SLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPApdlsemlRPVGSPGPP 147
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  155 AAAKGRKAKRGSRAPPARTVGPPTPAARIPAVTLAVTSVAG----SPARCSRISHTDSSSDLSDcPSEPLSDEQRLLPAA 230
Cdd:PHA03307   148 PAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSppaePPPSTPPAAASPRPPRRSS-PISASASSPAPAPGR 226
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  231 SSDAESGTGSSDREPPRGAPTPSpAARGAPPGSPEPPALLAAPLAAGACPGGRSIPSGVSGGFAGPgvaedvRGRSPPer 310
Cdd:PHA03307   227 SAADDAGASSSDSSSSESSGCGW-GPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSP------RERSPS-- 297
                          250
                   ....*....|....*.
gi 2217316421  311 PVPGTPKEPSLGEQSR 326
Cdd:PHA03307   298 PSPSSPGSGPAPSSPR 313
EzrA pfam06160
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like ...
350-479 1.71e-04

Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerizes into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation. The structure contains 5 spectrin like alpha helical repeats.


Pssm-ID: 428797 [Multi-domain]  Cd Length: 542  Bit Score: 46.39  E-value: 1.71e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEMEEMRDSYL--EEDVYQLQELRRELDRANKNCRILQYRLRKAEQ-------------KSLKVAETGQVD 414
Cdd:pfam06160  303 AEEQNKELKEELERVQQSYTlnENELERVRGLEKQLEELEKRYDEIVERLEEKEVayselqeeleeilEQLEEIEEEQEE 382
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  415 G-ELIRSLEQDLKVAKDVSVRLHHELKTVE---EKR-------------AKAEDENETLRQQMIEV-----EISKQAL-- 470
Cdd:pfam06160  383 FkESLQSLRKDELEAREKLDEFKLELREIKrlvEKSnlpglpesyldyfFDVSDEIEDLADELNEVplnmdEVNRLLDea 462

                   ....*....
gi 2217316421  471 QNELERLKE 479
Cdd:pfam06160  463 QDDVDTLYE 471
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
345-660 1.75e-04

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 46.57  E-value: 1.75e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  345 SENDYLKDELDELRAEME---EMRDSyLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSL 421
Cdd:PRK02224   213 SELAELDEEIERYEEQREqarETRDE-ADEVLEEHEERREELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEEL 291
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  422 EQDLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKktfnqdd 501
Cdd:PRK02224   292 EEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREE------- 364
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  502 SADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELqkykslygdvdsplptgeaGGPPSTREaELKLRLKLVEEEANIL 581
Cdd:PRK02224   365 AAELESELEEAREAVEDRREEIEELEEEIEELRERF-------------------GDAPVDLG-NAEDFLEELREERDEL 424
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  582 GRKIVELEVENRGLKAEMEDMRGQQErEGPGRDHAPSIPTSPFGDSLESSTELRRHLqfvEEEAELLRRSISEIED-HNR 660
Cdd:PRK02224   425 REREAELEATLRTARERVEEAEALLE-AGKCPECGQPVEGSPHVETIEEDRERVEEL---EAELEDLEEEVEEVEErLER 500
DUF4201 pfam13870
Domain of unknown function (DUF4201); This is a family of coiled-coil proteins from eukaryotes. ...
376-539 1.76e-04

Domain of unknown function (DUF4201); This is a family of coiled-coil proteins from eukaryotes. The function is not known.


Pssm-ID: 464008 [Multi-domain]  Cd Length: 177  Bit Score: 44.52  E-value: 1.76e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  376 LQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQ----VD-----------GELIRSLEQDLKVAKDVSVRLHHELK 440
Cdd:pfam13870    1 MRAKRNELSKLRLELITLKHTLAKIQEKLEQKEELGEgltmIDflqlqienqalNEKIEERNKELKRLKLKVTNTVHALT 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  441 TVEEKRAKAEDENETLRQQMIEveisKQALQNELErlkesslkrrstREMYKEKKTFNQ--DDSADLRCQL--------- 509
Cdd:pfam13870   81 HLKEKLHFLSAELSRLKKELRE----RQELLAKLR------------KELYRVKLERDKlrKQNKKLRQQGgllhvpall 144
                          170       180       190
                   ....*....|....*....|....*....|...
gi 2217316421  510 -QFAKEEAFL--MRKKMAKLGREKDELEQELQK 539
Cdd:pfam13870  145 hDYDKTKAEVeeKRKSVKKLRRKVKILEMRIKE 177
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
350-512 1.94e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 46.59  E-value: 1.94e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEMEEMRDSYLEEDVyQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLKVA- 428
Cdd:TIGR02168  356 LEAELEELEAELEELESRLEELEE-QLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAe 434
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  429 -KDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRC 507
Cdd:TIGR02168  435 lKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKN 514

                   ....*
gi 2217316421  508 QLQFA 512
Cdd:TIGR02168  515 QSGLS 519
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
345-543 2.02e-04

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 45.67  E-value: 2.02e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  345 SENDYLKDELDELRAEMEEMRDSYLE---------EDVYQLQELRRELDRANKNCRILQYRLRKAEQKSlkvaETGQVDG 415
Cdd:COG1340     57 EEAQELREKRDELNEKVKELKEERDElneklnelrEELDELRKELAELNKAGGSIDKLRKEIERLEWRQ----QTEVLSP 132
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  416 EL-------IRSLEQDLKVAKDvSVRLHHELKTVEEKRAKAEDENETLRQQMIEV--EISK-----QALQNELERLKESS 481
Cdd:COG1340    133 EEekelvekIKELEKELEKAKK-ALEKNEKLKELRAELKELRKEAEEIHKKIKELaeEAQElheemIELYKEADELRKEA 211
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217316421  482 -------LKRRSTREMYKEKKTFNQDDSADLRCQL--QFAKEEAFLMRKKMAKLGREKDELEQELQKYKSL 543
Cdd:COG1340    212 delhkeiVEAQEKADELHEEIIELQKELRELRKELkkLRKKQRALKREKEKEELEEKAEEIFEKLKKGEKL 282
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
12-265 2.34e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 46.13  E-value: 2.34e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   12 APDAKLQPPGQHHRHHHLHPVAERRRLHRAPSPARPflkdlhARPAAPGPAVPSSGRapapaAPRSPNLAGKAPPSPGSL 91
Cdd:PRK07764   589 GPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAP------AAPAPAGAAAAPAEA-----SAAPAPGVAAPEHHPKHV 657
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   92 AAPGRLSRRSGGVPGAKDKPPPgagaraaggakaalgsrrAARVAPAEPLSRAGKPPGAEPPSAAAKGRKAKRGSRAPPA 171
Cdd:PRK07764   658 AVPDASDGGDGWPAKAGGAAPA------------------APPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAA 719
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  172 RTVGPPTPAARIPAVTLAVTSVAGSParcsrishtdsssDLSDCPSEPLSDEQrllPAASSDAESGTGSSDREPPRGAPT 251
Cdd:PRK07764   720 QPPQAAQGASAPSPAADDPVPLPPEP-------------DDPPDPAGAPAQPP---PPPAPAPAAAPAAAPPPSPPSEEE 783
                          250
                   ....*....|....
gi 2217316421  252 PSPAARGAPPGSPE 265
Cdd:PRK07764   784 EMAEDDAPSMDDED 797
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
349-544 2.48e-04

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 46.16  E-value: 2.48e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  349 YLKDELDELRAEMEEMRDsYLEEdvyQLQELRRELDRANKncRILQYRlrkAEQKSLKVAETGQVDGELIRSLEQDLKVA 428
Cdd:COG3206    161 YLEQNLELRREEARKALE-FLEE---QLPELRKELEEAEA--ALEEFR---QKNGLVDLSEEAKLLLQQLSELESQLAEA 231
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  429 KDVSVRLHHELKTVEEKRAKAEDEN---------ETLRQQMIEVEISK--------------QALQNELERLKESSLKR- 484
Cdd:COG3206    232 RAELAEAEARLAALRAQLGSGPDALpellqspviQQLRAQLAELEAELaelsarytpnhpdvIALRAQIAALRAQLQQEa 311
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  485 RSTREMYKEKKTFNQDDSADLRCQLQFAKEEAflmrKKMAKLGREKDELEQELQKYKSLY 544
Cdd:COG3206    312 QRILASLEAELEALQAREASLQAQLAQLEARL----AELPELEAELRRLEREVEVARELY 367
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
131-264 2.54e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 46.13  E-value: 2.54e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  131 RAARVAPAEPLSRAGKPPGAEPPSAAAkgrkAKRGSRAPPARTVGPPTPAARIPAVTLAVTSVAGSPARCSRISHTDSSS 210
Cdd:PRK07764   377 RLERLERRLGVAGGAGAPAAAAPSAAA----AAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGG 452
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2217316421  211 DLSDCPSEPLSDEQRLLPAASSDAESGTGSSDREPPRGAPTPSPAARGAPPGSP 264
Cdd:PRK07764   453 APSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGA 506
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
144-264 2.61e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 45.86  E-value: 2.61e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  144 AGKPPGAEPPSAAAKGRKAKRGSRAPPARTVGPPTPAARIPAVTL-AVTSVAGSPArcSRISHTDSSSDLSDCPSEPLSD 222
Cdd:PRK14951   363 AFKPAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPaAAASAPAAPP--AAAPPAPVAAPAAAAPAAAPAA 440
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 2217316421  223 EQRLLPAASSDAESGTGSSDREPPRGAPTPSPA-ARGAPPGSP 264
Cdd:PRK14951   441 APAAVALAPAPPAQAAPETVAIPVRVAPEPAVAsAAPAPAAAP 483
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
13-319 3.36e-04

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 45.44  E-value: 3.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   13 PDAKLQPPGQHHRHHHLHPVAERrrLHRAPSPARPFLKDLHARPAAPGPAVPSSGRAPAPAAPRSPNLAGKAPPSPGSLA 92
Cdd:COG5180    158 SDPILAKDPDGDSASTLPPPAEK--LDKVLTEPRDALKDSPEKLDRPKVEVKDEAQEEPPDLTGGADHPRPEAASSPKVD 235
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   93 APGRLSRRSGgvPGAKDKPPPGAGARAAGGAKAALGSRRAARVAPAEPLSRAGKPPGAEPPSAA-AKGRKAKRGSRAPPA 171
Cdd:COG5180    236 PPSTSEARSR--PATVDAQPEMRPPADAKERRRAAIGDTPAAEPPGLPVLEAGSEPQSDAPEAEtARPIDVKGVASAPPA 313
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  172 RTVGPPTPAAR---IPAVTLAVTSVAGSPARCSRISHTDSSSDlsdcPSEPLSDEQRLLPAASSDAESGTGSSDREPPRG 248
Cdd:COG5180    314 TRPVRPPGGARdpgTPRPGQPTERPAGVPEAASDAGQPPSAYP----PAEEAVPGKPLEQGAPRPGSSGGDGAPFQPPNG 389
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217316421  249 APTPSPaARGAPPGSPEPPALLAAPLAAGACPGGRSIPSGVSGGFAGPGVAedvrgRSPPERPVPGTPKEP 319
Cdd:COG5180    390 APQPGL-GRRGAPGPPMGAGDLVQAALDGGGRETASLGGAAGGAGQGPKAD-----FVPGDAESVSGPAGL 454
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
323-662 3.52e-04

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 45.50  E-value: 3.52e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  323 EQSRLvpAAEEEELLREMEELRSendyLKDELDELRAEMEEMRDSYLEEDVYQLqELRRELDRankncriLQYRLRKAEQ 402
Cdd:pfam17380  297 EQERL--RQEKEEKAREVERRRK----LEEAEKARQAEMDRQAAIYAEQERMAM-ERERELER-------IRQEERKREL 362
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  403 KSLKVAETGqVDGELIRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEI----SKQALQNELERLK 478
Cdd:pfam17380  363 ERIRQEEIA-MEISRMRELERLQMERQQKNERVRQELEAARKVKILEEERQRKIQQQKVEMEQiraeQEEARQREVRRLE 441
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  479 EsslKRRSTREMYKEKKTFNQDDSADLRcqlqfaKEEAFLMRKKMAKLGREKDELEQELQKYKSLygdvdsplptgeagg 558
Cdd:pfam17380  442 E---ERAREMERVRLEEQERQQQVERLR------QQEEERKRKKLELEKEKRDRKRAEEQRRKIL--------------- 497
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  559 ppstrEAELKLRLKLVEEEANilGRKIVELEVENRGLKAEMEDMRGQQEREgpGRDHAPSIPTSPFGDSLESSTELRRHL 638
Cdd:pfam17380  498 -----EKELEERKQAMIEEER--KRKLLEKEMEERQKAIYEEERRREAEEE--RRKQQEMEERRRIQEQMRKATEERSRL 568
                          330       340
                   ....*....|....*....|....
gi 2217316421  639 QFVEEEAELLRRsISEIEDHNRQL 662
Cdd:pfam17380  569 EAMEREREMMRQ-IVESEKARAEY 591
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
102-323 3.98e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 45.36  E-value: 3.98e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  102 GGVPGAKDKPPPGAGARAAGGAKAalgSRRAARVAPAEPlSRAGKPPGAEPPSAAAKGRKAKRGSRAPPARTVGPPTPAA 181
Cdd:PRK07764   589 GPAPGAAGGEGPPAPASSGPPEEA---ARPAAPAAPAAP-AAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASD 664
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  182 RIPAVTLAVTSVAGSParcsrishtdsssdlsdcPSEPLSDEQRLLPAASSDAESGTGSSDREPPRGAPTPSPAARGAPP 261
Cdd:PRK07764   665 GGDGWPAKAGGAAPAA------------------PPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQ 726
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217316421  262 GSPEPPALLAAPLAAGACPGGRSIPSGVSGGFAGPGVAEDVRGRSPPERPVPGTPKEPSLGE 323
Cdd:PRK07764   727 GASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAED 788
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
351-538 4.51e-04

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 45.42  E-value: 4.51e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  351 KDELDELRAEMEEMRDSY------LEEDVYQLQELRRELDRANK-------NCRILQYRLRKAEQ--KSLKVAETGQ--- 412
Cdd:PRK02224   383 REEIEELEEEIEELRERFgdapvdLGNAEDFLEELREERDELREreaeleaTLRTARERVEEAEAllEAGKCPECGQpve 462
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  413 ----VDG-----ELIRSLEQDLKVAKDVSVRLHHELKTVEEKrAKAEDENETLRQQmieVEISKQALQNELERLKESSLK 483
Cdd:PRK02224   463 gsphVETieedrERVEELEAELEDLEEEVEEVEERLERAEDL-VEAEDRIERLEER---REDLEELIAERRETIEEKRER 538
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2217316421  484 RRSTREMYKEKKTfnqdDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQ 538
Cdd:PRK02224   539 AEELRERAAELEA----EAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIE 589
PTZ00121 PTZ00121
MAEBL; Provisional
352-496 4.58e-04

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 45.52  E-value: 4.58e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  352 DELDELRAEMEEMRDsylEEDVYQLQELRRELDRANKNCRILQyrlRKAEQKSlKVAETGQVDGELIRSLEQDLKVAKDV 431
Cdd:PTZ00121  1656 EEENKIKAAEEAKKA---EEDKKKAEEAKKAEEDEKKAAEALK---KEAEEAK-KAEELKKKEAEEKKKAEELKKAEEEN 1728
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217316421  432 SVRLHHELKTVEEKRAKAE----DENETLRQQMIEVEISKQALQNELER---LKESSLKRRSTREMYKEKKT 496
Cdd:PTZ00121  1729 KIKAEEAKKEAEEDKKKAEeakkDEEEKKKIAHLKKEEEKKAEEIRKEKeavIEEELDEEDEKRRMEVDKKI 1800
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
149-331 5.77e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 44.87  E-value: 5.77e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  149 GAEPPSAAAKGRKAKRGSRAPPARTVGPPTPAARIPAVTLAVTSVAGSPARCSRishtdsssdlsdcpsePLSDEQRLLP 228
Cdd:PRK12323   371 GAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPA----------------RRSPAPEALA 434
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  229 AASSDAESGTGSSDREPPRGAPTPSPAARGAPPGSPE------PPALLAAPLAAGAcPGGRSIP--SGVSGGFAGPGVAE 300
Cdd:PRK12323   435 AARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPvaaaaaAAPARAAPAAAPA-PADDDPPpwEELPPEFASPAPAQ 513
                          170       180       190
                   ....*....|....*....|....*....|....
gi 2217316421  301 DVRGRSPPER---PVPGTPKEPSLGEQSRLVPAA 331
Cdd:PRK12323   514 PDAAPAGWVAesiPDPATADPDDAFETLAPAPAA 547
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
130-313 6.70e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.78  E-value: 6.70e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  130 RRAARVAPAEPLSRA-GKPPGAEPPSAAAKGRKAKRGSRAPPARTVGPPTPAARIPAVTLAVTSVAGSPARCSRISHTDS 208
Cdd:PHA03307   742 RRARARASAWDITDAlFSNPSLVPAKLAEALALLEPAEPQRGAGSSPPVRAEAAFRRPGRLRRSGPAADAASRTASKRKS 821
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  209 SSDLSDCPSEPLSDEQRLLPAASSDAESGTGSSDREPPRGAPTPSPAARGAPPGSPEPPALLAAPLAAGACPGGRSIPSG 288
Cdd:PHA03307   822 RSHTPDGGSESSGPARPPGAAARPPPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAP 901
                          170       180
                   ....*....|....*....|....*
gi 2217316421  289 VSggfAGPGVAEDVRGRSPPERPVP 313
Cdd:PHA03307   902 AP---RPRPAPRVKLGPMPPGGPDP 923
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
348-516 6.92e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 44.91  E-value: 6.92e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  348 DYLKDELDELRAEMEEMRDSY-----LEEDVYQLQ----ELRRELDRANKNCRILQYRLRKAEQ-----KSLKVAETGQV 413
Cdd:COG4913    664 ASAEREIAELEAELERLDASSddlaaLEEQLEELEaeleELEEELDELKGEIGRLEKELEQAEEeldelQDRLEAAEDLA 743
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  414 DGELIRSLEQDL------KVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQ--------MIEVEISKQAL---QNELER 476
Cdd:COG4913    744 RLELRALLEERFaaalgdAVERELRENLEERIDALRARLNRAEEELERAMRAfnrewpaeTADLDADLESLpeyLALLDR 823
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|
gi 2217316421  477 LKESSLKRRsTREMYKEKKTFNQDDSADLRCQLQFAKEEA 516
Cdd:COG4913    824 LEEDGLPEY-EERFKELLNENSIEFVADLLSKLRRAIREI 862
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
416-627 8.14e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 43.98  E-value: 8.14e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  416 ELIRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLK-------------ESSL 482
Cdd:COG4942     20 DAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEaelaelekeiaelRAEL 99
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  483 KRRST------REMYKEKKT------FNQDDSAD-------LRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSL 543
Cdd:COG4942    100 EAQKEelaellRALYRLGRQpplallLSPEDFLDavrrlqyLKYLAPARREQAEELRADLAELAALRAELEAERAELEAL 179
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  544 YGDVdsplptgeaggppSTREAELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDMRGQQER-EGPGRDHAPSIPTS 622
Cdd:COG4942    180 LAEL-------------EEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARlEAEAAAAAERTPAA 246

                   ....*
gi 2217316421  623 PFGDS 627
Cdd:COG4942    247 GFAAL 251
PRK01156 PRK01156
chromosome segregation protein; Provisional
348-714 8.44e-04

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 44.51  E-value: 8.44e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  348 DYLKDELDELRAEMEEMrdSYLEEDVY----QLQELRRELDRANKNCRILqyrLRKAEQKSlkvAETGQVDGELI--RSL 421
Cdd:PRK01156   169 DKLKDVIDMLRAEISNI--DYLEEKLKssnlELENIKKQIADDEKSHSIT---LKEIERLS---IEYNNAMDDYNnlKSA 240
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  422 EQDLKVAKDVSVRLHHELKTVEEKRAKAEDENetlrqqmieVEISkqALQNELERLKESSL--KRRSTREMYKEKKTFNQ 499
Cdd:PRK01156   241 LNELSSLEDMKNRYESEIKTAESDLSMELEKN---------NYYK--ELEERHMKIINDPVykNRNYINDYFKYKNDIEN 309
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  500 ---------------DDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDS--------------- 549
Cdd:PRK01156   310 kkqilsnidaeinkyHAIIKKLSVLQKDYNDYIKKKSRYDDLNNQILELEGYEMDYNSYLKSIESlkkkieeysknierm 389
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  550 PLPTGEAGGPPSTREAELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDMRGQQEREGpGRDHAPSIPTSpFGDslE 629
Cdd:PRK01156   390 SAFISEILKIQEIDPDAIKKELNEINVKLQDISSKVSSLNQRIRALRENLDELSRNMEMLN-GQSVCPVCGTT-LGE--E 465
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  630 SSTELRRH----LQFVEEEAELLRRSISEIEDHNRQLTHELSKFKFEPPREpgwlgegASPGAGGGAPLQEELKSARLQI 705
Cdd:PRK01156   466 KSNHIINHynekKSRLEEKIREIEIEVKDIDEKIVDLKKRKEYLESEEINK-------SINEYNKIESARADLEDIKIKI 538

                   ....*....
gi 2217316421  706 SELSGKVLK 714
Cdd:PRK01156   539 NELKDKHDK 547
BAR cd07307
The Bin/Amphiphysin/Rvs (BAR) domain, a dimerization module that binds membranes and detects ...
376-587 8.67e-04

The Bin/Amphiphysin/Rvs (BAR) domain, a dimerization module that binds membranes and detects membrane curvature; BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions including organelle biogenesis, membrane trafficking or remodeling, and cell division and migration. Mutations in BAR containing proteins have been linked to diseases and their inactivation in cells leads to altered membrane dynamics. A BAR domain with an additional N-terminal amphipathic helix (an N-BAR) can drive membrane curvature. These N-BAR domains are found in amphiphysins and endophilins, among others. BAR domains are also frequently found alongside domains that determine lipid specificity, such as the Pleckstrin Homology (PH) and Phox Homology (PX) domains which are present in beta centaurins (ACAPs and ASAPs) and sorting nexins, respectively. A FES-CIP4 Homology (FCH) domain together with a coiled coil region is called the F-BAR domain and is present in Pombe/Cdc15 homology (PCH) family proteins, which include Fes/Fes tyrosine kinases, PACSIN or syndapin, CIP4-like proteins, and srGAPs, among others. The Inverse (I)-BAR or IRSp53/MIM homology Domain (IMD) is found in multi-domain proteins, such as IRSp53 and MIM, that act as scaffolding proteins and transducers of a variety of signaling pathways that link membrane dynamics and the underlying actin cytoskeleton. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions. The I-BAR domain induces membrane protrusions in the opposite direction compared to classical BAR and F-BAR domains, which produce membrane invaginations. BAR domains that also serve as protein interaction domains include those of arfaptin and OPHN1-like proteins, among others, which bind to Rac and Rho GAP domains, respectively.


Pssm-ID: 153271 [Multi-domain]  Cd Length: 194  Bit Score: 42.43  E-value: 8.67e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  376 LQELRRELDRANKNCRILQYRLRKAEQKSLKVAEtgqvdgelirSLEqdlKVAKDVSVRLHHELKTVEEKRAKAEDENET 455
Cdd:cd07307      2 LDELEKLLKKLIKDTKKLLDSLKELPAAAEKLSE----------ALQ---ELGKELPDLSNTDLGEALEKFGKIQKELEE 68
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  456 LRQQMIE------VEISKQALQNELERLKESSLKRRSTREMYkekktfnqdDSAdlrcqlqFAKEEAflMRKKMAKLGRE 529
Cdd:cd07307     69 FRDQLEQklenkvIEPLKEYLKKDLKEIKKRRKKLDKARLDY---------DAA-------REKLKK--LRKKKKDSSKL 130
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2217316421  530 KdELEQELQKYKSLYGDVdsplptgeaggppstrEAELKLRLKLVEEEANILGRKIVE 587
Cdd:cd07307    131 A-EAEEELQEAKEKYEEL----------------REELIEDLNKLEEKRKELFLSLLL 171
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
338-672 9.41e-04

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 44.28  E-value: 9.41e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  338 REMEELRSENDYLKDELDELRAEMEEMRDS---------YLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVA 408
Cdd:PRK03918   262 RELEERIEELKKEIEELEEKVKELKELKEKaeeyiklseFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLE 341
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  409 ETgqvdGELIRSLEQDLKVAKDvSVRLHHELKTVEE-----KRAKAEDENETLRQQMIEVEISKQALQNELERLKE--SS 481
Cdd:PRK03918   342 EL----KKKLKELEKRLEELEE-RHELYEEAKAKKEelerlKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITAriGE 416
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  482 LKRRSTRemykEKKTFNQDDSADLRCQLQFA----KEEAFLMRK---KMAKLGREKDELEQELQKYKSLYGDVDSPLptg 554
Cdd:PRK03918   417 LKKEIKE----LKKAIEELKKAKGKCPVCGRelteEHRKELLEEytaELKRIEKELKEIEEKERKLRKELRELEKVL--- 489
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  555 eAGGPPSTREAELKLRLKLVEEEANILGRKIVELEVEN-RGLKAEMEDMRGQQERegpgrdhapsiptspFGDSLESSTE 633
Cdd:PRK03918   490 -KKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEyEKLKEKLIKLKGEIKS---------------LKKELEKLEE 553
                          330       340       350
                   ....*....|....*....|....*....|....*....
gi 2217316421  634 LrrhlqfvEEEAELLRRSISEIEDHNRQLTHELSKFKFE 672
Cdd:PRK03918   554 L-------KKKLAELEKKLDELEEELAELLKELEELGFE 585
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
344-671 9.84e-04

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 44.24  E-value: 9.84e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  344 RSENDYLKDELDELRAEMEEmrdsyLEEDVYQLQELRRELDranKNCRILQ--YRLRKAEQKSLKVAETGQvdGELIRSL 421
Cdd:TIGR04523  376 KKENQSYKQEIKNLESQIND-----LESKIQNQEKLNQQKD---EQIKKLQqeKELLEKEIERLKETIIKN--NSEIKDL 445
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  422 EQDLKVAKdvsvrlhhelKTVEEKRAKAEDENETLRQQMIEVEISKQALQN---ELERlKESSLKrrstreMYKEKKTfn 498
Cdd:TIGR04523  446 TNQDSVKE----------LIIKNLDNTRESLETQLKVLSRSINKIKQNLEQkqkELKS-KEKELK------KLNEEKK-- 506
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  499 qddsaDLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELqkyKSLYGDVDsplptgeaggppstreaELKLRLK--LVEE 576
Cdd:TIGR04523  507 -----ELEEKVKDLTKKISSLKEKIEKLESEKKEKESKI---SDLEDELN-----------------KDDFELKkeNLEK 561
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  577 EANILGRKIVELEVENRGLKA---EMEDMRGQQEREgpgrdhapsiptspfgdslesSTELRRHLQFVEEEAELLRRSIS 653
Cdd:TIGR04523  562 EIDEKNKEIEELKQTQKSLKKkqeEKQELIDQKEKE---------------------KKDLIKEIEEKEKKISSLEKELE 620
                          330
                   ....*....|....*...
gi 2217316421  654 EIEDHNRQLTHELSKFKF 671
Cdd:TIGR04523  621 KAKKENEKLSSIIKNIKS 638
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
392-769 1.14e-03

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 44.28  E-value: 1.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  392 ILQYRLRK--AEQKslkvaetgqvdgelIRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDENEtLRQQMIEVEIS--- 466
Cdd:TIGR02168  167 ISKYKERRkeTERK--------------LERTRENLDRLEDILNELERQLKSLERQAEKAERYKE-LKAELRELELAllv 231
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  467 --KQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLY 544
Cdd:TIGR02168  232 lrLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERL 311
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  545 GDVDSPLPTGEAggppstREAELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDMRGQQEregpgrdhapsiptspf 624
Cdd:TIGR02168  312 ANLERQLEELEA------QLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELE----------------- 368
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  625 gdslesstELRRHLQFVEEEAELLRRSISEiedhnrqlthelskfkfepprepgwlgegaspgagggapLQEELKSARLQ 704
Cdd:TIGR02168  369 --------ELESRLEELEEQLETLRSKVAQ---------------------------------------LELQIASLNNE 401
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217316421  705 ISELSGKVLKLQHENHALLSNIQRCDLAAHLGLRAPSPRDSdAESDAGKKESDGEESRLPQPKRE 769
Cdd:TIGR02168  402 IERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAEL-EELEEELEELQEELERLEEALEE 465
Rootletin pfam15035
Ciliary rootlet component, centrosome cohesion;
391-493 1.27e-03

Ciliary rootlet component, centrosome cohesion;


Pssm-ID: 464459 [Multi-domain]  Cd Length: 190  Bit Score: 41.95  E-value: 1.27e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  391 RILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDL-----KVAKDVSVRLHHELKTVEEKRAKAED---ENETLRQQMIE 462
Cdd:pfam15035   24 KVLQYKKRCSELEQQLLEKTSELEKTELLLRKLTLeprlqRLEREHSADLEEALIRLEEERQRSESlsqVNSLLREQLEQ 103
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 2217316421  463 VEISKQALQNELERL-------------KESSLKRRstREMYKE 493
Cdd:pfam15035  104 ASRANEALREDLQKLtndwerareeleqKESEWRKE--EEAFNE 145
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
499-728 1.29e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 43.21  E-value: 1.29e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  499 QDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVdsplptgeaggppSTREAELKLRLKLVEEEA 578
Cdd:COG4942     19 ADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAAL-------------ARRIRALEQELAALEAEL 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  579 NILGRKIVELEVENRGLKAEMEDMRGQQEREgpGRDHAPSIPTSP--FGDSLESSTELRRHLQFVEEEAELLRRSISEIE 656
Cdd:COG4942     86 AELEKEIAELRAELEAQKEELAELLRALYRL--GRQPPLALLLSPedFLDAVRRLQYLKYLAPARREQAEELRADLAELA 163
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217316421  657 DHNRQLTHELSKfkfepprepgwLGEGASPGAGGGAPLQEELKSARLQISELSGKVLKLQHENHALLSNIQR 728
Cdd:COG4942    164 ALRAELEAERAE-----------LEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEE 224
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
340-607 1.33e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 43.60  E-value: 1.33e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  340 MEELRSENDYLKDELDELRAEMEEMRDSYLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKsLKVAETGQVDGELIR 419
Cdd:COG4717    165 LEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEE-LEQLENELEAAALEE 243
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  420 SLEQDLKVAKDVSVRL------------------------------------HHELKTVEEKRAKAEDENETLRQQMIEV 463
Cdd:COG4717    244 RLKEARLLLLIAAALLallglggsllsliltiagvlflvlgllallflllarEKASLGKEAEELQALPALEELEEEELEE 323
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  464 EISKQALQNELERLKESSLKRRSTRemYKEKKTFNQDDSADLRCQLQFAKEEAFLMRKKM------AKLGREKDELEQEL 537
Cdd:COG4717    324 LLAALGLPPDLSPEELLELLDRIEE--LQELLREAEELEEELQLEELEQEIAALLAEAGVedeeelRAALEQAEEYQELK 401
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  538 QKYKSLYGDVDSPLPTGEAGGPPSTREaELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDMRGQQE 607
Cdd:COG4717    402 EELEELEEQLEELLGELEELLEALDEE-ELEEELEELEEELEELEEELEELREELAELEAELEQLEEDGE 470
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
392-609 1.33e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 43.60  E-value: 1.33e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  392 ILQYRLRKAEQKSLKV-AETGQVDGELIRSLEQDLKvakdvsvrlhhELKTVEEKRAKAEDENETLRQQMIEVEISKQAL 470
Cdd:COG4717     46 MLLERLEKEADELFKPqGRKPELNLKELKELEEELK-----------EAEEKEEEYAELQEELEELEEELEELEAELEEL 114
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  471 QNELERLKesslKRRSTREMYKEKKTFNQDDsADLRCQLQFAKEEaflmRKKMAKLGREKDELEQELQKYKSLygdvdsp 550
Cdd:COG4717    115 REELEKLE----KLLQLLPLYQELEALEAEL-AELPERLEELEER----LEELRELEEELEELEAELAELQEE------- 178
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217316421  551 LPTGEAGGPPSTREA--ELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDMRGQQERE 609
Cdd:COG4717    179 LEELLEQLSLATEEElqDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAA 239
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
110-262 1.46e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 43.55  E-value: 1.46e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  110 KPPPGAGaraaggakaalgsrrAARVAPAEPLSRAGKP-PGAEPPSAAAKGRKAKrgsRAPPARTVGPPTPAARI-PAVT 187
Cdd:PRK14951   365 KPAAAAE---------------AAAPAEKKTPARPEAAaPAAAPVAQAAAAPAPA---AAPAAAASAPAAPPAAApPAPV 426
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217316421  188 LAVTSVAGSPARcsriSHTDSSSDLSDCPSEPLSDEQRLLPAAssdAESGTGSSDREPPRGAPTPSPAARGAPPG 262
Cdd:PRK14951   427 AAPAAAAPAAAP----AAAPAAVALAPAPPAQAAPETVAIPVR---VAPEPAVASAAPAPAAAPAAARLTPTEEG 494
KASH_CCD pfam14662
Coiled-coil region of CCDC155 or KASH; This coiled-coil region is found in the central part of ...
418-598 1.50e-03

Coiled-coil region of CCDC155 or KASH; This coiled-coil region is found in the central part of KASH or Klarsicht/ANC-1/Syne/homology proteins. KASH are a meiosis-specific proteins that localize at telomeres and interact with SUN1, thus being implicated in meiotic chromosome dynamics and homolog pairing.


Pssm-ID: 405365 [Multi-domain]  Cd Length: 191  Bit Score: 41.70  E-value: 1.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  418 IRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMievEISKQALQNELE--------RLKESSLKRRSTRE 489
Cdd:pfam14662   10 VEDLQANNQKLLQENSKLKATVETREETNAKLLEENLNLRKQA---KSQQQAVQKEKLleeeledlKLIVNSLEEARRSL 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  490 MYKEK-KTFNQDDSA----DLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDvdsplptgeaggppstRE 564
Cdd:pfam14662   87 LAQNKqLEKENQSLLqeieSLQEENKKNQAERDKLQKKKKELLKSKACLKEQLHSCEDLACN----------------RE 150
                          170       180       190
                   ....*....|....*....|....*....|....
gi 2217316421  565 AELKLRLKLVEEEAnilgRKIVELEVENRGLKAE 598
Cdd:pfam14662  151 TILIEKTTQIEELK----STVEEYSSIEEELRAE 180
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
351-619 1.97e-03

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 43.50  E-value: 1.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  351 KDELDELR---AEMEEMRDSYLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLkv 427
Cdd:TIGR00606  244 ENELDPLKnrlKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTDEQLNDLYHNHQRTVREKEREL-- 321
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  428 akdvsVRLHHELKtveekraKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTRE------------------ 489
Cdd:TIGR00606  322 -----VDCQRELE-------KLNKERRLLNQEKTELLVEQGRLQLQADRHQEHIRARDSLIQslatrleldgfergpfse 389
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  490 -------------MYKEKKTFNQdDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEa 556
Cdd:TIGR00606  390 rqiknfhtlvierQEDEAKTAAQ-LCADLQSKERLKQEQADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQLE- 467
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2217316421  557 GGPPSTREAELKLR-----LKLVEEEANILGRKIVELEVEN------RGLKAEMEDMRgQQEREGPGRDHAPSI 619
Cdd:TIGR00606  468 GSSDRILELDQELRkaereLSKAEKNSLTETLKKEVKSLQNekadldRKLRKLDQEME-QLNHHTTTRTQMEML 540
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
434-601 1.97e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 41.83  E-value: 1.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  434 RLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEK--KTFNQDDSADLRCQLQF 511
Cdd:COG1579     21 RLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQlgNVRNNKEYEALQKEIES 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  512 AKEEAFLMRKKMAKLGREKDELEQELQKYKSLYgdvdsplptgeaggppSTREAELKLRLKLVEEEANILGRKIVELEVE 591
Cdd:COG1579    101 LKRRISDLEDEILELMERIEELEEELAELEAEL----------------AELEAELEEKKAELDEELAELEAELEELEAE 164
                          170
                   ....*....|
gi 2217316421  592 NRGLKAEMED 601
Cdd:COG1579    165 REELAAKIPP 174
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
350-487 2.09e-03

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 43.08  E-value: 2.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEMEEMRDSYLEEDVyQLQELRRELDRANKncrilqyRLRKAEQKSLKVAETgqvDGELIRSLEQDLKvak 429
Cdd:COG3206    268 LRAQLAELEAELAELSARYTPNHP-DVIALRAQIAALRA-------QLQQEAQRILASLEA---ELEALQAREASLQ--- 333
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217316421  430 dvsvrlhhelKTVEEKRAKAEDENETLRQQMI---EVEISKQALQNELERLKESSLKRRST 487
Cdd:COG3206    334 ----------AQLAQLEARLAELPELEAELRRlerEVEVARELYESLLQRLEEARLAEALT 384
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
347-600 2.12e-03

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 43.29  E-value: 2.12e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  347 NDYLKDELDELRAEMEEMRDSY--LEEDVYQ----LQELRRELDRAN---KNCRILQYRLrKAEQKSLKVAETGQVD--- 414
Cdd:pfam12128  599 EEELRERLDKAEEALQSAREKQaaAEEQLVQangeLEKASREETFARtalKNARLDLRRL-FDEKQSEKDKKNKALAerk 677
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  415 ---GELIRSLEQDLKVakdvsvrLHHELKTVEEKRAKAEDENETLRQQMIEVEISkqALQNELERLKESSLKRRSTREmy 491
Cdd:pfam12128  678 dsaNERLNSLEAQLKQ-------LDKKHQAWLEEQKEQKREARTEKQAYWQVVEG--ALDAQLALLKAAIAARRSGAK-- 746
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  492 KEKKTFNQDDSADLRCQ-------LQFAKEEAFLMRKKMAKLGREKDELE----------QELQKYKSLYGDVDSPLPtg 554
Cdd:pfam12128  747 AELKALETWYKRDLASLgvdpdviAKLKREIRTLERKIERIAVRRQEVLRyfdwyqetwlQRRPRLATQLSNIERAIS-- 824
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 2217316421  555 EAGGPPSTREAELKLRLKLVEEEANILGRKIVELEVENRGLKAEME 600
Cdd:pfam12128  825 ELQQQLARLIADTKLRRAKLEMERKASEKQQVRLSENLRGLRCEMS 870
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
31-156 2.53e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 42.78  E-value: 2.53e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   31 PVAERRRLHRaPSPARPFLKDLHARPAAPGPAVPSSGrapapaaprspnlAGKAPPSPGSLAAPGRLSRRSGGVPGAKDK 110
Cdd:PRK14951   374 APAEKKTPAR-PEAAAPAAAPVAQAAAAPAPAAAPAA-------------AASAPAAPPAAAPPAPVAAPAAAAPAAAPA 439
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 2217316421  111 PPPGAGARAAGGAKAALGSRRA--ARVAPAEPLSRAGKPPGAEPPSAA 156
Cdd:PRK14951   440 AAPAAVALAPAPPAQAAPETVAipVRVAPEPAVASAAPAPAAAPAAAR 487
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
12-185 2.59e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.94  E-value: 2.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   12 APDAKLQPPGQHHRHHHLHPV--AERRRLHRAPSPARPFLKDL----HARPAAPGPAVPSSGRAPAPAAPRSPNLAGKAP 85
Cdd:PRK12323   391 APAAAAPAPAAPPAAPAAAPAaaAAARAVAAAPARRSPAPEALaaarQASARGPGGAPAPAPAPAAAPAAAARPAAAGPR 470
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   86 PSPGSLAAPGRLSRRSGGVPGAKDKPPPGAGARAAGGAKAALGSRRAARVAPAEPLSRagkPPGAEPPSAAAKGRKAKRG 165
Cdd:PRK12323   471 PVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPD---PATADPDDAFETLAPAPAA 547
                          170       180
                   ....*....|....*....|
gi 2217316421  166 SRAPPARTVGPPTPAARIPA 185
Cdd:PRK12323   548 APAPRAAAATEPVVAPRPPR 567
PHA03247 PHA03247
large tegument protein UL36; Provisional
7-311 3.63e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 3.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421    7 PAGGGAPD------AKLQPPGQHHRHHHLHPVAERRRLHRAPsparpflkdLHARPAAPGPAVPSSGRAPAPAAPRSPNL 80
Cdd:PHA03247   206 PSGPGPAApadltaAALHLYGASETYLQDEPFVERRVVISHP---------LRGDIAAPAPPPVVGEGADRAPETARGAT 276
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   81 AGKAPPSPGSLAAPGRLSRRS-----GGVPGAKDKPPPGAGARAAGGAKAALGSRRAARVAPAEPLSRAGKP-------- 147
Cdd:PHA03247   277 GPPPPPEAAAPNGAAAPPDGVwgaalAGAPLALPAPPDPPPPAPAGDAEEEDDEDGAMEVVSPLPRPRQHYPlgfpkrrr 356
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  148 PGAEPPSAA---AKGRKAKRgsRAPPARTVGPPTPAARIPavtlavtsVAGSPArcsrishtdssSDLSDCPSEPlsdeq 224
Cdd:PHA03247   357 PTWTPPSSLedlSAGRHHPK--RASLPTRKRRSARHAATP--------FARGPG-----------GDDQTRPAAP----- 410
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  225 rlLPAASSDAESGTGSSDREPPRGAPTPSP-AARGAPPGSPeppallaaplAAGACPGGRSIPSGVSGGFAGPGVAEDVR 303
Cdd:PHA03247   411 --VPASVPTPAPTPVPASAPPPPATPLPSAePGSDDGPAPP----------PERQPPAPATEPAPDDPDDATRKALDALR 478

                   ....*...
gi 2217316421  304 GRSPPERP 311
Cdd:PHA03247   479 ERRPPEPP 486
EzrA COG4477
Septation ring formation regulator EzrA [Cell cycle control, cell division, chromosome ...
350-541 3.88e-03

Septation ring formation regulator EzrA [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443574 [Multi-domain]  Cd Length: 567  Bit Score: 42.14  E-value: 3.88e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEMEEMR-DSY------LEEdvyQLQELRRELDRANKNcrILQYRLRKAEQKsLKVAETgQVDgELIRSLE 422
Cdd:COG4477    227 LPDQLEELKSGYREMKeQGYvlehlnIEK---EIEQLEEQLKEALEL--LEELDLDEAEEE-LEEIEE-EID-ELYDLLE 298
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  423 QDLKVAKDVsvrlHHELKTVEEKRAKAEDENETLRqqmIEVEISKQALQ---NELERLKE-----SSLKRR---STREMY 491
Cdd:COG4477    299 KEVEAKKYV----DKNQEELEEYLEHLKEQNRELK---EEIDRVQQSYRlneNELEKVRNlekqiEELEKRydeIDERIE 371
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2217316421  492 KEKKTFN--QDDSADLRCQLQFAKEEAFLMRKKMAKLgrEKDELE--QELQKYK 541
Cdd:COG4477    372 EEKVAYSelQEELEEIEEQLEEIEEEQEEFSEKLKSL--RKDELEarEKLDELK 423
PHA03378 PHA03378
EBNA-3B; Provisional
36-184 4.07e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 42.36  E-value: 4.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   36 RRLHRAPSPARPflkdlhaRPAAPGPAVPSSGRAPAPAAPrspnlAGKAPPSPGSLAAPGRLsRRSGGVPGAKdkPPPGA 115
Cdd:PHA03378   685 LPIQWAPGTMQP-------PPRAPTPMRPPAAPPGRAQRP-----AAATGRARPPAAAPGRA-RPPAAAPGRA--RPPAA 749
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217316421  116 GARAAGGAKAALGSRRAARVAPaeplsraGKPPGAEPPSAAAKGRKAKRGSRAPPARTVGPPTPAARIP 184
Cdd:PHA03378   750 APGRARPPAAAPGRARPPAAAP-------GAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMP 811
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
355-561 4.77e-03

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 41.92  E-value: 4.77e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  355 DELRAEMEEMRDSYLEEdvyQLQELRRELDRA---------NKNCRILQ---YRLRKAEQKSlKVAETGQVDGELIRSLE 422
Cdd:NF033838    53 NESQKEHAKEVESHLEK---ILSEIQKSLDKRkhtqnvalnKKLSDIKTeylYELNVLKEKS-EAELTSKTKKELDAAFE 128
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  423 QDLKVAKDVSVRLHHELKTVEEKRAKAEDENE-------TLRQQMIEVEISKQALQnelerLKESSLkrrstrEMYKEKK 495
Cdd:NF033838   129 QFKKDTLEPGKKVAEATKKVEEAEKKAKDQKEedrrnypTNTYKTLELEIAESDVE-----VKKAEL------ELVKEEA 197
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217316421  496 TFNQDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPS 561
Cdd:NF033838   198 KEPRDEEKIKQAKAKVESKKAEATRLEKIKTDREKAEEEAKRRADAKLKEAVEKNVATSEQDKPKR 263
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
350-516 5.02e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 41.35  E-value: 5.02e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEMEEMRDSYLEEDVyQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQdLKVAK 429
Cdd:COG3883     35 AQAELDALQAELEELNEEYNELQA-ELEALQAEIDKLQAEIAEAEAEIEERREELGERARALYRSGGSVSYLDV-LLGSE 112
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  430 DVS-------------------VRLHHELK-TVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTRE 489
Cdd:COG3883    113 SFSdfldrlsalskiadadadlLEELKADKaELEAKKAELEAKLAELEALKAELEAAKAELEAQQAEQEALLAQLSAEEA 192
                          170       180
                   ....*....|....*....|....*..
gi 2217316421  490 MYKEKKTFNQDDSADLRCQLQFAKEEA 516
Cdd:COG3883    193 AAEAQLAELEAELAAAEAAAAAAAAAA 219
EzrA pfam06160
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like ...
350-542 6.30e-03

Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerizes into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation. The structure contains 5 spectrin like alpha helical repeats.


Pssm-ID: 428797 [Multi-domain]  Cd Length: 542  Bit Score: 41.38  E-value: 6.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEMEEMRDSYLE-EDVY--QLQELR---RELDRANkncrilqYRLrkaeqKSLKVAET-GQVDGELIRSLE 422
Cdd:pfam06160  184 LEEETDALEELMEDIPPLYEElKTELpdQLEELKegyREMEEEG-------YAL-----EHLNVDKEiQQLEEQLEENLA 251
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  423 Q----DLKVAKDVSVRLHHELKTV-----EEKRAKAE-DEN-ETLRQQMIEVEISKQALQNELERLKES----------- 480
Cdd:pfam06160  252 LlenlELDEAEEALEEIEERIDQLydlleKEVDAKKYvEKNlPEIEDYLEHAEEQNKELKEELERVQQSytlnenelerv 331
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217316421  481 --------SLKRR---STREMYKEKKTFN--QDDSADLRCQLQFAKEEAFLMRKKMAKLgrEKDELE--QELQKYKS 542
Cdd:pfam06160  332 rglekqleELEKRydeIVERLEEKEVAYSelQEELEEILEQLEEIEEEQEEFKESLQSL--RKDELEarEKLDEFKL 406
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
350-543 7.86e-03

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 41.20  E-value: 7.86e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEMEEMRDSYLEEDVYQLQELRR------ELDRANKNCRILQYRLRKAEQKslkvaetgqvdgelirsLEQ 423
Cdd:PRK03918   568 LEEELAELLKELEELGFESVEELEERLKELEPfyneylELKDAEKELEREEKELKKLEEE-----------------LDK 630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  424 DLKVAKDVSVRLHHELKTVEEKRAK-AEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTfnqdds 502
Cdd:PRK03918   631 AFEELAETEKRLEELRKELEELEKKySEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELE------ 704
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|.
gi 2217316421  503 adlrcQLQFAKEEAFLMRKKMAKLgrekDELEQELQKYKSL 543
Cdd:PRK03918   705 -----EREKAKKELEKLEKALERV----EELREKVKKYKAL 736
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
56-329 7.97e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 41.10  E-value: 7.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421   56 PAAPGPAVPSSGRAPAPAAPRSPNLAGKAPPSPGSLAAPGRLSRRS---------GGVP---GAKDKPPPGAGARAAGGA 123
Cdd:pfam17823  115 LAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPRAaiaaasaphAASPaprTAASSTTAASSTTAASSA 194
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  124 KAALGSRRAARVAPAEPLSRAGKPPGAepPSAAAKgrKAKRGSRAPPART----VGPPTPA--ARIPAVTLAVTSVAGSP 197
Cdd:pfam17823  195 PTTAASSAPATLTPARGISTAATATGH--PAAGTA--LAAVGNSSPAAGTvtaaVGTVTPAalATLAAAAGTVASAAGTI 270
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  198 ARCSRISHTDSSSdlSDCPSEPLSDEqrllPAASSDAES-GTGS--SDREP---PRGAPTPSPAARGAPPGSPEPPALLA 271
Cdd:pfam17823  271 NMGDPHARRLSPA--KHMPSDTMARN----PAAPMGAQAqGPIIqvSTDQPvhnTAGEPTPSPSNTTLEPNTPKSVASTN 344
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2217316421  272 APLAAGACPGGRSiPSGVSGGFAGPGVAEDVRGRSPPERPVPGTPKEPSLGEQSRLVP 329
Cdd:pfam17823  345 LAVVTTTKAQAKE-PSASPVPVLHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILLAP 401
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
438-711 8.21e-03

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 41.03  E-value: 8.21e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  438 ELKTVEEKRAKAEDENETL----RQQMIEVEISK-QALQNELERLKESSLKRRST-REMYKEKKTFNQDDSADLRCQLQF 511
Cdd:pfam07888    5 ELVTLEEESHGEEGGTDMLlvvpRAELLQNRLEEcLQERAELLQAQEAANRQREKeKERYKRDREQWERQRRELESRVAE 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  512 AKEEaflmrkkmakLGREKDELEQELQKYKSLygdVDSPLPTGEAGGPPSTREAELKLRLKLVEEEANILGRKIVELEVE 591
Cdd:pfam07888   85 LKEE----------LRQSREKHEELEEKYKEL---SASSEELSEEKDALLAQRAAHEARIRELEEDIKTLTQRVLERETE 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  592 NRGLKAEMEDMRGQQEREGPGRDHAPSIPTSPFGDSLESSTE---LRRHLQFVEEEAELLRRSISEIED-----HNRQLT 663
Cdd:pfam07888  152 LERMKERAKKAGAQRKEEEAERKQLQAKLQQTEEELRSLSKEfqeLRNSLAQRDTQVLQLQDTITTLTQklttaHRKEAE 231
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2217316421  664 HELSKFKFEPPRE--------PGWLGEGASPGAGGGAPLQEELKSARLQISELSGK 711
Cdd:pfam07888  232 NEALLEELRSLQErlnaserkVEGLGEELSSMAAQRDRTQAELHQARLQAAQLTLQ 287
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
350-538 8.32e-03

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 41.21  E-value: 8.32e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  350 LKDELDELRAEMEEmrdsyLEEDVYQLQELRRELDRANKNCR-ILQYRLRKAEQKSLKVAETGQVDGELIRSLEQ----- 423
Cdd:TIGR02169  327 LEAEIDKLLAEIEE-----LEREIEEERKRRDKLTEEYAELKeELEDLRAELEEVDKEFAETRDELKDYREKLEKlkrei 401
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  424 -DLKVAKDvsvRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERlKESSLKRrstremykekktfNQDDS 502
Cdd:TIGR02169  402 nELKRELD---RLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKK-QEWKLEQ-------------LAADL 464
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 2217316421  503 ADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQ 538
Cdd:TIGR02169  465 SKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQAR 500
Tropomyosin pfam00261
Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 ...
418-539 8.43e-03

Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites. The protein is best known for its role in regulating the interaction between actin and myosin in muscle contraction, but is also involved in the organization and dynamics of the cytoskeleton in non-muscle cells. There are multiple cell-specific isoforms, expressed by alternative promoters and alternative RNA processing of at least four genes. Muscle isoforms of tropomyosin are characterized by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region.


Pssm-ID: 459736 [Multi-domain]  Cd Length: 235  Bit Score: 40.01  E-value: 8.43e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  418 IRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDENETL--RQQMIEVEI--SKQALQNELERLKESSLKRRSTREMYK- 492
Cdd:pfam00261    3 MQQIKEELDEAEERLKEAMKKLEEAEKRAEKAEAEVAALnrRIQLLEEELerTEERLAEALEKLEEAEKAADESERGRKv 82
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 2217316421  493 -EKKTFNQDDS-ADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQK 539
Cdd:pfam00261   83 lENRALKDEEKmEILEAQLKEAKEIAEEADRKYEEVARKLVVVEGDLER 131
TPH pfam13868
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
369-603 8.54e-03

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 40.67  E-value: 8.54e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  369 LEEDVYQLQELRRELDRAnKNCRILQYRLrkAEQKSLKVAETgqvdgelirslEQDLKVAKDVSVRLHHELKTVEEKRAK 448
Cdd:pfam13868    1 LRENSDELRELNSKLLAA-KCNKERDAQI--AEKKRIKAEEK-----------EEERRLDEMMEEERERALEEEEEKEEE 66
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  449 AEDENETLRQqmievEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSAD----------LRCQLQFAKEEAFL 518
Cdd:pfam13868   67 RKEERKRYRQ-----ELEEQIEEREQKRQEEYEEKLQEREQMDEIVERIQEEDQAEaeeklekqrqLREEIDEFNEEQAE 141
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  519 MRKKMAKLGREKDELEQELQKYKSLygdvdsplptgeaggppstREAELKLRLKLVEEEANILGRKIVELEVENRGLKAE 598
Cdd:pfam13868  142 WKELEKEEEREEDERILEYLKEKAE-------------------REEEREAEREEIEEEKEREIARLRAQQEKAQDEKAE 202

                   ....*
gi 2217316421  599 MEDMR 603
Cdd:pfam13868  203 RDELR 207
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
443-662 9.39e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 41.05  E-value: 9.39e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  443 EEKRAKAEDENETLRQQMIEVEISKQALQNELERLKEsslkRRSTREMYKEKkTFNQDDSADLRcqlqfakeeaflmrkk 522
Cdd:COG4913    609 RAKLAALEAELAELEEELAEAEERLEALEAELDALQE----RREALQRLAEY-SWDEIDVASAE---------------- 667
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  523 maklgREKDELEQELQKYKSLYGDVdsplptgeaggppstreAELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDM 602
Cdd:COG4913    668 -----REIAELEAELERLDASSDDL-----------------AALEEQLEELEAELEELEEELDELKGEIGRLEKELEQA 725
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217316421  603 RGQQEREGPGRDHAPSIPTSPFGDSLEsstELRRHLQFVEEEAELLRRSISEIEDHNRQL 662
Cdd:COG4913    726 EEELDELQDRLEAAEDLARLELRALLE---ERFAAALGDAVERELRENLEERIDALRARL 782
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH