NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1034663586|ref|XP_016869701|]
View 

structural maintenance of chromosomes protein 2 isoform X2 [Homo sapiens]

Protein Classification

chromosome segregation protein SMC( domain architecture ID 12035156)

chromosome segregation protein SMC is an ATPase required for chromosome condensation and partitioning

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
2-1117 4.59e-136

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


:

Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 442.49  E-value: 4.59e-136
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586    2 HIKSIILEGFKSYAQRTEVNgFDPLFNAITGLNGSGKSNILDSICFLLGISNLS-------------------------- 55
Cdd:pfam02463    1 YLKRIEIEGFKSYAKTVILP-FSPGFTAIVGPNGSGKSNILDAILFVLGERSAKslrserlsdlihsksgafvnsaevei 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586   56 --------------------QVVIGGRNKYLINGVNANNTRVQDLFCSVGLNVNNPHFLIMQGRITKVLNMKPPEILSMI 115
Cdd:pfam02463   80 tfdnedhelpidkeevsirrRVYRGGDSEYYINGKNVTKKEVAELLESQGISPEAYNFLVQGGKIEIIAMMKPERRLEIE 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  116 EEAAGTRMYEYKKIAAQKTIEKKEAKLKEIKTILEEEITPTIQKLKEERSSYLEYQKVMREIEHLSRLYIAYQFLLAEDT 195
Cdd:pfam02463  160 EEAAGSRLKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERI 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  196 KVRSAEELKEMQDKVIKLQEELSENDKKIKALNHEIEELEKRKDKETGGILRSLEDALAEAQRVNTKSQSAFDLKKKNLA 275
Cdd:pfam02463  240 DLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKES 319
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  276 CEESKRKELEKNMVEDSKTLAAKEKEVKKITDGLHALQEASNKDAEALAAAQQHFNAVSAGLSSNEDGAEATLAGQMMAC 355
Cdd:pfam02463  320 EKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELEL 399
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  356 KNDISKAQTEAKQAQMKLKHAQQELKNKQAEVKKMDSGYRKDQEALEAVKRLKEKLEAEMKKLNYEENKEESLLEKRRQL 435
Cdd:pfam02463  400 KSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQL 479
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  436 SRDIGRLKETYEALLARFPNLRFAYKDPEKNWNRNCVKGLVASLISVKDTSATTALELVAGERLYNVVVDTEVTGKKLLE 515
Cdd:pfam02463  480 VKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVIVEVSATADE 559
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  516 RGELKRRYTIIPLNKISARCIAPETLRVAQNLVGPDNVHVALSLVEYKPeLQKAMEFVFGTTFVCDNMDNAKKVAFDKRI 595
Cdd:pfam02463  560 VEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQL-DKATLEADEDDKRAKVVEGILKDTELTKLK 638
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  596 MTRTVTLGGDVFDPHGTLSGGARSQAASILTKFQELKDVQDELRIKENELRALEEELAGLKNTAEKYRQLKQQWEMKTEE 675
Cdd:pfam02463  639 ESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLE 718
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  676 ADLLQTKLQQSSYHKQQEELDALKKTIEESEETLKNTKEIQRKAEEKYEVLENKMKNAEAERERELKDAQKKLDCAKTKA 755
Cdd:pfam02463  719 AEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKA 798
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  756 DASSKKMKEKQQEVEAITLELEELKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNKESVNKAQEEVTKQKEVITAQ 835
Cdd:pfam02463  799 QEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEE 878
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  836 DTVIKAKYAEVAKHKEQNNDSQLKIKELDHNISKHKREAEDGAAKVSKMLKDYDWINAERHLFGQPNSAYDFKTNNPKEA 915
Cdd:pfam02463  879 LEEQKLKDELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLEEADEKEKEENNKEEE 958
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  916 GQRLQKLQEMKEKLGRNVNMRAMNVLTEAEERYNDLMKKKRIVENDKSKILTTIEDLDQKKNQALNIAWQKVNKDFGSIF 995
Cdd:pfam02463  959 EERNKRLLLAKEELGKVNLMAIEEFEEKEERYNKDELEKERLEEEKKKLIRAIIEETCQRLKEFLELFVSINKGWNKVFF 1038
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  996 STLLPGANAMLAPPEGQTVLDGLEFKVALGNTWKENLTELSGGQRSLVALSLILSMLLFKPAPIYILDEVDAALDLSHTQ 1075
Cdd:pfam02463 1039 YLELGGSAELRLEDPDDPFSGGIEISARPPGKGVKNLDLLSGGEKTLVALALIFAIQKYKPAPFYLLDEIDAALDDQNVS 1118
                         1130      1140      1150      1160
                   ....*....|....*....|....*....|....*....|...
gi 1034663586 1076 NIGQMLRTHFTHSQFIVVSLKEGMFNNANVLFKTKFV-DGVST 1117
Cdd:pfam02463 1119 RVANLLKELSKNAQFIVISLREEMLEKADKLVGVTMVeNGVST 1161
 
Name Accession Description Interval E-value
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
2-1117 4.59e-136

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 442.49  E-value: 4.59e-136
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586    2 HIKSIILEGFKSYAQRTEVNgFDPLFNAITGLNGSGKSNILDSICFLLGISNLS-------------------------- 55
Cdd:pfam02463    1 YLKRIEIEGFKSYAKTVILP-FSPGFTAIVGPNGSGKSNILDAILFVLGERSAKslrserlsdlihsksgafvnsaevei 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586   56 --------------------QVVIGGRNKYLINGVNANNTRVQDLFCSVGLNVNNPHFLIMQGRITKVLNMKPPEILSMI 115
Cdd:pfam02463   80 tfdnedhelpidkeevsirrRVYRGGDSEYYINGKNVTKKEVAELLESQGISPEAYNFLVQGGKIEIIAMMKPERRLEIE 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  116 EEAAGTRMYEYKKIAAQKTIEKKEAKLKEIKTILEEEITPTIQKLKEERSSYLEYQKVMREIEHLSRLYIAYQFLLAEDT 195
Cdd:pfam02463  160 EEAAGSRLKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERI 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  196 KVRSAEELKEMQDKVIKLQEELSENDKKIKALNHEIEELEKRKDKETGGILRSLEDALAEAQRVNTKSQSAFDLKKKNLA 275
Cdd:pfam02463  240 DLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKES 319
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  276 CEESKRKELEKNMVEDSKTLAAKEKEVKKITDGLHALQEASNKDAEALAAAQQHFNAVSAGLSSNEDGAEATLAGQMMAC 355
Cdd:pfam02463  320 EKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELEL 399
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  356 KNDISKAQTEAKQAQMKLKHAQQELKNKQAEVKKMDSGYRKDQEALEAVKRLKEKLEAEMKKLNYEENKEESLLEKRRQL 435
Cdd:pfam02463  400 KSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQL 479
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  436 SRDIGRLKETYEALLARFPNLRFAYKDPEKNWNRNCVKGLVASLISVKDTSATTALELVAGERLYNVVVDTEVTGKKLLE 515
Cdd:pfam02463  480 VKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVIVEVSATADE 559
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  516 RGELKRRYTIIPLNKISARCIAPETLRVAQNLVGPDNVHVALSLVEYKPeLQKAMEFVFGTTFVCDNMDNAKKVAFDKRI 595
Cdd:pfam02463  560 VEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQL-DKATLEADEDDKRAKVVEGILKDTELTKLK 638
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  596 MTRTVTLGGDVFDPHGTLSGGARSQAASILTKFQELKDVQDELRIKENELRALEEELAGLKNTAEKYRQLKQQWEMKTEE 675
Cdd:pfam02463  639 ESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLE 718
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  676 ADLLQTKLQQSSYHKQQEELDALKKTIEESEETLKNTKEIQRKAEEKYEVLENKMKNAEAERERELKDAQKKLDCAKTKA 755
Cdd:pfam02463  719 AEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKA 798
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  756 DASSKKMKEKQQEVEAITLELEELKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNKESVNKAQEEVTKQKEVITAQ 835
Cdd:pfam02463  799 QEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEE 878
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  836 DTVIKAKYAEVAKHKEQNNDSQLKIKELDHNISKHKREAEDGAAKVSKMLKDYDWINAERHLFGQPNSAYDFKTNNPKEA 915
Cdd:pfam02463  879 LEEQKLKDELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLEEADEKEKEENNKEEE 958
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  916 GQRLQKLQEMKEKLGRNVNMRAMNVLTEAEERYNDLMKKKRIVENDKSKILTTIEDLDQKKNQALNIAWQKVNKDFGSIF 995
Cdd:pfam02463  959 EERNKRLLLAKEELGKVNLMAIEEFEEKEERYNKDELEKERLEEEKKKLIRAIIEETCQRLKEFLELFVSINKGWNKVFF 1038
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  996 STLLPGANAMLAPPEGQTVLDGLEFKVALGNTWKENLTELSGGQRSLVALSLILSMLLFKPAPIYILDEVDAALDLSHTQ 1075
Cdd:pfam02463 1039 YLELGGSAELRLEDPDDPFSGGIEISARPPGKGVKNLDLLSGGEKTLVALALIFAIQKYKPAPFYLLDEIDAALDDQNVS 1118
                         1130      1140      1150      1160
                   ....*....|....*....|....*....|....*....|...
gi 1034663586 1076 NIGQMLRTHFTHSQFIVVSLKEGMFNNANVLFKTKFV-DGVST 1117
Cdd:pfam02463 1119 RVANLLKELSKNAQFIVISLREEMLEKADKLVGVTMVeNGVST 1161
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
3-1094 3.29e-109

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 369.39  E-value: 3.29e-109
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586    3 IKSIILEGFKSYAQRTEVNgFDPLFNAITGLNGSGKSNILDSICFLLG---------------ISN-------------- 53
Cdd:TIGR02168    2 LKKLELAGFKSFADPTTIN-FDKGITGIVGPNGCGKSNIVDAIRWVLGeqsakalrggkmedvIFNgsetrkplslaeve 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586   54 --------------LSQVVI------GGRNKYLINGVNANNTRVQDLFCSVGLNVNNpHFLIMQGRITKVLNMKPPEILS 113
Cdd:TIGR02168   81 lvfdnsdgllpgadYSEISItrrlyrDGESEYFINGQPCRLKDIQDLFLDTGLGKRS-YSIIEQGKISEIIEAKPEERRA 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  114 MIEEAAGTRMYEYKKIAAQKTIEKKEAKLKEIKTILEEeITPTIQKLKEERSSYLEYQKVMREIEHLSRLYIAYQFLLAE 193
Cdd:TIGR02168  160 IFEEAAGISKYKERRKETERKLERTRENLDRLEDILNE-LERQLKSLERQAEKAERYKELKAELRELELALLVLRLEELR 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  194 DTKVRSAEELKEMQDKVIKLQEELSENDKKIKALNHEIEELEKRKDKETGGI--------------------LRSLEDAL 253
Cdd:TIGR02168  239 EELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELyalaneisrleqqkqilrerLANLERQL 318
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  254 AEAQRVNTKSQSAFDLKKKNLACEESKRKELEKNMVEDSKTLAAKEKEVKKITDGLHALQEASNKDAEALAAAQQHFNAV 333
Cdd:TIGR02168  319 EELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASL 398
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  334 SAGLSSNEDGAEAtLAGQMMACKNDISKAQTEAKQAQMKLKHAQQELKNKQAEvkkmdsgyrKDQEALEAVKRLKEKLEA 413
Cdd:TIGR02168  399 NNEIERLEARLER-LEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELE---------ELQEELERLEEALEELRE 468
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  414 EMKKlnyeenKEESLLEKRRQLSRDIGRLKETyEALLARFPNLRFAYKDPEKN-WNRNCVKGLVASLISVkDTSATTALE 492
Cdd:TIGR02168  469 ELEE------AEQALDAAERELAQLQARLDSL-ERLQENLEGFSEGVKALLKNqSGLSGILGVLSELISV-DEGYEAAIE 540
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  493 LVAGERLYNVVVDTEVTGKK---LLERGELKRRyTIIPLNKISARCIAPETLRVAQNlvGPDNVHVALSLVEYKPELQKA 569
Cdd:TIGR02168  541 AALGGRLQAVVVENLNAAKKaiaFLKQNELGRV-TFLPLDSIKGTEIQGNDREILKN--IEGFLGVAKDLVKFDPKLRKA 617
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  570 MEFVFGTTFVCDNMDNAKKVAFDKRIMTRTVTLGGDVFDPHGTLSGGARSQAASILTKFQELKDVQDELRIKENELRALE 649
Cdd:TIGR02168  618 LSYLLGGVLVVDDLDNALELAKKLRPGYRIVTLDGDLVRPGGVITGGSAKTNSSILERRREIEELEEKIEELEEKIAELE 697
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  650 EELAGLKntaekyrqlKQQWEMKTEEADLLQTKLQQS-SYHKQQEELDALKKTIEESEETLKNT-KEIQRKAEEKYEVLE 727
Cdd:TIGR02168  698 KALAELR---------KELEELEEELEQLRKELEELSrQISALRKDLARLEAEVEQLEERIAQLsKELTELEAEIEELEE 768
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  728 NKMKNAE--AERERELKDAQKKLDCAKTKADASSKKMKEKQQEVEAITLELEELKREHTSYKQQLEAVNEAIKSYESQIE 805
Cdd:TIGR02168  769 RLEEAEEelAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIE 848
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  806 VMAAEVAKNKESVNKAQEEVTKQKEVITAQDTVIKAKYAEVAKHKEQNNDSQLKIKELDHNISKHKREAEDGAAKVSKML 885
Cdd:TIGR02168  849 ELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLE 928
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  886 KDYDWINAER-HLFGQPNSAY-----------DFKTNNPKEAGQRLQKLQEMKEKLGrNVNMRAMNVLTEAEERYNDLMK 953
Cdd:TIGR02168  929 LRLEGLEVRIdNLQERLSEEYsltleeaealeNKIEDDEEEARRRLKRLENKIKELG-PVNLAAIEEYEELKERYDFLTA 1007
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  954 KKRIVENDKSKILTTIEDLDQKKNQALNIAWQKVNKDFGSIFSTLLPGANAMLAPPEGQTVLD-GLEFKVALGNTWKENL 1032
Cdd:TIGR02168 1008 QKEDLTEAKETLEEAIEEIDREARERFKDTFDQVNENFQRVFPKLFGGGEAELRLTDPEDLLEaGIEIFAQPPGKKNQNL 1087
                         1130      1140      1150      1160      1170      1180
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034663586 1033 TELSGGQRSLVALSLILSMLLFKPAPIYILDEVDAALDLSHTQNIGQMLRTHFTHSQFIVVS 1094
Cdd:TIGR02168 1088 SLLSGGEKALTALALLFAIFKVKPAPFCILDEVDAPLDDANVERFANLLKEFSKNTQFIVIT 1149
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1-1107 2.58e-70

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 254.86  E-value: 2.58e-70
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586    1 MHIKSIILEGFKSYAQRTEVNgFDPLFNAITGLNGSGKSNILDSICFLLG---ISNL----------------------- 54
Cdd:COG1196      1 MRLKRLELAGFKSFADPTTIP-FEPGITAIVGPNGSGKSNIVDAIRWVLGeqsAKSLrggkmedvifagsssrkplgrae 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586   55 ----------------SQVVI------GGRNKYLINGVNANNTRVQDLFCSVGLNVNNpHFLIMQGRITKVLNMKPPEIL 112
Cdd:COG1196     80 vsltfdnsdgtlpidyDEVTItrrlyrSGESEYYINGKPCRLKDIQDLFLDTGLGPES-YSIIGQGMIDRIIEAKPEERR 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  113 SMIEEAAGTRMYEYKKIAAQKTIEKKEAKLKEIKTILEEeITPTIQKLKEERSSYLEYQKVMREIEHLSRLYIAYQFLLA 192
Cdd:COG1196    159 AIIEEAAGISKYKERKEEAERKLEATEENLERLEDILGE-LERQLEPLERQAEKAERYRELKEELKELEAELLLLKLREL 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  193 EDTKVRSAEELKEMQDKVIKLQEELSENDKKIKALNHEIEELEKRkDKETGGILRSLEDALAEAQRVNTKSQSAFDLKKK 272
Cdd:COG1196    238 EAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELE-LEEAQAEEYELLAELARLEQDIARLEERRRELEE 316
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  273 NLACEESKRKELEKNMVEDSKTLAAKEKEVKKITDGLHALQEASNKDAEALAAAQQHFNAVSAGLSSNEDgAEATLAGQM 352
Cdd:COG1196    317 RLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAE-ELLEALRAA 395
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  353 MACKNDISKAQTEAKQAQMKLKHAQQELKNKQAEVKKMDSGYRKDQEALEAVKRLKEKLEAEMKKLN----YEENKEESL 428
Cdd:COG1196    396 AELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLellaELLEEAALL 475
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  429 LEKRRQLSRDIGRLKETYEALL---ARFPNLRFAYKDPEKNWNRNCVKGLVASLISVkDTSATTALELVAGERLYNVVVD 505
Cdd:COG1196    476 EAALAELLEELAEAAARLLLLLeaeADYEGFLEGVKAALLLAGLRGLAGAVAVLIGV-EAAYEAALEAALAAALQNIVVE 554
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  506 TEVTGKKLLERGELKR--RYTIIPLNKISARciAPETLRVAQNLVGPDNVHVALSLVEYKPELQKAMEFVFGTTFVCDNM 583
Cdd:COG1196    555 DDEVAAAAIEYLKAAKagRATFLPLDKIRAR--AALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARL 632
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  584 DNAKKVAFDKRIMTRTVTLGGDVFDPHGTLSGGARSQAASiltkfqelkdvqdELRIKENELRALEEELAglkntaekyr 663
Cdd:COG1196    633 EAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLA-------------ALLEAEAELEELAERLA---------- 689
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  664 qlkqqwemkteeadllqtklqqssyhKQQEELDALKKTIEESEETLKNTKEIQRKAEEKYEVLENKMKNAEAERERELKD 743
Cdd:COG1196    690 --------------------------EEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLE 743
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  744 AQKKLDcaktkadasskkmkEKQQEVEAITLELEELKRehtsykqqleavneaiksyesqievmaaevaknkesvnkaqe 823
Cdd:COG1196    744 EEELLE--------------EEALEELPEPPDLEELER------------------------------------------ 767
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  824 evtkqkevitaqdtvikakyaevakhkeqnndsqlkikeldhniskhkreaedgaakvskmlkdydwinaerhlfgqpns 903
Cdd:COG1196        --------------------------------------------------------------------------------
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  904 aydfktnnpkeagqRLQKLQEMKEKLGrNVNMRAMNVLTEAEERYNDLMKKKRIVENDKSKILTTIEDLDQKKNQALNIA 983
Cdd:COG1196    768 --------------ELERLEREIEALG-PVNLLAIEEYEELEERYDFLSEQREDLEEARETLEEAIEEIDRETRERFLET 832
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  984 WQKVNKDFGSIFSTLLPGANAMLAPPEGQTVLD-GLEFKVALGNTWKENLTELSGGQRSLVALSLILSMLLFKPAPIYIL 1062
Cdd:COG1196    833 FDAVNENFQELFPRLFGGGEAELLLTDPDDPLEtGIEIMAQPPGKKLQRLSLLSGGEKALTALALLFAIFRLNPSPFCVL 912
                         1130      1140      1150      1160
                   ....*....|....*....|....*....|....*....|....*
gi 1034663586 1063 DEVDAALDLSHTQNIGQMLRTHFTHSQFIVVSLKEGMFNNANVLF 1107
Cdd:COG1196    913 DEVDAPLDDANVERFAELLKEMSEDTQFIVITHNKRTMEAADRLY 957
ABC_SMC2_euk cd03273
ATP-binding cassette domain of eukaryotic SMC2 proteins; The structural maintenance of ...
1024-1118 1.39e-63

ATP-binding cassette domain of eukaryotic SMC2 proteins; The structural maintenance of chromosomes (SMC) proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (formerly known as Rad18).


Pssm-ID: 213240 [Multi-domain]  Cd Length: 251  Bit Score: 216.40  E-value: 1.39e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586 1024 LGNTWKENLTELSGGQRSLVALSLILSMLLFKPAPIYILDEVDAALDLSHTQNIGQMLRTHFTHSQFIVVSLKEGMFNNA 1103
Cdd:cd03273    156 MGGVWKESLTELSGGQRSLVALSLILALLLFKPAPMYILDEVDAALDLSHTQNIGRMIKTHFKGSQFIVVSLKEGMFNNA 235
                           90
                   ....*....|....*
gi 1034663586 1104 NVLFKTKFVDGVSTV 1118
Cdd:cd03273    236 NVLFRTRFVDGTSTV 250
SMC_hinge smart00968
SMC proteins Flexible Hinge Domain; This entry represents the hinge region of the SMC ...
472-589 2.15e-33

SMC proteins Flexible Hinge Domain; This entry represents the hinge region of the SMC (Structural Maintenance of Chromosomes) family of proteins. The hinge region is responsible for formation of the DNA interacting dimer. It is also possible that the precise structure of it is an essential determinant of the specificity of the DNA-protein interaction.


Pssm-ID: 214944 [Multi-domain]  Cd Length: 120  Bit Score: 125.04  E-value: 2.15e-33
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586   472 VKGLVASLISVKDtSATTALELVAGERLYNVVVDTEVTGKKLLERGELKR--RYTIIPLNKISARCIAPETLRvAQNLVG 549
Cdd:smart00968    3 VLGRVADLISVDP-KYETALEAALGGRLQAVVVDTEETAKKAIEFLKKNRlgRATFLPLDKIKPRSPAGSKLR-EALLPE 80
                            90       100       110       120
                    ....*....|....*....|....*....|....*....|
gi 1034663586   550 PDNVHVALSLVEYKPELQKAMEFVFGTTFVCDNMDNAKKV 589
Cdd:smart00968   81 PGFVGPAIDLVEYDPELRPALEYLLGNTLVVDDLETARRL 120
PTZ00121 PTZ00121
MAEBL; Provisional
115-911 1.51e-16

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 85.58  E-value: 1.51e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  115 IEEAagTRMYEYKKIAAQKTIEKKEAKLKEIKTILEEEITPTIQKLKEERSSYLEYQKVMREIEhlsrlyiayqfllaed 194
Cdd:PTZ00121  1214 AEEA--RKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAE---------------- 1275
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  195 tKVRSAEELKEMQDKviKLQEELSENDKKIKAlnheiEELEKRKDKEtggilRSLEDALAEAQRVNTKSQSAfdlKKKnl 274
Cdd:PTZ00121  1276 -EARKADELKKAEEK--KKADEAKKAEEKKKA-----DEAKKKAEEA-----KKADEAKKKAEEAKKKADAA---KKK-- 1337
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  275 aCEESKRKELEKNMVEDSKTLAAKEKEVKKITDGLHAlQEASNKDAEALAAAQQHFNAVSAGLSSNEDGAEAtlagqmma 354
Cdd:PTZ00121  1338 -AEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK-EEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKA-------- 1407
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  355 ckNDISKAQTEAKQAQmklkhaqqELKNKQAEVKKMDSGYRKDQE---ALEAVKRLKEKLEAEMKKLNYEENKEESLLEK 431
Cdd:PTZ00121  1408 --DELKKAAAAKKKAD--------EAKKKAEEKKKADEAKKKAEEakkADEAKKKAEEAKKAEEAKKKAEEAKKADEAKK 1477
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  432 RRQLSRDIGRLKETYEALLARFPNLRFAYKDPEKnwnrncvkglvaslisvkdtsattALELVAGErlynvvvdtEVTGK 511
Cdd:PTZ00121  1478 KAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKK------------------------ADEAKKAE---------EAKKA 1524
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  512 KLLERGELKRRytiiplnkisarciaPETLRVAQNLVGPDNVHVAlslveykPELQKAMEfvfgttfvCDNMDNAKKVAF 591
Cdd:PTZ00121  1525 DEAKKAEEAKK---------------ADEAKKAEEKKKADELKKA-------EELKKAEE--------KKKAEEAKKAEE 1574
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  592 DKRIMTRTVTLGGDV----FDPHGTLSGGARSQAASILTKFQELKDVQDELRIKENELRALEEELAGLKNTAEKYRQLKQ 667
Cdd:PTZ00121  1575 DKNMALRKAEEAKKAeearIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKK 1654
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  668 QWEMKTEEADLLQTKLQQSSyhKQQEELDALKKTIEESEETLKNTKEIQRKAEEKYEVLENKMKNAE----AERERELK- 742
Cdd:PTZ00121  1655 AEEENKIKAAEEAKKAEEDK--KKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEelkkAEEENKIKa 1732
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  743 -DAQKKLDCAKTKADASSKKMKEK----------QQEVEAITLELEELKREHTSYK--QQLEAVNEAIKSYESQIEVMAA 809
Cdd:PTZ00121  1733 eEAKKEAEEDKKKAEEAKKDEEEKkkiahlkkeeEKKAEEIRKEKEAVIEEELDEEdeKRRMEVDKKIKDIFDNFANIIE 1812
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  810 EVAKNKESVNKAQE-EVTKQKEVITAQDTVIKaKYAEVAKHKEQNNDSQLKIKELDHNISKHKREAEDGAAKV--SKMLK 886
Cdd:PTZ00121  1813 GGKEGNLVINDSKEmEDSAIKEVADSKNMQLE-EADAFEKHKFNKNNENGEDGNKEADFNKEKDLKEDDEEEIeeADEIE 1891
                          810       820
                   ....*....|....*....|....*
gi 1034663586  887 DYDWINAERHLfgqPNSAYDFKTNN 911
Cdd:PTZ00121  1892 KIDKDDIEREI---PNNNMAGKNND 1913
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
630-862 3.20e-04

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 45.00  E-value: 3.20e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  630 ELKDVQDELRIKENELRALEEELAGLKNTaEKYRQLKQQWEMKTEEADLLQTKLQQSSYHKQQEELDALKKTIEESEETL 709
Cdd:NF033838   175 ELEIAESDVEVKKAELELVKEEAKEPRDE-EKIKQAKAKVESKKAEATRLEKIKTDREKAEEEAKRRADAKLKEAVEKNV 253
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  710 KNTKEIQRKAEEKYEVL---------ENKMKNAEA------------ERERELKDAQKKLDCAKTKADASSKKMKEKQQE 768
Cdd:NF033838   254 ATSEQDKPKRRAKRGVLgepatpdkkENDAKSSDSsvgeetlpspslKPEKKVAEAEKKVEEAKKKAKDQKEEDRRNYPT 333
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  769 VEAITLELEELKREHTSYKQQLEAVNEAIKSyesqievmaaevAKNKESVNKAQEEVTKQKEVITAQDTVI--KAKYAEV 846
Cdd:NF033838   334 NTYKTLELEIAESDVKVKEAELELVKEEAKE------------PRNEEKIKQAKAKVESKKAEATRLEKIKtdRKKAEEE 401
                          250
                   ....*....|....*.
gi 1034663586  847 AKHKEQNNDsqlKIKE 862
Cdd:NF033838   402 AKRKAAEED---KVKE 414
 
Name Accession Description Interval E-value
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
2-1117 4.59e-136

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 442.49  E-value: 4.59e-136
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586    2 HIKSIILEGFKSYAQRTEVNgFDPLFNAITGLNGSGKSNILDSICFLLGISNLS-------------------------- 55
Cdd:pfam02463    1 YLKRIEIEGFKSYAKTVILP-FSPGFTAIVGPNGSGKSNILDAILFVLGERSAKslrserlsdlihsksgafvnsaevei 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586   56 --------------------QVVIGGRNKYLINGVNANNTRVQDLFCSVGLNVNNPHFLIMQGRITKVLNMKPPEILSMI 115
Cdd:pfam02463   80 tfdnedhelpidkeevsirrRVYRGGDSEYYINGKNVTKKEVAELLESQGISPEAYNFLVQGGKIEIIAMMKPERRLEIE 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  116 EEAAGTRMYEYKKIAAQKTIEKKEAKLKEIKTILEEEITPTIQKLKEERSSYLEYQKVMREIEHLSRLYIAYQFLLAEDT 195
Cdd:pfam02463  160 EEAAGSRLKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERI 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  196 KVRSAEELKEMQDKVIKLQEELSENDKKIKALNHEIEELEKRKDKETGGILRSLEDALAEAQRVNTKSQSAFDLKKKNLA 275
Cdd:pfam02463  240 DLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKES 319
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  276 CEESKRKELEKNMVEDSKTLAAKEKEVKKITDGLHALQEASNKDAEALAAAQQHFNAVSAGLSSNEDGAEATLAGQMMAC 355
Cdd:pfam02463  320 EKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELEL 399
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  356 KNDISKAQTEAKQAQMKLKHAQQELKNKQAEVKKMDSGYRKDQEALEAVKRLKEKLEAEMKKLNYEENKEESLLEKRRQL 435
Cdd:pfam02463  400 KSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQL 479
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  436 SRDIGRLKETYEALLARFPNLRFAYKDPEKNWNRNCVKGLVASLISVKDTSATTALELVAGERLYNVVVDTEVTGKKLLE 515
Cdd:pfam02463  480 VKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVIVEVSATADE 559
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  516 RGELKRRYTIIPLNKISARCIAPETLRVAQNLVGPDNVHVALSLVEYKPeLQKAMEFVFGTTFVCDNMDNAKKVAFDKRI 595
Cdd:pfam02463  560 VEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQL-DKATLEADEDDKRAKVVEGILKDTELTKLK 638
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  596 MTRTVTLGGDVFDPHGTLSGGARSQAASILTKFQELKDVQDELRIKENELRALEEELAGLKNTAEKYRQLKQQWEMKTEE 675
Cdd:pfam02463  639 ESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLE 718
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  676 ADLLQTKLQQSSYHKQQEELDALKKTIEESEETLKNTKEIQRKAEEKYEVLENKMKNAEAERERELKDAQKKLDCAKTKA 755
Cdd:pfam02463  719 AEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKA 798
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  756 DASSKKMKEKQQEVEAITLELEELKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNKESVNKAQEEVTKQKEVITAQ 835
Cdd:pfam02463  799 QEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEE 878
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  836 DTVIKAKYAEVAKHKEQNNDSQLKIKELDHNISKHKREAEDGAAKVSKMLKDYDWINAERHLFGQPNSAYDFKTNNPKEA 915
Cdd:pfam02463  879 LEEQKLKDELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLEEADEKEKEENNKEEE 958
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  916 GQRLQKLQEMKEKLGRNVNMRAMNVLTEAEERYNDLMKKKRIVENDKSKILTTIEDLDQKKNQALNIAWQKVNKDFGSIF 995
Cdd:pfam02463  959 EERNKRLLLAKEELGKVNLMAIEEFEEKEERYNKDELEKERLEEEKKKLIRAIIEETCQRLKEFLELFVSINKGWNKVFF 1038
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  996 STLLPGANAMLAPPEGQTVLDGLEFKVALGNTWKENLTELSGGQRSLVALSLILSMLLFKPAPIYILDEVDAALDLSHTQ 1075
Cdd:pfam02463 1039 YLELGGSAELRLEDPDDPFSGGIEISARPPGKGVKNLDLLSGGEKTLVALALIFAIQKYKPAPFYLLDEIDAALDDQNVS 1118
                         1130      1140      1150      1160
                   ....*....|....*....|....*....|....*....|...
gi 1034663586 1076 NIGQMLRTHFTHSQFIVVSLKEGMFNNANVLFKTKFV-DGVST 1117
Cdd:pfam02463 1119 RVANLLKELSKNAQFIVISLREEMLEKADKLVGVTMVeNGVST 1161
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
3-1094 3.29e-109

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 369.39  E-value: 3.29e-109
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586    3 IKSIILEGFKSYAQRTEVNgFDPLFNAITGLNGSGKSNILDSICFLLG---------------ISN-------------- 53
Cdd:TIGR02168    2 LKKLELAGFKSFADPTTIN-FDKGITGIVGPNGCGKSNIVDAIRWVLGeqsakalrggkmedvIFNgsetrkplslaeve 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586   54 --------------LSQVVI------GGRNKYLINGVNANNTRVQDLFCSVGLNVNNpHFLIMQGRITKVLNMKPPEILS 113
Cdd:TIGR02168   81 lvfdnsdgllpgadYSEISItrrlyrDGESEYFINGQPCRLKDIQDLFLDTGLGKRS-YSIIEQGKISEIIEAKPEERRA 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  114 MIEEAAGTRMYEYKKIAAQKTIEKKEAKLKEIKTILEEeITPTIQKLKEERSSYLEYQKVMREIEHLSRLYIAYQFLLAE 193
Cdd:TIGR02168  160 IFEEAAGISKYKERRKETERKLERTRENLDRLEDILNE-LERQLKSLERQAEKAERYKELKAELRELELALLVLRLEELR 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  194 DTKVRSAEELKEMQDKVIKLQEELSENDKKIKALNHEIEELEKRKDKETGGI--------------------LRSLEDAL 253
Cdd:TIGR02168  239 EELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELyalaneisrleqqkqilrerLANLERQL 318
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  254 AEAQRVNTKSQSAFDLKKKNLACEESKRKELEKNMVEDSKTLAAKEKEVKKITDGLHALQEASNKDAEALAAAQQHFNAV 333
Cdd:TIGR02168  319 EELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASL 398
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  334 SAGLSSNEDGAEAtLAGQMMACKNDISKAQTEAKQAQMKLKHAQQELKNKQAEvkkmdsgyrKDQEALEAVKRLKEKLEA 413
Cdd:TIGR02168  399 NNEIERLEARLER-LEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELE---------ELQEELERLEEALEELRE 468
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  414 EMKKlnyeenKEESLLEKRRQLSRDIGRLKETyEALLARFPNLRFAYKDPEKN-WNRNCVKGLVASLISVkDTSATTALE 492
Cdd:TIGR02168  469 ELEE------AEQALDAAERELAQLQARLDSL-ERLQENLEGFSEGVKALLKNqSGLSGILGVLSELISV-DEGYEAAIE 540
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  493 LVAGERLYNVVVDTEVTGKK---LLERGELKRRyTIIPLNKISARCIAPETLRVAQNlvGPDNVHVALSLVEYKPELQKA 569
Cdd:TIGR02168  541 AALGGRLQAVVVENLNAAKKaiaFLKQNELGRV-TFLPLDSIKGTEIQGNDREILKN--IEGFLGVAKDLVKFDPKLRKA 617
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  570 MEFVFGTTFVCDNMDNAKKVAFDKRIMTRTVTLGGDVFDPHGTLSGGARSQAASILTKFQELKDVQDELRIKENELRALE 649
Cdd:TIGR02168  618 LSYLLGGVLVVDDLDNALELAKKLRPGYRIVTLDGDLVRPGGVITGGSAKTNSSILERRREIEELEEKIEELEEKIAELE 697
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  650 EELAGLKntaekyrqlKQQWEMKTEEADLLQTKLQQS-SYHKQQEELDALKKTIEESEETLKNT-KEIQRKAEEKYEVLE 727
Cdd:TIGR02168  698 KALAELR---------KELEELEEELEQLRKELEELSrQISALRKDLARLEAEVEQLEERIAQLsKELTELEAEIEELEE 768
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  728 NKMKNAE--AERERELKDAQKKLDCAKTKADASSKKMKEKQQEVEAITLELEELKREHTSYKQQLEAVNEAIKSYESQIE 805
Cdd:TIGR02168  769 RLEEAEEelAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIE 848
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  806 VMAAEVAKNKESVNKAQEEVTKQKEVITAQDTVIKAKYAEVAKHKEQNNDSQLKIKELDHNISKHKREAEDGAAKVSKML 885
Cdd:TIGR02168  849 ELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLE 928
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  886 KDYDWINAER-HLFGQPNSAY-----------DFKTNNPKEAGQRLQKLQEMKEKLGrNVNMRAMNVLTEAEERYNDLMK 953
Cdd:TIGR02168  929 LRLEGLEVRIdNLQERLSEEYsltleeaealeNKIEDDEEEARRRLKRLENKIKELG-PVNLAAIEEYEELKERYDFLTA 1007
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  954 KKRIVENDKSKILTTIEDLDQKKNQALNIAWQKVNKDFGSIFSTLLPGANAMLAPPEGQTVLD-GLEFKVALGNTWKENL 1032
Cdd:TIGR02168 1008 QKEDLTEAKETLEEAIEEIDREARERFKDTFDQVNENFQRVFPKLFGGGEAELRLTDPEDLLEaGIEIFAQPPGKKNQNL 1087
                         1130      1140      1150      1160      1170      1180
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034663586 1033 TELSGGQRSLVALSLILSMLLFKPAPIYILDEVDAALDLSHTQNIGQMLRTHFTHSQFIVVS 1094
Cdd:TIGR02168 1088 SLLSGGEKALTALALLFAIFKVKPAPFCILDEVDAPLDDANVERFANLLKEFSKNTQFIVIT 1149
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
2-1118 1.85e-103

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 353.22  E-value: 1.85e-103
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586    2 HIKSIILEGFKSYAQRTEVNgFDPLFNAITGLNGSGKSNILDSICFLLGIS--------NLSQVVIGGRNK--------- 64
Cdd:TIGR02169    1 YIERIELENFKSFGKKKVIP-FSKGFTVISGPNGSGKSNIGDAILFALGLSsskamraeRLSDLISNGKNGqsgneayvt 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586   65 ------------------------------YLINGVNANNTRVQDLFCSVGLNVNNPHFlIMQGRITKVLNMKPPEILSM 114
Cdd:TIGR02169   80 vtfknddgkfpdelevvrrlkvtddgkysyYYLNGQRVRLSEIHDFLAAAGIYPEGYNV-VLQGDVTDFISMSPVERRKI 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  115 IEEAAGTRMYEYKKIAAQKTIEKKEAKLKEIKTILEEEITpTIQKLKEERSSYLEYQKVMREIEHLSRLYIAYQFLLAED 194
Cdd:TIGR02169  159 IDEIAGVAEFDRKKEKALEELEEVEENIERLDLIIDEKRQ-QLERLRREREKAERYQALLKEKREYEGYELLKEKEALER 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  195 TKVRSAEELKEMQDKVIKLQEELSENDKKIKALNHEIEELEKRKDKETGGILRSLEDALAEAQRVNTKSQSAFDLKKKNL 274
Cdd:TIGR02169  238 QKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKEREL 317
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  275 -------ACEESKRKELEKNMVEDSKTLAAKEKEVKKITDGLHALQEASNKDAEALAAAQQHFNAVSAGLSSNEDGAEAt 347
Cdd:TIGR02169  318 edaeerlAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEK- 396
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  348 LAGQMMACKNDISKAQTEAKQAQMKLKHAQQELKNKQAEVKKMDSGYRKDQEALEA----VKRLKEKLEAEMKKLNYEEN 423
Cdd:TIGR02169  397 LKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKqewkLEQLAADLSKYEQELYDLKE 476
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  424 KEESLLEKRRQLSRDIGRLKETYEALLARFPNLRFAYKdpEKNWNRNCVKGLVASLISVKDTSATtALELVAGERLYNVV 503
Cdd:TIGR02169  477 EYDRVEKELSKLQRELAEAEAQARASEERVRGGRAVEE--VLKASIQGVHGTVAQLGSVGERYAT-AIEVAAGNRLNNVV 553
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  504 VDTEVTGKKLLERgeLKR----RYTIIPLNKISARCIAPETLRVaqnlvgPDNVHVALSLVEYKPELQKAMEFVFGTTFV 579
Cdd:TIGR02169  554 VEDDAVAKEAIEL--LKRrkagRATFLPLNKMRDERRDLSILSE------DGVIGFAVDLVEFDPKYEPAFKYVFGDTLV 625
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  580 CDNMDNAKkvafdkRIM--TRTVTLGGDVFDPHGTLSGGARSQAASILTKFQ---ELKDVQDELRIKENELRALEEELAG 654
Cdd:TIGR02169  626 VEDIEAAR------RLMgkYRMVTLEGELFEKSGAMTGGSRAPRGGILFSRSepaELQRLRERLEGLKRELSSLQSELRR 699
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  655 LKNTAEKYRQLKQQWEMKTEEADLLQTKLQQSSyHKQQEELDALKKTIEESEETLKNTKEIQRKAEEKYEVLE---NKMK 731
Cdd:TIGR02169  700 IENRLDELSQELSDASRKIGEIEKEIEQLEQEE-EKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEedlHKLE 778
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  732 NAEAERERELKDAQKKldcaktKADASSKKMKEKQQEVEAITLELEELKREHTSYKQQLEavnEAIKSYESQIEvmaaEV 811
Cdd:TIGR02169  779 EALNDLEARLSHSRIP------EIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLE---KEIQELQEQRI----DL 845
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  812 AKNKESVNKAQEEVTKQKEVItaqDTVIKAKYAEVAKHKEQNNDSQLKIKELDHNISKHKREAEDGAAKVSKMLKDYDWI 891
Cdd:TIGR02169  846 KEQIKSIEKEIENLNGKKEEL---EEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSEL 922
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  892 NAERHLFGQPNSAYDFKTNNPKE---AGQRLQKLQEMKEKLGRN------VNMRAMNVLTEAEERYNDLMKKKRIVENDK 962
Cdd:TIGR02169  923 KAKLEALEEELSEIEDPKGEDEEipeEELSLEDVQAELQRVEEEiralepVNMLAIQEYEEVLKRLDELKEKRAKLEEER 1002
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  963 SKILTTIEDLDQKKNQALNIAWQKVNKDFGSIFSTLLPGANAMLAPPEGQTVLDGLEFKVALGNTWKENLTELSGGQRSL 1042
Cdd:TIGR02169 1003 KAILERIEEYEKKKREVFMEAFEAINENFNEIFAELSGGTGELILENPDDPFAGGLELSAKPKGKPVQRLEAMSGGEKSL 1082
                         1130      1140      1150      1160      1170      1180      1190
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034663586 1043 VALSLILSMLLFKPAPIYILDEVDAALDLSHTQNIGQMLRTHFTHSQFIVVSLKEGMFNNA-NVLFKTKFVDGVSTV 1118
Cdd:TIGR02169 1083 TALSFIFAIQRYKPSPFYAFDEVDMFLDGVNVERVAKLIREKAGEAQFIVVSLRSPMIEYAdRAIGVTMRRNGESQV 1159
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1-1107 2.58e-70

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 254.86  E-value: 2.58e-70
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586    1 MHIKSIILEGFKSYAQRTEVNgFDPLFNAITGLNGSGKSNILDSICFLLG---ISNL----------------------- 54
Cdd:COG1196      1 MRLKRLELAGFKSFADPTTIP-FEPGITAIVGPNGSGKSNIVDAIRWVLGeqsAKSLrggkmedvifagsssrkplgrae 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586   55 ----------------SQVVI------GGRNKYLINGVNANNTRVQDLFCSVGLNVNNpHFLIMQGRITKVLNMKPPEIL 112
Cdd:COG1196     80 vsltfdnsdgtlpidyDEVTItrrlyrSGESEYYINGKPCRLKDIQDLFLDTGLGPES-YSIIGQGMIDRIIEAKPEERR 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  113 SMIEEAAGTRMYEYKKIAAQKTIEKKEAKLKEIKTILEEeITPTIQKLKEERSSYLEYQKVMREIEHLSRLYIAYQFLLA 192
Cdd:COG1196    159 AIIEEAAGISKYKERKEEAERKLEATEENLERLEDILGE-LERQLEPLERQAEKAERYRELKEELKELEAELLLLKLREL 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  193 EDTKVRSAEELKEMQDKVIKLQEELSENDKKIKALNHEIEELEKRkDKETGGILRSLEDALAEAQRVNTKSQSAFDLKKK 272
Cdd:COG1196    238 EAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELE-LEEAQAEEYELLAELARLEQDIARLEERRRELEE 316
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  273 NLACEESKRKELEKNMVEDSKTLAAKEKEVKKITDGLHALQEASNKDAEALAAAQQHFNAVSAGLSSNEDgAEATLAGQM 352
Cdd:COG1196    317 RLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAE-ELLEALRAA 395
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  353 MACKNDISKAQTEAKQAQMKLKHAQQELKNKQAEVKKMDSGYRKDQEALEAVKRLKEKLEAEMKKLN----YEENKEESL 428
Cdd:COG1196    396 AELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLellaELLEEAALL 475
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  429 LEKRRQLSRDIGRLKETYEALL---ARFPNLRFAYKDPEKNWNRNCVKGLVASLISVkDTSATTALELVAGERLYNVVVD 505
Cdd:COG1196    476 EAALAELLEELAEAAARLLLLLeaeADYEGFLEGVKAALLLAGLRGLAGAVAVLIGV-EAAYEAALEAALAAALQNIVVE 554
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  506 TEVTGKKLLERGELKR--RYTIIPLNKISARciAPETLRVAQNLVGPDNVHVALSLVEYKPELQKAMEFVFGTTFVCDNM 583
Cdd:COG1196    555 DDEVAAAAIEYLKAAKagRATFLPLDKIRAR--AALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARL 632
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  584 DNAKKVAFDKRIMTRTVTLGGDVFDPHGTLSGGARSQAASiltkfqelkdvqdELRIKENELRALEEELAglkntaekyr 663
Cdd:COG1196    633 EAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLA-------------ALLEAEAELEELAERLA---------- 689
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  664 qlkqqwemkteeadllqtklqqssyhKQQEELDALKKTIEESEETLKNTKEIQRKAEEKYEVLENKMKNAEAERERELKD 743
Cdd:COG1196    690 --------------------------EEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLE 743
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  744 AQKKLDcaktkadasskkmkEKQQEVEAITLELEELKRehtsykqqleavneaiksyesqievmaaevaknkesvnkaqe 823
Cdd:COG1196    744 EEELLE--------------EEALEELPEPPDLEELER------------------------------------------ 767
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  824 evtkqkevitaqdtvikakyaevakhkeqnndsqlkikeldhniskhkreaedgaakvskmlkdydwinaerhlfgqpns 903
Cdd:COG1196        --------------------------------------------------------------------------------
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  904 aydfktnnpkeagqRLQKLQEMKEKLGrNVNMRAMNVLTEAEERYNDLMKKKRIVENDKSKILTTIEDLDQKKNQALNIA 983
Cdd:COG1196    768 --------------ELERLEREIEALG-PVNLLAIEEYEELEERYDFLSEQREDLEEARETLEEAIEEIDRETRERFLET 832
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  984 WQKVNKDFGSIFSTLLPGANAMLAPPEGQTVLD-GLEFKVALGNTWKENLTELSGGQRSLVALSLILSMLLFKPAPIYIL 1062
Cdd:COG1196    833 FDAVNENFQELFPRLFGGGEAELLLTDPDDPLEtGIEIMAQPPGKKLQRLSLLSGGEKALTALALLFAIFRLNPSPFCVL 912
                         1130      1140      1150      1160
                   ....*....|....*....|....*....|....*....|....*
gi 1034663586 1063 DEVDAALDLSHTQNIGQMLRTHFTHSQFIVVSLKEGMFNNANVLF 1107
Cdd:COG1196    913 DEVDAPLDDANVERFAELLKEMSEDTQFIVITHNKRTMEAADRLY 957
ABC_SMC2_euk cd03273
ATP-binding cassette domain of eukaryotic SMC2 proteins; The structural maintenance of ...
1024-1118 1.39e-63

ATP-binding cassette domain of eukaryotic SMC2 proteins; The structural maintenance of chromosomes (SMC) proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (formerly known as Rad18).


Pssm-ID: 213240 [Multi-domain]  Cd Length: 251  Bit Score: 216.40  E-value: 1.39e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586 1024 LGNTWKENLTELSGGQRSLVALSLILSMLLFKPAPIYILDEVDAALDLSHTQNIGQMLRTHFTHSQFIVVSLKEGMFNNA 1103
Cdd:cd03273    156 MGGVWKESLTELSGGQRSLVALSLILALLLFKPAPMYILDEVDAALDLSHTQNIGRMIKTHFKGSQFIVVSLKEGMFNNA 235
                           90
                   ....*....|....*
gi 1034663586 1104 NVLFKTKFVDGVSTV 1118
Cdd:cd03273    236 NVLFRTRFVDGTSTV 250
ABC_SMC2_euk cd03273
ATP-binding cassette domain of eukaryotic SMC2 proteins; The structural maintenance of ...
1-122 3.91e-61

ATP-binding cassette domain of eukaryotic SMC2 proteins; The structural maintenance of chromosomes (SMC) proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (formerly known as Rad18).


Pssm-ID: 213240 [Multi-domain]  Cd Length: 251  Bit Score: 209.46  E-value: 3.91e-61
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586    1 MHIKSIILEGFKSYAQRTEVNGFDPLFNAITGLNGSGKSNILDSICFLLGISNLS------------------------- 55
Cdd:cd03273      1 MHIKEIILDGFKSYATRTVISGFDPQFNAITGLNGSGKSNILDAICFVLGITNLStvrasnlqdliykrgqagitkasvt 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586   56 -------------------------QVVIGGRNKYLINGVNANNTRVQDLFCSVGLNVNNPHFLIMQGRITKVLNMKPPE 110
Cdd:cd03273     81 ivfdnsdksqspigfenypeitvtrQIVLGGTNKYLINGHRAQQQRVQDLFQSVQLNVNNPHFLIMQGRITKVLNMGGVW 160
                          170
                   ....*....|..
gi 1034663586  111 ILSMIEEAAGTR 122
Cdd:cd03273    161 KESLTELSGGQR 172
SMC_hinge smart00968
SMC proteins Flexible Hinge Domain; This entry represents the hinge region of the SMC ...
472-589 2.15e-33

SMC proteins Flexible Hinge Domain; This entry represents the hinge region of the SMC (Structural Maintenance of Chromosomes) family of proteins. The hinge region is responsible for formation of the DNA interacting dimer. It is also possible that the precise structure of it is an essential determinant of the specificity of the DNA-protein interaction.


Pssm-ID: 214944 [Multi-domain]  Cd Length: 120  Bit Score: 125.04  E-value: 2.15e-33
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586   472 VKGLVASLISVKDtSATTALELVAGERLYNVVVDTEVTGKKLLERGELKR--RYTIIPLNKISARCIAPETLRvAQNLVG 549
Cdd:smart00968    3 VLGRVADLISVDP-KYETALEAALGGRLQAVVVDTEETAKKAIEFLKKNRlgRATFLPLDKIKPRSPAGSKLR-EALLPE 80
                            90       100       110       120
                    ....*....|....*....|....*....|....*....|
gi 1034663586   550 PDNVHVALSLVEYKPELQKAMEFVFGTTFVCDNMDNAKKV 589
Cdd:smart00968   81 PGFVGPAIDLVEYDPELRPALEYLLGNTLVVDDLETARRL 120
ABC_SMC_head cd03239
The SMC head domain belongs to the ATP-binding cassette superfamily; The structural ...
1035-1117 4.46e-31

The SMC head domain belongs to the ATP-binding cassette superfamily; The structural maintenance of chromosomes (SMC) proteins are essential for successful chromosome transmission during replication and segregation of the genome in all organisms. SMCs are generally present as single proteins in bacteria, and as at least six distinct proteins in eukaryotes. The proteins range in size from approximately 110 to 170 kDa, and each has five distinct domains: amino- and carboxy-terminal globular domains, which contain sequences characteristic of ATPases, two coiled-coil regions separating the terminal domains , and a central flexible hinge. SMC proteins function together with other proteins in a range of chromosomal transactions, including chromosome condensation, sister-chromatid cohesion, recombination, DNA repair, and epigenetic silencing of gene expression.


Pssm-ID: 213206 [Multi-domain]  Cd Length: 178  Bit Score: 120.49  E-value: 4.46e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586 1035 LSGGQRSLVALSLILSMLLFKPAPIYILDEVDAALDLSHTQNIGQMLRTHFTH-SQFIVVSLKEGMFNNANVLFKTKFVD 1113
Cdd:cd03239     95 LSGGEKSLSALALIFALQEIKPSPFYVLDEIDAALDPTNRRRVSDMIKEMAKHtSQFIVITLKKEMFENADKLIGVLFVH 174

                   ....
gi 1034663586 1114 GVST 1117
Cdd:cd03239    175 GVST 178
ABC_SMC_barmotin cd03278
ATP-binding cassette domain of barmotin, a member of the SMC protein family; Barmotin is a ...
1031-1107 1.62e-23

ATP-binding cassette domain of barmotin, a member of the SMC protein family; Barmotin is a tight junction-associated protein expressed in rat epithelial cells which is thought to have an important regulatory role in tight junction barrier function. Barmotin belongs to the SMC protein family. SMC proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (formerly known as Rad18).


Pssm-ID: 213245 [Multi-domain]  Cd Length: 197  Bit Score: 99.46  E-value: 1.62e-23
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034663586 1031 NLTELSGGQRSLVALSLILSMLLFKPAPIYILDEVDAALDLSHTQNIGQMLRTHFTHSQFIVVSLKEGMFNNANVLF 1107
Cdd:cd03278    110 RLSLLSGGEKALTALALLFAIFRVRPSPFCVLDEVDAALDDANVERFARLLKEFSKETQFIVITHRKGTMEAADRLY 186
SMC_hinge pfam06470
SMC proteins Flexible Hinge Domain; This family represents the hinge region of the SMC ...
472-590 3.95e-23

SMC proteins Flexible Hinge Domain; This family represents the hinge region of the SMC (Structural Maintenance of Chromosomes) family of proteins. The hinge region is responsible for formation of the DNA interacting dimer. It is also possible that the precise structure of it is an essential determinant of the specificity of the DNA-protein interaction.


Pssm-ID: 461926 [Multi-domain]  Cd Length: 116  Bit Score: 95.41  E-value: 3.95e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  472 VKGLVASLISVKDTsATTALELVAGERLYNVVVDTEVTGKKLLErgELKR----RYTIIPLNKISARCIAPETLRVAqnl 547
Cdd:pfam06470    4 VLGRLADLIEVDEG-YEKAVEAALGGRLQAVVVDDEDDAKRAIE--FLKKnklgRATFLPLDRLKPRPRRPGADLKG--- 77
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 1034663586  548 vgpdNVHVALSLVEYKPELQKAMEFVFGTTFVCDNMDNAKKVA 590
Cdd:pfam06470   78 ----GAGPLLDLVEYDDEYRKALRYLLGNTLVVDDLDEALELA 116
ABC_SMC_barmotin cd03278
ATP-binding cassette domain of barmotin, a member of the SMC protein family; Barmotin is a ...
3-107 4.23e-21

ATP-binding cassette domain of barmotin, a member of the SMC protein family; Barmotin is a tight junction-associated protein expressed in rat epithelial cells which is thought to have an important regulatory role in tight junction barrier function. Barmotin belongs to the SMC protein family. SMC proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (formerly known as Rad18).


Pssm-ID: 213245 [Multi-domain]  Cd Length: 197  Bit Score: 92.53  E-value: 4.23e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586    3 IKSIILEGFKSYAQRTEVNgFDPLFNAITGLNGSGKSNILDSICFLLGISNLSQVViGGRNKYLINgvNANNTRVQDLFC 82
Cdd:cd03278      1 LKKLELKGFKSFADKTTIP-FPPGLTAIVGPNGSGKSNIIDAIRWVLGEQSAKSLR-GEKMSDVIF--AGSETRKPANFA 76
                           90       100
                   ....*....|....*....|....*...
gi 1034663586   83 SVGL---NVNNPHFLIMQGRITKVLNMK 107
Cdd:cd03278     77 EVTLtfdNSDGRYSIISQGDVSEIIEAP 104
ABC_SMC_head cd03239
The SMC head domain belongs to the ATP-binding cassette superfamily; The structural ...
3-104 4.24e-21

The SMC head domain belongs to the ATP-binding cassette superfamily; The structural maintenance of chromosomes (SMC) proteins are essential for successful chromosome transmission during replication and segregation of the genome in all organisms. SMCs are generally present as single proteins in bacteria, and as at least six distinct proteins in eukaryotes. The proteins range in size from approximately 110 to 170 kDa, and each has five distinct domains: amino- and carboxy-terminal globular domains, which contain sequences characteristic of ATPases, two coiled-coil regions separating the terminal domains , and a central flexible hinge. SMC proteins function together with other proteins in a range of chromosomal transactions, including chromosome condensation, sister-chromatid cohesion, recombination, DNA repair, and epigenetic silencing of gene expression.


Pssm-ID: 213206 [Multi-domain]  Cd Length: 178  Bit Score: 91.99  E-value: 4.24e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586    3 IKSIILEGFKSYAQRTEVNGFDPlFNAITGLNGSGKSNILDSICFLLGISNLSQvvigGRNKYLINGVNANNTRVQDlfC 82
Cdd:cd03239      1 IKQITLKNFKSYRDETVVGGSNS-FNAIVGPNGSGKSNIVDAICFVLGGKAAKL----RRGSLLFLAGGGVKAGINS--A 73
                           90       100
                   ....*....|....*....|..
gi 1034663586   83 SVGLNVNNPHFLIMQGRITKVL 104
Cdd:cd03239     74 SVEITFDKSYFLVLQGKVEQIL 95
ABC_Class2 cd03227
ATP-binding cassette domain of non-transporter proteins; ABC-type Class 2 contains systems ...
1032-1117 5.69e-19

ATP-binding cassette domain of non-transporter proteins; ABC-type Class 2 contains systems involved in cellular processes other than transport. These families are characterized by the fact that the ABC subunit is made up of duplicated, fused ABC modules (ABC2). No known transmembrane proteins or domains are associated with these proteins.


Pssm-ID: 213194 [Multi-domain]  Cd Length: 162  Bit Score: 85.10  E-value: 5.69e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586 1032 LTELSGGQRSLVALSLILSMLLFKPAPIYILDEVDAALDLSHTQNIGQMLRTHFTH-SQFIVVSLKEGMFNNANVLFKTK 1110
Cdd:cd03227     75 RLQLSGGEKELSALALILALASLKPRPLYILDEIDRGLDPRDGQALAEAILEHLVKgAQVIVITHLPELAELADKLIHIK 154

                   ....*...
gi 1034663586 1111 FV-DGVST 1117
Cdd:cd03227    155 KViTGVYK 162
ABC_SMC3_euk cd03272
ATP-binding cassette domain of eukaryotic SMC3 proteins; The structural maintenance of ...
3-114 1.56e-18

ATP-binding cassette domain of eukaryotic SMC3 proteins; The structural maintenance of chromosomes (SMC) proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (formerly known as Rad18).


Pssm-ID: 213239 [Multi-domain]  Cd Length: 243  Bit Score: 86.16  E-value: 1.56e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586    3 IKSIILEGFKSYAQRTEVNGFDPLFNAITGLNGSGKSNILDSICFLL--------------------GISNLSQVV---- 58
Cdd:cd03272      1 IKQVIIQGFKSYKDQTVIEPFSPKHNVVVGRNGSGKSNFFAAIRFVLsdeythlreeqrqallhegsGPSVMSAYVeiif 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034663586   59 --------IGG------------RNKYLINGVNANNTRVQDLFCSVGLNVNNPHFLIMQGRITKVLNMKPPEILSM 114
Cdd:cd03272     81 dnsdnrfpIDKeevrlrrtiglkKDEYFLDKKNVTKNDVMNLLESAGFSRSNPYYIVPQGKINSLTNMKQDEQQEM 156
ABC_SMC3_euk cd03272
ATP-binding cassette domain of eukaryotic SMC3 proteins; The structural maintenance of ...
1031-1118 4.03e-18

ATP-binding cassette domain of eukaryotic SMC3 proteins; The structural maintenance of chromosomes (SMC) proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (formerly known as Rad18).


Pssm-ID: 213239 [Multi-domain]  Cd Length: 243  Bit Score: 85.00  E-value: 4.03e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586 1031 NLTELSGGQRSLVALSLILSMLLFKPAPIYILDEVDAALDLSHTQNIGQMLRTHFTHSQFIVVSLKEGMFNNANVLFKTK 1110
Cdd:cd03272    155 EMQQLSGGQKSLVALALIFAIQKCDPAPFYLFDEIDAALDAQYRTAVANMIKELSDGAQFITTTFRPELLEVADKFYGVK 234

                   ....*...
gi 1034663586 1111 FVDGVSTV 1118
Cdd:cd03272    235 FRNKVSTI 242
PTZ00121 PTZ00121
MAEBL; Provisional
115-911 1.51e-16

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 85.58  E-value: 1.51e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  115 IEEAagTRMYEYKKIAAQKTIEKKEAKLKEIKTILEEEITPTIQKLKEERSSYLEYQKVMREIEhlsrlyiayqfllaed 194
Cdd:PTZ00121  1214 AEEA--RKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAE---------------- 1275
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  195 tKVRSAEELKEMQDKviKLQEELSENDKKIKAlnheiEELEKRKDKEtggilRSLEDALAEAQRVNTKSQSAfdlKKKnl 274
Cdd:PTZ00121  1276 -EARKADELKKAEEK--KKADEAKKAEEKKKA-----DEAKKKAEEA-----KKADEAKKKAEEAKKKADAA---KKK-- 1337
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  275 aCEESKRKELEKNMVEDSKTLAAKEKEVKKITDGLHAlQEASNKDAEALAAAQQHFNAVSAGLSSNEDGAEAtlagqmma 354
Cdd:PTZ00121  1338 -AEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK-EEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKA-------- 1407
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  355 ckNDISKAQTEAKQAQmklkhaqqELKNKQAEVKKMDSGYRKDQE---ALEAVKRLKEKLEAEMKKLNYEENKEESLLEK 431
Cdd:PTZ00121  1408 --DELKKAAAAKKKAD--------EAKKKAEEKKKADEAKKKAEEakkADEAKKKAEEAKKAEEAKKKAEEAKKADEAKK 1477
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  432 RRQLSRDIGRLKETYEALLARFPNLRFAYKDPEKnwnrncvkglvaslisvkdtsattALELVAGErlynvvvdtEVTGK 511
Cdd:PTZ00121  1478 KAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKK------------------------ADEAKKAE---------EAKKA 1524
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  512 KLLERGELKRRytiiplnkisarciaPETLRVAQNLVGPDNVHVAlslveykPELQKAMEfvfgttfvCDNMDNAKKVAF 591
Cdd:PTZ00121  1525 DEAKKAEEAKK---------------ADEAKKAEEKKKADELKKA-------EELKKAEE--------KKKAEEAKKAEE 1574
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  592 DKRIMTRTVTLGGDV----FDPHGTLSGGARSQAASILTKFQELKDVQDELRIKENELRALEEELAGLKNTAEKYRQLKQ 667
Cdd:PTZ00121  1575 DKNMALRKAEEAKKAeearIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKK 1654
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  668 QWEMKTEEADLLQTKLQQSSyhKQQEELDALKKTIEESEETLKNTKEIQRKAEEKYEVLENKMKNAE----AERERELK- 742
Cdd:PTZ00121  1655 AEEENKIKAAEEAKKAEEDK--KKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEelkkAEEENKIKa 1732
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  743 -DAQKKLDCAKTKADASSKKMKEK----------QQEVEAITLELEELKREHTSYK--QQLEAVNEAIKSYESQIEVMAA 809
Cdd:PTZ00121  1733 eEAKKEAEEDKKKAEEAKKDEEEKkkiahlkkeeEKKAEEIRKEKEAVIEEELDEEdeKRRMEVDKKIKDIFDNFANIIE 1812
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  810 EVAKNKESVNKAQE-EVTKQKEVITAQDTVIKaKYAEVAKHKEQNNDSQLKIKELDHNISKHKREAEDGAAKV--SKMLK 886
Cdd:PTZ00121  1813 GGKEGNLVINDSKEmEDSAIKEVADSKNMQLE-EADAFEKHKFNKNNENGEDGNKEADFNKEKDLKEDDEEEIeeADEIE 1891
                          810       820
                   ....*....|....*....|....*
gi 1034663586  887 DYDWINAERHLfgqPNSAYDFKTNN 911
Cdd:PTZ00121  1892 KIDKDDIEREI---PNNNMAGKNND 1913
ABC_SMC1_euk cd03275
ATP-binding cassette domain of eukaryotic SMC1 proteins; The structural maintenance of ...
1035-1106 2.97e-16

ATP-binding cassette domain of eukaryotic SMC1 proteins; The structural maintenance of chromosomes (SMC) proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (formerly known as Rad18).


Pssm-ID: 213242 [Multi-domain]  Cd Length: 247  Bit Score: 79.54  E-value: 2.97e-16
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1034663586 1035 LSGGQRSLVALSLILSMLLFKPAPIYILDEVDAALDLSHTQNIGQMLRTHFT-HSQFIVVSLKEGMFNNANVL 1106
Cdd:cd03275    156 LSGGEKTMAALALLFAIHSYQPAPFFVLDEVDAALDNTNVGKVASYIREQAGpNFQFIVISLKEEFFSKADAL 228
SbcC COG0419
DNA repair exonuclease SbcCD ATPase subunit [Replication, recombination and repair];
2-153 1.02e-15

DNA repair exonuclease SbcCD ATPase subunit [Replication, recombination and repair];


Pssm-ID: 440188 [Multi-domain]  Cd Length: 204  Bit Score: 76.97  E-value: 1.02e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586    2 HIKSIILEGFKSYAQRTEVNgFDPLFNAITGLNGSGKSNILDSICFLLGisnlSQVVIGGRNKYLINGVNANNTRVqdlf 81
Cdd:COG0419      1 KLLRLRLENFRSYRDTETID-FDDGLNLIVGPNGAGKSTILEAIRYALY----GKARSRSKLRSDLINVGSEEASV---- 71
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034663586   82 cSVGLNVNNPHFLI--MQGRITKVLNMKPPEILSMIEEAAGTRMYEYKKIAAQKTIEKKEAKLKEIKTI--LEEEI 153
Cdd:COG0419     72 -ELEFEHGGKRYRIerRQGEFAEFLEAKPSERKEALKRLLGLEIYEELKERLKELEEALESALEELAELqkLKQEI 146
ABC_SMC4_euk cd03274
ATP-binding cassette domain of eukaryotic SMC4 proteins; The structural maintenance of ...
1028-1106 2.45e-15

ATP-binding cassette domain of eukaryotic SMC4 proteins; The structural maintenance of chromosomes (SMC) proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (formerly known as Rad18).


Pssm-ID: 213241 [Multi-domain]  Cd Length: 212  Bit Score: 76.18  E-value: 2.45e-15
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034663586 1028 WKeNLTELSGGQRSLVALSLILSMLLFKPAPIYILDEVDAALDLSHTQNIGQMLRTHFTHSQFIVVSLKEGMFNNANVL 1106
Cdd:cd03274    122 WK-NISNLSGGEKTLSSLALVFALHHYKPTPLYVMDEIDAALDFRNVSIVANYIKERTKNAQFIVISLRNNMFELADRL 199
ABC_SMC4_euk cd03274
ATP-binding cassette domain of eukaryotic SMC4 proteins; The structural maintenance of ...
3-106 1.76e-13

ATP-binding cassette domain of eukaryotic SMC4 proteins; The structural maintenance of chromosomes (SMC) proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (formerly known as Rad18).


Pssm-ID: 213241 [Multi-domain]  Cd Length: 212  Bit Score: 70.79  E-value: 1.76e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586    3 IKSIILEGFKSYAQRTEVNGFDPLFNAITGLNGSGKSNILDSICFLLGI-------SNLSQVVIGGRNKYLIN----GVN 71
Cdd:cd03274      3 ITKLVLENFKSYAGEQVIGPFHKSFSAIVGPNGSGKSNVIDSMLFVFGFraskmrqKKLSDLIHNSAGHPNLDscsvEVH 82
                           90       100       110
                   ....*....|....*....|....*....|....*
gi 1034663586   72 ANNTRVQDLFCSVGLNVNNPHFLIMQGRITKVLNM 106
Cdd:cd03274     83 FQEIIDKPLLKSKGIDLDHNRFLILQGEVEQIAQM 117
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
148-897 3.97e-12

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 70.71  E-value: 3.97e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  148 ILEE-EITPTIQKLKEERSSYLEYQKVM----REIEHLSRLYIAYQFLLAEDTKVRSAEELKEM------QDKVIKLQEE 216
Cdd:COG4913    217 MLEEpDTFEAADALVEHFDDLERAHEALedarEQIELLEPIRELAERYAAARERLAELEYLRAAlrlwfaQRRLELLEAE 296
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  217 LSENDKKIKALNHEIEELEKRKDkETGGILRSLEDALAEA--QRVNtksqsafDLKKknlaceESKRKEleknmvedsKT 294
Cdd:COG4913    297 LEELRAELARLEAELERLEARLD-ALREELDELEAQIRGNggDRLE-------QLER------EIERLE---------RE 353
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  295 LAAKEKEVKKITDGLHALQEASNKDAEALAAAQQHFNAVSAGLSSNEDGAEAtlagqmmackndiskAQTEAKQAQMKLK 374
Cdd:COG4913    354 LEERERRRARLEALLAALGLPLPASAEEFAALRAEAAALLEALEEELEALEE---------------ALAEAEAALRDLR 418
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  375 HAQQELKNKQAEVKKMDSGYRKDQEALeavkrlkekleaemkklnyeenkeesllekRRQLSRDIGrLKETyeallarfp 454
Cdd:COG4913    419 RELRELEAEIASLERRKSNIPARLLAL------------------------------RDALAEALG-LDEA--------- 458
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  455 NLRFaykdpeknwnrncvkglVASLISVKDTSA--TTALELVAGERLYNVVVDTEVTGK--KLLERGELKRRytiIPLNK 530
Cdd:COG4913    459 ELPF-----------------VGELIEVRPEEErwRGAIERVLGGFALTLLVPPEHYAAalRWVNRLHLRGR---LVYER 518
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  531 ISARCIAPETLRVAQN-LVGpdnvhvalsLVEYKP-ELQKAMEFVFGTTFV---CDNMDNAKKVafdkrimTRTVTLGGD 605
Cdd:COG4913    519 VRTGLPDPERPRLDPDsLAG---------KLDFKPhPFRAWLEAELGRRFDyvcVDSPEELRRH-------PRAITRAGQ 582
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  606 VFDPHGTLSGGARSQAAS-----------ILTKFQELKDVQDELRIKENELRALEEELAGLKNTAEKYRQLKQQWEmktE 674
Cdd:COG4913    583 VKGNGTRHEKDDRRRIRSryvlgfdnrakLAALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSW---D 659
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  675 EADLLQTKLQQSSYHKQQEELDALKKTIEESEETLKNTKEIQRKAEEKYEVLENKMKNAEAERERelkdAQKKLDCAKTK 754
Cdd:COG4913    660 EIDVASAEREIAELEAELERLDASSDDLAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQ----AEEELDELQDR 735
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  755 ADASSKKmkekqqEVEAITLELEELKREHtsykQQLEAVNEAIKSYESQIEVMAAEVAKNKESVNKAQEEVTKQ-KEVIT 833
Cdd:COG4913    736 LEAAEDL------ARLELRALLEERFAAA----LGDAVERELRENLEERIDALRARLNRAEEELERAMRAFNREwPAETA 805
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  834 AQDTVIKA--KYAEV---------AKHKEQ-----NNDSQLKIKELDHNISKHKREAEDGAAKVSKMLKDYDWiNAERHL 897
Cdd:COG4913    806 DLDADLESlpEYLALldrleedglPEYEERfkellNENSIEFVADLLSKLRRAIREIKERIDPLNDSLKRIPF-GPGRYL 884
ABC_ATPase cd00267
ATP-binding cassette transporter nucleotide-binding domain; ABC transporters are a large ...
996-1094 9.48e-12

ATP-binding cassette transporter nucleotide-binding domain; ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide-binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins.


Pssm-ID: 213179 [Multi-domain]  Cd Length: 157  Bit Score: 64.19  E-value: 9.48e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  996 STLLPGANAMLAPPEGQTVLDGLEFKVALGNTWKEN---LTELSGGQRSLVALSLILsmlLFKPaPIYILDEVDAALDLS 1072
Cdd:cd00267     39 STLLRAIAGLLKPTSGEILIDGKDIAKLPLEELRRRigyVPQLSGGQRQRVALARAL---LLNP-DLLLLDEPTSGLDPA 114
                           90       100
                   ....*....|....*....|...
gi 1034663586 1073 HTQNIGQMLRTHF-THSQFIVVS 1094
Cdd:cd00267    115 SRERLLELLRELAeEGRTVIIVT 137
ABC_Class2 cd03227
ATP-binding cassette domain of non-transporter proteins; ABC-type Class 2 contains systems ...
5-50 2.82e-11

ATP-binding cassette domain of non-transporter proteins; ABC-type Class 2 contains systems involved in cellular processes other than transport. These families are characterized by the fact that the ABC subunit is made up of duplicated, fused ABC modules (ABC2). No known transmembrane proteins or domains are associated with these proteins.


Pssm-ID: 213194 [Multi-domain]  Cd Length: 162  Bit Score: 63.15  E-value: 2.82e-11
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1034663586    5 SIILEGFKSYAQRTEVNGFDPLFNAITGLNGSGKSNILDSICFLLG 50
Cdd:cd03227      1 KIVLGRFPSYFVPNDVTFGEGSLTIITGPNGSGKSTILDAIGLALG 46
ABC_SMC1_euk cd03275
ATP-binding cassette domain of eukaryotic SMC1 proteins; The structural maintenance of ...
3-109 6.65e-11

ATP-binding cassette domain of eukaryotic SMC1 proteins; The structural maintenance of chromosomes (SMC) proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (formerly known as Rad18).


Pssm-ID: 213242 [Multi-domain]  Cd Length: 247  Bit Score: 63.75  E-value: 6.65e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586    3 IKSIILEGFKSYAQRTEVNGFDPlFNAITGLNGSGKSNILDSICFLLGI--SNL-------------------------- 54
Cdd:cd03275      1 LKRLELENFKSYKGRHVIGPFDR-FTCIIGPNGSGKSNLMDAISFVLGEksSHLrsknlkdliyrarvgkpdsnsayvta 79
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034663586   55 ------------SQVVIGGRNKYLINGVNANNTRVQDLFCSVGLNVNNPHFLIMQGRITKVLNMKPP 109
Cdd:cd03275     80 vyedddgeektfRRIITGGSSSYRINGKVVSLKEYNEELEKINILVKARNFLVFQGDVESIASKNPP 146
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
618-862 8.71e-11

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 66.24  E-value: 8.71e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  618 RSQAASILTKFQELKDVQDELRIKENELRALEEELAGLKNTAEKYRQLKQQWEMKTEEADLLQTKLQQSsyhkqQEELDA 697
Cdd:PRK03918   196 KEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIREL-----EERIEE 270
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  698 LKKTIEESEETLKNTKEIQRKAEE---------KYEVLENKMKNAEAERERELKDAQKKLdcaktkadassKKMKEKQQE 768
Cdd:PRK03918   271 LKKEIEELEEKVKELKELKEKAEEyiklsefyeEYLDELREIEKRLSRLEEEINGIEERI-----------KELEEKEER 339
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  769 VEAITLELEELKREHTSYKQQLEAVNEAiKSYESQIEVMAAEVA-KNKESVNKAQEEVTKQKEVITAQDTVIKAKYAEVA 847
Cdd:PRK03918   340 LEELKKKLKELEKRLEELEERHELYEEA-KAKKEELERLKKRLTgLTPEKLEKELEELEKAKEEIEEEISKITARIGELK 418
                          250
                   ....*....|....*
gi 1034663586  848 KHKEQNNDSQLKIKE 862
Cdd:PRK03918   419 KEIKELKKAIEELKK 433
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
207-459 8.96e-11

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 65.17  E-value: 8.96e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  207 QDKVIKLQEELSENDKKIKALNHEIEELEKRKdKETGGILRSLEDALAEAQRVNTKSQSAFDLKKKNLAceeskrkELEK 286
Cdd:COG4942     19 ADAAAEAEAELEQLQQEIAELEKELAALKKEE-KALLKQLAALERRIAALARRIRALEQELAALEAELA-------ELEK 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  287 NMVEDSKTLAAKEKEVKKItdgLHALQEASNKDAEALaaaqqhfnavsagLSSNEDGAEATLAGQMMACKNDISKAQTEA 366
Cdd:COG4942     91 EIAELRAELEAQKEELAEL---LRALYRLGRQPPLAL-------------LLSPEDFLDAVRRLQYLKYLAPARREQAEE 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  367 -KQAQMKLKHAQQELKNKQAEVKKMDSGYRKDQEALEAVKRLKEKLEAEM-KKLNYEENKEESLLEKRRQLSRDIGRLKE 444
Cdd:COG4942    155 lRADLAELAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLeKELAELAAELAELQQEAEELEALIARLEA 234
                          250
                   ....*....|....*
gi 1034663586  445 TYEALLARFPNLRFA 459
Cdd:COG4942    235 EAAAAAERTPAAGFA 249
PTZ00121 PTZ00121
MAEBL; Provisional
192-887 1.79e-10

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 65.55  E-value: 1.79e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  192 AEDtkVRSAEELKEMQDkVIKLQEELSENDKKIKALNHEIEELEKRKdketggILRSLEDAL-AEAQRVNTKSQSAFDLK 270
Cdd:PTZ00121  1124 AED--ARKAEEARKAED-ARKAEEARKAEDAKRVEIARKAEDARKAE------EARKAEDAKkAEAARKAEEVRKAEELR 1194
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  271 KknlaCEESKRKELEKNMVEDSKTLAAKEKEVKKITDGLHALQEASNKDAEALAAAQQHFNAvsaGLSSNEDGAEATLAG 350
Cdd:PTZ00121  1195 K----AEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNE---EIRKFEEARMAHFAR 1267
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  351 QMMACKNDISKAQTEAKQAQMKLKHAQ----------QELKNKQAEVKKMDSGYRKDQEALEAVKRLKEKLE-------A 413
Cdd:PTZ00121  1268 RQAAIKAEEARKADELKKAEEKKKADEakkaeekkkaDEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEeakkaaeA 1347
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  414 EMKKLNYEENKEESLLEKRRQLSRDIGRLKETYEALLARFPNLRFAYKDPEKnwnrncvkglvaslisvKDTSATTALEL 493
Cdd:PTZ00121  1348 AKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKK-----------------AEEDKKKADEL 1410
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  494 VAGERLYNVVVDTEVTGKKLLERGELKRRytiiplnkisarciAPETLRVAQNLVGPDNVHVALSLVEYKPELQKAmefv 573
Cdd:PTZ00121  1411 KKAAAAKKKADEAKKKAEEKKKADEAKKK--------------AEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKA---- 1472
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  574 fgttfvcdnmDNAKKVAFDKRIMTRTVTLGGDVFDPHGTLSGGARSQAASILTKFQELKDVQDELRIKENELRALEEELA 653
Cdd:PTZ00121  1473 ----------DEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKA 1542
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  654 GLKNTAEKYR---QLKQQWEMKTEEADLLQTKLQQSSYHKQQEELDALKKTIEESEETLKNTKEIqrKAEEKYEVLENKM 730
Cdd:PTZ00121  1543 EEKKKADELKkaeELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKM--KAEEAKKAEEAKI 1620
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  731 KNAEAERERELKdaqKKLDCAKTKADASSKKMKEKQQEVEAITLELEELKREHTSYKQQLEAVNEAIKSYESQIEVMA-- 808
Cdd:PTZ00121  1621 KAEELKKAEEEK---KKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKke 1697
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  809 AEVAKNKESVNKAQEEVTKQKEVITAQDTVIKAKYAEVAKHKEQNND--SQLKIKELDHNISKHKREAEDGAAKVSKMLK 886
Cdd:PTZ00121  1698 AEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKkaEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEK 1777

                   .
gi 1034663586  887 D 887
Cdd:PTZ00121  1778 E 1778
PTZ00121 PTZ00121
MAEBL; Provisional
617-980 2.22e-10

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 65.55  E-value: 2.22e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  617 ARSQAASILTKFQELKDVQDELRIKENELRALEEELAGLKNTAEKYRQLKQQWEMKtEEADLLQTKLQQssyhkqQEELD 696
Cdd:PTZ00121  1362 AEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAK-KKADEAKKKAEE------KKKAD 1434
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  697 ALKKTIEESeetlKNTKEIQRKAEEKYEVLENKMKnaeAERERELKDAQKKLDCAKtKADASSKKMKEKQQEVEAITLEL 776
Cdd:PTZ00121  1435 EAKKKAEEA----KKADEAKKKAEEAKKAEEAKKK---AEEAKKADEAKKKAEEAK-KADEAKKKAEEAKKKADEAKKAA 1506
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  777 EELKREHTSYKQQLEAVNEAIKSYEsqiEVMAAEVAKNKESVNKAqEEVTKQKEVITAQDTvikaKYAEVAKHKEQNNDS 856
Cdd:PTZ00121  1507 EAKKKADEAKKAEEAKKADEAKKAE---EAKKADEAKKAEEKKKA-DELKKAEELKKAEEK----KKAEEAKKAEEDKNM 1578
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  857 QLKIKELDHNISKHKREAEDGAAKVSKMLKDYDWINAERHlfgqpnsayDFKTNNPKEAGQRLQKLQEMKEKLGRNVNmR 936
Cdd:PTZ00121  1579 ALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEA---------KIKAEELKKAEEEKKKVEQLKKKEAEEKK-K 1648
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*.
gi 1034663586  937 AMNVLTEAEERYNDLMKKKRIVENDKSKI--LTTIEDLDQKKNQAL 980
Cdd:PTZ00121  1649 AEELKKAEEENKIKAAEEAKKAEEDKKKAeeAKKAEEDEKKAAEAL 1694
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
618-975 1.52e-09

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 62.15  E-value: 1.52e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  618 RSQAASILTKFQELKDvqdELRIKENELRALEEELAGLKNTAEKYRQ----LKQQWEMKTEEADLLQTklqqssyhkqqe 693
Cdd:pfam10174  281 KSHSKFMKNKIDQLKQ---ELSKKESELLALQTKLETLTNQNSDCKQhievLKESLTAKEQRAAILQT------------ 345
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  694 ELDALKKTIEESEETL-KNTKEIQRKAEE--------------------KYEVLENKMKN-AEAERERE-----LKDAQK 746
Cdd:pfam10174  346 EVDALRLRLEEKESFLnKKTKQLQDLTEEkstlageirdlkdmldvkerKINVLQKKIENlQEQLRDKDkqlagLKERVK 425
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  747 KLDCAKTKADASSKKMKEKQQEVEAITLEL-EELKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNKESVNKAQEEV 825
Cdd:pfam10174  426 SLQTDSSNTDTALTTLEEALSEKERIIERLkEQREREDRERLEELESLKKENKDLKEKVSALQPELTEKESSLIDLKEHA 505
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  826 TKQKEVITAQDTVIKAKYAEVAKHKEQ--------------------NNDSQLKIKELDHNISKHKREAEDGAAKVSKML 885
Cdd:pfam10174  506 SSLASSGLKKDSKLKSLEIAVEQKKEEcsklenqlkkahnaeeavrtNPEINDRIRLLEQEVARYKEESGKAQAEVERLL 585
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  886 ------------KDYDWINAE----RHLFGQPNSAYDFKTNNPKEagqRLQKLQEMKEKLGRNVNMRAMNVLTEAEERYN 949
Cdd:pfam10174  586 gilrevenekndKDKKIAELEsltlRQMKEQNKKVANIKHGQQEM---KKKGAQLLEEARRREDNLADNSQQLQLEELMG 662
                          410       420
                   ....*....|....*....|....*.
gi 1034663586  950 DLMKKKRIVENDKSKILTTIEDLDQK 975
Cdd:pfam10174  663 ALEKTRQELDATKARLSSTQQSLAEK 688
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
198-863 1.73e-09

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 62.44  E-value: 1.73e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  198 RSAEELK-EMQDKVIKLQEELSENDKKIKALNHEIEELEKRKDKETGgILRSLEDALAEAQRVNTKSQSAFD----LKKK 272
Cdd:pfam15921  138 QSQEDLRnQLQNTVHELEAAKCLKEDMLEDSNTQIEQLRKMMLSHEG-VLQEIRSILVDFEEASGKKIYEHDsmstMHFR 216
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  273 NLACEESK-RKELEKNmvedsktLAAKEKEVKKITDGLHALQEASNKDAEALAaaQQHFNAVSAGLSSNEDgaeatlagQ 351
Cdd:pfam15921  217 SLGSAISKiLRELDTE-------ISYLKGRIFPVEDQLEALKSESQNKIELLL--QQHQDRIEQLISEHEV--------E 279
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  352 MMACKNDISKAQTEAKQAQMKLKHAQQELKNKQA----EVKKMDSGYRKDQEALEAVKRLKE-KLEAEMKKLNYEENKEE 426
Cdd:pfam15921  280 ITGLTEKASSARSQANSIQSQLEIIQEQARNQNSmymrQLSDLESTVSQLRSELREAKRMYEdKIEELEKQLVLANSELT 359
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  427 SLLEKRRQLSRDIGRLKETYEALLA----RFPNLRFAYKDPEKNWNRNC-------------------VKGLVASLISVK 483
Cdd:pfam15921  360 EARTERDQFSQESGNLDDQLQKLLAdlhkREKELSLEKEQNKRLWDRDTgnsitidhlrrelddrnmeVQRLEALLKAMK 439
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  484 D----------------------TSATTALELVAGERLYNVVvdTEVTGKKL-LERGELKrrytiipLNKISARCIAPET 540
Cdd:pfam15921  440 SecqgqmerqmaaiqgkneslekVSSLTAQLESTKEMLRKVV--EELTAKKMtLESSERT-------VSDLTASLQEKER 510
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  541 LRVAQNL-VGPDNVHVALSLVEYKpELQKAMEFVFGTTFVCD----NMDNAKKVAfdkRIMTRTVTLGGDVFDPHGTLSG 615
Cdd:pfam15921  511 AIEATNAeITKLRSRVDLKLQELQ-HLKNEGDHLRNVQTECEalklQMAEKDKVI---EILRQQIENMTQLVGQHGRTAG 586
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  616 GARSQAASI----------LTKFQELKDVQDElRIKENELRALEEELAGLKNTAEKYRQLKQQWEMKTEEADLL-QTKLQ 684
Cdd:pfam15921  587 AMQVEKAQLekeindrrleLQEFKILKDKKDA-KIRELEARVSDLELEKVKLVNAGSERLRAVKDIKQERDQLLnEVKTS 665
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  685 QSSYHKQQEELDALKKTI----EESEETLKNTKEIQRKAEEKYEVLENKMKNAEAERERELKDA---QKKLDCAKTKADA 757
Cdd:pfam15921  666 RNELNSLSEDYEVLKRNFrnksEEMETTTNKLKMQLKSAQSELEQTRNTLKSMEGSDGHAMKVAmgmQKQITAKRGQIDA 745
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  758 SSKKMKEKQQEVEAITLELEELKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNKESVNKAQEEVTKQKEVITAQDT 837
Cdd:pfam15921  746 LQSKIQFLEEAMTNANKEKHFLKEEKNKLSQELSTVATEKNKMAGELEVLRSQERRLKEKVANMEVALDKASLQFAECQD 825
                          730       740
                   ....*....|....*....|....*.
gi 1034663586  838 VIKAKYAEVAKHKEQNNdsqLKIKEL 863
Cdd:pfam15921  826 IIQRQEQESVRLKLQHT---LDVKEL 848
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
627-789 1.92e-09

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 61.71  E-value: 1.92e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  627 KFQELKDVQDELRIKENELRALEEELAGLKNTAEKYRQLKQQWEMKTEEADLLQTKLQ-QSSYHKQQEELDALKKTIEES 705
Cdd:COG4717     79 ELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEAlEAELAELPERLEELEERLEEL 158
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  706 EETLKNTKEIQRKAEEKYEVLENKMKNAEAERERELKDAQKKLDCAKTKADASSKKMKEKQQEVEAITLELEELKREHTS 785
Cdd:COG4717    159 RELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEA 238

                   ....
gi 1034663586  786 YKQQ 789
Cdd:COG4717    239 AALE 242
PTZ00121 PTZ00121
MAEBL; Provisional
110-875 8.50e-09

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 60.15  E-value: 8.50e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  110 EILSMIEEAagtRMYEYKKIAAQKTIEKKEAKLKEIKTILEEEITPTIQKLKEERSSY-LEYQKVMREIEHLSRLYIAYQ 188
Cdd:PTZ00121  1095 EAFGKAEEA---KKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEdAKRVEIARKAEDARKAEEARK 1171
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  189 fllAEDTK----------VRSAEELKEMQDKVIKLQEELSENDKKIkalnheiEELEKRKDKETGGILRSLEDALAEAqr 258
Cdd:PTZ00121  1172 ---AEDAKkaeaarkaeeVRKAEELRKAEDARKAEAARKAEEERKA-------EEARKAEDAKKAEAVKKAEEAKKDA-- 1239
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  259 vntksqsafdlkkknlacEESKRKELEKNMVEDSKTLAAKEKEVKKITDGLHALQEasnKDAEALAAAQQHFNAVSAGLS 338
Cdd:PTZ00121  1240 ------------------EEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEA---RKADELKKAEEKKKADEAKKA 1298
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  339 SNEDGAEatlagqmmacknDISKAQTEAKQAQMKLKHAQQELKNKQAEVKKMDSGYRKDQEALEAVKRLKEKLEAEMKKL 418
Cdd:PTZ00121  1299 EEKKKAD------------EAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKA 1366
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  419 NYEENKEESLLEKRRQLSRDIGRLKETYEAllarfpnLRFAYKDPEKnwnrncvkglvASLISVKDTSATTALELVAGER 498
Cdd:PTZ00121  1367 EAAEKKKEEAKKKADAAKKKAEEKKKADEA-------KKKAEEDKKK-----------ADELKKAAAAKKKADEAKKKAE 1428
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  499 LYNVVVDTEVTGKKLLERGELKRRytiiplnkisarciaPETLRVAQNLV-GPDNVHVALSLVEYKPELQKAMEFVFGTT 577
Cdd:PTZ00121  1429 EKKKADEAKKKAEEAKKADEAKKK---------------AEEAKKAEEAKkKAEEAKKADEAKKKAEEAKKADEAKKKAE 1493
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  578 FVCDNMDNAKKVAFDKRIMTRTVTlgGDVFDPHGTLSGGARSQAASILTKFQELKDVqDELRiKENELRALEEelaglKN 657
Cdd:PTZ00121  1494 EAKKKADEAKKAAEAKKKADEAKK--AEEAKKADEAKKAEEAKKADEAKKAEEKKKA-DELK-KAEELKKAEE-----KK 1564
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  658 TAEKYRQLKQQWEMKTEEADLLQT--KLQQSSYHKQQEELDALKKTIEESEETLKNTKEIQRKAEEKYEVLEnKMKNAEA 735
Cdd:PTZ00121  1565 KAEEAKKAEEDKNMALRKAEEAKKaeEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVE-QLKKKEA 1643
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  736 ERERELKDAQKKLDCAKTKADASSKKMKEKQQEVEAITLELEELKREHTSYKQQLEAVNEAiksyeSQIEVMAAEVAKNK 815
Cdd:PTZ00121  1644 EEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKA-----EELKKKEAEEKKKA 1718
                          730       740       750       760       770       780
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  816 ESVNKAQEEVTKQKEVITAQDTVIKAKYAEVAKHKEQNNDSQLKIKELDHNISKHKREAE 875
Cdd:PTZ00121  1719 EELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKE 1778
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1-832 2.13e-08

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 58.54  E-value: 2.13e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586    1 MHIKSIILEGFKSYAQrTEVNgFDPLFNAITGLNGSGKSNILDSI---------CFLLGISNLSQVVIGGRNKYLINGVN 71
Cdd:PRK03918     1 MKIEELKIKNFRSHKS-SVVE-FDDGINLIIGQNGSGKSSILEAIlvglywghgSKPKGLKKDDFTRIGGSGTEIELKFE 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586   72 ANNTRvqdlfCSVGLNVNNPHFLIMQGRITKVLNMKPPEILSMIEE---------AAGTRMYEYKKIaaqktIEKKEAKL 142
Cdd:PRK03918    79 KNGRK-----YRIVRSFNRGESYLKYLDGSEVLEEGDSSVREWVERlipyhvflnAIYIRQGEIDAI-----LESDESRE 148
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  143 KEIKTIL--------EEEITPTIQKLKEERSSYLEYQKVMREIEHLSRlyiayqflLAEDTKVRSAEELKEMQDKVIKLQ 214
Cdd:PRK03918   149 KVVRQILglddyenaYKNLGEVIKEIKRRIERLEKFIKRTENIEELIK--------EKEKELEEVLREINEISSELPELR 220
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  215 EEL---SENDKKIKALNHEIEELEKRKDKETGGiLRSLEDALAEAQrvntksqsafdlkkKNLACEESKRKELEKNmVED 291
Cdd:PRK03918   221 EELeklEKEVKELEELKEEIEELEKELESLEGS-KRKLEEKIRELE--------------ERIEELKKEIEELEEK-VKE 284
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  292 SKTLAAKEKEVKKITDGLHALQEASNKDAEALAAAQQHFNAVSAGLSSNEDgaeatlagqMMACKNDISKAQTEAKQAQM 371
Cdd:PRK03918   285 LKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEE---------KEERLEELKKKLKELEKRLE 355
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  372 KLKHAQQELKNKQAEVKKMDsGYRKDQEALEAVKRLKEKLEAEMKKLNYEEnKEESLLEKRRQLSRDIGRLKETYEALla 451
Cdd:PRK03918   356 ELEERHELYEEAKAKKEELE-RLKKRLTGLTPEKLEKELEELEKAKEEIEE-EISKITARIGELKKEIKELKKAIEEL-- 431
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  452 rfpnlrfaykdpeKNWNRNCvkglvaslisvkdtsattalelvagerlynvvvdtEVTGKKLLE--RGELKRRYTiIPLN 529
Cdd:PRK03918   432 -------------KKAKGKC-----------------------------------PVCGRELTEehRKELLEEYT-AELK 462
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  530 KISA---------RCIAPETLRVAQNLVGPDNVHVALSLVEYKPELQKAMEFVfgttfvcdNMDNAKKVAFDKRIMTRtv 600
Cdd:PRK03918   463 RIEKelkeieekeRKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKY--------NLEELEKKAEEYEKLKE-- 532
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  601 tlggdvfdphgtLSGGARSQAASILTKFQELKDVQDELRIKENELRALEEELAGLkntaekyrqLKQQWEMKTEEADLLQ 680
Cdd:PRK03918   533 ------------KLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAEL---------LKELEELGFESVEELE 591
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  681 TKLQQ-SSYHKQQEELDALKKTIEESEETLKNTKEIQRKAEEKYEVLENKMKnaeaERERELKDAQKKLDcaktkadass 759
Cdd:PRK03918   592 ERLKElEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLE----ELRKELEELEKKYS---------- 657
                          810       820       830       840       850       860       870
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034663586  760 kkmkekQQEVEAITLELEELKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNK------ESVNKAQEEVTKQKEVI 832
Cdd:PRK03918   658 ------EEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREkakkelEKLEKALERVEELREKV 730
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
1-863 4.13e-08

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 57.67  E-value: 4.13e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586    1 MHIKSIILEGFKSYaQRTEVNGFDPL--FNAITGLNGSGKSNILDSICFLL--GISNLSQVVI----------------- 59
Cdd:TIGR00618    1 MKPLRLTLKNFGSY-KGTHTIDFTALgpIFLICGKTGAGKTTLLDAITYALygKLPRRSEVIRslnslyaapseaafael 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586   60 -----------------------------------GGRNKYLINGVNANNTRVQDLfcsvgLNVNNPHF----LIMQGRI 100
Cdd:TIGR00618   80 efslgtkiyrvhrtlrctrshrkteqpeqlyleqkKGRGRILAAKKSETEEVIHDL-----LKLDYKTFtrvvLLPQGEF 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  101 TKVLNMKPPEILSMIEEAAGTRMYE---------YKKIAAQKTIEKKEAKL------------KEIKTILEEEITPTIQK 159
Cdd:TIGR00618  155 AQFLKAKSKEKKELLMNLFPLDQYTqlalmefakKKSLHGKAELLTLRSQLltlctpcmpdtyHERKQVLEKELKHLREA 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  160 LKEERSSYLEYQKVMREIEHLSRLYIAYQFLLAEDTKVRSAEELKEMQDKVIKLQEELSENDKKIKALNHeIEELEKRKD 239
Cdd:TIGR00618  235 LQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERINRARKAAPLAAHIKAVTQ-IEQQAQRIH 313
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  240 KETGGILRSLEDALAEAQRVnTKSQSAFDLKKKNLACEESKRKELEKNMVEDSKTLAAKEKEvKKITDGLHALQEASNKD 319
Cdd:TIGR00618  314 TELQSKMRSRAKLLMKRAAH-VKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQ-HTLTQHIHTLQQQKTTL 391
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  320 AEALAAAQQHFNAVS--AGLSSNEDGAEATLAGQMMACKNDISKAQTEAKQAQMKLKHAQQELKNKQAEVKKMDSGYRKD 397
Cdd:TIGR00618  392 TQKLQSLCKELDILQreQATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQSLKER 471
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  398 QEALEAVKRLKEKlEAEMKKLnyEENKEESLLEKRRQLSrdiGRLKETYEAllarfpnlrfaykdpeknwnrncvkgLVA 477
Cdd:TIGR00618  472 EQQLQTKEQIHLQ-ETRKKAV--VLARLLELQEEPCPLC---GSCIHPNPA--------------------------RQD 519
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  478 SLISVKDTSATTALE------LVAGERLYNVVVDTEVTGKKLLERGELKRRYTIIPLNKISA-RCIAPETLRVAQNLvgp 550
Cdd:TIGR00618  520 IDNPGPLTRRMQRGEqtyaqlETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRsKEDIPNLQNITVRL--- 596
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  551 dnvhvalslveyKPELQKamefvfgttfvcdNMDNAKKVAFDKRIMTRTVTLGGDVFDPHGTLSGGARSQAASILTKFQE 630
Cdd:TIGR00618  597 ------------QDLTEK-------------LSEAEDMLACEQHALLRKLQPEQDLQDVRLHLQQCSQELALKLTALHAL 651
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  631 LKDVQDE------LRIKENELRALEEELAGLKNTAEKYRQLKQQWEMKTEEADLLQ---TKLQQSS--YHKQQEELDALK 699
Cdd:TIGR00618  652 QLTLTQErvrehaLSIRVLPKELLASRQLALQKMQSEKEQLTYWKEMLAQCQTLLReleTHIEEYDreFNEIENASSSLG 731
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  700 KTIEESEETLKNT-KEIQRKAEEKYEVLENKMKNAEaerERELKDAQKKLDCAKTKADasskkMKEKQQEVEAITLELEE 778
Cdd:TIGR00618  732 SDLAAREDALNQSlKELMHQARTVLKARTEAHFNNN---EEVTAALQTGAELSHLAAE-----IQFFNRLREEDTHLLKT 803
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  779 LKREHTSYkqqleavneaIKSYESQIEVMAAEVAKNKESVNKAQEEVTKQKEVITAQdtviKAKYAEVAKHKEQNNDSQL 858
Cdd:TIGR00618  804 LEAEIGQE----------IPSDEDILNLQCETLVQEEEQFLSRLEEKSATLGEITHQ----LLKYEECSKQLAQLTQEQA 869

                   ....*
gi 1034663586  859 KIKEL 863
Cdd:TIGR00618  870 KIIQL 874
COG3950 COG3950
Predicted ATP-binding protein involved in virulence [General function prediction only];
1-181 4.22e-08

Predicted ATP-binding protein involved in virulence [General function prediction only];


Pssm-ID: 443150 [Multi-domain]  Cd Length: 276  Bit Score: 55.77  E-value: 4.22e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586    1 MHIKSIILEGFKSYAQRtEVN-GFDPLFNAITGLNGSGKSNILDSICFLLGISNLSQVVIGGRNKYLINGVNANNTRV-- 77
Cdd:COG3950      1 MRIKSLTIENFRGFEDL-EIDfDNPPRLTVLVGENGSGKTTLLEAIALALSGLLSRLDDVKFRKLLIRNGEFGDSAKLil 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586   78 ----QDLFCSVGLNVNNPHFLIMQGRITKVLNMKPPEIlSMIEeaAGTRMYEYKKIAAQKTIEKKEAKLKEIKTILeEEI 153
Cdd:COG3950     80 yygtSRLLLDGPLKKLERLKEEYFSRLDGYDSLLDEDS-NLRE--FLEWLREYLEDLENKLSDELDEKLEAVREAL-NKL 155
                          170       180       190
                   ....*....|....*....|....*....|..
gi 1034663586  154 TPTIQKLKEERSSYLEYQKVMR----EIEHLS 181
Cdd:COG3950    156 LPDFKDIRIDRDPGRLVILDKNgeelPLNQLS 187
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
639-810 7.92e-08

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 56.70  E-value: 7.92e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  639 RIKENELRALEEELAGLKNTAEKYRQLKQQWEMKTEEADLLQTKL----QQSSYHKQQEELDALKKTIEESEETLKNTKE 714
Cdd:COG4717     67 ELNLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEAELeelrEELEKLEKLLQLLPLYQELEALEAELAELPE 146
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  715 IQRKAEEKYEVLENKMkNAEAERERELKDAQKKLDCAKTKADASSKK-MKEKQQEVEAITLELEELKREHTSYKQQLEAV 793
Cdd:COG4717    147 RLEELEERLEELRELE-EELEELEAELAELQEELEELLEQLSLATEEeLQDLAEELEELQQRLAELEEELEEAQEELEEL 225
                          170
                   ....*....|....*..
gi 1034663586  794 NEAIKSYESQIEVMAAE 810
Cdd:COG4717    226 EEELEQLENELEAAALE 242
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
694-848 7.98e-08

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 54.55  E-value: 7.98e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  694 ELDALKKTIEESEETLKNTKEIQRKAEEKYEVLENKMKNAEAER---ERELKDAQKKLDCAKTKADASSKkmkekQQEVE 770
Cdd:COG1579     18 ELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIkrlELEIEEVEARIKKYEEQLGNVRN-----NKEYE 92
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034663586  771 AITLELEELKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNKESVNKAQEEVTKQKEVITAQDTVIKAKYAEVAK 848
Cdd:COG1579     93 ALQKEIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEELEAEREELAA 170
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
619-1092 1.14e-07

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 55.93  E-value: 1.14e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  619 SQAASILTKFQELKDVQDELRIKENELRALEEELAGLKNTAEKYRQLKQQWEMKTEEADLLQTKLQQSSYH---KQQEEL 695
Cdd:COG4717    122 EKLLQLLPLYQELEALEAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEelqDLAEEL 201
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  696 DALKKTIEESEETLKNTKEIQRKAEEKYEVLENKMKNAEAERERELKDAQKK---------------------------- 747
Cdd:COG4717    202 EELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALEERLKEARLLLLiaaallallglggsllsliltiagvlfl 281
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  748 ----LDCAKTKADASSKKMKEKQQEVEAITLELEELKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNKESVNKAQE 823
Cdd:COG4717    282 vlglLALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEE 361
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  824 EV------TKQKEVITAQDTVIKAKYAEVAKHKEQNNDSQLKIKELDHNISKHKREAEDGAAKVSKMLKDYDWINAERHL 897
Cdd:COG4717    362 ELqleeleQEIAALLAEAGVEDEEELRAALEQAEEYQELKEELEELEEQLEELLGELEELLEALDEEELEEELEELEEEL 441
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  898 fgqpnsaydfktnnpKEAGQRLQKL-QEMKEKLGRNVNMRAMNVLTEAEERYNDLMKKKRIVENDKSKILTTIEDLDQKK 976
Cdd:COG4717    442 ---------------EELEEELEELrEELAELEAELEQLEEDGELAELLQELEELKAELRELAEEWAALKLALELLEEAR 506
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  977 NQALNIAWQKVNKDFGSIFSTLLPGANAMLAPPEgqtvldGLEFKVALGNTWKENLTELSGGQRSLVALSL---ILSMLL 1053
Cdd:COG4717    507 EEYREERLPPVLERASEYFSRLTDGRYRLIRIDE------DLSLKVDTEDGRTRPVEELSRGTREQLYLALrlaLAELLA 580
                          490       500       510
                   ....*....|....*....|....*....|....*....
gi 1034663586 1054 FKPAPIyILDEVDAALDLSHTQNIGQMLRTHFTHSQFIV 1092
Cdd:COG4717    581 GEPLPL-ILDDAFVNFDDERLRAALELLAELAKGRQVIY 618
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1-449 1.85e-07

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 55.43  E-value: 1.85e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586    1 MHIKSIILEGFKSYAQrTEVnGFDPLFNAITGLNGSGKSNILDSICFLLGIS-----NLSQVVIGGRN------------ 63
Cdd:PRK02224     1 MRFDRVRLENFKCYAD-ADL-RLEDGVTVIHGVNGSGKSSLLEACFFALYGSkalddTLDDVITIGAEeaeielwfehag 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586   64 -KYLI------NGVNANN---------------TRVQDLFCSVgLNVNNPHFL----IMQGRITKVLNMKPPEILSMIEE 117
Cdd:PRK02224    79 gEYHIerrvrlSGDRATTakcvletpegtidgaRDVREEVTEL-LRMDAEAFVncayVRQGEVNKLINATPSDRQDMIDD 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  118 AAGT-RMYEYKKIAA--------------------QKTIEKKEAK------------LKEIKTILE--EEITPTIQKLKE 162
Cdd:PRK02224   158 LLQLgKLEEYRERASdarlgvervlsdqrgsldqlKAQIEEKEEKdlherlngleseLAELDEEIEryEEQREQARETRD 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  163 ERSSYLE-YQKVMREIEHLSRLYIAYQFLLAEDTKVRS--AEELKEMQDKVIKLQEELSE-------NDKKIKALNHEIE 232
Cdd:PRK02224   238 EADEVLEeHEERREELETLEAEIEDLRETIAETEREREelAEEVRDLRERLEELEEERDDllaeaglDDADAEAVEARRE 317
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  233 ELEKRKDketgGILRSLEDALAEAQRVNTKSQS----AFDLKKKNLACEEsKRKELEKNMVEDSKTLAAKEKEVKKITDG 308
Cdd:PRK02224   318 ELEDRDE----ELRDRLEECRVAAQAHNEEAESlredADDLEERAEELRE-EAAELESELEEAREAVEDRREEIEELEEE 392
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  309 LHALQEASNKDAEALAAAQQHfnavSAGLSSNEDGA---EATLAGQMMACKNDISKAqtEAKQAQMKLKHAQQELKnkqa 385
Cdd:PRK02224   393 IEELRERFGDAPVDLGNAEDF----LEELREERDELrerEAELEATLRTARERVEEA--EALLEAGKCPECGQPVE---- 462
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034663586  386 evkkmDSGyrkDQEALEAVKRLKEKLEAEMKKLNYEENKEESLLEKRRQLS---RDIGRLKETYEAL 449
Cdd:PRK02224   463 -----GSP---HVETIEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVeaeDRIERLEERREDL 521
RecF COG1195
Recombinational DNA repair ATPase RecF [Replication, recombination and repair];
2-48 1.90e-07

Recombinational DNA repair ATPase RecF [Replication, recombination and repair];


Pssm-ID: 440808 [Multi-domain]  Cd Length: 352  Bit Score: 54.39  E-value: 1.90e-07
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1034663586    2 HIKSIILEGFKSYAQrTEVNgFDPLFNAITGLNGSGKSNILDSICFL 48
Cdd:COG1195      1 RLKRLSLTNFRNYES-LELE-FSPGINVLVGPNGQGKTNLLEAIYLL 45
recF PRK00064
recombination protein F; Reviewed
1-48 2.63e-07

recombination protein F; Reviewed


Pssm-ID: 234608 [Multi-domain]  Cd Length: 361  Bit Score: 54.01  E-value: 2.63e-07
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 1034663586    1 MHIKSIILEGFKSYAqRTEVNgFDPLFNAITGLNGSGKSNILDSICFL 48
Cdd:PRK00064     1 MYLTRLSLTDFRNYE-ELDLE-LSPGVNVLVGENGQGKTNLLEAIYLL 46
PTZ00121 PTZ00121
MAEBL; Provisional
627-976 2.73e-07

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 55.15  E-value: 2.73e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  627 KFQELKDVQDELRIKENELRALEEelagLKNTAEKYRQlKQQWEMKTEEADLLQTKLQQSSYHKQQEELDALKKTIEESE 706
Cdd:PTZ00121  1399 KAEEDKKKADELKKAAAAKKKADE----AKKKAEEKKK-ADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKAD 1473
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  707 ETLKNTKEIQRKAEEKYEVLENKMKNAEAERERELKDAQKKLDCAKTKADA-SSKKMKEKQQEVEAITLE---------- 775
Cdd:PTZ00121  1474 EAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKAdEAKKAEEAKKADEAKKAEekkkadelkk 1553
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  776 ------------LEELKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNK----ESVNKAQEEVTKQKEVITAQDTVI 839
Cdd:PTZ00121  1554 aeelkkaeekkkAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKkmkaEEAKKAEEAKIKAEELKKAEEEKK 1633
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  840 KAKYAEVAKHKEQNNDSQLKIKELDHNISKH--KREAEDGAAKVSKMLKDYDwinAERHlfgqpnsaydfKTNNPKEAGQ 917
Cdd:PTZ00121  1634 KVEQLKKKEAEEKKKAEELKKAEEENKIKAAeeAKKAEEDKKKAEEAKKAEE---DEKK-----------AAEALKKEAE 1699
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1034663586  918 RLQKLQEMKEKLGRNVNmRAMNVLTEAEERYNDLMKKKRIVENDKSKILTTIEDLDQKK 976
Cdd:PTZ00121  1700 EAKKAEELKKKEAEEKK-KAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKK 1757
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
614-981 2.95e-07

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 55.12  E-value: 2.95e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  614 SGGARSQAASILTkfqELKDVQDELRIKEN----ELRALEEELAGLKNtaeKYRQLKQQWEMKTEE---------ADLLQ 680
Cdd:pfam15921  287 ASSARSQANSIQS---QLEIIQEQARNQNSmymrQLSDLESTVSQLRS---ELREAKRMYEDKIEElekqlvlanSELTE 360
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  681 TKLQQSSYHKQQEELD-ALKKTIEESEetlKNTKEIQRKAEEKYEVLENKMKNAeaererelkdaqkkldcakTKADASS 759
Cdd:pfam15921  361 ARTERDQFSQESGNLDdQLQKLLADLH---KREKELSLEKEQNKRLWDRDTGNS-------------------ITIDHLR 418
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  760 KKMKEKQQEVEAITLELEELKRE-HTSYKQQLEAVNEAIKSYEsQIEVMAAEVAKNKESVNKAQEEVTKQKEVITA---- 834
Cdd:pfam15921  419 RELDDRNMEVQRLEALLKAMKSEcQGQMERQMAAIQGKNESLE-KVSSLTAQLESTKEMLRKVVEELTAKKMTLESsert 497
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  835 ----------QDTVIKAKYAEVAKHKEQNNdsqLKIKELDH--NISKHKR--EAEDGAAKVSKMLKD------YDWINAE 894
Cdd:pfam15921  498 vsdltaslqeKERAIEATNAEITKLRSRVD---LKLQELQHlkNEGDHLRnvQTECEALKLQMAEKDkvieilRQQIENM 574
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  895 RHLFGQPNSAYDF----KTNNPKEAGQRLQKLQEMK-EKLGRNVNMRamnvltEAEERYNDLMKKKRIVENDKSKILTTI 969
Cdd:pfam15921  575 TQLVGQHGRTAGAmqveKAQLEKEINDRRLELQEFKiLKDKKDAKIR------ELEARVSDLELEKVKLVNAGSERLRAV 648
                          410
                   ....*....|..
gi 1034663586  970 EDLDQKKNQALN 981
Cdd:pfam15921  649 KDIKQERDQLLN 660
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
631-876 3.23e-07

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 54.66  E-value: 3.23e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  631 LKDVQDELRIKENELR-ALEEE---LAGLKNTAEKYR----QLKQQWEMKTEEADLLQTKLQ--QSSYHKQQEELDALKK 700
Cdd:PRK02224   312 VEARREELEDRDEELRdRLEECrvaAQAHNEEAESLRedadDLEERAEELREEAAELESELEeaREAVEDRREEIEELEE 391
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  701 TIEESEETLKNTKEIQRKAEEKYEVLE---NKMKNAEAERERELKDAQKKLDCAKTKADASskKMKEKQQEVEaitlele 777
Cdd:PRK02224   392 EIEELRERFGDAPVDLGNAEDFLEELReerDELREREAELEATLRTARERVEEAEALLEAG--KCPECGQPVE------- 462
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  778 elKREHTsykqqleavnEAIKSYESQIEVMAAEVAKNKESVNKAQEEVTKQKEVITAQDTV----IKAKYAE--VAKHKE 851
Cdd:PRK02224   463 --GSPHV----------ETIEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIerleERREDLEelIAERRE 530
                          250       260
                   ....*....|....*....|....*
gi 1034663586  852 QNNDSQLKIKELDHNISKHKREAED 876
Cdd:PRK02224   531 TIEEKRERAEELRERAAELEAEAEE 555
COG3950 COG3950
Predicted ATP-binding protein involved in virulence [General function prediction only];
936-1094 3.40e-07

Predicted ATP-binding protein involved in virulence [General function prediction only];


Pssm-ID: 443150 [Multi-domain]  Cd Length: 276  Bit Score: 53.08  E-value: 3.40e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  936 RAMNVLTEAEERYNDLMKKKRIVENDKSKILTTIEDLDQKKNQALNIAWQKVNKdfgsIFSTLLPGANAMLAPPEGQTVL 1015
Cdd:COG3950     97 RLKEEYFSRLDGYDSLLDEDSNLREFLEWLREYLEDLENKLSDELDEKLEAVRE----ALNKLLPDFKDIRIDRDPGRLV 172
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586 1016 ----DGLEFkvalgntwkeNLTELSGGQRSLVAL--SLILSMLLFKPAP--------IYILDEVDAALDLSHTQNIGQML 1081
Cdd:COG3950    173 ildkNGEEL----------PLNQLSDGERSLLALvgDLARRLAELNPALenplegegIVLIDEIDLHLHPKWQRRILPDL 242
                          170
                   ....*....|...
gi 1034663586 1082 RTHFTHSQFIVVS 1094
Cdd:COG3950    243 RKIFPNIQFIVTT 255
HOOK pfam05622
HOOK protein coiled-coil region; This family consists of several HOOK1, 2 and 3 proteins from ...
628-979 3.44e-07

HOOK protein coiled-coil region; This family consists of several HOOK1, 2 and 3 proteins from different eukaryotic organizms. The different members of the human gene family are HOOK1, HOOK2 and HOOK3. Different domains have been identified in the three human HOOK proteins, and it was demonstrated that the highly conserved NH2-domain mediates attachment to microtubules, whereas this central coiled-coil motif mediates homodimerization and the more divergent C-terminal domains are involved in binding to specific organelles (organelle-binding domains). It has been demonstrated that endogenous HOOK3 binds to Golgi membranes, whereas both HOOK1 and HOOK2 are localized to discrete but unidentified cellular structures. In mice the Hook1 gene is predominantly expressed in the testis. Hook1 function is necessary for the correct positioning of microtubular structures within the haploid germ cell. Disruption of Hook1 function in mice causes abnormal sperm head shape and fragile attachment of the flagellum to the sperm head. This entry includes the central coiled-coiled domain and the divergent C-terminal domain.


Pssm-ID: 461694 [Multi-domain]  Cd Length: 528  Bit Score: 54.31  E-value: 3.44e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  628 FQELKDVQDELRIKENELRALEEELAGLKNTAEKYRQLKQQWEmkteeadllqtklqqsSYHKQQEELDALKKTIEESEE 707
Cdd:pfam05622   96 VLELQHRNEELTSLAEEAQALKDEMDILRESSDKVKKLEATVE----------------TYKKKLEDLGDLRRQVKLLEE 159
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  708 tlKNTKEIQRKAEekyevLENKMKNAEAER------ERELKDAQKKLDCAKTKADA---SSKKMKEK----QQEVEAITL 774
Cdd:pfam05622  160 --RNAEYMQRTLQ-----LEEELKKANALRgqletyKRQVQELHGKLSEESKKADKlefEYKKLEEKlealQKEKERLII 232
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  775 E-------LEELKREHTSYKQQLEAVNEAIKSYESqIEVMAAEV--AKNKESVNKAQEEvtkQKEVITAQDTVIKAKYAE 845
Cdd:pfam05622  233 ErdtlretNEELRCAQLQQAELSQADALLSPSSDP-GDNLAAEImpAEIREKLIRLQHE---NKMLRLGQEGSYRERLTE 308
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  846 VAKH-----------KEQNNDSQLKIKELDHNISKHKRE-------AEDGAAKVSKMLKDYDWINaERHLFGQPNSAYdF 907
Cdd:pfam05622  309 LQQLledanrrknelETQNRLANQRILELQQQVEELQKAlqeqgskAEDSSLLKQKLEEHLEKLH-EAQSELQKKKEQ-I 386
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1034663586  908 KTNNPKEAGQRLQKLQEMKEKL-GRNVNMRAMnvlteaEERYndlmkkKRIVENDKSKILTtiedLDQKKNQA 979
Cdd:pfam05622  387 EELEPKQDSNLAQKIDELQEALrKKDEDMKAM------EERY------KKYVEKAKSVIKT----LDPKQNPA 443
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
630-847 3.68e-07

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 53.68  E-value: 3.68e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  630 ELKDVQDELRIKENELRALEEELAGLKNTAEkyrQLKQQWEMKTEEADLLQTKLQQSsyhkqQEELDALKKTIEESEETL 709
Cdd:COG3883     17 QIQAKQKELSELQAELEAAQAELDALQAELE---ELNEEYNELQAELEALQAEIDKL-----QAEIAEAEAEIEERREEL 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  710 KNTKEIQRKAEEKYEVLE------------------NKMKNAEAERERELKDAQKKLDCAKTKADASSKKMKEKQQEVEA 771
Cdd:COG3883     89 GERARALYRSGGSVSYLDvllgsesfsdfldrlsalSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEA 168
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034663586  772 ITLELEELKREHTSYKQQLEAvneAIKSYESQIEVMAAEVAKNKESVNKAQEEVTKQKEVITAQDTVIKAKYAEVA 847
Cdd:COG3883    169 AKAELEAQQAEQEALLAQLSA---EEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 241
YbjD COG3593
Predicted ATP-dependent endonuclease of the OLD family, contains P-loop ATPase and TOPRIM ...
1-57 3.75e-07

Predicted ATP-dependent endonuclease of the OLD family, contains P-loop ATPase and TOPRIM domains [Replication, recombination and repair];


Pssm-ID: 442812 [Multi-domain]  Cd Length: 359  Bit Score: 53.47  E-value: 3.75e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034663586    1 MHIKSIILEGFKSYaQRTEVNgFDPLFNAITGLNGSGKSNILDSICFLLGISNLSQV 57
Cdd:COG3593      1 MKLEKIKIKNFRSI-KDLSIE-LSDDLTVLVGENNSGKSSILEALRLLLGPSSSRKF 55
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
625-894 4.60e-07

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 54.26  E-value: 4.60e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  625 LTKFQELKDVQDELRIKENELRALEEELAGLKNtaeKYRQLKQQWEMKTEEADLLQTKLQ--QSSYHKQQEELDALKKTI 702
Cdd:TIGR04523  359 SEKQRELEEKQNEIEKLKKENQSYKQEIKNLES---QINDLESKIQNQEKLNQQKDEQIKklQQEKELLEKEIERLKETI 435
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  703 EESEETLKNTKEIQRKAEEKYEVLENKMKnaeaERERELKDAQKKLDCAKTKADASSKKMKEKQQEVEAITLELEELKRE 782
Cdd:TIGR04523  436 IKNNSEIKDLTNQDSVKELIIKNLDNTRE----SLETQLKVLSRSINKIKQNLEQKQKELKSKEKELKKLNEEKKELEEK 511
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  783 HTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNKESVNKAQEEVTKQ--KEVITAQDTVIKAKYAEVAKHKEQNNDSQLKI 860
Cdd:TIGR04523  512 VKDLTKKISSLKEKIEKLESEKKEKESKISDLEDELNKDDFELKKEnlEKEIDEKNKEIEELKQTQKSLKKKQEEKQELI 591
                          250       260       270
                   ....*....|....*....|....*....|....
gi 1034663586  861 KELDHNISKHKREAEDGAAKVSKMLKDYDWINAE 894
Cdd:TIGR04523  592 DQKEKEKKDLIKEIEEKEKKISSLEKELEKAKKE 625
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
617-842 6.63e-07

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 53.69  E-value: 6.63e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  617 ARSQAASILTKFQE-LKDVQDELRIKENELRALEEELAGLKNTAekyrqlKQQWEMKTEEADLLQTKLQQ------SSYH 689
Cdd:pfam12128  270 DETLIASRQEERQEtSAELNQLLRTLDDQWKEKRDELNGELSAA------DAAVAKDRSELEALEDQHGAfldadiETAA 343
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  690 KQQEELDALKKTIEESEETLKNTKEIQRKAEEKYEVLENKMKnaeAERERELKDAQKKLDcaktkadaSSKKMKEKQQEV 769
Cdd:pfam12128  344 ADQEQLPSWQSELENLEERLKALTGKHQDVTAKYNRRRSKIK---EQNNRDIAGIKDKLA--------KIREARDRQLAV 412
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  770 EAITLE-LEELKREHtsYKQQLEAVNEAIKSYESQIE--------VMAAE-----VAKNKESVNKAQEEVTK-QKEVITA 834
Cdd:pfam12128  413 AEDDLQaLESELREQ--LEAGKLEFNEEEYRLKSRLGelklrlnqATATPelllqLENFDERIERAREEQEAaNAEVERL 490

                   ....*...
gi 1034663586  835 QDTVIKAK 842
Cdd:pfam12128  491 QSELRQAR 498
CCDC22 pfam05667
Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 ...
642-820 7.31e-07

Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 (CCDC22) is involved in regulation of NF-kappa-B signalling; the function may involve association with COMMD8 and a CUL1-dependent E3 ubiquitin ligase complex. It is part of the OMMD/CCDC22/CCDC93 (CCC) complex, which interacts with the multisubunit WASH complex required for endosomal deposition of F-actin and cargo trafficking in conjunction with the retromer. This entry also includes CCDC22 homologs from animals and plants.


Pssm-ID: 461708 [Multi-domain]  Cd Length: 600  Bit Score: 53.49  E-value: 7.31e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  642 ENELRALEEELAgLKNTAEKYRQLKQQWEMKTeeadlLQTKLQ--QSSYHKQQEELDALKKTIEESEEtlkNTKEIQRKA 719
Cdd:pfam05667  309 TNEAPAATSSPP-TKVETEEELQQQREEELEE-----LQEQLEdlESSIQELEKEIKKLESSIKQVEE---ELEELKEQN 379
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  720 EEKYEVLENKMKNAEaererELKDAQKKLDCAKTKADASSKKMKEKQQEVEA----ITLELEELKREHTSYKQQLEAVNE 795
Cdd:pfam05667  380 EELEKQYKVKKKTLD-----LLPDAEENIAKLQALVDASAQRLVELAGQWEKhrvpLIEEYRALKEAKSNKEDESQRKLE 454
                          170       180
                   ....*....|....*....|....*
gi 1034663586  796 AIKSYESQIEVMAAEvAKNKESVNK 820
Cdd:pfam05667  455 EIKELREKIKEVAEE-AKQKEELYK 478
tolA PRK09510
cell envelope integrity inner membrane protein TolA; Provisional
659-855 9.00e-07

cell envelope integrity inner membrane protein TolA; Provisional


Pssm-ID: 236545 [Multi-domain]  Cd Length: 387  Bit Score: 52.50  E-value: 9.00e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  659 AEKYRQLKQQWEMKTEEADLLQTKLQQSSYHKQQEELDALKKTIEESEETLKnTKEIQRKAEEKYEVLENKMKNAEAERE 738
Cdd:PRK09510    61 VEQYNRQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLA-AQEQKKQAEEAAKQAALKQKQAEEAAA 139
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  739 RELKDAQKKLDCAKTKADASSKKMKE---KQQEVEAITLELEELKREHTSYKQQlEAVNEAIKSYESQIEVMAAEVAKNK 815
Cdd:PRK09510   140 KAAAAAKAKAEAEAKRAAAAAKKAAAeakKKAEAEAAKKAAAEAKKKAEAEAAA-KAAAEAKKKAEAEAKKKAAAEAKKK 218
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|
gi 1034663586  816 ESVNKAQEEVTKQKEVITAQDTVIKAKYAEVAKHKEQNND 855
Cdd:PRK09510   219 AAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAE 258
ABC_Rad50 cd03240
ATP-binding cassette domain of Rad50; The catalytic domains of Rad50 are similar to the ...
3-49 9.66e-07

ATP-binding cassette domain of Rad50; The catalytic domains of Rad50 are similar to the ATP-binding cassette of ABC transporters, but are not associated with membrane-spanning domains. The conserved ATP-binding motifs common to Rad50 and the ABC transporter family include the Walker A and Walker B motifs, the Q loop, a histidine residue in the switch region, a D-loop, and a conserved LSGG sequence. This conserved sequence, LSGG, is the most specific and characteristic motif of this family and is thus known as the ABC signature sequence.


Pssm-ID: 213207 [Multi-domain]  Cd Length: 204  Bit Score: 50.68  E-value: 9.66e-07
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1034663586    3 IKSIILEGFKSYAQRTEVNgFDPLFNAITGLNGSGKSNILDSICFLL 49
Cdd:cd03240      1 IDKLSIRNIRSFHERSEIE-FFSPLTLIVGQNGAGKTTIIEALKYAL 46
ABC_SMC5_euk cd03277
ATP-binding cassette domain of eukaryotic SMC5 proteins; The structural maintenance of ...
3-112 9.82e-07

ATP-binding cassette domain of eukaryotic SMC5 proteins; The structural maintenance of chromosomes (SMC) proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (formerly known as Rad18).


Pssm-ID: 213244 [Multi-domain]  Cd Length: 213  Bit Score: 51.06  E-value: 9.82e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586    3 IKSIILEGFKSYAQrTEVNgFDPLFNAITGLNGSGKSNILDSICFLLG--ISNLsqvvigGRNK----YLINGVNANNTR 76
Cdd:cd03277      3 IVRIKLENFVTYDE-TEFR-PGPSLNMIIGPNGSGKSSIVCAICLGLGgkPKLL------GRAKkvgeFVKRGCDEGTIE 74
                           90       100       110
                   ....*....|....*....|....*....|....*...
gi 1034663586   77 VQdLFCSVG-LNVNNP-HFLiMQGRITKVLNMKPPEIL 112
Cdd:cd03277     75 IE-LYGNPGnIQVDNLcQFL-PQDRVGEFAKLSPIELL 110
COG4637 COG4637
Predicted ATPase [General function prediction only];
3-48 1.02e-06

Predicted ATPase [General function prediction only];


Pssm-ID: 443675 [Multi-domain]  Cd Length: 371  Bit Score: 52.24  E-value: 1.02e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1034663586    3 IKSIILEGFKSYaQRTEVNgFDPLfNAITGLNGSGKSNILDSICFL 48
Cdd:COG4637      2 ITRIRIKNFKSL-RDLELP-LGPL-TVLIGANGSGKSNLLDALRFL 44
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
629-771 1.16e-06

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 51.08  E-value: 1.16e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  629 QELKDVQDELRIKENELRALEEELAGLKntaEKYRQLKQQWEMKTEEADLLQTKLQQSSYHKQ----QEELDALKKTIEE 704
Cdd:COG1579     31 AELAELEDELAALEARLEAAKTELEDLE---KEIKRLELEIEEVEARIKKYEEQLGNVRNNKEyealQKEIESLKRRISD 107
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034663586  705 SEETLKNTKEIQRKAEEKYEVLENKMknaeAERERELKDAQKKLDCAKTKADASSKKMKEKQQEVEA 771
Cdd:COG1579    108 LEDEILELMERIEELEEELAELEAEL----AELEAELEEKKAELDEELAELEAELEELEAEREELAA 170
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
613-861 1.48e-06

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 51.69  E-value: 1.48e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  613 LSGGARSQAASILTKFQELKDVQDELRIKENELRALEEELAGLkntAEKYRQLKQQWEMKTEEADLLQTKLQQssyhkQQ 692
Cdd:COG4942     11 LALAAAAQADAAAEAEAELEQLQQEIAELEKELAALKKEEKAL---LKQLAALERRIAALARRIRALEQELAA-----LE 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  693 EELDALKKTIEESEETLKNTKEIQRKAEEKYEVLENKMKNAEAERERELKDAQKKLDCAKTKADASSK---KMKEKQQEV 769
Cdd:COG4942     83 AELAELEKEIAELRAELEAQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREqaeELRADLAEL 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  770 EAITLELEELKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNKESVNKAQEEVTKQKEVITAQDTVIKAKYAEVAKH 849
Cdd:COG4942    163 AALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAER 242
                          250
                   ....*....|..
gi 1034663586  850 KEQNNDSQLKIK 861
Cdd:COG4942    243 TPAAGFAALKGK 254
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
684-882 2.31e-06

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 51.30  E-value: 2.31e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  684 QQSSYHKQQEELDALKKTIEESEETLKNTKEIQRKAEEKYEVLENKMKNAEAER---ERELKDAQKKLDCAKTKADASSK 760
Cdd:COG4942     18 QADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIralEQELAALEAELAELEKEIAELRA 97
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  761 KMKEKQQEVEAITLELEELKREHT-------------------------SYKQQLEAVNEAIKSYESQIEVMAAEVAKNK 815
Cdd:COG4942     98 ELEAQKEELAELLRALYRLGRQPPlalllspedfldavrrlqylkylapARREQAEELRADLAELAALRAELEAERAELE 177
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034663586  816 ESVNKAQEEVTKQKEVITAQDTVIKAKYAEVAKHKEQNNDSQLKIKELDHNISKHKREAEDGAAKVS 882
Cdd:COG4942    178 ALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTP 244
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
372-884 3.05e-06

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 51.60  E-value: 3.05e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  372 KLKHAQQELKNKQAEVKKMDSGYRKDQEALEAVKRLKEKLEAEMKKLNYEENKEESLLEKRRQLSRDIGRLKETYEALLA 451
Cdd:PRK03918   194 LIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKK 273
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  452 RFPNLRFAYKD-PEKNWNRNCVKGLVASLISVKDTSATTALELVAGERLYNVVV--------DTEVTGKKLLERGELKRR 522
Cdd:PRK03918   274 EIEELEEKVKElKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEerikeleeKEERLEELKKKLKELEKR 353
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  523 YTIIP-----LNKISARCIAPETLRVAQNLVGPDNVHVALSLVE-YKPELQKAMEFVFGTTFVCDNMDNAKKVAFD--KR 594
Cdd:PRK03918   354 LEELEerhelYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEkAKEEIEEEISKITARIGELKKEIKELKKAIEelKK 433
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  595 IMTRTVTLGGDVFDPH-GTLSGGARSQAASILTKFQELKDVQDELRIKENELRAL---EEELAGLKNTAEKYRQLK---- 666
Cdd:PRK03918   434 AKGKCPVCGRELTEEHrKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVlkkESELIKLKELAEQLKELEeklk 513
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  667 ----QQWEMKTEEADLLQTKL-----QQSSYHKQQEELDALKKTIEESEETLKNT--------KEIQRKAEEKYEVLENK 729
Cdd:PRK03918   514 kynlEELEKKAEEYEKLKEKLiklkgEIKSLKKELEKLEELKKKLAELEKKLDELeeelaellKELEELGFESVEELEER 593
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  730 MKNAE---------AERERELKDAQKKLDCAKTKADASSKKMKEKQQEVEAITLELEELKREHTsyKQQLEAVNEAIKSY 800
Cdd:PRK03918   594 LKELEpfyneylelKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYS--EEEYEELREEYLEL 671
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  801 ESQIEVMAAEVAKNKESVNKAQEEVTKQKEVitaqdtviKAKYAEVAKHKEQNNDSQLKIKELDHNISKHKREAEDGA-A 879
Cdd:PRK03918   672 SRELAGLRAELEELEKRREEIKKTLEKLKEE--------LEEREKAKKELEKLEKALERVEELREKVKKYKALLKERAlS 743

                   ....*
gi 1034663586  880 KVSKM 884
Cdd:PRK03918   744 KVGEI 748
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
615-842 4.78e-06

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 50.81  E-value: 4.78e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  615 GGARSQAASILTKFQE-LKDVQDELRIKENE-----LRALEEELAGLKNTAEKYRQLKQQWEMKTEEADLLQTklqqsSY 688
Cdd:PRK02224   172 SDARLGVERVLSDQRGsLDQLKAQIEEKEEKdlherLNGLESELAELDEEIERYEEQREQARETRDEADEVLE-----EH 246
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  689 HKQQEELDALKKTIEESEETLkntkeiqRKAEEKYEVLENKMKNAEAERErELKDAQKKL--DCAKTKADAsskkmkekq 766
Cdd:PRK02224   247 EERREELETLEAEIEDLRETI-------AETEREREELAEEVRDLRERLE-ELEEERDDLlaEAGLDDADA--------- 309
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034663586  767 qevEAITLELEELKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNKESVNKAQEEV-TKQKEVITAQDTVIKAK 842
Cdd:PRK02224   310 ---EAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAaELESELEEAREAVEDRR 383
Borrelia_P83 pfam05262
Borrelia P83/100 protein; This family consists of several Borrelia P83/P100 antigen proteins.
624-803 7.07e-06

Borrelia P83/100 protein; This family consists of several Borrelia P83/P100 antigen proteins.


Pssm-ID: 114011 [Multi-domain]  Cd Length: 489  Bit Score: 50.00  E-value: 7.07e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  624 ILTKFQELKDVQDELRIKENELRALEEELAglkntAEKYRQLKQQWEMKTEEADLLQTKLQQSsyhkqQEELDALKKTIE 703
Cdd:pfam05262  182 VVEALREDNEKGVNFRRDMTDLKERESQED-----AKRAQQLKEELDKKQIDADKAQQKADFA-----QDNADKQRDEVR 251
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  704 ESEETLKNTKEIQR--KAEEKYEVLENKMKNAEAERERELKDAQKKLDCAKTKADASSKKMKEKQQEVEAITLELEELKR 781
Cdd:pfam05262  252 QKQQEAKNLPKPADtsSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKRE 331
                          170       180
                   ....*....|....*....|..
gi 1034663586  782 EHTSYKQQLEAVNEAIKSYESQ 803
Cdd:pfam05262  332 PVAEDLQKTKPQVEAQPTSLNE 353
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
620-884 7.24e-06

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 50.12  E-value: 7.24e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  620 QAASILTKFQeLKDVQDELRIKENELRaleEELAGLKNtaeKYRQLKQQWEMKTEEADLLQTKLQQssYHKQQEELD-AL 698
Cdd:pfam05557    1 RAELIESKAR-LSQLQNEKKQMELEHK---RARIELEK---KASALKRQLDRESDRNQELQKRIRL--LEKREAEAEeAL 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  699 KKTIEESEETLKNTKEIQRKAEEKYEVLEnkmknaeaererelkDAQKKLDCAKTKADASSKKMKEKQQEVEAITLELEE 778
Cdd:pfam05557   72 REQAELNRLKKKYLEALNKKLNEKESQLA---------------DAREVISCLKNELSELRRQIQRAELELQSTNSELEE 136
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  779 LKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNKESVNKAQEEVTKQKEVITAQDTVikAKYAEVAKHKEQNNDSQL 858
Cdd:pfam05557  137 LQERLDLLKAKASEAEQLRQNLEKQQSSLAEAEQRIKELEFEIQSQEQDSEIVKNSKSEL--ARIPELEKELERLREHNK 214
                          250       260
                   ....*....|....*....|....*.
gi 1034663586  859 KIKELDHNISKHKREAEDGAAKVSKM 884
Cdd:pfam05557  215 HLNENIENKLLLKEEVEDLKRKLERE 240
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
628-830 8.52e-06

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 49.14  E-value: 8.52e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  628 FQELKDVQDELRIKENELRALEEELAGLKNT----AEKYRQLKQQWEMKTEEADLLQTKL-----QQSSYHKQQEELDAL 698
Cdd:COG1340     35 NEELKELAEKRDELNAQVKELREEAQELREKrdelNEKVKELKEERDELNEKLNELREELdelrkELAELNKAGGSIDKL 114
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  699 KKTIEESEETLKN---TKEIQRKAEEKYEVLENKMKNAEAERE--RELKDAQKKLDCAKTKADASSKKMKEKQQEVEAIT 773
Cdd:COG1340    115 RKEIERLEWRQQTevlSPEEEKELVEKIKELEKELEKAKKALEknEKLKELRAELKELRKEAEEIHKKIKELAEEAQELH 194
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034663586  774 LELEELKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNKESVNKAQEEVTKQKE 830
Cdd:COG1340    195 EEMIELYKEADELRKEADELHKEIVEAQEKADELHEEIIELQKELRELRKELKKLRK 251
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
629-830 8.81e-06

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 50.34  E-value: 8.81e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  629 QELKDVQDELRIKENELRALEEELAGLKNTAEKYRQLKQQWemktEEADLLQTKLQQssyhkqqeELDALKKTI------ 702
Cdd:COG3096    903 DAAQEAQAFIQQHGKALAQLEPLVAVLQSDPEQFEQLQADY----LQAKEQQRRLKQ--------QIFALSEVVqrrphf 970
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  703 --EESEETLKNTKEIQrkaeekyEVLENKMKNAEAERER---ELKDAQKKLDCA-------KTKADASSKKMKEKQQEVE 770
Cdd:COG3096    971 syEDAVGLLGENSDLN-------EKLRARLEQAEEARREareQLRQAQAQYSQYnqvlaslKSSRDAKQQTLQELEQELE 1043
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1034663586  771 AITL----ELEELKREHTS-YKQQLEAVNEAIKSYESQIEVMAAEVAKNKESVNKAQEEVTKQKE 830
Cdd:COG3096   1044 ELGVqadaEAEERARIRRDeLHEELSQNRSRRSQLEKQLTRCEAEMDSLQKRLRKAERDYKQERE 1108
ABC_cobalt_CbiO_domain1 cd03225
First domain of the ATP-binding cassette component of cobalt transport system; Domain I of the ...
1003-1083 1.05e-05

First domain of the ATP-binding cassette component of cobalt transport system; Domain I of the ABC component of a cobalt transport family found in bacteria, archaea, and eukaryota. The transition metal cobalt is an essential component of many enzymes and must be transported into cells in appropriate amounts when needed. This ABC transport system of the CbiMNQO family is involved in cobalt transport in association with the cobalamin (vitamin B12) biosynthetic pathways. Most of cobalt (Cbi) transport systems possess a separate CbiN component, the cobalt-binding periplasmic protein, and they are encoded by the conserved gene cluster cbiMNQO. Both the CbiM and CbiQ proteins are integral cytoplasmic membrane proteins, and the CbiO protein has the linker peptide and the Walker A and B motifs commonly found in the ATPase components of the ABC-type transport systems.


Pssm-ID: 213192 [Multi-domain]  Cd Length: 211  Bit Score: 47.85  E-value: 1.05e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586 1003 NAMLAPPEGQTVLDGLEFKVALGNTWKENLTELSGGQRSLVALSLILSMllfKPaPIYILDEVDAALDLSHTQNIGQMLR 1082
Cdd:cd03225    103 NLGLPEEEIEERVEEALELVGLEGLRDRSPFTLSGGQKQRVAIAGVLAM---DP-DILLLDEPTAGLDPAGRRELLELLK 178

                   .
gi 1034663586 1083 T 1083
Cdd:cd03225    179 K 179
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
372-829 1.18e-05

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 49.38  E-value: 1.18e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  372 KLKHAQQELKNKQAEVKKmdsgYRKDQEALEAVKRLKEKLEAEMKKLNYEENKEE------SLLEKRRQLSRDIGRLKET 445
Cdd:COG4717     72 ELKELEEELKEAEEKEEE----YAELQEELEELEEELEELEAELEELREELEKLEkllqllPLYQELEALEAELAELPER 147
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  446 YEALLARFPNLRFaykdpeknwnrncvkgLVASLISVKDTSATTALELVAGERLYNVVVDTEVTgKKLLERGELKRRYTI 525
Cdd:COG4717    148 LEELEERLEELRE----------------LEEELEELEAELAELQEELEELLEQLSLATEEELQ-DLAEELEELQQRLAE 210
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  526 IpLNKISARCIAPETLRvaQNLVGPDNVHVALSLVEykpELQKAMEFVFGTTFVCDnmdnakKVAFDKRIMTRTVTLGGD 605
Cdd:COG4717    211 L-EEELEEAQEELEELE--EELEQLENELEAAALEE---RLKEARLLLLIAAALLA------LLGLGGSLLSLILTIAGV 278
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  606 VFDPHGTLSGGARSQAASILTKFQELKDVQDELRIKENELRALEEELAGL--------KNTAEKYRQLKQQWEMKTEEAD 677
Cdd:COG4717    279 LFLVLGLLALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALglppdlspEELLELLDRIEELQELLREAEE 358
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  678 lLQTKLQQSSYHKQQEELdaLKKTIEESEETLK---NTKEIQRKAEEKYEVLENKMKNAEAERERELKDAQK-----KLD 749
Cdd:COG4717    359 -LEEELQLEELEQEIAAL--LAEAGVEDEEELRaalEQAEEYQELKEELEELEEQLEELLGELEELLEALDEeeleeELE 435
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  750 CAKTKADASSKKMKEKQQEVEAITLELEELKREHTsykqqLEAVNEAIKSYESQIEVMAAEVAKNK---ESVNKAQEEVT 826
Cdd:COG4717    436 ELEEELEELEEELEELREELAELEAELEQLEEDGE-----LAELLQELEELKAELRELAEEWAALKlalELLEEAREEYR 510

                   ...
gi 1034663586  827 KQK 829
Cdd:COG4717    511 EER 513
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1-457 1.23e-05

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 49.38  E-value: 1.23e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586    1 MHIKSIILEGFKSYAQRTEVngFDPLFNAITGLNGSGKSNILDSICFLLgisnLSQVVIGGRNKYLINGVNA--NNTRVQ 78
Cdd:COG4717      1 MKIKELEIYGFGKFRDRTIE--FSPGLNVIYGPNEAGKSTLLAFIRAML----LERLEKEADELFKPQGRKPelNLKELK 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586   79 DLfcsvglnvnnphflimQGRItKVLNMKPPEILSMIEEAAGTRMYEYKKIAAQKTIEKKEAKLKEIKTI---------L 149
Cdd:COG4717     75 EL----------------EEEL-KEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLlplyqeleaL 137
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  150 EEEITPTIQKLKEERSSYLEYQKVMREIEHLSRLYIAYQFLLAE---DTKVRSAEELKEMQDKVIKLQEELSENDKKIKA 226
Cdd:COG4717    138 EAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEElleQLSLATEEELQDLAEELEELQQRLAELEEELEE 217
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  227 LNHEIEELEKRKDK-ETGGILRSLEDALAEAQRV------------NTKSQSAFDLKKKNLAC-----------EESKRK 282
Cdd:COG4717    218 AQEELEELEEELEQlENELEAAALEERLKEARLLlliaaallallgLGGSLLSLILTIAGVLFlvlgllallflLLAREK 297
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  283 ELEKNMVEDSKTLAAKE----KEVKKITDGLHALQEASNKDAEALAAAQQHFNAVSAGLSSNEdgAEATLAGQMMACKND 358
Cdd:COG4717    298 ASLGKEAEELQALPALEeleeEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELE--EELQLEELEQEIAAL 375
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  359 ISKAQTEAKQA-QMKLKHAQ--QELKNKQAEVKKMDSGYRKDQEALEAV---KRLKEKLEAEMKKLNYEENKEESLLEKR 432
Cdd:COG4717    376 LAEAGVEDEEElRAALEQAEeyQELKEELEELEEQLEELLGELEELLEAldeEELEEELEELEEELEELEEELEELREEL 455
                          490       500
                   ....*....|....*....|....*..
gi 1034663586  433 RQLSRDIGRLKE--TYEALLARFPNLR 457
Cdd:COG4717    456 AELEAELEQLEEdgELAELLQELEELK 482
Macoilin pfam09726
Macoilin family; The Macoilin proteins has an N-terminal portion that is composed of 5 ...
629-814 1.28e-05

Macoilin family; The Macoilin proteins has an N-terminal portion that is composed of 5 trasnmembrane helices, followed by a C-terminal coiled-coil region. Macoilin is a highly conserved protein present in eukaryotes. Macoilin appears to be found in the ER and be involved in the function of neurons.


Pssm-ID: 462859 [Multi-domain]  Cd Length: 670  Bit Score: 49.47  E-value: 1.28e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  629 QELKDVQDEL---RIKENELRALEEELAGLKntaekyRQLKQQWEMKTEEADLLQTKLQQSSYHKQQEeldalKKTIEES 705
Cdd:pfam09726  402 QDIKKLKAELqasRQTEQELRSQISSLTSLE------RSLKSELGQLRQENDLLQTKLHNAVSAKQKD-----KQTVQQL 470
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  706 EETLKNTKEiQRKAEEKYEVLENKMKnaeaeRERELKDAQKKLDCAKTKAD-ASSKKMKEKQQEVEAITLEL-----EEL 779
Cdd:pfam09726  471 EKRLKAEQE-ARASAEKQLAEEKKRK-----KEEEATAARAVALAAASRGEcTESLKQRKRELESEIKKLTHdiklkEEQ 544
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 1034663586  780 KREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKN 814
Cdd:pfam09726  545 IRELEIKVQELRKYKESEKDTEVLMSALSAMQDKN 579
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
630-805 1.49e-05

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 47.61  E-value: 1.49e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  630 ELKDVQDELRIKENELRALEEELAGLKNTAEKYRQlkqqwEMKTEEADLLQTKLQQSSYHKQQEELDALKKTIEESEETL 709
Cdd:COG1579     11 DLQELDSELDRLEHRLKELPAELAELEDELAALEA-----RLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNV 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  710 KNTKEiqrkaeekYEVLENKMKNAEAER---ERELKDAQKKLDCAKTKADASSKKMKEKQQEVEAITLELEELKREhtsY 786
Cdd:COG1579     86 RNNKE--------YEALQKEIESLKRRIsdlEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAE---L 154
                          170
                   ....*....|....*....
gi 1034663586  787 KQQLEAVNEAIKSYESQIE 805
Cdd:COG1579    155 EAELEELEAEREELAAKIP 173
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
685-978 1.68e-05

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 49.29  E-value: 1.68e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  685 QSSYHKQQEELDALKKTIEESEETLKNTKEIQRKAEEKYEVLENKMKNAEaERERELKDAQKKLDcaktkadasskKMKE 764
Cdd:PRK03918   161 ENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREIN-EISSELPELREELE-----------KLEK 228
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  765 KQQEVEAITLELEELKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNKESVnkaqEEVTKQKEVitaqdtviKAKYA 844
Cdd:PRK03918   229 EVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKV----KELKELKEK--------AEEYI 296
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  845 EVAKHKEQNNDSQLKIKELDHNISKHKREAEDGAAKVSKMLKDYDWINAERhlfgqpnSAYDFKTNNPKEAGQRLQKLQE 924
Cdd:PRK03918   297 KLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKL-------KELEKRLEELEERHELYEEAKA 369
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1034663586  925 MKEKLGRNVNMRAMNVLTEAEERYNDLMKKKRIVENDKSKILTTIEDLDQKKNQ 978
Cdd:PRK03918   370 KKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKE 423
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
125-976 1.82e-05

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 49.27  E-value: 1.82e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  125 EYKKIAAQKTIEKKEAKLKEIKTILE--EEITPTIQKLKEE----RSSYLEYQKVMREIEH-LSRLYIAYQFLLAEDTKV 197
Cdd:TIGR00606  230 EAQLESSREIVKSYENELDPLKNRLKeiEHNLSKIMKLDNEikalKSRKKQMEKDNSELELkMEKVFQGTDEQLNDLYHN 309
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  198 RSAEeLKEMQDKVIKLQEELSENDKKIKALNHEIEELEKRKDKETGGILRSLEDALA-EAQRVNTKSQSAFDLKKKNLAC 276
Cdd:TIGR00606  310 HQRT-VREKERELVDCQRELEKLNKERRLLNQEKTELLVEQGRLQLQADRHQEHIRArDSLIQSLATRLELDGFERGPFS 388
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  277 E--------------ESKRKELEKNMVEDSKTLAAKEKEVKKITDGLHALQEASNKDAEALAAAQQHFNAVSAGLSSNED 342
Cdd:TIGR00606  389 ErqiknfhtlvierqEDEAKTAAQLCADLQSKERLKQEQADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQLEG 468
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  343 GAEatlagqmmacknDISKAQTEAKQAQMKLKHAQqelKNKQAEVKKMDSGYRKDQEAleAVKRLKEKLEAEMKKLNYEE 422
Cdd:TIGR00606  469 SSD------------RILELDQELRKAERELSKAE---KNSLTETLKKEVKSLQNEKA--DLDRKLRKLDQEMEQLNHHT 531
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  423 NKEESLL-------EKRRQLSRDIGRLKETYEALLARFPNLR-----FAYKDPEKNWNRNCVKGLVASLISV-------- 482
Cdd:TIGR00606  532 TTRTQMEmltkdkmDKDEQIRKIKSRHSDELTSLLGYFPNKKqledwLHSKSKEINQTRDRLAKLNKELASLeqnknhin 611
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  483 KDTSATTALELVAGERLYNVV--VDTEVTGKKLLERGElKRRYTIIPLNKISARCIAPETLRVAQNlvgPDNVHVALSLV 560
Cdd:TIGR00606  612 NELESKEEQLSSYEDKLFDVCgsQDEESDLERLKEEIE-KSSKQRAMLAGATAVYSQFITQLTDEN---QSCCPVCQRVF 687
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  561 EYKPELQKamefvfgttfVCDNMDNAKKVAFDKriMTRTVTLGGDVFDPHGTLSGGARSQAASILTKFQELKDVQdelri 640
Cdd:TIGR00606  688 QTEAELQE----------FISDLQSKLRLAPDK--LKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPELR----- 750
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  641 keNELRALEEELAGLKNTAEKYRQLKQQWEMKTEEADLLQTKLqqSSYHKQQEELDALKKTIEE------SEETLKNTKE 714
Cdd:TIGR00606  751 --NKLQKVNRDIQRLKNDIEEQETLLGTIMPEEESAKVCLTDV--TIMERFQMELKDVERKIAQqaaklqGSDLDRTVQQ 826
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  715 IQRKAEEKYEVLENKMKNAE------AERERELKDAQKKLDCAKTKADASSKKMKEKQQ---EVEAITLELEELKREHTS 785
Cdd:TIGR00606  827 VNQEKQEKQHELDTVVSKIElnrkliQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQfeeQLVELSTEVQSLIREIKD 906
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  786 YKQQLEAVNEAIKSYESQIEVMaaeVAKNKESVNKAQEEVTKQKEVITAQDTVIKAKYAEVAKHKE-QNNDSQLKIKELD 864
Cdd:TIGR00606  907 AKEQDSPLETFLEKDQQEKEEL---ISSKETSNKKAQDKVNDIKEKVKNIHGYMKDIENKIQDGKDdYLKQKETELNTVN 983
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  865 HNISKHKREAEDGAAKVSKMLKDYDwINAERHLFGQPNSAYDFKTNnpkeagqrlqKLQEMKEKLGRNVNMRAMNVLTEA 944
Cdd:TIGR00606  984 AQLEECEKHQEKINEDMRLMRQDID-TQKIQERWLQDNLTLRKREN----------ELKEVEEELKQHLKEMGQMQVLQM 1052
                          890       900       910
                   ....*....|....*....|....*....|..
gi 1034663586  945 EERYNDLMKKKRIVENDKSKILTTIEDLDQKK 976
Cdd:TIGR00606 1053 KQEHQKLEENIDLIKRNHVLALGRQKGYEKEI 1084
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
726-889 1.98e-05

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 47.23  E-value: 1.98e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  726 LENKMKNAEAERE---RELKDAQKKLDCAKTKADASSKKMKEKQQEVEAITLELEELKREHTSYKQQLEAV--NEAIKSY 800
Cdd:COG1579     15 LDSELDRLEHRLKelpAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVrnNKEYEAL 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  801 ESQIEVMAAEVAKNKESVNKAQEEVTKQKEVITAQDTVIKAKYAEVAKHKEQNNDsqlKIKELDHNISKHKREAEDGAAK 880
Cdd:COG1579     95 QKEIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDE---ELAELEAELEELEAEREELAAK 171
                          170
                   ....*....|
gi 1034663586  881 VSK-MLKDYD 889
Cdd:COG1579    172 IPPeLLALYE 181
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
692-828 2.08e-05

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 47.23  E-value: 2.08e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  692 QEELDALKKTIEESEETLKNTKEIQRKAEEKYEVLENKMKNAEAE-----RERELKDAQKKLDCAKTKADASSKKMKEKQ 766
Cdd:COG1579     37 EDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQlgnvrNNKEYEALQKEIESLKRRISDLEDEILELM 116
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034663586  767 QEveaitleLEELKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNKESVNKAQEEVTKQ 828
Cdd:COG1579    117 ER-------IEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEELEAEREELAAK 171
PRK11281 PRK11281
mechanosensitive channel MscK;
670-866 2.40e-05

mechanosensitive channel MscK;


Pssm-ID: 236892 [Multi-domain]  Cd Length: 1113  Bit Score: 48.75  E-value: 2.40e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  670 EMKTEEADLLQTKLQQSSYHKQQEELDALKKTIEES-EETLKNTKEIQR-KAEEKYEVLE--NKMKNAE-----AERERE 740
Cdd:PRK11281    57 EDKLVQQDLEQTLALLDKIDRQKEETEQLKQQLAQApAKLRQAQAELEAlKDDNDEETREtlSTLSLRQlesrlAQTLDQ 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  741 LKDAQKKLDCA-------KTKADASSKKMKEKQQEVEAITLELEELKRE---------------------HTSYKQQLEA 792
Cdd:PRK11281   137 LQNAQNDLAEYnsqlvslQTQPERAQAALYANSQRLQQIRNLLKGGKVGgkalrpsqrvllqaeqallnaQNDLQRKSLE 216
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1034663586  793 VNEAIKS-YESQIEVMAAEVAKNKESVNKAQEEVTkQKEVITAQDTVIKAKYAEVAKHKEQNNdsqLKIKELDHN 866
Cdd:PRK11281   217 GNTQLQDlLQKQRDYLTARIQRLEHQLQLLQEAIN-SKRLTLSEKTVQEAQSQDEAARIQANP---LVAQELEIN 287
ABC_Iron-Siderophores_B12_Hemin cd03214
ATP-binding component of iron-siderophores, vitamin B12 and hemin transporters and related ...
1031-1093 2.52e-05

ATP-binding component of iron-siderophores, vitamin B12 and hemin transporters and related proteins; ABC transporters, involved in the uptake of siderophores, heme, and vitamin B12, are widely conserved in bacteria and archaea. Only very few species lack representatives of the siderophore family transporters. The E. coli BtuCD protein is an ABC transporter mediating vitamin B12 uptake. The two ATP-binding cassettes (BtuD) are in close contact with each other, as are the two membrane-spanning subunits (BtuC); this arrangement is distinct from that observed for the E. coli lipid flippase MsbA. The BtuC subunits provide 20 transmembrane helices grouped around a translocation pathway that is closed to the cytoplasm by a gate region, whereas the dimer arrangement of the BtuD subunits resembles the ATP-bound form of the Rad50 DNA repair enzyme. A prominent cytoplasmic loop of BtuC forms the contact region with the ATP-binding cassette and represent a conserved motif among the ABC transporters.


Pssm-ID: 213181 [Multi-domain]  Cd Length: 180  Bit Score: 46.27  E-value: 2.52e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1034663586 1031 NLTELSGGQRSLValslILSMLLFKPAPIYILDEVDAALDLSHTQNIGQMLRTHFTHSQFIVV 1093
Cdd:cd03214     94 PFNELSGGERQRV----LLARALAQEPPILLLDEPTSHLDIAHQIELLELLRRLARERGKTVV 152
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
97-439 2.54e-05

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 48.58  E-value: 2.54e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586   97 QGRITKVLNMKPPEILSMIEEAAGTRMyEYKKIAAQKTIEKKEAKLKEIKTILE-EEITPTIQKLKEE-RSSYLEYQKVM 174
Cdd:pfam15921  266 QDRIEQLISEHEVEITGLTEKASSARS-QANSIQSQLEIIQEQARNQNSMYMRQlSDLESTVSQLRSElREAKRMYEDKI 344
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  175 REIEHlsrlyiayQFLLA--EDTKVRS-----AEELKEMQDKvikLQEELSENDKKIKALNHEIEELEKRKDKETGG--- 244
Cdd:pfam15921  345 EELEK--------QLVLAnsELTEARTerdqfSQESGNLDDQ---LQKLLADLHKREKELSLEKEQNKRLWDRDTGNsit 413
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  245 ---ILRSLEDALAEAQRVNT-----KSQSAFDLKKKnLACEESKRKELEKnmvedSKTLAAKEKEVKKItdgLHALQEAS 316
Cdd:pfam15921  414 idhLRRELDDRNMEVQRLEAllkamKSECQGQMERQ-MAAIQGKNESLEK-----VSSLTAQLESTKEM---LRKVVEEL 484
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  317 NKDAEALAAAQQHFNAVSAGLSSNEDGAEATLAgqmmacknDISKAQTEAKQAQMKLKHAQQE---LKNKQAEVKKMDSG 393
Cdd:pfam15921  485 TAKKMTLESSERTVSDLTASLQEKERAIEATNA--------EITKLRSRVDLKLQELQHLKNEgdhLRNVQTECEALKLQ 556
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*.
gi 1034663586  394 YRKDQEALEAvkrLKEKLEAEMKKLNYEENKEESLLEKRRQLSRDI 439
Cdd:pfam15921  557 MAEKDKVIEI---LRQQIENMTQLVGQHGRTAGAMQVEKAQLEKEI 599
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
582-976 2.75e-05

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 48.56  E-value: 2.75e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  582 NMDNAKKVAFDKRIMTRTVTLGGDVFDPHGTLSGGARSQAASILTKFQELKDVQDELrikenelraLEEELAGLKNTAEK 661
Cdd:pfam05483  308 SMSTQKALEEDLQIATKTICQLTEEKEAQMEELNKAKAAHSFVVTEFEATTCSLEEL---------LRTEQQRLEKNEDQ 378
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  662 YRQLKQQWEMKTEEADLLqTKLQQSsyhkQQEELDALKKTIEESEETLKNTKEIQRKAEEkYEVLENKMKNAEAEREREL 741
Cdd:pfam05483  379 LKIITMELQKKSSELEEM-TKFKNN----KEVELEELKKILAEDEKLLDEKKQFEKIAEE-LKGKEQELIFLLQAREKEI 452
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  742 KDAQKKLDCAKTKADASSKKMKEKQQEVE--------------AITLELEELKREHTSYKQQLEAVNEAIKSYESQIEVM 807
Cdd:pfam05483  453 HDLEIQLTAIKTSEEHYLKEVEDLKTELEkeklknieltahcdKLLLENKELTQEASDMTLELKKHQEDIINCKKQEERM 532
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  808 AAEVAKNKESVNKAQEEVTKQKEVITAQDTVIKAKYAEVAKHKEQNNDSQLK----IKELDHNISKHKREAEDGAAKVSK 883
Cdd:pfam05483  533 LKQIENLEEKEMNLRDELESVREEFIQKGDEVKCKLDKSEENARSIEYEVLKkekqMKILENKCNNLKKQIENKNKNIEE 612
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  884 MLKDYDWINAERHLFGQPNSAYDFKTNNPK-EAGQRLQKLQEMKEKLGRNVNMRAMnvlteAEERYNDLMKKKRIVENDK 962
Cdd:pfam05483  613 LHQENKALKKKGSAENKQLNAYEIKVNKLElELASAKQKFEEIIDNYQKEIEDKKI-----SEEKLLEEVEKAKAIADEA 687
                          410
                   ....*....|....
gi 1034663586  963 SKILTTIEDLDQKK 976
Cdd:pfam05483  688 VKLQKEIDKRCQHK 701
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
634-842 4.15e-05

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 47.86  E-value: 4.15e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  634 VQDELRIKENELRALEEELAGLKNTAE----KYRQLKQQWEMKTEEADLLQTKLQQSSYHKQQEEldalkktieeseetl 709
Cdd:pfam01576  880 LQDEKRRLEARIAQLEEELEEEQSNTEllndRLRKSTLQVEQLTTELAAERSTSQKSESARQQLE--------------- 944
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  710 KNTKEIQRKAEEKYEVLENKMKNAEAERERELKDAQKKLDCAKTKADASSKKMKEKQQEVEAITLELEELKREHTSYKQQ 789
Cdd:pfam01576  945 RQNKELKAKLQEMEGTVKSKFKSSIAALEAKIAQLEEQLEQESRERQAANKLVRRTEKKLKEVLLQVEDERRHADQYKDQ 1024
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1034663586  790 LEAVNEAIKSYESQIEVMAAEVAKNKESVNKAQ---EEVTKQKEVITAQDTVIKAK 842
Cdd:pfam01576 1025 AEKGNSRMKQLKRQLEEAEEEASRANAARRKLQrelDDATESNESMNREVSTLKSK 1080
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
618-832 4.23e-05

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 46.83  E-value: 4.23e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  618 RSQAASILTKFQELKDVQDELRIKENELRaleEELAGLKNTAEKYRQLKQQWEMKTEEADLLQTKLQQSSYHKQQEelda 697
Cdd:COG1340     63 REKRDELNEKVKELKEERDELNEKLNELR---EELDELRKELAELNKAGGSIDKLRKEIERLEWRQQTEVLSPEEE---- 135
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  698 lKKTIEESEEtLKNTKEIQRKAEEKYEVLENKMKNAEAERErELKDAQKKLDCAKTKADASSKKMKEKQQEVEAITLELE 777
Cdd:COG1340    136 -KELVEKIKE-LEKELEKAKKALEKNEKLKELRAELKELRK-EAEEIHKKIKELAEEAQELHEEMIELYKEADELRKEAD 212
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  778 ELKREHTSYKQQLEAVNEAIKSYESQI-----EVMAAEVAKNKESVNKAQEEVTKQKEVI 832
Cdd:COG1340    213 ELHKEIVEAQEKADELHEEIIELQKELrelrkELKKLRKKQRALKREKEKEELEEKAEEI 272
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
618-1118 4.27e-05

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 47.75  E-value: 4.27e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  618 RSQAASILTKFQELKDVQDELRIKENELRALEEELAGLKNTAEKYRQLKQqwemKTEEADLLQTKLQQSSYHKQQEELDA 697
Cdd:PRK03918   320 EEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKA----KKEELERLKKRLTGLTPEKLEKELEE 395
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  698 LKKTIEESEETLKNTKEIQRKAEEKYEVLE---NKMKNAE-------------------AERERELKDAQKKLdcakTKA 755
Cdd:PRK03918   396 LEKAKEEIEEEISKITARIGELKKEIKELKkaiEELKKAKgkcpvcgrelteehrkellEEYTAELKRIEKEL----KEI 471
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  756 DASSKKMKEKQQEVEAITLELEELKREHTSYKQqLEAVNEAIKSYE-SQIEVMAAEVAKNKESVNKAQ------------ 822
Cdd:PRK03918   472 EEKERKLRKELRELEKVLKKESELIKLKELAEQ-LKELEEKLKKYNlEELEKKAEEYEKLKEKLIKLKgeikslkkelek 550
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  823 -EEVTKQKEVITAQDTVIKAKYAEVAK-----------------------HKEQNN--DSQLKIKELDHNISKHKREAED 876
Cdd:PRK03918   551 lEELKKKLAELEKKLDELEEELAELLKeleelgfesveeleerlkelepfYNEYLElkDAEKELEREEKELKKLEEELDK 630
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  877 GAAKVSKMLKDYDWINAERHLFGQPNSAYDF--KTNNPKEAGQRLQKLQEMKEKLGRNVNmRAMNVLTEAEERYNDLMKK 954
Cdd:PRK03918   631 AFEELAETEKRLEELRKELEELEKKYSEEEYeeLREEYLELSRELAGLRAELEELEKRRE-EIKKTLEKLKEELEEREKA 709
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  955 KRIVENDKsKILTTIEDLDQK----KNQALNIAWQKVNKDFGSIFSTLLPGANAmlappegQTVLDGLEFKVALGNTW-- 1028
Cdd:PRK03918   710 KKELEKLE-KALERVEELREKvkkyKALLKERALSKVGEIASEIFEELTEGKYS-------GVRVKAEENKVKLFVVYqg 781
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586 1029 -KENLTELSGGQRSLVALS--LILSMLLFKPAPIYILDEVDAALDLSHTQNIGQMLRTHFTH-SQFIVVSLKEGMFNNAN 1104
Cdd:PRK03918   782 kERPLTFLSGGERIALGLAfrLALSLYLAGNIPLLILDEPTPFLDEERRRKLVDIMERYLRKiPQVIIVSHDEELKDAAD 861
                          570
                   ....*....|....
gi 1034663586 1105 VLFKTKFVDGVSTV 1118
Cdd:PRK03918   862 YVIRVSLEGGVSKV 875
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
637-876 4.28e-05

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 48.02  E-value: 4.28e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  637 ELRIKENELRALEEELAGLKN--TAEKYRQ--LKQQWEMKTEEADLLQTKLQQSSYH--------KQQEELDALKKTIEE 704
Cdd:COG3096    279 ERRELSERALELRRELFGARRqlAEEQYRLveMARELEELSARESDLEQDYQAASDHlnlvqtalRQQEKIERYQEDLEE 358
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  705 SEETLKNTKEIQRKAEEKYEVLENKMKNAEAERER---ELKDAQKKLDCAKTKADA---SSKKMKEKQQEVEAITLELEE 778
Cdd:COG3096    359 LTERLEEQEEVVEEAAEQLAEAEARLEAAEEEVDSlksQLADYQQALDVQQTRAIQyqqAVQALEKARALCGLPDLTPEN 438
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  779 LKREHTSYKQQLEAVNEAIKSYESQIEVmaAEVAKNK-----ESVNKAQEEVTKQ------KEVIT---------AQDTV 838
Cdd:COG3096    439 AEDYLAAFRAKEQQATEEVLELEQKLSV--ADAARRQfekayELVCKIAGEVERSqawqtaRELLRryrsqqalaQRLQQ 516
                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 1034663586  839 IKAKYAEVAKHKEQNNDSQLKIKELDHNISKHKREAED 876
Cdd:COG3096    517 LRAQLAELEQRLRQQQNAERLLEEFCQRIGQQLDAAEE 554
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
629-983 4.63e-05

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 47.81  E-value: 4.63e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  629 QELKDVQDELRIKENELRALEEELAGlknTAEKYRQLKQQWEMKTEEADLLQTKLQQSS--YHKQQEELDALKKTIEESE 706
Cdd:pfam15921  419 RELDDRNMEVQRLEALLKAMKSECQG---QMERQMAAIQGKNESLEKVSSLTAQLESTKemLRKVVEELTAKKMTLESSE 495
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  707 ETLKntkEIQRKAEEKYEVLEnkMKNAEAERERELKDAQ-KKLDCAKTKADasskKMKEKQQEVEAITLELEELKREHTS 785
Cdd:pfam15921  496 RTVS---DLTASLQEKERAIE--ATNAEITKLRSRVDLKlQELQHLKNEGD----HLRNVQTECEALKLQMAEKDKVIEI 566
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  786 YKQQLEAVNEAIKSYESQIEVMAAEVAKNKESVNKAQEEVTKQKEVITAQDTvikakyaevakhkeqnndsqlKIKELDH 865
Cdd:pfam15921  567 LRQQIENMTQLVGQHGRTAGAMQVEKAQLEKEINDRRLELQEFKILKDKKDA---------------------KIRELEA 625
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  866 NISKHKREAEDGAAKVSKMLKDYDWINAERhlfgqpnsayDFKTNNPKEAGQRLQKLQEMKEKLGRNVNmramNVLTEAE 945
Cdd:pfam15921  626 RVSDLELEKVKLVNAGSERLRAVKDIKQER----------DQLLNEVKTSRNELNSLSEDYEVLKRNFR----NKSEEME 691
                          330       340       350
                   ....*....|....*....|....*....|....*...
gi 1034663586  946 ERYNDLMKKKRIVENDKSKILTTIEDLDQKKNQALNIA 983
Cdd:pfam15921  692 TTTNKLKMQLKSAQSELEQTRNTLKSMEGSDGHAMKVA 729
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
692-967 5.11e-05

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 47.66  E-value: 5.11e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  692 QEELDA-LKKTIEESEETLKNTKEIQRKAEEKYEVLENKmKNAEAERERELKDAQKKLDCAKTKADASSKKMKEKQQEVE 770
Cdd:TIGR00618  151 QGEFAQfLKAKSKEKKELLMNLFPLDQYTQLALMEFAKK-KSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEKELK 229
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  771 AITLELEELKREHTSYKQQLEAVNEAiksyeSQIEVMAAEVAKNKESVNKAQEEVTKQKEVITAQDTviKAKYAEVAKHK 850
Cdd:TIGR00618  230 HLREALQQTQQSHAYLTQKREAQEEQ-----LKKQQLLKQLRARIEELRAQEAVLEETQERINRARK--AAPLAAHIKAV 302
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  851 EQNNDsqlKIKELDHNISKHKREAEDGAAKVSKMLKDYDWINAERHLFGQPNSAYD-FKTNNPKEAGQRLQKLQEMKEkl 929
Cdd:TIGR00618  303 TQIEQ---QAQRIHTELQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIhIRDAHEVATSIREISCQQHTL-- 377
                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 1034663586  930 gRNVNMRAMNVLTEAEERYNDLMKKKRIVENDKSKILT 967
Cdd:TIGR00618  378 -TQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDT 414
AAA_23 pfam13476
AAA domain;
6-216 5.48e-05

AAA domain;


Pssm-ID: 463890 [Multi-domain]  Cd Length: 190  Bit Score: 45.18  E-value: 5.48e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586    6 IILEGFKSYaQRTEVNgFDPLFNAITGLNGSGKSNILDSICFLLGiSNLSQVVIGGRNKYLINGVNANNTRVQDLFCSVG 85
Cdd:pfam13476    1 LTIENFRSF-RDQTID-FSKGLTLITGPNGSGKTTILDAIKLALY-GKTSRLKRKSGGGFVKGDIRIGLEGKGKAYVEIT 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586   86 LNVNNPhflimqgritkvlnmKPPEILSMIEEAAGTRMYEYKKIAAQKTIEKKEAKLKEIKTILEEEITPTIQKLKEERS 165
Cdd:pfam13476   78 FENNDG---------------RYTYAIERSRELSKKKGKTKKKEILEILEIDELQQFISELLKSDKIILPLLVFLGQERE 142
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1034663586  166 SYLEYQKVMREIEHLSRlyiAYQFLLAEDTKVRSAEELKEMQDKVIKLQEE 216
Cdd:pfam13476  143 EEFERKEKKERLEELEK---ALEEKEDEKKLLEKLLQLKEKKKELEELKEE 190
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
131-258 5.53e-05

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 47.32  E-value: 5.53e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  131 AQKTIEKKEAKLKEIKT-----ILEEEITPTIQKLKEERSSYLEYQKVMREIEH--------LSRLYIAYQFLLAEDTKV 197
Cdd:COG3206    187 LRKELEEAEAALEEFRQknglvDLSEEAKLLLQQLSELESQLAEARAELAEAEArlaalraqLGSGPDALPELLQSPVIQ 266
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034663586  198 RSAEELKEMQDKVIKLQEELSENDKKIKALNHEIEELEKRKDKETGGILRSLEDALAEAQR 258
Cdd:COG3206    267 QLRAQLAELEAELAELSARYTPNHPDVIALRAQIAALRAQLQQEAQRILASLEAELEALQA 327
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
627-977 5.99e-05

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 47.32  E-value: 5.99e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  627 KFQELKDVQDELRIKENELRALEEELAGLKNTAEK--YRQLKQQWEMKTEEADLLQTKLQQS--SYHKQQEELDALKKTI 702
Cdd:TIGR04523  272 KQKELEQNNKKIKELEKQLNQLKSEISDLNNQKEQdwNKELKSELKNQEKKLEEIQNQISQNnkIISQLNEQISQLKKEL 351
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  703 EESEetlKNTKEIQRKAEEKYEVLEnKMKNAEAERERELKDAQKKLDCAKTKADASSKKMKEKQQEVEAITLELEELKRE 782
Cdd:TIGR04523  352 TNSE---SENSEKQRELEEKQNEIE-KLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKE 427
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  783 HTSYKQQLEAVNEAIKSYES---------------------QIEVMAAEVAKNKESVNKAQEEVTKQKEVITAQDTVIKA 841
Cdd:TIGR04523  428 IERLKETIIKNNSEIKDLTNqdsvkeliiknldntresletQLKVLSRSINKIKQNLEQKQKELKSKEKELKKLNEEKKE 507
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  842 KYAEVAKHKEQNNDSQLKIKELDHNISKHKREAEDGAAKVSKMLKDYDWINAERHLFGQPNSAYDFKTNNPKEAGQRLQK 921
Cdd:TIGR04523  508 LEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLEDELNKDDFELKKENLEKEIDEKNKEIEELKQTQKSLKKKQEEK 587
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1034663586  922 LQEMKEKLGRNVNMRamNVLTEAEERYNDLMKKKRIVENDKSKILTTIEDLDQKKN 977
Cdd:TIGR04523  588 QELIDQKEKEKKDLI--KEIEEKEKKISSLEKELEKAKKENEKLSSIIKNIKSKKN 641
46 PHA02562
endonuclease subunit; Provisional
661-895 6.05e-05

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 46.93  E-value: 6.05e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  661 KYRQLKQQWEMKTEEADLLQTKLQqsSYHKQQEELDAlkktieeseetlKNTKEIQRKaEEKYEVLENKMKNAEAERErE 740
Cdd:PHA02562   175 KIRELNQQIQTLDMKIDHIQQQIK--TYNKNIEEQRK------------KNGENIARK-QNKYDELVEEAKTIKAEIE-E 238
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  741 LKDaqKKLDCAKTKADASSKkMKEKQQEVEAITLELEELKREHTSY---------KQQLEAVNEAIKSYESQIevmaAEV 811
Cdd:PHA02562   239 LTD--ELLNLVMDIEDPSAA-LNKLNTAAAKIKSKIEQFQKVIKMYekggvcptcTQQISEGPDRITKIKDKL----KEL 311
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  812 AKNKESVNKAQEEVTKQKEVITAQDTVIKAKYAEVAKHKEQNNDSQLKIKELDHNISKHKREAEDGAAKVSKMLKDYDWI 891
Cdd:PHA02562   312 QHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEFVDNAEELAKLQDELDKI 391

                   ....
gi 1034663586  892 NAER 895
Cdd:PHA02562   392 VKTK 395
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
629-873 6.83e-05

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 47.02  E-value: 6.83e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  629 QELKDVQDEL----RIKENELRALEEELAGLKNTAEKYrqLKQQWEMKTE--EADLLQTKLQQSS------YHKQQEELD 696
Cdd:pfam05483  432 EELKGKEQELifllQAREKEIHDLEIQLTAIKTSEEHY--LKEVEDLKTEleKEKLKNIELTAHCdkllleNKELTQEAS 509
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  697 ALKKTIEESEETLKNTKEIQRKAEEKYEVLENKMKNAEAERERELKDAQKKLDCAKTKADASSKKM---------KEKQ- 766
Cdd:pfam05483  510 DMTLELKKHQEDIINCKKQEERMLKQIENLEEKEMNLRDELESVREEFIQKGDEVKCKLDKSEENArsieyevlkKEKQm 589
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  767 -----------QEVEAITLELEELKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNK----ESVNKAQEEVTKQK-- 829
Cdd:pfam05483  590 kilenkcnnlkKQIENKNKNIEELHQENKALKKKGSAENKQLNAYEIKVNKLELELASAKqkfeEIIDNYQKEIEDKKis 669
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|
gi 1034663586  830 ------EVITAQDTVIKAkyaeVAKHKEQNNDSQLKIKELDHNISKHKRE 873
Cdd:pfam05483  670 eeklleEVEKAKAIADEA----VKLQKEIDKRCQHKIAEMVALMEKHKHQ 715
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
213-453 6.90e-05

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 46.93  E-value: 6.90e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  213 LQEELSENDKKIKALNHEIEELEKRkdketggiLRSLEDALAEAQRvntksqsafdlkKKNLACEESKRKELEKNMVEDS 292
Cdd:COG3206    166 LELRREEARKALEFLEEQLPELRKE--------LEEAEAALEEFRQ------------KNGLVDLSEEAKLLLQQLSELE 225
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  293 KTLAAKEKEVKKITDGLHALQEASNKDAEALAAAQQhfNAVSAGLSSNEDGAEATLAgqmmacknDISKAQTEAKQAQMK 372
Cdd:COG3206    226 SQLAEARAELAEAEARLAALRAQLGSGPDALPELLQ--SPVIQQLRAQLAELEAELA--------ELSARYTPNHPDVIA 295
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  373 LKHAQQELKNK-QAEVKKMDSGYRKDQEALEA----VKRLKEKLEAEMKKLNyeenkeeSLLEKRRQLSRDIGRLKETYE 447
Cdd:COG3206    296 LRAQIAALRAQlQQEAQRILASLEAELEALQAreasLQAQLAQLEARLAELP-------ELEAELRRLEREVEVARELYE 368

                   ....*.
gi 1034663586  448 ALLARF 453
Cdd:COG3206    369 SLLQRL 374
DUF4200 pfam13863
Domain of unknown function (DUF4200); This family is found in eukaryotes. It is a coiled-coil ...
690-791 7.67e-05

Domain of unknown function (DUF4200); This family is found in eukaryotes. It is a coiled-coil domain of unknwon function.


Pssm-ID: 464003 [Multi-domain]  Cd Length: 119  Bit Score: 43.33  E-value: 7.67e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  690 KQQEELDALKKTIEESEETLKNTKEIQRKAEEKYEVLENKMKNAEAERERELKDAQKKLDCAKTKADASSKKMKEKQQEV 769
Cdd:pfam13863    3 EKKREMFLVQLALDAKREEIERLEELLKQREEELEKKEQELKEDLIKFDKFLKENDAKRRRALKKAEEETKLKKEKEKEI 82
                           90       100
                   ....*....|....*....|..
gi 1034663586  770 EAITLELEELKREHTSYKQQLE 791
Cdd:pfam13863   83 KKLTAQIEELKSEISKLEEKLE 104
PRK00409 PRK00409
recombination and DNA strand exchange inhibitor protein; Reviewed
648-771 8.10e-05

recombination and DNA strand exchange inhibitor protein; Reviewed


Pssm-ID: 234750 [Multi-domain]  Cd Length: 782  Bit Score: 46.74  E-value: 8.10e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  648 LEEELAGLkntAEKYRQLKQQW---EMKTEEADLLQTKLQQSsYHKQQEELDALKKTIEES-EETLKNTKEiqRKAEEKY 723
Cdd:PRK00409   518 LNELIASL---EELERELEQKAeeaEALLKEAEKLKEELEEK-KEKLQEEEDKLLEEAEKEaQQAIKEAKK--EADEIIK 591
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 1034663586  724 EVLENKMKNAEAERERELKDAQKKLDCAKTKADASSKKMKEKQQEVEA 771
Cdd:PRK00409   592 ELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKV 639
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
693-897 9.32e-05

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 46.68  E-value: 9.32e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  693 EELDALKKTIEESEETLKNTKEIQRKAEEkyevLENKMKNAEAERErELKDAQKKLDCAKTKADASsKKMKEKQQEVEAI 772
Cdd:COG4717     71 KELKELEEELKEAEEKEEEYAELQEELEE----LEEELEELEAELE-ELREELEKLEKLLQLLPLY-QELEALEAELAEL 144
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  773 TLELEELKREHTSYKQQLEavneaiksyesQIEVMAAEVAKNKESVNKAQEEVTKQKEvitaqdtvikakyAEVAKHKEQ 852
Cdd:COG4717    145 PERLEELEERLEELRELEE-----------ELEELEAELAELQEELEELLEQLSLATE-------------EELQDLAEE 200
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 1034663586  853 NNDSQLKIKELDHNISKHKREAEDGAAKVSKMLKDYDWINAERHL 897
Cdd:COG4717    201 LEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALEERL 245
mukB PRK04863
chromosome partition protein MukB;
629-835 1.00e-04

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 46.87  E-value: 1.00e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  629 QELKDVQDELRIKENELRALEEElagLKNTAEKYRQLKQQW---------------EMKTEEADLLQTKLQQSSYHKQ-- 691
Cdd:PRK04863   837 AELRQLNRRRVELERALADHESQ---EQQQRSQLEQAKEGLsalnrllprlnlladETLADRVEEIREQLDEAEEAKRfv 913
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  692 --------------------QEELDALKKTIEESEETLKNTKE--------IQRKAEEKY--------------EVLENK 729
Cdd:PRK04863   914 qqhgnalaqlepivsvlqsdPEQFEQLKQDYQQAQQTQRDAKQqafaltevVQRRAHFSYedaaemlaknsdlnEKLRQR 993
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  730 MKNAEAERER---ELKDAQKKLD-------CAKTKADASSKKMKEKQQEVEAITL----ELEELKREHTS-YKQQLEAVN 794
Cdd:PRK04863   994 LEQAEQERTRareQLRQAQAQLAqynqvlaSLKSSYDAKRQMLQELKQELQDLGVpadsGAEERARARRDeLHARLSANR 1073
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|.
gi 1034663586  795 EAIKSYESQIEVMAAEVAKNKESVNKAQEEVTKQKEVITAQ 835
Cdd:PRK04863  1074 SRRNQLEKQLTFCEAEMDNLTKKLRKLERDYHEMREQVVNA 1114
AAA_21 pfam13304
AAA domain, putative AbiEii toxin, Type IV TA system; Several members are annotated as being ...
27-251 1.01e-04

AAA domain, putative AbiEii toxin, Type IV TA system; Several members are annotated as being of the abortive phage resistance system, in which case the family would be acting as the toxin for a type IV toxin-antitoxin resistance system.


Pssm-ID: 433102 [Multi-domain]  Cd Length: 303  Bit Score: 45.84  E-value: 1.01e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586   27 FNAITGLNGSGKSNILDSICFLLGISNL---SQVVIGGRNKY--LINGVNANNTRVQDLFCSVGLNVNNPHFLIMqgriT 101
Cdd:pfam13304    1 INVLIGPNGSGKSNLLEALRFLADFDALvigLTDERSRNGGIggIPSLLNGIDPKEPIEFEISEFLEDGVRYRYG----L 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  102 KVLNMKPPEILSMIEEAAGTRMYEYKKIAaqKTIEKKEAKLKEIKTILEEEItpTIQKLKEERSSYLEYQKVMREIEHLS 181
Cdd:pfam13304   77 DLEREDVEEKLSSKPTLLEKRLLLREDSE--EREPKFPPEAEELRLGLDVEE--RIELSLSELSDLISGLLLLSIISPLS 152
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1034663586  182 RLYIAYQFLLAEDTKVRSAEELKEMQDKVIKLQEELS----ENDKKIKALNHEIEELEKRKDKETGGILRSLED 251
Cdd:pfam13304  153 FLLLLDEGLLLEDWAVLDLAADLALFPDLKELLQRLVrglkLADLNLSDLGEGIEKSLLVDDRLRERGLILLEN 226
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
192-396 1.30e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 45.53  E-value: 1.30e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  192 AEDTKVRSAEELKEMQDKVIKLQEELSENDKKIKALNHEIEELEKRKDkETGGILRSLEDALAE----AQRVNTKSQSAF 267
Cdd:COG4942     46 LKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIA-ELRAELEAQKEELAEllraLYRLGRQPPLAL 124
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  268 DLKKKNLAcEESKRKELEKNMVEDSKTLAAKEKEVKKITDGLHALQEASNKDAEALAAAQQHFNAVSAGLSSNEDGAEAT 347
Cdd:COG4942    125 LLSPEDFL-DAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAERAELEALLAELEEERAALEALKAERQKLLAR 203
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*....
gi 1034663586  348 LAGQMMACKNDISKAQTEAKQAQMKLKHAQQELKNKQAEVKKMDSGYRK 396
Cdd:COG4942    204 LEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPAAGFAALK 252
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
619-862 1.42e-04

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 45.29  E-value: 1.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  619 SQAASILTKFQELKDVQDELRIKENELRALEEELAGLKNT-AEKYRQLKQQWEMKTEEADLLQTKLQQSSYHKQQ--EEL 695
Cdd:COG1340      8 SSLEELEEKIEELREEIEELKEKRDELNEELKELAEKRDElNAQVKELREEAQELREKRDELNEKVKELKEERDElnEKL 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  696 DALKKTIEESEETLKNTKEIQR---KAEEKYEVLENKMKNA----EAERE--RELKDAQKKLDCAKtKADASSKKMKEKQ 766
Cdd:COG1340     88 NELREELDELRKELAELNKAGGsidKLRKEIERLEWRQQTEvlspEEEKElvEKIKELEKELEKAK-KALEKNEKLKELR 166
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  767 QEVEAITLELEELKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNKESVNKAQEEVTKQKEVITAQDTVIKAKYAEV 846
Cdd:COG1340    167 AELKELRKEAEEIHKKIKELAEEAQELHEEMIELYKEADELRKEADELHKEIVEAQEKADELHEEIIELQKELRELRKEL 246
                          250
                   ....*....|....*.
gi 1034663586  847 AKHKEQNNDSQLKIKE 862
Cdd:COG1340    247 KKLRKKQRALKREKEK 262
COG1106 COG1106
ATPase/GTPase, AAA15 family [General function prediction only];
2-55 1.53e-04

ATPase/GTPase, AAA15 family [General function prediction only];


Pssm-ID: 440723 [Multi-domain]  Cd Length: 330  Bit Score: 45.42  E-value: 1.53e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1034663586    2 HIKSIILEGFKSYAQRTEVN----GFDPL-FNAITGLNGSGKSNILDSICFLLGISNLS 55
Cdd:COG1106      1 MLISFSIENFRSFKDELTLSmvasGLRLLrVNLIYGANASGKSNLLEALYFLRNLVLNS 59
ABCC_MRP_domain2 cd03244
ATP-binding cassette domain 2 of multidrug resistance-associated protein; The ABC subfamily C ...
1012-1093 1.77e-04

ATP-binding cassette domain 2 of multidrug resistance-associated protein; The ABC subfamily C is also known as MRP (multidrug resistance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminus, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resistance lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate.


Pssm-ID: 213211 [Multi-domain]  Cd Length: 221  Bit Score: 44.02  E-value: 1.77e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586 1012 QTVLDGLEFKVALGNtwkENLtelSGGQRSLVALSLILsmllFKPAPIYILDEVDAALDLSHTQNIGQMLRTHFTHSQFI 1091
Cdd:cd03244    123 ESLPGGLDTVVEEGG---ENL---SVGQRQLLCLARAL----LRKSKILVLDEATASVDPETDALIQKTIREAFKDCTVL 192

                   ..
gi 1034663586 1092 VV 1093
Cdd:cd03244    193 TI 194
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
629-806 2.09e-04

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 45.39  E-value: 2.09e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  629 QELKDVQDELRIKENELRALEEELAGLKNTAEKYRQLKQQWEMKTEEADLlQTKLQQ--SSYHKQQEELDALKKTIEESE 706
Cdd:COG3206    226 SQLAEARAELAEAEARLAALRAQLGSGPDALPELLQSPVIQQLRAQLAEL-EAELAElsARYTPNHPDVIALRAQIAALR 304
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  707 EtlkntkEIQRKAEEKYEVLENKMKNAEAeRERELKDAQKKLDcaktkadASSKKMKEKQQEVEAITLELEELKREHTSY 786
Cdd:COG3206    305 A------QLQQEAQRILASLEAELEALQA-REASLQAQLAQLE-------ARLAELPELEAELRRLEREVEVARELYESL 370
                          170       180
                   ....*....|....*....|
gi 1034663586  787 KQQLEAVNEAIKSYESQIEV 806
Cdd:COG3206    371 LQRLEEARLAEALTVGNVRV 390
Tropomyosin pfam00261
Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 ...
629-830 2.22e-04

Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites. The protein is best known for its role in regulating the interaction between actin and myosin in muscle contraction, but is also involved in the organization and dynamics of the cytoskeleton in non-muscle cells. There are multiple cell-specific isoforms, expressed by alternative promoters and alternative RNA processing of at least four genes. Muscle isoforms of tropomyosin are characterized by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region.


Pssm-ID: 459736 [Multi-domain]  Cd Length: 235  Bit Score: 44.25  E-value: 2.22e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  629 QELKDVQDELRIKENELRALEEELAGLKNTAEKYRQLKQQWEMKTEEA-DLLQTKLQQ-SSYHKQQEELDALKKTIEES- 705
Cdd:pfam00261    8 EELDEAEERLKEAMKKLEEAEKRAEKAEAEVAALNRRIQLLEEELERTeERLAEALEKlEEAEKAADESERGRKVLENRa 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  706 ----------EETLKNTKEIQRKAEEKYEVLENKMKNAEAERERelkdaqkkldcAKTKADASSKKMKEKQQEVEAIT-- 773
Cdd:pfam00261   88 lkdeekmeilEAQLKEAKEIAEEADRKYEEVARKLVVVEGDLER-----------AEERAELAESKIVELEEELKVVGnn 156
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1034663586  774 ---LELEELK---REhTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNKESVNKAQEEVTKQKE 830
Cdd:pfam00261  157 lksLEASEEKaseRE-DKYEEQIRFLTEKLKEAETRAEFAERSVQKLEKEVDRLEDELEAEKE 218
ABC_RNaseL_inhibitor_domain2 cd03237
The ATP-binding cassette domain 2 of RNase L inhibitor; The ABC ATPase, RNase L inhibitor (RLI) ...
1031-1070 2.75e-04

The ATP-binding cassette domain 2 of RNase L inhibitor; The ABC ATPase, RNase L inhibitor (RLI), is a key enzyme in ribosomal biogenesis, formation of translation preinitiation complexes, and assembly of HIV capsids. RLI's are not transport proteins and thus cluster with a group of soluble proteins that lack the transmembrane components commonly found in other members of the family. Structurally, RLI's have an N-terminal Fe-S domain and two nucleotide-binding domains which are arranged to form two composite active sites in their interface cleft. RLI is one of the most conserved enzymes between archaea and eukaryotes with a sequence identity of more than 48%. The high degree of evolutionary conservation suggests that RLI performs a central role in archaeal and eukaryotic physiology.


Pssm-ID: 213204 [Multi-domain]  Cd Length: 246  Bit Score: 43.94  E-value: 2.75e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1034663586 1031 NLTELSGGQRSLVALSLILSmllfKPAPIYILDEVDAALD 1070
Cdd:cd03237    112 EVPELSGGELQRVAIAACLS----KDADIYLLDEPSAYLD 147
ABCC_MRP_Like cd03228
ATP-binding cassette domain of multidrug resistance protein-like transporters; The MRP ...
1029-1094 3.11e-04

ATP-binding cassette domain of multidrug resistance protein-like transporters; The MRP (Multidrug Resistance Protein)-like transporters are involved in drug, peptide, and lipid export. They belong to the subfamily C of the ATP-binding cassette (ABC) superfamily of transport proteins. The ABCC subfamily contains transporters with a diverse functional spectrum that includes ion transport, cell surface receptor, and toxin secretion activities. The MRP-like family, similar to all ABC proteins, have a common four-domain core structure constituted by two membrane-spanning domains, each composed of six transmembrane (TM) helices, and two nucleotide-binding domains (NBD). ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins.


Pssm-ID: 213195 [Multi-domain]  Cd Length: 171  Bit Score: 42.76  E-value: 3.11e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034663586 1029 KENLteLSGGQRSLVALSLilsMLLfKPAPIYILDEVDAALDlSHTQN-IGQMLRTHFTHSQFIVVS 1094
Cdd:cd03228     93 RENI--LSGGQRQRIAIAR---ALL-RDPPILILDEATSALD-PETEAlILEALRALAKGKTVIVIA 152
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
630-862 3.20e-04

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 45.00  E-value: 3.20e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  630 ELKDVQDELRIKENELRALEEELAGLKNTaEKYRQLKQQWEMKTEEADLLQTKLQQSSYHKQQEELDALKKTIEESEETL 709
Cdd:NF033838   175 ELEIAESDVEVKKAELELVKEEAKEPRDE-EKIKQAKAKVESKKAEATRLEKIKTDREKAEEEAKRRADAKLKEAVEKNV 253
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  710 KNTKEIQRKAEEKYEVL---------ENKMKNAEA------------ERERELKDAQKKLDCAKTKADASSKKMKEKQQE 768
Cdd:NF033838   254 ATSEQDKPKRRAKRGVLgepatpdkkENDAKSSDSsvgeetlpspslKPEKKVAEAEKKVEEAKKKAKDQKEEDRRNYPT 333
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  769 VEAITLELEELKREHTSYKQQLEAVNEAIKSyesqievmaaevAKNKESVNKAQEEVTKQKEVITAQDTVI--KAKYAEV 846
Cdd:NF033838   334 NTYKTLELEIAESDVKVKEAELELVKEEAKE------------PRNEEKIKQAKAKVESKKAEATRLEKIKtdRKKAEEE 401
                          250
                   ....*....|....*.
gi 1034663586  847 AKHKEQNNDsqlKIKE 862
Cdd:NF033838   402 AKRKAAEED---KVKE 414
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
121-457 3.36e-04

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 45.05  E-value: 3.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  121 TRMYEYKKIAAQKTIEKKEAKLKEI---KTILEEEITPTIQKLKEERSSYLEYQKVMREIEHLSRLYIAYQFLLAEDTKv 197
Cdd:PRK03918   372 EELERLKKRLTGLTPEKLEKELEELekaKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHR- 450
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  198 rsAEELKEMQDKVIKLQEELSENDKKIKALNHEIEELEK--------RKDKETGGILRSLEDALAE--AQRVNTKSQSAF 267
Cdd:PRK03918   451 --KELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKvlkkeselIKLKELAEQLKELEEKLKKynLEELEKKAEEYE 528
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  268 DLKKK---------NLACEESKRKELEKNMVEDSKTLAAKEKEVKKITDGLHALQEASNKDAEALAAAQQHFNAVSAGLS 338
Cdd:PRK03918   529 KLKEKliklkgeikSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELK 608
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  339 SNEDGAEATLAgQMMACKNDISKAQTE------------AKQAQMKLKHAQQELKNKQAEVKKMDSGYRKDQEALEAVKR 406
Cdd:PRK03918   609 DAEKELEREEK-ELKKLEEELDKAFEElaetekrleelrKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEK 687
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1034663586  407 LKEKLEAEMKKLnyEENKEEslLEKRRQLSRDIGRLKETYEALLARFPNLR 457
Cdd:PRK03918   688 RREEIKKTLEKL--KEELEE--REKAKKELEKLEKALERVEELREKVKKYK 734
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
658-886 3.37e-04

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 45.04  E-value: 3.37e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  658 TAEKYRQLKQQWEMKTEEADLLQTKLQQSSYhkqQEELDALKKTIEESEETLKNTKEIQRKAEEKyevlenKMKNAEAER 737
Cdd:PTZ00108  1100 TKEKVEKLNAELEKKEKELEKLKNTTPKDMW---LEDLDKFEEALEEQEEVEEKEIAKEQRLKSK------TKGKASKLR 1170
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  738 ERELKDAQKKLDCAKTKADASSKKMKEKQQEVEAITLELEELKREHTSYKQQLEAVNEaiksyESQIEVMAAEVAKNKES 817
Cdd:PTZ00108  1171 KPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDD-----EEQKTKPKKSSVKRLKS 1245
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  818 VNKAQEEVTKQKEVITAQDTVIKAKYAEVAKHKEQNNDSQLKIKELDHNISKHK-REAEDGAAKVSKMLK 886
Cdd:PTZ00108  1246 KKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGsKPSSPTKKKVKKRLE 1315
ABC_RecN cd03241
ATP-binding cassette domain of RecN; RecN ATPase involved in DNA repair; similar to ABC ...
23-70 3.71e-04

ATP-binding cassette domain of RecN; RecN ATPase involved in DNA repair; similar to ABC (ATP-binding cassette) transporter nucleotide-binding domain; ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds including sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins.


Pssm-ID: 213208 [Multi-domain]  Cd Length: 276  Bit Score: 43.73  E-value: 3.71e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 1034663586   23 FDPLFNAITGLNGSGKSNILDSICFLLGISNLSQVVIGGRNKYLINGV 70
Cdd:cd03241     19 FEEGLTVLTGETGAGKSILLDALSLLLGGRASADLIRSGAEKAVVEGV 66
PRK12704 PRK12704
phosphodiesterase; Provisional
639-835 4.01e-04

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 44.38  E-value: 4.01e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  639 RIKENELRALEEELAGL----KNTAEKYRQLKQQwEMKtEEADLLQTKLQQSsYHKQQEELDALKKTIEESEETLKNTKE 714
Cdd:PRK12704    27 KIAEAKIKEAEEEAKRIleeaKKEAEAIKKEALL-EAK-EEIHKLRNEFEKE-LRERRNELQKLEKRLLQKEENLDRKLE 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  715 IQRKAEEKyevLENKMKNAEaERERELKDAQKKLDcaktkadassKKMKEKQQEVEAITleleELKREhtsykqqlEAVN 794
Cdd:PRK12704   104 LLEKREEE---LEKKEKELE-QKQQELEKKEEELE----------ELIEEQLQELERIS----GLTAE--------EAKE 157
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|.
gi 1034663586  795 EAIKSYESQIEVMAAEVAKNKEsvNKAQEEVTKQKEVITAQ 835
Cdd:PRK12704   158 ILLEKVEEEARHEAAVLIKEIE--EEAKEEADKKAKEILAQ 196
PRK10246 PRK10246
exonuclease subunit SbcC; Provisional
94-418 4.39e-04

exonuclease subunit SbcC; Provisional


Pssm-ID: 182330 [Multi-domain]  Cd Length: 1047  Bit Score: 44.41  E-value: 4.39e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586   94 LIMQGRITKVLNMKPPEILSMIEEAAGT--------RMYEYKKiAAQKTIEKKEAKLKEIKTILEEE---ITPTIQKLKE 162
Cdd:PRK10246   152 LLSQGQFAAFLNAKPKERAELLEELTGTeiygqisaMVFEQHK-SARTELEKLQAQASGVALLTPEQvqsLTASLQVLTD 230
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  163 ERSSYLEYQKVM-REIEHLSRLYIAYQFLLAEDTKVRSA-EELKEMQDKVIKLqeELSENDKKIKALNHEIEELEKRkdk 240
Cdd:PRK10246   231 EEKQLLTAQQQQqQSLNWLTRLDELQQEASRRQQALQQAlAAEEKAQPQLAAL--SLAQPARQLRPHWERIQEQSAA--- 305
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  241 etggilrsLEDALAEAQRVNTKSQSAFDLKKKNLACEESKRKELEKNMVEDSKTLAAKEK------EVKkitdGLHALQE 314
Cdd:PRK10246   306 --------LAHTRQQIEEVNTRLQSTMALRARIRHHAAKQSAELQAQQQSLNTWLAEHDRfrqwnnELA----GWRAQFS 373
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  315 ASNKDAEALAAAQQHFNAVSAGLS---------SNEDGAEA--------TLAGQMMACKNDISKAQTEAKQAQMKLKHAQ 377
Cdd:PRK10246   374 QQTSDREQLRQWQQQLTHAEQKLNalpaitltlTADEVAAAlaqhaeqrPLRQRLVALHGQIVPQQKRLAQLQVAIQNVT 453
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|.
gi 1034663586  378 QELKNKQAEVKKMDSGYRKDQEALEAVKRLKEkLEAEMKKL 418
Cdd:PRK10246   454 QEQTQRNAALNEMRQRYKEKTQQLADVKTICE-QEARIKDL 493
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
627-884 4.50e-04

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 44.34  E-value: 4.50e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  627 KFQELKdvQDELRiKENELRALEEELAGLKNTAEKYRQLK---------QQWEMKTE-EADLLQTKLQQSSYHKQQEELD 696
Cdd:pfam17380  292 KFEKME--QERLR-QEKEEKAREVERRRKLEEAEKARQAEmdrqaaiyaEQERMAMErERELERIRQEERKRELERIRQE 368
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  697 ALKKTIEESEETLKNTKEIQRKAEEKYEVLENKMKNAEAERERELKDAQKKLDCAKTKADASSKKMKEKQQEVEAITLEL 776
Cdd:pfam17380  369 EIAMEISRMRELERLQMERQQKNERVRQELEAARKVKILEEERQRKIQQQKVEMEQIRAEQEEARQREVRRLEEERAREM 448
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  777 EELKREHTSYKQQLeavnEAIKSYESQIEVMAAEVAKNKESVNKAQEEVTK--QKEVITAQDTVIKAKYAEVAKHKEQnN 854
Cdd:pfam17380  449 ERVRLEEQERQQQV----ERLRQQEEERKRKKLELEKEKRDRKRAEEQRRKilEKELEERKQAMIEEERKRKLLEKEM-E 523
                          250       260       270
                   ....*....|....*....|....*....|
gi 1034663586  855 DSQLKIKEldhniSKHKREAEDGAAKVSKM 884
Cdd:pfam17380  524 ERQKAIYE-----EERRREAEEERRKQQEM 548
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
151-876 4.88e-04

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 44.44  E-value: 4.88e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  151 EEITPTIQKLKEERSSYLEyqkVMREIEHLSRLYIAYQFLLAEDTKVRSA------EEL----KEMQDKVIKLQEELSEN 220
Cdd:pfam12128  237 MKIRPEFTKLQQEFNTLES---AELRLSHLHFGYKSDETLIASRQEERQEtsaelnQLLrtldDQWKEKRDELNGELSAA 313
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  221 DKKIKALNHEIEELEKRKdketggilRSLEDALAEaqrvntksQSAFDLKKknlacEESKRKELEkNMVEDSKTLAAKEK 300
Cdd:pfam12128  314 DAAVAKDRSELEALEDQH--------GAFLDADIE--------TAAADQEQ-----LPSWQSELE-NLEERLKALTGKHQ 371
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  301 EVKKITDGLHALQEASNKDaeALAAAQQHFNAVSAGLSSNEDGAEATLAGQMMACKNDISKAQTEAKQAQMKLKHAQQEL 380
Cdd:pfam12128  372 DVTAKYNRRRSKIKEQNNR--DIAGIKDKLAKIREARDRQLAVAEDDLQALESELREQLEAGKLEFNEEEYRLKSRLGEL 449
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  381 KNKQAEV----------KKMDSGYRKDQEALEAVKRLKEKLEAEMKKLN-YEENKEESLLEKRRQLSRDIGRLKETYEAL 449
Cdd:pfam12128  450 KLRLNQAtatpelllqlENFDERIERAREEQEAANAEVERLQSELRQARkRRDQASEALRQASRRLEERQSALDELELQL 529
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  450 LARFPNLRFAYKDPEKNWNRNCVKGLVASLISVKDTSATTALELVAGER-LYNVVVD---TEVTGKKLLERgELKRRyti 525
Cdd:pfam12128  530 FPQAGTLLHFLRKEAPDWEQSIGKVISPELLHRTDLDPEVWDGSVGGELnLYGVKLDlkrIDVPEWAASEE-ELRER--- 605
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  526 iplnkisaRCIAPETLRVAQNLVGPDNVHVALSLVEYKpELQKAMEfvFGTTFVCDNMDNAKKVAFDKRIMTRTVTlggd 605
Cdd:pfam12128  606 --------LDKAEEALQSAREKQAAAEEQLVQANGELE-KASREET--FARTALKNARLDLRRLFDEKQSEKDKKN---- 670
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  606 vfdphgtlsggaRSQAASILTKFQELKDVQDELRIKENELRALEEELAGLKNTAEKYRQLKQQ---WEMKTEEADLLQTK 682
Cdd:pfam12128  671 ------------KALAERKDSANERLNSLEAQLKQLDKKHQAWLEEQKEQKREARTEKQAYWQvveGALDAQLALLKAAI 738
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  683 LQQSSYHKQQeeLDALKKtieeseetlKNTKEIQRKAEEKYEVlenkmknaeAERERELKDAQKKL-DCAKTKADASSKK 761
Cdd:pfam12128  739 AARRSGAKAE--LKALET---------WYKRDLASLGVDPDVI---------AKLKREIRTLERKIeRIAVRRQEVLRYF 798
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  762 --MKEK-QQEVEAITLELEELKREHTSYKQQLEAVNEAIKSYESQIEvmaaevaKNKESVNKAQEEVTkqkEVITAQDTV 838
Cdd:pfam12128  799 dwYQETwLQRRPRLATQLSNIERAISELQQQLARLIADTKLRRAKLE-------MERKASEKQQVRLS---ENLRGLRCE 868
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|..
gi 1034663586  839 IKaKYAEVAKHKEqNNDSQLKIKE----LDHNISKHKREAED 876
Cdd:pfam12128  869 MS-KLATLKEDAN-SEQAQGSIGErlaqLEDLKLKRDYLSES 908
fecE PRK11231
Fe(3+) dicitrate ABC transporter ATP-binding protein FecE;
1032-1082 4.91e-04

Fe(3+) dicitrate ABC transporter ATP-binding protein FecE;


Pssm-ID: 183044 [Multi-domain]  Cd Length: 255  Bit Score: 43.08  E-value: 4.91e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1034663586 1032 LTELSGGQRSLValslILSMLLFKPAPIYILDEVDAALDLSHTQNIGQMLR 1082
Cdd:PRK11231   136 LTDLSGGQRQRA----FLAMVLAQDTPVVLLDEPTTYLDINHQVELMRLMR 182
EcfA2 COG1122
Energy-coupling factor transporter ATP-binding protein EcfA2 [Inorganic ion transport and ...
1034-1094 5.22e-04

Energy-coupling factor transporter ATP-binding protein EcfA2 [Inorganic ion transport and metabolism, General function prediction only];


Pssm-ID: 440739 [Multi-domain]  Cd Length: 230  Bit Score: 42.70  E-value: 5.22e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034663586 1034 ELSGGQRSLVALSLILSMllfKPaPIYILDEVDAALDLSHTQNIGQMLRT-HFTHSQFIVVS 1094
Cdd:COG1122    134 ELSGGQKQRVAIAGVLAM---EP-EVLVLDEPTAGLDPRGRRELLELLKRlNKEGKTVIIVT 191
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
735-895 5.30e-04

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 43.67  E-value: 5.30e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  735 AERERELKDAQKKLDCAKTKADASSKKMKEKQQEVEAITLELEELKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKN 814
Cdd:COG3883     19 QAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERARALYRS 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  815 KESVNKAQ------------EEVTKQKEVITAQDTVIKAKYAEVAKHKEQNNDSQLKIKELDHNISKHKREAEDGAAKVS 882
Cdd:COG3883     99 GGSVSYLDvllgsesfsdflDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAKAELEAQQA 178
                          170
                   ....*....|...
gi 1034663586  883 KMLKDYDWINAER 895
Cdd:COG3883    179 EQEALLAQLSAEE 191
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
132-342 5.42e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 44.14  E-value: 5.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  132 QKTIEKKEAKLKEIKTILEE--EITPTIQKLKEERSSYLEYQKVMREIEHLSRLYIAYQFLLAE----DTKVRSAE---- 201
Cdd:COG4913    606 FDNRAKLAALEAELAELEEElaEAEERLEALEAELDALQERREALQRLAEYSWDEIDVASAEREiaelEAELERLDassd 685
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  202 ELKEMQDKVIKLQEELSENDKKIKALNHEIEELEKRkdketggilrsLEDALAEAQRVNTKSQSAFDLKKKNLaceeskR 281
Cdd:COG4913    686 DLAALEEQLEELEAELEELEEELDELKGEIGRLEKE-----------LEQAEEELDELQDRLEAAEDLARLEL------R 748
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034663586  282 KELEKnMVEDSKTLAAKEKEVKKITDGLHALQEASNKDAEALAAAQQHFNA--------VSAGLSSNED 342
Cdd:COG4913    749 ALLEE-RFAAALGDAVERELRENLEERIDALRARLNRAEEELERAMRAFNRewpaetadLDADLESLPE 816
RloC COG4694
Wobble nucleotide-excising tRNase [Translation, ribosomal structure and biogenesis];
3-453 5.76e-04

Wobble nucleotide-excising tRNase [Translation, ribosomal structure and biogenesis];


Pssm-ID: 443729 [Multi-domain]  Cd Length: 692  Bit Score: 43.96  E-value: 5.76e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586    3 IKSII-LEGFKSYAQRTEVNGFDPLfNAITGLNGSGK---SNILDSICfllgiSNLSQVVIGGRNKYLINGVNANNTR-- 76
Cdd:COG4694      2 ITKIKkLKNVGAFKDFGWLAFFKKL-NLIYGENGSGKstlSRILRSLE-----LGDTSSEVIAEFEIEAGGSAPNPSVrv 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586   77 -----VQDLFCSVGlnvNNPHFLIMQGRITKVLNmkppEILSMIEEAAGtrmYEYKKIAAQKTIEKKEAKLKEIKTILEE 151
Cdd:COG4694     76 fnrdfVEENLRSGE---EIKGIFTLGEENIELEE----EIEELEKEIED---LKKELDKLEKELKEAKKALEKLLEDLAK 145
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  152 EITPTIQKLKeeRSSYLEYQKvmreiEHLSRLYIAYQFLLAEDTKvRSAEELKEMQDKVIKLQEELSENDKKIKALNHEI 231
Cdd:COG4694    146 SIKDDLKKLF--ASSGRNYRK-----ANLEKKLSALKSSSEDELK-EKLKLLKEEEPEPIAPITPLPDLKALLSEAETLL 217
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  232 EELEKRKDKEtggILRSLEDALAEAQRVntksQSAFDLKKKN-----LACE----ESKRKELEKNMVEDSKTLAAK-EKE 301
Cdd:COG4694    218 EKSAVSSAIE---ELAALIQNPGNSDWV----EQGLAYHKEEeddtcPFCQqelaAERIEALEAYFDDEYEKLLAAlKDL 290
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  302 VKKITDGLHALQEASN-----KDAEALAAAQQHFNAVSAGLSSNE--------------DGAEATLAGQMMACKNDISKA 362
Cdd:COG4694    291 LEELESAINALSALLLeilrtLLPSAKEDLKAALEALNALLETLLaaleekianpstsiDLDDQELLDELNDLIAALNAL 370
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  363 QTEAKQAQMKLKHAQQELKNKQAEVKKmdsgyrkdQEALEAVKRLKEKLEAEMKKLNYEENKEESLLEKRRQLSR----- 437
Cdd:COG4694    371 IEEHNAKIANLKAEKEEARKKLEAHEL--------AELKEDLSRYKAEVEELIEELKTIKALKKALEDLKTEISEleael 442
                          490
                   ....*....|....*..
gi 1034663586  438 -DIGRLKETYEALLARF 453
Cdd:COG4694    443 sSVDEAADEINEELKAL 459
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
629-812 6.36e-04

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 43.67  E-value: 6.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  629 QELKDVQDELRIKENELRALEEELAGLKNT-AEKYRQLKQQWE---MKTEEADLLQTKLQQSSYHKQQEELDALKKTIEE 704
Cdd:COG3883     51 EEYNELQAELEALQAEIDKLQAEIAEAEAEiEERREELGERARalyRSGGSVSYLDVLLGSESFSDFLDRLSALSKIADA 130
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  705 SEETLKNTKEIQRKAEEKYEVLENKMKNAEAERErELKDAQKKLDCAKTKADASSKKMKEKQQEVEAITLELEELKREHT 784
Cdd:COG3883    131 DADLLEELKADKAELEAKKAELEAKLAELEALKA-ELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAE 209
                          170       180
                   ....*....|....*....|....*...
gi 1034663586  785 SYKQQLEAVNEAIKSYESQIEVMAAEVA 812
Cdd:COG3883    210 AAAAAAAAAAAAAAAAAAAAAAAAAAAA 237
46 PHA02562
endonuclease subunit; Provisional
632-850 6.42e-04

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 43.85  E-value: 6.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  632 KDVQDELRIKENELRALEEELAGLKNTAEkyRQLKQQWEMKTEEADLLQTKLQQSSYHKQQ-----EELDALKKTIEESE 706
Cdd:PHA02562   177 RELNQQIQTLDMKIDHIQQQIKTYNKNIE--EQRKKNGENIARKQNKYDELVEEAKTIKAEieeltDELLNLVMDIEDPS 254
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  707 ETLK--NTKEIQRKAE-----------EKYEVLENKMKNAEAERER------ELKDAQKKLDCAKTKADasskKMKEKQQ 767
Cdd:PHA02562   255 AALNklNTAAAKIKSKieqfqkvikmyEKGGVCPTCTQQISEGPDRitkikdKLKELQHSLEKLDTAID----ELEEIMD 330
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  768 EVEAITLELEELKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNKESVNKAQEEVTKqkevitaqdtvIKAKYAEVA 847
Cdd:PHA02562   331 EFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEFVDNAEELAKLQDELDK-----------IVKTKSELV 399

                   ...
gi 1034663586  848 KHK 850
Cdd:PHA02562   400 KEK 402
ABC_sbcCD cd03279
ATP-binding cassette domain of sbcCD; SbcCD and other Mre11/Rad50 (MR) complexes are ...
8-49 6.58e-04

ATP-binding cassette domain of sbcCD; SbcCD and other Mre11/Rad50 (MR) complexes are implicated in the metabolism of DNA ends. They cleave ends sealed by hairpin structures and are thought to play a role in removing protein bound to DNA termini.


Pssm-ID: 213246 [Multi-domain]  Cd Length: 213  Bit Score: 42.26  E-value: 6.58e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1034663586    8 LEGFKSYAQRTEVN----GFDPLFnAITGLNGSGKSNILDSICFLL 49
Cdd:cd03279      8 LKNFGPFREEQVIDftglDNNGLF-LICGPTGAGKSTILDAITYAL 52
ClyA_Cry6Aa-like cd22656
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes ...
591-824 6.83e-04

Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes pesticidal Cry6Aa toxin from Bacillus thuringiensis, one of the many parasporal crystal (Cry) toxins produced during the sporulation phase of growth. Many of these proteins are toxic to numerous insect species and have been effectively used as proteinaceous insecticides to directly kill insect pests; some have been used to control insect growth on transgenic agricultural plants. Cry6Aa exists as a protoxin, which is activated by cleavage using trypsin. Structure studies for Cry6Aa support a mechanism of action by pore formation, similar to cytolysin A (ClyA)-type alpha pore-forming toxins (alpha-PFTs) such as HblB, and bioassay and mutation studies show that Cry6Aa is an active pore-forming toxin. Cry6Aa shows atypical features compared to other members of alpha-PFTs, including internal repeat sequences and small loop regions within major alpha helices.


Pssm-ID: 439154 [Multi-domain]  Cd Length: 309  Bit Score: 43.13  E-value: 6.83e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  591 FDKRIMTRTVTLGGDVFDPHGTLSGGARSQAASILTKFQELKDVQDELRIKENeLRALEEELAGLKNTAEKYRQlkqqwe 670
Cdd:cd22656     63 FKDDTYPSIVSLAGDIYNYAQNAGGTIDSYYAEILELIDDLADATDDEELEEA-KKTIKALLDDLLKEAKKYQD------ 135
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  671 mkteEADLLQTKLQ--QSSYHKQQEELDALKKTI------EESEETLKNTKEIQRKAEEKYEVLENKMKNAEAERERELK 742
Cdd:cd22656    136 ----KAAKVVDKLTdfENQTEKDQTALETLEKALkdlltdEGGAIARKEIKDLQKELEKLNEEYAAKLKAKIDELKALIA 211
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  743 DAQKKLDcAKTKADASSKKMKEK----QQEVEAITLELEELKREHTSYKQQLEAVNEAIKSYESQIEvmAAEVAKNKesV 818
Cdd:cd22656    212 DDEAKLA-AALRLIADLTAADTDldnlLALIGPAIPALEKLQGAWQAIATDLDSLKDLLEDDISKIP--AAILAKLE--L 286

                   ....*.
gi 1034663586  819 NKAQEE 824
Cdd:cd22656    287 EKAIEK 292
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
130-328 6.85e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 43.21  E-value: 6.85e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  130 AAQKTIEKKEAKLKEIKtileEEITPTIQKLKEERSSYLEYQKVMREIEhlsrlyiayqfllaedtkvrsaEELKEMQDK 209
Cdd:COG4942     17 AQADAAAEAEAELEQLQ----QEIAELEKELAALKKEEKALLKQLAALE----------------------RRIAALARR 70
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  210 VIKLQEELSENDKKIKALNHEIEELEKRKDKETGGI-------------------------------------------- 245
Cdd:COG4942     71 IRALEQELAALEAELAELEKEIAELRAELEAQKEELaellralyrlgrqpplalllspedfldavrrlqylkylaparre 150
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  246 -LRSLEDALAEAQRVNTKSQSAFDLKKKNLACEESKRKELEKNMVEDSKTLAAKEKEVKKITDGLHALQEASNKDAEALA 324
Cdd:COG4942    151 qAEELRADLAELAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIA 230

                   ....
gi 1034663586  325 AAQQ 328
Cdd:COG4942    231 RLEA 234
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
617-895 7.68e-04

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 43.80  E-value: 7.68e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  617 ARSQAASILTKfQELKDVQDELRIKENELRALEEELAGLKNTAEKYRQLKQqwemKTEEADLLQTKLQQSsyHKQQEELD 696
Cdd:TIGR00618  201 LRSQLLTLCTP-CMPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQKRE----AQEEQLKKQQLLKQL--RARIEELR 273
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  697 ALKKTIEESEETL-------------KNTKEIQRKAEEKYEVLENKMKNAEAERERELKDAQKKLDCAKTKADASSKKMK 763
Cdd:TIGR00618  274 AQEAVLEETQERInrarkaaplaahiKAVTQIEQQAQRIHTELQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQ 353
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  764 EK--QQEVEAITLELEELKREHT------SYKQQLEAVNEAIKSYESQIEVMAAEVAknKESVNKAQEEVTKQKEVITAQ 835
Cdd:TIGR00618  354 EIhiRDAHEVATSIREISCQQHTltqhihTLQQQKTTLTQKLQSLCKELDILQREQA--TIDTRTSAFRDLQGQLAHAKK 431
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034663586  836 DTVIKAKYAEVAKHKEQNNDSQLKIKELDHNISKHK-REAEDGAAKVSKMLKDYDWINAER 895
Cdd:TIGR00618  432 QQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQSlKEREQQLQTKEQIHLQETRKKAVV 492
PRK12704 PRK12704
phosphodiesterase; Provisional
627-762 8.02e-04

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 43.23  E-value: 8.02e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  627 KFQELK-DVQDELRIKENELRALEEELaglkntAEKYRQLKQQWE-MKTEEADLLQtklQQSSYHKQQEELDALKKTIEE 704
Cdd:PRK12704    65 EIHKLRnEFEKELRERRNELQKLEKRL------LQKEENLDRKLElLEKREEELEK---KEKELEQKQQELEKKEEELEE 135
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1034663586  705 SEETLKntKEIQR-----KAEEKYEVLENKMKNAEAERERELKDAQKKldcAKTKADASSKKM 762
Cdd:PRK12704   136 LIEEQL--QELERisgltAEEAKEILLEKVEEEARHEAAVLIKEIEEE---AKEEADKKAKEI 193
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
192-417 8.03e-04

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 43.28  E-value: 8.03e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  192 AEDTKVRSAEELKEMQDKVIKLQEELSENDKKIKALNHEIEELEKRkdketggiLRSLEDALAEAQRVNTKSQSAFDLKK 271
Cdd:COG3883     14 ADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAE--------LEALQAEIDKLQAEIAEAEAEIEERR 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  272 KNLACE-----ESKRKELEKNMVEDSKTLAAkekevkkITDGLHALQEASNKDAEALAAAQQhfnavsaglssnedgAEA 346
Cdd:COG3883     86 EELGERaralyRSGGSVSYLDVLLGSESFSD-------FLDRLSALSKIADADADLLEELKA---------------DKA 143
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034663586  347 TLAGQmmacKNDISKAQTEAKQAQMKLKHAQQELKNKQAEVKKMDSGYRKDQEALEAVKRLKEKLEAEMKK 417
Cdd:COG3883    144 ELEAK----KAELEAKLAELEALKAELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEA 210
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
615-830 8.33e-04

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 43.49  E-value: 8.33e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  615 GGARSQAASILTKFQELKDVQDELRikeNELRALEEELAGLKNTAEKYR----QLKQQWEMKTEEADLLQTKLQQSSYHK 690
Cdd:PRK02224   338 QAHNEEAESLREDADDLEERAEELR---EEAAELESELEEAREAVEDRReeieELEEEIEELRERFGDAPVDLGNAEDFL 414
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  691 Q--QEELDALKKTIEESEETLKNTKEIQRKAE----------------------------EKYEVLENKMKNAEAERER- 739
Cdd:PRK02224   415 EelREERDELREREAELEATLRTARERVEEAEalleagkcpecgqpvegsphvetieedrERVEELEAELEDLEEEVEEv 494
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  740 --------ELKDAQKKLDCAKTKADASSKKMKEKQQEVEAITLELEELKREHTSYKQQLEAVNEAIKSYESQIEVMAAEV 811
Cdd:PRK02224   495 eerleraeDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEV 574
                          250
                   ....*....|....*....
gi 1034663586  812 AknkeSVNKAQEEVTKQKE 830
Cdd:PRK02224   575 A----ELNSKLAELKERIE 589
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
125-412 8.99e-04

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 43.51  E-value: 8.99e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  125 EYKKIAAQ-KTIEKKEAKLKEIKTILEEEIT--PTIQKLKEERSSYLEYQKVMREI--EHLSRLYIAYQFLLAEDTKVRS 199
Cdd:PRK03918   460 ELKRIEKElKEIEEKERKLRKELRELEKVLKkeSELIKLKELAEQLKELEEKLKKYnlEELEKKAEEYEKLKEKLIKLKG 539
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  200 --------AEELKEMQDKVIKLQEELSENDKKIKALNHEIEELEKRKDKETGGILRSLEDALAEAQRVNTKsqsafdlkK 271
Cdd:PRK03918   540 eikslkkeLEKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDA--------E 611
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  272 KNLACEESKRKELEKNMVEDSKTLAAKEKEVKKITDGLHALQeaSNKDAEALAAAQQHFNAVSAGLSSNEDGAEAtlagq 351
Cdd:PRK03918   612 KELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELE--KKYSEEEYEELREEYLELSRELAGLRAELEE----- 684
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034663586  352 mmackndISKAQTEAKQAQMKLKHAQQELKNKQAEVKKMDSGYRKDQEALEAVKRLKEKLE 412
Cdd:PRK03918   685 -------LEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALERVEELREKVKKYKALLK 738
mukB PRK04863
chromosome partition protein MukB;
629-814 1.05e-03

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 43.41  E-value: 1.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  629 QELKDVQDELRIKENELRALEEELAGLKNtAEKyrQLKQQWEMKTEEADLLQTKL-QQSSYHKQQEELDALKKTIEESEE 707
Cdd:PRK04863   293 RELYTSRRQLAAEQYRLVEMARELAELNE-AES--DLEQDYQAASDHLNLVQTALrQQEKIERYQADLEELEERLEEQNE 369
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  708 TLKNTKEIQRKAEEKYEVLENKMKNAEAererELKDAQKKLDCAKTKADA---SSKKMKEKQQEVEAITLELEELKREHT 784
Cdd:PRK04863   370 VVEEADEQQEENEARAEAAEEEVDELKS----QLADYQQALDVQQTRAIQyqqAVQALERAKQLCGLPDLTADNAEDWLE 445
                          170       180       190
                   ....*....|....*....|....*....|
gi 1034663586  785 SYKQQLEAVNEAIKSYESQIEVmaAEVAKN 814
Cdd:PRK04863   446 EFQAKEQEATEELLSLEQKLSV--AQAAHS 473
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
201-370 1.09e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 42.89  E-value: 1.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  201 EELKEMQDKVIKLQEELSENDKKIKALNHEIEELEKRKDKETggilRSLEDALAEAQR-----------VNTKSQSAFDL 269
Cdd:COG3883     44 AELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERR----EELGERARALYRsggsvsyldvlLGSESFSDFLD 119
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  270 KKKNLACEESKRKELEKNMVEDSKTLAAKEKEVKKITDGLHALQEASNKDAEALAAAQQHFNAVSAGLSSNEDGAEATLA 349
Cdd:COG3883    120 RLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLA 199
                          170       180
                   ....*....|....*....|.
gi 1034663586  350 GQMMACKNDISKAQTEAKQAQ 370
Cdd:COG3883    200 ELEAELAAAEAAAAAAAAAAA 220
PRK12704 PRK12704
phosphodiesterase; Provisional
725-845 1.09e-03

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 42.84  E-value: 1.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  725 VLENKMKNAEAERERELKDAQKKLDCAKtkadasSKKMKEKQQEVEAITLELEelkREHTSYKQQLEAVNEAIKSYESQI 804
Cdd:PRK12704    28 IAEAKIKEAEEEAKRILEEAKKEAEAIK------KEALLEAKEEIHKLRNEFE---KELRERRNELQKLEKRLLQKEENL 98
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 1034663586  805 EVMAAEVAKNKESVNKAQEEVTKQKEVITAQDTVIKAKYAE 845
Cdd:PRK12704    99 DRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEE 139
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
125-300 1.15e-03

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 43.19  E-value: 1.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  125 EYKKIAAQKTIE----KKEAKLKEIKTILEEEITPTIQKLKEERSSYLEYQ----KVMREIEHLSRLYI-------AYQF 189
Cdd:pfam17380  338 EQERMAMEREREleriRQEERKRELERIRQEEIAMEISRMRELERLQMERQqkneRVRQELEAARKVKIleeerqrKIQQ 417
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  190 LLAEDTKVRSAEElKEMQDKVIKLQEELSENDKKIK----ALNHEIEEL----EKRKDKETGGILRSLEDALAEAQRVNT 261
Cdd:pfam17380  418 QKVEMEQIRAEQE-EARQREVRRLEEERAREMERVRleeqERQQQVERLrqqeEERKRKKLELEKEKRDRKRAEEQRRKI 496
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 1034663586  262 KSQSaFDLKKKNLACEESKRKELEKNMVEDSKTLAAKEK 300
Cdd:pfam17380  497 LEKE-LEERKQAMIEEERKRKLLEKEMEERQKAIYEEER 534
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
617-823 1.17e-03

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 43.08  E-value: 1.17e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  617 ARSQAASILTKF--QELKDVQDELRIKENELRALEEElAGLKNTAEKYRQLKQQweMKTEEADLLQTKLQQSSYHKQQEE 694
Cdd:COG3206    168 LRREEARKALEFleEQLPELRKELEEAEAALEEFRQK-NGLVDLSEEAKLLLQQ--LSELESQLAEARAELAEAEARLAA 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  695 LDALKKTIEESEETLKNTKEIQRkaeekyevLENKMKNAEAERER----------ELKDAQKKLdcaktkADASSKKMKE 764
Cdd:COG3206    245 LRAQLGSGPDALPELLQSPVIQQ--------LRAQLAELEAELAElsarytpnhpDVIALRAQI------AALRAQLQQE 310
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034663586  765 KQQEVEAITLELEELKREHTSYKQQLEAVNEAIKSY---ESQIEVMAAEVAKNKES----VNKAQE 823
Cdd:COG3206    311 AQRILASLEAELEALQAREASLQAQLAQLEARLAELpelEAELRRLEREVEVARELyeslLQRLEE 376
PRK00409 PRK00409
recombination and DNA strand exchange inhibitor protein; Reviewed
757-881 1.18e-03

recombination and DNA strand exchange inhibitor protein; Reviewed


Pssm-ID: 234750 [Multi-domain]  Cd Length: 782  Bit Score: 42.89  E-value: 1.18e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  757 ASSKKMKEKQQEVEAITLELEELKREhtsYKQQLEAVNEAIKSYESQIEVMAaevaknKESVNKAQEEVtkqKEVITAQD 836
Cdd:PRK00409   527 ELERELEQKAEEAEALLKEAEKLKEE---LEEKKEKLQEEEDKLLEEAEKEA------QQAIKEAKKEA---DEIIKELR 594
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*
gi 1034663586  837 TVIKAKYAEVAKHKEQNNDSQLKIKELDHNISKHKREAEDGAAKV 881
Cdd:PRK00409   595 QLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKV 639
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
674-956 1.21e-03

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 43.29  E-value: 1.21e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  674 EEADLLQTKLQ-QSSYHKQQEE--------LDALKKTIEESEETLKNTKEIQRKaeekyevLENKMKNAEAERERELKDA 744
Cdd:pfam12128  604 ERLDKAEEALQsAREKQAAAEEqlvqangeLEKASREETFARTALKNARLDLRR-------LFDEKQSEKDKKNKALAER 676
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  745 QKKLDCAKTKADASSKKMKEKQQEVEAitlELEELKREHTSYKQQ-LEAVNEAIKSYESQIevmaaevaknkesvnkaqe 823
Cdd:pfam12128  677 KDSANERLNSLEAQLKQLDKKHQAWLE---EQKEQKREARTEKQAyWQVVEGALDAQLALL------------------- 734
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  824 evtkqKEVITAQDTVIKAKyaevAKHKEQNNDSQLKIKELD-HNISKHKREAEDGAAKVSKMLKD----YDWINAERHLF 898
Cdd:pfam12128  735 -----KAAIAARRSGAKAE----LKALETWYKRDLASLGVDpDVIAKLKREIRTLERKIERIAVRrqevLRYFDWYQETW 805
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034663586  899 GQPNSAYDFKTNNPKEAGQRLQ---KLQEMKEKLGRNVNMRAMNVLTEAEERYNDLMKKKR 956
Cdd:pfam12128  806 LQRRPRLATQLSNIERAISELQqqlARLIADTKLRRAKLEMERKASEKQQVRLSENLRGLR 866
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
624-859 1.29e-03

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 43.09  E-value: 1.29e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  624 ILTKFQELKDVQDELRIKENELRALEEELAGLKNTAE--KYRQLKQQWEMKTEEADLL----QTKLQQSSYHKQQEELDA 697
Cdd:TIGR04523   49 LKNKEKELKNLDKNLNKDEEKINNSNNKIKILEQQIKdlNDKLKKNKDKINKLNSDLSkinsEIKNDKEQKNKLEVELNK 128
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  698 LKKTIEESEETL-KNTKEIQRKAEE------KYEVLENKMKNAEAER---ERELKDAQKKLDCAKTKADASS------KK 761
Cdd:TIGR04523  129 LEKQKKENKKNIdKFLTEIKKKEKEleklnnKYNDLKKQKEELENELnllEKEKLNIQKNIDKIKNKLLKLElllsnlKK 208
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  762 MKEKQQEVEAITLELEE----LKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNKESVNKAQEEVTKQKEVI---TA 834
Cdd:TIGR04523  209 KIQKNKSLESQISELKKqnnqLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQLSEKQKELEQNNKKIkelEK 288
                          250       260
                   ....*....|....*....|....*
gi 1034663586  835 QDTVIKAKYAEVAKHKEQNNDSQLK 859
Cdd:TIGR04523  289 QLNQLKSEISDLNNQKEQDWNKELK 313
MsbA_lipidA TIGR02203
lipid A export permease/ATP-binding protein MsbA; This family consists of a single polypeptide ...
1016-1070 1.43e-03

lipid A export permease/ATP-binding protein MsbA; This family consists of a single polypeptide chain transporter in the ATP-binding cassette (ABC) transporter family, MsbA, which exports lipid A. It may also act in multidrug resistance. Lipid A, a part of lipopolysaccharide, is found in the outer leaflet of the outer membrane of most Gram-negative bacteria. Members of this family are restricted to the Proteobacteria (although lipid A is more broadly distributed) and often are clustered with lipid A biosynthesis genes. [Cell envelope, Biosynthesis and degradation of surface polysaccharides and lipopolysaccharides, Transport and binding proteins, Other]


Pssm-ID: 131258 [Multi-domain]  Cd Length: 571  Bit Score: 42.78  E-value: 1.43e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1034663586 1016 DGLEFKVAlgntwkENLTELSGGQRSLVALSlilsMLLFKPAPIYILDEVDAALD 1070
Cdd:TIGR02203  457 LGLDTPIG------ENGVLLSGGQRQRLAIA----RALLKDAPILILDEATSALD 501
CcmA COG4133
ABC-type transport system involved in cytochrome c biogenesis, ATPase component ...
1035-1084 1.45e-03

ABC-type transport system involved in cytochrome c biogenesis, ATPase component [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443308 [Multi-domain]  Cd Length: 206  Bit Score: 41.31  E-value: 1.45e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 1034663586 1035 LSGGQRSLVALSLilsmLLFKPAPIYILDEVDAALDLSHTQNIGQMLRTH 1084
Cdd:COG4133    132 LSAGQKRRVALAR----LLLSPAPLWLLDEPFTALDAAGVALLAELIAAH 177
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
753-895 1.46e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 41.45  E-value: 1.46e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  753 TKADASSKKMKEKQQEVEAITLELEELKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNKEsvnkAQEEVTKQKEvi 832
Cdd:COG1579     17 SELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEE----QLGNVRNNKE-- 90
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1034663586  833 taqdtvIKAKYAEVAKHKEQNNDSQLKIKELDHNISKHKREAEDGAAKVSKMLKDYDWINAER 895
Cdd:COG1579     91 ------YEALQKEIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAEL 147
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
744-820 1.55e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 42.12  E-value: 1.55e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034663586  744 AQKKLDCAKTKADASSKKMKEKQQEVEAITLELEELKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNKESVNK 820
Cdd:COG3883     14 ADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGE 90
COG5022 COG5022
Myosin heavy chain [General function prediction only];
198-449 1.59e-03

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 42.76  E-value: 1.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  198 RSAEELKEMQDKVIKLQEELSENDKKIKALNHEIEELEK-----RKDKETGGILRSLEDA--LAEAQRVNTKSQSAFDLK 270
Cdd:COG5022    786 LVDYELKWRLFIKLQPLLSLLGSRKEYRSYLACIIKLQKtikreKKLRETEEVEFSLKAEvlIQKFGRSLKAKKRFSLLK 865
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  271 KKNLACEESKRKELEKNMVEDSKTLAAKEKEVKKITDGLHA-LQEASNKDAEALAAAQQHFNAVSAGLSSNEDGAEATLA 349
Cdd:COG5022    866 KETIYLQSAQRVELAERQLQELKIDVKSISSLKLVNLELESeIIELKKSLSSDLIENLEFKTELIARLKKLLNNIDLEEG 945
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  350 GQMMACKNDISKAQTEAKQaqmKLKHAQQELKNkqaEVKKMDSGYRKDQEALEAVKRLKEKLEAEMKKLNYEENKEESLL 429
Cdd:COG5022    946 PSIEYVKLPELNKLHEVES---KLKETSEEYED---LLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQLK 1019
                          250       260
                   ....*....|....*....|...
gi 1034663586  430 EKRRQLSR---DIGRLKETYEAL 449
Cdd:COG5022   1020 ELPVEVAElqsASKIISSESTEL 1042
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
751-828 1.68e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 42.12  E-value: 1.68e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034663586  751 AKTKADASSKKMKEKQQEVEAITLELEELKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNKESVNKAQEEVTKQ 828
Cdd:COG3883     14 ADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGER 91
FepC COG1120
ABC-type cobalamin/Fe3+-siderophores transport system, ATPase component [Inorganic ion ...
1031-1082 1.73e-03

ABC-type cobalamin/Fe3+-siderophores transport system, ATPase component [Inorganic ion transport and metabolism, Coenzyme transport and metabolism];


Pssm-ID: 440737 [Multi-domain]  Cd Length: 254  Bit Score: 41.57  E-value: 1.73e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1034663586 1031 NLTELSGGQRSLVAlsliLSMLLFKPAPIYILDEVDAALDLSHTQNIGQMLR 1082
Cdd:COG1120    134 PVDELSGGERQRVL----IARALAQEPPLLLLDEPTSHLDLAHQLEVLELLR 181
ABC_tran pfam00005
ABC transporter; ABC transporters for a large family of proteins responsible for translocation ...
1012-1064 1.74e-03

ABC transporter; ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporters are composed of two copies of this domain and two copies of a transmembrane domain pfam00664. These four domains may belong to a single polypeptide or belong in different polypeptide chains.


Pssm-ID: 394964 [Multi-domain]  Cd Length: 150  Bit Score: 40.32  E-value: 1.74e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1034663586 1012 QTVLDGLEFKVALGNTWKENLTELSGGQRSLVAlsliLSMLLFKPAPIYILDE 1064
Cdd:pfam00005   99 EEALEKLGLGDLADRPVGERPGTLSGGQRQRVA----IARALLTKPKLLLLDE 147
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
646-937 1.92e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 41.81  E-value: 1.92e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  646 RALEEELAGLKNTAEKYRQLKQQWEMKTEEADLLQTKLQQSsyhkqQEELDALKKTIEESEETLKNTKEIQRKAEEKYEV 725
Cdd:COG4372     31 EQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQA-----RSELEQLEEELEELNEQLQAAQAELAQAQEELES 105
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  726 LENKMKNAEAERER---ELKDAQKKLDCAKTKADASSKKMKEKQQEVEAITLELEELKREHTSYKQQLEAVNEAIKSYES 802
Cdd:COG4372    106 LQEEAEELQEELEElqkERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAEAEQAL 185
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  803 QIEVMAAEVAKNKESVNKAQEEVTKQKEVITAQDTVIKAKYAEVakhKEQNNDSQLKIKELDHNISKHKREAEDGAAKVS 882
Cdd:COG4372    186 DELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEA---KLGLALSALLDALELEEDKEELLEEVILKEIEE 262
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1034663586  883 KMLKDYDWINAERHLFGQPNSAYDFKTNNPKEAGQRLQKLQEMKEKLGRNVNMRA 937
Cdd:COG4372    263 LELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDAL 317
AAA_15 pfam13175
AAA ATPase domain; This family of domains contain a P-loop motif that is characteriztic of the ...
1-302 2.02e-03

AAA ATPase domain; This family of domains contain a P-loop motif that is characteriztic of the AAA superfamily.


Pssm-ID: 433011 [Multi-domain]  Cd Length: 392  Bit Score: 41.82  E-value: 2.02e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586    1 MHIKSIILEGFKSYaQRTEVNgFDPLFNAITGLNGSGKSNILDSICFLLgisnlsqvviggRNKYLINGVNANNTRVQDL 80
Cdd:pfam13175    1 MKIKSIIIKNFRCL-KDTEID-LDEDLTVLIGKNNSGKSSILEALDIFL------------NNKEKFFEDDFLVLYLKDV 66
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586   81 FCSVGLNVNNPHFLIMQGRItkVLNMKPPEILSMIEEAAGTRMYEYKKIAAQKTIEKKEAKLKEIKTILEEEITPTIQKL 160
Cdd:pfam13175   67 IKIDKEDLNIFENISFSIDI--EIDVEFLLILFGYLEIKKKYLCLASKGKAKEYEKTLHPKGANKADLLLELKISDLKKY 144
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  161 KEERSSYLEYQKVMREIEHLSRLYIAYQFLLAEDTKVRSAEEL----KEMQDKVIKLQEELSENDKKIKALNHEIEELEK 236
Cdd:pfam13175  145 LKQFKIYIYNNYYLDEKKNVFDKKSKYELPSLKEEFLNSEKEEikvdKEDLKKLINELEKSINYHENVLENLQIKKLLIS 224
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034663586  237 RKDKETGGILRSLEDALAEAQRVNTKSQSAFDLKkknlacEESKRKELEKNMVEDSKTLAAKEKEV 302
Cdd:pfam13175  225 ADRNASDEDSEKINSLLGALKQRIFEEALQEELE------LTEKLKETQNKLKEIDKTLAEELKNI 284
ABC_RecF cd03242
ATP-binding cassette domain of RecF; RecF is a recombinational DNA repair ATPase that ...
3-48 2.07e-03

ATP-binding cassette domain of RecF; RecF is a recombinational DNA repair ATPase that maintains replication in the presence of DNA damage. When replication is prematurely disrupted by DNA damage, several recF pathway gene products play critical roles processing the arrested replication fork, allowing it to resume and complete its task. This CD represents the nucleotide binding domain of RecF. RecF belongs to a large superfamily of ABC transporters involved in the transport of a wide variety of different compounds including sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases with a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins.


Pssm-ID: 213209 [Multi-domain]  Cd Length: 270  Bit Score: 41.51  E-value: 2.07e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1034663586    3 IKSIILEGFKSYAqRTEVNgFDPLFNAITGLNGSGKSNILDSICFL 48
Cdd:cd03242      1 LKSLELRNFRNYA-ELELE-FEPGVTVLVGENAQGKTNLLEAISLL 44
ABC_Metallic_Cations cd03235
ATP-binding cassette domain of the metal-type transporters; This family includes transporters ...
1014-1082 2.27e-03

ATP-binding cassette domain of the metal-type transporters; This family includes transporters involved in the uptake of various metallic cations such as iron, manganese, and zinc. The ATPases of this group of transporters are very similar to members of iron-siderophore uptake family suggesting that they share a common ancestor. The best characterized metal-type ABC transporters are the YfeABCD system of Y. pestis, the SitABCD system of Salmonella enterica serovar Typhimurium, and the SitABCD transporter of Shigella flexneri. Moreover other uncharacterized homologs of these metal-type transporters are mainly found in pathogens like Haemophilus or enteroinvasive E. coli isolates.


Pssm-ID: 213202 [Multi-domain]  Cd Length: 213  Bit Score: 40.59  E-value: 2.27e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034663586 1014 VLDGLEFkVALGNTWKENLTELSGGQRSLValslILSMLLFKPAPIYILDEVDAALDLSHTQNIGQMLR 1082
Cdd:cd03235    113 VDEALER-VGLSELADRQIGELSGGQQQRV----LLARALVQDPDLLLLDEPFAGVDPKTQEDIYELLR 176
PRK01156 PRK01156
chromosome segregation protein; Provisional
631-852 2.29e-03

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 42.20  E-value: 2.29e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  631 LKDVQDELRIKENEL----RALEEELAGLKNTAEKYRQLKQQWEMKTEEADLLQTKLQQSSyhKQQEELDALKKTIEESE 706
Cdd:PRK01156   185 IDYLEEKLKSSNLELenikKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELS--SLEDMKNRYESEIKTAE 262
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  707 ETLKNTKEIQRK---AEEKYEVLENKMKNAEAERERE----------LKDAQKKLDCAKTKADASSKKMKE--------- 764
Cdd:PRK01156   263 SDLSMELEKNNYykeLEERHMKIINDPVYKNRNYINDyfkykndienKKQILSNIDAEINKYHAIIKKLSVlqkdyndyi 342
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  765 -KQQEVEAITLELEELKREHTSYK---QQLEAVNEAIKSYESQIEVMAAEVAK-------NKESVNKAQEEVTKQKEVIT 833
Cdd:PRK01156   343 kKKSRYDDLNNQILELEGYEMDYNsylKSIESLKKKIEEYSKNIERMSAFISEilkiqeiDPDAIKKELNEINVKLQDIS 422
                          250
                   ....*....|....*....
gi 1034663586  834 AQDTVIKAKYAEVAKHKEQ 852
Cdd:PRK01156   423 SKVSSLNQRIRALRENLDE 441
ABC_SMC6_euk cd03276
ATP-binding cassette domain of eukaryotic SM6 proteins; The structural maintenance of ...
3-50 2.37e-03

ATP-binding cassette domain of eukaryotic SM6 proteins; The structural maintenance of chromosomes (SMC) proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (formerly known as Rad18).


Pssm-ID: 213243 [Multi-domain]  Cd Length: 198  Bit Score: 40.66  E-value: 2.37e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 1034663586    3 IKSIILEGFKSYAqRTEVNgFDPLFNAITGLNGSGKSNILDSICFLLG 50
Cdd:cd03276      1 IESITLKNFMCHR-HLQIE-FGPRVNFIVGNNGSGKSAILTALTIGLG 46
PRK10247 PRK10247
putative ABC transporter ATP-binding protein YbbL; Provisional
1009-1093 2.55e-03

putative ABC transporter ATP-binding protein YbbL; Provisional


Pssm-ID: 182331 [Multi-domain]  Cd Length: 225  Bit Score: 40.85  E-value: 2.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586 1009 PEGQTVLDGLE-FKVALgNTWKENLTELSGGQRSLVALsliLSMLLFKPApIYILDEVDAALDLSHTQNIGQMLrTHFTH 1087
Cdd:PRK10247   112 PDPAIFLDDLErFALPD-TILTKNIAELSGGEKQRISL---IRNLQFMPK-VLLLDEITSALDESNKHNVNEII-HRYVR 185

                   ....*.
gi 1034663586 1088 SQFIVV 1093
Cdd:PRK10247   186 EQNIAV 191
YnjD COG4136
ABC-type uncharacterized transport system YnjBCD, ATPase component [General function ...
1034-1093 2.62e-03

ABC-type uncharacterized transport system YnjBCD, ATPase component [General function prediction only];


Pssm-ID: 443311 [Multi-domain]  Cd Length: 211  Bit Score: 40.54  E-value: 2.62e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586 1034 ELSGGQRSLVALsliLSMLLFKPAPIyILDEVDAALDLSHTQNIGQMLRTHFTHSQFIVV 1093
Cdd:COG4136    133 TLSGGQRARVAL---LRALLAEPRAL-LLDEPFSKLDAALRAQFREFVFEQIRQRGIPAL 188
PTZ00121 PTZ00121
MAEBL; Provisional
617-955 2.83e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 42.05  E-value: 2.83e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  617 ARSQAASILTKFQELKDVQDELRIKENELRALEEELaGLKNTAEKYRQLKQQWEMKTEEADLLQTKLQQSSYHKQQEELD 696
Cdd:PTZ00121  1062 AKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEEAF-GKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDAR 1140
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  697 ALK--KTIEESEETLKNTK-------EIQRKAEEKYEVlENKMKNAEAERERELKDAQ--KKLDCAKTKAD---ASSKKM 762
Cdd:PTZ00121  1141 KAEeaRKAEDAKRVEIARKaedarkaEEARKAEDAKKA-EAARKAEEVRKAEELRKAEdaRKAEAARKAEEerkAEEARK 1219
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  763 KEKQQEVEAITLELEELKREHTSYKQQLEAVNEAIKSYES--------QIEVMAAEVAKNKESVNKAQE-----EVTKQK 829
Cdd:PTZ00121  1220 AEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEarmahfarRQAAIKAEEARKADELKKAEEkkkadEAKKAE 1299
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  830 EVITAQDTVIKAKYAEVAKH-KEQNNDSQLKIKELDHNISKHKREAEDGAAKVSKMLKDYDWINAERHLFGQPNSAYDFK 908
Cdd:PTZ00121  1300 EKKKADEAKKKAEEAKKADEaKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK 1379
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*..
gi 1034663586  909 TNNPKEAGQRLQKLQEMKEKLGRNVNMRAMNVLTEAEERYNDLMKKK 955
Cdd:PTZ00121  1380 ADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKK 1426
tolA_full TIGR02794
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ...
629-761 2.92e-03

TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]


Pssm-ID: 274303 [Multi-domain]  Cd Length: 346  Bit Score: 41.37  E-value: 2.92e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  629 QELKDVQDELRIKENELRALEEELAglkNTAEKYRQLKQQWEMKTEEAdllqtKLQQSSYHKQQEELDALKKTIEESEET 708
Cdd:TIGR02794   80 AEKQRAAEQARQKELEQRAAAEKAA---KQAEQAAKQAEEKQKQAEEA-----KAKQAAEAKAKAEAEAERKAKEEAAKQ 151
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1034663586  709 LKNTKEIQRKAEEKYEVLENKMKNAEAERERELKDAQKKLDCAKTKADASSKK 761
Cdd:TIGR02794  152 AEEEAKAKAAAEAKKKAEEAKKKAEAEAKAKAEAEAKAKAEEAKAKAEAAKAK 204
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
189-431 3.00e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 41.35  E-value: 3.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  189 FLLAEDTKVRSAEELKEMQDKVIKLQEELSENDKKIKALNHEIEELEKRKDKetggilrsLEDALAEAQRvntksqsafd 268
Cdd:COG3883      4 LALAAPTPAFADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNE--------LQAELEALQA---------- 65
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  269 lkkknlaceesKRKELEKNMVEDSKTLAAKEKEVKKItdgLHALQEASNK--------DAEALAAAQQHFNAVSAGLSSN 340
Cdd:COG3883     66 -----------EIDKLQAEIAEAEAEIEERREELGER---ARALYRSGGSvsyldvllGSESFSDFLDRLSALSKIADAD 131
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  341 EDgaeatLAGQMMACKNDISKAQTEAKQAQMKLKHAQQELKNKQAEVKKMDSGYRKDQEALEAVKRLKEKLEAEMKKLNY 420
Cdd:COG3883    132 AD-----LLEELKADKAELEAKKAELEAKLAELEALKAELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELA 206
                          250
                   ....*....|.
gi 1034663586  421 EENKEESLLEK 431
Cdd:COG3883    207 AAEAAAAAAAA 217
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
629-832 3.23e-03

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 41.54  E-value: 3.23e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  629 QELKDVQDELRIKENELRALEEELAGLKNTAEKYRQLKQQWEMKTEEADLLQTKLQQSSYHKQQE------ELDALKKTI 702
Cdd:TIGR04523  426 KEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKIKQNLEQKQKElkskekELKKLNEEK 505
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  703 EESEETLKNTKEIQRKAEEKYEVLENKMKNAEAE-------------------RERELKDAQKKLDCAKTKADASSKKMK 763
Cdd:TIGR04523  506 KELEEKVKDLTKKISSLKEKIEKLESEKKEKESKisdledelnkddfelkkenLEKEIDEKNKEIEELKQTQKSLKKKQE 585
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034663586  764 EKQQEVEAITLELEELKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNKESVNKAQEEVTKQKEVI 832
Cdd:TIGR04523  586 EKQELIDQKEKEKKDLIKEIEEKEKKISSLEKELEKAKKENEKLSSIIKNIKSKKNKLKQEVKQIKETI 654
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
358-432 3.50e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 40.97  E-value: 3.50e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1034663586  358 DISKAQTEAKQAQMKLKHAQQELKNKQAEVKKMDSGYRKDQEALEAVKRLKEKLEAEMKKLNYEENKEESLLEKR 432
Cdd:COG3883     17 QIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGER 91
ABC_ATPase cd00267
ATP-binding cassette transporter nucleotide-binding domain; ABC transporters are a large ...
27-63 4.25e-03

ATP-binding cassette transporter nucleotide-binding domain; ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide-binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins.


Pssm-ID: 213179 [Multi-domain]  Cd Length: 157  Bit Score: 39.15  E-value: 4.25e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1034663586   27 FNAITGLNGSGKSNILDSICFLLGISNlSQVVIGGRN 63
Cdd:cd00267     27 IVALVGPNGSGKSTLLRAIAGLLKPTS-GEILIDGKD 62
RecN COG0497
DNA repair ATPase RecN [Replication, recombination and repair];
642-792 4.31e-03

DNA repair ATPase RecN [Replication, recombination and repair];


Pssm-ID: 440263 [Multi-domain]  Cd Length: 555  Bit Score: 41.21  E-value: 4.31e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  642 ENELRALEEELAGLKNTAEKYRQLKQQWEMKTEEADLLQTklqqssyhkQQEELDALK------KTIEESEETLKNTKEI 715
Cdd:COG0497    154 EELLEEYREAYRAWRALKKELEELRADEAERARELDLLRF---------QLEELEAAAlqpgeeEELEEERRRLSNAEKL 224
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034663586  716 QRKAEEKYEVLENKMKNAEAererELKDAQKKLDcaktKADASSKKMKEKQQEVEAITLELEELKREHTSYKQQLEA 792
Cdd:COG0497    225 REALQEALEALSGGEGGALD----LLGQALRALE----RLAEYDPSLAELAERLESALIELEEAASELRRYLDSLEF 293
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
127-375 4.91e-03

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 41.26  E-value: 4.91e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  127 KKIAAQKTIEKKEAKLKEIKtILEEEITPTIQKLkEERSSYLEYQKVmreiehlsrlyiayQFLLAEDTKVRSAEELKEM 206
Cdd:pfam15921  591 EKAQLEKEINDRRLELQEFK-ILKDKKDAKIREL-EARVSDLELEKV--------------KLVNAGSERLRAVKDIKQE 654
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  207 QDKVIKlQEELSENDKKIKALNHEI---------EELEKRKDK-------------ETGGILRSLEDALAEAQRVNTKSQ 264
Cdd:pfam15921  655 RDQLLN-EVKTSRNELNSLSEDYEVlkrnfrnksEEMETTTNKlkmqlksaqseleQTRNTLKSMEGSDGHAMKVAMGMQ 733
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  265 SAFDLKKKNLACEESKRKELEKNMVEDSKTLAAKEKEVKKITDGLHALQEASNKDA---EALAAAQQHFNAVSAGLSSNE 341
Cdd:pfam15921  734 KQITAKRGQIDALQSKIQFLEEAMTNANKEKHFLKEEKNKLSQELSTVATEKNKMAgelEVLRSQERRLKEKVANMEVAL 813
                          250       260       270
                   ....*....|....*....|....*....|....
gi 1034663586  342 DGAEAtlagQMMACKNDISKAQTEAkqAQMKLKH 375
Cdd:pfam15921  814 DKASL----QFAECQDIIQRQEQES--VRLKLQH 841
tolA_full TIGR02794
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ...
639-835 5.22e-03

TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]


Pssm-ID: 274303 [Multi-domain]  Cd Length: 346  Bit Score: 40.60  E-value: 5.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  639 RIKENELRALEEELAGLKNTAEKYRQLKQQWEMKTEEADLLQTKLQQSSYHKQQEE--LDALKKTIEESEETLKNTKEIQ 716
Cdd:TIGR02794   54 RIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQaaKQAEEKQKQAEEAKAKQAAEAK 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  717 RKAEEKYEVLENKMKNAEAERERELK---DAQKKLDCAKTKADASSKKMKEKQQEVEAitlelEELKREHTSYKQQLEAv 793
Cdd:TIGR02794  134 AKAEAEAERKAKEEAAKQAEEEAKAKaaaEAKKKAEEAKKKAEAEAKAKAEAEAKAKA-----EEAKAKAEAAKAKAAA- 207
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|..
gi 1034663586  794 nEAIKSYESQIEVMAAEVAKNKESVNKAQEEVTKQKEVITAQ 835
Cdd:TIGR02794  208 -EAAAKAEAEAAAAAAAEAERKADEAELGDIFGLASGSNAEK 248
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
201-451 6.12e-03

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 40.77  E-value: 6.12e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  201 EELKEMQDKVIKLQEELSENDKKIKALNHEIEELEKR---KDKETGGILRSLEDALAEAQRVNTKSQSAFD------LKK 271
Cdd:TIGR04523  314 SELKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKEltnSESENSEKQRELEEKQNEIEKLKKENQSYKQeiknleSQI 393
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  272 KNLACEESKRKELEKNMVEDSKTLAAKEKEVKKITDGLHALQEASNKDAEALAAAQQHFNAVSAGLSSNEDGAE---ATL 348
Cdd:TIGR04523  394 NDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLEtqlKVL 473
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  349 AGQMMACKNDISKAQTEAKQAQ---MKLKHAQQELKNKQAEVKKMDSGYRKDQEALEAVKRLKEK----LEAEMKKLNYE 421
Cdd:TIGR04523  474 SRSINKIKQNLEQKQKELKSKEkelKKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESkisdLEDELNKDDFE 553
                          250       260       270
                   ....*....|....*....|....*....|
gi 1034663586  422 ENKEEsLLEKRRQLSRDIGRLKETYEALLA 451
Cdd:TIGR04523  554 LKKEN-LEKEIDEKNKEIEELKQTQKSLKK 582
tolA_full TIGR02794
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ...
691-850 6.84e-03

TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]


Pssm-ID: 274303 [Multi-domain]  Cd Length: 346  Bit Score: 40.21  E-value: 6.84e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  691 QQEELDALKKTIEEseetlknTKEIQRKAEEKyevleNKMKNAEAERERELKDAQKKLDCAKTKADASSKKMKEKQQEVE 770
Cdd:TIGR02794   56 QQQKKPAAKKEQER-------QKKLEQQAEEA-----EKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEE 123
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  771 AITLELEELKREHTSYKQQlEAVNEAIKSYESQIEVMAAEVAKNK--ESVNKAQEEVTKQKEVITAQDTVIKAKYAEVAK 848
Cdd:TIGR02794  124 AKAKQAAEAKAKAEAEAER-KAKEEAAKQAEEEAKAKAAAEAKKKaeEAKKKAEAEAKAKAEAEAKAKAEEAKAKAEAAK 202

                   ..
gi 1034663586  849 HK 850
Cdd:TIGR02794  203 AK 204
PRK01156 PRK01156
chromosome segregation protein; Provisional
617-834 7.32e-03

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 40.66  E-value: 7.32e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  617 ARSQAASILTKFQELKDVQDELRIKENELRALEEELAGLKNTaekyrqlkqQWEMKTEEADLLQTKLQQSSYHKQQEELD 696
Cdd:PRK01156   527 ARADLEDIKIKINELKDKHDKYEEIKNRYKSLKLEDLDSKRT---------SWLNALAVISLIDIETNRSRSNEIKKQLN 597
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  697 ALKKTIEESEETLKNTKEIQ----RKAEEKYEVLENKMKNAEaERERELKDAQKKLDCAKTKadasSKKMKEKQQEVEAI 772
Cdd:PRK01156   598 DLESRLQEIEIGFPDDKSYIdksiREIENEANNLNNKYNEIQ-ENKILIEKLRGKIDNYKKQ----IAEIDSIIPDLKEI 672
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034663586  773 TLELEELKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNKESVNKAQEEVTKQKEVITA 834
Cdd:PRK01156   673 TSRINDIEDNLKKSRKALDDAKANRARLESTIEILRTRINELSDRINDINETLESMKKIKKA 734
GAS pfam13851
Growth-arrest specific micro-tubule binding; This family is the highly conserved central ...
646-805 8.12e-03

Growth-arrest specific micro-tubule binding; This family is the highly conserved central region of a number of metazoan proteins referred to as growth-arrest proteins. In mouse, Gas8 is predominantly a testicular protein, whose expression is developmentally regulated during puberty and spermatogenesis. In humans, it is absent in infertile males who lack the ability to generate gametes. The localization of Gas8 in the motility apparatus of post-meiotic gametocytes and mature spermatozoa, together with the detection of Gas8 also in cilia at the apical surfaces of epithelial cells lining the pulmonary bronchi and Fallopian tubes suggests that the Gas8 protein may have a role in the functioning of motile cellular appendages. Gas8 is a microtubule-binding protein localized to regions of dynein regulation in mammalian cells.


Pssm-ID: 464001 [Multi-domain]  Cd Length: 200  Bit Score: 39.12  E-value: 8.12e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  646 RALEEELAGLKNTAEKYRQLKQQWEMKTEEadlLQTKLQQssyhkQQEELDALKKTIEESE---ETLKNTKEIQRKAEEK 722
Cdd:pfam13851   29 KSLKEEIAELKKKEERNEKLMSEIQQENKR---LTEPLQK-----AQEEVEELRKQLENYEkdkQSLKNLKARLKVLEKE 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  723 -------YEVLENKMKNAEAER-------ERELKDAQKKLDCaktKADASSKKMKEKQQEVEAITLELEELKR----EHT 784
Cdd:pfam13851  101 lkdlkweHEVLEQRFEKVERERdelydkfEAAIQDVQQKTGL---KNLLLEKKLQALGETLEKKEAQLNEVLAaanlDPD 177
                          170       180
                   ....*....|....*....|.
gi 1034663586  785 SYKQQLEAVNEAIKSYESQIE 805
Cdd:pfam13851  178 ALQAVTEKLEDVLESKNQLIK 198
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
206-427 8.93e-03

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 40.19  E-value: 8.93e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  206 MQDKVIKLQEELSENDKKIKALNHEIEELEKrKDKETGGILRSLEDALAEAQRVntksqsafdlkkknlacEESKRKELE 285
Cdd:pfam10174  399 LQKKIENLQEQLRDKDKQLAGLKERVKSLQT-DSSNTDTALTTLEEALSEKERI-----------------IERLKEQRE 460
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  286 KNMVEDSKTLAAKEKEVKKITDGLHALQ-EASNKDAEALAAAQQHFNAVSAGLssNEDGAEATLAGQMMACKNDISKAQT 364
Cdd:pfam10174  461 REDRERLEELESLKKENKDLKEKVSALQpELTEKESSLIDLKEHASSLASSGL--KKDSKLKSLEIAVEQKKEECSKLEN 538
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034663586  365 EAKQAQM-------------KLKHAQQELKNKQAEVKKMDSGYRKDQEAL---EAVKRLKEKLEAEMKKLNYEENKEES 427
Cdd:pfam10174  539 QLKKAHNaeeavrtnpeindRIRLLEQEVARYKEESGKAQAEVERLLGILrevENEKNDKDKKIAELESLTLRQMKEQN 617
COG5022 COG5022
Myosin heavy chain [General function prediction only];
638-816 9.20e-03

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 40.45  E-value: 9.20e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  638 LRIKENELRALEEELAGLKNTAEKYRQLKQQWEMKTE-EADLLQTKLQQSSYHK---------------QQEELDALKKT 701
Cdd:COG5022    805 LLGSRKEYRSYLACIIKLQKTIKREKKLRETEEVEFSlKAEVLIQKFGRSLKAKkrfsllkketiylqsAQRVELAERQL 884
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  702 IEESEE--TLKNTKEIQRKAEEKY----------EVLENKMKNAEAERERELK---DAQKKLDCAKTKADASSK------ 760
Cdd:COG5022    885 QELKIDvkSISSLKLVNLELESEIielkkslssdLIENLEFKTELIARLKKLLnniDLEEGPSIEYVKLPELNKlheves 964
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1034663586  761 KMKEKQQEVEAITLELEELKREHTSYKQQLEAVNEAIKSYESQIEVMAAEVAKNKE 816
Cdd:COG5022    965 KLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKE 1020
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
191-320 9.48e-03

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 40.23  E-value: 9.48e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034663586  191 LAEDTKVRSAEELKEMQDKVIKLQEELSENDKKIKALNHEIEELEKRKdketggilRSLEDALAEAQRVNTKSQSAFDLK 270
Cdd:COG2433    382 LEELIEKELPEEEPEAEREKEHEERELTEEEEEIRRLEEQVERLEAEV--------EELEAELEEKDERIERLERELSEA 453
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1034663586  271 KKNLACEESKRKELEK--NMVED-SKTLAAKEKEVKKITDGLHALQEASNKDA 320
Cdd:COG2433    454 RSEERREIRKDREISRldREIERlERELEEERERIEELKRKLERLKELWKLEH 506
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH