NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2004286192|pdb|6X6S|HY]
View 

Chain HY, Cag pathogenicity island protein (Cag7)

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
VirB10 cd16429
VirB10 forms part of core complex in Type IV secretion system (T4SS); This family contains ...
1692-1888 2.01e-53

VirB10 forms part of core complex in Type IV secretion system (T4SS); This family contains VirB10, a component of the type IV secretion system (T4SS), including homologs TrbI, TraF, TrwE and TraL. T4S system is employed by pathogenic bacteria to export virulence DNAs and/or proteins directly from the bacterial cytoplasm into the host cell. It forms a large multiprotein complex consisting of 12 proteins termed VirB1-11 and VirD4. VirB10, interacts with VirB7 and VirB9, forming the membrane-spanning 'core complex' (CC), around which all other components assemble. The CC is inserted in both, the outer and inner membranes, playing a fundamental role as a scaffold for the rest of the T4SS components and actively participating in T4S substrate transfer through the bacterial envelope via conformational changes regulating channel opening and closing. TrwE in R33 plasmid has been shown to be anchored to the inner membrane and its C-terminal is necessary for conjugation; the transmembrane domains of TrwB and TrwE are involved in TrwB-TrwE interactions. TraF protein of the RP4 plasmid is involved in circularization of pilin subunits of P-type pili. In gonococcal genetic island (GGI) of Neisseria gonorrhoeae, T4SS encodes TrbI and circularization occurs via a covalent intermediate between the C terminus of putative pilin protein TraA and TrbI.


:

Pssm-ID: 319755  Cd Length: 180  Bit Score: 185.40  E-value: 2.01e-53
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY     1692 LAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQS-VKGGtpiMTRLMIVFTKAITPDGVII 1770
Cdd:cd16429    1 LPAGTVIPAVLETAINSDLPGQVRAQVTRDVYDSTGRVLLIPKGSRLIGEYNSgVAQG---QSRVFVVWTRLITPDGVSI 77
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY     1771 PLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTApiialdkliglgkgrSERTPEFNYALGQAINGSMQS 1850
Cdd:cd16429   78 DLDSMPGADALGRAGLKGDVDNHFWERFGGALLLSLISDGAQAA---------------SSSASNSNSAGQAAGSNTGQS 142
                        170       180       190
                 ....*....|....*....|....*....|....*...
6X6S_HY     1851 SAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFS 1888
Cdd:cd16429  143 LGQVASQILERNLNIPPTITIRQGTRINIFVARDLDFS 180
CagY_I pfam14585
CagY type 1 repeat; This repeat is found at the N-terminus of the CagY proteins - part of the ...
9-73 7.78e-28

CagY type 1 repeat; This repeat is found at the N-terminus of the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells.


:

Pssm-ID: 405299  Cd Length: 65  Bit Score: 107.80  E-value: 7.78e-28
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
6X6S_HY          9 ETSKKAQQDSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTHFDGDKSEETQTQMDSE 73
Cdd:pfam14585    1 ETSKKTQQHSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTNFDGDKSEEITDDSNDQ 65
CagY_I pfam14585
CagY type 1 repeat; This repeat is found at the N-terminus of the CagY proteins - part of the ...
139-203 7.10e-26

CagY type 1 repeat; This repeat is found at the N-terminus of the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells.


:

Pssm-ID: 405299  Cd Length: 65  Bit Score: 102.41  E-value: 7.10e-26
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
6X6S_HY        139 ETSKKTQQHSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTNFDGDKSEETQTQMDSE 203
Cdd:pfam14585    1 ETSKKTQQHSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTNFDGDKSEEITDDSNDQ 65
CagY_I pfam14585
CagY type 1 repeat; This repeat is found at the N-terminus of the CagY proteins - part of the ...
269-325 1.65e-21

CagY type 1 repeat; This repeat is found at the N-terminus of the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells.


:

Pssm-ID: 405299  Cd Length: 65  Bit Score: 89.70  E-value: 1.65e-21
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
6X6S_HY        269 ETSKKTQQHSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTNFDGDKSEE 325
Cdd:pfam14585    1 ETSKKTQQHSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTNFDGDKSEE 57
PTZ00121 super family cl31754
MAEBL; Provisional
846-1348 5.07e-14

MAEBL; Provisional


The actual alignment was detected with superfamily member PTZ00121:

Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 78.26  E-value: 5.07e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       846 KTEAEKKECEKLLTPEARKLLEEAKKSVKAYLDCVSQAKteaEKKECEKLLTPEARKLLEE*AKESVKAYLDcvsqaknE 925
Cdd:PTZ00121 1237 KDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAE---EARKADELKKAEEKKKADEAKKAEEKKKAD-------E 1306
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       926 AEKKECEKLLTLESKKKLEEAKKSVKayldcvsqaktEAEKKECEKLLTPEAKKLLEQQALDCLKNA--KTEADKKRCVK 1003
Cdd:PTZ00121 1307 AKKKAEEAKKADEAKKKAEEAKKKAD-----------AAKKKAEEAKKAAEAAKAEAEAAADEAEAAeeKAEAAEKKKEE 1375
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1004 DLPKDLQKKVLAKESLKAykDCVSKARNEKEKKECEKLLTPEAKKLLEEAKKSVKAyldcvSQAKTEAEKKECEKLLTPE 1083
Cdd:PTZ00121 1376 AKKKADAAKKKAEEKKKA--DEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEE-----KKKADEAKKKAEEAKKADE 1448
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1084 ARKLLEEAKESVKAYKDCVSKAR-NEKEKKECEKLLTPEAKKLLEQqvldclknAKTEADKKRCVKDLPKDLQKKVLAKE 1162
Cdd:PTZ00121 1449 AKKKAEEAKKAEEAKKKAEEAKKaDEAKKKAEEAKKADEAKKKAEE--------AKKKADEAKKAAEAKKKADEAKKAEE 1520
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1163 SVKAyldcvSRARNEKEKKECEKLLTPEAKKLLEEAKESLKAYK----DCLSQARNEEERRACEKLLTPEARKLLEQEVK 1238
Cdd:PTZ00121 1521 AKKA-----DEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKaeekKKAEEAKKAEEDKNMALRKAEEAKKAEEARIE 1595
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1239 KSIKAYLDCVSRARNEKEKKECEKLLTPEARKFL-AKQVLNCLEKAGNEEERKAclKNLPKDLQENILAKESLKAY---- 1313
Cdd:PTZ00121 1596 EVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEeEKKKVEQLKKKEAEEKKKA--EELKKAEEENKIKAAEEAKKaeed 1673
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|....*
6X6S_HY      1314 KDCLSQARNEEE----------RRACEKLLTPEARKLLEQEVKKS 1348
Cdd:PTZ00121 1674 KKKAEEAKKAEEdekkaaealkKEAEEAKKAEELKKKEAEEKKKA 1718
CagY_M pfam07337
DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island ...
622-653 4.51e-04

DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells. It forms part of a surface needle structure, and this repeat may form an alpha-helical rod structure. A conserved -DC- and -EC- can be seen in regularly spaced in the alignment.


:

Pssm-ID: 284699 [Multi-domain]  Cd Length: 32  Bit Score: 39.45  E-value: 4.51e-04
                           10        20        30
                   ....*....|....*....|....*....|..
6X6S_HY        622 AYKDCVSKARNEKEKQECEKLLTPEARKKLEQ 653
Cdd:pfam07337    1 QYKDCLKNAKTEEERNECLKGLSDEARKLLLQ 32
CagY_M pfam07337
DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island ...
553-584 6.62e-04

DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells. It forms part of a surface needle structure, and this repeat may form an alpha-helical rod structure. A conserved -DC- and -EC- can be seen in regularly spaced in the alignment.


:

Pssm-ID: 284699 [Multi-domain]  Cd Length: 32  Bit Score: 38.68  E-value: 6.62e-04
                           10        20        30
                   ....*....|....*....|....*....|..
6X6S_HY        553 EYKDCIKNAKTEAEKNKCLKGLSKEAIERLKQ 584
Cdd:pfam07337    1 QYKDCLKNAKTEEERNECLKGLSDEARKLLLQ 32
SMC_N super family cl47134
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
109-942 7.90e-04

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


The actual alignment was detected with superfamily member pfam02463:

Pssm-ID: 481474 [Multi-domain]  Cd Length: 1161  Bit Score: 44.58  E-value: 7.90e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        109 LDEETQELNEEDDQENNEYQEETQTDLIDDETSKKTQQHSpQDLSNEEATEANHFENLLKESKESSDhhldnpTETQTNF 188
Cdd:pfam02463  200 LKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERI-DLLQELLRDEQEEIESSKQEIEKEEE------KLAQVLK 272
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        189 DGDKSEETQTQMDSEGNETSessngSLADKLFKKARKLVDNKKPFTQQKNLDEETQELNEEDDQENNEYQEETQTDLIDD 268
Cdd:pfam02463  273 ENKEEEKEKKLQEEELKLLA-----KEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKEL 347
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        269 ETSKKTQQHSPQDLSNEEATEANHFENLLKESKESSDH-----HLDNPTETQTNFDGDKSEEITDDSNDQEIIKGSKKKY 343
Cdd:pfam02463  348 EIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERlssaaKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKE 427
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        344 IIGGIVVAVLIVIILFSRsifhyfmpLEDKSSRFSKDRNLYVNDEIQIRQEYNRLLKERNEKGNMIDKNLFFNDDPNRTL 423
Cdd:pfam02463  428 ELEILEEEEESIELKQGK--------LTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERS 499
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        424 YNYLNIAEIEDKNPLRAFYECISNGGNYEECLKLIKDKKLQDQMKKTLEAYNDCIKNAKTEEERIKCLDLIKDENLKKSL 503
Cdd:pfam02463  500 QKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVIVEVSATADEVEERQKLVRALTELPLGARK 579
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        504 LNQQKVQVALDCLKNAKtdEERNECLKLINDPEIREKFRKELELQKELQEYKDCIKNAKTEaEKNKCLKGLSKEAIERLK 583
Cdd:pfam02463  580 LRLLIPKLKLPLKSIAV--LEIDPILNLAQLDKATLEADEDDKRAKVVEGILKDTELTKLK-ESAKAKESGLRKGVSLEE 656
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        584 QQALDCLKNAKTDEERNECLKNIPQDLQKELLADMSVKAYKDCVSKARNEKEKQECEKLLTPEARKKLEQQVLDCLKNAK 663
Cdd:pfam02463  657 GLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINE 736
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        664 TDEERKKCLKDLPKDLQSDILAKESLKAYKDCVSQAKTEAEKKEcekllTPEAKKLLEEEAKESVKAYLDCVSQAKTEAE 743
Cdd:pfam02463  737 ELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEER-----EKTEKLKVEEEKEEKLKAQEEELRALEEELK 811
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        744 KKECEKLLTPEAKKKLEEAKKSVKAYLdcvsrARNEKEKKECEKLLTPEAKKLLEQQALDCLKNAKTDKERKKCLKDLPK 823
Cdd:pfam02463  812 EEAELLEEEQLLIEQEEKIKEEELEEL-----ALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKD 886
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        824 DLQKKVLAKESVKAYLDCVSQAKTEAEKKECEKLLTPEARKLLEEAKKSVKAYLDCVSQAKTEAEKKECEKLLTPEARKL 903
Cdd:pfam02463  887 ELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLEEADEKEKEENNKEEEEERNKRLL 966
                          810       820       830
                   ....*....|....*....|....*....|....*....
6X6S_HY        904 LEE*AKESVKAYLDCVSQAKNEAEKKECEKLLTLESKKK 942
Cdd:pfam02463  967 LAKEELGKVNLMAIEEFEEKEERYNKDELEKERLEEEKK 1005
 
Name Accession Description Interval E-value
VirB10 cd16429
VirB10 forms part of core complex in Type IV secretion system (T4SS); This family contains ...
1692-1888 2.01e-53

VirB10 forms part of core complex in Type IV secretion system (T4SS); This family contains VirB10, a component of the type IV secretion system (T4SS), including homologs TrbI, TraF, TrwE and TraL. T4S system is employed by pathogenic bacteria to export virulence DNAs and/or proteins directly from the bacterial cytoplasm into the host cell. It forms a large multiprotein complex consisting of 12 proteins termed VirB1-11 and VirD4. VirB10, interacts with VirB7 and VirB9, forming the membrane-spanning 'core complex' (CC), around which all other components assemble. The CC is inserted in both, the outer and inner membranes, playing a fundamental role as a scaffold for the rest of the T4SS components and actively participating in T4S substrate transfer through the bacterial envelope via conformational changes regulating channel opening and closing. TrwE in R33 plasmid has been shown to be anchored to the inner membrane and its C-terminal is necessary for conjugation; the transmembrane domains of TrwB and TrwE are involved in TrwB-TrwE interactions. TraF protein of the RP4 plasmid is involved in circularization of pilin subunits of P-type pili. In gonococcal genetic island (GGI) of Neisseria gonorrhoeae, T4SS encodes TrbI and circularization occurs via a covalent intermediate between the C terminus of putative pilin protein TraA and TrbI.


Pssm-ID: 319755  Cd Length: 180  Bit Score: 185.40  E-value: 2.01e-53
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY     1692 LAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQS-VKGGtpiMTRLMIVFTKAITPDGVII 1770
Cdd:cd16429    1 LPAGTVIPAVLETAINSDLPGQVRAQVTRDVYDSTGRVLLIPKGSRLIGEYNSgVAQG---QSRVFVVWTRLITPDGVSI 77
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY     1771 PLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTApiialdkliglgkgrSERTPEFNYALGQAINGSMQS 1850
Cdd:cd16429   78 DLDSMPGADALGRAGLKGDVDNHFWERFGGALLLSLISDGAQAA---------------SSSASNSNSAGQAAGSNTGQS 142
                        170       180       190
                 ....*....|....*....|....*....|....*...
6X6S_HY     1851 SAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFS 1888
Cdd:cd16429  143 LGQVASQILERNLNIPPTITIRQGTRINIFVARDLDFS 180
T4SS_VirB10 NF038091
type IV secretion system protein VirB10; Members of this family are VirB10, an outer ...
1686-1893 1.49e-49

type IV secretion system protein VirB10; Members of this family are VirB10, an outer membrane-associated protein from the apparatus of protein type IV secretion systems (T4SS). The model attempts to exclude related TraI proteins of conjugal transfer systems as well as the ComB10 protein of a DNA-translocating competence protein of Helicobacter pylori. Because the N-terminal regions of VirB10 proteins are highly variable, the model


Pssm-ID: 468345  Cd Length: 197  Bit Score: 175.06  E-value: 1.49e-49
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1686 SDPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQS-VKGGtpiMTRLMIVFTKAIT 1764
Cdd:NF038091    5 GDRDLLLTQGTMIPCVLETAIDSDLPGMVRCVVTRDVYSANGRVVLIERGSRVIGEYQSgLSQG---QARVFVLWTRIET 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1765 PDGVIIPLANAqAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTApiialdkligLGKGRSERTPEFNYalGQAI 1844
Cdd:NF038091   82 PDGVIVNLDSP-GTDELGRSGLPGYVDNHFWERFGGAILLSLIGDAGQYA----------ANKGQNGNGSGNGN--SVNF 148
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*....
6X6S_HY      1845 NGSMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVYDV 1893
Cdd:NF038091  149 ENTSQAASDMASEALENTINIPPTLYKNQGDRINIFVARDLDFSGVYEL 197
VirB10 COG2948
Type IV secretory pathway, VirB10 component [Intracellular trafficking, secretion, and ...
1687-1891 1.55e-43

Type IV secretory pathway, VirB10 component [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442191 [Multi-domain]  Cd Length: 309  Bit Score: 161.66  E-value: 1.55e-43
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY     1687 DPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQS-VKGGtpiMTRLMIVFTKAITP 1765
Cdd:COG2948  123 DSPYLLPAGTVIPAVLETGINSDLPGQITAQVTEDVYSATGRVLLIPQGSRLIGEYDSgVSFG---QSRVLVVWTRLITP 199
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY     1766 DGVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQtapiialdkligLGKGRSERTpefnyaLGQAI- 1844
Cdd:COG2948  200 DGVSIDLDSMPGTDALGRAGLEGQVDNHWWERFGAALLLSLLGDGAE------------LAASSDEGD------LSTAIr 261
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|....*..
6X6S_HY     1845 NGSMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVY 1891
Cdd:COG2948  262 ENTAQSLSQAAQQILERNLNIPPTLTIRQGTRVRVIVARDLDFSSPY 308
TrbI pfam03743
Bacterial conjugation TrbI-like protein; Although not essential for conjugation, the TrbI ...
1691-1886 5.68e-42

Bacterial conjugation TrbI-like protein; Although not essential for conjugation, the TrbI protein greatly increase the conjugational efficiency.


Pssm-ID: 427480  Cd Length: 187  Bit Score: 152.81  E-value: 5.68e-42
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       1691 VLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWN-MNGTMILLDKGTKVYGNYQS-VKGGtpiMTRLMIVFTKAITPDGV 1768
Cdd:pfam03743    1 YLPAGTVIPAVLITGINSDLPGQVTAQVTEPVYDsATGRALLIPKGSRLIGEYGSgVAYG---QERVLVVWTRLICPDGV 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       1769 IIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTApiialdklIGLGKGRSERTPEFNYALGQAINGSM 1848
Cdd:pfam03743   78 SIDLDSPGGVDALGRAGLRGRVDNHFGQRFGAALLLSLLGGGGELA--------NAASSNSTSTSTGNDNLLGAVGSGAG 149
                          170       180       190
                   ....*....|....*....|....*....|....*...
6X6S_HY       1849 QSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDID 1886
Cdd:pfam03743  150 QALSQLAQQILERALNIPPTITVRQGTRVRIIVNRDLD 187
VirB10_subf NF041422
VirB10/TraB/TrbI family type IV secretion system protein;
1687-1895 4.23e-30

VirB10/TraB/TrbI family type IV secretion system protein;


Pssm-ID: 469313  Cd Length: 404  Bit Score: 125.13  E-value: 4.23e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1687 DPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQS--VKGGtpiMTRLMIVFTKAIT 1764
Cdd:NF041422  210 DPDLYIPENTYIPCSLDRRFVSDVAGKISCTISEDVYSASGHVKLIPKGTTARGVYKTgtLKHG---QGRMFIIWTELRT 286
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1765 PD--GVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTAPiialdkliGLGKGRSERTpefNYAlgq 1842
Cdd:NF041422  287 PEppYLDIPLVDTQAAGQLGEAGIDGWIDTHFWERFGNALMLSTVQDVAAAAS--------GSAPGKDRNT---DYT--- 352
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|...
6X6S_HY      1843 aiNGSMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVYDVKI 1895
Cdd:NF041422  353 --ENSRQATAEMAKTALENSINIPPTLYKNQGDIIGLITGQDIDFSSIYQLRM 403
PRK13855 PRK13855
type IV secretion system protein VirB10; Provisional
1688-1900 3.05e-29

type IV secretion system protein VirB10; Provisional


Pssm-ID: 172376  Cd Length: 376  Bit Score: 121.93  E-value: 3.05e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1688 PTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQsvKGGTPIMTRLMIVFTKAITPDG 1767
Cdd:PRK13855  178 PDFMVTQGTIIPCILQTAIDTNLAGYVKCVLPQDIRGTTNNVVLLDRGTTVVGEIQ--RGLQQGDARVFVLWDRAETPDH 255
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1768 VIIPLAnAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTAPIIAldkliglgkGRSERTPEFNyalgqaingS 1847
Cdd:PRK13855  256 AMISLS-SPGADELGRSGLPGTVDNHFWQRFSGAMLLSVVQGAFQAASTYA---------GSSGGGTSFN---------S 316
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....
6X6S_HY      1848 MQSS-AQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVYDVKITNKSV 1900
Cdd:PRK13855  317 FQNNgEQTADTALKATINIPPTLKKNQGDTVSIFVARDLDFSGIYQLRMTGGAT 370
CagY_I pfam14585
CagY type 1 repeat; This repeat is found at the N-terminus of the CagY proteins - part of the ...
9-73 7.78e-28

CagY type 1 repeat; This repeat is found at the N-terminus of the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells.


Pssm-ID: 405299  Cd Length: 65  Bit Score: 107.80  E-value: 7.78e-28
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
6X6S_HY          9 ETSKKAQQDSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTHFDGDKSEETQTQMDSE 73
Cdd:pfam14585    1 ETSKKTQQHSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTNFDGDKSEEITDDSNDQ 65
CagY_I pfam14585
CagY type 1 repeat; This repeat is found at the N-terminus of the CagY proteins - part of the ...
139-203 7.10e-26

CagY type 1 repeat; This repeat is found at the N-terminus of the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells.


Pssm-ID: 405299  Cd Length: 65  Bit Score: 102.41  E-value: 7.10e-26
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
6X6S_HY        139 ETSKKTQQHSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTNFDGDKSEETQTQMDSE 203
Cdd:pfam14585    1 ETSKKTQQHSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTNFDGDKSEEITDDSNDQ 65
CagY_I pfam14585
CagY type 1 repeat; This repeat is found at the N-terminus of the CagY proteins - part of the ...
269-325 1.65e-21

CagY type 1 repeat; This repeat is found at the N-terminus of the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells.


Pssm-ID: 405299  Cd Length: 65  Bit Score: 89.70  E-value: 1.65e-21
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
6X6S_HY        269 ETSKKTQQHSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTNFDGDKSEE 325
Cdd:pfam14585    1 ETSKKTQQHSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTNFDGDKSEE 57
PTZ00121 PTZ00121
MAEBL; Provisional
846-1348 5.07e-14

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 78.26  E-value: 5.07e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       846 KTEAEKKECEKLLTPEARKLLEEAKKSVKAYLDCVSQAKteaEKKECEKLLTPEARKLLEE*AKESVKAYLDcvsqaknE 925
Cdd:PTZ00121 1237 KDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAE---EARKADELKKAEEKKKADEAKKAEEKKKAD-------E 1306
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       926 AEKKECEKLLTLESKKKLEEAKKSVKayldcvsqaktEAEKKECEKLLTPEAKKLLEQQALDCLKNA--KTEADKKRCVK 1003
Cdd:PTZ00121 1307 AKKKAEEAKKADEAKKKAEEAKKKAD-----------AAKKKAEEAKKAAEAAKAEAEAAADEAEAAeeKAEAAEKKKEE 1375
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1004 DLPKDLQKKVLAKESLKAykDCVSKARNEKEKKECEKLLTPEAKKLLEEAKKSVKAyldcvSQAKTEAEKKECEKLLTPE 1083
Cdd:PTZ00121 1376 AKKKADAAKKKAEEKKKA--DEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEE-----KKKADEAKKKAEEAKKADE 1448
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1084 ARKLLEEAKESVKAYKDCVSKAR-NEKEKKECEKLLTPEAKKLLEQqvldclknAKTEADKKRCVKDLPKDLQKKVLAKE 1162
Cdd:PTZ00121 1449 AKKKAEEAKKAEEAKKKAEEAKKaDEAKKKAEEAKKADEAKKKAEE--------AKKKADEAKKAAEAKKKADEAKKAEE 1520
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1163 SVKAyldcvSRARNEKEKKECEKLLTPEAKKLLEEAKESLKAYK----DCLSQARNEEERRACEKLLTPEARKLLEQEVK 1238
Cdd:PTZ00121 1521 AKKA-----DEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKaeekKKAEEAKKAEEDKNMALRKAEEAKKAEEARIE 1595
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1239 KSIKAYLDCVSRARNEKEKKECEKLLTPEARKFL-AKQVLNCLEKAGNEEERKAclKNLPKDLQENILAKESLKAY---- 1313
Cdd:PTZ00121 1596 EVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEeEKKKVEQLKKKEAEEKKKA--EELKKAEEENKIKAAEEAKKaeed 1673
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|....*
6X6S_HY      1314 KDCLSQARNEEE----------RRACEKLLTPEARKLLEQEVKKS 1348
Cdd:PTZ00121 1674 KKKAEEAKKAEEdekkaaealkKEAEEAKKAEELKKKEAEEKKKA 1718
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
17-331 4.13e-07

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 55.30  E-value: 4.13e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        17 DSPQDLSNEEATEANhfenllkESKESSDHHLDNPTETQTHFDGDKSEETQTQMDSEGNETSESSNGSLADklfkkarkl 96
Cdd:NF033609  568 DSGSDSSNSDSGSDS-------GSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSA--------- 631
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        97 vdnkkpftQQKNLDEETQELNEEDDQENNEYQEETQTDlIDDETSKKTQQHSPQDLSNEEATEANHFENLLKESKESSDH 176
Cdd:NF033609  632 --------SDSDSASDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 702
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       177 HLDNPTETQTNFDGDKSEETQTQMDSEGNETSESSNGSLADklfKKARKLVDNKKPFTQQKNLDEETQELNEEDDQENNE 256
Cdd:NF033609  703 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 779
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
6X6S_HY       257 YQEETQTDlIDDETSKKTQQHSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTNFDGDKSEEITDDSN 331
Cdd:NF033609  780 SDSDSDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 853
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
9-214 2.59e-06

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 52.60  E-value: 2.59e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY         9 ETSKKAQQDSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTHFDGDKSEETQTQMDSEGNETSESSNGSLADK 88
Cdd:NF033609  665 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 744
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        89 LfKKARKLVDNKKPFTQQKNLDEETQELNEEDDQENNEYQEETQTDLIDDETSkKTQQHSPQDLSNEEATEANHFENLLK 168
Cdd:NF033609  745 D-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDS 822
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*.
6X6S_HY       169 ESKESSDHHLDNPTETQTNFDGDKSEETQTQMDSEGNETSESSNGS 214
Cdd:NF033609  823 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGS 868
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
9-266 3.12e-06

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 52.60  E-value: 3.12e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY         9 ETSKKAQQDSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTHFDGDKSEETQTQMDSEGNETSESSNGSLADk 88
Cdd:NF033609  659 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD- 737
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        89 lfkkarKLVDNKKPFTQQKNLDEETQELNEEDDQENNEYQEETQTDlIDDETSKKTQQHSPQDLSNEEATEANHFENLLK 168
Cdd:NF033609  738 ------SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 810
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       169 ESKESSDHHLDNPTETQTNFDGDKSEETQTQMDSEGNETSESSNGSLADKLFKKARKLVDNKKPFTQQKNLDEETQELNE 248
Cdd:NF033609  811 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGTNASNKNEA 890
                         250       260
                  ....*....|....*....|.
6X6S_HY       249 EDDQE---NNEYQEETQTDLI 266
Cdd:NF033609  891 KDSKEplpDTGSEDEANTSLI 911
CagY_M pfam07337
DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island ...
1204-1235 2.04e-04

DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells. It forms part of a surface needle structure, and this repeat may form an alpha-helical rod structure. A conserved -DC- and -EC- can be seen in regularly spaced in the alignment.


Pssm-ID: 284699 [Multi-domain]  Cd Length: 32  Bit Score: 40.22  E-value: 2.04e-04
                           10        20        30
                   ....*....|....*....|....*....|..
6X6S_HY       1204 AYKDCLSQARNEEERRACEKLLTPEARKLLEQ 1235
Cdd:pfam07337    1 QYKDCLKNAKTEEERNECLKGLSDEARKLLLQ 32
tolA_full TIGR02794
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ...
790-970 2.64e-04

TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]


Pssm-ID: 274303 [Multi-domain]  Cd Length: 346  Bit Score: 45.22  E-value: 2.64e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        790 TPEAKKLLEQQALDCLKNAKTDKERKKCLKDLPKDLQKKVLAKESVKAYLDCVSQAKTEAEKKECEKLLTPEA---RKLL 866
Cdd:TIGR02794   66 EQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAeaeRKAK 145
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        867 EEAKKsvKAYLDCVSQAKTEAEKKECEKLLTPEARKLLEE*AKESVKAYldcVSQAKNEAEKKECEKlltlESKKKLEEA 946
Cdd:TIGR02794  146 EEAAK--QAEEEAKAKAAAEAKKKAEEAKKKAEAEAKAKAEAEAKAKAE---EAKAKAEAAKAKAAA----EAAAKAEAE 216
                          170       180
                   ....*....|....*....|....
6X6S_HY        947 KKsvkayldcvSQAKTEAEKKECE 970
Cdd:TIGR02794  217 AA---------AAAAAEAERKADE 231
CagY_M pfam07337
DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island ...
622-653 4.51e-04

DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells. It forms part of a surface needle structure, and this repeat may form an alpha-helical rod structure. A conserved -DC- and -EC- can be seen in regularly spaced in the alignment.


Pssm-ID: 284699 [Multi-domain]  Cd Length: 32  Bit Score: 39.45  E-value: 4.51e-04
                           10        20        30
                   ....*....|....*....|....*....|..
6X6S_HY        622 AYKDCVSKARNEKEKQECEKLLTPEARKKLEQ 653
Cdd:pfam07337    1 QYKDCLKNAKTEEERNECLKGLSDEARKLLLQ 32
CagY_M pfam07337
DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island ...
553-584 6.62e-04

DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells. It forms part of a surface needle structure, and this repeat may form an alpha-helical rod structure. A conserved -DC- and -EC- can be seen in regularly spaced in the alignment.


Pssm-ID: 284699 [Multi-domain]  Cd Length: 32  Bit Score: 38.68  E-value: 6.62e-04
                           10        20        30
                   ....*....|....*....|....*....|..
6X6S_HY        553 EYKDCIKNAKTEAEKNKCLKGLSKEAIERLKQ 584
Cdd:pfam07337    1 QYKDCLKNAKTEEERNECLKGLSDEARKLLLQ 32
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
109-942 7.90e-04

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 44.58  E-value: 7.90e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        109 LDEETQELNEEDDQENNEYQEETQTDLIDDETSKKTQQHSpQDLSNEEATEANHFENLLKESKESSDhhldnpTETQTNF 188
Cdd:pfam02463  200 LKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERI-DLLQELLRDEQEEIESSKQEIEKEEE------KLAQVLK 272
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        189 DGDKSEETQTQMDSEGNETSessngSLADKLFKKARKLVDNKKPFTQQKNLDEETQELNEEDDQENNEYQEETQTDLIDD 268
Cdd:pfam02463  273 ENKEEEKEKKLQEEELKLLA-----KEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKEL 347
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        269 ETSKKTQQHSPQDLSNEEATEANHFENLLKESKESSDH-----HLDNPTETQTNFDGDKSEEITDDSNDQEIIKGSKKKY 343
Cdd:pfam02463  348 EIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERlssaaKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKE 427
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        344 IIGGIVVAVLIVIILFSRsifhyfmpLEDKSSRFSKDRNLYVNDEIQIRQEYNRLLKERNEKGNMIDKNLFFNDDPNRTL 423
Cdd:pfam02463  428 ELEILEEEEESIELKQGK--------LTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERS 499
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        424 YNYLNIAEIEDKNPLRAFYECISNGGNYEECLKLIKDKKLQDQMKKTLEAYNDCIKNAKTEEERIKCLDLIKDENLKKSL 503
Cdd:pfam02463  500 QKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVIVEVSATADEVEERQKLVRALTELPLGARK 579
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        504 LNQQKVQVALDCLKNAKtdEERNECLKLINDPEIREKFRKELELQKELQEYKDCIKNAKTEaEKNKCLKGLSKEAIERLK 583
Cdd:pfam02463  580 LRLLIPKLKLPLKSIAV--LEIDPILNLAQLDKATLEADEDDKRAKVVEGILKDTELTKLK-ESAKAKESGLRKGVSLEE 656
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        584 QQALDCLKNAKTDEERNECLKNIPQDLQKELLADMSVKAYKDCVSKARNEKEKQECEKLLTPEARKKLEQQVLDCLKNAK 663
Cdd:pfam02463  657 GLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINE 736
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        664 TDEERKKCLKDLPKDLQSDILAKESLKAYKDCVSQAKTEAEKKEcekllTPEAKKLLEEEAKESVKAYLDCVSQAKTEAE 743
Cdd:pfam02463  737 ELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEER-----EKTEKLKVEEEKEEKLKAQEEELRALEEELK 811
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        744 KKECEKLLTPEAKKKLEEAKKSVKAYLdcvsrARNEKEKKECEKLLTPEAKKLLEQQALDCLKNAKTDKERKKCLKDLPK 823
Cdd:pfam02463  812 EEAELLEEEQLLIEQEEKIKEEELEEL-----ALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKD 886
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        824 DLQKKVLAKESVKAYLDCVSQAKTEAEKKECEKLLTPEARKLLEEAKKSVKAYLDCVSQAKTEAEKKECEKLLTPEARKL 903
Cdd:pfam02463  887 ELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLEEADEKEKEENNKEEEEERNKRLL 966
                          810       820       830
                   ....*....|....*....|....*....|....*....
6X6S_HY        904 LEE*AKESVKAYLDCVSQAKNEAEKKECEKLLTLESKKK 942
Cdd:pfam02463  967 LAKEELGKVNLMAIEEFEEKEERYNKDELEKERLEEEKK 1005
 
Name Accession Description Interval E-value
VirB10 cd16429
VirB10 forms part of core complex in Type IV secretion system (T4SS); This family contains ...
1692-1888 2.01e-53

VirB10 forms part of core complex in Type IV secretion system (T4SS); This family contains VirB10, a component of the type IV secretion system (T4SS), including homologs TrbI, TraF, TrwE and TraL. T4S system is employed by pathogenic bacteria to export virulence DNAs and/or proteins directly from the bacterial cytoplasm into the host cell. It forms a large multiprotein complex consisting of 12 proteins termed VirB1-11 and VirD4. VirB10, interacts with VirB7 and VirB9, forming the membrane-spanning 'core complex' (CC), around which all other components assemble. The CC is inserted in both, the outer and inner membranes, playing a fundamental role as a scaffold for the rest of the T4SS components and actively participating in T4S substrate transfer through the bacterial envelope via conformational changes regulating channel opening and closing. TrwE in R33 plasmid has been shown to be anchored to the inner membrane and its C-terminal is necessary for conjugation; the transmembrane domains of TrwB and TrwE are involved in TrwB-TrwE interactions. TraF protein of the RP4 plasmid is involved in circularization of pilin subunits of P-type pili. In gonococcal genetic island (GGI) of Neisseria gonorrhoeae, T4SS encodes TrbI and circularization occurs via a covalent intermediate between the C terminus of putative pilin protein TraA and TrbI.


Pssm-ID: 319755  Cd Length: 180  Bit Score: 185.40  E-value: 2.01e-53
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY     1692 LAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQS-VKGGtpiMTRLMIVFTKAITPDGVII 1770
Cdd:cd16429    1 LPAGTVIPAVLETAINSDLPGQVRAQVTRDVYDSTGRVLLIPKGSRLIGEYNSgVAQG---QSRVFVVWTRLITPDGVSI 77
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY     1771 PLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTApiialdkliglgkgrSERTPEFNYALGQAINGSMQS 1850
Cdd:cd16429   78 DLDSMPGADALGRAGLKGDVDNHFWERFGGALLLSLISDGAQAA---------------SSSASNSNSAGQAAGSNTGQS 142
                        170       180       190
                 ....*....|....*....|....*....|....*...
6X6S_HY     1851 SAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFS 1888
Cdd:cd16429  143 LGQVASQILERNLNIPPTITIRQGTRINIFVARDLDFS 180
T4SS_VirB10 NF038091
type IV secretion system protein VirB10; Members of this family are VirB10, an outer ...
1686-1893 1.49e-49

type IV secretion system protein VirB10; Members of this family are VirB10, an outer membrane-associated protein from the apparatus of protein type IV secretion systems (T4SS). The model attempts to exclude related TraI proteins of conjugal transfer systems as well as the ComB10 protein of a DNA-translocating competence protein of Helicobacter pylori. Because the N-terminal regions of VirB10 proteins are highly variable, the model


Pssm-ID: 468345  Cd Length: 197  Bit Score: 175.06  E-value: 1.49e-49
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1686 SDPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQS-VKGGtpiMTRLMIVFTKAIT 1764
Cdd:NF038091    5 GDRDLLLTQGTMIPCVLETAIDSDLPGMVRCVVTRDVYSANGRVVLIERGSRVIGEYQSgLSQG---QARVFVLWTRIET 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1765 PDGVIIPLANAqAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTApiialdkligLGKGRSERTPEFNYalGQAI 1844
Cdd:NF038091   82 PDGVIVNLDSP-GTDELGRSGLPGYVDNHFWERFGGAILLSLIGDAGQYA----------ANKGQNGNGSGNGN--SVNF 148
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*....
6X6S_HY      1845 NGSMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVYDV 1893
Cdd:NF038091  149 ENTSQAASDMASEALENTINIPPTLYKNQGDRINIFVARDLDFSGVYEL 197
VirB10 COG2948
Type IV secretory pathway, VirB10 component [Intracellular trafficking, secretion, and ...
1687-1891 1.55e-43

Type IV secretory pathway, VirB10 component [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442191 [Multi-domain]  Cd Length: 309  Bit Score: 161.66  E-value: 1.55e-43
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY     1687 DPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQS-VKGGtpiMTRLMIVFTKAITP 1765
Cdd:COG2948  123 DSPYLLPAGTVIPAVLETGINSDLPGQITAQVTEDVYSATGRVLLIPQGSRLIGEYDSgVSFG---QSRVLVVWTRLITP 199
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY     1766 DGVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQtapiialdkligLGKGRSERTpefnyaLGQAI- 1844
Cdd:COG2948  200 DGVSIDLDSMPGTDALGRAGLEGQVDNHWWERFGAALLLSLLGDGAE------------LAASSDEGD------LSTAIr 261
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|....*..
6X6S_HY     1845 NGSMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVY 1891
Cdd:COG2948  262 ENTAQSLSQAAQQILERNLNIPPTLTIRQGTRVRVIVARDLDFSSPY 308
TrbI pfam03743
Bacterial conjugation TrbI-like protein; Although not essential for conjugation, the TrbI ...
1691-1886 5.68e-42

Bacterial conjugation TrbI-like protein; Although not essential for conjugation, the TrbI protein greatly increase the conjugational efficiency.


Pssm-ID: 427480  Cd Length: 187  Bit Score: 152.81  E-value: 5.68e-42
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       1691 VLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWN-MNGTMILLDKGTKVYGNYQS-VKGGtpiMTRLMIVFTKAITPDGV 1768
Cdd:pfam03743    1 YLPAGTVIPAVLITGINSDLPGQVTAQVTEPVYDsATGRALLIPKGSRLIGEYGSgVAYG---QERVLVVWTRLICPDGV 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       1769 IIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTApiialdklIGLGKGRSERTPEFNYALGQAINGSM 1848
Cdd:pfam03743   78 SIDLDSPGGVDALGRAGLRGRVDNHFGQRFGAALLLSLLGGGGELA--------NAASSNSTSTSTGNDNLLGAVGSGAG 149
                          170       180       190
                   ....*....|....*....|....*....|....*...
6X6S_HY       1849 QSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDID 1886
Cdd:pfam03743  150 QALSQLAQQILERALNIPPTITVRQGTRVRIIVNRDLD 187
VirB10_like cd16426
VirB10 and similar proteins form part of core complex in Type IV secretion system (T4SS); This ...
1692-1888 1.28e-35

VirB10 and similar proteins form part of core complex in Type IV secretion system (T4SS); This family contains VirB10, a component of the type IV secretion system (T4SS) and its homologs, including TraB, TraF, IcmE, and similar proteins. T4S system is employed by pathogenic bacteria to export virulence DNAs and/or proteins directly from the bacterial cytoplasm into the host cell. It forms a large multiprotein complex consisting of 12 proteins termed VirB1-11 and VirD4. VirB10 interacts with VirB7 and VirB9, forming the membrane-spanning 'core complex' (CC), around which all other components assemble. The CC is inserted in both the outer and inner membranes, playing a fundamental role as a scaffold for the rest of the T4SS components and actively participating in T4S substrate transfer through the bacterial envelope via conformational changes regulating channel opening and closing. In Gram-negative bacterial pathogen Helicobacter pylori, an important aetiological agent of gastroduodenal disease in humans, the comB3 gene encodes protein with best homologies to TraS/TraB from the Pseudomonas aeruginosa conjugative plasmid RP1 and TrbI of plasmid RP4 and VirB10 from the Ti plasmid of Agrobacterium tumefaciens, as well as DotG/IcmE of Legionella pneumophila.


Pssm-ID: 319754 [Multi-domain]  Cd Length: 151  Bit Score: 133.05  E-value: 1.28e-35
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY     1692 LAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQsvkGGTPIMTRLMIVFTKAITPDGVIIP 1771
Cdd:cd16426    1 LLPGTIIPAVLLTGIDSDLPGPVLARVTRDVYDPNGGRLLIPKGSRLIGEYY---GDLSSQERVLVRWTRLICPDGVSID 77
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY     1772 LANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVnsflqtapiialdkliglgkgrsertpefnyalgqaingsmqsS 1851
Cdd:cd16426   78 LDSVAGADALGRAGLRGDVDNHFGERFGGALLASLL-------------------------------------------S 114
                        170       180       190
                 ....*....|....*....|....*....|....*..
6X6S_HY     1852 AQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFS 1888
Cdd:cd16426  115 GQLAQQYLQRAENIPPTITVNAGTRVNIVVARDVDLP 151
VirB10_subf NF041422
VirB10/TraB/TrbI family type IV secretion system protein;
1687-1895 4.23e-30

VirB10/TraB/TrbI family type IV secretion system protein;


Pssm-ID: 469313  Cd Length: 404  Bit Score: 125.13  E-value: 4.23e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1687 DPTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQS--VKGGtpiMTRLMIVFTKAIT 1764
Cdd:NF041422  210 DPDLYIPENTYIPCSLDRRFVSDVAGKISCTISEDVYSASGHVKLIPKGTTARGVYKTgtLKHG---QGRMFIIWTELRT 286
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1765 PD--GVIIPLANAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTAPiialdkliGLGKGRSERTpefNYAlgq 1842
Cdd:NF041422  287 PEppYLDIPLVDTQAAGQLGEAGIDGWIDTHFWERFGNALMLSTVQDVAAAAS--------GSAPGKDRNT---DYT--- 352
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|...
6X6S_HY      1843 aiNGSMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVYDVKI 1895
Cdd:NF041422  353 --ENSRQATAEMAKTALENSINIPPTLYKNQGDIIGLITGQDIDFSSIYQLRM 403
PRK13855 PRK13855
type IV secretion system protein VirB10; Provisional
1688-1900 3.05e-29

type IV secretion system protein VirB10; Provisional


Pssm-ID: 172376  Cd Length: 376  Bit Score: 121.93  E-value: 3.05e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1688 PTFVLAQYTPIEITLTSKVDATLTGIVSGVVAKDVWNMNGTMILLDKGTKVYGNYQsvKGGTPIMTRLMIVFTKAITPDG 1767
Cdd:PRK13855  178 PDFMVTQGTIIPCILQTAIDTNLAGYVKCVLPQDIRGTTNNVVLLDRGTTVVGEIQ--RGLQQGDARVFVLWDRAETPDH 255
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1768 VIIPLAnAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVVNSFLQTAPIIAldkliglgkGRSERTPEFNyalgqaingS 1847
Cdd:PRK13855  256 AMISLS-SPGADELGRSGLPGTVDNHFWQRFSGAMLLSVVQGAFQAASTYA---------GSSGGGTSFN---------S 316
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....
6X6S_HY      1848 MQSS-AQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVYDVKITNKSV 1900
Cdd:PRK13855  317 FQNNgEQTADTALKATINIPPTLKKNQGDTVSIFVARDLDFSGIYQLRMTGGAT 370
CagY_I pfam14585
CagY type 1 repeat; This repeat is found at the N-terminus of the CagY proteins - part of the ...
9-73 7.78e-28

CagY type 1 repeat; This repeat is found at the N-terminus of the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells.


Pssm-ID: 405299  Cd Length: 65  Bit Score: 107.80  E-value: 7.78e-28
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
6X6S_HY          9 ETSKKAQQDSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTHFDGDKSEETQTQMDSE 73
Cdd:pfam14585    1 ETSKKTQQHSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTNFDGDKSEEITDDSNDQ 65
CagY_I pfam14585
CagY type 1 repeat; This repeat is found at the N-terminus of the CagY proteins - part of the ...
139-203 7.10e-26

CagY type 1 repeat; This repeat is found at the N-terminus of the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells.


Pssm-ID: 405299  Cd Length: 65  Bit Score: 102.41  E-value: 7.10e-26
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
6X6S_HY        139 ETSKKTQQHSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTNFDGDKSEETQTQMDSE 203
Cdd:pfam14585    1 ETSKKTQQHSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTNFDGDKSEEITDDSNDQ 65
CagY_I pfam14585
CagY type 1 repeat; This repeat is found at the N-terminus of the CagY proteins - part of the ...
269-325 1.65e-21

CagY type 1 repeat; This repeat is found at the N-terminus of the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells.


Pssm-ID: 405299  Cd Length: 65  Bit Score: 89.70  E-value: 1.65e-21
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
6X6S_HY        269 ETSKKTQQHSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTNFDGDKSEE 325
Cdd:pfam14585    1 ETSKKTQQHSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTNFDGDKSEE 57
PTZ00121 PTZ00121
MAEBL; Provisional
846-1348 5.07e-14

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 78.26  E-value: 5.07e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       846 KTEAEKKECEKLLTPEARKLLEEAKKSVKAYLDCVSQAKteaEKKECEKLLTPEARKLLEE*AKESVKAYLDcvsqaknE 925
Cdd:PTZ00121 1237 KDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAE---EARKADELKKAEEKKKADEAKKAEEKKKAD-------E 1306
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       926 AEKKECEKLLTLESKKKLEEAKKSVKayldcvsqaktEAEKKECEKLLTPEAKKLLEQQALDCLKNA--KTEADKKRCVK 1003
Cdd:PTZ00121 1307 AKKKAEEAKKADEAKKKAEEAKKKAD-----------AAKKKAEEAKKAAEAAKAEAEAAADEAEAAeeKAEAAEKKKEE 1375
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1004 DLPKDLQKKVLAKESLKAykDCVSKARNEKEKKECEKLLTPEAKKLLEEAKKSVKAyldcvSQAKTEAEKKECEKLLTPE 1083
Cdd:PTZ00121 1376 AKKKADAAKKKAEEKKKA--DEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEE-----KKKADEAKKKAEEAKKADE 1448
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1084 ARKLLEEAKESVKAYKDCVSKAR-NEKEKKECEKLLTPEAKKLLEQqvldclknAKTEADKKRCVKDLPKDLQKKVLAKE 1162
Cdd:PTZ00121 1449 AKKKAEEAKKAEEAKKKAEEAKKaDEAKKKAEEAKKADEAKKKAEE--------AKKKADEAKKAAEAKKKADEAKKAEE 1520
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1163 SVKAyldcvSRARNEKEKKECEKLLTPEAKKLLEEAKESLKAYK----DCLSQARNEEERRACEKLLTPEARKLLEQEVK 1238
Cdd:PTZ00121 1521 AKKA-----DEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKaeekKKAEEAKKAEEDKNMALRKAEEAKKAEEARIE 1595
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1239 KSIKAYLDCVSRARNEKEKKECEKLLTPEARKFL-AKQVLNCLEKAGNEEERKAclKNLPKDLQENILAKESLKAY---- 1313
Cdd:PTZ00121 1596 EVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEeEKKKVEQLKKKEAEEKKKA--EELKKAEEENKIKAAEEAKKaeed 1673
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|....*
6X6S_HY      1314 KDCLSQARNEEE----------RRACEKLLTPEARKLLEQEVKKS 1348
Cdd:PTZ00121 1674 KKKAEEAKKAEEdekkaaealkKEAEEAKKAEELKKKEAEEKKKA 1718
PTZ00121 PTZ00121
MAEBL; Provisional
386-1157 1.75e-13

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 76.72  E-value: 1.75e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       386 NDEIQIRQEYNRLLKERNEKGNMIDKNLFFNDDPNRTLYNYLNIAEIEDKNPLRAFYECIsngGNYEECLKLIKDKKLQD 465
Cdd:PTZ00121 1038 NDDVLKEKDIIDEDIDGNHEGKAEAKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEEAF---GKAEEAKKTETGKAEEA 1114
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       466 QMKKtlEAYNDCIKNAKTEEERiKCLDLIKDENLKKSLlNQQKVQVALDCLKNAKTDEERN-ECLKLINDPEIREKFRKE 544
Cdd:PTZ00121 1115 RKAE--EAKKKAEDARKAEEAR-KAEDARKAEEARKAE-DAKRVEIARKAEDARKAEEARKaEDAKKAEAARKAEEVRKA 1190
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       545 LELQKELQ----EYKDCIKNAKTEAEKNKCLKGLSKEAIERLKQQALDCLKNAKTDEERNEclknipQDLQKELLADMSV 620
Cdd:PTZ00121 1191 EELRKAEDarkaEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNN------EEIRKFEEARMAH 1264
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       621 KAYKDCVSKArneKEKQECEKLLTPEARKKLEQqvldclknAKTDEERKKcLKDLPKDLQSDILAKESLKAYKDCVSQAK 700
Cdd:PTZ00121 1265 FARRQAAIKA---EEARKADELKKAEEKKKADE--------AKKAEEKKK-ADEAKKKAEEAKKADEAKKKAEEAKKKAD 1332
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       701 tEAEKKECEKLLTPEAKKLLEEEAKESVKAYLDCVSQA--KTEAEKKECEKLltpEAKKKLEEAKKSVKAYLDCVSRARN 778
Cdd:PTZ00121 1333 -AAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAekKKEEAKKKADAA---KKKAEEKKKADEAKKKAEEDKKKAD 1408
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       779 EKEKKECEKLLTPEAKKLLEQQALdcLKNAKTDKERKKCLKDLPKDLQKKVLAKESVKAyldcvsqaKTEAEKKECEKLL 858
Cdd:PTZ00121 1409 ELKKAAAAKKKADEAKKKAEEKKK--ADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKK--------AEEAKKADEAKKK 1478
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       859 TPEARKLLEEAKKSVKAYLDCVSQAKTEAEKKECEKLLTPEARKLLEE*AKESVKAYLDCVSQA---KNEAEKKECEKLL 935
Cdd:PTZ00121 1479 AEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAeekKKADELKKAEELK 1558
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       936 TLESKKKLEEAKKSVKAYLDCVSQAKteaEKKECEKLLTPEAKKLLEQQALDCLKNAKTEADKKRCVKDLPKDLQKKVLA 1015
Cdd:PTZ00121 1559 KAEEKKKAEEAKKAEEDKNMALRKAE---EAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKV 1635
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1016 KESLKAYKDCVSKARNEKEKKECEKLLTPEAKKLLEEAKKSVKayldcvSQAKTEAEKKECEKLLTPEARKlLEEAKESV 1095
Cdd:PTZ00121 1636 EQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAE------EAKKAEEDEKKAAEALKKEAEE-AKKAEELK 1708
                         730       740       750       760       770       780
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
6X6S_HY      1096 KAYKDCVSKARNEKEKKECEKLLTPEAKKLLEQQVLDClKNAKTEADKKRCVKDLPKDLQKK 1157
Cdd:PTZ00121 1709 KKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKA-EEAKKDEEEKKKIAHLKKEEEKK 1769
PRK13881 PRK13881
conjugal transfer protein TrbI; Provisional
1701-1891 1.25e-08

conjugal transfer protein TrbI; Provisional


Pssm-ID: 237541  Cd Length: 472  Bit Score: 59.79  E-value: 1.25e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1701 TLTSKVDATLTGIVSGVVAKDVWNM-NGTMILLDKGTKVYGNYQS-VKGGtpiMTRLMIVFTKAITPDGVIIPLANAQAA 1778
Cdd:PRK13881  288 TLISGINSDLPGQIMAQVSQNVYDTaTGKHLLIPQGSRLVGSYSSdVAYG---QARVLVAWQRIVFPDGKAMDIGAMPGA 364
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1779 GMLGEAGVDGYVNNHFMKRIGFAVIASVVnsflqTAPiialdklIGLGKGRSERTPEFNY-----ALGQAINgsmQSSAQ 1853
Cdd:PRK13881  365 DSAGYAGFRDQVNNHYVRIFGSAFLMSGV-----TAG-------ITYSQDQNQNNNGYGQptassALSEALG---QQLGQ 429
                         170       180       190
                  ....*....|....*....|....*....|....*...
6X6S_HY      1854 MSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVY 1891
Cdd:PRK13881  430 VTAQLIAKNLNIAPTLEIRPGYRFNVIVTKDMTFSKPY 467
IcmE cd16431
DotG/IcmE is a homolog of VirB10 which forms part of core complex in Type IV secretion system; ...
1734-1885 2.02e-08

DotG/IcmE is a homolog of VirB10 which forms part of core complex in Type IV secretion system; This family contains DotG/IcmE (VirB10 homolog) and a component of the type IV secretion system (T4SS), and similar proteins. The Dot/Icm system is a T4SS found in the pathogens Legionella and Coxiella and the conjugative apparatus of IncI plasmids; T4SS is employed by pathogenic bacteria to export virulence DNAs and/or proteins directly from the bacterial cytoplasm into the host cell. Similar to T4SS VirB/D components, the Legionella Dot/Icm secretion apparatus contains a critical five-protein sub-assembly that forms the membrane-spanning 'core-complex' (CC), around which all other components assemble. This transmembrane connection is mediated by protein dimer pairs consisting of two inner membrane proteins, DotF and DotG, each independently associating with DotH/DotC/DotD in the outer membrane.


Pssm-ID: 319757  Cd Length: 179  Bit Score: 56.05  E-value: 2.02e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY     1734 KGTKVYGNYQSVKggtpimTRLMIVFTKAITPDGVIIPLaNAQAAGMLGEAGVDGYVNNHFMKRIGFAVIASVV----NS 1809
Cdd:cd16431   34 KGARLIGSFERSG------DGVVITFTTMVLPGGKSYPI-DAYALDDRTLSAVASDVDNHYLSRYGLPAAAAFLqglgQA 106
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
6X6S_HY     1810 FLQTAPIIAldkLIGLGKGRSER-TPEFNYALGQAINGSMQssaQMSnQILGQLMNIPPSFYKNEGDSIKILTMDDI 1885
Cdd:cd16431  107 YQQAGTTTT---STSGGTVTVTTgTPDGKQAAGAAAGGVGQ---QLG-QVLAQDANRPPTVTVPAGTPIGILFMSDV 176
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
17-331 4.13e-07

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 55.30  E-value: 4.13e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        17 DSPQDLSNEEATEANhfenllkESKESSDHHLDNPTETQTHFDGDKSEETQTQMDSEGNETSESSNGSLADklfkkarkl 96
Cdd:NF033609  568 DSGSDSSNSDSGSDS-------GSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSA--------- 631
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        97 vdnkkpftQQKNLDEETQELNEEDDQENNEYQEETQTDlIDDETSKKTQQHSPQDLSNEEATEANHFENLLKESKESSDH 176
Cdd:NF033609  632 --------SDSDSASDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 702
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       177 HLDNPTETQTNFDGDKSEETQTQMDSEGNETSESSNGSLADklfKKARKLVDNKKPFTQQKNLDEETQELNEEDDQENNE 256
Cdd:NF033609  703 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 779
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
6X6S_HY       257 YQEETQTDlIDDETSKKTQQHSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTNFDGDKSEEITDDSN 331
Cdd:NF033609  780 SDSDSDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 853
PRK13831 PRK13831
conjugal transfer protein TrbI; Provisional
1688-1891 4.24e-07

conjugal transfer protein TrbI; Provisional


Pssm-ID: 172358 [Multi-domain]  Cd Length: 432  Bit Score: 54.74  E-value: 4.24e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1688 PTFVLAQYTPIEITLTSKVDATL-TGIVSGV-------VAKDVWN-MNGTMILLDKGTKVYGNYQS-VKGGtpiMTRLMI 1757
Cdd:PRK13831  233 PNRVVPQQSPYELKRGSVIPATLiTGINSDLpgritaqVSQNVYDsATGHRLLIPQGTKLFGRYDSkVSFG---QSRVLV 309
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY      1758 VFTKAITPDGviiplANAQAAGMlgeAGVD--GY------VNNHFMKRIGFAVIasvvnsflqtapiIALdklIGLGKGR 1829
Cdd:PRK13831  310 VWTDIIFPNG-----STLQIGGM---AGTDaeGYggfkdkVDRHYLRTFGSAIL-------------IAL---IGTGIDM 365
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
6X6S_HY      1830 SerTPEFNYALGQ------AINGSMQSSAQMSNQILGQLMNIPPSFYKNEGDSIKILTMDDIDFSGVY 1891
Cdd:PRK13831  366 A--VPESSTLATQdtasdaARRNFAETFGRVAERTISKNLNVQPTLEIRPGYKFNVLVDQDIVFPGAY 431
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
9-214 2.59e-06

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 52.60  E-value: 2.59e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY         9 ETSKKAQQDSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTHFDGDKSEETQTQMDSEGNETSESSNGSLADK 88
Cdd:NF033609  665 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 744
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        89 LfKKARKLVDNKKPFTQQKNLDEETQELNEEDDQENNEYQEETQTDLIDDETSkKTQQHSPQDLSNEEATEANHFENLLK 168
Cdd:NF033609  745 D-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDS 822
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*.
6X6S_HY       169 ESKESSDHHLDNPTETQTNFDGDKSEETQTQMDSEGNETSESSNGS 214
Cdd:NF033609  823 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGS 868
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
9-266 3.12e-06

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 52.60  E-value: 3.12e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY         9 ETSKKAQQDSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTHFDGDKSEETQTQMDSEGNETSESSNGSLADk 88
Cdd:NF033609  659 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD- 737
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        89 lfkkarKLVDNKKPFTQQKNLDEETQELNEEDDQENNEYQEETQTDlIDDETSKKTQQHSPQDLSNEEATEANHFENLLK 168
Cdd:NF033609  738 ------SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 810
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       169 ESKESSDHHLDNPTETQTNFDGDKSEETQTQMDSEGNETSESSNGSLADKLFKKARKLVDNKKPFTQQKNLDEETQELNE 248
Cdd:NF033609  811 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGTNASNKNEA 890
                         250       260
                  ....*....|....*....|.
6X6S_HY       249 EDDQE---NNEYQEETQTDLI 266
Cdd:NF033609  891 KDSKEplpDTGSEDEANTSLI 911
PRK08581 PRK08581
amidase domain-containing protein;
21-347 4.12e-06

amidase domain-containing protein;


Pssm-ID: 236304 [Multi-domain]  Cd Length: 619  Bit Score: 51.71  E-value: 4.12e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        21 DLSNEEATEANHFENLLKESKESSDHHLDNPTETQTHFDGDKSEETQTQMDSEGNETSESSN--GSLADKLFKKARKLVD 98
Cdd:PRK08581   29 DPQKDSTAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNiiDFIYKNLPQTNINQLL 108
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        99 NKKPFTQQKNLDEETQEL--NEEDDQENNEYQEETQTDLIDDETSKKTQQHSPQDLSNEEATEANHFENLLKESKESSDH 176
Cdd:PRK08581  109 TKNKYDDNYSLTTLIQNLfnLNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSN 188
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       177 hlDNPTETQTNFDGDKSEETQTQMDSeGNETSESSNGSLADKLfkkarklvdnkkpftqqknLDEETQELNEEDDQENNE 256
Cdd:PRK08581  189 --KQPNSPKPTQPNQSNSQPASDDTA-NQKSSSKDNQSMSDSA-------------------LDSILDQYSEDAKKTQKD 246
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       257 YQEETQTDliddeTSKKTQQHSPQDLSNEE----ATEANHFENLLKESKESSDHHLDNPTETQTNfDGDKSEEITDDSND 332
Cdd:PRK08581  247 YASQSKKD-----KTETSNTKNPQLPTQDElkhkSKPAQSFENDVNQSNTRSTSLFETGPSLSNN-DDSGSFNVVDSKDT 320
                         330
                  ....*....|....*.
6X6S_HY       333 QEIIKG-SKKKYIIGG 347
Cdd:PRK08581  321 RQFIKSiAKDAHRIGQ 336
PTZ00341 PTZ00341
Ring-infected erythrocyte surface antigen; Provisional
12-260 9.58e-06

Ring-infected erythrocyte surface antigen; Provisional


Pssm-ID: 173534 [Multi-domain]  Cd Length: 1136  Bit Score: 50.94  E-value: 9.58e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        12 KKAQQDSPQDLSNEEATEANHFENLLKESKESSDHHLDNPTETQTHFDGDKSEETQTQMDSEGN---ETSESSNGSLADK 88
Cdd:PTZ00341  903 KKAKKKDAKDLSGNIAHEINLINKELKNQNENVPEHLKEHAEANIEEDAEENVEEDAEENVEENveeNVEENVEENVEEN 982
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        89 LFKKARKLVDNKKPFTQQKNLDEETQELNEEDDQENNEYQEETQTDLIDDETSKKTQQHSPQdlsNEEATEANHFENlLK 168
Cdd:PTZ00341  983 VEENVEENVEENVEENVEENIEENVEENVEENIEENVEEYDEENVEEVEENVEEYDEENVEE---IEENAEENVEEN-IE 1058
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       169 ESKESSDHhlDNPTETQTNFDGDKSEETQTQMDSEGNETSESSNGSLADKLFKKArklvdnkkpftqQKNLDEETQELNE 248
Cdd:PTZ00341 1059 ENIEEYDE--ENVEEIEENIEENIEENVEENVEENVEEIEENVEENVEENAEENA------------EENAEENAEEYDD 1124
                         250
                  ....*....|..
6X6S_HY       249 EDDQENNEYQEE 260
Cdd:PTZ00341 1125 ENPEEHNEEYDE 1136
tolA PRK09510
cell envelope integrity inner membrane protein TolA; Provisional
807-1009 1.64e-04

cell envelope integrity inner membrane protein TolA; Provisional


Pssm-ID: 236545 [Multi-domain]  Cd Length: 387  Bit Score: 46.34  E-value: 1.64e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       807 NAKTDKERKKCLKDLPKDLQKKVLA-KESVKAYLDCVSQAKTEAEKKECEKLLTPEARKLLEEAKKSVKAyldcVSQAKT 885
Cdd:PRK09510   74 AKRAEEQRKKKEQQQAEELQQKQAAeQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAA----AAKAKA 149
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY       886 EAEKKECE---KLLTPEARKLLEE*AKESVKAyldcvsQAKNEAEKKECEKLLTLESKKKLEEAKKSVKAyldcVSQAKT 962
Cdd:PRK09510  150 EAEAKRAAaaaKKAAAEAKKKAEAEAAKKAAA------EAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAA----EAKKKA 219
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*..
6X6S_HY       963 EAEKKECEKLLTPEAKKLLEQQALDCLKNAKTEADKKRCVKDLPKDL 1009
Cdd:PRK09510  220 AAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEVDDLFGGL 266
CagY_M pfam07337
DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island ...
1204-1235 2.04e-04

DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells. It forms part of a surface needle structure, and this repeat may form an alpha-helical rod structure. A conserved -DC- and -EC- can be seen in regularly spaced in the alignment.


Pssm-ID: 284699 [Multi-domain]  Cd Length: 32  Bit Score: 40.22  E-value: 2.04e-04
                           10        20        30
                   ....*....|....*....|....*....|..
6X6S_HY       1204 AYKDCLSQARNEEERRACEKLLTPEARKLLEQ 1235
Cdd:pfam07337    1 QYKDCLKNAKTEEERNECLKGLSDEARKLLLQ 32
CagY_M pfam07337
DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island ...
1312-1343 2.04e-04

DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells. It forms part of a surface needle structure, and this repeat may form an alpha-helical rod structure. A conserved -DC- and -EC- can be seen in regularly spaced in the alignment.


Pssm-ID: 284699 [Multi-domain]  Cd Length: 32  Bit Score: 40.22  E-value: 2.04e-04
                           10        20        30
                   ....*....|....*....|....*....|..
6X6S_HY       1312 AYKDCLSQARNEEERRACEKLLTPEARKLLEQ 1343
Cdd:pfam07337    1 QYKDCLKNAKTEEERNECLKGLSDEARKLLLQ 32
tolA_full TIGR02794
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ...
790-970 2.64e-04

TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]


Pssm-ID: 274303 [Multi-domain]  Cd Length: 346  Bit Score: 45.22  E-value: 2.64e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        790 TPEAKKLLEQQALDCLKNAKTDKERKKCLKDLPKDLQKKVLAKESVKAYLDCVSQAKTEAEKKECEKLLTPEA---RKLL 866
Cdd:TIGR02794   66 EQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAeaeRKAK 145
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        867 EEAKKsvKAYLDCVSQAKTEAEKKECEKLLTPEARKLLEE*AKESVKAYldcVSQAKNEAEKKECEKlltlESKKKLEEA 946
Cdd:TIGR02794  146 EEAAK--QAEEEAKAKAAAEAKKKAEEAKKKAEAEAKAKAEAEAKAKAE---EAKAKAEAAKAKAAA----EAAAKAEAE 216
                          170       180
                   ....*....|....*....|....
6X6S_HY        947 KKsvkayldcvSQAKTEAEKKECE 970
Cdd:TIGR02794  217 AA---------AAAAAEAERKADE 231
CagY_M pfam07337
DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island ...
622-653 4.51e-04

DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells. It forms part of a surface needle structure, and this repeat may form an alpha-helical rod structure. A conserved -DC- and -EC- can be seen in regularly spaced in the alignment.


Pssm-ID: 284699 [Multi-domain]  Cd Length: 32  Bit Score: 39.45  E-value: 4.51e-04
                           10        20        30
                   ....*....|....*....|....*....|..
6X6S_HY        622 AYKDCVSKARNEKEKQECEKLLTPEARKKLEQ 653
Cdd:pfam07337    1 QYKDCLKNAKTEEERNECLKGLSDEARKLLLQ 32
CagY_M pfam07337
DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island ...
553-584 6.62e-04

DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells. It forms part of a surface needle structure, and this repeat may form an alpha-helical rod structure. A conserved -DC- and -EC- can be seen in regularly spaced in the alignment.


Pssm-ID: 284699 [Multi-domain]  Cd Length: 32  Bit Score: 38.68  E-value: 6.62e-04
                           10        20        30
                   ....*....|....*....|....*....|..
6X6S_HY        553 EYKDCIKNAKTEAEKNKCLKGLSKEAIERLKQ 584
Cdd:pfam07337    1 QYKDCLKNAKTEEERNECLKGLSDEARKLLLQ 32
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
109-942 7.90e-04

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 44.58  E-value: 7.90e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        109 LDEETQELNEEDDQENNEYQEETQTDLIDDETSKKTQQHSpQDLSNEEATEANHFENLLKESKESSDhhldnpTETQTNF 188
Cdd:pfam02463  200 LKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERI-DLLQELLRDEQEEIESSKQEIEKEEE------KLAQVLK 272
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        189 DGDKSEETQTQMDSEGNETSessngSLADKLFKKARKLVDNKKPFTQQKNLDEETQELNEEDDQENNEYQEETQTDLIDD 268
Cdd:pfam02463  273 ENKEEEKEKKLQEEELKLLA-----KEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKEL 347
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        269 ETSKKTQQHSPQDLSNEEATEANHFENLLKESKESSDH-----HLDNPTETQTNFDGDKSEEITDDSNDQEIIKGSKKKY 343
Cdd:pfam02463  348 EIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERlssaaKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKE 427
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        344 IIGGIVVAVLIVIILFSRsifhyfmpLEDKSSRFSKDRNLYVNDEIQIRQEYNRLLKERNEKGNMIDKNLFFNDDPNRTL 423
Cdd:pfam02463  428 ELEILEEEEESIELKQGK--------LTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERS 499
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        424 YNYLNIAEIEDKNPLRAFYECISNGGNYEECLKLIKDKKLQDQMKKTLEAYNDCIKNAKTEEERIKCLDLIKDENLKKSL 503
Cdd:pfam02463  500 QKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVIVEVSATADEVEERQKLVRALTELPLGARK 579
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        504 LNQQKVQVALDCLKNAKtdEERNECLKLINDPEIREKFRKELELQKELQEYKDCIKNAKTEaEKNKCLKGLSKEAIERLK 583
Cdd:pfam02463  580 LRLLIPKLKLPLKSIAV--LEIDPILNLAQLDKATLEADEDDKRAKVVEGILKDTELTKLK-ESAKAKESGLRKGVSLEE 656
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        584 QQALDCLKNAKTDEERNECLKNIPQDLQKELLADMSVKAYKDCVSKARNEKEKQECEKLLTPEARKKLEQQVLDCLKNAK 663
Cdd:pfam02463  657 GLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINE 736
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        664 TDEERKKCLKDLPKDLQSDILAKESLKAYKDCVSQAKTEAEKKEcekllTPEAKKLLEEEAKESVKAYLDCVSQAKTEAE 743
Cdd:pfam02463  737 ELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEER-----EKTEKLKVEEEKEEKLKAQEEELRALEEELK 811
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        744 KKECEKLLTPEAKKKLEEAKKSVKAYLdcvsrARNEKEKKECEKLLTPEAKKLLEQQALDCLKNAKTDKERKKCLKDLPK 823
Cdd:pfam02463  812 EEAELLEEEQLLIEQEEKIKEEELEEL-----ALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKD 886
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
6X6S_HY        824 DLQKKVLAKESVKAYLDCVSQAKTEAEKKECEKLLTPEARKLLEEAKKSVKAYLDCVSQAKTEAEKKECEKLLTPEARKL 903
Cdd:pfam02463  887 ELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLEEADEKEKEENNKEEEEERNKRLL 966
                          810       820       830
                   ....*....|....*....|....*....|....*....
6X6S_HY        904 LEE*AKESVKAYLDCVSQAKNEAEKKECEKLLTLESKKK 942
Cdd:pfam02463  967 LAKEELGKVNLMAIEEFEEKEERYNKDELEKERLEEEKK 1005
CagY_M pfam07337
DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island ...
952-983 9.34e-04

DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells. It forms part of a surface needle structure, and this repeat may form an alpha-helical rod structure. A conserved -DC- and -EC- can be seen in regularly spaced in the alignment.


Pssm-ID: 284699 [Multi-domain]  Cd Length: 32  Bit Score: 38.30  E-value: 9.34e-04
                           10        20        30
                   ....*....|....*....|....*....|..
6X6S_HY        952 AYLDCVSQAKTEAEKKECEKLLTPEAKKLLEQ 983
Cdd:pfam07337    1 QYKDCLKNAKTEEERNECLKGLSDEARKLLLQ 32
CagY_M pfam07337
DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island ...
691-722 1.06e-03

DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells. It forms part of a surface needle structure, and this repeat may form an alpha-helical rod structure. A conserved -DC- and -EC- can be seen in regularly spaced in the alignment.


Pssm-ID: 284699 [Multi-domain]  Cd Length: 32  Bit Score: 38.30  E-value: 1.06e-03
                           10        20        30
                   ....*....|....*....|....*....|..
6X6S_HY        691 AYKDCVSQAKTEAEKKECEKLLTPEAKKLLEE 722
Cdd:pfam07337    1 QYKDCLKNAKTEEERNECLKGLSDEARKLLLQ 32
CagY_M pfam07337
DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island ...
837-868 1.45e-03

DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells. It forms part of a surface needle structure, and this repeat may form an alpha-helical rod structure. A conserved -DC- and -EC- can be seen in regularly spaced in the alignment.


Pssm-ID: 284699 [Multi-domain]  Cd Length: 32  Bit Score: 37.91  E-value: 1.45e-03
                           10        20        30
                   ....*....|....*....|....*....|..
6X6S_HY        837 AYLDCVSQAKTEAEKKECEKLLTPEARKLLEE 868
Cdd:pfam07337    1 QYKDCLKNAKTEEERNECLKGLSDEARKLLLQ 32
CagY_M pfam07337
DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island ...
875-906 1.45e-03

DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells. It forms part of a surface needle structure, and this repeat may form an alpha-helical rod structure. A conserved -DC- and -EC- can be seen in regularly spaced in the alignment.


Pssm-ID: 284699 [Multi-domain]  Cd Length: 32  Bit Score: 37.91  E-value: 1.45e-03
                           10        20        30
                   ....*....|....*....|....*....|..
6X6S_HY        875 AYLDCVSQAKTEAEKKECEKLLTPEARKLLEE 906
Cdd:pfam07337    1 QYKDCLKNAKTEEERNECLKGLSDEARKLLLQ 32
CagY_M pfam07337
DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island ...
1059-1090 1.45e-03

DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells. It forms part of a surface needle structure, and this repeat may form an alpha-helical rod structure. A conserved -DC- and -EC- can be seen in regularly spaced in the alignment.


Pssm-ID: 284699 [Multi-domain]  Cd Length: 32  Bit Score: 37.91  E-value: 1.45e-03
                           10        20        30
                   ....*....|....*....|....*....|..
6X6S_HY       1059 AYLDCVSQAKTEAEKKECEKLLTPEARKLLEE 1090
Cdd:pfam07337    1 QYKDCLKNAKTEEERNECLKGLSDEARKLLLQ 32
CagY_M pfam07337
DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island ...
585-615 1.57e-03

DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells. It forms part of a surface needle structure, and this repeat may form an alpha-helical rod structure. A conserved -DC- and -EC- can be seen in regularly spaced in the alignment.


Pssm-ID: 284699 [Multi-domain]  Cd Length: 32  Bit Score: 37.91  E-value: 1.57e-03
                           10        20        30
                   ....*....|....*....|....*....|.
6X6S_HY        585 QALDCLKNAKTDEERNECLKNIPQDLQKELL 615
Cdd:pfam07337    1 QYKDCLKNAKTEEERNECLKGLSDEARKLLL 31
CagY_M pfam07337
DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island ...
800-830 7.97e-03

DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells. It forms part of a surface needle structure, and this repeat may form an alpha-helical rod structure. A conserved -DC- and -EC- can be seen in regularly spaced in the alignment.


Pssm-ID: 284699 [Multi-domain]  Cd Length: 32  Bit Score: 35.60  E-value: 7.97e-03
                           10        20        30
                   ....*....|....*....|....*....|.
6X6S_HY        800 QALDCLKNAKTDKERKKCLKDLPKDLQKKVL 830
Cdd:pfam07337    1 QYKDCLKNAKTEEERNECLKGLSDEARKLLL 31
CagY_M pfam07337
DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island ...
1275-1305 8.71e-03

DC-EC Repeat; This repeat is found in the CagY proteins - part of the CAG pathogenicity island - and involved in delivery of the protein CagA into host cells. It forms part of a surface needle structure, and this repeat may form an alpha-helical rod structure. A conserved -DC- and -EC- can be seen in regularly spaced in the alignment.


Pssm-ID: 284699 [Multi-domain]  Cd Length: 32  Bit Score: 35.60  E-value: 8.71e-03
                           10        20        30
                   ....*....|....*....|....*....|.
6X6S_HY       1275 QVLNCLEKAGNEEERKACLKNLPKDLQENIL 1305
Cdd:pfam07337    1 QYKDCLKNAKTEEERNECLKGLSDEARKLLL 31
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH