NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|553746391|ref|WP_023079880|]
View 

pilus ancillary protein 1 [Streptococcus pyogenes]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
pilus_ancill_1 NF033396
pilus ancillary protein 1;
13-743 0e+00

pilus ancillary protein 1;


:

Pssm-ID: 380246 [Multi-domain]  Cd Length: 737  Bit Score: 1398.85  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391  13 NNKRRQTTIGLLKVFLTFVALIGIVGFSIRAFGAEEQ----SVPNKQSSVQDYPWYGYDSYSKGYPDYSPLKTYHNLKVN 88
Cdd:NF033396   1 NRKPKQLTVTLVGVFLMFLTLVSSMRGAQSIFGEEKRieevSVPKIKSPDDDYPWYGYDSYDSSHPYYEPFKVAHDLKVN 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391  89 LEGSKDYQTYCFNLTKHFPSKLDSVKSQWYKKLEGNDQTFRNYASQIRNEQ-NISQKILDVLYNGYPNNANGLMNELEPL 167
Cdd:NF033396  81 LNGSKSYQVYCFNITSHYPSKKNSVSKQWFKRVDGTGDVFTSYAKTPRIEGeELNQKLLSVMYNAYPNNANGIMKGIEPL 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 168 NAIIVTQNAIWYYSDSAQINPDESFKTEAKSNGINDQQLGLMRKALKELIDPNLGSKYSNKTPSGYRLNVFESHDKTFQN 247
Cdd:NF033396 161 NAILVTQNAVWYYSDSSQINPDELFKSEAKSNKINDQQLGLMREALSELIDPNLGEKYSNKTPSGYRLNIFESHDKSFQN 240
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 248 LLSAEYVPDTPPKPGEEPPAKTEKTSVIIRKYAEGDYSKLLEGATLKLSQIEGSGFQEKDFQSNSLGETVELPNGTYTLT 327
Cdd:NF033396 241 LLSAEYVPDTPPKPGEEPPAKTEKTSVIIRKYAEGDYSKLLEGATLKLTQIEGSGFQEKIFQSNSSGETVELPNGTYTLT 320
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 328 ETLSPDGYKIAEPIKFRVENKKVFIVQKDGSQVENPNKEVAEPYSVEAYNDFMDEEVLS-GFTPYGKFYYAKNKDKSSQV 406
Cdd:NF033396 321 ETKSPDGYKIAEPIKFRVKNGKVFIVQKDGSQVENPNKEVAEPYSVEAYNDFSEDGYLSsGFRPYGKFYYAKNKDGSSQV 400
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 407 VYCFNADLHSPPDSYDSGETINPDTSTMKEVKYTHTAGSDLFKYALRPRDTNPEDFLKHIKKVIEKGYKKKGDSYNGLTE 486
Cdd:NF033396 401 VYCFNADLHSPPDSYDGGGTIDPDISTMKEVKYTHVAGSDLFKYALRPRDTNPEDFLKHIKKVIEKGYKKKGDSYNGLTE 480
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 487 TQFRAATQLAIYYFTDSADLKTLKTYNNGKGYHGFESMDEKTLAVTKELITYAQNGSAPQLTNLDFFVPNNSKYQSLIGT 566
Cdd:NF033396 481 TQFRAATQLAIYYFTDSADLETLKTYNNNKGYHGFEDMDEATLAVTKELIAYAQNDEAPQLTNLDFFVPNNSKYQSLIGT 560
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 567 EYHPDDLVDVIRMEDKKQEVIPVTHSLTVKKTVVGELGDKTKGFQFELELKDKTGQPIVNTLKTNNQDLVAKDGKYSFNL 646
Cdd:NF033396 561 EYHPDDLVDVIRMEDKKQEVIPVTHSLTVKKTVVGELGDKTKGFQFELELKDKTGQPIVNTLKTNNQDLVAKDGKYSFNL 640
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 647 KHGDTIRIEGLPTGYSYTLKETEAKDYIVTVDNKVSQEAQSVGKDITEDKKVTFENRKDLVPPTGLTTDGAIYLWLLLLV 726
Cdd:NF033396 641 KHGDTIRIEGLPTGYSYTLKETEAKDYIVTVDNKVSQEAQSTKASVTEDKTVTFENRKDLVPPTGLTTDGAIYLWLLLLV 720
                        730
                 ....*....|....*..
gi 553746391 727 PLGLLVWLIGRKGLKND 743
Cdd:NF033396 721 PLGLWVWLIGRKGLKND 737
 
Name Accession Description Interval E-value
pilus_ancill_1 NF033396
pilus ancillary protein 1;
13-743 0e+00

pilus ancillary protein 1;


Pssm-ID: 380246 [Multi-domain]  Cd Length: 737  Bit Score: 1398.85  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391  13 NNKRRQTTIGLLKVFLTFVALIGIVGFSIRAFGAEEQ----SVPNKQSSVQDYPWYGYDSYSKGYPDYSPLKTYHNLKVN 88
Cdd:NF033396   1 NRKPKQLTVTLVGVFLMFLTLVSSMRGAQSIFGEEKRieevSVPKIKSPDDDYPWYGYDSYDSSHPYYEPFKVAHDLKVN 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391  89 LEGSKDYQTYCFNLTKHFPSKLDSVKSQWYKKLEGNDQTFRNYASQIRNEQ-NISQKILDVLYNGYPNNANGLMNELEPL 167
Cdd:NF033396  81 LNGSKSYQVYCFNITSHYPSKKNSVSKQWFKRVDGTGDVFTSYAKTPRIEGeELNQKLLSVMYNAYPNNANGIMKGIEPL 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 168 NAIIVTQNAIWYYSDSAQINPDESFKTEAKSNGINDQQLGLMRKALKELIDPNLGSKYSNKTPSGYRLNVFESHDKTFQN 247
Cdd:NF033396 161 NAILVTQNAVWYYSDSSQINPDELFKSEAKSNKINDQQLGLMREALSELIDPNLGEKYSNKTPSGYRLNIFESHDKSFQN 240
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 248 LLSAEYVPDTPPKPGEEPPAKTEKTSVIIRKYAEGDYSKLLEGATLKLSQIEGSGFQEKDFQSNSLGETVELPNGTYTLT 327
Cdd:NF033396 241 LLSAEYVPDTPPKPGEEPPAKTEKTSVIIRKYAEGDYSKLLEGATLKLTQIEGSGFQEKIFQSNSSGETVELPNGTYTLT 320
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 328 ETLSPDGYKIAEPIKFRVENKKVFIVQKDGSQVENPNKEVAEPYSVEAYNDFMDEEVLS-GFTPYGKFYYAKNKDKSSQV 406
Cdd:NF033396 321 ETKSPDGYKIAEPIKFRVKNGKVFIVQKDGSQVENPNKEVAEPYSVEAYNDFSEDGYLSsGFRPYGKFYYAKNKDGSSQV 400
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 407 VYCFNADLHSPPDSYDSGETINPDTSTMKEVKYTHTAGSDLFKYALRPRDTNPEDFLKHIKKVIEKGYKKKGDSYNGLTE 486
Cdd:NF033396 401 VYCFNADLHSPPDSYDGGGTIDPDISTMKEVKYTHVAGSDLFKYALRPRDTNPEDFLKHIKKVIEKGYKKKGDSYNGLTE 480
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 487 TQFRAATQLAIYYFTDSADLKTLKTYNNGKGYHGFESMDEKTLAVTKELITYAQNGSAPQLTNLDFFVPNNSKYQSLIGT 566
Cdd:NF033396 481 TQFRAATQLAIYYFTDSADLETLKTYNNNKGYHGFEDMDEATLAVTKELIAYAQNDEAPQLTNLDFFVPNNSKYQSLIGT 560
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 567 EYHPDDLVDVIRMEDKKQEVIPVTHSLTVKKTVVGELGDKTKGFQFELELKDKTGQPIVNTLKTNNQDLVAKDGKYSFNL 646
Cdd:NF033396 561 EYHPDDLVDVIRMEDKKQEVIPVTHSLTVKKTVVGELGDKTKGFQFELELKDKTGQPIVNTLKTNNQDLVAKDGKYSFNL 640
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 647 KHGDTIRIEGLPTGYSYTLKETEAKDYIVTVDNKVSQEAQSVGKDITEDKKVTFENRKDLVPPTGLTTDGAIYLWLLLLV 726
Cdd:NF033396 641 KHGDTIRIEGLPTGYSYTLKETEAKDYIVTVDNKVSQEAQSTKASVTEDKTVTFENRKDLVPPTGLTTDGAIYLWLLLLV 720
                        730
                 ....*....|....*..
gi 553746391 727 PLGLLVWLIGRKGLKND 743
Cdd:NF033396 721 PLGLWVWLIGRKGLKND 737
fibronec_SfbI NF033395
fibronectin-binding protein PrtF1/SfbI; PrtF1/SfbI is a fibronectin-binding protein a ...
12-265 3.09e-99

fibronectin-binding protein PrtF1/SfbI; PrtF1/SfbI is a fibronectin-binding protein a C-terminal region LPXTG region that mediates processing by sortase and covalent attachment to the cell wall. Near the N-terminus is a TED domain, which includes a Cys residue that forms a covalent thioester bond.


Pssm-ID: 468012 [Multi-domain]  Cd Length: 555  Bit Score: 317.30  E-value: 3.09e-99
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391  12 ANNKR-RQTTIGLLKVFLTFVALIGIVGFSIRAFGAEEQSVPNKQSSVQDYPWYGYDSYSKGYPdysplkTYHNLKVNLE 90
Cdd:NF033395  15 AHTKRkRRFAVTLVGVFFMLLACAGAIGFGQVAYAADEKTVPNFKSPNPEFPWYGYDAYRGAFL------RYHDLKVNLN 88
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391  91 GSKDYQTYCFNLTKHFPSKLDSVKSQWYKKLEGNDQTFRNYASQIR-NEQNISQKILDVLYNGYPNNANGLMNELEPLNA 169
Cdd:NF033395  89 GSKEYQVYCFNLKKFEPRKETSSDKNWYKKLEGTAETFKKYAMNPRvGGEELEKNILSVMYNGYPNDGNGIMKGLEPLNA 168
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 170 IIVTQNAIWYYSDSAQINPDESFKTEAKSNGINDQQLGLMRKALKELIDPNLGSKYSNKTPSGYRLNVFESHDKTFQNLL 249
Cdd:NF033395 169 ILVTQNAVWYYSDSSPYDIETLWESEAKEGKISESQVTLMREALKKLIDPDLEETLVKKVPSNYKLNIFESSDKSYQNLL 248
                        250
                 ....*....|....*.
gi 553746391 250 SAEYVPDTPPKPGEEP 265
Cdd:NF033395 249 SAEYVPDDPPKPGDTS 264
surf_Nterm_1 NF012162
surface-anchored protein thioester-forming domain; This model describes a conserved region, ...
20-252 2.54e-57

surface-anchored protein thioester-forming domain; This model describes a conserved region, fairly rich in insertions and deletions, located just past the signal peptide region in long, variable, and typically highly repetitive and sortase-dependent surface proteins. Members are found in a broad range of taxa, including many strains of Streptococcus pneumoniae. A conserved Cys forms a thioester bond, often to a host protein for covalent attachment.


Pssm-ID: 467950 [Multi-domain]  Cd Length: 234  Bit Score: 194.99  E-value: 2.54e-57
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391  20 TIGLLKVFLTFVALIGIVGFSIRAFGAEEQSVPNKQSSVQ-DYPWYGYDSYSKGYpdysplKTYHNLKV--NLEGSKDYQ 96
Cdd:NF012162   1 ILTLVVVFLMLLALAGSIIFGSLAYAADEKGFPNDAKGVSpEGKYYGYDKYGKLY------TTYHRLRVveNLEGSDDYQ 74
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391  97 TYCFNLTKHFPSKLDSVKSQWYKKLEGNDQ-TFRNYA------SQIRNEQNISQKILDVLYNGYPNNANGLMNE--LEPL 167
Cdd:NF012162  75 AFCFNLKKKFPSYDDSSVKKWYKKLLGSDKeTFKKYArdprrdGTISNPNELWDKLRKVIYNGYPKDPTDIMGRsgLTPL 154
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 168 NAIIVTQNAIWYYSDSAQI-NPDESFKTEAKSNGINDQQLgLMRKALKELIDPNlgskysnktpSGYRLNVFES----HD 242
Cdd:NF012162 155 NFINVTQNAIWYYTDGSKVsKDDNSYEYEAQNSQSQEQLE-LMREALKKLIDPN----------SDFELRIYKPqdvgGQ 223
                        250
                 ....*....|
gi 553746391 243 KTFQNLLSAE 252
Cdd:NF012162 224 KGYQALLSGR 233
TQXA_dom TIGR03934
TQXA domain; This model describes a domain of about 40 residues with an invariant TQ dipeptide ...
144-185 3.07e-14

TQXA domain; This model describes a domain of about 40 residues with an invariant TQ dipeptide in an almost invariant TQxA[VI]W motif. This domain occurs in surface-expressed proteins of Gram-positive bacteria, many of which are anchored by LPXTG-containing sortase target domains. Numerous members of this family have domains pfam05738 (Cna protein B-type domain) and pfam08341 (fibronectin-binding protein signal sequence).


Pssm-ID: 274864 [Multi-domain]  Cd Length: 42  Bit Score: 67.27  E-value: 3.07e-14
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|..
gi 553746391  144 KILDVLYNGYPNNANGLMNELEPLNAIIVTQNAIWYYSDSAQ 185
Cdd:TIGR03934   1 KILWILANGYPNKSNGELGGLTEEEARAVTQLAIWYFTDGLD 42
TED pfam08341
Thioester domain; This domain is found near the N-terminus of a variety of bacterial surface ...
97-219 9.11e-11

Thioester domain; This domain is found near the N-terminus of a variety of bacterial surface proteins and pili. This domain contains an unusual covalent ester bond between a conserved cysteine and glutamine residue.


Pssm-ID: 462437 [Multi-domain]  Cd Length: 105  Bit Score: 59.17  E-value: 9.11e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391   97 TYCFNLTKHFPSKLDSVKSQWYKKLEGNDQTfrnyasqirneqnisQKILDVLYNGYPNNANGLMNE--LEPLNAIIVTQ 174
Cdd:pfam08341   2 AYCIEPGKGFPSGGDGTASSETRLTLYKDNA---------------DKINWILYNGYPNKSGLSEELggLTDDDAYAATQ 66
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 553746391  175 NAIWYYSDSAQINPDESFKTEAKSNgindqqlglMRKALKELIDP 219
Cdd:pfam08341  67 AAIWHFTDGVDGASDGDGGTERDDD---------VKKLYDYLIGN 102
surf_Nterm_1 NF012162
surface-anchored protein thioester-forming domain; This model describes a conserved region, ...
406-567 6.51e-05

surface-anchored protein thioester-forming domain; This model describes a conserved region, fairly rich in insertions and deletions, located just past the signal peptide region in long, variable, and typically highly repetitive and sortase-dependent surface proteins. Members are found in a broad range of taxa, including many strains of Streptococcus pneumoniae. A conserved Cys forms a thioester bond, often to a host protein for covalent attachment.


Pssm-ID: 467950 [Multi-domain]  Cd Length: 234  Bit Score: 45.14  E-value: 6.51e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 406 VVYCFNADLHSPP--DSYDSGEtinpdtstmkevkYTHTAGSD---LFKYALRPRDT----NPEDFLKHIKKVIEKGYKK 476
Cdd:NF012162  74 QAFCFNLKKKFPSydDSSVKKW-------------YKKLLGSDketFKKYARDPRRDgtisNPNELWDKLRKVIYNGYPK 140
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 477 KGD---SYNGLTETQFRAATQLAIYYFTDSA---DLKTLKTYNNGKGyhgfesmdEKTLAVtkELITYAQNGSAPQLTN- 549
Cdd:NF012162 141 DPTdimGRSGLTPLNFINVTQNAIWYYTDGSkvsKDDNSYEYEAQNS--------QSQEQL--ELMREALKKLIDPNSDf 210
                        170       180
                 ....*....|....*....|...
gi 553746391 550 -LDFFVP----NNSKYQSLIGTE 567
Cdd:NF012162 211 eLRIYKPqdvgGQKGYQALLSGR 233
ClfA COG4932
Clumping factor A-related surface protein, MSCRAMM (microbial surface components recognizing ...
228-347 7.42e-05

Clumping factor A-related surface protein, MSCRAMM (microbial surface components recognizing adhesive matrix molecules) family, DEv-IgG fold [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 443959 [Multi-domain]  Cd Length: 689  Bit Score: 46.12  E-value: 7.42e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 228 KTPSGYRLNvfeshDKTFQNLLSAEYVPDTPPKPGEEPPaktEKTSVIIRKYAEGDYSKLLEGATLKLSQIEGSGFQEKD 307
Cdd:COG4932  322 KAPAGYDLD-----GEAVKVTITAGQTTTVTVTNGNNEV---KTGSVTLTKVDADDGEAPLAGAEFTLTDADGTVVATIT 393
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|.
gi 553746391 308 FQSNSLGETVELPNGTYTLTETLSPDGYKI-AEPIKFRVEN 347
Cdd:COG4932  394 TDADGTASFKGLAPGTYTLTETKAPEGYTLdSTPITVTVTD 434
 
Name Accession Description Interval E-value
pilus_ancill_1 NF033396
pilus ancillary protein 1;
13-743 0e+00

pilus ancillary protein 1;


Pssm-ID: 380246 [Multi-domain]  Cd Length: 737  Bit Score: 1398.85  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391  13 NNKRRQTTIGLLKVFLTFVALIGIVGFSIRAFGAEEQ----SVPNKQSSVQDYPWYGYDSYSKGYPDYSPLKTYHNLKVN 88
Cdd:NF033396   1 NRKPKQLTVTLVGVFLMFLTLVSSMRGAQSIFGEEKRieevSVPKIKSPDDDYPWYGYDSYDSSHPYYEPFKVAHDLKVN 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391  89 LEGSKDYQTYCFNLTKHFPSKLDSVKSQWYKKLEGNDQTFRNYASQIRNEQ-NISQKILDVLYNGYPNNANGLMNELEPL 167
Cdd:NF033396  81 LNGSKSYQVYCFNITSHYPSKKNSVSKQWFKRVDGTGDVFTSYAKTPRIEGeELNQKLLSVMYNAYPNNANGIMKGIEPL 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 168 NAIIVTQNAIWYYSDSAQINPDESFKTEAKSNGINDQQLGLMRKALKELIDPNLGSKYSNKTPSGYRLNVFESHDKTFQN 247
Cdd:NF033396 161 NAILVTQNAVWYYSDSSQINPDELFKSEAKSNKINDQQLGLMREALSELIDPNLGEKYSNKTPSGYRLNIFESHDKSFQN 240
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 248 LLSAEYVPDTPPKPGEEPPAKTEKTSVIIRKYAEGDYSKLLEGATLKLSQIEGSGFQEKDFQSNSLGETVELPNGTYTLT 327
Cdd:NF033396 241 LLSAEYVPDTPPKPGEEPPAKTEKTSVIIRKYAEGDYSKLLEGATLKLTQIEGSGFQEKIFQSNSSGETVELPNGTYTLT 320
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 328 ETLSPDGYKIAEPIKFRVENKKVFIVQKDGSQVENPNKEVAEPYSVEAYNDFMDEEVLS-GFTPYGKFYYAKNKDKSSQV 406
Cdd:NF033396 321 ETKSPDGYKIAEPIKFRVKNGKVFIVQKDGSQVENPNKEVAEPYSVEAYNDFSEDGYLSsGFRPYGKFYYAKNKDGSSQV 400
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 407 VYCFNADLHSPPDSYDSGETINPDTSTMKEVKYTHTAGSDLFKYALRPRDTNPEDFLKHIKKVIEKGYKKKGDSYNGLTE 486
Cdd:NF033396 401 VYCFNADLHSPPDSYDGGGTIDPDISTMKEVKYTHVAGSDLFKYALRPRDTNPEDFLKHIKKVIEKGYKKKGDSYNGLTE 480
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 487 TQFRAATQLAIYYFTDSADLKTLKTYNNGKGYHGFESMDEKTLAVTKELITYAQNGSAPQLTNLDFFVPNNSKYQSLIGT 566
Cdd:NF033396 481 TQFRAATQLAIYYFTDSADLETLKTYNNNKGYHGFEDMDEATLAVTKELIAYAQNDEAPQLTNLDFFVPNNSKYQSLIGT 560
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 567 EYHPDDLVDVIRMEDKKQEVIPVTHSLTVKKTVVGELGDKTKGFQFELELKDKTGQPIVNTLKTNNQDLVAKDGKYSFNL 646
Cdd:NF033396 561 EYHPDDLVDVIRMEDKKQEVIPVTHSLTVKKTVVGELGDKTKGFQFELELKDKTGQPIVNTLKTNNQDLVAKDGKYSFNL 640
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 647 KHGDTIRIEGLPTGYSYTLKETEAKDYIVTVDNKVSQEAQSVGKDITEDKKVTFENRKDLVPPTGLTTDGAIYLWLLLLV 726
Cdd:NF033396 641 KHGDTIRIEGLPTGYSYTLKETEAKDYIVTVDNKVSQEAQSTKASVTEDKTVTFENRKDLVPPTGLTTDGAIYLWLLLLV 720
                        730
                 ....*....|....*..
gi 553746391 727 PLGLLVWLIGRKGLKND 743
Cdd:NF033396 721 PLGLWVWLIGRKGLKND 737
fibronec_SfbI NF033395
fibronectin-binding protein PrtF1/SfbI; PrtF1/SfbI is a fibronectin-binding protein a ...
12-265 3.09e-99

fibronectin-binding protein PrtF1/SfbI; PrtF1/SfbI is a fibronectin-binding protein a C-terminal region LPXTG region that mediates processing by sortase and covalent attachment to the cell wall. Near the N-terminus is a TED domain, which includes a Cys residue that forms a covalent thioester bond.


Pssm-ID: 468012 [Multi-domain]  Cd Length: 555  Bit Score: 317.30  E-value: 3.09e-99
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391  12 ANNKR-RQTTIGLLKVFLTFVALIGIVGFSIRAFGAEEQSVPNKQSSVQDYPWYGYDSYSKGYPdysplkTYHNLKVNLE 90
Cdd:NF033395  15 AHTKRkRRFAVTLVGVFFMLLACAGAIGFGQVAYAADEKTVPNFKSPNPEFPWYGYDAYRGAFL------RYHDLKVNLN 88
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391  91 GSKDYQTYCFNLTKHFPSKLDSVKSQWYKKLEGNDQTFRNYASQIR-NEQNISQKILDVLYNGYPNNANGLMNELEPLNA 169
Cdd:NF033395  89 GSKEYQVYCFNLKKFEPRKETSSDKNWYKKLEGTAETFKKYAMNPRvGGEELEKNILSVMYNGYPNDGNGIMKGLEPLNA 168
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 170 IIVTQNAIWYYSDSAQINPDESFKTEAKSNGINDQQLGLMRKALKELIDPNLGSKYSNKTPSGYRLNVFESHDKTFQNLL 249
Cdd:NF033395 169 ILVTQNAVWYYSDSSPYDIETLWESEAKEGKISESQVTLMREALKKLIDPDLEETLVKKVPSNYKLNIFESSDKSYQNLL 248
                        250
                 ....*....|....*.
gi 553746391 250 SAEYVPDTPPKPGEEP 265
Cdd:NF033395 249 SAEYVPDDPPKPGDTS 264
surf_Nterm_1 NF012162
surface-anchored protein thioester-forming domain; This model describes a conserved region, ...
20-252 2.54e-57

surface-anchored protein thioester-forming domain; This model describes a conserved region, fairly rich in insertions and deletions, located just past the signal peptide region in long, variable, and typically highly repetitive and sortase-dependent surface proteins. Members are found in a broad range of taxa, including many strains of Streptococcus pneumoniae. A conserved Cys forms a thioester bond, often to a host protein for covalent attachment.


Pssm-ID: 467950 [Multi-domain]  Cd Length: 234  Bit Score: 194.99  E-value: 2.54e-57
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391  20 TIGLLKVFLTFVALIGIVGFSIRAFGAEEQSVPNKQSSVQ-DYPWYGYDSYSKGYpdysplKTYHNLKV--NLEGSKDYQ 96
Cdd:NF012162   1 ILTLVVVFLMLLALAGSIIFGSLAYAADEKGFPNDAKGVSpEGKYYGYDKYGKLY------TTYHRLRVveNLEGSDDYQ 74
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391  97 TYCFNLTKHFPSKLDSVKSQWYKKLEGNDQ-TFRNYA------SQIRNEQNISQKILDVLYNGYPNNANGLMNE--LEPL 167
Cdd:NF012162  75 AFCFNLKKKFPSYDDSSVKKWYKKLLGSDKeTFKKYArdprrdGTISNPNELWDKLRKVIYNGYPKDPTDIMGRsgLTPL 154
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 168 NAIIVTQNAIWYYSDSAQI-NPDESFKTEAKSNGINDQQLgLMRKALKELIDPNlgskysnktpSGYRLNVFES----HD 242
Cdd:NF012162 155 NFINVTQNAIWYYTDGSKVsKDDNSYEYEAQNSQSQEQLE-LMREALKKLIDPN----------SDFELRIYKPqdvgGQ 223
                        250
                 ....*....|
gi 553746391 243 KTFQNLLSAE 252
Cdd:NF012162 224 KGYQALLSGR 233
TQXA_dom TIGR03934
TQXA domain; This model describes a domain of about 40 residues with an invariant TQ dipeptide ...
144-185 3.07e-14

TQXA domain; This model describes a domain of about 40 residues with an invariant TQ dipeptide in an almost invariant TQxA[VI]W motif. This domain occurs in surface-expressed proteins of Gram-positive bacteria, many of which are anchored by LPXTG-containing sortase target domains. Numerous members of this family have domains pfam05738 (Cna protein B-type domain) and pfam08341 (fibronectin-binding protein signal sequence).


Pssm-ID: 274864 [Multi-domain]  Cd Length: 42  Bit Score: 67.27  E-value: 3.07e-14
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|..
gi 553746391  144 KILDVLYNGYPNNANGLMNELEPLNAIIVTQNAIWYYSDSAQ 185
Cdd:TIGR03934   1 KILWILANGYPNKSNGELGGLTEEEARAVTQLAIWYFTDGLD 42
TQXA_dom TIGR03934
TQXA domain; This model describes a domain of about 40 residues with an invariant TQ dipeptide ...
465-505 4.75e-11

TQXA domain; This model describes a domain of about 40 residues with an invariant TQ dipeptide in an almost invariant TQxA[VI]W motif. This domain occurs in surface-expressed proteins of Gram-positive bacteria, many of which are anchored by LPXTG-containing sortase target domains. Numerous members of this family have domains pfam05738 (Cna protein B-type domain) and pfam08341 (fibronectin-binding protein signal sequence).


Pssm-ID: 274864 [Multi-domain]  Cd Length: 42  Bit Score: 58.03  E-value: 4.75e-11
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|..
gi 553746391  465 HIKKVIEKGYKKKGDS-YNGLTETQFRAATQLAIYYFTDSAD 505
Cdd:TIGR03934   1 KILWILANGYPNKSNGeLGGLTEEEARAVTQLAIWYFTDGLD 42
TED pfam08341
Thioester domain; This domain is found near the N-terminus of a variety of bacterial surface ...
97-219 9.11e-11

Thioester domain; This domain is found near the N-terminus of a variety of bacterial surface proteins and pili. This domain contains an unusual covalent ester bond between a conserved cysteine and glutamine residue.


Pssm-ID: 462437 [Multi-domain]  Cd Length: 105  Bit Score: 59.17  E-value: 9.11e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391   97 TYCFNLTKHFPSKLDSVKSQWYKKLEGNDQTfrnyasqirneqnisQKILDVLYNGYPNNANGLMNE--LEPLNAIIVTQ 174
Cdd:pfam08341   2 AYCIEPGKGFPSGGDGTASSETRLTLYKDNA---------------DKINWILYNGYPNKSGLSEELggLTDDDAYAATQ 66
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 553746391  175 NAIWYYSDSAQINPDESFKTEAKSNgindqqlglMRKALKELIDP 219
Cdd:pfam08341  67 AAIWHFTDGVDGASDGDGGTERDDD---------VKKLYDYLIGN 102
TED pfam08341
Thioester domain; This domain is found near the N-terminus of a variety of bacterial surface ...
406-541 3.56e-07

Thioester domain; This domain is found near the N-terminus of a variety of bacterial surface proteins and pili. This domain contains an unusual covalent ester bond between a conserved cysteine and glutamine residue.


Pssm-ID: 462437 [Multi-domain]  Cd Length: 105  Bit Score: 49.16  E-value: 3.56e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391  406 VVYCFNADLHSPPDSYDSGETINpdtstmkevkythtagsdlfkyalrpRDTNPEDFLKHIKKVIEKGYKKKGDS---YN 482
Cdd:pfam08341   1 PAYCIEPGKGFPSGGDGTASSET--------------------------RLTLYKDNADKINWILYNGYPNKSGLseeLG 54
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 553746391  483 GLTETQFRAATQLAIYYFTDSADLKTLKTYNNgkgyhgfeSMDEKTLAVTKELITYAQN 541
Cdd:pfam08341  55 GLTDDDAYAATQAAIWHFTDGVDGASDGDGGT--------ERDDDVKKLYDYLIGNANK 105
SpaA pfam17802
Prealbumin-like fold domain; This entry contains a prealbumin-like domain from a wide variety ...
286-349 6.39e-05

Prealbumin-like fold domain; This entry contains a prealbumin-like domain from a wide variety of bacterial surface proteins. This entry corresponds to domain 1 and domain 3 of SpaA from Corynebacterium diphtheriae. Some members of this family contain an isopeptide bond.


Pssm-ID: 465513 [Multi-domain]  Cd Length: 72  Bit Score: 41.80  E-value: 6.39e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 553746391  286 KLLEGATLKLSQIEGSGFQEK--DFQSNSLGETV--ELPNGTYTLTETLSPDGYKI-AEPIKFRVENKK 349
Cdd:pfam17802   4 KPLAGAEFTLYDADGTVDGKVvgTLTTDEDGKATfdGLPPGTYTLKETKAPDGYVLdDTPIEFTVTEDG 72
surf_Nterm_1 NF012162
surface-anchored protein thioester-forming domain; This model describes a conserved region, ...
406-567 6.51e-05

surface-anchored protein thioester-forming domain; This model describes a conserved region, fairly rich in insertions and deletions, located just past the signal peptide region in long, variable, and typically highly repetitive and sortase-dependent surface proteins. Members are found in a broad range of taxa, including many strains of Streptococcus pneumoniae. A conserved Cys forms a thioester bond, often to a host protein for covalent attachment.


Pssm-ID: 467950 [Multi-domain]  Cd Length: 234  Bit Score: 45.14  E-value: 6.51e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 406 VVYCFNADLHSPP--DSYDSGEtinpdtstmkevkYTHTAGSD---LFKYALRPRDT----NPEDFLKHIKKVIEKGYKK 476
Cdd:NF012162  74 QAFCFNLKKKFPSydDSSVKKW-------------YKKLLGSDketFKKYARDPRRDgtisNPNELWDKLRKVIYNGYPK 140
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 477 KGD---SYNGLTETQFRAATQLAIYYFTDSA---DLKTLKTYNNGKGyhgfesmdEKTLAVtkELITYAQNGSAPQLTN- 549
Cdd:NF012162 141 DPTdimGRSGLTPLNFINVTQNAIWYYTDGSkvsKDDNSYEYEAQNS--------QSQEQL--ELMREALKKLIDPNSDf 210
                        170       180
                 ....*....|....*....|...
gi 553746391 550 -LDFFVP----NNSKYQSLIGTE 567
Cdd:NF012162 211 eLRIYKPqdvgGQKGYQALLSGR 233
ClfA COG4932
Clumping factor A-related surface protein, MSCRAMM (microbial surface components recognizing ...
228-347 7.42e-05

Clumping factor A-related surface protein, MSCRAMM (microbial surface components recognizing adhesive matrix molecules) family, DEv-IgG fold [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 443959 [Multi-domain]  Cd Length: 689  Bit Score: 46.12  E-value: 7.42e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 228 KTPSGYRLNvfeshDKTFQNLLSAEYVPDTPPKPGEEPPaktEKTSVIIRKYAEGDYSKLLEGATLKLSQIEGSGFQEKD 307
Cdd:COG4932  322 KAPAGYDLD-----GEAVKVTITAGQTTTVTVTNGNNEV---KTGSVTLTKVDADDGEAPLAGAEFTLTDADGTVVATIT 393
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|.
gi 553746391 308 FQSNSLGETVELPNGTYTLTETLSPDGYKI-AEPIKFRVEN 347
Cdd:COG4932  394 TDADGTASFKGLAPGTYTLTETKAPEGYTLdSTPITVTVTD 434
ClfA COG4932
Clumping factor A-related surface protein, MSCRAMM (microbial surface components recognizing ...
251-374 7.75e-05

Clumping factor A-related surface protein, MSCRAMM (microbial surface components recognizing adhesive matrix molecules) family, DEv-IgG fold [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 443959 [Multi-domain]  Cd Length: 689  Bit Score: 46.12  E-value: 7.75e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 251 AEYVPDTPPKPGEEPPAKTEKTSVIIRKYAEGDYSKLLEGATLKLSQieGSGFQEKDFQSNSLGETV--ELPNGTYTLTE 328
Cdd:COG4932  143 TVTAAATDGVNDVDGNGASVTDSVTLKKVDDGDTGKPLPGATFTLYD--SDGTLVKTVTTDADGKYTftDLPPGTYTLTE 220
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*.
gi 553746391 329 TLSPDGYKIAePIKFRVENKKVFIVQKDGSQVENPNKEVAEPYSVE 374
Cdd:COG4932  221 TKAPEGYVLD-TKDPTGATITVTVNAGGTVTVTLKNTPKYTKGSVT 265
ClfA COG4932
Clumping factor A-related surface protein, MSCRAMM (microbial surface components recognizing ...
228-367 4.00e-03

Clumping factor A-related surface protein, MSCRAMM (microbial surface components recognizing adhesive matrix molecules) family, DEv-IgG fold [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 443959 [Multi-domain]  Cd Length: 689  Bit Score: 40.73  E-value: 4.00e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 553746391 228 KTPSGYRLNVFESHDKTFQNLLSAEYVPDTPPKPGEEPPakteKTSVIIRKyAEGDYSKLLEGATLKLSQIEGSG-FQEK 306
Cdd:COG4932  222 KAPEGYVLDTKDPTGATITVTVNAGGTVTVTLKNTPKYT----KGSVTVTK-TDADTGEPLAGATFTLTDADGNTvVTTT 296
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 553746391 307 DFQSNSLGETV--ELPNGTYTLTETLSPDGYKIA-EPIKFRVENKKVFIVQKDGSQVENPNKEV 367
Cdd:COG4932  297 VTVTDADGSYTftDLPPGTYTVTETKAPAGYDLDgEAVKVTITAGQTTTVTVTNGNNEVKTGSV 360
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH