NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2217320564|ref|XP_047294569|]
View 

C3 and PZP-like alpha-2-macroglobulin domain-containing protein 8 isoform X8 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
A2M_2 cd02897
Proteins similar to alpha2-macroglobulin (alpha (2)-M). This group also contains the pregnancy ...
967-1255 7.25e-146

Proteins similar to alpha2-macroglobulin (alpha (2)-M). This group also contains the pregnancy zone protein (PZP). Alpha(2)-M and PZP are broadly specific proteinase inhibitors. Alpha (2)-M is a major carrier protein in serum. The structural thioester of alpha (2)-M, is involved in the immobilization and entrapment of proteases. PZP is a trace protein in the plasma of non-pregnant females and males which is elevated in pregnancy. Alpha (2)-M and PZ bind to placental protein-14 and may modulate its activity in T-cell growth and cytokine production contributing to fetal survival. It has been suggested that thioester bond cleavage promotes the binding of PZ and alpha (2)-M to the CD91 receptor clearing them from circulation.


:

Pssm-ID: 239227  Cd Length: 292  Bit Score: 450.11  E-value: 7.25e-146
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  967 PFGCGEQNMIHFAPNVFVLKYLQKTQQLSPEVERETTDYLVQGYQRQLTYKRQDGSYSAFGERDASGSMWLTAFVLKSFA 1046
Cdd:cd02897     12 PYGCGEQNMVNFAPNIYVLDYLKATGQLTPEIESKALGFLRTGYQRQLTYKHSDGSYSAFGESDKSGSTWLTAFVLKSFA 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1047 QARSFIFVDPRELAAAKSWIIQQQQADGSFLAVGRVLNKDIQGGIHGTVPLTAYVVVALLETGTASeeERGSTDKARHFL 1126
Cdd:cd02897     92 QARPFIYIDENVLQQALTWLSSHQKSNGCFREVGRVFHKAMQGGVDDEVALTAYVLIALLEAGLPS--ERPVVEKALSCL 169
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1127 ESAAPLAMDPYSCALTTYALTLLRSPAAPEALRKLRSLAIMRDGVTHWSLSnswdvdkgTFLSFSDRVSQSVVSAEVEMT 1206
Cdd:cd02897    170 EAALDSISDPYTLALAAYALTLAGSEKRPEALKKLDELAISEDGTKHWSRP--------PPSEEGPSYYWQAPSAEVEMT 241
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2217320564 1207 AYALLTYTLLG--DVAAALPVVKWLSQQRNALGGFSSTQDTCVALQALAEY 1255
Cdd:cd02897    242 AYALLALLSAGgeDLAEALPIVKWLAKQRNSLGGFSSTQDTVVALQALAKY 292
A2M pfam00207
Alpha-2-macroglobulin family; This family includes the C-terminal region of the ...
576-667 8.27e-34

Alpha-2-macroglobulin family; This family includes the C-terminal region of the alpha-2-macroglobulin family.


:

Pssm-ID: 459711 [Multi-domain]  Cd Length: 91  Bit Score: 125.39  E-value: 8.27e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  576 TWIWHCLNISDpSGEGTLSVKVPDSITSWVGEAVALSTSQGLGIAEPSLLKTFKPFFVDFMLPALIIRGEQVKIPLSVYN 655
Cdd:pfam00207    1 TWLWDPVLVTD-NGKASLSFTLPDSITTWRATAFALSPDTGLGVAEPPELVVFKPFFVDLNLPYSVRRGEQFELKATVFN 79
                           90
                   ....*....|..
gi 2217320564  656 YMGTCAEVYMKL 667
Cdd:pfam00207   80 YLDKCLKVRVRL 91
Methyltransf_FA pfam12248
Farnesoic acid 0-methyl transferase; This domain family is found in bacteria and eukaryotes, ...
811-912 4.39e-32

Farnesoic acid 0-methyl transferase; This domain family is found in bacteria and eukaryotes, and is approximately 110 amino acids in length.Farnesoic acid O-methyl transferase (FAMeT) is the enzyme that catalyzes the formation of methyl farnesoate (MF) from farnesoic acid (FA) in the biosynthetic pathway of juvenile hormone (JH).


:

Pssm-ID: 463505  Cd Length: 104  Bit Score: 121.21  E-value: 4.39e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  811 VALSS--GPQDTAGMIEIVLGGHQNTRSWISTSKMG--EPVASAHTAKILSWDEFRTFWISWR-GGLIQVGHGPEpsnES 885
Cdd:pfam12248    2 IALSSspYPYDSDPMYEIVIGGWGNTRSVIRRQKRGsaPDVVEVSTPGILSPDEPRMFWISWTdDGLISVGKGGE---EN 78
                           90       100
                   ....*....|....*....|....*..
gi 2217320564  886 VIVAWTLPRPPEVQFIGFSTgWGSMGE 912
Cdd:pfam12248   79 PFLQWSDPNPLPVNYIGFST-WGSTGE 104
A2M_BRD pfam07703
Alpha-2-macroglobulin bait region domain; Alpha-2-macroglobulins (A2Ms) are plasma proteins ...
281-452 3.70e-28

Alpha-2-macroglobulin bait region domain; Alpha-2-macroglobulins (A2Ms) are plasma proteins that trap and inhibit a broad range of proteases and are major components of the eukaryotic innate immune system. However, A2M-like proteins were identified in pathogenically invasive bacteria and species that colonize higher eukaryotes. This domain is found in eukaryotic and bacterial proteins. In human A2Ms, this domain encompasses macroglobulin-like domain MG5 and 6 including bait region. In Salmonella enterica ser A2Ms, this domain encompasses MG7 and MG8 including the bait region. The Bait region is cleaved by proteases, followed by a large conformational change that blocks the target protease within a cage-like complex. This model of protease entrapment is recognized as the Venus flytrap mechanism.


:

Pssm-ID: 462235 [Multi-domain]  Cd Length: 139  Bit Score: 111.29  E-value: 3.70e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  281 LQLQPPSHPLQVGEEAYFSVKSTC----PCNFTlYYEVAARGNIVLSGQQPAhttqqrskraapalekpirlthlsetep 356
Cdd:pfam07703    1 LHLSTDKTEYKPGETATVTVKSPFdgtvERDGF-TYLVLSKGQIVVVGRGGV---------------------------- 51
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  357 ppapeaevdvcVTSLHLAVTPSMVPLGRLLVFYVRE---NGEGVADSLQFAVETFFENQVSVTYSANETQPGEVVDLRIR 433
Cdd:pfam07703   52 -----------TTSFSLPVTAEMAPSARVVAYYVRVdlsKPEVVADSVWVDVDDTCENKLKVTLSAEKYRPGSTVELKVK 120
                          170
                   ....*....|....*....
gi 2217320564  434 AARGSCVCVAAVDKSVYLL 452
Cdd:pfam07703  121 ADPGAYVALAAVDKGVLLL 139
A2M_recep pfam07677
A-macroglobulin receptor binding domain; This family includes the receptor binding domain ...
1393-1486 9.31e-28

A-macroglobulin receptor binding domain; This family includes the receptor binding domain region of the alpha-2-macroglobulin family.


:

Pssm-ID: 462226  Cd Length: 92  Bit Score: 108.43  E-value: 9.31e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1393 GSSNMAVLEVPLLSGFRADIESLEQLLLDkhMGMKRYE-VAGRRVLFYFDEIPSRcLTCVRFRALRECVVGRTSALPVSV 1471
Cdd:pfam07677    1 ESSNMAILEVGLPSGFVPDEEDLKKLGVD--PLIKRVEtVDDGKVILYLDKLSGE-PLCFSFRAEQTFPVANLKPAPVKV 77
                           90
                   ....*....|....*
gi 2217320564 1472 YDYYEPAFEATRFYN 1486
Cdd:pfam07677   78 YDYYEPERRATTFYS 92
MG4 super family cl39290
Macroglobulin domain MG4; This domain is MG4 found in complement C3 and C5 proteins.
176-243 5.29e-10

Macroglobulin domain MG4; This domain is MG4 found in complement C3 and C5 proteins.


The actual alignment was detected with superfamily member pfam17789:

Pssm-ID: 465507  Cd Length: 95  Bit Score: 57.65  E-value: 5.29e-10
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217320564  176 KDTRKQFKPGLAYVGKVELSYPDGSPAEGVTVQIKA-ELTPKDNIYTSEvvsqRGLVGFEIPSIPTSAQ 243
Cdd:pfam17789    4 EKTPKYFKPGLPFSGQVLVVDPDGSPAPNVPVFIEAgNTEFNQNLTTDE----DGTAQFSINTPGNAAS 68
MG3 super family cl39292
Macroglobulin domain MG3; This entry corresponds to the MG3 domain found in complement ...
49-115 2.18e-09

Macroglobulin domain MG3; This entry corresponds to the MG3 domain found in complement components C3, C4 and C5.


The actual alignment was detected with superfamily member pfam17791:

Pssm-ID: 465509  Cd Length: 83  Bit Score: 55.74  E-value: 2.18e-09
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217320564   49 KYVLPKFELLIDPPRYIQDLDACETGTVRARYTFGKPVAGALMINmtvngVGYYSHEVGRPVLRTTK 115
Cdd:pfam17791    1 EYVLPKFEVKVEVPKFISVKDEEFQVTICAKYTYGKPVKGKAYVT-----LCLKDDSKRKCFESFSK 62
KAZAL smart00280
Kazal type serine protease inhibitors; Kazal type serine protease inhibitors and ...
1538-1570 8.85e-09

Kazal type serine protease inhibitors; Kazal type serine protease inhibitors and follistatin-like domains.


:

Pssm-ID: 197624  Cd Length: 46  Bit Score: 52.68  E-value: 8.85e-09
                            10        20        30
                    ....*....|....*....|....*....|...
gi 2217320564  1538 CDHDCGAQGNPVCGSDGVVYASACRLREAACRQ 1570
Cdd:smart00280    2 CPEACPREYDPVCGSDGVTYSNECHLCKAACES 34
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1466-1670 4.67e-04

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.31  E-value: 4.67e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1466 ALPVSVYDYYEPAFEATRFYNVSTHSPLARELCAGPACNEVERAPARGPGwfPGESGPAVAPEEGAAiarcgcdhdcGAQ 1545
Cdd:PHA03247  2731 ASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPP--RRLTRPAVASLSESR----------ESL 2798
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1546 GNPVCGSDGVVYASACRLREAACRQAAPLEPAPPSCCALEQRLPASSSSTYGDDLASVAPGplqQDVKLNGaglevedsd 1625
Cdd:PHA03247  2799 PSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG---GDVRRRP--------- 2866
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 2217320564 1626 pePEGEAEDRVTAGPRPPVSSGNLESSTQSASPFHRWGQTPAPQR 1670
Cdd:PHA03247  2867 --PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPP 2909
 
Name Accession Description Interval E-value
A2M_2 cd02897
Proteins similar to alpha2-macroglobulin (alpha (2)-M). This group also contains the pregnancy ...
967-1255 7.25e-146

Proteins similar to alpha2-macroglobulin (alpha (2)-M). This group also contains the pregnancy zone protein (PZP). Alpha(2)-M and PZP are broadly specific proteinase inhibitors. Alpha (2)-M is a major carrier protein in serum. The structural thioester of alpha (2)-M, is involved in the immobilization and entrapment of proteases. PZP is a trace protein in the plasma of non-pregnant females and males which is elevated in pregnancy. Alpha (2)-M and PZ bind to placental protein-14 and may modulate its activity in T-cell growth and cytokine production contributing to fetal survival. It has been suggested that thioester bond cleavage promotes the binding of PZ and alpha (2)-M to the CD91 receptor clearing them from circulation.


Pssm-ID: 239227  Cd Length: 292  Bit Score: 450.11  E-value: 7.25e-146
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  967 PFGCGEQNMIHFAPNVFVLKYLQKTQQLSPEVERETTDYLVQGYQRQLTYKRQDGSYSAFGERDASGSMWLTAFVLKSFA 1046
Cdd:cd02897     12 PYGCGEQNMVNFAPNIYVLDYLKATGQLTPEIESKALGFLRTGYQRQLTYKHSDGSYSAFGESDKSGSTWLTAFVLKSFA 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1047 QARSFIFVDPRELAAAKSWIIQQQQADGSFLAVGRVLNKDIQGGIHGTVPLTAYVVVALLETGTASeeERGSTDKARHFL 1126
Cdd:cd02897     92 QARPFIYIDENVLQQALTWLSSHQKSNGCFREVGRVFHKAMQGGVDDEVALTAYVLIALLEAGLPS--ERPVVEKALSCL 169
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1127 ESAAPLAMDPYSCALTTYALTLLRSPAAPEALRKLRSLAIMRDGVTHWSLSnswdvdkgTFLSFSDRVSQSVVSAEVEMT 1206
Cdd:cd02897    170 EAALDSISDPYTLALAAYALTLAGSEKRPEALKKLDELAISEDGTKHWSRP--------PPSEEGPSYYWQAPSAEVEMT 241
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2217320564 1207 AYALLTYTLLG--DVAAALPVVKWLSQQRNALGGFSSTQDTCVALQALAEY 1255
Cdd:cd02897    242 AYALLALLSAGgeDLAEALPIVKWLAKQRNSLGGFSSTQDTVVALQALAKY 292
TED_complement pfam07678
A-macroglobulin TED domain; This entry corresponds to the TED domain of the complement ...
946-1255 5.47e-131

A-macroglobulin TED domain; This entry corresponds to the TED domain of the complement components such as C3, C4 and C5. This domain contains a short highly conserved region of proteinase-binding alpha-macro-globulins contains the cysteine and a glutamine of a thiol-ester bond that is cleaved at the moment of proteinase binding, and mediates the covalent binding of the alpha-macro-globulin to the proteinase. The GCGEQ motif is highly conserved.


Pssm-ID: 462227 [Multi-domain]  Cd Length: 311  Bit Score: 410.54  E-value: 5.47e-131
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  946 ASIIGDVMGPTLNHLNNLLRL----PFGCGEQNMIHFAPNVFVLKYLQKTQQLSPEVERETTDYLVQGYQRQLTYKRQDG 1021
Cdd:pfam07678    1 ISVVGDIMGPAIQVVPENLSSllrlPYGCGEQNMVLFAPNVYVLRYLDKTNQLTKLIKSKAIDYLEQGYQRQLSYKHPDG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1022 SYSAFGERDasGSMWLTAFVLKSFAQARSFIFVDPRELAAAKSWIIQQQQADGSFLAVGRVLNKDIQGGIHGTVPLTAYV 1101
Cdd:pfam07678   81 SYSAFGHSP--GSTWLTAFVLKVFAQARKFIFIDPEEICQSLRWLLSQQKPDGSFREPGPLLHRAMKGGVDGEVSLTAYV 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1102 VVALLETGTASE---EERGSTDKARHFLESAA-PLAMDPYSCALTTYALTLLRSPA-APEALRKLRSLAIMRDGVTHW-- 1174
Cdd:pfam07678  159 TIALLEALDINGllqRVHPSIRKALTYLEQAQlAGLTSPYTLAILAYALALAGSPEtREELLKSLDAMAREEGNSRYWer 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1175 -SLSNSWDVdkgtflsfsDRVSQSVVSAEVEMTAYALLTYTLLGDVAAALPVVKWLSQQRNALGGFSSTQDTCVALQALA 1253
Cdd:pfam07678  239 dEKSDPQGV---------PEYPPQAPSLEVETTAYALLAYLLLGDLTYADPIVKWLTSQRNSHGGFSSTQDTVVALQALA 309

                   ..
gi 2217320564 1254 EY 1255
Cdd:pfam07678  310 EY 311
A2M pfam00207
Alpha-2-macroglobulin family; This family includes the C-terminal region of the ...
576-667 8.27e-34

Alpha-2-macroglobulin family; This family includes the C-terminal region of the alpha-2-macroglobulin family.


Pssm-ID: 459711 [Multi-domain]  Cd Length: 91  Bit Score: 125.39  E-value: 8.27e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  576 TWIWHCLNISDpSGEGTLSVKVPDSITSWVGEAVALSTSQGLGIAEPSLLKTFKPFFVDFMLPALIIRGEQVKIPLSVYN 655
Cdd:pfam00207    1 TWLWDPVLVTD-NGKASLSFTLPDSITTWRATAFALSPDTGLGVAEPPELVVFKPFFVDLNLPYSVRRGEQFELKATVFN 79
                           90
                   ....*....|..
gi 2217320564  656 YMGTCAEVYMKL 667
Cdd:pfam00207   80 YLDKCLKVRVRL 91
Methyltransf_FA pfam12248
Farnesoic acid 0-methyl transferase; This domain family is found in bacteria and eukaryotes, ...
811-912 4.39e-32

Farnesoic acid 0-methyl transferase; This domain family is found in bacteria and eukaryotes, and is approximately 110 amino acids in length.Farnesoic acid O-methyl transferase (FAMeT) is the enzyme that catalyzes the formation of methyl farnesoate (MF) from farnesoic acid (FA) in the biosynthetic pathway of juvenile hormone (JH).


Pssm-ID: 463505  Cd Length: 104  Bit Score: 121.21  E-value: 4.39e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  811 VALSS--GPQDTAGMIEIVLGGHQNTRSWISTSKMG--EPVASAHTAKILSWDEFRTFWISWR-GGLIQVGHGPEpsnES 885
Cdd:pfam12248    2 IALSSspYPYDSDPMYEIVIGGWGNTRSVIRRQKRGsaPDVVEVSTPGILSPDEPRMFWISWTdDGLISVGKGGE---EN 78
                           90       100
                   ....*....|....*....|....*..
gi 2217320564  886 VIVAWTLPRPPEVQFIGFSTgWGSMGE 912
Cdd:pfam12248   79 PFLQWSDPNPLPVNYIGFST-WGSTGE 104
A2M_BRD pfam07703
Alpha-2-macroglobulin bait region domain; Alpha-2-macroglobulins (A2Ms) are plasma proteins ...
281-452 3.70e-28

Alpha-2-macroglobulin bait region domain; Alpha-2-macroglobulins (A2Ms) are plasma proteins that trap and inhibit a broad range of proteases and are major components of the eukaryotic innate immune system. However, A2M-like proteins were identified in pathogenically invasive bacteria and species that colonize higher eukaryotes. This domain is found in eukaryotic and bacterial proteins. In human A2Ms, this domain encompasses macroglobulin-like domain MG5 and 6 including bait region. In Salmonella enterica ser A2Ms, this domain encompasses MG7 and MG8 including the bait region. The Bait region is cleaved by proteases, followed by a large conformational change that blocks the target protease within a cage-like complex. This model of protease entrapment is recognized as the Venus flytrap mechanism.


Pssm-ID: 462235 [Multi-domain]  Cd Length: 139  Bit Score: 111.29  E-value: 3.70e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  281 LQLQPPSHPLQVGEEAYFSVKSTC----PCNFTlYYEVAARGNIVLSGQQPAhttqqrskraapalekpirlthlsetep 356
Cdd:pfam07703    1 LHLSTDKTEYKPGETATVTVKSPFdgtvERDGF-TYLVLSKGQIVVVGRGGV---------------------------- 51
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  357 ppapeaevdvcVTSLHLAVTPSMVPLGRLLVFYVRE---NGEGVADSLQFAVETFFENQVSVTYSANETQPGEVVDLRIR 433
Cdd:pfam07703   52 -----------TTSFSLPVTAEMAPSARVVAYYVRVdlsKPEVVADSVWVDVDDTCENKLKVTLSAEKYRPGSTVELKVK 120
                          170
                   ....*....|....*....
gi 2217320564  434 AARGSCVCVAAVDKSVYLL 452
Cdd:pfam07703  121 ADPGAYVALAAVDKGVLLL 139
A2M_recep pfam07677
A-macroglobulin receptor binding domain; This family includes the receptor binding domain ...
1393-1486 9.31e-28

A-macroglobulin receptor binding domain; This family includes the receptor binding domain region of the alpha-2-macroglobulin family.


Pssm-ID: 462226  Cd Length: 92  Bit Score: 108.43  E-value: 9.31e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1393 GSSNMAVLEVPLLSGFRADIESLEQLLLDkhMGMKRYE-VAGRRVLFYFDEIPSRcLTCVRFRALRECVVGRTSALPVSV 1471
Cdd:pfam07677    1 ESSNMAILEVGLPSGFVPDEEDLKKLGVD--PLIKRVEtVDDGKVILYLDKLSGE-PLCFSFRAEQTFPVANLKPAPVKV 77
                           90
                   ....*....|....*
gi 2217320564 1472 YDYYEPAFEATRFYN 1486
Cdd:pfam07677   78 YDYYEPERRATTFYS 92
MG4 pfam17789
Macroglobulin domain MG4; This domain is MG4 found in complement C3 and C5 proteins.
176-243 5.29e-10

Macroglobulin domain MG4; This domain is MG4 found in complement C3 and C5 proteins.


Pssm-ID: 465507  Cd Length: 95  Bit Score: 57.65  E-value: 5.29e-10
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217320564  176 KDTRKQFKPGLAYVGKVELSYPDGSPAEGVTVQIKA-ELTPKDNIYTSEvvsqRGLVGFEIPSIPTSAQ 243
Cdd:pfam17789    4 EKTPKYFKPGLPFSGQVLVVDPDGSPAPNVPVFIEAgNTEFNQNLTTDE----DGTAQFSINTPGNAAS 68
MG3 pfam17791
Macroglobulin domain MG3; This entry corresponds to the MG3 domain found in complement ...
49-115 2.18e-09

Macroglobulin domain MG3; This entry corresponds to the MG3 domain found in complement components C3, C4 and C5.


Pssm-ID: 465509  Cd Length: 83  Bit Score: 55.74  E-value: 2.18e-09
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217320564   49 KYVLPKFELLIDPPRYIQDLDACETGTVRARYTFGKPVAGALMINmtvngVGYYSHEVGRPVLRTTK 115
Cdd:pfam17791    1 EYVLPKFEVKVEVPKFISVKDEEFQVTICAKYTYGKPVKGKAYVT-----LCLKDDSKRKCFESFSK 62
YfaS COG2373
Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function ...
880-1333 2.88e-09

Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function prediction only];


Pssm-ID: 441940 [Multi-domain]  Cd Length: 1605  Bit Score: 62.41  E-value: 2.88e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  880 EPSNESVIVAWTLP-RPPevqfigfstgwgSMGEFRIwRKMEVDESYSEAFTLGVPHGAIPGSERATASI----IGDVMG 954
Cdd:COG2373   1067 TGGGESDAREVELPvRPA------------NPLVTRA-TSGVLAPGESWTLPLDLPGGLRPGTGSLTLSLssspPLDLAG 1133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  955 PTLNHLNNllrlPFGCGEQNMIHFAPNVFVLKyLQKTQQLSPEVERETTDYLVQGYQRQLTYKRQDGSYSAFGeRDASGS 1034
Cdd:COG2373   1134 LLRYLLRY----PYGCTEQTTSRALPLLYLSD-LAEALGLKGDKDAELRARIQAAIARLLSMQNSDGGFGLWP-GGSESD 1207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1035 MWLTAFVLKSFAQARSF-IFVDPRELAAAKSWiiQQQQADGSflavgrvlnKDIQGGIHGTVPLTAYVVVALletgtaSE 1113
Cdd:COG2373   1208 PWLTAYATDFLLEAREAgYAVPDDALDRALDY--LRNYLRNP---------WEIEYDDAYRLAVRAYALYVL------AR 1270
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1114 EERGSTDKARHFLESAAPlAMDPYSCALttYALTLLRSPAAPealrklRSLAIMRDGVTHWSLSNSWDVDKGTFLSfsdr 1193
Cdd:COG2373   1271 AGKADLGDLRYLYDRRKD-ALSPLAKAQ--LAAALALLGDKA------RAEELLAAALARLRETGARDYWYGDYGS---- 1337
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1194 vsqsvvsaEVEMTAYALLTYTLLGDVAAALP-VVKWLSQQRNAlGGFSSTQDTCVALQALAEYA-ILSYAGGINLTVSLA 1271
Cdd:COG2373   1338 --------PLRDQALALALLAELGPDAPLAPkLARWLAKALKS-GRWLSTQETAWALLALAAYArAAGASPDFTATLTLD 1408
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217320564 1272 STNLDYQETFELHRTNqkvlqTAAIPSLPTGLFVSAKGDGCCLMQIDVTYnVPDPVAKPAFQ 1333
Cdd:COG2373   1409 GKTLPLTGRGPLARVT-----LPAAELLAGPLTITNTGDGPLYYTLTLSG-YPAEGPPPAAS 1464
KAZAL smart00280
Kazal type serine protease inhibitors; Kazal type serine protease inhibitors and ...
1538-1570 8.85e-09

Kazal type serine protease inhibitors; Kazal type serine protease inhibitors and follistatin-like domains.


Pssm-ID: 197624  Cd Length: 46  Bit Score: 52.68  E-value: 8.85e-09
                            10        20        30
                    ....*....|....*....|....*....|...
gi 2217320564  1538 CDHDCGAQGNPVCGSDGVVYASACRLREAACRQ 1570
Cdd:smart00280    2 CPEACPREYDPVCGSDGVTYSNECHLCKAACES 34
KAZAL_FS cd00104
Kazal type serine protease inhibitors and follistatin-like domains. Kazal inhibitors inhibit ...
1542-1582 2.67e-08

Kazal type serine protease inhibitors and follistatin-like domains. Kazal inhibitors inhibit serine proteases, such as, trypsin, chyomotrypsin, avian ovomucoids, and elastases. The inhibitory domain has one reactive site peptide bond, which serves the cognate enzyme as substrate. The reactive site peptide bond is a combining loop which has an identical conformation in all Kazal inhibitors and in all enzyme/inhibitor complexes. These Kazal domains (small hydrophobic core of alpha/beta structure with 3 to 4 disulfide bonds) often occur in tandem arrays. Similar domains are also present in follistatin (FS) and follistatin-like family members, which play an important role in tissue specific regulation. The FS domain consists of an N-terminal beta hairpin (FOLN/EGF-like domain) and a Kazal-like domain and has five disulfide bonds. Although the Kazal-like FS substructure is similar to Kazal proteinase inhibitors, no FS domain has yet been shown to be a proteinase inhibitor. Follistatin-like family members include SPARC, also known as, BM-40 or osteonectin, the Gallus gallus Flik protein, as well as, agrin which has a long array of FS domains. The kazal-type inhibitor domain has also been detected in an extracellular loop region of solute carrier 21 (SLC21) family members (organic anion transporters) , which may regulate the specificity of anion uptake. The distant homolog, Ascidian trypsin inhibitor, is included in this CD.


Pssm-ID: 238052 [Multi-domain]  Cd Length: 41  Bit Score: 51.12  E-value: 2.67e-08
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 2217320564 1542 CGAQGNPVCGSDGVVYASACRLREAACRQAAPLEPAPPSCC 1582
Cdd:cd00104      1 CPKEYDPVCGSDGKTYSNECHLGCAACRSGRSITVAHNGPC 41
Kazal_2 pfam07648
Kazal-type serine protease inhibitor domain; Usually indicative of serine protease inhibitors. ...
1535-1570 2.01e-07

Kazal-type serine protease inhibitor domain; Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tandem arrays. Small alpha+beta fold containing three disulphides.


Pssm-ID: 400135  Cd Length: 50  Bit Score: 49.03  E-value: 2.01e-07
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 2217320564 1535 RCGCDHDcgaQGNPVCGSDGVVYASACRLREAACRQ 1570
Cdd:pfam07648    3 NCQCPKT---EYEPVCGSDGVTYPSPCALCAAGCKL 35
PHA03247 PHA03247
large tegument protein UL36; Provisional
1466-1670 4.67e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.31  E-value: 4.67e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1466 ALPVSVYDYYEPAFEATRFYNVSTHSPLARELCAGPACNEVERAPARGPGwfPGESGPAVAPEEGAAiarcgcdhdcGAQ 1545
Cdd:PHA03247  2731 ASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPP--RRLTRPAVASLSESR----------ESL 2798
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1546 GNPVCGSDGVVYASACRLREAACRQAAPLEPAPPSCCALEQRLPASSSSTYGDDLASVAPGplqQDVKLNGaglevedsd 1625
Cdd:PHA03247  2799 PSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG---GDVRRRP--------- 2866
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 2217320564 1626 pePEGEAEDRVTAGPRPPVSSGNLESSTQSASPFHRWGQTPAPQR 1670
Cdd:PHA03247  2867 --PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPP 2909
 
Name Accession Description Interval E-value
A2M_2 cd02897
Proteins similar to alpha2-macroglobulin (alpha (2)-M). This group also contains the pregnancy ...
967-1255 7.25e-146

Proteins similar to alpha2-macroglobulin (alpha (2)-M). This group also contains the pregnancy zone protein (PZP). Alpha(2)-M and PZP are broadly specific proteinase inhibitors. Alpha (2)-M is a major carrier protein in serum. The structural thioester of alpha (2)-M, is involved in the immobilization and entrapment of proteases. PZP is a trace protein in the plasma of non-pregnant females and males which is elevated in pregnancy. Alpha (2)-M and PZ bind to placental protein-14 and may modulate its activity in T-cell growth and cytokine production contributing to fetal survival. It has been suggested that thioester bond cleavage promotes the binding of PZ and alpha (2)-M to the CD91 receptor clearing them from circulation.


Pssm-ID: 239227  Cd Length: 292  Bit Score: 450.11  E-value: 7.25e-146
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  967 PFGCGEQNMIHFAPNVFVLKYLQKTQQLSPEVERETTDYLVQGYQRQLTYKRQDGSYSAFGERDASGSMWLTAFVLKSFA 1046
Cdd:cd02897     12 PYGCGEQNMVNFAPNIYVLDYLKATGQLTPEIESKALGFLRTGYQRQLTYKHSDGSYSAFGESDKSGSTWLTAFVLKSFA 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1047 QARSFIFVDPRELAAAKSWIIQQQQADGSFLAVGRVLNKDIQGGIHGTVPLTAYVVVALLETGTASeeERGSTDKARHFL 1126
Cdd:cd02897     92 QARPFIYIDENVLQQALTWLSSHQKSNGCFREVGRVFHKAMQGGVDDEVALTAYVLIALLEAGLPS--ERPVVEKALSCL 169
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1127 ESAAPLAMDPYSCALTTYALTLLRSPAAPEALRKLRSLAIMRDGVTHWSLSnswdvdkgTFLSFSDRVSQSVVSAEVEMT 1206
Cdd:cd02897    170 EAALDSISDPYTLALAAYALTLAGSEKRPEALKKLDELAISEDGTKHWSRP--------PPSEEGPSYYWQAPSAEVEMT 241
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2217320564 1207 AYALLTYTLLG--DVAAALPVVKWLSQQRNALGGFSSTQDTCVALQALAEY 1255
Cdd:cd02897    242 AYALLALLSAGgeDLAEALPIVKWLAKQRNSLGGFSSTQDTVVALQALAKY 292
TED_complement pfam07678
A-macroglobulin TED domain; This entry corresponds to the TED domain of the complement ...
946-1255 5.47e-131

A-macroglobulin TED domain; This entry corresponds to the TED domain of the complement components such as C3, C4 and C5. This domain contains a short highly conserved region of proteinase-binding alpha-macro-globulins contains the cysteine and a glutamine of a thiol-ester bond that is cleaved at the moment of proteinase binding, and mediates the covalent binding of the alpha-macro-globulin to the proteinase. The GCGEQ motif is highly conserved.


Pssm-ID: 462227 [Multi-domain]  Cd Length: 311  Bit Score: 410.54  E-value: 5.47e-131
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  946 ASIIGDVMGPTLNHLNNLLRL----PFGCGEQNMIHFAPNVFVLKYLQKTQQLSPEVERETTDYLVQGYQRQLTYKRQDG 1021
Cdd:pfam07678    1 ISVVGDIMGPAIQVVPENLSSllrlPYGCGEQNMVLFAPNVYVLRYLDKTNQLTKLIKSKAIDYLEQGYQRQLSYKHPDG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1022 SYSAFGERDasGSMWLTAFVLKSFAQARSFIFVDPRELAAAKSWIIQQQQADGSFLAVGRVLNKDIQGGIHGTVPLTAYV 1101
Cdd:pfam07678   81 SYSAFGHSP--GSTWLTAFVLKVFAQARKFIFIDPEEICQSLRWLLSQQKPDGSFREPGPLLHRAMKGGVDGEVSLTAYV 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1102 VVALLETGTASE---EERGSTDKARHFLESAA-PLAMDPYSCALTTYALTLLRSPA-APEALRKLRSLAIMRDGVTHW-- 1174
Cdd:pfam07678  159 TIALLEALDINGllqRVHPSIRKALTYLEQAQlAGLTSPYTLAILAYALALAGSPEtREELLKSLDAMAREEGNSRYWer 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1175 -SLSNSWDVdkgtflsfsDRVSQSVVSAEVEMTAYALLTYTLLGDVAAALPVVKWLSQQRNALGGFSSTQDTCVALQALA 1253
Cdd:pfam07678  239 dEKSDPQGV---------PEYPPQAPSLEVETTAYALLAYLLLGDLTYADPIVKWLTSQRNSHGGFSSTQDTVVALQALA 309

                   ..
gi 2217320564 1254 EY 1255
Cdd:pfam07678  310 EY 311
A2M_like cd02891
Proteins similar to alpha2-macroglobulin (alpha (2)-M). Alpha (2)-M is a major carrier ...
967-1255 2.49e-109

Proteins similar to alpha2-macroglobulin (alpha (2)-M). Alpha (2)-M is a major carrier protein in serum. It is a broadly specific proteinase inhibitor. The structural thioester of alpha (2)-M, is involved in the immobilization and entrapment of proteases. This group contains another broadly specific proteinase inhibitor: pregnancy zone protein (PZP). PZP is a trace protein in the plasma of non-pregnant females and males which is elevated in pregnancy. Alpha (2)-M and PZ bind to placental protein-14 and may modulate its activity in T-cell growth and cytokine production thereby protecting the allogeneic fetus from attack by the maternal immune system. This group also contains C3, C4 and C5 of vertebrate complement. The vertebrate complement is an effector of both the acquired and innate immune systems The point of convergence of the classical, alternative and lectin pathways of the complement system is the proteolytic activation of C3. C4 plays a key role in propagating the classical and lectin pathways. C5 participates in the classical and alternative pathways. The thioester bond located within the structure of C3 and C4 is central to the function of complement. C5 does not contain an active thioester bond.


Pssm-ID: 239221 [Multi-domain]  Cd Length: 282  Bit Score: 348.99  E-value: 2.49e-109
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  967 PFGCGEQNMIHFAPNVFVLKYLQKTQQLSPEVERETTDYLVQGYQRQLTYKRQDGSYSAFGERDaSGSMWLTAFVLKSFA 1046
Cdd:cd02891     12 PYGCGEQTMSRAAPNLYVLKYLDATGQLTPEIREKALEYIRKGYQRLLTYQRSDGSFSAWGNSD-SGSTWLTAYVVKFLS 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1047 QARSFIFVDPRELAAAKSWIIQQQQADGSFLAVGRVLNKDIQGGIHGTVPLTAYVVVALLETGTASeeeRGSTDKARHFL 1126
Cdd:cd02891     91 QARKYIDVDENVLARALGWLVPQQKEDGSFRELGPVIHREMKGGVDDSVSLTAYVLIALAEAGKAC---DASIEKALAYL 167
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1127 ESAAPLAMDPYSCALTTYALTLLR-SPAAPEALRKLRSLAIMRDGVTHWSLsnSWDVDKGTflsfsdrvsqsvvSAEVEM 1205
Cdd:cd02891    168 ETQLDGLLDPYALAILAYALALAGdSTRADEALKKLLEAAREKGGTAHWSL--SWPGDYGS-------------SLRVEA 232
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1206 TAYALLTYTLLGDVAAALPVVKWLSQQRNALGGFSSTQDTCVALQALAEY 1255
Cdd:cd02891    233 TAYALLALLKLGDLEEAGPIAKWLAQQRNSGGGFLSTQDTVVALQALAAY 282
complement_C3_C4_C5 cd02896
Proteins similar to C3, C4 and C5 of vertebrate complement. The vertebrate complement system, ...
967-1255 6.66e-94

Proteins similar to C3, C4 and C5 of vertebrate complement. The vertebrate complement system, comprised of a large number of distinct plasma proteins, is an effector of both the acquired and innate immune systems. The point of convergence of the classical, alternative and lectin pathways of the complement system is the proteolytic activation of C3. C4 plays a key role in propagating the classical and lectin pathways. C5 participates in the classical and alternative pathways. The thioester bond located within the structure of C3 and C4 is central to the function of complement. C5 does not contain an active thioester bond.


Pssm-ID: 239226 [Multi-domain]  Cd Length: 297  Bit Score: 306.12  E-value: 6.66e-94
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  967 PFGCGEQNMIHFAPNVFVLKYLQKTQQ---LSPEVERETTDYLVQGYQRQLTYKRQDGSYSAFGERDasGSMWLTAFVLK 1043
Cdd:cd02896     12 PTGCGEQTMIKLAPTVYALRYLDTTNQwekLGPERRDEALKYIRQGYQRQLSYRKPDGSYAAWKNRP--SSTWLTAFVVK 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1044 SFAQARSFIFVDPRELAAAKSWIIQQQQADGSFLAVGRVLNKDIQGGIHGT---VPLTAYVVVALLET----GTASEEER 1116
Cdd:cd02896     90 VFSLARKYIPVDQNVICGSVNWLISNQKPDGSFQEPSPVIHREMTGGVEGSegdVSLTAFVLIALQEArsicPPEVQNLD 169
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1117 GSTDKARHFLESAAPLAMDPYSCALTTYALTLLRSPAAPEALRKLRSLAIMRDGVTHWSL--SNSWDVDKGTFLSfsdrv 1194
Cdd:cd02896    170 QSIRKAISYLENQLPNLQRPYALAITAYALALADSPLSHAANRKLLSLAKRDGNGWYWWTidSPYWPVPGPSAIT----- 244
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217320564 1195 sqsvvsaeVEMTAYALLTYTLLGDVAAALPVVKWLSQQRNALGGFSSTQDTCVALQALAEY 1255
Cdd:cd02896    245 --------VETTAYALLALLKLGDIEYANPIARWLTEQRNYGGGFGSTQDTVVALQALAEY 297
ISOPREN_C2_like cd00688
This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two ...
970-1255 3.98e-64

This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two broadly specific proteinase inhibitors alpha2-macroglobulin (alpha (2)-M) and pregnancy zone protein (PZP) and, the C3 C4 and C5 components of vertebrate complement. Class II terpene cyclases include squalene cyclase (SQCY) and 2,3-oxidosqualene cyclase (OSQCY), these integral membrane proteins catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds. The protein prenyltransferases include protein farnesyltransferase (FTase) and geranylgeranyltransferase types I and II (GGTase-I and GGTase-II) which catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Alpha (2)-M is a major carrier protein in serum and involved in the immobilization and entrapment of proteases. PZP is a pregnancy associated protein. Alpha (2)-M and PZP are known to bind to and, may modulate, the activity of placental protein-14 in T-cell growth and cytokine production thereby protecting the allogeneic fetus from attack by the maternal immune system.


Pssm-ID: 238362 [Multi-domain]  Cd Length: 300  Bit Score: 220.50  E-value: 3.98e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  970 CGEQNMIHFAPNVFVLKYLQKTqqlspEVERETTDYLVQGYQRQLTYKRQDGSYSAFGERDaSGSMWLTAFVLKSFAQAR 1049
Cdd:cd00688     23 CGEQTWSTAWPLLALLLLLAAT-----GIRDKADENIEKGIQRLLSYQLSDGGFSGWGGND-YPSLWLTAYALKALLLAG 96
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1050 SFIFVDPRELAAAKSWIIQQQQADGSFLAVGRVLNKdiQGGIHGTVPLTAYVVVALLETGTASEEErgSTDKARHFLESA 1129
Cdd:cd00688     97 DYIAVDRIDLARALNWLLSLQNEDGGFREDGPGNHR--IGGDESDVRLTAYALIALALLGKLDPDP--LIEKALDYLLSC 172
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1130 APLAM--------DPYSCALTTYALTLL---RSPAAPEALRKLRSLAIMRDGVTHWSLSNSWDVDkgtflsfsdrvsqsv 1198
Cdd:cd00688    173 QNYDGgfgpggesHGYGTACAAAALALLgdlDSPDAKKALRWLLSRQRPDGGWGEGRDRTNKLSD--------------- 237
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2217320564 1199 vSAEVEMTAYALLTYTLLGDVAAALPVVKWLSQQRNALGGFSS-------TQDTCVALQALAEY 1255
Cdd:cd00688    238 -SCYTEWAAYALLALGKLGDLEDAEKLVKWLLSQQNEDGGFSSkpgksydTQHTVFALLALSLY 300
A2M pfam00207
Alpha-2-macroglobulin family; This family includes the C-terminal region of the ...
576-667 8.27e-34

Alpha-2-macroglobulin family; This family includes the C-terminal region of the alpha-2-macroglobulin family.


Pssm-ID: 459711 [Multi-domain]  Cd Length: 91  Bit Score: 125.39  E-value: 8.27e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  576 TWIWHCLNISDpSGEGTLSVKVPDSITSWVGEAVALSTSQGLGIAEPSLLKTFKPFFVDFMLPALIIRGEQVKIPLSVYN 655
Cdd:pfam00207    1 TWLWDPVLVTD-NGKASLSFTLPDSITTWRATAFALSPDTGLGVAEPPELVVFKPFFVDLNLPYSVRRGEQFELKATVFN 79
                           90
                   ....*....|..
gi 2217320564  656 YMGTCAEVYMKL 667
Cdd:pfam00207   80 YLDKCLKVRVRL 91
Methyltransf_FA pfam12248
Farnesoic acid 0-methyl transferase; This domain family is found in bacteria and eukaryotes, ...
811-912 4.39e-32

Farnesoic acid 0-methyl transferase; This domain family is found in bacteria and eukaryotes, and is approximately 110 amino acids in length.Farnesoic acid O-methyl transferase (FAMeT) is the enzyme that catalyzes the formation of methyl farnesoate (MF) from farnesoic acid (FA) in the biosynthetic pathway of juvenile hormone (JH).


Pssm-ID: 463505  Cd Length: 104  Bit Score: 121.21  E-value: 4.39e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  811 VALSS--GPQDTAGMIEIVLGGHQNTRSWISTSKMG--EPVASAHTAKILSWDEFRTFWISWR-GGLIQVGHGPEpsnES 885
Cdd:pfam12248    2 IALSSspYPYDSDPMYEIVIGGWGNTRSVIRRQKRGsaPDVVEVSTPGILSPDEPRMFWISWTdDGLISVGKGGE---EN 78
                           90       100
                   ....*....|....*....|....*..
gi 2217320564  886 VIVAWTLPRPPEVQFIGFSTgWGSMGE 912
Cdd:pfam12248   79 PFLQWSDPNPLPVNYIGFST-WGSTGE 104
A2M_BRD pfam07703
Alpha-2-macroglobulin bait region domain; Alpha-2-macroglobulins (A2Ms) are plasma proteins ...
281-452 3.70e-28

Alpha-2-macroglobulin bait region domain; Alpha-2-macroglobulins (A2Ms) are plasma proteins that trap and inhibit a broad range of proteases and are major components of the eukaryotic innate immune system. However, A2M-like proteins were identified in pathogenically invasive bacteria and species that colonize higher eukaryotes. This domain is found in eukaryotic and bacterial proteins. In human A2Ms, this domain encompasses macroglobulin-like domain MG5 and 6 including bait region. In Salmonella enterica ser A2Ms, this domain encompasses MG7 and MG8 including the bait region. The Bait region is cleaved by proteases, followed by a large conformational change that blocks the target protease within a cage-like complex. This model of protease entrapment is recognized as the Venus flytrap mechanism.


Pssm-ID: 462235 [Multi-domain]  Cd Length: 139  Bit Score: 111.29  E-value: 3.70e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  281 LQLQPPSHPLQVGEEAYFSVKSTC----PCNFTlYYEVAARGNIVLSGQQPAhttqqrskraapalekpirlthlsetep 356
Cdd:pfam07703    1 LHLSTDKTEYKPGETATVTVKSPFdgtvERDGF-TYLVLSKGQIVVVGRGGV---------------------------- 51
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  357 ppapeaevdvcVTSLHLAVTPSMVPLGRLLVFYVRE---NGEGVADSLQFAVETFFENQVSVTYSANETQPGEVVDLRIR 433
Cdd:pfam07703   52 -----------TTSFSLPVTAEMAPSARVVAYYVRVdlsKPEVVADSVWVDVDDTCENKLKVTLSAEKYRPGSTVELKVK 120
                          170
                   ....*....|....*....
gi 2217320564  434 AARGSCVCVAAVDKSVYLL 452
Cdd:pfam07703  121 ADPGAYVALAAVDKGVLLL 139
A2M_recep pfam07677
A-macroglobulin receptor binding domain; This family includes the receptor binding domain ...
1393-1486 9.31e-28

A-macroglobulin receptor binding domain; This family includes the receptor binding domain region of the alpha-2-macroglobulin family.


Pssm-ID: 462226  Cd Length: 92  Bit Score: 108.43  E-value: 9.31e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1393 GSSNMAVLEVPLLSGFRADIESLEQLLLDkhMGMKRYE-VAGRRVLFYFDEIPSRcLTCVRFRALRECVVGRTSALPVSV 1471
Cdd:pfam07677    1 ESSNMAILEVGLPSGFVPDEEDLKKLGVD--PLIKRVEtVDDGKVILYLDKLSGE-PLCFSFRAEQTFPVANLKPAPVKV 77
                           90
                   ....*....|....*
gi 2217320564 1472 YDYYEPAFEATRFYN 1486
Cdd:pfam07677   78 YDYYEPERRATTFYS 92
MG4 pfam17789
Macroglobulin domain MG4; This domain is MG4 found in complement C3 and C5 proteins.
176-243 5.29e-10

Macroglobulin domain MG4; This domain is MG4 found in complement C3 and C5 proteins.


Pssm-ID: 465507  Cd Length: 95  Bit Score: 57.65  E-value: 5.29e-10
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217320564  176 KDTRKQFKPGLAYVGKVELSYPDGSPAEGVTVQIKA-ELTPKDNIYTSEvvsqRGLVGFEIPSIPTSAQ 243
Cdd:pfam17789    4 EKTPKYFKPGLPFSGQVLVVDPDGSPAPNVPVFIEAgNTEFNQNLTTDE----DGTAQFSINTPGNAAS 68
MG3 pfam17791
Macroglobulin domain MG3; This entry corresponds to the MG3 domain found in complement ...
49-115 2.18e-09

Macroglobulin domain MG3; This entry corresponds to the MG3 domain found in complement components C3, C4 and C5.


Pssm-ID: 465509  Cd Length: 83  Bit Score: 55.74  E-value: 2.18e-09
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217320564   49 KYVLPKFELLIDPPRYIQDLDACETGTVRARYTFGKPVAGALMINmtvngVGYYSHEVGRPVLRTTK 115
Cdd:pfam17791    1 EYVLPKFEVKVEVPKFISVKDEEFQVTICAKYTYGKPVKGKAYVT-----LCLKDDSKRKCFESFSK 62
YfaS COG2373
Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function ...
880-1333 2.88e-09

Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function prediction only];


Pssm-ID: 441940 [Multi-domain]  Cd Length: 1605  Bit Score: 62.41  E-value: 2.88e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  880 EPSNESVIVAWTLP-RPPevqfigfstgwgSMGEFRIwRKMEVDESYSEAFTLGVPHGAIPGSERATASI----IGDVMG 954
Cdd:COG2373   1067 TGGGESDAREVELPvRPA------------NPLVTRA-TSGVLAPGESWTLPLDLPGGLRPGTGSLTLSLssspPLDLAG 1133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  955 PTLNHLNNllrlPFGCGEQNMIHFAPNVFVLKyLQKTQQLSPEVERETTDYLVQGYQRQLTYKRQDGSYSAFGeRDASGS 1034
Cdd:COG2373   1134 LLRYLLRY----PYGCTEQTTSRALPLLYLSD-LAEALGLKGDKDAELRARIQAAIARLLSMQNSDGGFGLWP-GGSESD 1207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1035 MWLTAFVLKSFAQARSF-IFVDPRELAAAKSWiiQQQQADGSflavgrvlnKDIQGGIHGTVPLTAYVVVALletgtaSE 1113
Cdd:COG2373   1208 PWLTAYATDFLLEAREAgYAVPDDALDRALDY--LRNYLRNP---------WEIEYDDAYRLAVRAYALYVL------AR 1270
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1114 EERGSTDKARHFLESAAPlAMDPYSCALttYALTLLRSPAAPealrklRSLAIMRDGVTHWSLSNSWDVDKGTFLSfsdr 1193
Cdd:COG2373   1271 AGKADLGDLRYLYDRRKD-ALSPLAKAQ--LAAALALLGDKA------RAEELLAAALARLRETGARDYWYGDYGS---- 1337
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1194 vsqsvvsaEVEMTAYALLTYTLLGDVAAALP-VVKWLSQQRNAlGGFSSTQDTCVALQALAEYA-ILSYAGGINLTVSLA 1271
Cdd:COG2373   1338 --------PLRDQALALALLAELGPDAPLAPkLARWLAKALKS-GRWLSTQETAWALLALAAYArAAGASPDFTATLTLD 1408
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217320564 1272 STNLDYQETFELHRTNqkvlqTAAIPSLPTGLFVSAKGDGCCLMQIDVTYnVPDPVAKPAFQ 1333
Cdd:COG2373   1409 GKTLPLTGRGPLARVT-----LPAAELLAGPLTITNTGDGPLYYTLTLSG-YPAEGPPPAAS 1464
KAZAL smart00280
Kazal type serine protease inhibitors; Kazal type serine protease inhibitors and ...
1538-1570 8.85e-09

Kazal type serine protease inhibitors; Kazal type serine protease inhibitors and follistatin-like domains.


Pssm-ID: 197624  Cd Length: 46  Bit Score: 52.68  E-value: 8.85e-09
                            10        20        30
                    ....*....|....*....|....*....|...
gi 2217320564  1538 CDHDCGAQGNPVCGSDGVVYASACRLREAACRQ 1570
Cdd:smart00280    2 CPEACPREYDPVCGSDGVTYSNECHLCKAACES 34
KAZAL_FS cd00104
Kazal type serine protease inhibitors and follistatin-like domains. Kazal inhibitors inhibit ...
1542-1582 2.67e-08

Kazal type serine protease inhibitors and follistatin-like domains. Kazal inhibitors inhibit serine proteases, such as, trypsin, chyomotrypsin, avian ovomucoids, and elastases. The inhibitory domain has one reactive site peptide bond, which serves the cognate enzyme as substrate. The reactive site peptide bond is a combining loop which has an identical conformation in all Kazal inhibitors and in all enzyme/inhibitor complexes. These Kazal domains (small hydrophobic core of alpha/beta structure with 3 to 4 disulfide bonds) often occur in tandem arrays. Similar domains are also present in follistatin (FS) and follistatin-like family members, which play an important role in tissue specific regulation. The FS domain consists of an N-terminal beta hairpin (FOLN/EGF-like domain) and a Kazal-like domain and has five disulfide bonds. Although the Kazal-like FS substructure is similar to Kazal proteinase inhibitors, no FS domain has yet been shown to be a proteinase inhibitor. Follistatin-like family members include SPARC, also known as, BM-40 or osteonectin, the Gallus gallus Flik protein, as well as, agrin which has a long array of FS domains. The kazal-type inhibitor domain has also been detected in an extracellular loop region of solute carrier 21 (SLC21) family members (organic anion transporters) , which may regulate the specificity of anion uptake. The distant homolog, Ascidian trypsin inhibitor, is included in this CD.


Pssm-ID: 238052 [Multi-domain]  Cd Length: 41  Bit Score: 51.12  E-value: 2.67e-08
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 2217320564 1542 CGAQGNPVCGSDGVVYASACRLREAACRQAAPLEPAPPSCC 1582
Cdd:cd00104      1 CPKEYDPVCGSDGKTYSNECHLGCAACRSGRSITVAHNGPC 41
Kazal_2 pfam07648
Kazal-type serine protease inhibitor domain; Usually indicative of serine protease inhibitors. ...
1535-1570 2.01e-07

Kazal-type serine protease inhibitor domain; Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tandem arrays. Small alpha+beta fold containing three disulphides.


Pssm-ID: 400135  Cd Length: 50  Bit Score: 49.03  E-value: 2.01e-07
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 2217320564 1535 RCGCDHDcgaQGNPVCGSDGVVYASACRLREAACRQ 1570
Cdd:pfam07648    3 NCQCPKT---EYEPVCGSDGVTYPSPCALCAAGCKL 35
YfaS COG2373
Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function ...
18-716 7.04e-06

Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function prediction only];


Pssm-ID: 441940 [Multi-domain]  Cd Length: 1605  Bit Score: 51.23  E-value: 7.04e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564   18 SFPLSDQPVLGEWFIFVEMQGHA--YNKSFEVQKYVLPKFELLIDPPRYIQDLDACETGTVRARYTFGKPVAGAlminmT 95
Cdd:COG2373    433 SFPLPEDAPTGTWRLELYVDPKPalGSKSFRVEEFKPPRFKVDLTLDKEPLKPGDPVTVTVDARYLFGAPAAGL-----K 507
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564   96 VNGV--------------GYYSHEVGRPVLRTTKIL--------GSRDFDICVRDMIPADVPehfrGRVSIWAMVTSVDG 153
Cdd:COG2373    508 VEGEvtlrpartafpgypGYRFGDPDEEFEPEELDLgegtldadGKASLSLPLPDAPDAPGP----LRATVEASVFESGG 583
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  154 sQQVAFDDSTPVQRQ--LVDIRYSK----DTRKQFKPGLAYVGkvelsyPDGSPAEGVTVQIKAE--------LTPKDNI 219
Cdd:COG2373    584 -RPVTRSATVPVHPAdfYVGIRLPLfdgdPEGAPATFEVVAVD------PDGKPVAGKGLKVELYreewryvwYKSDDGG 656
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  220 YTSEVVSQRGLVG-FEIPSIPTSAQHVWLETK--------VMALNGKPVGAQYLPSYlSLGSWYSPSQCYLQLQPPSHPL 290
Cdd:COG2373    657 WRYESQEKEEPVAeGTLTTGADGPASLSLTPVewgryrleVKDPDGGLATSVRFYAG-GNASWGAERPDRLELSLDKESY 735
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  291 QVGEEAYFSVKStcPcnftlyyeVAARGNIVLSGQQPAHTtqqrskraapalekpiRLTHLSETEpppapeaevdvcvTS 370
Cdd:COG2373    736 KPGETAKLLIQS--P--------FAGRALVTVERDGVLET----------------QWVDVKGGG-------------TT 776
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  371 LHLAVTPSMVPLGRLLVFYVR--ENGEGVADSLQFAVETFF----ENQVSVTYSANET-QPGEVVDLRIRAA----RGSC 439
Cdd:COG2373    777 VEIPVTEDWAPNAYVSATLVRpgDSTANDMPARAYGVAPLPvdppARRLKVELTAPEKlRPGETLTVTVKVKgaagKAAE 856
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  440 VCVAAVDKSVYLLrSGFRlTPaqvfqeledyDVSDSFGvsredgpfwwagltaqRRRRSSVFPWpwgitkDS-GFAFTET 518
Cdd:COG2373    857 VTLAAVDEGILNL-TGYK-TP----------DPLDFFY----------------GKRALGVETR------DLyGRLIGAF 902
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  519 GLVVMTDRVslnhrqdGGLytdeavpafqphtGSLVAVAPSRhPPRTEkrkrtfFPETWIWHCLNISDPSGEGTLSVKVP 598
Cdd:COG2373    903 GGAAGALRS-------GGD-------------GALGRGGNPK-PPRKR------FKPVALFSGPVKTDADGKATVSFDLP 955
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564  599 DSITSWVGEAVALSTSQgLGIAEpsllKTF---KPFFVDFMLPALIIRGEQVKIPLSVYNYMGTCAEVYMKLSVPKGIQF 675
Cdd:COG2373    956 DFNGTLRVMAVAWSDDR-FGSAE----ATVtvrKPLVVRPSLPRFLAPGDRFELPVDVFNLTGKAGTVTVTLEASGGLTL 1030
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|.
gi 2217320564  676 VGhPGKRHVTkkmcVAPGEAEPIWVVLSFSDLGLNNITAKA 716
Cdd:COG2373   1031 EG-EATQTVT----LAAGGRATVRFPLKAPDAGDAKVTVTA 1066
PHA03247 PHA03247
large tegument protein UL36; Provisional
1466-1670 4.67e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.31  E-value: 4.67e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1466 ALPVSVYDYYEPAFEATRFYNVSTHSPLARELCAGPACNEVERAPARGPGwfPGESGPAVAPEEGAAiarcgcdhdcGAQ 1545
Cdd:PHA03247  2731 ASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPP--RRLTRPAVASLSESR----------ESL 2798
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1546 GNPVCGSDGVVYASACRLREAACRQAAPLEPAPPSCCALEQRLPASSSSTYGDDLASVAPGplqQDVKLNGaglevedsd 1625
Cdd:PHA03247  2799 PSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG---GDVRRRP--------- 2866
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 2217320564 1626 pePEGEAEDRVTAGPRPPVSSGNLESSTQSASPFHRWGQTPAPQR 1670
Cdd:PHA03247  2867 --PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPP 2909
KAZAL_SLC21 cd01330
The kazal-type serine protease inhibitor domain has been detected in an extracellular loop ...
1534-1561 1.12e-03

The kazal-type serine protease inhibitor domain has been detected in an extracellular loop region of solute carrier 21 (SLC21) family members (organic anion transporters) , which may regulate the specificity of anion uptake. The KAZAL_SLC21 domain is a member of the superfamily of kazal-like proteinase inhibitors and follistatin-like proteins.


Pssm-ID: 238650 [Multi-domain]  Cd Length: 54  Bit Score: 38.44  E-value: 1.12e-03
                           10        20
                   ....*....|....*....|....*...
gi 2217320564 1534 ARCGCDhdcGAQGNPVCGSDGVVYASAC 1561
Cdd:cd01330      7 SNCSCS---ESAYSPVCGENGITYFSPC 31
CAL1 COG5029
Prenyltransferase, beta subunit [Posttranslational modification, protein turnover, chaperones, ...
1015-1217 2.61e-03

Prenyltransferase, beta subunit [Posttranslational modification, protein turnover, chaperones, Lipid transport and metabolism];


Pssm-ID: 444045 [Multi-domain]  Cd Length: 259  Bit Score: 41.62  E-value: 2.61e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1015 TYKRQDGSY-SAFGErdASGSMWLTAFVLKSfAQARSFIFVDPRELAAaksWIIQQQQADGSF-LAVGRVLNKDIqggih 1092
Cdd:COG5029     76 SLRVEDGGFaKAPEG--GAGSTYHTYLATLL-AELLGRPPPDPDRLVR---FLISQQNDDGGFeISPGRRSDTNP----- 144
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217320564 1093 gtvplTAYVVVALLETGTASEEERgsTDKARHFLESA------APLAMDPYSCALTTYALTLlrspaapeALRKLRSLAI 1166
Cdd:COG5029    145 -----TAAAIGALRALGALDDPIE--TKVIRFLRDVQspeggfAYNTRIGEADLLSTFTAIL--------TLYDLGAAPK 209
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2217320564 1167 MRDGVTHWSLSNswDVDKGtflSFSDRVSQSVvsAEVEMTAYALLTYTLLG 1217
Cdd:COG5029    210 LVDDLQAYILSL--QLPDG---GFEGAPWDGV--EDVEYTFYGVGALALLG 253
Kazal_1 pfam00050
Kazal-type serine protease inhibitor domain; Usually indicative of serine protease inhibitors. ...
1539-1561 3.45e-03

Kazal-type serine protease inhibitor domain; Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tandem arrays. Small alpha+beta fold containing three disulphides. Alignment also includes a single domain from transporters in the OATP/PGT family.


Pssm-ID: 395004  Cd Length: 49  Bit Score: 36.88  E-value: 3.45e-03
                           10        20
                   ....*....|....*....|...
gi 2217320564 1539 DHDCGAQGNPVCGSDGVVYASAC 1561
Cdd:pfam00050    6 SGACPRIYDPVCGTDGKTYSNEC 28
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH