NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|4503625|ref|NP_000495|]
View 

coagulation factor X isoform 1 preproprotein [Homo sapiens]

Protein Classification

coagulation factor( domain architecture ID 10637862)

coagulation factor is a vitamin K-dependent protein S1 family serine peptidase, similar to human coagulation factor X that converts prothrombin to thrombin in the presence of factor Va, calcium and phospholipid during blood clotting

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Tryp_SPc cd00190
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens ...
235-464 1.65e-95

Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.


:

Pssm-ID: 238113 [Multi-domain]  Cd Length: 232  Bit Score: 288.41  E-value: 1.65e-95
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4503625  235 IVGGQECKDGECPWQALLINEENEGFCGGTILSEFYILTAAHCLY--QAKRFKVRVGDRNTEQEEGGEAVHEVEVVIKHN 312
Cdd:cd00190   1 IVGGSEAKIGSFPWQVSLQYTGGRHFCGGSLISPRWVLTAAHCVYssAPSNYTVRLGSHDLSSNEGGGQVIKVKKVIVHP 80
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4503625  313 RFTKETYDFDIAVLRLKTPITFRMNVAPACLPERDwaeSTLMTQKTGIVSGFGRTHEKGRQSTRLKMLEVPYVDRNSCK- 391
Cdd:cd00190  81 NYNPSTYDNDIALLKLKRPVTLSDNVRPICLPSSG---YNLPAGTTCTVSGWGRTSEGGPLPDVLQEVNVPIVSNAECKr 157
                       170       180       190       200       210       220       230
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 4503625  392 -LSSSFIITQNMFCAGYDTKQEDACQGDSGGPHVTRFKDTYFVTGIVSWGEGCARKGKYGIYTKVTAFLKWIDR 464
Cdd:cd00190 158 aYSYGGTITDNMLCAGGLEGGKDACQGDSGGPLVCNDNGRGVLVGIVSWGSGCARPNYPGVYTRVSSYLDWIQK 231
GLA smart00069
Domain containing Gla (gamma-carboxyglutamate) residues; A hyaluronan-binding domain found in ...
25-85 1.14e-27

Domain containing Gla (gamma-carboxyglutamate) residues; A hyaluronan-binding domain found in proteins associated with the extracellular matrix, cell adhesion and cell migration.


:

Pssm-ID: 214503  Cd Length: 65  Bit Score: 105.08  E-value: 1.14e-27
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 4503625      25 FIRREQANNILARVTRANSF-LEEMKKGHLERECMEETCSYEEAREVFEDSDKTNEFWNKYK 85
Cdd:smart00069   4 FLSRQEANKVLRRQRRANAFlLEELRPGNLERECQEEICSLEEAREVFEDNEGTDEFYRRYY 65
FXa_inhibition pfam14670
Coagulation Factor Xa inhibitory site; This short domain on coagulation enzyme factor Xa is ...
129-164 5.39e-10

Coagulation Factor Xa inhibitory site; This short domain on coagulation enzyme factor Xa is found to be the target for a potent inhibitor of coagulation, TAK-442.


:

Pssm-ID: 464251 [Multi-domain]  Cd Length: 36  Bit Score: 54.56  E-value: 5.39e-10
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 4503625    129 CSLDNGDCDQFCHEEQNSVVCSCARGYTLADNGKAC 164
Cdd:pfam14670   1 CSVNNGGCSHLCLNTPGGYTCSCPEGYELQDDGRTC 36
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
86-122 1.14e-08

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 50.71  E-value: 1.14e-08
                        10        20        30
                ....*....|....*....|....*....|....*...
gi 4503625   86 DGDQCET-SPCQNQGKCKDGLGEYTCTCLEGFEGKNCE 122
Cdd:cd00054   1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
 
Name Accession Description Interval E-value
Tryp_SPc cd00190
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens ...
235-464 1.65e-95

Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.


Pssm-ID: 238113 [Multi-domain]  Cd Length: 232  Bit Score: 288.41  E-value: 1.65e-95
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4503625  235 IVGGQECKDGECPWQALLINEENEGFCGGTILSEFYILTAAHCLY--QAKRFKVRVGDRNTEQEEGGEAVHEVEVVIKHN 312
Cdd:cd00190   1 IVGGSEAKIGSFPWQVSLQYTGGRHFCGGSLISPRWVLTAAHCVYssAPSNYTVRLGSHDLSSNEGGGQVIKVKKVIVHP 80
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4503625  313 RFTKETYDFDIAVLRLKTPITFRMNVAPACLPERDwaeSTLMTQKTGIVSGFGRTHEKGRQSTRLKMLEVPYVDRNSCK- 391
Cdd:cd00190  81 NYNPSTYDNDIALLKLKRPVTLSDNVRPICLPSSG---YNLPAGTTCTVSGWGRTSEGGPLPDVLQEVNVPIVSNAECKr 157
                       170       180       190       200       210       220       230
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 4503625  392 -LSSSFIITQNMFCAGYDTKQEDACQGDSGGPHVTRFKDTYFVTGIVSWGEGCARKGKYGIYTKVTAFLKWIDR 464
Cdd:cd00190 158 aYSYGGTITDNMLCAGGLEGGKDACQGDSGGPLVCNDNGRGVLVGIVSWGSGCARPNYPGVYTRVSSYLDWIQK 231
Tryp_SPc smart00020
Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens ...
234-462 7.69e-93

Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.


Pssm-ID: 214473  Cd Length: 229  Bit Score: 281.49  E-value: 7.69e-93
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4503625     234 RIVGGQECKDGECPWQALLINEENEGFCGGTILSEFYILTAAHCLY--QAKRFKVRVGDRNTEQEEGGEaVHEVEVVIKH 311
Cdd:smart00020   1 RIVGGSEANIGSFPWQVSLQYGGGRHFCGGSLISPRWVLTAAHCVRgsDPSNIRVRLGSHDLSSGEEGQ-VIKVSKVIIH 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4503625     312 NRFTKETYDFDIAVLRLKTPITFRMNVAPACLPERDwaeSTLMTQKTGIVSGFGRTHE-KGRQSTRLKMLEVPYVDRNSC 390
Cdd:smart00020  80 PNYNPSTYDNDIALLKLKEPVTLSDNVRPICLPSSN---YNVPAGTTCTVSGWGRTSEgAGSLPDTLQEVNVPIVSNATC 156
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 4503625     391 K--LSSSFIITQNMFCAGYDTKQEDACQGDSGGPHVTRfKDTYFVTGIVSWGEGCARKGKYGIYTKVTAFLKWI 462
Cdd:smart00020 157 RraYSGGGAITDNMLCAGGLEGGKDACQGDSGGPLVCN-DGRWVLVGIVSWGSGCARPGKPGVYTRVSSYLDWI 229
Trypsin pfam00089
Trypsin;
235-462 3.30e-81

Trypsin;


Pssm-ID: 459667 [Multi-domain]  Cd Length: 219  Bit Score: 251.21  E-value: 3.30e-81
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4503625    235 IVGGQECKDGECPWQALLINEENEGFCGGTILSEFYILTAAHCLYQAKRFKVRVGDRNTEQEEGGEAVHEVEVVIKHNRF 314
Cdd:pfam00089   1 IVGGDEAQPGSFPWQVSLQLSSGKHFCGGSLISENWVLTAAHCVSGASDVKVVLGAHNIVLREGGEQKFDVEKIIVHPNY 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4503625    315 TKETYDFDIAVLRLKTPITFRMNVAPACLPErdwAESTLMTQKTGIVSGFGRTHEKGRqSTRLKMLEVPYVDRNSCKLSS 394
Cdd:pfam00089  81 NPDTLDNDIALLKLESPVTLGDTVRPICLPD---ASSDLPVGTTCTVSGWGNTKTLGP-SDTLQEVTVPVVSRETCRSAY 156
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 4503625    395 SFIITQNMFCAGYDTKqeDACQGDSGGPHVTRFKdtyFVTGIVSWGEGCARKGKYGIYTKVTAFLKWI 462
Cdd:pfam00089 157 GGTVTDTMICAGAGGK--DACQGDSGGPLVCSDG---ELIGIVSWGYGCASGNYPGVYTPVSSYLDWI 219
COG5640 COG5640
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, ...
233-470 1.06e-61

Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 444365 [Multi-domain]  Cd Length: 262  Bit Score: 202.19  E-value: 1.06e-61
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4503625  233 TRIVGGQECKDGECPWQALLINEE--NEGFCGGTILSEFYILTAAHCLY--QAKRFKVRVGDRNTEQEEGgeAVHEVEVV 308
Cdd:COG5640  29 PAIVGGTPATVGEYPWMVALQSSNgpSGQFCGGTLIAPRWVLTAAHCVDgdGPSDLRVVIGSTDLSTSGG--TVVKVARI 106
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4503625  309 IKHNRFTKETYDFDIAVLRLKTPITfrmNVAPACLPErdwAESTLMTQKTGIVSGFGRTHE-KGRQSTRLKMLEVPYVDR 387
Cdd:COG5640 107 VVHPDYDPATPGNDIALLKLATPVP---GVAPAPLAT---SADAAAPGTPATVAGWGRTSEgPGSQSGTLRKADVPVVSD 180
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4503625  388 NSCKLSSSFIiTQNMFCAGYDTKQEDACQGDSGGPHVTRFKDTYFVTGIVSWGEGCARKGKYGIYTKVTAFLKWIDRSMK 467
Cdd:COG5640 181 ATCAAYGGFD-GGTMLCAGYPEGGKDACQGDSGGPLVVKDGGGWVLVGVVSWGGGPCAAGYPGVYTRVSAYRDWIKSTAG 259

                ...
gi 4503625  468 TRG 470
Cdd:COG5640 260 GLG 262
GLA smart00069
Domain containing Gla (gamma-carboxyglutamate) residues; A hyaluronan-binding domain found in ...
25-85 1.14e-27

Domain containing Gla (gamma-carboxyglutamate) residues; A hyaluronan-binding domain found in proteins associated with the extracellular matrix, cell adhesion and cell migration.


Pssm-ID: 214503  Cd Length: 65  Bit Score: 105.08  E-value: 1.14e-27
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 4503625      25 FIRREQANNILARVTRANSF-LEEMKKGHLERECMEETCSYEEAREVFEDSDKTNEFWNKYK 85
Cdd:smart00069   4 FLSRQEANKVLRRQRRANAFlLEELRPGNLERECQEEICSLEEAREVFEDNEGTDEFYRRYY 65
Gla pfam00594
Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain; This domain is ...
45-85 8.56e-25

Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain; This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carboxylation to form gamma-carboxyglutamate (Gla).


Pssm-ID: 459861  Cd Length: 41  Bit Score: 96.45  E-value: 8.56e-25
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|.
gi 4503625     45 LEEMKKGHLERECMEETCSYEEAREVFEDSDKTNEFWNKYK 85
Cdd:pfam00594   1 LEELKPGNLERECYEEICSYEEAREIFEDDEKTMEFWKKYT 41
FXa_inhibition pfam14670
Coagulation Factor Xa inhibitory site; This short domain on coagulation enzyme factor Xa is ...
129-164 5.39e-10

Coagulation Factor Xa inhibitory site; This short domain on coagulation enzyme factor Xa is found to be the target for a potent inhibitor of coagulation, TAK-442.


Pssm-ID: 464251 [Multi-domain]  Cd Length: 36  Bit Score: 54.56  E-value: 5.39e-10
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 4503625    129 CSLDNGDCDQFCHEEQNSVVCSCARGYTLADNGKAC 164
Cdd:pfam14670   1 CSVNNGGCSHLCLNTPGGYTCSCPEGYELQDDGRTC 36
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
86-122 1.14e-08

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 50.71  E-value: 1.14e-08
                        10        20        30
                ....*....|....*....|....*....|....*...
gi 4503625   86 DGDQCET-SPCQNQGKCKDGLGEYTCTCLEGFEGKNCE 122
Cdd:cd00054   1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA smart00179
Calcium-binding EGF-like domain;
86-122 4.43e-07

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 46.09  E-value: 4.43e-07
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 4503625      86 DGDQCET-SPCQNQGKCKDGLGEYTCTCLEGFE-GKNCE 122
Cdd:smart00179   1 DIDECASgNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
90-120 2.67e-06

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 43.91  E-value: 2.67e-06
                          10        20        30
                  ....*....|....*....|....*....|.
gi 4503625     90 CETSPCQNQGKCKDGLGEYTCTCLEGFEGKN 120
Cdd:pfam00008   1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGKR 31
vWA_Matrilin cd01475
VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and ...
113-163 2.29e-03

VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.


Pssm-ID: 238752 [Multi-domain]  Cd Length: 224  Bit Score: 39.68  E-value: 2.29e-03
                        10        20        30        40        50
                ....*....|....*....|....*....|....*....|....*....|.
gi 4503625  113 LEGFEGKNCElfTRKLCSLDNGDCDQFCHEEQNSVVCSCARGYTLADNGKA 163
Cdd:cd01475 176 TKKFQGKICV--VPDLCATLSHVCQQVCISTPGSYLCACTEGYALLEDNKT 224
 
Name Accession Description Interval E-value
Tryp_SPc cd00190
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens ...
235-464 1.65e-95

Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.


Pssm-ID: 238113 [Multi-domain]  Cd Length: 232  Bit Score: 288.41  E-value: 1.65e-95
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4503625  235 IVGGQECKDGECPWQALLINEENEGFCGGTILSEFYILTAAHCLY--QAKRFKVRVGDRNTEQEEGGEAVHEVEVVIKHN 312
Cdd:cd00190   1 IVGGSEAKIGSFPWQVSLQYTGGRHFCGGSLISPRWVLTAAHCVYssAPSNYTVRLGSHDLSSNEGGGQVIKVKKVIVHP 80
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4503625  313 RFTKETYDFDIAVLRLKTPITFRMNVAPACLPERDwaeSTLMTQKTGIVSGFGRTHEKGRQSTRLKMLEVPYVDRNSCK- 391
Cdd:cd00190  81 NYNPSTYDNDIALLKLKRPVTLSDNVRPICLPSSG---YNLPAGTTCTVSGWGRTSEGGPLPDVLQEVNVPIVSNAECKr 157
                       170       180       190       200       210       220       230
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 4503625  392 -LSSSFIITQNMFCAGYDTKQEDACQGDSGGPHVTRFKDTYFVTGIVSWGEGCARKGKYGIYTKVTAFLKWIDR 464
Cdd:cd00190 158 aYSYGGTITDNMLCAGGLEGGKDACQGDSGGPLVCNDNGRGVLVGIVSWGSGCARPNYPGVYTRVSSYLDWIQK 231
Tryp_SPc smart00020
Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens ...
234-462 7.69e-93

Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.


Pssm-ID: 214473  Cd Length: 229  Bit Score: 281.49  E-value: 7.69e-93
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4503625     234 RIVGGQECKDGECPWQALLINEENEGFCGGTILSEFYILTAAHCLY--QAKRFKVRVGDRNTEQEEGGEaVHEVEVVIKH 311
Cdd:smart00020   1 RIVGGSEANIGSFPWQVSLQYGGGRHFCGGSLISPRWVLTAAHCVRgsDPSNIRVRLGSHDLSSGEEGQ-VIKVSKVIIH 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4503625     312 NRFTKETYDFDIAVLRLKTPITFRMNVAPACLPERDwaeSTLMTQKTGIVSGFGRTHE-KGRQSTRLKMLEVPYVDRNSC 390
Cdd:smart00020  80 PNYNPSTYDNDIALLKLKEPVTLSDNVRPICLPSSN---YNVPAGTTCTVSGWGRTSEgAGSLPDTLQEVNVPIVSNATC 156
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 4503625     391 K--LSSSFIITQNMFCAGYDTKQEDACQGDSGGPHVTRfKDTYFVTGIVSWGEGCARKGKYGIYTKVTAFLKWI 462
Cdd:smart00020 157 RraYSGGGAITDNMLCAGGLEGGKDACQGDSGGPLVCN-DGRWVLVGIVSWGSGCARPGKPGVYTRVSSYLDWI 229
Trypsin pfam00089
Trypsin;
235-462 3.30e-81

Trypsin;


Pssm-ID: 459667 [Multi-domain]  Cd Length: 219  Bit Score: 251.21  E-value: 3.30e-81
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4503625    235 IVGGQECKDGECPWQALLINEENEGFCGGTILSEFYILTAAHCLYQAKRFKVRVGDRNTEQEEGGEAVHEVEVVIKHNRF 314
Cdd:pfam00089   1 IVGGDEAQPGSFPWQVSLQLSSGKHFCGGSLISENWVLTAAHCVSGASDVKVVLGAHNIVLREGGEQKFDVEKIIVHPNY 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4503625    315 TKETYDFDIAVLRLKTPITFRMNVAPACLPErdwAESTLMTQKTGIVSGFGRTHEKGRqSTRLKMLEVPYVDRNSCKLSS 394
Cdd:pfam00089  81 NPDTLDNDIALLKLESPVTLGDTVRPICLPD---ASSDLPVGTTCTVSGWGNTKTLGP-SDTLQEVTVPVVSRETCRSAY 156
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 4503625    395 SFIITQNMFCAGYDTKqeDACQGDSGGPHVTRFKdtyFVTGIVSWGEGCARKGKYGIYTKVTAFLKWI 462
Cdd:pfam00089 157 GGTVTDTMICAGAGGK--DACQGDSGGPLVCSDG---ELIGIVSWGYGCASGNYPGVYTPVSSYLDWI 219
COG5640 COG5640
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, ...
233-470 1.06e-61

Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 444365 [Multi-domain]  Cd Length: 262  Bit Score: 202.19  E-value: 1.06e-61
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4503625  233 TRIVGGQECKDGECPWQALLINEE--NEGFCGGTILSEFYILTAAHCLY--QAKRFKVRVGDRNTEQEEGgeAVHEVEVV 308
Cdd:COG5640  29 PAIVGGTPATVGEYPWMVALQSSNgpSGQFCGGTLIAPRWVLTAAHCVDgdGPSDLRVVIGSTDLSTSGG--TVVKVARI 106
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4503625  309 IKHNRFTKETYDFDIAVLRLKTPITfrmNVAPACLPErdwAESTLMTQKTGIVSGFGRTHE-KGRQSTRLKMLEVPYVDR 387
Cdd:COG5640 107 VVHPDYDPATPGNDIALLKLATPVP---GVAPAPLAT---SADAAAPGTPATVAGWGRTSEgPGSQSGTLRKADVPVVSD 180
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4503625  388 NSCKLSSSFIiTQNMFCAGYDTKQEDACQGDSGGPHVTRFKDTYFVTGIVSWGEGCARKGKYGIYTKVTAFLKWIDRSMK 467
Cdd:COG5640 181 ATCAAYGGFD-GGTMLCAGYPEGGKDACQGDSGGPLVVKDGGGWVLVGVVSWGGGPCAAGYPGVYTRVSAYRDWIKSTAG 259

                ...
gi 4503625  468 TRG 470
Cdd:COG5640 260 GLG 262
GLA smart00069
Domain containing Gla (gamma-carboxyglutamate) residues; A hyaluronan-binding domain found in ...
25-85 1.14e-27

Domain containing Gla (gamma-carboxyglutamate) residues; A hyaluronan-binding domain found in proteins associated with the extracellular matrix, cell adhesion and cell migration.


Pssm-ID: 214503  Cd Length: 65  Bit Score: 105.08  E-value: 1.14e-27
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 4503625      25 FIRREQANNILARVTRANSF-LEEMKKGHLERECMEETCSYEEAREVFEDSDKTNEFWNKYK 85
Cdd:smart00069   4 FLSRQEANKVLRRQRRANAFlLEELRPGNLERECQEEICSLEEAREVFEDNEGTDEFYRRYY 65
Gla pfam00594
Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain; This domain is ...
45-85 8.56e-25

Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain; This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carboxylation to form gamma-carboxyglutamate (Gla).


Pssm-ID: 459861  Cd Length: 41  Bit Score: 96.45  E-value: 8.56e-25
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|.
gi 4503625     45 LEEMKKGHLERECMEETCSYEEAREVFEDSDKTNEFWNKYK 85
Cdd:pfam00594   1 LEELKPGNLERECYEEICSYEEAREIFEDDEKTMEFWKKYT 41
eMpr COG3591
V8-like Glu-specific endopeptidase [Posttranslational modification, protein turnover, ...
252-464 6.34e-11

V8-like Glu-specific endopeptidase [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 442810 [Multi-domain]  Cd Length: 194  Bit Score: 61.62  E-value: 6.34e-11
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4503625  252 LINEENEGFCGGTILSEFYILTAAHCLYQ------AKRFKVRVGDRNTEqeegGEAVHEVEVVIKHNRFTKETYDFDIAV 325
Cdd:COG3591   5 LETDGGGGVCTGTLIGPNLVLTAGHCVYDgagggwATNIVFVPGYNGGP----YGTATATRFRVPPGWVASGDAGYDYAL 80
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4503625  326 LRLKTPIT-----FRMNVAPACLPERDWAestlmtqktgiVSGFGRTHEKgrqstRLKMlevpyvdRNSCKLSSsfiITQ 400
Cdd:COG3591  81 LRLDEPLGdttgwLGLAFNDAPLAGEPVT-----------IIGYPGDRPK-----DLSL-------DCSGRVTG---VQG 134
                       170       180       190       200       210       220
                ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 4503625  401 NMFcaGYDTkqeDACQGDSGGPHVTRFKDTYFVTGIVSWG-EGCARKGKYGIYTKVTAFLKWIDR 464
Cdd:COG3591 135 NRL--SYDC---DTTGGSSGSPVLDDSDGGGRVVGVHSAGgADRANTGVRLTSAIVAALRAWASA 194
FXa_inhibition pfam14670
Coagulation Factor Xa inhibitory site; This short domain on coagulation enzyme factor Xa is ...
129-164 5.39e-10

Coagulation Factor Xa inhibitory site; This short domain on coagulation enzyme factor Xa is found to be the target for a potent inhibitor of coagulation, TAK-442.


Pssm-ID: 464251 [Multi-domain]  Cd Length: 36  Bit Score: 54.56  E-value: 5.39e-10
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 4503625    129 CSLDNGDCDQFCHEEQNSVVCSCARGYTLADNGKAC 164
Cdd:pfam14670   1 CSVNNGGCSHLCLNTPGGYTCSCPEGYELQDDGRTC 36
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
86-122 1.14e-08

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 50.71  E-value: 1.14e-08
                        10        20        30
                ....*....|....*....|....*....|....*...
gi 4503625   86 DGDQCET-SPCQNQGKCKDGLGEYTCTCLEGFEGKNCE 122
Cdd:cd00054   1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA smart00179
Calcium-binding EGF-like domain;
86-122 4.43e-07

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 46.09  E-value: 4.43e-07
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 4503625      86 DGDQCET-SPCQNQGKCKDGLGEYTCTCLEGFE-GKNCE 122
Cdd:smart00179   1 DIDECASgNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
90-120 2.67e-06

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 43.91  E-value: 2.67e-06
                          10        20        30
                  ....*....|....*....|....*....|.
gi 4503625     90 CETSPCQNQGKCKDGLGEYTCTCLEGFEGKN 120
Cdd:pfam00008   1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGKR 31
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
90-122 3.06e-05

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 40.92  E-value: 3.06e-05
                        10        20        30
                ....*....|....*....|....*....|....*
gi 4503625   90 CETS-PCQNQGKCKDGLGEYTCTCLEGFEG-KNCE 122
Cdd:cd00053   2 CAASnPCSNGGTCVNTPGSYRCVCPPGYTGdRSCE 36
vWA_Matrilin cd01475
VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and ...
113-163 2.29e-03

VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.


Pssm-ID: 238752 [Multi-domain]  Cd Length: 224  Bit Score: 39.68  E-value: 2.29e-03
                        10        20        30        40        50
                ....*....|....*....|....*....|....*....|....*....|.
gi 4503625  113 LEGFEGKNCElfTRKLCSLDNGDCDQFCHEEQNSVVCSCARGYTLADNGKA 163
Cdd:cd01475 176 TKKFQGKICV--VPDLCATLSHVCQQVCISTPGSYLCACTEGYALLEDNKT 224
hEGF pfam12661
Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six ...
95-116 2.30e-03

Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six conserved residues disulfide-bonded into the characteriztic 'ababcc' pattern. They are involved in growth and proliferation of cells, in proteins of the Notch/Delta pathway, neurogulin and selectins. hEGFs are also found in mosaic proteins with four-disulfide laminin EGFs such as aggrecan and perlecan. The core fold of the EGF domain consists of two small beta-hairpins packed against each other. Two major structural variants have been identified based on the structural context of the C-terminal Cys residue of disulfide 'c' in the C-terminal hairpin: hEGFs and cEGFs. In hEGFs the C-terminal thiol resides in the beta-turn, resulting in shorter loop-lengths between the Cys residues of disulfide 'c', typically C[8-9]XC. These shorter loop-lengths are also typical of the four-disulfide EGF domains, laminin ad integrin. Tandem hEGF domains have six linking residues between terminal cysteines of adjacent domains. hEGF domains may or may not bind calcium in the linker region. hEGF domains with the consensus motif CXD4X[F,Y]XCXC are hydroxylated exclusively in the Asp residue.


Pssm-ID: 463660  Cd Length: 22  Bit Score: 35.39  E-value: 2.30e-03
                          10        20
                  ....*....|....*....|..
gi 4503625     95 CQNQGKCKDGLGEYTCTCLEGF 116
Cdd:pfam12661   1 CQNGGTCVDGVNGYKCQCPPGY 22
EGF smart00181
Epidermal growth factor-like domain;
91-122 7.19e-03

Epidermal growth factor-like domain;


Pssm-ID: 214544  Cd Length: 35  Bit Score: 34.41  E-value: 7.19e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 4503625      91 ETSPCQNqGKCKDGLGEYTCTCLEGFEG-KNCE 122
Cdd:smart00181   4 SGGPCSN-GTCINTPGSYTCSCPPGYTGdKRCE 35
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH