NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1034654849|ref|XP_016867439|]
View 

contactin-associated protein-like 2 isoform X1 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
FA58C cd00057
Coagulation factor 5/8 C-terminal domain, discoidin domain; Cell surface-attached ...
70-180 1.55e-33

Coagulation factor 5/8 C-terminal domain, discoidin domain; Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.


:

Pssm-ID: 238014 [Multi-domain]  Cd Length: 143  Bit Score: 125.54  E-value: 1.55e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034654849  70 GAGGWSPSDSDHYQWLQVDFGNRKQISAIATQGR--YSSSDWVTQYRMLYSDTGRNWKPYHQDGNIWAFPGNINSDGVVR 147
Cdd:cd00057    33 SDNAWTPAVNDPPQWLQVDLGKTRRVTGIQTQGRkgGGSSEWVTSYKVQYSLDGETWTTYKDKGEEKVFTGNSDGSTPVT 112
                          90       100       110
                  ....*....|....*....|....*....|...
gi 1034654849 148 HELQHPIIARYVRIVPLDWNgeGRIGLRIEVYG 180
Cdd:cd00057   113 NDFPPPIVARYIRILPTTWN--GNISLRLELYG 143
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
216-345 2.53e-30

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


:

Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 115.59  E-value: 2.53e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034654849 216 FKTSESEGVILHGEGQQGDYITLELKKAKLVLSLNLGSnqlGPIyghTSVMTGSLLDDHHWHSVVIERQGRSINLTLDRS 295
Cdd:pfam02210   1 FRTRQPNGLLLYAGGGGSDFLALELVNGRLVLRYDLGS---GPE---SLLSSGKNLNDGQWHSVRVERNGNTLTLSVDGQ 74
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1034654849 296 MQHFRTN-GEFDYLDLDYEITFGGIP-FSGKPSSSSRKNFKGCMESINYNGV 345
Cdd:pfam02210  75 TVVSSLPpGESLLLNLNGPLYLGGLPpLLLLPALPVRAGFVGCIRDVRVNGE 126
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
375-527 4.98e-25

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


:

Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 101.34  E-value: 4.98e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034654849 375 PVFFNATSYLEVPGRLNQDL-FSVSFQFRTWNPNGLLVFSHFADNLGNVEIDLTESKVGVHINITQtkmSQIDISSGSGL 453
Cdd:cd00110     1 GVSFSGSSYVRLPTLPAPRTrLSISFSFRTTSPNGLLLYAGSQNGGDFLALELEDGRLVLRYDLGS---GSLVLSSKTPL 77
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034654849 454 NDGQWHEVRFLAKENFAILTIDGDeaSAVRTNSP---LQVKTGEKYFFGGfLNQMNNSSHSVLQPSFQGCMQLIQVD 527
Cdd:cd00110    78 NDGQWHSVSVERNGRSVTLSVDGE--RVVESGSPggsALLNLDGPLYLGG-LPEDLKSPGLPVSPGFVGCIRDLKVN 151
GGGWT_bact super family cl49103
fibrinogen-like bacterial YCDxxxxGGGW domain; Pfam model PF00147, about 220 amino acids long, ...
598-634 1.08e-09

fibrinogen-like bacterial YCDxxxxGGGW domain; Pfam model PF00147, about 220 amino acids long, describes a conserved domain found in eukaryotic proteins such as fibrinogen beta and gamma chains, fincolin, and angiopoietin. This model describes a small homology domain, about 46 amino acids long, found in the PF00147 homology region of those proteins but also as a much shorter homology domain in bacterial proteins that may lack homology to those proteins, or to each other, outside this region. The signature motif, at the C-terminus of this domain, is YCDxTTDGGGWxLV.


The actual alignment was detected with superfamily member NF040941:

Pssm-ID: 468872 [Multi-domain]  Cd Length: 46  Bit Score: 54.11  E-value: 1.08e-09
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1034654849 598 SCEAYKHLGQTSN--YYWIDPDGSGPLGPLKVYCNMTED 634
Cdd:NF040941    1 SCWEILQAGPSAPsgVYWIDPDGMGGLAPFQVYCDMTTD 39
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
555-590 8.39e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 45.71  E-value: 8.39e-07
                          10        20        30
                  ....*....|....*....|....*....|....*..
gi 1034654849 555 IDRCVPNH-CEHGGKCSQTWDSFKCTCDEtGYSGATC 590
Cdd:cd00054     2 IDECASGNpCQNGGTCVNTVGSYRCSCPP-GYTGRNC 37
 
Name Accession Description Interval E-value
FA58C cd00057
Coagulation factor 5/8 C-terminal domain, discoidin domain; Cell surface-attached ...
70-180 1.55e-33

Coagulation factor 5/8 C-terminal domain, discoidin domain; Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.


Pssm-ID: 238014 [Multi-domain]  Cd Length: 143  Bit Score: 125.54  E-value: 1.55e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034654849  70 GAGGWSPSDSDHYQWLQVDFGNRKQISAIATQGR--YSSSDWVTQYRMLYSDTGRNWKPYHQDGNIWAFPGNINSDGVVR 147
Cdd:cd00057    33 SDNAWTPAVNDPPQWLQVDLGKTRRVTGIQTQGRkgGGSSEWVTSYKVQYSLDGETWTTYKDKGEEKVFTGNSDGSTPVT 112
                          90       100       110
                  ....*....|....*....|....*....|...
gi 1034654849 148 HELQHPIIARYVRIVPLDWNgeGRIGLRIEVYG 180
Cdd:cd00057   113 NDFPPPIVARYIRILPTTWN--GNISLRLELYG 143
F5_F8_type_C pfam00754
F5/8 type C domain; This domain is also known as the discoidin (DS) domain family.
71-178 4.05e-32

F5/8 type C domain; This domain is also known as the discoidin (DS) domain family.


Pssm-ID: 459925 [Multi-domain]  Cd Length: 127  Bit Score: 121.02  E-value: 4.05e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034654849  71 AGGWSPSDSDHYQWLQVDFGNRKQISAIATQGRYS-SSDWVTQYRMLYSDTGRNWKPYHQDGniwaFPGNINSDGVVRHE 149
Cdd:pfam00754  23 NTAWSAWSGDDPQWIQVDLGKPKKITGVVTQGRQDgSNGYVTSYKIEYSLDGENWTTVKDEK----IPGNNDNNTPVTNT 98
                          90       100
                  ....*....|....*....|....*....
gi 1034654849 150 LQHPIIARYVRIVPLDWNGEGRIGLRIEV 178
Cdd:pfam00754  99 FDPPIKARYVRIVPTSWNGGNGIALRAEL 127
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
216-345 2.53e-30

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 115.59  E-value: 2.53e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034654849 216 FKTSESEGVILHGEGQQGDYITLELKKAKLVLSLNLGSnqlGPIyghTSVMTGSLLDDHHWHSVVIERQGRSINLTLDRS 295
Cdd:pfam02210   1 FRTRQPNGLLLYAGGGGSDFLALELVNGRLVLRYDLGS---GPE---SLLSSGKNLNDGQWHSVRVERNGNTLTLSVDGQ 74
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1034654849 296 MQHFRTN-GEFDYLDLDYEITFGGIP-FSGKPSSSSRKNFKGCMESINYNGV 345
Cdd:pfam02210  75 TVVSSLPpGESLLLNLNGPLYLGGLPpLLLLPALPVRAGFVGCIRDVRVNGE 126
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
187-343 6.73e-30

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 115.59  E-value: 6.73e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034654849 187 VINFDGHvvLPYRFRNKKMKTLKDVIALNFKTSESEGVILH-GEGQQGDYITLELKKAKLVLSLNLGSnqlgpiyGHTSV 265
Cdd:cd00110     1 GVSFSGS--SYVRLPTLPAPRTRLSISFSFRTTSPNGLLLYaGSQNGGDFLALELEDGRLVLRYDLGS-------GSLVL 71
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034654849 266 MTGSLLDDHHWHSVVIERQGRSINLTLD--RSMQHfRTNGEFDYLDLDYEITFGGIPFSGKPSSS-SRKNFKGCMESINY 342
Cdd:cd00110    72 SSKTPLNDGQWHSVSVERNGRSVTLSVDgeRVVES-GSPGGSALLNLDGPLYLGGLPEDLKSPGLpVSPGFVGCIRDLKV 150

                  .
gi 1034654849 343 N 343
Cdd:cd00110   151 N 151
FA58C smart00231
Coagulation factor 5/8 C-terminal domain, discoidin domain; Cell surface-attached ...
34-181 1.38e-29

Coagulation factor 5/8 C-terminal domain, discoidin domain; Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.


Pssm-ID: 214572  Cd Length: 139  Bit Score: 114.14  E-value: 1.38e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034654849   34 KCDEPLvsGLphVAFSSSSSISGSYSPGYAKINKrGGAGGWSPSDSDHYQWLQVDFGNRKQISAIATQGRYSSSDWVTqY 113
Cdd:smart00231   1 PCNEPL--GL--ESDSQITASSSYWAAKIARLNG-GSDGGWCPAKNDLPPWIQVDLGRLRTVTGVITGRRHGNGDWVT-Y 74
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034654849  114 RMLYSDTGRNWKPYhQDGNIWAFPGNINSDGVVRHELQHPIIARYVRIVPLDWNgeGRIGLRIEVYGC 181
Cdd:smart00231  75 KLEYSDDGVNWTTY-KDGNSKVFPGNSDAGTVVLNDFPPPIVARYVRILPTGWN--GNIILRVELLGC 139
LamG smart00282
Laminin G domain;
212-345 1.95e-29

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 113.59  E-value: 1.95e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034654849  212 IALNFKTSESEGVILH-GEGQQGDYITLELKKAKLVLSLNLGSNQLGPIYGHTSVmtgsllDDHHWHSVVIERQGRSINL 290
Cdd:smart00282   2 ISFSFRTTSPNGLLLYaGSKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPL------NDGQWHRVAVERNGRSVTL 75
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034654849  291 TLDRSM-QHFRTNGEFDYLDLDYEITFGGIPFSGKPSSS-SRKNFKGCMESINYNGV 345
Cdd:smart00282  76 SVDGGNrVSGESPGGLTILNLDGPLYLGGLPEDLKLPPLpVTPGFRGCIRNLKVNGK 132
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
375-527 4.98e-25

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 101.34  E-value: 4.98e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034654849 375 PVFFNATSYLEVPGRLNQDL-FSVSFQFRTWNPNGLLVFSHFADNLGNVEIDLTESKVGVHINITQtkmSQIDISSGSGL 453
Cdd:cd00110     1 GVSFSGSSYVRLPTLPAPRTrLSISFSFRTTSPNGLLLYAGSQNGGDFLALELEDGRLVLRYDLGS---GSLVLSSKTPL 77
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034654849 454 NDGQWHEVRFLAKENFAILTIDGDeaSAVRTNSP---LQVKTGEKYFFGGfLNQMNNSSHSVLQPSFQGCMQLIQVD 527
Cdd:cd00110    78 NDGQWHSVSVERNGRSVTLSVDGE--RVVESGSPggsALLNLDGPLYLGG-LPEDLKSPGLPVSPGFVGCIRDLKVN 151
LamG smart00282
Laminin G domain;
396-529 1.41e-24

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 99.72  E-value: 1.41e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034654849  396 SVSFQFRTWNPNGLLVFSHFADNLGNVEIDLTESKVGVHINITQTKmsqIDISSGSG-LNDGQWHEVRFLAKENFAILTI 474
Cdd:smart00282   1 SISFSFRTTSPNGLLLYAGSKGGGDYLALELRDGRLVLRYDLGSGP---ARLTSDPTpLNDGQWHRVAVERNGRSVTLSV 77
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1034654849  475 DGDEASAVRTNSPLQV-KTGEKYFFGGFLNQMNNsSHSVLQPSFQGCMQLIQVDDQ 529
Cdd:smart00282  78 DGGNRVSGESPGGLTIlNLDGPLYLGGLPEDLKL-PPLPVTPGFRGCIRNLKVNGK 132
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
401-529 5.12e-23

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 94.79  E-value: 5.12e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034654849 401 FRTWNPNGLLVFSHFADNlGNVEIDLTESKVGVHINITQTKMSQIdiSSGSGLNDGQWHEVRFLAKENFAILTIDGDEAS 480
Cdd:pfam02210   1 FRTRQPNGLLLYAGGGGS-DFLALELVNGRLVLRYDLGSGPESLL--SSGKNLNDGQWHSVRVERNGNTLTLSVDGQTVV 77
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 1034654849 481 AVRTNS-PLQVKTGEKYFFGGfLNQMNNSSHSVLQPSFQGCMQLIQVDDQ 529
Cdd:pfam02210  78 SSLPPGeSLLLNLNGPLYLGG-LPPLLLLPALPVRAGFVGCIRDVRVNGE 126
GGGWT_bact NF040941
fibrinogen-like bacterial YCDxxxxGGGW domain; Pfam model PF00147, about 220 amino acids long, ...
598-634 1.08e-09

fibrinogen-like bacterial YCDxxxxGGGW domain; Pfam model PF00147, about 220 amino acids long, describes a conserved domain found in eukaryotic proteins such as fibrinogen beta and gamma chains, fincolin, and angiopoietin. This model describes a small homology domain, about 46 amino acids long, found in the PF00147 homology region of those proteins but also as a much shorter homology domain in bacterial proteins that may lack homology to those proteins, or to each other, outside this region. The signature motif, at the C-terminus of this domain, is YCDxTTDGGGWxLV.


Pssm-ID: 468872 [Multi-domain]  Cd Length: 46  Bit Score: 54.11  E-value: 1.08e-09
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1034654849 598 SCEAYKHLGQTSN--YYWIDPDGSGPLGPLKVYCNMTED 634
Cdd:NF040941    1 SCWEILQAGPSAPsgVYWIDPDGMGGLAPFQVYCDMTTD 39
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
555-590 8.39e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 45.71  E-value: 8.39e-07
                          10        20        30
                  ....*....|....*....|....*....|....*..
gi 1034654849 555 IDRCVPNH-CEHGGKCSQTWDSFKCTCDEtGYSGATC 590
Cdd:cd00054     2 IDECASGNpCQNGGTCVNTVGSYRCSCPP-GYTGRNC 37
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
558-589 1.25e-05

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 42.37  E-value: 1.25e-05
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1034654849 558 CVPNHCEHGGKCSQTWDSFKCTCDEtGYSGAT 589
Cdd:pfam00008   1 CAPNPCSNGGTCVDTPGGYTCICPE-GYTGKR 31
EGF_CA smart00179
Calcium-binding EGF-like domain;
555-590 2.14e-04

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 39.15  E-value: 2.14e-04
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1034654849  555 IDRCVPNH-CEHGGKCSQTWDSFKCTCDEtGYS-GATC 590
Cdd:smart00179   2 IDECASGNpCQNGGTCVNTVGSYRCECPP-GYTdGRNC 38
COLFI pfam01410
Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
609-632 3.90e-03

Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


Pssm-ID: 460199  Cd Length: 233  Bit Score: 39.63  E-value: 3.90e-03
                          10        20
                  ....*....|....*....|....
gi 1034654849 609 SNYYWIDPDGSGPLGPLKVYCNMT 632
Cdd:pfam01410  45 SGEYWIDPNQGCTRDAIKVFCNFE 68
 
Name Accession Description Interval E-value
FA58C cd00057
Coagulation factor 5/8 C-terminal domain, discoidin domain; Cell surface-attached ...
70-180 1.55e-33

Coagulation factor 5/8 C-terminal domain, discoidin domain; Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.


Pssm-ID: 238014 [Multi-domain]  Cd Length: 143  Bit Score: 125.54  E-value: 1.55e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034654849  70 GAGGWSPSDSDHYQWLQVDFGNRKQISAIATQGR--YSSSDWVTQYRMLYSDTGRNWKPYHQDGNIWAFPGNINSDGVVR 147
Cdd:cd00057    33 SDNAWTPAVNDPPQWLQVDLGKTRRVTGIQTQGRkgGGSSEWVTSYKVQYSLDGETWTTYKDKGEEKVFTGNSDGSTPVT 112
                          90       100       110
                  ....*....|....*....|....*....|...
gi 1034654849 148 HELQHPIIARYVRIVPLDWNgeGRIGLRIEVYG 180
Cdd:cd00057   113 NDFPPPIVARYIRILPTTWN--GNISLRLELYG 143
F5_F8_type_C pfam00754
F5/8 type C domain; This domain is also known as the discoidin (DS) domain family.
71-178 4.05e-32

F5/8 type C domain; This domain is also known as the discoidin (DS) domain family.


Pssm-ID: 459925 [Multi-domain]  Cd Length: 127  Bit Score: 121.02  E-value: 4.05e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034654849  71 AGGWSPSDSDHYQWLQVDFGNRKQISAIATQGRYS-SSDWVTQYRMLYSDTGRNWKPYHQDGniwaFPGNINSDGVVRHE 149
Cdd:pfam00754  23 NTAWSAWSGDDPQWIQVDLGKPKKITGVVTQGRQDgSNGYVTSYKIEYSLDGENWTTVKDEK----IPGNNDNNTPVTNT 98
                          90       100
                  ....*....|....*....|....*....
gi 1034654849 150 LQHPIIARYVRIVPLDWNGEGRIGLRIEV 178
Cdd:pfam00754  99 FDPPIKARYVRIVPTSWNGGNGIALRAEL 127
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
216-345 2.53e-30

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 115.59  E-value: 2.53e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034654849 216 FKTSESEGVILHGEGQQGDYITLELKKAKLVLSLNLGSnqlGPIyghTSVMTGSLLDDHHWHSVVIERQGRSINLTLDRS 295
Cdd:pfam02210   1 FRTRQPNGLLLYAGGGGSDFLALELVNGRLVLRYDLGS---GPE---SLLSSGKNLNDGQWHSVRVERNGNTLTLSVDGQ 74
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1034654849 296 MQHFRTN-GEFDYLDLDYEITFGGIP-FSGKPSSSSRKNFKGCMESINYNGV 345
Cdd:pfam02210  75 TVVSSLPpGESLLLNLNGPLYLGGLPpLLLLPALPVRAGFVGCIRDVRVNGE 126
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
187-343 6.73e-30

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 115.59  E-value: 6.73e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034654849 187 VINFDGHvvLPYRFRNKKMKTLKDVIALNFKTSESEGVILH-GEGQQGDYITLELKKAKLVLSLNLGSnqlgpiyGHTSV 265
Cdd:cd00110     1 GVSFSGS--SYVRLPTLPAPRTRLSISFSFRTTSPNGLLLYaGSQNGGDFLALELEDGRLVLRYDLGS-------GSLVL 71
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034654849 266 MTGSLLDDHHWHSVVIERQGRSINLTLD--RSMQHfRTNGEFDYLDLDYEITFGGIPFSGKPSSS-SRKNFKGCMESINY 342
Cdd:cd00110    72 SSKTPLNDGQWHSVSVERNGRSVTLSVDgeRVVES-GSPGGSALLNLDGPLYLGGLPEDLKSPGLpVSPGFVGCIRDLKV 150

                  .
gi 1034654849 343 N 343
Cdd:cd00110   151 N 151
FA58C smart00231
Coagulation factor 5/8 C-terminal domain, discoidin domain; Cell surface-attached ...
34-181 1.38e-29

Coagulation factor 5/8 C-terminal domain, discoidin domain; Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.


Pssm-ID: 214572  Cd Length: 139  Bit Score: 114.14  E-value: 1.38e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034654849   34 KCDEPLvsGLphVAFSSSSSISGSYSPGYAKINKrGGAGGWSPSDSDHYQWLQVDFGNRKQISAIATQGRYSSSDWVTqY 113
Cdd:smart00231   1 PCNEPL--GL--ESDSQITASSSYWAAKIARLNG-GSDGGWCPAKNDLPPWIQVDLGRLRTVTGVITGRRHGNGDWVT-Y 74
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034654849  114 RMLYSDTGRNWKPYhQDGNIWAFPGNINSDGVVRHELQHPIIARYVRIVPLDWNgeGRIGLRIEVYGC 181
Cdd:smart00231  75 KLEYSDDGVNWTTY-KDGNSKVFPGNSDAGTVVLNDFPPPIVARYVRILPTGWN--GNIILRVELLGC 139
LamG smart00282
Laminin G domain;
212-345 1.95e-29

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 113.59  E-value: 1.95e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034654849  212 IALNFKTSESEGVILH-GEGQQGDYITLELKKAKLVLSLNLGSNQLGPIYGHTSVmtgsllDDHHWHSVVIERQGRSINL 290
Cdd:smart00282   2 ISFSFRTTSPNGLLLYaGSKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPL------NDGQWHRVAVERNGRSVTL 75
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034654849  291 TLDRSM-QHFRTNGEFDYLDLDYEITFGGIPFSGKPSSS-SRKNFKGCMESINYNGV 345
Cdd:smart00282  76 SVDGGNrVSGESPGGLTILNLDGPLYLGGLPEDLKLPPLpVTPGFRGCIRNLKVNGK 132
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
375-527 4.98e-25

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 101.34  E-value: 4.98e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034654849 375 PVFFNATSYLEVPGRLNQDL-FSVSFQFRTWNPNGLLVFSHFADNLGNVEIDLTESKVGVHINITQtkmSQIDISSGSGL 453
Cdd:cd00110     1 GVSFSGSSYVRLPTLPAPRTrLSISFSFRTTSPNGLLLYAGSQNGGDFLALELEDGRLVLRYDLGS---GSLVLSSKTPL 77
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034654849 454 NDGQWHEVRFLAKENFAILTIDGDeaSAVRTNSP---LQVKTGEKYFFGGfLNQMNNSSHSVLQPSFQGCMQLIQVD 527
Cdd:cd00110    78 NDGQWHSVSVERNGRSVTLSVDGE--RVVESGSPggsALLNLDGPLYLGG-LPEDLKSPGLPVSPGFVGCIRDLKVN 151
LamG smart00282
Laminin G domain;
396-529 1.41e-24

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 99.72  E-value: 1.41e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034654849  396 SVSFQFRTWNPNGLLVFSHFADNLGNVEIDLTESKVGVHINITQTKmsqIDISSGSG-LNDGQWHEVRFLAKENFAILTI 474
Cdd:smart00282   1 SISFSFRTTSPNGLLLYAGSKGGGDYLALELRDGRLVLRYDLGSGP---ARLTSDPTpLNDGQWHRVAVERNGRSVTLSV 77
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1034654849  475 DGDEASAVRTNSPLQV-KTGEKYFFGGFLNQMNNsSHSVLQPSFQGCMQLIQVDDQ 529
Cdd:smart00282  78 DGGNRVSGESPGGLTIlNLDGPLYLGGLPEDLKL-PPLPVTPGFRGCIRNLKVNGK 132
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
401-529 5.12e-23

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 94.79  E-value: 5.12e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034654849 401 FRTWNPNGLLVFSHFADNlGNVEIDLTESKVGVHINITQTKMSQIdiSSGSGLNDGQWHEVRFLAKENFAILTIDGDEAS 480
Cdd:pfam02210   1 FRTRQPNGLLLYAGGGGS-DFLALELVNGRLVLRYDLGSGPESLL--SSGKNLNDGQWHSVRVERNGNTLTLSVDGQTVV 77
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 1034654849 481 AVRTNS-PLQVKTGEKYFFGGfLNQMNNSSHSVLQPSFQGCMQLIQVDDQ 529
Cdd:pfam02210  78 SSLPPGeSLLLNLNGPLYLGG-LPPLLLLPALPVRAGFVGCIRDVRVNGE 126
Laminin_G_1 pfam00054
Laminin G domain;
216-347 2.07e-18

Laminin G domain;


Pssm-ID: 395008 [Multi-domain]  Cd Length: 131  Bit Score: 81.98  E-value: 2.07e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034654849 216 FKTSESEGVILHGEGQ-QGDYITLELKKAKLVLSLNLGSnqlgpiyGHTSVMTGSLLDDHHWHSVVIERQGRSINLTLD- 293
Cdd:pfam00054   1 FRTTEPSGLLLYNGTQtERDFLALELRDGRLEVSYDLGS-------GAAVVRSGDKLNDGKWHSVELERNGRSGTLSVDg 73
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034654849 294 RSMQHFRTNGEFDY-LDLDYEITFGGIPFSGKPSS--SSRKNFKGCMESINYNGVNI 347
Cdd:pfam00054  74 EARPTGESPLGATTdLDVDGPLYVGGLPSLGVKKRrlAISPSFDGCIRDVIVNGKPL 130
Laminin_G_1 pfam00054
Laminin G domain;
401-532 6.73e-13

Laminin G domain;


Pssm-ID: 395008 [Multi-domain]  Cd Length: 131  Bit Score: 66.19  E-value: 6.73e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034654849 401 FRTWNPNGLLVFSHFADNLGNVEIDLTESKVGVHINITQTKMSqidISSGSGLNDGQWHEVRflAKENFAILTIDGDEAS 480
Cdd:pfam00054   1 FRTTEPSGLLLYNGTQTERDFLALELRDGRLEVSYDLGSGAAV---VRSGDKLNDGKWHSVE--LERNGRSGTLSVDGEA 75
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1034654849 481 AVRTNSPL----QVKTGEKYFFGGFLNQMNNSSHSVLQPSFQGCMQLIQVDDQLVN 532
Cdd:pfam00054  76 RPTGESPLgattDLDVDGPLYVGGLPSLGVKKRRLAISPSFDGCIRDVIVNGKPLD 131
GGGWT_bact NF040941
fibrinogen-like bacterial YCDxxxxGGGW domain; Pfam model PF00147, about 220 amino acids long, ...
598-634 1.08e-09

fibrinogen-like bacterial YCDxxxxGGGW domain; Pfam model PF00147, about 220 amino acids long, describes a conserved domain found in eukaryotic proteins such as fibrinogen beta and gamma chains, fincolin, and angiopoietin. This model describes a small homology domain, about 46 amino acids long, found in the PF00147 homology region of those proteins but also as a much shorter homology domain in bacterial proteins that may lack homology to those proteins, or to each other, outside this region. The signature motif, at the C-terminus of this domain, is YCDxTTDGGGWxLV.


Pssm-ID: 468872 [Multi-domain]  Cd Length: 46  Bit Score: 54.11  E-value: 1.08e-09
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1034654849 598 SCEAYKHLGQTSN--YYWIDPDGSGPLGPLKVYCNMTED 634
Cdd:NF040941    1 SCWEILQAGPSAPsgVYWIDPDGMGGLAPFQVYCDMTTD 39
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
555-590 8.39e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 45.71  E-value: 8.39e-07
                          10        20        30
                  ....*....|....*....|....*....|....*..
gi 1034654849 555 IDRCVPNH-CEHGGKCSQTWDSFKCTCDEtGYSGATC 590
Cdd:cd00054     2 IDECASGNpCQNGGTCVNTVGSYRCSCPP-GYTGRNC 37
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
558-589 1.25e-05

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 42.37  E-value: 1.25e-05
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1034654849 558 CVPNHCEHGGKCSQTWDSFKCTCDEtGYSGAT 589
Cdd:pfam00008   1 CAPNPCSNGGTCVDTPGGYTCICPE-GYTGKR 31
EGF_CA smart00179
Calcium-binding EGF-like domain;
555-590 2.14e-04

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 39.15  E-value: 2.14e-04
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1034654849  555 IDRCVPNH-CEHGGKCSQTWDSFKCTCDEtGYS-GATC 590
Cdd:smart00179   2 IDECASGNpCQNGGTCVNTVGSYRCECPP-GYTdGRNC 38
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
560-590 7.79e-04

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 37.46  E-value: 7.79e-04
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1034654849 560 PNHCEHGGKCSQTWDSFKCTCDEtGYSGA-TC 590
Cdd:cd00053     5 SNPCSNGGTCVNTPGSYRCVCPP-GYTGDrSC 35
COLFI pfam01410
Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
609-632 3.90e-03

Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


Pssm-ID: 460199  Cd Length: 233  Bit Score: 39.63  E-value: 3.90e-03
                          10        20
                  ....*....|....*....|....
gi 1034654849 609 SNYYWIDPDGSGPLGPLKVYCNMT 632
Cdd:pfam01410  45 SGEYWIDPNQGCTRDAIKVFCNFE 68
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH