NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|145701025|ref|NP_001401|]
View 

multiple epidermal growth factor-like domains protein 8 isoform 2 precursor [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
49-139 2.93e-24

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 99.79  E-value: 2.93e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025   49 GNYSVNGNCEWLIEAPsPQHRILLDFLFLDTE----CTYDYLFVYDGDSPRGPLLASLSGSTRPPPIEASSGKMLLHLFS 124
Cdd:cd00041    20 NNYPNNLNCVWTIEAP-PGYRIRLTFEDFDLEsspnCSYDYLEIYDGPSTSSPLLGRFCGSTLPPPIISSGNSLTVRFRS 98
                          90
                  ....*....|....*
gi 145701025  125 DANYNLLGFNASFRF 139
Cdd:cd00041    99 DSSVTGRGFKATYSA 113
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
227-531 1.99e-22

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


:

Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 99.85  E-value: 1.99e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025  227 ARIGAAGAFLSppGLLAVFGGQDLNNALGDLVLYNFSANTWESwdLSPAP-AARHSHVAVAWAGSLVLMGG---ELADGS 302
Cdd:COG3055    12 PRSEAAAALLD--GKVYVAGGLSGGSASNSFEVYDPATNTWSE--LAPLPgPPRHHAAAVAQDGKLYVFGGftgANPSST 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025  303 LTNDVWAFSPlGRGHWELLAPPASSSsgppglaGHAAALVDDVWLYVSGGRtpHDLFSSGLFrFRLDSTSGGyWEQVIPA 382
Cdd:COG3055    88 PLNDVYVYDP-ATNTWTKLAPMPTPR-------GGATALLLDGKIYVVGGW--DDGGNVAWV-EVYDPATGT-WTQLAPL 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025  383 ggrPPAATGHSMVFhAPSRALLVHGGhrpstARFSVRVNStelfhvdrhvWTTLKgrdglQGPRERAFHTASVLGNYMVV 462
Cdd:COG3055   156 ---PTPRDHLAAAV-LPDGKILVIGG-----RNGSGFSNT----------WTTLA-----PLPTARAGHAAAVLGGKILV 211
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 145701025  463 YGGNVHTHyqeekcyeDGIFFYHLGCHQWVSGAELappgtpegraapPSGRYSHVAAVLGGSVLLVAGG 531
Cdd:COG3055   212 FGGESGFS--------DEVEAYDPATNTWTALGEL------------PTPRHGHAAVLTDGKVYVIGGE 260
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
1490-1791 9.62e-21

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


:

Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 94.84  E-value: 9.62e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025 1490 EDGGPGPSPRSfHAAAYVPAGRgaMYLLGGLTAGGVTRDFWVLNLTTLQWrQEKAPqtveLPAVA-GHTLTARRGLSLLL 1568
Cdd:COG3055     4 SSLPDLPTPRS-EAAAALLDGK--VYVAGGLSGGSASNSFEVYDPATNTW-SELAP----LPGPPrHHAAAVAQDGKLYV 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025 1569 VGGYSPENGFNQQL---LEYQLATGTWVSGAqsgTPPTGLYGHSAVYHEatDSLYVFGGFRFHVELAapSPELYSLhcPD 1645
Cdd:COG3055    76 FGGFTGANPSSTPLndvYVYDPATNTWTKLA---PMPTPRGGATALLLD--GKIYVVGGWDDGGNVA--WVEVYDP--AT 146
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025 1646 RTWSLLAPSQGAKRDRMRNVRGSSRGLgqVPGeqpGSWGFREVRKkmalWAALAgtggfleeisPHlkePRPRLFHASAL 1725
Cdd:COG3055   147 GTWTQLAPLPTPRDHLAAAVLPDGKIL--VIG---GRNGSGFSNT----WTTLA----------PL---PTARAGHAAAV 204
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 145701025 1726 LGDTMVVLGGRSDpdeFSSDVLLYQVNCNAWllpdltrsaSVGPPMEESV-AHAVAAVGSRLYISGG 1791
Cdd:COG3055   205 LGGKILVFGGESG---FSDEVEAYDPATNTW---------TALGELPTPRhGHAAVLTDGKVYVIGG 259
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
1143-1192 2.64e-11

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


:

Pssm-ID: 238012  Cd Length: 50  Bit Score: 60.83  E-value: 2.64e-11
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 145701025 1143 PCQCNGHGDpRRGHCDNLSGLCFCQDHTEGAHCQLCSPGYYGDPRAGGSC 1192
Cdd:cd00055     1 PCDCNGHGS-LSGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLPSQGGGC 49
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1011-1045 1.86e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 46.44  E-value: 1.86e-06
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 145701025  1011 CRLGLARCHPRATCLNTPLSYECHCQRGYQGDGIS 1045
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSFTCTCNDGYTGDGVT 35
PSI pfam01437
Plexin repeat; A cysteine rich repeat found in several different extracellular receptors. The ...
883-931 3.23e-03

Plexin repeat; A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans protein contains four copies of the repeat. The Met receptor contains a single copy of the repeat. The Pfam alignment shows 6 conserved cysteine residues that may form three conserved disulphide bridges, whereas some members show 8 conserved cysteines. The pattern of conservation suggests that cysteines 5 and 7 (that are not absolutely conserved) form a disulphide bridge (Personal observation. A Bateman).


:

Pssm-ID: 396154 [Multi-domain]  Cd Length: 52  Bit Score: 38.07  E-value: 3.23e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 145701025   883 CSQRLTCEDCL-ANSSQCAWCQSTHTCFLFAAYlaRYPHGGCRGWDDSVH 931
Cdd:pfam01437    2 CSQYTSCSSCLaARDPYCGWCSSEGRCVRRSAC--GAPEGNCEEWEQASS 49
 
Name Accession Description Interval E-value
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
49-139 2.93e-24

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 99.79  E-value: 2.93e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025   49 GNYSVNGNCEWLIEAPsPQHRILLDFLFLDTE----CTYDYLFVYDGDSPRGPLLASLSGSTRPPPIEASSGKMLLHLFS 124
Cdd:cd00041    20 NNYPNNLNCVWTIEAP-PGYRIRLTFEDFDLEsspnCSYDYLEIYDGPSTSSPLLGRFCGSTLPPPIISSGNSLTVRFRS 98
                          90
                  ....*....|....*
gi 145701025  125 DANYNLLGFNASFRF 139
Cdd:cd00041    99 DSSVTGRGFKATYSA 113
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
227-531 1.99e-22

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 99.85  E-value: 1.99e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025  227 ARIGAAGAFLSppGLLAVFGGQDLNNALGDLVLYNFSANTWESwdLSPAP-AARHSHVAVAWAGSLVLMGG---ELADGS 302
Cdd:COG3055    12 PRSEAAAALLD--GKVYVAGGLSGGSASNSFEVYDPATNTWSE--LAPLPgPPRHHAAAVAQDGKLYVFGGftgANPSST 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025  303 LTNDVWAFSPlGRGHWELLAPPASSSsgppglaGHAAALVDDVWLYVSGGRtpHDLFSSGLFrFRLDSTSGGyWEQVIPA 382
Cdd:COG3055    88 PLNDVYVYDP-ATNTWTKLAPMPTPR-------GGATALLLDGKIYVVGGW--DDGGNVAWV-EVYDPATGT-WTQLAPL 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025  383 ggrPPAATGHSMVFhAPSRALLVHGGhrpstARFSVRVNStelfhvdrhvWTTLKgrdglQGPRERAFHTASVLGNYMVV 462
Cdd:COG3055   156 ---PTPRDHLAAAV-LPDGKILVIGG-----RNGSGFSNT----------WTTLA-----PLPTARAGHAAAVLGGKILV 211
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 145701025  463 YGGNVHTHyqeekcyeDGIFFYHLGCHQWVSGAELappgtpegraapPSGRYSHVAAVLGGSVLLVAGG 531
Cdd:COG3055   212 FGGESGFS--------DEVEAYDPATNTWTALGEL------------PTPRHGHAAVLTDGKVYVIGGE 260
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
49-137 9.19e-22

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 92.07  E-value: 9.19e-22
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025     49 GNYSVNGNCEWLIEAPsPQHRILLDFLFLDTE----CTYDYLFVYDGDSPRGPLLASLSGSTRPPPIEASSG-KMLLHLF 123
Cdd:smart00042   10 QSYPNNLDCVWTIRAP-PGYRIELQFTDFDLEssdnCEYDYVEIYDGPSASSPLLGRFCGSEAPPPVISSSSnSLTLTFV 88
                            90
                    ....*....|....
gi 145701025    124 SDANYNLLGFNASF 137
Cdd:smart00042   89 SDSSVQKRGFSARY 102
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
1490-1791 9.62e-21

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 94.84  E-value: 9.62e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025 1490 EDGGPGPSPRSfHAAAYVPAGRgaMYLLGGLTAGGVTRDFWVLNLTTLQWrQEKAPqtveLPAVA-GHTLTARRGLSLLL 1568
Cdd:COG3055     4 SSLPDLPTPRS-EAAAALLDGK--VYVAGGLSGGSASNSFEVYDPATNTW-SELAP----LPGPPrHHAAAVAQDGKLYV 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025 1569 VGGYSPENGFNQQL---LEYQLATGTWVSGAqsgTPPTGLYGHSAVYHEatDSLYVFGGFRFHVELAapSPELYSLhcPD 1645
Cdd:COG3055    76 FGGFTGANPSSTPLndvYVYDPATNTWTKLA---PMPTPRGGATALLLD--GKIYVVGGWDDGGNVA--WVEVYDP--AT 146
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025 1646 RTWSLLAPSQGAKRDRMRNVRGSSRGLgqVPGeqpGSWGFREVRKkmalWAALAgtggfleeisPHlkePRPRLFHASAL 1725
Cdd:COG3055   147 GTWTQLAPLPTPRDHLAAAVLPDGKIL--VIG---GRNGSGFSNT----WTTLA----------PL---PTARAGHAAAV 204
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 145701025 1726 LGDTMVVLGGRSDpdeFSSDVLLYQVNCNAWllpdltrsaSVGPPMEESV-AHAVAAVGSRLYISGG 1791
Cdd:COG3055   205 LGGKILVFGGESG---FSDEVEAYDPATNTW---------TALGELPTPRhGHAAVLTDGKVYVIGG 259
CUB pfam00431
CUB domain;
30-137 3.96e-20

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 87.74  E-value: 3.96e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025    30 CKGqrqVLREAPG--FVTDGAGNYSVNGNCEWLIEAPsPQHRILLDFLFLDTE----CTYDYLFVYDGDSPRGPLLASLS 103
Cdd:pfam00431    1 CGG---VLTDSSGsiSSPNYPNPYPPNKDCVWLIRAP-PGFRVKLTFQDFELEdhdeCGYDYVEIRDGPSASSPLLGRFC 76
                           90       100       110
                   ....*....|....*....|....*....|....
gi 145701025   104 GSTRPPPIEASSGKMLLHLFSDANYNLLGFNASF 137
Cdd:pfam00431   77 GSGIPEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
1143-1192 2.64e-11

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 60.83  E-value: 2.64e-11
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 145701025 1143 PCQCNGHGDpRRGHCDNLSGLCFCQDHTEGAHCQLCSPGYYGDPRAGGSC 1192
Cdd:cd00055     1 PCDCNGHGS-LSGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLPSQGGGC 49
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
1144-1192 6.27e-09

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 53.90  E-value: 6.27e-09
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 145701025  1144 CQCNGHGDPRrGHCDNLSGLCFCQDHTEGAHCQLCSPGYYGDPRA-GGSC 1192
Cdd:pfam00053    1 CDCNPHGSLS-DTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPSDpPQGC 49
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
1144-1189 1.85e-06

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 46.92  E-value: 1.85e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*.
gi 145701025   1144 CQCNGHGDpRRGHCDNLSGLCFCQDHTEGAHCQLCSPGYYGDPRAG 1189
Cdd:smart00180    1 CDCDPGGS-ASGTCDPDTGQCECKPNVTGRRCDRCAPGYYGDGPPG 45
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1011-1045 1.86e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 46.44  E-value: 1.86e-06
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 145701025  1011 CRLGLARCHPRATCLNTPLSYECHCQRGYQGDGIS 1045
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSFTCTCNDGYTGDGVT 35
EGF_CA smart00179
Calcium-binding EGF-like domain;
1007-1040 1.00e-04

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 41.85  E-value: 1.00e-04
                            10        20        30
                    ....*....|....*....|....*....|....
gi 145701025   1007 DVDECRLGlARCHPRATCLNTPLSYECHCQRGYQ 1040
Cdd:smart00179    1 DIDECASG-NPCQNGGTCVNTVGSYRCECPPGYT 33
Kelch_3 pfam13415
Galactose oxidase, central domain;
1566-1613 1.18e-04

Galactose oxidase, central domain;


Pssm-ID: 433188 [Multi-domain]  Cd Length: 49  Bit Score: 41.89  E-value: 1.18e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 145701025  1566 LLLVGGYSPENG-FNQQLLEYQLATGTWvsgAQSGTPPTGLYGHSAVYH 1613
Cdd:pfam13415    4 LYIFGGLGFDGQtRLNDLYVYDLDTNTW---TQIGDLPPPRSGHSATYI 49
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1007-1042 1.71e-04

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 41.08  E-value: 1.71e-04
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 145701025 1007 DVDECRLGlARCHPRATCLNTPLSYECHCQRGYQGD 1042
Cdd:cd00054     1 DIDECASG-NPCQNGGTCVNTVGSYRCSCPPGYTGR 35
PLN02153 PLN02153
epithiospecifier protein
1713-1791 8.79e-04

epithiospecifier protein


Pssm-ID: 177814 [Multi-domain]  Cd Length: 341  Bit Score: 44.21  E-value: 8.79e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025 1713 KEPRPRLFHASALLGDTMVVLGGRSDPDE-FSSDVLLYQVNCNAWLLPdltrSASVGPPMEESVAHAVAAVGSRLYISGG 1791
Cdd:PLN02153   18 KGPGPRCSHGIAVVGDKLYSFGGELKPNEhIDKDLYVFDFNTHTWSIA----PANGDVPRISCLGVRMVAVGTKLYIFGG 93
Kelch_4 pfam13418
Galactose oxidase, central domain;
227-276 9.44e-04

Galactose oxidase, central domain;


Pssm-ID: 433191 [Multi-domain]  Cd Length: 49  Bit Score: 39.13  E-value: 9.44e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 145701025   227 ARIGAAgAFLSPPGLLAVFGGQDLNN-ALGDLVLYNFSANTWESwdLSPAP 276
Cdd:pfam13418    1 PRAYHT-STSIPDDTIYLFGGEGEDGtLLSDLWVFDLSTNEWTR--LGSLP 48
PSI pfam01437
Plexin repeat; A cysteine rich repeat found in several different extracellular receptors. The ...
883-931 3.23e-03

Plexin repeat; A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans protein contains four copies of the repeat. The Met receptor contains a single copy of the repeat. The Pfam alignment shows 6 conserved cysteine residues that may form three conserved disulphide bridges, whereas some members show 8 conserved cysteines. The pattern of conservation suggests that cysteines 5 and 7 (that are not absolutely conserved) form a disulphide bridge (Personal observation. A Bateman).


Pssm-ID: 396154 [Multi-domain]  Cd Length: 52  Bit Score: 38.07  E-value: 3.23e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 145701025   883 CSQRLTCEDCL-ANSSQCAWCQSTHTCFLFAAYlaRYPHGGCRGWDDSVH 931
Cdd:pfam01437    2 CSQYTSCSSCLaARDPYCGWCSSEGRCVRRSAC--GAPEGNCEEWEQASS 49
PLN02153 PLN02153
epithiospecifier protein
337-465 6.06e-03

epithiospecifier protein


Pssm-ID: 177814 [Multi-domain]  Cd Length: 341  Bit Score: 41.51  E-value: 6.06e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025  337 HAAALVDDVwLYVSGGR-TPHDLFSSGLFRFRLDSTSggyWeQVIPAGGRPPAATGHSMVFHAPSRALLVHGGhRPSTAR 415
Cdd:PLN02153   26 HGIAVVGDK-LYSFGGElKPNEHIDKDLYVFDFNTHT---W-SIAPANGDVPRISCLGVRMVAVGTKLYIFGG-RDEKRE 99
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 145701025  416 FSvrvnstELFHVD--RHVWTTLKGRDGLQGPRERAFHTASVLGNYMVVYGG 465
Cdd:PLN02153  100 FS------DFYSYDtvKNEWTFLTKLDEEGGPEARTFHSMASDENHVYVFGG 145
 
Name Accession Description Interval E-value
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
49-139 2.93e-24

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 99.79  E-value: 2.93e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025   49 GNYSVNGNCEWLIEAPsPQHRILLDFLFLDTE----CTYDYLFVYDGDSPRGPLLASLSGSTRPPPIEASSGKMLLHLFS 124
Cdd:cd00041    20 NNYPNNLNCVWTIEAP-PGYRIRLTFEDFDLEsspnCSYDYLEIYDGPSTSSPLLGRFCGSTLPPPIISSGNSLTVRFRS 98
                          90
                  ....*....|....*
gi 145701025  125 DANYNLLGFNASFRF 139
Cdd:cd00041    99 DSSVTGRGFKATYSA 113
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
227-531 1.99e-22

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 99.85  E-value: 1.99e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025  227 ARIGAAGAFLSppGLLAVFGGQDLNNALGDLVLYNFSANTWESwdLSPAP-AARHSHVAVAWAGSLVLMGG---ELADGS 302
Cdd:COG3055    12 PRSEAAAALLD--GKVYVAGGLSGGSASNSFEVYDPATNTWSE--LAPLPgPPRHHAAAVAQDGKLYVFGGftgANPSST 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025  303 LTNDVWAFSPlGRGHWELLAPPASSSsgppglaGHAAALVDDVWLYVSGGRtpHDLFSSGLFrFRLDSTSGGyWEQVIPA 382
Cdd:COG3055    88 PLNDVYVYDP-ATNTWTKLAPMPTPR-------GGATALLLDGKIYVVGGW--DDGGNVAWV-EVYDPATGT-WTQLAPL 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025  383 ggrPPAATGHSMVFhAPSRALLVHGGhrpstARFSVRVNStelfhvdrhvWTTLKgrdglQGPRERAFHTASVLGNYMVV 462
Cdd:COG3055   156 ---PTPRDHLAAAV-LPDGKILVIGG-----RNGSGFSNT----------WTTLA-----PLPTARAGHAAAVLGGKILV 211
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 145701025  463 YGGNVHTHyqeekcyeDGIFFYHLGCHQWVSGAELappgtpegraapPSGRYSHVAAVLGGSVLLVAGG 531
Cdd:COG3055   212 FGGESGFS--------DEVEAYDPATNTWTALGEL------------PTPRHGHAAVLTDGKVYVIGGE 260
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
49-137 9.19e-22

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 92.07  E-value: 9.19e-22
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025     49 GNYSVNGNCEWLIEAPsPQHRILLDFLFLDTE----CTYDYLFVYDGDSPRGPLLASLSGSTRPPPIEASSG-KMLLHLF 123
Cdd:smart00042   10 QSYPNNLDCVWTIRAP-PGYRIELQFTDFDLEssdnCEYDYVEIYDGPSASSPLLGRFCGSEAPPPVISSSSnSLTLTFV 88
                            90
                    ....*....|....
gi 145701025    124 SDANYNLLGFNASF 137
Cdd:smart00042   89 SDSSVQKRGFSARY 102
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
1490-1791 9.62e-21

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 94.84  E-value: 9.62e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025 1490 EDGGPGPSPRSfHAAAYVPAGRgaMYLLGGLTAGGVTRDFWVLNLTTLQWrQEKAPqtveLPAVA-GHTLTARRGLSLLL 1568
Cdd:COG3055     4 SSLPDLPTPRS-EAAAALLDGK--VYVAGGLSGGSASNSFEVYDPATNTW-SELAP----LPGPPrHHAAAVAQDGKLYV 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025 1569 VGGYSPENGFNQQL---LEYQLATGTWVSGAqsgTPPTGLYGHSAVYHEatDSLYVFGGFRFHVELAapSPELYSLhcPD 1645
Cdd:COG3055    76 FGGFTGANPSSTPLndvYVYDPATNTWTKLA---PMPTPRGGATALLLD--GKIYVVGGWDDGGNVA--WVEVYDP--AT 146
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025 1646 RTWSLLAPSQGAKRDRMRNVRGSSRGLgqVPGeqpGSWGFREVRKkmalWAALAgtggfleeisPHlkePRPRLFHASAL 1725
Cdd:COG3055   147 GTWTQLAPLPTPRDHLAAAVLPDGKIL--VIG---GRNGSGFSNT----WTTLA----------PL---PTARAGHAAAV 204
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 145701025 1726 LGDTMVVLGGRSDpdeFSSDVLLYQVNCNAWllpdltrsaSVGPPMEESV-AHAVAAVGSRLYISGG 1791
Cdd:COG3055   205 LGGKILVFGGESG---FSDEVEAYDPATNTW---------TALGELPTPRhGHAAVLTDGKVYVIGG 259
CUB pfam00431
CUB domain;
30-137 3.96e-20

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 87.74  E-value: 3.96e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025    30 CKGqrqVLREAPG--FVTDGAGNYSVNGNCEWLIEAPsPQHRILLDFLFLDTE----CTYDYLFVYDGDSPRGPLLASLS 103
Cdd:pfam00431    1 CGG---VLTDSSGsiSSPNYPNPYPPNKDCVWLIRAP-PGFRVKLTFQDFELEdhdeCGYDYVEIRDGPSASSPLLGRFC 76
                           90       100       110
                   ....*....|....*....|....*....|....
gi 145701025   104 GSTRPPPIEASSGKMLLHLFSDANYNLLGFNASF 137
Cdd:pfam00431   77 GSGIPEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
1473-1624 6.34e-14

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 74.81  E-value: 6.34e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025 1473 YRYSVSERRWTQMlagaedgGPGPSPRSFHAAAyVPAGRGamYLLGGLTAGGVTRDFWVLNLTTLQWRQeKAPQTVELPA 1552
Cdd:COG3055    93 YVYDPATNTWTKL-------APMPTPRGGATAL-LLDGKI--YVVGGWDDGGNVAWVEVYDPATGTWTQ-LAPLPTPRDH 161
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025 1553 VAGHTL---------------------------TARRGLS-------LLLVGGyspENGFNQQLLEYQLATGTWVsgaQS 1598
Cdd:COG3055   162 LAAAVLpdgkilviggrngsgfsntwttlaplpTARAGHAaavlggkILVFGG---ESGFSDEVEAYDPATNTWT---AL 235
                         170       180
                  ....*....|....*....|....*.
gi 145701025 1599 GTPPTGLYGHSAVYHEatDSLYVFGG 1624
Cdd:COG3055   236 GELPTPRHGHAAVLTD--GKVYVIGG 259
EGF_Lam cd00055
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ...
1143-1192 2.64e-11

Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies


Pssm-ID: 238012  Cd Length: 50  Bit Score: 60.83  E-value: 2.64e-11
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 145701025 1143 PCQCNGHGDpRRGHCDNLSGLCFCQDHTEGAHCQLCSPGYYGDPRAGGSC 1192
Cdd:cd00055     1 PCDCNGHGS-LSGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLPSQGGGC 49
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
211-313 1.59e-09

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 61.71  E-value: 1.59e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025  211 GAGWWHNVSARDPAFSARIGAAGAFLSppGLLAVFGGQdlNNALGDLVLYNFSANTWESwdLSPAPAARHSHVAVAWAGS 290
Cdd:COG3055   180 GSGFSNTWTTLAPLPTARAGHAAAVLG--GKILVFGGE--SGFSDEVEAYDPATNTWTA--LGELPTPRHGHAAVLTDGK 253
                          90       100
                  ....*....|....*....|...
gi 145701025  291 LVLMGGELADGSLTNDVWAFSPL 313
Cdd:COG3055   254 VYVIGGETKPGVRTPLVTSAEVY 276
Laminin_EGF pfam00053
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
1144-1192 6.27e-09

Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.


Pssm-ID: 395007  Cd Length: 49  Bit Score: 53.90  E-value: 6.27e-09
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 145701025  1144 CQCNGHGDPRrGHCDNLSGLCFCQDHTEGAHCQLCSPGYYGDPRA-GGSC 1192
Cdd:pfam00053    1 CDCNPHGSLS-DTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPSDpPQGC 49
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
223-426 1.32e-08

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 58.63  E-value: 1.32e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025  223 PAFSARIGAAGAFLsPPGLLAVFGGQDLNNALGDLVLYNFSANTWESwdLSPAPAARHSH-VAVAWAGSLVLMGGelADG 301
Cdd:COG3055   106 APMPTPRGGATALL-LDGKIYVVGGWDDGGNVAWVEVYDPATGTWTQ--LAPLPTPRDHLaAAVLPDGKILVIGG--RNG 180
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025  302 SLTNDVW---AFSPLGRghwellappassssgppglAGHAAALVDDVwLYVSGGRTPhdlFSSGLFRFrlDSTSGGyWEQ 378
Cdd:COG3055   181 SGFSNTWttlAPLPTAR-------------------AGHAAAVLGGK-ILVFGGESG---FSDEVEAY--DPATNT-WTA 234
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|
gi 145701025  379 V--IPAGGRPPAATGHSmvfhapSRALLVHGGHRPSTArfSVRVNSTELF 426
Cdd:COG3055   235 LgeLPTPRHGHAAVLTD------GKVYVIGGETKPGVR--TPLVTSAEVY 276
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
334-534 2.18e-08

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 58.24  E-value: 2.18e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025  334 LAGHAAALVDDVwLYVSGGR---TPHDLFssglfrFRLDSTSGGyWEQVIPAggrPPAATGHSMVFhAPSRALLVHGGhR 410
Cdd:COG3055    13 RSEAAAALLDGK-VYVAGGLsggSASNSF------EVYDPATNT-WSELAPL---PGPPRHHAAAV-AQDGKLYVFGG-F 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025  411 PSTARFSVRVNSTELFHVDRHVWTTLKgrdglQGPRERAFHTASVLGNYMVVYGG---NVHTHYQEEkcyedgiffYHLG 487
Cdd:COG3055    80 TGANPSSTPLNDVYVYDPATNTWTKLA-----PMPTPRGGATALLLDGKIYVVGGwddGGNVAWVEV---------YDPA 145
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*..
gi 145701025  488 CHQWVSGAELappgtpegraapPSGRYSHVAAVLGGSVLLVAGGYSG 534
Cdd:COG3055   146 TGTWTQLAPL------------PTPRDHLAAAVLPDGKILVIGGRNG 180
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
1144-1189 1.85e-06

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 46.92  E-value: 1.85e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*.
gi 145701025   1144 CQCNGHGDpRRGHCDNLSGLCFCQDHTEGAHCQLCSPGYYGDPRAG 1189
Cdd:smart00180    1 CDCDPGGS-ASGTCDPDTGQCECKPNVTGRRCDRCAPGYYGDGPPG 45
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1011-1045 1.86e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 46.44  E-value: 1.86e-06
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 145701025  1011 CRLGLARCHPRATCLNTPLSYECHCQRGYQGDGIS 1045
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSFTCTCNDGYTGDGVT 35
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
1715-1799 2.38e-05

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 48.61  E-value: 2.38e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025 1715 PRPRLFHASALLGDTMVVLGGRSDpDEFSSDVLLYQVNCNAWL-LPDLtrsasvgpPMEESVAHAVAAVGSRLYISGGFG 1793
Cdd:COG3055    10 PTPRSEAAAALLDGKVYVAGGLSG-GSASNSFEVYDPATNTWSeLAPL--------PGPPRHHAAAVAQDGKLYVFGGFT 80

                  ....*.
gi 145701025 1794 GVALGR 1799
Cdd:COG3055    81 GANPSS 86
EGF_CA smart00179
Calcium-binding EGF-like domain;
1007-1040 1.00e-04

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 41.85  E-value: 1.00e-04
                            10        20        30
                    ....*....|....*....|....*....|....
gi 145701025   1007 DVDECRLGlARCHPRATCLNTPLSYECHCQRGYQ 1040
Cdd:smart00179    1 DIDECASG-NPCQNGGTCVNTVGSYRCECPPGYT 33
Kelch_3 pfam13415
Galactose oxidase, central domain;
1566-1613 1.18e-04

Galactose oxidase, central domain;


Pssm-ID: 433188 [Multi-domain]  Cd Length: 49  Bit Score: 41.89  E-value: 1.18e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 145701025  1566 LLLVGGYSPENG-FNQQLLEYQLATGTWvsgAQSGTPPTGLYGHSAVYH 1613
Cdd:pfam13415    4 LYIFGGLGFDGQtRLNDLYVYDLDTNTW---TQIGDLPPPRSGHSATYI 49
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1007-1042 1.71e-04

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 41.08  E-value: 1.71e-04
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 145701025 1007 DVDECRLGlARCHPRATCLNTPLSYECHCQRGYQGD 1042
Cdd:cd00054     1 DIDECASG-NPCQNGGTCVNTVGSYRCSCPPGYTGR 35
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
403-543 7.65e-04

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 43.99  E-value: 7.65e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025  403 LLVHGGHRPSTArfsvrVNSTELFHVDRHVWTTLKGrdglqGPRERAFHTASVL-GNYMVVYGGNVHTHYQEEkcYEDGI 481
Cdd:COG3055    25 VYVAGGLSGGSA-----SNSFEVYDPATNTWSELAP-----LPGPPRHHAAAVAqDGKLYVFGGFTGANPSST--PLNDV 92
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 145701025  482 FFYHLGCHQWVSGAELappgtpegraapPSGRYSHVAAVLGGSVLLVAGGYSGRPRGDLMAY 543
Cdd:COG3055    93 YVYDPATNTWTKLAPM------------PTPRGGATALLLDGKIYVVGGWDDGGNVAWVEVY 142
PLN02153 PLN02153
epithiospecifier protein
1713-1791 8.79e-04

epithiospecifier protein


Pssm-ID: 177814 [Multi-domain]  Cd Length: 341  Bit Score: 44.21  E-value: 8.79e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025 1713 KEPRPRLFHASALLGDTMVVLGGRSDPDE-FSSDVLLYQVNCNAWLLPdltrSASVGPPMEESVAHAVAAVGSRLYISGG 1791
Cdd:PLN02153   18 KGPGPRCSHGIAVVGDKLYSFGGELKPNEhIDKDLYVFDFNTHTWSIA----PANGDVPRISCLGVRMVAVGTKLYIFGG 93
Kelch_4 pfam13418
Galactose oxidase, central domain;
227-276 9.44e-04

Galactose oxidase, central domain;


Pssm-ID: 433191 [Multi-domain]  Cd Length: 49  Bit Score: 39.13  E-value: 9.44e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 145701025   227 ARIGAAgAFLSPPGLLAVFGGQDLNN-ALGDLVLYNFSANTWESwdLSPAP 276
Cdd:pfam13418    1 PRAYHT-STSIPDDTIYLFGGEGEDGtLLSDLWVFDLSTNEWTR--LGSLP 48
EGF_CA pfam07645
Calcium-binding EGF domain;
1007-1038 2.37e-03

Calcium-binding EGF domain;


Pssm-ID: 429571  Cd Length: 32  Bit Score: 37.60  E-value: 2.37e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 145701025  1007 DVDECRLGLARCHPRATCLNTPLSYECHCQRG 1038
Cdd:pfam07645    1 DVDECATGTHNCPANTVCVNTIGSFECRCPDG 32
PSI pfam01437
Plexin repeat; A cysteine rich repeat found in several different extracellular receptors. The ...
883-931 3.23e-03

Plexin repeat; A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans protein contains four copies of the repeat. The Met receptor contains a single copy of the repeat. The Pfam alignment shows 6 conserved cysteine residues that may form three conserved disulphide bridges, whereas some members show 8 conserved cysteines. The pattern of conservation suggests that cysteines 5 and 7 (that are not absolutely conserved) form a disulphide bridge (Personal observation. A Bateman).


Pssm-ID: 396154 [Multi-domain]  Cd Length: 52  Bit Score: 38.07  E-value: 3.23e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 145701025   883 CSQRLTCEDCL-ANSSQCAWCQSTHTCFLFAAYlaRYPHGGCRGWDDSVH 931
Cdd:pfam01437    2 CSQYTSCSSCLaARDPYCGWCSSEGRCVRRSAC--GAPEGNCEEWEQASS 49
NanM COG3055
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
1715-1794 4.75e-03

N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442289 [Multi-domain]  Cd Length: 277  Bit Score: 41.68  E-value: 4.75e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025 1715 PRPRLFH-ASALLGDTMVVLGGRSDPD---EFSSDVLLYQVNCNAWllpdltrsaSVGPPMEESVAHAVAAV-GSRLYIS 1789
Cdd:COG3055    57 PGPPRHHaAAVAQDGKLYVFGGFTGANpssTPLNDVYVYDPATNTW---------TKLAPMPTPRGGATALLlDGKIYVV 127

                  ....*
gi 145701025 1790 GGFGG 1794
Cdd:COG3055   128 GGWDD 132
Kelch_3 pfam13415
Galactose oxidase, central domain;
240-286 4.97e-03

Galactose oxidase, central domain;


Pssm-ID: 433188 [Multi-domain]  Cd Length: 49  Bit Score: 37.27  E-value: 4.97e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 145701025   240 GLLAVFGGQDL--NNALGDLVLYNFSANTWESwdLSPAPAARHSHVAVA 286
Cdd:pfam13415    2 DKLYIFGGLGFdgQTRLNDLYVYDLDTNTWTQ--IGDLPPPRSGHSATY 48
PLN02153 PLN02153
epithiospecifier protein
337-465 6.06e-03

epithiospecifier protein


Pssm-ID: 177814 [Multi-domain]  Cd Length: 341  Bit Score: 41.51  E-value: 6.06e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145701025  337 HAAALVDDVwLYVSGGR-TPHDLFSSGLFRFRLDSTSggyWeQVIPAGGRPPAATGHSMVFHAPSRALLVHGGhRPSTAR 415
Cdd:PLN02153   26 HGIAVVGDK-LYSFGGElKPNEHIDKDLYVFDFNTHT---W-SIAPANGDVPRISCLGVRMVAVGTKLYIFGG-RDEKRE 99
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 145701025  416 FSvrvnstELFHVD--RHVWTTLKGRDGLQGPRERAFHTASVLGNYMVVYGG 465
Cdd:PLN02153  100 FS------DFYSYDtvKNEWTFLTKLDEEGGPEARTFHSMASDENHVYVFGG 145
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH