NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|767974771|ref|XP_011536840|]
View 

stabilin-2 isoform X3 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1009-1129 1.23e-27

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 109.65  E-value: 1.23e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771  1009 IFNRWINNASLQPTLSAT-SNLTVLVPSQQATEDMDQDEKSFWLSQSN-IPALIKYHMLLGTYRVADLQTLSSsdmlATS 1086
Cdd:pfam02469    5 TFVALLKAAGLVDTLNGSqGPFTVFAPTNEAFAKLPAGTLNFLLKDKEqLKNLLKYHVVPGRLTSSDLKNGGT----LAT 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 767974771  1087 LQGNFLHLAKVDGNITIEGASIVDGDNAATNGVIHIINKVLVP 1129
Cdd:pfam02469   81 LQGSKLRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1617-1726 4.22e-27

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 108.11  E-value: 4.22e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771  1617 QEHFVKDLVGP-GPFTVFAPLSAAFDE--EARVKDW-DKYGLMPQVLRYHVVAChQLLLENLKLISNATSLQGEPIVISV 1692
Cdd:pfam02469   12 AAGLVDTLNGSqGPFTVFAPTNEAFAKlpAGTLNFLlKDKEQLKNLLKYHVVPG-RLTSSDLKNGGTLATLQGSKLRVNV 90
                           90       100       110
                   ....*....|....*....|....*....|....
gi 767974771  1693 SQSTVYINNkAKIISSDIISTNGIVHIIDKLLSP 1726
Cdd:pfam02469   91 TGGSVTVNG-ARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1752-1882 4.26e-27

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 108.11  E-value: 4.26e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771  1752 GYIKFSNLIQDSGLLSVITDPiHTPVTLFWPTDQALHALPAEQQDFLFNqdNKDKLKEYLKFHVIRDaKVLAVDLPTSTA 1831
Cdd:pfam02469    2 GFSTFVALLKAAGLVDTLNGS-QGPFTVFAPTNEAFAKLPAGTLNFLLK--DKEQLKNLLKYHVVPG-RLTSSDLKNGGT 77
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 767974771  1832 WKTLQGSELSVKCGAGRdigdLFLNGqtCRIVQRELLFDLGVAYGIDCLLI 1882
Cdd:pfam02469   78 LATLQGSKLRVNVTGGS----VTVNG--ARVVQADIEATNGVIHVIDKVLL 122
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
526-654 2.89e-23

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 96.94  E-value: 2.89e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771   526 PRYSKFRSLLEETNLGHALDEDGvgGPYTIFVPNNEALNNMKDGTLDYLLSPegSRKLLELVRYHIVPfTQLEVATLIST 605
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGSQ--GPFTVFAPTNEAFAKLPAGTLNFLLKD--KEQLKNLLKYHVVP-GRLTSSDLKNG 75
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 767974771   606 PHIRSMANQLIQFNTTdNGQILANDVAMEEIEITAKNGRIYTLTGVLIP 654
Cdd:pfam02469   76 GTLATLQGSKLRVNVT-GGSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1148-1265 4.12e-22

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 93.86  E-value: 4.12e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771  1148 PDYSIFRGYIIQYNLANAIEAADA-YTVFAPNNNAIENYIRE-----KKVLSLEEDVLRYHVVlEEKLLKNDLHNGMHRE 1221
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGSQGpFTVFAPTNEAFAKLPAGtlnflLKDKEQLKNLLKYHVV-PGRLTSSDLKNGGTLA 79
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 767974771  1222 TMLGFSyfLSFFLHNDQLYVNEAPINYTNVATDKGVIHGLGKVL 1265
Cdd:pfam02469   80 TLQGSK--LRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVL 121
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
382-505 3.87e-20

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 88.08  E-value: 3.87e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771   382 GRLTSFISLLDKA-YAWPL-SKLGPFTVLLPTDKG---LKGFNVNELLVDNKAAQYFVKLHIIAGQMNIEYMNNTDMFYT 456
Cdd:pfam02469    1 PGFSTFVALLKAAgLVDTLnGSQGPFTVFAPTNEAfakLPAGTLNFLLKDKEQLKNLLKYHVVPGRLTSSDLKNGGTLAT 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 767974771   457 LTGKSGEIFNSDKDNQIKLklhggkkkVKIIQGDIIASNGLLHILDRAM 505
Cdd:pfam02469   81 LQGSKLRVNVTGGSVTVNG--------ARVVQADIEATNGVIHVIDKVL 121
Link_Domain super family cl02612
The link domain is a hyaluronan (HA)-binding domain. It functions to mediate adhesive ...
2198-2238 1.88e-16

The link domain is a hyaluronan (HA)-binding domain. It functions to mediate adhesive interactions during inflammatory leukocyte homing and tumor metastasis. It is found in the CD44 receptor and in human TSG-6. TSG-6 is the protein product of the tumor necrosis factor-stimulated gene-6. TSG-6 has a strong anti-inflammatory effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. This group also contains the link domains of the chondroitin sulfate proteoglycan core proteins (CSPG) including aggrecan, versican, neurocan, and brevican and the link domains of the vertebrate HAPLN (HA and proteoglycan binding link) protein family. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates in which other CSPGs substitute for aggregan might contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN gene family are physically linked adjacent to CSPG genes. TSG-6 contains a single link module which supports high affinity binding with HA. The functional HA-binding domain of CD44 is an extended domain comprised of a link module flanked with N-and C- extensions. These extensions are essential for folding and functional activity. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of the CSPG aggrecan are involved in interaction with HA. Aggrecan in addition contains a second globular domain (G2) which contains link modules 3 and 4 which lack HA-binding activity. HAPLNs contain two contiguous link modules.


The actual alignment was detected with superfamily member cd03515:

Pssm-ID: 470631  Cd Length: 93  Bit Score: 76.35  E-value: 1.88e-16
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|.
gi 767974771 2198 GVFHLRSPLGQYKLTFDKAREACANEAATMATYNQLSYAQK 2238
Cdd:cd03515     1 GVFHLRSRSGKYKLTYTEAKAACEAEGAHLATYSQLSAAQQ 41
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1475-1511 3.67e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 48.36  E-value: 3.67e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 767974771  1475 CEISNGGCSAKADCKRTtPGRRVCTCKAGYTGDGIVC 1511
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
2127-2164 1.41e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 46.44  E-value: 1.41e-06
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 767974771  2127 CADGlNGGCHEHATCKMTgPGKHKCECKSHYVGDGLNC 2164
Cdd:pfam12947    1 CSDN-NGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1559-1595 2.61e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 45.67  E-value: 2.61e-06
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 767974771  1559 CLTKNGGCSEFAICNHTGQvERTCTCKPNYIGDGFTC 1595
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGG-SFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
837-865 1.55e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 43.74  E-value: 1.55e-05
                           10        20
                   ....*....|....*....|....*....
gi 767974771   837 CHIHATCEYSNGTASCICKAGYEGDGTLC 865
Cdd:pfam12947    8 CHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1517-1553 6.08e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 41.82  E-value: 6.08e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 767974771  1517 CLENHGGCDKNAECTQTgPNQAACNCLPAYTGDGKVC 1553
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
2086-2121 1.32e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 41.05  E-value: 1.32e-04
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 767974771  2086 CKQDNGGCAKVARCSQKGTKVSCSCQKGYKGDGHSC 2121
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
326-361 4.38e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 39.50  E-value: 4.38e-04
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 767974771   326 CKSDN-PCHRNANCTTVAPGRTeCICQKGYVGDGLTC 361
Cdd:pfam12947    1 CSDNNgGCHPNATCTNTGGSFT-CTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
246-275 6.00e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 39.12  E-value: 6.00e-04
                           10        20        30
                   ....*....|....*....|....*....|
gi 767974771   246 CHPHAHCTYLgPNRHSCTCQEGYRGDGQVC 275
Cdd:pfam12947    8 CHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
957-993 8.38e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 38.73  E-value: 8.38e-04
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 767974771   957 CLEQTGKCHPLASCQSTSSGVwSCVCQEGYEGDGFLC 993
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSF-TCTCNDGYTGDGVTC 36
EGF_3 super family cl48154
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1437-1469 1.54e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


The actual alignment was detected with superfamily member pfam12947:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 37.96  E-value: 1.54e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 767974771  1437 NGTCHTSANClTNSDGTASCKCAAGFQGNGTIC 1469
Cdd:pfam12947    5 NGGCHPNATC-TNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 super family cl48154
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
876-908 2.18e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


The actual alignment was detected with superfamily member pfam12947:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 37.58  E-value: 2.18e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 767974771   876 PGGCSRNAECIKTGtGTHTCVCQQGWTGNGRDC 908
Cdd:pfam12947    5 NGGCHPNATCTNTG-GSFTCTCNDGYTGDGVTC 36
EGF_3 super family cl48154
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
920-951 5.02e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


The actual alignment was detected with superfamily member pfam12947:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 36.42  E-value: 5.02e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 767974771   920 GGCHDNASCLYVgPGQNECECKKGFRGNGIDC 951
Cdd:pfam12947    6 GGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
 
Name Accession Description Interval E-value
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1009-1129 1.23e-27

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 109.65  E-value: 1.23e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771  1009 IFNRWINNASLQPTLSAT-SNLTVLVPSQQATEDMDQDEKSFWLSQSN-IPALIKYHMLLGTYRVADLQTLSSsdmlATS 1086
Cdd:pfam02469    5 TFVALLKAAGLVDTLNGSqGPFTVFAPTNEAFAKLPAGTLNFLLKDKEqLKNLLKYHVVPGRLTSSDLKNGGT----LAT 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 767974771  1087 LQGNFLHLAKVDGNITIEGASIVDGDNAATNGVIHIINKVLVP 1129
Cdd:pfam02469   81 LQGSKLRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1617-1726 4.22e-27

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 108.11  E-value: 4.22e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771  1617 QEHFVKDLVGP-GPFTVFAPLSAAFDE--EARVKDW-DKYGLMPQVLRYHVVAChQLLLENLKLISNATSLQGEPIVISV 1692
Cdd:pfam02469   12 AAGLVDTLNGSqGPFTVFAPTNEAFAKlpAGTLNFLlKDKEQLKNLLKYHVVPG-RLTSSDLKNGGTLATLQGSKLRVNV 90
                           90       100       110
                   ....*....|....*....|....*....|....
gi 767974771  1693 SQSTVYINNkAKIISSDIISTNGIVHIIDKLLSP 1726
Cdd:pfam02469   91 TGGSVTVNG-ARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1752-1882 4.26e-27

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 108.11  E-value: 4.26e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771  1752 GYIKFSNLIQDSGLLSVITDPiHTPVTLFWPTDQALHALPAEQQDFLFNqdNKDKLKEYLKFHVIRDaKVLAVDLPTSTA 1831
Cdd:pfam02469    2 GFSTFVALLKAAGLVDTLNGS-QGPFTVFAPTNEAFAKLPAGTLNFLLK--DKEQLKNLLKYHVVPG-RLTSSDLKNGGT 77
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 767974771  1832 WKTLQGSELSVKCGAGRdigdLFLNGqtCRIVQRELLFDLGVAYGIDCLLI 1882
Cdd:pfam02469   78 LATLQGSKLRVNVTGGS----VTVNG--ARVVQADIEATNGVIHVIDKVLL 122
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1621-1726 9.35e-24

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 99.98  E-value: 9.35e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771 1621 VKDLVGPGPFTVFAPLSAAFDE--EARVKDWDK---YGLMPQVLRYHVVAcHQLLLENLKLISNATSLQGEPIVISVSQS 1695
Cdd:COG2335    56 VDTLSGEGPFTVFAPTDAAFAAlpAGTLDALLKpenKATLTKILTYHVVP-GKVTAADLKDGKTLTTLQGQTLTVTVSGG 134
                          90       100       110
                  ....*....|....*....|....*....|.
gi 767974771 1696 TVYINNkAKIISSDIISTNGIVHIIDKLLSP 1726
Cdd:COG2335   135 GVTVNG-ANVITADIEASNGVIHVIDKVLLP 164
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
526-654 2.89e-23

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 96.94  E-value: 2.89e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771   526 PRYSKFRSLLEETNLGHALDEDGvgGPYTIFVPNNEALNNMKDGTLDYLLSPegSRKLLELVRYHIVPfTQLEVATLIST 605
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGSQ--GPFTVFAPTNEAFAKLPAGTLNFLLKD--KEQLKNLLKYHVVP-GRLTSSDLKNG 75
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 767974771   606 PHIRSMANQLIQFNTTdNGQILANDVAMEEIEITAKNGRIYTLTGVLIP 654
Cdd:pfam02469   76 GTLATLQGSKLRVNVT-GGSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1631-1727 2.84e-22

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 93.20  E-value: 2.84e-22
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771   1631 TVFAPLSAAFDE-EARVKDWDKYgLMPQVLRYHVVAcHQLLLENLKLISNATSLQGEPIVISVSQ--STVYINNkAKIIS 1707
Cdd:smart00554    1 TVFAPTDEAFQKlPPDLNSLLAD-KLKNLLLYHVVP-GRLSSADLLNGGTLPTLAGSKLRITRSGgsGTVTVNG-ARIVE 77
                            90       100
                    ....*....|....*....|
gi 767974771   1708 SDIISTNGIVHIIDKLLSPK 1727
Cdd:smart00554   78 ADIAATNGVVHVIDRVLLPP 97
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1148-1265 4.12e-22

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 93.86  E-value: 4.12e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771  1148 PDYSIFRGYIIQYNLANAIEAADA-YTVFAPNNNAIENYIRE-----KKVLSLEEDVLRYHVVlEEKLLKNDLHNGMHRE 1221
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGSQGpFTVFAPTNEAFAKLPAGtlnflLKDKEQLKNLLKYHVV-PGRLTSSDLKNGGTLA 79
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 767974771  1222 TMLGFSyfLSFFLHNDQLYVNEAPINYTNVATDKGVIHGLGKVL 1265
Cdd:pfam02469   80 TLQGSK--LRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVL 121
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
504-654 1.19e-21

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 93.82  E-value: 1.19e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771  504 AMDKLEPTFESNNEQTIMTMLQ--PRYSKFRSLLEETNLGHALDEDGvggPYTIFVPNNEALNNMKDGTLDYLLSPEGSR 581
Cdd:COG2335    17 ASSAAAEGAAMAPTKNIVETAAnnPDFSTLVAALKAAGLVDTLSGEG---PFTVFAPTDAAFAALPAGTLDALLKPENKA 93
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 767974771  582 KLLELVRYHIVPfTQLEVATLISTPHIRSMANQLIQFnTTDNGQILANDVAMEEIEITAKNGRIYTLTGVLIP 654
Cdd:COG2335    94 TLTKILTYHVVP-GKVTAADLKDGKTLTTLQGQTLTV-TVSGGGVTVNGANVITADIEASNGVIHVIDKVLLP 164
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1000-1129 4.89e-21

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 91.89  E-value: 4.89e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771 1000 ELSFLSEAaifnrwINNASLQPTLSATSNLTVLVPSQQATEDMDQDEKSFWLSQSNIPAL---IKYHMLLGTYRVADLQT 1076
Cdd:COG2335    42 DFSTLVAA------LKAAGLVDTLSGEGPFTVFAPTDAAFAALPAGTLDALLKPENKATLtkiLTYHVVPGKVTAADLKD 115
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|...
gi 767974771 1077 LSSsdmlATSLQGNFLHLAKVDGNITIEGASIVDGDNAATNGVIHIINKVLVP 1129
Cdd:COG2335   116 GKT----LTTLQGQTLTVTVSGGGVTVNGANVITADIEASNGVIHVIDKVLLP 164
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
382-505 3.87e-20

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 88.08  E-value: 3.87e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771   382 GRLTSFISLLDKA-YAWPL-SKLGPFTVLLPTDKG---LKGFNVNELLVDNKAAQYFVKLHIIAGQMNIEYMNNTDMFYT 456
Cdd:pfam02469    1 PGFSTFVALLKAAgLVDTLnGSQGPFTVFAPTNEAfakLPAGTLNFLLKDKEQLKNLLKYHVVPGRLTSSDLKNGGTLAT 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 767974771   457 LTGKSGEIFNSDKDNQIKLklhggkkkVKIIQGDIIASNGLLHILDRAM 505
Cdd:pfam02469   81 LQGSKLRVNVTGGSVTVNG--------ARVVQADIEATNGVIHVIDKVL 121
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1778-1884 3.55e-18

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 81.64  E-value: 3.55e-18
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771   1778 TLFWPTDQALHALPAEQQDFLfnqdnKDKLKEYLKFHVIRDaKVLAVDLPTSTAWKTLQGSELSVKCgaGRDIGDLFLNG 1857
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLL-----ADKLKNLLLYHVVPG-RLSSADLLNGGTLPTLAGSKLRITR--SGGSGTVTVNG 72
                            90       100
                    ....*....|....*....|....*..
gi 767974771   1858 QtcRIVQRELLFDLGVAYGIDCLLIDP 1884
Cdd:smart00554   73 A--RIVEADIAATNGVVHVIDRVLLPP 97
Link_domain_TSG_6_like cd03515
This is the extracellular link domain of the type found in human TSG-6. The link domain is a ...
2198-2238 1.88e-16

This is the extracellular link domain of the type found in human TSG-6. The link domain is a hyaluronan (HA)-binding domain. TSG-6 is the protein product of tumor necrosis factor-stimulated gene-6. TSG-6 is up-regulated in inflammatory lesions and in the ovary during ovulation. It has a strong anti-inflammatory and chondroprotective effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. Also included in this group are the stabilins: stabilin-1 (FEEL-1, CLEVER-1) and stabilin-2 (FEEL-2). Stabilin-2 functions as the major liver and lymph node-scavenging receptor for HA and related glycosaminoglycans. Stabilin-2 is a scavenger receptor with a broad range of ligands including advanced glycation end (AGE) products, acetylated low density lipoprotein and procollagen peptides. In contrast, stabilin-1 does not bind HA, but binds acetylated low density lipoprotein and AGEs with lower affinity. As AGEs accumulate in vascular tissues during aging and diabetes, these receptors may be implicated in the pathologies of these states. Both stabilins are present in the early endocytic pathway in hepatic sinusoidal epithelium associating with clathrin/AP-2. Stabilin-1 is expressed in macrophages. Stabilin-2 is absent from the latter. In macrophages: stabilin-1 is involved in trafficking between early/sorting endosomes and the trans-Golgi network. Stabilin-1 has also been implicated in angiogenesis and possibly leucocyte trafficking. Both stabilins bind gram-positive and gram-negative bacteria. TSG-6 and stabilins contain a single link module which supports high affinity binding to HA.


Pssm-ID: 239592  Cd Length: 93  Bit Score: 76.35  E-value: 1.88e-16
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|.
gi 767974771 2198 GVFHLRSPLGQYKLTFDKAREACANEAATMATYNQLSYAQK 2238
Cdd:cd03515     1 GVFHLRSRSGKYKLTYTEAKAACEAEGAHLATYSQLSAAQQ 41
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1742-1882 2.10e-16

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 78.79  E-value: 2.10e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771 1742 QNLTTLATNNGyiKFSNL---IQDSGLLSVITDPihTPVTLFWPTDQALHALPAEQQDFLFNQDNKDKLKEYLKFHVIrD 1818
Cdd:COG2335    31 KNIVETAANNP--DFSTLvaaLKAAGLVDTLSGE--GPFTVFAPTDAAFAALPAGTLDALLKPENKATLTKILTYHVV-P 105
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 767974771 1819 AKVLAVDLPTSTAWKTLQGSELSVKcgagRDIGDLFLNGQTcrIVQRELLFDLGVAYGIDCLLI 1882
Cdd:COG2335   106 GKVTAADLKDGKTLTTLQGQTLTVT----VSGGGVTVNGAN--VITADIEASNGVIHVIDKVLL 163
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1030-1129 1.29e-15

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 74.32  E-value: 1.29e-15
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771   1030 TVLVPSQQATEDMDQDEKSFWLSQsnIPALIKYHMLLGTYRVADLQtlssSDMLATSLQGN--FLHLAKVDGNITIEGAS 1107
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLLADK--LKNLLLYHVVPGRLSSADLL----NGGTLPTLAGSklRITRSGGSGTVTVNGAR 74
                            90       100
                    ....*....|....*....|..
gi 767974771   1108 IVDGDNAATNGVIHIINKVLVP 1129
Cdd:smart00554   75 IVEADIAATNGVVHVIDRVLLP 96
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1138-1265 2.34e-15

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 75.71  E-value: 2.34e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771 1138 PNLLMRLEQMPDYSIFRGYIIQYNLANAIEAADAYTVFAPNNNAIENYIrEKKVLSLEED--------VLRYHVVlEEKL 1209
Cdd:COG2335    31 KNIVETAANNPDFSTLVAALKAAGLVDTLSGEGPFTVFAPTDAAFAALP-AGTLDALLKPenkatltkILTYHVV-PGKV 108
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 767974771 1210 LKNDLHNGMHRETMLGFSyfLSFFLHNDQLYVNEAPINYTNVATDKGVIHGLGKVL 1265
Cdd:COG2335   109 TAADLKDGKTLTTLQGQT--LTVTVSGGGVTVNGANVITADIEASNGVIHVIDKVL 162
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
554-655 7.95e-12

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 63.54  E-value: 7.95e-12
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771    554 TIFVPNNEALNNMKDGtLDYLLSPegsrKLLELVRYHIVPfTQLEVATLISTPHIRSMANQLIQFNTT-DNGQILANDVA 632
Cdd:smart00554    1 TVFAPTDEAFQKLPPD-LNSLLAD----KLKNLLLYHVVP-GRLSSADLLNGGTLPTLAGSKLRITRSgGSGTVTVNGAR 74
                            90       100
                    ....*....|....*....|...
gi 767974771    633 MEEIEITAKNGRIYTLTGVLIPP 655
Cdd:smart00554   75 IVEADIAATNGVVHVIDRVLLPP 97
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1173-1265 1.00e-11

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 63.15  E-value: 1.00e-11
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771   1173 TVFAPNNNAIENYIREKKVLSLE--EDVLRYHVVlEEKLLKNDLHNGMHRETMLGFSYFLSFFLHNDQLYVNEAPINYTN 1250
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLLADklKNLLLYHVV-PGRLSSADLLNGGTLPTLAGSKLRITRSGGSGTVTVNGARIVEAD 79
                            90
                    ....*....|....*
gi 767974771   1251 VATDKGVIHGLGKVL 1265
Cdd:smart00554   80 IAATNGVVHVIDRVL 94
Xlink pfam00193
Extracellular link domain;
2198-2238 1.79e-11

Extracellular link domain;


Pssm-ID: 459706  Cd Length: 92  Bit Score: 62.21  E-value: 1.79e-11
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 767974771  2198 GVFHLRSPlGQYKLTFDKAREACANEAATMATYNQLSYAQK 2238
Cdd:pfam00193    1 GVFHLESP-GRYKLTFQEAQAACAALGATLATPEQLYAAWK 40
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
382-503 2.46e-10

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 61.08  E-value: 2.46e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771  382 GRLTSFISLLDKA-YAWPLSKLGPFTVLLPTD---KGLKGFNVNELLVD-NKAA-QYFVKLHIIAGQMNIEYMNNTDMFY 455
Cdd:COG2335    41 PDFSTLVAALKAAgLVDTLSGEGPFTVFAPTDaafAALPAGTLDALLKPeNKATlTKILTYHVVPGKVTAADLKDGKTLT 120
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 767974771  456 TLTGKSGEIFNSDKDNQIKLklhggkkkVKIIQGDIIASNGLLHILDR 503
Cdd:COG2335   121 TLQGQTLTVTVSGGGVTVNG--------ANVITADIEASNGVIHVIDK 160
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
406-503 2.71e-08

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 53.52  E-value: 2.71e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771    406 TVLLPTDKGLKGFNVNELLVDNKAAQYFVKLHIIAGQMNIEYMNNTDMFYTLTGKSGEIFNSDKDNQIklklhgGKKKVK 485
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLLADKLKNLLLYHVVPGRLSSADLLNGGTLPTLAGSKLRITRSGGSGTV------TVNGAR 74
                            90
                    ....*....|....*...
gi 767974771    486 IIQGDIIASNGLLHILDR 503
Cdd:smart00554   75 IVEADIAATNGVVHVIDR 92
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1475-1511 3.67e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 48.36  E-value: 3.67e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 767974771  1475 CEISNGGCSAKADCKRTtPGRRVCTCKAGYTGDGIVC 1511
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
2127-2164 1.41e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 46.44  E-value: 1.41e-06
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 767974771  2127 CADGlNGGCHEHATCKMTgPGKHKCECKSHYVGDGLNC 2164
Cdd:pfam12947    1 CSDN-NGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
LINK smart00445
Link (Hyaluronan-binding);
2198-2238 1.41e-06

Link (Hyaluronan-binding);


Pssm-ID: 214667  Cd Length: 94  Bit Score: 48.49  E-value: 1.41e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 767974771   2198 GVFHLRsPLGQYKLTFDKAREACANEAATMATYNQLSYAQK 2238
Cdd:smart00445    3 GVFHVE-KNGRYKLTFAEAREACRAQGATLATVGQLYAAWQ 42
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1559-1595 2.61e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 45.67  E-value: 2.61e-06
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 767974771  1559 CLTKNGGCSEFAICNHTGQvERTCTCKPNYIGDGFTC 1595
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGG-SFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
837-865 1.55e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 43.74  E-value: 1.55e-05
                           10        20
                   ....*....|....*....|....*....
gi 767974771   837 CHIHATCEYSNGTASCICKAGYEGDGTLC 865
Cdd:pfam12947    8 CHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1517-1553 6.08e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 41.82  E-value: 6.08e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 767974771  1517 CLENHGGCDKNAECTQTgPNQAACNCLPAYTGDGKVC 1553
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
2086-2121 1.32e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 41.05  E-value: 1.32e-04
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 767974771  2086 CKQDNGGCAKVARCSQKGTKVSCSCQKGYKGDGHSC 2121
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
326-361 4.38e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 39.50  E-value: 4.38e-04
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 767974771   326 CKSDN-PCHRNANCTTVAPGRTeCICQKGYVGDGLTC 361
Cdd:pfam12947    1 CSDNNgGCHPNATCTNTGGSFT-CTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
246-275 6.00e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 39.12  E-value: 6.00e-04
                           10        20        30
                   ....*....|....*....|....*....|
gi 767974771   246 CHPHAHCTYLgPNRHSCTCQEGYRGDGQVC 275
Cdd:pfam12947    8 CHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
957-993 8.38e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 38.73  E-value: 8.38e-04
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 767974771   957 CLEQTGKCHPLASCQSTSSGVwSCVCQEGYEGDGFLC 993
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSF-TCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1437-1469 1.54e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 37.96  E-value: 1.54e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 767974771  1437 NGTCHTSANClTNSDGTASCKCAAGFQGNGTIC 1469
Cdd:pfam12947    5 NGGCHPNATC-TNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
876-908 2.18e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 37.58  E-value: 2.18e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 767974771   876 PGGCSRNAECIKTGtGTHTCVCQQGWTGNGRDC 908
Cdd:pfam12947    5 NGGCHPNATCTNTG-GSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
920-951 5.02e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 36.42  E-value: 5.02e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 767974771   920 GGCHDNASCLYVgPGQNECECKKGFRGNGIDC 951
Cdd:pfam12947    6 GGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
 
Name Accession Description Interval E-value
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1009-1129 1.23e-27

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 109.65  E-value: 1.23e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771  1009 IFNRWINNASLQPTLSAT-SNLTVLVPSQQATEDMDQDEKSFWLSQSN-IPALIKYHMLLGTYRVADLQTLSSsdmlATS 1086
Cdd:pfam02469    5 TFVALLKAAGLVDTLNGSqGPFTVFAPTNEAFAKLPAGTLNFLLKDKEqLKNLLKYHVVPGRLTSSDLKNGGT----LAT 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 767974771  1087 LQGNFLHLAKVDGNITIEGASIVDGDNAATNGVIHIINKVLVP 1129
Cdd:pfam02469   81 LQGSKLRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1617-1726 4.22e-27

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 108.11  E-value: 4.22e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771  1617 QEHFVKDLVGP-GPFTVFAPLSAAFDE--EARVKDW-DKYGLMPQVLRYHVVAChQLLLENLKLISNATSLQGEPIVISV 1692
Cdd:pfam02469   12 AAGLVDTLNGSqGPFTVFAPTNEAFAKlpAGTLNFLlKDKEQLKNLLKYHVVPG-RLTSSDLKNGGTLATLQGSKLRVNV 90
                           90       100       110
                   ....*....|....*....|....*....|....
gi 767974771  1693 SQSTVYINNkAKIISSDIISTNGIVHIIDKLLSP 1726
Cdd:pfam02469   91 TGGSVTVNG-ARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1752-1882 4.26e-27

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 108.11  E-value: 4.26e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771  1752 GYIKFSNLIQDSGLLSVITDPiHTPVTLFWPTDQALHALPAEQQDFLFNqdNKDKLKEYLKFHVIRDaKVLAVDLPTSTA 1831
Cdd:pfam02469    2 GFSTFVALLKAAGLVDTLNGS-QGPFTVFAPTNEAFAKLPAGTLNFLLK--DKEQLKNLLKYHVVPG-RLTSSDLKNGGT 77
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 767974771  1832 WKTLQGSELSVKCGAGRdigdLFLNGqtCRIVQRELLFDLGVAYGIDCLLI 1882
Cdd:pfam02469   78 LATLQGSKLRVNVTGGS----VTVNG--ARVVQADIEATNGVIHVIDKVLL 122
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1621-1726 9.35e-24

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 99.98  E-value: 9.35e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771 1621 VKDLVGPGPFTVFAPLSAAFDE--EARVKDWDK---YGLMPQVLRYHVVAcHQLLLENLKLISNATSLQGEPIVISVSQS 1695
Cdd:COG2335    56 VDTLSGEGPFTVFAPTDAAFAAlpAGTLDALLKpenKATLTKILTYHVVP-GKVTAADLKDGKTLTTLQGQTLTVTVSGG 134
                          90       100       110
                  ....*....|....*....|....*....|.
gi 767974771 1696 TVYINNkAKIISSDIISTNGIVHIIDKLLSP 1726
Cdd:COG2335   135 GVTVNG-ANVITADIEASNGVIHVIDKVLLP 164
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
526-654 2.89e-23

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 96.94  E-value: 2.89e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771   526 PRYSKFRSLLEETNLGHALDEDGvgGPYTIFVPNNEALNNMKDGTLDYLLSPegSRKLLELVRYHIVPfTQLEVATLIST 605
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGSQ--GPFTVFAPTNEAFAKLPAGTLNFLLKD--KEQLKNLLKYHVVP-GRLTSSDLKNG 75
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 767974771   606 PHIRSMANQLIQFNTTdNGQILANDVAMEEIEITAKNGRIYTLTGVLIP 654
Cdd:pfam02469   76 GTLATLQGSKLRVNVT-GGSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1631-1727 2.84e-22

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 93.20  E-value: 2.84e-22
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771   1631 TVFAPLSAAFDE-EARVKDWDKYgLMPQVLRYHVVAcHQLLLENLKLISNATSLQGEPIVISVSQ--STVYINNkAKIIS 1707
Cdd:smart00554    1 TVFAPTDEAFQKlPPDLNSLLAD-KLKNLLLYHVVP-GRLSSADLLNGGTLPTLAGSKLRITRSGgsGTVTVNG-ARIVE 77
                            90       100
                    ....*....|....*....|
gi 767974771   1708 SDIISTNGIVHIIDKLLSPK 1727
Cdd:smart00554   78 ADIAATNGVVHVIDRVLLPP 97
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1148-1265 4.12e-22

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 93.86  E-value: 4.12e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771  1148 PDYSIFRGYIIQYNLANAIEAADA-YTVFAPNNNAIENYIRE-----KKVLSLEEDVLRYHVVlEEKLLKNDLHNGMHRE 1221
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGSQGpFTVFAPTNEAFAKLPAGtlnflLKDKEQLKNLLKYHVV-PGRLTSSDLKNGGTLA 79
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 767974771  1222 TMLGFSyfLSFFLHNDQLYVNEAPINYTNVATDKGVIHGLGKVL 1265
Cdd:pfam02469   80 TLQGSK--LRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVL 121
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
504-654 1.19e-21

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 93.82  E-value: 1.19e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771  504 AMDKLEPTFESNNEQTIMTMLQ--PRYSKFRSLLEETNLGHALDEDGvggPYTIFVPNNEALNNMKDGTLDYLLSPEGSR 581
Cdd:COG2335    17 ASSAAAEGAAMAPTKNIVETAAnnPDFSTLVAALKAAGLVDTLSGEG---PFTVFAPTDAAFAALPAGTLDALLKPENKA 93
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 767974771  582 KLLELVRYHIVPfTQLEVATLISTPHIRSMANQLIQFnTTDNGQILANDVAMEEIEITAKNGRIYTLTGVLIP 654
Cdd:COG2335    94 TLTKILTYHVVP-GKVTAADLKDGKTLTTLQGQTLTV-TVSGGGVTVNGANVITADIEASNGVIHVIDKVLLP 164
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1000-1129 4.89e-21

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 91.89  E-value: 4.89e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771 1000 ELSFLSEAaifnrwINNASLQPTLSATSNLTVLVPSQQATEDMDQDEKSFWLSQSNIPAL---IKYHMLLGTYRVADLQT 1076
Cdd:COG2335    42 DFSTLVAA------LKAAGLVDTLSGEGPFTVFAPTDAAFAALPAGTLDALLKPENKATLtkiLTYHVVPGKVTAADLKD 115
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|...
gi 767974771 1077 LSSsdmlATSLQGNFLHLAKVDGNITIEGASIVDGDNAATNGVIHIINKVLVP 1129
Cdd:COG2335   116 GKT----LTTLQGQTLTVTVSGGGVTVNGANVITADIEASNGVIHVIDKVLLP 164
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
382-505 3.87e-20

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 88.08  E-value: 3.87e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771   382 GRLTSFISLLDKA-YAWPL-SKLGPFTVLLPTDKG---LKGFNVNELLVDNKAAQYFVKLHIIAGQMNIEYMNNTDMFYT 456
Cdd:pfam02469    1 PGFSTFVALLKAAgLVDTLnGSQGPFTVFAPTNEAfakLPAGTLNFLLKDKEQLKNLLKYHVVPGRLTSSDLKNGGTLAT 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 767974771   457 LTGKSGEIFNSDKDNQIKLklhggkkkVKIIQGDIIASNGLLHILDRAM 505
Cdd:pfam02469   81 LQGSKLRVNVTGGSVTVNG--------ARVVQADIEATNGVIHVIDKVL 121
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1778-1884 3.55e-18

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 81.64  E-value: 3.55e-18
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771   1778 TLFWPTDQALHALPAEQQDFLfnqdnKDKLKEYLKFHVIRDaKVLAVDLPTSTAWKTLQGSELSVKCgaGRDIGDLFLNG 1857
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLL-----ADKLKNLLLYHVVPG-RLSSADLLNGGTLPTLAGSKLRITR--SGGSGTVTVNG 72
                            90       100
                    ....*....|....*....|....*..
gi 767974771   1858 QtcRIVQRELLFDLGVAYGIDCLLIDP 1884
Cdd:smart00554   73 A--RIVEADIAATNGVVHVIDRVLLPP 97
Link_domain_TSG_6_like cd03515
This is the extracellular link domain of the type found in human TSG-6. The link domain is a ...
2198-2238 1.88e-16

This is the extracellular link domain of the type found in human TSG-6. The link domain is a hyaluronan (HA)-binding domain. TSG-6 is the protein product of tumor necrosis factor-stimulated gene-6. TSG-6 is up-regulated in inflammatory lesions and in the ovary during ovulation. It has a strong anti-inflammatory and chondroprotective effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. Also included in this group are the stabilins: stabilin-1 (FEEL-1, CLEVER-1) and stabilin-2 (FEEL-2). Stabilin-2 functions as the major liver and lymph node-scavenging receptor for HA and related glycosaminoglycans. Stabilin-2 is a scavenger receptor with a broad range of ligands including advanced glycation end (AGE) products, acetylated low density lipoprotein and procollagen peptides. In contrast, stabilin-1 does not bind HA, but binds acetylated low density lipoprotein and AGEs with lower affinity. As AGEs accumulate in vascular tissues during aging and diabetes, these receptors may be implicated in the pathologies of these states. Both stabilins are present in the early endocytic pathway in hepatic sinusoidal epithelium associating with clathrin/AP-2. Stabilin-1 is expressed in macrophages. Stabilin-2 is absent from the latter. In macrophages: stabilin-1 is involved in trafficking between early/sorting endosomes and the trans-Golgi network. Stabilin-1 has also been implicated in angiogenesis and possibly leucocyte trafficking. Both stabilins bind gram-positive and gram-negative bacteria. TSG-6 and stabilins contain a single link module which supports high affinity binding to HA.


Pssm-ID: 239592  Cd Length: 93  Bit Score: 76.35  E-value: 1.88e-16
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|.
gi 767974771 2198 GVFHLRSPLGQYKLTFDKAREACANEAATMATYNQLSYAQK 2238
Cdd:cd03515     1 GVFHLRSRSGKYKLTYTEAKAACEAEGAHLATYSQLSAAQQ 41
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1742-1882 2.10e-16

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 78.79  E-value: 2.10e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771 1742 QNLTTLATNNGyiKFSNL---IQDSGLLSVITDPihTPVTLFWPTDQALHALPAEQQDFLFNQDNKDKLKEYLKFHVIrD 1818
Cdd:COG2335    31 KNIVETAANNP--DFSTLvaaLKAAGLVDTLSGE--GPFTVFAPTDAAFAALPAGTLDALLKPENKATLTKILTYHVV-P 105
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 767974771 1819 AKVLAVDLPTSTAWKTLQGSELSVKcgagRDIGDLFLNGQTcrIVQRELLFDLGVAYGIDCLLI 1882
Cdd:COG2335   106 GKVTAADLKDGKTLTTLQGQTLTVT----VSGGGVTVNGAN--VITADIEASNGVIHVIDKVLL 163
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1030-1129 1.29e-15

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 74.32  E-value: 1.29e-15
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771   1030 TVLVPSQQATEDMDQDEKSFWLSQsnIPALIKYHMLLGTYRVADLQtlssSDMLATSLQGN--FLHLAKVDGNITIEGAS 1107
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLLADK--LKNLLLYHVVPGRLSSADLL----NGGTLPTLAGSklRITRSGGSGTVTVNGAR 74
                            90       100
                    ....*....|....*....|..
gi 767974771   1108 IVDGDNAATNGVIHIINKVLVP 1129
Cdd:smart00554   75 IVEADIAATNGVVHVIDRVLLP 96
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1138-1265 2.34e-15

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 75.71  E-value: 2.34e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771 1138 PNLLMRLEQMPDYSIFRGYIIQYNLANAIEAADAYTVFAPNNNAIENYIrEKKVLSLEED--------VLRYHVVlEEKL 1209
Cdd:COG2335    31 KNIVETAANNPDFSTLVAALKAAGLVDTLSGEGPFTVFAPTDAAFAALP-AGTLDALLKPenkatltkILTYHVV-PGKV 108
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 767974771 1210 LKNDLHNGMHRETMLGFSyfLSFFLHNDQLYVNEAPINYTNVATDKGVIHGLGKVL 1265
Cdd:COG2335   109 TAADLKDGKTLTTLQGQT--LTVTVSGGGVTVNGANVITADIEASNGVIHVIDKVL 162
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
554-655 7.95e-12

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 63.54  E-value: 7.95e-12
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771    554 TIFVPNNEALNNMKDGtLDYLLSPegsrKLLELVRYHIVPfTQLEVATLISTPHIRSMANQLIQFNTT-DNGQILANDVA 632
Cdd:smart00554    1 TVFAPTDEAFQKLPPD-LNSLLAD----KLKNLLLYHVVP-GRLSSADLLNGGTLPTLAGSKLRITRSgGSGTVTVNGAR 74
                            90       100
                    ....*....|....*....|...
gi 767974771    633 MEEIEITAKNGRIYTLTGVLIPP 655
Cdd:smart00554   75 IVEADIAATNGVVHVIDRVLLPP 97
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1173-1265 1.00e-11

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 63.15  E-value: 1.00e-11
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771   1173 TVFAPNNNAIENYIREKKVLSLE--EDVLRYHVVlEEKLLKNDLHNGMHRETMLGFSYFLSFFLHNDQLYVNEAPINYTN 1250
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLLADklKNLLLYHVV-PGRLSSADLLNGGTLPTLAGSKLRITRSGGSGTVTVNGARIVEAD 79
                            90
                    ....*....|....*
gi 767974771   1251 VATDKGVIHGLGKVL 1265
Cdd:smart00554   80 IAATNGVVHVIDRVL 94
Xlink pfam00193
Extracellular link domain;
2198-2238 1.79e-11

Extracellular link domain;


Pssm-ID: 459706  Cd Length: 92  Bit Score: 62.21  E-value: 1.79e-11
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 767974771  2198 GVFHLRSPlGQYKLTFDKAREACANEAATMATYNQLSYAQK 2238
Cdd:pfam00193    1 GVFHLESP-GRYKLTFQEAQAACAALGATLATPEQLYAAWK 40
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
382-503 2.46e-10

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 61.08  E-value: 2.46e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771  382 GRLTSFISLLDKA-YAWPLSKLGPFTVLLPTD---KGLKGFNVNELLVD-NKAA-QYFVKLHIIAGQMNIEYMNNTDMFY 455
Cdd:COG2335    41 PDFSTLVAALKAAgLVDTLSGEGPFTVFAPTDaafAALPAGTLDALLKPeNKATlTKILTYHVVPGKVTAADLKDGKTLT 120
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 767974771  456 TLTGKSGEIFNSDKDNQIKLklhggkkkVKIIQGDIIASNGLLHILDR 503
Cdd:COG2335   121 TLQGQTLTVTVSGGGVTVNG--------ANVITADIEASNGVIHVIDK 160
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
406-503 2.71e-08

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 53.52  E-value: 2.71e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767974771    406 TVLLPTDKGLKGFNVNELLVDNKAAQYFVKLHIIAGQMNIEYMNNTDMFYTLTGKSGEIFNSDKDNQIklklhgGKKKVK 485
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLLADKLKNLLLYHVVPGRLSSADLLNGGTLPTLAGSKLRITRSGGSGTV------TVNGAR 74
                            90
                    ....*....|....*...
gi 767974771    486 IIQGDIIASNGLLHILDR 503
Cdd:smart00554   75 IVEADIAATNGVVHVIDR 92
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1475-1511 3.67e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 48.36  E-value: 3.67e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 767974771  1475 CEISNGGCSAKADCKRTtPGRRVCTCKAGYTGDGIVC 1511
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
2127-2164 1.41e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 46.44  E-value: 1.41e-06
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 767974771  2127 CADGlNGGCHEHATCKMTgPGKHKCECKSHYVGDGLNC 2164
Cdd:pfam12947    1 CSDN-NGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
LINK smart00445
Link (Hyaluronan-binding);
2198-2238 1.41e-06

Link (Hyaluronan-binding);


Pssm-ID: 214667  Cd Length: 94  Bit Score: 48.49  E-value: 1.41e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 767974771   2198 GVFHLRsPLGQYKLTFDKAREACANEAATMATYNQLSYAQK 2238
Cdd:smart00445    3 GVFHVE-KNGRYKLTFAEAREACRAQGATLATVGQLYAAWQ 42
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1559-1595 2.61e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 45.67  E-value: 2.61e-06
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 767974771  1559 CLTKNGGCSEFAICNHTGQvERTCTCKPNYIGDGFTC 1595
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGG-SFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
837-865 1.55e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 43.74  E-value: 1.55e-05
                           10        20
                   ....*....|....*....|....*....
gi 767974771   837 CHIHATCEYSNGTASCICKAGYEGDGTLC 865
Cdd:pfam12947    8 CHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
Link_domain_CSPGs_modules_1_3 cd03517
Link_domain_CSPGs_modules_1_3; this extracellular link domain is found in the first and third ...
2199-2236 5.98e-05

Link_domain_CSPGs_modules_1_3; this extracellular link domain is found in the first and third link modules of the chondroitin sulfate proteoglycan core protein (CSPG) aggrecan. In addition, it is found in the first link module of three other CSPGs: versican, neurocan, and brevican. The link domain is a hyaluronan (HA)-binding domain. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of aggrecan are involved in interaction with HA. In addition, aggrecan contains a second globular domain (G2) which contains link modules 3 and 4. G2 appears to lack HA-binding activity. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN (hyaluronan/HA and proteoglycan binding link) protein family are physically linked adjacent to CSPG genes.


Pssm-ID: 239594  Cd Length: 95  Bit Score: 43.94  E-value: 5.98e-05
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 767974771 2199 VFHLRSPLGQYKLTFDKAREACANEAATMATYNQLSYA 2236
Cdd:cd03517     2 VFHYRDATARYALTFPRAQRACLDISAQIATPEQLLAA 39
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1517-1553 6.08e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 41.82  E-value: 6.08e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 767974771  1517 CLENHGGCDKNAECTQTgPNQAACNCLPAYTGDGKVC 1553
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
2086-2121 1.32e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 41.05  E-value: 1.32e-04
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 767974771  2086 CKQDNGGCAKVARCSQKGTKVSCSCQKGYKGDGHSC 2121
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
326-361 4.38e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 39.50  E-value: 4.38e-04
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 767974771   326 CKSDN-PCHRNANCTTVAPGRTeCICQKGYVGDGLTC 361
Cdd:pfam12947    1 CSDNNgGCHPNATCTNTGGSFT-CTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
246-275 6.00e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 39.12  E-value: 6.00e-04
                           10        20        30
                   ....*....|....*....|....*....|
gi 767974771   246 CHPHAHCTYLgPNRHSCTCQEGYRGDGQVC 275
Cdd:pfam12947    8 CHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
957-993 8.38e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 38.73  E-value: 8.38e-04
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 767974771   957 CLEQTGKCHPLASCQSTSSGVwSCVCQEGYEGDGFLC 993
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSF-TCTCNDGYTGDGVTC 36
Link_domain_HAPLN_module_1 cd03518
Link_domain_HAPLN_module_1; this link domain is found in the first link module of proteins ...
2199-2233 1.20e-03

Link_domain_HAPLN_module_1; this link domain is found in the first link module of proteins similar to the vertebrate HAPLN (hyaluronan/HA and proteoglycan binding link) protein family which includes cartilage link protein. The link domain is a HA-binding domain. HAPLNs contain two contiguous link modules. Both link modules of cartilage link protein are involved in interaction with HA. In cartilage, a chondroitin sulfate proteoglycan core protein (CSPG) aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates with other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HAPLN gene family are physically linked adjacent to CSPG genes.


Pssm-ID: 239595  Cd Length: 95  Bit Score: 40.10  E-value: 1.20e-03
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 767974771 2199 VFHLRSPLGQYKLTFDKAREACANEAATMATYNQL 2233
Cdd:cd03518     2 VFPYQPRLGRYNLNFHEAQQACEEQDATLASFEQL 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1437-1469 1.54e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 37.96  E-value: 1.54e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 767974771  1437 NGTCHTSANClTNSDGTASCKCAAGFQGNGTIC 1469
Cdd:pfam12947    5 NGGCHPNATC-TNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
876-908 2.18e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 37.58  E-value: 2.18e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 767974771   876 PGGCSRNAECIKTGtGTHTCVCQQGWTGNGRDC 908
Cdd:pfam12947    5 NGGCHPNATCTNTG-GSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
920-951 5.02e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 36.42  E-value: 5.02e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 767974771   920 GGCHDNASCLYVgPGQNECECKKGFRGNGIDC 951
Cdd:pfam12947    6 GGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH