NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1677539436|ref|NP_001258875|]
View 

slit homolog 3 protein isoform 1 precursor [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
1195-1321 1.39e-31

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


:

Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 120.22  E-value: 1.39e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436 1195 TDKDNGILLYKGD--NDPLALELYQGHVRLVYDSLSSPPTTVYSVETVNDGQFHSVELVTLNQTLNLVVDKGTPKSLGKL 1272
Cdd:pfam02210    3 TRQPNGLLLYAGGggSDFLALELVNGRLVLRYDLGSGPESLLSSGKNLNDGQWHSVRVERNGNTLTLSVDGQTVVSSLPP 82
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 1677539436 1273 QKQPAVGINSPLYLGGIPTStglsaLRQGTDRPLGGFHGCIHEVRINNE 1321
Cdd:pfam02210   83 GESLLLNLNGPLYLGGLPPL-----LLLPALPVRAGFVGCIRDVRVNGE 126
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
62-217 1.06e-27

Leucine-rich repeat (LRR) protein [Transcription];


:

Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 117.73  E-value: 1.06e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436   62 NAERLDLDRNNITRITKmDFAGLKNLRVLHLEDNQVSVIERgAFQDLKQLERLRLNKNKLQVLPELLFQSTpKLTRLDLS 141
Cdd:COG4886    114 NLESLDLSGNQLTDLPE-ELANLTNLKELDLSNNQLTDLPE-PLGNLTNLKSLDLSNNQLTDLPEELGNLT-NLKELDLS 190
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1677539436  142 ENQIQGIPrKAFRGITDVKNLQLDNNHISCIEDgAFRALRDLEILTLNNNNISRIlvTSFNHMPKIRTLRLHSNHL 217
Cdd:COG4886    191 NNQITDLP-EPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDL--PELGNLTNLEELDLSNNQL 262
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
588-900 1.35e-22

Leucine-rich repeat (LRR) protein [Transcription];


:

Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 102.32  E-value: 1.35e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  588 LTGNQLETVHGRVFRGLSGLKTLMLRSNligcvsnDTFAGLSSVRLLSLYDNRITTItPGAFTTLVSLSTINLlsnpfnc 667
Cdd:COG4886     79 LLLLSLLLLGLTDLGDLTNLTELDLSGN-------EELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDL------- 143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  668 nchlawlgkwlrkrrivSGNPrcqkpffLKEIPiqdVAIQDFTcdgneesscQLsprcpeqctcmeTVVRCSNKGLRALP 747
Cdd:COG4886    144 -----------------SNNQ-------LTDLP---EPLGNLT---------NL------------KSLDLSNNQLTDLP 175
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  748 R---GMPKdVTELYLEGNHLTAVPRELSALRHLTLIDLSNNSISMLTNyTFSNMSHLSTLILSYNRLRCIPvhAFNGLRS 824
Cdd:COG4886    176 EelgNLTN-LKELDLSNNQITDLPEPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLP--ELGNLTN 251
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1677539436  825 LRVLTLHGNDISSVPEGSfnDLTSLSHLALGTNPLHcDCSLRWLSEWVKAGYKEPGIARCSSPEPMADRLLLTTPT 900
Cdd:COG4886    252 LEELDLSNNQLTDLPPLA--NLTNLKTLDLSNNQLT-DLKLKELELLLGLNSLLLLLLLLNLLELLILLLLLTTLL 324
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
326-666 1.88e-19

Leucine-rich repeat (LRR) protein [Transcription];


:

Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 92.69  E-value: 1.88e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  326 AFTQYKKLKRIDISKNQISDIaPDAFQGLKSLTSLVLYGNKITEIVKGLFdglvslqllllnankinclrvntfqDLQNL 405
Cdd:COG4886    108 ELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQLTDLPEPLG-------------------------NLTNL 161
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  406 NLLSLYDNKLQTISKGLfAPLQsiqtlhlaqnpfvcdcHLKWLadYLQDNPIETSGARCSSPRRLankrisqikskkfrc 485
Cdd:COG4886    162 KSLDLSNNQLTDLPEEL-GNLT----------------NLKEL--DLSNNQITDLPEPLGNLTNL--------------- 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  486 sgsedyrsrfssecfmdlvcpekcrcegTIVDCSNQKLVRIPSHLPEY--VTDLRLNDNEVSVLEAtgiFKKLPNLRKIN 563
Cdd:COG4886    208 ----------------------------EELDLSGNQLTDLPEPLANLtnLETLDLSNNQLTDLPE---LGNLTNLEELD 256
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  564 LSNNKIKEVREGAfdGAASVQELMLTGNQLETVHGRVFRGLSGLKTLMLRSNLIGCVSNDTFAGLSSVRLLSLYDNRITT 643
Cdd:COG4886    257 LSNNQLTDLPPLA--NLTNLKTLDLSNNQLTDLKLKELELLLGLNSLLLLLLLLNLLELLILLLLLTTLLLLLLLLKGLL 334
                          330       340
                   ....*....|....*....|...
gi 1677539436  644 ITPGAFTTLVSLSTINLLSNPFN 666
Cdd:COG4886    335 VTLTTLALSLSLLALLTLLLLLN 357
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1081-1117 1.81e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.79  E-value: 1.81e-07
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1677539436 1081 DNDDCV-AHKCRHGAQCVDTINGYTCTCPQGFSGPFCE 1117
Cdd:cd00054      1 DIDECAsGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1005-1038 5.99e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 44.55  E-value: 5.99e-06
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 1677539436 1005 DDCED-NDCENNATCVDGINNYVCICPPNYTGELC 1038
Cdd:cd00054      3 DECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1130-1160 3.93e-05

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


:

Pssm-ID: 394967  Cd Length: 31  Bit Score: 41.98  E-value: 3.93e-05
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1677539436 1130 CDQYECQNGAQCIVVQQEPTCRCPPGFAGPR 1160
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGKR 31
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
963-1001 4.10e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 41.85  E-value: 4.10e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1677539436  963 INTC-IQNPCQHGGTCHlsdSHKDGFSCSCPLGFEGQRCE 1001
Cdd:cd00054      2 IDECaSGNPCQNGGTCV---NTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1042-1079 4.84e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 41.85  E-value: 4.84e-05
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1677539436 1042 IDHCVpELNLCQHEAKCIPLDKGFSCECVPGYSGKLCE 1079
Cdd:cd00054      2 IDECA-SGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
LRRNT smart00013
Leucine rich repeat N-terminal domain;
33-64 8.72e-05

Leucine rich repeat N-terminal domain;


:

Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 41.15  E-value: 8.72e-05
                            10        20        30
                    ....*....|....*....|....*....|..
gi 1677539436    33 ACPTKCTCSAASVDCHGLGLRAVPRGIPRNAE 64
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDTT 32
PCC super family cl28216
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
188-250 1.02e-04

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


The actual alignment was detected with superfamily member TIGR00864:

Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 47.38  E-value: 1.02e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1677539436  188 LNNNNISRILVTSFNHMPKIRTLRLHSNHLYCDCHLAWLSDWLRQR--RTVG-QFTLCMAPVHLRG 250
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKgvKVRQpEAALCAGPGALAG 67
GHB_like super family cl21545
Glycoprotein hormone beta chain homologues; This family of cystine-knot hormones includes the ...
1473-1528 3.56e-04

Glycoprotein hormone beta chain homologues; This family of cystine-knot hormones includes the beta chains of gonadotropins, thyrotropins, follitropins, choriogonadotropins and more. The members are reproductive hormones that consist of two glycosylated chains (alpha and beta), which form a tightly bound dimer.


The actual alignment was detected with superfamily member smart00041:

Pssm-ID: 473907  Cd Length: 82  Bit Score: 40.85  E-value: 3.56e-04
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*.
gi 1677539436  1473 SCATASKVPIMECRGGCgpQCCQPTRSKRRKYVFQCTDGSSFVEEVERHLECGCLA 1528
Cdd:smart00041   26 KCGSASSYSIQDVQHSC--SCCQPHKTKTRQVRLRCPDGSTVKKTVMHIEECGCEP 79
LRRNT smart00013
Leucine rich repeat N-terminal domain;
280-311 1.54e-03

Leucine rich repeat N-terminal domain;


:

Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 37.30  E-value: 1.54e-03
                            10        20        30
                    ....*....|....*....|....*....|..
gi 1677539436   280 CPSPCTCSNNIVDCRGKGLMEIPANLPEGIVE 311
Cdd:smart00013    2 CPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1379-1407 3.33e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


:

Pssm-ID: 394967  Cd Length: 31  Bit Score: 36.59  E-value: 3.33e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 1677539436 1379 CLGHRCHH-GKCVATGTSYMCKCAEGYGGD 1407
Cdd:pfam00008    1 CAPNPCSNgGTCVDTPGGYTCICPEGYTGK 30
 
Name Accession Description Interval E-value
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
1195-1321 1.39e-31

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 120.22  E-value: 1.39e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436 1195 TDKDNGILLYKGD--NDPLALELYQGHVRLVYDSLSSPPTTVYSVETVNDGQFHSVELVTLNQTLNLVVDKGTPKSLGKL 1272
Cdd:pfam02210    3 TRQPNGLLLYAGGggSDFLALELVNGRLVLRYDLGSGPESLLSSGKNLNDGQWHSVRVERNGNTLTLSVDGQTVVSSLPP 82
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 1677539436 1273 QKQPAVGINSPLYLGGIPTStglsaLRQGTDRPLGGFHGCIHEVRINNE 1321
Cdd:pfam02210   83 GESLLLNLNGPLYLGGLPPL-----LLLPALPVRAGFVGCIRDVRVNGE 126
LamG smart00282
Laminin G domain;
1188-1321 3.63e-30

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 116.67  E-value: 3.63e-30
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  1188 NISLQVATDKDNGILLY---KGDNDPLALELYQGHVRLVYDSLSSPPTTVYSVETVNDGQFHSVELVTLNQTLNLVVDKG 1264
Cdd:smart00282    1 SISFSFRTTSPNGLLLYagsKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRSVTLSVDGG 80
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*..
gi 1677539436  1265 TPKSLGKLQKQPAVGINSPLYLGGIPTSTGLSALRQGTdrplgGFHGCIHEVRINNE 1321
Cdd:smart00282   81 NRVSGESPGGLTILNLDGPLYLGGLPEDLKLPPLPVTP-----GFRGCIRNLKVNGK 132
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
1166-1319 3.79e-30

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 117.13  E-value: 3.79e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436 1166 TVNFVGkDSYVELA-SAKVRPQANISLQVATDKDNGILLYKGD---NDPLALELYQGHVRLVYDsLSSPPTTVYSVETVN 1241
Cdd:cd00110      1 GVSFSG-SSYVRLPtLPAPRTRLSISFSFRTTSPNGLLLYAGSqngGDFLALELEDGRLVLRYD-LGSGSLVLSSKTPLN 78
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1677539436 1242 DGQFHSVELVTLNQTLNLVVDKGTPKSLGKLQKQPAVGINSPLYLGGIPTSTGLSALRQGTdrplgGFHGCIHEVRIN 1319
Cdd:cd00110     79 DGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGGLPEDLKSPGLPVSP-----GFVGCIRDLKVN 151
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
62-217 1.06e-27

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 117.73  E-value: 1.06e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436   62 NAERLDLDRNNITRITKmDFAGLKNLRVLHLEDNQVSVIERgAFQDLKQLERLRLNKNKLQVLPELLFQSTpKLTRLDLS 141
Cdd:COG4886    114 NLESLDLSGNQLTDLPE-ELANLTNLKELDLSNNQLTDLPE-PLGNLTNLKSLDLSNNQLTDLPEELGNLT-NLKELDLS 190
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1677539436  142 ENQIQGIPrKAFRGITDVKNLQLDNNHISCIEDgAFRALRDLEILTLNNNNISRIlvTSFNHMPKIRTLRLHSNHL 217
Cdd:COG4886    191 NNQITDLP-EPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDL--PELGNLTNLEELDLSNNQL 262
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
588-900 1.35e-22

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 102.32  E-value: 1.35e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  588 LTGNQLETVHGRVFRGLSGLKTLMLRSNligcvsnDTFAGLSSVRLLSLYDNRITTItPGAFTTLVSLSTINLlsnpfnc 667
Cdd:COG4886     79 LLLLSLLLLGLTDLGDLTNLTELDLSGN-------EELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDL------- 143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  668 nchlawlgkwlrkrrivSGNPrcqkpffLKEIPiqdVAIQDFTcdgneesscQLsprcpeqctcmeTVVRCSNKGLRALP 747
Cdd:COG4886    144 -----------------SNNQ-------LTDLP---EPLGNLT---------NL------------KSLDLSNNQLTDLP 175
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  748 R---GMPKdVTELYLEGNHLTAVPRELSALRHLTLIDLSNNSISMLTNyTFSNMSHLSTLILSYNRLRCIPvhAFNGLRS 824
Cdd:COG4886    176 EelgNLTN-LKELDLSNNQITDLPEPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLP--ELGNLTN 251
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1677539436  825 LRVLTLHGNDISSVPEGSfnDLTSLSHLALGTNPLHcDCSLRWLSEWVKAGYKEPGIARCSSPEPMADRLLLTTPT 900
Cdd:COG4886    252 LEELDLSNNQLTDLPPLA--NLTNLKTLDLSNNQLT-DLKLKELELLLGLNSLLLLLLLLNLLELLILLLLLTTLL 324
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
326-666 1.88e-19

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 92.69  E-value: 1.88e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  326 AFTQYKKLKRIDISKNQISDIaPDAFQGLKSLTSLVLYGNKITEIVKGLFdglvslqllllnankinclrvntfqDLQNL 405
Cdd:COG4886    108 ELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQLTDLPEPLG-------------------------NLTNL 161
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  406 NLLSLYDNKLQTISKGLfAPLQsiqtlhlaqnpfvcdcHLKWLadYLQDNPIETSGARCSSPRRLankrisqikskkfrc 485
Cdd:COG4886    162 KSLDLSNNQLTDLPEEL-GNLT----------------NLKEL--DLSNNQITDLPEPLGNLTNL--------------- 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  486 sgsedyrsrfssecfmdlvcpekcrcegTIVDCSNQKLVRIPSHLPEY--VTDLRLNDNEVSVLEAtgiFKKLPNLRKIN 563
Cdd:COG4886    208 ----------------------------EELDLSGNQLTDLPEPLANLtnLETLDLSNNQLTDLPE---LGNLTNLEELD 256
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  564 LSNNKIKEVREGAfdGAASVQELMLTGNQLETVHGRVFRGLSGLKTLMLRSNLIGCVSNDTFAGLSSVRLLSLYDNRITT 643
Cdd:COG4886    257 LSNNQLTDLPPLA--NLTNLKTLDLSNNQLTDLKLKELELLLGLNSLLLLLLLLNLLELLILLLLLTTLLLLLLLLKGLL 334
                          330       340
                   ....*....|....*....|...
gi 1677539436  644 ITPGAFTTLVSLSTINLLSNPFN 666
Cdd:COG4886    335 VTLTTLALSLSLLALLTLLLLLN 357
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
61-217 1.17e-15

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 77.52  E-value: 1.17e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436   61 RNAERLDLDRNNITRITkmDFAGLKNLRVLHLEDNQVSVIErgAFQDLKQLERLRLNKNKLQVLPELLFQS------TPK 134
Cdd:cd21340     46 TNLTHLYLQNNQIEKIE--NLENLVNLKKLYLGGNRISVVE--GLENLTNLEELHIENQRLPPGEKLTFDPrslaalSNS 121
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  135 LTRLDLSenqiqgiprkafrgitdvknlqldNNHISCIEDgaFRALRDLEILTLNNNNISRI--LVTSFNHMPKIRTLRL 212
Cdd:cd21340    122 LRVLNIS------------------------GNNIDSLEP--LAPLRNLEQLDASNNQISDLeeLLDLLSSWPSLRELDL 175

                   ....*
gi 1677539436  213 HSNHL 217
Cdd:cd21340    176 TGNPV 180
LRR_8 pfam13855
Leucine rich repeat;
775-835 1.14e-14

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 69.86  E-value: 1.14e-14
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1677539436  775 RHLTLIDLSNNSISMLTNYTFSNMSHLSTLILSYNRLRCIPVHAFNGLRSLRVLTLHGNDI 835
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
557-617 6.85e-13

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 64.85  E-value: 6.85e-13
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1677539436  557 PNLRKINLSNNKIKEVREGAFDGAASVQELMLTGNQLETVHGRVFRGLSGLKTLMLRSNLI 617
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
159-217 3.90e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 62.54  E-value: 3.90e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1677539436  159 VKNLQLDNNHISCIEDGAFRALRDLEILTLNNNNISRILVTSFNHMPKIRTLRLHSNHL 217
Cdd:pfam13855    3 LRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
636-714 2.19e-11

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 69.34  E-value: 2.19e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  636 LYDNRITTITPGAFTTLVSLSTINLLSNPFNCNCHLAWLGKWLRKRRIVSGNPR---CQKPFFLKEIPIQDVAIQDFTCD 712
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEaalCAGPGALAGQPLLGIPLLDSGCD 81

                   ..
gi 1677539436  713 GN 714
Cdd:TIGR00864   82 EE 83
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
734-860 1.59e-09

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 59.80  E-value: 1.59e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  734 TVVRCSNKGLRALPR-GMPKDVTELYLEGNHLTAVPrELSALRHLTLIDLSNNSISMLTNytFSNMSHLSTLILSYNRLR 812
Cdd:cd21340      5 THLYLNDKNITKIDNlSLCKNLKVLYLYDNKITKIE-NLEFLTNLTHLYLQNNQIEKIEN--LENLVNLKKLYLGGNRIS 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  813 CIpvhafNGL---------------------------------RSLRVLTLHGNDISSVpeGSFNDLTSLSHLALGTNPL 859
Cdd:cd21340     82 VV-----EGLenltnleelhienqrlppgekltfdprslaalsNSLRVLNISGNNIDSL--EPLAPLRNLEQLDASNNQI 154

                   .
gi 1677539436  860 H 860
Cdd:cd21340    155 S 155
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
522-790 3.42e-09

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 61.64  E-value: 3.42e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  522 KLVRIPSHLPEYVTDLRLNDNEVsvleatgifKKLP-----NLRKINLSNNKIKEVREGAFDgaaSVQELMLTGNQLETV 596
Cdd:PRK15370   189 GLTTIPACIPEQITTLILDNNEL---------KSLPenlqgNIKTLYANSNQLTSIPATLPD---TIQEMELSINRITEL 256
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  597 HGRVfrgLSGLKTLMLRSNLIGCVSNDTFAGLssvRLLSLYDNRITTItPGAFTTlvSLSTINLLSNPfncnchLAWLGK 676
Cdd:PRK15370   257 PERL---PSALQSLDLFHNKISCLPENLPEEL---RYLSVYDNSIRTL-PAHLPS--GITHLNVQSNS------LTALPE 321
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  677 WLrkrrivsgnprcqkPFFLKEIPIQDVAIqdfTCdgneesscqlsprCPEQCTCMETVVRCSNKGLRALPRGMPKDVTE 756
Cdd:PRK15370   322 TL--------------PPGLKTLEAGENAL---TS-------------LPASLPPELQVLDVSKNQITVLPETLPPTITT 371
                          250       260       270
                   ....*....|....*....|....*....|....
gi 1677539436  757 LYLEGNHLTAVPRELSALrhLTLIDLSNNSISML 790
Cdd:PRK15370   372 LDVSRNALTNLPENLPAA--LQIMQASRNNLVRL 403
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1081-1117 1.81e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.79  E-value: 1.81e-07
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1677539436 1081 DNDDCV-AHKCRHGAQCVDTINGYTCTCPQGFSGPFCE 1117
Cdd:cd00054      1 DIDECAsGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
LRRCT smart00082
Leucine rich repeat C-terminal domain;
857-906 4.57e-07

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 47.81  E-value: 4.57e-07
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 1677539436   857 NPLHCDCSLRWLSEWVKAG--YKEPGIARCSSPEPMADRlLLTTPTHRFQCK 906
Cdd:smart00082    1 NPFICDCELRWLLRWLQANehLQDPVDLRCASPSSLRGP-LLELLHSEFKCP 51
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
255-536 1.83e-06

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 52.78  E-value: 1.83e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  255 DVQKKEYVCPAPHSEPPScNANSISCPSPCTCSNNIVDCRGK-GLMEIPANLPEGIVEIRLEQNSIKAIPAGAftqYKKL 333
Cdd:PRK15370   147 ELIWSEWVKEAPAKEAAN-REEAVQRMRDCLKNNKTELRLKIlGLTTIPACIPEQITTLILDNNELKSLPENL---QGNI 222
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  334 KRIDISKNQISDIA---PDAFQGLKsltslvLYGNKITEIVKGLfdgLVSLQLLLLNANKINCLRVNTFQDLQNlnlLSL 410
Cdd:PRK15370   223 KTLYANSNQLTSIPatlPDTIQEME------LSINRITELPERL---PSALQSLDLFHNKISCLPENLPEELRY---LSV 290
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  411 YDNKLQTISKGLfaPlQSIQTLHLAQN-----PFVCDCHLKWL-ADylqDNPIETSGArcSSPRRLANKRISQ----IKS 480
Cdd:PRK15370   291 YDNSIRTLPAHL--P-SGITHLNVQSNsltalPETLPPGLKTLeAG---ENALTSLPA--SLPPELQVLDVSKnqitVLP 362
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1677539436  481 KKFRCSGSEDYRSRfssECFMDLvcPEKCRCEGTIVDCSNQKLVRIPSHLPEYVTD 536
Cdd:PRK15370   363 ETLPPTITTLDVSR---NALTNL--PENLPAALQIMQASRNNLVRLPESLPHFRGE 413
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
52-217 2.77e-06

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 52.01  E-value: 2.77e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436   52 LRAVPRGIPRNAERLDLDRNNITRITKMDFAGLKNLRVLHledNQVSVIERGAFQDLKQLERLRlnkNKLQVLPELLFQS 131
Cdd:PRK15370   232 LTSIPATLPDTIQEMELSINRITELPERLPSALQSLDLFH---NKISCLPENLPEELRYLSVYD---NSIRTLPAHLPSG 305
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  132 tpkLTRLDLSENQIQGIPRKAFRGItdvKNLQLDNNHISCIEDGAFRALRDLEIltlNNNnisRILVTSFNHMPKIRTLR 211
Cdd:PRK15370   306 ---ITHLNVQSNSLTALPETLPPGL---KTLEAGENALTSLPASLPPELQVLDV---SKN---QITVLPETLPPTITTLD 373

                   ....*.
gi 1677539436  212 LHSNHL 217
Cdd:PRK15370   374 VSRNAL 379
EGF_CA smart00179
Calcium-binding EGF-like domain;
1081-1117 2.97e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 45.32  E-value: 2.97e-06
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 1677539436  1081 DNDDCV-AHKCRHGAQCVDTINGYTCTCPQGFS-GPFCE 1117
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1005-1038 5.99e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 44.55  E-value: 5.99e-06
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 1677539436 1005 DDCED-NDCENNATCVDGINNYVCICPPNYTGELC 1038
Cdd:cd00054      3 DECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
410-473 2.19e-05

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 49.70  E-value: 2.19e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1677539436  410 LYDNKLQTISKGLFAPLQSIQTLHLAQNPFVCDCHLKWLADYLQDNPIET---SGARCSSPRRLANK 473
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVrqpEAALCAGPGALAGQ 68
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1007-1036 3.49e-05

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 41.98  E-value: 3.49e-05
                           10        20        30
                   ....*....|....*....|....*....|
gi 1677539436 1007 CEDNDCENNATCVDGINNYVCICPPNYTGE 1036
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGK 30
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1130-1160 3.93e-05

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 41.98  E-value: 3.93e-05
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1677539436 1130 CDQYECQNGAQCIVVQQEPTCRCPPGFAGPR 1160
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGKR 31
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
963-1001 4.10e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 41.85  E-value: 4.10e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1677539436  963 INTC-IQNPCQHGGTCHlsdSHKDGFSCSCPLGFEGQRCE 1001
Cdd:cd00054      2 IDECaSGNPCQNGGTCV---NTVGSYRCSCPPGYTGRNCE 38
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1085-1115 4.29e-05

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 41.98  E-value: 4.29e-05
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1677539436 1085 CVAHKCRHGAQCVDTINGYTCTCPQGFSGPF 1115
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGKR 31
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1042-1079 4.84e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 41.85  E-value: 4.84e-05
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1677539436 1042 IDHCVpELNLCQHEAKCIPLDKGFSCECVPGYSGKLCE 1079
Cdd:cd00054      2 IDECA-SGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
LRRNT smart00013
Leucine rich repeat N-terminal domain;
33-64 8.72e-05

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 41.15  E-value: 8.72e-05
                            10        20        30
                    ....*....|....*....|....*....|..
gi 1677539436    33 ACPTKCTCSAASVDCHGLGLRAVPRGIPRNAE 64
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDTT 32
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
33-60 8.94e-05

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 40.69  E-value: 8.94e-05
                           10        20
                   ....*....|....*....|....*...
gi 1677539436   33 ACPTKCTCSAASVDCHGLGLRAVPRGIP 60
Cdd:pfam01462    1 ACPVPCHCSATVVNCSDRGLTAVPRDLP 28
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
188-250 1.02e-04

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 47.38  E-value: 1.02e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1677539436  188 LNNNNISRILVTSFNHMPKIRTLRLHSNHLYCDCHLAWLSDWLRQR--RTVG-QFTLCMAPVHLRG 250
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKgvKVRQpEAALCAGPGALAG 67
EGF_CA smart00179
Calcium-binding EGF-like domain;
1005-1034 1.97e-04

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 40.31  E-value: 1.97e-04
                            10        20        30
                    ....*....|....*....|....*....|.
gi 1677539436  1005 DDCE-DNDCENNATCVDGINNYVCICPPNYT 1034
Cdd:smart00179    3 DECAsGNPCQNGGTCVNTVGSYRCECPPGYT 33
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
55-415 2.93e-04

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 45.61  E-value: 2.93e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436   55 VPR--GIPRNAERLDLDRNNITRITKMDFAGLKNLRVLHLEDNQVSVIERGAFQDLKQLERLRLNKNKLQ-VLPELLFqS 131
Cdd:PLN00113   204 IPRelGQMKSLKWIYLGYNNLSGEIPYEIGGLTSLNHLDLVYNNLTGPIPSSLGNLKNLQYLFLYQNKLSgPIPPSIF-S 282
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  132 TPKLTRLDLSENQIQG-IPRKafrgITDVKNLQ---LDNNHISCIEDGAFRALRDLEILTLNNNNISRILVTSFNHMPKI 207
Cdd:PLN00113   283 LQKLISLDLSDNSLSGeIPEL----VIQLQNLEilhLFSNNFTGKIPVALTSLPRLQVLQLWSNKFSGEIPKNLGKHNNL 358
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  208 RTLRLHSNHL-------YCDC-HLawlsdwlrqrrtvgqFTLCMAPVHLRGfnvaDVQKKEYVCPAPHSEPPSCNANSIS 279
Cdd:PLN00113   359 TVLDLSTNNLtgeipegLCSSgNL---------------FKLILFSNSLEG----EIPKSLGACRSLRRVRLQDNSFSGE 419
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  280 CPSPCT----------CSNNIVDCRGKGLMEIPAnlpegIVEIRLEQNSIKA-IPAgaFTQYKKLKRIDISKNQISDIAP 348
Cdd:PLN00113   420 LPSEFTklplvyfldiSNNNLQGRINSRKWDMPS-----LQMLSLARNKFFGgLPD--SFGSKRLENLDLSRNQFSGAVP 492
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1677539436  349 DAFQGLKSLTSLVLYGNKITEIVKGLFDGLVSLQLLLLNANKINCLRVNTFQDLQNLNLLSLYDNKL 415
Cdd:PLN00113   493 RKLGSLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQL 559
CT smart00041
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta ...
1473-1528 3.56e-04

C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta (TGFbeta), nerve growth factor (NGF), platelet-derived growth factor (PDGF) and gonadotropin all form 2 highly twisted antiparallel pairs of beta-strands and contain three disulphide bonds. The domain is non-globular and little is conserved among these presumed homologues except for their cysteine residues. CT domains are predicted to form homodimers.


Pssm-ID: 214482  Cd Length: 82  Bit Score: 40.85  E-value: 3.56e-04
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*.
gi 1677539436  1473 SCATASKVPIMECRGGCgpQCCQPTRSKRRKYVFQCTDGSSFVEEVERHLECGCLA 1528
Cdd:smart00041   26 KCGSASSYSIQDVQHSC--SCCQPHKTKTRQVRLRCPDGSTVKKTVMHIEECGCEP 79
LRRCT smart00082
Leucine rich repeat C-terminal domain;
437-467 7.17e-04

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 38.95  E-value: 7.17e-04
                            10        20        30
                    ....*....|....*....|....*....|...
gi 1677539436   437 NPFVCDCHLKWLADYLQDNPI--ETSGARCSSP 467
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHlqDPVDLRCASP 33
EGF_CA smart00179
Calcium-binding EGF-like domain;
1042-1079 7.87e-04

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 38.38  E-value: 7.87e-04
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 1677539436  1042 IDHCVpELNLCQHEAKCIPLDKGFSCECVPGYS-GKLCE 1079
Cdd:smart00179    2 IDECA-SGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
LRRNT smart00013
Leucine rich repeat N-terminal domain;
280-311 1.54e-03

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 37.30  E-value: 1.54e-03
                            10        20        30
                    ....*....|....*....|....*....|..
gi 1677539436   280 CPSPCTCSNNIVDCRGKGLMEIPANLPEGIVE 311
Cdd:smart00013    2 CPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
280-306 1.88e-03

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 37.22  E-value: 1.88e-03
                           10        20
                   ....*....|....*....|....*..
gi 1677539436  280 CPSPCTCSNNIVDCRGKGLMEIPANLP 306
Cdd:pfam01462    2 CPVPCHCSATVVNCSDRGLTAVPRDLP 28
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1379-1407 3.33e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 36.59  E-value: 3.33e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 1677539436 1379 CLGHRCHH-GKCVATGTSYMCKCAEGYGGD 1407
Cdd:pfam00008    1 CAPNPCSNgGTCVDTPGGYTCICPEGYTGK 30
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
966-999 3.93e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 36.21  E-value: 3.93e-03
                           10        20        30
                   ....*....|....*....|....*....|....
gi 1677539436  966 CIQNPCQHGGTCHlsdSHKDGFSCSCPLGFEGQR 999
Cdd:pfam00008    1 CAPNPCSNGGTCV---DTPGGYTCICPEGYTGKR 31
EGF_CA smart00179
Calcium-binding EGF-like domain;
969-1001 5.37e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 36.07  E-value: 5.37e-03
                            10        20        30
                    ....*....|....*....|....*....|....
gi 1677539436   969 NPCQHGGTCHlsdSHKDGFSCSCPLGFE-GQRCE 1001
Cdd:smart00179    9 NPCQNGGTCV---NTVGSYRCECPPGYTdGRNCE 39
LRR_TYP smart00369
Leucine-rich repeats, typical (most populated) subfamily;
133-155 6.15e-03

Leucine-rich repeats, typical (most populated) subfamily;


Pssm-ID: 197687 [Multi-domain]  Cd Length: 24  Bit Score: 35.41  E-value: 6.15e-03
                            10        20
                    ....*....|....*....|...
gi 1677539436   133 PKLTRLDLSENQIQGIPRKAFRG 155
Cdd:smart00369    2 PNLRELDLSNNQLSSLPPGAFQG 24
 
Name Accession Description Interval E-value
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
1195-1321 1.39e-31

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 120.22  E-value: 1.39e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436 1195 TDKDNGILLYKGD--NDPLALELYQGHVRLVYDSLSSPPTTVYSVETVNDGQFHSVELVTLNQTLNLVVDKGTPKSLGKL 1272
Cdd:pfam02210    3 TRQPNGLLLYAGGggSDFLALELVNGRLVLRYDLGSGPESLLSSGKNLNDGQWHSVRVERNGNTLTLSVDGQTVVSSLPP 82
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 1677539436 1273 QKQPAVGINSPLYLGGIPTStglsaLRQGTDRPLGGFHGCIHEVRINNE 1321
Cdd:pfam02210   83 GESLLLNLNGPLYLGGLPPL-----LLLPALPVRAGFVGCIRDVRVNGE 126
LamG smart00282
Laminin G domain;
1188-1321 3.63e-30

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 116.67  E-value: 3.63e-30
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  1188 NISLQVATDKDNGILLY---KGDNDPLALELYQGHVRLVYDSLSSPPTTVYSVETVNDGQFHSVELVTLNQTLNLVVDKG 1264
Cdd:smart00282    1 SISFSFRTTSPNGLLLYagsKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRSVTLSVDGG 80
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*..
gi 1677539436  1265 TPKSLGKLQKQPAVGINSPLYLGGIPTSTGLSALRQGTdrplgGFHGCIHEVRINNE 1321
Cdd:smart00282   81 NRVSGESPGGLTILNLDGPLYLGGLPEDLKLPPLPVTP-----GFRGCIRNLKVNGK 132
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
1166-1319 3.79e-30

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 117.13  E-value: 3.79e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436 1166 TVNFVGkDSYVELA-SAKVRPQANISLQVATDKDNGILLYKGD---NDPLALELYQGHVRLVYDsLSSPPTTVYSVETVN 1241
Cdd:cd00110      1 GVSFSG-SSYVRLPtLPAPRTRLSISFSFRTTSPNGLLLYAGSqngGDFLALELEDGRLVLRYD-LGSGSLVLSSKTPLN 78
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1677539436 1242 DGQFHSVELVTLNQTLNLVVDKGTPKSLGKLQKQPAVGINSPLYLGGIPTSTGLSALRQGTdrplgGFHGCIHEVRIN 1319
Cdd:cd00110     79 DGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGGLPEDLKSPGLPVSP-----GFVGCIRDLKVN 151
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
62-217 1.06e-27

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 117.73  E-value: 1.06e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436   62 NAERLDLDRNNITRITKmDFAGLKNLRVLHLEDNQVSVIERgAFQDLKQLERLRLNKNKLQVLPELLFQSTpKLTRLDLS 141
Cdd:COG4886    114 NLESLDLSGNQLTDLPE-ELANLTNLKELDLSNNQLTDLPE-PLGNLTNLKSLDLSNNQLTDLPEELGNLT-NLKELDLS 190
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1677539436  142 ENQIQGIPrKAFRGITDVKNLQLDNNHISCIEDgAFRALRDLEILTLNNNNISRIlvTSFNHMPKIRTLRLHSNHL 217
Cdd:COG4886    191 NNQITDLP-EPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDL--PELGNLTNLEELDLSNNQL 262
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
61-455 2.17e-24

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 107.71  E-value: 2.17e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436   61 RNAERLDLDRNNITRITKMDFAGLKNLRVLHLEDNQvsviergAFQDLKQLERLRLNKNKLQVLPELLFQSTpKLTRLDL 140
Cdd:COG4886     72 LLLLLLSLLLLSLLLLGLTDLGDLTNLTELDLSGNE-------ELSNLTNLESLDLSGNQLTDLPEELANLT-NLKELDL 143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  141 SENQIQGIPrKAFRGITDVKNLQLDNNHISCIeDGAFRALRDLEILTLNNNNISRILvTSFNHMPKIRTLRLHSNHLYcd 220
Cdd:COG4886    144 SNNQLTDLP-EPLGNLTNLKSLDLSNNQLTDL-PEELGNLTNLKELDLSNNQITDLP-EPLGNLTNLEELDLSGNQLT-- 218
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  221 chlawlsdwlrqrrtvgqftlcmapvhlrgfnvadvqkkeyvcpaphseppscnansiscpspctcsnnivdcrgkglmE 300
Cdd:COG4886    219 -------------------------------------------------------------------------------D 219
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  301 IPANLPE--GIVEIRLEQNSIKAIPagAFTQYKKLKRIDISKNQISDIAPDAfqGLKSLTSLVLYGNKITEI-VKGLFDG 377
Cdd:COG4886    220 LPEPLANltNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQLTDLPPLA--NLTNLKTLDLSNNQLTDLkLKELELL 295
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1677539436  378 LVSLQLLLLNANKINCLRVNTFQDLQNLNLLSLYDNKLQTISKGLFAPLQSIQTLHLAQNPFVCDCHLKWLADYLQDN 455
Cdd:COG4886    296 LGLNSLLLLLLLLNLLELLILLLLLTTLLLLLLLLKGLLVTLTTLALSLSLLALLTLLLLLNLLSLLLTLLLTLGLLG 373
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
62-450 1.02e-23

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 105.79  E-value: 1.02e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436   62 NAERLDLDRNNitritkmDFAGLKNLRVLHLEDNQVSVIERgAFQDLKQLERLRLNKNKLQVLPELLFQSTpKLTRLDLS 141
Cdd:COG4886     97 NLTELDLSGNE-------ELSNLTNLESLDLSGNQLTDLPE-ELANLTNLKELDLSNNQLTDLPEPLGNLT-NLKSLDLS 167
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  142 ENQIQGIPrKAFRGITDVKNLQLDNNHISCIEDgAFRALRDLEILTLNNNNISRiLVTSFNHMPKIRTLRLHSNHLYcdc 221
Cdd:COG4886    168 NNQLTDLP-EELGNLTNLKELDLSNNQITDLPE-PLGNLTNLEELDLSGNQLTD-LPEPLANLTNLETLDLSNNQLT--- 241
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  222 HLAWLsdwlrqrrtvgqftlcmapvhlrgfnvadvqkkeyvcpaphseppscnansiscpspctcsnnivdcrgkglmei 301
Cdd:COG4886    242 DLPEL--------------------------------------------------------------------------- 246
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  302 pANLPEgIVEIRLEQNSIKAIPAGAftQYKKLKRIDISKNQISDIAPDAFQGLKSLTSLVLYGNKITEIvkGLFDGLVSL 381
Cdd:COG4886    247 -GNLTN-LEELDLSNNQLTDLPPLA--NLTNLKTLDLSNNQLTDLKLKELELLLGLNSLLLLLLLLNLL--ELLILLLLL 320
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1677539436  382 QLLLLNANKINCLRVNTFQDLQNLNLLSLYDNKLQTISKGLFAPLQSIQTLHLAQNPFVCDCHLKWLAD 450
Cdd:COG4886    321 TTLLLLLLLLKGLLVTLTTLALSLSLLALLTLLLLLNLLSLLLTLLLTLGLLGLLEATLLTLALLLLTL 389
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
588-900 1.35e-22

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 102.32  E-value: 1.35e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  588 LTGNQLETVHGRVFRGLSGLKTLMLRSNligcvsnDTFAGLSSVRLLSLYDNRITTItPGAFTTLVSLSTINLlsnpfnc 667
Cdd:COG4886     79 LLLLSLLLLGLTDLGDLTNLTELDLSGN-------EELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDL------- 143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  668 nchlawlgkwlrkrrivSGNPrcqkpffLKEIPiqdVAIQDFTcdgneesscQLsprcpeqctcmeTVVRCSNKGLRALP 747
Cdd:COG4886    144 -----------------SNNQ-------LTDLP---EPLGNLT---------NL------------KSLDLSNNQLTDLP 175
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  748 R---GMPKdVTELYLEGNHLTAVPRELSALRHLTLIDLSNNSISMLTNyTFSNMSHLSTLILSYNRLRCIPvhAFNGLRS 824
Cdd:COG4886    176 EelgNLTN-LKELDLSNNQITDLPEPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLP--ELGNLTN 251
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1677539436  825 LRVLTLHGNDISSVPEGSfnDLTSLSHLALGTNPLHcDCSLRWLSEWVKAGYKEPGIARCSSPEPMADRLLLTTPT 900
Cdd:COG4886    252 LEELDLSNNQLTDLPPLA--NLTNLKTLDLSNNQLT-DLKLKELELLLGLNSLLLLLLLLNLLELLILLLLLTTLL 324
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
535-837 1.50e-22

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 102.32  E-value: 1.50e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  535 TDLRLNDNEVSVLEATGIFKKLPNLRKINLSNNKikevregAFDGAASVQELMLTGNQLETVhGRVFRGLSGLKTLMLRS 614
Cdd:COG4886     74 LLLLSLLLLSLLLLGLTDLGDLTNLTELDLSGNE-------ELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSN 145
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  615 NLIGCVSnDTFAGLSSVRLLSLYDNRITTItPGAFTTLVSLSTINLLSNPfncnchlawlgkwlrkrrivsgnprcqkpf 694
Cdd:COG4886    146 NQLTDLP-EPLGNLTNLKSLDLSNNQLTDL-PEELGNLTNLKELDLSNNQ------------------------------ 193
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  695 fLKEIPiqdvaiqdftcdgneessCQLSprcpeqctcmetvvrcsnkGLRALprgmpkdvTELYLEGNHLTAVPRELSAL 774
Cdd:COG4886    194 -ITDLP------------------EPLG-------------------NLTNL--------EELDLSGNQLTDLPEPLANL 227
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1677539436  775 RHLTLIDLSNNSISMLTNytFSNMSHLSTLILSYNRLRCIPVHAfnGLRSLRVLTLHGNDISS 837
Cdd:COG4886    228 TNLETLDLSNNQLTDLPE--LGNLTNLEELDLSNNQLTDLPPLA--NLTNLKTLDLSNNQLTD 286
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
633-859 1.28e-20

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 96.54  E-value: 1.28e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  633 LLSLYDNRITTITPGAFTTLVSLSTINLLSNPFNCNCHLAWLGKWLRKRRIVSGNPRCQkpfFLKEIPIQDVAIQDFTCD 712
Cdd:COG4886      2 LLLLLSLTLKLLLLLLLELLTTLILLLLLLLLLLALLLLSLLSLLLLLTLLLSLLLRDL---LLSSLLLLLSLLLLLLLS 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  713 GNEESSCQLSPRCPEQCTCMETVVRCSNKGLRALprgmpKDVTELYLEGNHLTAVPRELSALRHLTLIDLSNNSISMLTN 792
Cdd:COG4886     79 LLLLSLLLLGLTDLGDLTNLTELDLSGNEELSNL-----TNLESLDLSGNQLTDLPEELANLTNLKELDLSNNQLTDLPE 153
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1677539436  793 yTFSNMSHLSTLILSYNRLRCIPvHAFNGLRSLRVLTLHGNDISSVPEgSFNDLTSLSHLALGTNPL 859
Cdd:COG4886    154 -PLGNLTNLKSLDLSNNQLTDLP-EELGNLTNLKELDLSNNQITDLPE-PLGNLTNLEELDLSGNQL 217
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
326-666 1.88e-19

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 92.69  E-value: 1.88e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  326 AFTQYKKLKRIDISKNQISDIaPDAFQGLKSLTSLVLYGNKITEIVKGLFdglvslqllllnankinclrvntfqDLQNL 405
Cdd:COG4886    108 ELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQLTDLPEPLG-------------------------NLTNL 161
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  406 NLLSLYDNKLQTISKGLfAPLQsiqtlhlaqnpfvcdcHLKWLadYLQDNPIETSGARCSSPRRLankrisqikskkfrc 485
Cdd:COG4886    162 KSLDLSNNQLTDLPEEL-GNLT----------------NLKEL--DLSNNQITDLPEPLGNLTNL--------------- 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  486 sgsedyrsrfssecfmdlvcpekcrcegTIVDCSNQKLVRIPSHLPEY--VTDLRLNDNEVSVLEAtgiFKKLPNLRKIN 563
Cdd:COG4886    208 ----------------------------EELDLSGNQLTDLPEPLANLtnLETLDLSNNQLTDLPE---LGNLTNLEELD 256
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  564 LSNNKIKEVREGAfdGAASVQELMLTGNQLETVHGRVFRGLSGLKTLMLRSNLIGCVSNDTFAGLSSVRLLSLYDNRITT 643
Cdd:COG4886    257 LSNNQLTDLPPLA--NLTNLKTLDLSNNQLTDLKLKELELLLGLNSLLLLLLLLNLLELLILLLLLTTLLLLLLLLKGLL 334
                          330       340
                   ....*....|....*....|...
gi 1677539436  644 ITPGAFTTLVSLSTINLLSNPFN 666
Cdd:COG4886    335 VTLTTLALSLSLLALLTLLLLLN 357
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
107-482 6.49e-18

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 88.07  E-value: 6.49e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  107 DLKQLERLRLNKNKLQVLPELLFQSTPKLTRLDLSENqiqgiprKAFRGITDVKNLQLDNNHISCIEDgAFRALRDLEIL 186
Cdd:COG4886     70 SLLLLLLLSLLLLSLLLLGLTDLGDLTNLTELDLSGN-------EELSNLTNLESLDLSGNQLTDLPE-ELANLTNLKEL 141
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  187 TLNNNNISRIlVTSFNHMPKIRTLRLHSNhlycdchlawlsdwlrqrrtvgqftlcmapvhlrgfnvadvqkkeyvcpap 266
Cdd:COG4886    142 DLSNNQLTDL-PEPLGNLTNLKSLDLSNN--------------------------------------------------- 169
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  267 hseppscnansiscpspctcsnnivdcrgkGLMEIP---ANLPEgIVEIRLEQNSIKAIPAgAFTQYKKLKRIDISKNQI 343
Cdd:COG4886    170 ------------------------------QLTDLPeelGNLTN-LKELDLSNNQITDLPE-PLGNLTNLEELDLSGNQL 217
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  344 SDIaPDAFQGLKSLTSLVLYGNKITEIVKglfdglvslqllllnankinclrvntFQDLQNLNLLSLYDNKLQTISKglF 423
Cdd:COG4886    218 TDL-PEPLANLTNLETLDLSNNQLTDLPE--------------------------LGNLTNLEELDLSNNQLTDLPP--L 268
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1677539436  424 APLQSIQTLHLAQNPFVcDCHLKWLADYLQDNPIETSGARCSSPRRLANKRISQIKSKK 482
Cdd:COG4886    269 ANLTNLKTLDLSNNQLT-DLKLKELELLLGLNSLLLLLLLLNLLELLILLLLLTTLLLL 326
Laminin_G_1 pfam00054
Laminin G domain;
1193-1324 8.01e-18

Laminin G domain;


Pssm-ID: 395008 [Multi-domain]  Cd Length: 131  Bit Score: 81.21  E-value: 8.01e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436 1193 VATDKDNGILLYKGDNDP---LALELYQGHVRLVYDsLSSPPTTVYSVETVNDGQFHSVELVTLNQTLNLVVDKGTP--- 1266
Cdd:pfam00054    1 FRTTEPSGLLLYNGTQTErdfLALELRDGRLEVSYD-LGSGAAVVRSGDKLNDGKWHSVELERNGRSGTLSVDGEARptg 79
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1677539436 1267 -KSLGKLQKQPAVGinsPLYLGGIPTSTglSALRQGTDRPlgGFHGCIHEVRINNELQD 1324
Cdd:pfam00054   80 eSPLGATTDLDVDG---PLYVGGLPSLG--VKKRRLAISP--SFDGCIRDVIVNGKPLD 131
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
61-217 1.17e-15

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 77.52  E-value: 1.17e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436   61 RNAERLDLDRNNITRITkmDFAGLKNLRVLHLEDNQVSVIErgAFQDLKQLERLRLNKNKLQVLPELLFQS------TPK 134
Cdd:cd21340     46 TNLTHLYLQNNQIEKIE--NLENLVNLKKLYLGGNRISVVE--GLENLTNLEELHIENQRLPPGEKLTFDPrslaalSNS 121
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  135 LTRLDLSenqiqgiprkafrgitdvknlqldNNHISCIEDgaFRALRDLEILTLNNNNISRI--LVTSFNHMPKIRTLRL 212
Cdd:cd21340    122 LRVLNIS------------------------GNNIDSLEP--LAPLRNLEQLDASNNQISDLeeLLDLLSSWPSLRELDL 175

                   ....*
gi 1677539436  213 HSNHL 217
Cdd:cd21340    176 TGNPV 180
LRR_8 pfam13855
Leucine rich repeat;
775-835 1.14e-14

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 69.86  E-value: 1.14e-14
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1677539436  775 RHLTLIDLSNNSISMLTNYTFSNMSHLSTLILSYNRLRCIPVHAFNGLRSLRVLTLHGNDI 835
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
557-617 6.85e-13

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 64.85  E-value: 6.85e-13
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1677539436  557 PNLRKINLSNNKIKEVREGAFDGAASVQELMLTGNQLETVHGRVFRGLSGLKTLMLRSNLI 617
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
582-641 1.18e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 64.08  E-value: 1.18e-12
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  582 SVQELMLTGNQLETVHGRVFRGLSGLKTLMLRSNLIGCVSNDTFAGLSSVRLLSLYDNRI 641
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
159-217 3.90e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 62.54  E-value: 3.90e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1677539436  159 VKNLQLDNNHISCIEDGAFRALRDLEILTLNNNNISRILVTSFNHMPKIRTLRLHSNHL 217
Cdd:pfam13855    3 LRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
61-121 4.25e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 62.54  E-value: 4.25e-12
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1677539436   61 RNAERLDLDRNNITRITKMDFAGLKNLRVLHLEDNQVSVIERGAFQDLKQLERLRLNKNKL 121
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
133-193 4.51e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 62.54  E-value: 4.51e-12
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1677539436  133 PKLTRLDLSENQIQGIPRKAFRGITDVKNLQLDNNHISCIEDGAFRALRDLEILTLNNNNI 193
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
311-367 6.60e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 61.77  E-value: 6.60e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1677539436  311 EIRLEQNSIKAIPAGAFTQYKKLKRIDISKNQISDIAPDAFQGLKSLTSLVLYGNKI 367
Cdd:pfam13855    5 SLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
799-859 1.70e-11

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 60.62  E-value: 1.70e-11
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1677539436  799 SHLSTLILSYNRLRCIPVHAFNGLRSLRVLTLHGNDISSVPEGSFNDLTSLSHLALGTNPL 859
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
86-145 1.74e-11

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 60.62  E-value: 1.74e-11
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436   86 NLRVLHLEDNQVSVIERGAFQDLKQLERLRLNKNKLQVLPELLFQSTPKLTRLDLSENQI 145
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
605-665 1.79e-11

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 60.62  E-value: 1.79e-11
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1677539436  605 SGLKTLMLRSNLIGCVSNDTFAGLSSVRLLSLYDNRITTITPGAFTTLVSLSTINLLSNPF 665
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
110-169 1.86e-11

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 60.62  E-value: 1.86e-11
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  110 QLERLRLNKNKLQVLPELLFQSTPKLTRLDLSENQIQGIPRKAFRGITDVKNLQLDNNHI 169
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
636-714 2.19e-11

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 69.34  E-value: 2.19e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  636 LYDNRITTITPGAFTTLVSLSTINLLSNPFNCNCHLAWLGKWLRKRRIVSGNPR---CQKPFFLKEIPIQDVAIQDFTCD 712
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEaalCAGPGALAGQPLLGIPLLDSGCD 81

                   ..
gi 1677539436  713 GN 714
Cdd:TIGR00864   82 EE 83
LRR_8 pfam13855
Leucine rich repeat;
534-593 2.34e-10

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 57.53  E-value: 2.34e-10
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  534 VTDLRLNDNEVSVLEAtGIFKKLPNLRKINLSNNKIKEVREGAFDGAASVQELMLTGNQL 593
Cdd:pfam13855    3 LRSLDLSNNRLTSLDD-GAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
754-811 5.53e-10

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 56.38  E-value: 5.53e-10
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1677539436  754 VTELYLEGNHLTAVPRE-LSALRHLTLIDLSNNSISMLTNYTFSNMSHLSTLILSYNRL 811
Cdd:pfam13855    3 LRSLDLSNNRLTSLDDGaFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
734-860 1.59e-09

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 59.80  E-value: 1.59e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  734 TVVRCSNKGLRALPR-GMPKDVTELYLEGNHLTAVPrELSALRHLTLIDLSNNSISMLTNytFSNMSHLSTLILSYNRLR 812
Cdd:cd21340      5 THLYLNDKNITKIDNlSLCKNLKVLYLYDNKITKIE-NLEFLTNLTHLYLQNNQIEKIEN--LENLVNLKKLYLGGNRIS 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  813 CIpvhafNGL---------------------------------RSLRVLTLHGNDISSVpeGSFNDLTSLSHLALGTNPL 859
Cdd:cd21340     82 VV-----EGLenltnleelhienqrlppgekltfdprslaalsNSLRVLNISGNNIDSL--EPLAPLRNLEQLDASNNQI 154

                   .
gi 1677539436  860 H 860
Cdd:cd21340    155 S 155
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
830-912 3.39e-09

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 62.02  E-value: 3.39e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  830 LHGNDISSVPEGSFNDLTSLSHLALGTNPLHCDCSLRWLSEWVK---AGYKEPGIARCSSPEPMADRLLLTTPTHRFQCK 906
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEekgVKVRQPEAALCAGPGALAGQPLLGIPLLDSGCD 81

                   ....*.
gi 1677539436  907 VLWFCC 912
Cdd:TIGR00864   82 EEYVAC 87
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
522-790 3.42e-09

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 61.64  E-value: 3.42e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  522 KLVRIPSHLPEYVTDLRLNDNEVsvleatgifKKLP-----NLRKINLSNNKIKEVREGAFDgaaSVQELMLTGNQLETV 596
Cdd:PRK15370   189 GLTTIPACIPEQITTLILDNNEL---------KSLPenlqgNIKTLYANSNQLTSIPATLPD---TIQEMELSINRITEL 256
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  597 HGRVfrgLSGLKTLMLRSNLIGCVSNDTFAGLssvRLLSLYDNRITTItPGAFTTlvSLSTINLLSNPfncnchLAWLGK 676
Cdd:PRK15370   257 PERL---PSALQSLDLFHNKISCLPENLPEEL---RYLSVYDNSIRTL-PAHLPS--GITHLNVQSNS------LTALPE 321
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  677 WLrkrrivsgnprcqkPFFLKEIPIQDVAIqdfTCdgneesscqlsprCPEQCTCMETVVRCSNKGLRALPRGMPKDVTE 756
Cdd:PRK15370   322 TL--------------PPGLKTLEAGENAL---TS-------------LPASLPPELQVLDVSKNQITVLPETLPPTITT 371
                          250       260       270
                   ....*....|....*....|....*....|....
gi 1677539436  757 LYLEGNHLTAVPRELSALrhLTLIDLSNNSISML 790
Cdd:PRK15370   372 LDVSRNALTNLPENLPAA--LQIMQASRNNLVRL 403
PLN03150 PLN03150
hypothetical protein; Provisional
767-860 8.80e-09

hypothetical protein; Provisional


Pssm-ID: 178695 [Multi-domain]  Cd Length: 623  Bit Score: 60.21  E-value: 8.80e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  767 VPRELSALRHLTLIDLSNNSISMLTNYTFSNMSHLSTLILSYNRLR-CIPvHAFNGLRSLRVLTLHGNDISS-VPEgsfn 844
Cdd:PLN03150   434 IPNDISKLRHLQSINLSGNSIRGNIPPSLGSITSLEVLDLSYNSFNgSIP-ESLGQLTSLRILNLNGNSLSGrVPA---- 508
                           90
                   ....*....|....*.
gi 1677539436  845 dltslshlALGTNPLH 860
Cdd:PLN03150   509 --------ALGGRLLH 516
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
697-859 1.13e-08

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 60.10  E-value: 1.13e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  697 KEIPIQDVAIQDF-TCDGNEESSCQLS--------PRCPEQCTcmeTVVrCSNKGLRALPRGMPKDVTELYLEGNHLTAV 767
Cdd:PRK15370   160 KEAANREEAVQRMrDCLKNNKTELRLKilglttipACIPEQIT---TLI-LDNNELKSLPENLQGNIKTLYANSNQLTSI 235
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  768 PRELSALrhLTLIDLSNNSISMLTNYTfsnMSHLSTLILSYNRLRCIPVHAFNGLRSLRVltlHGNDISSVPEgsfNDLT 847
Cdd:PRK15370   236 PATLPDT--IQEMELSINRITELPERL---PSALQSLDLFHNKISCLPENLPEELRYLSV---YDNSIRTLPA---HLPS 304
                          170
                   ....*....|..
gi 1677539436  848 SLSHLALGTNPL 859
Cdd:PRK15370   305 GITHLNVQSNSL 316
LRR_8 pfam13855
Leucine rich repeat;
332-377 2.37e-08

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 51.76  E-value: 2.37e-08
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1677539436  332 KLKRIDISKNQISDIAPDAFQGLKSLTSLVLYGNKITEIVKGLFDG 377
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSG 47
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
52-203 4.39e-08

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 57.25  E-value: 4.39e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436   52 LRAVPRGIPR--NAERLDLDRNNITRITkmDFAGLKNLRVLHLEDNQVSVIerGAFQDLKQLERLRLNKNK-----LQVL 124
Cdd:COG4886    217 LTDLPEPLANltNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQLTDL--PPLANLTNLKTLDLSNNQltdlkLKEL 292
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1677539436  125 PELLFQSTPKLTRLDLSENQIQGIPRKAFRGITDVKNLQLDNNHISCIEDGAFRALRDLEILTLNNNNISRILVTSFNH 203
Cdd:COG4886    293 ELLLGLNSLLLLLLLLNLLELLILLLLLTTLLLLLLLLKGLLVTLTTLALSLSLLALLTLLLLLNLLSLLLTLLLTLGL 371
LRR_8 pfam13855
Leucine rich repeat;
387-439 6.31e-08

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 50.60  E-value: 6.31e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1677539436  387 NANKINCLRVNTFQDLQNLNLLSLYDNKLQTISKGLFAPLQSIQTLHLAQNPF 439
Cdd:pfam13855    9 SNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
540-859 6.33e-08

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 57.55  E-value: 6.33e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  540 NDNEVSVLEATG---------IFKKLPNLRKINLSNNKIK-EVREGAFDGAASVQELMLTGNQLEtvhGRVFRG-LSGLK 608
Cdd:PLN00113    67 NSSRVVSIDLSGknisgkissAIFRLPYIQTINLSNNQLSgPIPDDIFTTSSSLRYLNLSNNNFT---GSIPRGsIPNLE 143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  609 TLMLRSNLI-GCVSNDtFAGLSSVRLLSLYDNRITTITPGAFTTLVSLSTINLLSNPFNC----------NCHLAWLG-- 675
Cdd:PLN00113   144 TLDLSNNMLsGEIPND-IGSFSSLKVLDLGGNVLVGKIPNSLTNLTSLEFLTLASNQLVGqiprelgqmkSLKWIYLGyn 222
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  676 -------------KWLRKRRIVSGNPRCQKP-----------FFLKE------IPIQDVAIQDF-TCDGNEESscqLSPR 724
Cdd:PLN00113   223 nlsgeipyeigglTSLNHLDLVYNNLTGPIPsslgnlknlqyLFLYQnklsgpIPPSIFSLQKLiSLDLSDNS---LSGE 299
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  725 CPE---QCTCMETVVRCSNK-------GLRALPRgmpkdVTELYLEGNHLTA-VPRELSALRHLTLIDLSNNSISMLTNY 793
Cdd:PLN00113   300 IPElviQLQNLEILHLFSNNftgkipvALTSLPR-----LQVLQLWSNKFSGeIPKNLGKHNNLTVLDLSTNNLTGEIPE 374
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1677539436  794 TFSNMSHLSTLILSYNRLRCIPVHAFNGLRSLRVLTLHGNDISSVPEGSFNDLTSLSHLALGTNPL 859
Cdd:PLN00113   375 GLCSSGNLFKLILFSNSLEGEIPKSLGACRSLRRVRLQDNSFSGELPSEFTKLPLVYFLDISNNNL 440
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1081-1117 1.81e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.79  E-value: 1.81e-07
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1677539436 1081 DNDDCV-AHKCRHGAQCVDTINGYTCTCPQGFSGPFCE 1117
Cdd:cd00054      1 DIDECAsGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
LRRCT smart00082
Leucine rich repeat C-terminal domain;
857-906 4.57e-07

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 47.81  E-value: 4.57e-07
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 1677539436   857 NPLHCDCSLRWLSEWVKAG--YKEPGIARCSSPEPMADRlLLTTPTHRFQCK 906
Cdd:smart00082    1 NPFICDCELRWLLRWLQANehLQDPVDLRCASPSSLRGP-LLELLHSEFKCP 51
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
49-220 1.41e-06

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 51.97  E-value: 1.41e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436   49 GLGLRAVPRGIPRNA--ERLDLDRNNITRITKMDFAGL---KNLRVLHLEDNQVSV----IERGAFQDLK-QLERLRLNK 118
Cdd:cd00116     67 PRGLQSLLQGLTKGCglQELDLSDNALGPDGCGVLESLlrsSSLQELKLNNNGLGDrglrLLAKGLKDLPpALEKLVLGR 146
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  119 NKLQVLP----ELLFQSTPKLTRLDLSENQI--QGIPR--KAFRGITDVKNLQLDNNHISCIED----GAFRALRDLEIL 186
Cdd:cd00116    147 NRLEGAScealAKALRANRDLKELNLANNGIgdAGIRAlaEGLKANCNLEVLDLNNNGLTDEGAsalaETLASLKSLEVL 226
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 1677539436  187 TLNNNNIS----RILVTSFNHM-PKIRTLRLHSNHLYCD 220
Cdd:cd00116    227 NLGDNNLTdagaAALASALLSPnISLLTLSLSCNDITDD 265
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
255-536 1.83e-06

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 52.78  E-value: 1.83e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  255 DVQKKEYVCPAPHSEPPScNANSISCPSPCTCSNNIVDCRGK-GLMEIPANLPEGIVEIRLEQNSIKAIPAGAftqYKKL 333
Cdd:PRK15370   147 ELIWSEWVKEAPAKEAAN-REEAVQRMRDCLKNNKTELRLKIlGLTTIPACIPEQITTLILDNNELKSLPENL---QGNI 222
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  334 KRIDISKNQISDIA---PDAFQGLKsltslvLYGNKITEIVKGLfdgLVSLQLLLLNANKINCLRVNTFQDLQNlnlLSL 410
Cdd:PRK15370   223 KTLYANSNQLTSIPatlPDTIQEME------LSINRITELPERL---PSALQSLDLFHNKISCLPENLPEELRY---LSV 290
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  411 YDNKLQTISKGLfaPlQSIQTLHLAQN-----PFVCDCHLKWL-ADylqDNPIETSGArcSSPRRLANKRISQ----IKS 480
Cdd:PRK15370   291 YDNSIRTLPAHL--P-SGITHLNVQSNsltalPETLPPGLKTLeAG---ENALTSLPA--SLPPELQVLDVSKnqitVLP 362
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1677539436  481 KKFRCSGSEDYRSRfssECFMDLvcPEKCRCEGTIVDCSNQKLVRIPSHLPEYVTD 536
Cdd:PRK15370   363 ETLPPTITTLDVSR---NALTNL--PENLPAALQIMQASRNNLVRLPESLPHFRGE 413
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
52-217 2.77e-06

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 52.01  E-value: 2.77e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436   52 LRAVPRGIPRNAERLDLDRNNITRITKMDFAGLKNLRVLHledNQVSVIERGAFQDLKQLERLRlnkNKLQVLPELLFQS 131
Cdd:PRK15370   232 LTSIPATLPDTIQEMELSINRITELPERLPSALQSLDLFH---NKISCLPENLPEELRYLSVYD---NSIRTLPAHLPSG 305
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  132 tpkLTRLDLSENQIQGIPRKAFRGItdvKNLQLDNNHISCIEDGAFRALRDLEIltlNNNnisRILVTSFNHMPKIRTLR 211
Cdd:PRK15370   306 ---ITHLNVQSNSLTALPETLPPGL---KTLEAGENALTSLPASLPPELQVLDV---SKN---QITVLPETLPPTITTLD 373

                   ....*.
gi 1677539436  212 LHSNHL 217
Cdd:PRK15370   374 VSRNAL 379
EGF_CA smart00179
Calcium-binding EGF-like domain;
1081-1117 2.97e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 45.32  E-value: 2.97e-06
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 1677539436  1081 DNDDCV-AHKCRHGAQCVDTINGYTCTCPQGFS-GPFCE 1117
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1005-1038 5.99e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 44.55  E-value: 5.99e-06
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 1677539436 1005 DDCED-NDCENNATCVDGINNYVCICPPNYTGELC 1038
Cdd:cd00054      3 DECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
755-839 1.12e-05

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 48.24  E-value: 1.12e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  755 TELYLE------GNHLTAVPRELSALRH-LTLIDLSNNSISMLTNytFSNMSHLSTLILSYNRLRCIP--VHAFNGLRSL 825
Cdd:cd21340     93 EELHIEnqrlppGEKLTFDPRSLAALSNsLRVLNISGNNIDSLEP--LAPLRNLEQLDASNNQISDLEelLDLLSSWPSL 170
                           90
                   ....*....|....
gi 1677539436  826 RVLTLHGNDISSVP 839
Cdd:cd21340    171 RELDLTGNPVCKKP 184
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
410-473 2.19e-05

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 49.70  E-value: 2.19e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1677539436  410 LYDNKLQTISKGLFAPLQSIQTLHLAQNPFVCDCHLKWLADYLQDNPIET---SGARCSSPRRLANK 473
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVrqpEAALCAGPGALAGQ 68
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
62-180 2.67e-05

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 49.08  E-value: 2.67e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436   62 NAERLDLDRNNITRITKMDFAGLKNLRVLHLEDNQVSVIERGAFQDLKQLERLRLNKNKLQVLPELLFQSTPKLTRLDLS 141
Cdd:PLN00113   476 RLENLDLSRNQFSGAVPRKLGSLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLS 555
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 1677539436  142 ENQIQGIPRKAFRGITDVKNLQLDNNHI--SCIEDGAFRAL 180
Cdd:PLN00113   556 QNQLSGEIPKNLGNVESLVQVNISHNHLhgSLPSTGAFLAI 596
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
134-666 2.76e-05

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 49.08  E-value: 2.76e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  134 KLTRLDLSENQIQGIPRKAFRGITDVKNLQLDNNHISC-IEDGAFRALRDLEILTLNNNNISRILVTSFnhMPKIRTLRL 212
Cdd:PLN00113    70 RVVSIDLSGKNISGKISSAIFRLPYIQTINLSNNQLSGpIPDDIFTTSSSLRYLNLSNNNFTGSIPRGS--IPNLETLDL 147
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  213 HSNHLYCDCHLawlsdwlrqrrTVGQFTlcmapvhlrGFNVADVQKKEYVCPAPHSEppscnANSISCPSPCTCSNNIVD 292
Cdd:PLN00113   148 SNNMLSGEIPN-----------DIGSFS---------SLKVLDLGGNVLVGKIPNSL-----TNLTSLEFLTLASNQLVG 202
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  293 crgkglmEIPANLPE--GIVEIRLEQNSIKAIPAGAFTQYKKLKRIDISKNQISDIAPDAFQGLKSLTSLVLYGNKIT-E 369
Cdd:PLN00113   203 -------QIPRELGQmkSLKWIYLGYNNLSGEIPYEIGGLTSLNHLDLVYNNLTGPIPSSLGNLKNLQYLFLYQNKLSgP 275
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  370 IVKGLFDglvslqllllnankinclrvntfqdLQNLNLLSLYDNKLQTISKGLFAPLQSIQTLHLAQNPFVCdchlkwla 449
Cdd:PLN00113   276 IPPSIFS-------------------------LQKLISLDLSDNSLSGEIPELVIQLQNLEILHLFSNNFTG-------- 322
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  450 dylqdnpiETSGARCSSPRRlankRISQIKSKKFRCSGSEDYRSRfSSECFMDLvcpEKCRCEGTIVD--CSNQKLVR-- 525
Cdd:PLN00113   323 --------KIPVALTSLPRL----QVLQLWSNKFSGEIPKNLGKH-NNLTVLDL---STNNLTGEIPEglCSSGNLFKli 386
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  526 -----IPSHLPEYVTD------LRLNDNEVSVlEATGIFKKLPNLRKINLSNNKIKEVREGAFDGAASVQELMLTGNQLE 594
Cdd:PLN00113   387 lfsnsLEGEIPKSLGAcrslrrVRLQDNSFSG-ELPSEFTKLPLVYFLDISNNNLQGRINSRKWDMPSLQMLSLARNKFF 465
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1677539436  595 TVHGRVFRGlSGLKTLMLRSNLIGCVSNDTFAGLSSVRLLSLYDNRITTITPGAFTTLVSLSTINLLSNPFN 666
Cdd:PLN00113   466 GGLPDSFGS-KRLENLDLSRNQFSGAVPRKLGSLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLS 536
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1007-1036 3.49e-05

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 41.98  E-value: 3.49e-05
                           10        20        30
                   ....*....|....*....|....*....|
gi 1677539436 1007 CEDNDCENNATCVDGINNYVCICPPNYTGE 1036
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGK 30
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1130-1160 3.93e-05

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 41.98  E-value: 3.93e-05
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1677539436 1130 CDQYECQNGAQCIVVQQEPTCRCPPGFAGPR 1160
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGKR 31
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
963-1001 4.10e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 41.85  E-value: 4.10e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1677539436  963 INTC-IQNPCQHGGTCHlsdSHKDGFSCSCPLGFEGQRCE 1001
Cdd:cd00054      2 IDECaSGNPCQNGGTCV---NTVGSYRCSCPPGYTGRNCE 38
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1085-1115 4.29e-05

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 41.98  E-value: 4.29e-05
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1677539436 1085 CVAHKCRHGAQCVDTINGYTCTCPQGFSGPF 1115
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGKR 31
RNA1 COG5238
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
60-222 4.54e-05

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 47.86  E-value: 4.54e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436   60 PRNAERLDLDRNNIT-----RITKMdFAGLKNLRVLHLEDNQVSviERGA------FQDLKQLERLRLNKNKLQV----- 123
Cdd:COG5238    263 NTTVETLYLSGNQIGaegaiALAKA-LQGNTTLTSLDLSVNRIG--DEGAialaegLQGNKTLHTLNLAYNGIGAqgaia 339
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  124 LPELLfQSTPKLTRLDLSENQIQGIPRKAF----RGITDVKNLQLDNNHISciEDGAfRALRDLeiltLNNNnisrilvt 199
Cdd:COG5238    340 LAKAL-QENTTLHSLDLSDNQIGDEGAIALakylEGNTTLRELNLGKNNIG--KQGA-EALIDA----LQTN-------- 403
                          170       180
                   ....*....|....*....|...
gi 1677539436  200 sfnhmpKIRTLRLHSNHLYCDCH 222
Cdd:COG5238    404 ------RLHTLILDGNLIGAEAQ 420
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1042-1079 4.84e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 41.85  E-value: 4.84e-05
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1677539436 1042 IDHCVpELNLCQHEAKCIPLDKGFSCECVPGYSGKLCE 1079
Cdd:cd00054      2 IDECA-SGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
LRRCT smart00082
Leucine rich repeat C-terminal domain;
663-712 7.68e-05

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 41.65  E-value: 7.68e-05
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 1677539436   663 NPFNCNCHLAWLGKWLRKRRIVSG--NPRCQKPFFLKEiPIQDVAIQDFTCD 712
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHLQDpvDLRCASPSSLRG-PLLELLHSEFKCP 51
LRRNT smart00013
Leucine rich repeat N-terminal domain;
33-64 8.72e-05

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 41.15  E-value: 8.72e-05
                            10        20        30
                    ....*....|....*....|....*....|..
gi 1677539436    33 ACPTKCTCSAASVDCHGLGLRAVPRGIPRNAE 64
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDTT 32
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
33-60 8.94e-05

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 40.69  E-value: 8.94e-05
                           10        20
                   ....*....|....*....|....*...
gi 1677539436   33 ACPTKCTCSAASVDCHGLGLRAVPRGIP 60
Cdd:pfam01462    1 ACPVPCHCSATVVNCSDRGLTAVPRDLP 28
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
188-250 1.02e-04

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 47.38  E-value: 1.02e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1677539436  188 LNNNNISRILVTSFNHMPKIRTLRLHSNHLYCDCHLAWLSDWLRQR--RTVG-QFTLCMAPVHLRG 250
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKgvKVRQpEAALCAGPGALAG 67
LRRNT smart00013
Leucine rich repeat N-terminal domain;
725-756 1.56e-04

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 40.38  E-value: 1.56e-04
                            10        20        30
                    ....*....|....*....|....*....|..
gi 1677539436   725 CPEQCTCMETVVRCSNKGLRALPRGMPKDVTE 756
Cdd:smart00013    2 CPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
65-215 1.60e-04

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 45.42  E-value: 1.60e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436   65 RLDLDRNNITRI-TKMDFAGLKNLRVLHLEDNQVSVIE----RGAFQDLKQLERLRLNKNKLQVLPELL------FQSTP 133
Cdd:cd00116      2 QLSLKGELLKTErATELLPKLLCLQVLRLEGNTLGEEAakalASALRPQPSLKELCLSLNETGRIPRGLqsllqgLTKGC 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  134 KLTRLDLSEN--QIQGIPR-KAFRGITDVKNLQLDNN--------------------------------HISCIE-DGAF 177
Cdd:cd00116     82 GLQELDLSDNalGPDGCGVlESLLRSSSLQELKLNNNglgdrglrllakglkdlppaleklvlgrnrleGASCEAlAKAL 161
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|..
gi 1677539436  178 RALRDLEILTLNNNNIS----RILVTSFNHMPKIRTLRLHSN 215
Cdd:cd00116    162 RANRDLKELNLANNGIGdagiRALAEGLKANCNLEVLDLNNN 203
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
1088-1117 1.84e-04

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 40.15  E-value: 1.84e-04
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1677539436 1088 HKCRHGAQCVDTINGYTCTCPQGFSGPF-CE 1117
Cdd:cd00053      6 NPCSNGGTCVNTPGSYRCVCPPGYTGDRsCE 36
EGF_CA smart00179
Calcium-binding EGF-like domain;
1005-1034 1.97e-04

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 40.31  E-value: 1.97e-04
                            10        20        30
                    ....*....|....*....|....*....|.
gi 1677539436  1005 DDCE-DNDCENNATCVDGINNYVCICPPNYT 1034
Cdd:smart00179    3 DECAsGNPCQNGGTCVNTVGSYRCECPPGYT 33
Laminin_G_3 pfam13385
Concanavalin A-like lectin/glucanases superfamily; This domain belongs to the Concanavalin ...
1171-1318 2.15e-04

Concanavalin A-like lectin/glucanases superfamily; This domain belongs to the Concanavalin A-like lectin/glucanases superfamily.


Pssm-ID: 463865 [Multi-domain]  Cd Length: 151  Bit Score: 43.14  E-value: 2.15e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436 1171 GKDSYVELASAKVRPQAN-ISLQVATDKDNG---ILLYKGDNDPLALELYQ-GHVRLVYDSLSSPPTTVYSVETVNDGQF 1245
Cdd:pfam13385    2 GGSDYVTLPDALLPTSDFtVSAWVKPDSLPGwarAIISSSGGGGYSLGLDGdGRLRFAVNGGNGGWDTVTSGASVPLGQW 81
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1677539436 1246 HSVeLVTLN-QTLNLVVDkGTPKSLGKLQKQPAVGINSPLYLGGiptstglsalRQGTDRPlggFHGCIHEVRI 1318
Cdd:pfam13385   82 THV-AVTYDgGTLRLYVN-GVLVGSSTLTGGPPPGTGGPLYIGR----------SPGGDDY---FNGLIDEVRI 140
hEGF pfam12661
Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six ...
1090-1111 2.89e-04

Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six conserved residues disulfide-bonded into the characteriztic 'ababcc' pattern. They are involved in growth and proliferation of cells, in proteins of the Notch/Delta pathway, neurogulin and selectins. hEGFs are also found in mosaic proteins with four-disulfide laminin EGFs such as aggrecan and perlecan. The core fold of the EGF domain consists of two small beta-hairpins packed against each other. Two major structural variants have been identified based on the structural context of the C-terminal Cys residue of disulfide 'c' in the C-terminal hairpin: hEGFs and cEGFs. In hEGFs the C-terminal thiol resides in the beta-turn, resulting in shorter loop-lengths between the Cys residues of disulfide 'c', typically C[8-9]XC. These shorter loop-lengths are also typical of the four-disulfide EGF domains, laminin ad integrin. Tandem hEGF domains have six linking residues between terminal cysteines of adjacent domains. hEGF domains may or may not bind calcium in the linker region. hEGF domains with the consensus motif CXD4X[F,Y]XCXC are hydroxylated exclusively in the Asp residue.


Pssm-ID: 463660  Cd Length: 22  Bit Score: 39.24  E-value: 2.89e-04
                           10        20
                   ....*....|....*....|..
gi 1677539436 1090 CRHGAQCVDTINGYTCTCPQGF 1111
Cdd:pfam12661    1 CQNGGTCVDGVNGYKCQCPPGY 22
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
55-415 2.93e-04

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 45.61  E-value: 2.93e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436   55 VPR--GIPRNAERLDLDRNNITRITKMDFAGLKNLRVLHLEDNQVSVIERGAFQDLKQLERLRLNKNKLQ-VLPELLFqS 131
Cdd:PLN00113   204 IPRelGQMKSLKWIYLGYNNLSGEIPYEIGGLTSLNHLDLVYNNLTGPIPSSLGNLKNLQYLFLYQNKLSgPIPPSIF-S 282
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  132 TPKLTRLDLSENQIQG-IPRKafrgITDVKNLQ---LDNNHISCIEDGAFRALRDLEILTLNNNNISRILVTSFNHMPKI 207
Cdd:PLN00113   283 LQKLISLDLSDNSLSGeIPEL----VIQLQNLEilhLFSNNFTGKIPVALTSLPRLQVLQLWSNKFSGEIPKNLGKHNNL 358
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  208 RTLRLHSNHL-------YCDC-HLawlsdwlrqrrtvgqFTLCMAPVHLRGfnvaDVQKKEYVCPAPHSEPPSCNANSIS 279
Cdd:PLN00113   359 TVLDLSTNNLtgeipegLCSSgNL---------------FKLILFSNSLEG----EIPKSLGACRSLRRVRLQDNSFSGE 419
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  280 CPSPCT----------CSNNIVDCRGKGLMEIPAnlpegIVEIRLEQNSIKA-IPAgaFTQYKKLKRIDISKNQISDIAP 348
Cdd:PLN00113   420 LPSEFTklplvyfldiSNNNLQGRINSRKWDMPS-----LQMLSLARNKFFGgLPD--SFGSKRLENLDLSRNQFSGAVP 492
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1677539436  349 DAFQGLKSLTSLVLYGNKITEIVKGLFDGLVSLQLLLLNANKINCLRVNTFQDLQNLNLLSLYDNKL 415
Cdd:PLN00113   493 RKLGSLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQL 559
CT smart00041
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta ...
1473-1528 3.56e-04

C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta (TGFbeta), nerve growth factor (NGF), platelet-derived growth factor (PDGF) and gonadotropin all form 2 highly twisted antiparallel pairs of beta-strands and contain three disulphide bonds. The domain is non-globular and little is conserved among these presumed homologues except for their cysteine residues. CT domains are predicted to form homodimers.


Pssm-ID: 214482  Cd Length: 82  Bit Score: 40.85  E-value: 3.56e-04
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*.
gi 1677539436  1473 SCATASKVPIMECRGGCgpQCCQPTRSKRRKYVFQCTDGSSFVEEVERHLECGCLA 1528
Cdd:smart00041   26 KCGSASSYSIQDVQHSC--SCCQPHKTKTRQVRLRCPDGSTVKKTVMHIEECGCEP 79
LRR_4 pfam12799
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ...
331-370 4.67e-04

Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.


Pssm-ID: 463713 [Multi-domain]  Cd Length: 44  Bit Score: 39.15  E-value: 4.67e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1677539436  331 KKLKRIDISKNQISDIapDAFQGLKSLTSLVLYGN-KITEI 370
Cdd:pfam12799    1 PNLEVLDLSNNQITDI--PPLAKLPNLETLDLSGNnKITDL 39
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
61-374 5.17e-04

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 44.84  E-value: 5.17e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436   61 RNAERLDLDRNNITRITKMDFAGLKNLRVLHLEDNQVSvieRGAFQDL-KQ--LERLRLNKNKLQ-VLPELLFQSTpKLT 136
Cdd:PLN00113   308 QNLEILHLFSNNFTGKIPVALTSLPRLQVLQLWSNKFS---GEIPKNLgKHnnLTVLDLSTNNLTgEIPEGLCSSG-NLF 383
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  137 RLDLSENQIQGIPRKAFRGITDVKNLQLDNNHISCIEDGAFRALRDLEILTLNNNNISRILVTSFNHMPKIRTLRLHSNH 216
Cdd:PLN00113   384 KLILFSNSLEGEIPKSLGACRSLRRVRLQDNSFSGELPSEFTKLPLVYFLDISNNNLQGRINSRKWDMPSLQMLSLARNK 463
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  217 LYCDchlawLSDWLRQRRtvgqftlcmapvhLRGFNVADvqkkeyvcpaphseppscnaNSISCPSPctcsnnivdcrgK 296
Cdd:PLN00113   464 FFGG-----LPDSFGSKR-------------LENLDLSR--------------------NQFSGAVP------------R 493
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1677539436  297 GLMeipaNLPEgIVEIRLEQNSIKAIPAGAFTQYKKLKRIDISKNQISDIAPDAFQGLKSLTSLVLYGNKIT-EIVKGL 374
Cdd:PLN00113   494 KLG----SLSE-LMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQLSgEIPKNL 567
LRRCT smart00082
Leucine rich repeat C-terminal domain;
437-467 7.17e-04

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 38.95  E-value: 7.17e-04
                            10        20        30
                    ....*....|....*....|....*....|...
gi 1677539436   437 NPFVCDCHLKWLADYLQDNPI--ETSGARCSSP 467
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHlqDPVDLRCASP 33
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
66-218 7.31e-04

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 44.45  E-value: 7.31e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436   66 LDLDRNNIT-RITKMDFAgLKNLRVLHLEDNQVSvierGAFQDL---KQLERLRLNKNKLQVLPELLFQSTPKLTRLDLS 141
Cdd:PLN00113   433 LDISNNNLQgRINSRKWD-MPSLQMLSLARNKFF----GGLPDSfgsKRLENLDLSRNQFSGAVPRKLGSLSELMQLKLS 507
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1677539436  142 ENQIQG-IPRKaFRGITDVKNLQLDNNHISCIEDGAFRALRDLEILTLNNNNISRILVTSFNHMPKIRTLRLHSNHLY 218
Cdd:PLN00113   508 ENKLSGeIPDE-LSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQLSGEIPKNLGNVESLVQVNISHNHLH 584
EGF_CA smart00179
Calcium-binding EGF-like domain;
1042-1079 7.87e-04

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 38.38  E-value: 7.87e-04
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 1677539436  1042 IDHCVpELNLCQHEAKCIPLDKGFSCECVPGYS-GKLCE 1079
Cdd:smart00179    2 IDECA-SGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
535-665 7.89e-04

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 42.47  E-value: 7.89e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  535 TDLRLNDNEVSVLEAtgiFKKLPNLRKINLSNNKIKEVrEGaFDGAASVQELMLTGNQLE-----TVHGRVFRGLSG-LK 608
Cdd:cd21340     49 THLYLQNNQIEKIEN---LENLVNLKKLYLGGNRISVV-EG-LENLTNLEELHIENQRLPpgeklTFDPRSLAALSNsLR 123
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1677539436  609 TLMLRSNLIGCVSNdtFAGLSSVRLLSLYDNRITTITP--GAFTTLVSLSTINLLSNPF 665
Cdd:cd21340    124 VLNISGNNIDSLEP--LAPLRNLEQLDASNNQISDLEEllDLLSSWPSLRELDLTGNPV 180
LRRNT smart00013
Leucine rich repeat N-terminal domain;
280-311 1.54e-03

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 37.30  E-value: 1.54e-03
                            10        20        30
                    ....*....|....*....|....*....|..
gi 1677539436   280 CPSPCTCSNNIVDCRGKGLMEIPANLPEGIVE 311
Cdd:smart00013    2 CPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
280-306 1.88e-03

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 37.22  E-value: 1.88e-03
                           10        20
                   ....*....|....*....|....*..
gi 1677539436  280 CPSPCTCSNNIVDCRGKGLMEIPANLP 306
Cdd:pfam01462    2 CPVPCHCSATVVNCSDRGLTAVPRDLP 28
LRRNT smart00013
Leucine rich repeat N-terminal domain;
505-535 2.19e-03

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 36.91  E-value: 2.19e-03
                            10        20        30
                    ....*....|....*....|....*....|.
gi 1677539436   505 CPEKCRCEGTIVDCSNQKLVRIPSHLPEYVT 535
Cdd:smart00013    2 CPAPCNCSGTAVDCSGRGLTEVPLDLPPDTT 32
LRR smart00370
Leucine-rich repeats, outliers;
556-579 2.37e-03

Leucine-rich repeats, outliers;


Pssm-ID: 197688 [Multi-domain]  Cd Length: 24  Bit Score: 36.56  E-value: 2.37e-03
                            10        20
                    ....*....|....*....|....
gi 1677539436   556 LPNLRKINLSNNKIKEVREGAFDG 579
Cdd:smart00370    1 LPNLRELDLSNNQLSSLPPGAFQG 24
LRR_TYP smart00369
Leucine-rich repeats, typical (most populated) subfamily;
556-579 2.37e-03

Leucine-rich repeats, typical (most populated) subfamily;


Pssm-ID: 197687 [Multi-domain]  Cd Length: 24  Bit Score: 36.56  E-value: 2.37e-03
                            10        20
                    ....*....|....*....|....
gi 1677539436   556 LPNLRKINLSNNKIKEVREGAFDG 579
Cdd:smart00369    1 LPNLRELDLSNNQLSSLPPGAFQG 24
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
1008-1036 2.64e-03

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 37.07  E-value: 2.64e-03
                           10        20
                   ....*....|....*....|....*....
gi 1677539436 1008 EDNDCENNATCVDGINNYVCICPPNYTGE 1036
Cdd:cd00053      4 ASNPCSNGGTCVNTPGSYRCVCPPGYTGD 32
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
725-751 2.86e-03

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 36.45  E-value: 2.86e-03
                           10        20
                   ....*....|....*....|....*..
gi 1677539436  725 CPEQCTCMETVVRCSNKGLRALPRGMP 751
Cdd:pfam01462    2 CPVPCHCSATVVNCSDRGLTAVPRDLP 28
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1078-1113 3.14e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 36.81  E-value: 3.14e-03
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1677539436 1078 CETDNDDCVAHkcrhgAQCVDTINGYTCTCPQGFSG 1113
Cdd:pfam12947    1 CSDNNGGCHPN-----ATCTNTGGSFTCTCNDGYTG 31
hEGF pfam12661
Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six ...
1012-1031 3.25e-03

Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six conserved residues disulfide-bonded into the characteriztic 'ababcc' pattern. They are involved in growth and proliferation of cells, in proteins of the Notch/Delta pathway, neurogulin and selectins. hEGFs are also found in mosaic proteins with four-disulfide laminin EGFs such as aggrecan and perlecan. The core fold of the EGF domain consists of two small beta-hairpins packed against each other. Two major structural variants have been identified based on the structural context of the C-terminal Cys residue of disulfide 'c' in the C-terminal hairpin: hEGFs and cEGFs. In hEGFs the C-terminal thiol resides in the beta-turn, resulting in shorter loop-lengths between the Cys residues of disulfide 'c', typically C[8-9]XC. These shorter loop-lengths are also typical of the four-disulfide EGF domains, laminin ad integrin. Tandem hEGF domains have six linking residues between terminal cysteines of adjacent domains. hEGF domains may or may not bind calcium in the linker region. hEGF domains with the consensus motif CXD4X[F,Y]XCXC are hydroxylated exclusively in the Asp residue.


Pssm-ID: 463660  Cd Length: 22  Bit Score: 36.54  E-value: 3.25e-03
                           10        20
                   ....*....|....*....|
gi 1677539436 1012 CENNATCVDGINNYVCICPP 1031
Cdd:pfam12661    1 CQNGGTCVDGVNGYKCQCPP 20
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1379-1407 3.33e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 36.59  E-value: 3.33e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 1677539436 1379 CLGHRCHH-GKCVATGTSYMCKCAEGYGGD 1407
Cdd:pfam00008    1 CAPNPCSNgGTCVDTPGGYTCICPEGYTGK 30
LRR_5 pfam13306
BspA type Leucine rich repeat region (6 copies); This family includes a number of leucine rich ...
71-177 3.55e-03

BspA type Leucine rich repeat region (6 copies); This family includes a number of leucine rich repeats. This family contains a large number of BSPA-like surface antigens from Trichomonas vaginalis.


Pssm-ID: 463839 [Multi-domain]  Cd Length: 127  Bit Score: 39.07  E-value: 3.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436   71 NNITRITKMDFAGLKNLRVLHLEDNqVSVIERGAFQDLKqLERLRLNKNkLQVLPELLFQSTPKLTRLDLSENqIQGIPR 150
Cdd:pfam13306   20 SSLTSIGEYAFSNCTSLKSITLPSS-LTSIGSYAFYNCS-LTSITIPSS-LTSIGEYAFSNCSNLKSITLPSN-LTSIGS 95
                           90       100
                   ....*....|....*....|....*..
gi 1677539436  151 KAFRGiTDVKNLQLDNNHIScIEDGAF 177
Cdd:pfam13306   96 YAFSN-CSLKSITIPSSVTT-IGSYAF 120
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
966-999 3.93e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 36.21  E-value: 3.93e-03
                           10        20        30
                   ....*....|....*....|....*....|....
gi 1677539436  966 CIQNPCQHGGTCHlsdSHKDGFSCSCPLGFEGQR 999
Cdd:pfam00008    1 CAPNPCSNGGTCV---DTPGGYTCICPEGYTGKR 31
RNA1 COG5238
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
324-456 5.26e-03

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 40.93  E-value: 5.26e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  324 AGAFTQYKKLKRIDISKNQISD-----IApDAFQGLKSLTSLVLYGNKITE-----IVKGLfDGLVSLQLLLLNANKI-- 391
Cdd:COG5238    229 AEALKGNKSLTTLDLSNNQIGDegviaLA-EALKNNTTVETLYLSGNQIGAegaiaLAKAL-QGNTTLTSLDLSVNRIgd 306
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1677539436  392 -------NCLRVNtfqdlQNLNLLSLYDNKLQTI-SKGLFAPLQ---SIQTLHLAQNPfVCDCHLKWLADYLQDNP 456
Cdd:COG5238    307 egaialaEGLQGN-----KTLHTLNLAYNGIGAQgAIALAKALQentTLHSLDLSDNQ-IGDEGAIALAKYLEGNT 376
EGF_CA smart00179
Calcium-binding EGF-like domain;
969-1001 5.37e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 36.07  E-value: 5.37e-03
                            10        20        30
                    ....*....|....*....|....*....|....
gi 1677539436   969 NPCQHGGTCHlsdSHKDGFSCSCPLGFE-GQRCE 1001
Cdd:smart00179    9 NPCQNGGTCV---NTVGSYRCECPPGYTdGRNCE 39
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
55-567 5.52e-03

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 41.37  E-value: 5.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436   55 VPRGIPRNAERLDLDRNNITRITKMDFAGLKNLRVLHLEDNQVSVIERGAFQDLKQLERLRLNKNKL--QVLPELLFQST 132
Cdd:PLN00113   134 IPRGSIPNLETLDLSNNMLSGEIPNDIGSFSSLKVLDLGGNVLVGKIPNSLTNLTSLEFLTLASNQLvgQIPRELGQMKS 213
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  133 PKLtrLDLSENQIQG-IPrKAFRGITDVKNLQLDNNHISCIEDGAFRALRDLEILTLNNNNISRILVTSFNHMPKIRTLR 211
Cdd:PLN00113   214 LKW--IYLGYNNLSGeIP-YEIGGLTSLNHLDLVYNNLTGPIPSSLGNLKNLQYLFLYQNKLSGPIPPSIFSLQKLISLD 290
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  212 LHSNHLYCDchlawLSDWLRQRRTvgqftlcMAPVHLRGFNVADvQKKEYVCPAPHSEPPSCNANSISCpspctcsnniv 291
Cdd:PLN00113   291 LSDNSLSGE-----IPELVIQLQN-------LEILHLFSNNFTG-KIPVALTSLPRLQVLQLWSNKFSG----------- 346
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  292 dcrgkglmEIPANLPEgiveirleQNSikaipagaftqykkLKRIDISKNQISDIAPDAFQGLKSLTSLVLYGNKI-TEI 370
Cdd:PLN00113   347 --------EIPKNLGK--------HNN--------------LTVLDLSTNNLTGEIPEGLCSSGNLFKLILFSNSLeGEI 396
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  371 VKGLFDGLVSLQLLLLNaNKINCLRVNTFQDLQNLNLLSLYDNKLQTISKGLFAPLQSIQTLHLAQNPFVCDchlkwLAD 450
Cdd:PLN00113   397 PKSLGACRSLRRVRLQD-NSFSGELPSEFTKLPLVYFLDISNNNLQGRINSRKWDMPSLQMLSLARNKFFGG-----LPD 470
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  451 YLQDNPIETsgarcsspRRLANKRISQIKSKKFRcSGSEDYRSRFSSECFMDLVCPEKCRCEGTI-VDCSNQKLV-RIPS 528
Cdd:PLN00113   471 SFGSKRLEN--------LDLSRNQFSGAVPRKLG-SLSELMQLKLSENKLSGEIPDELSSCKKLVsLDLSHNQLSgQIPA 541
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|.
gi 1677539436  529 HLPEY--VTDLRLNDNEVSVlEATGIFKKLPNLRKINLSNN 567
Cdd:PLN00113   542 SFSEMpvLSQLDLSQNQLSG-EIPKNLGNVESLVQVNISHN 581
LRR_9 pfam14580
Leucine-rich repeat;
539-622 5.77e-03

Leucine-rich repeat;


Pssm-ID: 405295 [Multi-domain]  Cd Length: 175  Bit Score: 39.36  E-value: 5.77e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677539436  539 LNDNEVSVLEAtgiFKKLPNLRKINLSNNKIKEVREGAFDGAASVQELMLTGNQLETVHGrvFRGLSGLKTLMLRSNLIG 618
Cdd:pfam14580   49 FSDNEIRKLDG---FPLLRRLKTLLLNNNRICRIGEGLGEALPNLTELILTNNNLQELGD--LDPLASLKKLTFLSLLRN 123

                   ....
gi 1677539436  619 CVSN 622
Cdd:pfam14580  124 PVTN 127
LRR_TYP smart00369
Leucine-rich repeats, typical (most populated) subfamily;
133-155 6.15e-03

Leucine-rich repeats, typical (most populated) subfamily;


Pssm-ID: 197687 [Multi-domain]  Cd Length: 24  Bit Score: 35.41  E-value: 6.15e-03
                            10        20
                    ....*....|....*....|...
gi 1677539436   133 PKLTRLDLSENQIQGIPRKAFRG 155
Cdd:smart00369    2 PNLRELDLSNNQLSSLPPGAFQG 24
LRR smart00370
Leucine-rich repeats, outliers;
133-155 6.15e-03

Leucine-rich repeats, outliers;


Pssm-ID: 197688 [Multi-domain]  Cd Length: 24  Bit Score: 35.41  E-value: 6.15e-03
                            10        20
                    ....*....|....*....|...
gi 1677539436   133 PKLTRLDLSENQIQGIPRKAFRG 155
Cdd:smart00370    2 PNLRELDLSNNQLSSLPPGAFQG 24
LRR smart00370
Leucine-rich repeats, outliers;
822-845 6.78e-03

Leucine-rich repeats, outliers;


Pssm-ID: 197688 [Multi-domain]  Cd Length: 24  Bit Score: 35.41  E-value: 6.78e-03
                            10        20
                    ....*....|....*....|....
gi 1677539436   822 LRSLRVLTLHGNDISSVPEGSFND 845
Cdd:smart00370    1 LPNLRELDLSNNQLSSLPPGAFQG 24
LRR_TYP smart00369
Leucine-rich repeats, typical (most populated) subfamily;
822-845 6.78e-03

Leucine-rich repeats, typical (most populated) subfamily;


Pssm-ID: 197687 [Multi-domain]  Cd Length: 24  Bit Score: 35.41  E-value: 6.78e-03
                            10        20
                    ....*....|....*....|....
gi 1677539436   822 LRSLRVLTLHGNDISSVPEGSFND 845
Cdd:smart00369    1 LPNLRELDLSNNQLSSLPPGAFQG 24
LRR_4 pfam12799
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ...
85-121 7.43e-03

Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.


Pssm-ID: 463713 [Multi-domain]  Cd Length: 44  Bit Score: 35.68  E-value: 7.43e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1677539436   85 KNLRVLHLEDNQVSVIErgAFQDLKQLERLRLNKNKL 121
Cdd:pfam12799    1 PNLEVLDLSNNQITDIP--PLAKLPNLETLDLSGNNK 35
LRR_4 pfam12799
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ...
133-174 8.62e-03

Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.


Pssm-ID: 463713 [Multi-domain]  Cd Length: 44  Bit Score: 35.68  E-value: 8.62e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 1677539436  133 PKLTRLDLSENQIQGIPrkAFRGITDVKNLQL-DNNHISCIED 174
Cdd:pfam12799    1 PNLEVLDLSNNQITDIP--PLAKLPNLETLDLsGNNKITDLSD 41
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH