NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|574281382|ref|NP_001276065|]
View 

slit homolog 2 protein isoform 3 precursor [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
LamG smart00282
Laminin G domain;
1175-1308 7.94e-37

Laminin G domain;


:

Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 135.54  E-value: 7.94e-37
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   1175 NITLQIATDEDSGILLY---KGDKDHIAVELYRGRVRASYDTGSHPASAIYSVETINDGNFHIVELLALDQSLSLSVDGG 1251
Cdd:smart00282    1 SISFSFRTTSPNGLLLYagsKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRSVTLSVDGG 80
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*..
gi 574281382   1252 NPKIITNLSKQSTLNFDSPLYVGGMPgksnvASLRQAPGQNGTSFHGCIRNLYINSE 1308
Cdd:smart00282   81 NRVSGESPGGLTILNLDGPLYLGGLP-----EDLKLPPLPVTPGFRGCIRNLKVNGK 132
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
507-853 1.25e-22

Leucine-rich repeat (LRR) protein [Transcription];


:

Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 102.70  E-value: 1.25e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  507 TTVDCSNQKLNKIPEHIPQYTAELRLNNNEFTVLEATGIFKKLPQLRKINFSNNKITDIEEGAFEGASGVNEILLTSNRL 586
Cdd:COG4886     3 LLLLSLTLKLLLLLLLELLTTLILLLLLLLLLLALLLLSLLSLLLLLTLLLSLLLRDLLLSSLLLLLSLLLLLLLSLLLL 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  587 ENVQ--HKMFKGLESLKTLMLRsnritcvGNDSFIGLSSVRLLSLYDNQITTVaPGAFDTLHSLSTLNLLANPfncncyl 664
Cdd:COG4886    83 SLLLlgLTDLGDLTNLTELDLS-------GNEELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQ------- 147
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  665 awlgewlrkkrivtgnprcqkpyfLKEIPiqdvaiqdftcddgnddnscSPLSRCPTectcLdTVVRCSNKGLKVLPKGI 744
Cdd:COG4886   148 ------------------------LTDLP--------------------EPLGNLTN----L-KSLDLSNNQLTDLPEEL 178
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  745 PR--DVTELYLDGNQFTLVPKELSNYKHLTLIDLSNNRISTLSNqSFSNMTQLLTLILSYNRLRCIPprTFDGLKSLRLL 822
Cdd:COG4886   179 GNltNLKELDLSNNQITDLPEPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLP--ELGNLTNLEEL 255
                         330       340       350
                  ....*....|....*....|....*....|.
gi 574281382  823 SLHGNDISVVPEGAfnDLSALSHLAIGANPL 853
Cdd:COG4886   256 DLSNNQLTDLPPLA--NLTNLKTLDLSNNQL 284
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
56-211 1.38e-22

Leucine-rich repeat (LRR) protein [Transcription];


:

Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 102.32  E-value: 1.38e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   56 NTERLDLNGNNITRITKtDFAGLRHLRVLQLMENKISTIERgAFQDLKELERLRLNRNHLQLFPELLfLGTAKLYRLDLS 135
Cdd:COG4886   114 NLESLDLSGNQLTDLPE-ELANLTNLKELDLSNNQLTDLPE-PLGNLTNLKSLDLSNNQLTDLPEEL-GNLTNLKELDLS 190
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 574281382  136 ENQIQAIPrKAFRGAVDIKNLQLDYNQISCIEDgAFRALRDLEVLTLNNNNITRLSvaSFNHMPKLRTFRLHSNNL 211
Cdd:COG4886   191 NNQITDLP-EPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQL 262
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
319-659 9.57e-18

Leucine-rich repeat (LRR) protein [Transcription];


:

Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 87.68  E-value: 9.57e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  319 AFSPYKKLRRIDLSNNQISELaPDAFQGLRSLNSLVLYGNKITELPKSLFEglfslqllllnankinclrvdafqdLHNL 398
Cdd:COG4886   108 ELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQLTDLPEPLGN-------------------------LTNL 161
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  399 NLLSLYDNKLQTIAKGtFSPLRaiqtmhlaqnpficdcHLKWLadYLHTNPIETsgarctsprrlANKRIGQIKskkfrc 478
Cdd:COG4886   162 KSLDLSNNQLTDLPEE-LGNLT----------------NLKEL--DLSNNQITD-----------LPEPLGNLT------ 205
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  479 sgtedyrsKLsgdcfadlacpekcrcegTTVDCSNQKLNKIPEHIPQYTA--ELRLNNNEFTVLEAtgiFKKLPQLRKIN 556
Cdd:COG4886   206 --------NL------------------EELDLSGNQLTDLPEPLANLTNleTLDLSNNQLTDLPE---LGNLTNLEELD 256
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  557 FSNNKITDIEEGAfegasgvneilltsnrlenvqhkmfkGLESLKTLMLRSNRITCVGNDSFIGLSSVRLLSLYDNQITT 636
Cdd:COG4886   257 LSNNQLTDLPPLA--------------------------NLTNLKTLDLSNNQLTDLKLKELELLLGLNSLLLLLLLLNL 310
                         330       340
                  ....*....|....*....|...
gi 574281382  637 VAPGAFDTLHSLSTLNLLANPFN 659
Cdd:COG4886   311 LELLILLLLLTTLLLLLLLLKGL 333
PCC super family cl28216
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
824-1005 2.37e-11

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


The actual alignment was detected with superfamily member TIGR00864:

Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 69.34  E-value: 2.37e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   824 LHGNDISVVPEGAFNDLSALSHLAIGANPLYCDCNMQWLSDWVKSE---YKEPGIARCAGPGEMADKLLLTTPSKKFTCq 900
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKgvkVRQPEAALCAGPGALAGQPLLGIPLLDSGC- 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   901 gpvDVNILAkcnpCLSNPCKNDGTCNSDPVDFYRCTcPYGFKGQDCDVpihACISNPCKHGGTchlkeGEEDgfWCICAD 980
Cdd:TIGR00864   81 ---DEEYVA----CLKDNSSGGGAARSELVIFSAAH-EGLFQPEACNA---FCFSAGHGLAAL-----GEQG--ECLCGA 142
                          170       180
                   ....*....|....*....|....*
gi 574281382   981 GFEGENCEVNVDDCEDNDCENNSTC 1005
Cdd:TIGR00864  143 AQPSEANFACESLCSGPPPPPAAAC 167
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1068-1104 2.51e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.40  E-value: 2.51e-07
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 574281382 1068 DFDDCQD-NKCKNGAHCTDAVNGYTCICPEGYSGLFCE 1104
Cdd:cd00054     1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
990-1026 4.34e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.63  E-value: 4.34e-07
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 574281382  990 NVDDCED-NDCENNSTCVDGINNYTCLCPPEYTGELCE 1026
Cdd:cd00054     1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
LRRNT smart00013
Leucine rich repeat N-terminal domain;
27-58 5.56e-06

Leucine rich repeat N-terminal domain;


:

Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 44.23  E-value: 5.56e-06
                            10        20        30
                    ....*....|....*....|....*....|..
gi 574281382     27 ACPAQCSCSGSTVDCHGLALRSVPRNIPRNTE 58
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDTT 32
PCC super family cl28216
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
182-319 7.80e-06

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


The actual alignment was detected with superfamily member TIGR00864:

Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 50.85  E-value: 7.80e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   182 LNNNNITRLSVASFNHMPKLRTFRLHSNNLYCDCHLAWLSDWLRQ------RPRVglyTQCMGPSHLRGH---------- 245
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEkgvkvrQPEA---ALCAGPGALAGQpllgipllds 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   246 ----------------NVAEVQKREFVCSGHQSFMAPScsvlHCPAAC-TCSNNIVDCRGKGLTEIPTNLPETITEIRLE 308
Cdd:TIGR00864   79 gcdeeyvaclkdnssgGGAARSELVIFSAAHEGLFQPE----ACNAFCfSAGHGLAALGEQGECLCGAAQPSEANFACES 154
                          170
                   ....*....|.
gi 574281382   309 QNTIKVIPPGA 319
Cdd:TIGR00864  155 LCSGPPPPPAA 165
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1030-1065 1.16e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.39  E-value: 1.16e-05
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 574281382 1030 DFCAQDlNPCQHDSKCILTPKGFKCDCTPGYVGEHC 1065
Cdd:cd00054     3 DECASG-NPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1122-1149 5.68e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 36.08  E-value: 5.68e-03
                          10        20
                  ....*....|....*....|....*...
gi 574281382 1122 CQNGAQCIVRINEPICQCLPGYQGEKCE 1149
Cdd:cd00054    11 CQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
 
Name Accession Description Interval E-value
LamG smart00282
Laminin G domain;
1175-1308 7.94e-37

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 135.54  E-value: 7.94e-37
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   1175 NITLQIATDEDSGILLY---KGDKDHIAVELYRGRVRASYDTGSHPASAIYSVETINDGNFHIVELLALDQSLSLSVDGG 1251
Cdd:smart00282    1 SISFSFRTTSPNGLLLYagsKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRSVTLSVDGG 80
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*..
gi 574281382   1252 NPKIITNLSKQSTLNFDSPLYVGGMPgksnvASLRQAPGQNGTSFHGCIRNLYINSE 1308
Cdd:smart00282   81 NRVSGESPGGLTILNLDGPLYLGGLP-----EDLKLPPLPVTPGFRGCIRNLKVNGK 132
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
1153-1306 8.62e-33

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 124.84  E-value: 8.62e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382 1153 SVNFiNKESYLQIP-SAKVRPQTNITLQIATDEDSGILLYKGDK---DHIAVELYRGRVRASYDTGSHPASaIYSVETIN 1228
Cdd:cd00110     1 GVSF-SGSSYVRLPtLPAPRTRLSISFSFRTTSPNGLLLYAGSQnggDFLALELEDGRLVLRYDLGSGSLV-LSSKTPLN 78
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 574281382 1229 DGNFHIVELLALDQSLSLSVDGGNPKIITNLSKQSTLNFDSPLYVGGMPgksnvASLRQAPGQNGTSFHGCIRNLYIN 1306
Cdd:cd00110    79 DGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGGLP-----EDLKSPGLPVSPGFVGCIRDLKVN 151
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
1182-1308 8.57e-32

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 120.99  E-value: 8.57e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  1182 TDEDSGILLYKGD--KDHIAVELYRGRVRASYDTGSHPASAIYSVETINDGNFHIVELLALDQSLSLSVDGGNPKIITNL 1259
Cdd:pfam02210    3 TRQPNGLLLYAGGggSDFLALELVNGRLVLRYDLGSGPESLLSSGKNLNDGQWHSVRVERNGNTLTLSVDGQTVVSSLPP 82
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 574281382  1260 SKQSTLNFDSPLYVGGMPGKSNVASLRQAPGqngtsFHGCIRNLYINSE 1308
Cdd:pfam02210   83 GESLLLNLNGPLYLGGLPPLLLLPALPVRAG-----FVGCIRDVRVNGE 126
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
507-853 1.25e-22

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 102.70  E-value: 1.25e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  507 TTVDCSNQKLNKIPEHIPQYTAELRLNNNEFTVLEATGIFKKLPQLRKINFSNNKITDIEEGAFEGASGVNEILLTSNRL 586
Cdd:COG4886     3 LLLLSLTLKLLLLLLLELLTTLILLLLLLLLLLALLLLSLLSLLLLLTLLLSLLLRDLLLSSLLLLLSLLLLLLLSLLLL 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  587 ENVQ--HKMFKGLESLKTLMLRsnritcvGNDSFIGLSSVRLLSLYDNQITTVaPGAFDTLHSLSTLNLLANPfncncyl 664
Cdd:COG4886    83 SLLLlgLTDLGDLTNLTELDLS-------GNEELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQ------- 147
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  665 awlgewlrkkrivtgnprcqkpyfLKEIPiqdvaiqdftcddgnddnscSPLSRCPTectcLdTVVRCSNKGLKVLPKGI 744
Cdd:COG4886   148 ------------------------LTDLP--------------------EPLGNLTN----L-KSLDLSNNQLTDLPEEL 178
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  745 PR--DVTELYLDGNQFTLVPKELSNYKHLTLIDLSNNRISTLSNqSFSNMTQLLTLILSYNRLRCIPprTFDGLKSLRLL 822
Cdd:COG4886   179 GNltNLKELDLSNNQITDLPEPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLP--ELGNLTNLEEL 255
                         330       340       350
                  ....*....|....*....|....*....|.
gi 574281382  823 SLHGNDISVVPEGAfnDLSALSHLAIGANPL 853
Cdd:COG4886   256 DLSNNQLTDLPPLA--NLTNLKTLDLSNNQL 284
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
56-211 1.38e-22

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 102.32  E-value: 1.38e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   56 NTERLDLNGNNITRITKtDFAGLRHLRVLQLMENKISTIERgAFQDLKELERLRLNRNHLQLFPELLfLGTAKLYRLDLS 135
Cdd:COG4886   114 NLESLDLSGNQLTDLPE-ELANLTNLKELDLSNNQLTDLPE-PLGNLTNLKSLDLSNNQLTDLPEEL-GNLTNLKELDLS 190
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 574281382  136 ENQIQAIPrKAFRGAVDIKNLQLDYNQISCIEDgAFRALRDLEVLTLNNNNITRLSvaSFNHMPKLRTFRLHSNNL 211
Cdd:COG4886   191 NNQITDLP-EPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQL 262
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
319-659 9.57e-18

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 87.68  E-value: 9.57e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  319 AFSPYKKLRRIDLSNNQISELaPDAFQGLRSLNSLVLYGNKITELPKSLFEglfslqllllnankinclrvdafqdLHNL 398
Cdd:COG4886   108 ELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQLTDLPEPLGN-------------------------LTNL 161
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  399 NLLSLYDNKLQTIAKGtFSPLRaiqtmhlaqnpficdcHLKWLadYLHTNPIETsgarctsprrlANKRIGQIKskkfrc 478
Cdd:COG4886   162 KSLDLSNNQLTDLPEE-LGNLT----------------NLKEL--DLSNNQITD-----------LPEPLGNLT------ 205
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  479 sgtedyrsKLsgdcfadlacpekcrcegTTVDCSNQKLNKIPEHIPQYTA--ELRLNNNEFTVLEAtgiFKKLPQLRKIN 556
Cdd:COG4886   206 --------NL------------------EELDLSGNQLTDLPEPLANLTNleTLDLSNNQLTDLPE---LGNLTNLEELD 256
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  557 FSNNKITDIEEGAfegasgvneilltsnrlenvqhkmfkGLESLKTLMLRSNRITCVGNDSFIGLSSVRLLSLYDNQITT 636
Cdd:COG4886   257 LSNNQLTDLPPLA--------------------------NLTNLKTLDLSNNQLTDLKLKELELLLGLNSLLLLLLLLNL 310
                         330       340
                  ....*....|....*....|...
gi 574281382  637 VAPGAFDTLHSLSTLNLLANPFN 659
Cdd:COG4886   311 LELLILLLLLTTLLLLLLLLKGL 333
LRR_8 pfam13855
Leucine rich repeat;
770-829 2.11e-16

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 74.87  E-value: 2.11e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   770 HLTLIDLSNNRISTLSNQSFSNMTQLLTLILSYNRLRCIPPRTFDGLKSLRLLSLHGNDI 829
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
55-115 7.28e-14

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 67.55  E-value: 7.28e-14
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 574281382    55 RNTERLDLNGNNITRITKTDFAGLRHLRVLQLMENKISTIERGAFQDLKELERLRLNRNHL 115
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
303-360 4.23e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 62.54  E-value: 4.23e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 574281382   303 TEIRLEQNTIKVIPPGAFSPYKKLRRIDLSNNQISELAPDAFQGLRSLNSLVLYGNKI 360
Cdd:pfam13855    4 RSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
824-1005 2.37e-11

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 69.34  E-value: 2.37e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   824 LHGNDISVVPEGAFNDLSALSHLAIGANPLYCDCNMQWLSDWVKSE---YKEPGIARCAGPGEMADKLLLTTPSKKFTCq 900
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKgvkVRQPEAALCAGPGALAGQPLLGIPLLDSGC- 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   901 gpvDVNILAkcnpCLSNPCKNDGTCNSDPVDFYRCTcPYGFKGQDCDVpihACISNPCKHGGTchlkeGEEDgfWCICAD 980
Cdd:TIGR00864   81 ---DEEYVA----CLKDNSSGGGAARSELVIFSAAH-EGLFQPEACNA---FCFSAGHGLAAL-----GEQG--ECLCGA 142
                          170       180
                   ....*....|....*....|....*
gi 574281382   981 GFEGENCEVNVDDCEDNDCENNSTC 1005
Cdd:TIGR00864  143 AQPSEANFACESLCSGPPPPPAAAC 167
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
629-705 1.08e-10

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 67.03  E-value: 1.08e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   629 LYDNQITTVAPGAFDTLHSLSTLNLLANPFNCNCYLAWLGEWLRKKRIVTGNPR---CQKPYFLKEIPIQDVAIQDFTCD 705
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEaalCAGPGALAGQPLLGIPLLDSGCD 81
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
60-211 1.61e-10

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 62.50  E-value: 1.61e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   60 LDLNGNNITRITktDFAGLRHLRVLQLMENKISTIErgAFQDLKELERLRLNRNHLQlfpELLFLGT-AKLYRLDLSENQ 138
Cdd:cd21340     7 LYLNDKNITKID--NLSLCKNLKVLYLYDNKITKIE--NLEFLTNLTHLYLQNNQIE---KIENLENlVNLKKLYLGGNR 79
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 574281382  139 IQAIprKAFRGAVDIKNLQLDYNQIS-----CIEDGAFRALRD-LEVLTLNNNNITrlSVASFNHMPKLRTFRLHSNNL 211
Cdd:cd21340    80 ISVV--EGLENLTNLEELHIENQRLPpgeklTFDPRSLAALSNsLRVLNISGNNID--SLEPLAPLRNLEQLDASNNQI 154
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
746-846 2.70e-10

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 62.11  E-value: 2.70e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  746 RDVTELYLDGNQFTLVPkELSNYKHLTLIDLSNNRISTLSNqsFSNMTQLLTLILSYNRLRCIPPrtFDGLKSLRLLSLH 825
Cdd:cd21340     2 KRITHLYLNDKNITKID-NLSLCKNLKVLYLYDNKITKIEN--LEFLTNLTHLYLQNNQIEKIEN--LENLVNLKKLYLG 76
                          90       100
                  ....*....|....*....|.
gi 574281382  826 GNDISVVpEGaFNDLSALSHL 846
Cdd:cd21340    77 GNRISVV-EG-LENLTNLEEL 95
PLN03150 PLN03150
hypothetical protein; Provisional
761-830 1.93e-08

hypothetical protein; Provisional


Pssm-ID: 178695 [Multi-domain]  Cd Length: 623  Bit Score: 59.06  E-value: 1.93e-08
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  761 VPKELSNYKHLTLIDLSNNRISTLSNQSFSNMTQLLTLILSYNRLRCIPPRTFDGLKSLRLLSLHGNDIS 830
Cdd:PLN03150  434 IPNDISKLRHLQSINLSGNSIRGNIPPSLGSITSLEVLDLSYNSFNGSIPESLGQLTSLRILNLNGNSLS 503
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
55-211 4.68e-08

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 57.94  E-value: 4.68e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   55 RNTERLDLNGNNITRITKTDFAGLRHLRVLQLMENKISTIERGAFQDLKELERLRLNRNhlQLFPELL-FLGTAKLYRLD 133
Cdd:PLN00113  404 RSLRRVRLQDNSFSGELPSEFTKLPLVYFLDISNNNLQGRINSRKWDMPSLQMLSLARN--KFFGGLPdSFGSKRLENLD 481
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 574281382  134 LSENQIQ-AIPRKaFRGAVDIKNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNITRLSVASFNHMPKLRTFRLHSNNL 211
Cdd:PLN00113  482 LSRNQFSgAVPRK-LGSLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQL 559
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1068-1104 2.51e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.40  E-value: 2.51e-07
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 574281382 1068 DFDDCQD-NKCKNGAHCTDAVNGYTCICPEGYSGLFCE 1104
Cdd:cd00054     1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
990-1026 4.34e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.63  E-value: 4.34e-07
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 574281382  990 NVDDCED-NDCENNSTCVDGINNYTCLCPPEYTGELCE 1026
Cdd:cd00054     1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
287-367 2.71e-06

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 52.01  E-value: 2.71e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  287 RGKGLTEIPTNLPETITEIRLEQNTIKVIP---PGAfspykkLRRIDLSNNQISELAPDAFQGLRSLNslvLYGNKITEL 363
Cdd:PRK15370  228 NSNQLTSIPATLPDTIQEMELSINRITELPerlPSA------LQSLDLFHNKISCLPENLPEELRYLS---VYDNSIRTL 298

                  ....
gi 574281382  364 PKSL 367
Cdd:PRK15370  299 PAHL 302
LRRCT smart00082
Leucine rich repeat C-terminal domain;
851-900 4.56e-06

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 45.11  E-value: 4.56e-06
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 574281382    851 NPLYCDCNMQWLSDWVKSE--YKEPGIARCAGPGEMADKLLLTTPSkKFTCQ 900
Cdd:smart00082    1 NPFICDCELRWLLRWLQANehLQDPVDLRCASPSSLRGPLLELLHS-EFKCP 51
LRRNT smart00013
Leucine rich repeat N-terminal domain;
27-58 5.56e-06

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 44.23  E-value: 5.56e-06
                            10        20        30
                    ....*....|....*....|....*....|..
gi 574281382     27 ACPAQCSCSGSTVDCHGLALRSVPRNIPRNTE 58
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDTT 32
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
182-319 7.80e-06

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 50.85  E-value: 7.80e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   182 LNNNNITRLSVASFNHMPKLRTFRLHSNNLYCDCHLAWLSDWLRQ------RPRVglyTQCMGPSHLRGH---------- 245
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEkgvkvrQPEA---ALCAGPGALAGQpllgipllds 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   246 ----------------NVAEVQKREFVCSGHQSFMAPScsvlHCPAAC-TCSNNIVDCRGKGLTEIPTNLPETITEIRLE 308
Cdd:TIGR00864   79 gcdeeyvaclkdnssgGGAARSELVIFSAAHEGLFQPE----ACNAFCfSAGHGLAALGEQGECLCGAAQPSEANFACES 154
                          170
                   ....*....|.
gi 574281382   309 QNTIKVIPPGA 319
Cdd:TIGR00864  155 LCSGPPPPPAA 165
EGF_CA smart00179
Calcium-binding EGF-like domain;
1068-1104 7.87e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 44.16  E-value: 7.87e-06
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 574281382   1068 DFDDCQ-DNKCKNGAHCTDAVNGYTCICPEGYS-GLFCE 1104
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1072-1100 9.79e-06

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 43.53  E-value: 9.79e-06
                           10        20
                   ....*....|....*....|....*....
gi 574281382  1072 CQDNKCKNGAHCTDAVNGYTCICPEGYSG 1100
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTG 29
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1030-1065 1.16e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.39  E-value: 1.16e-05
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 574281382 1030 DFCAQDlNPCQHDSKCILTPKGFKCDCTPGYVGEHC 1065
Cdd:cd00054     3 DECASG-NPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
LRRNT smart00013
Leucine rich repeat N-terminal domain;
719-750 1.25e-05

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 43.46  E-value: 1.25e-05
                            10        20        30
                    ....*....|....*....|....*....|..
gi 574281382    719 CPTECTCLDTVVRCSNKGLKVLPKGIPRDVTE 750
Cdd:smart00013    2 CPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
EGF_CA smart00179
Calcium-binding EGF-like domain;
990-1026 1.66e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 43.00  E-value: 1.66e-05
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 574281382    990 NVDDCE-DNDCENNSTCVDGINNYTCLCPPEYT-GELCE 1026
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
LRRCT smart00082
Leucine rich repeat C-terminal domain;
209-258 2.54e-05

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 43.19  E-value: 2.54e-05
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 574281382    209 NNLYCDCHLAWLSDWLRQRPRV--GLYTQCMGPSHLRGhNVAEVQKREFVCS 258
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHLqdPVDLRCASPSSLRG-PLLELLHSEFKCP 51
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
994-1023 1.09e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 40.83  E-value: 1.09e-04
                           10        20        30
                   ....*....|....*....|....*....|
gi 574281382   994 CEDNDCENNSTCVDGINNYTCLCPPEYTGE 1023
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGK 30
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
55-367 1.58e-04

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 46.38  E-value: 1.58e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   55 RNTERLDLNGNNITRITKTDFAGLRHLRVLQLMENKISTierGAFQDL---KELERLRLNRNHLQ-LFPELLfLGTAKLY 130
Cdd:PLN00113  308 QNLEILHLFSNNFTGKIPVALTSLPRLQVLQLWSNKFSG---EIPKNLgkhNNLTVLDLSTNNLTgEIPEGL-CSSGNLF 383
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  131 RLDLSENQIQAIPRKAFRGAVDIKNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNIT-RLSVASFNhMPKLRTFRLHSN 209
Cdd:PLN00113  384 KLILFSNSLEGEIPKSLGACRSLRRVRLQDNSFSGELPSEFTKLPLVYFLDISNNNLQgRINSRKWD-MPSLQMLSLARN 462
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  210 NLYcdchlawlsdwlrqrprvGLYTQCMGPSHLRGHNVAEVQKREFVcsgHQSFMapscsvlhcpaactcsnnivdcrgk 289
Cdd:PLN00113  463 KFF------------------GGLPDSFGSKRLENLDLSRNQFSGAV---PRKLG------------------------- 496
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 574281382  290 glteiptNLPEtITEIRLEQNTIKVIPPGAFSPYKKLRRIDLSNNQISELAPDAFQGLRSLNSLVLYGNKIT-ELPKSL 367
Cdd:PLN00113  497 -------SLSE-LMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQLSgEIPKNL 567
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
27-54 1.68e-04

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 39.92  E-value: 1.68e-04
                           10        20
                   ....*....|....*....|....*...
gi 574281382    27 ACPAQCSCSGSTVDCHGLALRSVPRNIP 54
Cdd:pfam01462    1 ACPVPCHCSATVVNCSDRGLTAVPRDLP 28
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
403-466 2.27e-04

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 46.23  E-value: 2.27e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 574281382   403 LYDNKLQTIAKGTFSPLRAIQTMHLAQNPFICDCHLKWLADYLHTNPIET---SGARCTSPRRLANK 466
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVrqpEAALCAGPGALAGQ 68
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1035-1064 2.65e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 39.67  E-value: 2.65e-04
                           10        20        30
                   ....*....|....*....|....*....|
gi 574281382  1035 DLNPCQHDSKCILTPKGFKCDCTPGYVGEH 1064
Cdd:pfam00008    2 APNPCSNGGTCVDTPGGYTCICPEGYTGKR 31
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
273-299 2.72e-04

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 39.53  E-value: 2.72e-04
                           10        20
                   ....*....|....*....|....*..
gi 574281382   273 CPAACTCSNNIVDCRGKGLTEIPTNLP 299
Cdd:pfam01462    2 CPVPCHCSATVVNCSDRGLTAVPRDLP 28
LRRCT smart00082
Leucine rich repeat C-terminal domain;
430-460 1.07e-03

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 38.57  E-value: 1.07e-03
                            10        20        30
                    ....*....|....*....|....*....|...
gi 574281382    430 NPFICDCHLKWLADYLHTNPI--ETSGARCTSP 460
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHlqDPVDLRCASP 33
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
956-988 1.45e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.62  E-value: 1.45e-03
                          10        20        30
                  ....*....|....*....|....*....|...
gi 574281382  956 NPCKHGGTCHLKEGeedGFWCICADGFEGENCE 988
Cdd:cd00054     9 NPCQNGGTCVNTVG---SYRCSCPPGYTGRNCE 38
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
953-986 2.74e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 36.59  E-value: 2.74e-03
                           10        20        30
                   ....*....|....*....|....*....|....
gi 574281382   953 CISNPCKHGGTCHLKEGeedGFWCICADGFEGEN 986
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPG---GYTCICPEGYTGKR 31
EGF_CA smart00179
Calcium-binding EGF-like domain;
1030-1066 3.60e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 36.46  E-value: 3.60e-03
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 574281382   1030 DFCAQDlNPCQHDSKCILTPKGFKCDCTPGYV-GEHCD 1066
Cdd:smart00179    3 DECASG-NPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1122-1149 5.68e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 36.08  E-value: 5.68e-03
                          10        20
                  ....*....|....*....|....*...
gi 574281382 1122 CQNGAQCIVRINEPICQCLPGYQGEKCE 1149
Cdd:cd00054    11 CQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1117-1146 6.40e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 35.82  E-value: 6.40e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 574281382  1117 CDNFDCQNGAQCIVRINEPICQCLPGYQGE 1146
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGK 30
 
Name Accession Description Interval E-value
LamG smart00282
Laminin G domain;
1175-1308 7.94e-37

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 135.54  E-value: 7.94e-37
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   1175 NITLQIATDEDSGILLY---KGDKDHIAVELYRGRVRASYDTGSHPASAIYSVETINDGNFHIVELLALDQSLSLSVDGG 1251
Cdd:smart00282    1 SISFSFRTTSPNGLLLYagsKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRSVTLSVDGG 80
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*..
gi 574281382   1252 NPKIITNLSKQSTLNFDSPLYVGGMPgksnvASLRQAPGQNGTSFHGCIRNLYINSE 1308
Cdd:smart00282   81 NRVSGESPGGLTILNLDGPLYLGGLP-----EDLKLPPLPVTPGFRGCIRNLKVNGK 132
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
1153-1306 8.62e-33

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 124.84  E-value: 8.62e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382 1153 SVNFiNKESYLQIP-SAKVRPQTNITLQIATDEDSGILLYKGDK---DHIAVELYRGRVRASYDTGSHPASaIYSVETIN 1228
Cdd:cd00110     1 GVSF-SGSSYVRLPtLPAPRTRLSISFSFRTTSPNGLLLYAGSQnggDFLALELEDGRLVLRYDLGSGSLV-LSSKTPLN 78
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 574281382 1229 DGNFHIVELLALDQSLSLSVDGGNPKIITNLSKQSTLNFDSPLYVGGMPgksnvASLRQAPGQNGTSFHGCIRNLYIN 1306
Cdd:cd00110    79 DGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGGLP-----EDLKSPGLPVSPGFVGCIRDLKVN 151
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
1182-1308 8.57e-32

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 120.99  E-value: 8.57e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  1182 TDEDSGILLYKGD--KDHIAVELYRGRVRASYDTGSHPASAIYSVETINDGNFHIVELLALDQSLSLSVDGGNPKIITNL 1259
Cdd:pfam02210    3 TRQPNGLLLYAGGggSDFLALELVNGRLVLRYDLGSGPESLLSSGKNLNDGQWHSVRVERNGNTLTLSVDGQTVVSSLPP 82
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 574281382  1260 SKQSTLNFDSPLYVGGMPGKSNVASLRQAPGqngtsFHGCIRNLYINSE 1308
Cdd:pfam02210   83 GESLLLNLNGPLYLGGLPPLLLLPALPVRAG-----FVGCIRDVRVNGE 126
Laminin_G_1 pfam00054
Laminin G domain;
1180-1311 1.89e-28

Laminin G domain;


Pssm-ID: 395008 [Multi-domain]  Cd Length: 131  Bit Score: 111.64  E-value: 1.89e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  1180 IATDEDSGILLYKGDKDH---IAVELYRGRVRASYDTGSHPASaIYSVETINDGNFHIVELLALDQSLSLSVDGG-NPKI 1255
Cdd:pfam00054    1 FRTTEPSGLLLYNGTQTErdfLALELRDGRLEVSYDLGSGAAV-VRSGDKLNDGKWHSVELERNGRSGTLSVDGEaRPTG 79
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 574281382  1256 ITNLSKQSTLNFDSPLYVGGMPgkSNVASLRQAPgqNGTSFHGCIRNLYINSELQD 1311
Cdd:pfam00054   80 ESPLGATTDLDVDGPLYVGGLP--SLGVKKRRLA--ISPSFDGCIRDVIVNGKPLD 131
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
507-853 1.25e-22

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 102.70  E-value: 1.25e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  507 TTVDCSNQKLNKIPEHIPQYTAELRLNNNEFTVLEATGIFKKLPQLRKINFSNNKITDIEEGAFEGASGVNEILLTSNRL 586
Cdd:COG4886     3 LLLLSLTLKLLLLLLLELLTTLILLLLLLLLLLALLLLSLLSLLLLLTLLLSLLLRDLLLSSLLLLLSLLLLLLLSLLLL 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  587 ENVQ--HKMFKGLESLKTLMLRsnritcvGNDSFIGLSSVRLLSLYDNQITTVaPGAFDTLHSLSTLNLLANPfncncyl 664
Cdd:COG4886    83 SLLLlgLTDLGDLTNLTELDLS-------GNEELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQ------- 147
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  665 awlgewlrkkrivtgnprcqkpyfLKEIPiqdvaiqdftcddgnddnscSPLSRCPTectcLdTVVRCSNKGLKVLPKGI 744
Cdd:COG4886   148 ------------------------LTDLP--------------------EPLGNLTN----L-KSLDLSNNQLTDLPEEL 178
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  745 PR--DVTELYLDGNQFTLVPKELSNYKHLTLIDLSNNRISTLSNqSFSNMTQLLTLILSYNRLRCIPprTFDGLKSLRLL 822
Cdd:COG4886   179 GNltNLKELDLSNNQITDLPEPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLP--ELGNLTNLEEL 255
                         330       340       350
                  ....*....|....*....|....*....|.
gi 574281382  823 SLHGNDISVVPEGAfnDLSALSHLAIGANPL 853
Cdd:COG4886   256 DLSNNQLTDLPPLA--NLTNLKTLDLSNNQL 284
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
56-211 1.38e-22

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 102.32  E-value: 1.38e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   56 NTERLDLNGNNITRITKtDFAGLRHLRVLQLMENKISTIERgAFQDLKELERLRLNRNHLQLFPELLfLGTAKLYRLDLS 135
Cdd:COG4886   114 NLESLDLSGNQLTDLPE-ELANLTNLKELDLSNNQLTDLPE-PLGNLTNLKSLDLSNNQLTDLPEEL-GNLTNLKELDLS 190
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 574281382  136 ENQIQAIPrKAFRGAVDIKNLQLDYNQISCIEDgAFRALRDLEVLTLNNNNITRLSvaSFNHMPKLRTFRLHSNNL 211
Cdd:COG4886   191 NNQITDLP-EPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQL 262
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
512-830 4.61e-22

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 100.78  E-value: 4.61e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  512 SNQKLNKIPEHIPQYTAELRLNNNEFTVLEATGIFKKLPQLRKINFSNNKITDIEEGafegasgvneilltsnrlenvqh 591
Cdd:COG4886    75 LLLSLLLLSLLLLGLTDLGDLTNLTELDLSGNEELSNLTNLESLDLSGNQLTDLPEE----------------------- 131
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  592 kmFKGLESLKTLMLRSNRITCVGnDSFIGLSSVRLLSLYDNQITTVaPGAFDTLHSLSTLNLlanpfncncylawlgewl 671
Cdd:COG4886   132 --LANLTNLKELDLSNNQLTDLP-EPLGNLTNLKSLDLSNNQLTDL-PEELGNLTNLKELDL------------------ 189
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  672 rkkrivTGNPrcqkpyfLKEIPiqdvaiqdftcddgnddnscSPLSRCPTectcldtvvrcsnkglkvlpkgiprdVTEL 751
Cdd:COG4886   190 ------SNNQ-------ITDLP--------------------EPLGNLTN--------------------------LEEL 210
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 574281382  752 YLDGNQFTLVPKELSNYKHLTLIDLSNNRISTLSnqSFSNMTQLLTLILSYNRLRCIPPrtFDGLKSLRLLSLHGNDIS 830
Cdd:COG4886   211 DLSGNQLTDLPEPLANLTNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQLTDLPP--LANLTNLKTLDLSNNQLT 285
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
52-448 6.78e-21

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 97.31  E-value: 6.78e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   52 NIPRNTERLDLNGNNITRITKTDFAGLRHLRVLQLMENKistiergAFQDLKELERLRLNRNHLQLFPELLFLGTaKLYR 131
Cdd:COG4886    69 LSLLLLLLLSLLLLSLLLLGLTDLGDLTNLTELDLSGNE-------ELSNLTNLESLDLSGNQLTDLPEELANLT-NLKE 140
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  132 LDLSENQIQAIPrKAFRGAVDIKNLQLDYNQISCIeDGAFRALRDLEVLTLNNNNITRLSvASFNHMPKLRTFRLHSNNl 211
Cdd:COG4886   141 LDLSNNQLTDLP-EPLGNLTNLKSLDLSNNQLTDL-PEELGNLTNLKELDLSNNQITDLP-EPLGNLTNLEELDLSGNQ- 216
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  212 ycdchlawlsdwlrqrprvglytqcmgpshlrghnvaevqkrefvcsghqsfmapscsvlhcpaactcsnnivdcrgkgL 291
Cdd:COG4886   217 -------------------------------------------------------------------------------L 217
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  292 TEIPTNLPE--TITEIRLEQNTIKVIPpgAFSPYKKLRRIDLSNNQISELAPDAfqGLRSLNSLVLYGNKITELP-KSLF 368
Cdd:COG4886   218 TDLPEPLANltNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQLTDLPPLA--NLTNLKTLDLSNNQLTDLKlKELE 293
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  369 EGLFSLQLLLLNANKINCLRVDAFQDLHNLNLLSLYDNKLQTIAKGTFSPLRAIQTMHLAQNPFICDCHLKWLADYLHTN 448
Cdd:COG4886   294 LLLGLNSLLLLLLLLNLLELLILLLLLTTLLLLLLLLKGLLVTLTTLALSLSLLALLTLLLLLNLLSLLLTLLLTLGLLG 373
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
38-195 9.95e-19

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 90.76  E-value: 9.95e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   38 TVDCHGLALRSVPRNIPRNT--ERLDLNGNNITRITKtDFAGLRHLRVLQLMENKISTIErGAFQDLKELERLRLNRNHL 115
Cdd:COG4886   140 ELDLSNNQLTDLPEPLGNLTnlKSLDLSNNQLTDLPE-ELGNLTNLKELDLSNNQITDLP-EPLGNLTNLEELDLSGNQL 217
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  116 QLFPELLFlGTAKLYRLDLSENQIQAIPrkAFRGAVDIKNLQLDYNQISCIEDGAfrALRDLEVLTLNNNNITRLSVASF 195
Cdd:COG4886   218 TDLPEPLA-NLTNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQLTDLPPLA--NLTNLKTLDLSNNQLTDLKLKEL 292
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
319-659 9.57e-18

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 87.68  E-value: 9.57e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  319 AFSPYKKLRRIDLSNNQISELaPDAFQGLRSLNSLVLYGNKITELPKSLFEglfslqllllnankinclrvdafqdLHNL 398
Cdd:COG4886   108 ELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQLTDLPEPLGN-------------------------LTNL 161
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  399 NLLSLYDNKLQTIAKGtFSPLRaiqtmhlaqnpficdcHLKWLadYLHTNPIETsgarctsprrlANKRIGQIKskkfrc 478
Cdd:COG4886   162 KSLDLSNNQLTDLPEE-LGNLT----------------NLKEL--DLSNNQITD-----------LPEPLGNLT------ 205
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  479 sgtedyrsKLsgdcfadlacpekcrcegTTVDCSNQKLNKIPEHIPQYTA--ELRLNNNEFTVLEAtgiFKKLPQLRKIN 556
Cdd:COG4886   206 --------NL------------------EELDLSGNQLTDLPEPLANLTNleTLDLSNNQLTDLPE---LGNLTNLEELD 256
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  557 FSNNKITDIEEGAfegasgvneilltsnrlenvqhkmfkGLESLKTLMLRSNRITCVGNDSFIGLSSVRLLSLYDNQITT 636
Cdd:COG4886   257 LSNNQLTDLPPLA--------------------------NLTNLKTLDLSNNQLTDLKLKELELLLGLNSLLLLLLLLNL 310
                         330       340
                  ....*....|....*....|...
gi 574281382  637 VAPGAFDTLHSLSTLNLLANPFN 659
Cdd:COG4886   311 LELLILLLLLTTLLLLLLLLKGL 333
LRR_8 pfam13855
Leucine rich repeat;
770-829 2.11e-16

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 74.87  E-value: 2.11e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   770 HLTLIDLSNNRISTLSNQSFSNMTQLLTLILSYNRLRCIPPRTFDGLKSLRLLSLHGNDI 829
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
283-452 4.43e-16

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 82.29  E-value: 4.43e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  283 IVDCRGKGLTEIPTNLPE--TITEIRLEQNTIKVIPPgAFSPYKKLRRIDLSNNQISELaPDAFQGLRSLNSLVLYGNKI 360
Cdd:COG4886   140 ELDLSNNQLTDLPEPLGNltNLKSLDLSNNQLTDLPE-ELGNLTNLKELDLSNNQITDL-PEPLGNLTNLEELDLSGNQL 217
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  361 TELPKSLfeglfslqllllnankinclrvdafQDLHNLNLLSLYDNKLQTIAKgtFSPLRAIQTMHLAQN-----PFICD 435
Cdd:COG4886   218 TDLPEPL-------------------------ANLTNLETLDLSNNQLTDLPE--LGNLTNLEELDLSNNqltdlPPLAN 270
                         170
                  ....*....|....*...
gi 574281382  436 CH-LKWLadYLHTNPIET 452
Cdd:COG4886   271 LTnLKTL--DLSNNQLTD 286
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
302-672 1.87e-15

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 80.36  E-value: 1.87e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  302 ITEIRLEQNTIKVIPPgAFSPYKKLRRIDLSNNQISELaPDAFQGLRSLNSLVLYGNKITELPKSLfeglfslqllllna 381
Cdd:COG4886   115 LESLDLSGNQLTDLPE-ELANLTNLKELDLSNNQLTDL-PEPLGNLTNLKSLDLSNNQLTDLPEEL-------------- 178
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  382 nkinclrvdafQDLHNLNLLSLYDNKLQTIAKgTFSPLRAIQTMHLAQNPFicdchlkwladylhtNPIETSGARCTspr 461
Cdd:COG4886   179 -----------GNLTNLKELDLSNNQITDLPE-PLGNLTNLEELDLSGNQL---------------TDLPEPLANLT--- 228
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  462 rlankrigqikskkfrcsgtedyrsKLsgdcfadlacpekcrcegTTVDCSNQKLNKIPE--HIPQYTaELRLNNNEFTV 539
Cdd:COG4886   229 -------------------------NL------------------ETLDLSNNQLTDLPElgNLTNLE-ELDLSNNQLTD 264
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  540 LEATGifkKLPQLRKINFSNNKITDIEEGAFEGASGVNeiLLTSNRLENVQHKMFKGLESLKTLMLRSNRITCVGNDSFI 619
Cdd:COG4886   265 LPPLA---NLTNLKTLDLSNNQLTDLKLKELELLLGLN--SLLLLLLLLNLLELLILLLLLTTLLLLLLLLKGLLVTLTT 339
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|...
gi 574281382  620 GLSSVRLLSLYDNQITTVAPGAFDTLHSLSTLNLLANPFNCNCYLAWLGEWLR 672
Cdd:COG4886   340 LALSLSLLALLTLLLLLNLLSLLLTLLLTLGLLGLLEATLLTLALLLLTLLLL 392
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
38-192 2.63e-15

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 79.98  E-value: 2.63e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   38 TVDCHGLALRSVPRNIPR--NTERLDLNGNNITRITKTdFAGLRHLRVLQLMENKISTIERgAFQDLKELERLRLNRNHL 115
Cdd:COG4886   163 SLDLSNNQLTDLPEELGNltNLKELDLSNNQITDLPEP-LGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQL 240
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 574281382  116 QLFPELLFLgtAKLYRLDLSENQIQAIPrkAFRGAVDIKNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNITRLSV 192
Cdd:COG4886   241 TDLPELGNL--TNLEELDLSNNQLTDLP--PLANLTNLKTLDLSNNQLTDLKLKELELLLGLNSLLLLLLLLNLLEL 313
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
294-659 9.31e-15

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 78.44  E-value: 9.31e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  294 IPTNLPETITEIRLEQNTIKVIPPGAFSPYKKLRRIDLSNNqiselapDAFQGLRSLNSLVLYGNKITELPKSLfeglfs 373
Cdd:COG4886    66 LLLLSLLLLLLLSLLLLSLLLLGLTDLGDLTNLTELDLSGN-------EELSNLTNLESLDLSGNQLTDLPEEL------ 132
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  374 lqllllnankinclrvdafQDLHNLNLLSLYDNKLQTIAKGtfsplraiqtmhLAQNPficdcHLKWLadylhtnpiets 453
Cdd:COG4886   133 -------------------ANLTNLKELDLSNNQLTDLPEP------------LGNLT-----NLKSL------------ 164
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  454 garctsprrlankrigqikskkfrcsgtedyrsklsgdcfadlacpekcrcegttvDCSNQKLNKIPEHIPQYTA--ELR 531
Cdd:COG4886   165 --------------------------------------------------------DLSNNQLTDLPEELGNLTNlkELD 188
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  532 LNNNEFTVLEATgiFKKLPQLRKINFSNNKITDIEEgAFEGASGVNEILLTSNRLENVQHkmFKGLESLKTLMLRSNRIT 611
Cdd:COG4886   189 LSNNQITDLPEP--LGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLPE--LGNLTNLEELDLSNNQLT 263
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*...
gi 574281382  612 CVGNDSfiGLSSVRLLSLYDNQITTVAPGAFDTLHSLSTLNLLANPFN 659
Cdd:COG4886   264 DLPPLA--NLTNLKTLDLSNNQLTDLKLKELELLLGLNSLLLLLLLLN 309
LRR_8 pfam13855
Leucine rich repeat;
55-115 7.28e-14

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 67.55  E-value: 7.28e-14
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 574281382    55 RNTERLDLNGNNITRITKTDFAGLRHLRVLQLMENKISTIERGAFQDLKELERLRLNRNHL 115
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
574-634 2.72e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 62.93  E-value: 2.72e-12
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 574281382   574 SGVNEILLTSNRLENVQHKMFKGLESLKTLMLRSNRITCVGNDSFIGLSSVRLLSLYDNQI 634
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
599-658 3.19e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 62.93  E-value: 3.19e-12
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   599 SLKTLMLRSNRITCVGNDSFIGLSSVRLLSLYDNQITTVAPGAFDTLHSLSTLNLLANPF 658
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
303-360 4.23e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 62.54  E-value: 4.23e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 574281382   303 TEIRLEQNTIKVIPPGAFSPYKKLRRIDLSNNQISELAPDAFQGLRSLNSLVLYGNKI 360
Cdd:pfam13855    4 RSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
154-211 1.34e-11

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 61.00  E-value: 1.34e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 574281382   154 KNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNITRLSVASFNHMPKLRTFRLHSNNL 211
Cdd:pfam13855    4 RSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
824-1005 2.37e-11

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 69.34  E-value: 2.37e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   824 LHGNDISVVPEGAFNDLSALSHLAIGANPLYCDCNMQWLSDWVKSE---YKEPGIARCAGPGEMADKLLLTTPSKKFTCq 900
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKgvkVRQPEAALCAGPGALAGQPLLGIPLLDSGC- 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   901 gpvDVNILAkcnpCLSNPCKNDGTCNSDPVDFYRCTcPYGFKGQDCDVpihACISNPCKHGGTchlkeGEEDgfWCICAD 980
Cdd:TIGR00864   81 ---DEEYVA----CLKDNSSGGGAARSELVIFSAAH-EGLFQPEACNA---FCFSAGHGLAAL-----GEQG--ECLCGA 142
                          170       180
                   ....*....|....*....|....*
gi 574281382   981 GFEGENCEVNVDDCEDNDCENNSTC 1005
Cdd:TIGR00864  143 AQPSEANFACESLCSGPPPPPAAAC 167
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
629-705 1.08e-10

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 67.03  E-value: 1.08e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   629 LYDNQITTVAPGAFDTLHSLSTLNLLANPFNCNCYLAWLGEWLRKKRIVTGNPR---CQKPYFLKEIPIQDVAIQDFTCD 705
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEaalCAGPGALAGQPLLGIPLLDSGCD 81
LRR_8 pfam13855
Leucine rich repeat;
128-187 1.48e-10

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 57.92  E-value: 1.48e-10
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   128 KLYRLDLSENQIQAIPRKAFRGAVDIKNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNI 187
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
60-211 1.61e-10

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 62.50  E-value: 1.61e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   60 LDLNGNNITRITktDFAGLRHLRVLQLMENKISTIErgAFQDLKELERLRLNRNHLQlfpELLFLGT-AKLYRLDLSENQ 138
Cdd:cd21340     7 LYLNDKNITKID--NLSLCKNLKVLYLYDNKITKIE--NLEFLTNLTHLYLQNNQIE---KIENLENlVNLKKLYLGGNR 79
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 574281382  139 IQAIprKAFRGAVDIKNLQLDYNQIS-----CIEDGAFRALRD-LEVLTLNNNNITrlSVASFNHMPKLRTFRLHSNNL 211
Cdd:cd21340    80 ISVV--EGLENLTNLEELHIENQRLPpgeklTFDPRSLAALSNsLRVLNISGNNID--SLEPLAPLRNLEQLDASNNQI 154
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
746-846 2.70e-10

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 62.11  E-value: 2.70e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  746 RDVTELYLDGNQFTLVPkELSNYKHLTLIDLSNNRISTLSNqsFSNMTQLLTLILSYNRLRCIPPrtFDGLKSLRLLSLH 825
Cdd:cd21340     2 KRITHLYLNDKNITKID-NLSLCKNLKVLYLYDNKITKIEN--LEFLTNLTHLYLQNNQIEKIEN--LENLVNLKKLYLG 76
                          90       100
                  ....*....|....*....|.
gi 574281382  826 GNDISVVpEGaFNDLSALSHL 846
Cdd:cd21340    77 GNRISVV-EG-LENLTNLEEL 95
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
55-209 2.80e-10

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 61.73  E-value: 2.80e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   55 RNTERLDLNGNNITRItkTDFAGLRHLRVLQLMENKISTIErgAFQDLKELERLRLNRNHLQLFPELLF-----LGTAK- 128
Cdd:cd21340    46 TNLTHLYLQNNQIEKI--ENLENLVNLKKLYLGGNRISVVE--GLENLTNLEELHIENQRLPPGEKLTFdprslAALSNs 121
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  129 LYRLDLSENQIQaiprkafrgavDIKNLQldynqisciedgafrALRDLEVLTLNNNNITRLSVAS--FNHMPKLRTFRL 206
Cdd:cd21340   122 LRVLNISGNNID-----------SLEPLA---------------PLRNLEQLDASNNQISDLEELLdlLSSWPSLRELDL 175

                  ...
gi 574281382  207 HSN 209
Cdd:cd21340   176 TGN 178
LRR_8 pfam13855
Leucine rich repeat;
748-805 4.02e-10

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 56.76  E-value: 4.02e-10
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 574281382   748 VTELYLDGNQFTLVPKE-LSNYKHLTLIDLSNNRISTLSNQSFSNMTQLLTLILSYNRL 805
Cdd:pfam13855    3 LRSLDLSNNRLTSLDDGaFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
793-853 8.62e-10

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 55.99  E-value: 8.62e-10
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 574281382   793 TQLLTLILSYNRLRCIPPRTFDGLKSLRLLSLHGNDISVVPEGAFNDLSALSHLAIGANPL 853
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
529-586 6.41e-09

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 53.30  E-value: 6.41e-09
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 574281382   529 ELRLNNNEFTVLEAtGIFKKLPQLRKINFSNNKITDIEEGAFEGASGVNEILLTSNRL 586
Cdd:pfam13855    5 SLDLSNNRLTSLDD-GAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
325-408 6.93e-09

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 53.30  E-value: 6.93e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   325 KLRRIDLSNNQISELAPDAFQGLRSLNSLVLYGNKITELPKslfeglfslqllllnankinclrvDAFQDLHNLNLLSLY 404
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSP------------------------GAFSGLPSLRYLDLS 57

                   ....
gi 574281382   405 DNKL 408
Cdd:pfam13855   58 GNRL 61
LRR_8 pfam13855
Leucine rich repeat;
79-139 7.00e-09

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 53.30  E-value: 7.00e-09
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 574281382    79 RHLRVLQLMENKISTIERGAFQDLKELERLRLNRNHLQLFPELLFLGTAKLYRLDLSENQI 139
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PLN03150 PLN03150
hypothetical protein; Provisional
761-830 1.93e-08

hypothetical protein; Provisional


Pssm-ID: 178695 [Multi-domain]  Cd Length: 623  Bit Score: 59.06  E-value: 1.93e-08
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  761 VPKELSNYKHLTLIDLSNNRISTLSNQSFSNMTQLLTLILSYNRLRCIPPRTFDGLKSLRLLSLHGNDIS 830
Cdd:PLN03150  434 IPNDISKLRHLQSINLSGNSIRGNIPPSLGSITSLEVLDLSYNSFNGSIPESLGQLTSLRILNLNGNSLS 503
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
725-853 3.23e-08

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 58.55  E-value: 3.23e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  725 CL---DTVVRCSNKGLKVLPKGIPRDVTELYLDGNQFTLVPKEL-SNYKHLT------------------LIDLSNNRIS 782
Cdd:PRK15370  175 CLknnKTELRLKILGLTTIPACIPEQITTLILDNNELKSLPENLqGNIKTLYansnqltsipatlpdtiqEMELSINRIT 254
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 574281382  783 TLSNQSFSnmtQLLTLILSYNRLRCIPPRTFDGlksLRLLSLHGNDISVVPEgafNDLSALSHLAIGANPL 853
Cdd:PRK15370  255 ELPERLPS---ALQSLDLFHNKISCLPENLPEE---LRYLSVYDNSIRTLPA---HLPSGITHLNVQSNSL 316
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
55-211 4.68e-08

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 57.94  E-value: 4.68e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   55 RNTERLDLNGNNITRITKTDFAGLRHLRVLQLMENKISTIERGAFQDLKELERLRLNRNhlQLFPELL-FLGTAKLYRLD 133
Cdd:PLN00113  404 RSLRRVRLQDNSFSGELPSEFTKLPLVYFLDISNNNLQGRINSRKWDMPSLQMLSLARN--KFFGGLPdSFGSKRLENLD 481
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 574281382  134 LSENQIQ-AIPRKaFRGAVDIKNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNITRLSVASFNHMPKLRTFRLHSNNL 211
Cdd:PLN00113  482 LSRNQFSgAVPRK-LGSLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQL 559
LRR_8 pfam13855
Leucine rich repeat;
380-432 6.65e-08

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 50.60  E-value: 6.65e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 574281382   380 NANKINCLRVDAFQDLHNLNLLSLYDNKLQTIAKGTFSPLRAIQTMHLAQNPF 432
Cdd:pfam13855    9 SNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1068-1104 2.51e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.40  E-value: 2.51e-07
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 574281382 1068 DFDDCQD-NKCKNGAHCTDAVNGYTCICPEGYSGLFCE 1104
Cdd:cd00054     1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
322-855 3.63e-07

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 55.24  E-value: 3.63e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  322 PYkkLRRIDLSNNQISELAPDA-FQGLRSLNSLVLYGNKITelpKSLFEGLFSLQLLLLNANkiNCLRVDAFQDL---HN 397
Cdd:PLN00113   93 PY--IQTINLSNNQLSGPIPDDiFTTSSSLRYLNLSNNNFT---GSIPRGSIPNLETLDLSN--NMLSGEIPNDIgsfSS 165
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  398 LNLLSLYDNKLQTIAKGTFSPLRAIQTMHLAQNPFICDC--------HLKWLadYLHTN------PIETSGarCTSPRRL 463
Cdd:PLN00113  166 LKVLDLGGNVLVGKIPNSLTNLTSLEFLTLASNQLVGQIprelgqmkSLKWI--YLGYNnlsgeiPYEIGG--LTSLNHL 241
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  464 A---NKRIGQIKSKKFRCSGTED---YRSKLSGDCFADLACPEKCrcegTTVDCSNQKLN-KIPEHIPQY-TAE-LRLNN 534
Cdd:PLN00113  242 DlvyNNLTGPIPSSLGNLKNLQYlflYQNKLSGPIPPSIFSLQKL----ISLDLSDNSLSgEIPELVIQLqNLEiLHLFS 317
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  535 NEFTVLEATGIfKKLPQLRKINFSNNKIT----------------DIEEGAFEG-------ASG-VNEILLTSNRLENVQ 590
Cdd:PLN00113  318 NNFTGKIPVAL-TSLPRLQVLQLWSNKFSgeipknlgkhnnltvlDLSTNNLTGeipeglcSSGnLFKLILFSNSLEGEI 396
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  591 HKMFKGLESLKTLMLRSNRITCVGNDSFIGLSSVRLLSLYDNQITtvapGAFDT----LHSLSTLNLLANPFNCNcylaw 666
Cdd:PLN00113  397 PKSLGACRSLRRVRLQDNSFSGELPSEFTKLPLVYFLDISNNNLQ----GRINSrkwdMPSLQMLSLARNKFFGG----- 467
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  667 LGEWLRKKRIvtgnprcqkpyflkeipiqdvaiqdftcddGNDDnscspLSRcptectcldtvvrcsNKGLKVLPKGIPR 746
Cdd:PLN00113  468 LPDSFGSKRL------------------------------ENLD-----LSR---------------NQFSGAVPRKLGS 497
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  747 --DVTELYLDGNQFT-LVPKELSNYKHLTLIDLSNNRISTLSNQSFSNMTQLLTLILSYNRLRCIPPRTFDGLKSLRLLS 823
Cdd:PLN00113  498 lsELMQLKLSENKLSgEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQLSGEIPKNLGNVESLVQVN 577
                         570       580       590
                  ....*....|....*....|....*....|....
gi 574281382  824 LHGNDI--SVVPEGAFndlSALSHLAIGANPLYC 855
Cdd:PLN00113  578 ISHNHLhgSLPSTGAF---LAINASAVAGNIDLC 608
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
990-1026 4.34e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.63  E-value: 4.34e-07
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 574281382  990 NVDDCED-NDCENNSTCVDGINNYTCLCPPEYTGELCE 1026
Cdd:cd00054     1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
287-367 2.71e-06

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 52.01  E-value: 2.71e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  287 RGKGLTEIPTNLPETITEIRLEQNTIKVIP---PGAfspykkLRRIDLSNNQISELAPDAFQGLRSLNslvLYGNKITEL 363
Cdd:PRK15370  228 NSNQLTSIPATLPDTIQEMELSINRITELPerlPSA------LQSLDLFHNKISCLPENLPEELRYLS---VYDNSIRTL 298

                  ....
gi 574281382  364 PKSL 367
Cdd:PRK15370  299 PAHL 302
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
290-411 3.35e-06

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 52.01  E-value: 3.35e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  290 GLTEIPTNLPETITEIRLEQNTIKVIPPGAFSPYKKLRRIDLSNNQISELAPDAFQGLRslnslvLYGNKITELPKSLfe 369
Cdd:PRK15370  189 GLTTIPACIPEQITTLILDNNELKSLPENLQGNIKTLYANSNQLTSIPATLPDTIQEME------LSINRITELPERL-- 260
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 574281382  370 gLFSLQLLLLNANKINCLRvDAFQDlhNLNLLSLYDNKLQTI 411
Cdd:PRK15370  261 -PSALQSLDLFHNKISCLP-ENLPE--ELRYLSVYDNSIRTL 298
LRRCT smart00082
Leucine rich repeat C-terminal domain;
851-900 4.56e-06

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 45.11  E-value: 4.56e-06
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 574281382    851 NPLYCDCNMQWLSDWVKSE--YKEPGIARCAGPGEMADKLLLTTPSkKFTCQ 900
Cdd:smart00082    1 NPFICDCELRWLLRWLQANehLQDPVDLRCASPSSLRGPLLELLHS-EFKCP 51
LRRNT smart00013
Leucine rich repeat N-terminal domain;
27-58 5.56e-06

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 44.23  E-value: 5.56e-06
                            10        20        30
                    ....*....|....*....|....*....|..
gi 574281382     27 ACPAQCSCSGSTVDCHGLALRSVPRNIPRNTE 58
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDTT 32
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
182-319 7.80e-06

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 50.85  E-value: 7.80e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   182 LNNNNITRLSVASFNHMPKLRTFRLHSNNLYCDCHLAWLSDWLRQ------RPRVglyTQCMGPSHLRGH---------- 245
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEkgvkvrQPEA---ALCAGPGALAGQpllgipllds 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   246 ----------------NVAEVQKREFVCSGHQSFMAPScsvlHCPAAC-TCSNNIVDCRGKGLTEIPTNLPETITEIRLE 308
Cdd:TIGR00864   79 gcdeeyvaclkdnssgGGAARSELVIFSAAHEGLFQPE----ACNAFCfSAGHGLAALGEQGECLCGAAQPSEANFACES 154
                          170
                   ....*....|.
gi 574281382   309 QNTIKVIPPGA 319
Cdd:TIGR00864  155 LCSGPPPPPAA 165
EGF_CA smart00179
Calcium-binding EGF-like domain;
1068-1104 7.87e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 44.16  E-value: 7.87e-06
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 574281382   1068 DFDDCQ-DNKCKNGAHCTDAVNGYTCICPEGYS-GLFCE 1104
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1072-1100 9.79e-06

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 43.53  E-value: 9.79e-06
                           10        20
                   ....*....|....*....|....*....
gi 574281382  1072 CQDNKCKNGAHCTDAVNGYTCICPEGYSG 1100
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTG 29
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1030-1065 1.16e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.39  E-value: 1.16e-05
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 574281382 1030 DFCAQDlNPCQHDSKCILTPKGFKCDCTPGYVGEHC 1065
Cdd:cd00054     3 DECASG-NPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
LRRNT smart00013
Leucine rich repeat N-terminal domain;
719-750 1.25e-05

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 43.46  E-value: 1.25e-05
                            10        20        30
                    ....*....|....*....|....*....|..
gi 574281382    719 CPTECTCLDTVVRCSNKGLKVLPKGIPRDVTE 750
Cdd:smart00013    2 CPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
EGF_CA smart00179
Calcium-binding EGF-like domain;
990-1026 1.66e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 43.00  E-value: 1.66e-05
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 574281382    990 NVDDCE-DNDCENNSTCVDGINNYTCLCPPEYT-GELCE 1026
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
LRRCT smart00082
Leucine rich repeat C-terminal domain;
209-258 2.54e-05

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 43.19  E-value: 2.54e-05
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 574281382    209 NNLYCDCHLAWLSDWLRQRPRV--GLYTQCMGPSHLRGhNVAEVQKREFVCS 258
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHLqdPVDLRCASPSSLRG-PLLELLHSEFKCP 51
LRRNT smart00013
Leucine rich repeat N-terminal domain;
272-304 2.95e-05

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 42.30  E-value: 2.95e-05
                            10        20        30
                    ....*....|....*....|....*....|...
gi 574281382    272 HCPAACTCSNNIVDCRGKGLTEIPTNLPETITE 304
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
43-214 5.31e-05

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 46.96  E-value: 5.31e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   43 GLALRSVPRNIPRNT--ERLDLNGNNITRITKTDFAGLRH---LRVLQLMENKIS-TIER---GAFQDLKE-LERLRLNR 112
Cdd:cd00116    67 PRGLQSLLQGLTKGCglQELDLSDNALGPDGCGVLESLLRsssLQELKLNNNGLGdRGLRllaKGLKDLPPaLEKLVLGR 146
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  113 NHL------QLFPELLFLGtaKLYRLDLSENQI--QAIPR--KAFRGAVDIKNLQLDYNQISCIED----GAFRALRDLE 178
Cdd:cd00116   147 NRLegasceALAKALRANR--DLKELNLANNGIgdAGIRAlaEGLKANCNLEVLDLNNNGLTDEGAsalaETLASLKSLE 224
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|.
gi 574281382  179 VLTLNNNNIT-----RLSVASFNHMPKLRTFRLHSNNLYCD 214
Cdd:cd00116   225 VLNLGDNNLTdagaaALASALLSPNISLLTLSLSCNDITDD 265
RNA1 COG5238
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
512-657 7.97e-05

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 47.09  E-value: 7.97e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  512 SNQKLNKIPEHIPQYTA--ELRLNNNEFTVLEATGIFKKL---PQLRKINFSNNKITDieegafEGASGVNEILLTSNRL 586
Cdd:COG5238   193 GDEGIEELAEALTQNTTvtTLWLKRNPIGDEGAEILAEALkgnKSLTTLDLSNNQIGD------EGVIALAEALKNNTTV 266
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  587 E-------NVQH-------KMFKGLESLKTLMLRSNRItcvGNDSFIGL-------SSVRLLSLYDNQITTV-APGAFDT 644
Cdd:COG5238   267 EtlylsgnQIGAegaialaKALQGNTTLTSLDLSVNRI---GDEGAIALaeglqgnKTLHTLNLAYNGIGAQgAIALAKA 343
                         170
                  ....*....|....*.
gi 574281382  645 LH---SLSTLNLLANP 657
Cdd:COG5238   344 LQentTLHSLDLSDNQ 359
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
994-1023 1.09e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 40.83  E-value: 1.09e-04
                           10        20        30
                   ....*....|....*....|....*....|
gi 574281382   994 CEDNDCENNSTCVDGINNYTCLCPPEYTGE 1023
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGK 30
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
55-367 1.58e-04

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 46.38  E-value: 1.58e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   55 RNTERLDLNGNNITRITKTDFAGLRHLRVLQLMENKISTierGAFQDL---KELERLRLNRNHLQ-LFPELLfLGTAKLY 130
Cdd:PLN00113  308 QNLEILHLFSNNFTGKIPVALTSLPRLQVLQLWSNKFSG---EIPKNLgkhNNLTVLDLSTNNLTgEIPEGL-CSSGNLF 383
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  131 RLDLSENQIQAIPRKAFRGAVDIKNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNIT-RLSVASFNhMPKLRTFRLHSN 209
Cdd:PLN00113  384 KLILFSNSLEGEIPKSLGACRSLRRVRLQDNSFSGELPSEFTKLPLVYFLDISNNNLQgRINSRKWD-MPSLQMLSLARN 462
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  210 NLYcdchlawlsdwlrqrprvGLYTQCMGPSHLRGHNVAEVQKREFVcsgHQSFMapscsvlhcpaactcsnnivdcrgk 289
Cdd:PLN00113  463 KFF------------------GGLPDSFGSKRLENLDLSRNQFSGAV---PRKLG------------------------- 496
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 574281382  290 glteiptNLPEtITEIRLEQNTIKVIPPGAFSPYKKLRRIDLSNNQISELAPDAFQGLRSLNSLVLYGNKIT-ELPKSL 367
Cdd:PLN00113  497 -------SLSE-LMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQLSgEIPKNL 567
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
27-54 1.68e-04

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 39.92  E-value: 1.68e-04
                           10        20
                   ....*....|....*....|....*...
gi 574281382    27 ACPAQCSCSGSTVDCHGLALRSVPRNIP 54
Cdd:pfam01462    1 ACPVPCHCSATVVNCSDRGLTAVPRDLP 28
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
719-745 1.84e-04

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 39.92  E-value: 1.84e-04
                           10        20
                   ....*....|....*....|....*..
gi 574281382   719 CPTECTCLDTVVRCSNKGLKVLPKGIP 745
Cdd:pfam01462    2 CPVPCHCSATVVNCSDRGLTAVPRDLP 28
LRR_8 pfam13855
Leucine rich repeat;
177-211 1.84e-04

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 40.97  E-value: 1.84e-04
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 574281382   177 LEVLTLNNNNITRLSVASFNHMPKLRTFRLHSNNL 211
Cdd:pfam13855    3 LRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLL 37
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
403-466 2.27e-04

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 46.23  E-value: 2.27e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 574281382   403 LYDNKLQTIAKGTFSPLRAIQTMHLAQNPFICDCHLKWLADYLHTNPIET---SGARCTSPRRLANK 466
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVrqpEAALCAGPGALAGQ 68
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1035-1064 2.65e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 39.67  E-value: 2.65e-04
                           10        20        30
                   ....*....|....*....|....*....|
gi 574281382  1035 DLNPCQHDSKCILTPKGFKCDCTPGYVGEH 1064
Cdd:pfam00008    2 APNPCSNGGTCVDTPGGYTCICPEGYTGKR 31
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
273-299 2.72e-04

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 39.53  E-value: 2.72e-04
                           10        20
                   ....*....|....*....|....*..
gi 574281382   273 CPAACTCSNNIVDCRGKGLTEIPTNLP 299
Cdd:pfam01462    2 CPVPCHCSATVVNCSDRGLTAVPRDLP 28
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
749-830 2.92e-04

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 44.01  E-value: 2.92e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  749 TELYLD------GNQFTLVP---KELSNykHLTLIDLSNNRISTLSnqSFSNMTQLLTLILSYNRLRCIPP--RTFDGLK 817
Cdd:cd21340    93 EELHIEnqrlppGEKLTFDPrslAALSN--SLRVLNISGNNIDSLE--PLAPLRNLEQLDASNNQISDLEEllDLLSSWP 168
                          90
                  ....*....|...
gi 574281382  818 SLRLLSLHGNDIS 830
Cdd:cd21340   169 SLRELDLTGNPVC 181
hEGF pfam12661
Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six ...
1077-1098 3.01e-04

Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six conserved residues disulfide-bonded into the characteriztic 'ababcc' pattern. They are involved in growth and proliferation of cells, in proteins of the Notch/Delta pathway, neurogulin and selectins. hEGFs are also found in mosaic proteins with four-disulfide laminin EGFs such as aggrecan and perlecan. The core fold of the EGF domain consists of two small beta-hairpins packed against each other. Two major structural variants have been identified based on the structural context of the C-terminal Cys residue of disulfide 'c' in the C-terminal hairpin: hEGFs and cEGFs. In hEGFs the C-terminal thiol resides in the beta-turn, resulting in shorter loop-lengths between the Cys residues of disulfide 'c', typically C[8-9]XC. These shorter loop-lengths are also typical of the four-disulfide EGF domains, laminin ad integrin. Tandem hEGF domains have six linking residues between terminal cysteines of adjacent domains. hEGF domains may or may not bind calcium in the linker region. hEGF domains with the consensus motif CXD4X[F,Y]XCXC are hydroxylated exclusively in the Asp residue.


Pssm-ID: 463660  Cd Length: 22  Bit Score: 39.24  E-value: 3.01e-04
                           10        20
                   ....*....|....*....|..
gi 574281382  1077 CKNGAHCTDAVNGYTCICPEGY 1098
Cdd:pfam12661    1 CQNGGTCVDGVNGYKCQCPPGY 22
LRRCT smart00082
Leucine rich repeat C-terminal domain;
656-705 6.09e-04

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 38.95  E-value: 6.09e-04
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 574281382    656 NPFNCNCYLAWLGEWLRKKRIV--TGNPRCQKPYFLKEiPIQDVAIQDFTCD 705
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHLqdPVDLRCASPSSLRG-PLLELLHSEFKCP 51
LRRNT smart00013
Leucine rich repeat N-terminal domain;
497-527 6.37e-04

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 38.45  E-value: 6.37e-04
                            10        20        30
                    ....*....|....*....|....*....|.
gi 574281382    497 ACPEKCRCEGTTVDCSNQKLNKIPEHIPQYT 527
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDT 31
LRR_4 pfam12799
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ...
177-211 7.74e-04

Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.


Pssm-ID: 463713 [Multi-domain]  Cd Length: 44  Bit Score: 38.77  E-value: 7.74e-04
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 574281382   177 LEVLTLNNNNITRLSvaSFNHMPKLRTFRLHSNNL 211
Cdd:pfam12799    3 LEVLDLSNNQITDIP--PLAKLPNLETLDLSGNNK 35
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
1073-1104 1.02e-03

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 38.23  E-value: 1.02e-03
                          10        20        30
                  ....*....|....*....|....*....|...
gi 574281382 1073 QDNKCKNGAHCTDAVNGYTCICPEGYSGLF-CE 1104
Cdd:cd00053     4 ASNPCSNGGTCVNTPGSYRCVCPPGYTGDRsCE 36
LRRCT smart00082
Leucine rich repeat C-terminal domain;
430-460 1.07e-03

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 38.57  E-value: 1.07e-03
                            10        20        30
                    ....*....|....*....|....*....|...
gi 574281382    430 NPFICDCHLKWLADYLHTNPI--ETSGARCTSP 460
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHlqDPVDLRCASP 33
RNA1 COG5238
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
35-190 1.16e-03

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 43.24  E-value: 1.16e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   35 SGSTVDCHGL-ALRSVPRNiPRNTERLDLNGNNIT-----RITKTdFAGLRHLRVLQLMENKIStiERGA------FQDL 102
Cdd:COG5238   244 SNNQIGDEGViALAEALKN-NTTVETLYLSGNQIGaegaiALAKA-LQGNTTLTSLDLSVNRIG--DEGAialaegLQGN 319
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  103 KELERLRLNRNHLQLfPELLFLGTA-----KLYRLDLSENQIQAIPRKAF----RGAVDIKNLQLDYNQISciEDGAfRA 173
Cdd:COG5238   320 KTLHTLNLAYNGIGA-QGAIALAKAlqentTLHSLDLSDNQIGDEGAIALakylEGNTTLRELNLGKNNIG--KQGA-EA 395
                         170
                  ....*....|....*..
gi 574281382  174 LRDLevltLNNNNITRL 190
Cdd:COG5238   396 LIDA----LQTNRLHTL 408
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
956-988 1.45e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.62  E-value: 1.45e-03
                          10        20        30
                  ....*....|....*....|....*....|...
gi 574281382  956 NPCKHGGTCHLKEGeedGFWCICADGFEGENCE 988
Cdd:cd00054     9 NPCQNGGTCVNTVG---SYRCSCPPGYTGRNCE 38
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
390-638 1.60e-03

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 42.34  E-value: 1.60e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  390 DAFQDLHNLNLLSLYDNKLQTIAKGTFSPLRAIQTMHlaqnpficdcHLKwladyLHTNPIETSGAR--CTSPRRLankr 467
Cdd:cd00116    75 QGLTKGCGLQELDLSDNALGPDGCGVLESLLRSSSLQ----------ELK-----LNNNGLGDRGLRllAKGLKDL---- 135
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  468 igQIKSKKFRCSgtedyRSKLSGDCFADLAcpekcrcegtTVDCSNQKLNkipehipqytaELRLNNNEFTVLEATGIFK 547
Cdd:cd00116   136 --PPALEKLVLG-----RNRLEGASCEALA----------KALRANRDLK-----------ELNLANNGIGDAGIRALAE 187
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  548 KLP---QLRKINFSNNKITDIEEGAFEGAsgvneilltsnrlenvqhkmFKGLESLKTLMLRSNRITCVG-----NDSFI 619
Cdd:cd00116   188 GLKancNLEVLDLNNNGLTDEGASALAET--------------------LASLKSLEVLNLGDNNLTDAGaaalaSALLS 247
                         250
                  ....*....|....*....
gi 574281382  620 GLSSVRLLSLYDNQITTVA 638
Cdd:cd00116   248 PNISLLTLSLSCNDITDDG 266
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
49-408 2.26e-03

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 42.53  E-value: 2.26e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   49 VPRNIP--RNTERLDLNGNNITRITKTDFAGLRHLRVLQLMENKISTIERGAFQDLKELERLRLNRNHLQ-LFPELLFlG 125
Cdd:PLN00113  204 IPRELGqmKSLKWIYLGYNNLSGEIPYEIGGLTSLNHLDLVYNNLTGPIPSSLGNLKNLQYLFLYQNKLSgPIPPSIF-S 282
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  126 TAKLYRLDLSENQIQA-IPRKAfrgaVDIKNLQ---LDYNQISCIEDGAFRALRDLEVLTLNNNNITRLSVASFNHMPKL 201
Cdd:PLN00113  283 LQKLISLDLSDNSLSGeIPELV----IQLQNLEilhLFSNNFTGKIPVALTSLPRLQVLQLWSNKFSGEIPKNLGKHNNL 358
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  202 RTFRLHSNNL-------YCDC----HLAWLSDWLR-QRPRVGLYTQCMGPSHLRGHNVAEVQKREFVCSGHQSFMAPScs 269
Cdd:PLN00113  359 TVLDLSTNNLtgeipegLCSSgnlfKLILFSNSLEgEIPKSLGACRSLRRVRLQDNSFSGELPSEFTKLPLVYFLDIS-- 436
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  270 vlhcpaactcSNNIVDCRGKGLTEIPTnlpetITEIRLEQNTIKVIPPGAFSPyKKLRRIDLSNNQISELAPDAFQGLRS 349
Cdd:PLN00113  437 ----------NNNLQGRINSRKWDMPS-----LQMLSLARNKFFGGLPDSFGS-KRLENLDLSRNQFSGAVPRKLGSLSE 500
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  350 LNSLVLYGNKIT-ELPKSLfEGLFSLQLLLLNANKINCLRVDAFQDLHNLNLLSLYDNKL 408
Cdd:PLN00113  501 LMQLKLSENKLSgEIPDEL-SSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQL 559
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
995-1026 2.43e-03

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 37.07  E-value: 2.43e-03
                          10        20        30
                  ....*....|....*....|....*....|...
gi 574281382  995 EDNDCENNSTCVDGINNYTCLCPPEYTGEL-CE 1026
Cdd:cd00053     4 ASNPCSNGGTCVNTPGSYRCVCPPGYTGDRsCE 36
LRR_4 pfam12799
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ...
324-364 2.45e-03

Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.


Pssm-ID: 463713 [Multi-domain]  Cd Length: 44  Bit Score: 37.22  E-value: 2.45e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 574281382   324 KKLRRIDLSNNQISELapDAFQGLRSLNSLVLYGN-KITELP 364
Cdd:pfam12799    1 PNLEVLDLSNNQITDI--PPLAKLPNLETLDLSGNnKITDLS 40
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
515-658 2.66e-03

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 40.92  E-value: 2.66e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  515 KLNKIP--EHIPQYTaELRLNNNEFTVLEAtgiFKKLPQLRKINFSNNKITDIEegAFEGASGVNEILLTSNRLENVQHK 592
Cdd:cd21340    35 KITKIEnlEFLTNLT-HLYLQNNQIEKIEN---LENLVNLKKLYLGGNRISVVE--GLENLTNLEELHIENQRLPPGEKL 108
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 574281382  593 MF-----KGL-ESLKTLMLRSNRITCVgnDSFIGLSSVRLLSLYDNQITTVAP--GAFDTLHSLSTLNLLANPF 658
Cdd:cd21340   109 TFdprslAALsNSLRVLNISGNNIDSL--EPLAPLRNLEQLDASNNQISDLEEllDLLSSWPSLRELDLTGNPV 180
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
953-986 2.74e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 36.59  E-value: 2.74e-03
                           10        20        30
                   ....*....|....*....|....*....|....
gi 574281382   953 CISNPCKHGGTCHLKEGeedGFWCICADGFEGEN 986
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPG---GYTCICPEGYTGKR 31
RNA1 COG5238
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
280-449 3.58e-03

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 41.70  E-value: 3.58e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  280 SNNIVDCRGKGLTEIPTnLPETITEIRLEQNTIKviPPGA------FSPYKKLRRIDLSNNQIS-----ELApDAFQGLR 348
Cdd:COG5238   189 CNQIGDEGIEELAEALT-QNTTVTTLWLKRNPIG--DEGAeilaeaLKGNKSLTTLDLSNNQIGdegviALA-EALKNNT 264
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382  349 SLNSLVLYGNKITE-----LPKSLfEGLFSLQLLLLNANKINCLRVDAFQDL----HNLNLLSLYDNKLQT-----IAKg 414
Cdd:COG5238   265 TVETLYLSGNQIGAegaiaLAKAL-QGNTTLTSLDLSVNRIGDEGAIALAEGlqgnKTLHTLNLAYNGIGAqgaiaLAK- 342
                         170       180       190
                  ....*....|....*....|....*....|....*
gi 574281382  415 TFSPLRAIQTMHLAQNPfICDCHLKWLADYLHTNP 449
Cdd:COG5238   343 ALQENTTLHSLDLSDNQ-IGDEGAIALAKYLEGNT 376
EGF_CA smart00179
Calcium-binding EGF-like domain;
1030-1066 3.60e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 36.46  E-value: 3.60e-03
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 574281382   1030 DFCAQDlNPCQHDSKCILTPKGFKCDCTPGYV-GEHCD 1066
Cdd:smart00179    3 DECASG-NPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
hEGF pfam12661
Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six ...
999-1020 3.74e-03

Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six conserved residues disulfide-bonded into the characteriztic 'ababcc' pattern. They are involved in growth and proliferation of cells, in proteins of the Notch/Delta pathway, neurogulin and selectins. hEGFs are also found in mosaic proteins with four-disulfide laminin EGFs such as aggrecan and perlecan. The core fold of the EGF domain consists of two small beta-hairpins packed against each other. Two major structural variants have been identified based on the structural context of the C-terminal Cys residue of disulfide 'c' in the C-terminal hairpin: hEGFs and cEGFs. In hEGFs the C-terminal thiol resides in the beta-turn, resulting in shorter loop-lengths between the Cys residues of disulfide 'c', typically C[8-9]XC. These shorter loop-lengths are also typical of the four-disulfide EGF domains, laminin ad integrin. Tandem hEGF domains have six linking residues between terminal cysteines of adjacent domains. hEGF domains may or may not bind calcium in the linker region. hEGF domains with the consensus motif CXD4X[F,Y]XCXC are hydroxylated exclusively in the Asp residue.


Pssm-ID: 463660  Cd Length: 22  Bit Score: 36.16  E-value: 3.74e-03
                           10        20
                   ....*....|....*....|..
gi 574281382   999 CENNSTCVDGINNYTCLCPPEY 1020
Cdd:pfam12661    1 CQNGGTCVDGVNGYKCQCPPGY 22
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
1037-1064 3.93e-03

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 36.30  E-value: 3.93e-03
                          10        20
                  ....*....|....*....|....*...
gi 574281382 1037 NPCQHDSKCILTPKGFKCDCTPGYVGEH 1064
Cdd:cd00053     6 NPCSNGGTCVNTPGSYRCVCPPGYTGDR 33
LRR_TYP smart00369
Leucine-rich repeats, typical (most populated) subfamily;
325-346 4.51e-03

Leucine-rich repeats, typical (most populated) subfamily;


Pssm-ID: 197687 [Multi-domain]  Cd Length: 24  Bit Score: 35.79  E-value: 4.51e-03
                            10        20
                    ....*....|....*....|..
gi 574281382    325 KLRRIDLSNNQISELAPDAFQG 346
Cdd:smart00369    3 NLRELDLSNNQLSSLPPGAFQG 24
LRR smart00370
Leucine-rich repeats, outliers;
325-346 4.51e-03

Leucine-rich repeats, outliers;


Pssm-ID: 197688 [Multi-domain]  Cd Length: 24  Bit Score: 35.79  E-value: 4.51e-03
                            10        20
                    ....*....|....*....|..
gi 574281382    325 KLRRIDLSNNQISELAPDAFQG 346
Cdd:smart00370    3 NLRELDLSNNQLSSLPPGAFQG 24
LRR_5 pfam13306
BspA type Leucine rich repeat region (6 copies); This family includes a number of leucine rich ...
289-370 5.13e-03

BspA type Leucine rich repeat region (6 copies); This family includes a number of leucine rich repeats. This family contains a large number of BSPA-like surface antigens from Trichomonas vaginalis.


Pssm-ID: 463839 [Multi-domain]  Cd Length: 127  Bit Score: 38.68  E-value: 5.13e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   289 KGLTEIptNLPETITEIR--------LE----QNTIKVIPPGAFSPYKKLRRIDLSNNqISELAPDAFQGLrSLNSLVLy 356
Cdd:pfam13306   34 TSLKSI--TLPSSLTSIGsyafyncsLTsitiPSSLTSIGEYAFSNCSNLKSITLPSN-LTSIGSYAFSNC-SLKSITI- 108
                           90
                   ....*....|....
gi 574281382   357 GNKITELPKSLFEG 370
Cdd:pfam13306  109 PSSVTTIGSYAFSN 122
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1122-1149 5.68e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 36.08  E-value: 5.68e-03
                          10        20
                  ....*....|....*....|....*...
gi 574281382 1122 CQNGAQCIVRINEPICQCLPGYQGEKCE 1149
Cdd:cd00054    11 CQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
914-943 5.91e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 35.82  E-value: 5.91e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 574281382   914 CLSNPCKNDGTCNSDPVDfYRCTCPYGFKG 943
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGG-YTCICPEGYTG 29
LRR_9 pfam14580
Leucine-rich repeat;
532-611 6.00e-03

Leucine-rich repeat;


Pssm-ID: 405295 [Multi-domain]  Cd Length: 175  Bit Score: 39.36  E-value: 6.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 574281382   532 LNNNEFTVLEAtgiFKKLPQLRKINFSNNKITDIEEGAFEGASGVNEILLTSNRL-ENVQHKMFKGLESLKTLMLRSNRI 610
Cdd:pfam14580   49 FSDNEIRKLDG---FPLLRRLKTLLLNNNRICRIGEGLGEALPNLTELILTNNNLqELGDLDPLASLKKLTFLSLLRNPV 125

                   .
gi 574281382   611 T 611
Cdd:pfam14580  126 T 126
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1117-1146 6.40e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 35.82  E-value: 6.40e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 574281382  1117 CDNFDCQNGAQCIVRINEPICQCLPGYQGE 1146
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGK 30
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
912-946 8.51e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 35.31  E-value: 8.51e-03
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 574281382  912 NPCLS-NPCKNDGTCNSDPVDfYRCTCPYGFKGQDC 946
Cdd:cd00054     3 DECASgNPCQNGGTCVNTVGS-YRCSCPPGYTGRNC 37
hEGF pfam12661
Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six ...
1122-1143 9.06e-03

Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six conserved residues disulfide-bonded into the characteriztic 'ababcc' pattern. They are involved in growth and proliferation of cells, in proteins of the Notch/Delta pathway, neurogulin and selectins. hEGFs are also found in mosaic proteins with four-disulfide laminin EGFs such as aggrecan and perlecan. The core fold of the EGF domain consists of two small beta-hairpins packed against each other. Two major structural variants have been identified based on the structural context of the C-terminal Cys residue of disulfide 'c' in the C-terminal hairpin: hEGFs and cEGFs. In hEGFs the C-terminal thiol resides in the beta-turn, resulting in shorter loop-lengths between the Cys residues of disulfide 'c', typically C[8-9]XC. These shorter loop-lengths are also typical of the four-disulfide EGF domains, laminin ad integrin. Tandem hEGF domains have six linking residues between terminal cysteines of adjacent domains. hEGF domains may or may not bind calcium in the linker region. hEGF domains with the consensus motif CXD4X[F,Y]XCXC are hydroxylated exclusively in the Asp residue.


Pssm-ID: 463660  Cd Length: 22  Bit Score: 35.00  E-value: 9.06e-03
                           10        20
                   ....*....|....*....|..
gi 574281382  1122 CQNGAQCIVRINEPICQCLPGY 1143
Cdd:pfam12661    1 CQNGGTCVDGVNGYKCQCPPGY 22
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH