NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1034641845|ref|XP_016864334|]
View 

slit homolog 2 protein isoform X4 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
LamG smart00282
Laminin G domain;
1149-1282 7.80e-37

Laminin G domain;


:

Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 135.54  E-value: 7.80e-37
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  1149 NITLQIATDEDSGILLY---KGDKDHIAVELYRGRVRASYDTGSHPASAIYSVETINDGNFHIVELLALDQSLSLSVDGG 1225
Cdd:smart00282    1 SISFSFRTTSPNGLLLYagsKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRSVTLSVDGG 80
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034641845  1226 NPKIITNLSKQSTLNFDSPLYVGGMPgksnvASLRQAPGQNGTSFHGCIRNLYINSE 1282
Cdd:smart00282   81 NRVSGESPGGLTILNLDGPLYLGGLP-----EDLKLPPLPVTPGFRGCIRNLKVNGK 132
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
481-827 1.13e-22

Leucine-rich repeat (LRR) protein [Transcription];


:

Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 102.70  E-value: 1.13e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  481 TTVDCSNQKLNKIPEHIPQYTAELRLNNNEFTVLEATGIFKKLPQLRKINFSNNKITDIEEGAFEGASGVNEILLTSNRL 560
Cdd:COG4886      3 LLLLSLTLKLLLLLLLELLTTLILLLLLLLLLLALLLLSLLSLLLLLTLLLSLLLRDLLLSSLLLLLSLLLLLLLSLLLL 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  561 ENVQ--HKMFKGLESLKTLMLRsnritcvGNDSFIGLSSVRLLSLYDNQITTVaPGAFDTLHSLSTLNLLANPfncncyl 638
Cdd:COG4886     83 SLLLlgLTDLGDLTNLTELDLS-------GNEELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQ------- 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  639 awlgewlrkkrivtgnprcqkpyfLKEIPiqdvaiqdftcddgnddnscSPLSRCPTectcLdTVVRCSNKGLKVLPKGI 718
Cdd:COG4886    148 ------------------------LTDLP--------------------EPLGNLTN----L-KSLDLSNNQLTDLPEEL 178
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  719 PR--DVTELYLDGNQFTLVPKELSNYKHLTLIDLSNNRISTLSNqSFSNMTQLLTLILSYNRLRCIPprTFDGLKSLRLL 796
Cdd:COG4886    179 GNltNLKELDLSNNQITDLPEPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLP--ELGNLTNLEEL 255
                          330       340       350
                   ....*....|....*....|....*....|.
gi 1034641845  797 SLHGNDISVVPEGAfnDLSALSHLAIGANPL 827
Cdd:COG4886    256 DLSNNQLTDLPPLA--NLTNLKTLDLSNNQL 284
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
27-177 5.34e-21

Leucine-rich repeat (LRR) protein [Transcription];


:

Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 97.70  E-value: 5.34e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845   27 DLNGNNITRITKtDFAGLRHLRVLQLMENKISTIERgAFQDLKELERLRLNRNHLQLFPELLfLGTAKLYRLDLSENQIQ 106
Cdd:COG4886    119 DLSGNQLTDLPE-ELANLTNLKELDLSNNQLTDLPE-PLGNLTNLKSLDLSNNQLTDLPEEL-GNLTNLKELDLSNNQIT 195
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034641845  107 AIPrKAFRGAVDIKNLQLDYNQISCIEDgAFRALRDLEVLTLNNNNITRLSvaSFNHMPKLRTFRLHSNNL 177
Cdd:COG4886    196 DLP-EPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQL 262
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
285-633 3.88e-17

Leucine-rich repeat (LRR) protein [Transcription];


:

Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 85.76  E-value: 3.88e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  285 AFSPYKKLRRIDLSNNQISELaPDAFQGLRSLNSLVLYGNKITELPKSLFEglfslqllllnankinclrvdafqdLHNL 364
Cdd:COG4886    108 ELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQLTDLPEPLGN-------------------------LTNL 161
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  365 NLLSLYDNKLQTIAKGtFSPLRaiqtmhlaqnpficdcHLKWLadYLHTNPIETsgarctsprrlankrigqikskkfrc 444
Cdd:COG4886    162 KSLDLSNNQLTDLPEE-LGNLT----------------NLKEL--DLSNNQITD-------------------------- 196
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  445 sakeqyfIPGTEDYRSKLsgdcfadlacpekcrcegTTVDCSNQKLNKIPEHIPQYTA--ELRLNNNEFTVLEAtgiFKK 522
Cdd:COG4886    197 -------LPEPLGNLTNL------------------EELDLSGNQLTDLPEPLANLTNleTLDLSNNQLTDLPE---LGN 248
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  523 LPQLRKINFSNNKITDIEEGAfegasgvneilltsnrlenvqhkmfkGLESLKTLMLRSNRITCVGNDSFIGLSSVRLLS 602
Cdd:COG4886    249 LTNLEELDLSNNQLTDLPPLA--------------------------NLTNLKTLDLSNNQLTDLKLKELELLLGLNSLL 302
                          330       340       350
                   ....*....|....*....|....*....|.
gi 1034641845  603 LYDNQITTVAPGAFDTLHSLSTLNLLANPFN 633
Cdd:COG4886    303 LLLLLLNLLELLILLLLLTTLLLLLLLLKGL 333
PCC super family cl28216
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
798-979 2.32e-11

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


The actual alignment was detected with superfamily member TIGR00864:

Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 69.34  E-value: 2.32e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  798 LHGNDISVVPEGAFNDLSALSHLAIGANPLYCDCNMQWLSDWVKSE---YKEPGIARCAGPGEMADKLLLTTPSKKFTCq 874
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKgvkVRQPEAALCAGPGALAGQPLLGIPLLDSGC- 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  875 gpvDVNILAkcnpCLSNPCKNDGTCNSDPVDFYRCTcPYGFKGQDCDVpihACISNPCKHGGTchlkeGEEDgfWCICAD 954
Cdd:TIGR00864   81 ---DEEYVA----CLKDNSSGGGAARSELVIFSAAH-EGLFQPEACNA---FCFSAGHGLAAL-----GEQG--ECLCGA 142
                          170       180
                   ....*....|....*....|....*
gi 1034641845  955 GFEGENCEVNVDDCEDNDCENNSTC 979
Cdd:TIGR00864  143 AQPSEANFACESLCSGPPPPPAAAC 167
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1042-1078 2.37e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.40  E-value: 2.37e-07
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1034641845 1042 DFDDCQD-NKCKNGAHCTDAVNGYTCICPEGYSGLFCE 1078
Cdd:cd00054      1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
964-1000 4.10e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.63  E-value: 4.10e-07
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1034641845  964 NVDDCED-NDCENNSTCVDGINNYTCLCPPEYTGELCE 1000
Cdd:cd00054      1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
PCC super family cl28216
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
148-285 7.64e-06

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


The actual alignment was detected with superfamily member TIGR00864:

Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 50.85  E-value: 7.64e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  148 LNNNNITRLSVASFNHMPKLRTFRLHSNNLYCDCHLAWLSDWLRQ------RPRVglyTQCMGPSHLRGH---------- 211
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEkgvkvrQPEA---ALCAGPGALAGQpllgipllds 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  212 ----------------NVAEVQKREFVCSGHQSFMAPScsvlHCPAAC-TCSNNIVDCRGKGLTEIPTNLPETITEIRLE 274
Cdd:TIGR00864   79 gcdeeyvaclkdnssgGGAARSELVIFSAAHEGLFQPE----ACNAFCfSAGHGLAALGEQGECLCGAAQPSEANFACES 154
                          170
                   ....*....|.
gi 1034641845  275 QNTIKVIPPGA 285
Cdd:TIGR00864  155 LCSGPPPPPAA 165
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1004-1039 1.13e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.39  E-value: 1.13e-05
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1034641845 1004 DFCAQDlNPCQHDSKCILTPKGFKCDCTPGYVGEHC 1039
Cdd:cd00054      3 DECASG-NPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1096-1123 5.37e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 36.08  E-value: 5.37e-03
                           10        20
                   ....*....|....*....|....*...
gi 1034641845 1096 CQNGAQCIVRINEPICQCLPGYQGEKCE 1123
Cdd:cd00054     11 CQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
 
Name Accession Description Interval E-value
LamG smart00282
Laminin G domain;
1149-1282 7.80e-37

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 135.54  E-value: 7.80e-37
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  1149 NITLQIATDEDSGILLY---KGDKDHIAVELYRGRVRASYDTGSHPASAIYSVETINDGNFHIVELLALDQSLSLSVDGG 1225
Cdd:smart00282    1 SISFSFRTTSPNGLLLYagsKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRSVTLSVDGG 80
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034641845  1226 NPKIITNLSKQSTLNFDSPLYVGGMPgksnvASLRQAPGQNGTSFHGCIRNLYINSE 1282
Cdd:smart00282   81 NRVSGESPGGLTILNLDGPLYLGGLP-----EDLKLPPLPVTPGFRGCIRNLKVNGK 132
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
1127-1280 8.46e-33

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 124.84  E-value: 8.46e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845 1127 SVNFiNKESYLQIP-SAKVRPQTNITLQIATDEDSGILLYKGDK---DHIAVELYRGRVRASYDTGSHPASaIYSVETIN 1202
Cdd:cd00110      1 GVSF-SGSSYVRLPtLPAPRTRLSISFSFRTTSPNGLLLYAGSQnggDFLALELEDGRLVLRYDLGSGSLV-LSSKTPLN 78
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034641845 1203 DGNFHIVELLALDQSLSLSVDGGNPKIITNLSKQSTLNFDSPLYVGGMPgksnvASLRQAPGQNGTSFHGCIRNLYIN 1280
Cdd:cd00110     79 DGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGGLP-----EDLKSPGLPVSPGFVGCIRDLKVN 151
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
1156-1282 8.41e-32

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 120.99  E-value: 8.41e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845 1156 TDEDSGILLYKGD--KDHIAVELYRGRVRASYDTGSHPASAIYSVETINDGNFHIVELLALDQSLSLSVDGGNPKIITNL 1233
Cdd:pfam02210    3 TRQPNGLLLYAGGggSDFLALELVNGRLVLRYDLGSGPESLLSSGKNLNDGQWHSVRVERNGNTLTLSVDGQTVVSSLPP 82
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 1034641845 1234 SKQSTLNFDSPLYVGGMPGKSNVASLRQAPGqngtsFHGCIRNLYINSE 1282
Cdd:pfam02210   83 GESLLLNLNGPLYLGGLPPLLLLPALPVRAG-----FVGCIRDVRVNGE 126
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
481-827 1.13e-22

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 102.70  E-value: 1.13e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  481 TTVDCSNQKLNKIPEHIPQYTAELRLNNNEFTVLEATGIFKKLPQLRKINFSNNKITDIEEGAFEGASGVNEILLTSNRL 560
Cdd:COG4886      3 LLLLSLTLKLLLLLLLELLTTLILLLLLLLLLLALLLLSLLSLLLLLTLLLSLLLRDLLLSSLLLLLSLLLLLLLSLLLL 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  561 ENVQ--HKMFKGLESLKTLMLRsnritcvGNDSFIGLSSVRLLSLYDNQITTVaPGAFDTLHSLSTLNLLANPfncncyl 638
Cdd:COG4886     83 SLLLlgLTDLGDLTNLTELDLS-------GNEELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQ------- 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  639 awlgewlrkkrivtgnprcqkpyfLKEIPiqdvaiqdftcddgnddnscSPLSRCPTectcLdTVVRCSNKGLKVLPKGI 718
Cdd:COG4886    148 ------------------------LTDLP--------------------EPLGNLTN----L-KSLDLSNNQLTDLPEEL 178
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  719 PR--DVTELYLDGNQFTLVPKELSNYKHLTLIDLSNNRISTLSNqSFSNMTQLLTLILSYNRLRCIPprTFDGLKSLRLL 796
Cdd:COG4886    179 GNltNLKELDLSNNQITDLPEPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLP--ELGNLTNLEEL 255
                          330       340       350
                   ....*....|....*....|....*....|.
gi 1034641845  797 SLHGNDISVVPEGAfnDLSALSHLAIGANPL 827
Cdd:COG4886    256 DLSNNQLTDLPPLA--NLTNLKTLDLSNNQL 284
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
27-177 5.34e-21

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 97.70  E-value: 5.34e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845   27 DLNGNNITRITKtDFAGLRHLRVLQLMENKISTIERgAFQDLKELERLRLNRNHLQLFPELLfLGTAKLYRLDLSENQIQ 106
Cdd:COG4886    119 DLSGNQLTDLPE-ELANLTNLKELDLSNNQLTDLPE-PLGNLTNLKSLDLSNNQLTDLPEEL-GNLTNLKELDLSNNQIT 195
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034641845  107 AIPrKAFRGAVDIKNLQLDYNQISCIEDgAFRALRDLEVLTLNNNNITRLSvaSFNHMPKLRTFRLHSNNL 177
Cdd:COG4886    196 DLP-EPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQL 262
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
285-633 3.88e-17

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 85.76  E-value: 3.88e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  285 AFSPYKKLRRIDLSNNQISELaPDAFQGLRSLNSLVLYGNKITELPKSLFEglfslqllllnankinclrvdafqdLHNL 364
Cdd:COG4886    108 ELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQLTDLPEPLGN-------------------------LTNL 161
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  365 NLLSLYDNKLQTIAKGtFSPLRaiqtmhlaqnpficdcHLKWLadYLHTNPIETsgarctsprrlankrigqikskkfrc 444
Cdd:COG4886    162 KSLDLSNNQLTDLPEE-LGNLT----------------NLKEL--DLSNNQITD-------------------------- 196
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  445 sakeqyfIPGTEDYRSKLsgdcfadlacpekcrcegTTVDCSNQKLNKIPEHIPQYTA--ELRLNNNEFTVLEAtgiFKK 522
Cdd:COG4886    197 -------LPEPLGNLTNL------------------EELDLSGNQLTDLPEPLANLTNleTLDLSNNQLTDLPE---LGN 248
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  523 LPQLRKINFSNNKITDIEEGAfegasgvneilltsnrlenvqhkmfkGLESLKTLMLRSNRITCVGNDSFIGLSSVRLLS 602
Cdd:COG4886    249 LTNLEELDLSNNQLTDLPPLA--------------------------NLTNLKTLDLSNNQLTDLKLKELELLLGLNSLL 302
                          330       340       350
                   ....*....|....*....|....*....|.
gi 1034641845  603 LYDNQITTVAPGAFDTLHSLSTLNLLANPFN 633
Cdd:COG4886    303 LLLLLLNLLELLILLLLLTTLLLLLLLLKGL 333
LRR_8 pfam13855
Leucine rich repeat;
744-803 1.66e-16

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 74.87  E-value: 1.66e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  744 HLTLIDLSNNRISTLSNQSFSNMTQLLTLILSYNRLRCIPPRTFDGLKSLRLLSLHGNDI 803
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
27-81 3.13e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 62.93  E-value: 3.13e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1034641845   27 DLNGNNITRITKTDFAGLRHLRVLQLMENKISTIERGAFQDLKELERLRLNRNHL 81
Cdd:pfam13855    7 DLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
269-326 3.62e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 62.54  E-value: 3.62e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1034641845  269 TEIRLEQNTIKVIPPGAFSPYKKLRRIDLSNNQISELAPDAFQGLRSLNSLVLYGNKI 326
Cdd:pfam13855    4 RSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
798-979 2.32e-11

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 69.34  E-value: 2.32e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  798 LHGNDISVVPEGAFNDLSALSHLAIGANPLYCDCNMQWLSDWVKSE---YKEPGIARCAGPGEMADKLLLTTPSKKFTCq 874
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKgvkVRQPEAALCAGPGALAGQPLLGIPLLDSGC- 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  875 gpvDVNILAkcnpCLSNPCKNDGTCNSDPVDFYRCTcPYGFKGQDCDVpihACISNPCKHGGTchlkeGEEDgfWCICAD 954
Cdd:TIGR00864   81 ---DEEYVA----CLKDNSSGGGAARSELVIFSAAH-EGLFQPEACNA---FCFSAGHGLAAL-----GEQG--ECLCGA 142
                          170       180
                   ....*....|....*....|....*
gi 1034641845  955 GFEGENCEVNVDDCEDNDCENNSTC 979
Cdd:TIGR00864  143 AQPSEANFACESLCSGPPPPPAAAC 167
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
603-679 1.05e-10

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 67.03  E-value: 1.05e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  603 LYDNQITTVAPGAFDTLHSLSTLNLLANPFNCNCYLAWLGEWLRKKRIVTGNPR---CQKPYFLKEIPIQDVAIQDFTCD 679
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEaalCAGPGALAGQPLLGIPLLDSGCD 81
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
720-820 2.46e-10

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 62.11  E-value: 2.46e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  720 RDVTELYLDGNQFTLVPkELSNYKHLTLIDLSNNRISTLSNqsFSNMTQLLTLILSYNRLRCIPPrtFDGLKSLRLLSLH 799
Cdd:cd21340      2 KRITHLYLNDKNITKID-NLSLCKNLKVLYLYDNKITKIEN--LEFLTNLTHLYLQNNQIEKIEN--LENLVNLKKLYLG 76
                           90       100
                   ....*....|....*....|.
gi 1034641845  800 GNDISVVpEGaFNDLSALSHL 820
Cdd:cd21340     77 GNRISVV-EG-LENLTNLEEL 95
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
28-177 5.67e-10

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 60.96  E-value: 5.67e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845   28 LNGNNITRITktDFAGLRHLRVLQLMENKISTIErgAFQDLKELERLRLNRNHLQlfpELLFLGT-AKLYRLDLSENQIQ 106
Cdd:cd21340      9 LNDKNITKID--NLSLCKNLKVLYLYDNKITKIE--NLEFLTNLTHLYLQNNQIE---KIENLENlVNLKKLYLGGNRIS 81
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034641845  107 AIprKAFRGAVDIKNLQLDYNQIS-----CIEDGAFRALRD-LEVLTLNNNNITrlSVASFNHMPKLRTFRLHSNNL 177
Cdd:cd21340     82 VV--EGLENLTNLEELHIENQRLPpgeklTFDPRSLAALSNsLRVLNISGNNID--SLEPLAPLRNLEQLDASNNQI 154
PLN03150 PLN03150
hypothetical protein; Provisional
735-804 1.89e-08

hypothetical protein; Provisional


Pssm-ID: 178695 [Multi-domain]  Cd Length: 623  Bit Score: 59.06  E-value: 1.89e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  735 VPKELSNYKHLTLIDLSNNRISTLSNQSFSNMTQLLTLILSYNRLRCIPPRTFDGLKSLRLLSLHGNDIS 804
Cdd:PLN03150   434 IPNDISKLRHLQSINLSGNSIRGNIPPSLGSITSLEVLDLSYNSFNGSIPESLGQLTSLRILNLNGNSLS 503
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1042-1078 2.37e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.40  E-value: 2.37e-07
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1034641845 1042 DFDDCQD-NKCKNGAHCTDAVNGYTCICPEGYSGLFCE 1078
Cdd:cd00054      1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
964-1000 4.10e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.63  E-value: 4.10e-07
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1034641845  964 NVDDCED-NDCENNSTCVDGINNYTCLCPPEYTGELCE 1000
Cdd:cd00054      1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
28-177 8.66e-07

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 53.70  E-value: 8.66e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845   28 LNGNNITRITKTDFAGLRHLRVLQLMENKISTIERGAFQDLKELERLRLNRNhlQLFPELL-FLGTAKLYRLDLSENQIQ 106
Cdd:PLN00113   411 LQDNSFSGELPSEFTKLPLVYFLDISNNNLQGRINSRKWDMPSLQMLSLARN--KFFGGLPdSFGSKRLENLDLSRNQFS 488
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034641845  107 -AIPRKaFRGAVDIKNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNITRLSVASFNHMPKLRTFRLHSNNL 177
Cdd:PLN00113   489 gAVPRK-LGSLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQL 559
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
253-333 2.66e-06

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 52.01  E-value: 2.66e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  253 RGKGLTEIPTNLPETITEIRLEQNTIKVIP---PGAfspykkLRRIDLSNNQISELAPDAFQGLRSLNslvLYGNKITEL 329
Cdd:PRK15370   228 NSNQLTSIPATLPDTIQEMELSINRITELPerlPSA------LQSLDLFHNKISCLPENLPEELRYLS---VYDNSIRTL 298

                   ....
gi 1034641845  330 PKSL 333
Cdd:PRK15370   299 PAHL 302
LRRCT smart00082
Leucine rich repeat C-terminal domain;
825-874 4.02e-06

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 45.11  E-value: 4.02e-06
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 1034641845   825 NPLYCDCNMQWLSDWVKSE--YKEPGIARCAGPGEMADKLLLTTPSkKFTCQ 874
Cdd:smart00082    1 NPFICDCELRWLLRWLQANehLQDPVDLRCASPSSLRGPLLELLHS-EFKCP 51
EGF_CA smart00179
Calcium-binding EGF-like domain;
1042-1078 7.16e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 44.16  E-value: 7.16e-06
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 1034641845  1042 DFDDCQ-DNKCKNGAHCTDAVNGYTCICPEGYS-GLFCE 1078
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
148-285 7.64e-06

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 50.85  E-value: 7.64e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  148 LNNNNITRLSVASFNHMPKLRTFRLHSNNLYCDCHLAWLSDWLRQ------RPRVglyTQCMGPSHLRGH---------- 211
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEkgvkvrQPEA---ALCAGPGALAGQpllgipllds 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  212 ----------------NVAEVQKREFVCSGHQSFMAPScsvlHCPAAC-TCSNNIVDCRGKGLTEIPTNLPETITEIRLE 274
Cdd:TIGR00864   79 gcdeeyvaclkdnssgGGAARSELVIFSAAHEGLFQPE----ACNAFCfSAGHGLAALGEQGECLCGAAQPSEANFACES 154
                          170
                   ....*....|.
gi 1034641845  275 QNTIKVIPPGA 285
Cdd:TIGR00864  155 LCSGPPPPPAA 165
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1046-1074 9.53e-06

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 43.53  E-value: 9.53e-06
                           10        20
                   ....*....|....*....|....*....
gi 1034641845 1046 CQDNKCKNGAHCTDAVNGYTCICPEGYSG 1074
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTG 29
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1004-1039 1.13e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.39  E-value: 1.13e-05
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1034641845 1004 DFCAQDlNPCQHDSKCILTPKGFKCDCTPGYVGEHC 1039
Cdd:cd00054      3 DECASG-NPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
LRRNT smart00013
Leucine rich repeat N-terminal domain;
693-724 1.17e-05

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 43.46  E-value: 1.17e-05
                            10        20        30
                    ....*....|....*....|....*....|..
gi 1034641845   693 CPTECTCLDTVVRCSNKGLKVLPKGIPRDVTE 724
Cdd:smart00013    2 CPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
EGF_CA smart00179
Calcium-binding EGF-like domain;
964-1000 1.55e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 43.39  E-value: 1.55e-05
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 1034641845   964 NVDDCE-DNDCENNSTCVDGINNYTCLCPPEYT-GELCE 1000
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
LRRCT smart00082
Leucine rich repeat C-terminal domain;
175-224 2.22e-05

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 43.19  E-value: 2.22e-05
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 1034641845   175 NNLYCDCHLAWLSDWLRQRPRV--GLYTQCMGPSHLRGhNVAEVQKREFVCS 224
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHLqdPVDLRCASPSSLRG-PLLELLHSEFKCP 51
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
968-997 1.06e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 40.83  E-value: 1.06e-04
                           10        20        30
                   ....*....|....*....|....*....|
gi 1034641845  968 CEDNDCENNSTCVDGINNYTCLCPPEYTGE 997
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGK 30
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
369-432 2.23e-04

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 46.23  E-value: 2.23e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034641845  369 LYDNKLQTIAKGTFSPLRAIQTMHLAQNPFICDCHLKWLADYLHTNPIET---SGARCTSPRRLANK 432
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVrqpEAALCAGPGALAGQ 68
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
239-265 2.42e-04

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 39.53  E-value: 2.42e-04
                           10        20
                   ....*....|....*....|....*..
gi 1034641845  239 CPAACTCSNNIVDCRGKGLTEIPTNLP 265
Cdd:pfam01462    2 CPVPCHCSATVVNCSDRGLTAVPRDLP 28
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1009-1038 2.55e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 39.67  E-value: 2.55e-04
                           10        20        30
                   ....*....|....*....|....*....|
gi 1034641845 1009 DLNPCQHDSKCILTPKGFKCDCTPGYVGEH 1038
Cdd:pfam00008    2 APNPCSNGGTCVDTPGGYTCICPEGYTGKR 31
LRRCT smart00082
Leucine rich repeat C-terminal domain;
396-426 9.50e-04

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 38.57  E-value: 9.50e-04
                            10        20        30
                    ....*....|....*....|....*....|...
gi 1034641845   396 NPFICDCHLKWLADYLHTNPI--ETSGARCTSP 426
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHlqDPVDLRCASP 33
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
930-962 1.44e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.62  E-value: 1.44e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1034641845  930 NPCKHGGTCHLKEGeedGFWCICADGFEGENCE 962
Cdd:cd00054      9 NPCQNGGTCVNTVG---SYRCSCPPGYTGRNCE 38
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
927-960 2.65e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 36.59  E-value: 2.65e-03
                           10        20        30
                   ....*....|....*....|....*....|....
gi 1034641845  927 CISNPCKHGGTCHLKEGeedGFWCICADGFEGEN 960
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPG---GYTCICPEGYTGKR 31
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
28-333 2.66e-03

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 42.53  E-value: 2.66e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845   28 LNGNNITRITKTDFAGLRHLRVLQLMENKISTierGAFQDL---KELERLRLNRNHLQ-LFPELLfLGTAKLYRLDLSEN 103
Cdd:PLN00113   315 LFSNNFTGKIPVALTSLPRLQVLQLWSNKFSG---EIPKNLgkhNNLTVLDLSTNNLTgEIPEGL-CSSGNLFKLILFSN 390
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  104 QIQAIPRKAFRGAVDIKNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNIT-RLSVASFNhMPKLRTFRLHSNNLYcdch 182
Cdd:PLN00113   391 SLEGEIPKSLGACRSLRRVRLQDNSFSGELPSEFTKLPLVYFLDISNNNLQgRINSRKWD-MPSLQMLSLARNKFF---- 465
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  183 lawlsdwlrqrprvGLYTQCMGPSHLRGHNVAEVQKREFVcsgHQSFMapscsvlhcpaactcsnnivdcrgkglteipt 262
Cdd:PLN00113   466 --------------GGLPDSFGSKRLENLDLSRNQFSGAV---PRKLG-------------------------------- 496
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034641845  263 NLPEtITEIRLEQNTIKVIPPGAFSPYKKLRRIDLSNNQISELAPDAFQGLRSLNSLVLYGNKIT-ELPKSL 333
Cdd:PLN00113   497 SLSE-LMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQLSgEIPKNL 567
EGF_CA smart00179
Calcium-binding EGF-like domain;
1004-1040 3.33e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 36.46  E-value: 3.33e-03
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 1034641845  1004 DFCAQDlNPCQHDSKCILTPKGFKCDCTPGYV-GEHCD 1040
Cdd:smart00179    3 DECASG-NPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1096-1123 5.37e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 36.08  E-value: 5.37e-03
                           10        20
                   ....*....|....*....|....*...
gi 1034641845 1096 CQNGAQCIVRINEPICQCLPGYQGEKCE 1123
Cdd:cd00054     11 CQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1091-1120 6.23e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 35.82  E-value: 6.23e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 1034641845 1091 CDNFDCQNGAQCIVRINEPICQCLPGYQGE 1120
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGK 30
 
Name Accession Description Interval E-value
LamG smart00282
Laminin G domain;
1149-1282 7.80e-37

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 135.54  E-value: 7.80e-37
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  1149 NITLQIATDEDSGILLY---KGDKDHIAVELYRGRVRASYDTGSHPASAIYSVETINDGNFHIVELLALDQSLSLSVDGG 1225
Cdd:smart00282    1 SISFSFRTTSPNGLLLYagsKGGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRSVTLSVDGG 80
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034641845  1226 NPKIITNLSKQSTLNFDSPLYVGGMPgksnvASLRQAPGQNGTSFHGCIRNLYINSE 1282
Cdd:smart00282   81 NRVSGESPGGLTILNLDGPLYLGGLP-----EDLKLPPLPVTPGFRGCIRNLKVNGK 132
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
1127-1280 8.46e-33

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 124.84  E-value: 8.46e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845 1127 SVNFiNKESYLQIP-SAKVRPQTNITLQIATDEDSGILLYKGDK---DHIAVELYRGRVRASYDTGSHPASaIYSVETIN 1202
Cdd:cd00110      1 GVSF-SGSSYVRLPtLPAPRTRLSISFSFRTTSPNGLLLYAGSQnggDFLALELEDGRLVLRYDLGSGSLV-LSSKTPLN 78
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034641845 1203 DGNFHIVELLALDQSLSLSVDGGNPKIITNLSKQSTLNFDSPLYVGGMPgksnvASLRQAPGQNGTSFHGCIRNLYIN 1280
Cdd:cd00110     79 DGQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLDGPLYLGGLP-----EDLKSPGLPVSPGFVGCIRDLKVN 151
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
1156-1282 8.41e-32

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 120.99  E-value: 8.41e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845 1156 TDEDSGILLYKGD--KDHIAVELYRGRVRASYDTGSHPASAIYSVETINDGNFHIVELLALDQSLSLSVDGGNPKIITNL 1233
Cdd:pfam02210    3 TRQPNGLLLYAGGggSDFLALELVNGRLVLRYDLGSGPESLLSSGKNLNDGQWHSVRVERNGNTLTLSVDGQTVVSSLPP 82
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 1034641845 1234 SKQSTLNFDSPLYVGGMPGKSNVASLRQAPGqngtsFHGCIRNLYINSE 1282
Cdd:pfam02210   83 GESLLLNLNGPLYLGGLPPLLLLPALPVRAG-----FVGCIRDVRVNGE 126
Laminin_G_1 pfam00054
Laminin G domain;
1154-1285 1.86e-28

Laminin G domain;


Pssm-ID: 395008 [Multi-domain]  Cd Length: 131  Bit Score: 111.64  E-value: 1.86e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845 1154 IATDEDSGILLYKGDKDH---IAVELYRGRVRASYDTGSHPASaIYSVETINDGNFHIVELLALDQSLSLSVDGG-NPKI 1229
Cdd:pfam00054    1 FRTTEPSGLLLYNGTQTErdfLALELRDGRLEVSYDLGSGAAV-VRSGDKLNDGKWHSVELERNGRSGTLSVDGEaRPTG 79
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1034641845 1230 ITNLSKQSTLNFDSPLYVGGMPgkSNVASLRQAPgqNGTSFHGCIRNLYINSELQD 1285
Cdd:pfam00054   80 ESPLGATTDLDVDGPLYVGGLP--SLGVKKRRLA--ISPSFDGCIRDVIVNGKPLD 131
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
481-827 1.13e-22

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 102.70  E-value: 1.13e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  481 TTVDCSNQKLNKIPEHIPQYTAELRLNNNEFTVLEATGIFKKLPQLRKINFSNNKITDIEEGAFEGASGVNEILLTSNRL 560
Cdd:COG4886      3 LLLLSLTLKLLLLLLLELLTTLILLLLLLLLLLALLLLSLLSLLLLLTLLLSLLLRDLLLSSLLLLLSLLLLLLLSLLLL 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  561 ENVQ--HKMFKGLESLKTLMLRsnritcvGNDSFIGLSSVRLLSLYDNQITTVaPGAFDTLHSLSTLNLLANPfncncyl 638
Cdd:COG4886     83 SLLLlgLTDLGDLTNLTELDLS-------GNEELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQ------- 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  639 awlgewlrkkrivtgnprcqkpyfLKEIPiqdvaiqdftcddgnddnscSPLSRCPTectcLdTVVRCSNKGLKVLPKGI 718
Cdd:COG4886    148 ------------------------LTDLP--------------------EPLGNLTN----L-KSLDLSNNQLTDLPEEL 178
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  719 PR--DVTELYLDGNQFTLVPKELSNYKHLTLIDLSNNRISTLSNqSFSNMTQLLTLILSYNRLRCIPprTFDGLKSLRLL 796
Cdd:COG4886    179 GNltNLKELDLSNNQITDLPEPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLP--ELGNLTNLEEL 255
                          330       340       350
                   ....*....|....*....|....*....|.
gi 1034641845  797 SLHGNDISVVPEGAfnDLSALSHLAIGANPL 827
Cdd:COG4886    256 DLSNNQLTDLPPLA--NLTNLKTLDLSNNQL 284
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
486-804 4.04e-22

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 100.78  E-value: 4.04e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  486 SNQKLNKIPEHIPQYTAELRLNNNEFTVLEATGIFKKLPQLRKINFSNNKITDIEEGafegasgvneilltsnrlenvqh 565
Cdd:COG4886     75 LLLSLLLLSLLLLGLTDLGDLTNLTELDLSGNEELSNLTNLESLDLSGNQLTDLPEE----------------------- 131
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  566 kmFKGLESLKTLMLRSNRITCVGnDSFIGLSSVRLLSLYDNQITTVaPGAFDTLHSLSTLNLlanpfncncylawlgewl 645
Cdd:COG4886    132 --LANLTNLKELDLSNNQLTDLP-EPLGNLTNLKSLDLSNNQLTDL-PEELGNLTNLKELDL------------------ 189
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  646 rkkrivTGNPrcqkpyfLKEIPiqdvaiqdftcddgnddnscSPLSRCPTectcldtvvrcsnkglkvlpkgiprdVTEL 725
Cdd:COG4886    190 ------SNNQ-------ITDLP--------------------EPLGNLTN--------------------------LEEL 210
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034641845  726 YLDGNQFTLVPKELSNYKHLTLIDLSNNRISTLSnqSFSNMTQLLTLILSYNRLRCIPPrtFDGLKSLRLLSLHGNDIS 804
Cdd:COG4886    211 DLSGNQLTDLPEPLANLTNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQLTDLPP--LANLTNLKTLDLSNNQLT 285
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
27-177 5.34e-21

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 97.70  E-value: 5.34e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845   27 DLNGNNITRITKtDFAGLRHLRVLQLMENKISTIERgAFQDLKELERLRLNRNHLQLFPELLfLGTAKLYRLDLSENQIQ 106
Cdd:COG4886    119 DLSGNQLTDLPE-ELANLTNLKELDLSNNQLTDLPE-PLGNLTNLKSLDLSNNQLTDLPEEL-GNLTNLKELDLSNNQIT 195
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034641845  107 AIPrKAFRGAVDIKNLQLDYNQISCIEDgAFRALRDLEVLTLNNNNITRLSvaSFNHMPKLRTFRLHSNNL 177
Cdd:COG4886    196 DLP-EPLGNLTNLEELDLSGNQLTDLPE-PLANLTNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQL 262
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
13-409 5.12e-19

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 91.53  E-value: 5.12e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845   13 ESINGVTLRLPASRDLNGNNITRITKTDFAGLRHLRVLQLMENKistiergAFQDLKELERLRLNRNHLQLFPELLFLGT 92
Cdd:COG4886     64 SLLLLLSLLLLLLLSLLLLSLLLLGLTDLGDLTNLTELDLSGNE-------ELSNLTNLESLDLSGNQLTDLPEELANLT 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845   93 aKLYRLDLSENQIQAIPrKAFRGAVDIKNLQLDYNQISCIeDGAFRALRDLEVLTLNNNNITRLSvASFNHMPKLRTFRL 172
Cdd:COG4886    137 -NLKELDLSNNQLTDLP-EPLGNLTNLKSLDLSNNQLTDL-PEELGNLTNLKELDLSNNQITDLP-EPLGNLTNLEELDL 212
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  173 HSNNLYCdchlawLSDWLRQrprvglytqcmgpshlrghnvaevqkrefvcsghqsfmapscsvlhcpaactCSN-NIVD 251
Cdd:COG4886    213 SGNQLTD------LPEPLAN----------------------------------------------------LTNlETLD 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  252 CRGKGLTEIP--TNLPEtITEIRLEQNTIKVIPPGAFSPykKLRRIDLSNNQISELAPDAFQGLRSLNSLVLYGNKITEL 329
Cdd:COG4886    235 LSNNQLTDLPelGNLTN-LEELDLSNNQLTDLPPLANLT--NLKTLDLSNNQLTDLKLKELELLLGLNSLLLLLLLLNLL 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  330 PksLFEGLFSLQLLLLNANKINCLRVDAFQDLHNLNLLSLYDNKLQTIAKGTFSPLRAIQTMHLAQNPFICDCHLKWLAD 409
Cdd:COG4886    312 E--LLILLLLLTTLLLLLLLLKGLLVTLTTLALSLSLLALLTLLLLLNLLSLLLTLLLTLGLLGLLEATLLTLALLLLTL 389
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
13-414 5.76e-19

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 91.15  E-value: 5.76e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845   13 ESINGVTLRLPASRDLNGNNITRITKTDFAGLRHLRVLQLMENKISTIERGAFQDLKELERLRLNRNhlqlfPELLFLgt 92
Cdd:COG4886     40 LSLLSLLLLLTLLLSLLLRDLLLSSLLLLLSLLLLLLLSLLLLSLLLLGLTDLGDLTNLTELDLSGN-----EELSNL-- 112
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845   93 AKLYRLDLSENQIQAIPrKAFRGAVDIKNLQLDYNQISCIEDgAFRALRDLEVLTLNNNNITRLSvASFNHMPKLRTFRL 172
Cdd:COG4886    113 TNLESLDLSGNQLTDLP-EELANLTNLKELDLSNNQLTDLPE-PLGNLTNLKSLDLSNNQLTDLP-EELGNLTNLKELDL 189
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  173 HSNNlycdchlawLSDWlrqrprvglytqcmgPSHLRGhnvaevqkrefvcsghqsfmapscsvlhcpaactCSN-NIVD 251
Cdd:COG4886    190 SNNQ---------ITDL---------------PEPLGN----------------------------------LTNlEELD 211
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  252 CRGKGLTEIPTNLPE--TITEIRLEQNTIKVIPpgAFSPYKKLRRIDLSNNQISELAPDAfqGLRSLNSLVLYGNKITEL 329
Cdd:COG4886    212 LSGNQLTDLPEPLANltNLETLDLSNNQLTDLP--ELGNLTNLEELDLSNNQLTDLPPLA--NLTNLKTLDLSNNQLTDL 287
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  330 P-KSLFEGLFSLQLLLLNANKINCLRVDAFQDLHNLNLLSLYDNKLQTIAKGTFSPLRAIQTMHLAQNPFICDCHLKWLA 408
Cdd:COG4886    288 KlKELELLLGLNSLLLLLLLLNLLELLILLLLLTTLLLLLLLLKGLLVTLTTLALSLSLLALLTLLLLLNLLSLLLTLLL 367

                   ....*.
gi 1034641845  409 DYLHTN 414
Cdd:COG4886    368 TLGLLG 373
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
285-633 3.88e-17

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 85.76  E-value: 3.88e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  285 AFSPYKKLRRIDLSNNQISELaPDAFQGLRSLNSLVLYGNKITELPKSLFEglfslqllllnankinclrvdafqdLHNL 364
Cdd:COG4886    108 ELSNLTNLESLDLSGNQLTDL-PEELANLTNLKELDLSNNQLTDLPEPLGN-------------------------LTNL 161
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  365 NLLSLYDNKLQTIAKGtFSPLRaiqtmhlaqnpficdcHLKWLadYLHTNPIETsgarctsprrlankrigqikskkfrc 444
Cdd:COG4886    162 KSLDLSNNQLTDLPEE-LGNLT----------------NLKEL--DLSNNQITD-------------------------- 196
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  445 sakeqyfIPGTEDYRSKLsgdcfadlacpekcrcegTTVDCSNQKLNKIPEHIPQYTA--ELRLNNNEFTVLEAtgiFKK 522
Cdd:COG4886    197 -------LPEPLGNLTNL------------------EELDLSGNQLTDLPEPLANLTNleTLDLSNNQLTDLPE---LGN 248
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  523 LPQLRKINFSNNKITDIEEGAfegasgvneilltsnrlenvqhkmfkGLESLKTLMLRSNRITCVGNDSFIGLSSVRLLS 602
Cdd:COG4886    249 LTNLEELDLSNNQLTDLPPLA--------------------------NLTNLKTLDLSNNQLTDLKLKELELLLGLNSLL 302
                          330       340       350
                   ....*....|....*....|....*....|.
gi 1034641845  603 LYDNQITTVAPGAFDTLHSLSTLNLLANPFN 633
Cdd:COG4886    303 LLLLLLNLLELLILLLLLTTLLLLLLLLKGL 333
LRR_8 pfam13855
Leucine rich repeat;
744-803 1.66e-16

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 74.87  E-value: 1.66e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  744 HLTLIDLSNNRISTLSNQSFSNMTQLLTLILSYNRLRCIPPRTFDGLKSLRLLSLHGNDI 803
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
249-418 4.22e-16

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 82.67  E-value: 4.22e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  249 IVDCRGKGLTEIPTNLPE--TITEIRLEQNTIKVIPPgAFSPYKKLRRIDLSNNQISELaPDAFQGLRSLNSLVLYGNKI 326
Cdd:COG4886    140 ELDLSNNQLTDLPEPLGNltNLKSLDLSNNQLTDLPE-ELGNLTNLKELDLSNNQITDL-PEPLGNLTNLEELDLSGNQL 217
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  327 TELPKSLfeglfslqllllnankinclrvdafQDLHNLNLLSLYDNKLQTIAKgtFSPLRAIQTMHLAQN-----PFICD 401
Cdd:COG4886    218 TDLPEPL-------------------------ANLTNLETLDLSNNQLTDLPE--LGNLTNLEELDLSNNqltdlPPLAN 270
                          170
                   ....*....|....*...
gi 1034641845  402 CH-LKWLadYLHTNPIET 418
Cdd:COG4886    271 LTnLKTL--DLSNNQLTD 286
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
268-646 1.61e-14

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 77.67  E-value: 1.61e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  268 ITEIRLEQNTIKVIPPgAFSPYKKLRRIDLSNNQISELaPDAFQGLRSLNSLVLYGNKITELPKSLfeglfslqllllna 347
Cdd:COG4886    115 LESLDLSGNQLTDLPE-ELANLTNLKELDLSNNQLTDL-PEPLGNLTNLKSLDLSNNQLTDLPEEL-------------- 178
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  348 nkinclrvdafQDLHNLNLLSLYDNKLQTIAKgTFSPLRAIQTMHLAQNPFicdchlkwladylhtNPIETSGARCTspr 427
Cdd:COG4886    179 -----------GNLTNLKELDLSNNQITDLPE-PLGNLTNLEELDLSGNQL---------------TDLPEPLANLT--- 228
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  428 rlankrigqikskkfrcsakeqyfipgtedyrsKLsgdcfadlacpekcrcegTTVDCSNQKLNKIPE--HIPQYTaELR 505
Cdd:COG4886    229 ---------------------------------NL------------------ETLDLSNNQLTDLPElgNLTNLE-ELD 256
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  506 LNNNEFTVLEATGifkKLPQLRKINFSNNKITDIEEGAFEGASGVNeiLLTSNRLENVQHKMFKGLESLKTLMLRSNRIT 585
Cdd:COG4886    257 LSNNQLTDLPPLA---NLTNLKTLDLSNNQLTDLKLKELELLLGLN--SLLLLLLLLNLLELLILLLLLTTLLLLLLLLK 331
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034641845  586 CVGNDSFIGLSSVRLLSLYDNQITTVAPGAFDTLHSLSTLNLLANPFNCNCYLAWLGEWLR 646
Cdd:COG4886    332 GLLVTLTTLALSLSLLALLTLLLLLNLLSLLLTLLLTLGLLGLLEATLLTLALLLLTLLLL 392
LRR_8 pfam13855
Leucine rich repeat;
548-608 2.31e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 63.31  E-value: 2.31e-12
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034641845  548 SGVNEILLTSNRLENVQHKMFKGLESLKTLMLRSNRITCVGNDSFIGLSSVRLLSLYDNQI 608
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
573-632 2.76e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 62.93  E-value: 2.76e-12
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  573 SLKTLMLRSNRITCVGNDSFIGLSSVRLLSLYDNQITTVAPGAFDTLHSLSTLNLLANPF 632
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
27-81 3.13e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 62.93  E-value: 3.13e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1034641845   27 DLNGNNITRITKTDFAGLRHLRVLQLMENKISTIERGAFQDLKELERLRLNRNHL 81
Cdd:pfam13855    7 DLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
269-326 3.62e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 62.54  E-value: 3.62e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1034641845  269 TEIRLEQNTIKVIPPGAFSPYKKLRRIDLSNNQISELAPDAFQGLRSLNSLVLYGNKI 326
Cdd:pfam13855    4 RSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
120-177 1.14e-11

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 61.39  E-value: 1.14e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1034641845  120 KNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNITRLSVASFNHMPKLRTFRLHSNNL 177
Cdd:pfam13855    4 RSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
798-979 2.32e-11

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 69.34  E-value: 2.32e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  798 LHGNDISVVPEGAFNDLSALSHLAIGANPLYCDCNMQWLSDWVKSE---YKEPGIARCAGPGEMADKLLLTTPSKKFTCq 874
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKgvkVRQPEAALCAGPGALAGQPLLGIPLLDSGC- 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  875 gpvDVNILAkcnpCLSNPCKNDGTCNSDPVDFYRCTcPYGFKGQDCDVpihACISNPCKHGGTchlkeGEEDgfWCICAD 954
Cdd:TIGR00864   81 ---DEEYVA----CLKDNSSGGGAARSELVIFSAAH-EGLFQPEACNA---FCFSAGHGLAAL-----GEQG--ECLCGA 142
                          170       180
                   ....*....|....*....|....*
gi 1034641845  955 GFEGENCEVNVDDCEDNDCENNSTC 979
Cdd:TIGR00864  143 AQPSEANFACESLCSGPPPPPAAAC 167
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
603-679 1.05e-10

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 67.03  E-value: 1.05e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  603 LYDNQITTVAPGAFDTLHSLSTLNLLANPFNCNCYLAWLGEWLRKKRIVTGNPR---CQKPYFLKEIPIQDVAIQDFTCD 679
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVRQPEaalCAGPGALAGQPLLGIPLLDSGCD 81
LRR_8 pfam13855
Leucine rich repeat;
94-153 1.25e-10

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 58.30  E-value: 1.25e-10
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845   94 KLYRLDLSENQIQAIPRKAFRGAVDIKNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNI 153
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
720-820 2.46e-10

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 62.11  E-value: 2.46e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  720 RDVTELYLDGNQFTLVPkELSNYKHLTLIDLSNNRISTLSNqsFSNMTQLLTLILSYNRLRCIPPrtFDGLKSLRLLSLH 799
Cdd:cd21340      2 KRITHLYLNDKNITKID-NLSLCKNLKVLYLYDNKITKIEN--LEFLTNLTHLYLQNNQIEKIEN--LENLVNLKKLYLG 76
                           90       100
                   ....*....|....*....|.
gi 1034641845  800 GNDISVVpEGaFNDLSALSHL 820
Cdd:cd21340     77 GNRISVV-EG-LENLTNLEEL 95
LRR_8 pfam13855
Leucine rich repeat;
722-779 3.45e-10

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 57.15  E-value: 3.45e-10
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1034641845  722 VTELYLDGNQFTLVPKE-LSNYKHLTLIDLSNNRISTLSNQSFSNMTQLLTLILSYNRL 779
Cdd:pfam13855    3 LRSLDLSNNRLTSLDDGaFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
28-177 5.67e-10

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 60.96  E-value: 5.67e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845   28 LNGNNITRITktDFAGLRHLRVLQLMENKISTIErgAFQDLKELERLRLNRNHLQlfpELLFLGT-AKLYRLDLSENQIQ 106
Cdd:cd21340      9 LNDKNITKID--NLSLCKNLKVLYLYDNKITKIE--NLEFLTNLTHLYLQNNQIE---KIENLENlVNLKKLYLGGNRIS 81
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034641845  107 AIprKAFRGAVDIKNLQLDYNQIS-----CIEDGAFRALRD-LEVLTLNNNNITrlSVASFNHMPKLRTFRLHSNNL 177
Cdd:cd21340     82 VV--EGLENLTNLEELHIENQRLPpgeklTFDPRSLAALSNsLRVLNISGNNID--SLEPLAPLRNLEQLDASNNQI 154
LRR_8 pfam13855
Leucine rich repeat;
767-827 7.46e-10

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 55.99  E-value: 7.46e-10
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034641845  767 TQLLTLILSYNRLRCIPPRTFDGLKSLRLLSLHGNDISVVPEGAFNDLSALSHLAIGANPL 827
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
28-175 1.80e-09

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 59.41  E-value: 1.80e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845   28 LNGNNITRITktDFAGLRHLRVLQLMENKISTIErgAFQDLKELERLRLNRNHLQLFPELLF-----LGTAK-LYRLDLS 101
Cdd:cd21340     53 LQNNQIEKIE--NLENLVNLKKLYLGGNRISVVE--GLENLTNLEELHIENQRLPPGEKLTFdprslAALSNsLRVLNIS 128
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034641845  102 ENQIQaiprkafrgavDIKNLQldynqisciedgafrALRDLEVLTLNNNNITRLSVAS--FNHMPKLRTFRLHSN 175
Cdd:cd21340    129 GNNID-----------SLEPLA---------------PLRNLEQLDASNNQISDLEELLdlLSSWPSLRELDLTGN 178
LRR_8 pfam13855
Leucine rich repeat;
503-560 5.38e-09

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 53.68  E-value: 5.38e-09
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1034641845  503 ELRLNNNEFTVLEAtGIFKKLPQLRKINFSNNKITDIEEGAFEGASGVNEILLTSNRL 560
Cdd:pfam13855    5 SLDLSNNRLTSLDD-GAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
LRR_8 pfam13855
Leucine rich repeat;
291-374 5.60e-09

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 53.68  E-value: 5.60e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  291 KLRRIDLSNNQISELAPDAFQGLRSLNSLVLYGNKITELPKslfeglfslqllllnankinclrvDAFQDLHNLNLLSLY 370
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSP------------------------GAFSGLPSLRYLDLS 57

                   ....
gi 1034641845  371 DNKL 374
Cdd:pfam13855   58 GNRL 61
LRR_8 pfam13855
Leucine rich repeat;
45-105 5.88e-09

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 53.68  E-value: 5.88e-09
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034641845   45 RHLRVLQLMENKISTIERGAFQDLKELERLRLNRNHLQLFPELLFLGTAKLYRLDLSENQI 105
Cdd:pfam13855    1 PNLRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PLN03150 PLN03150
hypothetical protein; Provisional
735-804 1.89e-08

hypothetical protein; Provisional


Pssm-ID: 178695 [Multi-domain]  Cd Length: 623  Bit Score: 59.06  E-value: 1.89e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  735 VPKELSNYKHLTLIDLSNNRISTLSNQSFSNMTQLLTLILSYNRLRCIPPRTFDGLKSLRLLSLHGNDIS 804
Cdd:PLN03150   434 IPNDISKLRHLQSINLSGNSIRGNIPPSLGSITSLEVLDLSYNSFNGSIPESLGQLTSLRILNLNGNSLS 503
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
699-827 3.17e-08

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 58.55  E-value: 3.17e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  699 CL---DTVVRCSNKGLKVLPKGIPRDVTELYLDGNQFTLVPKEL-SNYKHLT------------------LIDLSNNRIS 756
Cdd:PRK15370   175 CLknnKTELRLKILGLTTIPACIPEQITTLILDNNELKSLPENLqGNIKTLYansnqltsipatlpdtiqEMELSINRIT 254
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034641845  757 TLSNQSFSnmtQLLTLILSYNRLRCIPPRTFDGlksLRLLSLHGNDISVVPEgafNDLSALSHLAIGANPL 827
Cdd:PRK15370   255 ELPERLPS---ALQSLDLFHNKISCLPENLPEE---LRYLSVYDNSIRTLPA---HLPSGITHLNVQSNSL 316
LRR_8 pfam13855
Leucine rich repeat;
346-398 5.99e-08

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 50.60  E-value: 5.99e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1034641845  346 NANKINCLRVDAFQDLHNLNLLSLYDNKLQTIAKGTFSPLRAIQTMHLAQNPF 398
Cdd:pfam13855    9 SNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAFSGLPSLRYLDLSGNRL 61
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
288-829 1.06e-07

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 56.78  E-value: 1.06e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  288 PYkkLRRIDLSNNQISELAPDA-FQGLRSLNSLVLYGNKITelpKSLFEGLFSLQLLLLNANkiNCLRVDAFQDL---HN 363
Cdd:PLN00113    93 PY--IQTINLSNNQLSGPIPDDiFTTSSSLRYLNLSNNNFT---GSIPRGSIPNLETLDLSN--NMLSGEIPNDIgsfSS 165
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  364 LNLLSLYDNKLQTIAKGTFSPLRAIQTMHLAQNPFICDC--------HLKWLadYLHTN------PIETSGarCTSPRRL 429
Cdd:PLN00113   166 LKVLDLGGNVLVGKIPNSLTNLTSLEFLTLASNQLVGQIprelgqmkSLKWI--YLGYNnlsgeiPYEIGG--LTSLNHL 241
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  430 A---NKRIGQIKSKKFRCSAKEQYFIpgtedYRSKLSGDCFADLACPEKCrcegTTVDCSNQKLN-KIPEHIPQY-TAE- 503
Cdd:PLN00113   242 DlvyNNLTGPIPSSLGNLKNLQYLFL-----YQNKLSGPIPPSIFSLQKL----ISLDLSDNSLSgEIPELVIQLqNLEi 312
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  504 LRLNNNEFTVLEATGIfKKLPQLRKINFSNNKIT----------------DIEEGAFEG-------ASG-VNEILLTSNR 559
Cdd:PLN00113   313 LHLFSNNFTGKIPVAL-TSLPRLQVLQLWSNKFSgeipknlgkhnnltvlDLSTNNLTGeipeglcSSGnLFKLILFSNS 391
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  560 LENVQHKMFKGLESLKTLMLRSNRITCVGNDSFIGLSSVRLLSLYDNQITtvapGAFDT----LHSLSTLNLLANPFNCN 635
Cdd:PLN00113   392 LEGEIPKSLGACRSLRRVRLQDNSFSGELPSEFTKLPLVYFLDISNNNLQ----GRINSrkwdMPSLQMLSLARNKFFGG 467
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  636 cylawLGEWLRKKRIvtgnprcqkpyflkeipiqdvaiqdftcddGNDDnscspLSRcptectcldtvvrcsNKGLKVLP 715
Cdd:PLN00113   468 -----LPDSFGSKRL------------------------------ENLD-----LSR---------------NQFSGAVP 492
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  716 KGIPR--DVTELYLDGNQFT-LVPKELSNYKHLTLIDLSNNRISTLSNQSFSNMTQLLTLILSYNRLRCIPPRTFDGLKS 792
Cdd:PLN00113   493 RKLGSlsELMQLKLSENKLSgEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQLSGEIPKNLGNVES 572
                          570       580       590
                   ....*....|....*....|....*....|....*....
gi 1034641845  793 LRLLSLHGNDI--SVVPEGAFndlSALSHLAIGANPLYC 829
Cdd:PLN00113   573 LVQVNISHNHLhgSLPSTGAF---LAINASAVAGNIDLC 608
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1042-1078 2.37e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.40  E-value: 2.37e-07
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1034641845 1042 DFDDCQD-NKCKNGAHCTDAVNGYTCICPEGYSGLFCE 1078
Cdd:cd00054      1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
964-1000 4.10e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.63  E-value: 4.10e-07
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1034641845  964 NVDDCED-NDCENNSTCVDGINNYTCLCPPEYTGELCE 1000
Cdd:cd00054      1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
28-177 8.66e-07

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 53.70  E-value: 8.66e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845   28 LNGNNITRITKTDFAGLRHLRVLQLMENKISTIERGAFQDLKELERLRLNRNhlQLFPELL-FLGTAKLYRLDLSENQIQ 106
Cdd:PLN00113   411 LQDNSFSGELPSEFTKLPLVYFLDISNNNLQGRINSRKWDMPSLQMLSLARN--KFFGGLPdSFGSKRLENLDLSRNQFS 488
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034641845  107 -AIPRKaFRGAVDIKNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNITRLSVASFNHMPKLRTFRLHSNNL 177
Cdd:PLN00113   489 gAVPRK-LGSLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQL 559
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
253-333 2.66e-06

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 52.01  E-value: 2.66e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  253 RGKGLTEIPTNLPETITEIRLEQNTIKVIP---PGAfspykkLRRIDLSNNQISELAPDAFQGLRSLNslvLYGNKITEL 329
Cdd:PRK15370   228 NSNQLTSIPATLPDTIQEMELSINRITELPerlPSA------LQSLDLFHNKISCLPENLPEELRYLS---VYDNSIRTL 298

                   ....
gi 1034641845  330 PKSL 333
Cdd:PRK15370   299 PAHL 302
PRK15370 PRK15370
type III secretion system effector E3 ubiquitin transferase SlrP;
256-377 3.28e-06

type III secretion system effector E3 ubiquitin transferase SlrP;


Pssm-ID: 185268 [Multi-domain]  Cd Length: 754  Bit Score: 52.01  E-value: 3.28e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  256 GLTEIPTNLPETITEIRLEQNTIKVIPPGAFSPYKKLRRIDLSNNQISELAPDAFQGLRslnslvLYGNKITELPKSLfe 335
Cdd:PRK15370   189 GLTTIPACIPEQITTLILDNNELKSLPENLQGNIKTLYANSNQLTSIPATLPDTIQEME------LSINRITELPERL-- 260
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1034641845  336 gLFSLQLLLLNANKINCLRvDAFQDlhNLNLLSLYDNKLQTI 377
Cdd:PRK15370   261 -PSALQSLDLFHNKISCLP-ENLPE--ELRYLSVYDNSIRTL 298
LRRCT smart00082
Leucine rich repeat C-terminal domain;
825-874 4.02e-06

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 45.11  E-value: 4.02e-06
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 1034641845   825 NPLYCDCNMQWLSDWVKSE--YKEPGIARCAGPGEMADKLLLTTPSkKFTCQ 874
Cdd:smart00082    1 NPFICDCELRWLLRWLQANehLQDPVDLRCASPSSLRGPLLELLHS-EFKCP 51
EGF_CA smart00179
Calcium-binding EGF-like domain;
1042-1078 7.16e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 44.16  E-value: 7.16e-06
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 1034641845  1042 DFDDCQ-DNKCKNGAHCTDAVNGYTCICPEGYS-GLFCE 1078
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
148-285 7.64e-06

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 50.85  E-value: 7.64e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  148 LNNNNITRLSVASFNHMPKLRTFRLHSNNLYCDCHLAWLSDWLRQ------RPRVglyTQCMGPSHLRGH---------- 211
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEkgvkvrQPEA---ALCAGPGALAGQpllgipllds 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  212 ----------------NVAEVQKREFVCSGHQSFMAPScsvlHCPAAC-TCSNNIVDCRGKGLTEIPTNLPETITEIRLE 274
Cdd:TIGR00864   79 gcdeeyvaclkdnssgGGAARSELVIFSAAHEGLFQPE----ACNAFCfSAGHGLAALGEQGECLCGAAQPSEANFACES 154
                          170
                   ....*....|.
gi 1034641845  275 QNTIKVIPPGA 285
Cdd:TIGR00864  155 LCSGPPPPPAA 165
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1046-1074 9.53e-06

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 43.53  E-value: 9.53e-06
                           10        20
                   ....*....|....*....|....*....
gi 1034641845 1046 CQDNKCKNGAHCTDAVNGYTCICPEGYSG 1074
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTG 29
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1004-1039 1.13e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.39  E-value: 1.13e-05
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1034641845 1004 DFCAQDlNPCQHDSKCILTPKGFKCDCTPGYVGEHC 1039
Cdd:cd00054      3 DECASG-NPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
LRRNT smart00013
Leucine rich repeat N-terminal domain;
693-724 1.17e-05

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 43.46  E-value: 1.17e-05
                            10        20        30
                    ....*....|....*....|....*....|..
gi 1034641845   693 CPTECTCLDTVVRCSNKGLKVLPKGIPRDVTE 724
Cdd:smart00013    2 CPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
EGF_CA smart00179
Calcium-binding EGF-like domain;
964-1000 1.55e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 43.39  E-value: 1.55e-05
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 1034641845   964 NVDDCE-DNDCENNSTCVDGINNYTCLCPPEYT-GELCE 1000
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
LRRCT smart00082
Leucine rich repeat C-terminal domain;
175-224 2.22e-05

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 43.19  E-value: 2.22e-05
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 1034641845   175 NNLYCDCHLAWLSDWLRQRPRV--GLYTQCMGPSHLRGhNVAEVQKREFVCS 224
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHLqdPVDLRCASPSSLRG-PLLELLHSEFKCP 51
LRRNT smart00013
Leucine rich repeat N-terminal domain;
238-270 2.78e-05

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 42.30  E-value: 2.78e-05
                            10        20        30
                    ....*....|....*....|....*....|...
gi 1034641845   238 HCPAACTCSNNIVDCRGKGLTEIPTNLPETITE 270
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDTTL 33
RNA1 COG5238
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
486-631 7.82e-05

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 47.09  E-value: 7.82e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  486 SNQKLNKIPEHIPQYTA--ELRLNNNEFTVLEATGIFKKL---PQLRKINFSNNKITDieegafEGASGVNEILLTSNRL 560
Cdd:COG5238    193 GDEGIEELAEALTQNTTvtTLWLKRNPIGDEGAEILAEALkgnKSLTTLDLSNNQIGD------EGVIALAEALKNNTTV 266
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  561 E-------NVQH-------KMFKGLESLKTLMLRSNRItcvGNDSFIGL-------SSVRLLSLYDNQITTV-APGAFDT 618
Cdd:COG5238    267 EtlylsgnQIGAegaialaKALQGNTTLTSLDLSVNRI---GDEGAIALaeglqgnKTLHTLNLAYNGIGAQgAIALAKA 343
                          170
                   ....*....|....*.
gi 1034641845  619 LH---SLSTLNLLANP 631
Cdd:COG5238    344 LQentTLHSLDLSDNQ 359
LRR_RI cd00116
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 ...
47-180 1.04e-04

Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).


Pssm-ID: 238064 [Multi-domain]  Cd Length: 319  Bit Score: 46.19  E-value: 1.04e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845   47 LRVLQLMENKIS-TIER---GAFQDLKE-LERLRLNRNHL------QLFPELLFLGtaKLYRLDLSENQI--QAIPR--K 111
Cdd:cd00116    110 LQELKLNNNGLGdRGLRllaKGLKDLPPaLEKLVLGRNRLegasceALAKALRANR--DLKELNLANNGIgdAGIRAlaE 187
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034641845  112 AFRGAVDIKNLQLDYNQISCIED----GAFRALRDLEVLTLNNNNIT-----RLSVASFNHMPKLRTFRLHSNNLYCD 180
Cdd:cd00116    188 GLKANCNLEVLDLNNNGLTDEGAsalaETLASLKSLEVLNLGDNNLTdagaaALASALLSPNISLLTLSLSCNDITDD 265
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
968-997 1.06e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 40.83  E-value: 1.06e-04
                           10        20        30
                   ....*....|....*....|....*....|
gi 1034641845  968 CEDNDCENNSTCVDGINNYTCLCPPEYTGE 997
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGK 30
LRR_8 pfam13855
Leucine rich repeat;
143-177 1.63e-04

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 40.97  E-value: 1.63e-04
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 1034641845  143 LEVLTLNNNNITRLSVASFNHMPKLRTFRLHSNNL 177
Cdd:pfam13855    3 LRSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLL 37
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
693-719 1.65e-04

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 39.92  E-value: 1.65e-04
                           10        20
                   ....*....|....*....|....*..
gi 1034641845  693 CPTECTCLDTVVRCSNKGLKVLPKGIP 719
Cdd:pfam01462    2 CPVPCHCSATVVNCSDRGLTAVPRDLP 28
PCC TIGR00864
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ...
369-432 2.23e-04

polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.


Pssm-ID: 188093 [Multi-domain]  Cd Length: 2740  Bit Score: 46.23  E-value: 2.23e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034641845  369 LYDNKLQTIAKGTFSPLRAIQTMHLAQNPFICDCHLKWLADYLHTNPIET---SGARCTSPRRLANK 432
Cdd:TIGR00864    2 ISNNKISTIEEGICANLCNLSEIDLSGNPFECDCGLARLPRWAEEKGVKVrqpEAALCAGPGALAGQ 68
LRRNT pfam01462
Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence ...
239-265 2.42e-04

Leucine rich repeat N-terminal domain; Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.


Pssm-ID: 396168 [Multi-domain]  Cd Length: 28  Bit Score: 39.53  E-value: 2.42e-04
                           10        20
                   ....*....|....*....|....*..
gi 1034641845  239 CPAACTCSNNIVDCRGKGLTEIPTNLP 265
Cdd:pfam01462    2 CPVPCHCSATVVNCSDRGLTAVPRDLP 28
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1009-1038 2.55e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 39.67  E-value: 2.55e-04
                           10        20        30
                   ....*....|....*....|....*....|
gi 1034641845 1009 DLNPCQHDSKCILTPKGFKCDCTPGYVGEH 1038
Cdd:pfam00008    2 APNPCSNGGTCVDTPGGYTCICPEGYTGKR 31
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
723-804 2.64e-04

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 44.01  E-value: 2.64e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  723 TELYLD------GNQFTLVP---KELSNykHLTLIDLSNNRISTLSnqSFSNMTQLLTLILSYNRLRCIPP--RTFDGLK 791
Cdd:cd21340     93 EELHIEnqrlppGEKLTFDPrslAALSN--SLRVLNISGNNIDSLE--PLAPLRNLEQLDASNNQISDLEEllDLLSSWP 168
                           90
                   ....*....|...
gi 1034641845  792 SLRLLSLHGNDIS 804
Cdd:cd21340    169 SLRELDLTGNPVC 181
hEGF pfam12661
Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six ...
1051-1072 2.79e-04

Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six conserved residues disulfide-bonded into the characteriztic 'ababcc' pattern. They are involved in growth and proliferation of cells, in proteins of the Notch/Delta pathway, neurogulin and selectins. hEGFs are also found in mosaic proteins with four-disulfide laminin EGFs such as aggrecan and perlecan. The core fold of the EGF domain consists of two small beta-hairpins packed against each other. Two major structural variants have been identified based on the structural context of the C-terminal Cys residue of disulfide 'c' in the C-terminal hairpin: hEGFs and cEGFs. In hEGFs the C-terminal thiol resides in the beta-turn, resulting in shorter loop-lengths between the Cys residues of disulfide 'c', typically C[8-9]XC. These shorter loop-lengths are also typical of the four-disulfide EGF domains, laminin ad integrin. Tandem hEGF domains have six linking residues between terminal cysteines of adjacent domains. hEGF domains may or may not bind calcium in the linker region. hEGF domains with the consensus motif CXD4X[F,Y]XCXC are hydroxylated exclusively in the Asp residue.


Pssm-ID: 463660  Cd Length: 22  Bit Score: 39.24  E-value: 2.79e-04
                           10        20
                   ....*....|....*....|..
gi 1034641845 1051 CKNGAHCTDAVNGYTCICPEGY 1072
Cdd:pfam12661    1 CQNGGTCVDGVNGYKCQCPPGY 22
LRRCT smart00082
Leucine rich repeat C-terminal domain;
630-679 5.47e-04

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 39.34  E-value: 5.47e-04
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 1034641845   630 NPFNCNCYLAWLGEWLRKKRIV--TGNPRCQKPYFLKEiPIQDVAIQDFTCD 679
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHLqdPVDLRCASPSSLRG-PLLELLHSEFKCP 51
LRRNT smart00013
Leucine rich repeat N-terminal domain;
471-501 6.02e-04

Leucine rich repeat N-terminal domain;


Pssm-ID: 214470 [Multi-domain]  Cd Length: 33  Bit Score: 38.45  E-value: 6.02e-04
                            10        20        30
                    ....*....|....*....|....*....|.
gi 1034641845   471 ACPEKCRCEGTTVDCSNQKLNKIPEHIPQYT 501
Cdd:smart00013    1 ACPAPCNCSGTAVDCSGRGLTEVPLDLPPDT 31
LRR_4 pfam12799
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ...
143-177 7.25e-04

Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.


Pssm-ID: 463713 [Multi-domain]  Cd Length: 44  Bit Score: 38.77  E-value: 7.25e-04
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 1034641845  143 LEVLTLNNNNITRLSvaSFNHMPKLRTFRLHSNNL 177
Cdd:pfam12799    3 LEVLDLSNNQITDIP--PLAKLPNLETLDLSGNNK 35
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
1047-1078 9.37e-04

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 38.23  E-value: 9.37e-04
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1034641845 1047 QDNKCKNGAHCTDAVNGYTCICPEGYSGLF-CE 1078
Cdd:cd00053      4 ASNPCSNGGTCVNTPGSYRCVCPPGYTGDRsCE 36
LRRCT smart00082
Leucine rich repeat C-terminal domain;
396-426 9.50e-04

Leucine rich repeat C-terminal domain;


Pssm-ID: 214507 [Multi-domain]  Cd Length: 51  Bit Score: 38.57  E-value: 9.50e-04
                            10        20        30
                    ....*....|....*....|....*....|...
gi 1034641845   396 NPFICDCHLKWLADYLHTNPI--ETSGARCTSP 426
Cdd:smart00082    1 NPFICDCELRWLLRWLQANEHlqDPVDLRCASP 33
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
930-962 1.44e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.62  E-value: 1.44e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1034641845  930 NPCKHGGTCHLKEGeedGFWCICADGFEGENCE 962
Cdd:cd00054      9 NPCQNGGTCVNTVG---SYRCSCPPGYTGRNCE 38
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
969-1000 2.19e-03

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 37.07  E-value: 2.19e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1034641845  969 EDNDCENNSTCVDGINNYTCLCPPEYTGEL-CE 1000
Cdd:cd00053      4 ASNPCSNGGTCVNTPGSYRCVCPPGYTGDRsCE 36
LRR_4 pfam12799
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ...
290-330 2.23e-03

Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.


Pssm-ID: 463713 [Multi-domain]  Cd Length: 44  Bit Score: 37.22  E-value: 2.23e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 1034641845  290 KKLRRIDLSNNQISELapDAFQGLRSLNSLVLYGN-KITELP 330
Cdd:pfam12799    1 PNLEVLDLSNNQITDI--PPLAKLPNLETLDLSGNnKITDLS 40
PPP1R42 cd21340
protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 ...
489-632 2.34e-03

protein phosphatase 1 regulatory subunit 42; Protein phosphatase 1 regulatory subunit 42 (PPP1R42), also known as leucine-rich repeat-containing protein 67 (lrrc67) or testis leucine-rich repeat (TLRR) protein, plays a role in centrosome separation. PPP1R42 has been shown to interact with the well-conserved signaling protein phosphatase-1 (PP1) and thereby increasing PP1's activity, which counters centrosome separation. Inhibition of PPP1R42 expression increases the number of centrosomes per cell while its depletion reduces the activity of PP1 leading to activation of NEK2, the kinase responsible for phosphorylation of centrosomal linker proteins promoting centrosome separation.


Pssm-ID: 411060 [Multi-domain]  Cd Length: 220  Bit Score: 41.31  E-value: 2.34e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  489 KLNKIP--EHIPQYTaELRLNNNEFTVLEAtgiFKKLPQLRKINFSNNKITDIEegAFEGASGVNEILLTSNRLENVQHK 566
Cdd:cd21340     35 KITKIEnlEFLTNLT-HLYLQNNQIEKIEN---LENLVNLKKLYLGGNRISVVE--GLENLTNLEELHIENQRLPPGEKL 108
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1034641845  567 MF-----KGL-ESLKTLMLRSNRITCVgnDSFIGLSSVRLLSLYDNQITTVAP--GAFDTLHSLSTLNLLANPF 632
Cdd:cd21340    109 TFdprslAALsNSLRVLNISGNNIDSL--EPLAPLRNLEQLDASNNQISDLEEllDLLSSWPSLRELDLTGNPV 180
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
927-960 2.65e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 36.59  E-value: 2.65e-03
                           10        20        30
                   ....*....|....*....|....*....|....
gi 1034641845  927 CISNPCKHGGTCHLKEGeedGFWCICADGFEGEN 960
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPG---GYTCICPEGYTGKR 31
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
28-333 2.66e-03

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 42.53  E-value: 2.66e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845   28 LNGNNITRITKTDFAGLRHLRVLQLMENKISTierGAFQDL---KELERLRLNRNHLQ-LFPELLfLGTAKLYRLDLSEN 103
Cdd:PLN00113   315 LFSNNFTGKIPVALTSLPRLQVLQLWSNKFSG---EIPKNLgkhNNLTVLDLSTNNLTgEIPEGL-CSSGNLFKLILFSN 390
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  104 QIQAIPRKAFRGAVDIKNLQLDYNQISCIEDGAFRALRDLEVLTLNNNNIT-RLSVASFNhMPKLRTFRLHSNNLYcdch 182
Cdd:PLN00113   391 SLEGEIPKSLGACRSLRRVRLQDNSFSGELPSEFTKLPLVYFLDISNNNLQgRINSRKWD-MPSLQMLSLARNKFF---- 465
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  183 lawlsdwlrqrprvGLYTQCMGPSHLRGHNVAEVQKREFVcsgHQSFMapscsvlhcpaactcsnnivdcrgkglteipt 262
Cdd:PLN00113   466 --------------GGLPDSFGSKRLENLDLSRNQFSGAV---PRKLG-------------------------------- 496
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034641845  263 NLPEtITEIRLEQNTIKVIPPGAFSPYKKLRRIDLSNNQISELAPDAFQGLRSLNSLVLYGNKIT-ELPKSL 333
Cdd:PLN00113   497 SLSE-LMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQLSgEIPKNL 567
EGF_CA smart00179
Calcium-binding EGF-like domain;
1004-1040 3.33e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 36.46  E-value: 3.33e-03
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 1034641845  1004 DFCAQDlNPCQHDSKCILTPKGFKCDCTPGYV-GEHCD 1040
Cdd:smart00179    3 DECASG-NPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
hEGF pfam12661
Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six ...
973-994 3.50e-03

Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six conserved residues disulfide-bonded into the characteriztic 'ababcc' pattern. They are involved in growth and proliferation of cells, in proteins of the Notch/Delta pathway, neurogulin and selectins. hEGFs are also found in mosaic proteins with four-disulfide laminin EGFs such as aggrecan and perlecan. The core fold of the EGF domain consists of two small beta-hairpins packed against each other. Two major structural variants have been identified based on the structural context of the C-terminal Cys residue of disulfide 'c' in the C-terminal hairpin: hEGFs and cEGFs. In hEGFs the C-terminal thiol resides in the beta-turn, resulting in shorter loop-lengths between the Cys residues of disulfide 'c', typically C[8-9]XC. These shorter loop-lengths are also typical of the four-disulfide EGF domains, laminin ad integrin. Tandem hEGF domains have six linking residues between terminal cysteines of adjacent domains. hEGF domains may or may not bind calcium in the linker region. hEGF domains with the consensus motif CXD4X[F,Y]XCXC are hydroxylated exclusively in the Asp residue.


Pssm-ID: 463660  Cd Length: 22  Bit Score: 36.16  E-value: 3.50e-03
                           10        20
                   ....*....|....*....|..
gi 1034641845  973 CENNSTCVDGINNYTCLCPPEY 994
Cdd:pfam12661    1 CQNGGTCVDGVNGYKCQCPPGY 22
RNA1 COG5238
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ...
246-415 3.54e-03

Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Translation, ribosomal structure and biogenesis];


Pssm-ID: 444072 [Multi-domain]  Cd Length: 434  Bit Score: 41.70  E-value: 3.54e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  246 SNNIVDCRGKGLTEIPTnLPETITEIRLEQNTIKviPPGA------FSPYKKLRRIDLSNNQIS-----ELApDAFQGLR 314
Cdd:COG5238    189 CNQIGDEGIEELAEALT-QNTTVTTLWLKRNPIG--DEGAeilaeaLKGNKSLTTLDLSNNQIGdegviALA-EALKNNT 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  315 SLNSLVLYGNKITE-----LPKSLfEGLFSLQLLLLNANKINCLRVDAFQDL----HNLNLLSLYDNKLQT-----IAKg 380
Cdd:COG5238    265 TVETLYLSGNQIGAegaiaLAKAL-QGNTTLTSLDLSVNRIGDEGAIALAEGlqgnKTLHTLNLAYNGIGAqgaiaLAK- 342
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 1034641845  381 TFSPLRAIQTMHLAQNPfICDCHLKWLADYLHTNP 415
Cdd:COG5238    343 ALQENTTLHSLDLSDNQ-IGDEGAIALAKYLEGNT 376
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
1011-1038 3.61e-03

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 36.69  E-value: 3.61e-03
                           10        20
                   ....*....|....*....|....*...
gi 1034641845 1011 NPCQHDSKCILTPKGFKCDCTPGYVGEH 1038
Cdd:cd00053      6 NPCSNGGTCVNTPGSYRCVCPPGYTGDR 33
PLN00113 PLN00113
leucine-rich repeat receptor-like protein kinase; Provisional
28-374 3.83e-03

leucine-rich repeat receptor-like protein kinase; Provisional


Pssm-ID: 215061 [Multi-domain]  Cd Length: 968  Bit Score: 41.76  E-value: 3.83e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845   28 LNGNNITRITKTDFAGLRHLRVLQLMENKISTIERGAFQDLKELERLRLNRNHLQ-LFPELLFlGTAKLYRLDLSENQIQ 106
Cdd:PLN00113   219 LGYNNLSGEIPYEIGGLTSLNHLDLVYNNLTGPIPSSLGNLKNLQYLFLYQNKLSgPIPPSIF-SLQKLISLDLSDNSLS 297
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  107 A-IPRKAfrgaVDIKNLQ---LDYNQISCIEDGAFRALRDLEVLTLNNNNITRLSVASFNHMPKLRTFRLHSNNL----- 177
Cdd:PLN00113   298 GeIPELV----IQLQNLEilhLFSNNFTGKIPVALTSLPRLQVLQLWSNKFSGEIPKNLGKHNNLTVLDLSTNNLtgeip 373
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  178 --YCDC----HLAWLSDWLR-QRPRVGLYTQCMGPSHLRGHNVAEVQKREFVCSGHQSFMAPScsvlhcpaactcSNNIV 250
Cdd:PLN00113   374 egLCSSgnlfKLILFSNSLEgEIPKSLGACRSLRRVRLQDNSFSGELPSEFTKLPLVYFLDIS------------NNNLQ 441
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  251 DCRGKGLTEIPTnlpetITEIRLEQNTIKVIPPGAFSPyKKLRRIDLSNNQISELAPDAFQGLRSLNSLVLYGNKIT-EL 329
Cdd:PLN00113   442 GRINSRKWDMPS-----LQMLSLARNKFFGGLPDSFGS-KRLENLDLSRNQFSGAVPRKLGSLSELMQLKLSENKLSgEI 515
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*
gi 1034641845  330 PKSLfEGLFSLQLLLLNANKINCLRVDAFQDLHNLNLLSLYDNKL 374
Cdd:PLN00113   516 PDEL-SSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQNQL 559
LRR smart00370
Leucine-rich repeats, outliers;
291-312 4.26e-03

Leucine-rich repeats, outliers;


Pssm-ID: 197688 [Multi-domain]  Cd Length: 24  Bit Score: 35.79  E-value: 4.26e-03
                            10        20
                    ....*....|....*....|..
gi 1034641845   291 KLRRIDLSNNQISELAPDAFQG 312
Cdd:smart00370    3 NLRELDLSNNQLSSLPPGAFQG 24
LRR_TYP smart00369
Leucine-rich repeats, typical (most populated) subfamily;
291-312 4.26e-03

Leucine-rich repeats, typical (most populated) subfamily;


Pssm-ID: 197687 [Multi-domain]  Cd Length: 24  Bit Score: 35.79  E-value: 4.26e-03
                            10        20
                    ....*....|....*....|..
gi 1034641845   291 KLRRIDLSNNQISELAPDAFQG 312
Cdd:smart00369    3 NLRELDLSNNQLSSLPPGAFQG 24
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1096-1123 5.37e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 36.08  E-value: 5.37e-03
                           10        20
                   ....*....|....*....|....*...
gi 1034641845 1096 CQNGAQCIVRINEPICQCLPGYQGEKCE 1123
Cdd:cd00054     11 CQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
888-917 5.76e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 35.82  E-value: 5.76e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 1034641845  888 CLSNPCKNDGTCNSDPVDfYRCTCPYGFKG 917
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGG-YTCICPEGYTG 29
LRR_9 pfam14580
Leucine-rich repeat;
506-585 6.00e-03

Leucine-rich repeat;


Pssm-ID: 405295 [Multi-domain]  Cd Length: 175  Bit Score: 39.36  E-value: 6.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  506 LNNNEFTVLEAtgiFKKLPQLRKINFSNNKITDIEEGAFEGASGVNEILLTSNRL-ENVQHKMFKGLESLKTLMLRSNRI 584
Cdd:pfam14580   49 FSDNEIRKLDG---FPLLRRLKTLLLNNNRICRIGEGLGEALPNLTELILTNNNLqELGDLDPLASLKKLTFLSLLRNPV 125

                   .
gi 1034641845  585 T 585
Cdd:pfam14580  126 T 126
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1091-1120 6.23e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 35.82  E-value: 6.23e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 1034641845 1091 CDNFDCQNGAQCIVRINEPICQCLPGYQGE 1120
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGK 30
LRR_5 pfam13306
BspA type Leucine rich repeat region (6 copies); This family includes a number of leucine rich ...
255-336 6.29e-03

BspA type Leucine rich repeat region (6 copies); This family includes a number of leucine rich repeats. This family contains a large number of BSPA-like surface antigens from Trichomonas vaginalis.


Pssm-ID: 463839 [Multi-domain]  Cd Length: 127  Bit Score: 38.29  E-value: 6.29e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034641845  255 KGLTEIptNLPETITEIR--------LE----QNTIKVIPPGAFSPYKKLRRIDLSNNqISELAPDAFQGLrSLNSLVLy 322
Cdd:pfam13306   34 TSLKSI--TLPSSLTSIGsyafyncsLTsitiPSSLTSIGEYAFSNCSNLKSITLPSN-LTSIGSYAFSNC-SLKSITI- 108
                           90
                   ....*....|....
gi 1034641845  323 GNKITELPKSLFEG 336
Cdd:pfam13306  109 PSSVTTIGSYAFSN 122
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
886-920 8.37e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 35.31  E-value: 8.37e-03
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1034641845  886 NPCLS-NPCKNDGTCNSDPVDfYRCTCPYGFKGQDC 920
Cdd:cd00054      3 DECASgNPCQNGGTCVNTVGS-YRCSCPPGYTGRNC 37
hEGF pfam12661
Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six ...
1096-1117 8.39e-03

Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six conserved residues disulfide-bonded into the characteriztic 'ababcc' pattern. They are involved in growth and proliferation of cells, in proteins of the Notch/Delta pathway, neurogulin and selectins. hEGFs are also found in mosaic proteins with four-disulfide laminin EGFs such as aggrecan and perlecan. The core fold of the EGF domain consists of two small beta-hairpins packed against each other. Two major structural variants have been identified based on the structural context of the C-terminal Cys residue of disulfide 'c' in the C-terminal hairpin: hEGFs and cEGFs. In hEGFs the C-terminal thiol resides in the beta-turn, resulting in shorter loop-lengths between the Cys residues of disulfide 'c', typically C[8-9]XC. These shorter loop-lengths are also typical of the four-disulfide EGF domains, laminin ad integrin. Tandem hEGF domains have six linking residues between terminal cysteines of adjacent domains. hEGF domains may or may not bind calcium in the linker region. hEGF domains with the consensus motif CXD4X[F,Y]XCXC are hydroxylated exclusively in the Asp residue.


Pssm-ID: 463660  Cd Length: 22  Bit Score: 35.00  E-value: 8.39e-03
                           10        20
                   ....*....|....*....|..
gi 1034641845 1096 CQNGAQCIVRINEPICQCLPGY 1117
Cdd:pfam12661    1 CQNGGTCVDGVNGYKCQCPPGY 22
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH