NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2462588154|ref|XP_054201702|]
View 

sterol regulatory element-binding protein cleavage-activating protein isoform X2 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Sterol-sensing pfam12349
Sterol-sensing domain of SREBP cleavage-activation; Sterol regulatory element-binding proteins ...
308-452 7.44e-57

Sterol-sensing domain of SREBP cleavage-activation; Sterol regulatory element-binding proteins (SREBPs) are membrane-bound transcription factors that promote lipid synthesis in animal cells. They are embedded in the membranes of the endoplasmic reticulum (ER) in a helical hairpin orientation and are released from the ER by a two-step proteolytic process. Proteolysis begins when the SREBPs are cleaved at Site-1, which is located at a leucine residue in the middle of the hydrophobic loop in the lumen of the ER. Upon proteolytic processing SREBP can activate the expression of genes involved in cholesterol biosynthesis and uptake. SCAP stimulates cleavage of SREBPs via fusion of the their two C-termini. This domain is the transmembrane region that traverses the membrane eight times and is the sterol-sensing domain of the cleavage protein. WD40 domains are found towards the C-terminus.


:

Pssm-ID: 463544 [Multi-domain]  Cd Length: 153  Bit Score: 193.57  E-value: 7.44e-57
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  308 MVKSKWGLALAAVVTVLSSLLMSVGLCTLFGLTPTLNGGEIFPYLVVVIGLENVLVLTKSVVSTPVDLEVKLRIAQGLSS 387
Cdd:pfam12349    1 MVKSKFGLGLAGVIIVLASVASSLGLCAYFGLPLTLIISEVIPFLVLAIGVDNIFLLVKAVVRTPRSLDVSERIAEALGE 80
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462588154  388 ESWSIMKNMATELGIILIGYFTLVPAIQEFCLFAVVGLVSDFFLQMLFFTTVLSIDIRRMELADL 452
Cdd:pfam12349   81 VGPSITLTSLTEILAFLLGALTDMPAVQEFCLFAAVAVLFDFLLQMTFFVAVLSLDIRRLESNRL 145
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1077-1234 2.26e-25

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 107.81  E-value: 2.26e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1077 HQKPITALKAAA--GRLVTGSQDHTLRVFRLEDSCCLFTLQGHSGAITTV-YIDQTMVLASGGQDGAICLWDVLTGSRVS 1153
Cdd:cd00200      8 HTGGVTCVAFSPdgKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVaASADGTYLASGSSDKTIRLWDLETGECVR 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1154 HVFAHRGDVTSLTCTTSC--VISSGLDDLISIWDRSTGIKFYSIQQDLGCGASLGVISDNLLVTGGQ--GCVSFWDLNYG 1229
Cdd:cd00200     88 TLTGHTSYVSSVAFSPDGriLSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSqdGTIKLWDLRTG 167

                   ....*
gi 2462588154 1230 DLLQT 1234
Cdd:cd00200    168 KCVAT 172
WD40 COG2319
WD40 repeat [General function prediction only];
892-1234 1.50e-23

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 104.61  E-value: 1.50e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  892 PEPRHRAVCGRSRDSPGYDFSCLVQRVYQEEGLAAVCTPALRPPSPGPVLSQAPEDEGGSPEKGSPSLAWAPSAEGSIWS 971
Cdd:COG2319      4 ADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLS 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  972 LELQ--GNLIVVGRSSGRLEVWDAIEGVLCCSSEEVSSGITALVFLD--KRIVAARLNGSLDFFSLETHTALSPLqfRGT 1047
Cdd:COG2319     84 VAFSpdGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPdgKTLASGSADGTVRLWDLATGKLLRTL--TGH 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1048 PGRGSSPA-SP----VYSSS--------DTVACHLTHTVPcAHQKPITALK-AAAGR-LVTGSQDHTLRVFRLEDSCCLF 1112
Cdd:COG2319    162 SGAVTSVAfSPdgklLASGSddgtvrlwDLATGKLLRTLT-GHTGAVRSVAfSPDGKlLASGSADGTVRLWDLATGKLLR 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1113 TLQGHSGAITTVYI--DQTMvLASGGQDGAICLWDVLTGSRVSHVFAHRGDVTSLTCTT--SCVISSGLDDLISIWDRST 1188
Cdd:COG2319    241 TLTGHSGSVRSVAFspDGRL-LASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPdgKLLASGSDDGTVRLWDLAT 319
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*...
gi 2462588154 1189 GIKFYSIQQDLGCGASLGVISD-NLLVTGGQ-GCVSFWDLNYGDLLQT 1234
Cdd:COG2319    320 GKLLRTLTGHTGAVRSVAFSPDgKTLASGSDdGTVRLWDLATGELLRT 367
2A060601 super family cl36767
Niemann-Pick C type protein family; The model describes Niemann-Pick C type protein in ...
11-466 5.60e-22

Niemann-Pick C type protein family; The model describes Niemann-Pick C type protein in eukaryotes. The defective protein has been associated with Niemann-Pick disease which is described in humans as autosomal recessive lipidosis. It is characterized by the lysosomal accumulation of unestrified cholesterol. It is an integral membrane protein, which indicates that this protein is most likely involved in cholesterol transport or acts as some component of cholesterol homeostasis. [Transport and binding proteins, Other]


The actual alignment was detected with superfamily member TIGR00917:

Pssm-ID: 273337 [Multi-domain]  Cd Length: 1205  Bit Score: 103.45  E-value: 5.60e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154   11 ISRAFYNHGLLCASYPIPIILFTGFCILACCYPLLKLPLPgtgpvefTTPVKDYSPPpvdsDRKQGEPTEQPEWYVGaPV 90
Cdd:TIGR00917  309 LARFFGKYGIWVARHPTLVICLSVSVVLLLCVGLIRFKVE-------TRPVKLWVAP----GSRAALEKQYFDTHFG-PF 376
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154   91 AYVQQIFVKSSVFPWHK---NLLAVDVfrspLSRAFQLVEEIRNHVLRDSSGIRSLEELCLQVTDllPGlrklrnllpeh 167
Cdd:TIGR00917  377 YRIEQLIIATVQTSSHEkapEILTDDN----LKLLFDIQKKVSQLFANYEGELITLDSPCFKPNH--PY----------- 439
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  168 GCLLLSPGNFWQNDWERF---HADPDIIGTIHQH-EPKTLQTSATLKDllFGVP-------GKYSGVSLYTRKRMVsytI 236
Cdd:TIGR00917  440 NCFIYSTCKKLQNMYSKLkpeNYDDYGGVDYVKYcFEHFTSPESCLSA--FGGPvdpttvlGGFSGNNFSEASAFV---V 514
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  237 TLVFQHYHAK-------------FLGSLRARLmLLHPSPNCSLRAESLVHVHFKEEiGVAELIPLVTTYIILFAYIY--- 300
Cdd:TIGR00917  515 TFPVNNFVNKtnktekavawekaFIQLAKDEL-LPMVQATISFSAERSIEDELKRE-STADVITIAISYLVMFAYISltl 592
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  301 -FSTR-KIDMVKSKWGLALAAVVTVLSSLLMSVGLCTLFGLTPTLNGGEIFPYLVVVIGLENVLVLTKSV---------- 368
Cdd:TIGR00917  593 gDSPRlKSLYVTSKVLLGLSGILIVMLSVLGSVGVFSAVGLKSTLIIMEVIPFLVLAVGVDNIFILVFFYfyleyfyrqv 672
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  369 -VSTPVDLEVKLRIAQGLSSESWSIMKNMATELGIILIGYFTLVPAIQEFCLFAVVGLVSDFFLQMLFFTTVLSIDIRRM 447
Cdd:TIGR00917  673 gVDNEQELTLERRLSRALMEVGPSITLASLSEILAFALGALIKMPAVRVFSMFAVLAVFLDFLLQITAFVALLVLDFKRT 752
                          490
                   ....*....|....*....
gi 2462588154  448 EladlNKRLPPEACLPSAK 466
Cdd:TIGR00917  753 E----DKRVDCFPCIKTSK 767
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
771-813 1.59e-03

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 41.94  E-value: 1.59e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 2462588154  771 VLRGHLMDIECLA--SDGMLLVSCCLAGHICVWDAQTGDCLTRIP 813
Cdd:cd00200    214 TLRGHENGVNSVAfsPDGYLLASGSEDGTIRVWDLRTGECVQTLS 258
 
Name Accession Description Interval E-value
Sterol-sensing pfam12349
Sterol-sensing domain of SREBP cleavage-activation; Sterol regulatory element-binding proteins ...
308-452 7.44e-57

Sterol-sensing domain of SREBP cleavage-activation; Sterol regulatory element-binding proteins (SREBPs) are membrane-bound transcription factors that promote lipid synthesis in animal cells. They are embedded in the membranes of the endoplasmic reticulum (ER) in a helical hairpin orientation and are released from the ER by a two-step proteolytic process. Proteolysis begins when the SREBPs are cleaved at Site-1, which is located at a leucine residue in the middle of the hydrophobic loop in the lumen of the ER. Upon proteolytic processing SREBP can activate the expression of genes involved in cholesterol biosynthesis and uptake. SCAP stimulates cleavage of SREBPs via fusion of the their two C-termini. This domain is the transmembrane region that traverses the membrane eight times and is the sterol-sensing domain of the cleavage protein. WD40 domains are found towards the C-terminus.


Pssm-ID: 463544 [Multi-domain]  Cd Length: 153  Bit Score: 193.57  E-value: 7.44e-57
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  308 MVKSKWGLALAAVVTVLSSLLMSVGLCTLFGLTPTLNGGEIFPYLVVVIGLENVLVLTKSVVSTPVDLEVKLRIAQGLSS 387
Cdd:pfam12349    1 MVKSKFGLGLAGVIIVLASVASSLGLCAYFGLPLTLIISEVIPFLVLAIGVDNIFLLVKAVVRTPRSLDVSERIAEALGE 80
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462588154  388 ESWSIMKNMATELGIILIGYFTLVPAIQEFCLFAVVGLVSDFFLQMLFFTTVLSIDIRRMELADL 452
Cdd:pfam12349   81 VGPSITLTSLTEILAFLLGALTDMPAVQEFCLFAAVAVLFDFLLQMTFFVAVLSLDIRRLESNRL 145
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1077-1234 2.26e-25

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 107.81  E-value: 2.26e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1077 HQKPITALKAAA--GRLVTGSQDHTLRVFRLEDSCCLFTLQGHSGAITTV-YIDQTMVLASGGQDGAICLWDVLTGSRVS 1153
Cdd:cd00200      8 HTGGVTCVAFSPdgKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVaASADGTYLASGSSDKTIRLWDLETGECVR 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1154 HVFAHRGDVTSLTCTTSC--VISSGLDDLISIWDRSTGIKFYSIQQDLGCGASLGVISDNLLVTGGQ--GCVSFWDLNYG 1229
Cdd:cd00200     88 TLTGHTSYVSSVAFSPDGriLSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSqdGTIKLWDLRTG 167

                   ....*
gi 2462588154 1230 DLLQT 1234
Cdd:cd00200    168 KCVAT 172
WD40 COG2319
WD40 repeat [General function prediction only];
892-1234 1.50e-23

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 104.61  E-value: 1.50e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  892 PEPRHRAVCGRSRDSPGYDFSCLVQRVYQEEGLAAVCTPALRPPSPGPVLSQAPEDEGGSPEKGSPSLAWAPSAEGSIWS 971
Cdd:COG2319      4 ADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLS 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  972 LELQ--GNLIVVGRSSGRLEVWDAIEGVLCCSSEEVSSGITALVFLD--KRIVAARLNGSLDFFSLETHTALSPLqfRGT 1047
Cdd:COG2319     84 VAFSpdGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPdgKTLASGSADGTVRLWDLATGKLLRTL--TGH 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1048 PGRGSSPA-SP----VYSSS--------DTVACHLTHTVPcAHQKPITALK-AAAGR-LVTGSQDHTLRVFRLEDSCCLF 1112
Cdd:COG2319    162 SGAVTSVAfSPdgklLASGSddgtvrlwDLATGKLLRTLT-GHTGAVRSVAfSPDGKlLASGSADGTVRLWDLATGKLLR 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1113 TLQGHSGAITTVYI--DQTMvLASGGQDGAICLWDVLTGSRVSHVFAHRGDVTSLTCTT--SCVISSGLDDLISIWDRST 1188
Cdd:COG2319    241 TLTGHSGSVRSVAFspDGRL-LASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPdgKLLASGSDDGTVRLWDLAT 319
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*...
gi 2462588154 1189 GIKFYSIQQDLGCGASLGVISD-NLLVTGGQ-GCVSFWDLNYGDLLQT 1234
Cdd:COG2319    320 GKLLRTLTGHTGAVRSVAFSPDgKTLASGSDdGTVRLWDLATGELLRT 367
2A060601 TIGR00917
Niemann-Pick C type protein family; The model describes Niemann-Pick C type protein in ...
11-466 5.60e-22

Niemann-Pick C type protein family; The model describes Niemann-Pick C type protein in eukaryotes. The defective protein has been associated with Niemann-Pick disease which is described in humans as autosomal recessive lipidosis. It is characterized by the lysosomal accumulation of unestrified cholesterol. It is an integral membrane protein, which indicates that this protein is most likely involved in cholesterol transport or acts as some component of cholesterol homeostasis. [Transport and binding proteins, Other]


Pssm-ID: 273337 [Multi-domain]  Cd Length: 1205  Bit Score: 103.45  E-value: 5.60e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154   11 ISRAFYNHGLLCASYPIPIILFTGFCILACCYPLLKLPLPgtgpvefTTPVKDYSPPpvdsDRKQGEPTEQPEWYVGaPV 90
Cdd:TIGR00917  309 LARFFGKYGIWVARHPTLVICLSVSVVLLLCVGLIRFKVE-------TRPVKLWVAP----GSRAALEKQYFDTHFG-PF 376
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154   91 AYVQQIFVKSSVFPWHK---NLLAVDVfrspLSRAFQLVEEIRNHVLRDSSGIRSLEELCLQVTDllPGlrklrnllpeh 167
Cdd:TIGR00917  377 YRIEQLIIATVQTSSHEkapEILTDDN----LKLLFDIQKKVSQLFANYEGELITLDSPCFKPNH--PY----------- 439
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  168 GCLLLSPGNFWQNDWERF---HADPDIIGTIHQH-EPKTLQTSATLKDllFGVP-------GKYSGVSLYTRKRMVsytI 236
Cdd:TIGR00917  440 NCFIYSTCKKLQNMYSKLkpeNYDDYGGVDYVKYcFEHFTSPESCLSA--FGGPvdpttvlGGFSGNNFSEASAFV---V 514
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  237 TLVFQHYHAK-------------FLGSLRARLmLLHPSPNCSLRAESLVHVHFKEEiGVAELIPLVTTYIILFAYIY--- 300
Cdd:TIGR00917  515 TFPVNNFVNKtnktekavawekaFIQLAKDEL-LPMVQATISFSAERSIEDELKRE-STADVITIAISYLVMFAYISltl 592
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  301 -FSTR-KIDMVKSKWGLALAAVVTVLSSLLMSVGLCTLFGLTPTLNGGEIFPYLVVVIGLENVLVLTKSV---------- 368
Cdd:TIGR00917  593 gDSPRlKSLYVTSKVLLGLSGILIVMLSVLGSVGVFSAVGLKSTLIIMEVIPFLVLAVGVDNIFILVFFYfyleyfyrqv 672
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  369 -VSTPVDLEVKLRIAQGLSSESWSIMKNMATELGIILIGYFTLVPAIQEFCLFAVVGLVSDFFLQMLFFTTVLSIDIRRM 447
Cdd:TIGR00917  673 gVDNEQELTLERRLSRALMEVGPSITLASLSEILAFALGALIKMPAVRVFSMFAVLAVFLDFLLQITAFVALLVLDFKRT 752
                          490
                   ....*....|....*....
gi 2462588154  448 EladlNKRLPPEACLPSAK 466
Cdd:TIGR00917  753 E----DKRVDCFPCIKTSK 767
WD40 COG2319
WD40 repeat [General function prediction only];
971-1234 1.29e-19

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 93.05  E-value: 1.29e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  971 SLELQGNLIVVGRSSGRLEVWDAIEGVLCCSSEEVSSGITALVFLDKRIVAARLNGSLDFFSLETHTALSPLQFRGTPGR 1050
Cdd:COG2319      1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1051 G-----SSPASPVYSSSD-------TVACHLTHTVPCAHQKPITALKAAA--GRLVTGSQDHTLRVFRLEDSCCLFTLQG 1116
Cdd:COG2319     81 VlsvafSPDGRLLASASAdgtvrlwDLATGLLLRTLTGHTGAVRSVAFSPdgKTLASGSADGTVRLWDLATGKLLRTLTG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1117 HSGAITTVYI--DQTMvLASGGQDGAICLWDVLTGSRVSHVFAHRGDVTSLTCTT--SCVISSGLDDLISIWDRSTGIKF 1192
Cdd:COG2319    161 HSGAVTSVAFspDGKL-LASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPdgKLLASGSADGTVRLWDLATGKLL 239
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....
gi 2462588154 1193 YSIQQDLGCGASLGVISDN-LLVTGGQ-GCVSFWDLNYGDLLQT 1234
Cdd:COG2319    240 RTLTGHSGSVRSVAFSPDGrLLASGSAdGTVRLWDLATGELLRT 283
2A060605 TIGR00920
3-hydroxy-3-methylglutaryl-coenzyme A reductase; [Transport and binding proteins, ...
276-442 8.32e-14

3-hydroxy-3-methylglutaryl-coenzyme A reductase; [Transport and binding proteins, Carbohydrates, organic alcohols, and acids]


Pssm-ID: 273339 [Multi-domain]  Cd Length: 886  Bit Score: 76.43  E-value: 8.32e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  276 FKEEIGVAELIPLVTTYIILFAYIYFSTRKIDMVKSKWGLALAAVVTVLSSLLMSVGLCTLFGLTPT-LNggEIFPYLVV 354
Cdd:TIGR00920   53 FEEEYLSSDVIVMTITRCIAVLYIYYQFCNLRQLGSKYILGIAGLFTIFSSFVFSTAVIHFLGSELTgLN--EALPFFLL 130
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  355 VIGLENVLVLTKSVVSTPVDLEVKLRIAQGLSSESWSIMKNMATELGIILIGYFTLVPAIQEFCLFAVVGLVSDFFLQML 434
Cdd:TIGR00920  131 LIDLSKASALAKFALSSNSQDEVRDNIARGMAILGPTITLDTVVETLVIGVGTMSGVRRLEVLCCFGCMSVLANYFVFMT 210

                   ....*...
gi 2462588154  435 FFTTVLSI 442
Cdd:TIGR00920  211 FFPACLSL 218
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
1107-1145 1.56e-05

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 43.07  E-value: 1.56e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 2462588154  1107 DSCCLFTLQGHSGAITTVYIDQT-MVLASGGQDGAICLWD 1145
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDgKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
1110-1145 7.13e-05

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 41.18  E-value: 7.13e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 2462588154 1110 CLFTLQGHSGAITTVYIDQT-MVLASGGQDGAICLWD 1145
Cdd:pfam00400    3 LLKTLEGHTGSVTSLAFSPDgKLLASGSDDGTVKVWD 39
MMPL COG1033
Predicted exporter protein, RND superfamily [General function prediction only];
285-442 9.98e-04

Predicted exporter protein, RND superfamily [General function prediction only];


Pssm-ID: 440656 [Multi-domain]  Cd Length: 767  Bit Score: 43.31  E-value: 9.98e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  285 LIPLVTTYIILFAYIYFstrkidmvKSKWGLALAaVVTVLSSLLMSVGLCTLFG--LTPTLNggeIFPYLVVVIGLENVL 362
Cdd:COG1033    223 FFPLALLLILLLLFLFF--------RSLRGVLLP-LLVVLLAVIWTLGLMGLLGipLSPLTI---LVPPLLLAIGIDYGI 290
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  363 -VLTKsvvstpvdleVKLRIAQGLSSESwsIMKNMATELGI-IL-------IGYFTL----VPAIQEFCLFAVVGLVSDF 429
Cdd:COG1033    291 hLLNR----------YREERRKGLDKRE--ALREALRKLGPpVLltslttaIGFLSLlfsdIPPIRDFGIVAAIGVLLAF 358
                          170
                   ....*....|...
gi 2462588154  430 FLQMLFFTTVLSI 442
Cdd:COG1033    359 LTSLTLLPALLSL 371
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
771-813 1.59e-03

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 41.94  E-value: 1.59e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 2462588154  771 VLRGHLMDIECLA--SDGMLLVSCCLAGHICVWDAQTGDCLTRIP 813
Cdd:cd00200    214 TLRGHENGVNSVAfsPDGYLLASGSEDGTIRVWDLRTGECVQTLS 258
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
771-802 1.65e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 37.29  E-value: 1.65e-03
                            10        20        30
                    ....*....|....*....|....*....|....
gi 2462588154   771 VLRGHLMDIECLA--SDGMLLVSCCLAGHICVWD 802
Cdd:smart00320    7 TLKGHTGPVTSVAfsPDGKYLASGSDDGTIKLWD 40
 
Name Accession Description Interval E-value
Sterol-sensing pfam12349
Sterol-sensing domain of SREBP cleavage-activation; Sterol regulatory element-binding proteins ...
308-452 7.44e-57

Sterol-sensing domain of SREBP cleavage-activation; Sterol regulatory element-binding proteins (SREBPs) are membrane-bound transcription factors that promote lipid synthesis in animal cells. They are embedded in the membranes of the endoplasmic reticulum (ER) in a helical hairpin orientation and are released from the ER by a two-step proteolytic process. Proteolysis begins when the SREBPs are cleaved at Site-1, which is located at a leucine residue in the middle of the hydrophobic loop in the lumen of the ER. Upon proteolytic processing SREBP can activate the expression of genes involved in cholesterol biosynthesis and uptake. SCAP stimulates cleavage of SREBPs via fusion of the their two C-termini. This domain is the transmembrane region that traverses the membrane eight times and is the sterol-sensing domain of the cleavage protein. WD40 domains are found towards the C-terminus.


Pssm-ID: 463544 [Multi-domain]  Cd Length: 153  Bit Score: 193.57  E-value: 7.44e-57
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  308 MVKSKWGLALAAVVTVLSSLLMSVGLCTLFGLTPTLNGGEIFPYLVVVIGLENVLVLTKSVVSTPVDLEVKLRIAQGLSS 387
Cdd:pfam12349    1 MVKSKFGLGLAGVIIVLASVASSLGLCAYFGLPLTLIISEVIPFLVLAIGVDNIFLLVKAVVRTPRSLDVSERIAEALGE 80
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462588154  388 ESWSIMKNMATELGIILIGYFTLVPAIQEFCLFAVVGLVSDFFLQMLFFTTVLSIDIRRMELADL 452
Cdd:pfam12349   81 VGPSITLTSLTEILAFLLGALTDMPAVQEFCLFAAVAVLFDFLLQMTFFVAVLSLDIRRLESNRL 145
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1077-1234 2.26e-25

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 107.81  E-value: 2.26e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1077 HQKPITALKAAA--GRLVTGSQDHTLRVFRLEDSCCLFTLQGHSGAITTV-YIDQTMVLASGGQDGAICLWDVLTGSRVS 1153
Cdd:cd00200      8 HTGGVTCVAFSPdgKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVaASADGTYLASGSSDKTIRLWDLETGECVR 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1154 HVFAHRGDVTSLTCTTSC--VISSGLDDLISIWDRSTGIKFYSIQQDLGCGASLGVISDNLLVTGGQ--GCVSFWDLNYG 1229
Cdd:cd00200     88 TLTGHTSYVSSVAFSPDGriLSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSqdGTIKLWDLRTG 167

                   ....*
gi 2462588154 1230 DLLQT 1234
Cdd:cd00200    168 KCVAT 172
WD40 COG2319
WD40 repeat [General function prediction only];
892-1234 1.50e-23

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 104.61  E-value: 1.50e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  892 PEPRHRAVCGRSRDSPGYDFSCLVQRVYQEEGLAAVCTPALRPPSPGPVLSQAPEDEGGSPEKGSPSLAWAPSAEGSIWS 971
Cdd:COG2319      4 ADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLS 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  972 LELQ--GNLIVVGRSSGRLEVWDAIEGVLCCSSEEVSSGITALVFLD--KRIVAARLNGSLDFFSLETHTALSPLqfRGT 1047
Cdd:COG2319     84 VAFSpdGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPdgKTLASGSADGTVRLWDLATGKLLRTL--TGH 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1048 PGRGSSPA-SP----VYSSS--------DTVACHLTHTVPcAHQKPITALK-AAAGR-LVTGSQDHTLRVFRLEDSCCLF 1112
Cdd:COG2319    162 SGAVTSVAfSPdgklLASGSddgtvrlwDLATGKLLRTLT-GHTGAVRSVAfSPDGKlLASGSADGTVRLWDLATGKLLR 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1113 TLQGHSGAITTVYI--DQTMvLASGGQDGAICLWDVLTGSRVSHVFAHRGDVTSLTCTT--SCVISSGLDDLISIWDRST 1188
Cdd:COG2319    241 TLTGHSGSVRSVAFspDGRL-LASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPdgKLLASGSDDGTVRLWDLAT 319
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*...
gi 2462588154 1189 GIKFYSIQQDLGCGASLGVISD-NLLVTGGQ-GCVSFWDLNYGDLLQT 1234
Cdd:COG2319    320 GKLLRTLTGHTGAVRSVAFSPDgKTLASGSDdGTVRLWDLATGELLRT 367
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
966-1234 3.67e-23

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 101.26  E-value: 3.67e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  966 EGSIWSLEL--QGNLIVVGRSSGRLEVWDAIEGVLCCSSEEVSSGITALVFL--DKRIVAARLNGSLDFFSLEThtalsp 1041
Cdd:cd00200      9 TGGVTCVAFspDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASadGTYLASGSSDKTIRLWDLET------ 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1042 lqfrgtpgrgsspaspvysssdtvaCHLTHTVPCaHQKPITALKAAAGR--LVTGSQDHTLRVFRLEDSCCLFTLQGHSG 1119
Cdd:cd00200     83 -------------------------GECVRTLTG-HTSYVSSVAFSPDGriLSSSSRDKTIKVWDVETGKCLTTLRGHTD 136
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1120 AITTVYIDQT-MVLASGGQDGAICLWDVLTGSRVsHVF-AHRGDVTSLTC--TTSCVISSGLDDLISIWDRSTGIKFYSI 1195
Cdd:cd00200    137 WVNSVAFSPDgTFVASSSQDGTIKLWDLRTGKCV-ATLtGHTGEVNSVAFspDGEKLLSSSSDGTIKLWDLSTGKCLGTL 215
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|.
gi 2462588154 1196 QQDLGCGASLGVISDNLLVTGG--QGCVSFWDLNYGDLLQT 1234
Cdd:cd00200    216 RGHENGVNSVAFSPDGYLLASGseDGTIRVWDLRTGECVQT 256
2A060601 TIGR00917
Niemann-Pick C type protein family; The model describes Niemann-Pick C type protein in ...
11-466 5.60e-22

Niemann-Pick C type protein family; The model describes Niemann-Pick C type protein in eukaryotes. The defective protein has been associated with Niemann-Pick disease which is described in humans as autosomal recessive lipidosis. It is characterized by the lysosomal accumulation of unestrified cholesterol. It is an integral membrane protein, which indicates that this protein is most likely involved in cholesterol transport or acts as some component of cholesterol homeostasis. [Transport and binding proteins, Other]


Pssm-ID: 273337 [Multi-domain]  Cd Length: 1205  Bit Score: 103.45  E-value: 5.60e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154   11 ISRAFYNHGLLCASYPIPIILFTGFCILACCYPLLKLPLPgtgpvefTTPVKDYSPPpvdsDRKQGEPTEQPEWYVGaPV 90
Cdd:TIGR00917  309 LARFFGKYGIWVARHPTLVICLSVSVVLLLCVGLIRFKVE-------TRPVKLWVAP----GSRAALEKQYFDTHFG-PF 376
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154   91 AYVQQIFVKSSVFPWHK---NLLAVDVfrspLSRAFQLVEEIRNHVLRDSSGIRSLEELCLQVTDllPGlrklrnllpeh 167
Cdd:TIGR00917  377 YRIEQLIIATVQTSSHEkapEILTDDN----LKLLFDIQKKVSQLFANYEGELITLDSPCFKPNH--PY----------- 439
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  168 GCLLLSPGNFWQNDWERF---HADPDIIGTIHQH-EPKTLQTSATLKDllFGVP-------GKYSGVSLYTRKRMVsytI 236
Cdd:TIGR00917  440 NCFIYSTCKKLQNMYSKLkpeNYDDYGGVDYVKYcFEHFTSPESCLSA--FGGPvdpttvlGGFSGNNFSEASAFV---V 514
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  237 TLVFQHYHAK-------------FLGSLRARLmLLHPSPNCSLRAESLVHVHFKEEiGVAELIPLVTTYIILFAYIY--- 300
Cdd:TIGR00917  515 TFPVNNFVNKtnktekavawekaFIQLAKDEL-LPMVQATISFSAERSIEDELKRE-STADVITIAISYLVMFAYISltl 592
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  301 -FSTR-KIDMVKSKWGLALAAVVTVLSSLLMSVGLCTLFGLTPTLNGGEIFPYLVVVIGLENVLVLTKSV---------- 368
Cdd:TIGR00917  593 gDSPRlKSLYVTSKVLLGLSGILIVMLSVLGSVGVFSAVGLKSTLIIMEVIPFLVLAVGVDNIFILVFFYfyleyfyrqv 672
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  369 -VSTPVDLEVKLRIAQGLSSESWSIMKNMATELGIILIGYFTLVPAIQEFCLFAVVGLVSDFFLQMLFFTTVLSIDIRRM 447
Cdd:TIGR00917  673 gVDNEQELTLERRLSRALMEVGPSITLASLSEILAFALGALIKMPAVRVFSMFAVLAVFLDFLLQITAFVALLVLDFKRT 752
                          490
                   ....*....|....*....
gi 2462588154  448 EladlNKRLPPEACLPSAK 466
Cdd:TIGR00917  753 E----DKRVDCFPCIKTSK 767
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
975-1185 1.88e-20

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 93.17  E-value: 1.88e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  975 QGNLIVVGRSSGRLEVWDAIEGVLCCSSEEVSSGITALVFLDKR--IVAARLNGSLDFFSLETHTALSPLQFRGTPGRG- 1051
Cdd:cd00200     62 DGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGriLSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSv 141
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1052 -SSPASPVYSSS---------DTVACHLTHTVPcAHQKPITALKAAA--GRLVTGSQDHTLRVFRLEDSCCLFTLQGHSG 1119
Cdd:cd00200    142 aFSPDGTFVASSsqdgtiklwDLRTGKCVATLT-GHTGEVNSVAFSPdgEKLLSSSSDGTIKLWDLSTGKCLGTLRGHEN 220
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462588154 1120 AITTV-YIDQTMVLASGGQDGAICLWDVLTGSRVSHVFAHRGDVTSLTC--TTSCVISSGLDDLISIWD 1185
Cdd:cd00200    221 GVNSVaFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWspDGKRLASGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
971-1234 1.29e-19

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 93.05  E-value: 1.29e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  971 SLELQGNLIVVGRSSGRLEVWDAIEGVLCCSSEEVSSGITALVFLDKRIVAARLNGSLDFFSLETHTALSPLQFRGTPGR 1050
Cdd:COG2319      1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1051 G-----SSPASPVYSSSD-------TVACHLTHTVPCAHQKPITALKAAA--GRLVTGSQDHTLRVFRLEDSCCLFTLQG 1116
Cdd:COG2319     81 VlsvafSPDGRLLASASAdgtvrlwDLATGLLLRTLTGHTGAVRSVAFSPdgKTLASGSADGTVRLWDLATGKLLRTLTG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1117 HSGAITTVYI--DQTMvLASGGQDGAICLWDVLTGSRVSHVFAHRGDVTSLTCTT--SCVISSGLDDLISIWDRSTGIKF 1192
Cdd:COG2319    161 HSGAVTSVAFspDGKL-LASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPdgKLLASGSADGTVRLWDLATGKLL 239
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....
gi 2462588154 1193 YSIQQDLGCGASLGVISDN-LLVTGGQ-GCVSFWDLNYGDLLQT 1234
Cdd:COG2319    240 RTLTGHSGSVRSVAFSPDGrLLASGSAdGTVRLWDLATGELLRT 283
WD40 COG2319
WD40 repeat [General function prediction only];
966-1188 7.44e-19

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 90.74  E-value: 7.44e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  966 EGSIWSLEL--QGNLIVVGRSSGRLEVWDAIEGVLCCSSEEVSSGITALVFL--DKRIVAARLNGSLDFFSLETHTALSP 1041
Cdd:COG2319    162 SGAVTSVAFspDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSpdGKLLASGSADGTVRLWDLATGKLLRT 241
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1042 LqfRGTPGRGSSPA-SP-----VYSSSDTVAC-------HLTHTVPcAHQKPITALKAAA-GR-LVTGSQDHTLRVFRLE 1106
Cdd:COG2319    242 L--TGHSGSVRSVAfSPdgrllASGSADGTVRlwdlatgELLRTLT-GHSGGVNSVAFSPdGKlLASGSDDGTVRLWDLA 318
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1107 DSCCLFTLQGHSGAITTVYI---DQTmvLASGGQDGAICLWDVLTGSRVSHVFAHRGDVTSLTCTT--SCVISSGLDDLI 1181
Cdd:COG2319    319 TGKLLRTLTGHTGAVRSVAFspdGKT--LASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPdgRTLASGSADGTV 396

                   ....*..
gi 2462588154 1182 SIWDRST 1188
Cdd:COG2319    397 RLWDLAT 403
Patched pfam02460
Patched family; The transmembrane protein Patched is a receptor for the morphogene Sonic ...
284-533 1.27e-17

Patched family; The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals.


Pssm-ID: 308203 [Multi-domain]  Cd Length: 793  Bit Score: 88.57  E-value: 1.27e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  284 ELIP-LVTTYIILFAY-----IYFSTRKIDMVKSKWGLALAAVVTVLSSLLMSVGLCTLFGLtPTLNGGEIFPYLVVVIG 357
Cdd:pfam02460  214 TLTPfFVIGFFLLLTFsiivsVTLSSYTIDWVRSKPILAALGLLSPVMAIVSSFGLLFWMGF-PFNSIVCVTPFLVLAIG 292
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  358 LENVLVLTKSVVSTPVDLEVKLRIAQGLSSESWSIMKNMATELGIILIGYFTLVPAIQEFCLFAVVGLVSDFFLQMLFFT 437
Cdd:pfam02460  293 VDDMFLMVAAWQRTTATLSVKKRMGEALSEAGVSITITSLTDVLSFGIGTYTPTPAIQLFCAYTAVAIFFDFIYQITFFA 372
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  438 TVLSIdirrMELADLNKRLPPEACLPSakpvgQPTRYERQLAVRPSTPHTITLQPSSFRNLRLPkrlrvvyFLARTRLaq 517
Cdd:pfam02460  373 AIMAI----CAKPEAEGRHCLFVWATS-----SPQRIDSEGSEPDKSHNIEQLKSRFFLDIYCP-------FLLNPSV-- 434
                          250
                   ....*....|....*..
gi 2462588154  518 RLIMAGT-VVWIGILVY 533
Cdd:pfam02460  435 RVCMLVLfVVYIAIAIY 451
WD40 COG2319
WD40 repeat [General function prediction only];
958-1146 5.10e-16

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 81.88  E-value: 5.10e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  958 SLAWAPsaegsiwslelQGNLIVVGRSSGRLEVWDAIEGVLCCSSEEVSSGITALVFL--DKRIVAARLNGSLDFFSLET 1035
Cdd:COG2319    209 SVAFSP-----------DGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSpdGRLLASGSADGTVRLWDLAT 277
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1036 HTALSPLqfRGTPGRGSSPA-SP-----VYSSSDTVAC-------HLTHTVPcAHQKPITALKAAA--GRLVTGSQDHTL 1100
Cdd:COG2319    278 GELLRTL--TGHSGGVNSVAfSPdgkllASGSDDGTVRlwdlatgKLLRTLT-GHTGAVRSVAFSPdgKTLASGSDDGTV 354
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*...
gi 2462588154 1101 RVFRLEDSCCLFTLQGHSGAITTVYI--DQTMvLASGGQDGAICLWDV 1146
Cdd:COG2319    355 RLWDLATGELLRTLTGHTGAVTSVAFspDGRT-LASGSADGTVRLWDL 401
2A060605 TIGR00920
3-hydroxy-3-methylglutaryl-coenzyme A reductase; [Transport and binding proteins, ...
276-442 8.32e-14

3-hydroxy-3-methylglutaryl-coenzyme A reductase; [Transport and binding proteins, Carbohydrates, organic alcohols, and acids]


Pssm-ID: 273339 [Multi-domain]  Cd Length: 886  Bit Score: 76.43  E-value: 8.32e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  276 FKEEIGVAELIPLVTTYIILFAYIYFSTRKIDMVKSKWGLALAAVVTVLSSLLMSVGLCTLFGLTPT-LNggEIFPYLVV 354
Cdd:TIGR00920   53 FEEEYLSSDVIVMTITRCIAVLYIYYQFCNLRQLGSKYILGIAGLFTIFSSFVFSTAVIHFLGSELTgLN--EALPFFLL 130
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  355 VIGLENVLVLTKSVVSTPVDLEVKLRIAQGLSSESWSIMKNMATELGIILIGYFTLVPAIQEFCLFAVVGLVSDFFLQML 434
Cdd:TIGR00920  131 LIDLSKASALAKFALSSNSQDEVRDNIARGMAILGPTITLDTVVETLVIGVGTMSGVRRLEVLCCFGCMSVLANYFVFMT 210

                   ....*...
gi 2462588154  435 FFTTVLSI 442
Cdd:TIGR00920  211 FFPACLSL 218
2A060602 TIGR00918
The Eukaryotic (Putative) Sterol Transporter (EST) Family;
286-491 1.26e-13

The Eukaryotic (Putative) Sterol Transporter (EST) Family;


Pssm-ID: 273338 [Multi-domain]  Cd Length: 1145  Bit Score: 76.07  E-value: 1.26e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  286 IPLVTTYIILFAYIYFSTRKIDMVKSKWGLALAAVVTVLSSLLMSVGLCTLFGLTPTLNGGEIFPYLVVVIGLENVLVLT 365
Cdd:TIGR00918  400 IRIVSGYLLMLAYACLTMLRWDCAKSQGSVGLAGVLLVALSVAAGLGLCALLGISFNAATTQVLPFLALGVGVDDVFLLA 479
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  366 KSVVSTPVDLEVKLRIAQGLSSESWSIMKNMATELGIILIGYFTLVPAIQEFCLFAVVGLVSDFFLQMLFFTTVLSIDIR 445
Cdd:TIGR00918  480 HAFSETGQNIPFEERTGECLKRTGASVVLTSISNVTAFFMAALIPIPALRAFSLQAAIVVVFNFAAVLLVFPAILSLDLR 559
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2462588154  446 RMEladlNKRLPPEACL--PSAKPVGQ--PTRYERQLAVRPSTPH-TITLQ 491
Cdd:TIGR00918  560 RRE----DRRLDIFCCFfsPCSARVIQiePQAYADGSAPPVYSSHmQSTVQ 606
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1110-1234 5.72e-13

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 70.83  E-value: 5.72e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1110 CLFTLQGHSGAITTV-YIDQTMVLASGGQDGAICLWDVLTGSRVSHVFAHRGDVTSLTCT--TSCVISSGLDDLISIWDR 1186
Cdd:cd00200      1 LRRTLKGHTGGVTCVaFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASadGTYLASGSSDKTIRLWDL 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1187 STGIKFYSIQQDLGCGASLGVISDNLLVTGG--QGCVSFWDLNYGDLLQT 1234
Cdd:cd00200     81 ETGECVRTLTGHTSYVSSVAFSPDGRILSSSsrDKTIKVWDVETGKCLTT 130
WD40 COG2319
WD40 repeat [General function prediction only];
1068-1235 1.48e-11

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 68.01  E-value: 1.48e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1068 HLTHTVPCAHQKPITALKAAAGRLVTGSQDHTLRVFRLEDSCCLFTLQGHSGAITTV-YIDQTMVLASGGQDGAICLWDV 1146
Cdd:COG2319     28 LLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVaFSPDGRLLASASADGTVRLWDL 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154 1147 LTGSRVSHVFAHRGDVTSLTCT--TSCVISSGLDDLISIWDRSTGIKFYSIQQDLGCGASLGVISD-NLLVTGGQ-GCVS 1222
Cdd:COG2319    108 ATGLLLRTLTGHTGAVRSVAFSpdGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDgKLLASGSDdGTVR 187
                          170
                   ....*....|...
gi 2462588154 1223 FWDLNYGDLLQTV 1235
Cdd:COG2319    188 LWDLATGKLLRTL 200
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
1107-1145 1.56e-05

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 43.07  E-value: 1.56e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 2462588154  1107 DSCCLFTLQGHSGAITTVYIDQT-MVLASGGQDGAICLWD 1145
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDgKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
1110-1145 7.13e-05

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 41.18  E-value: 7.13e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 2462588154 1110 CLFTLQGHSGAITTVYIDQT-MVLASGGQDGAICLWD 1145
Cdd:pfam00400    3 LLKTLEGHTGSVTSLAFSPDgKLLASGSDDGTVKVWD 39
MMPL COG1033
Predicted exporter protein, RND superfamily [General function prediction only];
285-442 9.98e-04

Predicted exporter protein, RND superfamily [General function prediction only];


Pssm-ID: 440656 [Multi-domain]  Cd Length: 767  Bit Score: 43.31  E-value: 9.98e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  285 LIPLVTTYIILFAYIYFstrkidmvKSKWGLALAaVVTVLSSLLMSVGLCTLFG--LTPTLNggeIFPYLVVVIGLENVL 362
Cdd:COG1033    223 FFPLALLLILLLLFLFF--------RSLRGVLLP-LLVVLLAVIWTLGLMGLLGipLSPLTI---LVPPLLLAIGIDYGI 290
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462588154  363 -VLTKsvvstpvdleVKLRIAQGLSSESwsIMKNMATELGI-IL-------IGYFTL----VPAIQEFCLFAVVGLVSDF 429
Cdd:COG1033    291 hLLNR----------YREERRKGLDKRE--ALREALRKLGPpVLltslttaIGFLSLlfsdIPPIRDFGIVAAIGVLLAF 358
                          170
                   ....*....|...
gi 2462588154  430 FLQMLFFTTVLSI 442
Cdd:COG1033    359 LTSLTLLPALLSL 371
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
771-813 1.59e-03

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 41.94  E-value: 1.59e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 2462588154  771 VLRGHLMDIECLA--SDGMLLVSCCLAGHICVWDAQTGDCLTRIP 813
Cdd:cd00200    214 TLRGHENGVNSVAfsPDGYLLASGSEDGTIRVWDLRTGECVQTLS 258
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
771-802 1.65e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 37.29  E-value: 1.65e-03
                            10        20        30
                    ....*....|....*....|....*....|....
gi 2462588154   771 VLRGHLMDIECLA--SDGMLLVSCCLAGHICVWD 802
Cdd:smart00320    7 TLKGHTGPVTSVAfsPDGKYLASGSDDGTIKLWD 40
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH