NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1720373120|ref|XP_017170702|]
View 

prospero homeobox protein 2 isoform X1 [Mus musculus]

Protein Classification

homeo-prospero domain-containing protein( domain architecture ID 10523599)

homeo-prospero domain (HPD)-containing protein similar to Drosophila melanogaster homeobox protein prospero, a homeodomain protein that controls neuronal identity

CATH:  1.10.10.500
Gene Ontology:  GO:0003677|GO:0003700
PubMed:  12429095|15837198

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
HPD pfam05044
Homeo-prospero domain; Prospero is a large drosophila transcription factor protein that is ...
435-587 1.68e-107

Homeo-prospero domain; Prospero is a large drosophila transcription factor protein that is expressed in all neural lineages of drosophila embryos. It is needed for correct expression of several neural proteins and in determining the cell fates of neural stem cells. homologs of prospero are found in a wide range of animals including humans with the highest level of similarity being found in the C-terminal 160 amino acids. This region was identified as containing an atypical homeobox domain followed by a prospero domain. However, the structure shows that these two regions form a single stable structural domain as defined here. This homeo-prospero domain binds to DNA.


:

Pssm-ID: 461534  Cd Length: 154  Bit Score: 319.38  E-value: 1.68e-107
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720373120 435 GLSPGHLKKAKLMFFFTRYPSSSLLKAYFPDVQFNRCITSQMIKWFSNFREFYYIQMEKYARQALSDGITNAQALAVLRD 514
Cdd:pfam05044   1 GLTPMHLKKAKLMFFYTRYPSSNVLKTYFPDVKFNRCNTSQLIKWFSNFREFYYIQMEKFARQALSEGVTDAEDLLVSRD 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1720373120 515 SELFRVLNTHYNKGNDFEVPDCFLEIAALTLKEFFRAVLAGKDSDPSWKKPIYKVISKLDSDVPEMLKSPSFL 587
Cdd:pfam05044  81 SELFRVLNLHYNKNNDFEVPDRFLEVVQLTLREFFNAIQAGKDSDPSWKKAIYKVICKLDSEVPEIFKSPNFL 153
PHA03247 super family cl33720
large tegument protein UL36; Provisional
27-443 1.59e-03

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 1.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720373120   27 QERSPATAEAGRDSFPsGQLPSSSLTEADWFWDEHIQAKRARVETIVRGMCLSP-------SSSVSGRARESLRCPEKGR 99
Cdd:PHA03247  2594 QSARPRAPVDDRGDPR-GPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVppperprDDPAPGRVSRPRRARRLGR 2672
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720373120  100 --------ERKRKQSLPMHQGPLKSS---PAWERGPKKGGTRVKEQLHLLKQQLRHLQEHVLQATEPRAPAqSPGGTEPR 168
Cdd:PHA03247  2673 aaqassppQRPRRRAARPTVGSLTSLadpPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPA-VPAGPATP 2751
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720373120  169 SSPRARPRNSCSSGAW--TVENEPHQSSSKDLCGAVKPGAAEVLQYSEEPMlCPSGPRALVETLRKELSRAVSQAV---- 242
Cdd:PHA03247  2752 GGPARPARPPTTAGPPapAPPAAPAAGPPRRLTRPAVASLSESRESLPSPW-DPADPPAAVLAPAAALPPAASPAGplpp 2830
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720373120  243 -DSVLQQVLFDPQRHLTQQERSCQGLASEGRNQPSPPGRSAYKDPLALATLPRKIQPQAGVPLGNSTLARPLDSPMCPVS 321
Cdd:PHA03247  2831 pTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQ 2910
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720373120  322 PRGVPRSYQSPLPNCPLTNVPSHTWENQMLRQLL-GRGPDGQWSGSPPQDAAFQSHTSPESAQQPwglsQQQLPLSLTPV 400
Cdd:PHA03247  2911 PQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLApTTDPAGAGEPSGAVPQPWLGALVPGRVAVP----RFRVPQPAPSR 2986
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|...
gi 1720373120  401 HLESRPLPPPVKMEQGVLRGVADSLpfsSIHIQEGLSPGHLKK 443
Cdd:PHA03247  2987 EAPASSTPPLTGHSLSRVSSWASSL---ALHEETDPPPVSLKQ 3026
 
Name Accession Description Interval E-value
HPD pfam05044
Homeo-prospero domain; Prospero is a large drosophila transcription factor protein that is ...
435-587 1.68e-107

Homeo-prospero domain; Prospero is a large drosophila transcription factor protein that is expressed in all neural lineages of drosophila embryos. It is needed for correct expression of several neural proteins and in determining the cell fates of neural stem cells. homologs of prospero are found in a wide range of animals including humans with the highest level of similarity being found in the C-terminal 160 amino acids. This region was identified as containing an atypical homeobox domain followed by a prospero domain. However, the structure shows that these two regions form a single stable structural domain as defined here. This homeo-prospero domain binds to DNA.


Pssm-ID: 461534  Cd Length: 154  Bit Score: 319.38  E-value: 1.68e-107
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720373120 435 GLSPGHLKKAKLMFFFTRYPSSSLLKAYFPDVQFNRCITSQMIKWFSNFREFYYIQMEKYARQALSDGITNAQALAVLRD 514
Cdd:pfam05044   1 GLTPMHLKKAKLMFFYTRYPSSNVLKTYFPDVKFNRCNTSQLIKWFSNFREFYYIQMEKFARQALSEGVTDAEDLLVSRD 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1720373120 515 SELFRVLNTHYNKGNDFEVPDCFLEIAALTLKEFFRAVLAGKDSDPSWKKPIYKVISKLDSDVPEMLKSPSFL 587
Cdd:pfam05044  81 SELFRVLNLHYNKNNDFEVPDRFLEVVQLTLREFFNAIQAGKDSDPSWKKAIYKVICKLDSEVPEIFKSPNFL 153
PHA03247 PHA03247
large tegument protein UL36; Provisional
27-443 1.59e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 1.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720373120   27 QERSPATAEAGRDSFPsGQLPSSSLTEADWFWDEHIQAKRARVETIVRGMCLSP-------SSSVSGRARESLRCPEKGR 99
Cdd:PHA03247  2594 QSARPRAPVDDRGDPR-GPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVppperprDDPAPGRVSRPRRARRLGR 2672
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720373120  100 --------ERKRKQSLPMHQGPLKSS---PAWERGPKKGGTRVKEQLHLLKQQLRHLQEHVLQATEPRAPAqSPGGTEPR 168
Cdd:PHA03247  2673 aaqassppQRPRRRAARPTVGSLTSLadpPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPA-VPAGPATP 2751
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720373120  169 SSPRARPRNSCSSGAW--TVENEPHQSSSKDLCGAVKPGAAEVLQYSEEPMlCPSGPRALVETLRKELSRAVSQAV---- 242
Cdd:PHA03247  2752 GGPARPARPPTTAGPPapAPPAAPAAGPPRRLTRPAVASLSESRESLPSPW-DPADPPAAVLAPAAALPPAASPAGplpp 2830
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720373120  243 -DSVLQQVLFDPQRHLTQQERSCQGLASEGRNQPSPPGRSAYKDPLALATLPRKIQPQAGVPLGNSTLARPLDSPMCPVS 321
Cdd:PHA03247  2831 pTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQ 2910
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720373120  322 PRGVPRSYQSPLPNCPLTNVPSHTWENQMLRQLL-GRGPDGQWSGSPPQDAAFQSHTSPESAQQPwglsQQQLPLSLTPV 400
Cdd:PHA03247  2911 PQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLApTTDPAGAGEPSGAVPQPWLGALVPGRVAVP----RFRVPQPAPSR 2986
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|...
gi 1720373120  401 HLESRPLPPPVKMEQGVLRGVADSLpfsSIHIQEGLSPGHLKK 443
Cdd:PHA03247  2987 EAPASSTPPLTGHSLSRVSSWASSL---ALHEETDPPPVSLKQ 3026
 
Name Accession Description Interval E-value
HPD pfam05044
Homeo-prospero domain; Prospero is a large drosophila transcription factor protein that is ...
435-587 1.68e-107

Homeo-prospero domain; Prospero is a large drosophila transcription factor protein that is expressed in all neural lineages of drosophila embryos. It is needed for correct expression of several neural proteins and in determining the cell fates of neural stem cells. homologs of prospero are found in a wide range of animals including humans with the highest level of similarity being found in the C-terminal 160 amino acids. This region was identified as containing an atypical homeobox domain followed by a prospero domain. However, the structure shows that these two regions form a single stable structural domain as defined here. This homeo-prospero domain binds to DNA.


Pssm-ID: 461534  Cd Length: 154  Bit Score: 319.38  E-value: 1.68e-107
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720373120 435 GLSPGHLKKAKLMFFFTRYPSSSLLKAYFPDVQFNRCITSQMIKWFSNFREFYYIQMEKYARQALSDGITNAQALAVLRD 514
Cdd:pfam05044   1 GLTPMHLKKAKLMFFYTRYPSSNVLKTYFPDVKFNRCNTSQLIKWFSNFREFYYIQMEKFARQALSEGVTDAEDLLVSRD 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1720373120 515 SELFRVLNTHYNKGNDFEVPDCFLEIAALTLKEFFRAVLAGKDSDPSWKKPIYKVISKLDSDVPEMLKSPSFL 587
Cdd:pfam05044  81 SELFRVLNLHYNKNNDFEVPDRFLEVVQLTLREFFNAIQAGKDSDPSWKKAIYKVICKLDSEVPEIFKSPNFL 153
PHA03247 PHA03247
large tegument protein UL36; Provisional
27-443 1.59e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 1.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720373120   27 QERSPATAEAGRDSFPsGQLPSSSLTEADWFWDEHIQAKRARVETIVRGMCLSP-------SSSVSGRARESLRCPEKGR 99
Cdd:PHA03247  2594 QSARPRAPVDDRGDPR-GPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVppperprDDPAPGRVSRPRRARRLGR 2672
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720373120  100 --------ERKRKQSLPMHQGPLKSS---PAWERGPKKGGTRVKEQLHLLKQQLRHLQEHVLQATEPRAPAqSPGGTEPR 168
Cdd:PHA03247  2673 aaqassppQRPRRRAARPTVGSLTSLadpPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPA-VPAGPATP 2751
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720373120  169 SSPRARPRNSCSSGAW--TVENEPHQSSSKDLCGAVKPGAAEVLQYSEEPMlCPSGPRALVETLRKELSRAVSQAV---- 242
Cdd:PHA03247  2752 GGPARPARPPTTAGPPapAPPAAPAAGPPRRLTRPAVASLSESRESLPSPW-DPADPPAAVLAPAAALPPAASPAGplpp 2830
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720373120  243 -DSVLQQVLFDPQRHLTQQERSCQGLASEGRNQPSPPGRSAYKDPLALATLPRKIQPQAGVPLGNSTLARPLDSPMCPVS 321
Cdd:PHA03247  2831 pTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQ 2910
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720373120  322 PRGVPRSYQSPLPNCPLTNVPSHTWENQMLRQLL-GRGPDGQWSGSPPQDAAFQSHTSPESAQQPwglsQQQLPLSLTPV 400
Cdd:PHA03247  2911 PQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLApTTDPAGAGEPSGAVPQPWLGALVPGRVAVP----RFRVPQPAPSR 2986
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|...
gi 1720373120  401 HLESRPLPPPVKMEQGVLRGVADSLpfsSIHIQEGLSPGHLKK 443
Cdd:PHA03247  2987 EAPASSTPPLTGHSLSRVSSWASSL---ALHEETDPPPVSLKQ 3026
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH