NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|577861066|ref|NP_001276791|]
View 

period circadian protein homolog 3 isoform 1 [Homo sapiens]

Protein Classification

PAS and Period_C domain-containing protein( domain architecture ID 12888871)

protein containing domains PAS, Herpes_BLLF1, and Period_C

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Period_C super family cl13540
Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is ...
1083-1185 1.14e-25

Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is typically between 164 to 200 amino acids in length. This domain is found associated with pfam08447.


The actual alignment was detected with superfamily member pfam12114:

Pssm-ID: 463464  Cd Length: 171  Bit Score: 104.79  E-value: 1.14e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  1083 SVYSSKISQNGQQSQDVQKKETF-PNVAEEPIWRMIRQTPERILMTYQVPERVKEVVLKEDLEKLESMRQQQPQFSHGQK 1161
Cdd:pfam12114   68 SIDSSENNHKAKKTAEVGEEEHFiKCVLQDPIWLLMANTDDSVMMTYQIPSRDLETVLKEDREKLKAMQKMQPRFTEDQK 147
                           90       100
                   ....*....|....*....|....
gi 577861066  1162 EELAKVYNWIQSQTVTQEIDIQAC 1185
Cdd:pfam12114  148 GELAEVHPWIQKGGLPAALDLSEC 171
PAS cd00130
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising ...
285-377 1.80e-12

PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.


:

Pssm-ID: 238075 [Multi-domain]  Cd Length: 103  Bit Score: 64.58  E-value: 1.80e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  285 FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEhspIRFCTQNGDYIILDSSWSSFVNPW 364
Cdd:cd00130    14 ILYANPAAEQLLGYSPEELIGKSLLDLIHPEDREELRERLENLLSGGEPVTLE---VRLRRKDGSVIWVLVSLTPIRDEG 90
                          90
                  ....*....|...
gi 577861066  365 SRKISFIIGRHKV 377
Cdd:cd00130    91 GEVIGLLGVVRDI 103
Atrophin-1 super family cl38111
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
752-1063 3.83e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


The actual alignment was detected with superfamily member pfam03154:

Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 57.85  E-value: 3.83e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066   752 LPEPPDSSSSNTGSgprrgAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPylvPAFPLPAATSPGREYAAPGTAPEG 831
Cdd:pfam03154  148 IPSPQDNESDSDSS-----AQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAA---TAGPTPSAPSVPPQGSPATSQPPN 219
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066   832 LHGLPLSEG--LQPYPAFPFPYLDTfmtvflPDPPVCPLLSPSflpcpflgatassaispsmssamsptldPPPSVTSQR 909
Cdd:pfam03154  220 QTQSTAAPHtlIQQTPTLHPQRLPS------PHPPLQPMTQPP----------------------------PPSQVSPQP 265
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066   910 REEEKWEAQSE--GHPFITSRSSSPLQLNllqeemPRPSESPDQMRRNTCPQTEYQCVTGNNGSES-SPATTGALSTGSP 986
Cdd:pfam03154  266 LPQPSLHGQMPpmPHSLQTGPSHMQHPVP------PQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIhTPPSQSQLQSQQP 339
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 577861066   987 PRENPSHPTASALSTGSPPMKNPSHPTASALSTGSPPmkNPSHPTASTLSMGLPPSrtPSHPTATVLSTGSPPSESP 1063
Cdd:pfam03154  340 PREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPP--HLSGPSPFQMNSNLPPP--PALKPLSSLSTHHPPSAHP 412
 
Name Accession Description Interval E-value
Period_C pfam12114
Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is ...
1083-1185 1.14e-25

Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is typically between 164 to 200 amino acids in length. This domain is found associated with pfam08447.


Pssm-ID: 463464  Cd Length: 171  Bit Score: 104.79  E-value: 1.14e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  1083 SVYSSKISQNGQQSQDVQKKETF-PNVAEEPIWRMIRQTPERILMTYQVPERVKEVVLKEDLEKLESMRQQQPQFSHGQK 1161
Cdd:pfam12114   68 SIDSSENNHKAKKTAEVGEEEHFiKCVLQDPIWLLMANTDDSVMMTYQIPSRDLETVLKEDREKLKAMQKMQPRFTEDQK 147
                           90       100
                   ....*....|....*....|....
gi 577861066  1162 EELAKVYNWIQSQTVTQEIDIQAC 1185
Cdd:pfam12114  148 GELAEVHPWIQKGGLPAALDLSEC 171
PAS cd00130
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising ...
285-377 1.80e-12

PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.


Pssm-ID: 238075 [Multi-domain]  Cd Length: 103  Bit Score: 64.58  E-value: 1.80e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  285 FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEhspIRFCTQNGDYIILDSSWSSFVNPW 364
Cdd:cd00130    14 ILYANPAAEQLLGYSPEELIGKSLLDLIHPEDREELRERLENLLSGGEPVTLE---VRLRRKDGSVIWVLVSLTPIRDEG 90
                          90
                  ....*....|...
gi 577861066  365 SRKISFIIGRHKV 377
Cdd:cd00130    91 GEVIGLLGVVRDI 103
PAS_3 pfam08447
PAS fold; The PAS fold corresponds to the structural domain that has previously been defined ...
285-373 6.94e-12

PAS fold; The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.


Pssm-ID: 430001 [Multi-domain]  Cd Length: 89  Bit Score: 62.74  E-value: 6.94e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066   285 FLEVDEKAVPLLGYLPQDLIGT--SILSYLHPEDRSLMVAIHQKVLKYAGhpPFEHsPIRFCTQNGDYIILDSSWSSFVN 362
Cdd:pfam08447    1 IIYWSPRFEEILGYTPEELLGKgeSWLDLVHPDDRERVREALWEALKGGE--PYSG-EYRIRRKDGEYRWVEARARPIRD 77
                           90
                   ....*....|.
gi 577861066   363 pWSRKISFIIG 373
Cdd:pfam08447   78 -ENGKPVRVIG 87
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
752-1063 3.83e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 57.85  E-value: 3.83e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066   752 LPEPPDSSSSNTGSgprrgAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPylvPAFPLPAATSPGREYAAPGTAPEG 831
Cdd:pfam03154  148 IPSPQDNESDSDSS-----AQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAA---TAGPTPSAPSVPPQGSPATSQPPN 219
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066   832 LHGLPLSEG--LQPYPAFPFPYLDTfmtvflPDPPVCPLLSPSflpcpflgatassaispsmssamsptldPPPSVTSQR 909
Cdd:pfam03154  220 QTQSTAAPHtlIQQTPTLHPQRLPS------PHPPLQPMTQPP----------------------------PPSQVSPQP 265
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066   910 REEEKWEAQSE--GHPFITSRSSSPLQLNllqeemPRPSESPDQMRRNTCPQTEYQCVTGNNGSES-SPATTGALSTGSP 986
Cdd:pfam03154  266 LPQPSLHGQMPpmPHSLQTGPSHMQHPVP------PQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIhTPPSQSQLQSQQP 339
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 577861066   987 PRENPSHPTASALSTGSPPMKNPSHPTASALSTGSPPmkNPSHPTASTLSMGLPPSrtPSHPTATVLSTGSPPSESP 1063
Cdd:pfam03154  340 PREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPP--HLSGPSPFQMNSNLPPP--PALKPLSSLSTHHPPSAHP 412
PAS smart00091
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising ...
285-329 1.25e-07

PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels.


Pssm-ID: 214512  Cd Length: 67  Bit Score: 49.71  E-value: 1.25e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*
gi 577861066    285 FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLK 329
Cdd:smart00091   23 ILYANPAAEELLGYSPEELIGKSLLELIHPEDRERVQEALQRLLS 67
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
749-1064 7.99e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 53.64  E-value: 7.99e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  749 RKKLPEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSPHTSSP-------------TFPPAAMVPSQAPYLVPAFPL-PA 814
Cdd:PHA03307   64 RFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPtppgpsspdppppTPPPASPPPSPAPDLSEMLRPvGS 143
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  815 ATSPGREYAAPGTAPEGLHGLPLSEGLQPYPAFPfpyldtfMTVFLPDPPVCPLLSPSFLPCPFLGATASSAISPSMSSA 894
Cdd:PHA03307  144 PGPPPAASPPAAGASPAAVASDAASSRQAALPLS-------SPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISAS 216
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  895 MsptldPPPSVTSQRREEEKWEAQSEGhpfiTSRSSSPLQLNLLQEEMPRPSESPdqmrrNTCPQTEYQCVTGNN-GSES 973
Cdd:PHA03307  217 A-----SSPAPAPGRSAADDAGASSSD----SSSSESSGCGWGPENECPLPRPAP-----ITLPTRIWEASGWNGpSSRP 282
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  974 SPATTGALSTGSPPRENPSHPTASALSTGSPPMKNPSHPTASALSTGSPPmKNPSHPTASTLSMGLPPSRTPSHPTATvl 1053
Cdd:PHA03307  283 GPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSS-SESSRGAAVSPGPSPSRSPSPSRPPPP-- 359
                         330
                  ....*....|.
gi 577861066 1054 STGSPPSESPS 1064
Cdd:PHA03307  360 ADPSSPRKRPR 370
KinA COG5805
Sporulation sensor histidine kinase A (Stage II sporulation protein SpoIIF/SpoIIJ) [Cell cycle ...
274-374 3.25e-03

Sporulation sensor histidine kinase A (Stage II sporulation protein SpoIIF/SpoIIJ) [Cell cycle control, cell division, chromosome partitioning, Signal transduction mechanisms];


Pssm-ID: 444507 [Multi-domain]  Cd Length: 496  Bit Score: 41.64  E-value: 3.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  274 IFTTTHTPGcVFLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEHSPIrfcTQNGDYIIL 353
Cdd:COG5805   169 LICVIDTDG-RILFINESIERLFGAPREELIGKNLLELLHPCDKEEFKERIESITEVWQEFIIEREII---TKDGRIRYF 244
                          90       100
                  ....*....|....*....|..
gi 577861066  354 DSSWSSFVNP-WSRKISFIIGR 374
Cdd:COG5805   245 EAVIVPLIDTdGSVKGILVILR 266
 
Name Accession Description Interval E-value
Period_C pfam12114
Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is ...
1083-1185 1.14e-25

Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is typically between 164 to 200 amino acids in length. This domain is found associated with pfam08447.


Pssm-ID: 463464  Cd Length: 171  Bit Score: 104.79  E-value: 1.14e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  1083 SVYSSKISQNGQQSQDVQKKETF-PNVAEEPIWRMIRQTPERILMTYQVPERVKEVVLKEDLEKLESMRQQQPQFSHGQK 1161
Cdd:pfam12114   68 SIDSSENNHKAKKTAEVGEEEHFiKCVLQDPIWLLMANTDDSVMMTYQIPSRDLETVLKEDREKLKAMQKMQPRFTEDQK 147
                           90       100
                   ....*....|....*....|....
gi 577861066  1162 EELAKVYNWIQSQTVTQEIDIQAC 1185
Cdd:pfam12114  148 GELAEVHPWIQKGGLPAALDLSEC 171
PAS cd00130
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising ...
285-377 1.80e-12

PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.


Pssm-ID: 238075 [Multi-domain]  Cd Length: 103  Bit Score: 64.58  E-value: 1.80e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  285 FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEhspIRFCTQNGDYIILDSSWSSFVNPW 364
Cdd:cd00130    14 ILYANPAAEQLLGYSPEELIGKSLLDLIHPEDREELRERLENLLSGGEPVTLE---VRLRRKDGSVIWVLVSLTPIRDEG 90
                          90
                  ....*....|...
gi 577861066  365 SRKISFIIGRHKV 377
Cdd:cd00130    91 GEVIGLLGVVRDI 103
PAS_3 pfam08447
PAS fold; The PAS fold corresponds to the structural domain that has previously been defined ...
285-373 6.94e-12

PAS fold; The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.


Pssm-ID: 430001 [Multi-domain]  Cd Length: 89  Bit Score: 62.74  E-value: 6.94e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066   285 FLEVDEKAVPLLGYLPQDLIGT--SILSYLHPEDRSLMVAIHQKVLKYAGhpPFEHsPIRFCTQNGDYIILDSSWSSFVN 362
Cdd:pfam08447    1 IIYWSPRFEEILGYTPEELLGKgeSWLDLVHPDDRERVREALWEALKGGE--PYSG-EYRIRRKDGEYRWVEARARPIRD 77
                           90
                   ....*....|.
gi 577861066   363 pWSRKISFIIG 373
Cdd:pfam08447   78 -ENGKPVRVIG 87
PAS_11 pfam14598
PAS domain; This family includes the PAS-B domain of NCOA1 (Nuclear receptor coactivator 1), ...
275-377 3.16e-09

PAS domain; This family includes the PAS-B domain of NCOA1 (Nuclear receptor coactivator 1), which binds to an LXXLL motif in the C-terminal region of STAT6 (Signal transducer and activator of transcription 6).


Pssm-ID: 464214 [Multi-domain]  Cd Length: 110  Bit Score: 55.76  E-value: 3.16e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066   275 FTTTHTPGCVFLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHppfEHSPI-RFCTQNGDYIIL 353
Cdd:pfam14598    4 FTTRHDIDGKIISCDTRAPFSLGYEKDELVGRSIYDLVHPQDLRTAKSHLREIIQTRGR---ATSPSyRLRLRDGDFLSV 80
                           90       100
                   ....*....|....*....|....
gi 577861066   354 DSSWSSFVNPWSRKISFIIGRHKV 377
Cdd:pfam14598   81 HTKSKLFLNQNSNQQPFIMCTHTI 104
PAS pfam00989
PAS fold; The PAS fold corresponds to the structural domain that has previously been defined ...
272-371 3.87e-09

PAS fold; The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya. This domain can bind gases (O2, CO and NO), FAD, 4-hydroxycinnamic acid and NAD+ (Matilla et.al., FEMS Microbiology Reviews, fuab043, 45, 2021, 1. https://doi.org/10.1093/femsre/fuab043).


Pssm-ID: 395786 [Multi-domain]  Cd Length: 113  Bit Score: 55.50  E-value: 3.87e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066   272 KRIFTTTHTPGCV------FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKyAGHPPFEHSpIRFCT 345
Cdd:pfam00989    4 RAILESLPDGIFVvdedgrILYVNAAAEELLGLSREEVIGKSLLDLIPEEDDAEVAELLRQALL-QGEESRGFE-VSFRV 81
                           90       100
                   ....*....|....*....|....*.
gi 577861066   346 QNGDYIILDSSWSSFVNPWSRKISFI 371
Cdd:pfam00989   82 PDGRPRHVEVRASPVRDAGGEILGFL 107
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
752-1063 3.83e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 57.85  E-value: 3.83e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066   752 LPEPPDSSSSNTGSgprrgAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPylvPAFPLPAATSPGREYAAPGTAPEG 831
Cdd:pfam03154  148 IPSPQDNESDSDSS-----AQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAA---TAGPTPSAPSVPPQGSPATSQPPN 219
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066   832 LHGLPLSEG--LQPYPAFPFPYLDTfmtvflPDPPVCPLLSPSflpcpflgatassaispsmssamsptldPPPSVTSQR 909
Cdd:pfam03154  220 QTQSTAAPHtlIQQTPTLHPQRLPS------PHPPLQPMTQPP----------------------------PPSQVSPQP 265
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066   910 REEEKWEAQSE--GHPFITSRSSSPLQLNllqeemPRPSESPDQMRRNTCPQTEYQCVTGNNGSES-SPATTGALSTGSP 986
Cdd:pfam03154  266 LPQPSLHGQMPpmPHSLQTGPSHMQHPVP------PQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIhTPPSQSQLQSQQP 339
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 577861066   987 PRENPSHPTASALSTGSPPMKNPSHPTASALSTGSPPmkNPSHPTASTLSMGLPPSrtPSHPTATVLSTGSPPSESP 1063
Cdd:pfam03154  340 PREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPP--HLSGPSPFQMNSNLPPP--PALKPLSSLSTHHPPSAHP 412
PAS smart00091
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising ...
285-329 1.25e-07

PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels.


Pssm-ID: 214512  Cd Length: 67  Bit Score: 49.71  E-value: 1.25e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*
gi 577861066    285 FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLK 329
Cdd:smart00091   23 ILYANPAAEELLGYSPEELIGKSLLELIHPEDRERVQEALQRLLS 67
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
756-1066 2.14e-07

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 55.31  E-value: 2.14e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066   756 PDSSSSNTGSGPRRGAHQNAQPCCPSAA--------SSPHTSSPTFPPAAMVP-SQAPYLVPAFPLPAATSPGREYAAPG 826
Cdd:pfam05109  466 PTVSTADVTSPTPAGTTSGASPVTPSPSprdngtesKAPDMTSPTSAVTTPTPnATSPTPAVTTPTPNATSPTLGKTSPT 545
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066   827 TAPEglhgLPLSEGLQPYPAFPFPYLDTFMTVFLPDPPVCPLLSPS-FLPCPFLGATASSAISPSMS----SAMSPTLDP 901
Cdd:pfam05109  546 SAVT----TPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTPTpNATSPTVGETSPQANTTNHTlggtSSTPVVTSP 621
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066   902 PPSVTSqrreeekweAQSEGHPFITSRSSSPLQLNLLQ-EEMPRPSESpDQMRRNTCPQTEYQCVTGNNGSESSPATTGA 980
Cdd:pfam05109  622 PKNATS---------AVTTGQHNITSSSTSSMSLRPSSiSETLSPSTS-DNSTSHMPLLTSAHPTGGENITQVTPASTST 691
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066   981 --LSTGSP-PRenpshPTASALSTGSPPMKNPSHPTASALSTGSPPmKNPSHPTASTLSMGLPPSRTPSHPTATVLSTGS 1057
Cdd:pfam05109  692 hhVSTSSPaPR-----PGTTSQASGPGNSSTSTKPGEVNVTKGTPP-KNATSPQAPSGQKTAVPTVTSTGGKANSTTGGK 765

                   ....*....
gi 577861066  1058 PPSESPSRT 1066
Cdd:pfam05109  766 HTTGHGART 774
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
749-1064 7.99e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 53.64  E-value: 7.99e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  749 RKKLPEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSPHTSSP-------------TFPPAAMVPSQAPYLVPAFPL-PA 814
Cdd:PHA03307   64 RFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPtppgpsspdppppTPPPASPPPSPAPDLSEMLRPvGS 143
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  815 ATSPGREYAAPGTAPEGLHGLPLSEGLQPYPAFPfpyldtfMTVFLPDPPVCPLLSPSFLPCPFLGATASSAISPSMSSA 894
Cdd:PHA03307  144 PGPPPAASPPAAGASPAAVASDAASSRQAALPLS-------SPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISAS 216
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  895 MsptldPPPSVTSQRREEEKWEAQSEGhpfiTSRSSSPLQLNLLQEEMPRPSESPdqmrrNTCPQTEYQCVTGNN-GSES 973
Cdd:PHA03307  217 A-----SSPAPAPGRSAADDAGASSSD----SSSSESSGCGWGPENECPLPRPAP-----ITLPTRIWEASGWNGpSSRP 282
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  974 SPATTGALSTGSPPRENPSHPTASALSTGSPPMKNPSHPTASALSTGSPPmKNPSHPTASTLSMGLPPSRTPSHPTATvl 1053
Cdd:PHA03307  283 GPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSS-SESSRGAAVSPGPSPSRSPSPSRPPPP-- 359
                         330
                  ....*....|.
gi 577861066 1054 STGSPPSESPS 1064
Cdd:PHA03307  360 ADPSSPRKRPR 370
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
753-1065 3.79e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 51.33  E-value: 3.79e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  753 PEPPDSSSSNTGSGPRRGAHQNAQPCC-PSAASSPHTSSPTFPPAAMVPSQAPYLVPAFPL-PAATSPGREYAAPGTAPE 830
Cdd:PHA03307   80 PANESRSTPTWSLSTLAPASPAREGSPtPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPvGSPGPPPAASPPAAGASP 159
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  831 GLHGLPLSEGLQPYPAFPfpyldtfMTVFLPDPPVCPLLSPSFLPCPFLGATASSAISPSMSSAMsptldPPPSVTSQRR 910
Cdd:PHA03307  160 AAVASDAASSRQAALPLS-------SPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASA-----SSPAPAPGRS 227
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  911 EEEKWEAQSEGhpfiTSRSSSPLQLNLLQEEMPRPSESPdqmrrNTCPQTEYQCVTGNN-GSESSPATTGALSTGSPPRE 989
Cdd:PHA03307  228 AADDAGASSSD----SSSSESSGCGWGPENECPLPRPAP-----ITLPTRIWEASGWNGpSSRPGPASSSSSPRERSPSP 298
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 577861066  990 NPSHPTASALSTGSPPMKNPSHPTASALSTGSPpmknpSHPTAStlSMGLPPSRTPSHPTAtvLSTGSPPSESPSR 1065
Cdd:PHA03307  299 SPSSPGSGPAPSSPRASSSSSSSRESSSSSTSS-----SSESSR--GAAVSPGPSPSRSPS--PSRPPPPADPSSP 365
PHA03247 PHA03247
large tegument protein UL36; Provisional
739-1064 3.85e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.48  E-value: 3.85e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  739 SAGCRKGKHKRKKLPEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPYLVPAFPLPAATSP 818
Cdd:PHA03247 2656 PAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPAL 2735
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  819 GREYAAPGTaPEGlHGLPLSEGLQPYPAFPfpyldtfMTVFLPDPPVCPLLSPS-FLPCPflgATASSAISPSMSSAMSP 897
Cdd:PHA03247 2736 PAAPAPPAV-PAG-PATPGGPARPARPPTT-------AGPPAPAPPAAPAAGPPrRLTRP---AVASLSESRESLPSPWD 2803
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  898 TLDPPPSVTSQRREEEKWEAQSEGHPFITSRSSSPLQLNllqeemPRPSESPDQMRRNTCPQTEYQcvtgNNGSESSPAT 977
Cdd:PHA03247 2804 PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP------PGPPPPSLPLGGSVAPGGDVR----RRPPSRSPAA 2873
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  978 TGALSTGSP----PRENPSHPTASaLSTGSPPMKNPSHPTASALSTGSPPMKNPSHPTASTLSMGLPPSrtPSHPTATVL 1053
Cdd:PHA03247 2874 KPAAPARPPvrrlARPAVSRSTES-FALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP--PLAPTTDPA 2950
                         330
                  ....*....|.
gi 577861066 1054 STGSPPSESPS 1064
Cdd:PHA03247 2951 GAGEPSGAVPQ 2961
PHA03247 PHA03247
large tegument protein UL36; Provisional
755-1063 4.59e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.48  E-value: 4.59e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  755 PPDSSSSNTGSGPRRGAHQNAQPccpsAASSPHTSSPTFPPAAmvPSQAPYLVPAFPLPAATSPGREYAAPGTAPEGLHG 834
Cdd:PHA03247 2592 PPQSARPRAPVDDRGDPRGPAPP----SPLPPDTHAPDPPPPS--PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPR 2665
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  835 LPLSEGLQPYPAFPfpyldtfmtvflPDPPVCPLLSPSFLPCPFLGatassaispsmssamsptlDPPPSvtsQRREEEK 914
Cdd:PHA03247 2666 RARRLGRAAQASSP------------PQRPRRRAARPTVGSLTSLA-------------------DPPPP---PPTPEPA 2711
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  915 WEAQSEGHPfitsrssSPLQLNLLQEEMPRPSESPdqmrrNTCPQTEYQCVTGNNGSESSPATTGALSTGSPPRENPSHP 994
Cdd:PHA03247 2712 PHALVSATP-------LPPGPAAARQASPALPAAP-----APPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 577861066  995 ----TASALSTGSPPMKNPSHPTASALSTGSPPMKNPSHPTASTLSMGLPPSRTPShPTATVLSTGSPPSESP 1063
Cdd:PHA03247 2780 prrlTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQ-PTAPPPPPGPPPPSLP 2851
PHA03247 PHA03247
large tegument protein UL36; Provisional
755-1061 8.28e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 8.28e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  755 PPDSSSSNTGSGPRRGAhQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPYLVPAFPLPAATSPgreyaAPGTAPEGLHG 834
Cdd:PHA03247 2742 PAVPAGPATPGGPARPA-RPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDP-----ADPPAAVLAPA 2815
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  835 LPLSEGLQPYPAFPFPyldtfmTVFLPDPPVCPllsPSFLPCPFlgATASSAISPSMSSAMSPTLDPPPSVTSQRREEEK 914
Cdd:PHA03247 2816 AALPPAASPAGPLPPP------TSAQPTAPPPP---PGPPPPSL--PLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVR 2884
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  915 WEAQSEghpfiTSRSSSPLQLNLLQEEMPRPSESPDQMRRNTCPQTEYQCVTGNNG---SESSPATTGALSTGSPPRENP 991
Cdd:PHA03247 2885 RLARPA-----VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPpprPQPPLAPTTDPAGAGEPSGAV 2959
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 577861066  992 SHPTASALSTGSPPMKN----PSHPTASALSTGSPPMKNPSHPTASTLSMGLPPSRTPSHPTATVLSTGSPPSE 1061
Cdd:PHA03247 2960 PQPWLGALVPGRVAVPRfrvpQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPVSLKQTLWPPDD 3033
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
753-1066 9.27e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 50.17  E-value: 9.27e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  753 PEPPDSSSSNTGSGPRRG-AHQNAQPCCPSAASSPHTSSPtFPPAAMVPSQAPYLVPAFPLPAATSPGREYAAPGTAPEG 831
Cdd:PHA03307  114 PDPPPPTPPPASPPPSPApDLSEMLRPVGSPGPPPAASPP-AAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAE 192
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  832 LHGLPLSEGLQPYPafpfPYLDTFMTVFLPDPPVCPLLSPSFLPCPFLGATASSAISPSMSSAMSPTLDPPPS---VTSQ 908
Cdd:PHA03307  193 PPPSTPPAAASPRP----PRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPApitLPTR 268
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  909 RREEEKWEAQSEGHPFITSRSSSPlqlnllqEEMPRPSESPDQMRRNTCPQTeyqcVTGNNGSESSPATTGALSTGSPPR 988
Cdd:PHA03307  269 IWEASGWNGPSSRPGPASSSSSPR-------ERSPSPSPSSPGSGPAPSSPR----ASSSSSSSRESSSSSTSSSSESSR 337
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 577861066  989 ENPSHPtasalstGSPPMKNPSHPTASALSTGSPPMKN-PSHPTASTLSMGlPPSRTPSHPTATVLSTGSPPSESPSRT 1066
Cdd:PHA03307  338 GAAVSP-------GPSPSRSPSPSRPPPPADPSSPRKRpRPSRAPSSPAAS-AGRPTRRRARAAVAGRARRRDATGRFP 408
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
752-1064 4.24e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.84  E-value: 4.24e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066   752 LPEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSP-HTSSPTFP-PAAMVPSQAPYLVPAFPLPAATSPGREYA-APGTA 828
Cdd:pfam03154  252 MTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPsHMQHPVPPqPFPLTPQSSQSQVPPGPSPAAPGQSQQRIhTPPSQ 331
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066   829 PEGLHGLPLSEglQPYPAFPFPyldtfMTVFLPDP--PVCPLLSPSFLPCPflgatasSAISPSMSSAMSPTLDPPPSVt 906
Cdd:pfam03154  332 SQLQSQQPPRE--QPLPPAPLS-----MPHIKPPPttPIPQLPNPQSHKHP-------PHLSGPSPFQMNSNLPPPPAL- 396
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066   907 sqrreeEKWEAQSEGHPfiTSRSSSPLQLNLLQEEMPRPSESPDQMrrntcpqTEYQCVTGnngSESSPATTGALSTGSP 986
Cdd:pfam03154  397 ------KPLSSLSTHHP--PSAHPPPLQLMPQSQQLPPPPAQPPVL-------TQSQSLPP---PAASHPPTSGLHQVPS 458
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066   987 preNPSHPTASALSTGSPPMKNPSHPTASALSTGS---PPMKNP---SHPTASTLSMGLPPSRTPSHPTATVLSTGS--P 1058
Cdd:pfam03154  459 ---QSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPgiqPPSSASvssSGPVPAAVSCPLPPVQIKEEALDEAEEPESppP 535

                   ....*.
gi 577861066  1059 PSESPS 1064
Cdd:pfam03154  536 PPRSPS 541
PHA03247 PHA03247
large tegument protein UL36; Provisional
752-1064 7.09e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.63  E-value: 7.09e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  752 LPEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSP-------HTSSPTFPPAAMVPSQAPYLVPAFPLPAATSPGREYAA 824
Cdd:PHA03247 2624 PDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPgrvsrprRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPP 2703
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  825 PGTAPE-----GLHGLPLSEGLQPYPAFPFPYLDTFMTVFLPDPPVCPlLSPSFLPCPFLGATASSAISPSMSSAMSPTL 899
Cdd:PHA03247 2704 PPPTPEpaphaLVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP-GGPARPARPPTTAGPPAPAPPAAPAAGPPRR 2782
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  900 DPPPSVTSQrrEEEKWEAQSEGHPFITSRSSSPLQLNLLQEEMPRPSESPDQMRRNTCPQTEYQCVTGNNGSESSPATTG 979
Cdd:PHA03247 2783 LTRPAVASL--SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGG 2860
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  980 ALSTGSPPRENPSHPTAsalstgsppmknPSHPTASALStgSPPMKNPSHPTASTlSMGLPPSRTPSHPTATVLSTGSPP 1059
Cdd:PHA03247 2861 DVRRRPPSRSPAAKPAA------------PARPPVRRLA--RPAVSRSTESFALP-PDQPERPPQPQAPPPPQPQPQPPP 2925

                  ....*
gi 577861066 1060 SESPS 1064
Cdd:PHA03247 2926 PPQPQ 2930
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
753-1064 5.35e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.37  E-value: 5.35e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066   753 PEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPYLVPAFPLP-------AATSPGREYAAP 825
Cdd:pfam03154  185 SPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPhpplqpmTQPPPPSQVSPQ 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066   826 GTAPEGLHGL--PLSEGLQ--------PYPAFPFPYLDTFMTVFLPDPPVCPLLSPSflpcpflgatassaispsmssAM 895
Cdd:pfam03154  265 PLPQPSLHGQmpPMPHSLQtgpshmqhPVPPQPFPLTPQSSQSQVPPGPSPAAPGQS---------------------QQ 323
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066   896 SPTLDPPPSVTSQRR--EEEKWEAQSEGHPFITSRSSSPLQLNLLQEEMPRPSE----SPDQMRRNTCPQTEYQ---CVT 966
Cdd:pfam03154  324 RIHTPPSQSQLQSQQppREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHlsgpSPFQMNSNLPPPPALKplsSLS 403
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066   967 GNNGSESSPATTGALSTGSPPRENPSHPTASALSTGSPPmKNPSHPTASALSTGSPPMKNPSHPTASTLSMGLPPSRTPS 1046
Cdd:pfam03154  404 THHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPP-PAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPP 482
                          330
                   ....*....|....*...
gi 577861066  1047 HPTATVLSTGSPPSESPS 1064
Cdd:pfam03154  483 TSTSSAMPGIQPPSSASV 500
KinA COG5805
Sporulation sensor histidine kinase A (Stage II sporulation protein SpoIIF/SpoIIJ) [Cell cycle ...
274-374 3.25e-03

Sporulation sensor histidine kinase A (Stage II sporulation protein SpoIIF/SpoIIJ) [Cell cycle control, cell division, chromosome partitioning, Signal transduction mechanisms];


Pssm-ID: 444507 [Multi-domain]  Cd Length: 496  Bit Score: 41.64  E-value: 3.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  274 IFTTTHTPGcVFLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEHSPIrfcTQNGDYIIL 353
Cdd:COG5805   169 LICVIDTDG-RILFINESIERLFGAPREELIGKNLLELLHPCDKEEFKERIESITEVWQEFIIEREII---TKDGRIRYF 244
                          90       100
                  ....*....|....*....|..
gi 577861066  354 DSSWSSFVNP-WSRKISFIIGR 374
Cdd:COG5805   245 EAVIVPLIDTdGSVKGILVILR 266
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
753-1063 3.65e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 41.70  E-value: 3.65e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  753 PEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPYLVPAFPLPAATSPGREYAAPGTAPEGL 832
Cdd:PHA03307  129 SPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPR 208
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  833 HGLPLSEG-LQPYPAFP----FPYLDTFMTVFLPDPPVCPLLSPSFLPCPFLGATASSAISPSMSSAMSPTLDPPP--SV 905
Cdd:PHA03307  209 RSSPISASaSSPAPAPGrsaaDDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPasSS 288
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  906 TSQRREEEKWEAQSEGHPFITSRSSSPLQLNLLQEEMPrPSESPDQMRRNTCPQTeyqcvTGNNGSES-SPATTGALSTG 984
Cdd:PHA03307  289 SSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSS-SSTSSSSESSRGAAVS-----PGPSPSRSpSPSRPPPPADP 362
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 577861066  985 SPPRENPshPTASALSTGSPPMKNPSHPTASALSTGSPPMKNPSHPtastlsmgLPPSRTPSHPTATVLSTGSPPSESP 1063
Cdd:PHA03307  363 SSPRKRP--RPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGR--------FPAGRPRPSPLDAGAASGAFYARYP 431
PHA03379 PHA03379
EBNA-3A; Provisional
753-1063 9.08e-03

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 40.43  E-value: 9.08e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  753 PEPPDSSSSNTGSGPRRGAHQN-AQPCCPSAASSPHTSSPTfPPAAMVPSQAPYLVPAFPLPAATSPGREYAAPGTAPEG 831
Cdd:PHA03379  425 PEVPQSLETATSHGSAQVPEPPpVHDLEPGPLHDQHSMAPC-PVAQLPPGPLQDLEPGDQLPGVVQDGRPACAPVPAPAG 503
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  832 LHGLPLSEGLQPYPAFPF-PYLDTFMTV-FLPDP------PVCPLLSPSFLPCPflGATASSAISPSMSSAMSPTLDPPP 903
Cdd:PHA03379  504 PIVRPWEASLSQVPGVAFaPVMPQPMPVePVPVPtvalerPVCPAPPLIAMQGP--GETSGIVRVRERWRPAPWTPNPPR 581
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  904 SVTSQRREEEKWEAQSEGHPFITSRSSSPLQLNLL--QEEMPRPSEsPDQMRRNTCPQTEYQCVTGNNG----------- 970
Cdd:PHA03379  582 SPSQMSVRDRLARLRAEAQPYQASVEVQPPQLTQVspQQPMEYPLE-PEQQMFPGSPFSQVADVMRAGGvpamqpqyfdl 660
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 577861066  971 SESSPATTGALST-------GSPPR--ENPSH---PTASALSTGSP--------PMKNPSHPtASALSTGSPPMKNPSHP 1030
Cdd:PHA03379  661 PLQQPISQGAPLAplrasmgPVPPVpaTQPQYfdiPLTEPINQGASaahflpqqPMEGPLVP-ERWMFQGATLSQSVRPG 739
                         330       340       350
                  ....*....|....*....|....*....|...
gi 577861066 1031 TASTLSMGLPPSRTPSHPTATVLSTGSPPSESP 1063
Cdd:PHA03379  740 VAQSQYFDLPLTQPINHGAPAAHFLHQPPMEGP 772
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH