NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1370455087|ref|XP_024306358|]
View 

period circadian protein homolog 3 isoform X11 [Homo sapiens]

Protein Classification

PAS and Period_C domain-containing protein( domain architecture ID 12888871)

protein containing domains PAS, Herpes_BLLF1, and Period_C

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Period_C super family cl13540
Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is ...
1074-1176 1.15e-25

Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is typically between 164 to 200 amino acids in length. This domain is found associated with pfam08447.


The actual alignment was detected with superfamily member pfam12114:

Pssm-ID: 463464  Cd Length: 171  Bit Score: 104.79  E-value: 1.15e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087 1074 SVYSSKISQNGQQSQDVQKKETF-PNVAEEPIWRMIRQTPERILMTYQVPERVKEVVLKEDLEKLESMRQQQPQFSHGQK 1152
Cdd:pfam12114   68 SIDSSENNHKAKKTAEVGEEEHFiKCVLQDPIWLLMANTDDSVMMTYQIPSRDLETVLKEDREKLKAMQKMQPRFTEDQK 147
                           90       100
                   ....*....|....*....|....
gi 1370455087 1153 EELAKVYNWIQSQTVTQEIDIQAC 1176
Cdd:pfam12114  148 GELAEVHPWIQKGGLPAALDLSEC 171
PAS cd00130
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising ...
284-376 1.78e-12

PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.


:

Pssm-ID: 238075 [Multi-domain]  Cd Length: 103  Bit Score: 64.58  E-value: 1.78e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  284 FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEhspIRFCTQNGDYIILDSSWSSFVNPW 363
Cdd:cd00130     14 ILYANPAAEQLLGYSPEELIGKSLLDLIHPEDREELRERLENLLSGGEPVTLE---VRLRRKDGSVIWVLVSLTPIRDEG 90
                           90
                   ....*....|...
gi 1370455087  364 SRKISFIIGRHKV 376
Cdd:cd00130     91 GEVIGLLGVVRDI 103
Herpes_BLLF1 super family cl37540
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
748-1057 3.68e-08

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


The actual alignment was detected with superfamily member pfam05109:

Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 58.00  E-value: 3.68e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  748 PDSSSSNTGSGPRRGAHQNAQPCCPSAA--------SSPHTSSPTFPPAAMVP-SQAPYLVPAFPLPAATSPGREYAAPG 818
Cdd:pfam05109  466 PTVSTADVTSPTPAGTTSGASPVTPSPSprdngtesKAPDMTSPTSAVTTPTPnATSPTPAVTTPTPNATSPTLGKTSPT 545
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  819 TAPEglhgLPLSEGLQPYPAFPFPYLDTFMTVFLPDPPVCPLLSPS-FLPCPFLGATASSAISPSMS----SAMSPTLDP 893
Cdd:pfam05109  546 SAVT----TPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTPTpNATSPTVGETSPQANTTNHTlggtSSTPVVTSP 621
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  894 PPSVTSqrreeekweAQSEGHPFITSRSSSPLQLNLLQ-EEMPRPSESPDQMRRNTCPQTEYCVTGNNGSESSPATTGA- 971
Cdd:pfam05109  622 PKNATS---------AVTTGQHNITSSSTSSMSLRPSSiSETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTh 692
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  972 -LSTGSP-PRenpshPTASALSTGSPPMKNPSHPTASALSTGSPPmKNPSHPTASTLSMGLPPSRTPSHPTATVLSTGSP 1049
Cdd:pfam05109  693 hVSTSSPaPR-----PGTTSQASGPGNSSTSTKPGEVNVTKGTPP-KNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKH 766

                   ....*...
gi 1370455087 1050 PSESPSRT 1057
Cdd:pfam05109  767 TTGHGART 774
 
Name Accession Description Interval E-value
Period_C pfam12114
Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is ...
1074-1176 1.15e-25

Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is typically between 164 to 200 amino acids in length. This domain is found associated with pfam08447.


Pssm-ID: 463464  Cd Length: 171  Bit Score: 104.79  E-value: 1.15e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087 1074 SVYSSKISQNGQQSQDVQKKETF-PNVAEEPIWRMIRQTPERILMTYQVPERVKEVVLKEDLEKLESMRQQQPQFSHGQK 1152
Cdd:pfam12114   68 SIDSSENNHKAKKTAEVGEEEHFiKCVLQDPIWLLMANTDDSVMMTYQIPSRDLETVLKEDREKLKAMQKMQPRFTEDQK 147
                           90       100
                   ....*....|....*....|....
gi 1370455087 1153 EELAKVYNWIQSQTVTQEIDIQAC 1176
Cdd:pfam12114  148 GELAEVHPWIQKGGLPAALDLSEC 171
PAS cd00130
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising ...
284-376 1.78e-12

PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.


Pssm-ID: 238075 [Multi-domain]  Cd Length: 103  Bit Score: 64.58  E-value: 1.78e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  284 FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEhspIRFCTQNGDYIILDSSWSSFVNPW 363
Cdd:cd00130     14 ILYANPAAEQLLGYSPEELIGKSLLDLIHPEDREELRERLENLLSGGEPVTLE---VRLRRKDGSVIWVLVSLTPIRDEG 90
                           90
                   ....*....|...
gi 1370455087  364 SRKISFIIGRHKV 376
Cdd:cd00130     91 GEVIGLLGVVRDI 103
PAS_3 pfam08447
PAS fold; The PAS fold corresponds to the structural domain that has previously been defined ...
284-372 6.88e-12

PAS fold; The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.


Pssm-ID: 430001 [Multi-domain]  Cd Length: 89  Bit Score: 62.74  E-value: 6.88e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  284 FLEVDEKAVPLLGYLPQDLIGT--SILSYLHPEDRSLMVAIHQKVLKYAGhpPFEHsPIRFCTQNGDYIILDSSWSSFVN 361
Cdd:pfam08447    1 IIYWSPRFEEILGYTPEELLGKgeSWLDLVHPDDRERVREALWEALKGGE--PYSG-EYRIRRKDGEYRWVEARARPIRD 77
                           90
                   ....*....|.
gi 1370455087  362 pWSRKISFIIG 372
Cdd:pfam08447   78 -ENGKPVRVIG 87
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
748-1057 3.68e-08

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 58.00  E-value: 3.68e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  748 PDSSSSNTGSGPRRGAHQNAQPCCPSAA--------SSPHTSSPTFPPAAMVP-SQAPYLVPAFPLPAATSPGREYAAPG 818
Cdd:pfam05109  466 PTVSTADVTSPTPAGTTSGASPVTPSPSprdngtesKAPDMTSPTSAVTTPTPnATSPTPAVTTPTPNATSPTLGKTSPT 545
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  819 TAPEglhgLPLSEGLQPYPAFPFPYLDTFMTVFLPDPPVCPLLSPS-FLPCPFLGATASSAISPSMS----SAMSPTLDP 893
Cdd:pfam05109  546 SAVT----TPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTPTpNATSPTVGETSPQANTTNHTlggtSSTPVVTSP 621
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  894 PPSVTSqrreeekweAQSEGHPFITSRSSSPLQLNLLQ-EEMPRPSESPDQMRRNTCPQTEYCVTGNNGSESSPATTGA- 971
Cdd:pfam05109  622 PKNATS---------AVTTGQHNITSSSTSSMSLRPSSiSETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTh 692
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  972 -LSTGSP-PRenpshPTASALSTGSPPMKNPSHPTASALSTGSPPmKNPSHPTASTLSMGLPPSRTPSHPTATVLSTGSP 1049
Cdd:pfam05109  693 hVSTSSPaPR-----PGTTSQASGPGNSSTSTKPGEVNVTKGTPP-KNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKH 766

                   ....*...
gi 1370455087 1050 PSESPSRT 1057
Cdd:pfam05109  767 TTGHGART 774
PAS smart00091
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising ...
284-328 1.24e-07

PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels.


Pssm-ID: 214512  Cd Length: 67  Bit Score: 49.71  E-value: 1.24e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*
gi 1370455087   284 FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLK 328
Cdd:smart00091   23 ILYANPAAEELLGYSPEELIGKSLLELIHPEDRERVQEALQRLLS 67
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
745-1057 1.38e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 52.87  E-value: 1.38e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  745 PEPPDSSSSNTGSGPRRG-AHQNAQPCCPSAASSPHTSSPtFPPAAMVPSQAPYLVPAFPLPAATSPGREYAAPGTAPEG 823
Cdd:PHA03307   114 PDPPPPTPPPASPPPSPApDLSEMLRPVGSPGPPPAASPP-AAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAE 192
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  824 LHGLPLSEGLQPYPafpfPYLDTFMTVFLPDPPVCPLLSPSFLPCPFLGATASSAISPSMSSAMSPTLDPPPS---VTSQ 900
Cdd:PHA03307   193 PPPSTPPAAASPRP----PRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPApitLPTR 268
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  901 RREEEKWEAQSEGHPFITSRSSSPlqlnllqEEMPRPSESpdqmrrntcpqteycvtgNNGSESSPATTGALSTGSPPRE 980
Cdd:PHA03307   269 IWEASGWNGPSSRPGPASSSSSPR-------ERSPSPSPS------------------SPGSGPAPSSPRASSSSSSSRE 323
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1370455087  981 NPShptASALSTGSPPMKNPSHPTASALSTGSPPMKNPSHPTAStlsmglPPSRTPSHPTATVLSTgSPPSESPSRT 1057
Cdd:PHA03307   324 SSS---SSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSS------PRKRPRPSRAPSSPAA-SAGRPTRRRA 390
KinA COG5805
Sporulation sensor histidine kinase A (Stage II sporulation protein SpoIIF/SpoIIJ) [Cell cycle ...
273-373 3.22e-03

Sporulation sensor histidine kinase A (Stage II sporulation protein SpoIIF/SpoIIJ) [Cell cycle control, cell division, chromosome partitioning, Signal transduction mechanisms];


Pssm-ID: 444507 [Multi-domain]  Cd Length: 496  Bit Score: 41.64  E-value: 3.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  273 IFTTTHTPGcVFLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEHSPIrfcTQNGDYIIL 352
Cdd:COG5805    169 LICVIDTDG-RILFINESIERLFGAPREELIGKNLLELLHPCDKEEFKERIESITEVWQEFIIEREII---TKDGRIRYF 244
                           90       100
                   ....*....|....*....|..
gi 1370455087  353 DSSWSSFVNP-WSRKISFIIGR 373
Cdd:COG5805    245 EAVIVPLIDTdGSVKGILVILR 266
 
Name Accession Description Interval E-value
Period_C pfam12114
Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is ...
1074-1176 1.15e-25

Period protein 2/3C-terminal region; This domain is found in eukaryotes. This domain is typically between 164 to 200 amino acids in length. This domain is found associated with pfam08447.


Pssm-ID: 463464  Cd Length: 171  Bit Score: 104.79  E-value: 1.15e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087 1074 SVYSSKISQNGQQSQDVQKKETF-PNVAEEPIWRMIRQTPERILMTYQVPERVKEVVLKEDLEKLESMRQQQPQFSHGQK 1152
Cdd:pfam12114   68 SIDSSENNHKAKKTAEVGEEEHFiKCVLQDPIWLLMANTDDSVMMTYQIPSRDLETVLKEDREKLKAMQKMQPRFTEDQK 147
                           90       100
                   ....*....|....*....|....
gi 1370455087 1153 EELAKVYNWIQSQTVTQEIDIQAC 1176
Cdd:pfam12114  148 GELAEVHPWIQKGGLPAALDLSEC 171
PAS cd00130
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising ...
284-376 1.78e-12

PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.


Pssm-ID: 238075 [Multi-domain]  Cd Length: 103  Bit Score: 64.58  E-value: 1.78e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  284 FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEhspIRFCTQNGDYIILDSSWSSFVNPW 363
Cdd:cd00130     14 ILYANPAAEQLLGYSPEELIGKSLLDLIHPEDREELRERLENLLSGGEPVTLE---VRLRRKDGSVIWVLVSLTPIRDEG 90
                           90
                   ....*....|...
gi 1370455087  364 SRKISFIIGRHKV 376
Cdd:cd00130     91 GEVIGLLGVVRDI 103
PAS_3 pfam08447
PAS fold; The PAS fold corresponds to the structural domain that has previously been defined ...
284-372 6.88e-12

PAS fold; The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.


Pssm-ID: 430001 [Multi-domain]  Cd Length: 89  Bit Score: 62.74  E-value: 6.88e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  284 FLEVDEKAVPLLGYLPQDLIGT--SILSYLHPEDRSLMVAIHQKVLKYAGhpPFEHsPIRFCTQNGDYIILDSSWSSFVN 361
Cdd:pfam08447    1 IIYWSPRFEEILGYTPEELLGKgeSWLDLVHPDDRERVREALWEALKGGE--PYSG-EYRIRRKDGEYRWVEARARPIRD 77
                           90
                   ....*....|.
gi 1370455087  362 pWSRKISFIIG 372
Cdd:pfam08447   78 -ENGKPVRVIG 87
PAS_11 pfam14598
PAS domain; This family includes the PAS-B domain of NCOA1 (Nuclear receptor coactivator 1), ...
274-376 2.98e-09

PAS domain; This family includes the PAS-B domain of NCOA1 (Nuclear receptor coactivator 1), which binds to an LXXLL motif in the C-terminal region of STAT6 (Signal transducer and activator of transcription 6).


Pssm-ID: 464214 [Multi-domain]  Cd Length: 110  Bit Score: 55.76  E-value: 2.98e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  274 FTTTHTPGCVFLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHppfEHSPI-RFCTQNGDYIIL 352
Cdd:pfam14598    4 FTTRHDIDGKIISCDTRAPFSLGYEKDELVGRSIYDLVHPQDLRTAKSHLREIIQTRGR---ATSPSyRLRLRDGDFLSV 80
                           90       100
                   ....*....|....*....|....
gi 1370455087  353 DSSWSSFVNPWSRKISFIIGRHKV 376
Cdd:pfam14598   81 HTKSKLFLNQNSNQQPFIMCTHTI 104
PAS pfam00989
PAS fold; The PAS fold corresponds to the structural domain that has previously been defined ...
271-370 3.84e-09

PAS fold; The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya. This domain can bind gases (O2, CO and NO), FAD, 4-hydroxycinnamic acid and NAD+ (Matilla et.al., FEMS Microbiology Reviews, fuab043, 45, 2021, 1. https://doi.org/10.1093/femsre/fuab043).


Pssm-ID: 395786 [Multi-domain]  Cd Length: 113  Bit Score: 55.50  E-value: 3.84e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  271 KRIFTTTHTPGCV------FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKyAGHPPFEHSpIRFCT 344
Cdd:pfam00989    4 RAILESLPDGIFVvdedgrILYVNAAAEELLGLSREEVIGKSLLDLIPEEDDAEVAELLRQALL-QGEESRGFE-VSFRV 81
                           90       100
                   ....*....|....*....|....*.
gi 1370455087  345 QNGDYIILDSSWSSFVNPWSRKISFI 370
Cdd:pfam00989   82 PDGRPRHVEVRASPVRDAGGEILGFL 107
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
748-1057 3.68e-08

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 58.00  E-value: 3.68e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  748 PDSSSSNTGSGPRRGAHQNAQPCCPSAA--------SSPHTSSPTFPPAAMVP-SQAPYLVPAFPLPAATSPGREYAAPG 818
Cdd:pfam05109  466 PTVSTADVTSPTPAGTTSGASPVTPSPSprdngtesKAPDMTSPTSAVTTPTPnATSPTPAVTTPTPNATSPTLGKTSPT 545
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  819 TAPEglhgLPLSEGLQPYPAFPFPYLDTFMTVFLPDPPVCPLLSPS-FLPCPFLGATASSAISPSMS----SAMSPTLDP 893
Cdd:pfam05109  546 SAVT----TPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTPTpNATSPTVGETSPQANTTNHTlggtSSTPVVTSP 621
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  894 PPSVTSqrreeekweAQSEGHPFITSRSSSPLQLNLLQ-EEMPRPSESPDQMRRNTCPQTEYCVTGNNGSESSPATTGA- 971
Cdd:pfam05109  622 PKNATS---------AVTTGQHNITSSSTSSMSLRPSSiSETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTh 692
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  972 -LSTGSP-PRenpshPTASALSTGSPPMKNPSHPTASALSTGSPPmKNPSHPTASTLSMGLPPSRTPSHPTATVLSTGSP 1049
Cdd:pfam05109  693 hVSTSSPaPR-----PGTTSQASGPGNSSTSTKPGEVNVTKGTPP-KNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKH 766

                   ....*...
gi 1370455087 1050 PSESPSRT 1057
Cdd:pfam05109  767 TTGHGART 774
PAS smart00091
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising ...
284-328 1.24e-07

PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels.


Pssm-ID: 214512  Cd Length: 67  Bit Score: 49.71  E-value: 1.24e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*
gi 1370455087   284 FLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLK 328
Cdd:smart00091   23 ILYANPAAEELLGYSPEELIGKSLLELIHPEDRERVQEALQRLLS 67
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
744-1054 1.62e-07

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 55.93  E-value: 1.62e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  744 LPEPPDSSSSNTGSgprrgAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPylvPAFPLPAATSPGREYAAPGTAPEG 823
Cdd:pfam03154  148 IPSPQDNESDSDSS-----AQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAA---TAGPTPSAPSVPPQGSPATSQPPN 219
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  824 LHGLPLSEG--LQPYPAFPFPYLDTfmtvflPDPPVCPLLSPSflpcpflgatassaispsmssamsptldPPPSVTSQR 901
Cdd:pfam03154  220 QTQSTAAPHtlIQQTPTLHPQRLPS------PHPPLQPMTQPP----------------------------PPSQVSPQP 265
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  902 REEEKWEAQSE--GHPFITSRSSSPLQLNllqeemPRPSESPDQMRRNTCPQTEYCVTGNNGSESS--PATTGALSTGSP 977
Cdd:pfam03154  266 LPQPSLHGQMPpmPHSLQTGPSHMQHPVP------PQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIhtPPSQSQLQSQQP 339
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1370455087  978 PRENPSHPTASALSTGSPPMKNPSHPTASALSTGSPPmkNPSHPTASTLSMGLPPSrtPSHPTATVLSTGSPPSESP 1054
Cdd:pfam03154  340 PREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPP--HLSGPSPFQMNSNLPPP--PALKPLSSLSTHHPPSAHP 412
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
745-1057 1.38e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 52.87  E-value: 1.38e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  745 PEPPDSSSSNTGSGPRRG-AHQNAQPCCPSAASSPHTSSPtFPPAAMVPSQAPYLVPAFPLPAATSPGREYAAPGTAPEG 823
Cdd:PHA03307   114 PDPPPPTPPPASPPPSPApDLSEMLRPVGSPGPPPAASPP-AAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAE 192
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  824 LHGLPLSEGLQPYPafpfPYLDTFMTVFLPDPPVCPLLSPSFLPCPFLGATASSAISPSMSSAMSPTLDPPPS---VTSQ 900
Cdd:PHA03307   193 PPPSTPPAAASPRP----PRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPApitLPTR 268
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  901 RREEEKWEAQSEGHPFITSRSSSPlqlnllqEEMPRPSESpdqmrrntcpqteycvtgNNGSESSPATTGALSTGSPPRE 980
Cdd:PHA03307   269 IWEASGWNGPSSRPGPASSSSSPR-------ERSPSPSPS------------------SPGSGPAPSSPRASSSSSSSRE 323
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1370455087  981 NPShptASALSTGSPPMKNPSHPTASALSTGSPPMKNPSHPTAStlsmglPPSRTPSHPTATVLSTgSPPSESPSRT 1057
Cdd:PHA03307   324 SSS---SSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSS------PRKRPRPSRAPSSPAA-SAGRPTRRRA 390
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
741-1055 1.89e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 52.48  E-value: 1.89e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  741 RKKLPEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSPHTSSP-------------TFPPAAMVPSQAPYLVPAFPL-PA 806
Cdd:PHA03307    64 RFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPtppgpsspdppppTPPPASPPPSPAPDLSEMLRPvGS 143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  807 ATSPGREYAAPGTAPEGLHGLPLSEGLQPYPAFPfpyldtfMTVFLPDPPVCPLLSPSFLPCPFLGATASSAISPSMSSA 886
Cdd:PHA03307   144 PGPPPAASPPAAGASPAAVASDAASSRQAALPLS-------SPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISAS 216
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  887 MsptldPPPSVTSQRREEEKWEAQSEGhpfiTSRSSSPLQLNLLQEEMPRPSESPDqmRRNTCPQTEycVTGNN-GSESS 965
Cdd:PHA03307   217 A-----SSPAPAPGRSAADDAGASSSD----SSSSESSGCGWGPENECPLPRPAPI--TLPTRIWEA--SGWNGpSSRPG 283
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  966 PATTGALSTGSPPRENPSHPTASALSTGSPPMKNPSHPTASALSTGSPPmKNPSHPTASTLSMGLPPSRTPSHPTATvlS 1045
Cdd:PHA03307   284 PASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSS-SESSRGAAVSPGPSPSRSPSPSRPPPP--A 360
                          330
                   ....*....|
gi 1370455087 1046 TGSPPSESPS 1055
Cdd:PHA03307   361 DPSSPRKRPR 370
PHA03247 PHA03247
large tegument protein UL36; Provisional
747-1052 2.61e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.25  E-value: 2.61e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  747 PPDSSSSNTGSGPRRGAhQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPYLVPAFPLPAATSPgreyaAPGTAPEGLHG 826
Cdd:PHA03247  2742 PAVPAGPATPGGPARPA-RPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDP-----ADPPAAVLAPA 2815
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  827 LPLSEGLQPYPAFPFPyldtfmTVFLPDPPVCPllsPSFLPCPFlgATASSAISPSMSSAMSPTLDPPPSVTSQRREEEK 906
Cdd:PHA03247  2816 AALPPAASPAGPLPPP------TSAQPTAPPPP---PGPPPPSL--PLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVR 2884
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  907 WEAQSEghpfiTSRSSSPLQLNLLQEEMPRPSESPDQMRRNTCPQTEYCVTGNNGSESSPATTGALSTGSPPRENPS--- 983
Cdd:PHA03247  2885 RLARPA-----VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSgav 2959
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1370455087  984 -HPTASALSTGSPPMKN----PSHPTASALSTGSPPMKNPSHPTASTLSMGLPPSRTPSHPTATVLSTGSPPSE 1052
Cdd:PHA03247  2960 pQPWLGALVPGRVAVPRfrvpQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPVSLKQTLWPPDD 3033
PHA03247 PHA03247
large tegument protein UL36; Provisional
731-1055 4.08e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.48  E-value: 4.08e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  731 SAGCRKGKHKRKKLPEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPYLVPAFPLPAATSP 810
Cdd:PHA03247  2656 PAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPAL 2735
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  811 GREYAAPGTaPEGlHGLPLSEGLQPYPAFPfpyldtfMTVFLPDPPVCPLLSPS-FLPCPflgATASSAISPSMSSAMSP 889
Cdd:PHA03247  2736 PAAPAPPAV-PAG-PATPGGPARPARPPTT-------AGPPAPAPPAAPAAGPPrRLTRP---AVASLSESRESLPSPWD 2803
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  890 TLDPPPSVTSQRREEEKWEAQSEGHPFITSRSSSPLQLNllqeemPRPSESPDQMRRNTCPQTEYcvtGNNGSESSPATT 969
Cdd:PHA03247  2804 PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP------PGPPPPSLPLGGSVAPGGDV---RRRPPSRSPAAK 2874
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  970 GALSTGSP----PRENPSHPTASaLSTGSPPMKNPSHPTASALSTGSPPMKNPSHPTASTLSMGLPPSrtPSHPTATVLS 1045
Cdd:PHA03247  2875 PAAPARPPvrrlARPAVSRSTES-FALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP--PLAPTTDPAG 2951
                          330
                   ....*....|
gi 1370455087 1046 TGSPPSESPS 1055
Cdd:PHA03247  2952 AGEPSGAVPQ 2961
PHA03247 PHA03247
large tegument protein UL36; Provisional
747-1054 5.57e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.09  E-value: 5.57e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  747 PPDSSSSNTGSGPRRGAHQNAQPccpsAASSPHTSSPTFPPAAmvPSQAPYLVPAFPLPAATSPGREYAAPGTAPEGLHG 826
Cdd:PHA03247  2592 PPQSARPRAPVDDRGDPRGPAPP----SPLPPDTHAPDPPPPS--PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPR 2665
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  827 LPLSEGLQPYPAFPfpyldtfmtvflPDPPVCPLLSPSFLPCPFLGatassaispsmssamsptlDPPPSvtsQRREEEK 906
Cdd:PHA03247  2666 RARRLGRAAQASSP------------PQRPRRRAARPTVGSLTSLA-------------------DPPPP---PPTPEPA 2711
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  907 WEAQSEGHPfitsrssSPLQLNLLQEEMPRPSESPDQMRRNTCPQTEycvtGNNGSESSPATTGALSTGSPPRENPSHP- 985
Cdd:PHA03247  2712 PHALVSATP-------LPPGPAAARQASPALPAAPAPPAVPAGPATP----GGPARPARPPTTAGPPAPAPPAAPAAGPp 2780
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1370455087  986 ---TASALSTGSPPMKNPSHPTASALSTGSPPMKNPSHPTASTLSMGLPPSRTPShPTATVLSTGSPPSESP 1054
Cdd:PHA03247  2781 rrlTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQ-PTAPPPPPGPPPPSLP 2851
PHA03247 PHA03247
large tegument protein UL36; Provisional
744-1055 1.06e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.86  E-value: 1.06e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  744 LPEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSP-------HTSSPTFPPAAMVPSQAPYlVPAfpLPAATSPGREYAA 816
Cdd:PHA03247  2624 PDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPgrvsrprRARRLGRAAQASSPPQRPR-RRA--ARPTVGSLTSLAD 2700
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  817 PGTAPEGlhglPLSEGLQPYPAFPFPYLDTFMTVFLPDPPVCPLLSPSflpcpflgATASSAISPSMSSAMSPTLDPPPS 896
Cdd:PHA03247  2701 PPPPPPT----PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAV--------PAGPATPGGPARPARPPTTAGPPA 2768
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  897 VTSQRreeekweAQSEGHPFITSRSSSPlQLNLLQEEMPRPSESPDQMRRNTCPQTEYCVTGNNGSESSPATTGALSTGS 976
Cdd:PHA03247  2769 PAPPA-------APAAGPPRRLTRPAVA-SLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP 2840
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  977 PPRE--NPSHPTASALSTGSP-PMKNPSHPTASALSTGS-PPMKNPSHP--TASTLSMGLPPS-----RTPSHPTATVLS 1045
Cdd:PHA03247  2841 PPPGppPPSLPLGGSVAPGGDvRRRPPSRSPAAKPAAPArPPVRRLARPavSRSTESFALPPDqperpPQPQAPPPPQPQ 2920
                          330
                   ....*....|
gi 1370455087 1046 TGSPPSESPS 1055
Cdd:PHA03247  2921 PQPPPPPQPQ 2930
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
745-1055 3.84e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.76  E-value: 3.84e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  745 PEPPDSSSSNTGSGPRRGAHQNAQPCCPSAASSPHTSSPTFPPAAMVPSQAPYLVPAFPLP-------AATSPGREYAAP 817
Cdd:pfam03154  185 SPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPhpplqpmTQPPPPSQVSPQ 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  818 GTAPEGLHGL--PLSEGLQ--------PYPAFPFPYLDTFMTVFLPDPPVCPLLSPS----FLPCPFLGATASSAISPSM 883
Cdd:pfam03154  265 PLPQPSLHGQmpPMPHSLQtgpshmqhPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSqqriHTPPSQSQLQSQQPPREQP 344
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  884 SSAMSPTL---DPPPSVTSQRREEekweAQSEGHPfitSRSSSPLQLNLLQEEMPRPSESPDQMRRNTCPQTEYcvtgnn 960
Cdd:pfam03154  345 LPPAPLSMphiKPPPTTPIPQLPN----PQSHKHP---PHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPPSAH------ 411
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  961 gsessPATTGALSTGSPPRENPSHPTASALSTGSPPmKNPSHPTASALSTGSPPMKNPSHPTASTLSMGLPPSRTPSHPT 1040
Cdd:pfam03154  412 -----PPPLQLMPQSQQLPPPPAQPPVLTQSQSLPP-PAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTST 485
                          330
                   ....*....|....*
gi 1370455087 1041 ATVLSTGSPPSESPS 1055
Cdd:pfam03154  486 SSAMPGIQPPSSASV 500
KinA COG5805
Sporulation sensor histidine kinase A (Stage II sporulation protein SpoIIF/SpoIIJ) [Cell cycle ...
273-373 3.22e-03

Sporulation sensor histidine kinase A (Stage II sporulation protein SpoIIF/SpoIIJ) [Cell cycle control, cell division, chromosome partitioning, Signal transduction mechanisms];


Pssm-ID: 444507 [Multi-domain]  Cd Length: 496  Bit Score: 41.64  E-value: 3.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  273 IFTTTHTPGcVFLEVDEKAVPLLGYLPQDLIGTSILSYLHPEDRSLMVAIHQKVLKYAGHPPFEHSPIrfcTQNGDYIIL 352
Cdd:COG5805    169 LICVIDTDG-RILFINESIERLFGAPREELIGKNLLELLHPCDKEEFKERIESITEVWQEFIIEREII---TKDGRIRYF 244
                           90       100
                   ....*....|....*....|..
gi 1370455087  353 DSSWSSFVNP-WSRKISFIIGR 373
Cdd:COG5805    245 EAVIVPLIDTdGSVKGILVILR 266
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
931-1078 5.38e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 41.31  E-value: 5.38e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  931 QEEMPRPSESPDQMRRNTCPQTEYCVTGNNGSESSPATTGALSTGSPPRENPSHPTASALSTGS--PPMKNPSHPTASAL 1008
Cdd:PHA03307    69 TGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPdlSEMLRPVGSPGPPP 148
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087 1009 STGSPPMKNPSHPTASTLSMGLPPSRTPSHPTATVLSTGSPPSESPSRTGSAASGSSDSSIYLTSSVYSS 1078
Cdd:PHA03307   149 AASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASAS 218
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
950-1057 5.75e-03

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 40.84  E-value: 5.75e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  950 PQTEYCVTGNNGSESSPATTGALSTGSPprENPSHPTASALSTGSPPMKNPSHPTASALSTGSPPMKNPSHptastlSMG 1029
Cdd:PLN02217   556 PYIPGLFAGNPGSTNSTPTGSAASSNTT--FSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPSTSPPAS------HLG 627
                           90       100
                   ....*....|....*....|....*...
gi 1370455087 1030 LPPSrTPSHPTATVLSTGSpPSESPSRT 1057
Cdd:PLN02217   628 SPST-TPSSPESSIKVAST-ETASPESS 653
PHA03379 PHA03379
EBNA-3A; Provisional
745-1054 7.86e-03

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 40.43  E-value: 7.86e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  745 PEPPDSSSSNTGSGPRRGAHQN-AQPCCPSAASSPHTSSPTfPPAAMVPSQAPYLVPAFPLPAATSPGREYAAPGTAPEG 823
Cdd:PHA03379   425 PEVPQSLETATSHGSAQVPEPPpVHDLEPGPLHDQHSMAPC-PVAQLPPGPLQDLEPGDQLPGVVQDGRPACAPVPAPAG 503
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  824 LHGLPLSEGLQPYPAFPF-PYLDTFMTV-FLPDP------PVCPLLSPSFLPCPflGATASSAISPSMSSAMSPTLDPPP 895
Cdd:PHA03379   504 PIVRPWEASLSQVPGVAFaPVMPQPMPVePVPVPtvalerPVCPAPPLIAMQGP--GETSGIVRVRERWRPAPWTPNPPR 581
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  896 SVTSQRREEEKWEAQSEGHPFITSRSSSPLQLNLL--QEEMPRPSESPDQMRRNTCPQTEYCVTGNNG-----------S 962
Cdd:PHA03379   582 SPSQMSVRDRLARLRAEAQPYQASVEVQPPQLTQVspQQPMEYPLEPEQQMFPGSPFSQVADVMRAGGvpamqpqyfdlP 661
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  963 ESSPATTGALST-------GSPPR--ENPSH---PTASALSTGSP--------PMKNPSHPtASALSTGSPPMKNPSHPT 1022
Cdd:PHA03379   662 LQQPISQGAPLAplrasmgPVPPVpaTQPQYfdiPLTEPINQGASaahflpqqPMEGPLVP-ERWMFQGATLSQSVRPGV 740
                          330       340       350
                   ....*....|....*....|....*....|..
gi 1370455087 1023 ASTLSMGLPPSRTPSHPTATVLSTGSPPSESP 1054
Cdd:PHA03379   741 AQSQYFDLPLTQPINHGAPAAHFLHQPPMEGP 772
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
751-1007 8.07e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 40.24  E-value: 8.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  751 SSSNTGSGPRRGAHQN-AQPCCPSAASSPHTSSPTFPPA--AMVPSQAPYLVPAFPLPAATSPGREYAAPGTAPEGLHGL 827
Cdd:PRK12323   366 GQSGGGAGPATAAAAPvAQPAPAAAAPAAAAPAPAAPPAapAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPG 445
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  828 PLSEGLQPYPAFPFPyldtfmtvfLPDPPVCPLLSPSFLpcpflgATASSAISPSMSSAMSPTLDPPPsvtsqrreeekW 907
Cdd:PRK12323   446 GAPAPAPAPAAAPAA---------AARPAAAGPRPVAAA------AAAAPARAAPAAAPAPADDDPPP-----------W 499
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370455087  908 EAQSEGHPFITSRSSSPLQLNLLQEEMPRPSESPDQMRRNTCPQteycvtgnngsESSPATTGALSTGSPPRENPSHPTA 987
Cdd:PRK12323   500 EELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAP-----------APAAAPAPRAAAATEPVVAPRPPRA 568
                          250       260
                   ....*....|....*....|
gi 1370455087  988 SAlsTGSPPMKNPSHPTASA 1007
Cdd:PRK12323   569 SA--SGLPDMFDGDWPALAA 586
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH