NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1487193411|ref|NP_001353403|]
View 

histone-lysine N-methyltransferase MECOM isoform i [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
zf-H2C2_2 pfam13465
Zinc-finger double domain;
414-438 1.99e-05

Zinc-finger double domain;


:

Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 41.59  E-value: 1.99e-05
                          10        20
                  ....*....|....*....|....*
gi 1487193411 414 NLTRHLRTHTGEQPYRCKYCDRSFS 438
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFK 25
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
400-422 8.32e-05

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


:

Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 39.98  E-value: 8.32e-05
                          10        20
                  ....*....|....*....|...
gi 1487193411 400 YTCRYCGKIFPRSANLTRHLRTH 422
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
PHA00733 super family cl26169
hypothetical protein
333-452 1.16e-04

hypothetical protein


The actual alignment was detected with superfamily member PHA00733:

Pssm-ID: 177301  Cd Length: 128  Bit Score: 42.56  E-value: 1.16e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1487193411 333 EKYLRPSPGFLFHPQMsaienmaEKLESFSALKPEASELLQSVPSMFNFRapPNALPEN------LLRKGKERYTCRYCG 406
Cdd:PHA00733   10 KKYLSNHKGIFIHVTL-------EELKRYHSLTPEQKRLIRAVVKTLIYN--PQLLDESsylyklLTSKAVSPYVCPLCL 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 1487193411 407 KIFPRSANLTRHLRThtGEQPYRCKYCDRSFSISSNLQRHVRNIHN 452
Cdd:PHA00733   81 MPFSSSVSLKQHIRY--TEHSKVCPVCGKEFRNTDSTLDHVCKKHN 124
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
101-443 3.52e-04

FOG: Zn-finger [General function prediction only];


:

Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 43.53  E-value: 3.52e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1487193411 101 REYKCDQCPKAFNWKSNLIRHQMSHDSGKHYEC--ENCAKVFTDPSNLQRHIRSQHVGARAHACPECGKTFATSSGLKQH 178
Cdd:COG5048    32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCsySGCDKSFSRPLELSRHLRTHHNNPSDLNSKSLPLSNSKASSSSLS 111
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1487193411 179 KHIHSSVKPFISFSQSM----YPFPDRDLRSLPLKMEPQSPGE--VKKLQKGSSESPFDLTTKRKDEKPLTPVPSKPPVT 252
Cdd:COG5048   112 SSSSNSNDNNLLSSHSLppssRDPQLPDLLSISNLRNNPLPGNnsSSVNTPQSNSLHPPLPANSLSKDPSSNLSLLISSN 191
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1487193411 253 PATSQDQPLDLSMGSRSRASGTKLTEPRKNHVFggkkgSNVESRPASDGSLQHarptpffmDPIYRVEKRKLTDPLEALK 332
Cdd:COG5048   192 VSTSIPSSSENSPLSSSYSIPSSSSDQNLENSS-----SSLPLTTNSQLSPKS--------LLSQSPSSLSSSDSSSSAS 258
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1487193411 333 EKYLRPSPgFLFHPQMSAIENMAEKLESFSALKPEASELLQSvPSMFNFRAPPNALPENLlrKGKERYTCRY--CGKIFP 410
Cdd:COG5048   259 ESPRSSLP-TASSQSSSPNESDSSSEKGFSLPIKSKQCNISF-SRSSPLTRHLRSVNHSG--ESLKPFSCPYslCGKLFS 334
                         330       340       350
                  ....*....|....*....|....*....|...
gi 1487193411 411 RSANLTRHLRTHTGEQPYRCKYCDRSFSISSNL 443
Cdd:COG5048   335 RNDALKRHILLHTSISPAKEKLLNSSSKFSPLL 367
SET super family cl40432
SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain superfamily; The Su(var)3-9, ...
1-16 1.51e-03

SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain superfamily; The Su(var)3-9, Enhancer-of-zeste, Trithorax (SET) domain superfamily corresponds to SET domain-containing lysine methyltransferases, which catalyze site and state-specific methylation of lysine residues in histones that are fundamental in epigenetic regulation of gene activation and silencing in eukaryotic organisms. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains has been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as N-SET and C-SET. C-SET forms an unusual and conserved knot-like structure of probable functional importance. In addition to N-SET and C-SET, an insert region (I-SET) and flanking regions of high structural variability form part of the overall structure. Some family members contain a pre-SET domain, which is found in a number of histone methyltransferases (HMTase), and a post-SET domain, which harbors a zinc-binding site.


The actual alignment was detected with superfamily member cd19214:

Pssm-ID: 394802  Cd Length: 158  Bit Score: 39.92  E-value: 1.51e-03
                          10
                  ....*....|....*.
gi 1487193411   1 MKSEDYPHETMAPDIH 16
Cdd:cd19214   143 MKSEDYSHETMAPDIH 158
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
457-479 2.40e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


:

Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 35.74  E-value: 2.40e-03
                          10        20
                  ....*....|....*....|...
gi 1487193411 457 FKCHLCDRCFGQQTNLDRHLKKH 479
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
 
Name Accession Description Interval E-value
zf-H2C2_2 pfam13465
Zinc-finger double domain;
414-438 1.99e-05

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 41.59  E-value: 1.99e-05
                          10        20
                  ....*....|....*....|....*
gi 1487193411 414 NLTRHLRTHTGEQPYRCKYCDRSFS 438
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFK 25
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
400-422 8.32e-05

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 39.98  E-value: 8.32e-05
                          10        20
                  ....*....|....*....|...
gi 1487193411 400 YTCRYCGKIFPRSANLTRHLRTH 422
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
PHA00733 PHA00733
hypothetical protein
333-452 1.16e-04

hypothetical protein


Pssm-ID: 177301  Cd Length: 128  Bit Score: 42.56  E-value: 1.16e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1487193411 333 EKYLRPSPGFLFHPQMsaienmaEKLESFSALKPEASELLQSVPSMFNFRapPNALPEN------LLRKGKERYTCRYCG 406
Cdd:PHA00733   10 KKYLSNHKGIFIHVTL-------EELKRYHSLTPEQKRLIRAVVKTLIYN--PQLLDESsylyklLTSKAVSPYVCPLCL 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 1487193411 407 KIFPRSANLTRHLRThtGEQPYRCKYCDRSFSISSNLQRHVRNIHN 452
Cdd:PHA00733   81 MPFSSSVSLKQHIRY--TEHSKVCPVCGKEFRNTDSTLDHVCKKHN 124
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
101-443 3.52e-04

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 43.53  E-value: 3.52e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1487193411 101 REYKCDQCPKAFNWKSNLIRHQMSHDSGKHYEC--ENCAKVFTDPSNLQRHIRSQHVGARAHACPECGKTFATSSGLKQH 178
Cdd:COG5048    32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCsySGCDKSFSRPLELSRHLRTHHNNPSDLNSKSLPLSNSKASSSSLS 111
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1487193411 179 KHIHSSVKPFISFSQSM----YPFPDRDLRSLPLKMEPQSPGE--VKKLQKGSSESPFDLTTKRKDEKPLTPVPSKPPVT 252
Cdd:COG5048   112 SSSSNSNDNNLLSSHSLppssRDPQLPDLLSISNLRNNPLPGNnsSSVNTPQSNSLHPPLPANSLSKDPSSNLSLLISSN 191
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1487193411 253 PATSQDQPLDLSMGSRSRASGTKLTEPRKNHVFggkkgSNVESRPASDGSLQHarptpffmDPIYRVEKRKLTDPLEALK 332
Cdd:COG5048   192 VSTSIPSSSENSPLSSSYSIPSSSSDQNLENSS-----SSLPLTTNSQLSPKS--------LLSQSPSSLSSSDSSSSAS 258
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1487193411 333 EKYLRPSPgFLFHPQMSAIENMAEKLESFSALKPEASELLQSvPSMFNFRAPPNALPENLlrKGKERYTCRY--CGKIFP 410
Cdd:COG5048   259 ESPRSSLP-TASSQSSSPNESDSSSEKGFSLPIKSKQCNISF-SRSSPLTRHLRSVNHSG--ESLKPFSCPYslCGKLFS 334
                         330       340       350
                  ....*....|....*....|....*....|...
gi 1487193411 411 RSANLTRHLRTHTGEQPYRCKYCDRSFSISSNL 443
Cdd:COG5048   335 RNDALKRHILLHTSISPAKEKLLNSSSKFSPLL 367
zf-H2C2_2 pfam13465
Zinc-finger double domain;
89-113 4.33e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 38.12  E-value: 4.33e-04
                          10        20
                  ....*....|....*....|....*
gi 1487193411  89 SLEKHMLSHTEEREYKCDQCPKAFN 113
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFK 25
PR-SET_PRDM3 cd19214
PR-SET domain found in MDS1 and EVI1 complex locus protein and similar proteins; PRDM3 (also ...
1-16 1.51e-03

PR-SET domain found in MDS1 and EVI1 complex locus protein and similar proteins; PRDM3 (also termed MDS1 and EVI1 complex locus protein, ecotropic virus integration site 1 protein, EVI-1, myelodysplasia syndrome 1 protein, myelodysplasia syndrome-associated protein 1, or MECOM) is a nuclear transcription factor, which is essential for the proliferation/maintenance of hematopoietic stem cells (HSCs). It is closely related to paralog PRDM16, both o fwhich are directly linked to various aspects of oncogenic transformation.


Pssm-ID: 380991  Cd Length: 158  Bit Score: 39.92  E-value: 1.51e-03
                          10
                  ....*....|....*.
gi 1487193411   1 MKSEDYPHETMAPDIH 16
Cdd:cd19214   143 MKSEDYSHETMAPDIH 158
InsA COG3677
Transposase InsA [Mobilome: prophages, transposons];
394-439 1.99e-03

Transposase InsA [Mobilome: prophages, transposons];


Pssm-ID: 442893 [Multi-domain]  Cd Length: 241  Bit Score: 40.62  E-value: 1.99e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 1487193411 394 RKGKERYTCRYCGkifprSANLTRHLRTHTGEQPYRCKYCDRSFSI 439
Cdd:COG3677    11 IRWPNGPVCPHCG-----STRIVKNGKTRNGRQRYRCKDCGRTFTV 51
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
457-479 2.40e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 35.74  E-value: 2.40e-03
                          10        20
                  ....*....|....*....|...
gi 1487193411 457 FKCHLCDRCFGQQTNLDRHLKKH 479
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
430-464 3.87e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 36.77  E-value: 3.87e-03
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1487193411 430 CKYCDRSFSISSNLQrhvrnIHNKEKPFKCHLCDR 464
Cdd:cd20908     4 CYYCDREFDDEKILI-----QHQKAKHFKCHICHK 33
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
136-178 6.97e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 36.00  E-value: 6.97e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 1487193411 136 CAKVFTDPSNLQRHIRSQHVgarahACPECGKTFATSSGLKQH 178
Cdd:cd20908     7 CDREFDDEKILIQHQKAKHF-----KCHICHKKLYTAGGLAVH 44
 
Name Accession Description Interval E-value
zf-H2C2_2 pfam13465
Zinc-finger double domain;
414-438 1.99e-05

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 41.59  E-value: 1.99e-05
                          10        20
                  ....*....|....*....|....*
gi 1487193411 414 NLTRHLRTHTGEQPYRCKYCDRSFS 438
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFK 25
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
400-422 8.32e-05

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 39.98  E-value: 8.32e-05
                          10        20
                  ....*....|....*....|...
gi 1487193411 400 YTCRYCGKIFPRSANLTRHLRTH 422
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
PHA00733 PHA00733
hypothetical protein
333-452 1.16e-04

hypothetical protein


Pssm-ID: 177301  Cd Length: 128  Bit Score: 42.56  E-value: 1.16e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1487193411 333 EKYLRPSPGFLFHPQMsaienmaEKLESFSALKPEASELLQSVPSMFNFRapPNALPEN------LLRKGKERYTCRYCG 406
Cdd:PHA00733   10 KKYLSNHKGIFIHVTL-------EELKRYHSLTPEQKRLIRAVVKTLIYN--PQLLDESsylyklLTSKAVSPYVCPLCL 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 1487193411 407 KIFPRSANLTRHLRThtGEQPYRCKYCDRSFSISSNLQRHVRNIHN 452
Cdd:PHA00733   81 MPFSSSVSLKQHIRY--TEHSKVCPVCGKEFRNTDSTLDHVCKKHN 124
COG5048 COG5048
FOG: Zn-finger [General function prediction only];
101-443 3.52e-04

FOG: Zn-finger [General function prediction only];


Pssm-ID: 227381 [Multi-domain]  Cd Length: 467  Bit Score: 43.53  E-value: 3.52e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1487193411 101 REYKCDQCPKAFNWKSNLIRHQMSHDSGKHYEC--ENCAKVFTDPSNLQRHIRSQHVGARAHACPECGKTFATSSGLKQH 178
Cdd:COG5048    32 RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCsySGCDKSFSRPLELSRHLRTHHNNPSDLNSKSLPLSNSKASSSSLS 111
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1487193411 179 KHIHSSVKPFISFSQSM----YPFPDRDLRSLPLKMEPQSPGE--VKKLQKGSSESPFDLTTKRKDEKPLTPVPSKPPVT 252
Cdd:COG5048   112 SSSSNSNDNNLLSSHSLppssRDPQLPDLLSISNLRNNPLPGNnsSSVNTPQSNSLHPPLPANSLSKDPSSNLSLLISSN 191
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1487193411 253 PATSQDQPLDLSMGSRSRASGTKLTEPRKNHVFggkkgSNVESRPASDGSLQHarptpffmDPIYRVEKRKLTDPLEALK 332
Cdd:COG5048   192 VSTSIPSSSENSPLSSSYSIPSSSSDQNLENSS-----SSLPLTTNSQLSPKS--------LLSQSPSSLSSSDSSSSAS 258
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1487193411 333 EKYLRPSPgFLFHPQMSAIENMAEKLESFSALKPEASELLQSvPSMFNFRAPPNALPENLlrKGKERYTCRY--CGKIFP 410
Cdd:COG5048   259 ESPRSSLP-TASSQSSSPNESDSSSEKGFSLPIKSKQCNISF-SRSSPLTRHLRSVNHSG--ESLKPFSCPYslCGKLFS 334
                         330       340       350
                  ....*....|....*....|....*....|...
gi 1487193411 411 RSANLTRHLRTHTGEQPYRCKYCDRSFSISSNL 443
Cdd:COG5048   335 RNDALKRHILLHTSISPAKEKLLNSSSKFSPLL 367
zf-H2C2_2 pfam13465
Zinc-finger double domain;
89-113 4.33e-04

Zinc-finger double domain;


Pssm-ID: 463886 [Multi-domain]  Cd Length: 26  Bit Score: 38.12  E-value: 4.33e-04
                          10        20
                  ....*....|....*....|....*
gi 1487193411  89 SLEKHMLSHTEEREYKCDQCPKAFN 113
Cdd:pfam13465   1 NLKRHMRTHTGEKPYKCPECGKSFK 25
PR-SET_PRDM3 cd19214
PR-SET domain found in MDS1 and EVI1 complex locus protein and similar proteins; PRDM3 (also ...
1-16 1.51e-03

PR-SET domain found in MDS1 and EVI1 complex locus protein and similar proteins; PRDM3 (also termed MDS1 and EVI1 complex locus protein, ecotropic virus integration site 1 protein, EVI-1, myelodysplasia syndrome 1 protein, myelodysplasia syndrome-associated protein 1, or MECOM) is a nuclear transcription factor, which is essential for the proliferation/maintenance of hematopoietic stem cells (HSCs). It is closely related to paralog PRDM16, both o fwhich are directly linked to various aspects of oncogenic transformation.


Pssm-ID: 380991  Cd Length: 158  Bit Score: 39.92  E-value: 1.51e-03
                          10
                  ....*....|....*.
gi 1487193411   1 MKSEDYPHETMAPDIH 16
Cdd:cd19214   143 MKSEDYSHETMAPDIH 158
InsA COG3677
Transposase InsA [Mobilome: prophages, transposons];
394-439 1.99e-03

Transposase InsA [Mobilome: prophages, transposons];


Pssm-ID: 442893 [Multi-domain]  Cd Length: 241  Bit Score: 40.62  E-value: 1.99e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 1487193411 394 RKGKERYTCRYCGkifprSANLTRHLRTHTGEQPYRCKYCDRSFSI 439
Cdd:COG3677    11 IRWPNGPVCPHCG-----STRIVKNGKTRNGRQRYRCKDCGRTFTV 51
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
457-479 2.40e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 35.74  E-value: 2.40e-03
                          10        20
                  ....*....|....*....|...
gi 1487193411 457 FKCHLCDRCFGQQTNLDRHLKKH 479
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRTH 23
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
430-464 3.87e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 36.77  E-value: 3.87e-03
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1487193411 430 CKYCDRSFSISSNLQrhvrnIHNKEKPFKCHLCDR 464
Cdd:cd20908     4 CYYCDREFDDEKILI-----QHQKAKHFKCHICHK 33
DUF4045 pfam13254
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ...
199-286 4.05e-03

Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.


Pssm-ID: 433066 [Multi-domain]  Cd Length: 415  Bit Score: 40.15  E-value: 4.05e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1487193411 199 PDRDLRSLPLKMEPQSPGEVKKLQKGSSESPFDLTTKRKDEKPLTPVPSKPPVTPATSQDQPLDLSMGSRSR---ASGTK 275
Cdd:pfam13254 277 PSKSAEASTEKKEPDTESSPETSSEKSAPSLLSPVSKASIDKPLSSPDRDPLSPKPKPQSPPKDFRANLRSRevpKDKSK 356
                          90
                  ....*....|.
gi 1487193411 276 LTEPRKNHVFG 286
Cdd:pfam13254 357 KDEPEFKNVFG 367
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
131-152 6.94e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 34.58  E-value: 6.94e-03
                          10        20
                  ....*....|....*....|..
gi 1487193411 131 YECENCAKVFTDPSNLQRHIRS 152
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLRT 22
SUF4-like cd20908
N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), ...
136-178 6.97e-03

N-terminal domain of Oryza sativa transcription factor SUPPRESSOR OF FRI 4 (OsSUF4), Arabidopsis thaliana SUF4 (AtSUF4), and similar proteins; Oryza sativa SUPPRESSOR OF FRI 4 (OsSUF4) is a C2H2-type zinc finger transcription factor which interacts with the major H3K36 methyltransferase SDG725 to promote H3K36me3 (tri-methylation at H3K9) establishment. The transcription factor OsSUF4 recognizes a specific 7-bp DNA element (5'-CGGAAAT-3'), which is contained in the promoter regions of many genes throughout the rice genome. Through interaction with OsSUF4, SDG725 is recruited to the promoters of key florigen genes, RICE FLOWERING LOCUS T1 (RFT1) and Heading date 3a (Hd3a), for H3K36 deposition to promote gene activation and rice plant flowering. OsSUF4 target genes include a number of genes involved in many biological processes. Flowering plant Arabidopsis SUF4 binds to a 15bp DNA element (5'-CCAAATTTTAAGTTT-3') within the promoter of the floral repressor gene FLOWERING LOCUS C (FLC) and recruits the FRI-C transcription activator complex to the FLC promoter. Although the DNA-binding element and target genes of AtSUF4 are different from those of OsSUF4, AtSUF4 is known to interact with the Arabidopsis H3K36 methyltransferase SDG8 (also known as ASHH2/EFS/SET8), and the methylation deposition mechanism mediated by the SUF4 transcription factor and H3K36 methyltransferase may be conserved in Arabidopsis and rice. Proteins in this family have two conserved C2H2-type zinc finger motifs at the N-terminus (included in this model), and a large proline-rich domain at the C-terminus; for OsSUF4, it has been shown that the N-terminal zinc-finger domain is responsible for DNA binding, and that the C-terminal domain interacts with SDG725.


Pssm-ID: 411020 [Multi-domain]  Cd Length: 82  Bit Score: 36.00  E-value: 6.97e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 1487193411 136 CAKVFTDPSNLQRHIRSQHVgarahACPECGKTFATSSGLKQH 178
Cdd:cd20908     7 CDREFDDEKILIQHQKAKHF-----KCHICHKKLYTAGGLAVH 44
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
428-448 7.29e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 34.58  E-value: 7.29e-03
                          10        20
                  ....*....|....*....|.
gi 1487193411 428 YRCKYCDRSFSISSNLQRHVR 448
Cdd:pfam00096   1 YKCPDCGKSFSRKSNLKRHLR 21
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH