NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|23346420|ref|NP_006156|]
View 

nuclear factor related to kappa-B-binding protein isoform 2 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
DEUBAD_NFRKB cd21865
DEUBAD domain found in nuclear factor related to kappa-B-binding protein (NFRKB) and similar ...
55-166 3.00e-53

DEUBAD domain found in nuclear factor related to kappa-B-binding protein (NFRKB) and similar proteins; NFRKB, also called DNA-binding protein R kappa-B, or INO80 complex subunit G (INO80G), is a regulatory component of the metazoan INO80 complex involved in chromatin remodeling, transcription regulation, DNA replication and DNA repair. It modulates the deubiquitinase activity of UCHL5 in the INO80 complex. It binds to the DNA consensus sequence 5'-GGGGAATCTCC-3'. The model corresponds to the DEUBAD domain (conserved domain within the UCH regulatory proteins RPN13, NFRKB/INO80G, and ASX) of NFRKB, which binds primarily to the C-terminal ULD domain of UCH-L5.


:

Pssm-ID: 439381  Cd Length: 112  Bit Score: 181.64  E-value: 3.00e-53
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420   55 LLEDPEIFFDVVSLSTWQEVLSDSQREHLQQFLPQFPEDSAEQQNELILALFSGENFRFGNPLHIAQKLFRDGHFNPEVV 134
Cdd:cd21865    1 LCEDLEIFKEVLSLETWNSLLSEEEREHLMQFLPQFPENDEEEKEETLRMLFSGENFHFGNPLDKFQEKLKAGHFHPDIA 80
                         90       100       110
                 ....*....|....*....|....*....|..
gi 23346420  135 KYRQLCFKSQYKRYLNSQQQYFHRLLKQILAS 166
Cdd:cd21865   81 KYRKLLRKAQRKEYKYRLRKYHNRLLKDLLLS 112
NFRKB_winged pfam14465
NFRKB Winged Helix-like; This domain covers regions 370-495 of human nuclear factor related to ...
417-503 2.58e-41

NFRKB Winged Helix-like; This domain covers regions 370-495 of human nuclear factor related to kappaB binding (NFRKB) protein.


:

Pssm-ID: 433973  Cd Length: 103  Bit Score: 147.11  E-value: 2.58e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420    417 QASLPMLEERVLDWQSSPASSLNSWFSAAPNWAELVLPALQYLAGESR-AVPSSFSPFVEFKEKTQQWKLLGQSQDNEKE 495
Cdd:pfam14465   16 RATLSELEELVKDWQSSPASPLNDWFSLVPDWSELVQSALQFLAGDSPdALPPDFVPYVEYKEQLQIWQWIGAGRDSDKR 95

                   ....*...
gi 23346420    496 LAALFQLW 503
Cdd:pfam14465   96 LSALCQLW 103
DUF5585 super family cl39316
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
793-939 1.89e-05

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


The actual alignment was detected with superfamily member pfam17823:

Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 48.80  E-value: 1.89e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420    793 TMPHLGTMLSPASSQTAPSSqAAARVVSHSGSAGLSQVRVVAQPSLPAVPQQSGGPAQTlpQMPAGPQIRVPATATQTKV 872
Cdd:pfam17823   90 HTPHGTDLSEPATREGAADG-AASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAPRA--AACRANASAAPRAAIAAAS 166
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 23346420    873 VPQTVM-ATVPVKAQTTAATVQRPGPGQTGLTVTSLPATASPVSKPATSSPGTSAPSASTA-AVIQNVT 939
Cdd:pfam17823  167 APHAASpAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTAlAAVGNSS 235
 
Name Accession Description Interval E-value
DEUBAD_NFRKB cd21865
DEUBAD domain found in nuclear factor related to kappa-B-binding protein (NFRKB) and similar ...
55-166 3.00e-53

DEUBAD domain found in nuclear factor related to kappa-B-binding protein (NFRKB) and similar proteins; NFRKB, also called DNA-binding protein R kappa-B, or INO80 complex subunit G (INO80G), is a regulatory component of the metazoan INO80 complex involved in chromatin remodeling, transcription regulation, DNA replication and DNA repair. It modulates the deubiquitinase activity of UCHL5 in the INO80 complex. It binds to the DNA consensus sequence 5'-GGGGAATCTCC-3'. The model corresponds to the DEUBAD domain (conserved domain within the UCH regulatory proteins RPN13, NFRKB/INO80G, and ASX) of NFRKB, which binds primarily to the C-terminal ULD domain of UCH-L5.


Pssm-ID: 439381  Cd Length: 112  Bit Score: 181.64  E-value: 3.00e-53
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420   55 LLEDPEIFFDVVSLSTWQEVLSDSQREHLQQFLPQFPEDSAEQQNELILALFSGENFRFGNPLHIAQKLFRDGHFNPEVV 134
Cdd:cd21865    1 LCEDLEIFKEVLSLETWNSLLSEEEREHLMQFLPQFPENDEEEKEETLRMLFSGENFHFGNPLDKFQEKLKAGHFHPDIA 80
                         90       100       110
                 ....*....|....*....|....*....|..
gi 23346420  135 KYRQLCFKSQYKRYLNSQQQYFHRLLKQILAS 166
Cdd:cd21865   81 KYRKLLRKAQRKEYKYRLRKYHNRLLKDLLLS 112
NFRKB_winged pfam14465
NFRKB Winged Helix-like; This domain covers regions 370-495 of human nuclear factor related to ...
417-503 2.58e-41

NFRKB Winged Helix-like; This domain covers regions 370-495 of human nuclear factor related to kappaB binding (NFRKB) protein.


Pssm-ID: 433973  Cd Length: 103  Bit Score: 147.11  E-value: 2.58e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420    417 QASLPMLEERVLDWQSSPASSLNSWFSAAPNWAELVLPALQYLAGESR-AVPSSFSPFVEFKEKTQQWKLLGQSQDNEKE 495
Cdd:pfam14465   16 RATLSELEELVKDWQSSPASPLNDWFSLVPDWSELVQSALQFLAGDSPdALPPDFVPYVEYKEQLQIWQWIGAGRDSDKR 95

                   ....*...
gi 23346420    496 LAALFQLW 503
Cdd:pfam14465   96 LSALCQLW 103
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
793-939 1.89e-05

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 48.80  E-value: 1.89e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420    793 TMPHLGTMLSPASSQTAPSSqAAARVVSHSGSAGLSQVRVVAQPSLPAVPQQSGGPAQTlpQMPAGPQIRVPATATQTKV 872
Cdd:pfam17823   90 HTPHGTDLSEPATREGAADG-AASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAPRA--AACRANASAAPRAAIAAAS 166
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 23346420    873 VPQTVM-ATVPVKAQTTAATVQRPGPGQTGLTVTSLPATASPVSKPATSSPGTSAPSASTA-AVIQNVT 939
Cdd:pfam17823  167 APHAASpAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTAlAAVGNSS 235
PHA03247 PHA03247
large tegument protein UL36; Provisional
722-931 2.57e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.17  E-value: 2.57e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420   722 VLSSGPSEQSQMSLSDSS--MPPTPVTPVTPTTPALPAIPISPPPVSAvnKSGPSTVSEPAKSSSGVLLVSSPTMP--HL 797
Cdd:PHA03247 2788 VASLSESRESLPSPWDPAdpPAAVLAPAAALPPAASPAGPLPPPTSAQ--PTAPPPPPGPPPPSLPLGGSVAPGGDvrRR 2865
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420   798 GTMLSPASSQTAPSSQAAARVvshSGSAGLSQVRVVAQPSLPAVPQ-QSGGPAQTLPQMPAGPQIRvPATATQTKVVPQT 876
Cdd:PHA03247 2866 PPSRSPAAKPAAPARPPVRRL---ARPAVSRSTESFALPPDQPERPpQPQAPPPPQPQPQPPPPPQ-PQPPPPPPPRPQP 2941
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 23346420   877 VMATVPVKAQTTAATVQRPGPGQTGLTVTSLPATASPVSKPATSSPgTSAPSAST 931
Cdd:PHA03247 2942 PLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSRE-APASSTPP 2995
SP2_N cd22540
N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins ...
802-986 4.53e-03

N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP2 contains the least conserved DNA-binding domain within the SP subfamily of proteins, and its DNA sequence specificity differs from the other SP proteins. It localizes primarily within subnuclear foci associated with the nuclear matrix, and can activate, or in some cases, repress expression from different promoters. The transcription factor SP2 serves as a paradigm for indirect genomic binding. It does not require its DNA-binding domain for genomic DNA binding and occupies target promoters independently of whether they contain a cognate DNA-binding motif. SP2 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP2.


Pssm-ID: 411776 [Multi-domain]  Cd Length: 511  Bit Score: 41.07  E-value: 4.53e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420  802 SPASSQtaPSSQAAARVVSHSGSAGLSQV-----RVV--AQPSLPAVPQQSGGPAQ--TLPQMPAGPQIRVPATATQTKV 872
Cdd:cd22540  277 SPGTGQ--PAVLQQVQVLQPKQEQQVVQIpqqalRVVqaASATLPTVPQKPLQNIQiqNSEPTPTQVYIKTPSGEVQTVL 354
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420  873 VPQTVMATVPVKAQTTAATVQRPGPGQTGL-TVTSLPATASPVSKPATSSPGTSAPSASTAAVIQNVTGQNI----IKQV 947
Cdd:cd22540  355 LQEAPAATATPSSSTSTVQQQVTANNGTGTsKPNYNVRKERTLPKIAPAGGIISLNAAQLAAAAQAIQTINIngvqVQGV 434
                        170       180       190
                 ....*....|....*....|....*....|....*....
gi 23346420  948 AITGQLGVKPQTGNSIPLTATNFRIQGkdvlrLPPSSIT 986
Cdd:cd22540  435 PVTITNAGGQQQLTVQTVSSNNLTISG-----LSPTQIQ 468
 
Name Accession Description Interval E-value
DEUBAD_NFRKB cd21865
DEUBAD domain found in nuclear factor related to kappa-B-binding protein (NFRKB) and similar ...
55-166 3.00e-53

DEUBAD domain found in nuclear factor related to kappa-B-binding protein (NFRKB) and similar proteins; NFRKB, also called DNA-binding protein R kappa-B, or INO80 complex subunit G (INO80G), is a regulatory component of the metazoan INO80 complex involved in chromatin remodeling, transcription regulation, DNA replication and DNA repair. It modulates the deubiquitinase activity of UCHL5 in the INO80 complex. It binds to the DNA consensus sequence 5'-GGGGAATCTCC-3'. The model corresponds to the DEUBAD domain (conserved domain within the UCH regulatory proteins RPN13, NFRKB/INO80G, and ASX) of NFRKB, which binds primarily to the C-terminal ULD domain of UCH-L5.


Pssm-ID: 439381  Cd Length: 112  Bit Score: 181.64  E-value: 3.00e-53
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420   55 LLEDPEIFFDVVSLSTWQEVLSDSQREHLQQFLPQFPEDSAEQQNELILALFSGENFRFGNPLHIAQKLFRDGHFNPEVV 134
Cdd:cd21865    1 LCEDLEIFKEVLSLETWNSLLSEEEREHLMQFLPQFPENDEEEKEETLRMLFSGENFHFGNPLDKFQEKLKAGHFHPDIA 80
                         90       100       110
                 ....*....|....*....|....*....|..
gi 23346420  135 KYRQLCFKSQYKRYLNSQQQYFHRLLKQILAS 166
Cdd:cd21865   81 KYRKLLRKAQRKEYKYRLRKYHNRLLKDLLLS 112
NFRKB_winged pfam14465
NFRKB Winged Helix-like; This domain covers regions 370-495 of human nuclear factor related to ...
417-503 2.58e-41

NFRKB Winged Helix-like; This domain covers regions 370-495 of human nuclear factor related to kappaB binding (NFRKB) protein.


Pssm-ID: 433973  Cd Length: 103  Bit Score: 147.11  E-value: 2.58e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420    417 QASLPMLEERVLDWQSSPASSLNSWFSAAPNWAELVLPALQYLAGESR-AVPSSFSPFVEFKEKTQQWKLLGQSQDNEKE 495
Cdd:pfam14465   16 RATLSELEELVKDWQSSPASPLNDWFSLVPDWSELVQSALQFLAGDSPdALPPDFVPYVEYKEQLQIWQWIGAGRDSDKR 95

                   ....*...
gi 23346420    496 LAALFQLW 503
Cdd:pfam14465   96 LSALCQLW 103
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
793-939 1.89e-05

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 48.80  E-value: 1.89e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420    793 TMPHLGTMLSPASSQTAPSSqAAARVVSHSGSAGLSQVRVVAQPSLPAVPQQSGGPAQTlpQMPAGPQIRVPATATQTKV 872
Cdd:pfam17823   90 HTPHGTDLSEPATREGAADG-AASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAPRA--AACRANASAAPRAAIAAAS 166
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 23346420    873 VPQTVM-ATVPVKAQTTAATVQRPGPGQTGLTVTSLPATASPVSKPATSSPGTSAPSASTA-AVIQNVT 939
Cdd:pfam17823  167 APHAASpAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTAlAAVGNSS 235
PHA03247 PHA03247
large tegument protein UL36; Provisional
722-931 2.57e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.17  E-value: 2.57e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420   722 VLSSGPSEQSQMSLSDSS--MPPTPVTPVTPTTPALPAIPISPPPVSAvnKSGPSTVSEPAKSSSGVLLVSSPTMP--HL 797
Cdd:PHA03247 2788 VASLSESRESLPSPWDPAdpPAAVLAPAAALPPAASPAGPLPPPTSAQ--PTAPPPPPGPPPPSLPLGGSVAPGGDvrRR 2865
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420   798 GTMLSPASSQTAPSSQAAARVvshSGSAGLSQVRVVAQPSLPAVPQ-QSGGPAQTLPQMPAGPQIRvPATATQTKVVPQT 876
Cdd:PHA03247 2866 PPSRSPAAKPAAPARPPVRRL---ARPAVSRSTESFALPPDQPERPpQPQAPPPPQPQPQPPPPPQ-PQPPPPPPPRPQP 2941
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 23346420   877 VMATVPVKAQTTAATVQRPGPGQTGLTVTSLPATASPVSKPATSSPgTSAPSAST 931
Cdd:PHA03247 2942 PLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSRE-APASSTPP 2995
PHA03378 PHA03378
EBNA-3B; Provisional
759-936 1.45e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 46.21  E-value: 1.45e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420   759 PISPPPVSAVNKSGPSTVSEPAKSSSGVLLVSSPTMPHLGTMLSPASSQTAPSSQAAARVVSHSGSAGLSQVRVVAQPSL 838
Cdd:PHA03378  703 PMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQA 782
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420   839 PAVPQQ--SGGPA-QTLPQMPAGPQIRVPATATQTKVVPQTVMATV--------------PVKAQTTAATVQRPGPGQ-T 900
Cdd:PHA03378  783 PPAPQQrpRGAPTpQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQLltggvkrgrpslkkPAALERQAAAGPTPSPGSgT 862
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|.
gi 23346420   901 GLTVTSLPATASPVSKPAT-----SSPGTSAPSASTAAVIQ 936
Cdd:PHA03378  863 SDKIVQAPVFYPPVLQPIQvmrqlGSVRAAAASTVTQAPTE 903
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
800-933 1.78e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 46.13  E-value: 1.78e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420   800 MLSPASSQTAPS-SQAAARVVSHSGSAGLSQVRVVAQPSLPAVPQQSGGPAQTLPqmPAGPQIRVPATATQTKVVPQTVM 878
Cdd:PRK07764  362 MLLPSASDDERGlLARLERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAA--PAAAAAPAPAAAPQPAPAPAPAP 439
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 23346420   879 A--TVPVKAQTTAATVQRPGPGQTGLTVTSLPATASPVSKPATSSPGTSAPSASTAA 933
Cdd:PRK07764  440 AppSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAA 496
PRK10856 PRK10856
cytoskeleton protein RodZ;
819-921 1.27e-03

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 42.71  E-value: 1.27e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420   819 VSHSgSAGLSQVRVVAQP---SLPAVPQQSGGPAQTLPQMPAGPQIRVPATATQTKVVPQTVMATVPVKAQTTAATVQRP 895
Cdd:PRK10856  147 ADQS-SAELSQNSGQSVPldtSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAP 225
                          90       100
                  ....*....|....*....|....*.
gi 23346420   896 GPGQTGLTVTSLPATASPVSKPATSS 921
Cdd:PRK10856  226 AAPATPDGAAPLPTDQAGVSTPAADP 251
PRK11901 PRK11901
hypothetical protein; Reviewed
758-928 1.39e-03

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 42.36  E-value: 1.39e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420   758 IPISPPPVSAVNKSGPSTVSEPAKSSSGVLLVSSPTMPHLGTM--LSPASSQTAPSSQAAA--RV-VSHSGSAGLSQvrv 832
Cdd:PRK11901   79 IDLSGSSSLSSGNQSSPSAANNTSDGHDASGVKNTAPPQDISAppISPTPTQAAPPQTPNGqqRIeLPGNISDALSQ--- 155
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420   833 vAQPSLPAVPQQSGGPAQTLPQMPAgpqIRVPATATQTKVVPQTVMATVPVKAQTTAATVQRPGPgqtglTVTSLPAT-A 911
Cdd:PRK11901  156 -QQGQVNAASQNAQGNTSTLPTAPA---TVAPSKGAKVPATAETHPTPPQKPATKKPAVNHHKTA-----TVAVPPATsG 226
                         170
                  ....*....|....*..
gi 23346420   912 SPVSKPATSSPGTSAPS 928
Cdd:PRK11901  227 KPKSGAASARALSSAPA 243
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
762-958 3.07e-03

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 41.84  E-value: 3.07e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420   762 PPPVSAVNKSGPSTVS--EPAKSSSGVLLVSSPTMPHLGtmLSPASSQT-APSSQAAARVVSHSGSAGLSQVRVVAQP-S 837
Cdd:PLN03209  341 PVPTKPVTPEAPSPPIeeEPPQPKAVVPRPLSPYTAYED--LKPPTSPIpTPPSSSPASSKSVDAVAKPAEPDVVPSPgS 418
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420   838 LPAVPQQSGGPAQTLPQMPAGPQIRVP---------ATATQTKVVPQTVMATVPVKAQTTAATV-----QRPGPGQTGLT 903
Cdd:PLN03209  419 ASNVPEVEPAQVEAKKTRPLSPYARYEdlkpptspsPTAPTGVSPSVSSTSSVPAVPDTAPATAatdaaAPPPANMRPLS 498
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 23346420   904 VTSLPATASPVSKPATSSPGTSAPSASTAAVIQnvTGQNIIKQVAITGQLGVKPQ 958
Cdd:PLN03209  499 PYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVK--VGNSAPPTALADEQHHAQPK 551
PHA03247 PHA03247
large tegument protein UL36; Provisional
756-931 3.50e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 3.50e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420   756 PAIPISPPPVSAVNKSGPSTVSEPAKSSSGVLLVSSPTMPHLGTMLSPASSQTAPSSQAAARVVSHSGSAGLSQVRVVAQ 835
Cdd:PHA03247 2704 PPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRL 2783
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420   836 PSLPAVPQQSGGPAQTLPQMPAGPQIRV--PATATQTKVVPQTVMATVPVKAQTTAATVqrPGPGQTGLTVTSLPATASP 913
Cdd:PHA03247 2784 TRPAVASLSESRESLPSPWDPADPPAAVlaPAAALPPAASPAGPLPPPTSAQPTAPPPP--PGPPPPSLPLGGSVAPGGD 2861
                         170
                  ....*....|....*...
gi 23346420   914 VSKPATSSPGTSAPSAST 931
Cdd:PHA03247 2862 VRRRPPSRSPAAKPAAPA 2879
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
794-936 4.17e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.51  E-value: 4.17e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420   794 MPHLGTMLSPASSQTAPSSQAAARVVSHSGSAGLSQVRVVAQPSLPAVPQQSGGPAqtlPQMPAGPQIRVPATATQTKVV 873
Cdd:PRK07764  385 LGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPA---PPSPAGNAPAGGAPSPPPAAA 461
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 23346420   874 PQTVMATVPvkaQTTAATVQRPGPGQtgltvtslPATASPVSKPATSSPGTSAPSASTAAVIQ 936
Cdd:PRK07764  462 PSAQPAPAP---AAAPEPTAAPAPAP--------PAAPAPAAAPAAPAAPAAPAGADDAATLR 513
SP2_N cd22540
N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins ...
802-986 4.53e-03

N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP2 contains the least conserved DNA-binding domain within the SP subfamily of proteins, and its DNA sequence specificity differs from the other SP proteins. It localizes primarily within subnuclear foci associated with the nuclear matrix, and can activate, or in some cases, repress expression from different promoters. The transcription factor SP2 serves as a paradigm for indirect genomic binding. It does not require its DNA-binding domain for genomic DNA binding and occupies target promoters independently of whether they contain a cognate DNA-binding motif. SP2 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP2.


Pssm-ID: 411776 [Multi-domain]  Cd Length: 511  Bit Score: 41.07  E-value: 4.53e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420  802 SPASSQtaPSSQAAARVVSHSGSAGLSQV-----RVV--AQPSLPAVPQQSGGPAQ--TLPQMPAGPQIRVPATATQTKV 872
Cdd:cd22540  277 SPGTGQ--PAVLQQVQVLQPKQEQQVVQIpqqalRVVqaASATLPTVPQKPLQNIQiqNSEPTPTQVYIKTPSGEVQTVL 354
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420  873 VPQTVMATVPVKAQTTAATVQRPGPGQTGL-TVTSLPATASPVSKPATSSPGTSAPSASTAAVIQNVTGQNI----IKQV 947
Cdd:cd22540  355 LQEAPAATATPSSSTSTVQQQVTANNGTGTsKPNYNVRKERTLPKIAPAGGIISLNAAQLAAAAQAIQTINIngvqVQGV 434
                        170       180       190
                 ....*....|....*....|....*....|....*....
gi 23346420  948 AITGQLGVKPQTGNSIPLTATNFRIQGkdvlrLPPSSIT 986
Cdd:cd22540  435 PVTITNAGGQQQLTVQTVSSNNLTISG-----LSPTQIQ 468
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
803-934 6.26e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 40.99  E-value: 6.26e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420   803 PASSQTAPSSQAAARVVSHSGSAGLSQVRVVAQPSLPAVPQQSGGPAQTLPQMPAGpqirvpATATQTKVVPQTVMATVP 882
Cdd:PRK07003  381 PAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATA------DRGDDAADGDAPVPAKAN 454
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 23346420   883 VKAQTTAATVQRPGPGQTGLTVTSLPATASPVSKPATSSPGTSAPSASTAAV 934
Cdd:PRK07003  455 ARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAA 506
PRK11901 PRK11901
hypothetical protein; Reviewed
801-934 6.28e-03

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 40.44  E-value: 6.28e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420   801 LSPASSQTAPSSQAAARVVSHSGSAGLSQVRVVAQPSLPAVPQQSGGPAQTLPQMPAGPQIRV--PA------TATQTKV 872
Cdd:PRK11901   81 LSGSSSLSSGNQSSPSAANNTSDGHDASGVKNTAPPQDISAPPISPTPTQAAPPQTPNGQQRIelPGnisdalSQQQGQV 160
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 23346420   873 --VPQTVMA---TVPVKAQTTAATVQRPGPGQTGLTVT--SLPATASPVSKPATSSPGTSAPSASTAAV 934
Cdd:PRK11901  161 naASQNAQGntsTLPTAPATVAPSKGAKVPATAETHPTppQKPATKKPAVNHHKTATVAVPPATSGKPK 229
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
848-933 6.96e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 40.62  E-value: 6.96e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420   848 PAQTLPQMPAGPQIRVPATATQTKVVPQTVMA---TVPVKAQTTAATVQRPGPGQTGLTVTSLPATASPVSKPATSSPGT 924
Cdd:PRK07994  361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAppqAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKK 440

                  ....*....
gi 23346420   925 SAPSASTAA 933
Cdd:PRK07994  441 SEPAAASRA 449
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
809-965 8.09e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 40.74  E-value: 8.09e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420   809 APSSQAAARvvshSGSAGLSQVRVVAQPSLPAVPQQSGGPAQTLPQMPAGPQIRVPATATQTKVVPQTVMATVPVKAQTT 888
Cdd:PRK07764  589 GPAPGAAGG----EGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASD 664
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 23346420   889 AATVQRPGPGQTGLTVTSLPATASPVSKPATSSPGTSAPSASTAAVIQNVTGQNIIKQVAITGQLGVKPQTGNSIPL 965
Cdd:PRK07764  665 GGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPL 741
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
780-967 8.52e-03

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 40.44  E-value: 8.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420    780 AKSSSGVLLVSSPTMPHLGTMLSPASSQTAPSSQAAA-RVVSHSGSAGLSQVRVVAQPslPAVPQQSGGPAQTLPQMPAG 858
Cdd:pfam03546  189 AKPSGKILQVRPASGPAKGAAPAPPQKAGPVATQVKAeRSKEDSESSEESSDSEEEAP--AAATPAQAKPALKTPQTKAS 266
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420    859 PQIRVPATATQTKVVPQTVMATVPVKAQTtaatvqrpgpgQTGLTVTSLPATASPVSKPA--TSSPGTSAPSASTAAVIQ 936
Cdd:pfam03546  267 PRKGTPITPTSAKVPPVRVGTPAPWKAGT-----------VTSPACASSPAVARGAQRPEedSSSSEESESEEETAPAAA 335
                          170       180       190
                   ....*....|....*....|....*....|.
gi 23346420    937 NVTGQNIIKqvAITGQLGVKPQTGNSIPLTA 967
Cdd:pfam03546  336 VGQAKSVGK--GLQGKAASAPTKGPSGQGTA 364
PHA03247 PHA03247
large tegument protein UL36; Provisional
754-936 9.70e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 40.69  E-value: 9.70e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420   754 ALPAIPISPPPVSAVNKSGPSTVSEPAKSSSGVLLVSSPTMPhlgtmlsPASSQTAPSSQAAARVVSHSGSAGLSQVRVV 833
Cdd:PHA03247 2761 PTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPW-------DPADPPAAVLAPAAALPPAASPAGPLPPPTS 2833
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23346420   834 AQPSLPAVPqqSGGPAQTLPQ----MPAGPQIRVPAtatqtkvvPQTVMATVPVKAQTTAATVQRPGPGQTGLTVTSLPA 909
Cdd:PHA03247 2834 AQPTAPPPP--PGPPPPSLPLggsvAPGGDVRRRPP--------SRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPD 2903
                         170       180
                  ....*....|....*....|....*..
gi 23346420   910 TASPVSKPATSSPGTSAPSASTAAVIQ 936
Cdd:PHA03247 2904 QPERPPQPQAPPPPQPQPQPPPPPQPQ 2930
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH