NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|23397666|ref|NP_056292|]
View 

paired amphipathic helix protein Sin3a [Homo sapiens]

Protein Classification

Sin3 family protein( domain architecture ID 1001704)

Sin3 family protein similar to Homo sapiens paired amphipathic helix proteins Sin3a and Sin3b, which act as transcriptional repressors

CATH:  1.20.1160.11
Gene Ontology:  GO:0003714

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Sin3 super family cl35027
Histone deacetylase complex, regulatory component SIN3 [Chromatin structure and dynamics];
5-1213 2.87e-169

Histone deacetylase complex, regulatory component SIN3 [Chromatin structure and dynamics];


The actual alignment was detected with superfamily member COG5602:

Pssm-ID: 227889 [Multi-domain]  Cd Length: 1163  Bit Score: 534.50  E-value: 2.87e-169
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666    5 LDDQESPVYAAQQRRIP----GSTEAFPHQHrVLAPAPPVYEAVSetMQSATGIQY--SVTPSY--QVSAMPQSSGSHGP 76
Cdd:COG5602   23 SLKQISPKYHPSMSRDHdvntSLQNSEPWPD-NGSARKPSYSPES--MSSYGGPMDerSGTRRYyeDVEQILSSLAGDNP 99
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666   77 AIAAVHSSHHHptavqphggqvvQSHAHPAPPVAPvqgqqqFQRLKVEDALSYLDQVKLQFGSQPQVYNDFLDIMKEFKS 156
Cdd:COG5602  100 DPSTDHGSASN------------QNMERSPPPRIP------YRPLDVSDALSYLEKVKEQFSNRPEIYNNFLDIMKDFKS 161
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  157 QSIDTPGVISRVSQLFKGHPDLIMGFNTFLPPGYKIEVQTND----MVNVTTPGQVHQIPthgiqpqpqpppqhpsqpsa 232
Cdd:COG5602  162 QAIDTPGVIERVSVLFRGYPHLIEGFNTFLPSGYKIEGSLPQpngsRLHVTTPQGPLSSP-------------------- 221
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  233 qsapapaqpapqppPAKVSKPSQLQAHtpASQQTPPLPPYASPRSPPVQPHTPvtislgtaPSLQNNQPVEFNHAINYVN 312
Cdd:COG5602  222 --------------PVQSSYYVAPCNH--DQRTSHPTLPSDSQPEPSAPSHMP--------SDARGKHQVDFNQAIIFVN 277
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  313 KIKNRFQGQPDIYKAFLEILHTYQKEQRNAkeaggnytpalteQEVYAQVARLFKNQEDLLSEFGQFLPDANSSVLLSKT 392
Cdd:COG5602  278 KVKVRFQNNPEMYYDFLDSLRTYQMKQRSI-------------QEVYARVTKLFAEAPDLLEEFKEFLPDSSVSAEQSTA 344
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  393 TAEKVDSvRNDHGGTVKKPQLNNKPQRPSQ-NGCQIRRHPTGTTPPVKKKPKLLNLKDSSMADASKHGGGTESLFFDKVR 471
Cdd:COG5602  345 NAQKPSK-RLPPIGSFSLPTAAPEQNRPSLlWESPRSISNISRYRADLLTSFSRNFVPIRIHMPISHLSNESEDFFERIK 423
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  472 KALRSAEAYENFLRCLVIFNQEVISRAELVQLVSPFLGKFPELFNWFKNFLGYKESVH--LETYPKerategiameiDYA 549
Cdd:COG5602  424 QYLKDKNLYHEFLKLLNLYSQKIIDKNDLVERLFAFLGSNEELIRWFKAFINYSEPEKepLRETRK-----------DLE 492
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  550 SCKRLGSSYRALPKSYQQPKCTGRTPLCKEVLNDTWVSFPSW-SEDSTFVSSKKTQYEEHIYRCEDERFELDVVLETNLA 628
Cdd:COG5602  493 KFEKCGPSYRLLPKSEEEIKCSGRDDLAWEVLNDDWVSHPTWaSEDSGFIAHRKNQYEEALFKIEEERYEYDRHIEATQR 572
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  629 TIRVLEAIQKKLSRLSAEEQAKFRLDNTLGGTSEVIHRKALQRIY-ADKAADIIDGLRKNPSIAVPIVLKRLKMKEEEWR 707
Cdd:COG5602  573 TIKALEQIIDKIKDMEESEKANKTLPGGLGLPSKSIYKKVIKKVYdKEHAPEILEALLKKPHVTIPIVLKRLKMKDEEWR 652
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  708 EAQRGFNKVWREQNEKYYLKSLDHQGINFKQNDTKVLRSKSLLNEIESIYDERQEQATEENAGVpvgpHLSLAYEDKQIL 787
Cdd:COG5602  653 SCKREWNKIWREIEEKNYHKSLDHQGVSFKSRDKKILSTKNILEEIDDRSQAKIHVSIDDKKVF----QFVFVLCDTYIF 728
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  788 EDAAALIIHHVKRQTGIQKEDKYKIKQIMHHFIPD--LLFAQRGDLSDVEEEEEeemdvdeatgaVKKHNGVGGSP---- 861
Cdd:COG5602  729 VNILDLKDTLITNMSSYLESDKERLKANLKTIVSLffLLCIFRIIIIVYERLLN-----------VKGLNIDGLKAdrss 797
                        890       900       910       920       930       940       950       960
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  862 --------------------------------------PKSKLLFSNTAA-QKLRGMDEVYNlFYVNNNWYIFMRLHQIL 902
Cdd:COG5602  798 rsdesaqryskvksgnleqvskqideyaiedeikepthPDGLKEHNISKGiSENEKQVIALT-EFGNSLMYVFFRLRFDS 876
                        970       980       990      1000      1010      1020      1030      1040
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  903 CLRLLRicsqaerqieeenrerewerevlgIKRDKSDSPAIQLRLkepmdvdveDYYPAFLDMVRSLLDGNIDSSQYEDS 982
Cdd:COG5602  877 YERLYE------------------------IKKIKEAVTIGMMEP---------GHYENELWDSYRLIFGALTPCEFEKY 923
                       1050      1060      1070      1080      1090      1100      1110      1120
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  983 LREMFTIHAYIAFTMDKLIQSIVRQLQHIVSDEICVQVTDLYLAENNNGATGGQLNTqnsrsllesTYQRKAEQLMS-DE 1061
Cdd:COG5602  924 LRPFYNNTCYKIYTIDRLRQSLKKQLHPITYDIKYIAKSELFEMNSAASSKRKQDQT---------KYRLEVESLLNpDE 994
                       1130      1140      1150      1160      1170      1180      1190      1200
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666 1062 NCFKLMFIQSQGQVQLTIELLD--TEEENSDDpveAERWSDYVERYMNSDTTspELREHLAQKPVFLPRNLRRIRKC-QR 1138
Cdd:COG5602  995 ILFRFCWINKFKSFGIQIMKRAdlTVDQSLDT---QRVWKYYVQSYAIQHLT--EGISYKNYKCPFLCRNIEKERTVeQL 1069
                       1210      1220      1230      1240      1250      1260      1270
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 23397666 1139 GREQQEKEGKEGNSKKTmenvdsldkLECRFKLNSYKMVYVIKSEDyMYRRTALLRAHQSHERVSKRLHQRFQAW 1213
Cdd:COG5602 1070 VSRLQTKLLRSHELVSG---------LQAFLCLDTFKLLYLPNTED-SYIDASYLRLRDSDFLDCQKRKQRWRNW 1134
 
Name Accession Description Interval E-value
Sin3 COG5602
Histone deacetylase complex, regulatory component SIN3 [Chromatin structure and dynamics];
5-1213 2.87e-169

Histone deacetylase complex, regulatory component SIN3 [Chromatin structure and dynamics];


Pssm-ID: 227889 [Multi-domain]  Cd Length: 1163  Bit Score: 534.50  E-value: 2.87e-169
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666    5 LDDQESPVYAAQQRRIP----GSTEAFPHQHrVLAPAPPVYEAVSetMQSATGIQY--SVTPSY--QVSAMPQSSGSHGP 76
Cdd:COG5602   23 SLKQISPKYHPSMSRDHdvntSLQNSEPWPD-NGSARKPSYSPES--MSSYGGPMDerSGTRRYyeDVEQILSSLAGDNP 99
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666   77 AIAAVHSSHHHptavqphggqvvQSHAHPAPPVAPvqgqqqFQRLKVEDALSYLDQVKLQFGSQPQVYNDFLDIMKEFKS 156
Cdd:COG5602  100 DPSTDHGSASN------------QNMERSPPPRIP------YRPLDVSDALSYLEKVKEQFSNRPEIYNNFLDIMKDFKS 161
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  157 QSIDTPGVISRVSQLFKGHPDLIMGFNTFLPPGYKIEVQTND----MVNVTTPGQVHQIPthgiqpqpqpppqhpsqpsa 232
Cdd:COG5602  162 QAIDTPGVIERVSVLFRGYPHLIEGFNTFLPSGYKIEGSLPQpngsRLHVTTPQGPLSSP-------------------- 221
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  233 qsapapaqpapqppPAKVSKPSQLQAHtpASQQTPPLPPYASPRSPPVQPHTPvtislgtaPSLQNNQPVEFNHAINYVN 312
Cdd:COG5602  222 --------------PVQSSYYVAPCNH--DQRTSHPTLPSDSQPEPSAPSHMP--------SDARGKHQVDFNQAIIFVN 277
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  313 KIKNRFQGQPDIYKAFLEILHTYQKEQRNAkeaggnytpalteQEVYAQVARLFKNQEDLLSEFGQFLPDANSSVLLSKT 392
Cdd:COG5602  278 KVKVRFQNNPEMYYDFLDSLRTYQMKQRSI-------------QEVYARVTKLFAEAPDLLEEFKEFLPDSSVSAEQSTA 344
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  393 TAEKVDSvRNDHGGTVKKPQLNNKPQRPSQ-NGCQIRRHPTGTTPPVKKKPKLLNLKDSSMADASKHGGGTESLFFDKVR 471
Cdd:COG5602  345 NAQKPSK-RLPPIGSFSLPTAAPEQNRPSLlWESPRSISNISRYRADLLTSFSRNFVPIRIHMPISHLSNESEDFFERIK 423
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  472 KALRSAEAYENFLRCLVIFNQEVISRAELVQLVSPFLGKFPELFNWFKNFLGYKESVH--LETYPKerategiameiDYA 549
Cdd:COG5602  424 QYLKDKNLYHEFLKLLNLYSQKIIDKNDLVERLFAFLGSNEELIRWFKAFINYSEPEKepLRETRK-----------DLE 492
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  550 SCKRLGSSYRALPKSYQQPKCTGRTPLCKEVLNDTWVSFPSW-SEDSTFVSSKKTQYEEHIYRCEDERFELDVVLETNLA 628
Cdd:COG5602  493 KFEKCGPSYRLLPKSEEEIKCSGRDDLAWEVLNDDWVSHPTWaSEDSGFIAHRKNQYEEALFKIEEERYEYDRHIEATQR 572
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  629 TIRVLEAIQKKLSRLSAEEQAKFRLDNTLGGTSEVIHRKALQRIY-ADKAADIIDGLRKNPSIAVPIVLKRLKMKEEEWR 707
Cdd:COG5602  573 TIKALEQIIDKIKDMEESEKANKTLPGGLGLPSKSIYKKVIKKVYdKEHAPEILEALLKKPHVTIPIVLKRLKMKDEEWR 652
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  708 EAQRGFNKVWREQNEKYYLKSLDHQGINFKQNDTKVLRSKSLLNEIESIYDERQEQATEENAGVpvgpHLSLAYEDKQIL 787
Cdd:COG5602  653 SCKREWNKIWREIEEKNYHKSLDHQGVSFKSRDKKILSTKNILEEIDDRSQAKIHVSIDDKKVF----QFVFVLCDTYIF 728
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  788 EDAAALIIHHVKRQTGIQKEDKYKIKQIMHHFIPD--LLFAQRGDLSDVEEEEEeemdvdeatgaVKKHNGVGGSP---- 861
Cdd:COG5602  729 VNILDLKDTLITNMSSYLESDKERLKANLKTIVSLffLLCIFRIIIIVYERLLN-----------VKGLNIDGLKAdrss 797
                        890       900       910       920       930       940       950       960
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  862 --------------------------------------PKSKLLFSNTAA-QKLRGMDEVYNlFYVNNNWYIFMRLHQIL 902
Cdd:COG5602  798 rsdesaqryskvksgnleqvskqideyaiedeikepthPDGLKEHNISKGiSENEKQVIALT-EFGNSLMYVFFRLRFDS 876
                        970       980       990      1000      1010      1020      1030      1040
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  903 CLRLLRicsqaerqieeenrerewerevlgIKRDKSDSPAIQLRLkepmdvdveDYYPAFLDMVRSLLDGNIDSSQYEDS 982
Cdd:COG5602  877 YERLYE------------------------IKKIKEAVTIGMMEP---------GHYENELWDSYRLIFGALTPCEFEKY 923
                       1050      1060      1070      1080      1090      1100      1110      1120
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  983 LREMFTIHAYIAFTMDKLIQSIVRQLQHIVSDEICVQVTDLYLAENNNGATGGQLNTqnsrsllesTYQRKAEQLMS-DE 1061
Cdd:COG5602  924 LRPFYNNTCYKIYTIDRLRQSLKKQLHPITYDIKYIAKSELFEMNSAASSKRKQDQT---------KYRLEVESLLNpDE 994
                       1130      1140      1150      1160      1170      1180      1190      1200
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666 1062 NCFKLMFIQSQGQVQLTIELLD--TEEENSDDpveAERWSDYVERYMNSDTTspELREHLAQKPVFLPRNLRRIRKC-QR 1138
Cdd:COG5602  995 ILFRFCWINKFKSFGIQIMKRAdlTVDQSLDT---QRVWKYYVQSYAIQHLT--EGISYKNYKCPFLCRNIEKERTVeQL 1069
                       1210      1220      1230      1240      1250      1260      1270
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 23397666 1139 GREQQEKEGKEGNSKKTmenvdsldkLECRFKLNSYKMVYVIKSEDyMYRRTALLRAHQSHERVSKRLHQRFQAW 1213
Cdd:COG5602 1070 VSRLQTKLLRSHELVSG---------LQAFLCLDTFKLLYLPNTED-SYIDASYLRLRDSDFLDCQKRKQRWRNW 1134
Sin3a_C pfam16879
C-terminal domain of Sin3a protein; Sin3a_C is a family of eukaryotic species. It is found at ...
886-1186 1.75e-102

C-terminal domain of Sin3a protein; Sin3a_C is a family of eukaryotic species. It is found at the C-terminus of the co-repressor Sin3a, and downstream of family Sin3_corepress, pfam08295.


Pssm-ID: 465294  Cd Length: 281  Bit Score: 326.49  E-value: 1.75e-102
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666    886 FYVNNNWYIFMRLHQILCLRLLRIcSQAERQIEEENREREWEREVLGIKRDKSDSPAIQLRLkepmdVDVEDYYPAFLDM 965
Cdd:pfam16879    1 FFANTTIYVFFRLFQILYERLLKI-KQAEEEVAEEIKRRKANKPAKDLGLISKELEDFGLDL-----SDPEDYYPQLLEL 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666    966 VRSLLDGNIDSSQYEDSLREMFTIHAYIAFTMDKLIQSIVRQLQHIVSDEICVQVTDLYLAENNNGATggqlNTQNsrsl 1045
Cdd:pfam16879   75 CERLIDGEIDQSQFEDSLRQMYGNKAYKLYTIDKLIQALVKQLQTIVSDSKSQELLELFEKERSKEET----TAQD---- 146
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666   1046 lESTYQRKAEQLM-SDENCFKLMFIQSqgQVQLTIELLDTEEENSDDPV--EAERWSDYVERYMNSDTTspELREHLAQK 1122
Cdd:pfam16879  147 -EILYRKQAEKLLgKDENLFRIEWNPQ--SKTVTIQLLDKDDLTFDDDEltAEERWQYYVSSYVLSHPT--EGVPRSKLR 221
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 23397666   1123 PVFLPRNLRRIRKCQRGREQQEKEGKEgnskKTMENVDSLDKLECRFKLNSYKMVYVIKSEDYM 1186
Cdd:pfam16879  222 KPFLKRNLKAEREEEEDEEEEPKADAE----EPEGPLEFESGLEIRICLNSYKLFYVPGTEDYF 281
HDAC_interact smart00761
Histone deacetylase (HDAC) interacting; This domain is found on transcriptional regulators. It ...
550-650 2.67e-52

Histone deacetylase (HDAC) interacting; This domain is found on transcriptional regulators. It forms interactions with histone deacetylases.


Pssm-ID: 214808 [Multi-domain]  Cd Length: 102  Bit Score: 178.27  E-value: 2.67e-52
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666     550 SCKRLGSSYRALPKSYQQPKCTGRTPLCKEVLNDTWVSFPSW-SEDSTFVSSKKTQYEEHIYRCEDERFELDVVLETNLA 628
Cdd:smart00761    1 NCERCGPSYRLLPKSEKQPKCSGRDELCKEVLNDTWVSHPTWaSEDSGFVAHRKNQYEEALFRCEDERFELDMVIESNSS 80
                            90       100
                    ....*....|....*....|..
gi 23397666     629 TIRVLEAIQKKLSRLSAEEQAK 650
Cdd:smart00761   81 TIKLLEEILNKIEDMSDEERAN 102
KLF1_2_4_N-like cd22056
N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of ...
48-119 1.32e-03

N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of Kruppel-like factor (KLF)1, KLF2, and KLF4; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domains of an unknown subfamily of KLFs, predominantly found in fish, related to the N-terminal domains of KLF1, KLF2, and KLF4.


Pssm-ID: 409231 [Multi-domain]  Cd Length: 339  Bit Score: 42.34  E-value: 1.32e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 23397666   48 MQSATGIQYSVTPSYQVSAMPQSSGSHGPAIAAVHSSHHHPTAVQPHGgqvVQSHAHPAPPVAPvQGQQQFQ 119
Cdd:cd22056  213 MHSVHPQAFTHHQAAGPGALQGRGGRGGPDCHLLHSSHHHHHHHHLQY---QYMNAPYPPHYAH-QGAPQFH 280
 
Name Accession Description Interval E-value
Sin3 COG5602
Histone deacetylase complex, regulatory component SIN3 [Chromatin structure and dynamics];
5-1213 2.87e-169

Histone deacetylase complex, regulatory component SIN3 [Chromatin structure and dynamics];


Pssm-ID: 227889 [Multi-domain]  Cd Length: 1163  Bit Score: 534.50  E-value: 2.87e-169
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666    5 LDDQESPVYAAQQRRIP----GSTEAFPHQHrVLAPAPPVYEAVSetMQSATGIQY--SVTPSY--QVSAMPQSSGSHGP 76
Cdd:COG5602   23 SLKQISPKYHPSMSRDHdvntSLQNSEPWPD-NGSARKPSYSPES--MSSYGGPMDerSGTRRYyeDVEQILSSLAGDNP 99
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666   77 AIAAVHSSHHHptavqphggqvvQSHAHPAPPVAPvqgqqqFQRLKVEDALSYLDQVKLQFGSQPQVYNDFLDIMKEFKS 156
Cdd:COG5602  100 DPSTDHGSASN------------QNMERSPPPRIP------YRPLDVSDALSYLEKVKEQFSNRPEIYNNFLDIMKDFKS 161
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  157 QSIDTPGVISRVSQLFKGHPDLIMGFNTFLPPGYKIEVQTND----MVNVTTPGQVHQIPthgiqpqpqpppqhpsqpsa 232
Cdd:COG5602  162 QAIDTPGVIERVSVLFRGYPHLIEGFNTFLPSGYKIEGSLPQpngsRLHVTTPQGPLSSP-------------------- 221
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  233 qsapapaqpapqppPAKVSKPSQLQAHtpASQQTPPLPPYASPRSPPVQPHTPvtislgtaPSLQNNQPVEFNHAINYVN 312
Cdd:COG5602  222 --------------PVQSSYYVAPCNH--DQRTSHPTLPSDSQPEPSAPSHMP--------SDARGKHQVDFNQAIIFVN 277
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  313 KIKNRFQGQPDIYKAFLEILHTYQKEQRNAkeaggnytpalteQEVYAQVARLFKNQEDLLSEFGQFLPDANSSVLLSKT 392
Cdd:COG5602  278 KVKVRFQNNPEMYYDFLDSLRTYQMKQRSI-------------QEVYARVTKLFAEAPDLLEEFKEFLPDSSVSAEQSTA 344
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  393 TAEKVDSvRNDHGGTVKKPQLNNKPQRPSQ-NGCQIRRHPTGTTPPVKKKPKLLNLKDSSMADASKHGGGTESLFFDKVR 471
Cdd:COG5602  345 NAQKPSK-RLPPIGSFSLPTAAPEQNRPSLlWESPRSISNISRYRADLLTSFSRNFVPIRIHMPISHLSNESEDFFERIK 423
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  472 KALRSAEAYENFLRCLVIFNQEVISRAELVQLVSPFLGKFPELFNWFKNFLGYKESVH--LETYPKerategiameiDYA 549
Cdd:COG5602  424 QYLKDKNLYHEFLKLLNLYSQKIIDKNDLVERLFAFLGSNEELIRWFKAFINYSEPEKepLRETRK-----------DLE 492
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  550 SCKRLGSSYRALPKSYQQPKCTGRTPLCKEVLNDTWVSFPSW-SEDSTFVSSKKTQYEEHIYRCEDERFELDVVLETNLA 628
Cdd:COG5602  493 KFEKCGPSYRLLPKSEEEIKCSGRDDLAWEVLNDDWVSHPTWaSEDSGFIAHRKNQYEEALFKIEEERYEYDRHIEATQR 572
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  629 TIRVLEAIQKKLSRLSAEEQAKFRLDNTLGGTSEVIHRKALQRIY-ADKAADIIDGLRKNPSIAVPIVLKRLKMKEEEWR 707
Cdd:COG5602  573 TIKALEQIIDKIKDMEESEKANKTLPGGLGLPSKSIYKKVIKKVYdKEHAPEILEALLKKPHVTIPIVLKRLKMKDEEWR 652
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  708 EAQRGFNKVWREQNEKYYLKSLDHQGINFKQNDTKVLRSKSLLNEIESIYDERQEQATEENAGVpvgpHLSLAYEDKQIL 787
Cdd:COG5602  653 SCKREWNKIWREIEEKNYHKSLDHQGVSFKSRDKKILSTKNILEEIDDRSQAKIHVSIDDKKVF----QFVFVLCDTYIF 728
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  788 EDAAALIIHHVKRQTGIQKEDKYKIKQIMHHFIPD--LLFAQRGDLSDVEEEEEeemdvdeatgaVKKHNGVGGSP---- 861
Cdd:COG5602  729 VNILDLKDTLITNMSSYLESDKERLKANLKTIVSLffLLCIFRIIIIVYERLLN-----------VKGLNIDGLKAdrss 797
                        890       900       910       920       930       940       950       960
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  862 --------------------------------------PKSKLLFSNTAA-QKLRGMDEVYNlFYVNNNWYIFMRLHQIL 902
Cdd:COG5602  798 rsdesaqryskvksgnleqvskqideyaiedeikepthPDGLKEHNISKGiSENEKQVIALT-EFGNSLMYVFFRLRFDS 876
                        970       980       990      1000      1010      1020      1030      1040
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  903 CLRLLRicsqaerqieeenrerewerevlgIKRDKSDSPAIQLRLkepmdvdveDYYPAFLDMVRSLLDGNIDSSQYEDS 982
Cdd:COG5602  877 YERLYE------------------------IKKIKEAVTIGMMEP---------GHYENELWDSYRLIFGALTPCEFEKY 923
                       1050      1060      1070      1080      1090      1100      1110      1120
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666  983 LREMFTIHAYIAFTMDKLIQSIVRQLQHIVSDEICVQVTDLYLAENNNGATGGQLNTqnsrsllesTYQRKAEQLMS-DE 1061
Cdd:COG5602  924 LRPFYNNTCYKIYTIDRLRQSLKKQLHPITYDIKYIAKSELFEMNSAASSKRKQDQT---------KYRLEVESLLNpDE 994
                       1130      1140      1150      1160      1170      1180      1190      1200
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666 1062 NCFKLMFIQSQGQVQLTIELLD--TEEENSDDpveAERWSDYVERYMNSDTTspELREHLAQKPVFLPRNLRRIRKC-QR 1138
Cdd:COG5602  995 ILFRFCWINKFKSFGIQIMKRAdlTVDQSLDT---QRVWKYYVQSYAIQHLT--EGISYKNYKCPFLCRNIEKERTVeQL 1069
                       1210      1220      1230      1240      1250      1260      1270
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 23397666 1139 GREQQEKEGKEGNSKKTmenvdsldkLECRFKLNSYKMVYVIKSEDyMYRRTALLRAHQSHERVSKRLHQRFQAW 1213
Cdd:COG5602 1070 VSRLQTKLLRSHELVSG---------LQAFLCLDTFKLLYLPNTED-SYIDASYLRLRDSDFLDCQKRKQRWRNW 1134
Sin3a_C pfam16879
C-terminal domain of Sin3a protein; Sin3a_C is a family of eukaryotic species. It is found at ...
886-1186 1.75e-102

C-terminal domain of Sin3a protein; Sin3a_C is a family of eukaryotic species. It is found at the C-terminus of the co-repressor Sin3a, and downstream of family Sin3_corepress, pfam08295.


Pssm-ID: 465294  Cd Length: 281  Bit Score: 326.49  E-value: 1.75e-102
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666    886 FYVNNNWYIFMRLHQILCLRLLRIcSQAERQIEEENREREWEREVLGIKRDKSDSPAIQLRLkepmdVDVEDYYPAFLDM 965
Cdd:pfam16879    1 FFANTTIYVFFRLFQILYERLLKI-KQAEEEVAEEIKRRKANKPAKDLGLISKELEDFGLDL-----SDPEDYYPQLLEL 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666    966 VRSLLDGNIDSSQYEDSLREMFTIHAYIAFTMDKLIQSIVRQLQHIVSDEICVQVTDLYLAENNNGATggqlNTQNsrsl 1045
Cdd:pfam16879   75 CERLIDGEIDQSQFEDSLRQMYGNKAYKLYTIDKLIQALVKQLQTIVSDSKSQELLELFEKERSKEET----TAQD---- 146
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666   1046 lESTYQRKAEQLM-SDENCFKLMFIQSqgQVQLTIELLDTEEENSDDPV--EAERWSDYVERYMNSDTTspELREHLAQK 1122
Cdd:pfam16879  147 -EILYRKQAEKLLgKDENLFRIEWNPQ--SKTVTIQLLDKDDLTFDDDEltAEERWQYYVSSYVLSHPT--EGVPRSKLR 221
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 23397666   1123 PVFLPRNLRRIRKCQRGREQQEKEGKEgnskKTMENVDSLDKLECRFKLNSYKMVYVIKSEDYM 1186
Cdd:pfam16879  222 KPFLKRNLKAEREEEEDEEEEPKADAE----EPEGPLEFESGLEIRICLNSYKLFYVPGTEDYF 281
HDAC_interact smart00761
Histone deacetylase (HDAC) interacting; This domain is found on transcriptional regulators. It ...
550-650 2.67e-52

Histone deacetylase (HDAC) interacting; This domain is found on transcriptional regulators. It forms interactions with histone deacetylases.


Pssm-ID: 214808 [Multi-domain]  Cd Length: 102  Bit Score: 178.27  E-value: 2.67e-52
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666     550 SCKRLGSSYRALPKSYQQPKCTGRTPLCKEVLNDTWVSFPSW-SEDSTFVSSKKTQYEEHIYRCEDERFELDVVLETNLA 628
Cdd:smart00761    1 NCERCGPSYRLLPKSEKQPKCSGRDELCKEVLNDTWVSHPTWaSEDSGFVAHRKNQYEEALFRCEDERFELDMVIESNSS 80
                            90       100
                    ....*....|....*....|..
gi 23397666     629 TIRVLEAIQKKLSRLSAEEQAK 650
Cdd:smart00761   81 TIKLLEEILNKIEDMSDEERAN 102
Sin3_corepress pfam08295
Sin3 family co-repressor; This domain is found on transcriptional regulators. It forms ...
552-646 5.70e-51

Sin3 family co-repressor; This domain is found on transcriptional regulators. It forms interactions with histone deacetylases.


Pssm-ID: 462418  Cd Length: 97  Bit Score: 174.34  E-value: 5.70e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 23397666    552 KRLGSSYRALPKSYQQPKCTGRTPLCKEVLNDTWVSFPSW-SEDS-TFVSSKKTQYEEHIYRCEDERFELDVVLETNLAT 629
Cdd:pfam08295    1 KRCGPSYRLLPKSEPQLPCSGRDELCWEVLNDEWVSHPTWaSEDSgSFVAHRKNQYEEALFRCEDERYEYDMHIESNLRT 80
                           90
                   ....*....|....*..
gi 23397666    630 IRVLEAIQKKLSRLSAE 646
Cdd:pfam08295   81 IKLLEPIAEKINNMSPE 97
PAH pfam02671
Paired amphipathic helix repeat; This family contains the paired amphipathic helix repeat. The ...
142-186 8.97e-20

Paired amphipathic helix repeat; This family contains the paired amphipathic helix repeat. The family contains the yeast SIN3 gene (also known as SDI1) that is a negative regulator of the yeast HO gene. This repeat may be distantly related to the helix-loop-helix motif, which mediate protein-protein interactions.


Pssm-ID: 460645 [Multi-domain]  Cd Length: 45  Bit Score: 83.63  E-value: 8.97e-20
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 23397666    142 QVYNDFLDIMKEFKSQSIDTPGVISRVSQLFKGHPDLIMGFNTFL 186
Cdd:pfam02671    1 EVYDEFLKILNDYKQEIIDIAEVIARVSELLKGHPDLLEGFNTFL 45
PAH pfam02671
Paired amphipathic helix repeat; This family contains the paired amphipathic helix repeat. The ...
478-522 1.94e-13

Paired amphipathic helix repeat; This family contains the paired amphipathic helix repeat. The family contains the yeast SIN3 gene (also known as SDI1) that is a negative regulator of the yeast HO gene. This repeat may be distantly related to the helix-loop-helix motif, which mediate protein-protein interactions.


Pssm-ID: 460645 [Multi-domain]  Cd Length: 45  Bit Score: 65.52  E-value: 1.94e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 23397666    478 EAYENFLRCLVIFNQEVISRAELVQLVSPFLGKFPELFNWFKNFL 522
Cdd:pfam02671    1 EVYDEFLKILNDYKQEIIDIAEVIARVSELLKGHPDLLEGFNTFL 45
PAH pfam02671
Paired amphipathic helix repeat; This family contains the paired amphipathic helix repeat. The ...
323-380 3.63e-12

Paired amphipathic helix repeat; This family contains the paired amphipathic helix repeat. The family contains the yeast SIN3 gene (also known as SDI1) that is a negative regulator of the yeast HO gene. This repeat may be distantly related to the helix-loop-helix motif, which mediate protein-protein interactions.


Pssm-ID: 460645 [Multi-domain]  Cd Length: 45  Bit Score: 62.05  E-value: 3.63e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 23397666    323 DIYKAFLEILHTYQKEQRNAKEaggnytpalteqeVYAQVARLFKNQEDLLSEFGQFL 380
Cdd:pfam02671    1 EVYDEFLKILNDYKQEIIDIAE-------------VIARVSELLKGHPDLLEGFNTFL 45
KLF1_2_4_N-like cd22056
N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of ...
48-119 1.32e-03

N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of Kruppel-like factor (KLF)1, KLF2, and KLF4; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domains of an unknown subfamily of KLFs, predominantly found in fish, related to the N-terminal domains of KLF1, KLF2, and KLF4.


Pssm-ID: 409231 [Multi-domain]  Cd Length: 339  Bit Score: 42.34  E-value: 1.32e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 23397666   48 MQSATGIQYSVTPSYQVSAMPQSSGSHGPAIAAVHSSHHHPTAVQPHGgqvVQSHAHPAPPVAPvQGQQQFQ 119
Cdd:cd22056  213 MHSVHPQAFTHHQAAGPGALQGRGGRGGPDCHLLHSSHHHHHHHHLQY---QYMNAPYPPHYAH-QGAPQFH 280
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH