NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|672035056|ref|XP_008757158|]
View 

protein capicua homolog isoform X2 [Rattus norvegicus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
HMG-box_CIC-like cd21990
high mobility group (HMG)-box found in protein capicua (CIC) and similar proteins; CIC is a ...
1105-1182 2.23e-59

high mobility group (HMG)-box found in protein capicua (CIC) and similar proteins; CIC is a transcriptional repressor which plays a role in the development of the central nervous system (CNS). In concert with ATXN1 and ATXN1L, CIC is involved in brain development.


:

Pssm-ID: 438806 [Multi-domain]  Cd Length: 78  Bit Score: 198.45  E-value: 2.23e-59
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 672035056 1105 HIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWKWCNKDRKK 1182
Cdd:cd21990     1 HIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPEEKQKYHDLAFQVKEAHFKAHPDWKWCSKDRKK 78
DUF4819 super family cl24605
Domain of unknown function (DUF4819); This presumed domain is functionally uncharacterized. ...
252-345 7.99e-21

Domain of unknown function (DUF4819); This presumed domain is functionally uncharacterized. This domain family is found in eukaryotes, and is typically between 82 and 99 amino acids in length.


The actual alignment was detected with superfamily member pfam16090:

Pssm-ID: 465014  Cd Length: 84  Bit Score: 88.89  E-value: 7.99e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056   252 LDATPPPGALMVGTAVCTCV----EPGVAAYREGVVVEVATKPAAYKVRLNPGPSShagspgtlpqaqqplhrepEEAIW 327
Cdd:pfam16090    4 LDASPSLSDVAVGTRVCVRLdpglEGGENVYREGVVVEVNNKPVRYVVKVSGGDEA-------------------KGGVW 64
                           90
                   ....*....|....*...
gi 672035056   328 VTRSSLRLLRPPWEPEAL 345
Cdd:pfam16090   65 VKRADLRLLRPPWWDELE 82
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1638-2196 5.22e-11

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 68.81  E-value: 5.22e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1638 AAATSPAPHLVAGPLLGTVGKAPATVTNLLVGTPGYGAPASPAVQFIAQGAPGsatpagSGASAGSGPNGPVPLGILQPG 1717
Cdd:PHA03247 2445 AGLAADGDPFFARTILGAPFSLSLLLGELFPGAPVYRRPAEARFPFAAGAAPD------PGGGGPPDPDAPPAPSRLAPA 2518
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1718 ALGKAGGITQVQYILPTLPQQLQVAPAPAPAPGTKAAAPSGPAPTTSIRFTLP-PGTSTNGKVLAATAPTAGIPILQSVP 1796
Cdd:PHA03247 2519 ILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPPPPLPPAAPPAAPDRSVPPPrPAPRPSEPAVTSRARRPDAPPQSARP 2598
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1797 SAP-PPKAQSVSPVQATPSGGSAQLLPGKVLVPLAAPSMSVRGGGAGQPLPLVSSPFSVPVQNGAQQPSKIIQLTPVPVS 1875
Cdd:PHA03247 2599 RAPvDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASS 2678
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1876 TPSGLVPPLSPATMPGPTSQPQKVLLPSSTRITYVQSAGGHTLPLGTSSAcsqtgtvtsygPASSVALGFTSLGPSGPAf 1955
Cdd:PHA03247 2679 PPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAA-----------RQASPALPAAPAPPAVPA- 2746
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1956 vQPLLSAGQAPLLAPGQVGVSPVPSPQLPPACTAPGGPVITAFYPGSPAPTSAPLGPPSQAPPSLVYTVATSTTPPAAAI 2035
Cdd:PHA03247 2747 -GPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA 2825
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 2036 LPKGPPASATATPAPTSPFPSATAGSMTYSLV--APKAQRPSPKAPQKVKAAIASIPVGSFESGTTGR-TGPTPRQSLDS 2112
Cdd:PHA03247 2826 GPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVApgGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRsTESFALPPDQP 2905
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 2113 GVAREPAAPESELEGQPTPPAPPPPTETWPPTARSSPPPPLPAEERPGTKGPETASKFPSSSSDWRVPGLGLESRGEPPT 2192
Cdd:PHA03247 2906 ERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPS 2985

                  ....
gi 672035056 2193 PPSP 2196
Cdd:PHA03247 2986 REAP 2989
PHA03247 super family cl33720
large tegument protein UL36; Provisional
749-1060 3.27e-07

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.49  E-value: 3.27e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  749 VSHTPTPSTPAGFRAVSPAVPFSRSrQPSPLLLLPPPAGLTSDPGPSVRRVPAvqrdSPVIVRNPDVPLPSKFPgevgaa 828
Cdd:PHA03247 2716 VSATPLPPGPAAARQASPALPAAPA-PPAVPAGPATPGGPARPARPPTTAGPP----APAPPAAPAAGPPRRLT------ 2784
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  829 gearaggpgrgcretpvPPGVASGKPSLPPPLPAPVpitvPPAAPTAVAQPMPTLGLASSPFQPVAFHPSPAALLPVLVP 908
Cdd:PHA03247 2785 -----------------RPAVASLSESRESLPSPWD----PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPP 2843
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  909 SSYPSHPAPKKEVIMG-----RPGTVWTNVEPRSVAVFPWHSLVPFLAPSQPDPSVQPSEAQQPASHPVASNQSKEPAES 983
Cdd:PHA03247 2844 GPPPPSLPLGGSVAPGgdvrrRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQP 2923
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  984 AAVAHEQPPGGTGGADPGRPP----GATCPESPGPGPPLTLGGVDPGKSLPPTTEEEAPGPPGE-PRLDSETESDHDDAF 1058
Cdd:PHA03247 2924 PPPPQPQPPPPPPPRPQPPLApttdPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREaPASSTPPLTGHSLSR 3003

                  ..
gi 672035056 1059 LS 1060
Cdd:PHA03247 3004 VS 3005
PRK07003 super family cl35530
DNA polymerase III subunit gamma/tau;
1552-1694 6.80e-03

DNA polymerase III subunit gamma/tau;


The actual alignment was detected with superfamily member PRK07003:

Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 41.76  E-value: 6.80e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1552 SAPAPSLAYGAPAAPLCRPAATMVTNVVRPVSSTPVPIASKPFPTSGRAEASSNDTVGARTEMGTGSRVPGGSPLGVSLV 1631
Cdd:PRK07003  383 PGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSR 462
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 672035056 1632 YSDKKSAAATSPAPHLVAGPllgtvgKAPATVTNllvgtpgygAPASPAVQFIAQGAPGSATP 1694
Cdd:PRK07003  463 CDERDAQPPADSGSASAPAS------DAPPDAAF---------EPAPRAAAPSAATPAAVPDA 510
 
Name Accession Description Interval E-value
HMG-box_CIC-like cd21990
high mobility group (HMG)-box found in protein capicua (CIC) and similar proteins; CIC is a ...
1105-1182 2.23e-59

high mobility group (HMG)-box found in protein capicua (CIC) and similar proteins; CIC is a transcriptional repressor which plays a role in the development of the central nervous system (CNS). In concert with ATXN1 and ATXN1L, CIC is involved in brain development.


Pssm-ID: 438806 [Multi-domain]  Cd Length: 78  Bit Score: 198.45  E-value: 2.23e-59
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 672035056 1105 HIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWKWCNKDRKK 1182
Cdd:cd21990     1 HIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPEEKQKYHDLAFQVKEAHFKAHPDWKWCSKDRKK 78
DUF4819 pfam16090
Domain of unknown function (DUF4819); This presumed domain is functionally uncharacterized. ...
252-345 7.99e-21

Domain of unknown function (DUF4819); This presumed domain is functionally uncharacterized. This domain family is found in eukaryotes, and is typically between 82 and 99 amino acids in length.


Pssm-ID: 465014  Cd Length: 84  Bit Score: 88.89  E-value: 7.99e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056   252 LDATPPPGALMVGTAVCTCV----EPGVAAYREGVVVEVATKPAAYKVRLNPGPSShagspgtlpqaqqplhrepEEAIW 327
Cdd:pfam16090    4 LDASPSLSDVAVGTRVCVRLdpglEGGENVYREGVVVEVNNKPVRYVVKVSGGDEA-------------------KGGVW 64
                           90
                   ....*....|....*...
gi 672035056   328 VTRSSLRLLRPPWEPEAL 345
Cdd:pfam16090   65 VKRADLRLLRPPWWDELE 82
HMG smart00398
high mobility group;
1105-1174 9.61e-19

high mobility group;


Pssm-ID: 197700 [Multi-domain]  Cd Length: 70  Bit Score: 82.36  E-value: 9.61e-19
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056   1105 HIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWK 1174
Cdd:smart00398    1 KPKRPMSAFMLFSQENRAKIKAENPDLSNAEISKKLGERWKLLSEEEKAPYEEKAKKDKERYEEEMPEYK 70
HMG_box pfam00505
HMG (high mobility group) box;
1106-1173 2.90e-17

HMG (high mobility group) box;


Pssm-ID: 459837 [Multi-domain]  Cd Length: 68  Bit Score: 78.04  E-value: 2.90e-17
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 672035056  1106 IRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDW 1173
Cdd:pfam00505    1 PKRPMSAFFLFSKEQRAKLKAENPGLKNAEISKILGEKWKALSEEEKKPYEEKAEKEKARYEKEHPEY 68
PHA03247 PHA03247
large tegument protein UL36; Provisional
1638-2196 5.22e-11

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 68.81  E-value: 5.22e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1638 AAATSPAPHLVAGPLLGTVGKAPATVTNLLVGTPGYGAPASPAVQFIAQGAPGsatpagSGASAGSGPNGPVPLGILQPG 1717
Cdd:PHA03247 2445 AGLAADGDPFFARTILGAPFSLSLLLGELFPGAPVYRRPAEARFPFAAGAAPD------PGGGGPPDPDAPPAPSRLAPA 2518
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1718 ALGKAGGITQVQYILPTLPQQLQVAPAPAPAPGTKAAAPSGPAPTTSIRFTLP-PGTSTNGKVLAATAPTAGIPILQSVP 1796
Cdd:PHA03247 2519 ILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPPPPLPPAAPPAAPDRSVPPPrPAPRPSEPAVTSRARRPDAPPQSARP 2598
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1797 SAP-PPKAQSVSPVQATPSGGSAQLLPGKVLVPLAAPSMSVRGGGAGQPLPLVSSPFSVPVQNGAQQPSKIIQLTPVPVS 1875
Cdd:PHA03247 2599 RAPvDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASS 2678
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1876 TPSGLVPPLSPATMPGPTSQPQKVLLPSSTRITYVQSAGGHTLPLGTSSAcsqtgtvtsygPASSVALGFTSLGPSGPAf 1955
Cdd:PHA03247 2679 PPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAA-----------RQASPALPAAPAPPAVPA- 2746
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1956 vQPLLSAGQAPLLAPGQVGVSPVPSPQLPPACTAPGGPVITAFYPGSPAPTSAPLGPPSQAPPSLVYTVATSTTPPAAAI 2035
Cdd:PHA03247 2747 -GPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA 2825
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 2036 LPKGPPASATATPAPTSPFPSATAGSMTYSLV--APKAQRPSPKAPQKVKAAIASIPVGSFESGTTGR-TGPTPRQSLDS 2112
Cdd:PHA03247 2826 GPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVApgGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRsTESFALPPDQP 2905
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 2113 GVAREPAAPESELEGQPTPPAPPPPTETWPPTARSSPPPPLPAEERPGTKGPETASKFPSSSSDWRVPGLGLESRGEPPT 2192
Cdd:PHA03247 2906 ERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPS 2985

                  ....
gi 672035056 2193 PPSP 2196
Cdd:PHA03247 2986 REAP 2989
PHA03247 PHA03247
large tegument protein UL36; Provisional
749-1060 3.27e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.49  E-value: 3.27e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  749 VSHTPTPSTPAGFRAVSPAVPFSRSrQPSPLLLLPPPAGLTSDPGPSVRRVPAvqrdSPVIVRNPDVPLPSKFPgevgaa 828
Cdd:PHA03247 2716 VSATPLPPGPAAARQASPALPAAPA-PPAVPAGPATPGGPARPARPPTTAGPP----APAPPAAPAAGPPRRLT------ 2784
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  829 gearaggpgrgcretpvPPGVASGKPSLPPPLPAPVpitvPPAAPTAVAQPMPTLGLASSPFQPVAFHPSPAALLPVLVP 908
Cdd:PHA03247 2785 -----------------RPAVASLSESRESLPSPWD----PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPP 2843
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  909 SSYPSHPAPKKEVIMG-----RPGTVWTNVEPRSVAVFPWHSLVPFLAPSQPDPSVQPSEAQQPASHPVASNQSKEPAES 983
Cdd:PHA03247 2844 GPPPPSLPLGGSVAPGgdvrrRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQP 2923
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  984 AAVAHEQPPGGTGGADPGRPP----GATCPESPGPGPPLTLGGVDPGKSLPPTTEEEAPGPPGE-PRLDSETESDHDDAF 1058
Cdd:PHA03247 2924 PPPPQPQPPPPPPPRPQPPLApttdPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREaPASSTPPLTGHSLSR 3003

                  ..
gi 672035056 1059 LS 1060
Cdd:PHA03247 3004 VS 3005
NHP6B COG5648
Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics];
1077-1221 5.24e-07

Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics];


Pssm-ID: 227935 [Multi-domain]  Cd Length: 211  Bit Score: 52.94  E-value: 5.24e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1077 TQSLSALPKERDSSSEKDGRSPnkrekdhiRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYH 1156
Cdd:COG5648    50 TKPRKKTKSKRLVRKKKDPNGP--------KRPLSAYFLYSAENRDEIRKENPKLTFGEVGKLLSEKWKELTDEEKEPYY 121
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1157 DLAFQVKEAHFKA-HPDwkwcnkDRKKSSSEAKPASLGLAGGHKETRERSMSETG----TAAAPGVSSEL 1221
Cdd:COG5648   122 KEANSDRERYQREkEEY------NKKLPNKAPIGPFIENEPKIRPKVEGPSPDKAlveeTKIISKAWSEL 185
PTZ00199 PTZ00199
high mobility group protein; Provisional
1089-1163 1.26e-06

high mobility group protein; Provisional


Pssm-ID: 185511 [Multi-domain]  Cd Length: 94  Bit Score: 48.70  E-value: 1.26e-06
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 672035056 1089 SSSEKDGRSPNKREKDHI--RRPMNAFMIFSKRHRALVHQRHPN--QDNRTVSKILGEWWYALGPKEKQKYHDLAFQVK 1163
Cdd:PTZ00199    4 KQGKVLVRKNKRKKKDPNapKRALSAYMFFAKEKRAEIIAENPElaKDVAAVGKMVGEAWNKLSEEEKAPYEKKAQEDK 82
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1873-2082 1.83e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 50.14  E-value: 1.83e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1873 PVSTPSGLVPPLSPATMPGPTSQPQKVL-LPSSTRITYVQSAGGHTLPLGTSSACSQTGTVTSYGPASSVALGFTSLGPS 1951
Cdd:COG3469    11 TAGGASATAVTLLGAAATAASVTLTAATaTTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAAT 90
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1952 GPAFVQPLLSAGQAPLLAPGQVGVSPVPSPQLPPACTAPGGPVITAFYPGSPAPTSAPLGPPSQAPPSLVYTVATSTTPP 2031
Cdd:COG3469    91 STSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTT 170
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|.
gi 672035056 2032 AAAILPKGPPASATATPAPTSPFPSATAGSMtyslvAPKAQRPSPKAPQKV 2082
Cdd:COG3469   171 TTTSASTTPSATTTATATTASGATTPSATTT-----ATTTGPPTPGLPKHV 216
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1921-2109 3.08e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.45  E-value: 3.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  1921 GTSSACSQTGTVTSYGPASSV-ALGFTSLGPSGPAFVQPLLSAGQAP--LLAPGQVG--VSPVPSPQLPPACTAPGGPvi 1995
Cdd:pfam05109  409 ATNATTTTHKVIFSKAPESTTtSPTLNTTGFAAPNTTTGLPSSTHVPtnLTAPASTGptVSTADVTSPTPAGTTSGAS-- 486
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  1996 tafyPGSPAPTSAPLGPPSQAPPSLVYTVATSTTPPAAAilpkgPPASATATPAPTSPFPSATAGSMTYSLVAPKAQRPS 2075
Cdd:pfam05109  487 ----PVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNAT-----SPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATS 557
                          170       180       190
                   ....*....|....*....|....*....|....
gi 672035056  2076 PKAPQKVKAAIASIPVGSFESGTTGRTGPTPRQS 2109
Cdd:pfam05109  558 PTPAVTTPTPNATIPTLGKTSPTSAVTTPTPNAT 591
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
1951-2090 1.19e-03

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 43.60  E-value: 1.19e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1951 SGPAFVQPLLSAGQAPLLAPGQVGVSPVPSPQLPPACTAPGGPVITAFYPGSPAPTSA--PLGPPSQAPPSLVYTVATST 2028
Cdd:NF040712  188 IDPDFGRPLRPLATVPRLAREPADARPEEVEPAPAAEGAPATDSDPAEAGTPDDLASArrRRAGVEQPEDEPVGPGAAPA 267
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 672035056 2029 TPPAAAILPKGPP-----ASATATPAPTSPFPSATAGSMTYSLVAPKAQRPSPKAPQKVKAAIASIP 2090
Cdd:NF040712  268 AEPDEATRDAGEPpapgaAETPEAAEPPAPAPAAPAAPAAPEAEEPARPEPPPAPKPKRRRRRASVP 334
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
752-1044 4.05e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.83  E-value: 4.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056   752 TPTPSTPAGFRAVSPAVPFSRSRQPSPLLLLPPPAGLTSDPGPSVRRVPAvqrdspvivrNPDVPLPSKFPGEVGAAGEA 831
Cdd:pfam03154  262 SPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPP----------GPSPAAPGQSQQRIHTPPSQ 331
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056   832 RAGGPGRGCRETPVPPGVASgkpslpPPLPAPVPITVPPAAPTAVAQPMPTLGLASSPFQPVAFHPSPAALLPVlvpSSY 911
Cdd:pfam03154  332 SQLQSQQPPREQPLPPAPLS------MPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPL---SSL 402
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056   912 PSHPAPKkevimgrpgtvwtnveprsvAVFPWHSLVPFLAPSQPDPsVQPSEAQQPASHPVASNQSKEPAESAAVAhEQP 991
Cdd:pfam03154  403 STHHPPS--------------------AHPPPLQLMPQSQQLPPPP-AQPPVLTQSQSLPPPAASHPPTSGLHQVP-SQS 460
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 672035056   992 PGGTGGADPGRPPGATCPESPGPGPPLTLGGVDPGKSLPPTTEEEAPGPPGEP 1044
Cdd:pfam03154  461 PFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCP 513
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
1552-1694 6.80e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 41.76  E-value: 6.80e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1552 SAPAPSLAYGAPAAPLCRPAATMVTNVVRPVSSTPVPIASKPFPTSGRAEASSNDTVGARTEMGTGSRVPGGSPLGVSLV 1631
Cdd:PRK07003  383 PGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSR 462
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 672035056 1632 YSDKKSAAATSPAPHLVAGPllgtvgKAPATVTNllvgtpgygAPASPAVQFIAQGAPGSATP 1694
Cdd:PRK07003  463 CDERDAQPPADSGSASAPAS------DAPPDAAF---------EPAPRAAAPSAATPAAVPDA 510
rad23 TIGR00601
UV excision repair protein Rad23; All proteins in this family for which functions are known ...
2002-2073 8.80e-03

UV excision repair protein Rad23; All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273167 [Multi-domain]  Cd Length: 378  Bit Score: 41.03  E-value: 8.80e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 672035056  2002 SPAPTSAPLGPPSQAPPSLVYTVATSTTPPAAAILPKGPPASATATPAPTSPFPSATA-GSMTYSLVAPKAQR 2073
Cdd:TIGR00601   84 VAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATAPESPSTSVPSsGSDAASTLVVGSER 156
 
Name Accession Description Interval E-value
HMG-box_CIC-like cd21990
high mobility group (HMG)-box found in protein capicua (CIC) and similar proteins; CIC is a ...
1105-1182 2.23e-59

high mobility group (HMG)-box found in protein capicua (CIC) and similar proteins; CIC is a transcriptional repressor which plays a role in the development of the central nervous system (CNS). In concert with ATXN1 and ATXN1L, CIC is involved in brain development.


Pssm-ID: 438806 [Multi-domain]  Cd Length: 78  Bit Score: 198.45  E-value: 2.23e-59
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 672035056 1105 HIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWKWCNKDRKK 1182
Cdd:cd21990     1 HIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPEEKQKYHDLAFQVKEAHFKAHPDWKWCSKDRKK 78
HMG-box_HBP2 cd21989
high mobility group (HMG)-box found in HMG box-containing protein 2 (HBP2) and similar ...
1107-1175 4.47e-34

high mobility group (HMG)-box found in HMG box-containing protein 2 (HBP2) and similar proteins; HBP2, also called HMG box transcription factor BBX, or Bobby sox homolog, is a transcription factor that is necessary for cell cycle progression from the G1 to S phase.


Pssm-ID: 438805 [Multi-domain]  Cd Length: 69  Bit Score: 125.98  E-value: 4.47e-34
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 672035056 1107 RRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWKW 1175
Cdd:cd21989     1 RRPMNAFLLFCKRHRSLVRERHPRLDNRGITKILGDWWAVLDPDEKQKYTDLAKQYKEAFMKANPNFKW 69
HMG-box_SOX cd22004
high mobility group (HMG)-box found in sex-determining region Y (SRY)-box (SOX) family ...
1105-1178 2.74e-22

high mobility group (HMG)-box found in sex-determining region Y (SRY)-box (SOX) family transcription factors; The SOX gene family of transcription factors are characterized by the evolutionarily conserved SRY-type HMG box, which is a DNA binding domain that binds the minor groove of DNA on a common consensus site, (A/T)(A/T)CAA(A/T)G but with different levels of efficiency. Members include SRY and its homologs identified in mammals that can be subdivided into 8 groups (A, B1, B2, C, D, E, F, G, H). They are involved in embryonic development, regulating processes such as cell differentiation, maintenance of stemness, sex determination, and development of the central nervous, haematopoietic and other organ systems. The SOX gene family has a crucial role in carcinogenesis and cancer progression.


Pssm-ID: 438820 [Multi-domain]  Cd Length: 75  Bit Score: 92.61  E-value: 2.74e-22
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 672035056 1105 HIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWKWCNK 1178
Cdd:cd22004     1 HIKRPMNAFMVWSQQERKKIAKQNPKLHNSEISKILGKEWKKLTEEEKRPYVEEAERLREEHKKEYPDYKYRPR 74
HMG-box_ROX1-like cd01389
high mobility group (HMG)-box found in Saccharomyces cerevisiae repressor ROX1 and similar ...
1106-1174 1.32e-21

high mobility group (HMG)-box found in Saccharomyces cerevisiae repressor ROX1 and similar proteins; This family includes class I members of the HMG-box superfamily of DNA-binding proteins, including Saccharomyces cerevisiae repressor ROX1, Schizosaccharomyces pombe mating-type M-specific polypeptide Mc (mat-Mc), Schizosaccharomyces pombe transcription factor ste11, Podospora anserina MAT+ sexual cell fertilization-promoting factor (FPR1), Podospora anserina sporulation minus regulator 2 (SMR2) and Candida albicans repressor of filamentous growth 1 (RFG1). These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. ROX1, also called heme-dependent repression factor, or hypoxic function repressor, is a transcription factor that represses the expression of HEM13, COX5B, ANB1, CYC7 or AAC3. It binds to the DNA sequence 5'-RRRTAACAAGAG-3'. mat-Mc belongs to the mating type protein family, which contains sequence specific DNA-binding proteins that act as master switches in yeast differentiation by controlling gene expression in a cell type-specific fashion. mat-Mc is a positive regulator of MFM genes and is required for conjugation and efficient meiosis. Its HMG box recognizes the DNA sequence 5'-AACAAAG-3'. Ste11 is a key transcription factor for sexual development. It activates the transcription of the matp, matm, mei2, mfm, ste6 and rgs1 genes. It binds specifically to a DNA fragment carrying a 10-base motif 5'-TTCTTTGTTY-3'. FPR1 controls fertilization, probably by determining the mating type. SMR2 is a transcription factor that is required for post-fertilization events. It is required for the developmental events that occur in the female organ after fertilization. RFG1 is a transcription regulator that functions in both the positive and negative regulation of filamentous growth, depending upon environmental conditions.


Pssm-ID: 438791 [Multi-domain]  Cd Length: 72  Bit Score: 90.34  E-value: 1.32e-21
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 672035056 1106 IRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWK 1174
Cdd:cd01389     1 IPRPPNAFILFRKEKHKQLRAQNPGLTNSEISKIIGKMWKNLSEEEKEPYKELAEEEKEEHKLKYPDYK 69
DUF4819 pfam16090
Domain of unknown function (DUF4819); This presumed domain is functionally uncharacterized. ...
252-345 7.99e-21

Domain of unknown function (DUF4819); This presumed domain is functionally uncharacterized. This domain family is found in eukaryotes, and is typically between 82 and 99 amino acids in length.


Pssm-ID: 465014  Cd Length: 84  Bit Score: 88.89  E-value: 7.99e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056   252 LDATPPPGALMVGTAVCTCV----EPGVAAYREGVVVEVATKPAAYKVRLNPGPSShagspgtlpqaqqplhrepEEAIW 327
Cdd:pfam16090    4 LDASPSLSDVAVGTRVCVRLdpglEGGENVYREGVVVEVNNKPVRYVVKVSGGDEA-------------------KGGVW 64
                           90
                   ....*....|....*...
gi 672035056   328 VTRSSLRLLRPPWEPEAL 345
Cdd:pfam16090   65 VKRADLRLLRPPWWDELE 82
HMG-box_SoxH_SOX30 cd22033
high mobility group (HMG)-box found in sex determining region Y (SRY)-box 30 (SOX30) and ...
1105-1176 7.30e-20

high mobility group (HMG)-box found in sex determining region Y (SRY)-box 30 (SOX30) and similar proteins; SOX-30 is a crucial transcription factor that controls the transition from a late meiotic to a post-meiotic gene expression program and subsequent round spermatid development. It specially prevents Wnt-signaling to suppress metastasis. SOX-30 binds to the DNA sequence 5'-ACAAT-3' and shows a preference for guanine residues surrounding this core motif. SOX-30 is the only member of the group H of SRY-related high-mobility group (HMG) box (Sox) transcription factors.


Pssm-ID: 438842 [Multi-domain]  Cd Length: 75  Bit Score: 85.55  E-value: 7.30e-20
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 672035056 1105 HIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWKWC 1176
Cdd:cd22033     1 HIKRPMNAFMVWARIHRPALAKANPNANNADISVQLGEEWNKLTEEQKKPYYDEAQKLKEQHRKEHPGWVYQ 72
HMG smart00398
high mobility group;
1105-1174 9.61e-19

high mobility group;


Pssm-ID: 197700 [Multi-domain]  Cd Length: 70  Bit Score: 82.36  E-value: 9.61e-19
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056   1105 HIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWK 1174
Cdd:smart00398    1 KPKRPMSAFMLFSQENRAKIKAENPDLSNAEISKKLGERWKLLSEEEKAPYEEKAKKDKERYEEEMPEYK 70
HMG-box_HBP1 cd21988
high mobility group (HMG)-box found in HMG box-containing protein 1 (HBP1) and similar ...
1107-1174 1.95e-18

high mobility group (HMG)-box found in HMG box-containing protein 1 (HBP1) and similar proteins; HBP1, also called HMG box transcription factor 1, or high mobility group box transcription factor 1, is a transcriptional repressor that binds to the promoter region of target genes. It plays a role in the regulation of the cell cycle and the Wnt pathway. HBP1 binds preferentially to the sequence 5'-TTCATTCATTCA-3'.


Pssm-ID: 438804 [Multi-domain]  Cd Length: 69  Bit Score: 81.35  E-value: 1.95e-18
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 672035056 1107 RRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPD-WK 1174
Cdd:cd21988     1 KRPMNAFMLFAKKYRVEYTQMHPGKDNRAISVILGDRWKKMKNEERRVYTLEAKALAEEHKRLHPDcWK 69
HMG-box_SoxA_SoxB_SoxG cd22028
high mobility group (HMG)-box found in group A, group B and group G of SRY-related ...
1105-1182 2.16e-18

high mobility group (HMG)-box found in group A, group B and group G of SRY-related high-mobility group (HMG) box (Sox) transcription factors; This subfamily includes SoxA, SoxB, and SoxG proteins. SRY is the only member of group A SRY-related high-mobility group (HMG) box (Sox) transcription factors. SRY, also called testis-determining factor, is a transcriptional regulator that controls a genetic switch in male development. It promotes DNA bending and is also involved in pre-mRNA splicing. The SRY HMG box recognizes DNA by partial intercalation in the minor groove. It binds to the DNA consensus sequence 5'-[AT]AACAA[AT]-3'. SoxB transcription factors play critical roles in the regulation of neurogenesis. They can be divided into two main subgroups, SoxB1 (Sox-1/Sox-2/Sox-3) and SoxB2 (SOX-14/SOX-21). SoxB1 proteins suppress neurogenesis by maintaining neural cells in an undifferentiated state. SoxB2 proteins may have the opposite activity and promote neuronal differentiation. SOX-5 is the only member of group G Sox transcription factors. It is also called SOX-12, SOX-20, SOX-26, or SOX-27. SOX-5 is a crucial transcription factor involved in the regulation of embryonic development and in cell fate determination. It binds to the 5'-AACAAT-3' sequence. SOX-15 can be a tumor suppressor in multiple cancer types.


Pssm-ID: 438837 [Multi-domain]  Cd Length: 76  Bit Score: 81.63  E-value: 2.16e-18
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 672035056 1105 HIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWKWcnKDRKK 1182
Cdd:cd22028     1 RVKRPMNAFMVWSREQRRKIAQENPKMHNSEISKRLGAEWKQLSEDEKRPFIDEAKRLRAQHMKEHPDYKY--RPRRK 76
HMG_box pfam00505
HMG (high mobility group) box;
1106-1173 2.90e-17

HMG (high mobility group) box;


Pssm-ID: 459837 [Multi-domain]  Cd Length: 68  Bit Score: 78.04  E-value: 2.90e-17
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 672035056  1106 IRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDW 1173
Cdd:pfam00505    1 PKRPMSAFFLFSKEQRAKLKAENPGLKNAEISKILGEKWKALSEEEKKPYEEKAEKEKARYEKEHPEY 68
HMG-box_EGL13-like cd22042
high mobility group (HMG)-box found in Caenorhabditis elegans protein egg laying defective 13 ...
1105-1175 9.16e-17

high mobility group (HMG)-box found in Caenorhabditis elegans protein egg laying defective 13 (EGL-13) and similar proteins; EGL-13 may act as a transcription factor that is required for uterine cell fate decisions in Caenorhabditis elegans. It controls genes required for the specification and differentiation of O(2) and CO(2)-sensing neurons and for maintaining URX sensory neuronal cell fate.


Pssm-ID: 438848 [Multi-domain]  Cd Length: 78  Bit Score: 76.86  E-value: 9.16e-17
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 672035056 1105 HIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWKW 1175
Cdd:cd22042     2 HIKRPMNAFMVWAKDERRKILKACPDMHNSNISKILGAKWKAMSNAEKQPYYEEQSRLSKLHMEKHPDYRY 72
HMG-box_SF cd00084
high mobility group (HMG)-box domain superfamily; The High Mobility Group (HMG)-box is found ...
1108-1166 2.76e-16

high mobility group (HMG)-box domain superfamily; The High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenetically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4, and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.


Pssm-ID: 438789 [Multi-domain]  Cd Length: 59  Bit Score: 74.86  E-value: 2.76e-16
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 672035056 1108 RPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAH 1166
Cdd:cd00084     1 RPLSAYLLFSKEKRPKLKKENPDLSFTEISKLLGERWKELSEEEKQPYEEKAKEDKERY 59
HMG-box_SoxB cd01388
high mobility group (HMG)-box found in group B SRY-related high-mobility group (HMG) box (Sox) ...
1104-1185 1.17e-15

high mobility group (HMG)-box found in group B SRY-related high-mobility group (HMG) box (Sox) transcription factors; SoxB transcription factors play critical roles in the regulation of neurogenesis. They can be divided into two main subgroups, SoxB1 (Sox-1/Sox-2/Sox-3) and SoxB2 (SOX-14/SOX-21). SoxB1 proteins suppress neurogenesis by maintaining neural cells in an undifferentiated state. SoxB2 proteins may have the opposite activity and promote neuronal differentiation. SOX-1 is involved in the regulation of embryonic development and cell fate determination. It also acts as a tumor suppressor that plays an anti-tumorigenicity role in different cells and its expression is inhibited in a variety of cancers. SOX-2 plays an important role in various phases of embryonic development, including cell fate and differentiation. Its overexpression and gene amplification may be associated with tumor aggression and metastasis in various cancer types, including breast, prostate, lung, ovarian and colon cancer. SOX-3 is required during the formation of the hypothalamic-pituitary axis. It plays a role in both normal neural development and carcinogenesis. SOX-14, also called SOX-28, acts as a negative regulator of transcription. It is mainly involved in the regulation of neural development. It can also promote proliferation and invasion capacity of cervical cancer cells by activating the Wnt/beta-catenin pathway. SOX-21, also called SOX-A, or SOX-25, promotes the progression of vertebrate neurogenesis.


Pssm-ID: 438790 [Multi-domain]  Cd Length: 80  Bit Score: 73.79  E-value: 1.17e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1104 DHIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWKWcnKDRKKS 1183
Cdd:cd01388     1 DRVKRPMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSEAEKRPFIDEAKRLRALHMKEHPDYKY--RPRRKT 78

                  ..
gi 672035056 1184 SS 1185
Cdd:cd01388    79 KT 80
HMG-box_SoxD cd22030
high mobility group (HMG)-box found in group D SRY-related high-mobility group (HMG) box (Sox) ...
1105-1182 4.77e-15

high mobility group (HMG)-box found in group D SRY-related high-mobility group (HMG) box (Sox) transcription factors; SoxD transcription factors includes three members, Sox5, Sox6, and Sox13. They function as important transcriptional regulators of glial development in the central nervous system. SoxD proteins influence multiple stages of oligodendrocyte development and modulate SoxE protein function. This subfamily also contains SoxD transcription factor homolog EGL-13, which specifies distinct O2 and CO2 sensory neuron fates in Caenorhabditis elegans. SOX-5 plays important roles in various cancer types. It binds specifically to the DNA sequence 5'-AACAAT-3'. SOX-6 plays a key role in cell proliferation, differentiation, and cell fate determination, as well as neurogenesis and skeleton formation. It is a tumor suppressor and downregulated in various cancers. SOX-6 binds specifically to the DNA sequence 5'-AACAAT-3'. SOX-13, also called Islet cell antigen 12 (ICA12), or type 1 diabetes auto-antigen ICA12, modulates T cell specification and is an autoimmune antigen. It binds to the sequence 5'-AACAAT-3'.


Pssm-ID: 438839 [Multi-domain]  Cd Length: 77  Bit Score: 71.96  E-value: 4.77e-15
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 672035056 1105 HIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWKWcnKDRKK 1182
Cdd:cd22030     1 HIKRPMNAFMVWAKDERRKILQAFPDMHNSNISKILGSRWKSMSNQEKQPYYEEQARLSKQHLEKYPDYKY--KPRPK 76
HMG-box_SoxE cd22031
high mobility group (HMG)-box found in group E SRY-related high-mobility group (HMG) box (Sox) ...
1105-1175 8.01e-15

high mobility group (HMG)-box found in group E SRY-related high-mobility group (HMG) box (Sox) transcription factors; SoxE transcription factors includes three members: Sox8, Sox9, and Sox10. They function as important transcriptional regulators of neural differentiation and nervous system development. They regulate mammalian development directing sex determination, gliogenesis, pancreas specification and neural crest development. SOX-8 is specifically expressed by M cells in the intestinal epithelium. It acts as a transcription factor necessary for the maintenance of spermatogenesis. It may play a role in central nervous system, limb and facial development. SOX-9 is involved in cell differentiation, sex determination, and tumorigenesis. It plays a role in chondrocyte differentiation and skeletal development. SOX-10 plays a central role in the development of melanocytes and glial cells from neural crest precursors.


Pssm-ID: 438840 [Multi-domain]  Cd Length: 75  Bit Score: 71.27  E-value: 8.01e-15
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 672035056 1105 HIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWKW 1175
Cdd:cd22031     1 HVKRPMNAFMVWAQAARRKLADQYPHLHNAELSKTLGKLWRLLSESEKRPFIEEAERLRVQHKKDHPDYKY 71
HMG-box_TCF7-like cd21996
high mobility group (HMG)-box found in the transcription factor 7 (TCF-7)-like family; The ...
1105-1182 2.35e-14

high mobility group (HMG)-box found in the transcription factor 7 (TCF-7)-like family; The TCF7-like family includes TCF-7, TCF7-like 1 (TF7L1), lymphoid enhancer-binding factor 1 (LEF-1), and similar proteins. TCF-7, also called T-cell-specific transcription factor 1, or T-cell factor 1 (TCF-1), is a T lymphocyte-specific transcriptional activator involved in T-cell lymphocyte differentiation. It is necessary for the survival of CD4(+) CD8(+) immature thymocytes. It binds to the T-lymphocyte-specific enhancer element (5'-WWCAAAG-3') found in the promoter of the CD3E gene. TF7L1, also called HMG box transcription factor 3 (TCF-3), participates in the Wnt signaling pathway. It binds to DNA and acts as a repressor in the absence of CTNNB1, and as an activator in its presence. TF7L2, also called HMG box transcription factor 4, or T-cell-specific transcription factor 4, or T-cell factor 4, or TCF-4, participates in the Wnt signaling pathway and modulates MYC expression by binding to its promoter in a sequence-specific manner. It acts as a repressor in the absence of CTNNB1, and it activates transcription from promoters with several copies of the TCF motif 5'-CCTTTGATC-3' in the presence of CTNNB1. LEF-1, also called T cell-specific transcription factor 1-alpha (TCF1-alpha), is a transcription factor that binds DNA in a sequence-specific manner. It may participate in the Wnt signaling pathway. All family members contain one HMG-box domain.


Pssm-ID: 438812 [Multi-domain]  Cd Length: 85  Bit Score: 70.44  E-value: 2.35e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1105 HIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWK-----WCNKD 1179
Cdd:cd21996     1 HIKKPLNAFMLYMKEMRAKVVAECTLKESAAINQILGRRWHALSREEQAKYYELARKERQLHMQLYPGWSardnyGKKKK 80

                  ...
gi 672035056 1180 RKK 1182
Cdd:cd21996    81 RKR 83
HMG-box_SoxF cd22032
high mobility group (HMG)-box found in group F SRY-related high-mobility group (HMG) box (Sox) ...
1105-1175 2.96e-14

high mobility group (HMG)-box found in group F SRY-related high-mobility group (HMG) box (Sox) transcription factors; SoxF transcription factors includes three members: Sox7, Sox17, and Sox18. They regulate endothelial cell fate as well as development and differentiation of the developing heart, blood cells, and lymphatic vessels.


Pssm-ID: 438841 [Multi-domain]  Cd Length: 76  Bit Score: 69.75  E-value: 2.96e-14
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 672035056 1105 HIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWKW 1175
Cdd:cd22032     1 RIRRPMNAFMVWAKDERKRLADENPDLHNAELSKMLGKKWKSLSLAEKRPFVEEAERLRVQHMQDYPNYKY 71
HMG-box_SoxC cd22029
high mobility group (HMG)-box found in group C SRY-related high-mobility group (HMG) box (Sox) ...
1105-1182 6.62e-14

high mobility group (HMG)-box found in group C SRY-related high-mobility group (HMG) box (Sox) transcription factors; SoxC transcription factors includes three members: SOX4, SOX11 and SOX12. They play key roles, often in redundancy, in multiple developmental pathways, including neurogenesis and skeletogenesis. SOX-4 is a transcriptional activator that promotes neuronal differentiation both in the adult and embryonic neural progenitors. It binds with high affinity to the T-cell enhancer motif 5'-AACAAAG-3' motif. SOX-4 is abnormally expressed in various cancers. SOX-11 is a neural transcriptional factor involved in precursor survival, neuronal fate determination, migration and morphogenesis. SOX-12, also called SOX-22, is a transcription factor essential for embryonic development and cell fate determination. It binds to the sequence 5'-AACAAT-3'.


Pssm-ID: 438838 [Multi-domain]  Cd Length: 76  Bit Score: 68.62  E-value: 6.62e-14
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 672035056 1105 HIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWKWcnKDRKK 1182
Cdd:cd22029     1 HIKRPMNAFMVWSQIERRKIMEQQPDMHNAEISKRLGKRWKLLSDSEKIPFIEEAERLRLLHMQEYPDYKY--RPRKK 76
HMG-box_SoxF_SOX7 cd22046
high mobility group (HMG)-box found in sex determining region Y (SRY)-box 7 (SOX7) and similar ...
1096-1182 1.30e-13

high mobility group (HMG)-box found in sex determining region Y (SRY)-box 7 (SOX7) and similar proteins; SOX-7 is an endothelial-associated transcription factor that acts as a tumor suppressor in a number of cancer types. It binds the DNA sequence 5'-AACAAT-3'. SOX-7 belongs to the group F of SRY-related high-mobility group (HMG) box (Sox) transcription factors.


Pssm-ID: 438849 [Multi-domain]  Cd Length: 88  Bit Score: 68.56  E-value: 1.30e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1096 RSP-NKREKDHIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWK 1174
Cdd:cd22046     1 RSPgEKGSEPRIRRPMNAFMVWAKDERKRLAVQNPDLHNAELSKMLGKSWKALTPSQKRPYVEEAERLRVQHMQDYPNYK 80

                  ....*...
gi 672035056 1175 WCNKDRKK 1182
Cdd:cd22046    81 YRPRRKKQ 88
HMG-box_SoxF_SOX18 cd22048
high mobility group (HMG)-box found in sex determining region Y (SRY)-box 18 (SOX18) and ...
1106-1182 1.70e-13

high mobility group (HMG)-box found in sex determining region Y (SRY)-box 18 (SOX18) and similar proteins; SOX-18 is a transcription factor involved in the development of cardiovascular and lymphatic vessels during embryonic development. It binds to the consensus sequence 5'-AACAAAG-3' and is able to trans-activate transcription via this site. SOX-18 belongs to the group F of SRY-related high-mobility group (HMG) box (Sox) transcription factors.


Pssm-ID: 438851 [Multi-domain]  Cd Length: 80  Bit Score: 67.79  E-value: 1.70e-13
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 672035056 1106 IRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWKWCNKDRKK 1182
Cdd:cd22048     4 IRRPMNAFMVWAKDERKRLAQQNPDLHNAVLSKMLGQSWKSLSAAEKRPFVEEAERLRVQHLQDHPNYKYRPRRKKQ 80
HMG-box_SoxF_SOX17 cd22047
high mobility group (HMG)-box found in sex determining region Y (SRY)-box 17 (SOX17) and ...
1106-1175 1.09e-12

high mobility group (HMG)-box found in sex determining region Y (SRY)-box 17 (SOX17) and similar proteins; SOX-17 is a developmental transcription regulator involved in endoderm formation, angiogenesis, and carcinogenesis. It acts as a tumor suppressor and modulates WNT signaling. SOX-17 binds target promoter DNA and bends the DNA. It binds to the sequences 5'-AACAAT-'3 or 5'-AACAAAG-3'. SOX-17 belongs to the group F of SRY-related high-mobility group (HMG) box (Sox) transcription factors.


Pssm-ID: 438850 [Multi-domain]  Cd Length: 81  Bit Score: 65.49  E-value: 1.09e-12
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1106 IRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWKW 1175
Cdd:cd22047     7 IRRPMNAFMVWAKDERKRLAQQNPDLHNAELSKMLGKSWKALTLAEKRPFVEEAERLRVQHMQDHPNYKY 76
HMG-box_SoxG_SOX15 cd22035
high mobility group (HMG)-box found in sex determining region Y (SRY)-box 15 (SOX15) and ...
1104-1183 2.16e-12

high mobility group (HMG)-box found in sex determining region Y (SRY)-box 15 (SOX15) and similar proteins; SOX-15, also called SOX-12, SOX-20, SOX-26, or SOX-27, is a crucial transcription factor involved in the regulation of embryonic development and in cell fate determination. It binds to the 5'-AACAAT-3' sequence. SOX-15 can be a tumor suppressor in multiple cancer types. SOX-5 is the only member of group G SRY-related high-mobility group (HMG) box (Sox) transcription factors.


Pssm-ID: 438844 [Multi-domain]  Cd Length: 84  Bit Score: 64.72  E-value: 2.16e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1104 DHIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWKWcnKDRKKS 1183
Cdd:cd22035     6 EKVKRPMNAFMVWSSAQRRQMAQEHPKMHNSEISKRLGAAWKLLPEAEKRPFVEEAKRLRARHMRDYPDYKY--RPRRKG 83
PHA03247 PHA03247
large tegument protein UL36; Provisional
1638-2196 5.22e-11

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 68.81  E-value: 5.22e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1638 AAATSPAPHLVAGPLLGTVGKAPATVTNLLVGTPGYGAPASPAVQFIAQGAPGsatpagSGASAGSGPNGPVPLGILQPG 1717
Cdd:PHA03247 2445 AGLAADGDPFFARTILGAPFSLSLLLGELFPGAPVYRRPAEARFPFAAGAAPD------PGGGGPPDPDAPPAPSRLAPA 2518
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1718 ALGKAGGITQVQYILPTLPQQLQVAPAPAPAPGTKAAAPSGPAPTTSIRFTLP-PGTSTNGKVLAATAPTAGIPILQSVP 1796
Cdd:PHA03247 2519 ILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPPPPLPPAAPPAAPDRSVPPPrPAPRPSEPAVTSRARRPDAPPQSARP 2598
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1797 SAP-PPKAQSVSPVQATPSGGSAQLLPGKVLVPLAAPSMSVRGGGAGQPLPLVSSPFSVPVQNGAQQPSKIIQLTPVPVS 1875
Cdd:PHA03247 2599 RAPvDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASS 2678
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1876 TPSGLVPPLSPATMPGPTSQPQKVLLPSSTRITYVQSAGGHTLPLGTSSAcsqtgtvtsygPASSVALGFTSLGPSGPAf 1955
Cdd:PHA03247 2679 PPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAA-----------RQASPALPAAPAPPAVPA- 2746
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1956 vQPLLSAGQAPLLAPGQVGVSPVPSPQLPPACTAPGGPVITAFYPGSPAPTSAPLGPPSQAPPSLVYTVATSTTPPAAAI 2035
Cdd:PHA03247 2747 -GPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA 2825
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 2036 LPKGPPASATATPAPTSPFPSATAGSMTYSLV--APKAQRPSPKAPQKVKAAIASIPVGSFESGTTGR-TGPTPRQSLDS 2112
Cdd:PHA03247 2826 GPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVApgGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRsTESFALPPDQP 2905
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 2113 GVAREPAAPESELEGQPTPPAPPPPTETWPPTARSSPPPPLPAEERPGTKGPETASKFPSSSSDWRVPGLGLESRGEPPT 2192
Cdd:PHA03247 2906 ERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPS 2985

                  ....
gi 672035056 2193 PPSP 2196
Cdd:PHA03247 2986 REAP 2989
HMG-box_SoxC_SOX12 cd22038
high mobility group (HMG)-box found in sex determining region Y (SRY)-box 12 (SOX12) and ...
1100-1183 1.10e-10

high mobility group (HMG)-box found in sex determining region Y (SRY)-box 12 (SOX12) and similar proteins; SOX-12, also called SOX-22, is a transcription factor essential for embryonic development and cell fate determination. It binds to the sequence 5'-AACAAT-3'. SOX-12 belongs to the group C of SRY-related high-mobility group (HMG) box (Sox) transcription factors.


Pssm-ID: 438847 [Multi-domain]  Cd Length: 88  Bit Score: 60.08  E-value: 1.10e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1100 KREKDHIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWKWcnKD 1179
Cdd:cd22038     6 KTPSGHIKRPMNAFMVWSQIERRKIMEQWPDMHNAEISKRLGRRWKLLQDYEKIPFIKEAERLRLKHMADYPDYKY--RP 83

                  ....
gi 672035056 1180 RKKS 1183
Cdd:cd22038    84 RKKS 87
HMG-box_SoxA_SRY cd22034
high mobility group (HMG)-box found in sex-determining region Y protein (SRY) and similar ...
1103-1182 6.46e-10

high mobility group (HMG)-box found in sex-determining region Y protein (SRY) and similar proteins; SRY, also called testis-determining factor, is a transcriptional regulator that controls a genetic switch in male development. It promotes DNA bending and is also involved in pre-mRNA splicing. The SRY HMG box recognizes DNA by partial intercalation in the minor groove. It binds to the DNA consensus sequence 5'-[AT]AACAA[AT]-3'. SRY is the only member of group A SRY-related high-mobility group (HMG) box (Sox) transcription factors.


Pssm-ID: 438843 [Multi-domain]  Cd Length: 85  Bit Score: 57.72  E-value: 6.46e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1103 KDHIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWKWCNKDRKK 1182
Cdd:cd22034     2 QDRVKRPMNAFIVWSRDQRRKMALENPRMRNSEISKQLGYQWKMLTEAEKWPFFQEAQKLQAMHREKYPNYKYRPRRKAK 81
HMG-box_NHP6-like cd01390
high mobility group (HMG)-box found in Saccharomyces cerevisiae non-histone chromosomal ...
1093-1166 6.66e-10

high mobility group (HMG)-box found in Saccharomyces cerevisiae non-histone chromosomal proteins NHP6A, NHP6B and similar proteins; This subfamily includes Saccharomyces cerevisiae high-mobility-group proteins NHP6A and its closely related paralog NHP6B. NHP6A and NHP6B seem to be functionally redundant. They are DNA-binding proteins that induce severe bending of DNA and are required for DNA-binding by the FACT complex, a general chromatin factor that acts to reorganize nucleosomes. They augment the fidelity of transcription by RNA polymerase III independently of any role in the FACT complex. They may also play essential roles in transcriptional initiation fidelity of some but not all tRNA genes.


Pssm-ID: 438792 [Multi-domain]  Cd Length: 81  Bit Score: 57.76  E-value: 6.66e-10
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 672035056 1093 KDGRSPNKREKDHIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAH 1166
Cdd:cd01390     1 KKKKGRKKKDPNAPKRGLSAYMFFSQDNREKVKEENPDATFGEVGKLLGEKWKELSEEEKAPYEEKAAKDKKRY 74
HMG-box_SoxC_SOX4 cd22036
high mobility group (HMG)-box found in sex determining region Y (SRY)-box 4 (SOX4) and similar ...
1105-1182 1.39e-09

high mobility group (HMG)-box found in sex determining region Y (SRY)-box 4 (SOX4) and similar proteins; SOX-4 is a transcriptional activator that promotes neuronal differentiation both in the adult and embryonic neural progenitors. It binds with high affinity to the T-cell enhancer motif 5'-AACAAAG-3' motif. SOX-4 is abnormally expressed in various cancers. SOX-4 belongs to group C SRY-related high-mobility group (HMG) box (Sox) transcription factors.


Pssm-ID: 438845 [Multi-domain]  Cd Length: 79  Bit Score: 56.68  E-value: 1.39e-09
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 672035056 1105 HIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWKWcnKDRKK 1182
Cdd:cd22036     2 HIKRPMNAFMVWSQIERRKIMEQSPDMHNAEISKRLGKRWKLLKDSDKIPFIREAERLRLKHMADYPDYKY--RPRKK 77
HMG-box_SoxC_SOX11 cd22037
high mobility group (HMG)-box found in sex determining region Y (SRY)-box 11 (SOX11) and ...
1100-1185 1.61e-09

high mobility group (HMG)-box found in sex determining region Y (SRY)-box 11 (SOX11) and similar proteins; SOX-11 is a neural transcriptional factor involved in precursor survival, neuronal fate determination, migration and morphogenesis. SOX-11 belongs to group C SRY-related high-mobility group (HMG) box (Sox) transcription factors.


Pssm-ID: 438846 [Multi-domain]  Cd Length: 92  Bit Score: 57.03  E-value: 1.61e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1100 KREKDHIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWKWCNKD 1179
Cdd:cd22037     6 KTATGHIKRPMNAFMVWSKIERRKIMEQSPDMHNAEISKRLGKRWKMLKDSEKIPFIREAERLRLKHMADYPDYKYRPRK 85

                  ....*.
gi 672035056 1180 RKKSSS 1185
Cdd:cd22037    86 KPKVDP 91
HMG-box_SSRP1-like cd21994
high mobility group (HMG)-box found in structure-specific recognition protein 1 (SSRP1) and ...
1108-1174 1.37e-08

high mobility group (HMG)-box found in structure-specific recognition protein 1 (SSRP1) and similar proteins; SSRP1, also called FACT complex subunit SSRP1, chromatin-specific transcription elongation factor 80 kDa subunit, facilitates chromatin transcription complex 80 kDa subunit (FACT 80 kDa subunit or FACTp80), facilitates chromatin transcription complex subunit SSRP1, recombination signal sequence recognition protein 1, or T160, is a factor that facilitates transcript elongation through nucleosomes. It is a component of the FACT complex, a general chromatin factor that acts to reorganize nucleosomes. The FACT complex is involved in multiple processes that require DNA as a template such as mRNA elongation, DNA replication, and DNA repair.


Pssm-ID: 438810 [Multi-domain]  Cd Length: 67  Bit Score: 53.46  E-value: 1.37e-08
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 672035056 1108 RPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWK 1174
Cdd:cd21994     1 RPMSAYMLWLNENREKIKKENPGISVTEISKKAGEIWKELDEEDKEKWEQKAEKAKERYDKAMKEYK 67
HMG-box_HMG20 cd21980
high mobility group (HMG)-box found in the high mobility group protein 20 (HMG20) subfamily; ...
1107-1164 1.80e-08

high mobility group (HMG)-box found in the high mobility group protein 20 (HMG20) subfamily; The HMG20 subfamily includes HMG20A and HMG20B. HMG20A, also called HMG box-containing protein 20A, HMG domain-containing protein 1, HMG domain-containing protein HMGX1, HMGXB1, or iBRAF, is a chromatin-associated protein involved in neuronal differentiation and maturation. It is required for SNAI1-mediated epithelial to mesenchymal transition. HMG20A acts as an inhibitor of HMG20B. HMG20B, also called SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily E member 1-related, SMARCE1-related protein (SMARCE1R), BRCA2-associated factor 35 (BRAF35), HMG box-containing protein 20B, HMG domain-containing protein 2, HMG domain-containing protein HMGX2, Sox-like transcriptional factor, or structural DNA-binding protein BRAF35, is a DNA binding factor that acts as a repressor of erythroid differentiation. It is required for correct progression through the G2 phase of the cell cycle and entry into mitosis. It is also required for RCOR1/CoREST mediated repression of neuronal specific gene promoters. HMG20B is a core subunit of the Lys-specific demethylase 1/REST co-repressor 1 (LSD1-CoREST) histone demethylase complex. Both HMG20A and HMG20B contain one HMG-box.


Pssm-ID: 438796 [Multi-domain]  Cd Length: 81  Bit Score: 53.32  E-value: 1.80e-08
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 672035056 1107 RRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKE 1164
Cdd:cd21980     3 KAPLTGYVRFLNERREKLRAENPDLSFPEITKILGAEWSSLSAEEKQKYLDEAEKDKE 60
HMG-box_ABF2_IXR1-like_rpt2 cd22012
second high mobility group (HMG)-box found in Saccharomyces cerevisiae mitochondrial ...
1108-1157 6.68e-08

second high mobility group (HMG)-box found in Saccharomyces cerevisiae mitochondrial ARS-binding factor 2 (ABF2), intrastrand cross-link recognition protein (Ixr1) and similar proteins; ABF2 is a close relative of the nuclear, chromosomal high-mobility group protein HMG1 in yeast mitochondria. It specifically binds to the autonomously replicating sequence 1 (ARS1). It might play a positive role in gene expression and replication. Ixr1, also called structure-specific recognition protein (SSRP), is a homolog of the yeast mitochondrial regulator ABF2. It binds to platinated DNA and confers sensitivity to the anticancer drug cisplatin. Both ABF2 and Ixr1 contain two HMG-box domains. This model corresponds to the second one.


Pssm-ID: 438828 [Multi-domain]  Cd Length: 64  Bit Score: 51.13  E-value: 6.68e-08
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 672035056 1108 RPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHD 1157
Cdd:cd22012     4 RPASAYFLFAKEVRPKLKEENPNEKYTDITKLISEKWRSLDESEKEPYQE 53
HMG-box_AtHMGB6-like_rpt1 cd22006
first high mobility group (HMG)-box found in Arabidopsis thaliana high mobility group B ...
1107-1168 1.45e-07

first high mobility group (HMG)-box found in Arabidopsis thaliana high mobility group B protein 6 (HMGB6) and similar proteins; HMGB6, also called nucleosome/chromatin assembly factor group D 06 (or D 6), WRKY transcription factor 53 (WRKY53), or WRKY DNA-binding protein 53, is a master regulator of age-induced leaf senescence. It acts in a complex transcription factor signaling network regulating senescence specific gene expression; hydrogen peroxide might be involved in signal transduction. The subfamily also includes Arabidopsis thaliana HMGB13 (also known as nucleosome/chromatin assembly factor group D 13). Both HMGB6 and HMGB13 contain three HMG-box domains. This model corresponds to the first one.


Pssm-ID: 438822 [Multi-domain]  Cd Length: 68  Bit Score: 50.53  E-value: 1.45e-07
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 672035056 1107 RRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFK 1168
Cdd:cd22006     1 KKPKTAYFLWCKDQREEVKKENPNADFSEVSKILGAKWKNLSEEEKKPYEEKYKEEKEKYLK 62
PHA03247 PHA03247
large tegument protein UL36; Provisional
749-1060 3.27e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.49  E-value: 3.27e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  749 VSHTPTPSTPAGFRAVSPAVPFSRSrQPSPLLLLPPPAGLTSDPGPSVRRVPAvqrdSPVIVRNPDVPLPSKFPgevgaa 828
Cdd:PHA03247 2716 VSATPLPPGPAAARQASPALPAAPA-PPAVPAGPATPGGPARPARPPTTAGPP----APAPPAAPAAGPPRRLT------ 2784
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  829 gearaggpgrgcretpvPPGVASGKPSLPPPLPAPVpitvPPAAPTAVAQPMPTLGLASSPFQPVAFHPSPAALLPVLVP 908
Cdd:PHA03247 2785 -----------------RPAVASLSESRESLPSPWD----PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPP 2843
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  909 SSYPSHPAPKKEVIMG-----RPGTVWTNVEPRSVAVFPWHSLVPFLAPSQPDPSVQPSEAQQPASHPVASNQSKEPAES 983
Cdd:PHA03247 2844 GPPPPSLPLGGSVAPGgdvrrRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQP 2923
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  984 AAVAHEQPPGGTGGADPGRPP----GATCPESPGPGPPLTLGGVDPGKSLPPTTEEEAPGPPGE-PRLDSETESDHDDAF 1058
Cdd:PHA03247 2924 PPPPQPQPPPPPPPRPQPPLApttdPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREaPASSTPPLTGHSLSR 3003

                  ..
gi 672035056 1059 LS 1060
Cdd:PHA03247 3004 VS 3005
NHP6B COG5648
Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics];
1077-1221 5.24e-07

Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics];


Pssm-ID: 227935 [Multi-domain]  Cd Length: 211  Bit Score: 52.94  E-value: 5.24e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1077 TQSLSALPKERDSSSEKDGRSPnkrekdhiRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYH 1156
Cdd:COG5648    50 TKPRKKTKSKRLVRKKKDPNGP--------KRPLSAYFLYSAENRDEIRKENPKLTFGEVGKLLSEKWKELTDEEKEPYY 121
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1157 DLAFQVKEAHFKA-HPDwkwcnkDRKKSSSEAKPASLGLAGGHKETRERSMSETG----TAAAPGVSSEL 1221
Cdd:COG5648   122 KEANSDRERYQREkEEY------NKKLPNKAPIGPFIENEPKIRPKVEGPSPDKAlveeTKIISKAWSEL 185
HMG-box_AtHMGB6-like_rpt3 cd22008
third high mobility group (HMG)-box found in Arabidopsis thaliana high mobility group B ...
1098-1169 5.44e-07

third high mobility group (HMG)-box found in Arabidopsis thaliana high mobility group B protein 6 (HMGB6) and similar proteins; HMGB6, also called nucleosome/chromatin assembly factor group D 06 (or D 6), WRKY transcription factor 53 (WRKY53), or WRKY DNA-binding protein 53, is a master regulator of age-induced leaf senescence. It acts in a complex transcription factor signaling network regulating senescence specific gene expression; hydrogen peroxide might be involved in signal transduction. The subfamily also includes Arabidopsis thaliana HMGB13 (also known as nucleosome/chromatin assembly factor group D 13). Both HMGB6 and HMGB13 contain three HMG-box domains. This model corresponds to the third one.


Pssm-ID: 438824 [Multi-domain]  Cd Length: 78  Bit Score: 49.18  E-value: 5.44e-07
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 672035056 1098 PNKREKdhirrPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKA 1169
Cdd:cd22008     7 PNKPKK-----PASSYLLFGKEYRKKLQEERPGANNATVTALISLKWKELGEEEKQVYNDKAAVLMEKYKKE 73
HMG-box_HMGB_rpt2 cd21979
second high mobility group (HMG)-box found in the high mobility group protein B (HMGB) family; ...
1107-1168 5.56e-07

second high mobility group (HMG)-box found in the high mobility group protein B (HMGB) family; HMGB proteins are chromatin-associated nuclear proteins that act as architectural factors in nucleoprotein structures, which regulate DNA-dependent processes including transcription. In mammals, four family members are present: HMGB1, HMGB2, HMGB3 and HMGB4. They regulate the expression of a wide range of genes through architectural remodeling of the chromatin structure. HMGB1, also called high mobility group protein 1 (HMG-1), is a prototypical alarmin or damage-associated molecular pattern (DAMP) molecule when released from cells. It plays important roles in the regulation of a wide range of processes, including transcription, replication, DNA repair, and nucleosome formation, in the nucleus. It also plays multiple roles in regulating inflammation and responses to cell and tissue stress. HMGB2, also called high mobility group protein 2 (HMG-2), has been implicated in numerous cellular processes, including proliferation, differentiation, apoptosis, and tumor growth. It acts as a chromatin-associated nonhistone protein involved in transcriptional regulation and nucleic-acid-mediated innate immune responses in mammalian. It binds DNA to stabilize nucleosomes and promote transcription. HMGB3, also called high mobility group protein 2a (HMG-2a), or high mobility group protein 4 (HMG-4), is an X-linked member of HMGB family and functions as a universal sentinel for nucleic acid-mediated innate immune responses. HMGB3 has been implicated in the regulation of cellular proliferation and differentiation, as well as inflammatory response. HMGB4 is expressed by neuronal cells and affects the expression of genes involved in neural differentiation. It is a factor that regulates chromatin and expression of neuronal differentiation markers. The family also includes high mobility group protein B1 pseudogene 1 (HMGB1P1) and nuclear auto-antigen Sp-100. HMGB1P1, also called putative high mobility group protein B1-like 1 (HMGB1L1), or putative high mobility group protein 1-like 1 (HMG-1L1), is an HMG-box containing protein that binds preferentially single-stranded DNA and unwinds double-stranded DNA. Sp-100, also called nuclear dot-associated Sp100 protein, or speckled 100 kDa. It is a tumor suppressor that is a major constituent of the promyelocytic leukemia (PML) bodies, a subnuclear organelle involved in many physiological processes including cell growth, differentiation and apoptosis. Through the regulation of ETS1, Sp-100 may play a role in angiogenesis, controlling endothelial cell motility and invasion. It may also play roles in the regulation of telomeres lengthening, TP53-mediated transcription, FAS-mediated apoptosis, etc. In addition, the family includes Drosophila melanogaster high mobility group protein DSP1 (dDSP1) and similar proteins. dDSP1, also called protein dorsal switch 1, is a Drosophila HMG1 protein that binds preferentially single-stranded DNA and unwinds double-stranded DNA. It converts Dorsal and nuclear factor (NF)-kappa B from transcriptional activators to repressors. Members of the HMGB family contain two HMG-box domains. This model corresponds to the second one.


Pssm-ID: 438795 [Multi-domain]  Cd Length: 71  Bit Score: 48.95  E-value: 5.56e-07
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 672035056 1107 RRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFK 1168
Cdd:cd21979     4 KRPPSAFFLFCSEHRPKIKGEHPGLSIGDVAKKLGEMWNNTSAKDKQPYEKKAAKLKEKYEK 65
PTZ00199 PTZ00199
high mobility group protein; Provisional
1089-1163 1.26e-06

high mobility group protein; Provisional


Pssm-ID: 185511 [Multi-domain]  Cd Length: 94  Bit Score: 48.70  E-value: 1.26e-06
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 672035056 1089 SSSEKDGRSPNKREKDHI--RRPMNAFMIFSKRHRALVHQRHPN--QDNRTVSKILGEWWYALGPKEKQKYHDLAFQVK 1163
Cdd:PTZ00199    4 KQGKVLVRKNKRKKKDPNapKRALSAYMFFAKEKRAEIIAENPElaKDVAAVGKMVGEAWNKLSEEEKAPYEKKAQEDK 82
HMG-box_ABF2-like_rpt1 cd22010
first high mobility group (HMG)-box found in Saccharomyces cerevisiae mitochondrial ...
1107-1173 1.30e-06

first high mobility group (HMG)-box found in Saccharomyces cerevisiae mitochondrial ARS-binding factor 2 (ABF2) and similar proteins; ABF2 is a close relative of the nuclear, chromosomal high-mobility group protein HMG1 in yeast mitochondria. It specifically binds to the autonomously replicating sequence 1 (ARS1). It might play a positive role in gene expression and replication. ABF2 contains two HMG-box domains. This model corresponds to the first one.


Pssm-ID: 438826 [Multi-domain]  Cd Length: 68  Bit Score: 47.92  E-value: 1.30e-06
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 672035056 1107 RRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDW 1173
Cdd:cd22010     2 KRPLSAYFLYFQEHRSDFVKENPDAKMTEISKIGGDKWKNLSADDKKKYEDDFQRELSEYQKAKAEF 68
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1949-2188 2.00e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 53.34  E-value: 2.00e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1949 GPSGPAFVQPLLSAGQAPLLAPGQVGVSPVPSPQLPPACTAPGGPV--ITAFYPGSPAPTSAPLGPPSQAPPSLVYTVAT 2026
Cdd:PRK12323  370 GGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAarAVAAAPARRSPAPEALAAARQASARGPGGAPA 449
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 2027 STTPPAAAilpkgpPASATATPAPTSPFPSATAGSMTYSLVAPKAQRPSPKAPQKVKAAIASIPVGSFESGTTGRTGPTP 2106
Cdd:PRK12323  450 PAPAPAAA------PAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVA 523
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 2107 RQSLDSGVAREPAAPESELEGqptppappppTETWPPTARSSPPPPLPAEERPGTKGPETASKFPsssSDWRVPGLGLES 2186
Cdd:PRK12323  524 ESIPDPATADPDDAFETLAPA----------PAAAPAPRAAAATEPVVAPRPPRASASGLPDMFD---GDWPALAARLPV 590

                  ..
gi 672035056 2187 RG 2188
Cdd:PRK12323  591 RG 592
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1781-2167 2.37e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 53.07  E-value: 2.37e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1781 AATAPTAGIPILQSVPSAPPPKAQSVSPvQATPSGGSAQLLPGKVLVPLAAPSMSVRGGGAGQPLPLVSSPFSVPVQNGA 1860
Cdd:PRK07764  402 AAAAPAAAPAPAAAAPAAAAAPAPAAAP-QPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAP 480
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1861 QQPSKIIQLTPVPVSTPSGLVPPLSPATMPGPTSQPQKVL--LPSSTRIT------YVQSAG--GHTLPLGTSS-----A 1925
Cdd:PRK07764  481 APAPPAAPAPAAAPAAPAAPAAPAGADDAATLRERWPEILaaVPKRSRKTwaillpEATVLGvrGDTLVLGFSTgglarR 560
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1926 CSQTGTVTSYGPASSVALGFT------------SLGPSGPAFVQPLLSAGQAPllAPGQVGVSPVPSPQLPPACTAPGGP 1993
Cdd:PRK07764  561 FASPGNAEVLVTALAEELGGDwqveavvgpapgAAGGEGPPAPASSGPPEEAA--RPAAPAAPAAPAAPAPAGAAAAPAE 638
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1994 VITAFYPGSPAPTSAPLGPPSQAPPSLVYTVATSTTPPAAAilPKGPPASATATPAPTSPFPSATAGSMTYSLVAPKAQR 2073
Cdd:PRK07764  639 ASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPA--APPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADD 716
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 2074 PSPKAPQkvkaaiasipvgsfesGTTGRTGPTPRQSLDSGVAREPAAPESELEGQPTPPAPPPPTETWPPTARSSPPPPL 2153
Cdd:PRK07764  717 PAAQPPQ----------------AAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780
                         410
                  ....*....|....
gi 672035056 2154 PAEERPGTKGPETA 2167
Cdd:PRK07764  781 EEEEMAEDDAPSMD 794
HMG_box_2 pfam09011
HMG-box domain; This short 71 residue domain is an HMG-box domain. HMG-box domains mediate ...
1098-1173 7.69e-06

HMG-box domain; This short 71 residue domain is an HMG-box domain. HMG-box domains mediate re-modelling of chromatin-structure. Mammalian HMG-box proteins are of two types: those that are non-sequence-specific DNA-binding proteins with two HMG-box domains and a long highly acidic C-tail; and a diverse group of sequence-specific transcription factor-proteins with either a single HMG-box or up to six copies, and no acidic C-tail.


Pssm-ID: 430369 [Multi-domain]  Cd Length: 72  Bit Score: 45.86  E-value: 7.69e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 672035056  1098 PNKRekdhiRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDW 1173
Cdd:pfam09011    1 PNKP-----KRARNAYAFFVQEMIPEHKRQNPVIGFAEVSKLCSERWKNLSEEEKEKYEEMAKEDKNRYDREMGTY 71
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
1947-2109 7.75e-06

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 51.22  E-value: 7.75e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1947 SLGPSGPAFVQPLLSAGQAPllAPGQVGVSPVPSPQLPPACTAPGGPVITAFYPGSPAPTSAPLG--PPSQAPPslvytv 2024
Cdd:PRK14959  370 SLRPSGGGASAPSGSAAEGP--ASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSPrvPWDDAPP------ 441
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 2025 atstTPPAAAILPKGPPASATATPAPTSPFPSATAGSMTYSLVAPKAqrPSPKAPQKVKAAIASIPVGSFESGTTGRTGP 2104
Cdd:PRK14959  442 ----APPRSGIPPRPAPRMPEASPVPGAPDSVASASDAPPTLGDPSD--TAEHTPSGPRTWDGFLEFCQGRNGQGGRLAT 515

                  ....*
gi 672035056 2105 TPRQS 2109
Cdd:PRK14959  516 VLRQA 520
HMG-box_AtHMGB6-like_rpt2 cd22007
second high mobility group (HMG)-box found in Arabidopsis thaliana high mobility group B ...
1107-1169 1.02e-05

second high mobility group (HMG)-box found in Arabidopsis thaliana high mobility group B protein 6 (HMGB6) and similar proteins; HMGB6, also called nucleosome/chromatin assembly factor group D 06 (or D 6), WRKY transcription factor 53 (WRKY53), or WRKY DNA-binding protein 53, is a master regulator of age-induced leaf senescence. It acts in a complex transcription factor signaling network regulating senescence specific gene expression; hydrogen peroxide might be involved in signal transduction. The subfamily also includes Arabidopsis thaliana HMGB13 (also known as nucleosome/chromatin assembly factor group D 13). Both HMGB6 and HMGB13 contain three HMG-box domains. This model corresponds to the second one.


Pssm-ID: 438823 [Multi-domain]  Cd Length: 68  Bit Score: 45.32  E-value: 1.02e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 672035056 1107 RRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKA 1169
Cdd:cd22007     2 KRPQSAYFLYANDRRAALKEENKNVKITEIAKMLGAEWKNLSDAKKKPYEEKAKKLKEAYLQE 64
HMG-box_UBF1_rpt4 cd22001
fourth high mobility group (HMG)-box found in upstream-binding factor 1 (UBF1) and similar ...
1104-1164 1.15e-05

fourth high mobility group (HMG)-box found in upstream-binding factor 1 (UBF1) and similar proteins; UBF1, also called UBTF, nucleolar transcription factor 1, or auto-antigen NOR-90, is a nucleolar transcription factor that recognizes the ribosomal RNA gene promoter and activates transcription mediated by RNA polymerase I through cooperative interactions with the transcription factor SL1/TIF-IB complex. It binds specifically to the upstream control element. UBF1 contains six HMG-box domains. This model corresponds to the fourth one.


Pssm-ID: 438817 [Multi-domain]  Cd Length: 66  Bit Score: 44.98  E-value: 1.15e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 672035056 1104 DHIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKE 1164
Cdd:cd22001     1 EKPKRPISAMFIYSKEKRSKLKKKHPELSEQELTRLLAKKYNELPDKKKAKYKKKEALAKA 61
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1759-2125 1.25e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 50.94  E-value: 1.25e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1759 PAPTTSIRFTLPPGTSTNGKVLAATAPTAGIPilqSVPSAPPPKAQSVSPVQATPSGGSAQLLPGKVLVPLAAPSMSVRG 1838
Cdd:PHA03307   80 PANESRSTPTWSLSTLAPASPAREGSPTPPGP---SSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAG 156
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1839 GGAGQ-PLPLVSSPFSVPVQNGAQQPSKiiqltpvPVSTPSGLVPPLSPATMPGPTSQPqkvllPSSTRITYVQSAGGHT 1917
Cdd:PHA03307  157 ASPAAvASDAASSRQAALPLSSPEETAR-------APSSPPAEPPPSTPPAAASPRPPR-----RSSPISASASSPAPAP 224
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1918 LPLGTSSACSQTGTVTSYGPASSvALGFTSLGPSGPAFVQPLLSagqAPLLAPGQVGVSPVPSPQLPPACTAPGGPVITA 1997
Cdd:PHA03307  225 GRSAADDAGASSSDSSSSESSGC-GWGPENECPLPRPAPITLPT---RIWEASGWNGPSSRPGPASSSSSPRERSPSPSP 300
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1998 FYPGSPAPTSAPLGPPSQAPPSLVYTVATSTTPPAAAILPKGPPASATATPAPTSPFPSATAGSMT---YSLVAPKAQRP 2074
Cdd:PHA03307  301 SSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRkrpRPSRAPSSPAA 380
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|...
gi 672035056 2075 SP--KAPQKVKAAIASIPVGSFESGTTGRTGPTPRQSLDSGVAREPAAPESEL 2125
Cdd:PHA03307  381 SAgrPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLL 433
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1873-2082 1.83e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 50.14  E-value: 1.83e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1873 PVSTPSGLVPPLSPATMPGPTSQPQKVL-LPSSTRITYVQSAGGHTLPLGTSSACSQTGTVTSYGPASSVALGFTSLGPS 1951
Cdd:COG3469    11 TAGGASATAVTLLGAAATAASVTLTAATaTTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAAT 90
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1952 GPAFVQPLLSAGQAPLLAPGQVGVSPVPSPQLPPACTAPGGPVITAFYPGSPAPTSAPLGPPSQAPPSLVYTVATSTTPP 2031
Cdd:COG3469    91 STSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTT 170
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|.
gi 672035056 2032 AAAILPKGPPASATATPAPTSPFPSATAGSMtyslvAPKAQRPSPKAPQKV 2082
Cdd:COG3469   171 TTTSASTTPSATTTATATTASGATTPSATTT-----ATTTGPPTPGLPKHV 216
HMG-box_NHP10-like cd22016
high mobility group (HMG)-box found in Saccharomyces cerevisiae non-histone protein 10 (NHP10) ...
1092-1164 1.93e-05

high mobility group (HMG)-box found in Saccharomyces cerevisiae non-histone protein 10 (NHP10) and similar proteins; NHP10, also called high mobility group protein 2, is probably involved in transcription regulation via its interaction with the INO80 complex, a chromatin remodeling complex.


Pssm-ID: 438832 [Multi-domain]  Cd Length: 79  Bit Score: 45.02  E-value: 1.93e-05
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 672035056 1092 EKDGRSPnkrekdhiRRPMNAFMIFSKRHRALVHQR----HPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKE 1164
Cdd:cd22016     2 EKDPNAP--------KRPANAFFLFCQEQREKVREEykeeHQEIDHHDLTKALAQAWRNLDAEDKKPYYELYEKDKE 70
PHA03247 PHA03247
large tegument protein UL36; Provisional
1508-1894 2.96e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.94  E-value: 2.96e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1508 PESVGSLEAPGSSVIAAPPSGGGNILQTLVLPPSKEDREGTRVPSAPAPSLAYGAPAAPlcrPAATMVTNVVRPVSSTPV 1587
Cdd:PHA03247 2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAAR---PTVGSLTSLADPPPPPPT 2707
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1588 PIASKPFPTSGRAEASSNDTVGARTEMGTGSRVPGGSPLGVSLVYSDKKSAAATSPAPHLVAGPLLGTVGKAPATVTNLL 1667
Cdd:PHA03247 2708 PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPA 2787
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1668 VGTPGYGAPASPAVQFIAQgAPGSATPAGSGASAGSGPNGPVPlgilQPGALGKAGGITQVQYILPTLPQQLQVapapAP 1747
Cdd:PHA03247 2788 VASLSESRESLPSPWDPAD-PPAAVLAPAAALPPAASPAGPLP----PPTSAQPTAPPPPPGPPPPSLPLGGSV----AP 2858
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1748 APGTKAAAPSGPAPTTSIRFTLPPGTSTNGKVLAATAPTAGIPILQSVPSAPPPKAQSVSPVQATPSGGSAQLLPGKVLV 1827
Cdd:PHA03247 2859 GGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPR 2938
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 672035056 1828 PLAAPSMSVRGGGAGQPLPLVSSPFSVPVQNGAQQPSKIIQLTPVP-VSTPSGLVPPLSPATMPGPTS 1894
Cdd:PHA03247 2939 PQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPsREAPASSTPPLTGHSLSRVSS 3006
HMG-box_PMS1 cd21985
high mobility group (HMG)-box found in PMS1 protein homolog 1 (PMS1) and similar proteins; ...
1106-1166 3.21e-05

high mobility group (HMG)-box found in PMS1 protein homolog 1 (PMS1) and similar proteins; PMS1, also called DNA mismatch repair protein PMS1, is probably involved in the repair of mismatches in DNA.


Pssm-ID: 438801 [Multi-domain]  Cd Length: 73  Bit Score: 44.07  E-value: 3.21e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 672035056 1106 IRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAH 1166
Cdd:cd21985     2 IRKPMSASALFEQETRPQFLAENPKASLQDITLKIEERWKNLSEEEKKKYEEKAAKDLERY 62
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
1961-2088 3.30e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 49.42  E-value: 3.30e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1961 SAGQAPL-LAPGQVGVSPVPSPQlppactaPGGPviTAFYPGSPAPTSAPLGPPSQAPPSlvytvATSTTPPAAAILPKG 2039
Cdd:PRK14950  345 SYGQLPLeLAVIEALLVPVPAPQ-------PAKP--TAAAPSPVRPTPAPSTRPKAAAAA-----NIPPKEPVRETATPP 410
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|.
gi 672035056 2040 PPASATATPAPTSPFPSA--TAGSMTYSLVAPKAQRPSPKaPQKVKAAIAS 2088
Cdd:PRK14950  411 PVPPRPVAPPVPHTPESApkLTRAAIPVDEKPKYTPPAPP-KEEEKALIAD 460
HMG-box_AtHMGB1-like cd22005
high mobility group (HMG)-box found in Arabidopsis thaliana high mobility group B protein 1 ...
1108-1163 3.68e-05

high mobility group (HMG)-box found in Arabidopsis thaliana high mobility group B protein 1 (HMGB1) and similar proteins; This subfamily contains a group of Arabidopsis thaliana HMGB family proteins, including HMGB1, 2, 3, 4, 5, 7, 12 and 14. They bind preferentially to double-stranded DNA. HMGB1 modulates general plant growth and stress tolerance and confers sensitivity to salt and genotoxic (methyl methanesulfonate, MMS) stresses. HMGB2 and HMGB5 confer sensitivity to salt and drought stresses. HMGB7 is required for karyogamy during female gametophyte development, when the two polar nuclei fuse to form the diploid central cell nucleus. Members of this subfamily contain only one HMG-box domain.


Pssm-ID: 438821 [Multi-domain]  Cd Length: 60  Bit Score: 43.45  E-value: 3.68e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 672035056 1108 RPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVK 1163
Cdd:cd22005     2 RPPTAFFLFLADFRKTYKKKNPDKSVKAVGKAAGEKWKSMSDEEKAPYVEKAEKEK 57
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1835-2061 7.42e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 48.21  E-value: 7.42e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1835 SVRGGGAGQPLPLVSSPFSVPVQNGAQQPSKIIQLTPVPVSTPSGLVPPLSPATMP-GPTSQPQKVLLPSSTRITYVQSA 1913
Cdd:COG3469     3 SVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGsGTGTTAASSTAATSSTTSTTATA 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1914 GGHTLPLGTSSACSQTGTVTSYGPASSVALGFTSLGPSGPAFVQPLLSAGQAPLLAPGQVGVSpvpspqlppACTAPGGP 1993
Cdd:COG3469    83 TAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAG---------STTTTTTV 153
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 672035056 1994 VITAFYPGSPAPTSAPLGPPSQAPPslvYTVATSTTPPAAAilpkGPPASATATPAPTSPFPSATAGS 2061
Cdd:COG3469   154 SGTETATGGTTTTSTTTTTTSASTT---PSATTTATATTAS----GATTPSATTTATTTGPPTPGLPK 214
dnaA PRK14086
chromosomal replication initiator protein DnaA;
868-1043 1.02e-04

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 47.51  E-value: 1.02e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  868 VPPAAPTAVAQPMPTLGLASSPFQPVAFHPSPAALLPVLVPSSYP----SHPAPKKEVIMGRPGTvWtnvePRSVAVFPW 943
Cdd:PRK14086   93 GEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDqlptARPAYPAYQQRPEPGA-W----PRAADDYGW 167
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  944 HSlvPFLAPSQPDPSVQPSEAQQPASHPVASNQSKEPAESAA-------VAHEQPPGGTGGADPGRPPGATCPESPGPGP 1016
Cdd:PRK14086  168 QQ--QRLGFPPRAPYASPASYAPEQERDREPYDAGRPEYDQRrrdydhpRPDWDRPRRDRTDRPEPPPGAGHVHRGGPGP 245
                         170       180       190
                  ....*....|....*....|....*....|..
gi 672035056 1017 P-----LTLGGVDPGKSLPPTTEEEAPGPPGE 1043
Cdd:PRK14086  246 PerddaPVVPIRPSAPGPLAAQPAPAPGPGEP 277
HMG-box_HMGB_rpt1 cd21978
first high mobility group (HMG)-box found in the high mobility group protein B (HMGB) family; ...
1107-1164 1.14e-04

first high mobility group (HMG)-box found in the high mobility group protein B (HMGB) family; HMGB proteins are chromatin-associated nuclear proteins that act as architectural factors in nucleoprotein structures, which regulate DNA-dependent processes including transcription. In mammals, four family members are present: HMGB1, HMGB2, HMGB3 and HMGB4. They regulate the expression of a wide range of genes through architectural remodeling of the chromatin structure. HMGB1, also called high mobility group protein 1 (HMG-1), is a prototypical alarmin or damage-associated molecular pattern (DAMP) molecule when released from cells. It plays important roles in the regulation of a wide range of processes, including transcription, replication, DNA repair, and nucleosome formation, in the nucleus. It also plays multiple roles in regulating inflammation and responses to cell and tissue stress. HMGB2, also called high mobility group protein 2 (HMG-2), has been implicated in numerous cellular processes, including proliferation, differentiation, apoptosis, and tumor growth. It acts as a chromatin-associated nonhistone protein involved in transcriptional regulation and nucleic-acid-mediated innate immune responses in mammalian cells. It binds DNA to stabilize nucleosomes and promote transcription. HMGB3, also called high mobility group protein 2a (HMG-2a), or high mobility group protein 4 (HMG-4), is an X-linked member of the HMGB family that functions as a universal sentinel for nucleic acid-mediated innate immune responses. HMGB3 has been implicated in the regulation of cellular proliferation and differentiation, as well as inflammatory responses. HMGB4 is expressed by neuronal cells and affects the expression of genes involved in neural differentiation. It is a factor that regulates chromatin and expression of neuronal differentiation markers. This family also includes high mobility group protein B1 pseudogene 1 (HMGB1P1) and nuclear auto-antigen Sp-100. HMGB1P1, also called putative high mobility group protein B1-like 1 (HMGB1L1), or putative high mobility group protein 1-like 1 (HMG-1L1), is an HMG-box containing protein that binds preferentially single-stranded DNA and unwinds double-stranded DNA. Sp-100, also called nuclear dot-associated Sp100 protein, or speckled 100 kDa, is a tumor suppressor that is a major constituent of promyelocytic leukemia (PML) bodies, a subnuclear organelle involved in many physiological processes including cell growth, differentiation and apoptosis. Through the regulation of ETS1, Sp-100 may play a role in angiogenesis, controlling endothelial cell motility and invasion. It may also play roles in the regulation of telomere lengthening, TP53-mediated transcription, FAS-mediated apoptosis, etc. In addition, the family includes Drosophila melanogaster high mobility group protein DSP1 (dDSP1) and similar proteins. dDSP1, also called protein dorsal switch 1, binds preferentially to single-stranded DNA and unwinds double-stranded DNA. It converts Dorsal and nuclear factor (NF)-kappa B from transcriptional activators to repressors. Members of the HMGB family contain two HMG-box domains. This model corresponds to the first one.


Pssm-ID: 438794 [Multi-domain]  Cd Length: 69  Bit Score: 42.29  E-value: 1.14e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1107 RRPMNAFMIFSKRHRALVHQRHPNQ--DNRTVSKILGEWWYALGPKEKQKYHDLAFQVKE 1164
Cdd:cd21978     3 RGKMSSYAFFVQTCREEHKKKHPNEsvNFSEFSKKCSERWKTMSAKEKKKFEDMAKKDKA 62
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1914-2115 1.52e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 47.18  E-value: 1.52e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1914 GGHTLPLGTSSACSQTGTVTSYGPASSVALGFTSLGPSGPAFVQPLLSAGQAPLLAPGQVGVSPVPSPQLPPACTAPGGP 1993
Cdd:PRK12323  369 GGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAP 448
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1994 VITAFYPGSPAPTSAPLGPPSQAPPSLVYTVATSTTPPAA-AILPKGPPASATATPAPTSPFP----SATAGSMTYSLVA 2068
Cdd:PRK12323  449 APAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAApAPADDDPPPWEELPPEFASPAPaqpdAAPAGWVAESIPD 528
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*..
gi 672035056 2069 PKAQRPSPKAPQKVKAAIASiPVGSFESGTTGRTGPTPRQSLDSGVA 2115
Cdd:PRK12323  529 PATADPDDAFETLAPAPAAA-PAPRAAAATEPVVAPRPPRASASGLP 574
PHA03378 PHA03378
EBNA-3B; Provisional
1950-2116 2.30e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 46.60  E-value: 2.30e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1950 PSGPAFVQPLLSA---GQAPLLAPGqvgvsPVPSPQLPPACTAPGGPVITAFYPGSPAPTSA--PLGPPSQAPPSlvYTV 2024
Cdd:PHA03378  678 PTGANTMLPIQWApgtMQPPPRAPT-----PMRPPAAPPGRAQRPAAATGRARPPAAAPGRArpPAAAPGRARPP--AAA 750
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 2025 ATSTTPPAAAILPKGPPASATATPAPTSPfPSATAGSMTYSLVAPKAQRPSPKAPQKVKAAIASIPvgsfesGTTGRTGP 2104
Cdd:PHA03378  751 PGRARPPAAAPGRARPPAAAPGAPTPQPP-PQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAAP------GQQGPTKQ 823
                         170
                  ....*....|..
gi 672035056 2105 TPRQSLDSGVAR 2116
Cdd:PHA03378  824 ILRQLLTGGVKR 835
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
847-1051 2.99e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 46.41  E-value: 2.99e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  847 PGVASGKPSLPPPLPAPVPITVPPAAPTAVAQPMPTLGLASSPFQPVAFHPSPAALLPVLVPSSYPSHPAPKKEVIMGRP 926
Cdd:PRK12323  365 PGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGP 444
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  927 GTvwtnveprsvAVFPWHSLVPFLAPSQPDPSVQPSEAqqPASHPVASNQSKEPAESAAVAHEQPPGGTGGADPGRPPGA 1006
Cdd:PRK12323  445 GG----------APAPAPAPAAAPAAAARPAAAGPRPV--AAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPA 512
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*...
gi 672035056 1007 tcPESPGPGPPLTLGGVDPGKSLPPT---TEEEAPGPPGEPRLDSETE 1051
Cdd:PRK12323  513 --QPDAAPAGWVAESIPDPATADPDDafeTLAPAPAAAPAPRAAAATE 558
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1921-2109 3.08e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.45  E-value: 3.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  1921 GTSSACSQTGTVTSYGPASSV-ALGFTSLGPSGPAFVQPLLSAGQAP--LLAPGQVG--VSPVPSPQLPPACTAPGGPvi 1995
Cdd:pfam05109  409 ATNATTTTHKVIFSKAPESTTtSPTLNTTGFAAPNTTTGLPSSTHVPtnLTAPASTGptVSTADVTSPTPAGTTSGAS-- 486
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  1996 tafyPGSPAPTSAPLGPPSQAPPSLVYTVATSTTPPAAAilpkgPPASATATPAPTSPFPSATAGSMTYSLVAPKAQRPS 2075
Cdd:pfam05109  487 ----PVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNAT-----SPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATS 557
                          170       180       190
                   ....*....|....*....|....*....|....
gi 672035056  2076 PKAPQKVKAAIASIPVGSFESGTTGRTGPTPRQS 2109
Cdd:pfam05109  558 PTPAVTTPTPNATIPTLGKTSPTSAVTTPTPNAT 591
PHA03247 PHA03247
large tegument protein UL36; Provisional
1563-2152 3.10e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.47  E-value: 3.10e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1563 PAAPLCRPAAtmvtnvvrPVSSTPVPiasKPFPTSGRAEASSNDTVGARTEMGTGSRVPGGSPlgvslvySDKKSAAATS 1642
Cdd:PHA03247 2554 PLPPAAPPAA--------PDRSVPPP---RPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDR-------GDPRGPAPPS 2615
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1643 PAPHLVAGPLLGTVGKAPATVTnllvGTPGYGAPASPAVQFIAQGAPGSATPAGSGASAGSGPNGPVPLGILQPGALGKA 1722
Cdd:PHA03247 2616 PLPPDTHAPDPPPPSPSPAANE----PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPT 2691
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1723 GGItqvqyiLPTLPQQLQVAPAPAPAPGTKAAAPSGPAPTTSIRFTLPPGTSTngkvlAATAPTAGIPILQSVPSAPPPK 1802
Cdd:PHA03247 2692 VGS------LTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAA-----PAPPAVPAGPATPGGPARPARP 2760
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1803 AQSVSPVQATPSGGSAQLLPGKVLVPLAAPSMSVRgggagQPLPLVSSPFSVPVQNGAQQPSKIIQLTPVPVSTPSGLVP 1882
Cdd:PHA03247 2761 PTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESR-----ESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQ 2835
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1883 PLSPATMPGPtsqpqkvlLPSSTRITYVQSAGGHTLPLGTSSACSQTGTVTSYGPASSVALgfTSLGPSGPAFVQPLLS- 1961
Cdd:PHA03247 2836 PTAPPPPPGP--------PPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLAR--PAVSRSTESFALPPDQp 2905
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1962 ------AGQAPLLAPGQVGVSPVPSPQLPPACTAPGGPVITAFYPGSPAPTSAPLGPPSQAPPSLVYTVATSTTPPaaai 2035
Cdd:PHA03247 2906 erppqpQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQ---- 2981
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 2036 lpkgpPASATATPAPTSPFPSATAGSMTYSLVAPKAQRPSPKAPQKVKAAIASIPVGSFESGTTGRTGPTPRQSLDSGVA 2115
Cdd:PHA03247 2982 -----PAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSDLEALD 3056
                         570       580       590
                  ....*....|....*....|....*....|....*..
gi 672035056 2116 REPAAPeselegqptppappppteTWPPTARSSPPPP 2152
Cdd:PHA03247 3057 PLPPEP------------------HDPFAHEPDPATP 3075
HMG-box_BHMG1 cd21977
high mobility group (HMG)-box found in basic helix-loop-helix and HMG box domain-containing ...
1110-1166 3.11e-04

high mobility group (HMG)-box found in basic helix-loop-helix and HMG box domain-containing protein 1 (BHMG1) and similar proteins; BHMG1 is an uncharacterized HMG-box containing protein that contains a degenerate basic motif not likely to bind DNA.


Pssm-ID: 438793 [Multi-domain]  Cd Length: 66  Bit Score: 41.15  E-value: 3.11e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 672035056 1110 MNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAH 1166
Cdd:cd21977     5 VNGFIMFCRLNRKNYIDKHPGLASTELTKELGQLWRELSAEEKKPYCVRARELSQLH 61
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1773-1984 3.36e-04

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 45.90  E-value: 3.36e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1773 TSTNGKVLAATAPTAGIPILQSVPSAPPPKAQSVSPVQATPSGGSAQLLPGKVLVPLAAPSMSVRGGGAGQPLPLVSSPF 1852
Cdd:COG3469     2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1853 SVPVQNGAQQPSKIIQLTPVPVSTPSGLVPPLSPATMPGPTSQPQKVLLPSSTRITYVQSAGGHTLPLGTSSACSQTGTV 1932
Cdd:COG3469    82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATG 161
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|..
gi 672035056 1933 TSYGPASSVALGFTSLGPSGPAFVQPLLSAGQAPLLAPGQVGVSPVPSPQLP 1984
Cdd:COG3469   162 GTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLP 213
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
1953-2090 4.26e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 45.48  E-value: 4.26e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1953 PAFVQPLLSAGQAPLLAPGQVGvSPVPSPQLPPActAPGGPVITAfypgsPAPTSAPLGPPSQAPPslvytvATSTTPPA 2032
Cdd:PRK14951  366 PAAAAEAAAPAEKKTPARPEAA-APAAAPVAQAA--AAPAPAAAP-----AAAASAPAAPPAAAPP------APVAAPAA 431
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 672035056 2033 AAILPKGPPASATATPAPTSPFPSATAGSMTYSLVAP-KAQRPSPKAPQKVKAAIASIP 2090
Cdd:PRK14951  432 AAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPePAVASAAPAPAAAPAAARLTP 490
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
1875-2059 5.03e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 45.61  E-value: 5.03e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1875 STPSGLVPPLSPATMPGPtsQPQKVLLPSSTRITYVQSAGGHTLPLGTSSACSQTGTVTSYGPASSVALGFTSLGPSGPA 1954
Cdd:PRK07003  366 GAPGGGVPARVAGAVPAP--GARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAA 443
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1955 fvqpllsAGQAPLLAPGQVGVSPVPSPQLPPACTAPGGPVITAFYPGSPAPTSAPLGPPSQAPPSLVYTVATSTTPPAAA 2034
Cdd:PRK07003  444 -------DGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAA 516
                         170       180
                  ....*....|....*....|....*
gi 672035056 2035 ILPKGPPASATATPAPTSPFPSATA 2059
Cdd:PRK07003  517 SREDAPAAAAPPAPEARPPTPAAAA 541
HMG-box_SMARCE1 cd21983
high mobility group (HMG)-box found in SWI/SNF-related matrix-associated actin-dependent ...
1108-1157 5.19e-04

high mobility group (HMG)-box found in SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily E member 1 (SMARCE1) and similar proteins; SMARCE1, also called BRG1-associated factor 57 (BAF57), is a ubiquitously expressed protein involved in transcriptional activation and repression of select genes by chromatin remodeling. It is a component of SWI/SNF chromatin remodeling complexes that carry out key enzymatic activities, changing chromatin structure by altering DNA-histone contacts within a nucleosome in an ATP-dependent manner. SMARCE1 has a single HMG domain that displays non-specific DNA-binding characteristics. It also contains a kinesin-like coiled-coil (KLCC) domain.


Pssm-ID: 438799 [Multi-domain]  Cd Length: 73  Bit Score: 40.74  E-value: 5.19e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 672035056 1108 RPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHD 1157
Cdd:cd21983     7 KPLMPYMRYSRKVWDQVKASNPDLKLWEIGKIIGQMWRELSDEEKQEYIE 56
HMG-box_PB1 cd21984
high mobility group (HMG)-box found in protein polybromo-1 (PB1) and similar proteins; PB1, ...
1107-1164 6.04e-04

high mobility group (HMG)-box found in protein polybromo-1 (PB1) and similar proteins; PB1, also called BRG1-associated factor 180 (BAF180), or polybromo-1D, is a subunit of the PBAF (polybromo/Brg1-associated factor) chromatin-remodeling complex required for kinetochore localization during mitosis and the transcription of estrogen-responsive genes. It is involved in transcriptional activation and repression of select genes by chromatin remodeling. It acts as a negative regulator of cell proliferation.


Pssm-ID: 438800 [Multi-domain]  Cd Length: 60  Bit Score: 39.92  E-value: 6.04e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 672035056 1107 RRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKE 1164
Cdd:cd21984     1 KRNPSGYILFSSEVRKSIKAENPDYSFGEISRLVGTEWRNLPAEKKAEYEERAQKQAE 58
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
840-1050 7.95e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.98  E-value: 7.95e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  840 CRETPVPPGVASGKPSLPPPLPAPVPITVPPAAPTAVAQPMPTLGLASSPFQPVAFHPSPAALLPVLVPSSYPSHPAPKK 919
Cdd:PRK07764  586 AVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDG 665
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  920 EVIMGRPGT--VWTNVEPRSVAVFPWHSLVPFLAPSQPDPSVQPSEAQQPASHPVASNQSKEPAESAAVAHEQ------- 990
Cdd:PRK07764  666 GDGWPAKAGgaAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPvplppep 745
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 672035056  991 --PPGGTGGADPGRPPGATCPESPGPGPPLTLGGVDPgkslPPTTEEEAPGPPGEPRLDSET 1050
Cdd:PRK07764  746 ddPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEE----EEMAEDDAPSMDDEDRRDAEE 803
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1942-2323 1.14e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.39  E-value: 1.14e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1942 ALGFTSLGPSGPAFVQPLLSAGQAPLLAP-----GQVGVSPVPSPQLPPACTAPGGPVITAFYPGSPAPTSAPLGPPSQA 2016
Cdd:PHA03307    4 APDLYDLIEAAAEGGEFFPRPPATPGDAAddllsGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANE 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 2017 PPSLVYTVATSTTPPAAAILPKGPPASATATPAPTSPFPSATAGSMTYSLVAPKAQR-PSPKAPQKVKAAIASIPVGSFE 2095
Cdd:PHA03307   84 SRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPvGSPGPPPAASPPAAGASPAAVA 163
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 2096 SGTTGRTGPTPRQSLDSGVAREPAAPESELEGQPTPPAPPPPTETWPPTARSSPPPPLPAEERPGTKGPETASKFPSSSS 2175
Cdd:PHA03307  164 SDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSE 243
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 2176 DWRVPGLGLESRGEPPTPPSPAPAPATGPSGSSSGSSEGSSGRAAGDTPERKEVTSSGKKMKVRPPPLKKTFDSVDKVLS 2255
Cdd:PHA03307  244 SSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRE 323
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 672035056 2256 EVDFEERFAELPEfRPEEVLPSPTLQSLATSPR---AILGSYRKKRKNSTDLDSAPedPTSPKRKMRRRSS 2323
Cdd:PHA03307  324 SSSSSTSSSSESS-RGAAVSPGPSPSRSPSPSRpppPADPSSPRKRPRPSRAPSSP--AASAGRPTRRRAR 391
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
1951-2090 1.19e-03

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 43.60  E-value: 1.19e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1951 SGPAFVQPLLSAGQAPLLAPGQVGVSPVPSPQLPPACTAPGGPVITAFYPGSPAPTSA--PLGPPSQAPPSLVYTVATST 2028
Cdd:NF040712  188 IDPDFGRPLRPLATVPRLAREPADARPEEVEPAPAAEGAPATDSDPAEAGTPDDLASArrRRAGVEQPEDEPVGPGAAPA 267
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 672035056 2029 TPPAAAILPKGPP-----ASATATPAPTSPFPSATAGSMTYSLVAPKAQRPSPKAPQKVKAAIASIP 2090
Cdd:NF040712  268 AEPDEATRDAGEPpapgaAETPEAAEPPAPAPAAPAAPAAPEAEEPARPEPPPAPKPKRRRRRASVP 334
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
1961-2094 1.21e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 44.00  E-value: 1.21e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1961 SAGQAPLLAPGQVGVSPVPSPQLPPACTAPGGPVITAFYPGSPAPTSAPlgPPSQAPPSL-VYTVATSTTPPAAAILPKG 2039
Cdd:PRK14971  370 SGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSAT--QPAGTPPTVsVDPPAAVPVNPPSTAPQAV 447
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 672035056 2040 PPASAtatpAPTSPFPSATAGSMTYSLVAPKaQRPSPKAPQKVKAAIASIPVGSF 2094
Cdd:PRK14971  448 RPAQF----KEEKKIPVSKVSSLGPSTLRPI-QEKAEQATGNIKEAPTGTQKEIF 497
HMG-box_AtSSRP1 cd22013
high mobility group (HMG)-box found in Arabidopsis thaliana FACT complex subunit SSRP1 and ...
1100-1174 1.39e-03

high mobility group (HMG)-box found in Arabidopsis thaliana FACT complex subunit SSRP1 and similar proteins; SSRP1, also called facilitates chromatin transcription complex subunit SSRP1, high mobility group B protein 8, nucleosome/chromatin assembly factor group D 08 (or D 8), protein NUCLEAR FUSION DEFECTIVE 8, or recombination signal sequence recognition protein 1, is a component of the FACT complex, a general chromatin factor that acts to reorganize nucleosomes. The FACT complex is involved in multiple processes that require DNA as a template such as mRNA elongation, DNA replication, and DNA repair. SSRP1 may bind specifically to double-stranded DNA. It is required for karyogamy during female gametophyte development, when the two polar nuclei fuse to form the diploid central cell nucleus. SSRP1 contains only one HMG-box domain.


Pssm-ID: 438829 [Multi-domain]  Cd Length: 80  Bit Score: 39.84  E-value: 1.39e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 672035056 1100 KREKD--HIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKAHPDWK 1174
Cdd:cd22013     1 KKKKDpnAPKRALSGFMFFSLMERENLKKEKPGISFGEVGKVLGEKWKNMSADDKAPYEAKAQVDKERYKKEMSGYK 77
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
1953-2084 1.54e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 43.70  E-value: 1.54e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1953 PAFVQPLLSAGQAPLLAPGQVGVSPVPSPQLPPACTAPGGPvitafyPGSPAPTSAPLGPPSQAPPSLVytVATSTTPPA 2032
Cdd:PRK07994  361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPP------PPASAPQQAPAVPLPETTSQLL--AARQQLQRA 432
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 672035056 2033 AAILPkgPPASATATPAPTSPFPSATAgsmtysLVAPKAQRPSPKAPQKVKA 2084
Cdd:PRK07994  433 QGATK--AKKSEPAAASRARPVNSALE------RLASVRPAPSALEKAPAKK 476
HMG-box_NSD2 cd21991
high mobility group (HMG)-box found in nuclear SET domain-containing protein 2 (NSD2) and ...
1106-1155 1.65e-03

high mobility group (HMG)-box found in nuclear SET domain-containing protein 2 (NSD2) and similar proteins; NSD2, also called multiple myeloma SET domain-containing protein (MMSET), protein trithorax-5 (TRX5), or wolf-Hirschhorn syndrome candidate 1 protein (WHSC1), acts as a histone-lysine N-methyltransferase with histone H3 'Lys-36' (H3K36me) methyltransferase activity. NSD2 has been shown to mediate di- and trimethylation of H3K36 and dimethylation of H4K20 in different systems and has been characterized as a transcriptional repressor interacting with histone deacetylase HDAC1 and histone demethylase LSD1. NSD2 mediates constitutive NF-kappaB signaling for cancer cell proliferation, survival and tumor growth. It is highly overexpressed in several types of human cancers, including small-cell lung cancers, neuroblastoma, carcinomas of stomach and colon, and bladder cancers, and its overexpression tends to be associated with tumor aggressiveness.


Pssm-ID: 438807 [Multi-domain]  Cd Length: 62  Bit Score: 38.83  E-value: 1.65e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 672035056 1106 IRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKY 1155
Cdd:cd21991     2 KKVKKGQFEVFCQKHREEVAQEHPDLSEEEIEEYLKKQWNNMSEKQRARY 51
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1729-2055 1.79e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.99  E-value: 1.79e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  1729 QYILPTLPQQLQVAPAPAPAPGTKAAAPSG---PAPTTSIRFTLPPGTSTNGKVLAATAPTAG-IPILQSVPSAPPPKAQ 1804
Cdd:pfam03154  164 QQILQTQPPVLQAQSGAASPPSPPPPGTTQaatAGPTPSAPSVPPQGSPATSQPPNQTQSTAApHTLIQQTPTLHPQRLP 243
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  1805 SV-SPVQ-----ATPSGGSAQLLPGKVL-VPLAAPSMSVRGGGAGQPLPLVSSPFSVPVQNGAQQ-----------PSKI 1866
Cdd:pfam03154  244 SPhPPLQpmtqpPPPSQVSPQPLPQPSLhGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQvppgpspaapgQSQQ 323
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  1867 IQLTPVPVSTPSGLVPPLSPATMPGPTSQPQkVLLPSSTRITYVQSAGGHTLPLGTSSAcsQTGTVTSYGPASSVALGFT 1946
Cdd:pfam03154  324 RIHTPPSQSQLQSQQPPREQPLPPAPLSMPH-IKPPPTTPIPQLPNPQSHKHPPHLSGP--SPFQMNSNLPPPPALKPLS 400
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  1947 SLGPSGPAFVQP----LLSAGQAPLLAPGQVGV------SPVPSPQLPPACTAPGGPVIT-----AFYPGSPAPTSAPLG 2011
Cdd:pfam03154  401 SLSTHHPPSAHPpplqLMPQSQQLPPPPAQPPVltqsqsLPPPAASHPPTSGLHQVPSQSpfpqhPFVPGGPPPITPPSG 480
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 672035056  2012 P-------------PSQAPPSLVYTVATSTTPPAAAILPKGPPASATATPAPTSPFP 2055
Cdd:pfam03154  481 PptstssampgiqpPSSASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPP 537
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1997-2167 1.84e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 43.71  E-value: 1.84e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1997 AFYP----GSPAPTSAPLGPPSQAPPSLVYTVATSTTPPAAAILPKGPPASATATPAPT------SPFPSATAGSMTYSL 2066
Cdd:PRK12323  362 AFRPgqsgGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAaaparrSPAPEALAAARQASA 441
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 2067 VAPKAQRPSPKAPQKVKAAIASIPVGSfesgttgrTGPTPRQSLDSGVAREPAAPESELEGQPTPPAPPPPTETWPPTAR 2146
Cdd:PRK12323  442 RGPGGAPAPAPAPAAAPAAAARPAAAG--------PRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQ 513
                         170       180
                  ....*....|....*....|....
gi 672035056 2147 SSPPPPLPAEE---RPGTKGPETA 2167
Cdd:PRK12323  514 PDAAPAGWVAEsipDPATADPDDA 537
HMG-box_HMG20B cd22018
high mobility group (HMG)-box found in high mobility group protein 20B (HMG20B) and similar ...
1107-1168 1.94e-03

high mobility group (HMG)-box found in high mobility group protein 20B (HMG20B) and similar proteins; HMG20B, also called SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily E member 1-related, SMARCE1-related protein (SMARCE1R), BRCA2-associated factor 35 (BRAF35), HMG box-containing protein 20B, HMG domain-containing protein 2 (HMGXB2), HMG domain-containing protein HMGX2, Sox-like transcriptional factor, or structural DNA-binding protein BRAF35, is a DNA binding factor that acts as a repressor of erythroid differentiation. It is required for correct progression through the G2 phase of the cell cycle and entry into mitosis. It is also required for RCOR1/CoREST mediated repression of neuronal specific gene promoters. HMG20B is a core subunit of the Lys-specific demethylase 1/REST co-repressor 1 (LSD1-CoREST) histone demethylase complex.


Pssm-ID: 438834 [Multi-domain]  Cd Length: 85  Bit Score: 39.57  E-value: 1.94e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 672035056 1107 RRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFK 1168
Cdd:cd22018     3 KAPVTGYVRFLNERREQIRTQHPDLPFPEITKMLGAEWSKLQPHEKQRYLDEAERDKQQYMK 64
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1757-2052 2.04e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 43.75  E-value: 2.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  1757 SGPAPTtsirftlPPGTSTNGKVLAATAPTAGIpilqsvpSAPPPKAQSVSPVQATPSGGSAQLLPGKVlVPLAApsMSV 1836
Cdd:pfam05109  488 VTPSPS-------PRDNGTESKAPDMTSPTSAV-------TTPTPNATSPTPAVTTPTPNATSPTLGKT-SPTSA--VTT 550
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  1837 RGGGAGQPLPLVSSPF---SVPVQNGAQQPSKIIQLTPVPVSTPSGLVPPLSPA---TMPGPTSQPqkVLLPSSTRITYV 1910
Cdd:pfam05109  551 PTPNATSPTPAVTTPTpnaTIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTtnhTLGGTSSTP--VVTSPPKNATSA 628
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  1911 QSAGGHTLPLGTSSACSQTGTVTSYGPASSVALGFTSLgpsgpafvQPLLSAGQaPLLAPGQVGVSPVPSPQLPPACTAP 1990
Cdd:pfam05109  629 VTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSH--------MPLLTSAH-PTGGENITQVTPASTSTHHVSTSSP 699
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 672035056  1991 ggpvitAFYPGSPAPTSAPLGPPSQAPPSLVYTvaTSTTPPAAAILPKGPPASATATPAPTS 2052
Cdd:pfam05109  700 ------APRPGTTSQASGPGNSSTSTKPGEVNV--TKGTPPKNATSPQAPSGQKTAVPTVTS 753
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
1950-2056 2.25e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 43.23  E-value: 2.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1950 PSGPAFVQP----LLSAGQAPLLAPGQVGVSPvPSPQLPPACTAPGGPVITAFYPGSPAPTSAPLGPPSQAPPSLVY--- 2022
Cdd:PRK14971  378 HIKPVFTQPaaapQPSAAAAASPSPSQSSAAA-QPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKeek 456
                          90       100       110
                  ....*....|....*....|....*....|....*.
gi 672035056 2023 TVATSTTPPAAAIL--PKGPPASATATPAPTSPFPS 2056
Cdd:PRK14971  457 KIPVSKVSSLGPSTlrPIQEKAEQATGNIKEAPTGT 492
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
1760-2111 2.37e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 43.02  E-value: 2.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  1760 APTTSIRFTLP---PGTSTNGKVLAATAPTAGIPILQSVPSAPPPKAQSVSPVQATPSGGSAQLLPGKVLVPLAAPsmsv 1836
Cdd:pfam17823  117 AAASSSPSSAAqslPAAIAALPSEAFSAPRAAACRANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAAS---- 192
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  1837 rgggagqplplvsspfSVPVQNGAQQPSKIIQLTPVPVSTPSGLVPPLSPATMPGPTSQPqkVLLPSSTRITYVQSAGGH 1916
Cdd:pfam17823  193 ----------------SAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSP--AAGTVTAAVGTVTPAALA 254
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  1917 TLPLGTSSACSQTGTVTSYGPASsvalgfTSLGPSGPAFVQPLLSAGQAPLLAPGQVGVSPVPSPQlpPACTAPGGPVit 1996
Cdd:pfam17823  255 TLAAAAGTVASAAGTINMGDPHA------RRLSPAKHMPSDTMARNPAAPMGAQAQGPIIQVSTDQ--PVHNTAGEPT-- 324
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  1997 afyPGSPAPTSAPLGPPSQAPPSLvyTVATSTTPPAaailpKGPPASATATPaPTSPFPSATAGSMTyslvapkaQRPSP 2076
Cdd:pfam17823  325 ---PSPSNTTLEPNTPKSVASTNL--AVVTTTKAQA-----KEPSASPVPVL-HTSMIPEVEATSPT--------TQPSP 385
                          330       340       350
                   ....*....|....*....|....*....|....*....
gi 672035056  2077 KAPQKVKAA----IASIPVGSFESGTTGRTGPTPRQSLD 2111
Cdd:pfam17823  386 LLPTQGAAGpgilLAPEQVATEATAGTASAGPTPRSSGD 424
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
1996-2089 2.46e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 43.34  E-value: 2.46e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1996 TAFYPGSPAPTSAPLGPPSQAPPSLVYTVATSTTPPAAAILPKGPPASATATPAPTSPFPSATAGSMTYSLVAPKAQRPS 2075
Cdd:PRK12270   34 ADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAV 113
                          90
                  ....*....|....
gi 672035056 2076 PKAPQKVKAAIASI 2089
Cdd:PRK12270  114 EDEVTPLRGAAAAV 127
HMG-box_CMB1-like cd22014
high mobility group (HMG)-box found in Schizosaccharomyces pombe mismatch-binding protein cmb1 ...
1108-1169 2.65e-03

high mobility group (HMG)-box found in Schizosaccharomyces pombe mismatch-binding protein cmb1 and similar proteins; Cmb1 binds to cytosines in base mismatches and opposite chemically altered guanines. It contains only one HMG-box domain.


Pssm-ID: 438830 [Multi-domain]  Cd Length: 62  Bit Score: 38.12  E-value: 2.65e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 672035056 1108 RPMNAFMIFSKRHRALVHQRHPNQDNrtvSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKA 1169
Cdd:cd22014     3 RPPSPFLLFMEEFRRNEDNGKNLVEL---SRIAAEAWKNMSEDEKQPYIDRAKELLEEYKKQ 61
HMG-box_HMGXB4 cd21982
high mobility group (HMG)-box found in HMG domain-containing protein 4 (HMGXB4) and similar ...
1107-1153 2.72e-03

high mobility group (HMG)-box found in HMG domain-containing protein 4 (HMGXB4) and similar proteins; HMGXB4, also called HMG box-containing protein 4, high mobility group protein 2-like 1 (HMG2L1), or protein HMGBCG, is a non-histone chromosomal protein that negatively regulates Wnt/beta-catenin signaling during development. It plays a role in the hematopoietic system.


Pssm-ID: 438798 [Multi-domain]  Cd Length: 61  Bit Score: 38.41  E-value: 2.72e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 672035056 1107 RRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQ 1153
Cdd:cd21982     2 KKNMSAYQVFCKEYRVSIVAEHPGIDFGELSKKLAEVWKQLPEKDKL 48
HMG-box_UBF1_rpt1-like cd21998
first high mobility group (HMG)-box found in upstream-binding factor 1 (UBF1) and similar ...
1104-1172 3.02e-03

first high mobility group (HMG)-box found in upstream-binding factor 1 (UBF1) and similar proteins; UBF1, also called UBTF, nucleolar transcription factor 1, or auto-antigen NOR-90, is a nucleolar transcription factor that recognizes the ribosomal RNA gene promoter and activates transcription mediated by RNA polymerase I through cooperative interactions with the transcription factor SL1/TIF-IB complex. It binds specifically to the upstream control element. UBF1 contains six HMG-box domains. This model corresponds to the first one. This model also includes the first HMG-box domain of upstream-binding factor 1-like protein 1 (UBTFL1), which contains two HMG-box domains.


Pssm-ID: 438814 [Multi-domain]  Cd Length: 77  Bit Score: 38.46  E-value: 3.02e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 672035056 1104 DHIRRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHD------LAFQVKEAHFKA-HPD 1172
Cdd:cd21998     2 DFPKKPLTPYFRFFMEKRAKYAKKHPEMSNLELTKILSKKYKELPEKKKQKYIQdyekekEEYEQKMARFREeHPD 77
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
752-1044 4.05e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.83  E-value: 4.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056   752 TPTPSTPAGFRAVSPAVPFSRSRQPSPLLLLPPPAGLTSDPGPSVRRVPAvqrdspvivrNPDVPLPSKFPGEVGAAGEA 831
Cdd:pfam03154  262 SPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPP----------GPSPAAPGQSQQRIHTPPSQ 331
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056   832 RAGGPGRGCRETPVPPGVASgkpslpPPLPAPVPITVPPAAPTAVAQPMPTLGLASSPFQPVAFHPSPAALLPVlvpSSY 911
Cdd:pfam03154  332 SQLQSQQPPREQPLPPAPLS------MPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPL---SSL 402
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056   912 PSHPAPKkevimgrpgtvwtnveprsvAVFPWHSLVPFLAPSQPDPsVQPSEAQQPASHPVASNQSKEPAESAAVAhEQP 991
Cdd:pfam03154  403 STHHPPS--------------------AHPPPLQLMPQSQQLPPPP-AQPPVLTQSQSLPPPAASHPPTSGLHQVP-SQS 460
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 672035056   992 PGGTGGADPGRPPGATCPESPGPGPPLTLGGVDPGKSLPPTTEEEAPGPPGEP 1044
Cdd:pfam03154  461 PFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCP 513
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
1949-2123 6.04e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 42.14  E-value: 6.04e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1949 GPSGPAFVQPLLSAGQAPllAPGQVGVSPVPSPQLPPACTAPGGPVITAF-YPGSPAPTSAPLGPPSQAPPSLVYTVATS 2027
Cdd:PRK07003  364 GGGAPGGGVPARVAGAVP--APGARAAAAVGASAVPAVTAVTGAAGAALApKAAAAAAATRAEAPPAAPAPPATADRGDD 441
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 2028 TTP-PAAAILPKGPPASATATPAPTSPFP-SATAGSMTYSLVAPKAQRPSPKAPQKVKAAIASIPVGSFESgttgrtgPT 2105
Cdd:PRK07003  442 AADgDAPVPAKANARASADSRCDERDAQPpADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARA-------PA 514
                         170
                  ....*....|....*...
gi 672035056 2106 PRQSLDSGVAREPAAPES 2123
Cdd:PRK07003  515 AASREDAPAAAAPPAPEA 532
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
1552-1694 6.80e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 41.76  E-value: 6.80e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1552 SAPAPSLAYGAPAAPLCRPAATMVTNVVRPVSSTPVPIASKPFPTSGRAEASSNDTVGARTEMGTGSRVPGGSPLGVSLV 1631
Cdd:PRK07003  383 PGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSR 462
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 672035056 1632 YSDKKSAAATSPAPHLVAGPllgtvgKAPATVTNllvgtpgygAPASPAVQFIAQGAPGSATP 1694
Cdd:PRK07003  463 CDERDAQPPADSGSASAPAS------DAPPDAAF---------EPAPRAAAPSAATPAAVPDA 510
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
1930-2058 6.98e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 41.62  E-value: 6.98e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1930 GTVTSYGPASSVALGFTSLGPSGPAfVQPLLSAGQAPLLAPGQVGVSPVPSP-QLPPACTAPGGPVI-TAFYPGSPAPTS 2007
Cdd:PRK14951  368 AAAEAAAPAEKKTPARPEAAAPAAA-PVAQAAAAPAPAAAPAAAASAPAAPPaAAPPAPVAAPAAAApAAAPAAAPAAVA 446
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....
gi 672035056 2008 APLGPPSQAPPSLVY---TVATSTTPPAAAILPKGPPASATATPAPTSPFPSAT 2058
Cdd:PRK14951  447 LAPAPPAQAAPETVAipvRVAPEPAVASAAPAPAAAPAAARLTPTEEGDVWHAT 500
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1870-2240 7.57e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 41.83  E-value: 7.57e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  1870 TPVPVSTPSGLVPPLSPATMPGPTSQPQKVLLPSSTRITYvqsagghtlplgtssacsQTGTVTSYGPASSVAlGFTSLG 1949
Cdd:pfam05109  429 TTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTV------------------STADVTSPTPAGTTS-GASPVT 489
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  1950 PSgPAFVQPLLSAGQAPLLAPGQVGVSPVPSPQLP-PACTAPGgPVITAFYPGSPAPTSAPLGPpsqAPPSLVYTVATST 2028
Cdd:pfam05109  490 PS-PSPRDNGTESKAPDMTSPTSAVTTPTPNATSPtPAVTTPT-PNATSPTLGKTSPTSAVTTP---TPNATSPTPAVTT 564
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  2029 TPPAAAI--LPKGPPASATATPAPTSPFPS----ATAGSMTYSLVAPKAQRPSPKAPQKVKAAIASIPVGSFESGTTGRT 2102
Cdd:pfam05109  565 PTPNATIptLGKTSPTSAVTTPTPNATSPTvgetSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSM 644
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  2103 GPTPRQ--------SLDSGVAREPAAPESELEGQPTPPAPPPPTETWPPTARSSPPPplpaeeRPGTKGPETAskfPSSS 2174
Cdd:pfam05109  645 SLRPSSisetlspsTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPAP------RPGTTSQASG---PGNS 715
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 672035056  2175 SDWRVPGLGLESRGeppTPPSPAPAPATGPSGSSSGSSEGSSGRAAGDTPERKEVTSSGKKMKVRP 2240
Cdd:pfam05109  716 STSTKPGEVNVTKG---TPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTEP 778
PHA03247 PHA03247
large tegument protein UL36; Provisional
870-1041 8.30e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 8.30e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  870 PAAPTAVaqPMPTLGLASSPFQPvafhPSPAALLPVLVPSSYPSHPAPKKevimgrpgtVWTNVE-----PRSVAVFPWH 944
Cdd:PHA03247 2489 PFAAGAA--PDPGGGGPPDPDAP----PAPSRLAPAILPDEPVGEPVHPR---------MLTWIRgleelASDDAGDPPP 2553
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  945 SLVPFLAPSQPDPSVQPSEAQQPASHPVASNQSKEPAESAAVAHEQPPGGTGGADPGR------PPGATCPESPGPGP-- 1016
Cdd:PHA03247 2554 PLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPappsplPPDTHAPDPPPPSPsp 2633
                         170       180
                  ....*....|....*....|....*.
gi 672035056 1017 -PLTLGGVDPGKSLPPTTEEEAPGPP 1041
Cdd:PHA03247 2634 aANEPDPHPPPTVPPPERPRDDPAPG 2659
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1770-2152 8.59e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 41.68  E-value: 8.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  1770 PPGTSTNGKVLAATAPTAGIPilqSVPSAPPPkAQSVSPVQATPSGGSAQLL-PGKVLVPLAAPSMSvrggGAGQPLPLV 1848
Cdd:pfam03154  184 PSPPPPGTTQAATAGPTPSAP---SVPPQGSP-ATSQPPNQTQSTAAPHTLIqQTPTLHPQRLPSPH----PPLQPMTQP 255
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  1849 SSPFSVPVQnGAQQPSKIIQLTPVPVST---PSGLVPPLSPATMPGPTSQPQKVLLPSSTRITYVQSAGGHTLPLGTSSA 1925
Cdd:pfam03154  256 PPPSQVSPQ-PLPQPSLHGQMPPMPHSLqtgPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQL 334
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  1926 CSQTGTVTSYGPASSVALGFTSLGPSGPAfvqPLLSAGQAPLLAPGQVGVSP--VPSPQLPPACTAPGGPVITAFYPGSP 2003
Cdd:pfam03154  335 QSQQPPREQPLPPAPLSMPHIKPPPTTPI---PQLPNPQSHKHPPHLSGPSPfqMNSNLPPPPALKPLSSLSTHHPPSAH 411
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  2004 AP------TSAPLGPPSQAPPSLVytvATSTTPPAAAilpKGPPASATATPAPTSPFPSATAGSMTYSLVAPkaqrPSPK 2077
Cdd:pfam03154  412 PPplqlmpQSQQLPPPPAQPPVLT---QSQSLPPPAA---SHPPTSGLHQVPSQSPFPQHPFVPGGPPPITP----PSGP 481
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 672035056  2078 APQKVKAAIASIPVGSFESGTTGRTGPTPRQSLDSGVAREPAAPESElegQPTPPAPPPptetwpptaRSSPPPP 2152
Cdd:pfam03154  482 PTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLPPVQIKEEALDEAE---EPESPPPPP---------RSPSPEP 544
FAP pfam07174
Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment ...
1961-2068 8.61e-03

Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.


Pssm-ID: 429334  Cd Length: 301  Bit Score: 40.68  E-value: 8.61e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  1961 SAGQAPLLAPGQVGVSPVPSPQLPPACTAPGGPVITAFYPGSPAPTSAPLGPPSQAPPSlvyTVATSTTPPAAAILPKGP 2040
Cdd:pfam07174   25 GASAVAVALPAVAHADPEPAPPPPSTATAPPAPPPPPPAPAAPAPPPPPAAPNAPNAPP---PPADPNAPPPPPADPNAP 101
                           90       100
                   ....*....|....*....|....*...
gi 672035056  2041 PASATATPAPTSPFPSATAGSMTYSLVA 2068
Cdd:pfam07174  102 PPPAVDPNAPEPGRIDNAVGGFSYVVPA 129
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1840-2175 8.73e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 41.70  E-value: 8.73e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1840 GAGQPLPLVSSPFSVPVQNGAQQPSKIIQLTPVPVSTPSGLVPPLSPATMPGPTSQPQKVLLPSSTRityVQSAGGHTLP 1919
Cdd:PHA03307   38 GSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREG---SPTPPGPSSP 114
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1920 LGTSSacsqtgTVTSYGPASSVALGFTSLGPSGPAFVQPLLSAGQAPLLAPGQVGVSPVPSPQLPPACTAPGGPVITAFY 1999
Cdd:PHA03307  115 DPPPP------TPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSS 188
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 2000 PGSPAPTSAPLGPPSQAPPSLVYTVATSTTPPAAAiLPKGPPASATATPAPTSPFPSATAGSMTYSLvAPKAQRPSPKAP 2079
Cdd:PHA03307  189 PPAEPPPSTPPAAASPRPPRRSSPISASASSPAPA-PGRSAADDAGASSSDSSSSESSGCGWGPENE-CPLPRPAPITLP 266
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 2080 QKVKAAIASIPVG------SFESGTTGRTGPTPRQSLDSGVAREPAAPESELEGQPTPPAPPPPTETWPPTARSSPPPPL 2153
Cdd:PHA03307  267 TRIWEASGWNGPSsrpgpaSSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPS 346
                         330       340
                  ....*....|....*....|..
gi 672035056 2154 PaeERPGTKGPETASKFPSSSS 2175
Cdd:PHA03307  347 P--SRSPSPSRPPPPADPSSPR 366
rad23 TIGR00601
UV excision repair protein Rad23; All proteins in this family for which functions are known ...
2002-2073 8.80e-03

UV excision repair protein Rad23; All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273167 [Multi-domain]  Cd Length: 378  Bit Score: 41.03  E-value: 8.80e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 672035056  2002 SPAPTSAPLGPPSQAPPSLVYTVATSTTPPAAAILPKGPPASATATPAPTSPFPSATA-GSMTYSLVAPKAQR 2073
Cdd:TIGR00601   84 VAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATAPESPSTSVPSsGSDAASTLVVGSER 156
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
1974-2071 8.91e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 41.41  E-value: 8.91e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056 1974 GVSPVPSPQLPPACTAPGGPVITAFYPGSPAPTSAPLGPPSQAPPslvytVATSTTPPAAAILPKGPPASATATPAPTSP 2053
Cdd:PRK12270   37 GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPA-----APPKPAAAAAAAAAPAAPPAAAAAAAPAAA 111
                          90       100
                  ....*....|....*....|....*....
gi 672035056 2054 FPS-----------ATAGSMTYSLVAPKA 2071
Cdd:PRK12270  112 AVEdevtplrgaaaAVAKNMDASLEVPTA 140
HMG-box_TOX-like cd21995
high mobility group (HMG)-box found in the TOX high mobility group box family; The TOX family ...
1107-1169 8.91e-03

high mobility group (HMG)-box found in the TOX high mobility group box family; The TOX family includes four members: TOX, TOX2, TOX3 and TOX4. TOX, also called thymus high mobility group box protein TOX, is a transcriptional regulator with a major role in neural stem cell commitment and corticogenesis as well as in lymphoid cell development and lymphoid tissue organogenesis. It binds to GC-rich DNA sequences in the proximity of transcription start sites and may alter chromatin structure, modifying access of transcription factors to DNA. TOX2, also called granulosa cell HMG box protein 1 (GCX-1), is a putative transcriptional activator involved in the hypothalamic-pituitary-gonadal system. TOX3, also called CAG trinucleotide repeat-containing gene F9 protein (CAGF9), or trinucleotide repeat-containing gene 9 protein (TNRC9), is a transcriptional coactivator of the p300/CBP-mediated transcription complex. It activates transactivation through cAMP response element (CRE) sites. It protects against cell death by inducing anti-apoptotic and repressing pro-apoptotic transcripts. TOX4, also called epidermal Langerhans cell protein LCP1, is a component of the PTW/PP1 phosphatase complex, which plays a role in the control of chromatin structure and cell cycle progression during the transition from mitosis into interphase. All family members contain one HMG-box domain.


Pssm-ID: 438811 [Multi-domain]  Cd Length: 70  Bit Score: 37.07  E-value: 8.91e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 672035056 1107 RRPMNAFMIFSKRHRALVHQRHPNQDNRTVSKILGEWWYALGPKEKQKYHDLAFQVKEAHFKA 1169
Cdd:cd21995     1 QKPVSAYALFFRDTQAAIKGQNPNATFGEVSKIVASMWDSLDEEQKQVYKKKTEAAKKEYLKA 63
PHA03247 PHA03247
large tegument protein UL36; Provisional
703-993 9.11e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 9.11e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  703 TVPISPGRRKTELLPHPGTLGASGAGGGGAAPDFPKSDSLDSGVDSVSHTPTPSTPAGFRAVSPAVPFSRSRQPSPLLLL 782
Cdd:PHA03247 2723 PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPW 2802
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  783 PPPAGLTSDPGPSVRRVPAVQRDSPVIVRNPDVPLPSKFPGEVGAAGEARAGGPGRGCRETPVPPGVASGKPSLPPPLPA 862
Cdd:PHA03247 2803 DPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPP 2882
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 672035056  863 PVPITVPPAAPTAVAQPMPTLGL-------ASSPFQPVAFHPSPAALLPVLVPSSYPSHPAPKKEVIMGRPGTVWTNVEP 935
Cdd:PHA03247 2883 VRRLARPAVSRSTESFALPPDQPerppqpqAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQP 2962
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 672035056  936 RSVAVFPWHSLVPFLAPSQPDPSVQPSEAQQPASHPVASNQSKEPAESAAVAHEQPPG 993
Cdd:PHA03247 2963 WLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPP 3020
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH