NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1877233802|ref|NP_001372520|]
View 

cAMP-regulated phosphoprotein 21 isoform 7 [Homo sapiens]

Protein Classification

R3H_encore_like and SUZ domain-containing protein( domain architecture ID 12927641)

protein containing domains R3H_encore_like, SUZ, and PAT1

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
R3H_encore_like cd02642
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the ...
163-224 8.46e-26

R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.


:

Pssm-ID: 100071  Cd Length: 63  Bit Score: 100.75  E-value: 8.46e-26
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1877233802 163 DRMILLKMEQEIIDFIADNNNHYKKFPQMSSYQRMLVHRVAAYFGLDHNVDQTG-KSVIINKT 224
Cdd:cd02642     1 DRLFVLKLEKDLLAFIKDSTRQSLELPPMNSYYRLLAHRVAQYYGLDHNVDNSGgKCVIVNKT 63
SUZ pfam12752
SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched ...
245-300 2.54e-13

SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched in positively charged amino acids. It was first characterized in the C.elegans protein Szy-20 where it has been shown to bind RNA and allow their localization to the centrosome. Warning- the domain has a compositionally biased character.


:

Pssm-ID: 463689 [Multi-domain]  Cd Length: 56  Bit Score: 65.04  E-value: 2.54e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1877233802 245 ESQKRFILKRDNSSIDKEDNQQNRMHPFRDDRRSKSIEEREEEYQRVRERIFAHDS 300
Cdd:pfam12752   1 PPPKMKILRRPSSGSSSSSSAGSSGASSSSGSDSKTLEEREAEYAEARARIFGSSE 56
PAT1 super family cl37801
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
606-793 6.35e-04

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


The actual alignment was detected with superfamily member pfam09770:

Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 43.49  E-value: 6.35e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802 606 PPSGPVYPSSLmPQPAQQPSYVIASTG-----QQLPTGGFSGSGPPISQQVLQPPPSPqgfvQQPPPAQMPVYYYPSGQY 680
Cdd:pfam09770 166 APKKAAAPAPA-PQPAAQPASLPAPSRkmmslEEVEAAMRAQAKKPAQQPAPAPAQPP----AAPPAQQAQQQQQFPPQI 240
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802 681 PTSTTQQYRPMAPVQYNAQrsqqmpqaaqqaGYQPVLSGQQGFQGLIGVQQPPQSQNVINNQQGTPVQS---------VM 751
Cdd:pfam09770 241 QQQQQPQQQPQQPQQHPGQ------------GHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVqptqilqnpNR 308
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|..
gi 1877233802 752 VSYPTMSSYQVPMTQGSQGLPQQSYQQPIMLPNQAGQGSLPA 793
Cdd:pfam09770 309 LSAARVGYPQNPQPGVQPAPAHQAHRQQGSFGRQAPIITHPQ 350
PHA03247 super family cl33720
large tegument protein UL36; Provisional
426-693 1.50e-03

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 1.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802  426 PPLQSTPLVSGVAAGSPGCVPYPENGIGGQVAPSSTSYILLPLEAATGIPPGSillnphtgqpfVNPDGTPAIYNPPTSQ 505
Cdd:PHA03247  2701 PPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGG-----------PARPARPPTTAGPPAP 2769
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802  506 QPLRSAMVGQSQQQPPQQQPSPQPQQQVQPPQPQMAGPlvTQSVQGLQASSQSVQYPAVSFPPqhllPVSPTQhfpmrdd 585
Cdd:PHA03247  2770 APPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADP--PAAVLAPAAALPPAASPAGPLPP----PTSAQP------- 2836
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802  586 vatqfgqmtlsrqssgETPEPPSGPVyPSSLMPQPAQQPSYVIASTG--QQLPTGGFSGSGPPISQqVLQPPPSPQGFVQ 663
Cdd:PHA03247  2837 ----------------TAPPPPPGPP-PPSLPLGGSVAPGGDVRRRPpsRSPAAKPAAPARPPVRR-LARPAVSRSTESF 2898
                          250       260       270
                   ....*....|....*....|....*....|.
gi 1877233802  664 -QPPPAQMPVYYYPSGQYPTSTTQQYRPMAP 693
Cdd:PHA03247  2899 aLPPDQPERPPQPQAPPPPQPQPQPPPPPQP 2929
 
Name Accession Description Interval E-value
R3H_encore_like cd02642
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the ...
163-224 8.46e-26

R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.


Pssm-ID: 100071  Cd Length: 63  Bit Score: 100.75  E-value: 8.46e-26
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1877233802 163 DRMILLKMEQEIIDFIADNNNHYKKFPQMSSYQRMLVHRVAAYFGLDHNVDQTG-KSVIINKT 224
Cdd:cd02642     1 DRLFVLKLEKDLLAFIKDSTRQSLELPPMNSYYRLLAHRVAQYYGLDHNVDNSGgKCVIVNKT 63
R3H smart00393
Putative single-stranded nucleic acids-binding domain;
147-224 4.29e-14

Putative single-stranded nucleic acids-binding domain;


Pssm-ID: 214647  Cd Length: 79  Bit Score: 68.10  E-value: 4.29e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802  147 IDLHEFLINTLKNNSRDRMILLKMEQEIIDFIAdNNNHYKKFPQMSSYQRMLVHRVAAYFGLDHNVDQTG--KSVIINKT 224
Cdd:smart00393   1 ADFLPVTLDALSYRPRRREELIELELEIARFVK-STKESVELPPMNSYERKIVHELAEKYGLESESFGEGpkRRVVISKK 79
SUZ pfam12752
SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched ...
245-300 2.54e-13

SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched in positively charged amino acids. It was first characterized in the C.elegans protein Szy-20 where it has been shown to bind RNA and allow their localization to the centrosome. Warning- the domain has a compositionally biased character.


Pssm-ID: 463689 [Multi-domain]  Cd Length: 56  Bit Score: 65.04  E-value: 2.54e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1877233802 245 ESQKRFILKRDNSSIDKEDNQQNRMHPFRDDRRSKSIEEREEEYQRVRERIFAHDS 300
Cdd:pfam12752   1 PPPKMKILRRPSSGSSSSSSAGSSGASSSSGSDSKTLEEREAEYAEARARIFGSSE 56
R3H pfam01424
R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most ...
165-223 3.76e-12

R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA.


Pssm-ID: 460206  Cd Length: 60  Bit Score: 61.74  E-value: 3.76e-12
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1877233802 165 MILLKMEQEIIDFIADNNNHYKkFPQMSSYQRMLVHRVAAYFGLDHNV--DQTGKSVIINK 223
Cdd:pfam01424   1 EFLEQLAEKLAEFVKDTGKSLE-LPPMSSYERRIIHELAQKYGLESESegEEPNRRVVVYK 60
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
606-793 6.35e-04

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 43.49  E-value: 6.35e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802 606 PPSGPVYPSSLmPQPAQQPSYVIASTG-----QQLPTGGFSGSGPPISQQVLQPPPSPqgfvQQPPPAQMPVYYYPSGQY 680
Cdd:pfam09770 166 APKKAAAPAPA-PQPAAQPASLPAPSRkmmslEEVEAAMRAQAKKPAQQPAPAPAQPP----AAPPAQQAQQQQQFPPQI 240
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802 681 PTSTTQQYRPMAPVQYNAQrsqqmpqaaqqaGYQPVLSGQQGFQGLIGVQQPPQSQNVINNQQGTPVQS---------VM 751
Cdd:pfam09770 241 QQQQQPQQQPQQPQQHPGQ------------GHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVqptqilqnpNR 308
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|..
gi 1877233802 752 VSYPTMSSYQVPMTQGSQGLPQQSYQQPIMLPNQAGQGSLPA 793
Cdd:pfam09770 309 LSAARVGYPQNPQPGVQPAPAHQAHRQQGSFGRQAPIITHPQ 350
PHA03247 PHA03247
large tegument protein UL36; Provisional
426-693 1.50e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 1.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802  426 PPLQSTPLVSGVAAGSPGCVPYPENGIGGQVAPSSTSYILLPLEAATGIPPGSillnphtgqpfVNPDGTPAIYNPPTSQ 505
Cdd:PHA03247  2701 PPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGG-----------PARPARPPTTAGPPAP 2769
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802  506 QPLRSAMVGQSQQQPPQQQPSPQPQQQVQPPQPQMAGPlvTQSVQGLQASSQSVQYPAVSFPPqhllPVSPTQhfpmrdd 585
Cdd:PHA03247  2770 APPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADP--PAAVLAPAAALPPAASPAGPLPP----PTSAQP------- 2836
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802  586 vatqfgqmtlsrqssgETPEPPSGPVyPSSLMPQPAQQPSYVIASTG--QQLPTGGFSGSGPPISQqVLQPPPSPQGFVQ 663
Cdd:PHA03247  2837 ----------------TAPPPPPGPP-PPSLPLGGSVAPGGDVRRRPpsRSPAAKPAAPARPPVRR-LARPAVSRSTESF 2898
                          250       260       270
                   ....*....|....*....|....*....|.
gi 1877233802  664 -QPPPAQMPVYYYPSGQYPTSTTQQYRPMAP 693
Cdd:PHA03247  2899 aLPPDQPERPPQPQAPPPPQPQPQPPPPPQP 2929
PRK10263 PRK10263
DNA translocase FtsK; Provisional
607-799 3.51e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.22  E-value: 3.51e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802  607 PSGPVYPSSLMPQPAQQPSYVIASTgQQLPtggfsgsGPPISQQVLQPppSPQGFVQQPPPAQmpvyyyPSGQYPTSTTQ 686
Cdd:PRK10263   336 PVEPVTQTPPVASVDVPPAQPTVAW-QPVP-------GPQTGEPVIAP--APEGYPQQSQYAQ------PAVQYNEPLQQ 399
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802  687 QYRPMAPVQYNAQRSQQMPQAAQQAGYQPVLSGQQGFQgligVQQPPQSQNVINNQQGTPVQSvMVSYPTMSSYQVPMTQ 766
Cdd:PRK10263   400 PVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPA----PEQPVAGNAWQAEEQQSTFAP-QSTYQTEQTYQQPAAQ 474
                          170       180       190
                   ....*....|....*....|....*....|...
gi 1877233802  767 GSQGLPQQSYQQPIMLPNQAGQGSLPATGMPVY 799
Cdd:PRK10263   475 EPLYQQPQPVEQQPVVEPEPVVEETKPARPPLY 507
SP1-4_arthropods_N cd22553
N-terminal domain of transcription factor Specificity Protein (SP) 1-4 from arthropods; ...
576-819 3.96e-03

N-terminal domain of transcription factor Specificity Protein (SP) 1-4 from arthropods; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. There are many SPs in vertebrates (9 SPs in humans and mice, 7 SPs in the chicken, and 11 SPs in teleost fish), but arthropods only have 3 SPs. One SP is clade SP1-4, which is expressed ubiquitously throughout development. SP1-4 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. This model represents the N-terminal domain of SP1-4 from arthropods.


Pssm-ID: 411778 [Multi-domain]  Cd Length: 384  Bit Score: 40.39  E-value: 3.96e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802 576 PTQHFPMRDDVATQFGQMTLSRQSSGETPEPPSGPVYPSSLMPQ--PAQQPsyVIASTGQ------QLPTGGFSGSGPPI 647
Cdd:cd22553   113 ANQQTLIRPNTVQGQANASNVLQNIAQIASGGNAVQLPLNNMTQtiPVQVP--VSTANGQtvyqtiQVPIQAIQSGNAGG 190
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802 648 SQQVLQPPPSPQgfVQQPPPAQMPVYYYPSGQ-----YPTSTTQQYRPMAPVQYNaQRSQQMPQAAQQAGYQPVLSGQQG 722
Cdd:cd22553   191 GNQALQAQVIPQ--LAQAAQLQPQQLAQVSSQgyiqqIPANASQQQPQMVQQGPN-QSGQIIGQVASASSIQAAAIPLTV 267
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802 723 FQGLIGvqqppqSQNVINNQQGTPVQSV--------MVSYPTMSSYQVPMTQGSQGLPQQSYQQPIMLPNQAGQGSLPAT 794
Cdd:cd22553   268 YTGALA------GQNGSNQQQVGQIVTSpiqgmtqgLTAPASSSIPTVVQQQAIQGNPLPPGTQIIAAGQQLQQDPNDPT 341
                         250       260
                  ....*....|....*....|....*
gi 1877233802 795 GMPVYCNVTPPTPQnNLRLIGPHCP 819
Cdd:cd22553   342 KWQVVADGTPGSKK-RLRRVACTCP 365
 
Name Accession Description Interval E-value
R3H_encore_like cd02642
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the ...
163-224 8.46e-26

R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.


Pssm-ID: 100071  Cd Length: 63  Bit Score: 100.75  E-value: 8.46e-26
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1877233802 163 DRMILLKMEQEIIDFIADNNNHYKKFPQMSSYQRMLVHRVAAYFGLDHNVDQTG-KSVIINKT 224
Cdd:cd02642     1 DRLFVLKLEKDLLAFIKDSTRQSLELPPMNSYYRLLAHRVAQYYGLDHNVDNSGgKCVIVNKT 63
R3H smart00393
Putative single-stranded nucleic acids-binding domain;
147-224 4.29e-14

Putative single-stranded nucleic acids-binding domain;


Pssm-ID: 214647  Cd Length: 79  Bit Score: 68.10  E-value: 4.29e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802  147 IDLHEFLINTLKNNSRDRMILLKMEQEIIDFIAdNNNHYKKFPQMSSYQRMLVHRVAAYFGLDHNVDQTG--KSVIINKT 224
Cdd:smart00393   1 ADFLPVTLDALSYRPRRREELIELELEIARFVK-STKESVELPPMNSYERKIVHELAEKYGLESESFGEGpkRRVVISKK 79
SUZ pfam12752
SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched ...
245-300 2.54e-13

SUZ domain; The SUZ domain is a conserved RNA-binding domain found in eukaryotes and enriched in positively charged amino acids. It was first characterized in the C.elegans protein Szy-20 where it has been shown to bind RNA and allow their localization to the centrosome. Warning- the domain has a compositionally biased character.


Pssm-ID: 463689 [Multi-domain]  Cd Length: 56  Bit Score: 65.04  E-value: 2.54e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 1877233802 245 ESQKRFILKRDNSSIDKEDNQQNRMHPFRDDRRSKSIEEREEEYQRVRERIFAHDS 300
Cdd:pfam12752   1 PPPKMKILRRPSSGSSSSSSAGSSGASSSSGSDSKTLEEREAEYAEARARIFGSSE 56
R3H cd02325
R3H domain. The name of the R3H domain comes from the characteristic spacing of the most ...
167-223 1.89e-12

R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. R3H domains are found in proteins together with ATPase domains, SF1 helicase domains, SF2 DEAH helicase domains, Cys-rich repeats, ring-type zinc fingers, and KH domains. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.


Pssm-ID: 100064  Cd Length: 59  Bit Score: 62.63  E-value: 1.89e-12
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1877233802 167 LLKMEQEIIDFIADNNNHYKKFPQMSSYQRMLVHRVAAYFGLDHNVDQTG--KSVIINK 223
Cdd:cd02325     1 REEREEELEAFAKDAAGKSLELPPMNSYERKLIHDLAEYYGLKSESEGEGpnRRVVITK 59
R3H pfam01424
R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most ...
165-223 3.76e-12

R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA.


Pssm-ID: 460206  Cd Length: 60  Bit Score: 61.74  E-value: 3.76e-12
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1877233802 165 MILLKMEQEIIDFIADNNNHYKkFPQMSSYQRMLVHRVAAYFGLDHNV--DQTGKSVIINK 223
Cdd:pfam01424   1 EFLEQLAEKLAEFVKDTGKSLE-LPPMSSYERRIIHELAQKYGLESESegEEPNRRVVVYK 60
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
606-793 6.35e-04

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 43.49  E-value: 6.35e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802 606 PPSGPVYPSSLmPQPAQQPSYVIASTG-----QQLPTGGFSGSGPPISQQVLQPPPSPqgfvQQPPPAQMPVYYYPSGQY 680
Cdd:pfam09770 166 APKKAAAPAPA-PQPAAQPASLPAPSRkmmslEEVEAAMRAQAKKPAQQPAPAPAQPP----AAPPAQQAQQQQQFPPQI 240
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802 681 PTSTTQQYRPMAPVQYNAQrsqqmpqaaqqaGYQPVLSGQQGFQGLIGVQQPPQSQNVINNQQGTPVQS---------VM 751
Cdd:pfam09770 241 QQQQQPQQQPQQPQQHPGQ------------GHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVqptqilqnpNR 308
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|..
gi 1877233802 752 VSYPTMSSYQVPMTQGSQGLPQQSYQQPIMLPNQAGQGSLPA 793
Cdd:pfam09770 309 LSAARVGYPQNPQPGVQPAPAHQAHRQQGSFGRQAPIITHPQ 350
PHA03247 PHA03247
large tegument protein UL36; Provisional
426-693 1.50e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 1.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802  426 PPLQSTPLVSGVAAGSPGCVPYPENGIGGQVAPSSTSYILLPLEAATGIPPGSillnphtgqpfVNPDGTPAIYNPPTSQ 505
Cdd:PHA03247  2701 PPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGG-----------PARPARPPTTAGPPAP 2769
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802  506 QPLRSAMVGQSQQQPPQQQPSPQPQQQVQPPQPQMAGPlvTQSVQGLQASSQSVQYPAVSFPPqhllPVSPTQhfpmrdd 585
Cdd:PHA03247  2770 APPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADP--PAAVLAPAAALPPAASPAGPLPP----PTSAQP------- 2836
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802  586 vatqfgqmtlsrqssgETPEPPSGPVyPSSLMPQPAQQPSYVIASTG--QQLPTGGFSGSGPPISQqVLQPPPSPQGFVQ 663
Cdd:PHA03247  2837 ----------------TAPPPPPGPP-PPSLPLGGSVAPGGDVRRRPpsRSPAAKPAAPARPPVRR-LARPAVSRSTESF 2898
                          250       260       270
                   ....*....|....*....|....*....|.
gi 1877233802  664 -QPPPAQMPVYYYPSGQYPTSTTQQYRPMAP 693
Cdd:PHA03247  2899 aLPPDQPERPPQPQAPPPPQPQPQPPPPPQP 2929
PHA03247 PHA03247
large tegument protein UL36; Provisional
423-842 1.52e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 1.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802  423 RTHPPLQSTPLVSGVAAGS----PGCVPYPENGIG---------GQVAPSStsyilLPLEAATGIPPgsillnPHTGQPF 489
Cdd:PHA03247  2566 RSVPPPRPAPRPSEPAVTSrarrPDAPPQSARPRApvddrgdprGPAPPSP-----LPPDTHAPDPP------PPSPSPA 2634
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802  490 VNPDGTPAIYNPPTSQQPLRSAMVGQSQQQPPQQQPSPQPQQQVQPPQ-PQMAGPLVTQSVQGL------QASSQSVQYP 562
Cdd:PHA03247  2635 ANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRpRRRAARPTVGSLTSLadppppPPTPEPAPHA 2714
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802  563 AVSFPPQHLLPVSPTQHFPMR----------DDVATQFGQMTLSRQSSGETPEPPSGPVYPSS----------------- 615
Cdd:PHA03247  2715 LVSATPLPPGPAAARQASPALpaapappavpAGPATPGGPARPARPPTTAGPPAPAPPAAPAAgpprrltrpavaslses 2794
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802  616 -----LMPQPAQQPSYVIASTGQQLPTGGFSGSGPPISQQVLQPPPSPQGFVqqPPPAQMPVYYYPSGQY---PTSTTQQ 687
Cdd:PHA03247  2795 reslpSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPP--PPSLPLGGSVAPGGDVrrrPPSRSPA 2872
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802  688 YRPMAPVQYNAQRSQqmpqaaqqagyQPVLSGQQGFQGLIGVQQPPQSQNVINNQQGTPVQSVMVSYPTMSsyqvPMTQG 767
Cdd:PHA03247  2873 AKPAAPARPPVRRLA-----------RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPP----PPPPP 2937
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1877233802  768 SQGLPQQSYQQPIMLPNQAGQGSLPATGMPVYCNVTPPTpqnnlRLIGPHCPSSTVPVMSASCRTNCASMSNAGW 842
Cdd:PHA03247  2938 RPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPR-----FRVPQPAPSREAPASSTPPLTGHSLSRVSSW 3007
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
505-843 1.90e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.06  E-value: 1.90e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802 505 QQPLRSAMVGQSQQQPPQQQPSPQPQQQVQPPQPQMAGPLVTQSVQGLQASSQSVQYPAVSFPPQHLLPVSPTQHFPMRD 584
Cdd:pfam03154 164 QQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLP 243
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802 585 DVATQFGQMTLSRQSSGETPEPPSGPVY--PSSLMPQPAQ------------QPSYVIASTGQ-QLPTGGFSGSGPPISQ 649
Cdd:pfam03154 244 SPHPPLQPMTQPPPPSQVSPQPLPQPSLhgQMPPMPHSLQtgpshmqhpvppQPFPLTPQSSQsQVPPGPSPAAPGQSQQ 323
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802 650 QVLQPPPSPQGFVQQP------PPAQMPVyyyPSGQYPTSTtqqyrPMAPVQyNAQRSQqmpqaaqqagYQPVLSGQQGF 723
Cdd:pfam03154 324 RIHTPPSQSQLQSQQPpreqplPPAPLSM---PHIKPPPTT-----PIPQLP-NPQSHK----------HPPHLSGPSPF 384
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802 724 QGLIGVQQPPQSQnvinnqqgtPVQSVMVSYPTmSSYQVP---MTQgSQGLPQQSYQQPIMLPNQ---AGQGSLPATGMP 797
Cdd:pfam03154 385 QMNSNLPPPPALK---------PLSSLSTHHPP-SAHPPPlqlMPQ-SQQLPPPPAQPPVLTQSQslpPPAASHPPTSGL 453
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*.
gi 1877233802 798 VYCNVTPPTPQNnlrligPHCPSSTVPVMSASCRTNCASMSNAGWQ 843
Cdd:pfam03154 454 HQVPSQSPFPQH------PFVPGGPPPITPPSGPPTSTSSAMPGIQ 493
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
483-807 2.46e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 41.68  E-value: 2.46e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802 483 PHTGQPFVNPDGTPAIYNPPTSQQPLRSAMvgQSQQQPPQQQPSPQPQQQVQPPQPQMAGPLVTQSVQGLQASSQSVQYP 562
Cdd:pfam03154 199 PTPSAPSVPPQGSPATSQPPNQTQSTAAPH--TLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMP 276
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802 563 AVSFP----PQHLLPVSPTQHFPMRDDVATQFGQMTLSRQSSGETPEPPSGPvyPSSLMPQPAQQPSYviastgQQLPTG 638
Cdd:pfam03154 277 PMPHSlqtgPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTP--PSQSQLQSQQPPRE------QPLPPA 348
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802 639 gfsgsgpPISQQVLQPPPS--------PQG-----FVQQPPPAQMPVYY-YPSGQYPTSTTQQYRPMAPVQYNAQRSQQM 704
Cdd:pfam03154 349 -------PLSMPHIKPPPTtpipqlpnPQShkhppHLSGPSPFQMNSNLpPPPALKPLSSLSTHHPPSAHPPPLQLMPQS 421
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802 705 PQAAQQAGYQPVLSGQQGFQGLIGVQQPPQSQNVINNQQGTPVQSVMVSYPT-MSSYQVPMTQGSQGLPqqSYQQPIMLP 783
Cdd:pfam03154 422 QQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPpITPPSGPPTSTSSAMP--GIQPPSSAS 499
                         330       340
                  ....*....|....*....|....
gi 1877233802 784 nqagqgslPATGMPVYCNVTPPTP 807
Cdd:pfam03154 500 --------VSSSGPVPAAVSCPLP 515
R3H_unknown_2 cd06006
R3H domain of a group of fungal proteins with unknown function. The name of the R3H domain ...
169-209 2.84e-03

R3H domain of a group of fungal proteins with unknown function. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.


Pssm-ID: 100076  Cd Length: 59  Bit Score: 36.58  E-value: 2.84e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|.
gi 1877233802 169 KMEQEIIDFIADNNNHYKKFPQMSSYQRMLVHRVAAYFGLD 209
Cdd:cd06006     3 QIESTLRKFINDKSKRSLRFPPMRSPQRAFIHELAKDYGLY 43
PRK10263 PRK10263
DNA translocase FtsK; Provisional
607-799 3.51e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.22  E-value: 3.51e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802  607 PSGPVYPSSLMPQPAQQPSYVIASTgQQLPtggfsgsGPPISQQVLQPppSPQGFVQQPPPAQmpvyyyPSGQYPTSTTQ 686
Cdd:PRK10263   336 PVEPVTQTPPVASVDVPPAQPTVAW-QPVP-------GPQTGEPVIAP--APEGYPQQSQYAQ------PAVQYNEPLQQ 399
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802  687 QYRPMAPVQYNAQRSQQMPQAAQQAGYQPVLSGQQGFQgligVQQPPQSQNVINNQQGTPVQSvMVSYPTMSSYQVPMTQ 766
Cdd:PRK10263   400 PVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPA----PEQPVAGNAWQAEEQQSTFAP-QSTYQTEQTYQQPAAQ 474
                          170       180       190
                   ....*....|....*....|....*....|...
gi 1877233802  767 GSQGLPQQSYQQPIMLPNQAGQGSLPATGMPVY 799
Cdd:PRK10263   475 EPLYQQPQPVEQQPVVEPEPVVEETKPARPPLY 507
SP1-4_arthropods_N cd22553
N-terminal domain of transcription factor Specificity Protein (SP) 1-4 from arthropods; ...
576-819 3.96e-03

N-terminal domain of transcription factor Specificity Protein (SP) 1-4 from arthropods; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. There are many SPs in vertebrates (9 SPs in humans and mice, 7 SPs in the chicken, and 11 SPs in teleost fish), but arthropods only have 3 SPs. One SP is clade SP1-4, which is expressed ubiquitously throughout development. SP1-4 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. This model represents the N-terminal domain of SP1-4 from arthropods.


Pssm-ID: 411778 [Multi-domain]  Cd Length: 384  Bit Score: 40.39  E-value: 3.96e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802 576 PTQHFPMRDDVATQFGQMTLSRQSSGETPEPPSGPVYPSSLMPQ--PAQQPsyVIASTGQ------QLPTGGFSGSGPPI 647
Cdd:cd22553   113 ANQQTLIRPNTVQGQANASNVLQNIAQIASGGNAVQLPLNNMTQtiPVQVP--VSTANGQtvyqtiQVPIQAIQSGNAGG 190
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802 648 SQQVLQPPPSPQgfVQQPPPAQMPVYYYPSGQ-----YPTSTTQQYRPMAPVQYNaQRSQQMPQAAQQAGYQPVLSGQQG 722
Cdd:cd22553   191 GNQALQAQVIPQ--LAQAAQLQPQQLAQVSSQgyiqqIPANASQQQPQMVQQGPN-QSGQIIGQVASASSIQAAAIPLTV 267
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802 723 FQGLIGvqqppqSQNVINNQQGTPVQSV--------MVSYPTMSSYQVPMTQGSQGLPQQSYQQPIMLPNQAGQGSLPAT 794
Cdd:cd22553   268 YTGALA------GQNGSNQQQVGQIVTSpiqgmtqgLTAPASSSIPTVVQQQAIQGNPLPPGTQIIAAGQQLQQDPNDPT 341
                         250       260
                  ....*....|....*....|....*
gi 1877233802 795 GMPVYCNVTPPTPQnNLRLIGPHCP 819
Cdd:cd22553   342 KWQVVADGTPGSKK-RLRRVACTCP 365
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
599-695 5.67e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 40.14  E-value: 5.67e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1877233802 599 SSGETPEPPSGP---VYPSSLMPQPAQQPSYVIASTGQQLPTGgfSGSGPPISQQVLQPPPS-PQGFVQQPPPAQMPVYY 674
Cdd:PRK14971  364 QKGDDASGGRGPkqhIKPVFTQPAAAPQPSAAAAASPSPSQSS--AAAQPSAPQSATQPAGTpPTVSVDPPAAVPVNPPS 441
                          90       100
                  ....*....|....*....|.
gi 1877233802 675 YPSGQYPTSTTQQYRPMAPVQ 695
Cdd:PRK14971  442 TAPQAVRPAQFKEEKKIPVSK 462
R3H_Smubp-2_like cd02641
R3H domain of Smubp-2_like proteins. Smubp-2_like proteins also contain a helicase_like and ...
174-221 6.37e-03

R3H domain of Smubp-2_like proteins. Smubp-2_like proteins also contain a helicase_like and an AN1-like Zinc finger domain and have been shown to bind single-stranded DNA. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA.


Pssm-ID: 100070  Cd Length: 60  Bit Score: 35.79  E-value: 6.37e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1877233802 174 IIDFIADNNNHYKKFP-QMSSYQRMLVHRVAAYFGLDHNVDQTGKSVII 221
Cdd:cd02641     8 VKAFMKDPKATELEFPpTLSSHDRLLVHELAEELGLRHESTGEGSDRVI 56
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH