NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|15240970|ref|NP_195760|]
View 

HEAT repeat-containing protein [Arabidopsis thaliana]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Symplekin_C pfam12295
Symplekin tight junction protein C terminal; This domain family is found in eukaryotes, and is ...
1033-1196 2.19e-69

Symplekin tight junction protein C terminal; This domain family is found in eukaryotes, and is approximately 180 amino acids in length. There is a single completely conserved residue P that may be functionally important. Symplekn has been localized, by light and electron microscopy, to the plaque associated with the cytoplasmic face of the tight junction-containing zone (zonula occludens) of polar epithelial cells and of Sertoli cells of testis. However, both the mRNA and the protein can also be detected in a wide range of cell types that do not form tight junctions. Careful analyses have revealed that the protein occurs in all these diverse cells in the nucleoplasm, and only in those cells forming tight junctions is it recruited, partly but specifically, to the plaque structure of the zonula occludens.


:

Pssm-ID: 463523  Cd Length: 185  Bit Score: 230.97  E-value: 2.19e-69
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15240970   1033 DVEILFPILPFLPRDDVLRIFPHMVNLPMEKFQVALSRVLQGSSQS-----GPVLSPSEALIAIHSIDPARDGIPLKQVT 1107
Cdd:pfam12295    1 DVRFLIPILGGLEKDEVIALLPRLIKLPPEVVKEVFNRLLQAPPSRvnqteGSPLTPVELLVALHNIDPKDDGVPLKRIV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15240970   1108 DACNTCFAQRQTFTQQVLAGVLNQLVQQIPLPMLFMRTVLQAIGAFPALSDFILEILSRLVSKQIWKYPKLWVGFLKCTQ 1187
Cdd:pfam12295   81 DALDLCFSMTDVFTQEVLAAALQQIVEQPPLPLLFMRTVIQALQKYPSLRSFVANILPRLIQKQVWKNKKLWEGFIKCAK 160

                   ....*....
gi 15240970   1188 TTQPQSYKV 1196
Cdd:pfam12295  161 RTKPQSFPV 169
SYMPK_PTA1_N pfam11935
Symplekin/PTA1 N-terminal; This is the N-terminal domain found in the symplekin protein from ...
96-291 1.00e-40

Symplekin/PTA1 N-terminal; This is the N-terminal domain found in the symplekin protein from animals and PTA1 protein from budding yeasts and is typically between 239 to 261 amino acids in length. Symplekin is a scaffold protein that functions as a component of a multimolecular complex involved in histone mRNA 3'-end processing. PTA1 is involved in pre-tRNA processing. PTA1 is a subunit of the cleavage and polyadenylation factor (CPF), which plays a key role in polyadenylation-dependent pre-mRNA 3'-end formation and cooperates with cleavage factors including the CFIA complex and NAB4/CFIB. This domain has the ARM or HEAT fold, with seven pairs of antiparallel alpha-helices arranged in the shape of an arc. It is important for interaction with Ssu72 and stimulates Ssu72 C-terminal domain phosphatase activity in vitro.


:

Pssm-ID: 463403  Cd Length: 218  Bit Score: 150.06  E-value: 1.00e-40
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15240970     96 TPAVARQVIACGADLFRSTLERVAvqglhsseLNDLLESSWTWLIKFKDEICSVAFKQGNSGVKLCAMKFVEALILLYTP 175
Cdd:pfam11935    1 DPAVVKRVIQAAASIYPLVFRWVA--------KNSNDSELWEKMTAIKSQILNLWDSADNDGVKLCAIKFLERVVLVQTP 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15240970    176 HEGIEA-----DFNISILRGGHPVLKIGDLSIEASQKLGLLLDQLRHPAakSLNSSTIIVLINSLSSVAKKRPAYCGRIL 250
Cdd:pfam11935   73 GDISDRrpdgnDFSLSIVPRNHPLLKPQNLEAEAKGLLDRLLKVLQESP--SISSPLVMATLNSLAIIAKQRPQFASRIL 150
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 15240970    251 PVLLSLDPLSFLKGVYAAATNL---------ALKTVFLSCLKctHPAAAP 291
Cdd:pfam11935  151 SALLSFNPLKKAQSPNLPPTLSklqvksvekTLRIFLLHLLK--NPPSSP 198
HEAT COG1413
HEAT repeat [General function prediction only];
48-104 1.38e-04

HEAT repeat [General function prediction only];


:

Pssm-ID: 441023 [Multi-domain]  Cd Length: 137  Bit Score: 43.46  E-value: 1.38e-04
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|....*..
gi 15240970   48 LPHLFDLLSDQFGAVRKFVAEILGEIGLkyveliPEIVPLLIKSLEDETPAVARQVI 104
Cdd:COG1413   18 VPALIAALADEDPDVRAAAARALGRLGD------PRAVPALLEALKDPDPEVRAAAA 68
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1286-1433 6.07e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 41.14  E-value: 6.07e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15240970  1286 LSHSQvDHQEPSQVVASQSQSSPIGTVQSAmSQSQNSPIDTGRS-------------EMSQSQNSPIDTGRSEMSQSQNS 1352
Cdd:NF033849  353 ISHSE-SSSESTGTSVGHSTSSSVSSSESS-SRSSSSGVSGGFSggiagggvtseglGASQGGSEGWGSGDSVQSVSQSY 430
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15240970  1353 PIDTGRSEMSQSQNSpidTGRSEMSeSQSSPIGQSQSSPIGTGQSDMSQTPQVSDSSAPEPTSHTrTSDPQASSQTLRDD 1432
Cdd:NF033849  431 GSSSSTGTSSGHSDS---SSHSTSS-GQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDS-VGDSTGTSESVSQG 505

                  .
gi 15240970  1433 D 1433
Cdd:NF033849  506 D 506
 
Name Accession Description Interval E-value
Symplekin_C pfam12295
Symplekin tight junction protein C terminal; This domain family is found in eukaryotes, and is ...
1033-1196 2.19e-69

Symplekin tight junction protein C terminal; This domain family is found in eukaryotes, and is approximately 180 amino acids in length. There is a single completely conserved residue P that may be functionally important. Symplekn has been localized, by light and electron microscopy, to the plaque associated with the cytoplasmic face of the tight junction-containing zone (zonula occludens) of polar epithelial cells and of Sertoli cells of testis. However, both the mRNA and the protein can also be detected in a wide range of cell types that do not form tight junctions. Careful analyses have revealed that the protein occurs in all these diverse cells in the nucleoplasm, and only in those cells forming tight junctions is it recruited, partly but specifically, to the plaque structure of the zonula occludens.


Pssm-ID: 463523  Cd Length: 185  Bit Score: 230.97  E-value: 2.19e-69
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15240970   1033 DVEILFPILPFLPRDDVLRIFPHMVNLPMEKFQVALSRVLQGSSQS-----GPVLSPSEALIAIHSIDPARDGIPLKQVT 1107
Cdd:pfam12295    1 DVRFLIPILGGLEKDEVIALLPRLIKLPPEVVKEVFNRLLQAPPSRvnqteGSPLTPVELLVALHNIDPKDDGVPLKRIV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15240970   1108 DACNTCFAQRQTFTQQVLAGVLNQLVQQIPLPMLFMRTVLQAIGAFPALSDFILEILSRLVSKQIWKYPKLWVGFLKCTQ 1187
Cdd:pfam12295   81 DALDLCFSMTDVFTQEVLAAALQQIVEQPPLPLLFMRTVIQALQKYPSLRSFVANILPRLIQKQVWKNKKLWEGFIKCAK 160

                   ....*....
gi 15240970   1188 TTQPQSYKV 1196
Cdd:pfam12295  161 RTKPQSFPV 169
SYMPK_PTA1_N pfam11935
Symplekin/PTA1 N-terminal; This is the N-terminal domain found in the symplekin protein from ...
96-291 1.00e-40

Symplekin/PTA1 N-terminal; This is the N-terminal domain found in the symplekin protein from animals and PTA1 protein from budding yeasts and is typically between 239 to 261 amino acids in length. Symplekin is a scaffold protein that functions as a component of a multimolecular complex involved in histone mRNA 3'-end processing. PTA1 is involved in pre-tRNA processing. PTA1 is a subunit of the cleavage and polyadenylation factor (CPF), which plays a key role in polyadenylation-dependent pre-mRNA 3'-end formation and cooperates with cleavage factors including the CFIA complex and NAB4/CFIB. This domain has the ARM or HEAT fold, with seven pairs of antiparallel alpha-helices arranged in the shape of an arc. It is important for interaction with Ssu72 and stimulates Ssu72 C-terminal domain phosphatase activity in vitro.


Pssm-ID: 463403  Cd Length: 218  Bit Score: 150.06  E-value: 1.00e-40
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15240970     96 TPAVARQVIACGADLFRSTLERVAvqglhsseLNDLLESSWTWLIKFKDEICSVAFKQGNSGVKLCAMKFVEALILLYTP 175
Cdd:pfam11935    1 DPAVVKRVIQAAASIYPLVFRWVA--------KNSNDSELWEKMTAIKSQILNLWDSADNDGVKLCAIKFLERVVLVQTP 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15240970    176 HEGIEA-----DFNISILRGGHPVLKIGDLSIEASQKLGLLLDQLRHPAakSLNSSTIIVLINSLSSVAKKRPAYCGRIL 250
Cdd:pfam11935   73 GDISDRrpdgnDFSLSIVPRNHPLLKPQNLEAEAKGLLDRLLKVLQESP--SISSPLVMATLNSLAIIAKQRPQFASRIL 150
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 15240970    251 PVLLSLDPLSFLKGVYAAATNL---------ALKTVFLSCLKctHPAAAP 291
Cdd:pfam11935  151 SALLSFNPLKKAQSPNLPPTLSklqvksvekTLRIFLLHLLK--NPPSSP 198
HEAT COG1413
HEAT repeat [General function prediction only];
48-104 1.38e-04

HEAT repeat [General function prediction only];


Pssm-ID: 441023 [Multi-domain]  Cd Length: 137  Bit Score: 43.46  E-value: 1.38e-04
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|....*..
gi 15240970   48 LPHLFDLLSDQFGAVRKFVAEILGEIGLkyveliPEIVPLLIKSLEDETPAVARQVI 104
Cdd:COG1413   18 VPALIAALADEDPDVRAAAARALGRLGD------PRAVPALLEALKDPDPEVRAAAA 68
HEAT_2 pfam13646
HEAT repeats; This family includes multiple HEAT repeats.
48-104 1.85e-04

HEAT repeats; This family includes multiple HEAT repeats.


Pssm-ID: 433376 [Multi-domain]  Cd Length: 88  Bit Score: 41.55  E-value: 1.85e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 15240970     48 LPHLFDLL-SDQFGAVRKFVAEILGEIGLkyveliPEIVPLLIKSLEDETPAVARQVI 104
Cdd:pfam13646    1 LPALLQALlRDPDPEVRAAAIRALGRIGD------PEAVPALLELLKDEDPAVRRAAA 52
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1286-1433 6.07e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 41.14  E-value: 6.07e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15240970  1286 LSHSQvDHQEPSQVVASQSQSSPIGTVQSAmSQSQNSPIDTGRS-------------EMSQSQNSPIDTGRSEMSQSQNS 1352
Cdd:NF033849  353 ISHSE-SSSESTGTSVGHSTSSSVSSSESS-SRSSSSGVSGGFSggiagggvtseglGASQGGSEGWGSGDSVQSVSQSY 430
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15240970  1353 PIDTGRSEMSQSQNSpidTGRSEMSeSQSSPIGQSQSSPIGTGQSDMSQTPQVSDSSAPEPTSHTrTSDPQASSQTLRDD 1432
Cdd:NF033849  431 GSSSSTGTSSGHSDS---SSHSTSS-GQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDS-VGDSTGTSESVSQG 505

                  .
gi 15240970  1433 D 1433
Cdd:NF033849  506 D 506
 
Name Accession Description Interval E-value
Symplekin_C pfam12295
Symplekin tight junction protein C terminal; This domain family is found in eukaryotes, and is ...
1033-1196 2.19e-69

Symplekin tight junction protein C terminal; This domain family is found in eukaryotes, and is approximately 180 amino acids in length. There is a single completely conserved residue P that may be functionally important. Symplekn has been localized, by light and electron microscopy, to the plaque associated with the cytoplasmic face of the tight junction-containing zone (zonula occludens) of polar epithelial cells and of Sertoli cells of testis. However, both the mRNA and the protein can also be detected in a wide range of cell types that do not form tight junctions. Careful analyses have revealed that the protein occurs in all these diverse cells in the nucleoplasm, and only in those cells forming tight junctions is it recruited, partly but specifically, to the plaque structure of the zonula occludens.


Pssm-ID: 463523  Cd Length: 185  Bit Score: 230.97  E-value: 2.19e-69
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15240970   1033 DVEILFPILPFLPRDDVLRIFPHMVNLPMEKFQVALSRVLQGSSQS-----GPVLSPSEALIAIHSIDPARDGIPLKQVT 1107
Cdd:pfam12295    1 DVRFLIPILGGLEKDEVIALLPRLIKLPPEVVKEVFNRLLQAPPSRvnqteGSPLTPVELLVALHNIDPKDDGVPLKRIV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15240970   1108 DACNTCFAQRQTFTQQVLAGVLNQLVQQIPLPMLFMRTVLQAIGAFPALSDFILEILSRLVSKQIWKYPKLWVGFLKCTQ 1187
Cdd:pfam12295   81 DALDLCFSMTDVFTQEVLAAALQQIVEQPPLPLLFMRTVIQALQKYPSLRSFVANILPRLIQKQVWKNKKLWEGFIKCAK 160

                   ....*....
gi 15240970   1188 TTQPQSYKV 1196
Cdd:pfam12295  161 RTKPQSFPV 169
SYMPK_PTA1_N pfam11935
Symplekin/PTA1 N-terminal; This is the N-terminal domain found in the symplekin protein from ...
96-291 1.00e-40

Symplekin/PTA1 N-terminal; This is the N-terminal domain found in the symplekin protein from animals and PTA1 protein from budding yeasts and is typically between 239 to 261 amino acids in length. Symplekin is a scaffold protein that functions as a component of a multimolecular complex involved in histone mRNA 3'-end processing. PTA1 is involved in pre-tRNA processing. PTA1 is a subunit of the cleavage and polyadenylation factor (CPF), which plays a key role in polyadenylation-dependent pre-mRNA 3'-end formation and cooperates with cleavage factors including the CFIA complex and NAB4/CFIB. This domain has the ARM or HEAT fold, with seven pairs of antiparallel alpha-helices arranged in the shape of an arc. It is important for interaction with Ssu72 and stimulates Ssu72 C-terminal domain phosphatase activity in vitro.


Pssm-ID: 463403  Cd Length: 218  Bit Score: 150.06  E-value: 1.00e-40
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15240970     96 TPAVARQVIACGADLFRSTLERVAvqglhsseLNDLLESSWTWLIKFKDEICSVAFKQGNSGVKLCAMKFVEALILLYTP 175
Cdd:pfam11935    1 DPAVVKRVIQAAASIYPLVFRWVA--------KNSNDSELWEKMTAIKSQILNLWDSADNDGVKLCAIKFLERVVLVQTP 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15240970    176 HEGIEA-----DFNISILRGGHPVLKIGDLSIEASQKLGLLLDQLRHPAakSLNSSTIIVLINSLSSVAKKRPAYCGRIL 250
Cdd:pfam11935   73 GDISDRrpdgnDFSLSIVPRNHPLLKPQNLEAEAKGLLDRLLKVLQESP--SISSPLVMATLNSLAIIAKQRPQFASRIL 150
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 15240970    251 PVLLSLDPLSFLKGVYAAATNL---------ALKTVFLSCLKctHPAAAP 291
Cdd:pfam11935  151 SALLSFNPLKKAQSPNLPPTLSklqvksvekTLRIFLLHLLK--NPPSSP 198
HEAT COG1413
HEAT repeat [General function prediction only];
48-104 1.38e-04

HEAT repeat [General function prediction only];


Pssm-ID: 441023 [Multi-domain]  Cd Length: 137  Bit Score: 43.46  E-value: 1.38e-04
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|....*..
gi 15240970   48 LPHLFDLLSDQFGAVRKFVAEILGEIGLkyveliPEIVPLLIKSLEDETPAVARQVI 104
Cdd:COG1413   18 VPALIAALADEDPDVRAAAARALGRLGD------PRAVPALLEALKDPDPEVRAAAA 68
HEAT_2 pfam13646
HEAT repeats; This family includes multiple HEAT repeats.
48-104 1.85e-04

HEAT repeats; This family includes multiple HEAT repeats.


Pssm-ID: 433376 [Multi-domain]  Cd Length: 88  Bit Score: 41.55  E-value: 1.85e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 15240970     48 LPHLFDLL-SDQFGAVRKFVAEILGEIGLkyveliPEIVPLLIKSLEDETPAVARQVI 104
Cdd:pfam13646    1 LPALLQALlRDPDPEVRAAAIRALGRIGD------PEAVPALLELLKDEDPAVRRAAA 52
HEAT COG1413
HEAT repeat [General function prediction only];
45-105 6.09e-04

HEAT repeat [General function prediction only];


Pssm-ID: 441023 [Multi-domain]  Cd Length: 137  Bit Score: 41.54  E-value: 6.09e-04
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 15240970   45 TELLPHLFDLLSDQFGAVRKFVAEILGEIGLkyveliPEIVPLLIKSLEDETPAVARQVIA 105
Cdd:COG1413   46 PRAVPALLEALKDPDPEVRAAAAEALGRIGD------PEAVPALIAALKDEDPEVRRAAAE 100
HEAT COG1413
HEAT repeat [General function prediction only];
46-104 1.17e-03

HEAT repeat [General function prediction only];


Pssm-ID: 441023 [Multi-domain]  Cd Length: 137  Bit Score: 40.77  E-value: 1.17e-03
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|....*....
gi 15240970   46 ELLPHLFDLLSDQFGAVRKFVAEILGEIGlkyvelIPEIVPLLIKSLEDETPAVARQVI 104
Cdd:COG1413   78 EAVPALIAALKDEDPEVRRAAAEALGRLG------DPAAVPALLEALKDPDWEVRRAAA 130
HEAT_2 pfam13646
HEAT repeats; This family includes multiple HEAT repeats.
46-105 1.38e-03

HEAT repeats; This family includes multiple HEAT repeats.


Pssm-ID: 433376 [Multi-domain]  Cd Length: 88  Bit Score: 39.24  E-value: 1.38e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 15240970     46 ELLPHLFDLLSDQFGAVRKFVAEILGEIGLkyveliPEIVPLLIKSLEDETPAVARQVIA 105
Cdd:pfam13646   31 EAVPALLELLKDEDPAVRRAAAEALGKIGD------PEALPALLELLRDDDDDVVRAAAA 84
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
1286-1433 6.07e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 41.14  E-value: 6.07e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15240970  1286 LSHSQvDHQEPSQVVASQSQSSPIGTVQSAmSQSQNSPIDTGRS-------------EMSQSQNSPIDTGRSEMSQSQNS 1352
Cdd:NF033849  353 ISHSE-SSSESTGTSVGHSTSSSVSSSESS-SRSSSSGVSGGFSggiagggvtseglGASQGGSEGWGSGDSVQSVSQSY 430
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15240970  1353 PIDTGRSEMSQSQNSpidTGRSEMSeSQSSPIGQSQSSPIGTGQSDMSQTPQVSDSSAPEPTSHTrTSDPQASSQTLRDD 1432
Cdd:NF033849  431 GSSSSTGTSSGHSDS---SSHSTSS-GQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDS-VGDSTGTSESVSQG 505

                  .
gi 15240970  1433 D 1433
Cdd:NF033849  506 D 506
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH