NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1373511542|ref|WP_107106657|]
View 

MULTISPECIES: GSU2403 family nucleotidyltransferase fold protein [Rhizobium]

Protein Classification

COG5397 family protein( domain architecture ID 10009251)

COG5397 family protein

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
COG5397 COG5397
Uncharacterized conserved protein [Function unknown];
1-335 3.57e-145

Uncharacterized conserved protein [Function unknown];


:

Pssm-ID: 444156  Cd Length: 334  Bit Score: 413.60  E-value: 3.57e-145
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373511542   1 MKEIDLVYRTMFAELAQRSLDArfhADFPLEGRFVTVPVKGRDYWYFDLPTPNGDKRS-YVGPhSDEEITARVNAHKEIK 79
Cdd:COG5397     3 MKELSLAAQTAYADLLQALRDA---ALFNLRGSFVWKTVKGRVYWYRRYRIRGGERRRrYLGP-DSPETRARIERFKALK 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373511542  80 DNIKERR----RMVSSLRRAGLPGPDPFAGDITKALADAGLFRLRAVLIGSVAFGTYAGLLGVRLPSSAMQTGDADFAQD 155
Cdd:COG5397    79 ADAEARRkeraRLVRLLRAAGLGRTDRQTGSVLEALAAAGLFRLGGTLVGTHAFRAYEGELGVRLPADAAATGDIDIAQF 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373511542 156 FAISAEVGDSL-PPVLKILQSIDPGFRAVPHRSDQAKVVAFVNGKGYRVEFLTGNRGSDehTGKPSPMPALgGASAENLR 234
Cdd:COG5397   159 ERLSLALGDVVePPLLDVLRSVDPGFEPVPHLSDGRVWRWAQNRSGYLVEFLTPNRGSD--DEEPVPLPAL-GVSAQALR 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373511542 235 FLDYLIYEPVRTVLLHRDGVNVLVPAPERYAVHKLIVASRRLTDAlgrAKADKDRSQAALLFEALvETRQGDVLADAYEE 314
Cdd:COG5397   236 FLDYLLADPIRAVALYRSGVLVQVPDPERFAVHKLIVADRRGRDP---AKARKDRAQAAFLIEAL-AERRPDDLAEAYEE 311
                         330       340
                  ....*....|....*....|.
gi 1373511542 315 AWERGPSWQEGITSSLTRLPD 335
Cdd:COG5397   312 ALSRGPKWRERIRASLSRLPE 332
 
Name Accession Description Interval E-value
COG5397 COG5397
Uncharacterized conserved protein [Function unknown];
1-335 3.57e-145

Uncharacterized conserved protein [Function unknown];


Pssm-ID: 444156  Cd Length: 334  Bit Score: 413.60  E-value: 3.57e-145
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373511542   1 MKEIDLVYRTMFAELAQRSLDArfhADFPLEGRFVTVPVKGRDYWYFDLPTPNGDKRS-YVGPhSDEEITARVNAHKEIK 79
Cdd:COG5397     3 MKELSLAAQTAYADLLQALRDA---ALFNLRGSFVWKTVKGRVYWYRRYRIRGGERRRrYLGP-DSPETRARIERFKALK 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373511542  80 DNIKERR----RMVSSLRRAGLPGPDPFAGDITKALADAGLFRLRAVLIGSVAFGTYAGLLGVRLPSSAMQTGDADFAQD 155
Cdd:COG5397    79 ADAEARRkeraRLVRLLRAAGLGRTDRQTGSVLEALAAAGLFRLGGTLVGTHAFRAYEGELGVRLPADAAATGDIDIAQF 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373511542 156 FAISAEVGDSL-PPVLKILQSIDPGFRAVPHRSDQAKVVAFVNGKGYRVEFLTGNRGSDehTGKPSPMPALgGASAENLR 234
Cdd:COG5397   159 ERLSLALGDVVePPLLDVLRSVDPGFEPVPHLSDGRVWRWAQNRSGYLVEFLTPNRGSD--DEEPVPLPAL-GVSAQALR 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373511542 235 FLDYLIYEPVRTVLLHRDGVNVLVPAPERYAVHKLIVASRRLTDAlgrAKADKDRSQAALLFEALvETRQGDVLADAYEE 314
Cdd:COG5397   236 FLDYLLADPIRAVALYRSGVLVQVPDPERFAVHKLIVADRRGRDP---AKARKDRAQAAFLIEAL-AERRPDDLAEAYEE 311
                         330       340
                  ....*....|....*....|.
gi 1373511542 315 AWERGPSWQEGITSSLTRLPD 335
Cdd:COG5397   312 ALSRGPKWRERIRASLSRLPE 332
NTP_transf_8 pfam12281
Nucleotidyltransferase; This is a family of bacterial proteins that have a ...
105-311 7.00e-76

Nucleotidyltransferase; This is a family of bacterial proteins that have a nucleotidyltransferase fold. The fold-prediction is backed up by conservation of three highly characteriztic sequence motifs found in all other nucleotidyl transferases: i) pDhDhhh(h/p), where p is a polar residue and h is a hydrophobic residue; ii) upstream of the first, a GG/S; iii) a conserved D/E in a hydrophobic surround. In the classification of nucleotidyltransferases proposed in this is a group XVIII NTP-transferase. Many of these sequences were classified in the COG database as COG5397. The exact function is not known.


Pssm-ID: 463520  Cd Length: 209  Bit Score: 232.59  E-value: 7.00e-76
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373511542 105 GDITKALADAGLFRLRAVLIGSVAFGTYAGLLGVRLPSSAMQTGDADF--AQDFAISAEVGDSLP-PVLKILQSIDPGFR 181
Cdd:pfam12281   1 GRVLRALAAAGLFRLGGVLVGTNAFYAYEGLLGVRLPGEMLATGDIDLlfAQFRRLSLALGDSVPePLLGVLRSVDPTFE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373511542 182 AVPHRSDQAKVVAFVNGKGYRVEFLTGNRGSDEHTGKPSPMPALGGASAENLRFLDYLIYEPVRTVLLHRDG--VNVLVP 259
Cdd:pfam12281  81 PVPRLSDRAAWTTYRNSDGYLVDLLTPSRGSGEKEDGPAPLPALDGLSAQPLRFLDWLLNDPVRAVALDRNGapVLVQVP 160
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1373511542 260 APERYAVHKLIVASRRltDALGRAKADKDRSQAALLFEALVETRQgDVLADA 311
Cdd:pfam12281 161 DPRRFAVHKLIISQRR--EGRDPLKRAKDLAQAAALIELLAETRP-LLLDDA 209
 
Name Accession Description Interval E-value
COG5397 COG5397
Uncharacterized conserved protein [Function unknown];
1-335 3.57e-145

Uncharacterized conserved protein [Function unknown];


Pssm-ID: 444156  Cd Length: 334  Bit Score: 413.60  E-value: 3.57e-145
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373511542   1 MKEIDLVYRTMFAELAQRSLDArfhADFPLEGRFVTVPVKGRDYWYFDLPTPNGDKRS-YVGPhSDEEITARVNAHKEIK 79
Cdd:COG5397     3 MKELSLAAQTAYADLLQALRDA---ALFNLRGSFVWKTVKGRVYWYRRYRIRGGERRRrYLGP-DSPETRARIERFKALK 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373511542  80 DNIKERR----RMVSSLRRAGLPGPDPFAGDITKALADAGLFRLRAVLIGSVAFGTYAGLLGVRLPSSAMQTGDADFAQD 155
Cdd:COG5397    79 ADAEARRkeraRLVRLLRAAGLGRTDRQTGSVLEALAAAGLFRLGGTLVGTHAFRAYEGELGVRLPADAAATGDIDIAQF 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373511542 156 FAISAEVGDSL-PPVLKILQSIDPGFRAVPHRSDQAKVVAFVNGKGYRVEFLTGNRGSDehTGKPSPMPALgGASAENLR 234
Cdd:COG5397   159 ERLSLALGDVVePPLLDVLRSVDPGFEPVPHLSDGRVWRWAQNRSGYLVEFLTPNRGSD--DEEPVPLPAL-GVSAQALR 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373511542 235 FLDYLIYEPVRTVLLHRDGVNVLVPAPERYAVHKLIVASRRLTDAlgrAKADKDRSQAALLFEALvETRQGDVLADAYEE 314
Cdd:COG5397   236 FLDYLLADPIRAVALYRSGVLVQVPDPERFAVHKLIVADRRGRDP---AKARKDRAQAAFLIEAL-AERRPDDLAEAYEE 311
                         330       340
                  ....*....|....*....|.
gi 1373511542 315 AWERGPSWQEGITSSLTRLPD 335
Cdd:COG5397   312 ALSRGPKWRERIRASLSRLPE 332
NTP_transf_8 pfam12281
Nucleotidyltransferase; This is a family of bacterial proteins that have a ...
105-311 7.00e-76

Nucleotidyltransferase; This is a family of bacterial proteins that have a nucleotidyltransferase fold. The fold-prediction is backed up by conservation of three highly characteriztic sequence motifs found in all other nucleotidyl transferases: i) pDhDhhh(h/p), where p is a polar residue and h is a hydrophobic residue; ii) upstream of the first, a GG/S; iii) a conserved D/E in a hydrophobic surround. In the classification of nucleotidyltransferases proposed in this is a group XVIII NTP-transferase. Many of these sequences were classified in the COG database as COG5397. The exact function is not known.


Pssm-ID: 463520  Cd Length: 209  Bit Score: 232.59  E-value: 7.00e-76
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373511542 105 GDITKALADAGLFRLRAVLIGSVAFGTYAGLLGVRLPSSAMQTGDADF--AQDFAISAEVGDSLP-PVLKILQSIDPGFR 181
Cdd:pfam12281   1 GRVLRALAAAGLFRLGGVLVGTNAFYAYEGLLGVRLPGEMLATGDIDLlfAQFRRLSLALGDSVPePLLGVLRSVDPTFE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1373511542 182 AVPHRSDQAKVVAFVNGKGYRVEFLTGNRGSDEHTGKPSPMPALGGASAENLRFLDYLIYEPVRTVLLHRDG--VNVLVP 259
Cdd:pfam12281  81 PVPRLSDRAAWTTYRNSDGYLVDLLTPSRGSGEKEDGPAPLPALDGLSAQPLRFLDWLLNDPVRAVALDRNGapVLVQVP 160
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1373511542 260 APERYAVHKLIVASRRltDALGRAKADKDRSQAALLFEALVETRQgDVLADA 311
Cdd:pfam12281 161 DPRRFAVHKLIISQRR--EGRDPLKRAKDLAQAAALIELLAETRP-LLLDDA 209
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH