NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|530387583|ref|XP_005273466|]
View 

heparan-alpha-glucosaminide N-acetyltransferase isoform X1 [Homo sapiens]

Protein Classification

acyltransferase family protein( domain architecture ID 10008472)

DUF5009 domain-containing protein may act as acyltransferase

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
COG4299 COG4299
Predicted acyltransferase, DUF1624 domain [General function prediction only];
237-672 3.33e-75

Predicted acyltransferase, DUF1624 domain [General function prediction only];


:

Pssm-ID: 443440 [Multi-domain]  Cd Length: 370  Bit Score: 245.85  E-value: 3.33e-75
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387583 237 PPRLRSVDTFRGIALILMVFVNYGGG---KYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGK 313
Cdd:COG4299    2 SKRLLSLDVLRGLTIALMILVNNPGSwshVYAPLLHAEWHGFTPTDLVFPFFLFIVGVAMPFSLSKRLAKGAPKSALYRK 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387583 314 IAWRSFLLICIGIIIvnpnYCLG--PLSWDKVRIPGVLQRLGVTYFVVAVLELLFakpvpehcasersclslrditSSWP 391
Cdd:COG4299   82 ILKRSLILFLLGLFL----NWFPffLKDFSEIRIPGVLQRIALAYLFAALLYLYL---------------------SRKT 136
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387583 392 QWLLILV-LEGLWLGLTFlLPVPGCPTGYLGPGGigdfgkypNctggAAGYIDRLLLGDDHLYQhpssavlyHTEVAYDP 470
Cdd:COG4299  137 QLIIAAGlLLGYWLLLAF-VPVPGFGAGPLSPEG--------N----LAAYIDRLLLGKGHLYK--------GEGKTFDP 195
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387583 471 EGILGTINSIVMAFLGVqccpdwvtkqaclteplsplwrilfgpclevratepaQAGKILLYYKARTKDIlirftAWCCI 550
Cdd:COG4299  196 EGLLSTLPAIVTVLLGY-------------------------------------LAGRLLRSKKSNREKV-----LKLLI 233
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387583 551 LGLISVALtkvseneGFI-----PVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVKGL--WTgTPFFYPGMNSILVYVGH 623
Cdd:COG4299  234 AGVLLLLL-------GLLwnlvfPINKKLWTSSFVLLTGGLALLLLALFYWLIDVKGYrkWT-FPFVVFGMNAIFIYLLS 305
                        410       420       430       440       450       460
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 530387583 624 EVFENYFpFQWKLKDNQSH-KEHLTQNIVA----------------TALWVLIAYILYRKKIFWKI 672
Cdd:COG4299  306 ELVARLL-GLIKVGGTATSlFGWLYQNLFQpilgpynasllfalafVLLWWLILYWLYKKKIFIKV 370
 
Name Accession Description Interval E-value
COG4299 COG4299
Predicted acyltransferase, DUF1624 domain [General function prediction only];
237-672 3.33e-75

Predicted acyltransferase, DUF1624 domain [General function prediction only];


Pssm-ID: 443440 [Multi-domain]  Cd Length: 370  Bit Score: 245.85  E-value: 3.33e-75
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387583 237 PPRLRSVDTFRGIALILMVFVNYGGG---KYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGK 313
Cdd:COG4299    2 SKRLLSLDVLRGLTIALMILVNNPGSwshVYAPLLHAEWHGFTPTDLVFPFFLFIVGVAMPFSLSKRLAKGAPKSALYRK 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387583 314 IAWRSFLLICIGIIIvnpnYCLG--PLSWDKVRIPGVLQRLGVTYFVVAVLELLFakpvpehcasersclslrditSSWP 391
Cdd:COG4299   82 ILKRSLILFLLGLFL----NWFPffLKDFSEIRIPGVLQRIALAYLFAALLYLYL---------------------SRKT 136
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387583 392 QWLLILV-LEGLWLGLTFlLPVPGCPTGYLGPGGigdfgkypNctggAAGYIDRLLLGDDHLYQhpssavlyHTEVAYDP 470
Cdd:COG4299  137 QLIIAAGlLLGYWLLLAF-VPVPGFGAGPLSPEG--------N----LAAYIDRLLLGKGHLYK--------GEGKTFDP 195
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387583 471 EGILGTINSIVMAFLGVqccpdwvtkqaclteplsplwrilfgpclevratepaQAGKILLYYKARTKDIlirftAWCCI 550
Cdd:COG4299  196 EGLLSTLPAIVTVLLGY-------------------------------------LAGRLLRSKKSNREKV-----LKLLI 233
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387583 551 LGLISVALtkvseneGFI-----PVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVKGL--WTgTPFFYPGMNSILVYVGH 623
Cdd:COG4299  234 AGVLLLLL-------GLLwnlvfPINKKLWTSSFVLLTGGLALLLLALFYWLIDVKGYrkWT-FPFVVFGMNAIFIYLLS 305
                        410       420       430       440       450       460
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 530387583 624 EVFENYFpFQWKLKDNQSH-KEHLTQNIVA----------------TALWVLIAYILYRKKIFWKI 672
Cdd:COG4299  306 ELVARLL-GLIKVGGTATSlFGWLYQNLFQpilgpynasllfalafVLLWWLILYWLYKKKIFIKV 370
DUF5009 pfam16401
Domain of unknown function (DUF5009); This small family of proteins is functionally ...
239-328 1.05e-04

Domain of unknown function (DUF5009); This small family of proteins is functionally uncharacterized. This family is mainly found in various Bacteroides species. The members in this family are around 470 residues in length.


Pssm-ID: 293010  Cd Length: 260  Bit Score: 44.40  E-value: 1.05e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387583  239 RLRSVDTFRGIALILMVF---VNYGGGKYWYFkHAS-----------WNGLTVADLVFPWFVFIMGSSIFLSMTSILQRG 304
Cdd:pfam16401   1 RALSLDALRGYAIILMVLsgsIAFSILPGWMY-HAQtpppghifnpeIPGITWVDLVFPFFLFAMGAAIPLALGKKAEKG 79
                          90       100
                  ....*....|....*....|....
gi 530387583  305 CSKFRLLGKIAWRSFLLICIGIII 328
Cdd:pfam16401  80 SSKLLLLYDAIKRFVLLTFFALFT 103
 
Name Accession Description Interval E-value
COG4299 COG4299
Predicted acyltransferase, DUF1624 domain [General function prediction only];
237-672 3.33e-75

Predicted acyltransferase, DUF1624 domain [General function prediction only];


Pssm-ID: 443440 [Multi-domain]  Cd Length: 370  Bit Score: 245.85  E-value: 3.33e-75
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387583 237 PPRLRSVDTFRGIALILMVFVNYGGG---KYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGK 313
Cdd:COG4299    2 SKRLLSLDVLRGLTIALMILVNNPGSwshVYAPLLHAEWHGFTPTDLVFPFFLFIVGVAMPFSLSKRLAKGAPKSALYRK 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387583 314 IAWRSFLLICIGIIIvnpnYCLG--PLSWDKVRIPGVLQRLGVTYFVVAVLELLFakpvpehcasersclslrditSSWP 391
Cdd:COG4299   82 ILKRSLILFLLGLFL----NWFPffLKDFSEIRIPGVLQRIALAYLFAALLYLYL---------------------SRKT 136
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387583 392 QWLLILV-LEGLWLGLTFlLPVPGCPTGYLGPGGigdfgkypNctggAAGYIDRLLLGDDHLYQhpssavlyHTEVAYDP 470
Cdd:COG4299  137 QLIIAAGlLLGYWLLLAF-VPVPGFGAGPLSPEG--------N----LAAYIDRLLLGKGHLYK--------GEGKTFDP 195
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387583 471 EGILGTINSIVMAFLGVqccpdwvtkqaclteplsplwrilfgpclevratepaQAGKILLYYKARTKDIlirftAWCCI 550
Cdd:COG4299  196 EGLLSTLPAIVTVLLGY-------------------------------------LAGRLLRSKKSNREKV-----LKLLI 233
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387583 551 LGLISVALtkvseneGFI-----PVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVKGL--WTgTPFFYPGMNSILVYVGH 623
Cdd:COG4299  234 AGVLLLLL-------GLLwnlvfPINKKLWTSSFVLLTGGLALLLLALFYWLIDVKGYrkWT-FPFVVFGMNAIFIYLLS 305
                        410       420       430       440       450       460
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 530387583 624 EVFENYFpFQWKLKDNQSH-KEHLTQNIVA----------------TALWVLIAYILYRKKIFWKI 672
Cdd:COG4299  306 ELVARLL-GLIKVGGTATSlFGWLYQNLFQpilgpynasllfalafVLLWWLILYWLYKKKIFIKV 370
COG3503 COG3503
Uncharacterized membrane protein, DUF1624 family [Function unknown];
237-340 2.47e-05

Uncharacterized membrane protein, DUF1624 family [Function unknown];


Pssm-ID: 442726  Cd Length: 273  Bit Score: 46.37  E-value: 2.47e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387583 237 PPRLRSVDTFRGIALILMV------FVNYGGGKYWYFKHASWNGLtVADLVFPWFVFIMGSSIFLSmtsiLQRGCSKFRL 310
Cdd:COG3503    1 TSRLASIDALRGLAMVLMAldhvrdDLHFFGLVPTDLATTPPWRW-FTHLCAPLFLFLAGVSLYLA----HSRGIRWRAL 75
                         90       100       110
                 ....*....|....*....|....*....|
gi 530387583 311 LGKIAWRSFLLICIGIIIVNPNYCLGPLSW 340
Cdd:COG3503   76 SRFLLKRGLWLILLALLITLFTWLFFPDSF 105
DUF5009 pfam16401
Domain of unknown function (DUF5009); This small family of proteins is functionally ...
239-328 1.05e-04

Domain of unknown function (DUF5009); This small family of proteins is functionally uncharacterized. This family is mainly found in various Bacteroides species. The members in this family are around 470 residues in length.


Pssm-ID: 293010  Cd Length: 260  Bit Score: 44.40  E-value: 1.05e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387583  239 RLRSVDTFRGIALILMVF---VNYGGGKYWYFkHAS-----------WNGLTVADLVFPWFVFIMGSSIFLSMTSILQRG 304
Cdd:pfam16401   1 RALSLDALRGYAIILMVLsgsIAFSILPGWMY-HAQtpppghifnpeIPGITWVDLVFPFFLFAMGAAIPLALGKKAEKG 79
                          90       100
                  ....*....|....*....|....
gi 530387583  305 CSKFRLLGKIAWRSFLLICIGIII 328
Cdd:pfam16401  80 SSKLLLLYDAIKRFVLLTFFALFT 103
HGSNAT_cat pfam07786
Heparan-alpha-glucosaminide N-acetyltransferase, catalytic; This entry includes the catalytic ...
239-433 4.65e-03

Heparan-alpha-glucosaminide N-acetyltransferase, catalytic; This entry includes the catalytic domain of HGSNAT (Heparan-alpha-glucosaminide N-acetyltransferase). It contains the conserved histidine in the active site (His269), thought to hold the acetyl group during the transfer across the membrane and required for its enzymatic activity. HGSNAT transfers an acetyl group from cytoplasmically derived acetyl-CoA to terminal N-glucosamine residues of heparan sulfate within the lysosomes.


Pssm-ID: 377915  Cd Length: 222  Bit Score: 39.12  E-value: 4.65e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387583  239 RLRSVDTFRGIALILMVF--------------VNYGGGKYWYFKHAswngltVADLvfpwFVFIMGSSIFLSmtsilqrg 304
Cdd:pfam07786   1 RYWEIDALRGIALILMIIfhflwdleffgyldVDLTSGFWVYFARL------IASL----FLFIAGISLVLA-------- 62
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530387583  305 CSKFRLLGKIAWRSFLLICIGIIIVNPNYCLGPLSWdkVRIpGVLQRLGvtyfVVAVLELLFAKpvpehcasersclslr 384
Cdd:pfam07786  63 HGRGLRWRKFLKRGLKIFAAALLITAATYIAFPDSF--IYF-GILHFIG----LASLLGLLFLR---------------- 119
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*....
gi 530387583  385 ditssWPQWLLILVLeGLWLGLTFLLPVPGCPTGYLGPGGIGDFGKYPN 433
Cdd:pfam07786 120 -----LPKWLLLLGA-LLFLALGLFLRSPTFDTPLLLWLGLSPLPFRTL 162
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH