NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|530390903|ref|XP_005251942|]
View 

uncharacterized protein C9orf43 isoform X3 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
DUF4647 super family cl21314
Domain of unknown function (DUF4647); This family of proteins is found in eukaryotes. Proteins ...
1-312 3.29e-153

Domain of unknown function (DUF4647); This family of proteins is found in eukaryotes. Proteins in this family are typically between 282 and 480 amino acids in length.


The actual alignment was detected with superfamily member pfam15504:

Pssm-ID: 464752  Cd Length: 467  Bit Score: 437.76  E-value: 3.29e-153
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530390903    1 MVVLWIPEETEIHV--------SQHGKKKRKNSAVKSKSFLGLSGNQSAGTRVGTPGMIVPPPTPVQLSEQFSSDFLPLW 72
Cdd:pfam15504 140 MVVIWIPEEPEKHVaeekpdvtSQDGKKKRKKSTVKSKSSLGLSGKQYRETQLRSPGMIVPPPSPVHLLEQLSSESIPLW 219
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530390903   73 AQSEALPQDLLKELLPGGKQTMLCPEMKIKLAMMKKNLPLEKNRPDSVISSKMFLSIHRLTLERPALRYPERLKKL-HNL 151
Cdd:pfam15504 220 AQFDMLPQDLLKDLLLDEGKTMPCPEMKIQLAMMKKSLPLEKSRPDSAISSKMFLSVHRLTLQRPSLRYPEHLKKLrHNL 299
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530390903  152 KTEGYrkQQQRQQQQQQQQKKVKTPIKKQEAKKKAKSDPGIQSTSHKHPVTTVHDRLyGYRTLPGQNSDMKQQQQM-EKG 230
Cdd:pfam15504 300 KTEGL--RKQQQWQQQQQQRKVKTPTKKQEAKKKAKSDPGSQYTSRKHSGHIFHDPV-GLRTLRGQESDKKQQQEGkEKG 376
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530390903  231 TTSKQDSTERPKMNYYDH-ADFHHSVKSPELYETEPTNKDISAPVDAVPEAQAARQKKISFNFSEIMASTGWNSELKLLR 309
Cdd:pfam15504 377 PTLKQVSTERPQMDYAEKyLDYYHSPESPELYETESTYKDISTQVEAVLESQASSREETPKNLSASMDGISWNPELKLLR 456

                  ...
gi 530390903  310 ILQ 312
Cdd:pfam15504 457 ILQ 459
 
Name Accession Description Interval E-value
DUF4647 pfam15504
Domain of unknown function (DUF4647); This family of proteins is found in eukaryotes. Proteins ...
1-312 3.29e-153

Domain of unknown function (DUF4647); This family of proteins is found in eukaryotes. Proteins in this family are typically between 282 and 480 amino acids in length.


Pssm-ID: 464752  Cd Length: 467  Bit Score: 437.76  E-value: 3.29e-153
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530390903    1 MVVLWIPEETEIHV--------SQHGKKKRKNSAVKSKSFLGLSGNQSAGTRVGTPGMIVPPPTPVQLSEQFSSDFLPLW 72
Cdd:pfam15504 140 MVVIWIPEEPEKHVaeekpdvtSQDGKKKRKKSTVKSKSSLGLSGKQYRETQLRSPGMIVPPPSPVHLLEQLSSESIPLW 219
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530390903   73 AQSEALPQDLLKELLPGGKQTMLCPEMKIKLAMMKKNLPLEKNRPDSVISSKMFLSIHRLTLERPALRYPERLKKL-HNL 151
Cdd:pfam15504 220 AQFDMLPQDLLKDLLLDEGKTMPCPEMKIQLAMMKKSLPLEKSRPDSAISSKMFLSVHRLTLQRPSLRYPEHLKKLrHNL 299
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530390903  152 KTEGYrkQQQRQQQQQQQQKKVKTPIKKQEAKKKAKSDPGIQSTSHKHPVTTVHDRLyGYRTLPGQNSDMKQQQQM-EKG 230
Cdd:pfam15504 300 KTEGL--RKQQQWQQQQQQRKVKTPTKKQEAKKKAKSDPGSQYTSRKHSGHIFHDPV-GLRTLRGQESDKKQQQEGkEKG 376
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530390903  231 TTSKQDSTERPKMNYYDH-ADFHHSVKSPELYETEPTNKDISAPVDAVPEAQAARQKKISFNFSEIMASTGWNSELKLLR 309
Cdd:pfam15504 377 PTLKQVSTERPQMDYAEKyLDYYHSPESPELYETESTYKDISTQVEAVLESQASSREETPKNLSASMDGISWNPELKLLR 456

                  ...
gi 530390903  310 ILQ 312
Cdd:pfam15504 457 ILQ 459
 
Name Accession Description Interval E-value
DUF4647 pfam15504
Domain of unknown function (DUF4647); This family of proteins is found in eukaryotes. Proteins ...
1-312 3.29e-153

Domain of unknown function (DUF4647); This family of proteins is found in eukaryotes. Proteins in this family are typically between 282 and 480 amino acids in length.


Pssm-ID: 464752  Cd Length: 467  Bit Score: 437.76  E-value: 3.29e-153
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530390903    1 MVVLWIPEETEIHV--------SQHGKKKRKNSAVKSKSFLGLSGNQSAGTRVGTPGMIVPPPTPVQLSEQFSSDFLPLW 72
Cdd:pfam15504 140 MVVIWIPEEPEKHVaeekpdvtSQDGKKKRKKSTVKSKSSLGLSGKQYRETQLRSPGMIVPPPSPVHLLEQLSSESIPLW 219
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530390903   73 AQSEALPQDLLKELLPGGKQTMLCPEMKIKLAMMKKNLPLEKNRPDSVISSKMFLSIHRLTLERPALRYPERLKKL-HNL 151
Cdd:pfam15504 220 AQFDMLPQDLLKDLLLDEGKTMPCPEMKIQLAMMKKSLPLEKSRPDSAISSKMFLSVHRLTLQRPSLRYPEHLKKLrHNL 299
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530390903  152 KTEGYrkQQQRQQQQQQQQKKVKTPIKKQEAKKKAKSDPGIQSTSHKHPVTTVHDRLyGYRTLPGQNSDMKQQQQM-EKG 230
Cdd:pfam15504 300 KTEGL--RKQQQWQQQQQQRKVKTPTKKQEAKKKAKSDPGSQYTSRKHSGHIFHDPV-GLRTLRGQESDKKQQQEGkEKG 376
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530390903  231 TTSKQDSTERPKMNYYDH-ADFHHSVKSPELYETEPTNKDISAPVDAVPEAQAARQKKISFNFSEIMASTGWNSELKLLR 309
Cdd:pfam15504 377 PTLKQVSTERPQMDYAEKyLDYYHSPESPELYETESTYKDISTQVEAVLESQASSREETPKNLSASMDGISWNPELKLLR 456

                  ...
gi 530390903  310 ILQ 312
Cdd:pfam15504 457 ILQ 459
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH