NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|333609254|ref|NP_542138|]
View 

uncharacterized protein C20orf96 isoform 2 [Homo sapiens]

Protein Classification

DUF4618 domain-containing protein( domain architecture ID 12173338)

DUF4618 domain-containing protein similar to Homo sapiens protein C20orf96

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
DUF4618 pfam15397
Domain of unknown function (DUF4618); This family of proteins is found in eukaryotes. Proteins ...
104-361 3.29e-122

Domain of unknown function (DUF4618); This family of proteins is found in eukaryotes. Proteins in this family are typically between 238 and 363 amino acids in length. There are two conserved sequence motifs: EYP and KCTPD.


:

Pssm-ID: 464704 [Multi-domain]  Cd Length: 258  Bit Score: 352.72  E-value: 3.29e-122
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 333609254  104 LRSGRAALRELRSRENFLSKLNRELIETIQEMENSTTLHVRALLQQQDTLATIIDILEYSNKKRLQQLKSELQEWEEKKK 183
Cdd:pfam15397   1 IRNRRTSLEELKKHEDFLTKLNLELIKAIQDTEDSTALKVRKLLQQYEKFGTIISILEYSNKKQLQQAKAELQEWEEKEE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 333609254  184 CKMSYLEQQAEQLNAKIEKTQEEVNFLSTYMDHEYSIKSVQISTLMRQLQQVKDSQQDELDDLGEMRRKVLESLSDKIQK 263
Cdd:pfam15397  81 SKLNKLEQQLEQLNAKIQKTQEELNFLSTYKDKEYPVKAVQIANLVRQLQQLKDSQQDELDELEEMRRMVLESLSRKIQK 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 333609254  264 KKKKILSSVVAETQRPYEEALLQKMWESQDFLKCMQRFREIIDQFEENMPVLRAEVEELQAQTREPREVIFEDVLLRRPK 343
Cdd:pfam15397 161 KKEKILSSLAEKTLSPYQESLLQKTRDNQVMLKEIEQFREFIDELEEEIPKLKAEVQQLQAQRQEPREVIFADVLLRRPK 240
                         250
                  ....*....|....*...
gi 333609254  344 CTPDMDVILNIPVEEPLP 361
Cdd:pfam15397 241 CTPDMDVILNIPTEELLP 258
 
Name Accession Description Interval E-value
DUF4618 pfam15397
Domain of unknown function (DUF4618); This family of proteins is found in eukaryotes. Proteins ...
104-361 3.29e-122

Domain of unknown function (DUF4618); This family of proteins is found in eukaryotes. Proteins in this family are typically between 238 and 363 amino acids in length. There are two conserved sequence motifs: EYP and KCTPD.


Pssm-ID: 464704 [Multi-domain]  Cd Length: 258  Bit Score: 352.72  E-value: 3.29e-122
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 333609254  104 LRSGRAALRELRSRENFLSKLNRELIETIQEMENSTTLHVRALLQQQDTLATIIDILEYSNKKRLQQLKSELQEWEEKKK 183
Cdd:pfam15397   1 IRNRRTSLEELKKHEDFLTKLNLELIKAIQDTEDSTALKVRKLLQQYEKFGTIISILEYSNKKQLQQAKAELQEWEEKEE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 333609254  184 CKMSYLEQQAEQLNAKIEKTQEEVNFLSTYMDHEYSIKSVQISTLMRQLQQVKDSQQDELDDLGEMRRKVLESLSDKIQK 263
Cdd:pfam15397  81 SKLNKLEQQLEQLNAKIQKTQEELNFLSTYKDKEYPVKAVQIANLVRQLQQLKDSQQDELDELEEMRRMVLESLSRKIQK 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 333609254  264 KKKKILSSVVAETQRPYEEALLQKMWESQDFLKCMQRFREIIDQFEENMPVLRAEVEELQAQTREPREVIFEDVLLRRPK 343
Cdd:pfam15397 161 KKEKILSSLAEKTLSPYQESLLQKTRDNQVMLKEIEQFREFIDELEEEIPKLKAEVQQLQAQRQEPREVIFADVLLRRPK 240
                         250
                  ....*....|....*...
gi 333609254  344 CTPDMDVILNIPVEEPLP 361
Cdd:pfam15397 241 CTPDMDVILNIPTEELLP 258
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
94-335 8.84e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 41.20  E-value: 8.84e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 333609254    94 HAKIWLMKTSLRSGRAALRELRSRENFLSKLNRELIETIQEMENSTTLHVRALLQQQDTLATIidileysnKKRLQQLKS 173
Cdd:TIGR02168  224 ELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEEL--------QKELYALAN 295
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 333609254   174 ELQEWEEKK---KCKMSYLEQQAEQLNAKIEKTQEEVNflstymdhEYSIKSVQISTLMRQLQQVKDSQQDELDDLgEMR 250
Cdd:TIGR02168  296 EISRLEQQKqilRERLANLERQLEELEAQLEELESKLD--------ELAEELAELEEKLEELKEELESLEAELEEL-EAE 366
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 333609254   251 RKVLESLSDkIQKKKKKILSSVVAETQRpyEEALLQKMWESQDFLKCM---QRFREIIDQFEENMPVLRAEVEELQAQTR 327
Cdd:TIGR02168  367 LEELESRLE-ELEEQLETLRSKVAQLEL--QIASLNNEIERLEARLERledRRERLQQEIEELLKKLEEAELKELQAELE 443

                   ....*...
gi 333609254   328 EPREVIFE 335
Cdd:TIGR02168  444 ELEEELEE 451
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
163-328 2.93e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 39.36  E-value: 2.93e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 333609254 163 SNKKRLQQLKSELQE---WEEKKKCKMSYLEQQAEQLNAKIEKTQEEVNFLSTymdhEYSIKSVQISTLMRQLQQVKDSQ 239
Cdd:COG4942   24 EAEAELEQLQQEIAElekELAALKKEEKALLKQLAALERRIAALARRIRALEQ----ELAALEAELAELEKEIAELRAEL 99
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 333609254 240 QDELDDLGEMrrkvleslsdkiqkkkkkiLSSVVAETQRPYEEALLQKMwESQDFLKCMQRFREIIDQFEENMPVLRAEV 319
Cdd:COG4942  100 EAQKEELAEL-------------------LRALYRLGRQPPLALLLSPE-DFLDAVRRLQYLKYLAPARREQAEELRADL 159

                 ....*....
gi 333609254 320 EELQAQTRE 328
Cdd:COG4942  160 AELAALRAE 168
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
113-255 8.68e-03

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 38.12  E-value: 8.68e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 333609254 113 ELRSRENFLSKLNRELIETIQEMENsttlhVRALLQQQDTLATIIDILEY------SNKKRLQQLKSELQEWEEKKKCKM 186
Cdd:PRK03918 201 ELEEVLREINEISSELPELREELEK-----LEKEVKELEELKEEIEELEKelesleGSKRKLEEKIRELEERIEELKKEI 275
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 333609254 187 SYLEQQAEQLNaKIEKTQEEVNFLSTYMDhEYSIKSVQISTLMRQLQQVKDSQQDELDDLGEMRRKVLE 255
Cdd:PRK03918 276 EELEEKVKELK-ELKEKAEEYIKLSEFYE-EYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEE 342
 
Name Accession Description Interval E-value
DUF4618 pfam15397
Domain of unknown function (DUF4618); This family of proteins is found in eukaryotes. Proteins ...
104-361 3.29e-122

Domain of unknown function (DUF4618); This family of proteins is found in eukaryotes. Proteins in this family are typically between 238 and 363 amino acids in length. There are two conserved sequence motifs: EYP and KCTPD.


Pssm-ID: 464704 [Multi-domain]  Cd Length: 258  Bit Score: 352.72  E-value: 3.29e-122
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 333609254  104 LRSGRAALRELRSRENFLSKLNRELIETIQEMENSTTLHVRALLQQQDTLATIIDILEYSNKKRLQQLKSELQEWEEKKK 183
Cdd:pfam15397   1 IRNRRTSLEELKKHEDFLTKLNLELIKAIQDTEDSTALKVRKLLQQYEKFGTIISILEYSNKKQLQQAKAELQEWEEKEE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 333609254  184 CKMSYLEQQAEQLNAKIEKTQEEVNFLSTYMDHEYSIKSVQISTLMRQLQQVKDSQQDELDDLGEMRRKVLESLSDKIQK 263
Cdd:pfam15397  81 SKLNKLEQQLEQLNAKIQKTQEELNFLSTYKDKEYPVKAVQIANLVRQLQQLKDSQQDELDELEEMRRMVLESLSRKIQK 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 333609254  264 KKKKILSSVVAETQRPYEEALLQKMWESQDFLKCMQRFREIIDQFEENMPVLRAEVEELQAQTREPREVIFEDVLLRRPK 343
Cdd:pfam15397 161 KKEKILSSLAEKTLSPYQESLLQKTRDNQVMLKEIEQFREFIDELEEEIPKLKAEVQQLQAQRQEPREVIFADVLLRRPK 240
                         250
                  ....*....|....*...
gi 333609254  344 CTPDMDVILNIPVEEPLP 361
Cdd:pfam15397 241 CTPDMDVILNIPTEELLP 258
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
94-335 8.84e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 41.20  E-value: 8.84e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 333609254    94 HAKIWLMKTSLRSGRAALRELRSRENFLSKLNRELIETIQEMENSTTLHVRALLQQQDTLATIidileysnKKRLQQLKS 173
Cdd:TIGR02168  224 ELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEEL--------QKELYALAN 295
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 333609254   174 ELQEWEEKK---KCKMSYLEQQAEQLNAKIEKTQEEVNflstymdhEYSIKSVQISTLMRQLQQVKDSQQDELDDLgEMR 250
Cdd:TIGR02168  296 EISRLEQQKqilRERLANLERQLEELEAQLEELESKLD--------ELAEELAELEEKLEELKEELESLEAELEEL-EAE 366
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 333609254   251 RKVLESLSDkIQKKKKKILSSVVAETQRpyEEALLQKMWESQDFLKCM---QRFREIIDQFEENMPVLRAEVEELQAQTR 327
Cdd:TIGR02168  367 LEELESRLE-ELEEQLETLRSKVAQLEL--QIASLNNEIERLEARLERledRRERLQQEIEELLKKLEEAELKELQAELE 443

                   ....*...
gi 333609254   328 EPREVIFE 335
Cdd:TIGR02168  444 ELEEELEE 451
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
163-328 2.93e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 39.36  E-value: 2.93e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 333609254 163 SNKKRLQQLKSELQE---WEEKKKCKMSYLEQQAEQLNAKIEKTQEEVNFLSTymdhEYSIKSVQISTLMRQLQQVKDSQ 239
Cdd:COG4942   24 EAEAELEQLQQEIAElekELAALKKEEKALLKQLAALERRIAALARRIRALEQ----ELAALEAELAELEKEIAELRAEL 99
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 333609254 240 QDELDDLGEMrrkvleslsdkiqkkkkkiLSSVVAETQRPYEEALLQKMwESQDFLKCMQRFREIIDQFEENMPVLRAEV 319
Cdd:COG4942  100 EAQKEELAEL-------------------LRALYRLGRQPPLALLLSPE-DFLDAVRRLQYLKYLAPARREQAEELRADL 159

                 ....*....
gi 333609254 320 EELQAQTRE 328
Cdd:COG4942  160 AELAALRAE 168
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
111-248 4.93e-03

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 38.85  E-value: 4.93e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 333609254  111 LRELRSRENFLSKLN---RELIETIQEMENSTTLHVRALLQQQDTLATIidileYSNKKRLQQLKSELQEWEEKKK---C 184
Cdd:TIGR04523 144 LTEIKKKEKELEKLNnkyNDLKKQKEELENELNLLEKEKLNIQKNIDKI-----KNKLLKLELLLSNLKKKIQKNKsleS 218
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 333609254  185 KMSYLEQQAEQLNAKIEKTQEEVNFLSTymdheysiksvQISTLMRQLQQVKDSQQDELDDLGE 248
Cdd:TIGR04523 219 QISELKKQNNQLKDNIEKKQQEINEKTT-----------EISNTQTQLNQLKDEQNKIKKQLSE 271
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
113-255 8.68e-03

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 38.12  E-value: 8.68e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 333609254 113 ELRSRENFLSKLNRELIETIQEMENsttlhVRALLQQQDTLATIIDILEY------SNKKRLQQLKSELQEWEEKKKCKM 186
Cdd:PRK03918 201 ELEEVLREINEISSELPELREELEK-----LEKEVKELEELKEEIEELEKelesleGSKRKLEEKIRELEERIEELKKEI 275
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 333609254 187 SYLEQQAEQLNaKIEKTQEEVNFLSTYMDhEYSIKSVQISTLMRQLQQVKDSQQDELDDLGEMRRKVLE 255
Cdd:PRK03918 276 EELEEKVKELK-ELKEKAEEYIKLSEFYE-EYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEE 342
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
143-341 8.78e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 38.21  E-value: 8.78e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 333609254 143 VRALLQQQDTLATIIDILEYSNKKRLQQLKSELQEWEEKKKcKMSYLEQQAEQLNAKIEKTQEEVNFLSTYMD-HEYSIK 221
Cdd:COG4717   48 LERLEKEADELFKPQGRKPELNLKELKELEEELKEAEEKEE-EYAELQEELEELEEELEELEAELEELREELEkLEKLLQ 126
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 333609254 222 SVQISTLMRQLQQVKDSQQDELDDLgEMRRKVLESLSDKIQKkkkkiLSSVVAETQRPYEEAL----LQKMWESQDFLKC 297
Cdd:COG4717  127 LLPLYQELEALEAELAELPERLEEL-EERLEELRELEEELEE-----LEAELAELQEELEELLeqlsLATEEELQDLAEE 200
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|....
gi 333609254 298 MQRFREIIDQFEENMPVLRAEVEELQAQTREPREVIFEDVLLRR 341
Cdd:COG4717  201 LEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALEER 244
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH