NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1085007603|gb|OGS16897|]
View 

hypothetical protein A2251_05845 [Elusimicrobia bacterium RIFOXYA2_FULL_47_53]

Protein Classification

rhoptry family protein( domain architecture ID 1000160)

rhoptry family protein such as the merozoite surface protein 1 (MSP-1) which is the most abundant protein on the surface of the erythrocyte-invading Plasmodium merozoite, the causative agent of malaria

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
DUF4175 super family cl44629
Domain of unknown function (DUF4175);
286-1005 3.00e-06

Domain of unknown function (DUF4175);


The actual alignment was detected with superfamily member pfam13779:

Pssm-ID: 463981 [Multi-domain]  Cd Length: 833  Bit Score: 51.53  E-value: 3.00e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  286 PRYTGLGEITVADEPD----ISALPGTKVNIEFNSSVALSAAAINNSWGEPIPVKISGKKLSAY-FTAEEAGEYFIESad 360
Cdd:pfam13779  193 PAYTGRPPVYLTAGADaaaaLTVPAGSTLTVRVTGLSGSALLEADVGGTAPLPFAAVDAGAREQrFTLTRSGTLTIRG-- 270
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  361 ektpGDPDPVKYSVNIIQDKKPSVDLISPDSdllSSVDSEIPVVFQAGDDFGVTRVELVYSKDGQGEQ----VAELPVDK 436
Cdd:pfam13779  271 ----GGGELRSWTITVIPDAPPTIAFAGPPE---RSARGALQLSYKASDDYGVTSAEAEIRLDPAAPDrrpgLYVAPEFP 343
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  437 KKISEVTDYLIR--LYKyNLTP----GDKIRFYIKAWDndtaAGPKYGKSADF---LVEttnyaiehgkieeelKKFQKQ 507
Cdd:pfam13779  344 LPLPLAGSREAEgqTSK-DLSEhpwaGLPVTLTLVATD----GAGQEGRSEPLeirLPE---------------RRFSDP 403
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  508 LLDLLSDQmsanswlkdyavKRSSAaypgiiDAQASIQESAAGgiesLDGILnkMESDPVTDFSTlkeykgiksqlVYLK 587
Cdd:pfam13779  404 LARALIEQ------------RRRLA------LDRENRPRVARA----LDALT--LAPEEFGPDAG-----------VYLG 448
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  588 dtpMRRAVEALrELNINEAERRqnEIIASLEKLSLLSEDimryqnmKDIIDSAAELSKSSDDILDSLKGKPEAGELARIL 667
Cdd:pfam13779  449 ---LRSALARL-ELARSDEALD--EVADLLWELALRIED-------GDLSDAERRLRAAQERLSEALERGASDEEIAKLM 515
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  668 D----KINDLMAQVGEQLSKFPQSLPEDfvNSPAVKSI---DISKSRDLASELSAAVNRgdwktAEnlARELKKQLEDML 740
Cdd:pfam13779  516 QelreALDDYMQALAEQAQQNPQDLQQP--DDPNAQEMtqqDLQRMLDRIEELARSGRR-----AE--AQQMLSQLQQML 586
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  741 ETVRsAGQDIGFSSGAMSKSAEEMNRLGnelsKLAERQSRILAKTEtinnKRIREAMKKQEDMllllaKEQQriiasage 820
Cdd:pfam13779  587 ENLQ-AGQPQQQQQQGQSEMQQAMDELG----DLLREQQQLLDETF----RQLQQQGGQQQGQ-----PGQQ-------- 644
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  821 vlpllvekkpmaGDSTQQSMAlmnkvydefaskraynsqkylediishwsdlsalfKEDAVSPRINAVLAAEKNILLKLK 900
Cdd:pfam13779  645 ------------GQQGQGQQP-----------------------------------GQGGQQPGAQMPPQGGAEALGDLA 677
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  901 EndtnsvsgaaaagdteaaskEQSALARDTVSLRQSLETVGRKSaliPPETLGslnEAGASMESAASELDSGRVPQAVES 980
Cdd:pfam13779  678 E--------------------RQQALRRRLEELQDELKELGGKE---PGQALG---DAGRAMRDAEEALGQGDLAGAVDA 731
                          730       740
                   ....*....|....*....|....*
gi 1085007603  981 QRNALNLLLSGADNLkgtSESMQES 1005
Cdd:pfam13779  732 QGRALEALRKGAQQL---AEAMQQQ 753
 
Name Accession Description Interval E-value
DUF4175 pfam13779
Domain of unknown function (DUF4175);
286-1005 3.00e-06

Domain of unknown function (DUF4175);


Pssm-ID: 463981 [Multi-domain]  Cd Length: 833  Bit Score: 51.53  E-value: 3.00e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  286 PRYTGLGEITVADEPD----ISALPGTKVNIEFNSSVALSAAAINNSWGEPIPVKISGKKLSAY-FTAEEAGEYFIESad 360
Cdd:pfam13779  193 PAYTGRPPVYLTAGADaaaaLTVPAGSTLTVRVTGLSGSALLEADVGGTAPLPFAAVDAGAREQrFTLTRSGTLTIRG-- 270
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  361 ektpGDPDPVKYSVNIIQDKKPSVDLISPDSdllSSVDSEIPVVFQAGDDFGVTRVELVYSKDGQGEQ----VAELPVDK 436
Cdd:pfam13779  271 ----GGGELRSWTITVIPDAPPTIAFAGPPE---RSARGALQLSYKASDDYGVTSAEAEIRLDPAAPDrrpgLYVAPEFP 343
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  437 KKISEVTDYLIR--LYKyNLTP----GDKIRFYIKAWDndtaAGPKYGKSADF---LVEttnyaiehgkieeelKKFQKQ 507
Cdd:pfam13779  344 LPLPLAGSREAEgqTSK-DLSEhpwaGLPVTLTLVATD----GAGQEGRSEPLeirLPE---------------RRFSDP 403
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  508 LLDLLSDQmsanswlkdyavKRSSAaypgiiDAQASIQESAAGgiesLDGILnkMESDPVTDFSTlkeykgiksqlVYLK 587
Cdd:pfam13779  404 LARALIEQ------------RRRLA------LDRENRPRVARA----LDALT--LAPEEFGPDAG-----------VYLG 448
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  588 dtpMRRAVEALrELNINEAERRqnEIIASLEKLSLLSEDimryqnmKDIIDSAAELSKSSDDILDSLKGKPEAGELARIL 667
Cdd:pfam13779  449 ---LRSALARL-ELARSDEALD--EVADLLWELALRIED-------GDLSDAERRLRAAQERLSEALERGASDEEIAKLM 515
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  668 D----KINDLMAQVGEQLSKFPQSLPEDfvNSPAVKSI---DISKSRDLASELSAAVNRgdwktAEnlARELKKQLEDML 740
Cdd:pfam13779  516 QelreALDDYMQALAEQAQQNPQDLQQP--DDPNAQEMtqqDLQRMLDRIEELARSGRR-----AE--AQQMLSQLQQML 586
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  741 ETVRsAGQDIGFSSGAMSKSAEEMNRLGnelsKLAERQSRILAKTEtinnKRIREAMKKQEDMllllaKEQQriiasage 820
Cdd:pfam13779  587 ENLQ-AGQPQQQQQQGQSEMQQAMDELG----DLLREQQQLLDETF----RQLQQQGGQQQGQ-----PGQQ-------- 644
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  821 vlpllvekkpmaGDSTQQSMAlmnkvydefaskraynsqkylediishwsdlsalfKEDAVSPRINAVLAAEKNILLKLK 900
Cdd:pfam13779  645 ------------GQQGQGQQP-----------------------------------GQGGQQPGAQMPPQGGAEALGDLA 677
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  901 EndtnsvsgaaaagdteaaskEQSALARDTVSLRQSLETVGRKSaliPPETLGslnEAGASMESAASELDSGRVPQAVES 980
Cdd:pfam13779  678 E--------------------RQQALRRRLEELQDELKELGGKE---PGQALG---DAGRAMRDAEEALGQGDLAGAVDA 731
                          730       740
                   ....*....|....*....|....*
gi 1085007603  981 QRNALNLLLSGADNLkgtSESMQES 1005
Cdd:pfam13779  732 QGRALEALRKGAQQL---AEAMQQQ 753
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
532-802 2.88e-05

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 48.51  E-value: 2.88e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  532 AAYPGIIDAQASIQESAAGGIESLD-GILNKMESDpVTDFSTLKEykgiKSQLVYLKDTPMRRAVEALRELN-----INE 605
Cdd:TIGR01612 1048 AIHTSIYNIIDEIEKEIGKNIELLNkEILEEAEIN-ITNFNEIKE----KLKHYNFDDFGKEENIKYADEINkikddIKN 1122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  606 AERRQNEIIASLEKLSLLSE---DIMRYQ--NMKDIIDSA------AELSKSSDDILDSLKGKpeagelARILDKINDLM 674
Cdd:TIGR01612 1123 LDQKIDHHIKALEEIKKKSEnyiDEIKAQinDLEDVADKAisnddpEEIEKKIENIVTKIDKK------KNIYDEIKKLL 1196
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  675 AQVGEqLSKFPQSLPEdfvnspaVKSIDISKSRDLASELSAAVNRgDWKTAENLARELKKQLEDmLETVRSAGQDIGFSS 754
Cdd:TIGR01612 1197 NEIAE-IEKDKTSLEE-------VKGINLSYGKNLGKLFLEKIDE-EKKKSEHMIKAMEAYIED-LDEIKEKSPEIENEM 1266
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*....
gi 1085007603  755 GAMSKSAEEMNRLgnELSKLAERQSRILAKTETINNKRIRE-AMKKQED 802
Cdd:TIGR01612 1267 GIEMDIKAEMETF--NISHDDDKDHHIISKKHDENISDIREkSLKIIED 1313
 
Name Accession Description Interval E-value
DUF4175 pfam13779
Domain of unknown function (DUF4175);
286-1005 3.00e-06

Domain of unknown function (DUF4175);


Pssm-ID: 463981 [Multi-domain]  Cd Length: 833  Bit Score: 51.53  E-value: 3.00e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  286 PRYTGLGEITVADEPD----ISALPGTKVNIEFNSSVALSAAAINNSWGEPIPVKISGKKLSAY-FTAEEAGEYFIESad 360
Cdd:pfam13779  193 PAYTGRPPVYLTAGADaaaaLTVPAGSTLTVRVTGLSGSALLEADVGGTAPLPFAAVDAGAREQrFTLTRSGTLTIRG-- 270
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  361 ektpGDPDPVKYSVNIIQDKKPSVDLISPDSdllSSVDSEIPVVFQAGDDFGVTRVELVYSKDGQGEQ----VAELPVDK 436
Cdd:pfam13779  271 ----GGGELRSWTITVIPDAPPTIAFAGPPE---RSARGALQLSYKASDDYGVTSAEAEIRLDPAAPDrrpgLYVAPEFP 343
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  437 KKISEVTDYLIR--LYKyNLTP----GDKIRFYIKAWDndtaAGPKYGKSADF---LVEttnyaiehgkieeelKKFQKQ 507
Cdd:pfam13779  344 LPLPLAGSREAEgqTSK-DLSEhpwaGLPVTLTLVATD----GAGQEGRSEPLeirLPE---------------RRFSDP 403
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  508 LLDLLSDQmsanswlkdyavKRSSAaypgiiDAQASIQESAAGgiesLDGILnkMESDPVTDFSTlkeykgiksqlVYLK 587
Cdd:pfam13779  404 LARALIEQ------------RRRLA------LDRENRPRVARA----LDALT--LAPEEFGPDAG-----------VYLG 448
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  588 dtpMRRAVEALrELNINEAERRqnEIIASLEKLSLLSEDimryqnmKDIIDSAAELSKSSDDILDSLKGKPEAGELARIL 667
Cdd:pfam13779  449 ---LRSALARL-ELARSDEALD--EVADLLWELALRIED-------GDLSDAERRLRAAQERLSEALERGASDEEIAKLM 515
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  668 D----KINDLMAQVGEQLSKFPQSLPEDfvNSPAVKSI---DISKSRDLASELSAAVNRgdwktAEnlARELKKQLEDML 740
Cdd:pfam13779  516 QelreALDDYMQALAEQAQQNPQDLQQP--DDPNAQEMtqqDLQRMLDRIEELARSGRR-----AE--AQQMLSQLQQML 586
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  741 ETVRsAGQDIGFSSGAMSKSAEEMNRLGnelsKLAERQSRILAKTEtinnKRIREAMKKQEDMllllaKEQQriiasage 820
Cdd:pfam13779  587 ENLQ-AGQPQQQQQQGQSEMQQAMDELG----DLLREQQQLLDETF----RQLQQQGGQQQGQ-----PGQQ-------- 644
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  821 vlpllvekkpmaGDSTQQSMAlmnkvydefaskraynsqkylediishwsdlsalfKEDAVSPRINAVLAAEKNILLKLK 900
Cdd:pfam13779  645 ------------GQQGQGQQP-----------------------------------GQGGQQPGAQMPPQGGAEALGDLA 677
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  901 EndtnsvsgaaaagdteaaskEQSALARDTVSLRQSLETVGRKSaliPPETLGslnEAGASMESAASELDSGRVPQAVES 980
Cdd:pfam13779  678 E--------------------RQQALRRRLEELQDELKELGGKE---PGQALG---DAGRAMRDAEEALGQGDLAGAVDA 731
                          730       740
                   ....*....|....*....|....*
gi 1085007603  981 QRNALNLLLSGADNLkgtSESMQES 1005
Cdd:pfam13779  732 QGRALEALRKGAQQL---AEAMQQQ 753
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
532-802 2.88e-05

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 48.51  E-value: 2.88e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  532 AAYPGIIDAQASIQESAAGGIESLD-GILNKMESDpVTDFSTLKEykgiKSQLVYLKDTPMRRAVEALRELN-----INE 605
Cdd:TIGR01612 1048 AIHTSIYNIIDEIEKEIGKNIELLNkEILEEAEIN-ITNFNEIKE----KLKHYNFDDFGKEENIKYADEINkikddIKN 1122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  606 AERRQNEIIASLEKLSLLSE---DIMRYQ--NMKDIIDSA------AELSKSSDDILDSLKGKpeagelARILDKINDLM 674
Cdd:TIGR01612 1123 LDQKIDHHIKALEEIKKKSEnyiDEIKAQinDLEDVADKAisnddpEEIEKKIENIVTKIDKK------KNIYDEIKKLL 1196
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1085007603  675 AQVGEqLSKFPQSLPEdfvnspaVKSIDISKSRDLASELSAAVNRgDWKTAENLARELKKQLEDmLETVRSAGQDIGFSS 754
Cdd:TIGR01612 1197 NEIAE-IEKDKTSLEE-------VKGINLSYGKNLGKLFLEKIDE-EKKKSEHMIKAMEAYIED-LDEIKEKSPEIENEM 1266
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*....
gi 1085007603  755 GAMSKSAEEMNRLgnELSKLAERQSRILAKTETINNKRIRE-AMKKQED 802
Cdd:TIGR01612 1267 GIEMDIKAEMETF--NISHDDDKDHHIISKKHDENISDIREkSLKIIED 1313
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH