NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2238311084|gb|UQK84922|]
View 

hapless 2 [Zea mays]

Protein Classification

HAP2/GCS1 family protein( domain architecture ID 10566459)

HAP2/GCS1 family protein similar to Chlamydomonas reinhardtii Hapless 2 that is required on male (minus) gametes for their fusion with female (plus) gametes during fertilization

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
HAP2-GCS1 pfam10699
Male gamete fusion factor; The gene encoding Arabidopsis HAP2 is allelic with GCS1 (Generative ...
49-495 3.06e-150

Male gamete fusion factor; The gene encoding Arabidopsis HAP2 is allelic with GCS1 (Generative cell-specific protein 1). HAP2 is expressed only in the haploid sperm and is required for efficient guidance of the pollen tube to the ovules. In Arabidopsis the protein is a predicted membrane protein with an N-terminal secretion signal, a single transmembrane domain and a C-terminal histidine-rich domain. HAP2-GCS1 is found from plants to lower eukaryotes and is necessary for the fusion of the gametes in fertilization. Studies in the green alga Chlamydomonas and the malaria organizm Plasmodium showed that it is involved in a novel mechanism for gamete fusion where a first species-specific protein binds male and female gamete membranes together after which a second, broadly conserved protein, either directly or indirectly, causes fusion of the two membranes together. The broadly conserved protein is represented by this HAP2-GCS1 domain, conserved from plants to lower eukaryotes. In Plasmodium berghei the protein is expressed only in male gametocytes and gametes, having a male-specific function during the interaction with female gametes, and being indispensable for parasite fertilization. The gene in plants and eukaryotes might well have originated from acquisition of plastids from red algae.


:

Pssm-ID: 431445  Cd Length: 475  Bit Score: 446.13  E-value: 3.06e-150
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2238311084  49 LSCDRKVVVDMAVPSGSNGGEAWLVAQVAHV--NDTKQSKTIRNPPVITVNKGAVFALYALNYIRDVAYKPEEQYVET-- 124
Cdd:pfam10699   2 LTCKKKLVLTLAVPNGQSGNEESELAELSVVdeNGTNKKLELARPIRIKITKSDVYVKYPLVYLKDVNSKPYEVVIKKnn 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2238311084 125 ------RKCEPDAGsdvvraCERLRYENGSIIEHSEPVCCPCGPNRRVPSSCG---NVFEKIFKGKANTAHCLRFPGDWF 195
Cdd:pfam10699  82 snfcvdSKSSSSPT------CGVLRDEKGERIPYSQGFCCSCGPFDGIGLSRGgagNLSCKLFGGDPSSAHCLRFSPLWY 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2238311084 196 HVFGIGTRSLGFNIRVQVKK------------GSSVSEVVVGPENRTVVSKDNFLRVNLIGDFGGYTSIPAFEDFYLVTP 263
Cdd:pfam10699 156 SAYSIGTPSISFSISINVTKsnspsnsgiswnAYSTETLRLGPSNKVAEASDGTIIAKYIGDFAPSEPLPSLEDKYLLIP 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2238311084 264 RKSAGSGEpqnLGAEYRKWMLL--ERVRFTdGVECNKIGVGYEAFQNQPNFCASPFESCLNNQLWTFLESDKNRISMSRQ 341
Cdd:pfam10699 236 RSPNQHPR---KNAGERRWLIVdkSQVDLT-GSSCNKIGVSYEAFRQQPNRCSAPFGSCLGNQLWDLRKQDLALLAAGRL 311
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2238311084 342 PQ--YVVQGRFQRINQHPDASVHSFSIGVTEVINSNLRIELSADDIEYMYQRSPGNITDISVPAFEVLSQYGTAKVTTKN 419
Cdd:pfam10699 312 GLalYSLGRPGGRSNSSLNNDNRSLQFEVLGVQNSLVTLEIAADDLAFVYQRSPGKIVSARIPDFEALSKGGTLTVQVMN 391
                         410       420       430       440       450       460       470
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2238311084 420 IGTLEASYTLTFHCSSGISFMEEQYYILKPNEASTRLFYLHASTDQAAKYQCTAILKASDSSELDRQECVFSTTAT 495
Cdd:pfam10699 392 TGKVEASYGVTVTCSDNILPIQAQAVSIKPNESVIFSFPLNTLTDQAGNASCTVTLKNALGNIVDRREIKFNTTAT 467
 
Name Accession Description Interval E-value
HAP2-GCS1 pfam10699
Male gamete fusion factor; The gene encoding Arabidopsis HAP2 is allelic with GCS1 (Generative ...
49-495 3.06e-150

Male gamete fusion factor; The gene encoding Arabidopsis HAP2 is allelic with GCS1 (Generative cell-specific protein 1). HAP2 is expressed only in the haploid sperm and is required for efficient guidance of the pollen tube to the ovules. In Arabidopsis the protein is a predicted membrane protein with an N-terminal secretion signal, a single transmembrane domain and a C-terminal histidine-rich domain. HAP2-GCS1 is found from plants to lower eukaryotes and is necessary for the fusion of the gametes in fertilization. Studies in the green alga Chlamydomonas and the malaria organizm Plasmodium showed that it is involved in a novel mechanism for gamete fusion where a first species-specific protein binds male and female gamete membranes together after which a second, broadly conserved protein, either directly or indirectly, causes fusion of the two membranes together. The broadly conserved protein is represented by this HAP2-GCS1 domain, conserved from plants to lower eukaryotes. In Plasmodium berghei the protein is expressed only in male gametocytes and gametes, having a male-specific function during the interaction with female gametes, and being indispensable for parasite fertilization. The gene in plants and eukaryotes might well have originated from acquisition of plastids from red algae.


Pssm-ID: 431445  Cd Length: 475  Bit Score: 446.13  E-value: 3.06e-150
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2238311084  49 LSCDRKVVVDMAVPSGSNGGEAWLVAQVAHV--NDTKQSKTIRNPPVITVNKGAVFALYALNYIRDVAYKPEEQYVET-- 124
Cdd:pfam10699   2 LTCKKKLVLTLAVPNGQSGNEESELAELSVVdeNGTNKKLELARPIRIKITKSDVYVKYPLVYLKDVNSKPYEVVIKKnn 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2238311084 125 ------RKCEPDAGsdvvraCERLRYENGSIIEHSEPVCCPCGPNRRVPSSCG---NVFEKIFKGKANTAHCLRFPGDWF 195
Cdd:pfam10699  82 snfcvdSKSSSSPT------CGVLRDEKGERIPYSQGFCCSCGPFDGIGLSRGgagNLSCKLFGGDPSSAHCLRFSPLWY 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2238311084 196 HVFGIGTRSLGFNIRVQVKK------------GSSVSEVVVGPENRTVVSKDNFLRVNLIGDFGGYTSIPAFEDFYLVTP 263
Cdd:pfam10699 156 SAYSIGTPSISFSISINVTKsnspsnsgiswnAYSTETLRLGPSNKVAEASDGTIIAKYIGDFAPSEPLPSLEDKYLLIP 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2238311084 264 RKSAGSGEpqnLGAEYRKWMLL--ERVRFTdGVECNKIGVGYEAFQNQPNFCASPFESCLNNQLWTFLESDKNRISMSRQ 341
Cdd:pfam10699 236 RSPNQHPR---KNAGERRWLIVdkSQVDLT-GSSCNKIGVSYEAFRQQPNRCSAPFGSCLGNQLWDLRKQDLALLAAGRL 311
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2238311084 342 PQ--YVVQGRFQRINQHPDASVHSFSIGVTEVINSNLRIELSADDIEYMYQRSPGNITDISVPAFEVLSQYGTAKVTTKN 419
Cdd:pfam10699 312 GLalYSLGRPGGRSNSSLNNDNRSLQFEVLGVQNSLVTLEIAADDLAFVYQRSPGKIVSARIPDFEALSKGGTLTVQVMN 391
                         410       420       430       440       450       460       470
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2238311084 420 IGTLEASYTLTFHCSSGISFMEEQYYILKPNEASTRLFYLHASTDQAAKYQCTAILKASDSSELDRQECVFSTTAT 495
Cdd:pfam10699 392 TGKVEASYGVTVTCSDNILPIQAQAVSIKPNESVIFSFPLNTLTDQAGNASCTVTLKNALGNIVDRREIKFNTTAT 467
 
Name Accession Description Interval E-value
HAP2-GCS1 pfam10699
Male gamete fusion factor; The gene encoding Arabidopsis HAP2 is allelic with GCS1 (Generative ...
49-495 3.06e-150

Male gamete fusion factor; The gene encoding Arabidopsis HAP2 is allelic with GCS1 (Generative cell-specific protein 1). HAP2 is expressed only in the haploid sperm and is required for efficient guidance of the pollen tube to the ovules. In Arabidopsis the protein is a predicted membrane protein with an N-terminal secretion signal, a single transmembrane domain and a C-terminal histidine-rich domain. HAP2-GCS1 is found from plants to lower eukaryotes and is necessary for the fusion of the gametes in fertilization. Studies in the green alga Chlamydomonas and the malaria organizm Plasmodium showed that it is involved in a novel mechanism for gamete fusion where a first species-specific protein binds male and female gamete membranes together after which a second, broadly conserved protein, either directly or indirectly, causes fusion of the two membranes together. The broadly conserved protein is represented by this HAP2-GCS1 domain, conserved from plants to lower eukaryotes. In Plasmodium berghei the protein is expressed only in male gametocytes and gametes, having a male-specific function during the interaction with female gametes, and being indispensable for parasite fertilization. The gene in plants and eukaryotes might well have originated from acquisition of plastids from red algae.


Pssm-ID: 431445  Cd Length: 475  Bit Score: 446.13  E-value: 3.06e-150
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2238311084  49 LSCDRKVVVDMAVPSGSNGGEAWLVAQVAHV--NDTKQSKTIRNPPVITVNKGAVFALYALNYIRDVAYKPEEQYVET-- 124
Cdd:pfam10699   2 LTCKKKLVLTLAVPNGQSGNEESELAELSVVdeNGTNKKLELARPIRIKITKSDVYVKYPLVYLKDVNSKPYEVVIKKnn 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2238311084 125 ------RKCEPDAGsdvvraCERLRYENGSIIEHSEPVCCPCGPNRRVPSSCG---NVFEKIFKGKANTAHCLRFPGDWF 195
Cdd:pfam10699  82 snfcvdSKSSSSPT------CGVLRDEKGERIPYSQGFCCSCGPFDGIGLSRGgagNLSCKLFGGDPSSAHCLRFSPLWY 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2238311084 196 HVFGIGTRSLGFNIRVQVKK------------GSSVSEVVVGPENRTVVSKDNFLRVNLIGDFGGYTSIPAFEDFYLVTP 263
Cdd:pfam10699 156 SAYSIGTPSISFSISINVTKsnspsnsgiswnAYSTETLRLGPSNKVAEASDGTIIAKYIGDFAPSEPLPSLEDKYLLIP 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2238311084 264 RKSAGSGEpqnLGAEYRKWMLL--ERVRFTdGVECNKIGVGYEAFQNQPNFCASPFESCLNNQLWTFLESDKNRISMSRQ 341
Cdd:pfam10699 236 RSPNQHPR---KNAGERRWLIVdkSQVDLT-GSSCNKIGVSYEAFRQQPNRCSAPFGSCLGNQLWDLRKQDLALLAAGRL 311
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2238311084 342 PQ--YVVQGRFQRINQHPDASVHSFSIGVTEVINSNLRIELSADDIEYMYQRSPGNITDISVPAFEVLSQYGTAKVTTKN 419
Cdd:pfam10699 312 GLalYSLGRPGGRSNSSLNNDNRSLQFEVLGVQNSLVTLEIAADDLAFVYQRSPGKIVSARIPDFEALSKGGTLTVQVMN 391
                         410       420       430       440       450       460       470
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2238311084 420 IGTLEASYTLTFHCSSGISFMEEQYYILKPNEASTRLFYLHASTDQAAKYQCTAILKASDSSELDRQECVFSTTAT 495
Cdd:pfam10699 392 TGKVEASYGVTVTCSDNILPIQAQAVSIKPNESVIFSFPLNTLTDQAGNASCTVTLKNALGNIVDRREIKFNTTAT 467
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH