NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1063708407|ref|NP_001319627|]
View 

Tetratricopeptide repeat (TPR)-like superfamily protein [Arabidopsis thaliana]

Protein Classification

pentatricopeptide repeat-containing protein( domain architecture ID 1000225)

pentatricopeptide repeat (PPR)-containing protein may form anti-parallel alpha helices and bind single-stranded RNA in a sequence-specific and modular manner

Gene Ontology:  GO:0003723|GO:0009451
PubMed:  24471833|19004664

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03081 super family cl33631
pentatricopeptide (PPR) repeat-containing protein; Provisional
59-633 0e+00

pentatricopeptide (PPR) repeat-containing protein; Provisional


The actual alignment was detected with superfamily member PLN03081:

Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 535.22  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407  59 DRRFYNTLLKKCTVFKLLIQGRIVHAHILQSIFRHDIVMGNTLLNMYAKCGSLEEARKVFEKMPQRDFVTWTTLISGYSQ 138
Cdd:PLN03081  122 PASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFDEMPERNLASWGTIIGGLVD 201
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 139 HDRPCDALLFFNQMLRFGYSPNEFTLSSVIKAAAAERRGCCGHQLHGFCVKCGFDSNVHVGSALLDLYTRYGLMDDAQLV 218
Cdd:PLN03081  202 AGNYREAFALFREMWEDGSDAEPRTFVVMLRASAGLGSARAGQQLHCCVLKTGVVGDTFVSCALIDMYSKCGDIEDARCV 281
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 219 FDALESRNDVSWNALIAGHARRSGTEKALELFQGMLRDGFRPSHFSYASLFGACSSTGFLEQGKWVHAYMIKSGEKLVAF 298
Cdd:PLN03081  282 FDGMPEKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIV 361
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 299 AGNTLLDMYAKSGSIHDARKIFDRLAKRDVVSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRPNEISFLSVLTACSHSGL 378
Cdd:PLN03081  362 ANTALVDLYSKWGRMEDARNVFDRMPRKNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGL 441
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 379 LDEGWHYYELMKKD-GIVPEAWHYVTVVDLLGRAGDLNRALRFIEEMPIEPTAAIWKALLNACRMHKNTELGAYAAEHVF 457
Cdd:PLN03081  442 SEQGWEIFQSMSENhRIKPRAMHYACMIELLGREGLLDEAYAMIRRAPFKPTVNMWAALLTACRIHKNLELGRLAAEKLY 521
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 458 ELDPDDPGPHVILYNIYASGGRWNDAARVRKKMKESGVKKEPACSWVEIENAIHMFVANDERHPQREEIARKWEEVLAKI 537
Cdd:PLN03081  522 GMGPEKLNNYVVLLNLYNSSGRQAEAAKVVETLKRKGLSMHPACTWIEVKKQDHSFFSGDRLHPQSREIYQKLDELMKEI 601
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 538 KELGYVPDTSHVIVHVDQQEREVNLQYHSEKIALAFALLNTPPGSTIHIKKNIRVCGDCHTAIKLASKVVGREIIVRDTN 617
Cdd:PLN03081  602 SEYGYVAEENELLPDVDEDEEKVSGRYHSEKLAIAFGLINTSEWTPLQITQSHRICKDCHKVIKFIALVTKREIVVRDAS 681
                         570
                  ....*....|....*.
gi 1063708407 618 RFHHFKDGNCSCKDYW 633
Cdd:PLN03081  682 RFHHFKLGKCSCGDYW 697
 
Name Accession Description Interval E-value
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
59-633 0e+00

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 535.22  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407  59 DRRFYNTLLKKCTVFKLLIQGRIVHAHILQSIFRHDIVMGNTLLNMYAKCGSLEEARKVFEKMPQRDFVTWTTLISGYSQ 138
Cdd:PLN03081  122 PASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFDEMPERNLASWGTIIGGLVD 201
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 139 HDRPCDALLFFNQMLRFGYSPNEFTLSSVIKAAAAERRGCCGHQLHGFCVKCGFDSNVHVGSALLDLYTRYGLMDDAQLV 218
Cdd:PLN03081  202 AGNYREAFALFREMWEDGSDAEPRTFVVMLRASAGLGSARAGQQLHCCVLKTGVVGDTFVSCALIDMYSKCGDIEDARCV 281
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 219 FDALESRNDVSWNALIAGHARRSGTEKALELFQGMLRDGFRPSHFSYASLFGACSSTGFLEQGKWVHAYMIKSGEKLVAF 298
Cdd:PLN03081  282 FDGMPEKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIV 361
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 299 AGNTLLDMYAKSGSIHDARKIFDRLAKRDVVSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRPNEISFLSVLTACSHSGL 378
Cdd:PLN03081  362 ANTALVDLYSKWGRMEDARNVFDRMPRKNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGL 441
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 379 LDEGWHYYELMKKD-GIVPEAWHYVTVVDLLGRAGDLNRALRFIEEMPIEPTAAIWKALLNACRMHKNTELGAYAAEHVF 457
Cdd:PLN03081  442 SEQGWEIFQSMSENhRIKPRAMHYACMIELLGREGLLDEAYAMIRRAPFKPTVNMWAALLTACRIHKNLELGRLAAEKLY 521
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 458 ELDPDDPGPHVILYNIYASGGRWNDAARVRKKMKESGVKKEPACSWVEIENAIHMFVANDERHPQREEIARKWEEVLAKI 537
Cdd:PLN03081  522 GMGPEKLNNYVVLLNLYNSSGRQAEAAKVVETLKRKGLSMHPACTWIEVKKQDHSFFSGDRLHPQSREIYQKLDELMKEI 601
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 538 KELGYVPDTSHVIVHVDQQEREVNLQYHSEKIALAFALLNTPPGSTIHIKKNIRVCGDCHTAIKLASKVVGREIIVRDTN 617
Cdd:PLN03081  602 SEYGYVAEENELLPDVDEDEEKVSGRYHSEKLAIAFGLINTSEWTPLQITQSHRICKDCHKVIKFIALVTKREIVVRDAS 681
                         570
                  ....*....|....*.
gi 1063708407 618 RFHHFKDGNCSCKDYW 633
Cdd:PLN03081  682 RFHHFKLGKCSCGDYW 697
DYW_deaminase pfam14432
DYW family of nucleic acid deaminases; This is the DYW domain found in nucleic acid deaminases ...
541-633 2.79e-45

DYW family of nucleic acid deaminases; This is the DYW domain found in nucleic acid deaminases prototyped by the plant PPR DYW proteins that are implicated in chloroplast and mitochondrial RNA transcript maturation by numerous C to U editing events. The name derives from the DYW motif present at the C-terminus of the classical plant PPR DYW deaminases. Members containing this domain are present in bacteria, plants, Naegleria, and fungi. Plants and Naegleria show lineage-specific expansions of this family. This domain contains a characteriztic zinc-binding motif (CXXC, HXE) which has been shown to bind zinc ions. This domain is often fused to PPR repeats. Ascomycete versions, which are independent lateral transfers, contain a large insert within the domain and are often fused to ankyrin repeats. Bacterial versions are predicted to function as toxins in polymorphic toxin systems.


Pssm-ID: 464172 [Multi-domain]  Cd Length: 93  Bit Score: 155.66  E-value: 2.79e-45
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 541 GYVPDTSHVIVHVDQQEREVNLQYHSEKIALAFALLNTPPGSTIHIKKNIRVCGDCHTAIKLASKVVGREIIVRDTNRFH 620
Cdd:pfam14432   1 GYVPDLKFVLHDVDEEEKKQLLHGHSEKLALAYGLLTTPDGATIRITKNLRVCGDCHTAFKYISKIFGREIVVRDASRFH 80
                          90
                  ....*....|...
gi 1063708407 621 HFKDGNCSCKDYW 633
Cdd:pfam14432  81 HFKNGLCSCGDFW 93
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
329-363 3.79e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 40.90  E-value: 3.79e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063708407 329 VSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRPNE 363
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
NrfG COG4235
Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, ...
396-510 4.56e-05

Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443378 [Multi-domain]  Cd Length: 131  Bit Score: 43.46  E-value: 4.56e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 396 PEAWHyvtvvdLLGRA----GDLNRALRFIEE-MPIEP-TAAIWKALLNACRMHKNTELGAYAAEHVFELDPDDPGPHVI 469
Cdd:COG4235    17 AEGWL------LLGRAylrlGRYDEALAAYEKaLRLDPdNADALLDLAEALLAAGDTEEAEELLERALALDPDNPEALYL 90
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|.
gi 1063708407 470 LYNIYASGGRWNDAARVRKKMKESGVKKEPacsWVEIENAI 510
Cdd:COG4235    91 LGLAAFQQGDYAEAIAAWQKLLALLPADAP---ARLLEASI 128
 
Name Accession Description Interval E-value
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
59-633 0e+00

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 535.22  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407  59 DRRFYNTLLKKCTVFKLLIQGRIVHAHILQSIFRHDIVMGNTLLNMYAKCGSLEEARKVFEKMPQRDFVTWTTLISGYSQ 138
Cdd:PLN03081  122 PASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFDEMPERNLASWGTIIGGLVD 201
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 139 HDRPCDALLFFNQMLRFGYSPNEFTLSSVIKAAAAERRGCCGHQLHGFCVKCGFDSNVHVGSALLDLYTRYGLMDDAQLV 218
Cdd:PLN03081  202 AGNYREAFALFREMWEDGSDAEPRTFVVMLRASAGLGSARAGQQLHCCVLKTGVVGDTFVSCALIDMYSKCGDIEDARCV 281
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 219 FDALESRNDVSWNALIAGHARRSGTEKALELFQGMLRDGFRPSHFSYASLFGACSSTGFLEQGKWVHAYMIKSGEKLVAF 298
Cdd:PLN03081  282 FDGMPEKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIV 361
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 299 AGNTLLDMYAKSGSIHDARKIFDRLAKRDVVSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRPNEISFLSVLTACSHSGL 378
Cdd:PLN03081  362 ANTALVDLYSKWGRMEDARNVFDRMPRKNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGL 441
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 379 LDEGWHYYELMKKD-GIVPEAWHYVTVVDLLGRAGDLNRALRFIEEMPIEPTAAIWKALLNACRMHKNTELGAYAAEHVF 457
Cdd:PLN03081  442 SEQGWEIFQSMSENhRIKPRAMHYACMIELLGREGLLDEAYAMIRRAPFKPTVNMWAALLTACRIHKNLELGRLAAEKLY 521
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 458 ELDPDDPGPHVILYNIYASGGRWNDAARVRKKMKESGVKKEPACSWVEIENAIHMFVANDERHPQREEIARKWEEVLAKI 537
Cdd:PLN03081  522 GMGPEKLNNYVVLLNLYNSSGRQAEAAKVVETLKRKGLSMHPACTWIEVKKQDHSFFSGDRLHPQSREIYQKLDELMKEI 601
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 538 KELGYVPDTSHVIVHVDQQEREVNLQYHSEKIALAFALLNTPPGSTIHIKKNIRVCGDCHTAIKLASKVVGREIIVRDTN 617
Cdd:PLN03081  602 SEYGYVAEENELLPDVDEDEEKVSGRYHSEKLAIAFGLINTSEWTPLQITQSHRICKDCHKVIKFIALVTKREIVVRDAS 681
                         570
                  ....*....|....*.
gi 1063708407 618 RFHHFKDGNCSCKDYW 633
Cdd:PLN03081  682 RFHHFKLGKCSCGDYW 697
PLN03077 PLN03077
Protein ECB2; Provisional
79-631 9.52e-165

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 492.83  E-value: 9.52e-165
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407  79 GRIVHAHILQSIFRHDIVMGNTLLNMYAKCGSLEEARKVFEKMPQRDFVTWTTLISGYSQHDRPCDALLFFNQMLRFGYS 158
Cdd:PLN03077  307 GREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETKDAVSWTAMISGYEKNGLPDKALETYALMEQDNVS 386
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 159 PNEFTLSSVIKAAAAERRGCCGHQLHGFCVKCGFDSNVHVGSALLDLYTRYGLMDDAQLVFDALESRNDVSWNALIAGHA 238
Cdd:PLN03077  387 PDEITIASVLSACACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNIPEKDVISWTSIIAGLR 466
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 239 RRSGTEKALELFQGMLRDgFRPSHFSYASLFGACSSTGFLEQGKWVHAYMIKSGEKLVAFAGNTLLDMYAKSGSIHDARK 318
Cdd:PLN03077  467 LNNRCFEALIFFRQMLLT-LKPNSVTLIAALSACARIGALMCGKEIHAHVLRTGIGFDGFLPNALLDLYVRCGRMNYAWN 545
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 319 IFDrLAKRDVVSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRPNEISFLSVLTACSHSGLLDEGWHYYELMKKD-GIVPE 397
Cdd:PLN03077  546 QFN-SHEKDVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEEKySITPN 624
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 398 AWHYVTVVDLLGRAGDLNRALRFIEEMPIEPTAAIWKALLNACRMHKNTELGAYAAEHVFELDPDDPGPHVILYNIYASG 477
Cdd:PLN03077  625 LKHYACVVDLLGRAGKLTEAYNFINKMPITPDPAVWGALLNACRIHRHVELGELAAQHIFELDPNSVGYYILLCNLYADA 704
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 478 GRWNDAARVRKKMKESGVKKEPACSWVEIENAIHMFVANDERHPQREEIARKWEEVLAKIKELGYvpdTSHVIVHVDQQE 557
Cdd:PLN03077  705 GKWDEVARVRKTMRENGLTVDPGCSWVEVKGKVHAFLTDDESHPQIKEINTVLEGFYEKMKASGL---AGSESSSMDEIE 781
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1063708407 558 --REVNLQYHSEKIALAFALLNTPPGSTIHIKKNIRVCGDCHTAIKLASKVVGREIIVRDTNRFHHFKDGNCSCKD 631
Cdd:PLN03077  782 vsKDDIFCGHSERLAIAFGLINTVPGMPIWVTKNLYMCENCHNTVKFISKIVRREISVRDTEQFHHFKDGECSCGD 857
DYW_deaminase pfam14432
DYW family of nucleic acid deaminases; This is the DYW domain found in nucleic acid deaminases ...
541-633 2.79e-45

DYW family of nucleic acid deaminases; This is the DYW domain found in nucleic acid deaminases prototyped by the plant PPR DYW proteins that are implicated in chloroplast and mitochondrial RNA transcript maturation by numerous C to U editing events. The name derives from the DYW motif present at the C-terminus of the classical plant PPR DYW deaminases. Members containing this domain are present in bacteria, plants, Naegleria, and fungi. Plants and Naegleria show lineage-specific expansions of this family. This domain contains a characteriztic zinc-binding motif (CXXC, HXE) which has been shown to bind zinc ions. This domain is often fused to PPR repeats. Ascomycete versions, which are independent lateral transfers, contain a large insert within the domain and are often fused to ankyrin repeats. Bacterial versions are predicted to function as toxins in polymorphic toxin systems.


Pssm-ID: 464172 [Multi-domain]  Cd Length: 93  Bit Score: 155.66  E-value: 2.79e-45
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 541 GYVPDTSHVIVHVDQQEREVNLQYHSEKIALAFALLNTPPGSTIHIKKNIRVCGDCHTAIKLASKVVGREIIVRDTNRFH 620
Cdd:pfam14432   1 GYVPDLKFVLHDVDEEEKKQLLHGHSEKLALAYGLLTTPDGATIRITKNLRVCGDCHTAFKYISKIFGREIVVRDASRFH 80
                          90
                  ....*....|...
gi 1063708407 621 HFKDGNCSCKDYW 633
Cdd:pfam14432  81 HFKNGLCSCGDFW 93
E_motif pfam20431
E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) ...
443-505 1.64e-21

E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) proteins which contain a DYW deaminase domain. The DYW domain is required for RNA editing, a process that deaminates specific cytidines to uridines. This motif, together with the E+ motif, precedes the DYW domain and, although their role is not clear, they are essential in the RNA editing reaction. The E/E+ motifs may contain two degenerate PPR motifs that could be involved in RNA or protein binding.


Pssm-ID: 466580 [Multi-domain]  Cd Length: 63  Bit Score: 88.37  E-value: 1.64e-21
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1063708407 443 HKNTELGAYAAEHVFELDPDDPGPHVILYNIYASGGRWNDAARVRKKMKESGVKKEPACSWVE 505
Cdd:pfam20431   1 YSNVELAEKAANILLELEKTNDGNYTLLSNIYAYAGRWKDVERIRKLMKSSGIKKRPGCSWIE 63
PLN03077 PLN03077
Protein ECB2; Provisional
244-373 1.32e-15

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 80.66  E-value: 1.32e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 244 EKALELFQGMLRDGFRPSHFSYASLFGACSSTGFLEQGKWVHAYMIKSGEKLVAFAGNTLLDMYAKSGSIHDARKIFDRL 323
Cdd:PLN03077   68 EQALKLLESMQELRVPVDEDAYVALFRLCEWKRAVEEGSRVCSRALSSHPSLGVRLGNAMLSMFVRFGELVHAWYVFGKM 147
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063708407 324 AKRDVVSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRPNEISFLSVLTAC 373
Cdd:PLN03077  148 PERDLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTC 197
PLN03218 PLN03218
maturation of RBCL 1; Provisional
181-449 2.05e-13

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 73.76  E-value: 2.05e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407  181 HQLhgfcVKCGFDSNVHVGSALLDLYTRYGLMDDAQLVFDALESRN----DVSWNALIAGHARRSGTEKALELFQGMLRD 256
Cdd:PLN03218   496 HEM----VNAGVEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNvkpdRVVFNALISACGQSGAVDRAFDVLAEMKAE 571
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407  257 G--FRPSHFSYASLFGACSSTGFLEQGKWVHAYMIKSGEKLVAFAGNTLLDMYAKSGSIHDARKIFDRLAKRDV----VS 330
Cdd:PLN03218   572 ThpIDPDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVkpdeVF 651
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407  331 WNSLLTAYAQHGFGKEAVWWFEEMRRVGIRPNEISFLSVLTACSHSGLLDEGWHYYELMKKDGIVPEAWHYVTVVDLLGR 410
Cdd:PLN03218   652 FSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCE 731
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|..
gi 1063708407  411 AGDLNRALRFIEEMP---IEPTAAIWKALLNACRMHKNTELG 449
Cdd:PLN03218   732 GNQLPKALEVLSEMKrlgLCPNTITYSILLVASERKDDADVG 773
PLN03218 PLN03218
maturation of RBCL 1; Provisional
233-447 9.13e-13

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 71.83  E-value: 9.13e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407  233 LIAGHARRSGTEKALELFQGMLRDGFRPSHFSYASLFGACSSTGFLeqGKWVHAYMIKSGEKL----VAFagNTLLDMYA 308
Cdd:PLN03218   478 LISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQV--AKAFGAYGIMRSKNVkpdrVVF--NALISACG 553
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407  309 KSGSIHDArkiFDRLA---------KRDVVSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRPNEISFLSVLTACSHSGLL 379
Cdd:PLN03218   554 QSGAVDRA---FDVLAemkaethpiDPDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDW 630
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1063708407  380 DEGWHYYELMKKDGIVPEAWHYVTVVDLLGRAGDLNRALRFIEEMP---IEPTAAIWKALLNACRMHKNTE 447
Cdd:PLN03218   631 DFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARkqgIKLGTVSYSSLMGACSNAKNWK 701
PLN03218 PLN03218
maturation of RBCL 1; Provisional
51-396 1.50e-11

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 67.59  E-value: 1.50e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407   51 LEGSYIPADRRFYNTLLKKCTVfklliQGRIvhaHILQSIFRHDIVMG-----NTLLNMYAKCGSLEEARKVF------- 118
Cdd:PLN03218   463 VQEAGLKADCKLYTTLISTCAK-----SGKV---DAMFEVFHEMVNAGveanvHTFGALIDGCARAGQVAKAFgaygimr 534
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407  119 EKMPQRDFVTWTTLISGYSQH---DRPCDALlffNQMLRFGY--SPNEFTLSSVIKAAA----AERRGCCGHQLHGFCVK 189
Cdd:PLN03218   535 SKNVKPDRVVFNALISACGQSgavDRAFDVL---AEMKAETHpiDPDHITVGALMKACAnagqVDRAKEVYQMIHEYNIK 611
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407  190 cGFDSnvhVGSALLDLYTRYGLMDDAQLVFDALESR----NDVSWNALI--AGHARRsgTEKALELFQGMLRDGFRPSHF 263
Cdd:PLN03218   612 -GTPE---VYTIAVNSCSQKGDWDFALSIYDDMKKKgvkpDEVFFSALVdvAGHAGD--LDKAFEILQDARKQGIKLGTV 685
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407  264 SYASLFGACSSTGfleqgKWVHAymiksgeklvafagntlLDMYAKSGSIhdarKIfdrlaKRDVVSWNSLLTAYAQHGF 343
Cdd:PLN03218   686 SYSSLMGACSNAK-----NWKKA-----------------LELYEDIKSI----KL-----RPTVSTMNALITALCEGNQ 734
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1063708407  344 GKEAVWWFEEMRRVGIRPNEISFLSVLTACSHSGLLDEGWHYYELMKKDGIVP 396
Cdd:PLN03218   735 LPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKP 787
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
326-375 1.59e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 53.91  E-value: 1.59e-09
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063708407 326 RDVVSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRPNEISFLSVLTACSH 375
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
127-179 8.21e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 51.60  E-value: 8.21e-09
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|...
gi 1063708407 127 VTWTTLISGYSQHDRPCDALLFFNQMLRFGYSPNEFTLSSVIKaaaaerrGCC 179
Cdd:pfam13041   4 VTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILIN-------GLC 49
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
225-272 1.06e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 48.51  E-value: 1.06e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 1063708407 225 RNDVSWNALIAGHARRSGTEKALELFQGMLRDGFRPSHFSYASLFGAC 272
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
94-136 3.27e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 47.36  E-value: 3.27e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 1063708407  94 DIVMGNTLLNMYAKCGSLEEARKVFEKMPQR----DFVTWTTLISGY 136
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRgvkpNVYTYTILINGL 48
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
329-359 6.23e-07

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 45.92  E-value: 6.23e-07
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063708407 329 VSWNSLLTAYAQHGFGKEAVWWFEEMRRVGI 359
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PLN03218 PLN03218
maturation of RBCL 1; Provisional
301-496 4.28e-06

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 49.88  E-value: 4.28e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407  301 NTLLDMYAKSGSIHDARKIFdRLA-----KRDVVSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRPNEISFLSVLTACSH 375
Cdd:PLN03218   441 NMLMSVCASSQDIDGALRVL-RLVqeaglKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCAR 519
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407  376 SGLLDEGWHYYELMKKDGIVPEAWHYVTVVDLLGRAGDLNRALRFIEEM-----PIEPTAAIWKALLNAC---------- 440
Cdd:PLN03218   520 AGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMkaethPIDPDHITVGALMKACanagqvdrak 599
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407  441 ----RMHKNTELGAyaaehvfeldpddPGPHVILYNIYASGGRWNDAARVRKKMKESGVK 496
Cdd:PLN03218   600 evyqMIHEYNIKGT-------------PEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVK 646
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
327-373 5.32e-06

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 44.27  E-value: 5.32e-06
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 1063708407 327 DVVSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRPNEISFLSVLTAC 373
Cdd:pfam13812  14 NVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
127-156 2.04e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 41.68  E-value: 2.04e-05
                          10        20        30
                  ....*....|....*....|....*....|
gi 1063708407 127 VTWTTLISGYSQHDRPCDALLFFNQMLRFG 156
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKG 30
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
329-363 3.79e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 40.90  E-value: 3.79e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063708407 329 VSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRPNE 363
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
NrfG COG4235
Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, ...
396-510 4.56e-05

Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443378 [Multi-domain]  Cd Length: 131  Bit Score: 43.46  E-value: 4.56e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 396 PEAWHyvtvvdLLGRA----GDLNRALRFIEE-MPIEP-TAAIWKALLNACRMHKNTELGAYAAEHVFELDPDDPGPHVI 469
Cdd:COG4235    17 AEGWL------LLGRAylrlGRYDEALAAYEKaLRLDPdNADALLDLAEALLAAGDTEEAEELLERALALDPDNPEALYL 90
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|.
gi 1063708407 470 LYNIYASGGRWNDAARVRKKMKESGVKKEPacsWVEIENAI 510
Cdd:COG4235    91 LGLAAFQQGDYAEAIAAWQKLLALLPADAP---ARLLEASI 128
PLN03218 PLN03218
maturation of RBCL 1; Provisional
358-501 8.57e-05

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 46.02  E-value: 8.57e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407  358 GIRPNEISFLSVLTACSHSGLLDEGWHYYELMKKDGIVPEAWHYVTVVDLLGRAGDLNRAL---RFIEEMPIEPTAAIWK 434
Cdd:PLN03218   467 GLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFgayGIMRSKNVKPDRVVFN 546
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1063708407  435 ALLNACrmhknTELGAYA------AEHVFELDPDDPGpHVI---LYNIYASGGRWNDAARVRKKMKESGVKKEPAC 501
Cdd:PLN03218   547 ALISAC-----GQSGAVDrafdvlAEMKAETHPIDPD-HITvgaLMKACANAGQVDRAKEVYQMIHEYNIKGTPEV 616
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
228-258 2.24e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 38.60  E-value: 2.24e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063708407 228 VSWNALIAGHARRSGTEKALELFQGMLRDGF 258
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
99-126 2.49e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 38.60  E-value: 2.49e-04
                          10        20
                  ....*....|....*....|....*...
gi 1063708407  99 NTLLNMYAKCGSLEEARKVFEKMPQRDF 126
Cdd:pfam01535   4 NSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
351-396 5.55e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 38.49  E-value: 5.55e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 1063708407 351 FEEMRRVGIRPNEISFLSVLTACSHSGLLDEGWHYYELMKKDGIVP 396
Cdd:pfam13812   3 LREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKP 48
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
127-161 5.60e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.82  E-value: 5.60e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063708407 127 VTWTTLISGYSQHDRPCDALLFFNQMLRFGYSPNE 161
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
201-424 6.32e-04

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 42.02  E-value: 6.32e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 201 ALLDLYTRYGLMDDAQLVFDALESRNDVSWNAL--IAGHARRSG-TEKALELFQGMLRDgfRPSHFSYASLFGACsstgF 277
Cdd:COG2956    47 ALGNLYRRRGEYDRAIRIHQKLLERDPDRAEALleLAQDYLKAGlLDRAEELLEKLLEL--DPDDAEALRLLAEI----Y 120
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 278 LEQGKWVHAymIKSGEKLVAFAGNT------LLDMYAKSGSIHDARKIFDRLAKRD---VVSWNSLLTAYAQHGFGKEAV 348
Cdd:COG2956   121 EQEGDWEKA--IEVLERLLKLGPENahayceLAELYLEQGDYDEAIEALEKALKLDpdcARALLLLAELYLEQGDYEEAI 198
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1063708407 349 WWFEEMRRvgIRPNEISFLSVLTACS-HSGLLDEGWHYYElmKKDGIVPEAWHYVTVVDLLGRAGDLNRALRFIEEM 424
Cdd:COG2956   199 AALERALE--QDPDYLPALPRLAELYeKLGDPEEALELLR--KALELDPSDDLLLALADLLERKEGLEAALALLERQ 271
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
228-261 6.95e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.43  E-value: 6.95e-04
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063708407 228 VSWNALIAGHARRSGTEKALELFQGMLRDGFRPS 261
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
302-486 1.17e-03

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 41.25  E-value: 1.17e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 302 TLLDMYAKSGSIHDARKIFDRLAKRDVVSWNSLLT---AYAQHGFGKEAVWWFEEMrrVGIRPNEISFLSVLTAC-SHSG 377
Cdd:COG2956    47 ALGNLYRRRGEYDRAIRIHQKLLERDPDRAEALLElaqDYLKAGLLDRAEELLEKL--LELDPDDAEALRLLAEIyEQEG 124
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 378 LLDEGWHYYE-LMKKDGivPEAWHYVTVVDLLGRAGDLNRALRFIEE-MPIEPTAA-IWKALLNACRMHKNTELGAYAAE 454
Cdd:COG2956   125 DWEKAIEVLErLLKLGP--ENAHAYCELAELYLEQGDYDEAIEALEKaLKLDPDCArALLLLAELYLEQGDYEEAIAALE 202
                         170       180       190
                  ....*....|....*....|....*....|..
gi 1063708407 455 HVFELDPDDPGPHVILYNIYASGGRWNDAARV 486
Cdd:COG2956   203 RALEQDPDYLPALPRLAELYEKLGDPEEALEL 234
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
99-127 1.90e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 36.28  E-value: 1.90e-03
                          10        20
                  ....*....|....*....|....*....
gi 1063708407  99 NTLLNMYAKCGSLEEARKVFEKMPQRDFV 127
Cdd:TIGR00756   4 NTLIDGLCKAGRVEEALELFKEMKERGIE 32
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
301-340 1.98e-03

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 36.57  E-value: 1.98e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....
gi 1063708407 301 NTLLDMYAKSGSIHDARKIFDRLAKR----DVVSWNSLLTAYAQ 340
Cdd:pfam13041   7 NTLINGYCKKGKVEEAFKLFNEMKKRgvkpNVYTYTILINGLCK 50
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
94-121 2.56e-03

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 35.78  E-value: 2.56e-03
                          10        20
                  ....*....|....*....|....*...
gi 1063708407  94 DIVMGNTLLNMYAKCGSLEEARKVFEKM 121
Cdd:pfam12854   6 DVVTYNTLINGLCRAGRVDEAFELLDEM 33
TPR COG0457
Tetratricopeptide (TPR) repeat [General function prediction only];
322-540 2.82e-03

Tetratricopeptide (TPR) repeat [General function prediction only];


Pssm-ID: 440225 [Multi-domain]  Cd Length: 245  Bit Score: 39.99  E-value: 2.82e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 322 RLAKRDVVSWNSLLTAYAQHGFGKEAVWWFEEMRRvgIRPNEISFLSVL-TACSHSGLLDEGWHYYE-LMKKDGIVPEAW 399
Cdd:COG0457     2 ELDPDDAEAYNNLGLAYRRLGRYEEAIEDYEKALE--LDPDDAEALYNLgLAYLRLGRYEEALADYEqALELDPDDAEAL 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 400 HYVtvVDLLGRAGDLNRALRFIEE-MPIEPT-AAIWKALLNACRMHKNTELGAYAAEHVFELDPDDPGPHVILYNIYASG 477
Cdd:COG0457    80 NNL--GLALQALGRYEEALEDYDKaLELDPDdAEALYNLGLALLELGRYDEAIEAYERALELDPDDADALYNLGIALEKL 157
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1063708407 478 GRWNDAARVRKKMKESGVKKEPACSWVEIENAIHMFVANDERHPQREEIARKWEEVLAKIKEL 540
Cdd:COG0457   158 GRYEEALELLEKLEAAALAALLAAALGEAALALAAAEVLLALLLALEQALRKKLAILTLAALA 220
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
304-440 3.25e-03

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 39.30  E-value: 3.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 304 LDMYAKSGSIHDARKIFDRLAKRDVV----SWNSLL---------TAYAQHGFGKEAVWWFEEMRRVGIRPNEISFLSVL 370
Cdd:pfam17177  18 LDKCSKHADATGALALYDAAKAEGVRlaqyHYNVLLylcskaadaTDLKPQLAADRGFEVFEAMKAQGVSPNEATYTAVA 97
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1063708407 371 TACSHSGLLDEGWHYYELMKKDGIVPEAWHYVTVVDLLGRAGDLNRALRFIEEM---PIEPTAAIWKALLNAC 440
Cdd:pfam17177  98 RLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMlahGVELEEPELAALLKVS 170
Spy COG3914
Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational ...
377-485 6.59e-03

Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443119 [Multi-domain]  Cd Length: 658  Bit Score: 39.59  E-value: 6.59e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063708407 377 GLLDEGWHYYE-LMKKDGIVPEAWHYVTVvdLLGRAGDLNRALRFIEE-MPIEP-TAAIWKALLNACRMHKNTELGAYAA 453
Cdd:COG3914    92 GRYEEALALYRrALALNPDNAEALFNLGN--LLLALGRLEEALAALRRaLALNPdFAEAYLNLGEALRRLGRLEEAIAAL 169
                          90       100       110
                  ....*....|....*....|....*....|..
gi 1063708407 454 EHVFELDPDDPGPHVILYNIYASGGRWNDAAR 485
Cdd:COG3914   170 RRALELDPDNAEALNNLGNALQDLGRLEEAIA 201
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
392-424 8.78e-03

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 34.24  E-value: 8.78e-03
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1063708407 392 DGIVPEAWHYVTVVDLLGRAGDLNRALRFIEEM 424
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH