|
Name |
Accession |
Description |
Interval |
E-value |
| OGFr_N |
pfam04664 |
Opioid growth factor receptor (OGFr) conserved region; Opioid peptides act as growth factors ... |
55-262 |
5.35e-152 |
|
Opioid growth factor receptor (OGFr) conserved region; Opioid peptides act as growth factors in neural and non-neural cells and tissues, in addition to serving in neurotransmission/neuromodulation in the nervous system. The Opioid growth factor receptor is an integral membrane protein associated with the nucleus. The conserved region is situated at the N-terminus of the member proteins with a series of imperfect repeats lying immediately to its C-terminus.
Pssm-ID: 461383 Cd Length: 208 Bit Score: 435.99 E-value: 5.35e-152
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 55 DCNGDMCNLSFYKNEICFQPNGFLIEDILQNWKDNYDLLEENHSYIQWLFPLREPGVNWHAKPLTLKEVEAFKSSKEVRE 134
Cdd:pfam04664 1 DQPNDMANLKFYKNEIPFQPDGIYIEEFLQKWKGDYDKLEHNHSYIQWLFPLREPGVNWRAKPLTPKEIEAFKKSEEAKR 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 135 RLVRAYELMLGFYGIQLEDRNTGAVCRAQNFQPRFHNLNSHSHNNLRITRILKSLGELGLEHYQAPLVRFFLEETLVQHK 214
Cdd:pfam04664 81 RLLKSYKLMLGFYGIELLDEKTGEVKRASNWQERFQNLNRNSHNNLRITRILKSLGELGYEHYQAPLVRFFLEETLVHFT 160
|
170 180 190 200
....*....|....*....|....*....|....*....|....*...
gi 1039761998 215 LPSVRQSALDYFLFAVRCRHQRRELVHFAWEHFKPRREFVWGPRDKLR 262
Cdd:pfam04664 161 LPNVKQSALDYFVFTVRDKRERRELVRFAWQHYKPRGKFVWGPWDKLQ 208
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
414-612 |
4.40e-14 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 75.88 E-value: 4.40e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 414 AEGDGVASNTQVQASALSPTPSE--CPESQKDGNGPEDPksqvGP--EDPKSQVgPEDPKSQVGPEDPKSqvgPEDPKgq 489
Cdd:PTZ00449 523 APGDKEGEEGEHEDSKESDEPKEggKPGETKEGEVGKKP----GPakEHKPSKI-PTLSKKPEFPKDPKH---PKDPE-- 592
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 490 vEPEDPKGQVGPEDPKGQVGPEDPKGQVGPEDPKSQVGPEDPKSQVEPEDPKSQVEPEDPKSQVEPEDPKSQVGPEDPQS 569
Cdd:PTZ00449 593 -EPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKF 671
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 1039761998 570 QVGPEQAASKSLGEDPDSDTTGTSMSESEELARIEASVEPPKP 612
Cdd:PTZ00449 672 KEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTP 714
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
424-588 |
4.64e-13 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 72.11 E-value: 4.64e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 424 QVQASALSPTPSECPESQKDG----NGPEDPKSQVGP--EDPKSQVG--PEDPKSQVGP--EDPKSQVG--PEDPKGQV- 490
Cdd:NF033839 309 EVKPEPETPKPEVKPQLEKPKpevkPQPEKPKPEVKPqlETPKPEVKpqPEKPKPEVKPqpEKPKPEVKpqPETPKPEVk 388
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 491 -EPEDPKGQVGP--EDPKGQVGP--EDPKGQVGP--EDPKSQVGP--EDPKSQV--EPEDPKSQVEPedpksQVEPEDPK 559
Cdd:NF033839 389 pQPEKPKPEVKPqpEKPKPEVKPqpEKPKPEVKPqpEKPKPEVKPqpEKPKPEVkpQPEKPKPEVKP-----QPETPKPE 463
|
170 180
....*....|....*....|....*....
gi 1039761998 560 SQVGPEDPQSQVGPEQAASKSLGEDPDSD 588
Cdd:NF033839 464 VKPQPEKPKPEVKPQPEKPKPDNSKPQAD 492
|
|
| gly_rich_SclB |
NF038329 |
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ... |
470-586 |
6.24e-09 |
|
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.
Pssm-ID: 468478 [Multi-domain] Cd Length: 440 Bit Score: 58.38 E-value: 6.24e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 470 SQVGPEDPKSQVGPEDPKGQVEPEDPKGQVGPEDPKGQVGPEDPKGQVGPEDPKSQVGPEDPKSqvePEDPKSQVEPEDP 549
Cdd:NF038329 111 QQLKGDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQG---PAGKDGEAGAKGP 187
|
90 100 110
....*....|....*....|....*....|....*..
gi 1039761998 550 KSQVEPEDPKSQVGPEDPQSQVGPEQAASKSLGEDPD 586
Cdd:NF038329 188 AGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGED 224
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
424-578 |
8.79e-08 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 55.16 E-value: 8.79e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 424 QVQASALSPTPSECPEsqkdgngPEDPKSQVGP--EDPKSQVGP--EDPKSQVGP--EDPKSQVGP--EDPKGQV--EPE 493
Cdd:NF033839 364 EVKPQPEKPKPEVKPQ-------PETPKPEVKPqpEKPKPEVKPqpEKPKPEVKPqpEKPKPEVKPqpEKPKPEVkpQPE 436
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 494 DPKGQVGP--EDPKGQVGPedpkgQVGPEDPKSQVGPEDPKSQVEPEDPKsqvePEDPKSQVEPEDPKSQVGPEDPQSQV 571
Cdd:NF033839 437 KPKPEVKPqpEKPKPEVKP-----QPETPKPEVKPQPEKPKPEVKPQPEK----PKPDNSKPQADDKKPSTPNNLSKDKQ 507
|
....*..
gi 1039761998 572 GPEQAAS 578
Cdd:NF033839 508 PSNQAST 514
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
448-612 |
1.76e-05 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 47.84 E-value: 1.76e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 448 EDPKSQVGPEDPKSQVGPEDPKSQVGPEDPKSQVGPEDPkGQVEPEDPKGQVGPEDPkgqvgPEDPKGQVGPEDPKSQVG 527
Cdd:NF033839 249 DNVNTKVEIENTVHKIFADMDAVVTKFKKGLTQDTPKEP-GNKKPSAPKPGMQPSPQ-----PEKKEVKPEPETPKPEVK 322
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 528 PEDPKSQVEPEDpksqvEPEDPKSQV-------------EPEDPKSQVGP--EDPQSQVGPEQAASKSlGEDPDSDTTGT 592
Cdd:NF033839 323 PQLEKPKPEVKP-----QPEKPKPEVkpqletpkpevkpQPEKPKPEVKPqpEKPKPEVKPQPETPKP-EVKPQPEKPKP 396
|
170 180
....*....|....*....|
gi 1039761998 593 SMSESEELARIEASVEPPKP 612
Cdd:NF033839 397 EVKPQPEKPKPEVKPQPEKP 416
|
|
| Treacle |
pfam03546 |
Treacher Collins syndrome protein Treacle; |
411-612 |
1.98e-05 |
|
Treacher Collins syndrome protein Treacle;
Pssm-ID: 460967 [Multi-domain] Cd Length: 531 Bit Score: 47.37 E-value: 1.98e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 411 EEGAEGDGVASNTQVQASALSPTP------SECPESQKDGNGPEDPkSQVGPEDPKSQVGPEDPKSQVGPEDPKS----- 479
Cdd:pfam03546 31 ESDSEEETPAAKTPLQAKPSGKTPqvraasAPAKESPRKGAPPVPP-GKTGPAAAQAQAGKPEEDSESSSEESDSdgetp 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 480 ----------QVGPEDPKGQVEP-----EDPKGQVGPEDPKGQVGPEDPKGQVGPEDPKSQVGPEDPKSQVEPEDPKSQV 544
Cdd:pfam03546 110 aaatlttspaQVKPLGKNSQVRPastvgKGPSGKGANPAPPGKAGSAAPLVQVGKKEEDSESSSEESDSEGEAPPAATQA 189
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039761998 545 EPEDPKSQVEPEDPKSQVGPEDPQSQVGPEQAASKSLGEDPDSDTTGTSmSESEELARIEASVEPPKP 612
Cdd:pfam03546 190 KPSGKILQVRPASGPAKGAAPAPPQKAGPVATQVKAERSKEDSESSEES-SDSEEEAPAAATPAQAKP 256
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
254-570 |
7.71e-05 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 45.76 E-value: 7.71e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 254 VWGPRDKLRRFRPQTISRPLMGLGQADKDEGPGDPS-QEAGTQGRTCGSGRDLSGDSGTAEDLSLLSAKPQDVGTLDGDQ 332
Cdd:TIGR00927 610 LWVKEQLSRRPVAKVMALGDLSKGDVAEAEHTGERTgEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAER 689
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 333 RHEAKSPSPKESKKRKLEGnrqEQVPGEPDPQGVSEVEKIALNLEGCALSPTSQEPREAEqpclvarvaNEVRKRRKVEe 412
Cdd:TIGR00927 690 KGEQEGEGEIEAKEADHKG---ETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGE---------GEAEGKHEVE- 756
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 413 gAEGDGVASNTQVQASALSPTPSECPESQKDGNGpeDPKSQVGPEDPKSQVGPEDPKSQV-GPEDPKSQVGPEDPKGQVE 491
Cdd:TIGR00927 757 -TEGDRKETEHEGETEAEGKEDEDEGEIQAGEDG--EMKGDEGAEGKVEHEGETEAGEKDeHEGQSETQADDTEVKDETG 833
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039761998 492 PEDPKGQVGPEDPKGQVGPEDPKGQVGPEDpksqvgpEDPKSQVEPEDPKSQVEPEDPKSQVEPEDPKSQVGPEDPQSQ 570
Cdd:TIGR00927 834 EQELNAENQGEAKQDEKGVDGGGGSDGGDS-------EEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905
|
|
| PspC_subgroup_1 |
NF033838 |
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ... |
366-496 |
1.61e-04 |
|
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain] Cd Length: 684 Bit Score: 44.62 E-value: 1.61e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 366 VSEVEKIALNLEGCALSPTSQEPREAEqpclVARVANEVRKRRKVEEGAEGDGVASNTQVQASALSPTPSECPESQKDGN 445
Cdd:NF033838 358 VKEEAKEPRNEEKIKQAKAKVESKKAE----ATRLEKIKTDRKKAEEEAKRKAAEEDKVKEKPAEQPQPAPAPQPEKPAP 433
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|...
gi 1039761998 446 GPEDPKSQVGPEDPKSQVGPEDpKSQVGPEDPK--SQVGPEDPKGQVEPEDPK 496
Cdd:NF033838 434 KPEKPAEQPKAEKPADQQAEED-YARRSEEEYNrlTQQQPPKTEKPAQPSTPK 485
|
|
| PHA03169 |
PHA03169 |
hypothetical protein; Provisional |
263-527 |
4.92e-04 |
|
hypothetical protein; Provisional
Pssm-ID: 223003 [Multi-domain] Cd Length: 413 Bit Score: 43.04 E-value: 4.92e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 263 RFRPQTISRPLMGLGQADKDEGPgdPSQEAGTQGRtcgsgRDLSGDSGTAEDLSLLSAKPQDVGTLDGDQRHEAKSPSPK 342
Cdd:PHA03169 34 GRRRGTAARAAKPAPPAPTTSGP--QVRAVAEQGH-----RQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPS 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 343 ESKKRKLEGNrqeqvpgEPDPQGVSEVEKIALNLEGCALSPTSQEPREAEQPclvarvanevrkrRKVEEGAEGDGVASN 422
Cdd:PHA03169 107 PSGSAEELAS-------GLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAP-------------PESHNPSPNQQPSSF 166
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 423 TQVQASAlSPTPSECPESQKDGNGPEDPKSQVGPEDPKSQVGPEDPKSQVGPEdPKSQVGPEDPKGQVEPEDPKGQVGPE 502
Cdd:PHA03169 167 LQPSHED-SPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPT-PQQAPSPNTQQAVEHEDEPTEPEREG 244
|
250 260
....*....|....*....|....*
gi 1039761998 503 DPkgQVGPEDPKGQVGPEDPKSQVG 527
Cdd:PHA03169 245 PP--FPGHRSHSYTVVGWKPSTRPG 267
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| OGFr_N |
pfam04664 |
Opioid growth factor receptor (OGFr) conserved region; Opioid peptides act as growth factors ... |
55-262 |
5.35e-152 |
|
Opioid growth factor receptor (OGFr) conserved region; Opioid peptides act as growth factors in neural and non-neural cells and tissues, in addition to serving in neurotransmission/neuromodulation in the nervous system. The Opioid growth factor receptor is an integral membrane protein associated with the nucleus. The conserved region is situated at the N-terminus of the member proteins with a series of imperfect repeats lying immediately to its C-terminus.
Pssm-ID: 461383 Cd Length: 208 Bit Score: 435.99 E-value: 5.35e-152
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 55 DCNGDMCNLSFYKNEICFQPNGFLIEDILQNWKDNYDLLEENHSYIQWLFPLREPGVNWHAKPLTLKEVEAFKSSKEVRE 134
Cdd:pfam04664 1 DQPNDMANLKFYKNEIPFQPDGIYIEEFLQKWKGDYDKLEHNHSYIQWLFPLREPGVNWRAKPLTPKEIEAFKKSEEAKR 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 135 RLVRAYELMLGFYGIQLEDRNTGAVCRAQNFQPRFHNLNSHSHNNLRITRILKSLGELGLEHYQAPLVRFFLEETLVQHK 214
Cdd:pfam04664 81 RLLKSYKLMLGFYGIELLDEKTGEVKRASNWQERFQNLNRNSHNNLRITRILKSLGELGYEHYQAPLVRFFLEETLVHFT 160
|
170 180 190 200
....*....|....*....|....*....|....*....|....*...
gi 1039761998 215 LPSVRQSALDYFLFAVRCRHQRRELVHFAWEHFKPRREFVWGPRDKLR 262
Cdd:pfam04664 161 LPNVKQSALDYFVFTVRDKRERRELVRFAWQHYKPRGKFVWGPWDKLQ 208
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
414-612 |
4.40e-14 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 75.88 E-value: 4.40e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 414 AEGDGVASNTQVQASALSPTPSE--CPESQKDGNGPEDPksqvGP--EDPKSQVgPEDPKSQVGPEDPKSqvgPEDPKgq 489
Cdd:PTZ00449 523 APGDKEGEEGEHEDSKESDEPKEggKPGETKEGEVGKKP----GPakEHKPSKI-PTLSKKPEFPKDPKH---PKDPE-- 592
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 490 vEPEDPKGQVGPEDPKGQVGPEDPKGQVGPEDPKSQVGPEDPKSQVEPEDPKSQVEPEDPKSQVEPEDPKSQVGPEDPQS 569
Cdd:PTZ00449 593 -EPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKF 671
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 1039761998 570 QVGPEQAASKSLGEDPDSDTTGTSMSESEELARIEASVEPPKP 612
Cdd:PTZ00449 672 KEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTP 714
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
424-588 |
4.64e-13 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 72.11 E-value: 4.64e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 424 QVQASALSPTPSECPESQKDG----NGPEDPKSQVGP--EDPKSQVG--PEDPKSQVGP--EDPKSQVG--PEDPKGQV- 490
Cdd:NF033839 309 EVKPEPETPKPEVKPQLEKPKpevkPQPEKPKPEVKPqlETPKPEVKpqPEKPKPEVKPqpEKPKPEVKpqPETPKPEVk 388
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 491 -EPEDPKGQVGP--EDPKGQVGP--EDPKGQVGP--EDPKSQVGP--EDPKSQV--EPEDPKSQVEPedpksQVEPEDPK 559
Cdd:NF033839 389 pQPEKPKPEVKPqpEKPKPEVKPqpEKPKPEVKPqpEKPKPEVKPqpEKPKPEVkpQPEKPKPEVKP-----QPETPKPE 463
|
170 180
....*....|....*....|....*....
gi 1039761998 560 SQVGPEDPQSQVGPEQAASKSLGEDPDSD 588
Cdd:NF033839 464 VKPQPEKPKPEVKPQPEKPKPDNSKPQAD 492
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
382-611 |
5.16e-12 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 68.95 E-value: 5.16e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 382 SPTSQEPREAEQPclvarvanevrkrRKVEEGAEGDGVASNTQVQASALsPTPSECPESQKDGNGPEDPKSqvgPEDPKS 461
Cdd:PTZ00449 537 SKESDEPKEGGKP-------------GETKEGEVGKKPGPAKEHKPSKI-PTLSKKPEFPKDPKHPKDPEE---PKKPKR 599
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 462 QVGPEDPKSQVGPEDPKSQVGPEDPKGQVEPEDPKGQVGPEDPKGQVGPEDPKGQVGPEDPKSQVGPEDPK--------- 532
Cdd:PTZ00449 600 PRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKfkekfyddy 679
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 533 ------------------------SQVEPEDPKSQ---------VEPEDPKSQVEP-EDPKSQvGPEDPQSQVGPEQAAS 578
Cdd:PTZ00449 680 ldaaaksketkttvvldesfesilKETLPETPGTPfttprplppKLPRDEEFPFEPiGDPDAE-QPDDIEFFTPPEEERT 758
|
250 260 270
....*....|....*....|....*....|....*.
gi 1039761998 579 ---KSLGEDPDSDTTGTSMSESEELARIEASVEPPK 611
Cdd:PTZ00449 759 ffhETPADTPLPDILAEEFKEEDIHAETGEPDEAMK 794
|
|
| gly_rich_SclB |
NF038329 |
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ... |
470-586 |
6.24e-09 |
|
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.
Pssm-ID: 468478 [Multi-domain] Cd Length: 440 Bit Score: 58.38 E-value: 6.24e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 470 SQVGPEDPKSQVGPEDPKGQVEPEDPKGQVGPEDPKGQVGPEDPKGQVGPEDPKSQVGPEDPKSqvePEDPKSQVEPEDP 549
Cdd:NF038329 111 QQLKGDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQG---PAGKDGEAGAKGP 187
|
90 100 110
....*....|....*....|....*....|....*..
gi 1039761998 550 KSQVEPEDPKSQVGPEDPQSQVGPEQAASKSLGEDPD 586
Cdd:NF038329 188 AGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGED 224
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
424-578 |
8.79e-08 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 55.16 E-value: 8.79e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 424 QVQASALSPTPSECPEsqkdgngPEDPKSQVGP--EDPKSQVGP--EDPKSQVGP--EDPKSQVGP--EDPKGQV--EPE 493
Cdd:NF033839 364 EVKPQPEKPKPEVKPQ-------PETPKPEVKPqpEKPKPEVKPqpEKPKPEVKPqpEKPKPEVKPqpEKPKPEVkpQPE 436
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 494 DPKGQVGP--EDPKGQVGPedpkgQVGPEDPKSQVGPEDPKSQVEPEDPKsqvePEDPKSQVEPEDPKSQVGPEDPQSQV 571
Cdd:NF033839 437 KPKPEVKPqpEKPKPEVKP-----QPETPKPEVKPQPEKPKPEVKPQPEK----PKPDNSKPQADDKKPSTPNNLSKDKQ 507
|
....*..
gi 1039761998 572 GPEQAAS 578
Cdd:NF033839 508 PSNQAST 514
|
|
| PHA03169 |
PHA03169 |
hypothetical protein; Provisional |
405-612 |
1.46e-06 |
|
hypothetical protein; Provisional
Pssm-ID: 223003 [Multi-domain] Cd Length: 413 Bit Score: 50.74 E-value: 1.46e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 405 RKRRKVEEGAEGDGVASNTQVQASAL---SPTPSECPESQKDGNGPEDPKSQVGPEDPKSQVGPEDPKSQVGPEDPKSQV 481
Cdd:PHA03169 38 GTAARAAKPAPPAPTTSGPQVRAVAEqghRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASG 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 482 GPEDPKGQVEPEDPKGQVGPEDPKGQVGPEDP-----KGQVGPEDPKSQVGPEDPKSQVEPEDPKSQVEPEDPKSQvEPE 556
Cdd:PHA03169 118 LSPENTSGSSPESPASHSPPPSPPSHPGPHEPappesHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPP-QSE 196
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 1039761998 557 DPKSQVGPEDPQSQVGPEQAASKSLGEDPDSDTTGTSMSESEELARIEAsvEPPKP 612
Cdd:PHA03169 197 TPTSSPPPQSPPDEPGEPQSPTPQQAPSPNTQQAVEHEDEPTEPEREGP--PFPGH 250
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
448-612 |
1.76e-05 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 47.84 E-value: 1.76e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 448 EDPKSQVGPEDPKSQVGPEDPKSQVGPEDPKSQVGPEDPkGQVEPEDPKGQVGPEDPkgqvgPEDPKGQVGPEDPKSQVG 527
Cdd:NF033839 249 DNVNTKVEIENTVHKIFADMDAVVTKFKKGLTQDTPKEP-GNKKPSAPKPGMQPSPQ-----PEKKEVKPEPETPKPEVK 322
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 528 PEDPKSQVEPEDpksqvEPEDPKSQV-------------EPEDPKSQVGP--EDPQSQVGPEQAASKSlGEDPDSDTTGT 592
Cdd:NF033839 323 PQLEKPKPEVKP-----QPEKPKPEVkpqletpkpevkpQPEKPKPEVKPqpEKPKPEVKPQPETPKP-EVKPQPEKPKP 396
|
170 180
....*....|....*....|
gi 1039761998 593 SMSESEELARIEASVEPPKP 612
Cdd:NF033839 397 EVKPQPEKPKPEVKPQPEKP 416
|
|
| Treacle |
pfam03546 |
Treacher Collins syndrome protein Treacle; |
411-612 |
1.98e-05 |
|
Treacher Collins syndrome protein Treacle;
Pssm-ID: 460967 [Multi-domain] Cd Length: 531 Bit Score: 47.37 E-value: 1.98e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 411 EEGAEGDGVASNTQVQASALSPTP------SECPESQKDGNGPEDPkSQVGPEDPKSQVGPEDPKSQVGPEDPKS----- 479
Cdd:pfam03546 31 ESDSEEETPAAKTPLQAKPSGKTPqvraasAPAKESPRKGAPPVPP-GKTGPAAAQAQAGKPEEDSESSSEESDSdgetp 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 480 ----------QVGPEDPKGQVEP-----EDPKGQVGPEDPKGQVGPEDPKGQVGPEDPKSQVGPEDPKSQVEPEDPKSQV 544
Cdd:pfam03546 110 aaatlttspaQVKPLGKNSQVRPastvgKGPSGKGANPAPPGKAGSAAPLVQVGKKEEDSESSSEESDSEGEAPPAATQA 189
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039761998 545 EPEDPKSQVEPEDPKSQVGPEDPQSQVGPEQAASKSLGEDPDSDTTGTSmSESEELARIEASVEPPKP 612
Cdd:pfam03546 190 KPSGKILQVRPASGPAKGAAPAPPQKAGPVATQVKAERSKEDSESSEES-SDSEEEAPAAATPAQAKP 256
|
|
| PHA03169 |
PHA03169 |
hypothetical protein; Provisional |
411-591 |
2.26e-05 |
|
hypothetical protein; Provisional
Pssm-ID: 223003 [Multi-domain] Cd Length: 413 Bit Score: 47.27 E-value: 2.26e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 411 EEGAEGDGVASNTQVQASALSPTPSECPESQKDGNGPEDPKSQVGPEDPKSQVGPEDPksqvGPEDPKSQVGPEDPKGQV 490
Cdd:PHA03169 92 PSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEP----APPESHNPSPNQQPSSFL 167
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 491 EPEDPKGQVGPEDPKGQVGPEDPKGQVGPEDPKSQVGPEDPKSQVEPEDPKSQvEPEDPKSQVEPEDPKSQVGPEDPQS- 569
Cdd:PHA03169 168 QPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQ-QAPSPNTQQAVEHEDEPTEPEREGPp 246
|
170 180
....*....|....*....|..
gi 1039761998 570 QVGPEQAASKSLGEDPDSDTTG 591
Cdd:PHA03169 247 FPGHRSHSYTVVGWKPSTRPGG 268
|
|
| DUF4573 |
pfam15140 |
Domain of unknown function (DUF4573); This family of proteins is found in eukaryotes. Proteins ... |
447-573 |
7.27e-05 |
|
Domain of unknown function (DUF4573); This family of proteins is found in eukaryotes. Proteins in this family are typically approximately 360 amino acids in length.
Pssm-ID: 434493 [Multi-domain] Cd Length: 176 Bit Score: 43.66 E-value: 7.27e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 447 PEDPKSQVGPEDPKSQVGPEDPKSQVGPEDPKSQVGPEDPKGQVEPEDPKGQVGPEDPKGQVGPEDPKGQVGPEDPKSQV 526
Cdd:pfam15140 49 PLKGVAEIEPLGPVSEIQPLRAVSERDPLGAVEEIEPPQAASEMKPLGTAENILPLEAAREIHPLEAVGKIEPLQLVETI 128
|
90 100 110 120
....*....|....*....|....*....|....*....|....*..
gi 1039761998 527 GPEDPKSQVEPEDPKSQVEPEDPKSQVEPEDPKSQVGPEDPQSQVGP 573
Cdd:pfam15140 129 PKENESPEIHPLEGSQEIEPLEPVQLIEPLGEVEQIQPLETVPKENP 175
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
254-570 |
7.71e-05 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 45.76 E-value: 7.71e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 254 VWGPRDKLRRFRPQTISRPLMGLGQADKDEGPGDPS-QEAGTQGRTCGSGRDLSGDSGTAEDLSLLSAKPQDVGTLDGDQ 332
Cdd:TIGR00927 610 LWVKEQLSRRPVAKVMALGDLSKGDVAEAEHTGERTgEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAER 689
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 333 RHEAKSPSPKESKKRKLEGnrqEQVPGEPDPQGVSEVEKIALNLEGCALSPTSQEPREAEqpclvarvaNEVRKRRKVEe 412
Cdd:TIGR00927 690 KGEQEGEGEIEAKEADHKG---ETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGE---------GEAEGKHEVE- 756
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 413 gAEGDGVASNTQVQASALSPTPSECPESQKDGNGpeDPKSQVGPEDPKSQVGPEDPKSQV-GPEDPKSQVGPEDPKGQVE 491
Cdd:TIGR00927 757 -TEGDRKETEHEGETEAEGKEDEDEGEIQAGEDG--EMKGDEGAEGKVEHEGETEAGEKDeHEGQSETQADDTEVKDETG 833
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039761998 492 PEDPKGQVGPEDPKGQVGPEDPKGQVGPEDpksqvgpEDPKSQVEPEDPKSQVEPEDPKSQVEPEDPKSQVGPEDPQSQ 570
Cdd:TIGR00927 834 EQELNAENQGEAKQDEKGVDGGGGSDGGDS-------EEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905
|
|
| DUF4573 |
pfam15140 |
Domain of unknown function (DUF4573); This family of proteins is found in eukaryotes. Proteins ... |
447-579 |
1.08e-04 |
|
Domain of unknown function (DUF4573); This family of proteins is found in eukaryotes. Proteins in this family are typically approximately 360 amino acids in length.
Pssm-ID: 434493 [Multi-domain] Cd Length: 176 Bit Score: 43.28 E-value: 1.08e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 447 PEDPKSQVGPEDPKSQVGPEDPKSQVGPEDPKSQVGPEDPKGQVEPEDPKGQVGPEDPKGQVGPEDPKGQVGPEDPKSQV 526
Cdd:pfam15140 40 DLRAVTEVEPLKGVAEIEPLGPVSEIQPLRAVSERDPLGAVEEIEPPQAASEMKPLGTAENILPLEAAREIHPLEAVGKI 119
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|...
gi 1039761998 527 GPEDPKSQVEPEDPKSQVEPEDPKSQVEPEDPKSQVGPEDPQSQVGPEQAASK 579
Cdd:pfam15140 120 EPLQLVETIPKENESPEIHPLEGSQEIEPLEPVQLIEPLGEVEQIQPLETVPK 172
|
|
| PspC_subgroup_1 |
NF033838 |
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ... |
366-496 |
1.61e-04 |
|
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain] Cd Length: 684 Bit Score: 44.62 E-value: 1.61e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 366 VSEVEKIALNLEGCALSPTSQEPREAEqpclVARVANEVRKRRKVEEGAEGDGVASNTQVQASALSPTPSECPESQKDGN 445
Cdd:NF033838 358 VKEEAKEPRNEEKIKQAKAKVESKKAE----ATRLEKIKTDRKKAEEEAKRKAAEEDKVKEKPAEQPQPAPAPQPEKPAP 433
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|...
gi 1039761998 446 GPEDPKSQVGPEDPKSQVGPEDpKSQVGPEDPK--SQVGPEDPKGQVEPEDPK 496
Cdd:NF033838 434 KPEKPAEQPKAEKPADQQAEED-YARRSEEEYNrlTQQQPPKTEKPAQPSTPK 485
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
379-602 |
2.30e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 44.21 E-value: 2.30e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 379 CALSPTSQEPREAEQPCLVARVANEVRKRRKVEEGAEGD-----GVASNTQVQASALSPTPSECPESQKDGNGPEDPKSQ 453
Cdd:PRK07764 586 AVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPaapapAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDG 665
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 454 VGPEDPKsqVGPEDPKSQVGPEDPKSQVGPEDPKGQvePEDPKGQVGPEDPKGQVGPEDPKGQVGPEDPKSQVGPEDPKS 533
Cdd:PRK07764 666 GDGWPAK--AGGAAPAAPPPAPAPAAPAAPAGAAPA--QPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPL 741
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039761998 534 QVEPEDPKsqvEPEDPKSQVEPEDPKSQVGPEDPQSQVGPEQAASKSLGEDPDSDTTgTSMSESEELAR 602
Cdd:PRK07764 742 PPEPDDPP---DPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDD-EDRRDAEEVAM 806
|
|
| DUF4573 |
pfam15140 |
Domain of unknown function (DUF4573); This family of proteins is found in eukaryotes. Proteins ... |
456-612 |
3.23e-04 |
|
Domain of unknown function (DUF4573); This family of proteins is found in eukaryotes. Proteins in this family are typically approximately 360 amino acids in length.
Pssm-ID: 434493 [Multi-domain] Cd Length: 176 Bit Score: 41.73 E-value: 3.23e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 456 PEDPKSQVGPEDPKSQVGPEDPKSQVGPEDPKGQVEPEDPKGQVGPEDPKGQVGPEDPKGQVGPEDPKSQVGPEDPKSQV 535
Cdd:pfam15140 4 PSRPTSEIQPLKGVREIEPPQPGGKDDPLGAEEKKKDLRAVTEVEPLKGVAEIEPLGPVSEIQPLRAVSERDPLGAVEEI 83
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039761998 536 EPEDPKSQVEPEDPKSQVEPEDPKSQVGPEDPQSQVGPEQAASKSLGEDPDSDTTGTSMSESEELARIEASVEPPKP 612
Cdd:pfam15140 84 EPPQAASEMKPLGTAENILPLEAAREIHPLEAVGKIEPLQLVETIPKENESPEIHPLEGSQEIEPLEPVQLIEPLGE 160
|
|
| PHA03169 |
PHA03169 |
hypothetical protein; Provisional |
263-527 |
4.92e-04 |
|
hypothetical protein; Provisional
Pssm-ID: 223003 [Multi-domain] Cd Length: 413 Bit Score: 43.04 E-value: 4.92e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 263 RFRPQTISRPLMGLGQADKDEGPgdPSQEAGTQGRtcgsgRDLSGDSGTAEDLSLLSAKPQDVGTLDGDQRHEAKSPSPK 342
Cdd:PHA03169 34 GRRRGTAARAAKPAPPAPTTSGP--QVRAVAEQGH-----RQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPS 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 343 ESKKRKLEGNrqeqvpgEPDPQGVSEVEKIALNLEGCALSPTSQEPREAEQPclvarvanevrkrRKVEEGAEGDGVASN 422
Cdd:PHA03169 107 PSGSAEELAS-------GLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAP-------------PESHNPSPNQQPSSF 166
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 423 TQVQASAlSPTPSECPESQKDGNGPEDPKSQVGPEDPKSQVGPEDPKSQVGPEdPKSQVGPEDPKGQVEPEDPKGQVGPE 502
Cdd:PHA03169 167 LQPSHED-SPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPT-PQQAPSPNTQQAVEHEDEPTEPEREG 244
|
250 260
....*....|....*....|....*
gi 1039761998 503 DPkgQVGPEDPKGQVGPEDPKSQVG 527
Cdd:PHA03169 245 PP--FPGHRSHSYTVVGWKPSTRPG 267
|
|
| DUF4573 |
pfam15140 |
Domain of unknown function (DUF4573); This family of proteins is found in eukaryotes. Proteins ... |
434-580 |
7.86e-04 |
|
Domain of unknown function (DUF4573); This family of proteins is found in eukaryotes. Proteins in this family are typically approximately 360 amino acids in length.
Pssm-ID: 434493 [Multi-domain] Cd Length: 176 Bit Score: 40.58 E-value: 7.86e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 434 PSECPESQKDGN-GPEDPKSQVGPEDPKSQVGPEDPKSQVGPEDPKSQVGPEDPKGQVEPEDPKGQVGPEDPKGQVGPED 512
Cdd:pfam15140 17 VREIEPPQPGGKdDPLGAEEKKKDLRAVTEVEPLKGVAEIEPLGPVSEIQPLRAVSERDPLGAVEEIEPPQAASEMKPLG 96
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039761998 513 PKGQVGPEDPKSQVGPEDPKSQVEPEDPKSQVEPEDPKSQVEPEDPKSQVGPEDPQSQVGPEQAASKS 580
Cdd:pfam15140 97 TAENILPLEAAREIHPLEAVGKIEPLQLVETIPKENESPEIHPLEGSQEIEPLEPVQLIEPLGEVEQI 164
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
446-574 |
8.66e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 42.38 E-value: 8.66e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 446 GPEDPKSQVGPEDPKSQVGPEDPKSQVGPEDPKSQVGPEDPKGQVEPEDPKGQVGPEDPKGQVGPEDPKGQVGPEDPKSQ 525
Cdd:PRK10263 365 GPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAG 444
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|
gi 1039761998 526 vGPEDPKSQVEPEDPKSQVEPEDPKSQ-VEPEDPKSQVGPEDPQSQVGPE 574
Cdd:PRK10263 445 -NAWQAEEQQSTFAPQSTYQTEQTYQQpAAQEPLYQQPQPVEQQPVVEPE 493
|
|
| PRK13108 |
PRK13108 |
prolipoprotein diacylglyceryl transferase; Reviewed |
410-592 |
2.99e-03 |
|
prolipoprotein diacylglyceryl transferase; Reviewed
Pssm-ID: 237284 [Multi-domain] Cd Length: 460 Bit Score: 40.35 E-value: 2.99e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 410 VEEGAEGDGVASNTQVQASALSPTPsecPESQKDGNGPEDPKSQVGPEDPK-SQVGPEDPKSQVGPEDPKSQVGPEDPKg 488
Cdd:PRK13108 292 VDEALEREPAELAAAAVASAASAVG---PVGPGEPNQPDDVAEAVKAEVAEvTDEVAAESVVQVADRDGESTPAVEETS- 367
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 489 qvEPEDPKGQvgPEDPKGQvGPEDPKgqvgPEDPKSQVGPEDPKSQvEPEDPkSQVEPEDPksqvEPEDPKSQVGPEDPQ 568
Cdd:PRK13108 368 --EADIEREQ--PGDLAGQ-APAAHQ----VDAEAASAAPEEPAAL-ASEAH-DETEPEVP----EKAAPIPDPAKPDEL 432
|
170 180
....*....|....*....|....
gi 1039761998 569 SQVGPEQAASKSLGEDPDSDTTGT 592
Cdd:PRK13108 433 AVAGPGDDPAEPDGIRRQDDFSSR 456
|
|
| DUF4573 |
pfam15140 |
Domain of unknown function (DUF4573); This family of proteins is found in eukaryotes. Proteins ... |
447-578 |
3.88e-03 |
|
Domain of unknown function (DUF4573); This family of proteins is found in eukaryotes. Proteins in this family are typically approximately 360 amino acids in length.
Pssm-ID: 434493 [Multi-domain] Cd Length: 176 Bit Score: 38.65 E-value: 3.88e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 447 PEDPKSQVGPEDPKSQVGPEDPKSQVGPEDPKSQVGPEDPKGQVEPEDPKGQVGPEDPKGQVGPEDPKGQVGPEDPKSQV 526
Cdd:pfam15140 13 PLKGVREIEPPQPGGKDDPLGAEEKKKDLRAVTEVEPLKGVAEIEPLGPVSEIQPLRAVSERDPLGAVEEIEPPQAASEM 92
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 1039761998 527 GPEDPKSQVEPEDPKSQVEPEDPKSQVEPEDPKSQVGPEDPQSQVGPEQAAS 578
Cdd:pfam15140 93 KPLGTAENILPLEAAREIHPLEAVGKIEPLQLVETIPKENESPEIHPLEGSQ 144
|
|
| DUF4573 |
pfam15140 |
Domain of unknown function (DUF4573); This family of proteins is found in eukaryotes. Proteins ... |
430-578 |
5.48e-03 |
|
Domain of unknown function (DUF4573); This family of proteins is found in eukaryotes. Proteins in this family are typically approximately 360 amino acids in length.
Pssm-ID: 434493 [Multi-domain] Cd Length: 176 Bit Score: 38.27 E-value: 5.48e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 430 LSPTPSECPESQKDGNGPEDPKSQVGPEDPKSQVGPEDPKSQVGPEDPKSQVGPEDPKGQVEPEDPKGQVGPEDPKGQVG 509
Cdd:pfam15140 5 SRPTSEIQPLKGVREIEPPQPGGKDDPLGAEEKKKDLRAVTEVEPLKGVAEIEPLGPVSEIQPLRAVSERDPLGAVEEIE 84
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039761998 510 PEDPKGQVGPEDPKSQVGPEDPKSQVEPEDPKSQVEPEDPKSQVEPEDPKSQVGPEDPQSQVGPEQAAS 578
Cdd:pfam15140 85 PPQAASEMKPLGTAENILPLEAAREIHPLEAVGKIEPLQLVETIPKENESPEIHPLEGSQEIEPLEPVQ 153
|
|
| FAM47 |
pfam14642 |
FAM47 family; The function of this Chordate family of proteins is not known. |
465-562 |
5.98e-03 |
|
FAM47 family; The function of this Chordate family of proteins is not known.
Pssm-ID: 405345 [Multi-domain] Cd Length: 257 Bit Score: 39.09 E-value: 5.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 465 PEDPKSQVgpEDPKSQVGPEdpKGQVEPEDPK----GQVGPEDPKGQVG--PEDPkgqvgPEDPKSQVGPEDPK---SQV 535
Cdd:pfam14642 152 PLDPERKL--EDAGSCEGQE--KTTDEPTEPGkypcGEFSPRPPETRVSclPPEP-----PKTPVSSLRPEPPEtgvSHL 222
|
90 100 110
....*....|....*....|....*....|...
gi 1039761998 536 EPEDPKSQVE------PEDPKSQVEPEDPKSQV 562
Cdd:pfam14642 223 RPQPPKTQVSslhlepPETGVSHLRPEPPKTQV 255
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
432-612 |
8.44e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 39.38 E-value: 8.44e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 432 PTPSECPESQKDGNGPEDPKS--QVGPEDPKSQVGPEDPKSQVGPEDPKSQVGPEDPKGQVEPEDPK--GQVGPEDPKGQ 507
Cdd:PHA03307 73 PGPGTEAPANESRSTPTWSLStlAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRpvGSPGPPPAASP 152
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 508 VGPEDPKGQVgPEDPKSQVGPEDPKSQVEPEDPKSQVEPEDPKSQVEPEDPKSQVGPEDPQSQVGPEQAASKSLGEDPDS 587
Cdd:PHA03307 153 PAAGASPAAV-ASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADD 231
|
170 180
....*....|....*....|....*
gi 1039761998 588 DTTGTSMSESEELARIEASVEPPKP 612
Cdd:PHA03307 232 AGASSSDSSSSESSGCGWGPENECP 256
|
|
| NESP55 |
pfam06390 |
Neuroendocrine-specific golgi protein P55 (NESP55); This family consists of several mammalian ... |
412-570 |
9.48e-03 |
|
Neuroendocrine-specific golgi protein P55 (NESP55); This family consists of several mammalian neuroendocrine-specific golgi protein P55 (NESP55) sequences. NESP55 is a novel member of the chromogranin family and is a soluble, acidic, heat-stable secretory protein that is expressed exclusively in endocrine and nervous tissues, although less widely than chromogranins.
Pssm-ID: 115071 [Multi-domain] Cd Length: 261 Bit Score: 38.31 E-value: 9.48e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039761998 412 EGAEGDGVASNTQVQASALSPtpsECPESQKDGNGPEDPKSQvgpeDPKSQVGPEDpKSQVGPEDPKSQVGPEDPkgQVE 491
Cdd:pfam06390 82 EPSEPESDHEDEDFEPELARP---ECLEYDEDDFDTETDSET----EPESDIESET-EFETEPETEPDTAPTTEP--ETE 151
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039761998 492 PEDPKGQVGPEDPKGQVGPEDPKGQVGPEDPKSQVGPEDPKSQvEPEDPKSQVEPEDPKSQVEPEDPKSQVGPEDPQSQ 570
Cdd:pfam06390 152 PEDEPGPVVPKGATFHQSLTERLHALKLQSADASPRRAPPSTQ-EPESAREGEEPERGPLDKDPRDPEEEEEEKEEEKQ 229
|
|
|