|
Name |
Accession |
Description |
Interval |
E-value |
| MAGE |
pfam01454 |
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ... |
498-666 |
7.16e-21 |
|
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
Pssm-ID: 426270 Cd Length: 205 Bit Score: 91.56 E-value: 7.16e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 498 LLQFLLVKDQSKYPIRESEMREYIVKEY-RNQFPEILRRAAAHLECIFRFELRELDPE--------------------AH 556
Cdd:pfam01454 1 LVRYALACEYQRTPIRREDISKKVLGENrKRLFKKVFEEAQKILRDVFGMELVELPAKeekkttvtsqqrraaakssrSK 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 557 TYILLNKL----------GPVPF-EGLEESPNGPKMGLLMMILGQIFLNGNQAKEAEIWEMLWRMGVQRERRL---SIFG 622
Cdd:pfam01454 81 SYILVSTLppeyrvpaiiWPSKApSFVLDQDEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDTDGTKeipPLNG 160
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 20143482 623 NPKRLLSvEFVWQRYLDYR--PVTDCKPVEYEFFWGPRSHLETTKM 666
Cdd:pfam01454 161 NTDDLLK-RLVKQGYLVRTkeGASDDGEEIIEYRVGPRAKVEFGPE 205
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
34-420 |
1.20e-16 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 85.76 E-value: 1.20e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 34 PGLPADVPGSDVPQGPSDSQILQGLCASEGPSTSVLPTSAEGPS---TFVPPTISEASSASGQPTISEGPGTSVLP---- 106
Cdd:PHA03247 2589 PDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSpaaNEPDPHPPPTVPPPERPRDDPAPGRVSRPrrar 2668
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 107 TPSEGLSTSGPPTISKGLCTSVTLAASEgrNTSRPPTSSEEPSTSVPPTASEVPSTSLPPTPGEG----TSTSVPPTAYE 182
Cdd:PHA03247 2669 RLGRAAQASSPPQRPRRRAARPTVGSLT--SLADPPPPPPTPEPAPHALVSATPLPPGPAAARQAspalPAAPAPPAVPA 2746
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 183 GPSTSVVPTPDEGP---STSVLPTPGEGPGTSVPLAATEGLSTSVQATPDEGPSTSVPPTATeglsTPVPPTRDEGPSTS 259
Cdd:PHA03247 2747 GPATPGGPARPARPpttAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPP----AAVLAPAAALPPAA 2822
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 260 VPATPGEgPSTSVLPAASDGQSISLVPTRGKGSstSVPPTATEGLSTSVQPTAGEGSSTSVPPT---PGGGLSTSVPPTA 336
Cdd:PHA03247 2823 SPAGPLP-PPTSAQPTAPPPPPGPPPPSLPLGG--SVAPGGDVRRRPPSRSPAAKPAAPARPPVrrlARPAVSRSTESFA 2899
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 337 TEELSTSVPPTPGEGPSTSVLPIPGEGLSTSVPPTASDGSDTSVPPTPGEGASTLVQPTAPDGPGSSVLPNPGEGPSTLF 416
Cdd:PHA03247 2900 LPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRV 2979
|
....
gi 20143482 417 SSSA 420
Cdd:PHA03247 2980 PQPA 2983
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
30-441 |
7.07e-15 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 79.19 E-value: 7.07e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 30 APNAPGLPADVPGSDVPQGPSDSQILQGLCASEGPSTSVLPTSAEGPSTFVPPtiseassASGQPTISEGPGTSvlPTPS 109
Cdd:pfam05109 409 ATNATTTTHKVIFSKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAP-------ASTGPTVSTADVTS--PTPA 479
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 110 EGLSTSGPptiskglctsVTLAASEGRNTsrppTSSEEPSTSVPPTASEVPS-TSLPPTPGEGTST--SVPPTAYEGPST 186
Cdd:pfam05109 480 GTTSGASP----------VTPSPSPRDNG----TESKAPDMTSPTSAVTTPTpNATSPTPAVTTPTpnATSPTLGKTSPT 545
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 187 SVV--PTPDEGPSTSVLPTPgeGPGTSVPlaaTEGLSTSVQATPDEGPSTSVPptaTEGLSTPVPPTRDE----GPSTSV 260
Cdd:pfam05109 546 SAVttPTPNATSPTPAVTTP--TPNATIP---TLGKTSPTSAVTTPTPNATSP---TVGETSPQANTTNHtlggTSSTPV 617
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 261 PATPGEGPSTSVLPA-----ASDGQSISLVP---TRGKGSSTSVPPTATEGLSTSVQPTAGEG--------------SST 318
Cdd:pfam05109 618 VTSPPKNATSAVTTGqhnitSSSTSSMSLRPssiSETLSPSTSDNSTSHMPLLTSAHPTGGENitqvtpaststhhvSTS 697
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 319 SVPPTPGGGLSTSVP-PTATEELSTSVPPTPGEGPSTSVLPIPGEGLSTSVPPTASDG----SDTSVPPTPGEGASTLVQ 393
Cdd:pfam05109 698 SPAPRPGTTSQASGPgNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTVTSTGgkanSTTGGKHTTGHGARTSTE 777
|
410 420 430 440
....*....|....*....|....*....|....*....|....*...
gi 20143482 394 PTAPDGpGSSVLPNPGEGPSTLFSSSASVDRNPskcSLVLPSPRVTKA 441
Cdd:pfam05109 778 PTTDYG-GDSTTPRTRYNATTYLPPSTSSKLRP---RWTFTSPPVTTA 821
|
|
| MAGE |
pfam01454 |
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ... |
752-912 |
3.55e-13 |
|
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
Pssm-ID: 426270 Cd Length: 205 Bit Score: 69.22 E-value: 3.55e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 752 LVQLFLLMDSTKLPIPKKGILYYIGRECSKV-FPDLLNRAARTLNHVYGTELVVLDPRNH-------------------- 810
Cdd:pfam01454 1 LVRYALACEYQRTPIRREDISKKVLGENRKRlFKKVFEEAQKILRDVFGMELVELPAKEEkkttvtsqqrraaakssrsk 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 811 SYTLYN-----------RREMEETEEIVDSPNRPGNNFLMQVLSFIFIMGNHARESAVWAFLRGLGV---QAGRKHVITC 876
Cdd:pfam01454 81 SYILVStlppeyrvpaiIWPSKAPSFVLDQDEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIdtdGTKEIPPLNG 160
|
170 180 190 200
....*....|....*....|....*....|....*....|....*
gi 20143482 877 -------RYLSQRYIDSLRVPDSDP--VQYEFVWGPRARLETSKM 912
Cdd:pfam01454 161 ntddllkRLVKQGYLVRTKEGASDDgeEIIEYRVGPRAKVEFGPE 205
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
61-353 |
1.07e-11 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 69.26 E-value: 1.07e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 61 SEGPSTSVLPTSAEGPSTFVP--PTISEASSASGQPTISEGPGTSVLPTPSEGLSTSGPPTISKGLCTSVTLAASEGRNT 138
Cdd:NF033849 252 SQGQSHSVGTSESHSVGTSQSqsHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSY 331
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 139 SRPPTSSEEPSTSVPPTASEVPSTSLPPTPGEGTSTSVPPTAYEGPSTSVVPTPDEGPST----SVLPTPGEGPGTSVPL 214
Cdd:NF033849 332 NVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGgfsgGIAGGGVTSEGLGASQ 411
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 215 AATEGLSTSvqaTPDEGPSTSVPPTATEGLSTPVPPTRDEGPSTSVPATPGEGPSTSVLPAASDGQSISlvptRGKGSST 294
Cdd:NF033849 412 GGSEGWGSG---DSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVG----TSESWST 484
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 20143482 295 SVPPTATEGLSTSVQPTAGEGSSTSVPPTPGGG--LSTSVPPTATEELSTSVPPTPGEGPS 353
Cdd:NF033849 485 SQSETDSVGDSTGTSESVSQGDGRSTGRSESQGtsLGTSGGRTSGAGGSMGLGPSISLGKS 545
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
87-390 |
3.56e-11 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 67.34 E-value: 3.56e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 87 ASSASGQPTiSEGPGTSVLPTPSEGLSTSGPPTISKGLCTSVTLAASEGRNTSRpptsseepSTSVPPTASEVPSTSLPP 166
Cdd:NF033849 231 YAANLGQSA-GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTR--------GWSHTQSTSESESTGQSS 301
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 167 TpgEGTSTSVPPTAYEGPSTSvvptpdEGPSTSVLPTPGEGPGTSVPLAATEGLSTSVQATPDEGPSTSVPPTATEGLST 246
Cdd:NF033849 302 S--VGTSESQSHGTTEGTSTT------DSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSS 373
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 247 PVPPTRDEGPSTSVPATPGEGPSTSVLPAASDGQSISLVPTRGKGSSTSVpptatEGLSTSVQPTAGEGSSTSVppTPGG 326
Cdd:NF033849 374 SVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSV-----QSVSQSYGSSSSTGTSSGH--SDSS 446
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 20143482 327 GLSTSVPPTATEELSTSVPPTPGEGPSTSVlpIPGEGLSTSVPPTASDGSDTSVPPTPGEGAST 390
Cdd:NF033849 447 SHSTSSGQADSVSQGTSWSEGTGTSQGQSV--GTSESWSTSQSETDSVGDSTGTSESVSQGDGR 508
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
60-272 |
1.75e-10 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 64.77 E-value: 1.75e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 60 ASEGPSTSVLPTSAEGPSTFVPPTISEASSASGQPTISEGPGTSVL-----PTPSEGLSTSGPPTISKGLCTSVTLAASE 134
Cdd:COG3469 7 AASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAasgsaGSGTGTTAASSTAATSSTTSTTATATAAA 86
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 135 GrNTSRPPTSSEEPSTSVPPTASEVPSTSLPPTPGEGTSTSVPPTAYEGPSTSVVPTPDEGPSTSVLPTPGEGPGTSVPL 214
Cdd:COG3469 87 A-AATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTT 165
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*...
gi 20143482 215 AATEGLSTSVQATPdeGPSTSVPPTATEGLSTPvpptrdegpSTSVPATPGEGPSTSV 272
Cdd:COG3469 166 TSTTTTTTSASTTP--SATTTATATTASGATTP---------SATTTATTTGPPTPGL 212
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
47-323 |
1.21e-08 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 59.25 E-value: 1.21e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 47 QGPSDSQilqGLCASEGPSTSVLPTSAEGPSTFVPPTISEASSASGQPTISEGPGTSVLPTPSEGLSTSGPPTISKGLCT 126
Cdd:NF033849 303 VGTSESQ---SHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSE 379
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 127 SVTLAASEGRNTSRPPTSseepstSVPPTASEVPSTSLPPTPGEGTSTSvpptaYEGPSTSVVPTPDEGPSTSVLPTPGE 206
Cdd:NF033849 380 SSSRSSSSGVSGGFSGGI------AGGGVTSEGLGASQGGSEGWGSGDS-----VQSVSQSYGSSSSTGTSSGHSDSSSH 448
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 207 GPGTSvplaATEGLSTSVqatpdegpSTSVPPTATEGLSTpvppTRDEGPSTSVPATPGEGPSTSVLPAASDGQSISLvp 286
Cdd:NF033849 449 STSSG----QADSVSQGT--------SWSEGTGTSQGQSV----GTSESWSTSQSETDSVGDSTGTSESVSQGDGRST-- 510
|
250 260 270
....*....|....*....|....*....|....*..
gi 20143482 287 TRGKGSSTSvpptategLSTSVQPTAGEGSSTSVPPT 323
Cdd:NF033849 511 GRSESQGTS--------LGTSGGRTSGAGGSMGLGPS 539
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
194-427 |
4.25e-08 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 57.32 E-value: 4.25e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 194 EGPSTSVLPTPGEGPGTSVPLAATEGLSTSVqatpdegpSTSVPPTATEGLSTPVPPTrdEGPSTSVPATPGEGPSTSVL 273
Cdd:NF033849 253 QGQSHSVGTSESHSVGTSQSQSHTTGHGSTR--------GWSHTQSTSESESTGQSSS--VGTSESQSHGTTEGTSTTDS 322
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 274 PAASDGQSISLVPTRGKGSSTSVPPTATEGLSTSvqPTAGEGSSTSVpptpGGGLSTSVPPTATEELSTSVPPTPGEGPS 353
Cdd:NF033849 323 SSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHS--ESSSESTGTSV----GHSTSSSVSSSESSSRSSSSGVSGGFSGG 396
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 20143482 354 TSVLPIPGEGLSTSVPPTASDGSDTSVpPTPGEGASTLVQPTAPDGPGSSVLPNPGEGPSTLFSSSASVDRNPS 427
Cdd:NF033849 397 IAGGGVTSEGLGASQGGSEGWGSGDSV-QSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTG 469
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
205-419 |
1.37e-06 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 52.31 E-value: 1.37e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 205 GEGPGTSVPLAATEGLSTSVqatpdegpSTSVPPTATEGLSTPVPPTRDEGPSTSVPATPGEGPSTSvlpaASDGQSISl 284
Cdd:NF033849 236 GQSAGTGYGESVGHSTSQGQ--------SHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSES----ESTGQSSS- 302
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 285 vptrgKGSSTSVPPTATEGLSTSVQPTAGEGSSTSVPPTPGGGLSTSVPPTATEELSTSvpPTPGEGPSTSVlpipGEGL 364
Cdd:NF033849 303 -----VGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHS--ESSSESTGTSV----GHST 371
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 20143482 365 STSVPPTASDGSDTSVPPTPGEGASTLVQPTAPDGPGSSVLPNPGEGPSTLFSSS 419
Cdd:NF033849 372 SSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSV 426
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
209-422 |
3.27e-05 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 48.08 E-value: 3.27e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 209 GTSVPLAATEGLSTSVqatpdegpSTSVPPTATEGLStpvpptrdEGPSTSVPATPGEGPSTSVLPAASDGQSIslvpTR 288
Cdd:NF033849 224 GVSLPMMYAANLGQSA--------GTGYGESVGHSTS--------QGQSHSVGTSESHSVGTSQSQSHTTGHGS----TR 283
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 289 GKGSSTSVPPTATEGLSTSVQPTAGEGSSTSVPPTPGGGLSTSVpptateelSTSVPPTPGEGPSTSVLPIPGEGLSTSV 368
Cdd:NF033849 284 GWSHTQSTSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQ--------SSSYNVSSGTGVSSSHSDGTSQSTSISH 355
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 20143482 369 PPTASDGSDTSVPPTPGEGASTLVQPTAPDGPGSSVLPNPGEGPSTLFSSSASV 422
Cdd:NF033849 356 SESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGA 409
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
70-417 |
1.49e-04 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 45.76 E-value: 1.49e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 70 PTSAEGPSTFVPPTISeaSSASGQPTISEGPGTSVLPTPSEGLSTSgPPTISKGLCTSVTLAASEGRNTSRPPTSSEEPS 149
Cdd:TIGR00927 112 PSPPRRTAKITPTTPK--NNYSPTAAGTERVKEDTPATPSRALNHY-ISTSGRQRVKSYTPKPRGEVKSSSPTQTREKVR 188
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 150 TSVPPTASEVPSTSLPPTPGEG-TSTSVPPTAYEGPSTSVV------PTPDEGPSTSVLPTPGEGPGTSVPLAATEGLST 222
Cdd:TIGR00927 189 KYTPSPLGRMVNSYAPSTFMTMpRSHGITPRTTVKDSEITAtykmleTNPSKRTAGKTTPTPLKGMTDNTPTFLTREVET 268
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 223 SVQATPDE--GPSTSVPPTATEGLSTpvppTRDEGPSTSVPATPGEGPSTSVLPAASDGQSISLVPTRGKgsstsvpPTA 300
Cdd:TIGR00927 269 DLLTSPRSvvEKNTLTTPRRVESNSS----TNHWGLVGKNNLTTPQGTVLEHTPATSEGQVTISIMTGSS-------PAE 337
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 301 TEGlSTSVQPTAGEGSSTSVPptpggglSTSVPPTATEELSTSvpptPGEGPSTSVLPIPGEGLSTSVPPTASDGSDTSV 380
Cdd:TIGR00927 338 TKA-STAAWKIRNPLSRTSAP-------AVRIASATFRGLEKN----PSTAPSTPATPRVRAVLTTQVHHCVVVKPAPAV 405
|
330 340 350 360
....*....|....*....|....*....|....*....|....
gi 20143482 381 PPTPGEGASTLVQPTAPDgPGSSVLPN-------PGEGPSTLFS 417
Cdd:TIGR00927 406 PTTPSPSLTTALFPEAPS-PSPSALPPgqpdlhpKAEYPPDLFS 448
|
|
| Streccoc_I_II |
NF033804 |
antigen I/II family LPXTG-anchored adhesin; Members of the antigen I/II family are adhesins ... |
62-204 |
1.25e-03 |
|
antigen I/II family LPXTG-anchored adhesin; Members of the antigen I/II family are adhesins with a glucan-binding domain, two types of repetitive regions, an isopeptide bond-forming domain associated with shear resistance, and a C-terminal LPXTG motif for anchoring to the cell wall. They occur in oral Streptococci, and tend to be major cell surface adhesins. Members of this family include SspA and SspB from Streptococcus gordonii, antigen I/II from S. mutans, etc.
Pssm-ID: 468188 [Multi-domain] Cd Length: 1552 Bit Score: 43.01 E-value: 1.25e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 62 EGPSTSVLPTSAEGPSTFVPPTISEASSAsgqPTISEGPGTSVLPTPSEGLSTSGPPTISKGLCTSVTLAASEGRNTSRP 141
Cdd:NF033804 830 EKPTPPVAPTAPQAPTYEVEKPLEPAPVA---PTYENEPTPPVKTPDQPEPSKPEEPTYETEKPLEPAPVAPTYENEPTP 906
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 20143482 142 PTSS---EEPSTSVPPT-ASEVPSTSLP--------PTPGEGTSTSVPPTAYEGPSTSVVPTPDEGPSTSVLPTP 204
Cdd:NF033804 907 PVKTpdqPEPSKPEEPTyETEKPLEPAPvapsyenePTPPVKTPDQPEPSKPVEPTYDPLPTPPVAPTPKQLPTP 981
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
21-225 |
7.50e-03 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 40.37 E-value: 7.50e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 21 HNSSWGEMQAPNAPGLPADVPGSDVPQGPSDSQilqGLcaSEGPSTSVLPTSAEGPSTFVPPTISEA-SSASGQPTISEG 99
Cdd:NF033849 355 HSESSSESTGTSVGHSTSSSVSSSESSSRSSSS---GV--SGGFSGGIAGGGVTSEGLGASQGGSEGwGSGDSVQSVSQS 429
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 100 PGTSVLPTPSEGLSTSgpptISKGLCTSVTLAASEGRNTSRPPTSSEepSTSVppTASEVPSTSLPPTPGEGTSTSVPPT 179
Cdd:NF033849 430 YGSSSSTGTSSGHSDS----SSHSTSSGQADSVSQGTSWSEGTGTSQ--GQSV--GTSESWSTSQSETDSVGDSTGTSES 501
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 20143482 180 AYEGPSTSVVPTPDEGPSTSVLPTPGEGPGTSVPLAATEGLSTSVQ 225
Cdd:NF033849 502 VSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLGKSYQ 547
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| MAGE |
pfam01454 |
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ... |
498-666 |
7.16e-21 |
|
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
Pssm-ID: 426270 Cd Length: 205 Bit Score: 91.56 E-value: 7.16e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 498 LLQFLLVKDQSKYPIRESEMREYIVKEY-RNQFPEILRRAAAHLECIFRFELRELDPE--------------------AH 556
Cdd:pfam01454 1 LVRYALACEYQRTPIRREDISKKVLGENrKRLFKKVFEEAQKILRDVFGMELVELPAKeekkttvtsqqrraaakssrSK 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 557 TYILLNKL----------GPVPF-EGLEESPNGPKMGLLMMILGQIFLNGNQAKEAEIWEMLWRMGVQRERRL---SIFG 622
Cdd:pfam01454 81 SYILVSTLppeyrvpaiiWPSKApSFVLDQDEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDTDGTKeipPLNG 160
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 20143482 623 NPKRLLSvEFVWQRYLDYR--PVTDCKPVEYEFFWGPRSHLETTKM 666
Cdd:pfam01454 161 NTDDLLK-RLVKQGYLVRTkeGASDDGEEIIEYRVGPRAKVEFGPE 205
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
34-420 |
1.20e-16 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 85.76 E-value: 1.20e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 34 PGLPADVPGSDVPQGPSDSQILQGLCASEGPSTSVLPTSAEGPS---TFVPPTISEASSASGQPTISEGPGTSVLP---- 106
Cdd:PHA03247 2589 PDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSpaaNEPDPHPPPTVPPPERPRDDPAPGRVSRPrrar 2668
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 107 TPSEGLSTSGPPTISKGLCTSVTLAASEgrNTSRPPTSSEEPSTSVPPTASEVPSTSLPPTPGEG----TSTSVPPTAYE 182
Cdd:PHA03247 2669 RLGRAAQASSPPQRPRRRAARPTVGSLT--SLADPPPPPPTPEPAPHALVSATPLPPGPAAARQAspalPAAPAPPAVPA 2746
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 183 GPSTSVVPTPDEGP---STSVLPTPGEGPGTSVPLAATEGLSTSVQATPDEGPSTSVPPTATeglsTPVPPTRDEGPSTS 259
Cdd:PHA03247 2747 GPATPGGPARPARPpttAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPP----AAVLAPAAALPPAA 2822
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 260 VPATPGEgPSTSVLPAASDGQSISLVPTRGKGSstSVPPTATEGLSTSVQPTAGEGSSTSVPPT---PGGGLSTSVPPTA 336
Cdd:PHA03247 2823 SPAGPLP-PPTSAQPTAPPPPPGPPPPSLPLGG--SVAPGGDVRRRPPSRSPAAKPAAPARPPVrrlARPAVSRSTESFA 2899
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 337 TEELSTSVPPTPGEGPSTSVLPIPGEGLSTSVPPTASDGSDTSVPPTPGEGASTLVQPTAPDGPGSSVLPNPGEGPSTLF 416
Cdd:PHA03247 2900 LPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRV 2979
|
....
gi 20143482 417 SSSA 420
Cdd:PHA03247 2980 PQPA 2983
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
30-441 |
7.07e-15 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 79.19 E-value: 7.07e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 30 APNAPGLPADVPGSDVPQGPSDSQILQGLCASEGPSTSVLPTSAEGPSTFVPPtiseassASGQPTISEGPGTSvlPTPS 109
Cdd:pfam05109 409 ATNATTTTHKVIFSKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAP-------ASTGPTVSTADVTS--PTPA 479
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 110 EGLSTSGPptiskglctsVTLAASEGRNTsrppTSSEEPSTSVPPTASEVPS-TSLPPTPGEGTST--SVPPTAYEGPST 186
Cdd:pfam05109 480 GTTSGASP----------VTPSPSPRDNG----TESKAPDMTSPTSAVTTPTpNATSPTPAVTTPTpnATSPTLGKTSPT 545
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 187 SVV--PTPDEGPSTSVLPTPgeGPGTSVPlaaTEGLSTSVQATPDEGPSTSVPptaTEGLSTPVPPTRDE----GPSTSV 260
Cdd:pfam05109 546 SAVttPTPNATSPTPAVTTP--TPNATIP---TLGKTSPTSAVTTPTPNATSP---TVGETSPQANTTNHtlggTSSTPV 617
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 261 PATPGEGPSTSVLPA-----ASDGQSISLVP---TRGKGSSTSVPPTATEGLSTSVQPTAGEG--------------SST 318
Cdd:pfam05109 618 VTSPPKNATSAVTTGqhnitSSSTSSMSLRPssiSETLSPSTSDNSTSHMPLLTSAHPTGGENitqvtpaststhhvSTS 697
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 319 SVPPTPGGGLSTSVP-PTATEELSTSVPPTPGEGPSTSVLPIPGEGLSTSVPPTASDG----SDTSVPPTPGEGASTLVQ 393
Cdd:pfam05109 698 SPAPRPGTTSQASGPgNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTVTSTGgkanSTTGGKHTTGHGARTSTE 777
|
410 420 430 440
....*....|....*....|....*....|....*....|....*...
gi 20143482 394 PTAPDGpGSSVLPNPGEGPSTLFSSSASVDRNPskcSLVLPSPRVTKA 441
Cdd:pfam05109 778 PTTDYG-GDSTTPRTRYNATTYLPPSTSSKLRP---RWTFTSPPVTTA 821
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
71-464 |
8.89e-15 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 79.44 E-value: 8.89e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 71 TSAEGPSTFVPPTISEA-----SSASGQPTISEGPGTSVLPTPSEGLSTSGPPTiskglctsvTLAASEGRNTSRPPTSS 145
Cdd:PHA03307 15 AEGGEFFPRPPATPGDAaddllSGSQGQLVSDSAELAAVTVVAGAAACDRFEPP---------TGPPPGPGTEAPANESR 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 146 EEPSTSVPPTASEVPSTslPPTPGEGTSTSVPPTAYEGPSTSVVPTPDEGPSTSVLPTPGEGPGTSVPLAATEGLSTSVQ 225
Cdd:PHA03307 86 STPTWSLSTLAPASPAR--EGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVA 163
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 226 ATPDEGPSTSVPPTATEGLS-TPVPPTRDEGPSTSVPATPGEGPSTSvlPAASDGQSiSLVPTRGKGSSTSVPPTAtEGL 304
Cdd:PHA03307 164 SDAASSRQAALPLSSPEETArAPSSPPAEPPPSTPPAAASPRPPRRS--SPISASAS-SPAPAPGRSAADDAGASS-SDS 239
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 305 STSVQPTAGEGSSTSvppTPGGGLSTSVPPTATEELSTSVPPTPGEGPSTSVLPIPGEGLSTSVP-------PTASDGSD 377
Cdd:PHA03307 240 SSSESSGCGWGPENE---CPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSspgsgpaPSSPRASS 316
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 378 TSVPPTPGEGASTLVQPTAPDGPGSSVLPNPGEGPSTLFSSSASVDRNPSKCSLVLPSPRVTKASvDSDSEGPKGAEGPI 457
Cdd:PHA03307 317 SSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAAS-AGRPTRRRARAAVA 395
|
....*..
gi 20143482 458 EFEVLRD 464
Cdd:PHA03307 396 GRARRRD 402
|
|
| MAGE |
pfam01454 |
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ... |
752-912 |
3.55e-13 |
|
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
Pssm-ID: 426270 Cd Length: 205 Bit Score: 69.22 E-value: 3.55e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 752 LVQLFLLMDSTKLPIPKKGILYYIGRECSKV-FPDLLNRAARTLNHVYGTELVVLDPRNH-------------------- 810
Cdd:pfam01454 1 LVRYALACEYQRTPIRREDISKKVLGENRKRlFKKVFEEAQKILRDVFGMELVELPAKEEkkttvtsqqrraaakssrsk 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 811 SYTLYN-----------RREMEETEEIVDSPNRPGNNFLMQVLSFIFIMGNHARESAVWAFLRGLGV---QAGRKHVITC 876
Cdd:pfam01454 81 SYILVStlppeyrvpaiIWPSKAPSFVLDQDEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIdtdGTKEIPPLNG 160
|
170 180 190 200
....*....|....*....|....*....|....*....|....*
gi 20143482 877 -------RYLSQRYIDSLRVPDSDP--VQYEFVWGPRARLETSKM 912
Cdd:pfam01454 161 ntddllkRLVKQGYLVRTKEGASDDgeEIIEYRVGPRAKVEFGPE 205
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
42-395 |
6.44e-13 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 72.30 E-value: 6.44e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 42 GSDVPQGPSDSQILQGLCASEGPSTSVLPT----SAEGPSTFVPPTISEASSASGQPTISEGPGTSVLPTPSEGLSTSGP 117
Cdd:pfam17823 44 GDAVPRADNKSSEQ*NFCAATAAPAPVTLTkgtsAAHLNSTEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSP 123
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 118 PTISKGLCTSVTLAASEGRNTsrPPTSSEEPSTSVPPTASeVPSTSLPPTPGEGTSTSVPPTAYEGPSTSVVPTPDEGPS 197
Cdd:pfam17823 124 SSAAQSLPAAIAALPSEAFSA--PRAAACRANASAAPRAA-IAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAAS 200
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 198 TSVLP-TPGEGPGTSVPLAATEGLSTSVQATPDegpSTSVPPTATEGLSTPVPPTrdegpSTSVPATPGEGPSTSVLPAA 276
Cdd:pfam17823 201 SAPATlTPARGISTAATATGHPAAGTALAAVGN---SSPAAGTVTAAVGTVTPAA-----LATLAAAAGTVASAAGTINM 272
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 277 SDGQSISLVPTRGKGSSTS----VPPTATEGLSTSVQPTAGEGSSTSVP-PTPGGGLSTSVPPTATEELSTS---VPPTP 348
Cdd:pfam17823 273 GDPHARRLSPAKHMPSDTMarnpAAPMGAQAQGPIIQVSTDQPVHNTAGePTPSPSNTTLEPNTPKSVASTNlavVTTTK 352
|
330 340 350 360
....*....|....*....|....*....|....*....|....*....
gi 20143482 349 GEG--PSTSVLPIPGEGLSTSVPPTASDGSDTSVPPTPGEGASTLVQPT 395
Cdd:pfam17823 353 AQAkePSASPVPVLHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILLAP 401
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
31-424 |
8.68e-13 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 73.05 E-value: 8.68e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 31 PNAPGLPADVPGSDVPQGPSDSqilqGLCASEGPSTSVLPTSAEGPSTFVPPTiseassASGQPTISEGPGTsvlPTPSE 110
Cdd:PHA03247 2707 TPEPAPHALVSATPLPPGPAAA----RQASPALPAAPAPPAVPAGPATPGGPA------RPARPPTTAGPPA---PAPPA 2773
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 111 GLSTSGPPTISKGLCTSvtlaASEGRNTSRPPTSSEEPSTSVPPTASEVPSTSlPPTPGEGTSTSVPPTAyegpstsvvP 190
Cdd:PHA03247 2774 APAAGPPRRLTRPAVAS----LSESRESLPSPWDPADPPAAVLAPAAALPPAA-SPAGPLPPPTSAQPTA---------P 2839
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 191 TPDEGPSTSVLPTPGE-GPGTSVPLAATEGLSTSVQATPDEGPSTSV--PPTATEGLSTPVPPTRDEGPSTSVPATPGEG 267
Cdd:PHA03247 2840 PPPPGPPPPSLPLGGSvAPGGDVRRRPPSRSPAAKPAAPARPPVRRLarPAVSRSTESFALPPDQPERPPQPQAPPPPQP 2919
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 268 PSTSVLPAASDGQSislvPTRGKGSSTSVPPTATEGLSTSVQPTAGEGSSTSVP---PTPGGGLSTSVPPTATEELSTSV 344
Cdd:PHA03247 2920 QPQPPPPPQPQPPP----PPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPgrvAVPRFRVPQPAPSREAPASSTPP 2995
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 345 P---PTPGEGPSTSVL-------PIPGEGLSTSVPPTASDGSDT--------------SVPPTPGEGASTLVQPTAPDGP 400
Cdd:PHA03247 2996 LtghSLSRVSSWASSLalheetdPPPVSLKQTLWPPDDTEDSDAdslfdsdsersdleALDPLPPEPHDPFAHEPDPATP 3075
|
410 420
....*....|....*....|....*.
gi 20143482 401 GSSVLPNPGE--GPSTLfSSSASVDR 424
Cdd:PHA03247 3076 EAGARESPSSqfGPPPL-SANAALSR 3100
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
15-358 |
3.20e-12 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 70.97 E-value: 3.20e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 15 VAKATAHNSSWGEMQAPNAPGLPADVPGSDVPQGPSDSQILQGLCASEGP---STSVLPTSAEGPSTFVPPTISEASSAS 91
Cdd:PHA03307 97 PASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPppaASPPAAGASPAAVASDAASSRQAALPL 176
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 92 GQP-----TISEGPGTSVLPTPSEGLSTSGPPTISKGLCTSVTLAASEGRNTSRPPTSSEEPSTSVPPTASEVPSTSLPP 166
Cdd:PHA03307 177 SSPeetarAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECP 256
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 167 TPGEGTSTSvpPTAYEGPSTSVVPTPDEGPSTSVLPTPGEGPGTSVPLAATEGLSTSVQATPDEGPSTSVPPTATegLST 246
Cdd:PHA03307 257 LPRPAPITL--PTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSST--SSS 332
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 247 PVPPtrdEGPSTSVPATPGEGPSTSVLPAASDGQSISLVPTRGKGSSTSVPPTATEGLSTSVQPTAGEGSSTSVPPTPGG 326
Cdd:PHA03307 333 SESS---RGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPA 409
|
330 340 350
....*....|....*....|....*....|..
gi 20143482 327 GLSTSVPPTATEELSTSVPPTPGEGPSTSVLP 358
Cdd:PHA03307 410 GRPRPSPLDAGAASGAFYARYPLLTPSGEPWP 441
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
61-353 |
1.07e-11 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 69.26 E-value: 1.07e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 61 SEGPSTSVLPTSAEGPSTFVP--PTISEASSASGQPTISEGPGTSVLPTPSEGLSTSGPPTISKGLCTSVTLAASEGRNT 138
Cdd:NF033849 252 SQGQSHSVGTSESHSVGTSQSqsHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSY 331
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 139 SRPPTSSEEPSTSVPPTASEVPSTSLPPTPGEGTSTSVPPTAYEGPSTSVVPTPDEGPST----SVLPTPGEGPGTSVPL 214
Cdd:NF033849 332 NVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGgfsgGIAGGGVTSEGLGASQ 411
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 215 AATEGLSTSvqaTPDEGPSTSVPPTATEGLSTPVPPTRDEGPSTSVPATPGEGPSTSVLPAASDGQSISlvptRGKGSST 294
Cdd:NF033849 412 GGSEGWGSG---DSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVG----TSESWST 484
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 20143482 295 SVPPTATEGLSTSVQPTAGEGSSTSVPPTPGGG--LSTSVPPTATEELSTSVPPTPGEGPS 353
Cdd:NF033849 485 SQSETDSVGDSTGTSESVSQGDGRSTGRSESQGtsLGTSGGRTSGAGGSMGLGPSISLGKS 545
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
91-455 |
1.65e-11 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 68.40 E-value: 1.65e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 91 SGQPTISEGPGTSVLPTPSEGLSTSGPPTISKGLCtsVTLAASEGRNTSRPPTSSEEP-STSVPPTASEVPSTSLPPTPG 169
Cdd:pfam05109 370 SGTPSGCENISGAFASNRTFDITVSGLGTAPKTLI--ITRTATNATTTTHKVIFSKAPeSTTTSPTLNTTGFAAPNTTTG 447
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 170 EGTSTSVPPTAYEGPSTS-VVPTPDEGPSTSVLPTPGEGPGTSVPLAATEGLSTSvqaTPDEGPSTSVPPTATEGLSTPV 248
Cdd:pfam05109 448 LPSSTHVPTNLTAPASTGpTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESK---APDMTSPTSAVTTPTPNATSPT 524
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 249 PPTRDEGPSTSVPATPGEGPSTSVL----------PAASDGQSISLVPTRGKGSSTSV----PPTATeglstsvQPTAGE 314
Cdd:pfam05109 525 PAVTTPTPNATSPTLGKTSPTSAVTtptpnatsptPAVTTPTPNATIPTLGKTSPTSAvttpTPNAT-------SPTVGE 597
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 315 GS--STSVPPTPGGGLSTSVPPTATEELSTSVppTPGE----GPSTSVLPIPGEGLSTSVPPTASDGSDTSVP------P 382
Cdd:pfam05109 598 TSpqANTTNHTLGGTSSTPVVTSPPKNATSAV--TTGQhnitSSSTSSMSLRPSSISETLSPSTSDNSTSHMPlltsahP 675
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 20143482 383 TPGEGAsTLVQPTAPDG---PGSSVLPNPGEGPSTLFSSSASVDRNPSKCSlvlpsprVTKASVDSDSEGPKGAEG 455
Cdd:pfam05109 676 TGGENI-TQVTPASTSThhvSTSSPAPRPGTTSQASGPGNSSTSTKPGEVN-------VTKGTPPKNATSPQAPSG 743
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
87-390 |
3.56e-11 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 67.34 E-value: 3.56e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 87 ASSASGQPTiSEGPGTSVLPTPSEGLSTSGPPTISKGLCTSVTLAASEGRNTSRpptsseepSTSVPPTASEVPSTSLPP 166
Cdd:NF033849 231 YAANLGQSA-GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTR--------GWSHTQSTSESESTGQSS 301
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 167 TpgEGTSTSVPPTAYEGPSTSvvptpdEGPSTSVLPTPGEGPGTSVPLAATEGLSTSVQATPDEGPSTSVPPTATEGLST 246
Cdd:NF033849 302 S--VGTSESQSHGTTEGTSTT------DSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSS 373
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 247 PVPPTRDEGPSTSVPATPGEGPSTSVLPAASDGQSISLVPTRGKGSSTSVpptatEGLSTSVQPTAGEGSSTSVppTPGG 326
Cdd:NF033849 374 SVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSV-----QSVSQSYGSSSSTGTSSGH--SDSS 446
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 20143482 327 GLSTSVPPTATEELSTSVPPTPGEGPSTSVlpIPGEGLSTSVPPTASDGSDTSVPPTPGEGAST 390
Cdd:NF033849 447 SHSTSSGQADSVSQGTSWSEGTGTSQGQSV--GTSESWSTSQSETDSVGDSTGTSESVSQGDGR 508
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
60-272 |
1.75e-10 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 64.77 E-value: 1.75e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 60 ASEGPSTSVLPTSAEGPSTFVPPTISEASSASGQPTISEGPGTSVL-----PTPSEGLSTSGPPTISKGLCTSVTLAASE 134
Cdd:COG3469 7 AASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAasgsaGSGTGTTAASSTAATSSTTSTTATATAAA 86
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 135 GrNTSRPPTSSEEPSTSVPPTASEVPSTSLPPTPGEGTSTSVPPTAYEGPSTSVVPTPDEGPSTSVLPTPGEGPGTSVPL 214
Cdd:COG3469 87 A-AATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTT 165
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*...
gi 20143482 215 AATEGLSTSVQATPdeGPSTSVPPTATEGLSTPvpptrdegpSTSVPATPGEGPSTSV 272
Cdd:COG3469 166 TSTTTTTTSASTTP--SATTTATATTASGATTP---------SATTTATTTGPPTPGL 212
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
27-414 |
4.96e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 63.80 E-value: 4.96e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 27 EMQAPNAPGlPADVPGSDVPQGPSDSQilqglcASEGPSTSVLPTSAEGPStfVPPTI--------SEASSASGQPTiSE 98
Cdd:PHA03247 2485 EARFPFAAG-AAPDPGGGGPPDPDAPP------APSRLAPAILPDEPVGEP--VHPRMltwirgleELASDDAGDPP-PP 2554
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 99 GPGTSVLPTPSEGLSTSGPPTISKGlctsvTLAASEGRNTSRPPTSSEEPSTSVPPTASEVPSTSLPPTPGEGTSTSVPP 178
Cdd:PHA03247 2555 LPPAAPPAAPDRSVPPPRPAPRPSE-----PAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPP 2629
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 179 T----AYEGPSTSVVPTP-------DEGPSTSVLP----TPGEGPGTSVPL---------AATEGLSTSVQATPDEGPST 234
Cdd:PHA03247 2630 SpspaANEPDPHPPPTVPpperprdDPAPGRVSRPrrarRLGRAAQASSPPqrprrraarPTVGSLTSLADPPPPPPTPE 2709
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 235 SVPPTATEGLSTPVPPTRDEGPSTSVPATPGEgPSTSVLPAASDGQSISLVPTRGKGSSTSVPPTATEG---LSTSVQPT 311
Cdd:PHA03247 2710 PAPHALVSATPLPPGPAAARQASPALPAAPAP-PAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAgppRRLTRPAV 2788
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 312 AGEGSSTSVPPTPGGGLSTSVP-PTATEELSTSVPPTPGEGPSTSVLPIPGEGLSTSVPPTAS------DGSDTSVPPTP 384
Cdd:PHA03247 2789 ASLSESRESLPSPWDPADPPAAvLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPlggsvaPGGDVRRRPPS 2868
|
410 420 430
....*....|....*....|....*....|
gi 20143482 385 GEGASTlvqPTAPDGPGSSVLPNPGEGPST 414
Cdd:PHA03247 2869 RSPAAK---PAAPARPPVRRLARPAVSRST 2895
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
49-383 |
6.90e-10 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 63.01 E-value: 6.90e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 49 PSDSQILQGLCA--SEGPSTSVL----PTSA---EGPSTFVP-PTISEASSASGQPTISEGPGTSVLPTP-----SEGLS 113
Cdd:pfam05109 449 PSSTHVPTNLTApaSTGPTVSTAdvtsPTPAgttSGASPVTPsPSPRDNGTESKAPDMTSPTSAVTTPTPnatspTPAVT 528
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 114 TSGP----PTISKGLCTSVTLAASEGRNTSRPPTSSEEPSTSVPPTASEVPSTSL-PPTPgEGTSTSVPPTAYEG----- 183
Cdd:pfam05109 529 TPTPnatsPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVtTPTP-NATSPTVGETSPQAnttnh 607
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 184 -----PSTSVVPTPDEGPSTSVlpTPGEGPGTSVPLAATEGLSTSVQATpdEGPSTSVPPTATEGLSTPVPPTRDEGPST 258
Cdd:pfam05109 608 tlggtSSTPVVTSPPKNATSAV--TTGQHNITSSSTSSMSLRPSSISET--LSPSTSDNSTSHMPLLTSAHPTGGENITQ 683
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 259 SVPA----------TPGEGPSTSVLPAASDGQSISLVP-----TRGKGSSTSVPPTATEGLSTSVQPTAGEG----SSTS 319
Cdd:pfam05109 684 VTPAststhhvstsSPAPRPGTTSQASGPGNSSTSTKPgevnvTKGTPPKNATSPQAPSGQKTAVPTVTSTGgkanSTTG 763
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 20143482 320 VPPTPGGGLSTSVPPTATEELSTSVPPTPGEG-----PSTSVLPIPGEGLSTsvPPTASDGSDTSVPPT 383
Cdd:pfam05109 764 GKHTTGHGARTSTEPTTDYGGDSTTPRTRYNAttylpPSTSSKLRPRWTFTS--PPVTTAQATVPVPPT 830
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
117-450 |
2.31e-09 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 61.13 E-value: 2.31e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 117 PPTISKGL------CTSVTLAASEGRNTSRPPTSSEEPSTSVPPTASEVPSTSLPPTPGEGTSTSV--PPT-AYEGPSTS 187
Cdd:pfam17823 69 PVTLTKGTsaahlnSTEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIaaLPSeAFSAPRAA 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 188 VVPTPDE-GPSTSVLPTPGEGPGTSVPLAAteglSTSVQATPDEGPSTSVPPTATEGLSTPVPPTRDEGPSTSVPATPGE 266
Cdd:pfam17823 149 ACRANASaAPRAAIAAASAPHAASPAPRTA----ASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAA 224
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 267 GPSTSVLPAASDGQSISLVPTRGKGSSTSVPPTATEGLSTSVQPTAGEGSSTSVPPTPGGGLstsvpPTATEELSTSVPP 346
Cdd:pfam17823 225 GTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAKHM-----PSDTMARNPAAPM 299
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 347 TPGEGPSTSVLPIPGEGLSTSVPPTASDGSDTSVPPTPGEGAS--------TLVQPTAPDGPGSSVLPNP------GEGP 412
Cdd:pfam17823 300 GAQAQGPIIQVSTDQPVHNTAGEPTPSPSNTTLEPNTPKSVAStnlavvttTKAQAKEPSASPVPVLHTSmipeveATSP 379
|
330 340 350
....*....|....*....|....*....|....*...
gi 20143482 413 STLFSSSASVDRNPSKCSLVLPSPRVTKASVDSDSEGP 450
Cdd:pfam17823 380 TTQPSPLLPTQGAAGPGILLAPEQVATEATAGTASAGP 417
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
15-405 |
2.96e-09 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 61.34 E-value: 2.96e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 15 VAKATAHNSSWGEMQAPNAPGLPADVPGSDVPQGPSDSQILQGLCASEGPSTSVLPTSAEGPstfvPPTISEASSASGQP 94
Cdd:PHA03307 56 VAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPP----PPTPPPASPPPSPA 131
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 95 TISEGPGTSVLPTPSEGLSTSGPPTISKGlctSVTLAASEGRNTSRPPTSSEEPSTSVPPTASEVP-STSLPPTPGEGTS 173
Cdd:PHA03307 132 PDLSEMLRPVGSPGPPPAASPPAAGASPA---AVASDAASSRQAALPLSSPEETARAPSSPPAEPPpSTPPAAASPRPPR 208
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 174 TSVPPTAYEGPSTSVVPTPDEGP--------STSVLPTPGEGPGTSVPLAateGLSTSVQATPDEGPSTSVPPTATEGLS 245
Cdd:PHA03307 209 RSSPISASASSPAPAPGRSAADDagasssdsSSSESSGCGWGPENECPLP---RPAPITLPTRIWEASGWNGPSSRPGPA 285
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 246 TPVPPTRDEGPSTSvPATPGEGPSTSVLPAASDGQSISLVPTRGKGSS------TSVPPTATEGLSTSVQPTAG--EGSS 317
Cdd:PHA03307 286 SSSSSPRERSPSPS-PSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSsessrgAAVSPGPSPSRSPSPSRPPPpaDPSS 364
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 318 TSVPPTPGGGLSTSVPPTATEELSTSVPPTPGEGPSTSVLPIPGEGLSTSVPPTASDGSDTSVPPTPGEGASTLVQPTAP 397
Cdd:PHA03307 365 PRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPSGEPWPGSP 444
|
....*...
gi 20143482 398 DGPGSSVL 405
Cdd:PHA03307 445 PPPPGRVR 452
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
24-249 |
2.97e-09 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 60.54 E-value: 2.97e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 24 SWGEMQAPNAPGLPADVPGSDVPQGPSDSQILQGLCASEGPSTSVlPTSAEGPSTFVPPTISEASSASGQPTISEGPGTS 103
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVST-TGSVVVAASGSAGSGTGTTAASSTAATSSTTSTT 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 104 VLPTPSEGLSTSGPPTISKGLcTSVTLAASEGRNTSRPPTSSEEPSTSVPPTASEVPSTSLPPTPGEGTSTSVPPTAYEG 183
Cdd:COG3469 80 ATATAAAAAATSTSATLVATS-TASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 20143482 184 PSTSVVPTPDEGPSTSVLPTPGEGPGTSVPlaateglstsvQATPDEGPSTSVPPTATEGLSTPVP 249
Cdd:COG3469 159 ATGGTTTTSTTTTTTSASTTPSATTTATAT-----------TASGATTPSATTTATTTGPPTPGLP 213
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
47-323 |
1.21e-08 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 59.25 E-value: 1.21e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 47 QGPSDSQilqGLCASEGPSTSVLPTSAEGPSTFVPPTISEASSASGQPTISEGPGTSVLPTPSEGLSTSGPPTISKGLCT 126
Cdd:NF033849 303 VGTSESQ---SHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSE 379
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 127 SVTLAASEGRNTSRPPTSseepstSVPPTASEVPSTSLPPTPGEGTSTSvpptaYEGPSTSVVPTPDEGPSTSVLPTPGE 206
Cdd:NF033849 380 SSSRSSSSGVSGGFSGGI------AGGGVTSEGLGASQGGSEGWGSGDS-----VQSVSQSYGSSSSTGTSSGHSDSSSH 448
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 207 GPGTSvplaATEGLSTSVqatpdegpSTSVPPTATEGLSTpvppTRDEGPSTSVPATPGEGPSTSVLPAASDGQSISLvp 286
Cdd:NF033849 449 STSSG----QADSVSQGT--------SWSEGTGTSQGQSV----GTSESWSTSQSETDSVGDSTGTSESVSQGDGRST-- 510
|
250 260 270
....*....|....*....|....*....|....*..
gi 20143482 287 TRGKGSSTSvpptategLSTSVQPTAGEGSSTSVPPT 323
Cdd:NF033849 511 GRSESQGTS--------LGTSGGRTSGAGGSMGLGPS 539
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
197-406 |
1.30e-08 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 58.61 E-value: 1.30e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 197 STSVLPTPGEGPGTSVPLAATEGLSTSVQATPDEGPSTSVPPTATEGLSTPVPPTRDEGPSTSVPATPGEGPSTSVLPAA 276
Cdd:COG3469 5 STAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATA 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 277 SDGQSISLVPTrGKGSSTSVPPTATEGLSTSVQPTAGEGSSTSVPPTPGGGLSTSVPPTATEELSTSVPPTPGEGPSTSV 356
Cdd:COG3469 85 AAAAATSTSAT-LVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGT 163
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|
gi 20143482 357 LPIPGEGLSTSVPPTASDGSDTSVPPTPGEGASTLVQPTAPDGPGSSVLP 406
Cdd:COG3469 164 TTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLP 213
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
131-456 |
1.50e-08 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 58.84 E-value: 1.50e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 131 AASEGRNTSRPPTSSEEPSTSVPPTASEVPSTSLPPTPGEGTSTSVPPTAYEGPSTSVVPTPDEGPSTSVLPTPGEGPGT 210
Cdd:PRK07764 395 AAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAP 474
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 211 SVPLAAteglstsvQATPDEGPSTSVPPTATEGLSTPVPPTRDEGPSTSVP----ATPGEGPSTS--VLPAAS----DGQ 280
Cdd:PRK07764 475 EPTAAP--------APAPPAAPAPAAAPAAPAAPAAPAGADDAATLRERWPeilaAVPKRSRKTWaiLLPEATvlgvRGD 546
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 281 SISLVPTRG--------KGSSTSVPPTATEGLSTSVQPTA-------GEGSSTSVPPTPGGGLSTSVPPTATEELSTSVP 345
Cdd:PRK07764 547 TLVLGFSTGglarrfasPGNAEVLVTALAEELGGDWQVEAvvgpapgAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAA 626
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 346 PTPGEGPSTSVLPIPGEGLSTSVPPTASDGSDTSVPPTPGEGASTLVQPTAPDGPGSSVLPNPGEGPSTLFSSSASVDRN 425
Cdd:PRK07764 627 PAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPA 706
|
330 340 350
....*....|....*....|....*....|.
gi 20143482 426 PSKCSLVLPSPRVTKASVDSDSEGPKGAEGP 456
Cdd:PRK07764 707 ATPPAGQADDPAAQPPQAAQGASAPSPAADD 737
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
194-427 |
4.25e-08 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 57.32 E-value: 4.25e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 194 EGPSTSVLPTPGEGPGTSVPLAATEGLSTSVqatpdegpSTSVPPTATEGLSTPVPPTrdEGPSTSVPATPGEGPSTSVL 273
Cdd:NF033849 253 QGQSHSVGTSESHSVGTSQSQSHTTGHGSTR--------GWSHTQSTSESESTGQSSS--VGTSESQSHGTTEGTSTTDS 322
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 274 PAASDGQSISLVPTRGKGSSTSVPPTATEGLSTSvqPTAGEGSSTSVpptpGGGLSTSVPPTATEELSTSVPPTPGEGPS 353
Cdd:NF033849 323 SSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHS--ESSSESTGTSV----GHSTSSSVSSSESSSRSSSSGVSGGFSGG 396
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 20143482 354 TSVLPIPGEGLSTSVPPTASDGSDTSVpPTPGEGASTLVQPTAPDGPGSSVLPNPGEGPSTLFSSSASVDRNPS 427
Cdd:NF033849 397 IAGGGVTSEGLGASQGGSEGWGSGDSV-QSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTG 469
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
56-387 |
4.81e-08 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 57.00 E-value: 4.81e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 56 QGLCASEGPSTSVLPTSAEGPSTFVPPTISEASSASGQPTISEGPgtSVLPTPSEglstsgPPTISKGLCTSvTLAASEG 135
Cdd:PHA03378 601 HPSQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQPITFNV--LVFPTPHQ------PPQVEITPYKP-TWTQIGH 671
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 136 RNTSRPPTSseePSTSVPPTASevPSTSLPP--TPGEGTSTSVPPTAYEGPSTSVVPTPDEGPSTSVLPTPGEGPGTSVP 213
Cdd:PHA03378 672 IPYQPSPTG---ANTMLPIQWA--PGTMQPPprAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARP 746
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 214 LAATEGLSTSVQATPDEGPstsvPPTATEGLSTP-----VPPTRDEGPSTS-VPATPGEGPSTS--VLPAASDGQSISLV 285
Cdd:PHA03378 747 PAAAPGRARPPAAAPGRAR----PPAAAPGAPTPqpppqAPPAPQQRPRGApTPQPPPQAGPTSmqLMPRAAPGQQGPTK 822
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 286 PTRGKGSSTSVpptaTEGLSTSVQPTAGEGSSTSVP-PTPGGGLSTSV-------PP------------TATEELSTSVP 345
Cdd:PHA03378 823 QILRQLLTGGV----KRGRPSLKKPAALERQAAAGPtPSPGSGTSDKIvqapvfyPPvlqpiqvmrqlgSVRAAAASTVT 898
|
330 340 350 360
....*....|....*....|....*....|....*....|..
gi 20143482 346 PTPGEGPSTSVLPIPGEglSTSVPPTASDGSDTSVPPTPGEG 387
Cdd:PHA03378 899 QAPTEYTGERRGVGPMH--PTDIPPSKRAKTDAYVESQPPHG 938
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
40-448 |
9.41e-08 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 56.31 E-value: 9.41e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 40 VPGSDVPQGPSDSQILQGLCASEGPstsVLPTSAEGPSTFVPPTISEASSASGQPTisegPGTSVLPtPSEGLSTSGPPT 119
Cdd:pfam03154 148 IPSPQDNESDSDSSAQQQILQTQPP---VLQAQSGAASPPSPPPPGTTQAATAGPT----PSAPSVP-PQGSPATSQPPN 219
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 120 ISKGLCTSVTLAASEGRNTSRPPTSSEEPSTSVPPtasevpstslPPTPGEGTSTSVPPTAYEGPSTSVVPTPDEGPSTS 199
Cdd:pfam03154 220 QTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQ----------PPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHM 289
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 200 VLPTPGEGPGTSVPLAATEGLSTSVQATPDEGPSTSVPPTATEGLSTPVPPTRDEGPSTSVPATPGEGPSTSVLPAASDG 279
Cdd:pfam03154 290 QHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNP 369
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 280 QSIS-----LVPTRGKGSSTSVPPTATEGLSTsvqptagegSSTSVPPtpggglSTSVPPTATEELSTSVPPTPGEGP-- 352
Cdd:pfam03154 370 QSHKhpphlSGPSPFQMNSNLPPPPALKPLSS---------LSTHHPP------SAHPPPLQLMPQSQQLPPPPAQPPvl 434
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 353 -STSVLPIPGeglsTSVPPTASDGSDTSVPPTPGE----GASTLVQPtaPDGPGSSVLPN-PGEGPSTLFSSSASVDRnP 426
Cdd:pfam03154 435 tQSQSLPPPA----ASHPPTSGLHQVPSQSPFPQHpfvpGGPPPITP--PSGPPTSTSSAmPGIQPPSSASVSSSGPV-P 507
|
410 420
....*....|....*....|..
gi 20143482 427 SKCSLVLPSPRVTKASVDSDSE 448
Cdd:pfam03154 508 AAVSCPLPPVQIKEEALDEAEE 529
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
64-289 |
3.37e-07 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 54.11 E-value: 3.37e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 64 PSTSVLPTSAEGPSTFVPPTISEASSASGQPTISEGPGTSVLPTPSEGLSTSGPPtiskglctSVTLAASEGRNTSRPPT 143
Cdd:PRK12323 375 ATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPA--------PEALAAARQASARGPGG 446
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 144 SSEEPSTSVPPTASEVPSTSLPPTPGEGTSTSVP----PTAYEGPSTSVVPTPDEGPSTSVLPTPGEGPGTSVPLAATEG 219
Cdd:PRK12323 447 APAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAParaaPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESI 526
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 20143482 220 LSTSVQATPDEGPSTSVPPTAteglstPVPPTRDEGPSTSVPATPGEGpSTSVLPAASDGQSISL---VPTRG 289
Cdd:PRK12323 527 PDPATADPDDAFETLAPAPAA------APAPRAAAATEPVVAPRPPRA-SASGLPDMFDGDWPALaarLPVRG 592
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
125-354 |
6.95e-07 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 53.45 E-value: 6.95e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 125 CTSVTLAASEGRNTSRPPTSSEEPSTSVPPTASEVPSTSLPPTPGEGTStsvPPTAYEGPSTSVVPTPDEGPSTSVLPTP 204
Cdd:PRK07764 586 AVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAA---APAEASAAPAPGVAAPEHHPKHVAVPDA 662
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 205 GEGPGTSVPLAATEGLSTSVQATPDEGPSTSVPPTATEGLSTPVPPTRDEGPSTSVPATPGEGPSTSVLPAASDGQsisl 284
Cdd:PRK07764 663 SDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDP---- 738
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 285 vptrgkgsstsVPPTATEGLSTSVQPTAGEGSSTSVPPTPGGGLSTSVPPTATEELSTSVPPTPGEGPST 354
Cdd:PRK07764 739 -----------VPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDED 797
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
50-348 |
7.13e-07 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 53.01 E-value: 7.13e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 50 SDSQILQGLCASEGPSTSVLPTSAEGPSTFVPPTISEAssASGQPTISEGPGTSVLPTPSEGlstSGPPTiSKGLCTSvT 129
Cdd:PLN03209 298 SYCKVVEVIAETTAPLTPMEELLAKIPSQRVPPKESDA--ADGPKPVPTKPVTPEAPSPPIE---EEPPQ-PKAVVPR-P 370
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 130 LAASEGRNTSRPPTS-SEEPSTSVPPTASEVPSTSLPPT----PGEGTSTSVP-------PTAYEGPSTSVVPTPDEGPS 197
Cdd:PLN03209 371 LSPYTAYEDLKPPTSpIPTPPSSSPASSKSVDAVAKPAEpdvvPSPGSASNVPevepaqvEAKKTRPLSPYARYEDLKPP 450
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 198 TSVLPTPGEGPGTSVPLAAteglstSVQATPDEGPSTSVPPTATeglstpvPPTRDEGPSTSVPATPGEGPSTSVLPAAs 277
Cdd:PLN03209 451 TSPSPTAPTGVSPSVSSTS------SVPAVPDTAPATAATDAAA-------PPPANMRPLSPYAVYDDLKPPTSPSPAA- 516
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 20143482 278 dgqsislvpTRGKGSSTSVPPTATEGLSTSVQPTAGEGSSTSVPPTPgggLStsvPPTATEELSTSVPPTP 348
Cdd:PLN03209 517 ---------PVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQPKPRP---LS---PYTMYEDLKPPTSPTP 572
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
205-419 |
1.37e-06 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 52.31 E-value: 1.37e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 205 GEGPGTSVPLAATEGLSTSVqatpdegpSTSVPPTATEGLSTPVPPTRDEGPSTSVPATPGEGPSTSvlpaASDGQSISl 284
Cdd:NF033849 236 GQSAGTGYGESVGHSTSQGQ--------SHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSES----ESTGQSSS- 302
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 285 vptrgKGSSTSVPPTATEGLSTSVQPTAGEGSSTSVPPTPGGGLSTSVPPTATEELSTSvpPTPGEGPSTSVlpipGEGL 364
Cdd:NF033849 303 -----VGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHS--ESSSESTGTSV----GHST 371
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 20143482 365 STSVPPTASDGSDTSVPPTPGEGASTLVQPTAPDGPGSSVLPNPGEGPSTLFSSS 419
Cdd:NF033849 372 SSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSV 426
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
70-270 |
1.51e-06 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 52.30 E-value: 1.51e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 70 PTSAEGPSTFVPPTISEASSASGQPTISEGPGTSVLPTPSEGLSTSGPPtiSKGLCTSVTLAASEGRNTSRPPTSSEEPS 149
Cdd:PRK07764 591 APGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEA--SAAPAPGVAAPEHHPKHVAVPDASDGGDG 668
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 150 TSVPPTASEVPSTSLPPTPGEGTSTSVPPTAYEGPSTSVVPTPDEGPSTSVLP--------TPGEGPGTSVPLAATEGLS 221
Cdd:PRK07764 669 WPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPpqaaqgasAPSPAADDPVPLPPEPDDP 748
|
170 180 190 200
....*....|....*....|....*....|....*....|....*....
gi 20143482 222 TSVQATPDEGPSTSVPPTATEGLSTPVPPTRDEGPSTSVPATPGEGPST 270
Cdd:PRK07764 749 PDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDED 797
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
66-235 |
2.91e-06 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 49.52 E-value: 2.91e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 66 TSVLPTSAeGPSTFVPPTISEASSASGQPTISEGPGTSVLPTPSeglSTSGPPTISKGLCTSVTLAASEGrnTSRPPTSS 145
Cdd:PHA03255 20 TSLIWTSS-GSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLT---TTSAPITTTAILSTNTTTVTSTG--TTVTPVPT 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 146 EEPSTSVPPTASEVPSTSLPPTPGEGTSTSVPPTAYEGPSTSVVPTPDEGPSTSVLPTPgegpgTSVPLAATEGLSTSVQ 225
Cdd:PHA03255 94 TSNASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTL-----SSKGTSNATKTTAELP 168
|
170
....*....|.
gi 20143482 226 ATPDE-GPSTS 235
Cdd:PHA03255 169 TVPDErQPSLS 179
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
151-383 |
3.41e-06 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 51.03 E-value: 3.41e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 151 SVPPTASEVPSTSLPPtpgegtSTSVPPTAYEGPSTSVVPTPDEGPSTSVLPTPGEGPGTSVPlaATEGLSTSVQATPDE 230
Cdd:PRK12323 372 AGPATAAAAPVAQPAP------AAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSP--APEALAAARQASARG 443
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 231 GPSTSVPPTATeglstpvpptrdegPSTSVPATPGEGPSTSVLPAASDGQSISLVPTRGKGSSTSVPPTATEglstsVQP 310
Cdd:PRK12323 444 PGGAPAPAPAP--------------AAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEE-----LPP 504
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 20143482 311 TAGEGSSTSVPPTPGGGLSTSVPPTATEELSTSVpPTPGEGPSTSVLPIPGEGLSTSVPPTASDGSDTSVPPT 383
Cdd:PRK12323 505 EFASPAPAQPDAAPAGWVAESIPDPATADPDDAF-ETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDM 576
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
58-414 |
4.41e-06 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 50.75 E-value: 4.41e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 58 LCA-----SEGPSTSVLPTSAEGPSTFVPPTISEASSASGQPtisegPGTSVLPTPSEGLSTSGPPTISKGLCTSVTLAA 132
Cdd:PRK07764 358 LCArmllpSASDDERGLLARLERLERRLGVAGGAGAPAAAAP-----SAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPA 432
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 133 SEGRNTSRPPTSSEEPSTSVPPTASEVPSTSLPPTPGEGTStsVPPTAYEGPSTSVVPTPDEGPSTSVLPTPGEGP---- 208
Cdd:PRK07764 433 PAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAA--PEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGAddaa 510
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 209 ----------------------------------GTSVPLA-ATEGL--------------------------------- 220
Cdd:PRK07764 511 tlrerwpeilaavpkrsrktwaillpeatvlgvrGDTLVLGfSTGGLarrfaspgnaevlvtalaeelggdwqveavvgp 590
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 221 -STSVQATPDEGPSTSVPPTATEGLSTPVPPTRDEGPST-SVPATPGEGPSTSVLPAASDGQSISLVPtrGKGSSTSVPP 298
Cdd:PRK07764 591 aPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPaGAAAAPAEASAAPAPGVAAPEHHPKHVA--VPDASDGGDG 668
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 299 TATEGlsTSVQPTAGEGSSTSVPPTPGGGLSTSVPptateelSTSVPPTPGEGPSTSVLPIPGEGLSTSVPPTASDGSDT 378
Cdd:PRK07764 669 WPAKA--GGAAPAAPPPAPAPAAPAAPAGAAPAQP-------APAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPV 739
|
410 420 430
....*....|....*....|....*....|....*....
gi 20143482 379 SVPPTPGE---GASTLVQPTAPDGPGSSVLPNPGEGPST 414
Cdd:PRK07764 740 PLPPEPDDppdPAGAPAQPPPPPAPAPAAAPAAAPPPSP 778
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
108-339 |
5.79e-06 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 50.26 E-value: 5.79e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 108 PSEGLSTSGPPTISKGLCTSVTLAAsegrntsRPPTSSEEPSTSVPPTASEVPSTSLPPTPGEGTSTSVPPTAYEGPSTS 187
Cdd:PRK12323 365 PGQSGGGAGPATAAAAPVAQPAPAA-------AAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAAR 437
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 188 VVPTPDEGPSTSVLPTPgegpgTSVPLAATEGLSTSVQATPDEGPStsvPPTATEGLSTPVPPTRDEGPSTSVPATPGEg 267
Cdd:PRK12323 438 QASARGPGGAPAPAPAP-----AAAPAAAARPAAAGPRPVAAAAAA---APARAAPAAAPAPADDDPPPWEELPPEFAS- 508
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 20143482 268 PSTSVLPAASDGQSISLVPTRGKGSSTSVPPTATEGLSTSVQPTAGEGSSTSVPPTPGGGLSTSVPPTATEE 339
Cdd:PRK12323 509 PAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGD 580
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
62-383 |
7.54e-06 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 50.07 E-value: 7.54e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 62 EGPSTSVLPTSAEGPSTFV-----PPTISEASSASGQPTISEGPGTSVLPTPSEGLSTSGPPTISKglctsvtlaASEGR 136
Cdd:PTZ00449 512 EGPEASGLPPKAPGDKEGEegeheDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPTLSK---------KPEFP 582
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 137 NTSRPPTSSEEPSTSVPPTASEVPSTslPPTPGEGTSTSVP--PTAYEGPSTSVVPTPDEGPSTSVLPTPGEGPGTSVPL 214
Cdd:PTZ00449 583 KDPKHPKDPEEPKKPKRPRSAQRPTR--PKSPKLPELLDIPksPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPP 660
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 215 ---------------------AATEGLSTSVQATPDEGPSTSVPPTATEGLST------PVPPTRDEGPSTsvPATPGEG 267
Cdd:PTZ00449 661 kspkppfdpkfkekfyddyldAAAKSKETKTTVVLDESFESILKETLPETPGTpfttprPLPPKLPRDEEF--PFEPIGD 738
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 268 PSTsvlPAASDGQSISLVPTRGKGSSTSVPPTATEGLSTSVQPTAGEGSSTSVPPTPggglsTSVPPTATEELSTSvppt 347
Cdd:PTZ00449 739 PDA---EQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEA-----MKRPDSPSEHEDKP---- 806
|
330 340 350
....*....|....*....|....*....|....*.
gi 20143482 348 PGEGPSTSVLPIPGEGLSTSVPPTASDGSDTSVPPT 383
Cdd:PTZ00449 807 PGDHPSLPKKRHRLDGLALSTTDLESDAGRIAKDAS 842
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
138-408 |
1.36e-05 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 48.77 E-value: 1.36e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 138 TSRPPTSSEE-----PSTSVPPTASEVPS--TSLPPTPGEGTSTSVPPTAYEGPSTSVVPTP--------DEGPSTSVLP 202
Cdd:PLN03209 309 TTAPLTPMEEllakiPSQRVPPKESDAADgpKPVPTKPVTPEAPSPPIEEEPPQPKAVVPRPlspytayeDLKPPTSPIP 388
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 203 TPGegpgTSVPLAATEGLSTSVQATPDEGPSTSVPPTATEGLSTPVPpTRDEGPSTSVPATPGEGPSTSvlpaasdgqsi 282
Cdd:PLN03209 389 TPP----SSSPASSKSVDAVAKPAEPDVVPSPGSASNVPEVEPAQVE-AKKTRPLSPYARYEDLKPPTS----------- 452
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 283 slvPTrgkgsstsvpPTATEGLSTSVQptagegSSTSVPPTPGgglstSVPPTATEElsTSVPPTPGEGPSTSVLPIPGE 362
Cdd:PLN03209 453 ---PS----------PTAPTGVSPSVS------STSSVPAVPD-----TAPATAATD--AAAPPPANMRPLSPYAVYDDL 506
|
250 260 270 280
....*....|....*....|....*....|....*....|....*.
gi 20143482 363 GLSTSVPPTASDGSDTSVPPTPGEGASTLVQPTAPDGPGSSVLPNP 408
Cdd:PLN03209 507 KPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQPKP 552
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
177-402 |
2.10e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 48.44 E-value: 2.10e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 177 PPTAYEGPSTSVVPTPDEGPSTSVLPTPGEGPGTSVPLAATEGlstsvQATPDEGPSTSVPPTATEGLSTPVP--PTRDE 254
Cdd:PRK07764 590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGA-----AAAPAEASAAPAPGVAAPEHHPKHVavPDASD 664
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 255 GPSTSVPATPGEGPSTSVLPAASDGQSIslvPTRGKGSSTSVPPTATEglstsvqptAGEGSSTSVPPTPGGGLSTSVPP 334
Cdd:PRK07764 665 GGDGWPAKAGGAAPAAPPPAPAPAAPAA---PAGAAPAQPAPAPAATP---------PAGQADDPAAQPPQAAQGASAPS 732
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 20143482 335 TATEElstSVPPTPGEGPSTSVLPIPGEGLSTSVPPTASDGSDTSVPPTPGEGASTLVQPTAPDGPGS 402
Cdd:PRK07764 733 PAADD---PVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDED 797
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
29-400 |
2.13e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 48.61 E-value: 2.13e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 29 QAPNAPGLPADvPGSDVPQGPS-DSQILQGLCASEGPSTSVLPTSAEGPSTFVPPTISEASSASGQPtISEGPGTSVLPT 107
Cdd:pfam03154 216 QPPNQTQSTAA-PHTLIQQTPTlHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHS-LQTGPSHMQHPV 293
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 108 PSEGLSTSGPPTISKGLCTSVTLAASEGRNTSRPPTSSEEPSTSVPPtaSEVPstsLPPTPGEGTSTSVPPTAyegpSTS 187
Cdd:pfam03154 294 PPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPP--REQP---LPPAPLSMPHIKPPPTT----PIP 364
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 188 VVPTPD--EGPSTSVLPTPGEGPGTSVPLAATEGLSTSVQATPdegPSTSVPPTATEGLSTPVPPTrdegpstsvPATPG 265
Cdd:pfam03154 365 QLPNPQshKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHP---PSAHPPPLQLMPQSQQLPPP---------PAQPP 432
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 266 EGPSTSVLPAASDGQSislvPTRGKGSSTSVPPTATEGLSTsvqptageGSSTSVPPTPGgglstsvPPTATEELSTSVP 345
Cdd:pfam03154 433 VLTQSQSLPPPAASHP----PTSGLHQVPSQSPFPQHPFVP--------GGPPPITPPSG-------PPTSTSSAMPGIQ 493
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 20143482 346 PtPGEGPSTSVLPIPGeGLSTSVPPTA------SDGSDTSVPPTPGEGASTlvQPTAPDGP 400
Cdd:pfam03154 494 P-PSSASVSSSGPVPA-AVSCPLPPVQikeealDEAEEPESPPPPPRSPSP--EPTVVNTP 550
|
|
| Pneumo_att_G |
pfam05539 |
Pneumovirinae attachment membrane glycoprotein G; |
143-324 |
2.93e-05 |
|
Pneumovirinae attachment membrane glycoprotein G;
Pssm-ID: 114270 [Multi-domain] Cd Length: 408 Bit Score: 47.74 E-value: 2.93e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 143 TSSEEPSTSVPPTASEVPSTSLPPTPGEGTSTSvpptayEGPSTSVVPTPDEGPSTSVLPTPGEGPGTSvplaaTEGLST 222
Cdd:pfam05539 177 TTSWPTEVSHPTYPSQVTPQSQPATQGHQTATA------NQRLSSTEPVGTQGTTTSSNPEPQTEPPPS-----QRGPSG 245
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 223 svqaTPDEGPSTSVPPTATEGLSTPVPPTRDEGPSTSVPATPGEGPSTSVLPAASDGQSISLVPTRGKGSSTSVPPTATE 302
Cdd:pfam05539 246 ----SPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPP 321
|
170 180
....*....|....*....|..
gi 20143482 303 GLSTSVQPTAGEGSSTSVPPTP 324
Cdd:pfam05539 322 GVQANPTTQNLVDCKELDPPKP 343
|
|
| motB |
PRK12799 |
flagellar motor protein MotB; Reviewed |
159-280 |
2.95e-05 |
|
flagellar motor protein MotB; Reviewed
Pssm-ID: 183756 [Multi-domain] Cd Length: 421 Bit Score: 47.40 E-value: 2.95e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 159 VPSTSLPPTPGEGTSTSVPPTAYEGPSTSVVPTPDEGPSTSVLPTPGEGPGTSVPLAATEGLSTSVQATPDEGPSTSVPP 238
Cdd:PRK12799 299 VPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVNMQPQP 378
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 20143482 239 ---TATEGLSTPVPPTRDEGPSTSVPATPGEGPSTSvlPAASDGQ 280
Cdd:PRK12799 379 mstTETQQSSTGNITSTANGPTTSLPAAPASNIPVS--PTSRDAQ 421
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
23-264 |
3.23e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 47.84 E-value: 3.23e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 23 SSWGEMQAPNAPGLPADVPGSDVPQGP-SDSQILQGLCASEGPstsVLPTSAEGPSTFVPPTISEASSASgqPTISEGPG 101
Cdd:pfam03154 302 PQSSQSQVPPGPSPAAPGQSQQRIHTPpSQSQLQSQQPPREQP---LPPAPLSMPHIKPPPTTPIPQLPN--PQSHKHPP 376
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 102 TSVLPTPSEGLSTSGPPTISKGLCTsvtLAASEGRNTSRPPTSSEEPSTSVPPTASEVPS-TSLPPTPGEGTSTSVPPTA 180
Cdd:pfam03154 377 HLSGPSPFQMNSNLPPPPALKPLSS---LSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVlTQSQSLPPPAASHPPTSGL 453
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 181 YEGPSTSVVPTPDEGPSTSVLPTPGEGPGTSVPLAAT-----EGLSTSVQATPDEGPSTSVPPT--------ATEGLSTP 247
Cdd:pfam03154 454 HQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPgiqppSSASVSSSGPVPAAVSCPLPPVqikeealdEAEEPESP 533
|
250
....*....|....*..
gi 20143482 248 VPPTRDEGPSTSVPATP 264
Cdd:pfam03154 534 PPPPRSPSPEPTVVNTP 550
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
209-422 |
3.27e-05 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 48.08 E-value: 3.27e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 209 GTSVPLAATEGLSTSVqatpdegpSTSVPPTATEGLStpvpptrdEGPSTSVPATPGEGPSTSVLPAASDGQSIslvpTR 288
Cdd:NF033849 224 GVSLPMMYAANLGQSA--------GTGYGESVGHSTS--------QGQSHSVGTSESHSVGTSQSQSHTTGHGS----TR 283
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 289 GKGSSTSVPPTATEGLSTSVQPTAGEGSSTSVPPTPGGGLSTSVpptateelSTSVPPTPGEGPSTSVLPIPGEGLSTSV 368
Cdd:NF033849 284 GWSHTQSTSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQ--------SSSYNVSSGTGVSSSHSDGTSQSTSISH 355
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 20143482 369 PPTASDGSDTSVPPTPGEGASTLVQPTAPDGPGSSVLPNPGEGPSTLFSSSASV 422
Cdd:NF033849 356 SESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGA 409
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
152-460 |
3.39e-05 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 47.76 E-value: 3.39e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 152 VPPTASEVPSTSLPPTPGEGTStSVPPTayeGPSTSVVPTPDEGPST-SVLPTPGEGPGTSVPLAATEGLSTSVQATPDE 230
Cdd:PTZ00449 496 LAPIEEEDSDKHDEPPEGPEAS-GLPPK---APGDKEGEEGEHEDSKeSDEPKEGGKPGETKEGEVGKKPGPAKEHKPSK 571
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 231 GPSTSVPPTATEGLSTPVPPTRDEGPSTSVPATPGEGPSTSVLPAASDGQSISLVPTRGKGSSTSVPPTateglsTSVQP 310
Cdd:PTZ00449 572 IPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQ------RPSSP 645
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 311 TAGEGssTSVPPTPGGGLSTSVP--PTATEELSTSVPPTPG---EGPSTSVLPIPGEGLSTSVPPTASDGSDTSVPPTPG 385
Cdd:PTZ00449 646 ERPEG--PKIIKSPKPPKSPKPPfdPKFKEKFYDDYLDAAAkskETKTTVVLDESFESILKETLPETPGTPFTTPRPLPP 723
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 386 EGASTLVQPTAPDGPGSSVLPNPGEGPSTLFSSSASVDRNPSKCSL------VLPSPRVTkASVDSDSEGPKGAEGPIEF 459
Cdd:PTZ00449 724 KLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLpdilaeEFKEEDIH-AETGEPDEAMKRPDSPSEH 802
|
.
gi 20143482 460 E 460
Cdd:PTZ00449 803 E 803
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
94-386 |
3.52e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 47.92 E-value: 3.52e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 94 PTISEGPGTSVLPTPSEGLSTSGPPTISKGLCTSVTLAASEGRNTSRPPTSSeePSTSVPPTASEVPSTSLPPTPGEGTS 173
Cdd:PRK07003 362 VTGGGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPK--AAAAAAATRAEAPPAAPAPPATADRG 439
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 174 TSVPPTAYEGPSTSVVPTPDEGPSTSVLPTPGEGPGT-SVPLAATEGLSTSVQATPDEGPSTSVPPTATEGLSTPVPPTR 252
Cdd:PRK07003 440 DDAADGDAPVPAKANARASADSRCDERDAQPPADSGSaSAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASRE 519
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 253 DEgpstsvPATPGEgPSTSVLPAASDGQSislVPTRGKGSSTSVPPTATEGLSTSvqptAGEGSSTSVPPTPGGGLSTSV 332
Cdd:PRK07003 520 DA------PAAAAP-PAPEARPPTPAAAA---PAARAGGAAAALDVLRNAGMRVS----SDRGARAAAAAKPAAAPAAAP 585
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 20143482 333 PPTATEelsTSVP-PTPGEGPSTSVLPIPGEGlstsvppTASDGSDTSVPPTPGE 386
Cdd:PRK07003 586 KPAAPR---VAVQvPTPRARAATGDAPPNGAA-------RAEQAAESRGAPPPWE 630
|
|
| motB |
PRK12799 |
flagellar motor protein MotB; Reviewed |
138-253 |
4.79e-05 |
|
flagellar motor protein MotB; Reviewed
Pssm-ID: 183756 [Multi-domain] Cd Length: 421 Bit Score: 47.02 E-value: 4.79e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 138 TSRPPTSSEEPSTSVPPTASEVPSTSLPPTPGEGTS-TSVPPTAYEGPSTSVVPTPDEGPSTSVLPTPGEGPGTSVPLAA 216
Cdd:PRK12799 302 AAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSaTTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVNMQPQPMST 381
|
90 100 110
....*....|....*....|....*....|....*....
gi 20143482 217 TEGL--STSVQATPDEGPSTSVpPTATEGLSTPVPPTRD 253
Cdd:PRK12799 382 TETQqsSTGNITSTANGPTTSL-PAAPASNIPVSPTSRD 419
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
204-412 |
6.43e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 46.79 E-value: 6.43e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 204 PGEGPGTSVPlaATEGLSTSVQATPDEGPSTSVPPTATEGLSTPVPPTRDEGPSTSVPATPGE-GPSTSVLPAASdgQSI 282
Cdd:PRK12323 365 PGQSGGGAGP--ATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARrSPAPEALAAAR--QAS 440
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 283 SLVPTRGKGSSTSVPPTATEGLSTSVQPTAGEGSSTSVPPTPGGglstsvpPTATEELSTSVPPTPGEGPSTSVLPIPGE 362
Cdd:PRK12323 441 ARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAA-------PAAAPAPADDDPPPWEELPPEFASPAPAQ 513
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|
gi 20143482 363 GLSTSVPPTASDGSDTSVPPTPGEGASTLVQPTAPDGPGSSVLPNPGEGP 412
Cdd:PRK12323 514 PDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
29-283 |
7.04e-05 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 46.99 E-value: 7.04e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 29 QAPNAPGLPADVPGSDVPQGPSDsqilqglcasegpstsvlPTSAEGPSTFVPPTiseassasgQPTISEGP-GTSVLPT 107
Cdd:PTZ00449 604 QRPTRPKSPKLPELLDIPKSPKR------------------PESPKSPKRPPPPQ---------RPSSPERPeGPKIIKS 656
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 108 PSEGLSTSGP--PTISKGLCTSVTLAASEGRNTSRPPTSSEEPSTSVPPTASEVPST------SLPPT-PGEGTSTSVPP 178
Cdd:PTZ00449 657 PKPPKSPKPPfdPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTpfttprPLPPKlPRDEEFPFEPI 736
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 179 TAYEGPSTSVV--PTPDEGPSTSVLPTPGEGPgtsvplaaTEGLSTSVQATPDEGPSTSVPPTateglstpvPPTRDEGP 256
Cdd:PTZ00449 737 GDPDAEQPDDIefFTPPEEERTFFHETPADTP--------LPDILAEEFKEEDIHAETGEPDE---------AMKRPDSP 799
|
250 260
....*....|....*....|....*..
gi 20143482 257 STSVPATPGEGPSTSVLPAASDGQSIS 283
Cdd:PTZ00449 800 SEHEDKPPGDHPSLPKKRHRLDGLALS 826
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
231-455 |
7.62e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 46.79 E-value: 7.62e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 231 GPSTSVPPTATEGLSTPVPPTRDEGPSTSVPATPGEGPSTSVLPAAsdgqsislvPTRGKGSSTSVPPTATEGLSTSVQP 310
Cdd:PRK12323 369 GGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAA---------AARAVAAAPARRSPAPEALAAARQA 439
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 311 TAGEGSSTSVP-PTPggglsTSVPPTATEelstsvPPTPGEGPSTSVLPIPGeglSTSVPPTASDGSDTSVPP---TPGE 386
Cdd:PRK12323 440 SARGPGGAPAPaPAP-----AAAPAAAAR------PAAAGPRPVAAAAAAAP---ARAAPAAAPAPADDDPPPweeLPPE 505
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 20143482 387 GASTLVQPTAPDGPGSSVLPNPgeGPSTLFSSSASVDRNPSKCSLVLPSPRVTKASVDSDSEGPKGAEG 455
Cdd:PRK12323 506 FASPAPAQPDAAPAGWVAESIP--DPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASG 572
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
158-456 |
8.63e-05 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 46.60 E-value: 8.63e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 158 EVPSTSLPPTPGEGTSTSVPPTAYEGP--STSVVPTPDEGPSTSVLPTPGEGPGTSVPLAA--TEGLSTSVQA---TPD- 229
Cdd:PHA03378 519 RVMATLLPPSPPQPRAGRRAPCVYTEDldIESDEPASTEPVHDQLLPAPGLGPLQIQPLTSptTSQLASSAPSyaqTPWp 598
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 230 --EGPSTSVPPTATEGLSTPVPPTRDEGPSTSVPATPGEGPSTSVLPAA--SDGQSISLVPTRGKGSSTSVPPT----AT 301
Cdd:PHA03378 599 vpHPSQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQPITFNVLVfpTPHQPPQVEITPYKPTWTQIGHIpyqpSP 678
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 302 EGLSTSVQPTAGEGSSTSVPPTPGGGLSTSVPPTATEELSTSVPPTPGEGPSTSVLPIPGEGLSTSVPPTASDGSDTSVP 381
Cdd:PHA03378 679 TGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPA 758
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 20143482 382 PTPGEGASTLVQPTAPdgpgsSVLPNPGEGPSTLfsssasvdRNPSKCSLVLPSPRVTKASVDSDSEGPKGAEGP 456
Cdd:PHA03378 759 AAPGRARPPAAAPGAP-----TPQPPPQAPPAPQ--------QRPRGAPTPQPPPQAGPTSMQLMPRAAPGQQGP 820
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
29-251 |
1.34e-04 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 45.69 E-value: 1.34e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 29 QAPNAPGLPADVPGSDVPQGPSDSQILQGLCASEGPSTSVL-PTSAE---GPSTFVPPTISEASSASGQPTISEGPGTSV 104
Cdd:PLN03209 331 KESDAADGPKPVPTKPVTPEAPSPPIEEEPPQPKAVVPRPLsPYTAYedlKPPTSPIPTPPSSSPASSKSVDAVAKPAEP 410
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 105 LPTPSEGLSTSGPPTISKGLCTSVT--LAASEGRNTSRPPTS-SEEPSTSVPPTASE---VPSTSLPPTPGEGTSTSVPP 178
Cdd:PLN03209 411 DVVPSPGSASNVPEVEPAQVEAKKTrpLSPYARYEDLKPPTSpSPTAPTGVSPSVSStssVPAVPDTAPATAATDAAAPP 490
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 179 TAYEGPSTSVVPTPDEGPSTSVLP--TPGEGPGTSVPLAATEGLSTSVQATPDEG------PSTSVPPTATEGLSTPVPP 250
Cdd:PLN03209 491 PANMRPLSPYAVYDDLKPPTSPSPaaPVGKVAPSSTNEVVKVGNSAPPTALADEQhhaqpkPRPLSPYTMYEDLKPPTSP 570
|
.
gi 20143482 251 T 251
Cdd:PLN03209 571 T 571
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
70-417 |
1.49e-04 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 45.76 E-value: 1.49e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 70 PTSAEGPSTFVPPTISeaSSASGQPTISEGPGTSVLPTPSEGLSTSgPPTISKGLCTSVTLAASEGRNTSRPPTSSEEPS 149
Cdd:TIGR00927 112 PSPPRRTAKITPTTPK--NNYSPTAAGTERVKEDTPATPSRALNHY-ISTSGRQRVKSYTPKPRGEVKSSSPTQTREKVR 188
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 150 TSVPPTASEVPSTSLPPTPGEG-TSTSVPPTAYEGPSTSVV------PTPDEGPSTSVLPTPGEGPGTSVPLAATEGLST 222
Cdd:TIGR00927 189 KYTPSPLGRMVNSYAPSTFMTMpRSHGITPRTTVKDSEITAtykmleTNPSKRTAGKTTPTPLKGMTDNTPTFLTREVET 268
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 223 SVQATPDE--GPSTSVPPTATEGLSTpvppTRDEGPSTSVPATPGEGPSTSVLPAASDGQSISLVPTRGKgsstsvpPTA 300
Cdd:TIGR00927 269 DLLTSPRSvvEKNTLTTPRRVESNSS----TNHWGLVGKNNLTTPQGTVLEHTPATSEGQVTISIMTGSS-------PAE 337
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 301 TEGlSTSVQPTAGEGSSTSVPptpggglSTSVPPTATEELSTSvpptPGEGPSTSVLPIPGEGLSTSVPPTASDGSDTSV 380
Cdd:TIGR00927 338 TKA-STAAWKIRNPLSRTSAP-------AVRIASATFRGLEKN----PSTAPSTPATPRVRAVLTTQVHHCVVVKPAPAV 405
|
330 340 350 360
....*....|....*....|....*....|....*....|....
gi 20143482 381 PPTPGEGASTLVQPTAPDgPGSSVLPN-------PGEGPSTLFS 417
Cdd:TIGR00927 406 PTTPSPSLTTALFPEAPS-PSPSALPPgqpdlhpKAEYPPDLFS 448
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
172-360 |
1.62e-04 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 44.12 E-value: 1.62e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 172 TSTSVPPTAYEGPSTSVVPTPDEGPSTSVLPTPGEGPGTSVPLAATEGLSTSvqatpdegpSTSVPPTATEglSTPVPPT 251
Cdd:PHA03255 26 SSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTN---------TTTVTSTGTT--VTPVPTT 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 252 RD-EGPSTSVPATPGEGPSTSVlpaasdgqsislvptrGKGSSTSVPPTATEGLSTSVQPTAGEGSSTSVPPTPggglsT 330
Cdd:PHA03255 95 SNaSTINVTTKVTAQNITATEA----------------GTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTL-----S 153
|
170 180 190
....*....|....*....|....*....|
gi 20143482 331 SVPPTATEELSTSVPPTPGEGPSTSVLPIP 360
Cdd:PHA03255 154 SKGTSNATKTTAELPTVPDERQPSLSYGLP 183
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
133-408 |
1.62e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 45.53 E-value: 1.62e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 133 SEGRNTSRPPTSSEEPS-TSVPPTASEVPSTSLPPTPGEGTSTSVPPTAYEGPSTSVVPTPDEGPSTSVLPTPGEGPGTS 211
Cdd:pfam03154 79 SAKRQREKGASDTEEPErATAKKSKTQEISRPNSPSEGEGESSDGRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDS 158
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 212 VPLAATEGLSTS---VQATPDEGPSTSVPPTATEGLSTPVP-PTRDEGPSTSVPAT--PGEGPSTSVLPAASDGQSISLV 285
Cdd:pfam03154 159 DSSAQQQILQTQppvLQAQSGAASPPSPPPPGTTQAATAGPtPSAPSVPPQGSPATsqPPNQTQSTAAPHTLIQQTPTLH 238
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 286 PTRgkgssTSVPPTATEGLSTSVQPtagegSSTSVPPTPGGGLSTSVPPtateelstsVPPTPGEGPSTSVLPIPGEGLs 365
Cdd:pfam03154 239 PQR-----LPSPHPPLQPMTQPPPP-----SQVSPQPLPQPSLHGQMPP---------MPHSLQTGPSHMQHPVPPQPF- 298
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 20143482 366 tsvpPTASDGSDTSVPPTPGEGASTLVQPTAPDGPGSSVLPNP 408
Cdd:pfam03154 299 ----PLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQ 337
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
72-298 |
1.72e-04 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 44.88 E-value: 1.72e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 72 SAEGPSTFVPPTISEASSASGQPTISEGPGTSVLPTPSEGLSTSGPptisKGLCTSVTLAASEGRNTSRPPTSSEEPSTS 151
Cdd:COG5651 162 VALTPFTQPPPTITNPGGLLGAQNAGSGNTSSNPGFANLGLTGLNQ----VGIGGLNSGSGPIGLNSGPGNTGFAGTGAA 237
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 152 VPPTASEVPSTSLPPTPGEGTSTSVPPTAYEGPSTSVVPTPDEGPSTS---VLPTPGEGPGTSVPLAATEGLSTSVQATP 228
Cdd:COG5651 238 AGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNlglAGSPLGLAGGGAGAAAATGLGLGAGGAAG 317
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 229 DEGPSTSVPPTATEGLSTPVPPTrdeGPSTSVPATPGEGPSTSVLPAASDGQSISLVPTRGKGSSTSVPP 298
Cdd:COG5651 318 AAGATGAGAALGAGAAAAAAGAA---AGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAA 384
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
140-266 |
1.76e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 45.48 E-value: 1.76e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 140 RPPTSSE-----EPSTSVPPTASEVPSTSLPPTPGEGTSTSVPPTAYEGPSTSVVPTPdEGPSTSVLPTPGEGPGTSVPL 214
Cdd:PRK14951 365 KPAAAAEaaapaEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAP-PAPVAAPAAAAPAAAPAAAPA 443
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 20143482 215 AATEGLSTSVQATPDegpSTSVPPTATEGLSTPVPPTRDEGPSTSVPATPGE 266
Cdd:PRK14951 444 AVALAPAPPAQAAPE---TVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTE 492
|
|
| Pneumo_att_G |
pfam05539 |
Pneumovirinae attachment membrane glycoprotein G; |
236-408 |
2.21e-04 |
|
Pneumovirinae attachment membrane glycoprotein G;
Pssm-ID: 114270 [Multi-domain] Cd Length: 408 Bit Score: 44.65 E-value: 2.21e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 236 VPPTATEGLSTPVPPTRDEGP-------STSVPATPGEGPSTSVLPAASDGqsisLVPTRGKGSSTSVPPTATEGLSTSv 308
Cdd:pfam05539 167 EPKTAVTTSKTTSWPTEVSHPtypsqvtPQSQPATQGHQTATANQRLSSTE----PVGTQGTTTSSNPEPQTEPPPSQR- 241
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 309 qptaGEGSSTSVPPtpggglSTSVPPTATEELSTSVPPTPGEGPSTSVlpiPGEGLSTSVPPTASDGSDTSvPPTPGEGA 388
Cdd:pfam05539 242 ----GPSGSPQHPP------STTSQDQSTTGDGQEHTQRRKTPPATSN---RRSPHSTATPPPTTKRQETG-RPTPRPTA 307
|
170 180
....*....|....*....|
gi 20143482 389 STLVQPTAPDGPGSSVLPNP 408
Cdd:pfam05539 308 TTQSGSSPPHSSPPGVQANP 327
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
131-253 |
3.06e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 44.71 E-value: 3.06e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 131 AASEGRNTSRPPTSSEEPSTSVPPTASEVPSTSLPPTPGEGTSTSVPPTAYEGPSTSVVPTPDEGPSTSVLPTPGEGPGT 210
Cdd:PRK14951 371 EAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPA 450
|
90 100 110 120
....*....|....*....|....*....|....*....|...
gi 20143482 211 SVPLAATEGLSTSVQATPDEGPSTSVPPTATEGLSTPVPPTRD 253
Cdd:PRK14951 451 PPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEE 493
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
257-398 |
4.08e-04 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 42.97 E-value: 4.08e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 257 STSVPATPGEGPSTSVLPAASDGQSISLVPTRGKGSSTSVPPTATEGLSTSVQPTAGEGSSTSVPPTPGGGLSTSVPPTA 336
Cdd:PHA03255 27 SGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTSTGTTVTPVPTTSNASTINVTTKV 106
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 20143482 337 TEELSTSVPPTPGEGPSTSVlPIPGEGLSTSVPPTASDGSDTSVPPTPGEGAS-----TLVQPTAPD 398
Cdd:PHA03255 107 TAQNITATEAGTGTSTGVTS-NVTTRSSSTTSATTRITNATTLAPTLSSKGTSnatktTAELPTVPD 172
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
4-226 |
4.12e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 44.21 E-value: 4.12e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 4 VSQNSRRRRRRVAKATAHNSSWGEMQAPNAPGLPADVPGSDVPQGPSDSQILQGLCASEGPSTSVLPTSAEGPSTFVPPT 83
Cdd:PRK07764 588 VGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGD 667
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 84 ISEASSASGQPTISEGPGTSVLPTPSEGLSTSGPPtiskglcTSVTLAASEGRNTSRPPTSSEEPSTSVPPTASEVPSTS 163
Cdd:PRK07764 668 GWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPA-------PAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVP 740
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 20143482 164 LPPTPGEGTstsVPPTAYEGPSTSVVPTPDEGPSTSVLPTPGEGPGTSVPLAATEGLSTSVQA 226
Cdd:PRK07764 741 LPPEPDDPP---DPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRD 800
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
206-335 |
4.27e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 43.93 E-value: 4.27e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 206 EGPGTSVPLAATEGLSTSVQATPDEGPstsVPPTATEGLSTPVPPTrdEGPSTSVPATPGEGPSTSVLPAASDGQSISLV 285
Cdd:PRK14951 367 AAAAEAAAPAEKKTPARPEAAAPAAAP---VAQAAAAPAPAAAPAA--AASAPAAPPAAAPPAPVAAPAAAAPAAAPAAA 441
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|
gi 20143482 286 PTRGKGSSTSVPPTATEGLSTSVQPTAGEGSSTSVPPTPGGGLSTSVPPT 335
Cdd:PRK14951 442 PAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPT 491
|
|
| PHA03269 |
PHA03269 |
envelope glycoprotein C; Provisional |
55-181 |
5.60e-04 |
|
envelope glycoprotein C; Provisional
Pssm-ID: 165527 [Multi-domain] Cd Length: 566 Bit Score: 43.56 E-value: 5.60e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 55 LQGLCASEGPSTSVLPTSAEgpSTFVPPTISEASSASGQPTISEGPGTSVLPTPSEGLSTSGPPTISKGLCTSVTLAASE 134
Cdd:PHA03269 31 LHTSAATQKPDPAPAPHQAA--SRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPAPAPHQAASRAPDPAVAPQLAAAP 108
|
90 100 110 120
....*....|....*....|....*....|....*....|....*..
gi 20143482 135 GRNTSRPPTSSeePSTSVPPTASEVPSTSLPPTPGEGTSTSVPPTAY 181
Cdd:PHA03269 109 KPDAAEAFTSA--AQAHEAPADAGTSAASKKPDPAAHTQHSPPPFAY 153
|
|
| PLN02217 |
PLN02217 |
probable pectinesterase/pectinesterase inhibitor |
100-218 |
6.38e-04 |
|
probable pectinesterase/pectinesterase inhibitor
Pssm-ID: 215130 [Multi-domain] Cd Length: 670 Bit Score: 43.54 E-value: 6.38e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 100 PGTSVLPTPseGLSTSGPPTISkglcTSVTLAASEGRNTSrpptSSEEPSTSVPPTASevPSTSLPPTPGE------GTS 173
Cdd:PLN02217 551 PGKGVPYIP--GLFAGNPGSTN----STPTGSAASSNTTF----SSDSPSTVVAPSTS--PPAGHLGSPPAtpskivSPS 618
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 20143482 174 TSVPPTAYEGPSTSvvPTPDEGPSTSVlPTPGEGPGTSVPLAATE 218
Cdd:PLN02217 619 TSPPASHLGSPSTT--PSSPESSIKVA-STETASPESSIKVASTE 660
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
231-419 |
7.47e-04 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 43.12 E-value: 7.47e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 231 GPSTSVPPTATEGLSTPVPPTRDEGPSTSVPATPGEG-------PSTSVLPAASDGQSISLVPTRGKGSSTSVPPTATEG 303
Cdd:pfam15967 23 GAAAASNPGSTGGFSFGTLGAAPAATATTTTATLGLGgglfgqkPATGFTFGTPASSTAATGPTGLTLGTPAATTAASTG 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 304 LSTSVQPTAGEGSSTSVPPTP--GGGLSTSVPPTATEELSTSVPPTPGEG--PSTSVLPIPGEGLSTSVPPTASDGSDTS 379
Cdd:pfam15967 103 FSLGFNKPAASATPFSLPASStsGGGLSLGSVLTSTAAQQGATGFTLNLGgtPATTTAVSTGLSLGSTLTSLGGSLFQNT 182
|
170 180 190 200
....*....|....*....|....*....|....*....|
gi 20143482 380 VPPTPGEGASTLVQPTAPDGPGSSVLPNPGEGPSTLFSSS 419
Cdd:pfam15967 183 NSTGLGQTTLGLTLLATSTAPVSAPAASEGLGGLDFSTSS 222
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
232-414 |
7.65e-04 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 43.62 E-value: 7.65e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 232 PSTSVPPTATEGLSTPVPPTRDEGPSTSVPATPGEGpstsvlpAASDGQsislvPTRGKGSSTSVPPTATEGLSTSVQPT 311
Cdd:PHA03307 760 NPSLVPAKLAEALALLEPAEPQRGAGSSPPVRAEAA-------FRRPGR-----LRRSGPAADAASRTASKRKSRSHTPD 827
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 312 AGEGSSTsvPPTPGGGLSTSVPPTATEELSTSVPPTPGEGPSTSVLPIPGEGLSTSVPPTASDGSDTSVPPTPGEgastl 391
Cdd:PHA03307 828 GGSESSG--PARPPGAAARPPPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGA----- 900
|
170 180
....*....|....*....|....
gi 20143482 392 vQPTAPDGPGSSVL-PNPGEGPST 414
Cdd:PHA03307 901 -PAPRPRPAPRVKLgPMPPGGPDP 923
|
|
| Pneumo_att_G |
pfam05539 |
Pneumovirinae attachment membrane glycoprotein G; |
80-246 |
9.13e-04 |
|
Pneumovirinae attachment membrane glycoprotein G;
Pssm-ID: 114270 [Multi-domain] Cd Length: 408 Bit Score: 42.73 E-value: 9.13e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 80 VPPTISEASSASGQPTISEGPgtsvlPTPSEGLSTSGPPTISKGLCTSVTLAASEG-RNTSRPPTSSEEPSTSVPPTASE 158
Cdd:pfam05539 167 EPKTAVTTSKTTSWPTEVSHP-----TYPSQVTPQSQPATQGHQTATANQRLSSTEpVGTQGTTTSSNPEPQTEPPPSQR 241
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 159 VPSTslppTPGEGTSTSVPPTAYEGPSTSVVPTPDEGPSTS------VLPTPGEGPGTSVPLAATEGLSTSVQATPDEGP 232
Cdd:pfam05539 242 GPSG----SPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSnrrsphSTATPPPTTKRQETGRPTPRPTATTQSGSSPPH 317
|
170
....*....|....
gi 20143482 233 STsvpPTATEGLST 246
Cdd:pfam05539 318 SS---PPGVQANPT 328
|
|
| Streccoc_I_II |
NF033804 |
antigen I/II family LPXTG-anchored adhesin; Members of the antigen I/II family are adhesins ... |
62-204 |
1.25e-03 |
|
antigen I/II family LPXTG-anchored adhesin; Members of the antigen I/II family are adhesins with a glucan-binding domain, two types of repetitive regions, an isopeptide bond-forming domain associated with shear resistance, and a C-terminal LPXTG motif for anchoring to the cell wall. They occur in oral Streptococci, and tend to be major cell surface adhesins. Members of this family include SspA and SspB from Streptococcus gordonii, antigen I/II from S. mutans, etc.
Pssm-ID: 468188 [Multi-domain] Cd Length: 1552 Bit Score: 43.01 E-value: 1.25e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 62 EGPSTSVLPTSAEGPSTFVPPTISEASSAsgqPTISEGPGTSVLPTPSEGLSTSGPPTISKGLCTSVTLAASEGRNTSRP 141
Cdd:NF033804 830 EKPTPPVAPTAPQAPTYEVEKPLEPAPVA---PTYENEPTPPVKTPDQPEPSKPEEPTYETEKPLEPAPVAPTYENEPTP 906
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 20143482 142 PTSS---EEPSTSVPPT-ASEVPSTSLP--------PTPGEGTSTSVPPTAYEGPSTSVVPTPDEGPSTSVLPTP 204
Cdd:NF033804 907 PVKTpdqPEPSKPEEPTyETEKPLEPAPvapsyenePTPPVKTPDQPEPSKPVEPTYDPLPTPPVAPTPKQLPTP 981
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
26-276 |
1.94e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 42.14 E-value: 1.94e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 26 GEMQAPNAPGLPAdvPGSDVPQGPSDSQILQGLCASEGPSTSVLPTSA--EGPSTFVPPTISEASSASGQPTISEGPGTS 103
Cdd:PRK07003 370 GGVPARVAGAVPA--PGARAAAAVGASAVPAVTAVTGAAGAALAPKAAaaAAATRAEAPPAAPAPPATADRGDDAADGDA 447
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 104 VLPTPSEGLSTSGPPTISKGLCTSVTLAASEGRNTSRPPTSSEEPST-SVPPTASEVPSTSLPPTPGEGTSTSVPPTAYE 182
Cdd:PRK07003 448 PVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPrAAAPSAATPAAVPDARAPAAASREDAPAAAAP 527
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 183 GPSTSVVPTPDEGPStsvlptPGEGPGTSVPLAATEGLSTSVQATPDEGPSTSVPPTATEGLSTPVPPTRdegPSTSVPa 262
Cdd:PRK07003 528 PAPEARPPTPAAAAP------AARAGGAAAALDVLRNAGMRVSSDRGARAAAAAKPAAAPAAAPKPAAPR---VAVQVP- 597
|
250
....*....|....
gi 20143482 263 TPGEGPSTSVLPAA 276
Cdd:PRK07003 598 TPRARAATGDAPPN 611
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
60-199 |
2.27e-03 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 40.66 E-value: 2.27e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 60 ASEGPSTSVLPTSAEGPSTFVPPTISEAS--SASGQPTISEGPGTSVLPTPSEGLSTSGPPTISKGLCTSVT-------L 130
Cdd:PHA03255 32 ASAGNVTGTTAVTTPSPSASGPSTNQSTTltTTSAPITTTAILSTNTTTVTSTGTTVTPVPTTSNASTINVTtkvtaqnI 111
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 20143482 131 AASEGRNTSRPPTSSE---EPSTSVPPTASEVPSTSLPPTPGEGTSTSVPPTAYEGPStsvvpTPDE-GPSTS 199
Cdd:PHA03255 112 TATEAGTGTSTGVTSNvttRSSSTTSATTRITNATTLAPTLSSKGTSNATKTTAELPT-----VPDErQPSLS 179
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
148-275 |
2.44e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 41.62 E-value: 2.44e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 148 PSTSVPPTASEVPSTSLPPTPGEGTSTSVPPTAyeGPSTSVVPTPDEGPSTSVLPTPGEGPGTSVPLAATEGLSTSVQAT 227
Cdd:PRK14951 366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAA--AAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPA 443
|
90 100 110 120
....*....|....*....|....*....|....*....|....*...
gi 20143482 228 PDEGPSTSVPPTATEGLSTPVPPTRDEGPSTSVPATPGEGPSTSVLPA 275
Cdd:PRK14951 444 AVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPT 491
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
193-423 |
2.79e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.85 E-value: 2.79e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 193 DEGPSTSVLPTPGEG--PGTSVPLAATEGLSTSVQATPDEGPSTSVPPTATEG--LSTPVPPTR-DEGPSTSVPATPGE- 266
Cdd:PHA03247 252 IAAPAPPPVVGEGADraPETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGapLALPAPPDPpPPAPAGDAEEEDDEd 331
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 267 GPSTSVLPAASDGQSISL-VPTRGKGSSTsvPPTATEGLSTSVQPTAGEGSSTSVPPTPGGGLSTSVPPTATEELSTSVP 345
Cdd:PHA03247 332 GAMEVVSPLPRPRQHYPLgFPKRRRPTWT--PPSSLEDLSAGRHHPKRASLPTRKRRSARHAATPFARGPGGDDQTRPAA 409
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 20143482 346 PTPGEGPSTSVLPIPGeglstSVPPTASDGSDTsvpPTPGEGAStlvqPTAPDGPGSSVLPNPGEGPSTLFSSSASVD 423
Cdd:PHA03247 410 PVPASVPTPAPTPVPA-----SAPPPPATPLPS---AEPGSDDG----PAPPPERQPPAPATEPAPDDPDDATRKALD 475
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
113-264 |
2.98e-03 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 40.27 E-value: 2.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 113 STSGPPTISKGLCTSVTLAASEGRNTSRPPT--SSEEPSTSVPPTASEVPSTSlpPTPGEGTSTSVPPTAYEGPSTSVVP 190
Cdd:PHA03255 25 TSSGSSTASAGNVTGTTAVTTPSPSASGPSTnqSTTLTTTSAPITTTAILSTN--TTTVTSTGTTVTPVPTTSNASTINV 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 191 TPDEGPSTSVLPTPGEGPGTSVPLAATEGLSTSVQATPDEGPSTSVPPT-------ATEGLSTPVPPTRDEGPSTSVPAT 263
Cdd:PHA03255 103 TTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTlsskgtsNATKTTAELPTVPDERQPSLSYGL 182
|
.
gi 20143482 264 P 264
Cdd:PHA03255 183 P 183
|
|
| Metaviral_G |
pfam09595 |
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ... |
78-203 |
3.63e-03 |
|
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.
Pssm-ID: 462833 [Multi-domain] Cd Length: 183 Bit Score: 39.55 E-value: 3.63e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 78 TFVPPTISEASSASgqpTISEGPGTSVLPTPSEGLS-TSGPPTISKGLCTSVTLAASEGRNTSRPPTSSEEPSTSVPPTA 156
Cdd:pfam09595 32 SLILIGESNKEAAL---IITDIIDININKQHPEQEHhENPPLNEAAKEAPSESEDAPDIDPNNQHPSQDRSEAPPLEPAA 108
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*..
gi 20143482 157 SEVPSTSLPPTPGEGTSTSVPP----------TAYEGPSTSVVPTPDEGPSTSVLPT 203
Cdd:pfam09595 109 KTKPSEHEPANPPDASNRLSPPdastaaireaRTFRKPSTGKRNNPSSAQSDQSPPR 165
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
141-430 |
4.19e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.08 E-value: 4.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 141 PPTSSEEPSTSVPPTASEVPSTSLPPTPGEGTSTSVPPTAYEGPSTSVVPTPDEGPSTSVLPTPGEGPGtsvplAATEGL 220
Cdd:PHA03247 257 PPPVVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPAPAGDAE-----EEDDED 331
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 221 STSVQATPDEGPSTSVPptatEGLSTPVPPTRDEgPSTSVPATPGEGPSTSVLPaasdgqsislvPTRGKGSSTSVPPTA 300
Cdd:PHA03247 332 GAMEVVSPLPRPRQHYP----LGFPKRRRPTWTP-PSSLEDLSAGRHHPKRASL-----------PTRKRRSARHAATPF 395
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 301 TEGLSTSVQPTAGEGSSTSVP-PTPGGGLSTSVPPTATeelstsvPPTPGEGPSTSVLPIPGEGlstSVPPTASDGSDTS 379
Cdd:PHA03247 396 ARGPGGDDQTRPAAPVPASVPtPAPTPVPASAPPPPAT-------PLPSAEPGSDDGPAPPPER---QPPAPATEPAPDD 465
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*..
gi 20143482 380 VPPTPGEGASTLVQPTAPDGPGSS------VLPNPGEGPSTLFSSSASVDRNPSKCS 430
Cdd:PHA03247 466 PDDATRKALDALRERRPPEPPGADlaellgRHPDTAGTVVRLAAREAAIAREVAECS 522
|
|
| PRK10856 |
PRK10856 |
cytoskeleton protein RodZ; |
149-248 |
4.21e-03 |
|
cytoskeleton protein RodZ;
Pssm-ID: 236776 [Multi-domain] Cd Length: 331 Bit Score: 40.39 E-value: 4.21e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 149 STSVPPTASEvpSTSLPPTPGEGTSTSVPPTAYEGPSTSVVPTPDEGPSTSVLPTPGEgpgTSVPLAATeglstsVQATP 228
Cdd:PRK10856 159 GQSVPLDTST--TTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQ---ANVDTAAT------PAPAA 227
|
90 100
....*....|....*....|.
gi 20143482 229 DEGPSTSVP-PTATEGLSTPV 248
Cdd:PRK10856 228 PATPDGAAPlPTDQAGVSTPA 248
|
|
| DamX |
COG3266 |
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell ... |
36-240 |
4.79e-03 |
|
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442497 [Multi-domain] Cd Length: 455 Bit Score: 40.60 E-value: 4.79e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 36 LPADVPGSDVPQGPSDSQILQGLCASEGPSTSVLPTSAEGPSTFVPPTISEASSASGQPTISEGPGTSVLPTPSEGLSTS 115
Cdd:COG3266 176 ALGAVAALLGLRKAEEALALRAGSAAADALALLLLLLASALGEAVAAAAELAALALLAAGAAEVLTARLVLLLLIIGSAL 255
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 116 GPPTISKGLCTSVTLAASEGRNTSRPPTSSEEPstsVPPTASEVPSTSLPPTPGEGTSTSVPPTAYEGPSTSVVPTPDEg 195
Cdd:COG3266 256 KAPSQASSASAPATTSLGEQQEVSLPPAVAAQP---AAAAAAQPSAVALPAAPAAAAAAAAPAEAAAPQPTAAKPVVTE- 331
|
170 180 190 200
....*....|....*....|....*....|....*....|....*
gi 20143482 196 PSTSVLPTPGEGPGTSVPLAAteglSTSVQATPDEGPSTSVPPTA 240
Cdd:COG3266 332 TAAPAAPAPEAAAAAAAPAAP----AVAKKLAADEQWLASQPASH 372
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
223-348 |
5.94e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 40.47 E-value: 5.94e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 223 SVQATPDEGPSTSVPPTATEGLSTPVPPTrdEGPSTSVPATPGEGPSTSVLPAASDGQSISLVPTRGKGSSTSVPPTATE 302
Cdd:PRK14951 369 AAEAAAPAEKKTPARPEAAAPAAAPVAQA--AAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVA 446
|
90 100 110 120
....*....|....*....|....*....|....*....|....*.
gi 20143482 303 GLSTSVQPTAGEGSSTSVPPTPGGGLSTSVPPTATEELSTSVPPTP 348
Cdd:PRK14951 447 LAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTE 492
|
|
| PHA03292 |
PHA03292 |
envelope glycoprotein I; Provisional |
242-396 |
6.07e-03 |
|
envelope glycoprotein I; Provisional
Pssm-ID: 177577 Cd Length: 413 Bit Score: 40.33 E-value: 6.07e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 242 EGLSTPVPPTRDEGPSTSVPATPGEGPSTSVLPAASDgQSISLVPTrgkGSSTSVP-PTATEGLSTSVQPTAGEGSSTSV 320
Cdd:PHA03292 170 PTVPDPEPTTARPEPAAGYVATPTPRYLNAVTTSTYS-RSMSSQPA---GAATATPtPTLDTGLTTVAPPNETVVTGETA 245
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 321 PPTPGGGLSTSVPPTATEELST--SVPPTPGEGPSTSVLPIPGEGLSTSVPPTASDG--SDTSVPPTPGE--GASTLVqP 394
Cdd:PHA03292 246 LLCHWFQPSTRVPTLYLHLLGTtgNLTEDVLLTEDSEILRTPPPDPSSSRSPGAGDDfkQTNSTSPKRRNkiVAMIVI-P 324
|
..
gi 20143482 395 TA 396
Cdd:PHA03292 325 TA 326
|
|
| PRK10856 |
PRK10856 |
cytoskeleton protein RodZ; |
209-320 |
6.13e-03 |
|
cytoskeleton protein RodZ;
Pssm-ID: 236776 [Multi-domain] Cd Length: 331 Bit Score: 40.01 E-value: 6.13e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 209 GTSVPLaateGLSTSVQATPDEGPSTSVPPTateglstpvpPTRDEGPSTSVPATPGEGPSTSVLPAASdgqsislvPTR 288
Cdd:PRK10856 159 GQSVPL----DTSTTTDPATTPAPAAPVDTT----------PTNSQTPAVATAPAPAVDPQQNAVVAPS--------QAN 216
|
90 100 110
....*....|....*....|....*....|..
gi 20143482 289 GKGSSTSVPPTATEGLSTSVQPTAGEGSSTSV 320
Cdd:PRK10856 217 VDTAATPAPAAPATPDGAAPLPTDQAGVSTPA 248
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
21-225 |
7.50e-03 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 40.37 E-value: 7.50e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 21 HNSSWGEMQAPNAPGLPADVPGSDVPQGPSDSQilqGLcaSEGPSTSVLPTSAEGPSTFVPPTISEA-SSASGQPTISEG 99
Cdd:NF033849 355 HSESSSESTGTSVGHSTSSSVSSSESSSRSSSS---GV--SGGFSGGIAGGGVTSEGLGASQGGSEGwGSGDSVQSVSQS 429
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 100 PGTSVLPTPSEGLSTSgpptISKGLCTSVTLAASEGRNTSRPPTSSEepSTSVppTASEVPSTSLPPTPGEGTSTSVPPT 179
Cdd:NF033849 430 YGSSSSTGTSSGHSDS----SSHSTSSGQADSVSQGTSWSEGTGTSQ--GQSV--GTSESWSTSQSETDSVGDSTGTSES 501
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 20143482 180 AYEGPSTSVVPTPDEGPSTSVLPTPGEGPGTSVPLAATEGLSTSVQ 225
Cdd:NF033849 502 VSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLGKSYQ 547
|
|
| COG5099 |
COG5099 |
RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal ... |
70-426 |
8.12e-03 |
|
RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal structure and biogenesis];
Pssm-ID: 227430 [Multi-domain] Cd Length: 777 Bit Score: 40.12 E-value: 8.12e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 70 PTSAEGPSTFVPPTISEaSSASGQPTISEGPGTSVLPTpSEGLSTSG---PPTISKGLCTSVTLAASEGRNT-------S 139
Cdd:COG5099 74 SSSRRKPSGSWSVAISS-STSGSQSLLMELPSSSFNPS-TSSRNKSNsalSSTQQGNANSSVTLSSSTASSMfnsnklpL 151
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 140 RPPTSSEEPST--------SVPPTASEVPSTSL---PPTPGEGTSTSVPPTAYEGPSTSVVPT------PDEGPSTSVLP 202
Cdd:COG5099 152 PNPNHSNSATTnqsgssfiNTPASSSSQPLTNLvvsSIKRFPYLTSLSPFFNYLIDPSSDSATasadtsPSFNPPPNLSP 231
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 203 TPGEGPGTSVPLAATEglstSVQATPDEGPSTSV-PPTATEGlstPVPPTRDEGPSTSVPATPGEGPSTSVLPAA--SDG 279
Cdd:COG5099 232 NNLFSTSDLSPLPDTQ----SVENNIILNSSSSInELTSIYG---SVPSIRNLRGLNSALVSFLNVSSSSLAFSAlnGKE 304
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 280 QSISLVPTRGKGSStSVPPTATEGLSTSvQPTAGEGSSTSVPPtpggGLSTSVPPTATEELSTSVPPTPGEGPSTSVLPI 359
Cdd:COG5099 305 VSPTGSPSTRSFAR-VLPKSSPNNLLTE-ILTTGVNPPQSLPS----LLNPVFLSTSTGFSLTNLSGYLNPNKNLKKNTL 378
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 20143482 360 P-GEGLSTSVPPTASDGSDTSvpptpgegASTLVQPTAPDGPGSSVLPNPGEGPSTLFSSSASVDRNP 426
Cdd:COG5099 379 SsLSNLGYSSNVPSPSSSEST--------RNILGNISPNFKTSSNLTNLNSLLKEKLSNSSSVSATDI 438
|
|
| PRK10856 |
PRK10856 |
cytoskeleton protein RodZ; |
131-227 |
9.16e-03 |
|
cytoskeleton protein RodZ;
Pssm-ID: 236776 [Multi-domain] Cd Length: 331 Bit Score: 39.24 E-value: 9.16e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 131 AASEGRNTSRPPTSSEEPSTSVPPTASEVPSTSLPPTPGEGTSTSVPPTAYEGPSTSVVPTP---DEGPSTSVLPTPGEG 207
Cdd:PRK10856 151 SAELSQNSGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPsqaNVDTAATPAPAAPAT 230
|
90 100
....*....|....*....|.
gi 20143482 208 PGTSVPL-AATEGLSTSVQAT 227
Cdd:PRK10856 231 PDGAAPLpTDQAGVSTPAADP 251
|
|
| Gag_spuma |
pfam03276 |
Spumavirus gag protein; |
133-304 |
9.48e-03 |
|
Spumavirus gag protein;
Pssm-ID: 460872 [Multi-domain] Cd Length: 614 Bit Score: 39.73 E-value: 9.48e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 133 SEGRNTSRPPTSSEepstSVPPTASEVPSTSLPPTPGEGTSTSVPPTAYEGPSTSVVPTPDEGPSTSVLPTP-----GEG 207
Cdd:pfam03276 179 SPGAQGGIPPGASF----SGLPSLPAIGGIHLPAIPGIHARAPPGNIARSLGDDIMPSLGDAGMPQPRFAFHpgnpfAEA 254
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20143482 208 PGTSVPLAATEGLSTSVQATPDEGPSTSVPPTATEGLSTPVPPTRDEGPSTSVPATPGEGPSTSVLPAASDGQSISLVPT 287
Cdd:pfam03276 255 EGHPFAEAEGERPRDIPRAPRIDAPSAPAIPAIQPIAPPMIPPIGAPIPIPHGASIPGEHIRNPREEPIRLGREAPAIDG 334
|
170
....*....|....*..
gi 20143482 288 RGKGSSTSVPPTATEGL 304
Cdd:pfam03276 335 RFAPAIDDLFCRIINAL 351
|
|
|