|
Name |
Accession |
Description |
Interval |
E-value |
| Cadherin_repeat |
cd11304 |
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ... |
274-343 |
5.65e-10 |
|
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.
Pssm-ID: 206637 [Multi-domain] Cd Length: 98 Bit Score: 56.94 E-value: 5.65e-10
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1677538249 274 IYAEDGDRGINQPIIYSIFRGNVNGTFIIHPDSGNLTVARSVP--SPMTFLLLVKGQ-QADLARYSVTQVTVE 343
Cdd:cd11304 19 VSATDPDSGENGEVTYSIVSGNEDGLFSIDPSTGEITTAKPLDreEQSSYTLTVTATdGGGPPLSSTATVTIT 91
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
454-645 |
2.26e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.49 E-value: 2.26e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 454 QEPPSTEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENstshqPATPGGDT 533
Cdd:PHA03247 2681 QRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAG-----PATPGGPA 2755
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 534 AQTPKPGTSQPMPPG--VGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPA-TPGGGTAQTPEAG-----TSQ 605
Cdd:PHA03247 2756 RPARPPTTAGPPAPAppAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLApAAALPPAASPAGPlppptSAQ 2835
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 1677538249 606 PMPPGMGTSTSHQPTTPGGGTA------QTPEPGTSQPMPLSKSTP 645
Cdd:PHA03247 2836 PTAPPPPPGPPPPSLPLGGSVApggdvrRRPPSRSPAAKPAAPARP 2881
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
451-646 |
1.15e-07 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 55.69 E-value: 1.15e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 451 VSEQEPPSTEAGGTTGPwTSTTSEVPRPPEpSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENSTSHQP---- 526
Cdd:pfam05109 595 VGETSPQANTTNHTLGG-TSSTPVVTSPPK-NATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPllts 672
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 527 ATPGG------------------DTAQTPKPGT-SQPMPPGvGTSTSHQPATPSGgTAQTPEPGTSQPMPPSmGTSTSHQ 587
Cdd:pfam05109 673 AHPTGgenitqvtpaststhhvsTSSPAPRPGTtSQASGPG-NSSTSTKPGEVNV-TKGTPPKNATSPQAPS-GQKTAVP 749
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1677538249 588 PATPGGGTAQTPEAGTSQPmppGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKSTPS 646
Cdd:pfam05109 750 TVTSTGGKANSTTGGKHTT---GHGARTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSS 805
|
|
| CA |
smart00112 |
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ... |
276-314 |
2.85e-06 |
|
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Pssm-ID: 214520 [Multi-domain] Cd Length: 81 Bit Score: 45.80 E-value: 2.85e-06
10 20 30
....*....|....*....|....*....|....*....
gi 1677538249 276 AEDGDRGINQPIIYSIFRGNVNGTFIIHPDSGNLTVARS 314
Cdd:smart00112 2 ATDADSGENGKVTYSILSGNDDGLFSIDPETGEITTTKP 40
|
|
| Cadherin_repeat |
cd11304 |
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ... |
37-120 |
4.34e-06 |
|
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.
Pssm-ID: 206637 [Multi-domain] Cd Length: 98 Bit Score: 45.77 E-value: 4.34e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 37 FEVEENTNVTEPLVDIHV-----PEGQEVT--LGALSTPFAFRIQGN--QLFLNVTPDYEEKSLLEAQLLCQSGGT--LV 105
Cdd:cd11304 4 VSVPENAPPGTVVLTVSAtdpdsGENGEVTysIVSGNEDGLFSIDPStgEITTAKPLDREEQSSYTLTVTATDGGGppLS 83
|
90
....*....|....*
gi 1677538249 106 TQLRVFVSVLDVNDN 120
Cdd:cd11304 84 STATVTITVLDVNDN 98
|
|
| Cadherin |
pfam00028 |
Cadherin domain; |
253-343 |
6.34e-06 |
|
Cadherin domain;
Pssm-ID: 394985 [Multi-domain] Cd Length: 92 Bit Score: 45.37 E-value: 6.34e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 253 YHGAVPTGhILPSPLVLRpgpIYAEDGDRGINQPIIYSIFRGNVNGTFIIHPDSGNLTVARSV--PSPMTFLLLVKGQQA 330
Cdd:pfam00028 1 YSASVPEN-APVGTEVLT---VTATDPDLGPNGRIFYSILGGGPGGNFRIDPDTGDISTTKPLdrESIGEYELTVEATDS 76
|
90
....*....|....
gi 1677538249 331 DL-ARYSVTQVTVE 343
Cdd:pfam00028 77 GGpPLSSTATVTIT 90
|
|
| CA |
smart00112 |
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ... |
71-122 |
3.23e-05 |
|
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Pssm-ID: 214520 [Multi-domain] Cd Length: 81 Bit Score: 42.72 E-value: 3.23e-05
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*.
gi 1677538249 71 FRI--QGNQLFLNVTPDYEEKSLLEAQLLCQSGGT--LVTQLRVFVSVLDVNDNAP 122
Cdd:smart00112 26 FSIdpETGEITTTKPLDREEQPEYTLTVEATDGGGppLSSTATVTITVLDVNDNAP 81
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
419-617 |
7.36e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 46.28 E-value: 7.36e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 419 TTTLAQAGAFYAEVEAHNTVTSGTATTVieiqvseqepPSTEAGGTTGPWTSTTSEVPRPPEPSqgpsTTSSGGGTGPHP 498
Cdd:COG3469 38 TATTVVSTTGSVVVAASGSAGSGTGTTA----------ASSTAATSSTTSTTATATAAAAAATS----TSATLVATSTAS 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 499 PSGTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSqpmpPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPP 578
Cdd:COG3469 104 GANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGST----TTTTTVSGTETATGGTTTTSTTTTTTSASTTP 179
|
170 180 190
....*....|....*....|....*....|....*....
gi 1677538249 579 SMGTSTShqpATPGGGTAQTPEAGTSQPMPPGMGTSTSH 617
Cdd:COG3469 180 SATTTAT---ATTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
476-637 |
6.86e-03 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 39.75 E-value: 6.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 476 PRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPG-----GPPGAENSTSHQPATPGGDT---AQTPKPGTsQPMPP 547
Cdd:NF033839 292 PSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKpevkpQPEKPKPEVKPQLETPKPEVkpqPEKPKPEV-KPQPE 370
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 548 GVGTSTSHQPATPSGGTAQTPEPGTS--QPMPPSMGTSTSHQPATPGGGTAQTPEAGTSQ--PMPPGMGTSTSHQPTTPg 623
Cdd:NF033839 371 KPKPEVKPQPETPKPEVKPQPEKPKPevKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEvkPQPEKPKPEVKPQPEKP- 449
|
170
....*....|....
gi 1677538249 624 gGTAQTPEPGTSQP 637
Cdd:NF033839 450 -KPEVKPQPETPKP 462
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Cadherin_repeat |
cd11304 |
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ... |
274-343 |
5.65e-10 |
|
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.
Pssm-ID: 206637 [Multi-domain] Cd Length: 98 Bit Score: 56.94 E-value: 5.65e-10
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1677538249 274 IYAEDGDRGINQPIIYSIFRGNVNGTFIIHPDSGNLTVARSVP--SPMTFLLLVKGQ-QADLARYSVTQVTVE 343
Cdd:cd11304 19 VSATDPDSGENGEVTYSIVSGNEDGLFSIDPSTGEITTAKPLDreEQSSYTLTVTATdGGGPPLSSTATVTIT 91
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
454-645 |
2.26e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.49 E-value: 2.26e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 454 QEPPSTEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENstshqPATPGGDT 533
Cdd:PHA03247 2681 QRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAG-----PATPGGPA 2755
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 534 AQTPKPGTSQPMPPG--VGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPA-TPGGGTAQTPEAG-----TSQ 605
Cdd:PHA03247 2756 RPARPPTTAGPPAPAppAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLApAAALPPAASPAGPlppptSAQ 2835
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 1677538249 606 PMPPGMGTSTSHQPTTPGGGTA------QTPEPGTSQPMPLSKSTP 645
Cdd:PHA03247 2836 PTAPPPPPGPPPPSLPLGGSVApggdvrRRPPSRSPAAKPAAPARP 2881
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
311-637 |
1.36e-08 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 58.46 E-value: 1.36e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 311 VARSVPSPMTFLLLVKGQQADLARYSVTQVTVEAVAAAGSPPRFPQRLyrgTVARGAGAGVVVKDAAAPSQPLRIQAQDP 390
Cdd:PRK07764 438 APAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPP---AAPAPAAAPAAPAAPAAPAGADDAATLRE 514
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 391 EFSDLNSAITyritNHSHFRME-------------GEVVLTTTTLAQAGAFYAEVEAHNTVTSGTATTVIEIQVS-EQEP 456
Cdd:PRK07764 515 RWPEILAAVP----KRSRKTWAillpeatvlgvrgDTLVLGFSTGGLARRFASPGNAEVLVTALAEELGGDWQVEaVVGP 590
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 457 PSTEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQT 536
Cdd:PRK07764 591 APGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWP 670
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 537 PKPGTSQPMPPGvGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPATPGGGTAQTPEAGTSQPMPP------- 609
Cdd:PRK07764 671 AKAGGAAPAAPP-PAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPpepddpp 749
|
330 340 350
....*....|....*....|....*....|
gi 1677538249 610 --GMGTSTSHQPTTPGGGTAQTPEPGTSQP 637
Cdd:PRK07764 750 dpAGAPAQPPPPPAPAPAAAPAAAPPPSPP 779
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
451-646 |
1.15e-07 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 55.69 E-value: 1.15e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 451 VSEQEPPSTEAGGTTGPwTSTTSEVPRPPEpSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENSTSHQP---- 526
Cdd:pfam05109 595 VGETSPQANTTNHTLGG-TSSTPVVTSPPK-NATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPllts 672
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 527 ATPGG------------------DTAQTPKPGT-SQPMPPGvGTSTSHQPATPSGgTAQTPEPGTSQPMPPSmGTSTSHQ 587
Cdd:pfam05109 673 AHPTGgenitqvtpaststhhvsTSSPAPRPGTtSQASGPG-NSSTSTKPGEVNV-TKGTPPKNATSPQAPS-GQKTAVP 749
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1677538249 588 PATPGGGTAQTPEAGTSQPmppGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKSTPS 646
Cdd:pfam05109 750 TVTSTGGKANSTTGGKHTT---GHGARTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSS 805
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
454-647 |
1.82e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 55.33 E-value: 1.82e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 454 QEPPSTEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENSTSHQPATPG--- 530
Cdd:PHA03247 2611 PAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarp 2690
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 531 -----GDTAQTPKPG-TSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPATPggGTAQTPeAGTS 604
Cdd:PHA03247 2691 tvgslTSLADPPPPPpTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARP--ARPPTT-AGPP 2767
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 1677538249 605 QPMPPGmGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKSTPSS 647
Cdd:PHA03247 2768 APAPPA-APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPA 2809
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
433-648 |
2.03e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 55.33 E-value: 2.03e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 433 EAHNTVTSGTATTVIEIQVSEQEPPSTEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTP 512
Cdd:PHA03247 2793 ESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPA 2872
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 513 GGPPGAENSTSHQPATPggdtaQTPKPGTSQPMPPgVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPATPG 592
Cdd:PHA03247 2873 AKPAAPARPPVRRLARP-----AVSRSTESFALPP-DQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT 2946
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 1677538249 593 GGTAQTPEAGTSQPMPPGMGTSTSHQPTTpgggTAQTPEPGTSQPMPLSKSTPSSG 648
Cdd:PHA03247 2947 TDPAGAGEPSGAVPQPWLGALVPGRVAVP----RFRVPQPAPSREAPASSTPPLTG 2998
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
467-653 |
1.23e-06 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 52.38 E-value: 1.23e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 467 PWTSTTSEVP----RPPEPSQGPSTTSSGGGTGPHP--PSGTTLRPPTSSTPGGPPGAENS-----------TSHQPATP 529
Cdd:PHA03378 600 PHPSQTPEPPttqsHIPETSAPRQWPMPLRPIPMRPlrMQPITFNVLVFPTPHQPPQVEITpykptwtqighIPYQPSPT 679
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 530 GGDTAQTPK--PGTSQP-------MPPGVGTSTSHQPATPSGGTAQTPE--PGTSQP-------MPPSMGTSTSHQPATP 591
Cdd:PHA03378 680 GANTMLPIQwaPGTMQPppraptpMRPPAAPPGRAQRPAAATGRARPPAaaPGRARPpaaapgrARPPAAAPGRARPPAA 759
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1677538249 592 GGGTAQTPEA--GTSQPMPPGMGTSTSHQptTPGGGTAQTPEP-GTSQPMPLSKSTPSSGGGPSE 653
Cdd:PHA03378 760 APGRARPPAAapGAPTPQPPPQAPPAPQQ--RPRGAPTPQPPPqAGPTSMQLMPRAAPGQQGPTK 822
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
452-653 |
1.39e-06 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 51.99 E-value: 1.39e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 452 SEQEPPSTEAGGTTGPWTSTtsevprPPEPSQGPSTTSSGGGTGPHPPsGTTLRPPTSSTPGGPPGA--------ENSTS 523
Cdd:PHA03378 650 TPHQPPQVEITPYKPTWTQI------GHIPYQPSPTGANTMLPIQWAP-GTMQPPPRAPTPMRPPAAppgraqrpAAATG 722
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 524 HQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPE--PGTSQPMPPSMGTSTSHQPATPGGGTAQTPEA 601
Cdd:PHA03378 723 RARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAaaPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQA 802
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1677538249 602 G-----TSQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKS-----TPSSGGGPSE 653
Cdd:PHA03378 803 GptsmqLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALERQaaagpTPSPGSGTSD 864
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
455-724 |
2.55e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.48 E-value: 2.55e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 455 EPPSTEAGGTTGPwTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTT----------LRPPTSSTPGGP-PGAENSTS 523
Cdd:PHA03247 2700 DPPPPPPTPEPAP-HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGpatpggparpARPPTTAGPPAPaPPAAPAAG 2778
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 524 HQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPA-TPSGGTAQTPEPG-----TSQPMPPSMGTSTSHQPATPGGGTAq 597
Cdd:PHA03247 2779 PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLApAAALPPAASPAGPlppptSAQPTAPPPPPGPPPPSLPLGGSVA- 2857
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 598 tPEAGTSQPMPPGmgtSTSHQPTTPgggtAQTPEPGTSQPmPLSKSTPSSGGGPSEDKRFSVVDMAALGGVLGALLLLAL 677
Cdd:PHA03247 2858 -PGGDVRRRPPSR---SPAAKPAAP----ARPPVRRLARP-AVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQ 2928
|
250 260 270 280
....*....|....*....|....*....|....*....|....*..
gi 1677538249 678 LGLAVLVHKHYGPRLKCCCGKAPEPQPQGFDNQAFLPDHKANWAPVP 724
Cdd:PHA03247 2929 PQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVP 2975
|
|
| CA |
smart00112 |
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ... |
276-314 |
2.85e-06 |
|
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Pssm-ID: 214520 [Multi-domain] Cd Length: 81 Bit Score: 45.80 E-value: 2.85e-06
10 20 30
....*....|....*....|....*....|....*....
gi 1677538249 276 AEDGDRGINQPIIYSIFRGNVNGTFIIHPDSGNLTVARS 314
Cdd:smart00112 2 ATDADSGENGKVTYSILSGNDDGLFSIDPETGEITTTKP 40
|
|
| Cadherin_repeat |
cd11304 |
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ... |
37-120 |
4.34e-06 |
|
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.
Pssm-ID: 206637 [Multi-domain] Cd Length: 98 Bit Score: 45.77 E-value: 4.34e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 37 FEVEENTNVTEPLVDIHV-----PEGQEVT--LGALSTPFAFRIQGN--QLFLNVTPDYEEKSLLEAQLLCQSGGT--LV 105
Cdd:cd11304 4 VSVPENAPPGTVVLTVSAtdpdsGENGEVTysIVSGNEDGLFSIDPStgEITTAKPLDREEQSSYTLTVTATDGGGppLS 83
|
90
....*....|....*
gi 1677538249 106 TQLRVFVSVLDVNDN 120
Cdd:cd11304 84 STATVTITVLDVNDN 98
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
474-653 |
4.39e-06 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 50.54 E-value: 4.39e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 474 EVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSt 553
Cdd:pfam03154 175 QAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMT- 253
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 554 shQPATPSGGTAQ-TPEPGTSQPMPPsmgtstshQPATPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPgggtaQTPEP 632
Cdd:pfam03154 254 --QPPPPSQVSPQpLPQPSLHGQMPP--------MPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGP-----SPAAP 318
|
170 180
....*....|....*....|....
gi 1677538249 633 GTSQPM---PLSKSTPSSGGGPSE 653
Cdd:pfam03154 319 GQSQQRihtPPSQSQLQSQQPPRE 342
|
|
| Cadherin |
pfam00028 |
Cadherin domain; |
253-343 |
6.34e-06 |
|
Cadherin domain;
Pssm-ID: 394985 [Multi-domain] Cd Length: 92 Bit Score: 45.37 E-value: 6.34e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 253 YHGAVPTGhILPSPLVLRpgpIYAEDGDRGINQPIIYSIFRGNVNGTFIIHPDSGNLTVARSV--PSPMTFLLLVKGQQA 330
Cdd:pfam00028 1 YSASVPEN-APVGTEVLT---VTATDPDLGPNGRIFYSILGGGPGGNFRIDPDTGDISTTKPLdrESIGEYELTVEATDS 76
|
90
....*....|....
gi 1677538249 331 DL-ARYSVTQVTVE 343
Cdd:pfam00028 77 GGpPLSSTATVTIT 90
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
456-651 |
6.80e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 50.32 E-value: 6.80e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 456 PPSTEAGGTTGPwtSTTSEVPRPPEPSQGPSTTSSGGGTGPHP-PSGTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTA 534
Cdd:PHA03247 2569 PPPRPAPRPSEP--AVTSRARRPDAPPQSARPRAPVDDRGDPRgPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTV 2646
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 535 QTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTS-HQPATPGGGTAQTPEAGTSQ-PMPPGMG 612
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSlADPPPPPPTPEPAPHALVSAtPLPPGPA 2726
|
170 180 190 200
....*....|....*....|....*....|....*....|
gi 1677538249 613 TSTSHQPTTPGGGTAQTPEPGTSQPM-PLSKSTPSSGGGP 651
Cdd:PHA03247 2727 AARQASPALPAAPAPPAVPAGPATPGgPARPARPPTTAGP 2766
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
498-613 |
1.80e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 48.44 E-value: 1.80e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 498 PPSGTTLRPPTSSTPGGPPGAENSTSHQPATPggDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMP 577
Cdd:PRK07764 398 APSAAAAAPAAAPAPAAAAPAAAAAPAPAAAP--QPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPE 475
|
90 100 110
....*....|....*....|....*....|....*.
gi 1677538249 578 PSMGTSTSHQPATPGGGTAQTPEAGTSQPMPPGMGT 613
Cdd:PRK07764 476 PTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAAT 511
|
|
| PRK14959 |
PRK14959 |
DNA polymerase III subunits gamma and tau; Provisional |
504-624 |
2.16e-05 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 184923 [Multi-domain] Cd Length: 624 Bit Score: 48.14 E-value: 2.16e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 504 LRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPM---PPGVGTSTSHQPATPSGGTAQTPEP--GTSQPMPP 578
Cdd:PRK14959 365 LMPVESLRPSGGGASAPSGSAAEGPASGGAATIPTPGTQGPQgtaPAAGMTPSSAAPATPAPSAAPSPRVpwDDAPPAPP 444
|
90 100 110 120
....*....|....*....|....*....|....*....|....*...
gi 1677538249 579 SMGTSTSHQPATPGGGTAQTPEAGTSQP--MPPGMGTSTSHQPTTPGG 624
Cdd:PRK14959 445 RSGIPPRPAPRMPEASPVPGAPDSVASAsdAPPTLGDPSDTAEHTPSG 492
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
417-652 |
2.59e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.40 E-value: 2.59e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 417 LTTTTLAQAGAFYAEVEAHNTVTSGTATTVieiQVSEQEPPSTEAggTTGPWTSTTSEVP-RPPEPSQGPSTTSSGGGTG 495
Cdd:PHA03247 2721 LPPGPAAARQASPALPAAPAPPAVPAGPAT---PGGPARPARPPT--TAGPPAPAPPAAPaAGPPRRLTRPAVASLSESR 2795
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 496 PHPPSGTTLRPPTSSTPggPPGAENSTSHQPAT---PGGDTAQTPKPGTSQPMPPGVGTSTSHQPATP--SGGTAQTPEP 570
Cdd:PHA03247 2796 ESLPSPWDPADPPAAVL--APAAALPPAASPAGplpPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDvrRRPPSRSPAA 2873
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 571 GTSQPMPPSMGTSTSHQPATPGGGTAQTPEAGTSQPMPPGMGTSTSH-QPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGG 649
Cdd:PHA03247 2874 KPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQpQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAG 2953
|
...
gi 1677538249 650 GPS 652
Cdd:PHA03247 2954 EPS 2956
|
|
| CA |
smart00112 |
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ... |
71-122 |
3.23e-05 |
|
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
Pssm-ID: 214520 [Multi-domain] Cd Length: 81 Bit Score: 42.72 E-value: 3.23e-05
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*.
gi 1677538249 71 FRI--QGNQLFLNVTPDYEEKSLLEAQLLCQSGGT--LVTQLRVFVSVLDVNDNAP 122
Cdd:smart00112 26 FSIdpETGEITTTKPLDREEQPEYTLTVEATDGGGppLSSTATVTITVLDVNDNAP 81
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
431-661 |
3.61e-05 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 47.47 E-value: 3.61e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 431 EVEAHNTVTSGTATTVIEIQVSEQEPPSTEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPH---PPSGTTLRPP 507
Cdd:PHA03307 83 ESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAaspPAAGASPAAV 162
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 508 TSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQtPEPGTSQPMPPSMGTSTSHQ 587
Cdd:PHA03307 163 ASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPA-PAPGRSAADDAGASSSDSSS 241
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1677538249 588 PATPGGGTAQTPEAGTSQPMPPGMGTST-SHQPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGGGPSEDKRFSVVD 661
Cdd:PHA03307 242 SESSGCGWGPENECPLPRPAPITLPTRIwEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASS 316
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
502-645 |
4.73e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 47.29 E-value: 4.73e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 502 TTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPAtpsggtaQTPEPGTSQPMPPSMG 581
Cdd:PRK07764 376 ARLERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPA-------PAPAPAPAPPSPAGNA 448
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1677538249 582 TSTSHQPATPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKSTP 645
Cdd:PRK07764 449 PAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATL 512
|
|
| PRK11901 |
PRK11901 |
hypothetical protein; Reviewed |
508-652 |
4.82e-05 |
|
hypothetical protein; Reviewed
Pssm-ID: 237015 [Multi-domain] Cd Length: 327 Bit Score: 46.21 E-value: 4.82e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 508 TSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTsqpMPPGVGTSTSHQPATPSGGTAQTPEPGT-----SQ------PM 576
Cdd:PRK11901 87 LSSGNQSSPSAANNTSDGHDASGVKNTAPPQDIS---APPISPTPTQAAPPQTPNGQQRIELPGNisdalSQqqgqvnAA 163
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1677538249 577 PPSMGTSTSHQP---ATPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPgggtaqTPEPGTSQPmPLSKSTPSSGGGPS 652
Cdd:PRK11901 164 SQNAQGNTSTLPtapATVAPSKGAKVPATAETHPTPPQKPATKKPAVNH------HKTATVAVP-PATSGKPKSGAASA 235
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
456-640 |
5.63e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 47.07 E-value: 5.63e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 456 PPSTEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGT---------GPHPPSGTTLRPPTSSTPGGPPGAENSTS--- 523
Cdd:pfam03154 188 PPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAphtliqqtpTLHPQRLPSPHPPLQPMTQPPPPSQVSPQplp 267
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 524 ----HQPATPGGDTAQT-----PKPGTSQPMP------------------PGVGTSTSHQPatPSGGTAQTPEPGTSQPM 576
Cdd:pfam03154 268 qpslHGQMPPMPHSLQTgpshmQHPVPPQPFPltpqssqsqvppgpspaaPGQSQQRIHTP--PSQSQLQSQQPPREQPL 345
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1677538249 577 PPSmGTSTSHQPATPGGGTAQTPEAGT-------SQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPL 640
Cdd:pfam03154 346 PPA-PLSMPHIKPPPTTPIPQLPNPQShkhpphlSGPSPFQMNSNLPPPPALKPLSSLSTHHPPSAHPPPL 415
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
435-654 |
5.86e-05 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 47.09 E-value: 5.86e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 435 HNTVTSGTATTVIEIQVSEQEPPSTEAGGTTGPwtsTTSEVPRPPEP-SQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPG 513
Cdd:PHA03307 39 SQGQLVSDSAELAAVTVVAGAAACDRFEPPTGP---PPGPGTEAPANeSRSTPTWSLSTLAPASPAREGSPTPPGPSSPD 115
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 514 GPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPATPGG 593
Cdd:PHA03307 116 PPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPP 195
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1677538249 594 GTAqtPEAGTSQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGGGPSED 654
Cdd:PHA03307 196 STP--PAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENE 254
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
374-657 |
5.99e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 46.79 E-value: 5.99e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 374 KDAAAPSQPLRIQAQDPEFSDL----------NSAITYRITNHShfrmEGEVVLT-------TTTLAQAGAFyaeveaHN 436
Cdd:PRK12323 296 KIALAQVVPAAVQDDWPEADDIrrlagrfdaqEVQLFYQIANLG----RSELALApdeyagfTMTLLRMLAF------RP 365
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 437 TVTSGTATTVIEIQVSEQEPPSTEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPP 516
Cdd:PRK12323 366 GQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPG 445
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 517 GAENSTSHQPATPggdtAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPE-PGTSQPMPPSMGTSTSHQ--PATPGG 593
Cdd:PRK12323 446 GAPAPAPAPAAAP----AAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDdPPPWEELPPEFASPAPAQpdAAPAGW 521
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1677538249 594 GTAQTPEAGTSQPMPPGmgtstshqPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGGGPSEDKRF 657
Cdd:PRK12323 522 VAESIPDPATADPDDAF--------ETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMF 577
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
454-661 |
6.64e-05 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 46.46 E-value: 6.64e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 454 QEPPSTEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTT----LRPPTSSTPGGPPGAENSTSHQPATP 529
Cdd:PLN03209 326 QRVPPKESDAADGPKPVPTKPVTPEAPSPPIEEEPPQPKAVVPRPLSPYTayedLKPPTSPIPTPPSSSPASSKSVDAVA 405
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 530 GGDTAQT-PKPGTSQPMP---PGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPS-MGTSTSHQPATPG-GGTAQTPEAGT 603
Cdd:PLN03209 406 KPAEPDVvPSPGSASNVPevePAQVEAKKTRPLSPYARYEDLKPPTSPSPTAPTgVSPSVSSTSSVPAvPDTAPATAATD 485
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*...
gi 1677538249 604 SQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGGGPSEDKRFSVVD 661
Cdd:PLN03209 486 AAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALAD 543
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
419-617 |
7.36e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 46.28 E-value: 7.36e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 419 TTTLAQAGAFYAEVEAHNTVTSGTATTVieiqvseqepPSTEAGGTTGPWTSTTSEVPRPPEPSqgpsTTSSGGGTGPHP 498
Cdd:COG3469 38 TATTVVSTTGSVVVAASGSAGSGTGTTA----------ASSTAATSSTTSTTATATAAAAAATS----TSATLVATSTAS 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 499 PSGTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSqpmpPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPP 578
Cdd:COG3469 104 GANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGST----TTTTTVSGTETATGGTTTTSTTTTTTSASTTP 179
|
170 180 190
....*....|....*....|....*....|....*....
gi 1677538249 579 SMGTSTShqpATPGGGTAQTPEAGTSQPMPPGMGTSTSH 617
Cdd:COG3469 180 SATTTAT---ATTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| PRK13700 |
PRK13700 |
conjugal transfer protein TraD; Provisional |
538-614 |
8.56e-05 |
|
conjugal transfer protein TraD; Provisional
Pssm-ID: 184256 [Multi-domain] Cd Length: 732 Bit Score: 46.11 E-value: 8.56e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 538 KPGTSQPMPPGVGTSTSHQPATPSGGTAQTPePGTSQPMPPSMG-TSTSHQPATPGGGT----AQTPEAGTSQPMPPGMG 612
Cdd:PRK13700 604 EPDVPEVASGEDVTQAEQPQQPQQPQQPQQP-QQPQQPVSPVINdKKSDAGVNVPAGGIeqelKMKPEEEMEQQLPPGIS 682
|
..
gi 1677538249 613 TS 614
Cdd:PRK13700 683 ES 684
|
|
| PHA03255 |
PHA03255 |
BDLF3; Provisional |
508-650 |
1.84e-04 |
|
BDLF3; Provisional
Pssm-ID: 165513 [Multi-domain] Cd Length: 234 Bit Score: 43.74 E-value: 1.84e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 508 TSSTP-----GGPPGAENSTSHQPATPGGDTAQTP-KPGTSQPMPPGVGTSTSHQPATpSGGTAQTPEPGTSQPMPPSMG 581
Cdd:PHA03255 25 TSSGSstasaGNVTGTTAVTTPSPSASGPSTNQSTtLTTTSAPITTTAILSTNTTTVT-STGTTVTPVPTTSNASTINVT 103
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1677538249 582 TSTSHQ--PATPGGGTAQTPEAGTSQPMPPGMGTSTSH-------QPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGGG 650
Cdd:PHA03255 104 TKVTAQniTATEAGTGTSTGVTSNVTTRSSSTTSATTRitnattlAPTLSSKGTSNATKTTAELPTVPDERQPSLSYG 181
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
444-622 |
1.84e-04 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 44.92 E-value: 1.84e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 444 TTVIEIQVSEQEPPSTEAGGTTGPWTSTTSEVPRPPePSQGPSTTSSGGGTGPHPPSGTT----LRPPTSSTPGGPPGAE 519
Cdd:PLN03209 384 TSPIPTPPSSSPASSKSVDAVAKPAEPDVVPSPGSA-SNVPEVEPAQVEAKKTRPLSPYAryedLKPPTSPSPTAPTGVS 462
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 520 NSTSHQPATPG-GDTA--------QTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPAT 590
Cdd:PLN03209 463 PSVSSTSSVPAvPDTApataatdaAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALA 542
|
170 180 190
....*....|....*....|....*....|...
gi 1677538249 591 PGGGTAQ-TPEAGTSQPMPPGMGTSTSHQPTTP 622
Cdd:PLN03209 543 DEQHHAQpKPRPLSPYTMYEDLKPPTSPTPSPV 575
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
417-652 |
2.39e-04 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 44.57 E-value: 2.39e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 417 LTTTTLAQAGAFYAEVEAHNTVTSGTATTVieiqvsEQEPPSTEAGGTTGPWTSTTSEVPRPPEPSQGPSTTssgggtgp 496
Cdd:pfam17823 97 LSEPATREGAADGAASRALAAAASSSPSSA------AQSLPAAIAALPSEAFSAPRAAACRANASAAPRAAI-------- 162
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 497 hppsgTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSH-QPATPSGGTA------QTPE 569
Cdd:pfam17823 163 -----AAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAAtATGHPAAGTAlaavgnSSPA 237
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 570 PGT-----SQPMPPSMGTSTSH-QPATPGGGTAQT--PEAGTSQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLS 641
Cdd:pfam17823 238 AGTvtaavGTVTPAALATLAAAaGTVASAAGTINMgdPHARRLSPAKHMPSDTMARNPAAPMGAQAQGPIIQVSTDQPVH 317
|
250
....*....|.
gi 1677538249 642 KSTPSSGGGPS 652
Cdd:pfam17823 318 NTAGEPTPSPS 328
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
452-656 |
2.63e-04 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 44.78 E-value: 2.63e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 452 SEQEPPSteaGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENSTSHQPATPGG 531
Cdd:PHA03307 67 PPTGPPP---GPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGS 143
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 532 DT---AQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQT-PEPGTSQPMPPSMGTSTSHQPATPGGGTAQTPEAGTSQPM 607
Cdd:PHA03307 144 PGpppAASPPAAGASPAAVASDAASSRQAALPLSSPEETaRAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPA 223
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 1677538249 608 PPGM-----GTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGGGPSEDKR 656
Cdd:PHA03307 224 PGRSaaddaGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNG 277
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
558-648 |
3.57e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 44.00 E-value: 3.57e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 558 ATPSGGTAQTPEPGTSQPM---PPSMGTSTSHQP------ATPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPGGGTAQ 628
Cdd:PRK14971 369 ASGGRGPKQHIKPVFTQPAaapQPSAAAAASPSPsqssaaAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVR 448
|
90 100
....*....|....*....|
gi 1677538249 629 TPEPGTSQPMPLSKsTPSSG 648
Cdd:PRK14971 449 PAQFKEEKKIPVSK-VSSLG 467
|
|
| PHA03377 |
PHA03377 |
EBNA-3C; Provisional |
454-652 |
3.60e-04 |
|
EBNA-3C; Provisional
Pssm-ID: 177614 [Multi-domain] Cd Length: 1000 Bit Score: 44.27 E-value: 3.60e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 454 QEPPSTEAGGTTGPWTSTTSEVPRP-PEPSQGPSTTSSGGGTGPHPPSGTtlRP------PTSSTPGGP------PGAEN 520
Cdd:PHA03377 663 QQEPSSRRQPATQSTPPRPSWLPSVfVLPSVDAGRAQPSEESHLSSMSPT--QPisheeqPRYEDPDDPldlslhPDQAP 740
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 521 STSHQPATPGGD---TAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPE---PGTSQPMPPSMGTSTSHQPATPGGG 594
Cdd:PHA03377 741 PPSHQAPYSGHEepqAQQAPYPGYWEPRPPQAPYLGYQEPQAQGVQVSSYPGyagPWGLRAQHPRYRHSWAYWSQYPGHG 820
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1677538249 595 TAQTPEAGTSQPMPPGMGTSTSHQPTTPGGGTAQTPEPG------TSQPMPLSKSTPSSGGGPS 652
Cdd:PHA03377 821 HPQGPWAPRPPHLPPQWDGSAGHGQDQVSQFPHLQSETGpprlqlSQVPQLPYSQTLVSSSAPS 884
|
|
| PHA03264 |
PHA03264 |
envelope glycoprotein D; Provisional |
501-614 |
3.80e-04 |
|
envelope glycoprotein D; Provisional
Pssm-ID: 223029 [Multi-domain] Cd Length: 416 Bit Score: 43.84 E-value: 3.80e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 501 GTTLRPPTSSTPGGPPGAENSTShqPATPGGDTAQT-PKPGTSQPMPPG---VGTSTSHQPATPSGGTAQTPEPGTSQPM 576
Cdd:PHA03264 252 GVVPPYFEESKGYEPPPAPSGGS--PAPPGDDRPEAkPEPGPVEDGAPGretGGEGEGPEPAGRDGAAGGEPKPGPPRPA 329
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 1677538249 577 PPSMGTS-------TSHQPATPGggtaqTPEAGTSQPMPPGMGTS 614
Cdd:PHA03264 330 PDADRPEgwpsleaITFPPPTPA-----TPAVPRARPVIVGTGIA 369
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
504-654 |
4.26e-04 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 44.01 E-value: 4.26e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 504 LRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTP-KPGTSQPMPPGVGTSTSHQPATPSGGTAQtPEPGTSQPMPPSMGT 582
Cdd:PHA03307 774 LLEPAEPQRGAGSSPPVRAEAAFRRPGRLRRSGPaADAASRTASKRKSRSHTPDGGSESSGPAR-PPGAAARPPPARSSE 852
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1677538249 583 STSHQPATPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGGGPSED 654
Cdd:PHA03307 853 SSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVKLGPMPPGGPDPR 924
|
|
| PRK14959 |
PRK14959 |
DNA polymerase III subunits gamma and tau; Provisional |
534-653 |
4.43e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 184923 [Multi-domain] Cd Length: 624 Bit Score: 43.90 E-value: 4.43e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 534 AQTPKPGTSQPMPPGVG----TSTSHQPATPSGGTAQTPEPGTSQPM---PPSMGTSTSHQPATPGGGTAQTPEA--GTS 604
Cdd:PRK14959 360 AMLPRLMPVESLRPSGGgasaPSGSAAEGPASGGAATIPTPGTQGPQgtaPAAGMTPSSAAPATPAPSAAPSPRVpwDDA 439
|
90 100 110 120
....*....|....*....|....*....|....*....|....*....
gi 1677538249 605 QPMPPGMGTSTSHQPTTPGGgtaqTPEPGTSQPMPLSKSTPSSGGGPSE 653
Cdd:PRK14959 440 PPAPPRSGIPPRPAPRMPEA----SPVPGAPDSVASASDAPPTLGDPSD 484
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
497-652 |
4.64e-04 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 43.80 E-value: 4.64e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 497 HPPSGTTLRPPTSSTpGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMP------PGVGTSTSHQPATPSGGTAQTPEP 570
Cdd:pfam17823 90 HTPHGTDLSEPATRE-GAADGAASRALAAAASSSPSSAAQSLPAAIAALPseafsaPRAAACRANASAAPRAAIAAASAP 168
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 571 GTSQPMPPSMGTSTSHQPATPGGGTAQTPEAGTS-QPMPPGMGTSTSHQPT-TPGGGTAqTPEPGTSQPMPLSKSTPSSG 648
Cdd:pfam17823 169 HAASPAPRTAASSTTAASSTTAASSAPTTAASSApATLTPARGISTAATATgHPAAGTA-LAAVGNSSPAAGTVTAAVGT 247
|
....
gi 1677538249 649 GGPS 652
Cdd:pfam17823 248 VTPA 251
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
497-652 |
5.11e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.16 E-value: 5.11e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 497 HPPSGTTLRPPTSSTPggPPGAENSTS---HQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTP-EPGT 572
Cdd:PHA03247 346 HYPLGFPKRRRPTWTP--PSSLEDLSAgrhHPKRASLPTRKRRSARHAATPFARGPGGDDQTRPAAPVPASVPTPaPTPV 423
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 573 SQPMPPSmgTSTSHQPATPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPGG----GTAQTPEPGTSQPMPLSKSTPSSG 648
Cdd:PHA03247 424 PASAPPP--PATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATRKAldalRERRPPEPPGADLAELLGRHPDTA 501
|
....
gi 1677538249 649 GGPS 652
Cdd:PHA03247 502 GTVV 505
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
527-623 |
5.57e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 43.61 E-value: 5.57e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 527 ATPGGDTAQTPKPGTSQPM---PPGVGTSTSHQP------ATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPATPGGGTAQ 597
Cdd:PRK14971 369 ASGGRGPKQHIKPVFTQPAaapQPSAAAAASPSPsqssaaAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVR 448
|
90 100
....*....|....*....|....*.
gi 1677538249 598 TPEAGTSQPMPPgMGTSTSHQPTTPG 623
Cdd:PRK14971 449 PAQFKEEKKIPV-SKVSSLGPSTLRP 473
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
441-644 |
5.84e-04 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 43.52 E-value: 5.84e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 441 GTATTVIEIQVSEQEPPSTEAGGTTGPWTSTTSEVP-------RPPEPSQGPSTTSSGGGTGPHPPSgttlRPPTSSTPG 513
Cdd:PTZ00449 547 GKPGETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPkdpkhpkDPEEPKKPKRPRSAQRPTRPKSPK----LPELLDIPK 622
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 514 GPPGAENSTS-------HQPATP----GGDTAQTPKPGTSqPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGT 582
Cdd:PTZ00449 623 SPKRPESPKSpkrppppQRPSSPerpeGPKIIKSPKPPKS-PKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFES 701
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1677538249 583 STSHQPATPGGGTAQTPeagtsQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPMPLSKST 644
Cdd:PTZ00449 702 ILKETLPETPGTPFTTP-----RPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERT 758
|
|
| PRK13700 |
PRK13700 |
conjugal transfer protein TraD; Provisional |
508-583 |
6.99e-04 |
|
conjugal transfer protein TraD; Provisional
Pssm-ID: 184256 [Multi-domain] Cd Length: 732 Bit Score: 43.41 E-value: 6.99e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 508 TSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVG-TSTSHQPATPSGGT----AQTPEPGTSQPMPPSMGT 582
Cdd:PRK13700 604 EPDVPEVASGEDVTQAEQPQQPQQPQQPQQPQQPQQPVSPVINdKKSDAGVNVPAGGIeqelKMKPEEEMEQQLPPGISE 683
|
.
gi 1677538249 583 S 583
Cdd:PRK13700 684 S 684
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
418-586 |
8.58e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 42.82 E-value: 8.58e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 418 TTTTLAQAGAFYAEVEAHNTVTSGTATTVIeiQVSEQEPPSTEAGGTTGPWTSTTSEVPRPPepsqgpsttssgGGTGPH 497
Cdd:COG3469 64 TAASSTAATSSTTSTTATATAAAAAATSTS--ATLVATSTASGANTGTSTVTTTSTGAGSVT------------STTSST 129
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 498 PPSGTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTShqpATPSGGTAQTPEPGTSQPMP 577
Cdd:COG3469 130 AGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTAT---ATTASGATTPSATTTATTTG 206
|
....*....
gi 1677538249 578 PSMGTSTSH 586
Cdd:COG3469 207 PPTPGLPKH 215
|
|
| motB |
PRK12799 |
flagellar motor protein MotB; Reviewed |
517-636 |
9.35e-04 |
|
flagellar motor protein MotB; Reviewed
Pssm-ID: 183756 [Multi-domain] Cd Length: 421 Bit Score: 42.40 E-value: 9.35e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 517 GAENSTSHQPATPGGDTAQTPKPGTSQPMPPG---VGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPATPGG 593
Cdd:PRK12799 294 DTHGTVPVAAVTPSSAVTQSSAITPSSAAIPSpavIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVN 373
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 1677538249 594 GTAQTPEAGTSQPMPPGMGTSTSHQPTT--PGGGTAQTPEPGTSQ 636
Cdd:PRK12799 374 MQPQPMSTTETQQSSTGNITSTANGPTTslPAAPASNIPVSPTSR 418
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
469-652 |
1.02e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 42.98 E-value: 1.02e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 469 TSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTL----RPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQP 544
Cdd:pfam05109 413 TTTTHKVIFSKAPESTTTSPTLNTTGFAAPNTTTGLpsstHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSP 492
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 545 MPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMP-----------PSMGTSTSHQPATPGGGTAQTPEAGTSQPMP----P 609
Cdd:pfam05109 493 SPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPavttptpnatsPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPnatiP 572
|
170 180 190 200
....*....|....*....|....*....|....*....|....
gi 1677538249 610 GMG-TSTSHQPTTPgggTAQTPEPGTSQPMPLSKSTPSSGGGPS 652
Cdd:pfam05109 573 TLGkTSPTSAVTTP---TPNATSPTVGETSPQANTTNHTLGGTS 613
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
509-652 |
1.10e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 42.53 E-value: 1.10e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 509 SSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQP 588
Cdd:PRK07003 395 AVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADS 474
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1677538249 589 ATPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPGGGT------------AQTPEPGTSQPMPLSKSTPSSGGGPS 652
Cdd:PRK07003 475 GSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARApaaasredapaaAAPPAPEARPPTPAAAAPAARAGGAA 550
|
|
| SPT5 |
COG5164 |
Transcription elongation factor SPT5 [Transcription]; |
459-652 |
1.54e-03 |
|
Transcription elongation factor SPT5 [Transcription];
Pssm-ID: 444063 [Multi-domain] Cd Length: 495 Bit Score: 41.94 E-value: 1.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 459 TEAGGTTGPWTSTTSEVPrppePSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPK 538
Cdd:COG5164 84 AQNQGGTRPAGNTGGTTP----AGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGSTPPGPGSTGPGGSTTPP 159
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 539 PGTSQPMPPGVGTSTShqpATPSGGTAQTPEPGTSQPMPPSMGTSTSHQ--PATPGGGTAQTPEAGTSQPMPPGMGTSTS 616
Cdd:COG5164 160 GDGGSTTPPGPGGSTT---PPDDGGSTTPPNKGETGTDIPTGGTPRQGPdgPVKKDDKNGKGNPPDDRGGKTGPKDQRPK 236
|
170 180 190
....*....|....*....|....*....|....*.
gi 1677538249 617 HQPTTPGGGTAQTPEPGTSQPMPLSKSTPSSGGGPS 652
Cdd:COG5164 237 TNPIERRGPERPEAAALPAELTALEAENRAANPEPA 272
|
|
| G_path_suppress |
pfam15991 |
G-protein pathway suppressor; This family of proteins inhibits G-protein- and ... |
498-651 |
1.61e-03 |
|
G-protein pathway suppressor; This family of proteins inhibits G-protein- and mitogen-activated protein kinase-mediated signal transduction.
Pssm-ID: 464961 [Multi-domain] Cd Length: 272 Bit Score: 41.44 E-value: 1.61e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 498 PPSGTTLRPPTSSTPGGPPGAENSTsHQPATP---------GGDTAQTPKPGTSQPMPPG----VGTSTSHQPATPSGGT 564
Cdd:pfam15991 114 PQLSMQGQPHHQQHPGPQVGVLKRT-RSPSPPvqqqayykqPAFSPGYAEHGQQKHDDGRrgydVARFGSWNKSTAQYPP 192
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 565 AQTPEPGTSQPMPPSmgtSTSHQPATpgGGTAQTPEAGTSQPMPPGMgtstsHQPTTPGGgtaqTPEPGTSQPMPLSKST 644
Cdd:pfam15991 193 SGQLFYPTHQYLPPP---QTQGQADA--RLQTIYPQPGYALPLQQQY-----EHANQPSP----FVSSSPLKQMQSPKAG 258
|
....*..
gi 1677538249 645 PSSGGGP 651
Cdd:pfam15991 259 PGPQPMQ 265
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
498-623 |
1.82e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 42.01 E-value: 1.82e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 498 PPSGTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPgTSQPMP 577
Cdd:PRK14951 366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAP-AAAPAA 444
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|
gi 1677538249 578 PSMGTSTSHQPA----TPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPG 623
Cdd:PRK14951 445 VALAPAPPAQAApetvAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEEG 494
|
|
| PHA03269 |
PHA03269 |
envelope glycoprotein C; Provisional |
507-632 |
1.93e-03 |
|
envelope glycoprotein C; Provisional
Pssm-ID: 165527 [Multi-domain] Cd Length: 566 Bit Score: 41.64 E-value: 1.93e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 507 PTSSTPGGPPGAENSTSHQPATPggDTAQTPKPGTSQPMPPGVgtsTSHQPATpsggtaQTPEPGTSqpmppsmgtSTSH 586
Cdd:PHA03269 46 PHQAASRAPDPAVAPTSAASRKP--DLAQAPTPAASEKFDPAP---APHQAAS------RAPDPAVA---------PQLA 105
|
90 100 110 120
....*....|....*....|....*....|....*....|....*.
gi 1677538249 587 QPATPGGGTAQTpEAGTSQPMPPGMGTSTSHQPTTPGGGTAQTPEP 632
Cdd:PHA03269 106 AAPKPDAAEAFT-SAAQAHEAPADAGTSAASKKPDPAAHTQHSPPP 150
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
502-610 |
2.29e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 41.33 E-value: 2.29e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 502 TTLRPPTSSTPGGPpgaeNSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMG 581
Cdd:PRK14950 358 ALLVPVPAPQPAKP----TAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTR 433
|
90 100 110
....*....|....*....|....*....|
gi 1677538249 582 TStshqpatpgggtAQTPEAGTS-QPMPPG 610
Cdd:PRK14950 434 AA------------IPVDEKPKYtPPAPPK 451
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
456-610 |
2.40e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 41.70 E-value: 2.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 456 PPSTEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQ 535
Cdd:PHA03307 283 GPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADP 362
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1677538249 536 TPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPATPGGGTAQTPEAGTSQPMPPG 610
Cdd:PHA03307 363 SSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPSG 437
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
449-652 |
2.43e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 41.68 E-value: 2.43e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 449 IQVSEQEPPSTEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSSGGGtgphPPSGTTLRPPTSSTPGGP------PGAENST 522
Cdd:pfam03154 249 LQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPV----PPQPFPLTPQSSQSQVPPgpspaaPGQSQQR 324
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 523 SHQPatPGGDTAQTPKPGTSQPMPPGvGTSTSHQPATPSGGTAQTPEPGT-------SQPMPPSMGT------------- 582
Cdd:pfam03154 325 IHTP--PSQSQLQSQQPPREQPLPPA-PLSMPHIKPPPTTPIPQLPNPQShkhpphlSGPSPFQMNSnlppppalkplss 401
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1677538249 583 -STSHQPATPGGGTAQTPEAGTSQP---MPPGMGTSTSHQPTTPGGGTAQTPEPGTSQPmPLSKSTPSSGGGPS 652
Cdd:pfam03154 402 lSTHHPPSAHPPPLQLMPQSQQLPPppaQPPVLTQSQSLPPPAASHPPTSGLHQVPSQS-PFPQHPFVPGGPPP 474
|
|
| PHA03269 |
PHA03269 |
envelope glycoprotein C; Provisional |
445-570 |
2.52e-03 |
|
envelope glycoprotein C; Provisional
Pssm-ID: 165527 [Multi-domain] Cd Length: 566 Bit Score: 41.25 E-value: 2.52e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 445 TVIEIQVSEQEPPSTEAGGTT-GPWTSTTSEVPRPPEPSQGpsttssgggtgphPPSGTTLRP-----PTSSTPGGPPGA 518
Cdd:PHA03269 33 TSAATQKPDPAPAPHQAASRApDPAVAPTSAASRKPDLAQA-------------PTPAASEKFdpapaPHQAASRAPDPA 99
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 1677538249 519 ENSTSHQPATPGGDTAQTPKPgTSQPMPPGVGTSTSHQPATPSGGTAQTPEP 570
Cdd:PHA03269 100 VAPQLAAAPKPDAAEAFTSAA-QAHEAPADAGTSAASKKPDPAAHTQHSPPP 150
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
442-621 |
2.70e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.46 E-value: 2.70e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 442 TATTVIEIQVSEQEPPSTEAGGTT---GPWTSTTSEVPRPPEPSqgpsttssgggTGPHPPSGTTLRP----PTSSTPGG 514
Cdd:PHA03247 2833 SAQPTAPPPPPGPPPPSLPLGGSVapgGDVRRRPPSRSPAAKPA-----------APARPPVRRLARPavsrSTESFALP 2901
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 515 PPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTShqPATPSGGTAQTPEPGTSQPMPPSMGTSTSHQPATpggg 594
Cdd:PHA03247 2902 PDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP--PLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVP---- 2975
|
170 180
....*....|....*....|....*..
gi 1677538249 595 TAQTPEAGTSQPMPPGMGTSTSHQPTT 621
Cdd:PHA03247 2976 RFRVPQPAPSREAPASSTPPLTGHSLS 3002
|
|
| Med15 |
pfam09606 |
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ... |
507-649 |
3.19e-03 |
|
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.
Pssm-ID: 312941 [Multi-domain] Cd Length: 732 Bit Score: 41.15 E-value: 3.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 507 PTSSTPGGPPGAE--------------NSTSHQPATPGGDTAQT------PKPGTSQPMPPGVGTSTSHQPATPSGGTAQ 566
Cdd:pfam09606 101 PMGPGPGGPMGQQmggpgtasnllaslGRPQMPMGGAGFPSQMSrvgrmqPGGQAGGMMQPSSGQPGSGTPNQMGPNGGP 180
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 567 ------TPEPGTSQPM----PPSMGTSTSHQPATPGGGTAQtpEAGTSQPMPPG------MGTSTSHQPTTPGGGTAQTp 630
Cdd:pfam09606 181 gqgqagGMNGGQQGPMggqmPPQMGVPGMPGPADAGAQMGQ--QAQANGGMNPQqmggapNQVAMQQQQPQQQGQQSQL- 257
|
170
....*....|....*....
gi 1677538249 631 EPGTSQPMPLSKSTPSSGG 649
Cdd:pfam09606 258 GMGINQMQQMPQGVGGGAG 276
|
|
| PRK12495 |
PRK12495 |
hypothetical protein; Provisional |
507-628 |
3.69e-03 |
|
hypothetical protein; Provisional
Pssm-ID: 183558 [Multi-domain] Cd Length: 226 Bit Score: 39.85 E-value: 3.69e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 507 PTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPePGTSQPMPPSMGTSTSH 586
Cdd:PRK12495 62 PTCQQPVTEDGAAGDDAGDGAEATAPSDAGSQASPDDDAQPAAEAEAADQSAPPEASSTSAT-DEAATDPPATAAARDGP 140
|
90 100 110 120
....*....|....*....|....*....|....*....|..
gi 1677538249 587 QPATPGGGTAQTPEAGTSQPMPPGMGTSTSHQPTTPGGGTAQ 628
Cdd:PRK12495 141 TPDPTAQPATPDERRSPRQRPPVSGEPPTPSTPDAHVAGTLQ 182
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
498-651 |
4.09e-03 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 40.79 E-value: 4.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 498 PPSGTTLRPPTSSTPGGPPGAEN-------------STSHQPAT-PGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGG 563
Cdd:pfam09770 169 KAAAPAPAPQPAAQPASLPAPSRkmmsleeveaamrAQAKKPAQqPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQ 248
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 564 TAQTPEPGTSQPMPPSMGTSTSHQPATPGGGTAQTPEAGTSQPMPPGMGTSTS--HQPTTPGGGTAQTPEPGT----SQP 637
Cdd:pfam09770 249 QPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQilQNPNRLSAARVGYPQNPQpgvqPAP 328
|
170
....*....|....
gi 1677538249 638 MPLSKSTPSSGGGP 651
Cdd:pfam09770 329 AHQAHRQQGSFGRQ 342
|
|
| Pneumo_att_G |
pfam05539 |
Pneumovirinae attachment membrane glycoprotein G; |
454-631 |
4.44e-03 |
|
Pneumovirinae attachment membrane glycoprotein G;
Pssm-ID: 114270 [Multi-domain] Cd Length: 408 Bit Score: 40.42 E-value: 4.44e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 454 QEPPSTEAGGTTGPWTSttsEVPRPPEPSQGPSTTssgggtgpHPPSG--TTLRPPTSSTPGGPPGAENSTSHQPATPgg 531
Cdd:pfam05539 166 KEPKTAVTTSKTTSWPT---EVSHPTYPSQVTPQS--------QPATQghQTATANQRLSSTEPVGTQGTTTSSNPEP-- 232
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 532 dtAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSmGTSTSHQPATPgGGTAQTPEAGTSQPMPPGM 611
Cdd:pfam05539 233 --QTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATS-NRRSPHSTATP-PPTTKRQETGRPTPRPTAT 308
|
170 180
....*....|....*....|
gi 1677538249 612 GTSTSHQPTTPGGGTAQTPE 631
Cdd:pfam05539 309 TQSGSSPPHSSPPGVQANPT 328
|
|
| motB |
PRK12799 |
flagellar motor protein MotB; Reviewed |
517-647 |
6.10e-03 |
|
flagellar motor protein MotB; Reviewed
Pssm-ID: 183756 [Multi-domain] Cd Length: 421 Bit Score: 40.08 E-value: 6.10e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 517 GAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTStshQPATpSGGTAQTPEPGTSQPMPPSMGTSTSHQPATPGGGTA 596
Cdd:PRK12799 289 GLKQIDTHGTVPVAAVTPSSAVTQSSAITPSSAAIP---SPAV-IPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTV 364
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 1677538249 597 QTPEAGTSQPMPPGMGTSTSHQPTTpGGGTAQTPEPGTSQP-MPLSKSTPSS 647
Cdd:PRK12799 365 ALPAAEPVNMQPQPMSTTETQQSST-GNITSTANGPTTSLPaAPASNIPVSP 415
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
470-606 |
6.27e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 40.28 E-value: 6.27e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 470 STTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSST-PGGPPGAENSTSHQPATPGGDTAQTPKPGTSqpmppG 548
Cdd:pfam05109 695 STSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATsPQAPSGQKTAVPTVTSTGGKANSTTGGKHTT-----G 769
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1677538249 549 VGTSTSHQPATPSGGTAQTPEP--GTSQPMPPSmgTSTSHQP----ATPGGGTAQT--PEAGTSQP 606
Cdd:pfam05109 770 HGARTSTEPTTDYGGDSTTPRTryNATTYLPPS--TSSKLRPrwtfTSPPVTTAQAtvPVPPTSQP 833
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
506-638 |
6.40e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 40.45 E-value: 6.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 506 PPTSSTPGGPPGAENSTSHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMGTSTS 585
Cdd:PRK10263 751 PVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQY 830
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 1677538249 586 HQPATPgggTAQTPEAGTSQPMPPGMGTSTS-HQPTTP-GGGTAQTPEPGTSQPM 638
Cdd:PRK10263 831 QQPQQP---VAPQPQDTLLHPLLMRNGDSRPlHKPTTPlPSLDLLTPPPSEVEPV 882
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
476-637 |
6.86e-03 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 39.75 E-value: 6.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 476 PRPPEPSQGPSTTSSGGGTGPHPPSGTTLRPPTSSTPG-----GPPGAENSTSHQPATPGGDT---AQTPKPGTsQPMPP 547
Cdd:NF033839 292 PSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKpevkpQPEKPKPEVKPQLETPKPEVkpqPEKPKPEV-KPQPE 370
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 548 GVGTSTSHQPATPSGGTAQTPEPGTS--QPMPPSMGTSTSHQPATPGGGTAQTPEAGTSQ--PMPPGMGTSTSHQPTTPg 623
Cdd:NF033839 371 KPKPEVKPQPETPKPEVKPQPEKPKPevKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEvkPQPEKPKPEVKPQPEKP- 449
|
170
....*....|....
gi 1677538249 624 gGTAQTPEPGTSQP 637
Cdd:NF033839 450 -KPEVKPQPETPKP 462
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
436-646 |
7.01e-03 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 39.94 E-value: 7.01e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 436 NTVTSGTATTVIEIQVSEQEPPSTEAGGTTGPW----------TSTTSEVPRPPEPSQGPSTTSSGGGTGPHPPSGTTLR 505
Cdd:pfam17823 45 DAVPRADNKSSEQ*NFCAATAAPAPVTLTKGTSaahlnstevtAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPS 124
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 506 PPTSSTP---GGPPGAENST-------SHQPATPGGDTAQTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTS-Q 574
Cdd:pfam17823 125 SAAQSLPaaiAALPSEAFSApraaacrANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSApA 204
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 575 PMPPSMGTSTSH-QPATPGGGTA-----------QTPEAGTSQPMPPGMGTSTSH-QPTTPGGGTAQTPEPGTSQPMPlS 641
Cdd:pfam17823 205 TLTPARGISTAAtATGHPAAGTAlaavgnsspaaGTVTAAVGTVTPAALATLAAAaGTVASAAGTINMGDPHARRLSP-A 283
|
....*
gi 1677538249 642 KSTPS 646
Cdd:pfam17823 284 KHMPS 288
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
459-596 |
7.71e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 39.97 E-value: 7.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 459 TEAGGTTGPWTSTTSEVPRPPEPSQGPSTTSsgggtgphPPSGTTLRPPTSSTPGGPPGAENSTSHQPATPGGDTAQTPK 538
Cdd:PRK07764 386 GVAGGAGAPAAAAPSAAAAAPAAAPAPAAAA--------PAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPP 457
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*....
gi 1677538249 539 PGTSQPMPPG-VGTSTSHQPATPSGGTAQTPEPGTSQPMPPsmgtstshQPATPGGGTA 596
Cdd:PRK07764 458 PAAAPSAQPApAPAAAPEPTAAPAPAPPAAPAPAAAPAAPA--------APAAPAGADD 508
|
|
| SOBP |
pfam15279 |
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual ... |
476-639 |
8.32e-03 |
|
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual disability. It carries a zinc-finger of the zf-C2H2 type at the N-terminus, and a highly characteriztic C-terminal PhPhPhPhPhPh motif. The deduced 873-amino acid protein contains an N-terminal nuclear localization signal (NLS), followed by 2 FCS-type zinc finger motifs, a proline-rich region (PR1), a putative RNA-binding motif region, and a C-terminal NLS embedded in a second proline-rich motif. SOBP is expressed in various human tissues, including developing mouse brain at embryonic day 14. In postnatal and adult mouse brain SOBP is expressed in all neurons, with intense staining in the limbic system. Highest expression is in layer V cortical neurons, hippocampus, pyriform cortex, dorsomedial nucleus of thalamus, amygdala, and hypothalamus. Postnatal expression of SOBP in the limbic system corresponds to a time of active synaptogenesis. the family is also referred to as Jackson circler, JXC1. In seven affected siblings from a consanguineous Israeli Arab family with mental retardation, anterior maxillary protrusion, and strabismus mutations were found in this protein.
Pssm-ID: 464609 [Multi-domain] Cd Length: 325 Bit Score: 39.41 E-value: 8.32e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 476 PRPPEPSQGPSTTSSGGGTGPHPPsgtTLRPPTSSTPGGPPGAEnsTSHQPATPGGDTAQTPK-----PGTSQPMPPgvg 550
Cdd:pfam15279 128 PKPHEPPSLPPPPLPPKKGRRHRP---GLHPPLGRPPGSPPMSM--TPRGLLGKPQQHPPPSPlpafmEPSSMPPPF--- 199
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 551 TSTSHQPATPSGGTAQTPEPGT-SQPMPPSMGTSTSHQPATPGGGTAQTPEAGTSQPMPPGMgtstshQPTTPGGGTAQT 629
Cdd:pfam15279 200 LRPPPSIPQPNSPLSNPMLPGIgPPPKPPRNLGPPSNPMHRPPFSPHHPPPPPTPPGPPPGL------PPPPPRGFTPPF 273
|
170
....*....|
gi 1677538249 630 PEPGTSQPMP 639
Cdd:pfam15279 274 GPPFPPVNMM 283
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
505-635 |
9.00e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 39.70 E-value: 9.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1677538249 505 RPPTSStpGGPPGAENSTSHQPATPGGDTA---QTPKPGTSQPMPPGVGTSTSHQPATPSGGTAQTPEPGTSQPMPPSMG 581
Cdd:PRK14951 365 KPAAAA--EAAAPAEKKTPARPEAAAPAAApvaQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAP 442
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....
gi 1677538249 582 TSTSHQPATPgggtAQTPEAGTSQPMPPGMGTSTSHQPTTPGGGTAQTPEPGTS 635
Cdd:PRK14951 443 AAVALAPAPP----AQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTE 492
|
|
|