|
Name |
Accession |
Description |
Interval |
E-value |
| ATF7IP_BD |
pfam16788 |
ATF-interacting protein binding domain; ATF7IP-BD is a short conserved region of activating ... |
564-779 |
1.12e-76 |
|
ATF-interacting protein binding domain; ATF7IP-BD is a short conserved region of activating transcription factor 7-interacting protein 1 found in higher eukaryotes. This domain appears to bind several key proteins such as TFIIE-alpha and TFIIE-beta as well the transcriptional regulator Sp1 which are part of the transcriptional machinery.
Pssm-ID: 465271 [Multi-domain] Cd Length: 214 Bit Score: 252.29 E-value: 1.12e-76
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 564 NVQSKRRRYMEEeyeaeFQVKITAKGDINQKLQKVIQWLLEEKLCALQCAVFDKTLAELKTRVEKIECNKRHKTVLTELQ 643
Cdd:pfam16788 1 KENVKRMKTSEQ-----INENICVALEKQTALLEQVKHLIEQEICSINYKLFDKKLKELNERVEKTECRKKHEAIATELQ 75
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 644 AKIARLTKRFEAAKEDLKKrhehpPNPPVSPGKTVND--VNSNNNMSYRNAGTVRQMLESKRNVSESAPpsFQTPVNTVS 721
Cdd:pfam16788 76 AKIARLTKRFKAALEDLKK-----CLPPNSPSSNAASkvANSNTINLYRNAGSVRSMLESKRSVGESSP--FQPPEKASK 148
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1917203715 722 STNLVTPPAVVSSQPKLQTPVTSGSLT----ATSVLPAPNTATVV---ATTQVPSGNPQPT-ISLQ 779
Cdd:pfam16788 149 KINLTSPQNEVVSESNNQDDVMLISVEspnlTTPVTSNPTDTRKVtsgNSSNSPSAETEVMaVEKK 214
|
|
| fn3_4 |
pfam16794 |
Fibronectin-III type domain; |
1160-1260 |
6.58e-49 |
|
Fibronectin-III type domain;
Pssm-ID: 465273 [Multi-domain] Cd Length: 101 Bit Score: 168.68 E-value: 6.58e-49
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 1160 LPQKPHLKLARVQsqNGIVLSWSVLEVDRSCATVDSYHLYAYHEEPSATVPS-QWKKIGEVKALPLPMACTLTQFVSGSK 1238
Cdd:pfam16794 2 PPQKPTLKLARVP--TGIVLSWNMPDLDPKYAPVESYHLFAYQENTSTTPSTdSWKKIGDVKALPLPMACTLSQFKAGQR 79
|
90 100
....*....|....*....|..
gi 1917203715 1239 YYFAVRAKDIYGRFGPFCDPQS 1260
Cdd:pfam16794 80 YYFAVRAVDIHGRYGPFSDPKT 101
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
822-1158 |
4.84e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.11 E-value: 4.84e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 822 PPTVSGLTK--NPVSLPSLPNPTKPNNVPSVPSPSIQRNPTASAAPLGTTLAVQAVPTAHSIVQATRTSLPTVGPSGLYS 899
Cdd:PHA03247 2689 RPTVGSLTSlaDPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPA 2768
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 900 PSTNRGPiqmkipisafstsSAAEQNSNTTPRIENQTNKTIDASVSKKAADSTSQCGKATGSDSsgvidltmddeesgAS 979
Cdd:PHA03247 2769 PAPPAAP-------------AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALP--------------PA 2821
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 980 QDPKklnhTPVSTMSSSQPVSRPLQPIQPAPPLQPSG-VPTSGPSQTTIHLLPTAPTTVNVTHRPVTQVTtRLPVPRAP- 1057
Cdd:PHA03247 2822 ASPA----GPLPPPTSAQPTAPPPPPGPPPPSLPLGGsVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLA-RPAVSRSTe 2896
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 1058 --ANHQVVYTTLPAPPAQAPLRGTVMQAPAVR---QVNPQNSVTVRVPQTTTYVVNNGLTLGSTGPQLTvHHRP-----P 1127
Cdd:PHA03247 2897 sfALPPDQPERPPQPQAPPPPQPQPQPPPPPQpqpPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG-ALVPgrvavP 2975
|
330 340 350
....*....|....*....|....*....|.
gi 1917203715 1128 QVHTEPPRPVHPAPLPEAPQPQRLPPEAAST 1158
Cdd:PHA03247 2976 RFRVPQPAPSREAPASSTPPLTGHSLSRVSS 3006
|
|
| PTZ00341 |
PTZ00341 |
Ring-infected erythrocyte surface antigen; Provisional |
319-574 |
4.54e-08 |
|
Ring-infected erythrocyte surface antigen; Provisional
Pssm-ID: 173534 [Multi-domain] Cd Length: 1136 Bit Score: 57.87 E-value: 4.54e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 319 KNGADEKLEQIQSKDSLDEKNKADNNIDAN-EETLEtddtticsdrppEN-EKKVEEDIitelalgEDAISSSMEIDQGE 396
Cdd:PTZ00341 929 KNQNENVPEHLKEHAEANIEEDAEENVEEDaEENVE------------ENvEENVEENV-------EENVEENVEENVEE 989
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 397 KNEDETSADLVETINENViEDNKSENILENTDSMETDEIIPILEKLAPSEDEltcfsktsllPIDETNPDLEEKMESSFg 476
Cdd:PTZ00341 990 NVEENVEENVEENIEENV-EENVEENIEENVEEYDEENVEEVEENVEEYDEE----------NVEEIEENAEENVEENI- 1057
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 477 spskQESSESLPKEaflvlsDEEDIsgEKDESEVISQNetcspaeVESNEKDNKPEEEEQVIHEDDERPSEKNEFSRRKR 556
Cdd:PTZ00341 1058 ----EENIEEYDEE------NVEEI--EENIEENIEEN-------VEENVEENVEEIEENVEENVEENAEENAEENAEEN 1118
|
250
....*....|....*...
gi 1917203715 557 SKSEDMDNVQSKRRRYME 574
Cdd:PTZ00341 1119 AEEYDDENPEEHNEEYDE 1136
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
708-1149 |
6.28e-07 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 54.00 E-value: 6.28e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 708 SAPPSFQTPVNTVSSTNLVTPPAVVSSQP-KLQTPVTSGSLTATSVLPAPNTATVVATTQVPSGNPQPTISLQPLPVILH 786
Cdd:pfam03154 143 STSPSIPSPQDNESDSDSSAQQQILQTQPpVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQ 222
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 787 VPVAVSSqpqLLQSHPGTLVTNQPSGNVEFISVQSPPTVSGLTKNPVSLPSLPNPTKPNNVPSVPSPSIQRNPTASAA-P 865
Cdd:pfam03154 223 STAAPHT---LIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPfP 299
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 866 LGTTLAVQAVPTAHSIVQATRTSLPTVGPSGLYSPSTNRGPIQMKIPISAFSTSSAAEQNSNTTPRIENQTNKTIDASVS 945
Cdd:pfam03154 300 LTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLS 379
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 946 KKAADSTSqcgkatgsdssgvidltmddeesgaSQDPKKLNHTPVSTMSSSQPVSRPLQPIQPAPPLQPSGVPTSGPSQT 1025
Cdd:pfam03154 380 GPSPFQMN-------------------------SNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVL 434
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 1026 TIHLLPTAPTTVNVTHRPVTQVTTRLPVPRAPANHQVVYTTLPA--PPAQAPLRGTVMQAPAVRQVnpqnSVTVRVPQTT 1103
Cdd:pfam03154 435 TQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPsgPPTSTSSAMPGIQPPSSASV----SSSGPVPAAV 510
|
410 420 430 440
....*....|....*....|....*....|....*....|....*.
gi 1917203715 1104 TYVVnngltlgstgPQLTVHHRPPQvhtEPPRPVHPAPLPEAPQPQ 1149
Cdd:pfam03154 511 SCPL----------PPVQIKEEALD---EAEEPESPPPPPRSPSPE 543
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
122-433 |
1.87e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 49.14 E-value: 1.87e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 122 VSKLPAEPVSGDPAPGDLDA------GDPASGVLASGDSTSGDPTSSEP-SSSDAASGDATSGDAPSGDVSPGDATSGDA 194
Cdd:NF033609 544 VPEQPDEPGEIEPIPEDSDSdpgsdsGSDSSNSDSGSDSGSDSTSDSGSdSASDSDSASDSDSASDSDSASDSDSASDSD 623
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 195 TADDLSSGDPTSSDPIPGEPVPVEPISGDCAADDIASSEITSVDLASGAPASTDPASDDLASGDLSSSELASDDLATGEL 274
Cdd:NF033609 624 SASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 703
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 275 ASDELTSESTFDRTFEPKSVPVCEPVPEIDNiEPSSNKDDDFLEKNGADEKLEQIQSKDSlDEKNKADNNIDANEETLET 354
Cdd:NF033609 704 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSD 781
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1917203715 355 DDTTICSDRPPENEKKVEEDIITELALGEDAISSSmEIDQGEKNEDETSADLVETINENVIEDNKSENILENTDSMETD 433
Cdd:NF033609 782 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESD 859
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
110-287 |
4.10e-05 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 48.24 E-value: 4.10e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 110 EPLSPHNITPEPVSKLPAEPVSGDPAPGDLDAGD------PASGVLASGDSTSGdPTSSEPSSSDAASGDATSGDAPSGD 183
Cdd:PHA03307 78 EAPANESRSTPTWSLSTLAPASPAREGSPTPPGPsspdppPPTPPPASPPPSPA-PDLSEMLRPVGSPGPPPAASPPAAG 156
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 184 VSPGDATSGDAT---ADDLSSGDPTSSDPI--PGEPVPVEPISGDCAADDIASSEITSVDLASGAPASTDPASDDLASGD 258
Cdd:PHA03307 157 ASPAAVASDAASsrqAALPLSSPEETARAPssPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASS 236
|
170 180
....*....|....*....|....*....
gi 1917203715 259 LSSSELASDDLATGELASDELTSESTFDR 287
Cdd:PHA03307 237 SDSSSSESSGCGWGPENECPLPRPAPITL 265
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
324-662 |
2.25e-03 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 42.35 E-value: 2.25e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 324 EKLEQIQSKdsLDEKNKAdnnIDANEETLETDDTTIcsdrppENEKKVEEDIITELALG-EDAISSSMEIDQGEKNEDET 402
Cdd:TIGR02168 684 EKIEELEEK--IAELEKA---LAELRKELEELEEEL------EQLRKELEELSRQISALrKDLARLEAEVEQLEERIAQL 752
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 403 SADLVETINENVIEDNK----SENILENTDSMETDEiipilEKLAPSEDELTCFSKTsllpIDETNPDLEEKMESSFgsp 478
Cdd:TIGR02168 753 SKELTELEAEIEELEERleeaEEELAEAEAEIEELE-----AQIEQLKEELKALREA----LDELRAELTLLNEEAA--- 820
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 479 SKQESSESLPKEAFLVLSDEEDISGE-KDESEVISQNEtcspAEVESnEKDNKPEEEEQVIHEDDERpSEKNEFSRRKRS 557
Cdd:TIGR02168 821 NLRERLESLERRIAATERRLEDLEEQiEELSEDIESLA----AEIEE-LEELIEELESELEALLNER-ASLEEALALLRS 894
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 558 KSEDMDNVQ---SKRRRYMEEEYEAefqvKITAKGDINQKLQKVIQWL--LEEKLCALQCAVFDKTLAELKTRVEKIECN 632
Cdd:TIGR02168 895 ELEELSEELrelESKRSELRRELEE----LREKLAQLELRLEGLEVRIdnLQERLSEEYSLTLEEAEALENKIEDDEEEA 970
|
330 340 350 360
....*....|....*....|....*....|....*....|....*..
gi 1917203715 633 KRHktvLTELQAKIARL--------------TKRFE---AAKEDLKK 662
Cdd:TIGR02168 971 RRR---LKRLENKIKELgpvnlaaieeyeelKERYDfltAQKEDLTE 1014
|
|
| MDN1 |
COG5271 |
Midasin, AAA ATPase with vWA domain, involved in ribosome maturation [Translation, ribosomal ... |
80-552 |
4.01e-03 |
|
Midasin, AAA ATPase with vWA domain, involved in ribosome maturation [Translation, ribosomal structure and biogenesis];
Pssm-ID: 444083 [Multi-domain] Cd Length: 1028 Bit Score: 41.54 E-value: 4.01e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 80 DPEGSKAEWKETPCILSVNVKNKQDDDLNCEPLSPHNITPEPVSKLPAEPVSGDPAPGDLDAGDPASGVLASGDSTSGDP 159
Cdd:COG5271 274 ATDDADGLEAAEDDALDAELTAAQAADPESDDDADDSTLAALEGAAEDTEIATADELAAADDEDDDDSAAEDAAEEAATA 353
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 160 TSSEPSSSDAASGDATSGDAPSGDVSPGDATSGDATADDLSSGDPTSSDPIPGEPVPVEPISGDCAADDIASSEITSVDL 239
Cdd:COG5271 354 EDSAAEDTQDAEDEAAGEAADESEGADTDAAADEADAAADDSADDEEASADGGTSPTSDTDEEEEEADEDASAGETEDES 433
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 240 ASGAPASTDPASDDLASGDLSSSELASDDLATGELASDELTSESTFDRTFEPKSVPVCEPVPEIDNIEPSSNKDD----- 314
Cdd:COG5271 434 TDVTSAEDDIATDEEADSLADEEEEAEAELDTEEDTESAEEDADGDEATDEDDASDDGDEEEAEEDAEAEADSDEltaee 513
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 315 ---DFLEKNGADEKLEQIQSKDSLDEKNKADNNIDANEETLETDDTTICSDRPPENEKKVEEDIITELALGEDAISSSME 391
Cdd:COG5271 514 tsaDDGADTDAAADPEDSDEDALEDETEGEENAPGSDQDADETDEPEATAEEDEPDEAEAETEDATENADADETEESADE 593
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 392 IDQGEKNEDETSADLVETINENVIEDNKSENILENTDSMETDEIIPILEKLAPSEDELTCFSKTSLLPIDETNPDLEEKM 471
Cdd:COG5271 594 SEEAEASEDEAAEEEEADDDEADADADGAADEEETEEEAAEDEAAEPETDASEAADEDADAETEAEASADESEEEAEDES 673
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 472 ESSfgSPSKQESSESLPKEAflvLSDEEDISGEKDESEVISQNETCSPAEVESNEKDNKPEEEEQVIHEDDERPSEKNEF 551
Cdd:COG5271 674 ETS--SEDAEEDADAAAAEA---SDDEEETEEADEDAETASEEADAEEADTEADGTAEEAEEAAEEAESADEEAASLPDE 748
|
.
gi 1917203715 552 S 552
Cdd:COG5271 749 A 749
|
|
| COG1340 |
COG1340 |
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown]; |
532-665 |
9.66e-03 |
|
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
Pssm-ID: 440951 [Multi-domain] Cd Length: 297 Bit Score: 39.51 E-value: 9.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 532 EEEEQVIHEDDERPSEKNEFSRRKRSKSEDMDNVQSKRRRYMEE--EYEAEfqvkitaKGDINQKLQKVIQWLLEEKLCA 609
Cdd:COG1340 29 EKRDELNEELKELAEKRDELNAQVKELREEAQELREKRDELNEKvkELKEE-------RDELNEKLNELREELDELRKEL 101
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1917203715 610 LQCAVFDKTLAELKTRVEKIEcnKRHKT-VLT-----ELQAKIARLTKRFEAAKEDLKKRHE 665
Cdd:COG1340 102 AELNKAGGSIDKLRKEIERLE--WRQQTeVLSpeeekELVEKIKELEKELEKAKKALEKNEK 161
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| ATF7IP_BD |
pfam16788 |
ATF-interacting protein binding domain; ATF7IP-BD is a short conserved region of activating ... |
564-779 |
1.12e-76 |
|
ATF-interacting protein binding domain; ATF7IP-BD is a short conserved region of activating transcription factor 7-interacting protein 1 found in higher eukaryotes. This domain appears to bind several key proteins such as TFIIE-alpha and TFIIE-beta as well the transcriptional regulator Sp1 which are part of the transcriptional machinery.
Pssm-ID: 465271 [Multi-domain] Cd Length: 214 Bit Score: 252.29 E-value: 1.12e-76
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 564 NVQSKRRRYMEEeyeaeFQVKITAKGDINQKLQKVIQWLLEEKLCALQCAVFDKTLAELKTRVEKIECNKRHKTVLTELQ 643
Cdd:pfam16788 1 KENVKRMKTSEQ-----INENICVALEKQTALLEQVKHLIEQEICSINYKLFDKKLKELNERVEKTECRKKHEAIATELQ 75
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 644 AKIARLTKRFEAAKEDLKKrhehpPNPPVSPGKTVND--VNSNNNMSYRNAGTVRQMLESKRNVSESAPpsFQTPVNTVS 721
Cdd:pfam16788 76 AKIARLTKRFKAALEDLKK-----CLPPNSPSSNAASkvANSNTINLYRNAGSVRSMLESKRSVGESSP--FQPPEKASK 148
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1917203715 722 STNLVTPPAVVSSQPKLQTPVTSGSLT----ATSVLPAPNTATVV---ATTQVPSGNPQPT-ISLQ 779
Cdd:pfam16788 149 KINLTSPQNEVVSESNNQDDVMLISVEspnlTTPVTSNPTDTRKVtsgNSSNSPSAETEVMaVEKK 214
|
|
| fn3_4 |
pfam16794 |
Fibronectin-III type domain; |
1160-1260 |
6.58e-49 |
|
Fibronectin-III type domain;
Pssm-ID: 465273 [Multi-domain] Cd Length: 101 Bit Score: 168.68 E-value: 6.58e-49
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 1160 LPQKPHLKLARVQsqNGIVLSWSVLEVDRSCATVDSYHLYAYHEEPSATVPS-QWKKIGEVKALPLPMACTLTQFVSGSK 1238
Cdd:pfam16794 2 PPQKPTLKLARVP--TGIVLSWNMPDLDPKYAPVESYHLFAYQENTSTTPSTdSWKKIGDVKALPLPMACTLSQFKAGQR 79
|
90 100
....*....|....*....|..
gi 1917203715 1239 YYFAVRAKDIYGRFGPFCDPQS 1260
Cdd:pfam16794 80 YYFAVRAVDIHGRYGPFSDPKT 101
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
822-1158 |
4.84e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.11 E-value: 4.84e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 822 PPTVSGLTK--NPVSLPSLPNPTKPNNVPSVPSPSIQRNPTASAAPLGTTLAVQAVPTAHSIVQATRTSLPTVGPSGLYS 899
Cdd:PHA03247 2689 RPTVGSLTSlaDPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPA 2768
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 900 PSTNRGPiqmkipisafstsSAAEQNSNTTPRIENQTNKTIDASVSKKAADSTSQCGKATGSDSsgvidltmddeesgAS 979
Cdd:PHA03247 2769 PAPPAAP-------------AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALP--------------PA 2821
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 980 QDPKklnhTPVSTMSSSQPVSRPLQPIQPAPPLQPSG-VPTSGPSQTTIHLLPTAPTTVNVTHRPVTQVTtRLPVPRAP- 1057
Cdd:PHA03247 2822 ASPA----GPLPPPTSAQPTAPPPPPGPPPPSLPLGGsVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLA-RPAVSRSTe 2896
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 1058 --ANHQVVYTTLPAPPAQAPLRGTVMQAPAVR---QVNPQNSVTVRVPQTTTYVVNNGLTLGSTGPQLTvHHRP-----P 1127
Cdd:PHA03247 2897 sfALPPDQPERPPQPQAPPPPQPQPQPPPPPQpqpPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG-ALVPgrvavP 2975
|
330 340 350
....*....|....*....|....*....|.
gi 1917203715 1128 QVHTEPPRPVHPAPLPEAPQPQRLPPEAAST 1158
Cdd:PHA03247 2976 RFRVPQPAPSREAPASSTPPLTGHSLSRVSS 3006
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
820-1174 |
4.96e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.11 E-value: 4.96e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 820 QSPPTVSGLTKNPVSLPSLPNPTKPNNVPSVPSPSiQRNPTASAAPLGTTLAVQAVPTAHSIVQATRTSLPTVGPSGLYS 899
Cdd:PHA03247 2595 SARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPP-SPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRA 2673
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 900 P---STNRGPIQMKIPISAFS-TSSAAEQNSNTTPriENQTNKTIDASVSKKAADSTSQCGKATGSDSSgvidltmddee 975
Cdd:PHA03247 2674 AqasSPPQRPRRRAARPTVGSlTSLADPPPPPPTP--EPAPHALVSATPLPPGPAAARQASPALPAAPA----------- 2740
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 976 sgasqdPKKLNHTPVSTMSSSQPVSRPLQ--PIQPAPPLQPSG-------VPTSGPSQTTIHLLPTAPTTVNVThRPVTQ 1046
Cdd:PHA03247 2741 ------PPAVPAGPATPGGPARPARPPTTagPPAPAPPAAPAAgpprrltRPAVASLSESRESLPSPWDPADPP-AAVLA 2813
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 1047 VTTRLPVPRAPANHQVVYTT-LPAPPAQAP--------LRGTVMQAPAVRQVNPQNSvTVRVPQTTTYVVNNGLTlgstG 1117
Cdd:PHA03247 2814 PAAALPPAASPAGPLPPPTSaQPTAPPPPPgppppslpLGGSVAPGGDVRRRPPSRS-PAAKPAAPARPPVRRLA----R 2888
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*..
gi 1917203715 1118 PQLTvhhRPPQVHTEPPRPVHPAPLPEAPQPQRLPPEAASTSLPQKPHLKLARVQSQ 1174
Cdd:PHA03247 2889 PAVS---RSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP 2942
|
|
| PTZ00341 |
PTZ00341 |
Ring-infected erythrocyte surface antigen; Provisional |
319-574 |
4.54e-08 |
|
Ring-infected erythrocyte surface antigen; Provisional
Pssm-ID: 173534 [Multi-domain] Cd Length: 1136 Bit Score: 57.87 E-value: 4.54e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 319 KNGADEKLEQIQSKDSLDEKNKADNNIDAN-EETLEtddtticsdrppEN-EKKVEEDIitelalgEDAISSSMEIDQGE 396
Cdd:PTZ00341 929 KNQNENVPEHLKEHAEANIEEDAEENVEEDaEENVE------------ENvEENVEENV-------EENVEENVEENVEE 989
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 397 KNEDETSADLVETINENViEDNKSENILENTDSMETDEIIPILEKLAPSEDEltcfsktsllPIDETNPDLEEKMESSFg 476
Cdd:PTZ00341 990 NVEENVEENVEENIEENV-EENVEENIEENVEEYDEENVEEVEENVEEYDEE----------NVEEIEENAEENVEENI- 1057
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 477 spskQESSESLPKEaflvlsDEEDIsgEKDESEVISQNetcspaeVESNEKDNKPEEEEQVIHEDDERPSEKNEFSRRKR 556
Cdd:PTZ00341 1058 ----EENIEEYDEE------NVEEI--EENIEENIEEN-------VEENVEENVEEIEENVEENVEENAEENAEENAEEN 1118
|
250
....*....|....*...
gi 1917203715 557 SKSEDMDNVQSKRRRYME 574
Cdd:PTZ00341 1119 AEEYDDENPEEHNEEYDE 1136
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
708-1149 |
6.28e-07 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 54.00 E-value: 6.28e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 708 SAPPSFQTPVNTVSSTNLVTPPAVVSSQP-KLQTPVTSGSLTATSVLPAPNTATVVATTQVPSGNPQPTISLQPLPVILH 786
Cdd:pfam03154 143 STSPSIPSPQDNESDSDSSAQQQILQTQPpVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQ 222
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 787 VPVAVSSqpqLLQSHPGTLVTNQPSGNVEFISVQSPPTVSGLTKNPVSLPSLPNPTKPNNVPSVPSPSIQRNPTASAA-P 865
Cdd:pfam03154 223 STAAPHT---LIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPfP 299
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 866 LGTTLAVQAVPTAHSIVQATRTSLPTVGPSGLYSPSTNRGPIQMKIPISAFSTSSAAEQNSNTTPRIENQTNKTIDASVS 945
Cdd:pfam03154 300 LTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLS 379
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 946 KKAADSTSqcgkatgsdssgvidltmddeesgaSQDPKKLNHTPVSTMSSSQPVSRPLQPIQPAPPLQPSGVPTSGPSQT 1025
Cdd:pfam03154 380 GPSPFQMN-------------------------SNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVL 434
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 1026 TIHLLPTAPTTVNVTHRPVTQVTTRLPVPRAPANHQVVYTTLPA--PPAQAPLRGTVMQAPAVRQVnpqnSVTVRVPQTT 1103
Cdd:pfam03154 435 TQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPsgPPTSTSSAMPGIQPPSSASV----SSSGPVPAAV 510
|
410 420 430 440
....*....|....*....|....*....|....*....|....*.
gi 1917203715 1104 TYVVnngltlgstgPQLTVHHRPPQvhtEPPRPVHPAPLPEAPQPQ 1149
Cdd:pfam03154 511 SCPL----------PPVQIKEEALD---EAEEPESPPPPPRSPSPE 543
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
319-684 |
2.44e-06 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 52.45 E-value: 2.44e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 319 KNGADEKLEQIQSKDSLDEKNKADNNIDANEETLETDDTTICSD--RPPENEKKVEEDIITELALGEDAISSSMEIDQGE 396
Cdd:PTZ00121 1437 KKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEeaKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAK 1516
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 397 KNEDETSADLVETINE--------NVIEDNKSENILENTDSMETDEIIPILEKLAPSEDELTCFSKTSLLPIDEtNPDLE 468
Cdd:PTZ00121 1517 KAEEAKKADEAKKAEEakkadeakKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAE-EARIE 1595
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 469 EKMEssFGSPSKQESSESLPKEAFLVLSDEEdISGEKDESEVISQNETCSPAEVESNEKDNKPEEEEQVIHEDDERPSE- 547
Cdd:PTZ00121 1596 EVMK--LYEEEKKMKAEEAKKAEEAKIKAEE-LKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEe 1672
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 548 ---------KNEFSRRKRS-----KSEDMDNVQSKRRRYMEEEYEAEfQVKitakgdinqKLQKVIQWLLEEklcALQCA 613
Cdd:PTZ00121 1673 dkkkaeeakKAEEDEKKAAealkkEAEEAKKAEELKKKEAEEKKKAE-ELK---------KAEEENKIKAEE---AKKEA 1739
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1917203715 614 VFDKTLAElKTRVEKIECNKRHKTVLTELQAKIARLTKRFEAAKEDLKKRHEhppNPPVSPGKTVNDVNSN 684
Cdd:PTZ00121 1740 EEDKKKAE-EAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDE---KRRMEVDKKIKDIFDN 1806
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
792-1172 |
5.64e-06 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 50.92 E-value: 5.64e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 792 SSQPQLLQSHPGTLVTNQPSgnvefiSVQSPPTVSGLTKNPVSLPSLPNPTKPNNV-PSVPSPSIQRNPTASAAPL---G 867
Cdd:pfam03154 161 SAQQQILQTQPPVLQAQSGA------ASPPSPPPPGTTQAATAGPTPSAPSVPPQGsPATSQPPNQTQSTAAPHTLiqqT 234
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 868 TTLAVQAVPTAHSIVQ-ATRTSLPTVGPSGLYSPSTNRGPIQ-MKIPISAFSTSSAAEQNSNTTPRIENQTNKTIDASVS 945
Cdd:pfam03154 235 PTLHPQRLPSPHPPLQpMTQPPPPSQVSPQPLPQPSLHGQMPpMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPS 314
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 946 KKAADSTSQCGKATGSDSSGvidltmddeesgASQDPKKLNHTPVSTMSSSQPVSRPLQPIQPAPPLQPSGVPT--SGPS 1023
Cdd:pfam03154 315 PAAPGQSQQRIHTPPSQSQL------------QSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPhlSGPS 382
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 1024 QTTIHL-LPTAPTTvnvthRPVTQVTTRLPVPRAPANHQVVYTT--LPAPPAQAPLRGTVMQAPAVRQVNPQNSVTVRVP 1100
Cdd:pfam03154 383 PFQMNSnLPPPPAL-----KPLSSLSTHHPPSAHPPPLQLMPQSqqLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVP 457
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1917203715 1101 QTTTYVVNNGLTLGStgpqltvhhrPPQVHTEPPRPVHPAPLPEAPQPQRLPPeAASTSLPQKPHLKLARVQ 1172
Cdd:pfam03154 458 SQSPFPQHPFVPGGP----------PPITPPSGPPTSTSSAMPGIQPPSSASV-SSSGPVPAAVSCPLPPVQ 518
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
708-1024 |
9.15e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 50.32 E-value: 9.15e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 708 SAPPSFQTPVNTVSSTNLVTPPAVVSSQPKLQTPVTSGSLTATSVLPAPNTATVVATTQVPSGNPQPTISLQPLPVILHV 787
Cdd:PHA03247 2764 AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPP 2843
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 788 PVAVSSQPQLLQSHPGTLVTNQPSGNVEFISVQSP--PTVSGLTKNPVSLPSLPNPTKPNNVPSVPSPSIQRNPTASA-- 863
Cdd:PHA03247 2844 GPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAParPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPqp 2923
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 864 -APLGTTLAVQAVPTAHSIVQATRTSLPTVGPSGLYSPSTNRGPIQMKIPISAFSTSSAAEqnSNTTPRIENQTNKTIDA 942
Cdd:PHA03247 2924 pPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAP--SREAPASSTPPLTGHSL 3001
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 943 SVSKKAADSTSQCGKATGSDSSGVIDLTMDDEESGASQDPKKLNHTPVSTMSSsqpvsrpLQPIQPAPPLQPSGVPTSGP 1022
Cdd:PHA03247 3002 SRVSSWASSLALHEETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSDLEA-------LDPLPPEPHDPFAHEPDPAT 3074
|
..
gi 1917203715 1023 SQ 1024
Cdd:PHA03247 3075 PE 3076
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
122-433 |
1.87e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 49.14 E-value: 1.87e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 122 VSKLPAEPVSGDPAPGDLDA------GDPASGVLASGDSTSGDPTSSEP-SSSDAASGDATSGDAPSGDVSPGDATSGDA 194
Cdd:NF033609 544 VPEQPDEPGEIEPIPEDSDSdpgsdsGSDSSNSDSGSDSGSDSTSDSGSdSASDSDSASDSDSASDSDSASDSDSASDSD 623
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 195 TADDLSSGDPTSSDPIPGEPVPVEPISGDCAADDIASSEITSVDLASGAPASTDPASDDLASGDLSSSELASDDLATGEL 274
Cdd:NF033609 624 SASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 703
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 275 ASDELTSESTFDRTFEPKSVPVCEPVPEIDNiEPSSNKDDDFLEKNGADEKLEQIQSKDSlDEKNKADNNIDANEETLET 354
Cdd:NF033609 704 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSD 781
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1917203715 355 DDTTICSDRPPENEKKVEEDIITELALGEDAISSSmEIDQGEKNEDETSADLVETINENVIEDNKSENILENTDSMETD 433
Cdd:NF033609 782 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESD 859
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
110-287 |
4.10e-05 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 48.24 E-value: 4.10e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 110 EPLSPHNITPEPVSKLPAEPVSGDPAPGDLDAGD------PASGVLASGDSTSGdPTSSEPSSSDAASGDATSGDAPSGD 183
Cdd:PHA03307 78 EAPANESRSTPTWSLSTLAPASPAREGSPTPPGPsspdppPPTPPPASPPPSPA-PDLSEMLRPVGSPGPPPAASPPAAG 156
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 184 VSPGDATSGDAT---ADDLSSGDPTSSDPI--PGEPVPVEPISGDCAADDIASSEITSVDLASGAPASTDPASDDLASGD 258
Cdd:PHA03307 157 ASPAAVASDAASsrqAALPLSSPEETARAPssPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASS 236
|
170 180
....*....|....*....|....*....
gi 1917203715 259 LSSSELASDDLATGELASDELTSESTFDR 287
Cdd:PHA03307 237 SDSSSSESSGCGWGPENECPLPRPAPITL 265
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
708-1191 |
6.55e-05 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 47.22 E-value: 6.55e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 708 SAPPSFQTPVNTVSSTNLVTP------PAVVSSQPKLQTPVTSGSLTATSVL--PAPNTATVVATTQVPSGNPQ------ 773
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPntttglPSSTHVPTNLTAPASTGPTVSTADVtsPTPAGTTSGASPVTPSPSPRdngtes 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 774 --PTISLQPLPVILHVPVAVSSQPQLLQSHPGTlvTNQPSGNVEFISVQSPPTVSGLTKNPVSLPSLPNPTKPNNVPSVP 851
Cdd:pfam05109 502 kaPDMTSPTSAVTTPTPNATSPTPAVTTPTPNA--TSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSP 579
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 852 SPSIQR-NPTASAAPLGTTlAVQAVPTAHSIVQATRTSLPTVGPSGLYSPSTNRgpiqmKIPISAFSTSSAAEQNSNTTP 930
Cdd:pfam05109 580 TSAVTTpTPNATSPTVGET-SPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTG-----QHNITSSSTSSMSLRPSSISE 653
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 931 RIENQTNKtidasvskkaaDSTSQCGKATGSDSSGVIDLTMDDEESgasqdpkklnhTPVSTMSSSQPVSRPLQPIQPAP 1010
Cdd:pfam05109 654 TLSPSTSD-----------NSTSHMPLLTSAHPTGGENITQVTPAS-----------TSTHHVSTSSPAPRPGTTSQASG 711
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 1011 PlqpsgvptsGPSQTTihllpTAPTTVNVTH-RPVTQVTTrlpvPRAPANHQVVYTTLPAPPAQA-PLRGTVMQAPAVRQ 1088
Cdd:pfam05109 712 P---------GNSSTS-----TKPGEVNVTKgTPPKNATS----PQAPSGQKTAVPTVTSTGGKAnSTTGGKHTTGHGAR 773
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 1089 VNPQNSVTVRVPQTTTYVVNNGLTLgsTGPQLTVHHRPPQVHTEPPRPVHPAPLPeapqpqrLPPeaasTSLPQKPHLKL 1168
Cdd:pfam05109 774 TSTEPTTDYGGDSTTPRTRYNATTY--LPPSTSSKLRPRWTFTSPPVTTAQATVP-------VPP----TSQPRFSNLSM 840
|
490 500
....*....|....*....|...
gi 1917203715 1169 ARVQSQNGIVLSWSVLEVDRSCA 1191
Cdd:pfam05109 841 LVLQWASLAVLTLLLLLVMADCA 863
|
|
| PRK13108 |
PRK13108 |
prolipoprotein diacylglyceryl transferase; Reviewed |
142-291 |
7.16e-05 |
|
prolipoprotein diacylglyceryl transferase; Reviewed
Pssm-ID: 237284 [Multi-domain] Cd Length: 460 Bit Score: 46.90 E-value: 7.16e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 142 GDPASGVLASGDSTSGDPTSSEPSSSDAASGDATSGDAPSGDvsPGDATSGDATADDLSS-------------------- 201
Cdd:PRK13108 278 GREAPGALRGSEYVVDEALEREPAELAAAAVASAASAVGPVG--PGEPNQPDDVAEAVKAevaevtdevaaesvvqvadr 355
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 202 -GDPTSSDPIPGEPVPVEPISGDCAADDIASSEitsVDLASGAPASTDPAsdDLASGDLSSSELASDDLATGELAS---D 277
Cdd:PRK13108 356 dGESTPAVEETSEADIEREQPGDLAGQAPAAHQ---VDAEAASAAPEEPA--ALASEAHDETEPEVPEKAAPIPDPakpD 430
|
170
....*....|....
gi 1917203715 278 ELTSESTFDRTFEP 291
Cdd:PRK13108 431 ELAVAGPGDDPAEP 444
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
963-1164 |
7.59e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 47.07 E-value: 7.59e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 963 SSGVIDLTMDDEESGASQDPKKLNHTPVSTMSSSQPVSRPLQPIQPAPPLQPSGVPTSGPSQTTIHLLPTAPTTVNVTHR 1042
Cdd:pfam03154 144 TSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQS 223
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 1043 PVT-----QVTTRLPVPRAPANHQVVY-TTLPAPPAQAP--------LRGTVMQAPAVRQVNPQNSVTVRVPQTTTYVVN 1108
Cdd:pfam03154 224 TAAphtliQQTPTLHPQRLPSPHPPLQpMTQPPPPSQVSpqplpqpsLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQ 303
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1917203715 1109 NGLTLGSTGPQLTV--------HHRPPQVHTEPPRPVHPAPLPEAPQPQRLPPEAASTSLPQKP 1164
Cdd:pfam03154 304 SSQSQVPPGPSPAApgqsqqriHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLP 367
|
|
| PTZ00341 |
PTZ00341 |
Ring-infected erythrocyte surface antigen; Provisional |
321-578 |
1.94e-04 |
|
Ring-infected erythrocyte surface antigen; Provisional
Pssm-ID: 173534 [Multi-domain] Cd Length: 1136 Bit Score: 45.93 E-value: 1.94e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 321 GADEKLEQIQSKDSLDEKNKADNNIDANEETLETDDTTICSDRPPENEKKVEEDIitelalgEDAISSSMEIDQGEKNED 400
Cdd:PTZ00341 897 GGGKKDKKAKKKDAKDLSGNIAHEINLINKELKNQNENVPEHLKEHAEANIEEDA-------EENVEEDAEENVEENVEE 969
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 401 ETSADLVETINENViEDNKSENILENTDSMETDEIIPILEklapsedeltcfsktsllpiDETNPDLEEKMESSFGSPSK 480
Cdd:PTZ00341 970 NVEENVEENVEENV-EENVEENVEENVEENVEENIEENVE--------------------ENVEENIEENVEEYDEENVE 1028
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 481 QESSESLPKEAFLVLSDEEDIsgEKDESEVISQN----ETCSPAEVESNEKDNKPEEEEQVIHEDDERPSEKNEFSRRKR 556
Cdd:PTZ00341 1029 EVEENVEEYDEENVEEIEENA--EENVEENIEENieeyDEENVEEIEENIEENIEENVEENVEENVEEIEENVEENVEEN 1106
|
250 260
....*....|....*....|..
gi 1917203715 557 SKSEDMDNVQSKRRRYMEEEYE 578
Cdd:PTZ00341 1107 AEENAEENAEENAEEYDDENPE 1128
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
976-1162 |
2.32e-04 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 45.83 E-value: 2.32e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 976 SGASQDPKKLNHTPVSTMSSSQPVsrPLQPIQPAP------PLQPSGVPTsgPSQT-TIHLLPTAPTTVNVTHRPV---- 1044
Cdd:PHA03378 603 SQTPEPPTTQSHIPETSAPRQWPM--PLRPIPMRPlrmqpiTFNVLVFPT--PHQPpQVEITPYKPTWTQIGHIPYqpsp 678
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 1045 TQVTTRLPVPRAPANHQvvyttlpaPPAQAPLRGTVMQAPAVRQVNPQNSVT-VRVPQTTTYVVN--NGLTLGSTGPQLT 1121
Cdd:PHA03378 679 TGANTMLPIQWAPGTMQ--------PPPRAPTPMRPPAAPPGRAQRPAAATGrARPPAAAPGRARppAAAPGRARPPAAA 750
|
170 180 190 200
....*....|....*....|....*....|....*....|....*....
gi 1917203715 1122 -VHHRPPQVHTEPPRPVHPAPLPEAPQPQ-------RLPPEAASTSLPQ 1162
Cdd:PHA03378 751 pGRARPPAAAPGRARPPAAAPGAPTPQPPpqappapQQRPRGAPTPQPP 799
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
366-663 |
2.84e-04 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 45.44 E-value: 2.84e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 366 ENEKKVEEDIITELALGEDAISSSMEIDQGEKNEDETSADLVETINENVIEDNKSENILENTDS--METDEIIPILEKLA 443
Cdd:PRK03918 165 KNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKevKELEELKEEIEELE 244
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 444 PSEDELTCFSKTSLLPIDETNPDLEEKMESSFGSPSKQESSESLPKEA--------FLVLSDEEDISGEKDESEVISQ-- 513
Cdd:PRK03918 245 KELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAeeyiklseFYEEYLDELREIEKRLSRLEEEin 324
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 514 --NETCSPAEvESNEKDNKPEEEEQVIHEDDERPSEKNEFSRRKRSKSEDMDNVQSKRRRYMEEEYEAEFQVKITAKGDI 591
Cdd:PRK03918 325 giEERIKELE-EKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEI 403
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1917203715 592 NQKLQKVIQWL--LEEKLCALQCAVFDKTLAELKTRVEKIECNKRH-KTVLTELQAKIARLTKR---FEAAKEDLKKR 663
Cdd:PRK03918 404 EEEISKITARIgeLKKEIKELKKAIEELKKAKGKCPVCGRELTEEHrKELLEEYTAELKRIEKElkeIEEKERKLRKE 481
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
317-600 |
3.35e-04 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 45.13 E-value: 3.35e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 317 LEKNGADEKLEQIQSKDSLDEKNKADNNIDANEETLETDDTTicsdRPPENEKKVEEDIITELALGEDAISSSMEIDQGE 396
Cdd:PTZ00121 1680 AKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEEL----KKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEE 1755
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 397 KN-------EDETSADLVETINENVIEDNKSENilENTDSMETDEIIPilEKLAPSEDELTCFSKTSLLPIDETNPDLEE 469
Cdd:PTZ00121 1756 KKkiahlkkEEEKKAEEIRKEKEAVIEEELDEE--DEKRRMEVDKKIK--DIFDNFANIIEGGKEGNLVINDSKEMEDSA 1831
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 470 KMESSFGSPSKQESSESLPKEAFlvlsDEEDISGEKDESEVISqnetcspaeveSNEKDNKPEEEEQVIHEDDERPSEKN 549
Cdd:PTZ00121 1832 IKEVADSKNMQLEEADAFEKHKF----NKNNENGEDGNKEADF-----------NKEKDLKEDDEEEIEEADEIEKIDKD 1896
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 1917203715 550 EFSRRKRSKSEDMDNVQSKRRRYMEEEYEaefqvkitaKGDINQKLQKVIQ 600
Cdd:PTZ00121 1897 DIEREIPNNNMAGKNNDIIDDKLDKDEYI---------KRDAEETREEIIK 1938
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
126-252 |
1.66e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 42.67 E-value: 1.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 126 PAEPVSGDPAPGDLDAGDPASGVLASGDSTSGDPTSSEPSSSDAASGDATSGDAPSGDVSPGDATSGDATADDLS---SG 202
Cdd:PRK07764 649 APEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQaaqGA 728
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|
gi 1917203715 203 DPTSSDPIPGEPVPVEPISGDCAADDIASSEITSVDLASGAPASTDPASD 252
Cdd:PRK07764 729 SAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSP 778
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1001-1164 |
2.21e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.62 E-value: 2.21e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 1001 RPLQPIQPAPPLQPSGVPTSGPSQTTIHLLPTAPttvnvtHRPVTQVttrlPVPRAPANHQVVYTTLPAPPAQAPLRgtv 1080
Cdd:PHA03247 2588 RPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDT------HAPDPPP----PSPSPAANEPDPHPPPTVPPPERPRD--- 2654
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 1081 mqAPAVRQVNPQNSVTVRVPQTTTYVVNNGLTLGSTGP---QLTVHHRPPqvhtEPPRPVHPAPLPEAPQ-PQRLPPEAA 1156
Cdd:PHA03247 2655 --DPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPtvgSLTSLADPP----PPPPTPEPAPHALVSAtPLPPGPAAA 2728
|
....*...
gi 1917203715 1157 STSLPQKP 1164
Cdd:PHA03247 2729 RQASPALP 2736
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
324-662 |
2.25e-03 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 42.35 E-value: 2.25e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 324 EKLEQIQSKdsLDEKNKAdnnIDANEETLETDDTTIcsdrppENEKKVEEDIITELALG-EDAISSSMEIDQGEKNEDET 402
Cdd:TIGR02168 684 EKIEELEEK--IAELEKA---LAELRKELEELEEEL------EQLRKELEELSRQISALrKDLARLEAEVEQLEERIAQL 752
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 403 SADLVETINENVIEDNK----SENILENTDSMETDEiipilEKLAPSEDELTCFSKTsllpIDETNPDLEEKMESSFgsp 478
Cdd:TIGR02168 753 SKELTELEAEIEELEERleeaEEELAEAEAEIEELE-----AQIEQLKEELKALREA----LDELRAELTLLNEEAA--- 820
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 479 SKQESSESLPKEAFLVLSDEEDISGE-KDESEVISQNEtcspAEVESnEKDNKPEEEEQVIHEDDERpSEKNEFSRRKRS 557
Cdd:TIGR02168 821 NLRERLESLERRIAATERRLEDLEEQiEELSEDIESLA----AEIEE-LEELIEELESELEALLNER-ASLEEALALLRS 894
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 558 KSEDMDNVQ---SKRRRYMEEEYEAefqvKITAKGDINQKLQKVIQWL--LEEKLCALQCAVFDKTLAELKTRVEKIECN 632
Cdd:TIGR02168 895 ELEELSEELrelESKRSELRRELEE----LREKLAQLELRLEGLEVRIdnLQERLSEEYSLTLEEAEALENKIEDDEEEA 970
|
330 340 350 360
....*....|....*....|....*....|....*....|....*..
gi 1917203715 633 KRHktvLTELQAKIARL--------------TKRFE---AAKEDLKK 662
Cdd:TIGR02168 971 RRR---LKRLENKIKELgpvnlaaieeyeelKERYDfltAQKEDLTE 1014
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
860-1164 |
2.71e-03 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 41.87 E-value: 2.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 860 TASAAPLGTTLAVQAvpTAHSIVQATRTSLPTVGPSGlySPSTNRGpiqmkipiSAFSTSSAAEQNSNTTPRIEN-QTNK 938
Cdd:pfam17823 87 TAEHTPHGTDLSEPA--TREGAADGAASRALAAAASS--SPSSAAQ--------SLPAAIAALPSEAFSAPRAAAcRANA 154
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 939 TIDASVSKKAADSTSQCGKATGSDSSGVidlTMDDEESGASQDPKKLNHTPVSTMSSSQPVS-RPLQPIQPAPPLQPSGV 1017
Cdd:pfam17823 155 SAAPRAAIAAASAPHAASPAPRTAASST---TAASSTTAASSAPTTAASSAPATLTPARGIStAATATGHPAAGTALAAV 231
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 1018 PTSGPSQTTIHL---------LPTAPTTVNVTHRPVTQVTTRLPVPR--APANHQVVYTTL--PAPPAQAPLRGTVMQAP 1084
Cdd:pfam17823 232 GNSSPAAGTVTAavgtvtpaaLATLAAAAGTVASAAGTINMGDPHARrlSPAKHMPSDTMArnPAAPMGAQAQGPIIQVS 311
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 1085 AVRQV-------NPQNSVTVRVPQTTTYVVNNGLTLGSTgpqLTVHHRPPQVHTEPPRPVHPAPLPEA----PQPQRLPP 1153
Cdd:pfam17823 312 TDQPVhntagepTPSPSNTTLEPNTPKSVASTNLAVVTT---TKAQAKEPSASPVPVLHTSMIPEVEAtsptTQPSPLLP 388
|
330
....*....|...
gi 1917203715 1154 E--AASTSLPQKP 1164
Cdd:pfam17823 389 TqgAAGPGILLAP 401
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
994-1170 |
3.16e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 41.99 E-value: 3.16e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 994 SSSQPVSRPLQPIQPAPPLqPSgvPTSGPSQTTIHLLPT-APTTVNVTHRPVTQVTTRLPVPRAPANHQVVYTTLPAPPA 1072
Cdd:PRK10263 328 TATQSWAAPVEPVTQTPPV-AS--VDVPPAQPTVAWQPVpGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQ 404
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 1073 QAPLRGTVMQAPAVRQVNPQNSVTVRVPQTTTYVVNNGLTLGSTGPQLTVHHRPPQVH---TEPPRPVHPAPLPEAPQPq 1149
Cdd:PRK10263 405 QPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYqteQTYQQPAAQEPLYQQPQP- 483
|
170 180
....*....|....*....|.
gi 1917203715 1150 rLPPEAASTSLPQKPHLKLAR 1170
Cdd:PRK10263 484 -VEQQPVVEPEPVVEETKPAR 503
|
|
| PHA02664 |
PHA02664 |
hypothetical protein; Provisional |
126-282 |
3.68e-03 |
|
hypothetical protein; Provisional
Pssm-ID: 177447 Cd Length: 534 Bit Score: 41.52 E-value: 3.68e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 126 PAEP----VSGDPApgdLDAGDPASGVLASGDSTSGdptssePSSSDAASGDATSgdapsgdvSPGDATSGDATADDLSS 201
Cdd:PHA02664 368 PAEPaalfVDGNEV---IAAGAAAAMIAAAERAANG------ARGSPMAAPEEGR--------AAAAAAAANAPADQDVE 430
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 202 GDPtSSDPIPGEPVPVEPISGDCAADDIASSEIT-----------SVDLASGAPASTDPASDDLASGDLSSSELASDDLA 270
Cdd:PHA02664 431 AEA-HDEFDQDPGAPAHADRADSDEDDMDEQESGderadgeddsdSSYSYSTTSSEDESDSADDSWGDESDSGIEHDDGG 509
|
170
....*....|..
gi 1917203715 271 TGELASDELTSE 282
Cdd:PHA02664 510 VGQAIEEEEEEE 521
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
302-662 |
3.86e-03 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 41.54 E-value: 3.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 302 EIDNIEPSSNKDDdfleKNGADEKLEQIQSKDSLDEKNKADNNIDANEETLETDDTTICSDRppENEKKVEEDIITELAL 381
Cdd:TIGR04523 125 ELNKLEKQKKENK----KNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEK--LNIQKNIDKIKNKLLK 198
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 382 GEDAISSSMEIDQGEK-------NEDETSADLVETINENVIEDNKSENILENTDSM---ETDEIIPILEKLAPSEDELTC 451
Cdd:TIGR04523 199 LELLLSNLKKKIQKNKslesqisELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQlnqLKDEQNKIKKQLSEKQKELEQ 278
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 452 FSKTsllpIDETNPDLEE-KMESsfgSPSKQESSESLPKEaflVLSDEEDISGEKDESE--------VISQ-NETCSPAE 521
Cdd:TIGR04523 279 NNKK----IKELEKQLNQlKSEI---SDLNNQKEQDWNKE---LKSELKNQEKKLEEIQnqisqnnkIISQlNEQISQLK 348
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 522 VESNEKDNKPEEEEQVIHE-DDERPSEKNEfsrrKRSKSEDMDNVQSKRRRY-----MEEEYEAEFQVKITAKGDINQKL 595
Cdd:TIGR04523 349 KELTNSESENSEKQRELEEkQNEIEKLKKE----NQSYKQEIKNLESQINDLeskiqNQEKLNQQKDEQIKKLQQEKELL 424
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1917203715 596 QKVIQWLLEEKLcalqcaVFDKTLAELKTR--VEKIECNKrHKTVLTELQAKIARLTKRFEAAKEDLKK 662
Cdd:TIGR04523 425 EKEIERLKETII------KNNSEIKDLTNQdsVKELIIKN-LDNTRESLETQLKVLSRSINKIKQNLEQ 486
|
|
| MDN1 |
COG5271 |
Midasin, AAA ATPase with vWA domain, involved in ribosome maturation [Translation, ribosomal ... |
80-552 |
4.01e-03 |
|
Midasin, AAA ATPase with vWA domain, involved in ribosome maturation [Translation, ribosomal structure and biogenesis];
Pssm-ID: 444083 [Multi-domain] Cd Length: 1028 Bit Score: 41.54 E-value: 4.01e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 80 DPEGSKAEWKETPCILSVNVKNKQDDDLNCEPLSPHNITPEPVSKLPAEPVSGDPAPGDLDAGDPASGVLASGDSTSGDP 159
Cdd:COG5271 274 ATDDADGLEAAEDDALDAELTAAQAADPESDDDADDSTLAALEGAAEDTEIATADELAAADDEDDDDSAAEDAAEEAATA 353
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 160 TSSEPSSSDAASGDATSGDAPSGDVSPGDATSGDATADDLSSGDPTSSDPIPGEPVPVEPISGDCAADDIASSEITSVDL 239
Cdd:COG5271 354 EDSAAEDTQDAEDEAAGEAADESEGADTDAAADEADAAADDSADDEEASADGGTSPTSDTDEEEEEADEDASAGETEDES 433
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 240 ASGAPASTDPASDDLASGDLSSSELASDDLATGELASDELTSESTFDRTFEPKSVPVCEPVPEIDNIEPSSNKDD----- 314
Cdd:COG5271 434 TDVTSAEDDIATDEEADSLADEEEEAEAELDTEEDTESAEEDADGDEATDEDDASDDGDEEEAEEDAEAEADSDEltaee 513
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 315 ---DFLEKNGADEKLEQIQSKDSLDEKNKADNNIDANEETLETDDTTICSDRPPENEKKVEEDIITELALGEDAISSSME 391
Cdd:COG5271 514 tsaDDGADTDAAADPEDSDEDALEDETEGEENAPGSDQDADETDEPEATAEEDEPDEAEAETEDATENADADETEESADE 593
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 392 IDQGEKNEDETSADLVETINENVIEDNKSENILENTDSMETDEIIPILEKLAPSEDELTCFSKTSLLPIDETNPDLEEKM 471
Cdd:COG5271 594 SEEAEASEDEAAEEEEADDDEADADADGAADEEETEEEAAEDEAAEPETDASEAADEDADAETEAEASADESEEEAEDES 673
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 472 ESSfgSPSKQESSESLPKEAflvLSDEEDISGEKDESEVISQNETCSPAEVESNEKDNKPEEEEQVIHEDDERPSEKNEF 551
Cdd:COG5271 674 ETS--SEDAEEDADAAAAEA---SDDEEETEEADEDAETASEEADAEEADTEADGTAEEAEEAAEEAESADEEAASLPDE 748
|
.
gi 1917203715 552 S 552
Cdd:COG5271 749 A 749
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
992-1091 |
4.64e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 41.30 E-value: 4.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 992 TMSSSQPVSRPLQPI--QPAPPLQPSGVPTSGPSQTTIHLLPTAPTTVNVTHRPVTQVTTRLPVPRApanhqvvyttLPA 1069
Cdd:PRK14971 368 DASGGRGPKQHIKPVftQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAA----------VPV 437
|
90 100
....*....|....*....|..
gi 1917203715 1070 PPAQAPLRGTVMQAPAVRQVNP 1091
Cdd:PRK14971 438 NPPSTAPQAVRPAQFKEEKKIP 459
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
121-313 |
5.09e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 41.37 E-value: 5.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 121 PVSKLPAEPVSGDPAPGDLDAGDPASGVLASGDSTSGDPtSSEPSSSDAASGDATSGDAPSGDVSPgdATSGDATADDLS 200
Cdd:PRK07003 368 PGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAA-GAALAPKAAAAAAATRAEAPPAAPAP--PATADRGDDAAD 444
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 201 SGDPTSSDpipgEPVPVEPisgDCAADDIASSEITSVDLASGAPASTDPASddlASGDLSSSELASDDLATGELASDELT 280
Cdd:PRK07003 445 GDAPVPAK----ANARASA---DSRCDERDAQPPADSGSASAPASDAPPDA---AFEPAPRAAAPSAATPAAVPDARAPA 514
|
170 180 190
....*....|....*....|....*....|...
gi 1917203715 281 SESTFDRtfepkSVPVCEPVPEIDNIEPSSNKD 313
Cdd:PRK07003 515 AASREDA-----PAAAAPPAPEARPPTPAAAAP 542
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
655-1076 |
7.14e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 40.92 E-value: 7.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 655 AAKEDLKKRHEHPPNPPVSPGKTVNDVNSNNNMSYRNAGTVRQMLESKRNVSESAPPSFQTPVNTvSSTNLVTPPAVVSS 734
Cdd:PHA03307 56 VAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPT-PPPASPPPSPAPDL 134
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 735 QPKLQTPVTSGSLTATSVLPAPNTATVVATTQVPSGNPQPTISLQPLPVilHVPVAVSSQPQLLQSHPGTLVTNQPSGNV 814
Cdd:PHA03307 135 SEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETA--RAPSSPPAEPPPSTPPAAASPRPPRRSSP 212
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 815 EFISVQSPPtvsgltknpvslPSLPNPTKPNNVPSVPSPSIQRNPTASAAPLGTTlavqAVPTAHSIVQATR--TSLPTV 892
Cdd:PHA03307 213 ISASASSPA------------PAPGRSAADDAGASSSDSSSSESSGCGWGPENEC----PLPRPAPITLPTRiwEASGWN 276
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 893 GPSGLYSPSTNRGPIQMKIPISAFSTSSAAEqnSNTTPRIENQTNKTIDASVSKKAADSTSQCGKATGSDSSgvidltmd 972
Cdd:PHA03307 277 GPSSRPGPASSSSSPRERSPSPSPSSPGSGP--APSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPS-------- 346
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 973 DEESGASQDPkklnhtPVSTMSSSQPVSRPLQPIQPAPPlQPSGVPTSgpsqttihllPTAPTTVNVTHRPvTQVTTRLP 1052
Cdd:PHA03307 347 PSRSPSPSRP------PPPADPSSPRKRPRPSRAPSSPA-ASAGRPTR----------RRARAAVAGRARR-RDATGRFP 408
|
410 420
....*....|....*....|....
gi 1917203715 1053 VPRAPANHQVVYTTLPAPPAQAPL 1076
Cdd:PHA03307 409 AGRPRPSPLDAGAASGAFYARYPL 432
|
|
| COG1340 |
COG1340 |
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown]; |
532-665 |
9.66e-03 |
|
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
Pssm-ID: 440951 [Multi-domain] Cd Length: 297 Bit Score: 39.51 E-value: 9.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1917203715 532 EEEEQVIHEDDERPSEKNEFSRRKRSKSEDMDNVQSKRRRYMEE--EYEAEfqvkitaKGDINQKLQKVIQWLLEEKLCA 609
Cdd:COG1340 29 EKRDELNEELKELAEKRDELNAQVKELREEAQELREKRDELNEKvkELKEE-------RDELNEKLNELREELDELRKEL 101
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1917203715 610 LQCAVFDKTLAELKTRVEKIEcnKRHKT-VLT-----ELQAKIARLTKRFEAAKEDLKKRHE 665
Cdd:COG1340 102 AELNKAGGSIDKLRKEIERLE--WRQQTeVLSpeeekELVEKIKELEKELEKAKKALEKNEK 161
|
|
|