NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|578837041|ref|XP_006724242|]
View 

calcineurin-binding protein cabin-1 isoform X3 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
MEF2_binding pfam09047
MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the ...
2162-2196 8.71e-16

MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the calcineurin-binding protein CABIN 1, adopts an amphipathic alpha-helical structure, which allows it to bind a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription.


:

Pssm-ID: 370261 [Multi-domain]  Cd Length: 35  Bit Score: 72.58  E-value: 8.71e-16
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 578837041  2162 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2196
Cdd:pfam09047    1 TLLSPKGSISEETKQKLKNAILSAQSAANVKKDSL 35
TPR COG0457
Tetratricopeptide (TPR) repeat [General function prediction only];
30-205 2.36e-11

Tetratricopeptide (TPR) repeat [General function prediction only];


:

Pssm-ID: 440225 [Multi-domain]  Cd Length: 245  Bit Score: 66.18  E-value: 2.36e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041   30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEasllreavssgdekeglKHPGLilkYSTYKNLAQLAAQREDLETA 109
Cdd:COG0457     2 ELDPDDAEAYNNLGLAYRRLGRYEEAIEDYEKALE-----------------LDPDD---AEALYNLGLAYLRLGRYEEA 61
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041  110 MEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKDC 189
Cdd:COG0457    62 LADYEQALELDPDDAEALNNLGLALQALGRYEEALEDYDKALELDPDDAEALYNLGLALLELGRYDEAIEAYERALELDP 141
                         170
                  ....*....|....*.
gi 578837041  190 RYSKGLVLKEKIFEEQ 205
Cdd:COG0457   142 DDADALYNLGIALEKL 157
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1948-2200 1.58e-09

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 63.80  E-value: 1.58e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1948 GAAAQRQASGDTPTTPKHPKDSRENFfpvtVVPTAPDPVPADSvQRPSDAHTKPRPAlaaATTIITCPPSASASTLDQSK 2027
Cdd:PHA03247 2583 TSRARRPDAPPQSARPRAPVDDRGDP----RGPAPPSPLPPDT-HAPDPPPPSPSPA---ANEPDPHPPPTVPPPERPRD 2654
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2028 DPGPPRPHRP-----EATPSMASLGPEG-------EELARVAEGTSFPPQEPRHSPQvkmAPTSSPAEPhcWPAEAALGT 2095
Cdd:PHA03247 2655 DPAPGRVSRPrrarrLGRAAQASSPPQRprrraarPTVGSLTSLADPPPPPPTPEPA---PHALVSATP--LPPGPAAAR 2729
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2096 GAEPTCSQAASSKAPSSGSAQPpeghpgkPEPSRAKSRPLPNMPklviPSAAtkfPPEITVTPPTPTLLSPKGSISEETK 2175
Cdd:PHA03247 2730 QASPALPAAPAPPAVPAGPATP-------GGPARPARPPTTAGP----PAPA---PPAAPAAGPPRRLTRPAVASLSESR 2795
                         250       260
                  ....*....|....*....|....*
gi 578837041 2176 QKLKSAILSAQSAANVRKESLCQPA 2200
Cdd:PHA03247 2796 ESLPSPWDPADPPAAVLAPAAALPP 2820
 
Name Accession Description Interval E-value
MEF2_binding pfam09047
MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the ...
2162-2196 8.71e-16

MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the calcineurin-binding protein CABIN 1, adopts an amphipathic alpha-helical structure, which allows it to bind a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription.


Pssm-ID: 370261 [Multi-domain]  Cd Length: 35  Bit Score: 72.58  E-value: 8.71e-16
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 578837041  2162 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2196
Cdd:pfam09047    1 TLLSPKGSISEETKQKLKNAILSAQSAANVKKDSL 35
MEF2_binding cd13839
Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; ...
2162-2196 5.11e-14

Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; The myocyte enhancer factor-2 (MEF2) binding domain, as found in the calcineurin-binding protein cabin-1, adopts an amphipathic alpha-helical structure, which allows it to bind to a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription. Cabin-1 inhibits calcineurin-mediated signal transduction in T-cell receptor-mediated signalling pathways, by binding to the activated form of calcineurin. Cabin-1 acts as a co-repressor of MEF2, the mycocyte enhancer factor-2, which regulates transcription in a calcium-dependent manner and plays vital roles in T-cell development and function.


Pssm-ID: 260103 [Multi-domain]  Cd Length: 35  Bit Score: 67.79  E-value: 5.11e-14
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 578837041 2162 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2196
Cdd:cd13839     1 TLLSPKGSISEETKQKLKNAILSSQSAANVKKDTL 35
TPR COG0457
Tetratricopeptide (TPR) repeat [General function prediction only];
30-205 2.36e-11

Tetratricopeptide (TPR) repeat [General function prediction only];


Pssm-ID: 440225 [Multi-domain]  Cd Length: 245  Bit Score: 66.18  E-value: 2.36e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041   30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEasllreavssgdekeglKHPGLilkYSTYKNLAQLAAQREDLETA 109
Cdd:COG0457     2 ELDPDDAEAYNNLGLAYRRLGRYEEAIEDYEKALE-----------------LDPDD---AEALYNLGLAYLRLGRYEEA 61
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041  110 MEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKDC 189
Cdd:COG0457    62 LADYEQALELDPDDAEALNNLGLALQALGRYEEALEDYDKALELDPDDAEALYNLGLALLELGRYDEAIEAYERALELDP 141
                         170
                  ....*....|....*.
gi 578837041  190 RYSKGLVLKEKIFEEQ 205
Cdd:COG0457   142 DDADALYNLGIALEKL 157
PHA03247 PHA03247
large tegument protein UL36; Provisional
1948-2200 1.58e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 63.80  E-value: 1.58e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1948 GAAAQRQASGDTPTTPKHPKDSRENFfpvtVVPTAPDPVPADSvQRPSDAHTKPRPAlaaATTIITCPPSASASTLDQSK 2027
Cdd:PHA03247 2583 TSRARRPDAPPQSARPRAPVDDRGDP----RGPAPPSPLPPDT-HAPDPPPPSPSPA---ANEPDPHPPPTVPPPERPRD 2654
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2028 DPGPPRPHRP-----EATPSMASLGPEG-------EELARVAEGTSFPPQEPRHSPQvkmAPTSSPAEPhcWPAEAALGT 2095
Cdd:PHA03247 2655 DPAPGRVSRPrrarrLGRAAQASSPPQRprrraarPTVGSLTSLADPPPPPPTPEPA---PHALVSATP--LPPGPAAAR 2729
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2096 GAEPTCSQAASSKAPSSGSAQPpeghpgkPEPSRAKSRPLPNMPklviPSAAtkfPPEITVTPPTPTLLSPKGSISEETK 2175
Cdd:PHA03247 2730 QASPALPAAPAPPAVPAGPATP-------GGPARPARPPTTAGP----PAPA---PPAAPAAGPPRRLTRPAVASLSESR 2795
                         250       260
                  ....*....|....*....|....*
gi 578837041 2176 QKLKSAILSAQSAANVRKESLCQPA 2200
Cdd:PHA03247 2796 ESLPSPWDPADPPAAVLAPAAALPP 2820
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1925-2189 2.63e-08

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 59.54  E-value: 2.63e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041  1925 HLPVKVDEEAALEQAVKFCQVHLGAAAQrQASGDTPTTPK-HPKDS-RENFFPVTVVPTAPDPVPADSVQRPSDAHTKPR 2002
Cdd:pfam05109  453 HVPTNLTAPASTGPTVSTADVTSPTPAG-TTSGASPVTPSpSPRDNgTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPT 531
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041  2003 PALAAATTIITCPPSASASTLDQSKDPGP----PRPH------------------RPEAT-PSMASLGPEGEELARVAEG 2059
Cdd:pfam05109  532 PNATSPTLGKTSPTSAVTTPTPNATSPTPavttPTPNatiptlgktsptsavttpTPNATsPTVGETSPQANTTNHTLGG 611
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041  2060 TSFPP---QEPRH----------------SPQVKMAPTS-----SPAEPHCWPAEAALGTGAEPTCSQAASSKAPSSGSA 2115
Cdd:pfam05109  612 TSSTPvvtSPPKNatsavttgqhnitsssTSSMSLRPSSisetlSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTST 691
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 578837041  2116 QPPEGHPGKPEPSRAKSRPLPNMpklviPSAATKfPPEITVTPPTPtllsPKGSISEETKQKLKSAILSAQSAA 2189
Cdd:pfam05109  692 HHVSTSSPAPRPGTTSQASGPGN-----SSTSTK-PGEVNVTKGTP----PKNATSPQAPSGQKTAVPTVTSTG 755
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1910-2161 1.66e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 46.69  E-value: 1.66e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1910 RVERIMSETYMLIKQHLPV--KVDEEAALEQAVKFCQVHLGAAAQRQASGDTPTTPKHPKDSRenffpvtvvPTAPDPvp 1987
Cdd:NF033839  229 QIVALIKELDELKKQALSEidNVNTKVEIENTVHKIFADMDAVVTKFKKGLTQDTPKEPGNKK---------PSAPKP-- 297
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1988 adsvqrpsdaHTKPRPALAAAttiitcPPSASASTLDQSKDPGPPRPhRPEATPSmaslgPEGEElarvaegTSFPPQEP 2067
Cdd:NF033839  298 ----------GMQPSPQPEKK------EVKPEPETPKPEVKPQLEKP-KPEVKPQ-----PEKPK-------PEVKPQLE 348
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2068 RHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQAASSKAPSsgsAQPPEGHPgKPEPSRAKSRPLPNMPKLVIPSAA 2147
Cdd:NF033839  349 TPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPE---KPKPEVKP-QPEKPKPEVKPQPEKPKPEVKPQP 424
                         250
                  ....*....|....
gi 578837041 2148 TKFPPEITVTPPTP 2161
Cdd:NF033839  425 EKPKPEVKPQPEKP 438
sucB TIGR01347
2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component); This ...
2050-2156 6.63e-04

2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component); This model describes the TCA cycle 2-oxoglutarate system E2 component, dihydrolipoamide succinyltransferase. It is closely related to the pyruvate dehydrogenase E2 component, dihydrolipoamide acetyltransferase. The seed for this model includes mitochondrial and Gram-negative bacterial forms. Mycobacterial candidates are highly derived, differ in having and extra copy of the lipoyl-binding domain at the N-terminus. They score below the trusted cutoff, but above the noise cutoff and above all examples of dihydrolipoamide acetyltransferase. [Energy metabolism, TCA cycle]


Pssm-ID: 273565 [Multi-domain]  Cd Length: 403  Bit Score: 44.34  E-value: 6.63e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041  2050 GEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHcwPAEAALGTGAEPTCSQAASSKAPSsgSAQPPEGHPG------ 2123
Cdd:TIGR01347   68 GQVLAILEEGNDATAAPPAKSGEEKEETPAASAAAA--PTAAANRPSLSPAARRLAKEHGID--LSAVPGTGVTgrvtke 143
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 578837041  2124 ---KPEPSRAKSRPLPNMPKLVIPSAATKfpPEITV 2156
Cdd:TIGR01347  144 diiKKTEAPASAQPPAAAAAAAAPAAATR--PEERV 177
TPR_12 pfam13424
Tetratricopeptide repeat;
36-119 1.85e-03

Tetratricopeptide repeat;


Pssm-ID: 315987 [Multi-domain]  Cd Length: 77  Bit Score: 38.91  E-value: 1.85e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041    36 AFALYHKALDLQKHDRFEESAKAYHELLEaslLREAVSSGDekeglkHPGLILkysTYKNLAQLAAQREDLETAMEFYLE 115
Cdd:pfam13424    3 ATALNNLAAVLRRLGRYDEALELLEKALE---IARRLLGPD------HPLTAT---TLLNLGRLYLELGRYEEALELLER 70

                   ....
gi 578837041   116 AVML 119
Cdd:pfam13424   71 ALAL 74
 
Name Accession Description Interval E-value
MEF2_binding pfam09047
MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the ...
2162-2196 8.71e-16

MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the calcineurin-binding protein CABIN 1, adopts an amphipathic alpha-helical structure, which allows it to bind a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription.


Pssm-ID: 370261 [Multi-domain]  Cd Length: 35  Bit Score: 72.58  E-value: 8.71e-16
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 578837041  2162 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2196
Cdd:pfam09047    1 TLLSPKGSISEETKQKLKNAILSAQSAANVKKDSL 35
MEF2_binding cd13839
Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; ...
2162-2196 5.11e-14

Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; The myocyte enhancer factor-2 (MEF2) binding domain, as found in the calcineurin-binding protein cabin-1, adopts an amphipathic alpha-helical structure, which allows it to bind to a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription. Cabin-1 inhibits calcineurin-mediated signal transduction in T-cell receptor-mediated signalling pathways, by binding to the activated form of calcineurin. Cabin-1 acts as a co-repressor of MEF2, the mycocyte enhancer factor-2, which regulates transcription in a calcium-dependent manner and plays vital roles in T-cell development and function.


Pssm-ID: 260103 [Multi-domain]  Cd Length: 35  Bit Score: 67.79  E-value: 5.11e-14
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 578837041 2162 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2196
Cdd:cd13839     1 TLLSPKGSISEETKQKLKNAILSSQSAANVKKDTL 35
TPR COG0457
Tetratricopeptide (TPR) repeat [General function prediction only];
30-205 2.36e-11

Tetratricopeptide (TPR) repeat [General function prediction only];


Pssm-ID: 440225 [Multi-domain]  Cd Length: 245  Bit Score: 66.18  E-value: 2.36e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041   30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEasllreavssgdekeglKHPGLilkYSTYKNLAQLAAQREDLETA 109
Cdd:COG0457     2 ELDPDDAEAYNNLGLAYRRLGRYEEAIEDYEKALE-----------------LDPDD---AEALYNLGLAYLRLGRYEEA 61
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041  110 MEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKDC 189
Cdd:COG0457    62 LADYEQALELDPDDAEALNNLGLALQALGRYEEALEDYDKALELDPDDAEALYNLGLALLELGRYDEAIEAYERALELDP 141
                         170
                  ....*....|....*.
gi 578837041  190 RYSKGLVLKEKIFEEQ 205
Cdd:COG0457   142 DDADALYNLGIALEKL 157
TPR COG0457
Tetratricopeptide (TPR) repeat [General function prediction only];
34-199 4.61e-10

Tetratricopeptide (TPR) repeat [General function prediction only];


Pssm-ID: 440225 [Multi-domain]  Cd Length: 245  Bit Score: 62.33  E-value: 4.61e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041   34 AEAFALYHKALDLQkhdrfEESAKAYHELleASLLREAvssGDEKEGLKH--------PGLIlkySTYKNLAQLAAQRED 105
Cdd:COG0457    25 EEAIEDYEKALELD-----PDDAEALYNL--GLAYLRL---GRYEEALADyeqaleldPDDA---EALNNLGLALQALGR 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041  106 LETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKAL 185
Cdd:COG0457    92 YEEALEDYDKALELDPDDAEALYNLGLALLELGRYDEAIEAYERALELDPDDADALYNLGIALEKLGRYEEALELLEKLE 171
                         170
                  ....*....|....
gi 578837041  186 EKDCRYSKGLVLKE 199
Cdd:COG0457   172 AAALAALLAAALGE 185
PHA03247 PHA03247
large tegument protein UL36; Provisional
1948-2200 1.58e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 63.80  E-value: 1.58e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1948 GAAAQRQASGDTPTTPKHPKDSRENFfpvtVVPTAPDPVPADSvQRPSDAHTKPRPAlaaATTIITCPPSASASTLDQSK 2027
Cdd:PHA03247 2583 TSRARRPDAPPQSARPRAPVDDRGDP----RGPAPPSPLPPDT-HAPDPPPPSPSPA---ANEPDPHPPPTVPPPERPRD 2654
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2028 DPGPPRPHRP-----EATPSMASLGPEG-------EELARVAEGTSFPPQEPRHSPQvkmAPTSSPAEPhcWPAEAALGT 2095
Cdd:PHA03247 2655 DPAPGRVSRPrrarrLGRAAQASSPPQRprrraarPTVGSLTSLADPPPPPPTPEPA---PHALVSATP--LPPGPAAAR 2729
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2096 GAEPTCSQAASSKAPSSGSAQPpeghpgkPEPSRAKSRPLPNMPklviPSAAtkfPPEITVTPPTPTLLSPKGSISEETK 2175
Cdd:PHA03247 2730 QASPALPAAPAPPAVPAGPATP-------GGPARPARPPTTAGP----PAPA---PPAAPAAGPPRRLTRPAVASLSESR 2795
                         250       260
                  ....*....|....*....|....*
gi 578837041 2176 QKLKSAILSAQSAANVRKESLCQPA 2200
Cdd:PHA03247 2796 ESLPSPWDPADPPAAVLAPAAALPP 2820
Spy COG3914
Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational ...
30-188 2.28e-09

Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443119 [Multi-domain]  Cd Length: 658  Bit Score: 62.70  E-value: 2.28e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041   30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEAsllreavssgdekeglkHPGLilkYSTYKNLAQLAAQREDLETA 109
Cdd:COG3914    72 AALLLLAALLELAALLLQALGRYEEALALYRRALAL-----------------NPDN---AEALFNLGNLLLALGRLEEA 131
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 578837041  110 MEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKD 188
Cdd:COG3914   132 LAALRRALALNPDFAEAYLNLGEALRRLGRLEEAIAALRRALELDPDNAEALNNLGNALQDLGRLEEAIAAYRRALELD 210
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
30-205 8.37e-09

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 58.97  E-value: 8.37e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041   30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEAS---------LLREAVSSGDEKEGLKHPGLILKYS-----TYKN 95
Cdd:COG2956    70 ERDPDRAEALLELAQDYLKAGLLDRAEELLEKLLELDpddaealrlLAEIYEQEGDWEKAIEVLERLLKLGpenahAYCE 149
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041   96 LAQLAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYT 175
Cdd:COG2956   150 LAELYLEQGDYDEAIEALEKALKLDPDCARALLLLAELYLEQGDYEEAIAALERALEQDPDYLPALPRLAELYEKLGDPE 229
                         170       180       190
                  ....*....|....*....|....*....|
gi 578837041  176 TCLYFICKALEKDCRYSKGLVLKEKIFEEQ 205
Cdd:COG2956   230 EALELLRKALELDPSDDLLLALADLLERKE 259
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1925-2189 2.63e-08

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 59.54  E-value: 2.63e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041  1925 HLPVKVDEEAALEQAVKFCQVHLGAAAQrQASGDTPTTPK-HPKDS-RENFFPVTVVPTAPDPVPADSVQRPSDAHTKPR 2002
Cdd:pfam05109  453 HVPTNLTAPASTGPTVSTADVTSPTPAG-TTSGASPVTPSpSPRDNgTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPT 531
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041  2003 PALAAATTIITCPPSASASTLDQSKDPGP----PRPH------------------RPEAT-PSMASLGPEGEELARVAEG 2059
Cdd:pfam05109  532 PNATSPTLGKTSPTSAVTTPTPNATSPTPavttPTPNatiptlgktsptsavttpTPNATsPTVGETSPQANTTNHTLGG 611
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041  2060 TSFPP---QEPRH----------------SPQVKMAPTS-----SPAEPHCWPAEAALGTGAEPTCSQAASSKAPSSGSA 2115
Cdd:pfam05109  612 TSSTPvvtSPPKNatsavttgqhnitsssTSSMSLRPSSisetlSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTST 691
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 578837041  2116 QPPEGHPGKPEPSRAKSRPLPNMpklviPSAATKfPPEITVTPPTPtllsPKGSISEETKQKLKSAILSAQSAA 2189
Cdd:pfam05109  692 HHVSTSSPAPRPGTTSQASGPGN-----SSTSTK-PGEVNVTKGTP----PKNATSPQAPSGQKTAVPTVTSTG 755
PHA03247 PHA03247
large tegument protein UL36; Provisional
1960-2166 4.74e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 58.80  E-value: 4.74e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1960 PTTPKHPKDSRENFFPVTVVPTAPDPVPADS-VQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPPRPHRPE 2038
Cdd:PHA03247 2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGrVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2039 ATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQAASSKAPSsgSAQPP 2118
Cdd:PHA03247 2709 EPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR--RLTRP 2786
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*...
gi 578837041 2119 EGHPGKPEPSRAKSRPLPNMPKLVIPSAATKFPPEITVTPPTPTLLSP 2166
Cdd:PHA03247 2787 AVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA 2834
BepA COG4783
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell ...
33-188 4.80e-08

Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell wall/membrane/envelope biogenesis, Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443813 [Multi-domain]  Cd Length: 139  Bit Score: 54.04  E-value: 4.80e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041   33 EAEAFALYHKALDLQKHDRFEESAKAYHELLEASllreavssGDEKEGlkhpglilkystYKNLAQLAAQREDLETAMEF 112
Cdd:COG4783     1 AACAEALYALAQALLLAGDYDEAEALLEKALELD--------PDNPEA------------FALLGEILLQLGDLDEAIVL 60
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 578837041  113 YLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKD 188
Cdd:COG4783    61 LHEALELDPDEPEARLNLGLALLKAGDYDEALALLEKALKLDPEHPEAYLRLARAYRALGRPDEAIAALEKALELD 136
PHA03247 PHA03247
large tegument protein UL36; Provisional
1980-2162 1.77e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.87  E-value: 1.77e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1980 PTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPP-SASAST-LDQSKD--------PGPPRPHRPEATPSMASlgPE 2049
Cdd:PHA03247 2557 PAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPqSARPRApVDDRGDprgpappsPLPPDTHAPDPPPPSPS--PA 2634
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2050 GEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGT-GAEPTCSQAASSKAPSSGSAQPPEGHPGKPEPS 2128
Cdd:PHA03247 2635 ANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPqRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHA 2714
                         170       180       190
                  ....*....|....*....|....*....|....
gi 578837041 2129 RAKSRPLPNMPKlvIPSAATKFPPEITVTPPTPT 2162
Cdd:PHA03247 2715 LVSATPLPPGPA--AARQASPALPAAPAPPAVPA 2746
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
1949-2189 1.86e-07

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 56.12  E-value: 1.86e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041  1949 AAAQRQASGDTPTTPKHPKdSRENFFPVTVVPTAPDPV----PADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLD 2024
Cdd:pfam17823  120 SSSPSSAAQSLPAAIAALP-SEAFSAPRAAACRANASAapraAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTA 198
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041  2025 QSKDPGPPRPHRPEATPSMASLGPE-GEELARVAEGTSFPPQEPRHSPQVKMAP-------------------TSSPAEP 2084
Cdd:pfam17823  199 ASSAPATLTPARGISTAATATGHPAaGTALAAVGNSSPAAGTVTAAVGTVTPAAlatlaaaagtvasaagtinMGDPHAR 278
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041  2085 HCWPAEAAlgtgaePTCSQAASSKAPSSGSAQ----------PPEGHPGKPEPSRAKSRPLPNMPKLVIPS--------- 2145
Cdd:pfam17823  279 RLSPAKHM------PSDTMARNPAAPMGAQAQgpiiqvstdqPVHNTAGEPTPSPSNTTLEPNTPKSVASTnlavvtttk 352
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....
gi 578837041  2146 AATKFPPEITVtPPTPTLLSPKGSISEETKQklKSAILSAQSAA 2189
Cdd:pfam17823  353 AQAKEPSASPV-PVLHTSMIPEVEATSPTTQ--PSPLLPTQGAA 393
PHA03247 PHA03247
large tegument protein UL36; Provisional
1782-2166 2.30e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.49  E-value: 2.30e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1782 SPRagPTEPMDTSEATvchsdlertPPLLPGRPARDRGPESRPTelsleelsiSARQQPTPLTPAQPAPAPAPATTTGTR 1861
Cdd:PHA03247 2574 APR--PSEPAVTSRAR---------RPDAPPQSARPRAPVDDRG---------DPRGPAPPSPLPPDTHAPDPPPPSPSP 2633
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1862 AGGHPEEPLSRLSRKRKLLEDTESGKTLLLdAYRVWQQGQKGVAYDLGRVER------IMSETYMLIKQHLPVKVDEEAA 1935
Cdd:PHA03247 2634 AANEPDPHPPPTVPPPERPRDDPAPGRVSR-PRRARRLGRAAQASSPPQRPRrraarpTVGSLTSLADPPPPPPTPEPAP 2712
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1936 lEQAVKFCQVHLGAAAQRQASGDTPTTPKHPKDSRenffpVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCP 2015
Cdd:PHA03247 2713 -HALVSATPLPPGPAAARQASPALPAAPAPPAVPA-----GPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRP 2786
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2016 PSASASTLDQSKdPGPPRPhrpeATPSMASLGPEGEELARVAEGTSFPPqePRHSPQVKMAPTSSPAEPHCwPAEAALGT 2095
Cdd:PHA03247 2787 AVASLSESRESL-PSPWDP----ADPPAAVLAPAAALPPAASPAGPLPP--PTSAQPTAPPPPPGPPPPSL-PLGGSVAP 2858
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 578837041 2096 GAE-----PTCSQAASSKAPSsgsaQPPEGHPGKPEPSRA-KSRPLPNMPKLVIPSAATKFPPEITVTPPTPTLLSP 2166
Cdd:PHA03247 2859 GGDvrrrpPSRSPAAKPAAPA----RPPVRRLARPAVSRStESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQP 2931
PHA03247 PHA03247
large tegument protein UL36; Provisional
1949-2200 3.50e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.10  E-value: 3.50e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1949 AAAQRQASGDTPTTPKHPKDSRENFFP----VTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASAS--- 2021
Cdd:PHA03247 2752 GGPARPARPPTTAGPPAPAPPAAPAAGpprrLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLppp 2831
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2022 TLDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFP-PQEPRHSPQVKMA------PTSSPAEPHCWPAEAALG 2094
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAkPAAPARPPVRRLArpavsrSTESFALPPDQPERPPQP 2911
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2095 TGAEPTCSQAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPN-MPKLVIPSAATKFPPEITVT-----PPTPTLLSPKG 2168
Cdd:PHA03247 2912 QAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEpSGAVPQPWLGALVPGRVAVPrfrvpQPAPSREAPAS 2991
                         250       260       270
                  ....*....|....*....|....*....|..
gi 578837041 2169 SISEETKQKLkSAILSAQSAANVRKESLCQPA 2200
Cdd:PHA03247 2992 STPPLTGHSL-SRVSSWASSLALHEETDPPPV 3022
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1954-2159 9.84e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 54.11  E-value: 9.84e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1954 QASGDT-PTTPKHPKDSRENffPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASA-STLDQSKDPGP 2031
Cdd:PRK12323  367 QSGGGAgPATAAAAPVAQPA--PAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEAlAAARQASARGP 444
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2032 PRPHRPEATPSMAslgPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQAASSKAPS 2111
Cdd:PRK12323  445 GGAPAPAPAPAAA---PAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGW 521
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*...
gi 578837041 2112 SGSAQPpegHPGKPEPSRAKSRPLPNMPKLVIPSAATKFPPEITVTPP 2159
Cdd:PRK12323  522 VAESIP---DPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPP 566
TadD COG5010
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, ...
38-188 2.45e-06

Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444034 [Multi-domain]  Cd Length: 155  Bit Score: 49.57  E-value: 2.45e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041   38 ALYHKALDLQKHDRFEESAKAYHELLEASLLREAVSSGDEKEGLKHPGLILKYSTYKNLAQLAAQREDLETAMEFYLEAV 117
Cdd:COG5010     2 RALEGFDRLPLYLLLLTKLRTLVEKYEAALAGANNTKEDELAAAGRDKLAKAFAIESPSDNLYNKLGDFEESLALLEQAL 81
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 578837041  118 MLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKD 188
Cdd:COG5010    82 QLDPNNPELYYNLALLYSRSGDKDEAKEYYEKALALSPDNPNAYSNLAALLLSLGQDDEAKAALQRALGTS 152
PilF COG3063
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];
99-188 4.04e-06

Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];


Pssm-ID: 442297 [Multi-domain]  Cd Length: 94  Bit Score: 47.09  E-value: 4.04e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041   99 LAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARhAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCL 178
Cdd:COG3063     1 LYLKLGDLEEAEEYYEKALELDPDNADALNNLGLLLLEQGRYDEAI-ALEKALKLDPNNAEALLNLAELLLELGDYDEAL 79
                          90
                  ....*....|
gi 578837041  179 YFICKALEKD 188
Cdd:COG3063    80 AYLERALELD 89
PHA03379 PHA03379
EBNA-3A; Provisional
1954-2199 6.10e-06

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 51.60  E-value: 6.10e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1954 QASGDTPTTPKHPKDSrenffPVTVVP----TAPDPVPADSVQRPSDAHTKPRPaLAAATTIITCP-------PSASAST 2022
Cdd:PHA03379  407 KASEPTYGTPRPPVEK-----PRPEVPqsleTATSHGSAQVPEPPPVHDLEPGP-LHDQHSMAPCPvaqlppgPLQDLEP 480
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2023 LDQskDPGPPRPHRPEATPSMASLGP---EGEELARVAEGTSFPPQEPRHSP-QVKMAPTSSPAEPHC-WPAEAALGTGA 2097
Cdd:PHA03379  481 GDQ--LPGVVQDGRPACAPVPAPAGPivrPWEASLSQVPGVAFAPVMPQPMPvEPVPVPTVALERPVCpAPPLIAMQGPG 558
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2098 EPTCS-QAASSKAPSSGSAQPPEghpgKPEPSRAKSRPLPNMPKLVIPSAATKF-PPEITVTPPTPTLLSPKGSISEETK 2175
Cdd:PHA03379  559 ETSGIvRVRERWRPAPWTPNPPR----SPSQMSVRDRLARLRAEAQPYQASVEVqPPQLTQVSPQQPMEYPLEPEQQMFP 634
                         250       260       270
                  ....*....|....*....|....*....|.
gi 578837041 2176 QKLKSAILSAQSAANV-------RKESLCQP 2199
Cdd:PHA03379  635 GSPFSQVADVMRAGGVpamqpqyFDLPLQQP 665
PHA03378 PHA03378
EBNA-3B; Provisional
1958-2202 7.60e-06

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 51.22  E-value: 7.60e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1958 DTPTTPKHP---KDSRENFFPVTVVPTAP---DPVPADSVQRPSdAHTKPRPALAAATTIITCPPSASAstldQSKDPGP 2031
Cdd:PHA03378  607 EPPTTQSHIpetSAPRQWPMPLRPIPMRPlrmQPITFNVLVFPT-PHQPPQVEITPYKPTWTQIGHIPY----QPSPTGA 681
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2032 PRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSP-------AEPHCWPAEAALGTGAEPtcSQA 2104
Cdd:PHA03378  682 NTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPgrarppaAAPGRARPPAAAPGRARP--PAA 759
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2105 ASSKAPSSGSA-------QPPEGHPGKPEPSRAKSRPLPnmpklvipsaatkfPPEItvtPPTPTLLSPKGSISEETKQK 2177
Cdd:PHA03378  760 APGRARPPAAApgaptpqPPPQAPPAPQQRPRGAPTPQP--------------PPQA---GPTSMQLMPRAAPGQQGPTK 822
                         250       260
                  ....*....|....*....|....*
gi 578837041 2178 LKSAILSAQSAANVRKESLCQPALE 2202
Cdd:PHA03378  823 QILRQLLTGGVKRGRPSLKKPAALE 847
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1949-2192 9.08e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 51.33  E-value: 9.08e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1949 AAAQRQASGDTPTTPKHPKDSRENFFPVTVVPTAPDPVPADSVQRPSdahtkPRPALAAATtiitcPPSASASTLDQSKD 2028
Cdd:PHA03307  175 PLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPA-----PAPGRSAAD-----DAGASSSDSSSSES 244
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2029 PGP---PRPHRPEATPSM-ASLGPEGEELARVAEGTSFPPQEPRHSPQVKmAPTSSPAEPHCWPAEAALGTGAEPTCSQA 2104
Cdd:PHA03307  245 SGCgwgPENECPLPRPAPiTLPTRIWEASGWNGPSSRPGPASSSSSPRER-SPSPSPSSPGSGPAPSSPRASSSSSSSRE 323
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2105 ASSKAPSSGSAQP-PEGHPGKPEPSRAKSRPLPNmpklviPSAATKFPPE-ITVTPPTPTLLSPKGSISEETKQKLKSAI 2182
Cdd:PHA03307  324 SSSSSTSSSSESSrGAAVSPGPSPSRSPSPSRPP------PPADPSSPRKrPRPSRAPSSPAASAGRPTRRRARAAVAGR 397
                         250
                  ....*....|
gi 578837041 2183 LSAQSAANVR 2192
Cdd:PHA03307  398 ARRRDATGRF 407
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
1949-2190 1.20e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 50.62  E-value: 1.20e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1949 AAAQRQASGDTPTTPKHPKdsrenffPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKD 2028
Cdd:PRK07003  395 AVPAVTAVTGAAGAALAPK-------AAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERD 467
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2029 PGPPRPHRPEATPSMASLGPegeelarvaegtsfPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQAASSK 2108
Cdd:PRK07003  468 AQPPADSGSASAPASDAPPD--------------AAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEAR 533
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2109 APSSGSAQPPEGHPGKPEP-----------SRAKSRPLPNMPKLVIPSAATKFPPEITVTPPTPTLLSPKGSiseeTKQK 2177
Cdd:PRK07003  534 PPTPAAAAPAARAGGAAAAldvlrnagmrvSSDRGARAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAAT----GDAP 609
                         250
                  ....*....|...
gi 578837041 2178 LKSAILSAQSAAN 2190
Cdd:PRK07003  610 PNGAARAEQAAES 622
NlpI COG4785
Lipoprotein NlpI, contains TPR repeats [Cell wall/membrane/envelope biogenesis];
29-174 1.29e-05

Lipoprotein NlpI, contains TPR repeats [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 443815 [Multi-domain]  Cd Length: 223  Bit Score: 48.76  E-value: 1.29e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041   29 KEAQEAEAFALYHKALDLQKHDRF-----EESAKAYHELLEASLLREAVSSGDEK--EGLKHPGLIlkySTYKNLAQLAA 101
Cdd:COG4785     8 LLLALALAAAAASKAAILLAALLFaavlaLAIALADLALALAAAALAAAALAAERidRALALPDLA---QLYYERGVAYD 84
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 578837041  102 QREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDY 174
Cdd:COG4785    85 SLGDYDLAIADFDQALELDPDLAEAYNNRGLAYLLLGDYDAALEDFDRALELDPDYAYAYLNRGIALYYLGRY 157
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1948-2173 1.42e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 50.55  E-value: 1.42e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1948 GAAAQRQASGDTPTTPKHPKDSREnfFPVTVVPTAPDPVPADSVQRPSDahtkprPALAAATtiitcPPSASASTLDQSK 2027
Cdd:PHA03307   76 GTEAPANESRSTPTWSLSTLAPAS--PAREGSPTPPGPSSPDPPPPTPP------PASPPPS-----PAPDLSEMLRPVG 142
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2028 DPGPPRPHRPEATPSMASLGPEGEE-------LARVAEGTSFPPQEPRHSPQVKMAP---TSSPAEPHCWPAEAALGTGA 2097
Cdd:PHA03307  143 SPGPPPAASPPAAGASPAAVASDAAssrqaalPLSSPEETARAPSSPPAEPPPSTPPaaaSPRPPRRSSPISASASSPAP 222
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 578837041 2098 EPTCSQAAS-SKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPSAATKFPPeitvtPPTPTLLSPKGSISEE 2173
Cdd:PHA03307  223 APGRSAADDaGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGP-----SSRPGPASSSSSPRER 294
NrfG COG4235
Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, ...
95-174 1.89e-05

Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443378 [Multi-domain]  Cd Length: 131  Bit Score: 46.15  E-value: 1.89e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041   95 NLAQLAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDY 174
Cdd:COG4235    22 LLGRAYLRLGRYDEALAAYEKALRLDPDNADALLDLAEALLAAGDTEEAEELLERALALDPDNPEALYLLGLAAFQQGDY 101
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1993-2174 2.13e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 49.87  E-value: 2.13e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1993 RPSDAHTKPRPALAAATTIITCPPSASASTldqskdPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEP----R 2068
Cdd:PRK12323  364 RPGQSGGGAGPATAAAAPVAQPAPAAAAPA------AAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEAlaaaR 437
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2069 HSPQVKMAPTSSPAephcwPAEAALGTGAEPTCSQAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMPklvipsaat 2148
Cdd:PRK12323  438 QASARGPGGAPAPA-----PAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELP--------- 503
                         170       180
                  ....*....|....*....|....*.
gi 578837041 2149 kfpPEITVTPPTPTLLSPKGSISEET 2174
Cdd:PRK12323  504 ---PEFASPAPAQPDAAPAGWVAESI 526
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1943-2163 2.77e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 49.60  E-value: 2.77e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1943 CQVHLGAAAQRQASGDTPTTPKHPKDSRENffpvTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTiiTCPPSASAST 2022
Cdd:PRK07764  582 WQVEAVVGPAPGAAGGEGPPAPASSGPPEE----AARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAP--GVAAPEHHPK 655
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2023 LDQSKDPGPPRPHRPEATPSMASLGPEGEelarvaegtsfPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPT-C 2101
Cdd:PRK07764  656 HVAVPDASDGGDGWPAKAGGAAPAAPPPA-----------PAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPqA 724
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 578837041 2102 SQAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPSAATKFPPEITVTPPTPTL 2163
Cdd:PRK07764  725 AQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMA 786
PHA03269 PHA03269
envelope glycoprotein C; Provisional
1983-2143 2.95e-05

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 49.34  E-value: 2.95e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1983 PDPVPADSVQrpsdAHTKPRPALAAATTIITCPPSASASTLDQSKDPGP-PRPHrpeatpSMASLGPEgeelarvaegts 2061
Cdd:PHA03269   40 PDPAPAPHQA----ASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPaPAPH------QAASRAPD------------ 97
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2062 fppqePRHSPQVKMAPTSSPAEPhcwPAEAALGTGAEPTCSQAASSKAPSsgsaqpPEGHPGKPEPSRAKSRPLPNMPKL 2141
Cdd:PHA03269   98 -----PAVAPQLAAAPKPDAAEA---FTSAAQAHEAPADAGTSAASKKPD------PAAHTQHSPPPFAYTRSMEHIACT 163

                  ..
gi 578837041 2142 VI 2143
Cdd:PHA03269  164 HG 165
PHA02682 PHA02682
ORF080 virion core protein; Provisional
1975-2081 3.40e-05

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 47.93  E-value: 3.40e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1975 PVTVVPTAPDP-VPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPPRPHR--PEAT------PSMAS 2045
Cdd:PHA02682   76 PSGQSPLAPSPaCAAPAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPARPAPacPPSTrqcppaPPLPT 155
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 578837041 2046 LGPEGEELARVAEGTSFPPQEPRHS-PQVKMAPTSSP 2081
Cdd:PHA02682  156 PKPAPAAKPIFLHNQLPPPDYPAAScPTIETAPAASP 192
TadD COG5010
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, ...
1-155 4.03e-05

Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444034 [Multi-domain]  Cd Length: 155  Bit Score: 46.11  E-value: 4.03e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041    1 MIRIAALNASSTIEDDHEGSFKSHKTQTKEAQEAEAFALYHKALDLQKhdRFEESAKAYHELLEAsllreavssgdekeg 80
Cdd:COG5010    21 RTLVEKYEAALAGANNTKEDELAAAGRDKLAKAFAIESPSDNLYNKLG--DFEESLALLEQALQL--------------- 83
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 578837041   81 lkHPGlilKYSTYKNLAQLAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNP 155
Cdd:COG5010    84 --DPN---NPELYYNLALLYSRSGDKDEAKEYYEKALALSPDNPNAYSNLAALLLSLGQDDEAKAALQRALGTSP 153
dnaA PRK14086
chromosomal replication initiator protein DnaA;
1977-2167 5.85e-05

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 48.28  E-value: 5.85e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1977 TVVPTAPDPVPADSVQRPSDAHTKPRPAlaaattiitcpPSASASTLDQSKDPGPPRPHRPEATPSMASLGPEGEELARV 2056
Cdd:PRK14086   91 SAGEPAPPPPHARRTSEPELPRPGRRPY-----------EGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQRPEPGAWP 159
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2057 AEGTSFPPQEPRH---SPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQ-----AASSKAPSSGSAQPPEGHPGKPEPS 2128
Cdd:PRK14086  160 RAADDYGWQQQRLgfpPRAPYASPASYAPEQERDREPYDAGRPEYDQRRRdydhpRPDWDRPRRDRTDRPEPPPGAGHVH 239
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*.
gi 578837041 2129 RAkSRPLPNMPKLVIPSAATKFP------PEITVTPPTPTL-LSPK 2167
Cdd:PRK14086  240 RG-GPGPPERDDAPVVPIRPSAPgplaaqPAPAPGPGEPTArLNPK 284
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1949-2119 6.98e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 48.06  E-value: 6.98e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1949 AAAQRQASGDTPTTPKHPKDSRENFFPVTVVPTAPDPVPaDSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKD 2028
Cdd:PRK07764  622 AAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVP-DASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQ 700
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2029 PGPPRPHRP------EATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCS 2102
Cdd:PRK07764  701 PAPAPAATPpagqadDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780
                         170
                  ....*....|....*..
gi 578837041 2103 QAASSKAPSSGSAQPPE 2119
Cdd:PRK07764  781 EEEEMAEDDAPSMDDED 797
PHA03381 PHA03381
tegument protein VP22; Provisional
1966-2109 8.13e-05

tegument protein VP22; Provisional


Pssm-ID: 177618 [Multi-domain]  Cd Length: 290  Bit Score: 46.93  E-value: 8.13e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1966 PKDSRENFFPVTVVPTAPDPVPAD-SVQRPSDAHTKPRPALAAAT----------TIITCPPSASASTLDQSKDPGPPRP 2034
Cdd:PHA03381   11 PHGTDEVEADVYYDFISPDASPARvSFEEPADRARRGAGQARGRSqaerrfhhydEARADYPYYTGSSSEDERPADPRPS 90
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 578837041 2035 HRPEATPSM----ASLGPEGEELARVAEGTSFPPqEPRHSPQVKMAPTSSPAEPHCwPAEAALGTGAEPTCSQAASSKA 2109
Cdd:PHA03381   91 RRPHAQPEAsgpgPARGARGPAGSRGRGRRAESP-SPRDPPNPKGASAPRGRKSAC-ADSAALLDAPAPAAPKRQKTPA 167
PRK13863 PRK13863
T-DNA border endonuclease VirD2;
1944-2133 8.72e-05

T-DNA border endonuclease VirD2;


Pssm-ID: 237533 [Multi-domain]  Cd Length: 446  Bit Score: 47.63  E-value: 8.72e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1944 QVHLGAAAQRQASGDTPTTPKHPKDSRENFFPVTVVPTA-----PDPVPADSVQRPSDAHTKPRPALAAATtiitcppsa 2018
Cdd:PRK13863  258 EVRLQEPAGSSIKADARIRVSLESERRAQPSASKIPVADdfgieTSYVAEGDVRKLEGNSGTPRLATEVAT--------- 328
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2019 sASTLDQSkdPGPPRPHRPEATPSMASLGpegeELARVAEGTSFPPQEP--RHSPQVKMAPTSSPAEPHCwPAEAALGTG 2096
Cdd:PRK13863  329 -HTTSERQ--QRRKRPRDDEGEPSGAKRT----RLNGIAVGPEANAGEQdgRDDPITSPAQPPRSNPLAD-PVRASIATD 400
                         170       180       190
                  ....*....|....*....|....*....|....*..
gi 578837041 2097 AEPTCSQAASSKAPSSGSAQPPEGHPGKPEPSRAKSR 2133
Cdd:PRK13863  401 SLPATADRQQQREPSSKRPRDDDGEPSIRKRARDGRS 437
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
1963-2166 1.05e-04

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 47.23  E-value: 1.05e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1963 PKHPkDSRENFFPVTVVPTAPD-PVPADSVQRPSDAHTKPRPA--------LAAATTIITCPPS---ASASTLDQSKDPG 2030
Cdd:PLN03209  330 PKES-DAADGPKPVPTKPVTPEaPSPPIEEEPPQPKAVVPRPLspytayedLKPPTSPIPTPPSsspASSKSVDAVAKPA 408
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2031 PPRPH-RPEATPSMASLGPEGEELARVAEGTSF-------PPQEPRHSPQVKMAPTSS-----PAEPHCWPAEAALGTGA 2097
Cdd:PLN03209  409 EPDVVpSPGSASNVPEVEPAQVEAKKTRPLSPYaryedlkPPTSPSPTAPTGVSPSVSstssvPAVPDTAPATAATDAAA 488
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2098 EPTCSQAASSKAPSSGSAQPPEG-HPGKPEPSRAKSRPlPNMPKLVIPSAATKFPPEITVTPPTPTLLSP 2166
Cdd:PLN03209  489 PPPANMRPLSPYAVYDDLKPPTSpSPAAPVGKVAPSST-NEVVKVGNSAPPTALADEQHHAQPKPRPLSP 557
PRK12727 PRK12727
flagellar biosynthesis protein FlhF;
1976-2161 1.56e-04

flagellar biosynthesis protein FlhF;


Pssm-ID: 237182 [Multi-domain]  Cd Length: 559  Bit Score: 46.91  E-value: 1.56e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1976 VTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSA-----SASTLDQSKDPGPPRPhRPEATPSMASLGPEG 2050
Cdd:PRK12727   62 TPATAAAPAPAPQAPTKPAAPVHAPLKLSANANMSQRQRVASAaedmiAAMALRQPVSVPRQAP-AAAPVRAASIPSPAA 140
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2051 EELARVAEGTSFPPQE----PRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQAASSKAPSSGSAQPPEGHPG-KP 2125
Cdd:PRK12727  141 QALAHAAAVRTAPRQEhalsAVPEQLFADFLTTAPVPRAPVQAPVVAAPAPVPAIAAALAAHAAYAQDDDEQLDDDGfDL 220
                         170       180       190
                  ....*....|....*....|....*....|....*.
gi 578837041 2126 EPSRAKSRPLPNMPKLVIPSAATKFPPEITVTPPTP 2161
Cdd:PRK12727  221 DDALPQILPPAALPPIVVAPAAPAALAAVAAAAPAP 256
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1910-2161 1.66e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 46.69  E-value: 1.66e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1910 RVERIMSETYMLIKQHLPV--KVDEEAALEQAVKFCQVHLGAAAQRQASGDTPTTPKHPKDSRenffpvtvvPTAPDPvp 1987
Cdd:NF033839  229 QIVALIKELDELKKQALSEidNVNTKVEIENTVHKIFADMDAVVTKFKKGLTQDTPKEPGNKK---------PSAPKP-- 297
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1988 adsvqrpsdaHTKPRPALAAAttiitcPPSASASTLDQSKDPGPPRPhRPEATPSmaslgPEGEElarvaegTSFPPQEP 2067
Cdd:NF033839  298 ----------GMQPSPQPEKK------EVKPEPETPKPEVKPQLEKP-KPEVKPQ-----PEKPK-------PEVKPQLE 348
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2068 RHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQAASSKAPSsgsAQPPEGHPgKPEPSRAKSRPLPNMPKLVIPSAA 2147
Cdd:NF033839  349 TPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPE---KPKPEVKP-QPEKPKPEVKPQPEKPKPEVKPQP 424
                         250
                  ....*....|....
gi 578837041 2148 TKFPPEITVTPPTP 2161
Cdd:NF033839  425 EKPKPEVKPQPEKP 438
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1981-2169 1.84e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.09  E-value: 1.84e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1981 TAPDPVPAdsvqrPSDAHTKPRPALAAATTIITCPP-SASASTLDQSKDPGPPRPHRPEATPSMASLGpegeELARVAEG 2059
Cdd:PHA03307  254 ECPLPRPA-----PITLPTRIWEASGWNGPSSRPGPaSSSSSPRERSPSPSPSSPGSGPAPSSPRASS----SSSSSRES 324
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2060 TSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQAASSKAPSSGSAQPPEGHP-------GKPEPSRAKS 2132
Cdd:PHA03307  325 SSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRrraraavAGRARRRDAT 404
                         170       180       190
                  ....*....|....*....|....*....|....*..
gi 578837041 2133 RPLPNMPKLVIPSAATKFPPEITVTPPtptLLSPKGS 2169
Cdd:PHA03307  405 GRFPAGRPRPSPLDAGAASGAFYARYP---LLTPSGE 438
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1975-2171 1.92e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 46.70  E-value: 1.92e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1975 PVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAA-----TTIITCPPSASASTLDQSKDPGPpRPHRPEATPSMASLGPE 2049
Cdd:PHA03307   25 PATPGDAADDLLSGSQGQLVSDSAELAAVTVVAGaaacdRFEPPTGPPPGPGTEAPANESRS-TPTWSLSTLAPASPARE 103
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2050 GEELARVAEGTSFPPQ-EPRHSPqvkmAPTSSPAEPHCWPAEAALGTGAeptcsqAASSKAPSSGSAQPPEGHPGKPEPS 2128
Cdd:PHA03307  104 GSPTPPGPSSPDPPPPtPPPASP----PPSPAPDLSEMLRPVGSPGPPP------AASPPAAGASPAAVASDAASSRQAA 173
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|...
gi 578837041 2129 RAKSRPLPNMPKLVIPSAATKFPPEITVTPPTPTLLSPKGSIS 2171
Cdd:PHA03307  174 LPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISAS 216
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1943-2167 2.19e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 46.68  E-value: 2.19e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041  1943 CQVHLGAAAQRQASGDTPTTPKHPKDSREnffPVTVVPTAPDPV--PADSVQRPSDAHTKPRPALAAATTIITCPPsasa 2020
Cdd:pfam03154  184 PSPPPPGTTQAATAGPTPSAPSVPPQGSP---ATSQPPNQTQSTaaPHTLIQQTPTLHPQRLPSPHPPLQPMTQPP---- 256
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041  2021 stldqskdpgPPRPHRPEATPSMASLGPegeelarvaegtsFPPQEprHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPT 2100
Cdd:pfam03154  257 ----------PPSQVSPQPLPQPSLHGQ-------------MPPMP--HSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPP 311
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 578837041  2101 CSQAASSKAPSSGSAQPP-EGHPGKPEPSRakSRPLPNMPkLVIPSAAtkfPPEITVTPPTPTLLSPK 2167
Cdd:pfam03154  312 GPSPAAPGQSQQRIHTPPsQSQLQSQQPPR--EQPLPPAP-LSMPHIK---PPPTTPIPQLPNPQSHK 373
PHA03247 PHA03247
large tegument protein UL36; Provisional
1948-2135 2.76e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.47  E-value: 2.76e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1948 GAAAQRQASGD---TPTTPKHPKDSRENFFPVTvVPTAPDPVPADSVQRPSdahtkprpalaaattiitcPPSASASTLD 2024
Cdd:PHA03247 2860 GDVRRRPPSRSpaaKPAAPARPPVRRLARPAVS-RSTESFALPPDQPERPP-------------------QPQAPPPPQP 2919
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2025 QSKDPGPPRPHRPEATPSMaslgpegeelarvaegtsfPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEP--TCS 2102
Cdd:PHA03247 2920 QPQPPPPPQPQPPPPPPPR-------------------PQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPrfRVP 2980
                         170       180       190
                  ....*....|....*....|....*....|...
gi 578837041 2103 QAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPL 2135
Cdd:PHA03247 2981 QPAPSREAPASSTPPLTGHSLSRVSSWASSLAL 3013
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1987-2190 3.19e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 46.02  E-value: 3.19e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1987 PADSVQRPSDAhtkPRPALAAATTIITCPPSASASTLDQSKDPGPPRPHR---PEATPSMASLGPEGEELARVAEGTSFP 2063
Cdd:PRK12323  374 PATAAAAPVAQ---PAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAaapARRSPAPEALAAARQASARGPGGAPAP 450
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2064 PQEPRHSPqvkmAPTSSPAEPHcwPAEAALGTGAEPTCSQAASSKAPSSGSAQPPEGHPGK-PEPSRAKSRPLPNM---- 2138
Cdd:PRK12323  451 APAPAAAP----AAAARPAAAG--PRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEfASPAPAQPDAAPAGwvae 524
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 578837041 2139 ----PKLVIPSAATKFPPEITVTPPTPTLLSPKGSISEETKQKLKSAILSAQSAAN 2190
Cdd:PRK12323  525 sipdPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGD 580
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1971-2127 3.26e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 46.32  E-value: 3.26e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1971 ENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAattiitcPPSASASTLDQSKDPGPPrphrPEATPSMASLGP-E 2049
Cdd:PHA03307   62 CDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPA-------SPAREGSPTPPGPSSPDP----PPPTPPPASPPPsP 130
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 578837041 2050 GEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQAASSKAPSSGSAQPPEGHPGKPEP 2127
Cdd:PHA03307  131 APDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPR 208
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1927-2119 4.96e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 45.25  E-value: 4.96e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1927 PVKVDEEAALEQAVKFCQVHLGAAAQRQASGDTPTTPKHPKDSRENFFPVTVVPTAPDPVPADS-----VQRPSDAHTKP 2001
Cdd:PRK12323  392 PAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAaapaaAARPAAAGPRP 471
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2002 RPALAAATTIITCPPSASAStldqSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPP--QEPRHSPQVKMAPTS 2079
Cdd:PRK12323  472 VAAAAAAAPARAAPAAAPAP----ADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPAtaDPDDAFETLAPAPAA 547
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|
gi 578837041 2080 SPAEphcwPAEAALGTGAEPTCSQAASSKAPSSGSAQPPE 2119
Cdd:PRK12323  548 APAP----RAAAATEPVVAPRPPRASASGLPDMFDGDWPA 583
PilF COG3063
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];
92-158 5.80e-04

Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];


Pssm-ID: 442297 [Multi-domain]  Cd Length: 94  Bit Score: 40.92  E-value: 5.80e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 578837041   92 TYKNLAQLAAQREDLETAMEFyLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHW 158
Cdd:COG3063    28 ALNNLGLLLLEQGRYDEAIAL-EKALKLDPNNAEALLNLAELLLELGDYDEALAYLERALELDPSAL 93
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1950-2134 6.34e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 44.87  E-value: 6.34e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1950 AAQRQASGDTPTTPKHPKDSRENFFPVTVVPTAPDPVPA------DSVQRPSDAHTKPRPALAAATTIITCPPSASASTL 2023
Cdd:PRK12323  393 AAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEalaaarQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPV 472
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2024 DQSKDPGPPRPHRPEATPSMASLGPEGEELarvaegtsfPPQEPRHSP-QVKMAPTSSPAEPHCWPAEAALGTGAEPTCS 2102
Cdd:PRK12323  473 AAAAAAAPARAAPAAAPAPADDDPPPWEEL---------PPEFASPAPaQPDAAPAGWVAESIPDPATADPDDAFETLAP 543
                         170       180       190
                  ....*....|....*....|....*....|..
gi 578837041 2103 QAASSKAPSSGSAQPPEGHPGKPEPSRAKSRP 2134
Cdd:PRK12323  544 APAAAPAPRAAAATEPVVAPRPPRASASGLPD 575
sucB TIGR01347
2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component); This ...
2050-2156 6.63e-04

2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component); This model describes the TCA cycle 2-oxoglutarate system E2 component, dihydrolipoamide succinyltransferase. It is closely related to the pyruvate dehydrogenase E2 component, dihydrolipoamide acetyltransferase. The seed for this model includes mitochondrial and Gram-negative bacterial forms. Mycobacterial candidates are highly derived, differ in having and extra copy of the lipoyl-binding domain at the N-terminus. They score below the trusted cutoff, but above the noise cutoff and above all examples of dihydrolipoamide acetyltransferase. [Energy metabolism, TCA cycle]


Pssm-ID: 273565 [Multi-domain]  Cd Length: 403  Bit Score: 44.34  E-value: 6.63e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041  2050 GEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHcwPAEAALGTGAEPTCSQAASSKAPSsgSAQPPEGHPG------ 2123
Cdd:TIGR01347   68 GQVLAILEEGNDATAAPPAKSGEEKEETPAASAAAA--PTAAANRPSLSPAARRLAKEHGID--LSAVPGTGVTgrvtke 143
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 578837041  2124 ---KPEPSRAKSRPLPNMPKLVIPSAATKfpPEITV 2156
Cdd:TIGR01347  144 diiKKTEAPASAQPPAAAAAAAAPAAATR--PEERV 177
PRK10263 PRK10263
DNA translocase FtsK; Provisional
1956-2175 7.98e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 44.69  E-value: 7.98e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1956 SGDTPTTPKHPK-DSRENFFPVT--VVPTAPDPVPADSVQRPSDAHTkPRPALAAATTIITCPpsasasTLDQSKDPGPp 2032
Cdd:PRK10263  295 SGNRATQPEYDEyDPLLNGAPITepVAVAAAATTATQSWAAPVEPVT-QTPPVASVDVPPAQP------TVAWQPVPGP- 366
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2033 rpHRPEatPSMAslgPEGEELARVAEGTSfpPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAePTCSQAASSKAPSS 2112
Cdd:PRK10263  367 --QTGE--PVIA---PAPEGYPQQSQYAQ--PAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYA-PAPEQPAQQPYYAP 436
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 578837041 2113 GSAQPPEGHPGKPEPSRAKSRPLPN-MPKLVIPSAATKFPPEITVTPPTPTLLSPKGSISEETK 2175
Cdd:PRK10263  437 APEQPVAGNAWQAEEQQSTFAPQSTyQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETK 500
PHA03325 PHA03325
nuclear-egress-membrane-like protein; Provisional
2018-2169 8.51e-04

nuclear-egress-membrane-like protein; Provisional


Pssm-ID: 223044  Cd Length: 418  Bit Score: 44.10  E-value: 8.51e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2018 ASASTLDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQ-----VKMAPTSSPAEPhcwPAEAA 2092
Cdd:PHA03325  259 SSAFMLNSSLPTSAPKRRSRRAGAMRAAAGETADLADDDGSEHSDPEPLPASLPPppvrrPRVKHPEAGKEE---PDGAR 335
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 578837041 2093 LGTGAEPTCSqAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPsaATKFPPEITVTPPTPTLLSPKGS 2169
Cdd:PHA03325  336 NAEAKEPAQP-ATSTSSKGSSSAQNKDSGSTGPGSSLAAASSFLEDDDFGSP--PLDLTTSLRHMPSPSVTSAPEPP 409
PHA03379 PHA03379
EBNA-3A; Provisional
1924-2170 9.16e-04

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 44.66  E-value: 9.16e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1924 QHLPVKVDEEAALEQAVK--FCQVHLGAAAQRQASGDTPTTPKHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKP 2001
Cdd:PHA03379  436 SHGSAQVPEPPPVHDLEPgpLHDQHSMAPCPVAQLPPGPLQDLEPGDQLPGVVQDGRPACAPVPAPAGPIVRPWEASLSQ 515
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2002 RPALAAATTIitcpPSASASTLDQSKDPGPPRPHRPeATPSMASLGP-EGEELARVAE---GTSFPPQEPRHSPQVKMAP 2077
Cdd:PHA03379  516 VPGVAFAPVM----PQPMPVEPVPVPTVALERPVCP-APPLIAMQGPgETSGIVRVRErwrPAPWTPNPPRSPSQMSVRD 590
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2078 TSSPAEPHCWPAEAALGTgaEPTCSQAASSKAPSSGSAQPPEG-HPGKPEPSRAKSRPLPNMPKLVIPSAATKFPPEITV 2156
Cdd:PHA03379  591 RLARLRAEAQPYQASVEV--QPPQLTQVSPQQPMEYPLEPEQQmFPGSPFSQVADVMRAGGVPAMQPQYFDLPLQQPISQ 668
                         250
                  ....*....|....
gi 578837041 2157 TPPTPTLLSPKGSI 2170
Cdd:PHA03379  669 GAPLAPLRASMGPV 682
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1991-2161 1.65e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.60  E-value: 1.65e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041  1991 VQRPSDAHTKPRPALAAATTIITCPPSASASTLD-QSKDPGPPRPHRPEATPSMASLGPEGEELarvaegtsFPPQEPR- 2068
Cdd:pfam03154  174 LQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPpQGSPATSQPPNQTQSTAAPHTLIQQTPTL--------HPQRLPSp 245
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041  2069 HSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNmPKLVIPSAAT 2148
Cdd:pfam03154  246 HPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPS-PAAPGQSQQR 324
                          170
                   ....*....|...
gi 578837041  2149 KFPPEITVTPPTP 2161
Cdd:pfam03154  325 IHTPPSQSQLQSQ 337
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
1984-2168 1.79e-03

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 43.43  E-value: 1.79e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1984 DPVPADSVQRPSDAHTKPRPALAAATTIitcppsASASTLDQSKDPGPP-RPHRPEATPSMASLGPEGEELARVAEGTSF 2062
Cdd:PRK13108  281 APGALRGSEYVVDEALEREPAELAAAAV------ASAASAVGPVGPGEPnQPDDVAEAVKAEVAEVTDEVAAESVVQVAD 354
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2063 PPQEPRHSPQVKMAPTSSPAEPHCWPAEAAlgtgAEPTCSQAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMPKlv 2142
Cdd:PRK13108  355 RDGESTPAVEETSEADIEREQPGDLAGQAP----AAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAK-- 428
                         170       180
                  ....*....|....*....|....*.
gi 578837041 2143 ipsaatkfPPEITVTPPTPTLLSPKG 2168
Cdd:PRK13108  429 --------PDELAVAGPGDDPAEPDG 446
TPR_12 pfam13424
Tetratricopeptide repeat;
36-119 1.85e-03

Tetratricopeptide repeat;


Pssm-ID: 315987 [Multi-domain]  Cd Length: 77  Bit Score: 38.91  E-value: 1.85e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041    36 AFALYHKALDLQKHDRFEESAKAYHELLEaslLREAVSSGDekeglkHPGLILkysTYKNLAQLAAQREDLETAMEFYLE 115
Cdd:pfam13424    3 ATALNNLAAVLRRLGRYDEALELLEKALE---IARRLLGPD------HPLTAT---TLLNLGRLYLELGRYEEALELLER 70

                   ....
gi 578837041   116 AVML 119
Cdd:pfam13424   71 ALAL 74
PHA03291 PHA03291
envelope glycoprotein I; Provisional
1973-2167 3.02e-03

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 42.25  E-value: 3.02e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1973 FFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTiiTCPPSASASTLDQSKDPGPPRPHRPEATPSMASLGPEGEE 2052
Cdd:PHA03291  203 FVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPST--TIAAPQAGTTPEAEGTPAPPTPGGGEAPPANATPAPEASR 280
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2053 -----------------LARVAEGTSF-----PPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGT-GAEptcsqaasska 2109
Cdd:PHA03291  281 yeltvtqiiqiaipasiIACVFLGSCAcclhrRCRRRRRRPARIYRPPSPVAPSISAVNEAALARlGDE----------- 349
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 578837041 2110 pssgsaqpPEGHPGKPePSRAKSRPLPN-MPKLVIPSAATKFP--PEITVTPPTPTLLSPK 2167
Cdd:PHA03291  350 --------LKRHPPES-PRRSKRRSSQTmVPSLTAISEESEAPavVELSRSPRRPGGPTAR 401
PLN03237 PLN03237
DNA topoisomerase 2; Provisional
1927-2134 3.17e-03

DNA topoisomerase 2; Provisional


Pssm-ID: 215641 [Multi-domain]  Cd Length: 1465  Bit Score: 42.93  E-value: 3.17e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1927 PVKVDEEAALEQAVKFCQVHLGAAAQrqASGDTPTTPKHPKDSRENFFPVtvvPTAPDPVPADSVQRPSDAHTKPRPAla 2006
Cdd:PLN03237 1255 KEKEEEDEILDLKDRLAAYNLDSAPA--QSAKMEETVKAVPARRAAARKK---PLASVSVISDSDDDDDDFAVEVSLA-- 1327
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2007 aATTIITCPPSASASTLDQSKDPGPPRPHRPEATPSMASLgpEGEELARVAEGTSFPPQEPRhspqvKMAPTSSPAEphc 2086
Cdd:PLN03237 1328 -ERLKKKGGRKPAAANKKAAKPPAAAKKRGPATVQSGQKL--LTEMLKPAEAIGISPEKKVR-----KMRASPFNKK--- 1396
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*...
gi 578837041 2087 wpAEAALGTGAEPTCSQAASSKAPSSGSAQPPEGHPGKPEPSRAKSRP 2134
Cdd:PLN03237 1397 --SGSVLGRAATNKETESSENVSGSSSSEKDEIDVSAKPRPQRANRKQ 1442
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
2004-2176 3.17e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.85  E-value: 3.17e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2004 ALAAATTIITCPPSASAstlDQSKDP--GPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHspqvkMAPTSSP 2081
Cdd:PHA03307   13 AAAEGGEFFPRPPATPG---DAADDLlsGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGT-----EAPANES 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2082 AEPHCWPAEAALGTGAEPTCSQAASSkaPSSGSAQPPEGHPGKPEPSRA-------KSRPLPNMPKLVIPSAATKFPPEI 2154
Cdd:PHA03307   85 RSTPTWSLSTLAPASPAREGSPTPPG--PSSPDPPPPTPPPASPPPSPApdlsemlRPVGSPGPPPAASPPAAGASPAAV 162
                         170       180
                  ....*....|....*....|..
gi 578837041 2155 TVTPPTPTLLSPKGSISEETKQ 2176
Cdd:PHA03307  163 ASDAASSRQAALPLSSPEETAR 184
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
34-174 3.57e-03

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 41.64  E-value: 3.57e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041   34 AEAFALYHKALDLQKHDRFEESAKAYHELLEasllreavssgdekeglKHPGLIlkySTYKNLAQLAAQREDLETAMEFY 113
Cdd:COG2956     6 AAALGWYFKGLNYLLNGQPDKAIDLLEEALE-----------------LDPETV---EAHLALGNLYRRRGEYDRAIRIH 65
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 578837041  114 LEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDY 174
Cdd:COG2956    66 QKLLERDPDRAEALLELAQDYLKAGLLDRAEELLEKLLELDPDDAEALRLLAEIYEQEGDW 126
TPR_21 pfam09976
Tetratricopeptide repeat-like domain; This family resembles a single unit of a TPR repeat.
48-151 3.60e-03

Tetratricopeptide repeat-like domain; This family resembles a single unit of a TPR repeat.


Pssm-ID: 430959 [Multi-domain]  Cd Length: 194  Bit Score: 41.03  E-value: 3.60e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041    48 KHDRFEESAKAYHELLEAsllreaVSSGDEKEGL--------KHPGlilkySTYKNLAQL-----AAQREDLETAMEfYL 114
Cdd:pfam09976   32 QRSQAEEASALYQQLLEA------VAAGDAAKAQaaaaqlkdEYGG-----TGYAALAALllakaAVEAGDLAAAKA-QL 99
                           90       100       110
                   ....*....|....*....|....*....|....*...
gi 578837041   115 EAVMLDSTDVNLwykiGHVA-LRLIRIPLARHAFEEGL 151
Cdd:pfam09976  100 EWVADNAKDEAL----KALArLRLARVLLAQGKYDEAL 133
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
1993-2123 4.48e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 42.01  E-value: 4.48e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1993 RPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPPRPhRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQ 2072
Cdd:PRK14951  365 KPAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAA-PAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPA 443
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|.
gi 578837041 2073 VKMAPTSSPAEPHCWPAEAALGTGAEPTCSQAASSKAPSSGSAQPPEGHPG 2123
Cdd:PRK14951  444 AVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEEG 494
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
2002-2200 4.52e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 42.01  E-value: 4.52e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2002 RPALAAATtiitcPPSASASTldqskdpgpprPHRPEATPSMASlgpegeelarvaegtsfpPQEPRHSPQVKMAPTSSP 2081
Cdd:PRK14951  365 KPAAAAEA-----AAPAEKKT-----------PARPEAAAPAAA------------------PVAQAAAAPAPAAAPAAA 410
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2082 AEPHCWPAEAALGTGAEPTCSQAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPSAAtkfPPEITVTPPTP 2161
Cdd:PRK14951  411 ASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAA---PAPAAAPAAAR 487
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 578837041 2162 TLLSPKGSISEETKQKLKSAI--------LSAQS---AAN-------VRKESLCQPA 2200
Cdd:PRK14951  488 LTPTEEGDVWHATVQQLAAAEaitalareLALQSelvARDgdqwllrVERESLNQPG 544
PHA03247 PHA03247
large tegument protein UL36; Provisional
1771-2163 4.60e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 4.60e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1771 ADQSGERKDKESPRAGPTEPmdTSEATVCH-SDLERTPPllPGRPardrgPESRPTELS----LEELSISARQQ--PTPL 1843
Cdd:PHA03247 2667 ARRLGRAAQASSPPQRPRRR--AARPTVGSlTSLADPPP--PPPT-----PEPAPHALVsatpLPPGPAAARQAspALPA 2737
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1844 TPAQPAPAPAPATTTGTRAGGHPEEPLSRLSRKrklledtesgktllldayrvwqqgqkgvaydlgrverimsetymlik 1923
Cdd:PHA03247 2738 APAPPAVPAGPATPGGPARPARPPTTAGPPAPA----------------------------------------------- 2770
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1924 qhlPVKVDEEAALEQAVKFCQVHLGAAAQRQASGDTPTTPKHPKDSRENFFPVTVVPTAPDPVPADSVQRPSdahtkPRP 2003
Cdd:PHA03247 2771 ---PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP-----PPP 2842
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2004 AlaaattiitcPPSASASTLDQSKDPGPP----RPHRPEATPSMASLGPEGEELARVA-----EGTSFPPQEPRHSPQVK 2074
Cdd:PHA03247 2843 P----------GPPPPSLPLGGSVAPGGDvrrrPPSRSPAAKPAAPARPPVRRLARPAvsrstESFALPPDQPERPPQPQ 2912
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2075 MAPTSSPAEPhcwPAEAALGTGAEPTCSQAASSKAPSSGSAQPPEGHPGKPEPSRAKSRP-LPNMPKLVIPSAAtkfPPE 2153
Cdd:PHA03247 2913 APPPPQPQPQ---PPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPgRVAVPRFRVPQPA---PSR 2986
                         410
                  ....*....|
gi 578837041 2154 ITVTPPTPTL 2163
Cdd:PHA03247 2987 EAPASSTPPL 2996
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
2029-2130 5.83e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.90  E-value: 5.83e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2029 PGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPP-QEPRHSPQVKMAPTSSPAEPHCWPAEAALGTgAEPTCSQAASS 2107
Cdd:PRK07764  396 AAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPaPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPS-AQPAPAPAAAP 474
                          90       100
                  ....*....|....*....|...
gi 578837041 2108 KAPSSGSAQPPEGHPGKPEPSRA 2130
Cdd:PRK07764  475 EPTAAPAPAPPAAPAPAAAPAAP 497
PHA03247 PHA03247
large tegument protein UL36; Provisional
1963-2161 7.80e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 7.80e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1963 PKHPKDSREnffPVTVVPTAPDPVPADSVQRPSDAHTKPrpalaaattiitcPPSASASTLDQSKDPGPPRPhrpeatPS 2042
Cdd:PHA03247 2475 PGAPVYRRP---AEARFPFAAGAAPDPGGGGPPDPDAPP-------------APSRLAPAILPDEPVGEPVH------PR 2532
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2043 MASLGPEGEELARVAEGTSFPPQEPRHSPQV--KMAPTSSPAEPHCWPAEAALGT--GAEPtcsQAASSKAPSSGSAQPP 2118
Cdd:PHA03247 2533 MLTWIRGLEELASDDAGDPPPPLPPAAPPAApdRSVPPPRPAPRPSEPAVTSRARrpDAPP---QSARPRAPVDDRGDPR 2609
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....
gi 578837041 2119 -EGHPGKPEPSRAKSRPLPNMPKlviPSAATKFPPEITVTPPTP 2161
Cdd:PHA03247 2610 gPAPPSPLPPDTHAPDPPPPSPS---PAANEPDPHPPPTVPPPE 2650
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
1993-2195 8.61e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 41.39  E-value: 8.61e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 1993 RPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPPrPHRPEATPSMASLGPEGEELARVAEGTsfpPQEPRHSPQ 2072
Cdd:PRK07994  360 HPAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAV-PPPPASAPQQAPAVPLPETTSQLLAAR---QQLQRAQGA 435
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578837041 2073 VKmAPTSSPAEPhcwpaeaalgTGAEPTcSQAASSKAPSSGSAQPPEGHPGKPEPSRAKSR-PLPNMPKLVIPSAATKFP 2151
Cdd:PRK07994  436 TK-AKKSEPAAA----------SRARPV-NSALERLASVRPAPSALEKAPAKKEAYRWKATnPVEVKKEPVATPKALKKA 503
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*..
gi 578837041 2152 PEITVTPPTPTLLSPKGSISE---ETKQKLKSAILSAQSAANVRKES 2195
Cdd:PRK07994  504 LEHEKTPELAAKLAAEAIERDpwaALVSQLGLPGLVEQLALNAWKEE 550
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH