|
Name |
Accession |
Description |
Interval |
E-value |
| MEF2_binding |
pfam09047 |
MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the ... |
2161-2195 |
1.20e-15 |
|
MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the calcineurin-binding protein CABIN 1, adopts an amphipathic alpha-helical structure, which allows it to bind a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription.
Pssm-ID: 370261 [Multi-domain] Cd Length: 35 Bit Score: 72.19 E-value: 1.20e-15
10 20 30
....*....|....*....|....*....|....*
gi 2462584232 2161 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2195
Cdd:pfam09047 1 TLLSPKGSISEETKQKLKNAILSAQSAANVKKDSL 35
|
|
| MEF2_binding |
cd13839 |
Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; ... |
2161-2195 |
6.91e-14 |
|
Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; The myocyte enhancer factor-2 (MEF2) binding domain, as found in the calcineurin-binding protein cabin-1, adopts an amphipathic alpha-helical structure, which allows it to bind to a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription. Cabin-1 inhibits calcineurin-mediated signal transduction in T-cell receptor-mediated signalling pathways, by binding to the activated form of calcineurin. Cabin-1 acts as a co-repressor of MEF2, the mycocyte enhancer factor-2, which regulates transcription in a calcium-dependent manner and plays vital roles in T-cell development and function.
Pssm-ID: 260103 [Multi-domain] Cd Length: 35 Bit Score: 67.41 E-value: 6.91e-14
10 20 30
....*....|....*....|....*....|....*
gi 2462584232 2161 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2195
Cdd:cd13839 1 TLLSPKGSISEETKQKLKNAILSSQSAANVKKDTL 35
|
|
| TPR |
COG0457 |
Tetratricopeptide (TPR) repeat [General function prediction only]; |
30-205 |
3.01e-11 |
|
Tetratricopeptide (TPR) repeat [General function prediction only];
Pssm-ID: 440225 [Multi-domain] Cd Length: 245 Bit Score: 65.80 E-value: 3.01e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEasllreavssgdekeglKHPGLilkYSTYKNLAQLAAQREDLETA 109
Cdd:COG0457 2 ELDPDDAEAYNNLGLAYRRLGRYEEAIEDYEKALE-----------------LDPDD---AEALYNLGLAYLRLGRYEEA 61
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 110 MEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKDC 189
Cdd:COG0457 62 LADYEQALELDPDDAEALNNLGLALQALGRYEEALEDYDKALELDPDDAEALYNLGLALLELGRYDEAIEAYERALELDP 141
|
170
....*....|....*.
gi 2462584232 190 RYSKGLVLKEKIFEEQ 205
Cdd:COG0457 142 DDADALYNLGIALEKL 157
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1919-2162 |
4.74e-11 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 68.81 E-value: 4.74e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1919 AAAQRQASGDTPTTPKHPKDSRENFFP----VTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASAS--- 1991
Cdd:PHA03247 2752 GGPARPARPPTTAGPPAPAPPAAPAAGpprrLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLppp 2831
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1992 TLDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFP-PQEPRHSPQVKMA--PTSSPAEPHCWPAEaalgtgaE 2068
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAkPAAPARPPVRRLArpAVSRSTESFALPPD-------Q 2904
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2069 PTCSQEGKLRPEPRRDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRP-LPNMPKLVIPSAAt 2147
Cdd:PHA03247 2905 PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPgRVAVPRFRVPQPA- 2983
|
250
....*....|....*
gi 2462584232 2148 kfPPEITVTPPTPTL 2162
Cdd:PHA03247 2984 --PSREAPASSTPPL 2996
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
1919-2188 |
3.61e-10 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 64.98 E-value: 3.61e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1919 AAAQRQASGDTPTTPKHPKdSRENFFPVTVVPTAPDPV----PADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLD 1994
Cdd:pfam17823 120 SSSPSSAAQSLPAAIAALP-SEAFSAPRAAACRANASAapraAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTA 198
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1995 QSKDPGPPRPHRPEATPSMASLGPE-GEELARVaeGTSFPpqeprhspqvkMAPTSSPAEPHCWPAE-AALGTGAEPTCS 2072
Cdd:pfam17823 199 ASSAPATLTPARGISTAATATGHPAaGTALAAV--GNSSP-----------AAGTVTAAVGTVTPAAlATLAAAAGTVAS 265
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2073 QEGKLR---PEPRRDGEAQEAASETQPLS-SPPTAASSKAPSS--GSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPS-- 2144
Cdd:pfam17823 266 AAGTINmgdPHARRLSPAKHMPSDTMARNpAAPMGAQAQGPIIqvSTDQPVHNTAGEPTPSPSNTTLEPNTPKSVASTnl 345
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 2462584232 2145 -------AATKFPPEITVtPPTPTLLSPKGSISEETKQklKSAILSAQSAA 2188
Cdd:pfam17823 346 avvtttkAQAKEPSASPV-PVLHTSMIPEVEATSPTTQ--PSPLLPTQGAA 393
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
1880-2160 |
8.47e-05 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 47.84 E-value: 8.47e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1880 RVERIMSETYMLIKQHLPV--KVDEEAALEQAVKFCQVHLGAAAQRQASGDTPTTPKHPKDSRenffpvtvvPTAPDPVP 1957
Cdd:NF033839 229 QIVALIKELDELKKQALSEidNVNTKVEIENTVHKIFADMDAVVTKFKKGLTQDTPKEPGNKK---------PSAPKPGM 299
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1958 ADSVQRPSDahtkprpalaaattiitcPPSASASTLDQSKDPGPPRPhRPEATPSmaslgPEGEElarvaegTSFPPQEP 2037
Cdd:NF033839 300 QPSPQPEKK------------------EVKPEPETPKPEVKPQLEKP-KPEVKPQ-----PEKPK-------PEVKPQLE 348
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2038 RHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTcsqegklRPEPRRDGEAQEAASETQPlsSPPTAASSKAPSSGSAQP- 2116
Cdd:NF033839 349 TPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPE-------TPKPEVKPQPEKPKPEVKP--QPEKPKPEVKPQPEKPKPe 419
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 2462584232 2117 --PEGHPGKPE--PSRAKS----RPLPNMPKLVIPSAATKFPPEITVTPPTP 2160
Cdd:NF033839 420 vkPQPEKPKPEvkPQPEKPkpevKPQPEKPKPEVKPQPETPKPEVKPQPEKP 471
|
|
| SepH |
NF040712 |
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ... |
1998-2138 |
1.72e-04 |
|
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.
Pssm-ID: 468676 [Multi-domain] Cd Length: 346 Bit Score: 46.30 E-value: 1.72e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1998 DPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHC------WPAEAALGTGAEPTC 2071
Cdd:NF040712 189 DPDFGRPLRPLATVPRLAREPADARPEEVEPAPAAEGAPATDSDPAEAGTPDDLASARRrragveQPEDEPVGPGAAPAA 268
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462584232 2072 SQEGKLRPEPRRdgEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSR-PLPNMP 2138
Cdd:NF040712 269 EPDEATRDAGEP--PAPGAAETPEAAEPPAPAPAAPAAPAAPEAEEPARPEPPPAPKPKRRrRRASVP 334
|
|
| PBP1 |
COG5180 |
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ... |
1918-2160 |
1.90e-03 |
|
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];
Pssm-ID: 444064 [Multi-domain] Cd Length: 548 Bit Score: 43.13 E-value: 1.90e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1918 GAAAQRQASGDTPTTPKH----PKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTiitcpPSASASTL 1993
Cdd:COG5180 152 AALLQRSDPILAKDPDGDsastLPPPAEKLDKVLTEPRDALKDSPEKLDRPKVEVKDEAQEEPPDLT-----GGADHPRP 226
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1994 DQSKDPGPPRPHRPEATPSMASLGPEGEEL-------ARVAEGTSFPPQEPRHSPQ-------VKMAPTSSPAEPHCWPA 2059
Cdd:COG5180 227 EAASSPKVDPPSTSEARSRPATVDAQPEMRppadakeRRRAAIGDTPAAEPPGLPVleagsepQSDAPEAETARPIDVKG 306
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2060 EAALGTGAEPTCSQEGKLRPEPRRDGEAQEaasetQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKS------RP 2133
Cdd:COG5180 307 VASAPPATRPVRPPGGARDPGTPRPGQPTE-----RPAGVPEAASDAGQPPSAYPPAEEAVPGKPLEQGAPRpgssggDG 381
|
250 260
....*....|....*....|....*..
gi 2462584232 2134 LPNMPKLVIPSAATKFPPeiTVTPPTP 2160
Cdd:COG5180 382 APFQPPNGAPQPGLGRRG--APGPPMG 406
|
|
| TPR_12 |
pfam13424 |
Tetratricopeptide repeat; |
36-119 |
1.92e-03 |
|
Tetratricopeptide repeat;
Pssm-ID: 315987 [Multi-domain] Cd Length: 77 Bit Score: 38.91 E-value: 1.92e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 36 AFALYHKALDLQKHDRFEESAKAYHELLEaslLREAVSSGDekeglkHPGLILkysTYKNLAQLAAQREDLETAMEFYLE 115
Cdd:pfam13424 3 ATALNNLAAVLRRLGRYDEALELLEKALE---IARRLLGPD------HPLTAT---TLLNLGRLYLELGRYEEALELLER 70
|
....
gi 2462584232 116 AVML 119
Cdd:pfam13424 71 ALAL 74
|
|
| sucB |
TIGR01347 |
2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component); This ... |
2020-2130 |
1.98e-03 |
|
2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component); This model describes the TCA cycle 2-oxoglutarate system E2 component, dihydrolipoamide succinyltransferase. It is closely related to the pyruvate dehydrogenase E2 component, dihydrolipoamide acetyltransferase. The seed for this model includes mitochondrial and Gram-negative bacterial forms. Mycobacterial candidates are highly derived, differ in having and extra copy of the lipoyl-binding domain at the N-terminus. They score below the trusted cutoff, but above the noise cutoff and above all examples of dihydrolipoamide acetyltransferase. [Energy metabolism, TCA cycle]
Pssm-ID: 273565 [Multi-domain] Cd Length: 403 Bit Score: 42.80 E-value: 1.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2020 GEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHcwPAEAALGTGAEPTCSQEGKlrpEPRRDGEAQEAASETQPLSS 2099
Cdd:TIGR01347 68 GQVLAILEEGNDATAAPPAKSGEEKEETPAASAAAA--PTAAANRPSLSPAARRLAK---EHGIDLSAVPGTGVTGRVTK 142
|
90 100 110
....*....|....*....|....*....|.
gi 2462584232 2100 PPTAASSKAPSsgSAQPPEGHPGKPEPSRAK 2130
Cdd:TIGR01347 143 EDIIKKTEAPA--SAQPPAAAAAAAAPAAAT 171
|
|
| KLF9_13_N-like |
cd21975 |
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like ... |
2005-2148 |
5.47e-03 |
|
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved alpha-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF9, KLF13, KLF14, KLF16, and similar proteins.
Pssm-ID: 409240 [Multi-domain] Cd Length: 163 Bit Score: 40.06 E-value: 5.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2005 HRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEGKLRPEPRRD 2084
Cdd:cd21975 19 HGVRPDPEGAGLAAGLDVRATREVAKGPGPPGPAWKPDGADSPGLVTAAPHLLAANVLAPLRGPSVEGSSLESGDADMGS 98
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462584232 2085 GEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGkPEPSRAKSRPLPNMPKLVIPSAATK 2148
Cdd:cd21975 99 DSDVAPASGAAASTSPESSSDAASSPSPLSLLHPGEAG-LEPERPRPRVRRGVRRRGVTPAAKR 161
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| MEF2_binding |
pfam09047 |
MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the ... |
2161-2195 |
1.20e-15 |
|
MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the calcineurin-binding protein CABIN 1, adopts an amphipathic alpha-helical structure, which allows it to bind a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription.
Pssm-ID: 370261 [Multi-domain] Cd Length: 35 Bit Score: 72.19 E-value: 1.20e-15
10 20 30
....*....|....*....|....*....|....*
gi 2462584232 2161 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2195
Cdd:pfam09047 1 TLLSPKGSISEETKQKLKNAILSAQSAANVKKDSL 35
|
|
| MEF2_binding |
cd13839 |
Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; ... |
2161-2195 |
6.91e-14 |
|
Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; The myocyte enhancer factor-2 (MEF2) binding domain, as found in the calcineurin-binding protein cabin-1, adopts an amphipathic alpha-helical structure, which allows it to bind to a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription. Cabin-1 inhibits calcineurin-mediated signal transduction in T-cell receptor-mediated signalling pathways, by binding to the activated form of calcineurin. Cabin-1 acts as a co-repressor of MEF2, the mycocyte enhancer factor-2, which regulates transcription in a calcium-dependent manner and plays vital roles in T-cell development and function.
Pssm-ID: 260103 [Multi-domain] Cd Length: 35 Bit Score: 67.41 E-value: 6.91e-14
10 20 30
....*....|....*....|....*....|....*
gi 2462584232 2161 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2195
Cdd:cd13839 1 TLLSPKGSISEETKQKLKNAILSSQSAANVKKDTL 35
|
|
| TPR |
COG0457 |
Tetratricopeptide (TPR) repeat [General function prediction only]; |
30-205 |
3.01e-11 |
|
Tetratricopeptide (TPR) repeat [General function prediction only];
Pssm-ID: 440225 [Multi-domain] Cd Length: 245 Bit Score: 65.80 E-value: 3.01e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEasllreavssgdekeglKHPGLilkYSTYKNLAQLAAQREDLETA 109
Cdd:COG0457 2 ELDPDDAEAYNNLGLAYRRLGRYEEAIEDYEKALE-----------------LDPDD---AEALYNLGLAYLRLGRYEEA 61
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 110 MEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKDC 189
Cdd:COG0457 62 LADYEQALELDPDDAEALNNLGLALQALGRYEEALEDYDKALELDPDDAEALYNLGLALLELGRYDEAIEAYERALELDP 141
|
170
....*....|....*.
gi 2462584232 190 RYSKGLVLKEKIFEEQ 205
Cdd:COG0457 142 DDADALYNLGIALEKL 157
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1919-2162 |
4.74e-11 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 68.81 E-value: 4.74e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1919 AAAQRQASGDTPTTPKHPKDSRENFFP----VTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASAS--- 1991
Cdd:PHA03247 2752 GGPARPARPPTTAGPPAPAPPAAPAAGpprrLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLppp 2831
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1992 TLDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFP-PQEPRHSPQVKMA--PTSSPAEPHCWPAEaalgtgaE 2068
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAkPAAPARPPVRRLArpAVSRSTESFALPPD-------Q 2904
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2069 PTCSQEGKLRPEPRRDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRP-LPNMPKLVIPSAAt 2147
Cdd:PHA03247 2905 PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPgRVAVPRFRVPQPA- 2983
|
250
....*....|....*
gi 2462584232 2148 kfPPEITVTPPTPTL 2162
Cdd:PHA03247 2984 --PSREAPASSTPPL 2996
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1917-2167 |
1.26e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 67.27 E-value: 1.26e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1917 LGAAAQRQASGDTPTTPKHPKDSRenffpVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQS 1996
Cdd:PHA03247 2723 PGPAAARQASPALPAAPAPPAVPA-----GPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES 2797
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1997 KdPGPPRPhrpeATPSMASLGPEGEELARVAEGTSFPPqePRHSPQVKMAPTSSPAEPHCwPAEAALGTGAEPTcsqegk 2076
Cdd:PHA03247 2798 L-PSPWDP----ADPPAAVLAPAAALPPAASPAGPLPP--PTSAQPTAPPPPPGPPPPSL-PLGGSVAPGGDVR------ 2863
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2077 lRPEPRRDGEAQEAASETQPLSSPPTAASSKAPSSgSAQPPEGHPGKPEPSrAKSRPLPNMPKLVIPSAATKFPPEITVT 2156
Cdd:PHA03247 2864 -RRPPSRSPAAKPAAPARPPVRRLARPAVSRSTES-FALPPDQPERPPQPQ-APPPPQPQPQPPPPPQPQPPPPPPPRPQ 2940
|
250
....*....|.
gi 2462584232 2157 PPTPTLLSPKG 2167
Cdd:PHA03247 2941 PPLAPTTDPAG 2951
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1930-2200 |
1.38e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 67.27 E-value: 1.38e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1930 PTTPKHPKDSRENFFPVTVVPTAPDPVPADS-VQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPPRPHRPE 2008
Cdd:PHA03247 2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGrVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2009 ATPSMASLGPEGEELARVAEGTSFPPqeprhsPQVKMAPTSSPAEPHCWPAEAALGTGAEPTcsqeGKLRPEPRRDGEAQ 2088
Cdd:PHA03247 2709 EPAPHALVSATPLPPGPAAARQASPA------LPAAPAPPAVPAGPATPGGPARPARPPTTA----GPPAPAPPAAPAAG 2778
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2089 EAASETQPLSSPPTAASSKAPS-SGSAQPPEGHPGKPEPSRAKSRPLPNMPKlviPSAATKFPPEITVTPPTPTL----- 2162
Cdd:PHA03247 2779 PPRRLTRPAVASLSESRESLPSpWDPADPPAAVLAPAAALPPAASPAGPLPP---PTSAQPTAPPPPPGPPPPSLplggs 2855
|
250 260 270
....*....|....*....|....*....|....*...
gi 2462584232 2163 LSPKGSISEETKQKLKSAILSAQSAANVRkeSLCQPAL 2200
Cdd:PHA03247 2856 VAPGGDVRRRPPSRSPAAKPAAPARPPVR--RLARPAV 2891
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1925-2191 |
1.89e-10 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 66.73 E-value: 1.89e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1925 ASGDTPTTPKHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATT--IITCPPSASA----STLDQSKD 1998
Cdd:PHA03307 124 ASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPeeTARAPSSPPAepppSTPPAAAS 203
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1999 PGPPRPHRPEATPSM--ASLGPEGEELARVAEGTSFPPQEPRHSP--QVKMAPTSSPAEPHC--WPAEAALGTGAEPTCS 2072
Cdd:PHA03307 204 PRPPRRSSPISASASspAPAPGRSAADDAGASSSDSSSSESSGCGwgPENECPLPRPAPITLptRIWEASGWNGPSSRPG 283
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2073 QEGKLRPEPRRDGEAQEAASETQPLSSPPT----------AASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNmpklvi 2142
Cdd:PHA03307 284 PASSSSSPRERSPSPSPSSPGSGPAPSSPRasssssssreSSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPP------ 357
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2143 PSAATKFPPE-ITVTPPTPTLLSPKGSISEETKQKLKSAILSAQSAANVR 2191
Cdd:PHA03307 358 PPADPSSPRKrPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRF 407
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
1919-2188 |
3.61e-10 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 64.98 E-value: 3.61e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1919 AAAQRQASGDTPTTPKHPKdSRENFFPVTVVPTAPDPV----PADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLD 1994
Cdd:pfam17823 120 SSSPSSAAQSLPAAIAALP-SEAFSAPRAAACRANASAapraAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTA 198
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1995 QSKDPGPPRPHRPEATPSMASLGPE-GEELARVaeGTSFPpqeprhspqvkMAPTSSPAEPHCWPAE-AALGTGAEPTCS 2072
Cdd:pfam17823 199 ASSAPATLTPARGISTAATATGHPAaGTALAAV--GNSSP-----------AAGTVTAAVGTVTPAAlATLAAAAGTVAS 265
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2073 QEGKLR---PEPRRDGEAQEAASETQPLS-SPPTAASSKAPSS--GSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPS-- 2144
Cdd:pfam17823 266 AAGTINmgdPHARRLSPAKHMPSDTMARNpAAPMGAQAQGPIIqvSTDQPVHNTAGEPTPSPSNTTLEPNTPKSVASTnl 345
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 2462584232 2145 -------AATKFPPEITVtPPTPTLLSPKGSISEETKQklKSAILSAQSAA 2188
Cdd:pfam17823 346 avvtttkAQAKEPSASPV-PVLHTSMIPEVEATSPTTQ--PSPLLPTQGAA 393
|
|
| TPR |
COG0457 |
Tetratricopeptide (TPR) repeat [General function prediction only]; |
34-199 |
5.81e-10 |
|
Tetratricopeptide (TPR) repeat [General function prediction only];
Pssm-ID: 440225 [Multi-domain] Cd Length: 245 Bit Score: 61.95 E-value: 5.81e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 34 AEAFALYHKALDLQkhdrfEESAKAYHELleASLLREAvssGDEKEGLKH--------PGLIlkySTYKNLAQLAAQRED 105
Cdd:COG0457 25 EEAIEDYEKALELD-----PDDAEALYNL--GLAYLRL---GRYEEALADyeqaleldPDDA---EALNNLGLALQALGR 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 106 LETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKAL 185
Cdd:COG0457 92 YEEALEDYDKALELDPDDAEALYNLGLALLELGRYDEAIEAYERALELDPDDADALYNLGIALEKLGRYEEALELLEKLE 171
|
170
....*....|....
gi 2462584232 186 EKDCRYSKGLVLKE 199
Cdd:COG0457 172 AAALAALLAAALGE 185
|
|
| Spy |
COG3914 |
Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational ... |
30-188 |
2.26e-09 |
|
Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 443119 [Multi-domain] Cd Length: 658 Bit Score: 62.70 E-value: 2.26e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEAsllreavssgdekeglkHPGLilkYSTYKNLAQLAAQREDLETA 109
Cdd:COG3914 72 AALLLLAALLELAALLLQALGRYEEALALYRRALAL-----------------NPDN---AEALFNLGNLLLALGRLEEA 131
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462584232 110 MEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKD 188
Cdd:COG3914 132 LAALRRALALNPDFAEAYLNLGEALRRLGRLEEAIAALRRALELDPDNAEALNNLGNALQDLGRLEEAIAAYRRALELD 210
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1926-2199 |
6.38e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.88 E-value: 6.38e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1926 SGDTPttPKHPKDSRENFFPVTVVPTAPDPVPADSVQRpSDAHTKPRPALAAATTIITCPPSASASTLDQSkdPGPPRPH 2005
Cdd:PHA03247 2548 AGDPP--PPLPPAAPPAAPDRSVPPPRPAPRPSEPAVT-SRARRPDAPPQSARPRAPVDDRGDPRGPAPPS--PLPPDTH 2622
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2006 RPEATPSMASlgPEGEELARVAEGTSFPPQEPRHSPQVK-----------------MAPTSSPAEPHCWPAEAALGTGAE 2068
Cdd:PHA03247 2623 APDPPPPSPS--PAANEPDPHPPPTVPPPERPRDDPAPGrvsrprrarrlgraaqaSSPPQRPRRRAARPTVGSLTSLAD 2700
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2069 PTcsqegklrPEPRRDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKP----EPSRAKSRPLPNMPklviPS 2144
Cdd:PHA03247 2701 PP--------PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPatpgGPARPARPPTTAGP----PA 2768
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 2462584232 2145 AAtkfPPEITVTPPTPTLLSPKGSISEETKQKLKSAILSAQSAANVRKESLCQPA 2199
Cdd:PHA03247 2769 PA---PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPP 2820
|
|
| LapB |
COG2956 |
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ... |
30-205 |
9.77e-09 |
|
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442196 [Multi-domain] Cd Length: 275 Bit Score: 58.97 E-value: 9.77e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEAS---------LLREAVSSGDEKEGLKHPGLILKYS-----TYKN 95
Cdd:COG2956 70 ERDPDRAEALLELAQDYLKAGLLDRAEELLEKLLELDpddaealrlLAEIYEQEGDWEKAIEVLERLLKLGpenahAYCE 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 96 LAQLAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYT 175
Cdd:COG2956 150 LAELYLEQGDYDEAIEALEKALKLDPDCARALLLLAELYLEQGDYEEAIAALERALEQDPDYLPALPRLAELYEKLGDPE 229
|
170 180 190
....*....|....*....|....*....|
gi 2462584232 176 TCLYFICKALEKDCRYSKGLVLKEKIFEEQ 205
Cdd:COG2956 230 EALELLRKALELDPSDDLLLALADLLERKE 259
|
|
| BepA |
COG4783 |
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell ... |
33-188 |
6.22e-08 |
|
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell wall/membrane/envelope biogenesis, Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 443813 [Multi-domain] Cd Length: 139 Bit Score: 53.66 E-value: 6.22e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 33 EAEAFALYHKALDLQKHDRFEESAKAYHELLEASllreavssGDEKEGlkhpglilkystYKNLAQLAAQREDLETAMEF 112
Cdd:COG4783 1 AACAEALYALAQALLLAGDYDEAEALLEKALELD--------PDNPEA------------FALLGEILLQLGDLDEAIVL 60
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462584232 113 YLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKD 188
Cdd:COG4783 61 LHEALELDPDEPEARLNLGLALLKAGDYDEALALLEKALKLDPEHPEAYLRLARAYRALGRPDEAIAALEKALELD 136
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1895-2188 |
7.31e-08 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 58.00 E-value: 7.31e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1895 HLPVKVDEEAALEQAVKFCQVHLGAAAQrQASGDTPTTPK-HPKDS-RENFFPVTVVPTAPDPVPADSVQRPSDAHTKPR 1972
Cdd:pfam05109 453 HVPTNLTAPASTGPTVSTADVTSPTPAG-TTSGASPVTPSpSPRDNgTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPT 531
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1973 PALAAATTIITCPPSASASTLDQSKDPGPP-RPHRPEAT-PSMASLGPEGEELARVAEGTSfpPQEPRHSPQVK-----M 2045
Cdd:pfam05109 532 PNATSPTLGKTSPTSAVTTPTPNATSPTPAvTTPTPNATiPTLGKTSPTSAVTTPTPNATS--PTVGETSPQANttnhtL 609
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2046 APTSSPAEPHCWP--AEAALGTGAEPTCSQEG---KLRPEPRRDG---EAQEAASETQPL--SSPPTAA---SSKAPSSG 2112
Cdd:pfam05109 610 GGTSSTPVVTSPPknATSAVTTGQHNITSSSTssmSLRPSSISETlspSTSDNSTSHMPLltSAHPTGGeniTQVTPAST 689
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462584232 2113 SAQPPEGHPGKPEPSRAKSRPLPNMpklviPSAATKfPPEITVTPPTPtllsPKGSISEETKQKLKSAILSAQSAA 2188
Cdd:pfam05109 690 STHHVSTSSPAPRPGTTSQASGPGN-----SSTSTK-PGEVNVTKGTP----PKNATSPQAPSGQKTAVPTVTSTG 755
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
1950-2189 |
2.56e-07 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 56.01 E-value: 2.56e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1950 PTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTldqskdpgPPRPHRPEATPSMASLGPEGEELARVAEG 2029
Cdd:PRK07003 374 ARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAA--------AAAATRAEAPPAAPAPPATADRGDDAADG 445
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2030 TSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTG-AEPTCSQEgklrPEPRRDGEAQEAASETQPLSSPPTAASSKA 2108
Cdd:PRK07003 446 DAPVPAKANARASADSRCDERDAQPPADSGSASAPASdAPPDAAFE----PAPRAAAPSAATPAAVPDARAPAAASREDA 521
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2109 PSSGSAQPPEGHPGKP----EPSRA--------------------KSRPLPNMPKLVIPSAATKFPPEITVTPPTPTLLS 2164
Cdd:PRK07003 522 PAAAAPPAPEARPPTPaaaaPAARAggaaaaldvlrnagmrvssdRGARAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRA 601
|
250 260
....*....|....*....|....*
gi 2462584232 2165 PKGSiseeTKQKLKSAILSAQSAAN 2189
Cdd:PRK07003 602 RAAT----GDAPPNGAARAEQAAES 622
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
1928-2201 |
1.10e-06 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 54.30 E-value: 1.10e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1928 DTPTTPKHP---KDSRENFFPVTVVPTAP---DPVPADSVQRPSdAHTKPRPALAAATTIITCPPSASAstldQSKDPGP 2001
Cdd:PHA03378 607 EPPTTQSHIpetSAPRQWPMPLRPIPMRPlrmQPITFNVLVFPT-PHQPPQVEITPYKPTWTQIGHIPY----QPSPTGA 681
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2002 PRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPaephcwpaeaalgtgaeptcsqeGKLRPep 2081
Cdd:PHA03378 682 NTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAP-----------------------GRARP-- 736
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2082 rrdgeAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSrpLPNMPKLVIPSAATKFPPeiTVTPPTPT 2161
Cdd:PHA03378 737 -----PAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQ--APPAPQQRPRGAPTPQPP--PQAGPTSM 807
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 2462584232 2162 LLSPKGSISEETKQKLKSAILSAQSAANVRKESLCQPALE 2201
Cdd:PHA03378 808 QLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALE 847
|
|
| dnaA |
PRK14086 |
chromosomal replication initiator protein DnaA; |
1947-2166 |
1.27e-06 |
|
chromosomal replication initiator protein DnaA;
Pssm-ID: 237605 [Multi-domain] Cd Length: 617 Bit Score: 53.68 E-value: 1.27e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1947 TVVPTAPDPVPADSVQRPSDAHTKPRPAlaaattiitcpPSASASTLDQSKDPGPPRPHRPEATPSMASLGPEGEELARV 2026
Cdd:PRK14086 91 SAGEPAPPPPHARRTSEPELPRPGRRPY-----------EGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQRPEPGAWP 159
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2027 AEGTSFPPQEPRHSPqvkmaptsSPAEPHCWPAEAAlgTGAEPTCSQEGKLRPE---PRRDGEAQEaasetqPLSSPPTA 2103
Cdd:PRK14086 160 RAADDYGWQQQRLGF--------PPRAPYASPASYA--PEQERDREPYDAGRPEydqRRRDYDHPR------PDWDRPRR 223
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462584232 2104 ASSKAP--SSGSAQPPEGHPGKPEPSRAKSRPlpnmpklVIPSAATKFP--PEITVTPPTPTL-LSPK 2166
Cdd:PRK14086 224 DRTDRPepPPGAGHVHRGGPGPPERDDAPVVP-------IRPSAPGPLAaqPAPAPGPGEPTArLNPK 284
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
1925-2175 |
1.31e-06 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 53.92 E-value: 1.31e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1925 ASGDTPTTP--KHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIitcPPSASAstldqskdPGPP 2002
Cdd:PHA03378 646 LVFPTPHQPpqVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPM---RPPAAP--------PGRA 714
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2003 RPhrPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAephcwPAEAALGTGAEPTCSQEGKLRPEPR 2082
Cdd:PHA03378 715 QR--PAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPG-----RARPPAAAPGAPTPQPPPQAPPAPQ 787
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2083 RdgEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRplpNMPKLVIPSAATKFPPEItvtpPTPtl 2162
Cdd:PHA03378 788 Q--RPRGAPTPQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKR---GRPSLKKPAALERQAAAG----PTP-- 856
|
250
....*....|...
gi 2462584232 2163 lSPKGSISEETKQ 2175
Cdd:PHA03378 857 -SPGSGTSDKIVQ 868
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
1924-2150 |
1.40e-06 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 53.73 E-value: 1.40e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1924 QASGDT-PTTPKHPKDSRENffPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASA-STLDQSKDPGP 2001
Cdd:PRK12323 367 QSGGGAgPATAAAAPVAQPA--PAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEAlAAARQASARGP 444
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2002 PRPHRPEATPSMAslgPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEGKLRPEP 2081
Cdd:PRK12323 445 GGAPAPAPAPAAA---PAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGW 521
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462584232 2082 RRDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRpLPNMPKLVIPSAATKFP 2150
Cdd:PRK12323 522 VAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASG-LPDMFDGDWPALAARLP 589
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
1913-2138 |
1.71e-06 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 53.45 E-value: 1.71e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1913 CQVHLGAAAQRQASGDTPTTPKHPKDSRENffpvTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASAST 1992
Cdd:PRK07764 582 WQVEAVVGPAPGAAGGEGPPAPASSGPPEE----AARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHV 657
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1993 LDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHcWPAEAALGTGAEPTCS 2072
Cdd:PRK07764 658 AVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQ-PPQAAQGASAPSPAAD 736
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462584232 2073 QEGKLRPEPRRDGEAQEAasetqPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMP 2138
Cdd:PRK07764 737 DPVPLPPEPDDPPDPAGA-----PAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDED 797
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
1951-2158 |
2.48e-06 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 52.96 E-value: 2.48e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1951 TAPDPVPADSVQRPSDAHTKPRPALAAATTiitcPPSASASTLDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGT 2030
Cdd:PRK12323 372 AGPATAAAAPVAQPAPAAAAPAAAAPAPAA----PPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGA 447
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2031 SFPPQEPRHSPQVKMAPTSSPAEPhcwpaEAALGTGAEPTCSQEGKLRPEPRRDGEAQEAASEtqpLSSPPTAASSKAPs 2110
Cdd:PRK12323 448 PAPAPAPAAAPAAAARPAAAGPRP-----VAAAAAAAPARAAPAAAPAPADDDPPPWEELPPE---FASPAPAQPDAAP- 518
|
170 180 190 200
....*....|....*....|....*....|....*....|....*...
gi 2462584232 2111 SGSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPSAATKFPPEITVTPP 2158
Cdd:PRK12323 519 AGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPP 566
|
|
| TadD |
COG5010 |
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, ... |
38-188 |
2.88e-06 |
|
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];
Pssm-ID: 444034 [Multi-domain] Cd Length: 155 Bit Score: 49.19 E-value: 2.88e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 38 ALYHKALDLQKHDRFEESAKAYHELLEASLLREAVSSGDEKEGLKHPGLILKYSTYKNLAQLAAQREDLETAMEFYLEAV 117
Cdd:COG5010 2 RALEGFDRLPLYLLLLTKLRTLVEKYEAALAGANNTKEDELAAAGRDKLAKAFAIESPSDNLYNKLGDFEESLALLEQAL 81
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462584232 118 MLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKD 188
Cdd:COG5010 82 QLDPNNPELYYNLALLYSRSGDKDEAKEYYEKALALSPDNPNAYSNLAALLLSLGQDDEAKAALQRALGTS 152
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1918-2138 |
2.96e-06 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 52.87 E-value: 2.96e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1918 GAAAQRQASGDTPTTPKHPKDSREnfFPVTVVPTAPDPVPADSVQRPSDahtkprPALAAATtiitcPPSASASTLDQSK 1997
Cdd:PHA03307 76 GTEAPANESRSTPTWSLSTLAPAS--PAREGSPTPPGPSSPDPPPPTPP------PASPPPS-----PAPDLSEMLRPVG 142
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1998 DPGPPRPHRPEATPSMASLGPEGEE-------LARVAEGTSFPPQEPRHSPQVKMAP---TSSPAEPHCWPAEAALGTGA 2067
Cdd:PHA03307 143 SPGPPPAASPPAAGASPAAVASDAAssrqaalPLSSPEETARAPSSPPAEPPPSTPPaaaSPRPPRRSSPISASASSPAP 222
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2068 EPTCSQEGKLR--------PEPRRDGEAQE-------AASETQP----LSSPPTAASSKAPSSGSAQPPEGHPGKPEPSR 2128
Cdd:PHA03307 223 APGRSAADDAGasssdsssSESSGCGWGPEnecplprPAPITLPtriwEASGWNGPSSRPGPASSSSSPRERSPSPSPSS 302
|
250
....*....|
gi 2462584232 2129 AKSRPLPNMP 2138
Cdd:PHA03307 303 PGSGPAPSSP 312
|
|
| PilF |
COG3063 |
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures]; |
99-188 |
4.41e-06 |
|
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];
Pssm-ID: 442297 [Multi-domain] Cd Length: 94 Bit Score: 47.09 E-value: 4.41e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 99 LAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARhAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCL 178
Cdd:COG3063 1 LYLKLGDLEEAEEYYEKALELDPDNADALNNLGLLLLEQGRYDEAI-ALEKALKLDPNNAEALLNLAELLLELGDYDEAL 79
|
90
....*....|
gi 2462584232 179 YFICKALEKD 188
Cdd:COG3063 80 AYLERALELD 89
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
1961-2166 |
4.74e-06 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 52.08 E-value: 4.74e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1961 VQRPSDAHTKPRPALAAATTIITCPPSASASTLD-QSKDPGPPRPHRPEATPSMASLGPEGEELarvaegtsFPPQEPrh 2039
Cdd:pfam03154 174 LQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPpQGSPATSQPPNQTQSTAAPHTLIQQTPTL--------HPQRLP-- 243
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2040 SPQVKMAPTSSPAEPHCWPAEAAlgtgAEPTCSQEGKLRPEPRRDGEAQ-EAASETQPLSSPPTAASSKAPSSGSAQ--- 2115
Cdd:pfam03154 244 SPHPPLQPMTQPPPPSQVSPQPL----PQPSLHGQMPPMPHSLQTGPSHmQHPVPPQPFPLTPQSSQSQVPPGPSPAapg 319
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2116 ---------PPEGHPGKPEPSRakSRPLPNMPkLVIPSAAtkfPPEITVTPPTPTLLSPK 2166
Cdd:pfam03154 320 qsqqrihtpPSQSQLQSQQPPR--EQPLPPAP-LSMPHIK---PPPTTPIPQLPNPQSHK 373
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
1918-2180 |
6.26e-06 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 51.69 E-value: 6.26e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1918 GAAAQRQASGDTPTTPKHPKDSRENffpvtvvPTAPDPVPADSVQRPSdahTKPRPALAAATTIiTCPPSASASTLDQSK 1997
Cdd:pfam03154 319 GQSQQRIHTPPSQSQLQSQQPPREQ-------PLPPAPLSMPHIKPPP---TTPIPQLPNPQSH-KHPPHLSGPSPFQMN 387
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1998 DPGPPRP-----------HRPEATPSMASLGPEGEELARvaegtsfPPQEPRHSPQVKMAPTSSPAEPHcwpaeaalGTG 2066
Cdd:pfam03154 388 SNLPPPPalkplsslsthHPPSAHPPPLQLMPQSQQLPP-------PPAQPPVLTQSQSLPPPAASHPP--------TSG 452
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2067 AEPTCSQEgklrPEPRRDGEAQEAASETQPlSSPPTAASSKAPSSgsaQPPEghpgkpEPSRAKSRPLPNMPKLVIPSAA 2146
Cdd:pfam03154 453 LHQVPSQS----PFPQHPFVPGGPPPITPP-SGPPTSTSSAMPGI---QPPS------SASVSSSGPVPAAVSCPLPPVQ 518
|
250 260 270
....*....|....*....|....*....|....*....
gi 2462584232 2147 TKFPP-----EITVTPPTPTLLSPKGSISEETKQKLKSA 2180
Cdd:pfam03154 519 IKEEAldeaeEPESPPPPPRSPSPEPTVVNTPSHASQSA 557
|
|
| PHA03379 |
PHA03379 |
EBNA-3A; Provisional |
1924-2169 |
9.62e-06 |
|
EBNA-3A; Provisional
Pssm-ID: 223066 [Multi-domain] Cd Length: 935 Bit Score: 51.21 E-value: 9.62e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1924 QASGDTPTTPKHPKDSrenffPVTVVP----TAPDPVPADSVQRPSDAHTKPRPaLAAATTIITCP-------PSASAST 1992
Cdd:PHA03379 407 KASEPTYGTPRPPVEK-----PRPEVPqsleTATSHGSAQVPEPPPVHDLEPGP-LHDQHSMAPCPvaqlppgPLQDLEP 480
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1993 LDQskDPGPPRPHRPEATPSMASLGP---EGEELARVAEGTSFPPQEPRHSP-QVKMAPTSSPAEPHC-WPAEAALGTGA 2067
Cdd:PHA03379 481 GDQ--LPGVVQDGRPACAPVPAPAGPivrPWEASLSQVPGVAFAPVMPQPMPvEPVPVPTVALERPVCpAPPLIAMQGPG 558
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2068 EPTCSQEGKLR---------------PEPRRDGEAQ---EAASETQPLSSPP---TAASSKAPSSGSAQPPEG-HPGKPE 2125
Cdd:PHA03379 559 ETSGIVRVRERwrpapwtpnpprspsQMSVRDRLARlraEAQPYQASVEVQPpqlTQVSPQQPMEYPLEPEQQmFPGSPF 638
|
250 260 270 280
....*....|....*....|....*....|....*....|....
gi 2462584232 2126 PSRAKSRPLPNMPKLVIPSAATKFPPEITVTPPTPTLLSPKGSI 2169
Cdd:PHA03379 639 SQVADVMRAGGVPAMQPQYFDLPLQQPISQGAPLAPLRASMGPV 682
|
|
| NlpI |
COG4785 |
Lipoprotein NlpI, contains TPR repeats [Cell wall/membrane/envelope biogenesis]; |
29-174 |
1.58e-05 |
|
Lipoprotein NlpI, contains TPR repeats [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 443815 [Multi-domain] Cd Length: 223 Bit Score: 48.37 E-value: 1.58e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 29 KEAQEAEAFALYHKALDLQKHDRF-----EESAKAYHELLEASLLREAVSSGDEK--EGLKHPGLIlkySTYKNLAQLAA 101
Cdd:COG4785 8 LLLALALAAAAASKAAILLAALLFaavlaLAIALADLALALAAAALAAAALAAERidRALALPDLA---QLYYERGVAYD 84
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462584232 102 QREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDY 174
Cdd:COG4785 85 SLGDYDLAIADFDQALELDPDLAEAYNNRGLAYLLLGDYDAALEDFDRALELDPDYAYAYLNRGIALYYLGRY 157
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1919-2172 |
1.94e-05 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 50.17 E-value: 1.94e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1919 AAAQRQASGDTPTT-PKHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSasastldqsk 1997
Cdd:PHA03307 60 AACDRFEPPTGPPPgPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPS---------- 129
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1998 dPGPPRPH-----RPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAAlGTGAEPTCs 2072
Cdd:PHA03307 130 -PAPDLSEmlrpvGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTP-PAAASPRP- 206
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2073 qegklrpePRRDGEAQEAASETQPlSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPSAATKFPpe 2152
Cdd:PHA03307 207 --------PRRSSPISASASSPAP-APGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGW-- 275
|
250 260
....*....|....*....|
gi 2462584232 2153 iTVTPPTPTLLSPKGSISEE 2172
Cdd:PHA03307 276 -NGPSSRPGPASSSSSPRER 294
|
|
| NrfG |
COG4235 |
Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, ... |
95-174 |
2.27e-05 |
|
Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 443378 [Multi-domain] Cd Length: 131 Bit Score: 46.15 E-value: 2.27e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 95 NLAQLAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDY 174
Cdd:COG4235 22 LLGRAYLRLGRYDEALAAYEKALRLDPDNADALLDLAEALLAAGDTEEAEELLERALALDPDNPEALYLLGLAAFQQGDY 101
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
1946-2162 |
2.82e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 49.60 E-value: 2.82e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1946 VTVVPtAPDPVPADSVQRPSDAHTKPRPALAAATtiitcPPSASASTlDQSKDPGPPRPHRPEATPSMASLGPEGEELAR 2025
Cdd:PRK07764 584 VEAVV-GPAPGAAGGEGPPAPASSGPPEEAARPA-----APAAPAAP-AAPAPAGAAAAPAEASAAPAPGVAAPEHHPKH 656
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2026 VAEGTSFPPQEPRHSPQVKMAPTSSPAEPhcwpAEAALGTGAEPTCSQEGKLRPEPRRDGEAQEAASETQplsSPPTAAS 2105
Cdd:PRK07764 657 VAVPDASDGGDGWPAKAGGAAPAAPPPAP----APAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPP---QAAQGAS 729
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*..
gi 2462584232 2106 SKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPSAATKFPPEITVTPPTPTL 2162
Cdd:PRK07764 730 APSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMA 786
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
1919-2091 |
3.37e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 49.21 E-value: 3.37e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1919 AAAQRQASGDTPTTPKHPKDSRENFFPVTVVPTAPDPVPaDSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKD 1998
Cdd:PRK07764 622 AAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVP-DASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQ 700
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1999 PGPPRPHRP------EATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCS 2072
Cdd:PRK07764 701 PAPAPAATPpagqadDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780
|
170 180
....*....|....*....|..
gi 2462584232 2073 QEGKLRPEPRR---DGEAQEAA 2091
Cdd:PRK07764 781 EEEEMAEDDAPsmdDEDRRDAE 802
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
1919-2138 |
4.09e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 49.08 E-value: 4.09e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1919 AAAQRQASGDTPTTPKHPKdsrenffPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKD 1998
Cdd:PRK07003 395 AVPAVTAVTGAAGAALAPK-------AAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERD 467
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1999 PGPPRPHRPEATPSMASLGPEGEELA--------------RVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHC-WPAEAAL 2063
Cdd:PRK07003 468 AQPPADSGSASAPASDAPPDAAFEPApraaapsaatpaavPDARAPAAASREDAPAAAAPPAPEARPPTPAAaAPAARAG 547
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2064 GTGAEPTCSQEGKLRPEPRRDGEAQEAASETQPLSSPPTAASSK---------APSSGSAQPPEghPGKPEPSRAKSR-- 2132
Cdd:PRK07003 548 GAAAALDVLRNAGMRVSSDRGARAAAAAKPAAAPAAAPKPAAPRvavqvptprARAATGDAPPN--GAARAEQAAESRga 625
|
....*...
gi 2462584232 2133 --PLPNMP 2138
Cdd:PRK07003 626 ppPWEDIP 633
|
|
| PHA02682 |
PHA02682 |
ORF080 virion core protein; Provisional |
1945-2051 |
4.69e-05 |
|
ORF080 virion core protein; Provisional
Pssm-ID: 177464 [Multi-domain] Cd Length: 280 Bit Score: 47.55 E-value: 4.69e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1945 PVTVVPTAPDP-VPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPPRPHR--PEAT------PSMAS 2015
Cdd:PHA02682 76 PSGQSPLAPSPaCAAPAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPARPAPacPPSTrqcppaPPLPT 155
|
90 100 110
....*....|....*....|....*....|....*..
gi 2462584232 2016 LGPEGEELARVAEGTSFPPQEPRHS-PQVKMAPTSSP 2051
Cdd:PHA02682 156 PKPAPAAKPIFLHNQLPPPDYPAAScPTIETAPAASP 192
|
|
| TadD |
COG5010 |
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, ... |
1-155 |
4.87e-05 |
|
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];
Pssm-ID: 444034 [Multi-domain] Cd Length: 155 Bit Score: 45.72 E-value: 4.87e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1 MIRIAALNASSTIEDDHEGSFKSHKTQTKEAQEAEAFALYHKALDLQKhdRFEESAKAYHELLEAsllreavssgdekeg 80
Cdd:COG5010 21 RTLVEKYEAALAGANNTKEDELAAAGRDKLAKAFAIESPSDNLYNKLG--DFEESLALLEQALQL--------------- 83
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462584232 81 lkHPGlilKYSTYKNLAQLAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNP 155
Cdd:COG5010 84 --DPN---NPELYYNLALLYSRSGDKDEAKEYYEKALALSPDNPNAYSNLAALLLSLGQDDEAKAALQRALGTSP 153
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1945-2133 |
5.23e-05 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 48.63 E-value: 5.23e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1945 PVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAA-----TTIITCPPSASASTLDQSKDPGPpRPHRPEATPSMASLGPE 2019
Cdd:PHA03307 25 PATPGDAADDLLSGSQGQLVSDSAELAAVTVVAGaaacdRFEPPTGPPPGPGTEAPANESRS-TPTWSLSTLAPASPARE 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2020 GEELARVAEGTSFPPQ-EPRHSPqvkmAPTSSPAEPHCWPAEAALGTGAEPtcsqegklRPEPRRDGEAQEAASETQPLS 2098
Cdd:PHA03307 104 GSPTPPGPSSPDPPPPtPPPASP----PPSPAPDLSEMLRPVGSPGPPPAA--------SPPAAGASPAAVASDAASSRQ 171
|
170 180 190
....*....|....*....|....*....|....*
gi 2462584232 2099 SPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRP 2133
Cdd:PHA03307 172 AALPLSSPEETARAPSSPPAEPPPSTPPAAASPRP 206
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
1985-2198 |
5.90e-05 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 48.38 E-value: 5.90e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1985 PPSASASTldQSKDPGPPRPHRPEATPSMASLGPEGEELAR---VAEGTSF----PPQEPRHSPQVKMAPTSSPAEPHCW 2057
Cdd:PLN03209 329 PPKESDAA--DGPKPVPTKPVTPEAPSPPIEEEPPQPKAVVprpLSPYTAYedlkPPTSPIPTPPSSSPASSKSVDAVAK 406
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2058 PAEAALGTGAEPTCS-QEGKLRPEPR---RDGEAQEAASETQPLSSP-PTAASSKAPSSGSAQPPEGHPGKPEPSRAKSR 2132
Cdd:PLN03209 407 PAEPDVVPSPGSASNvPEVEPAQVEAkktRPLSPYARYEDLKPPTSPsPTAPTGVSPSVSSTSSVPAVPDTAPATAATDA 486
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462584232 2133 PLPNMPKlviPSAATKFPPEITVTPPT-PTLLSPKGSISEETKQKLKSAILSAQSAANVRKESLCQP 2198
Cdd:PLN03209 487 AAPPPAN---MRPLSPYAVYDDLKPPTsPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQP 550
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
1880-2160 |
8.47e-05 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 47.84 E-value: 8.47e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1880 RVERIMSETYMLIKQHLPV--KVDEEAALEQAVKFCQVHLGAAAQRQASGDTPTTPKHPKDSRenffpvtvvPTAPDPVP 1957
Cdd:NF033839 229 QIVALIKELDELKKQALSEidNVNTKVEIENTVHKIFADMDAVVTKFKKGLTQDTPKEPGNKK---------PSAPKPGM 299
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1958 ADSVQRPSDahtkprpalaaattiitcPPSASASTLDQSKDPGPPRPhRPEATPSmaslgPEGEElarvaegTSFPPQEP 2037
Cdd:NF033839 300 QPSPQPEKK------------------EVKPEPETPKPEVKPQLEKP-KPEVKPQ-----PEKPK-------PEVKPQLE 348
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2038 RHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTcsqegklRPEPRRDGEAQEAASETQPlsSPPTAASSKAPSSGSAQP- 2116
Cdd:NF033839 349 TPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPE-------TPKPEVKPQPEKPKPEVKP--QPEKPKPEVKPQPEKPKPe 419
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 2462584232 2117 --PEGHPGKPE--PSRAKS----RPLPNMPKLVIPSAATKFPPEITVTPPTP 2160
Cdd:NF033839 420 vkPQPEKPKPEvkPQPEKPkpevKPQPEKPKPEVKPQPETPKPEVKPQPEKP 471
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
1930-2201 |
9.26e-05 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 47.75 E-value: 9.26e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1930 PTTPKHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAAttiitcPPSASASTLDQSKDPGPPRPHRPEA 2009
Cdd:PHA03378 676 PSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRA------RPPAAAPGRARPPAAAPGRARPPAA 749
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2010 TPSMA---SLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPA---EPHCWPAEAALGTGAEPTCSQEGKLRPEPRR 2083
Cdd:PHA03378 750 APGRArppAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTpqpPPQAGPTSMQLMPRAAPGQQGPTKQILRQLL 829
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2084 DGEAQEA-ASETQPLSSPPTAASSKAPSSGSA------QPPEGHPGKPEPSRAKSRplPNMPKLVIPSAATKFPPEIT-- 2154
Cdd:PHA03378 830 TGGVKRGrPSLKKPAALERQAAAGPTPSPGSGtsdkivQAPVFYPPVLQPIQVMRQ--LGSVRAAAASTVTQAPTEYTge 907
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|...
gi 2462584232 2155 ---VTPPTPTLLSPKGSISEETKQKLKSAILSAQSAANVRKESLC---QPALE 2201
Cdd:PHA03378 908 rrgVGPMHPTDIPPSKRAKTDAYVESQPPHGGQSHSFSVIWENVSqgqQQTLE 960
|
|
| SepH |
NF040712 |
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ... |
1998-2138 |
1.72e-04 |
|
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.
Pssm-ID: 468676 [Multi-domain] Cd Length: 346 Bit Score: 46.30 E-value: 1.72e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1998 DPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHC------WPAEAALGTGAEPTC 2071
Cdd:NF040712 189 DPDFGRPLRPLATVPRLAREPADARPEEVEPAPAAEGAPATDSDPAEAGTPDDLASARRrragveQPEDEPVGPGAAPAA 268
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462584232 2072 SQEGKLRPEPRRdgEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSR-PLPNMP 2138
Cdd:NF040712 269 EPDEATRDAGEP--PAPGAAETPEAAEPPAPAPAAPAAPAAPEAEEPARPEPPPAPKPKRRrRRASVP 334
|
|
| PHA03291 |
PHA03291 |
envelope glycoprotein I; Provisional |
1943-2166 |
2.52e-04 |
|
envelope glycoprotein I; Provisional
Pssm-ID: 223033 [Multi-domain] Cd Length: 401 Bit Score: 45.72 E-value: 2.52e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1943 FFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTiiTCPPSASASTLDQSKDPGPPRPHRPEATPSMASLGPEgee 2022
Cdd:PHA03291 203 FVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPST--TIAAPQAGTTPEAEGTPAPPTPGGGEAPPANATPAPE--- 277
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2023 larvaegtsfppqEPRHspQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEGKLRPEPRRdgeaqeaASETQPLSSPPT 2102
Cdd:PHA03291 278 -------------ASRY--ELTVTQIIQIAIPASIIACVFLGSCACCLHRRCRRRRRRPAR-------IYRPPSPVAPSI 335
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462584232 2103 AASSKAPSSGSAQPPEGHPGKPePSRAKSRPLPN-MPKLVIPSAATKFP--PEITVTPPTPTLLSPK 2166
Cdd:PHA03291 336 SAVNEAALARLGDELKRHPPES-PRRSKRRSSQTmVPSLTAISEESEAPavVELSRSPRRPGGPTAR 401
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
2008-2165 |
3.19e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 46.02 E-value: 3.19e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2008 EATPSMASLGPEGEELARVAEGTSFPPqeprhspqvkmAPTSSPAEPHCWPAEAAlgtGAEPTCSQEGKLRPEPRRDGEA 2087
Cdd:PRK12323 371 GAGPATAAAAPVAQPAPAAAAPAAAAP-----------APAAPPAAPAAAPAAAA---AARAVAAAPARRSPAPEALAAA 436
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462584232 2088 QEAASETQPLSSPPTAASSKAPSsgSAQPPEGHPGKPEPSRAKSRPLPNMPKLViPSAATKFPPEITVTPPTPTLLSP 2165
Cdd:PRK12323 437 RQASARGPGGAPAPAPAPAAAPA--AAARPAAAGPRPVAAAAAAAPARAAPAAA-PAPADDDPPPWEELPPEFASPAP 511
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
1926-2135 |
3.65e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 45.85 E-value: 3.65e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1926 SGDTPTTPKHPK-DSRENFFPVT--VVPTAPDPVPADSVQRPSDAHTkPRPALAAATTIITCPpsasasTLDQSKDPGPp 2002
Cdd:PRK10263 295 SGNRATQPEYDEyDPLLNGAPITepVAVAAAATTATQSWAAPVEPVT-QTPPVASVDVPPAQP------TVAWQPVPGP- 366
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2003 rpHRPEatPSMASlGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEGKLRPEPR 2082
Cdd:PRK10263 367 --QTGE--PVIAP-APEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQP 441
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|...
gi 2462584232 2083 RDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLP 2135
Cdd:PRK10263 442 VAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEP 494
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
1963-2189 |
4.04e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 45.64 E-value: 4.04e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1963 RPSDAHTKPRPALAAATTIITCPPSASASTldqskdPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEP----R 2038
Cdd:PRK12323 364 RPGQSGGGAGPATAAAAPVAQPAPAAAAPA------AAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEAlaaaR 437
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2039 HSPQVKMAPTSSPAephcwPAEAALGTGAEPTCSQEgkLRPEPRrdgeaqeAASETQPLSSPPTAAsskAPSSGSAQPPE 2118
Cdd:PRK12323 438 QASARGPGGAPAPA-----PAPAAAPAAAARPAAAG--PRPVAA-------AAAAAPARAAPAAAP---APADDDPPPWE 500
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2119 GHPGK-PEPSRAKSRPLPNM--------PKLVIPSAATKFPPEITVTPPTPTLLSPKGSISEETKQKLKSAILSAQSAAN 2189
Cdd:PRK12323 501 ELPPEfASPAPAQPDAAPAGwvaesipdPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGD 580
|
|
| PHA03381 |
PHA03381 |
tegument protein VP22; Provisional |
1936-2076 |
5.23e-04 |
|
tegument protein VP22; Provisional
Pssm-ID: 177618 [Multi-domain] Cd Length: 290 Bit Score: 44.23 E-value: 5.23e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1936 PKDSRENFFPVTVVPTAPDPVPAD-SVQRPSDAHTKPRPALAAAT----------TIITCPPSASASTLDQSKDPGPPRP 2004
Cdd:PHA03381 11 PHGTDEVEADVYYDFISPDASPARvSFEEPADRARRGAGQARGRSqaerrfhhydEARADYPYYTGSSSEDERPADPRPS 90
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462584232 2005 HRPEATPSM----ASLGPEGEELARVAEGTSFPPqEPRHSPQVKMAPTSSPAEPHCwPAEAALGTGAEPTCSQEGK 2076
Cdd:PHA03381 91 RRPHAQPEAsgpgPARGARGPAGSRGRGRRAESP-SPRDPPNPKGASAPRGRKSAC-ADSAALLDAPAPAAPKRQK 164
|
|
| PilF |
COG3063 |
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures]; |
92-158 |
6.39e-04 |
|
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];
Pssm-ID: 442297 [Multi-domain] Cd Length: 94 Bit Score: 40.92 E-value: 6.39e-04
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462584232 92 TYKNLAQLAAQREDLETAMEFyLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHW 158
Cdd:COG3063 28 ALNNLGLLLLEQGRYDEAIAL-EKALKLDPNNAEALLNLAELLLELGDYDEALAYLERALELDPSAL 93
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1954-2165 |
8.09e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.93 E-value: 8.09e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1954 DPVPADSV-QRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPPRPHRPEATPSMASLGP-------------- 2018
Cdd:PHA03247 2452 DPFFARTIlGAPFSLSLLLGELFPGAPVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPailpdepvgepvhp 2531
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2019 ------EG-EELARVAEGTSFPPQEPRHSPQV--KMAPTSSPAePHcwPAEAALGTGAE----PTCSQEGKLRPEPRRDG 2085
Cdd:PHA03247 2532 rmltwiRGlEELASDDAGDPPPPLPPAAPPAApdRSVPPPRPA-PR--PSEPAVTSRARrpdaPPQSARPRAPVDDRGDP 2608
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2086 EAQEAASETQPLSSPPTAASSkAPSSGSAQPPEGHP-GKPEPSRAKSRPLPNMPKLVIPSAATKFPPEITVTP--PTPTL 2162
Cdd:PHA03247 2609 RGPAPPSPLPPDTHAPDPPPP-SPSPAANEPDPHPPpTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPqrPRRRA 2687
|
...
gi 2462584232 2163 LSP 2165
Cdd:PHA03247 2688 ARP 2690
|
|
| PHA03325 |
PHA03325 |
nuclear-egress-membrane-like protein; Provisional |
1988-2157 |
9.52e-04 |
|
nuclear-egress-membrane-like protein; Provisional
Pssm-ID: 223044 Cd Length: 418 Bit Score: 44.10 E-value: 9.52e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1988 ASASTLDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQ-----VKMAPTSSPAEPhcwPAEAA 2062
Cdd:PHA03325 259 SSAFMLNSSLPTSAPKRRSRRAGAMRAAAGETADLADDDGSEHSDPEPLPASLPPppvrrPRVKHPEAGKEE---PDGAR 335
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2063 LGTGAEPTCSQEGKLRPeprrdgeAQEAASETQPLSSPPTAASSKApSSGSAQPPEGHPGKPEPSRAKSRPLPnmpklvi 2142
Cdd:PHA03325 336 NAEAKEPAQPATSTSSK-------GSSSAQNKDSGSTGPGSSLAAA-SSFLEDDDFGSPPLDLTTSLRHMPSP------- 400
|
170
....*....|....*
gi 2462584232 2143 PSAATKFPPEITVTP 2157
Cdd:PHA03325 401 SVTSAPEPPSIPLTY 415
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
1999-2136 |
1.07e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 44.21 E-value: 1.07e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1999 PGPPRPHRPEATPSMASLGPEGEelarvaegtsfPPQEPRHSPQVKMAPTSSPAEPHCwPAEAALGTGAEPtcsqegklr 2078
Cdd:PRK07764 396 AAAPSAAAAAPAAAPAPAAAAPA-----------AAAAPAPAAAPQPAPAPAPAPAPP-SPAGNAPAGGAP--------- 454
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*...
gi 2462584232 2079 pePRRDGEAQEAASETQPLSSPPTAASSkAPSSGSAQPPEGHPGKPEPSRAKSRPLPN 2136
Cdd:PRK07764 455 --SPPPAAAPSAQPAPAPAAAPEPTAAP-APAPPAAPAPAAAPAAPAAPAAPAGADDA 509
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
1933-2165 |
1.34e-03 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 43.76 E-value: 1.34e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1933 PKHPkDSRENFFPVTVVPTAPD-PVPADSVQRPSDAHTKPRPA--------LAAATTIITCPPS---ASASTLDQSKDPG 2000
Cdd:PLN03209 330 PKES-DAADGPKPVPTKPVTPEaPSPPIEEEPPQPKAVVPRPLspytayedLKPPTSPIPTPPSsspASSKSVDAVAKPA 408
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2001 PPRPH-RPEATPSMASLGPEGEELARVAEGTSF-------PPQEPRHSPQVKMAPTSSPAephcwPAEAALGTGAEPTCS 2072
Cdd:PLN03209 409 EPDVVpSPGSASNVPEVEPAQVEAKKTRPLSPYaryedlkPPTSPSPTAPTGVSPSVSST-----SSVPAVPDTAPATAA 483
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2073 QEGKLRPEPRrdgeaqeaaseTQPLSSPPTAASSKAPSSGSaqppeghPGKPEPSRAKSRPlPNMPKLVIPSAATKFPPE 2152
Cdd:PLN03209 484 TDAAAPPPAN-----------MRPLSPYAVYDDLKPPTSPS-------PAAPVGKVAPSST-NEVVKVGNSAPPTALADE 544
|
250
....*....|...
gi 2462584232 2153 ITVTPPTPTLLSP 2165
Cdd:PLN03209 545 QHHAQPKPRPLSP 557
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1920-2162 |
1.36e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.16 E-value: 1.36e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1920 AAQRQASGDTPT-TPKHPKDSRENFFPV---------TVVPTAPDPVPADSVQRPSDAHTKPRpalaAATTIITCP-PSA 1988
Cdd:PHA03247 270 ETARGATGPPPPpEAAAPNGAAAPPDGVwgaalagapLALPAPPDPPPPAPAGDAEEEDDEDG----AMEVVSPLPrPRQ 345
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1989 SASTldqskdpGPPRPHRPEATP--SMASLGpEGEELARVAEgtsfPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTG 2066
Cdd:PHA03247 346 HYPL-------GFPKRRRPTWTPpsSLEDLS-AGRHHPKRAS----LPTRKRRSARHAATPFARGPGGDDQTRPAAPVPA 413
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2067 AEPTCSQEGKLRPEPrrdgeaqeaasetqPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMPKlVIPSAA 2146
Cdd:PHA03247 414 SVPTPAPTPVPASAP--------------PPPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATRK-ALDALR 478
|
250
....*....|....*.
gi 2462584232 2147 TKFPPEitvtPPTPTL 2162
Cdd:PHA03247 479 ERRPPE----PPGADL 490
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
1897-2134 |
1.53e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 43.71 E-value: 1.53e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1897 PVKVDEEAALEQAVKFCQVHLGAAAQRQASGDTPTTPkhpkdsrenffpvtvvptAPDPVPADSVQRPSDAHTKPRPALA 1976
Cdd:PRK12323 392 PAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSP------------------APEALAAARQASARGPGGAPAPAPA 453
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1977 -AATTIITCPPSASASTLDQSKDPGPPRPHRPEATPSMASLG-PEGEELarvaegtsfpPQEPrhspqvkmaPTSSPAEP 2054
Cdd:PRK12323 454 pAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDpPPWEEL----------PPEF---------ASPAPAQP 514
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2055 HCWPAEAALGTGAEPTCSQEGKLRPEPRrdgEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKpEPSRAKSRPL 2134
Cdd:PRK12323 515 DAAPAGWVAESIPDPATADPDDAFETLA---PAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGD-WPALAARLPV 590
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
1999-2117 |
1.86e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 43.44 E-value: 1.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1999 PGPPRPHRPEATPSMASLGPEgeelarvAEGTSFPPQEPRHSPQVKMAPTSSPAEPhcwPAEAALGTGAEPTCSQEGKLR 2078
Cdd:PRK07764 394 PAAAAPSAAAAAPAAAPAPAA-------AAPAAAAAPAPAAAPQPAPAPAPAPAPP---SPAGNAPAGGAPSPPPAAAPS 463
|
90 100 110 120
....*....|....*....|....*....|....*....|..
gi 2462584232 2079 PEPRR---DGEAQEAASETQPLSSPPTAASSKAPSSGSAQPP 2117
Cdd:PRK07764 464 AQPAPapaAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAG 505
|
|
| PBP1 |
COG5180 |
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ... |
1918-2160 |
1.90e-03 |
|
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];
Pssm-ID: 444064 [Multi-domain] Cd Length: 548 Bit Score: 43.13 E-value: 1.90e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1918 GAAAQRQASGDTPTTPKH----PKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTiitcpPSASASTL 1993
Cdd:COG5180 152 AALLQRSDPILAKDPDGDsastLPPPAEKLDKVLTEPRDALKDSPEKLDRPKVEVKDEAQEEPPDLT-----GGADHPRP 226
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1994 DQSKDPGPPRPHRPEATPSMASLGPEGEEL-------ARVAEGTSFPPQEPRHSPQ-------VKMAPTSSPAEPHCWPA 2059
Cdd:COG5180 227 EAASSPKVDPPSTSEARSRPATVDAQPEMRppadakeRRRAAIGDTPAAEPPGLPVleagsepQSDAPEAETARPIDVKG 306
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2060 EAALGTGAEPTCSQEGKLRPEPRRDGEAQEaasetQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKS------RP 2133
Cdd:COG5180 307 VASAPPATRPVRPPGGARDPGTPRPGQPTE-----RPAGVPEAASDAGQPPSAYPPAEEAVPGKPLEQGAPRpgssggDG 381
|
250 260
....*....|....*....|....*..
gi 2462584232 2134 LPNMPKLVIPSAATKFPPeiTVTPPTP 2160
Cdd:COG5180 382 APFQPPNGAPQPGLGRRG--APGPPMG 406
|
|
| TPR_12 |
pfam13424 |
Tetratricopeptide repeat; |
36-119 |
1.92e-03 |
|
Tetratricopeptide repeat;
Pssm-ID: 315987 [Multi-domain] Cd Length: 77 Bit Score: 38.91 E-value: 1.92e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 36 AFALYHKALDLQKHDRFEESAKAYHELLEaslLREAVSSGDekeglkHPGLILkysTYKNLAQLAAQREDLETAMEFYLE 115
Cdd:pfam13424 3 ATALNNLAAVLRRLGRYDEALELLEKALE---IARRLLGPD------HPLTAT---TLLNLGRLYLELGRYEEALELLER 70
|
....
gi 2462584232 116 AVML 119
Cdd:pfam13424 71 ALAL 74
|
|
| sucB |
TIGR01347 |
2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component); This ... |
2020-2130 |
1.98e-03 |
|
2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component); This model describes the TCA cycle 2-oxoglutarate system E2 component, dihydrolipoamide succinyltransferase. It is closely related to the pyruvate dehydrogenase E2 component, dihydrolipoamide acetyltransferase. The seed for this model includes mitochondrial and Gram-negative bacterial forms. Mycobacterial candidates are highly derived, differ in having and extra copy of the lipoyl-binding domain at the N-terminus. They score below the trusted cutoff, but above the noise cutoff and above all examples of dihydrolipoamide acetyltransferase. [Energy metabolism, TCA cycle]
Pssm-ID: 273565 [Multi-domain] Cd Length: 403 Bit Score: 42.80 E-value: 1.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2020 GEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHcwPAEAALGTGAEPTCSQEGKlrpEPRRDGEAQEAASETQPLSS 2099
Cdd:TIGR01347 68 GQVLAILEEGNDATAAPPAKSGEEKEETPAASAAAA--PTAAANRPSLSPAARRLAK---EHGIDLSAVPGTGVTGRVTK 142
|
90 100 110
....*....|....*....|....*....|.
gi 2462584232 2100 PPTAASSKAPSsgSAQPPEGHPGKPEPSRAK 2130
Cdd:TIGR01347 143 EDIIKKTEAPA--SAQPPAAAAAAAAPAAAT 171
|
|
| Treacle |
pfam03546 |
Treacher Collins syndrome protein Treacle; |
1957-2191 |
2.36e-03 |
|
Treacher Collins syndrome protein Treacle;
Pssm-ID: 460967 [Multi-domain] Cd Length: 531 Bit Score: 43.14 E-value: 2.36e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1957 PADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPPRPHRPEATPSMASLGPEGEELARvaEGTSFPPQE 2036
Cdd:pfam03546 168 DSESSSEESDSEGEAPPAATQAKPSGKILQVRPASGPAKGAAPAPPQKAGPVATQVKAERSKEDSESSE--ESSDSEEEA 245
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2037 PRHSPQVKMAPTSSPAEPHCWPAEaalGTGAEPTCSQEGKLR---PEPRRDGEAQEAASetqpLSSPPTAASSKAP---S 2110
Cdd:pfam03546 246 PAAATPAQAKPALKTPQTKASPRK---GTPITPTSAKVPPVRvgtPAPWKAGTVTSPAC----ASSPAVARGAQRPeedS 318
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2111 SGSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPSAATKFPPEITVTPPTPTLLSPKGSI-----------SEETKQKLKS 2179
Cdd:pfam03546 319 SSSEESESEEETAPAAAVGQAKSVGKGLQGKAASAPTKGPSGQGTAPVPPGKTGPAVAQvkaeaqedsesSEEESDSEEA 398
|
250
....*....|..
gi 2462584232 2180 AILSAQSAANVR 2191
Cdd:pfam03546 399 AATPAQVKASGK 410
|
|
| PHA03321 |
PHA03321 |
tegument protein VP11/12; Provisional |
1923-2167 |
2.40e-03 |
|
tegument protein VP11/12; Provisional
Pssm-ID: 223041 [Multi-domain] Cd Length: 694 Bit Score: 43.02 E-value: 2.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1923 RQASGDTPTTPKHPKDSRENFFPVT-----------VVPTAPDPVPAdSVQRPSDAHTK---PRPAlaaattiitcPPSA 1988
Cdd:PHA03321 447 RARPGSTPACARRARAQRARDAGPEyvdplgalrrlPAGAAPPPEPA-AAPSPATYYTRmggGPPR----------LPPR 515
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1989 SASTLDQSKDPGPPRPHRPEATPSmASLGPEGEELARVAEGTSFPPQEPRHSPqvkmAPTSSPaephcwPAEaALGTGAE 2068
Cdd:PHA03321 516 NRATETLRPDWGPPAAAPPEQMED-PYLEPDDDRFDRRDGAAAAATSHPREAP----APDDDP------IYE-GVSDSEE 583
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2069 PTCSQegklRPEPR----RDGEAQEAASETQPLSSPptaassKAPSSGSAQPPEGHPGKP--EPSRAKSRPLPnmpklvi 2142
Cdd:PHA03321 584 PVYEE----IPTPRvyqnPLPRPMEGAGEPPDLDAP------TSPWVEEENPIYGWGDSPlfSPPPAARFPPP------- 646
|
250 260
....*....|....*....|....*
gi 2462584232 2143 PSAATKFPPEITVTPPTPTLLSPKG 2167
Cdd:PHA03321 647 DPALSPEPPALPAHRPRPGALAPDG 671
|
|
| TPR_21 |
pfam09976 |
Tetratricopeptide repeat-like domain; This family resembles a single unit of a TPR repeat. |
48-151 |
3.60e-03 |
|
Tetratricopeptide repeat-like domain; This family resembles a single unit of a TPR repeat.
Pssm-ID: 430959 [Multi-domain] Cd Length: 194 Bit Score: 41.03 E-value: 3.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 48 KHDRFEESAKAYHELLEAsllreaVSSGDEKEGL--------KHPGlilkySTYKNLAQL-----AAQREDLETAMEfYL 114
Cdd:pfam09976 32 QRSQAEEASALYQQLLEA------VAAGDAAKAQaaaaqlkdEYGG-----TGYAALAALllakaAVEAGDLAAAKA-QL 99
|
90 100 110
....*....|....*....|....*....|....*...
gi 2462584232 115 EAVMLDSTDVNLwykiGHVA-LRLIRIPLARHAFEEGL 151
Cdd:pfam09976 100 EWVADNAKDEAL----KALArLRLARVLLAQGKYDEAL 133
|
|
| LapB |
COG2956 |
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ... |
34-174 |
4.04e-03 |
|
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442196 [Multi-domain] Cd Length: 275 Bit Score: 41.64 E-value: 4.04e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 34 AEAFALYHKALDLQKHDRFEESAKAYHELLEasllreavssgdekeglKHPGLIlkySTYKNLAQLAAQREDLETAMEFY 113
Cdd:COG2956 6 AAALGWYFKGLNYLLNGQPDKAIDLLEEALE-----------------LDPETV---EAHLALGNLYRRRGEYDRAIRIH 65
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462584232 114 LEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDY 174
Cdd:COG2956 66 QKLLERDPDRAEALLELAQDYLKAGLLDRAEELLEKLLELDPDDAEALRLLAEIYEQEGDW 126
|
|
| PRK12727 |
PRK12727 |
flagellar biosynthesis protein FlhF; |
1946-2160 |
4.33e-03 |
|
flagellar biosynthesis protein FlhF;
Pssm-ID: 237182 [Multi-domain] Cd Length: 559 Bit Score: 42.28 E-value: 4.33e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1946 VTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKdpgpprPHRPEATPSMASLGpegeelAR 2025
Cdd:PRK12727 62 TPATAAAPAPAPQAPTKPAAPVHAPLKLSANANMSQRQRVASAAEDMIAAMA------LRQPVSVPRQAPAA------AP 129
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2026 VAEGTSFPPQEPRHSPQVKMapTSSPAEPHCWPAEAALGTGAEPTCSQegklRPEPRRDGEAQEAASETqPLSSPPTAAS 2105
Cdd:PRK12727 130 VRAASIPSPAAQALAHAAAV--RTAPRQEHALSAVPEQLFADFLTTAP----VPRAPVQAPVVAAPAPV-PAIAAALAAH 202
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 2462584232 2106 SKAPSSGSAQPPEGHPGKPEPSrAKSRPLPNMPKLVIPSAATKFPPEITVTPPTP 2160
Cdd:PRK12727 203 AAYAQDDDEQLDDDGFDLDDAL-PQILPPAALPPIVVAPAAPAALAAVAAAAPAP 256
|
|
| PHA03369 |
PHA03369 |
capsid maturational protease; Provisional |
2012-2199 |
4.46e-03 |
|
capsid maturational protease; Provisional
Pssm-ID: 223061 [Multi-domain] Cd Length: 663 Bit Score: 42.29 E-value: 4.46e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2012 SMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALgTGAEPTCSQEGKLRPEPRRDGEAQEAA 2091
Cdd:PHA03369 349 KTASLTAPSRVLAAAAKVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPM-TAYPPVPQFCGDPGLVSPYNPQSPGTS 427
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2092 SETQPLSS-PPT-AASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLP-NMPKLVIPSAATKFPPEITVTPPTPTLLSPKGS 2168
Cdd:PHA03369 428 YGPEPVGPvPPQpTNPYVMPISMANMVYPGHPQEHGHERKRKRGGElKEELIETLKLVKKLKEEQESLAKELEATAHKSE 507
|
170 180 190
....*....|....*....|....*....|.
gi 2462584232 2169 ISEETKQKLKSAILSAQSAANVRKESLCQPA 2199
Cdd:PHA03369 508 IKKIAESEFKNAGAKTAAANIEPNCSADAAA 538
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
1921-2053 |
4.62e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 42.07 E-value: 4.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 1921 AQRQASGDTPTTPKHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASA---------- 1990
Cdd:PRK14971 360 AQLTQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVdppaavpvnp 439
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462584232 1991 -STLDQSKDPGPPRPHRPEATPSMASLGPegeelarvaeGTSFPPQEPRHSPQ--VKMAPTSSPAE 2053
Cdd:PRK14971 440 pSTAPQAVRPAQFKEEKKIPVSKVSSLGP----------STLRPIQEKAEQATgnIKEAPTGTQKE 495
|
|
| KLF9_13_N-like |
cd21975 |
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like ... |
2005-2148 |
5.47e-03 |
|
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved alpha-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF9, KLF13, KLF14, KLF16, and similar proteins.
Pssm-ID: 409240 [Multi-domain] Cd Length: 163 Bit Score: 40.06 E-value: 5.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2005 HRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEGKLRPEPRRD 2084
Cdd:cd21975 19 HGVRPDPEGAGLAAGLDVRATREVAKGPGPPGPAWKPDGADSPGLVTAAPHLLAANVLAPLRGPSVEGSSLESGDADMGS 98
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462584232 2085 GEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGkPEPSRAKSRPLPNMPKLVIPSAATK 2148
Cdd:cd21975 99 DSDVAPASGAAASTSPESSSDAASSPSPLSLLHPGEAG-LEPERPRPRVRRGVRRRGVTPAAKR 161
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
2003-2194 |
9.53e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 41.00 E-value: 9.53e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2003 RPHRPEATPSM---ASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEgklRP 2079
Cdd:PRK07994 360 HPAAPLPEPEVppqSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQG---AT 436
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462584232 2080 EPRRDGEAqeAASETQPLSSPPTAASSKAPssgSAQPPEGHPGKPEPSRAKSR-PLPNMPKLVIPSAATKFPPEITVTPP 2158
Cdd:PRK07994 437 KAKKSEPA--AASRARPVNSALERLASVRP---APSALEKAPAKKEAYRWKATnPVEVKKEPVATPKALKKALEHEKTPE 511
|
170 180 190
....*....|....*....|....*....|....*....
gi 2462584232 2159 TPTLLSPKGSISE---ETKQKLKSAILSAQSAANVRKES 2194
Cdd:PRK07994 512 LAAKLAAEAIERDpwaALVSQLGLPGLVEQLALNAWKEE 550
|
|
|