NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|332164762|ref|NP_001193719|]
View 

transmembrane protease serine 13 isoform 3 [Homo sapiens]

Protein Classification

LDL receptor domain-containing protein( domain architecture ID 11517102)

Low Density Lipoprotein (LDL) receptor class A domain is a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; similar to Bos taurus CD320 antigen; Low Density Lipoprotein (LDL) receptor class A domain is a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; similar to Bos taurus CD320 antigen; Low Density Lipoprotein (LDL) receptor class A domain is a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; similar to Caenorhabditis elegans LDL receptor repeat-containing protein egg-2

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Tryp_SPc smart00020
Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens ...
325-441 1.01e-31

Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.


:

Pssm-ID: 214473  Cd Length: 229  Bit Score: 121.63  E-value: 1.01e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   325 RIVGGALASDSKWPWQVSLHF-GTTHICGGTLIDAQWVLTAAHCFfvtREKVLEGWKVYAGTSNLHQLPEA--ASIAEII 401
Cdd:smart00020   1 RIVGGSEANIGSFPWQVSLQYgGGRHFCGGSLISPRWVLTAAHCV---RGSDPSNIRVRLGSHDLSSGEEGqvIKVSKVI 77
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 332164762   402 INSNYTDEEDDYDIALMRLSKPLTLSG--EGICTPRSPAPQP 441
Cdd:smart00020  78 IHPNYNPSTYDNDIALLKLKEPVTLSDnvRPICLPSSNYNVP 119
SRCR_2 pfam15494
Scavenger receptor cysteine-rich domain; SRCR_2 is a scavenger receptor cysteine-rich domain ...
231-321 2.07e-29

Scavenger receptor cysteine-rich domain; SRCR_2 is a scavenger receptor cysteine-rich domain family found largely on vertebrate sequences up-stream of the trypsin-like transmembrane serine protease, Spinesin.


:

Pssm-ID: 464747  Cd Length: 99  Bit Score: 110.88  E-value: 2.07e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762  231 WDKSLLKIYSGSSHQWLPICSSNWNDSYSEKTCQQLGFESAHRTTEVAHRD----FANSFSIL---RYNSTIQESL-HRS 302
Cdd:pfam15494   1 GENFLLQVYSSARPSWLPVCSDDWNPAYGRAACQQLGYLRLTHHKSVNLTDissnSSQSFMKLnssSLNTDLYEALqPRD 80
                          90
                  ....*....|....*....
gi 332164762  303 ECPSQRYISLQCSHCGLRA 321
Cdd:pfam15494  81 SCSSGSVVSLRCSECGLRS 99
PHA03378 super family cl33729
EBNA-3B; Provisional
9-148 2.22e-12

EBNA-3B; Provisional


The actual alignment was detected with superfamily member PHA03378:

Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 69.71  E-value: 2.22e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPAR 88
Cdd:PHA03378 689 WAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPA 768
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762  89 ASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPiRSSPARSAPATRATRE 148
Cdd:PHA03378 769 AAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMP-RAAPGQQGPTKQILRQ 827
LDLa cd00112
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central ...
210-226 4.41e-03

Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure


:

Pssm-ID: 238060  Cd Length: 35  Bit Score: 34.87  E-value: 4.41e-03
                         10
                 ....*....|....*..
gi 332164762 210 RCDGVVDCKLKSDELGC 226
Cdd:cd00112   19 VCDGEDDCGDGSDEENC 35
 
Name Accession Description Interval E-value
Tryp_SPc smart00020
Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens ...
325-441 1.01e-31

Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.


Pssm-ID: 214473  Cd Length: 229  Bit Score: 121.63  E-value: 1.01e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   325 RIVGGALASDSKWPWQVSLHF-GTTHICGGTLIDAQWVLTAAHCFfvtREKVLEGWKVYAGTSNLHQLPEA--ASIAEII 401
Cdd:smart00020   1 RIVGGSEANIGSFPWQVSLQYgGGRHFCGGSLISPRWVLTAAHCV---RGSDPSNIRVRLGSHDLSSGEEGqvIKVSKVI 77
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 332164762   402 INSNYTDEEDDYDIALMRLSKPLTLSG--EGICTPRSPAPQP 441
Cdd:smart00020  78 IHPNYNPSTYDNDIALLKLKEPVTLSDnvRPICLPSSNYNVP 119
Tryp_SPc cd00190
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens ...
326-427 7.20e-30

Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.


Pssm-ID: 238113 [Multi-domain]  Cd Length: 232  Bit Score: 116.61  E-value: 7.20e-30
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762 326 IVGGALASDSKWPWQVSLHFGT-THICGGTLIDAQWVLTAAHCFfvtREKVLEGWKVYAGTSNLHQLPE---AASIAEII 401
Cdd:cd00190    1 IVGGSEAKIGSFPWQVSLQYTGgRHFCGGSLISPRWVLTAAHCV---YSSAPSNYTVRLGSHDLSSNEGggqVIKVKKVI 77
                         90       100
                 ....*....|....*....|....*.
gi 332164762 402 INSNYTDEEDDYDIALMRLSKPLTLS 427
Cdd:cd00190   78 VHPNYNPSTYDNDIALLKLKRPVTLS 103
SRCR_2 pfam15494
Scavenger receptor cysteine-rich domain; SRCR_2 is a scavenger receptor cysteine-rich domain ...
231-321 2.07e-29

Scavenger receptor cysteine-rich domain; SRCR_2 is a scavenger receptor cysteine-rich domain family found largely on vertebrate sequences up-stream of the trypsin-like transmembrane serine protease, Spinesin.


Pssm-ID: 464747  Cd Length: 99  Bit Score: 110.88  E-value: 2.07e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762  231 WDKSLLKIYSGSSHQWLPICSSNWNDSYSEKTCQQLGFESAHRTTEVAHRD----FANSFSIL---RYNSTIQESL-HRS 302
Cdd:pfam15494   1 GENFLLQVYSSARPSWLPVCSDDWNPAYGRAACQQLGYLRLTHHKSVNLTDissnSSQSFMKLnssSLNTDLYEALqPRD 80
                          90
                  ....*....|....*....
gi 332164762  303 ECPSQRYISLQCSHCGLRA 321
Cdd:pfam15494  81 SCSSGSVVSLRCSECGLRS 99
COG5640 COG5640
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, ...
317-425 2.55e-26

Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 444365 [Multi-domain]  Cd Length: 262  Bit Score: 107.43  E-value: 2.55e-26
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762 317 CGLRAMTGRIVGGALASDSKWPWQVSLHF---GTTHICGGTLIDAQWVLTAAHCFFvtrEKVLEGWKVYAGTSNLH-QLP 392
Cdd:COG5640   22 APAADAAPAIVGGTPATVGEYPWMVALQSsngPSGQFCGGTLIAPRWVLTAAHCVD---GDGPSDLRVVIGSTDLStSGG 98
                         90       100       110
                 ....*....|....*....|....*....|...
gi 332164762 393 EAASIAEIIINSNYTDEEDDYDIALMRLSKPLT 425
Cdd:COG5640   99 TVVKVARIVVHPDYDPATPGNDIALLKLATPVP 131
Trypsin pfam00089
Trypsin;
326-441 6.18e-25

Trypsin;


Pssm-ID: 459667 [Multi-domain]  Cd Length: 219  Bit Score: 102.52  E-value: 6.18e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762  326 IVGGALASDSKWPWQVSLHFGT-THICGGTLIDAQWVLTAAHCFfvtreKVLEGWKVYAGTSNLHQLPEA---ASIAEII 401
Cdd:pfam00089   1 IVGGDEAQPGSFPWQVSLQLSSgKHFCGGSLISENWVLTAAHCV-----SGASDVKVVLGAHNIVLREGGeqkFDVEKII 75
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 332164762  402 INSNYTDEEDDYDIALMRLSKPLTLSG--EGICTPRSPAPQP 441
Cdd:pfam00089  76 VHPNYNPDTLDNDIALLKLESPVTLGDtvRPICLPDASSDLP 117
PHA03378 PHA03378
EBNA-3B; Provisional
9-148 2.22e-12

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 69.71  E-value: 2.22e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPAR 88
Cdd:PHA03378 689 WAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPA 768
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762  89 ASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPiRSSPARSAPATRATRE 148
Cdd:PHA03378 769 AAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMP-RAAPGQQGPTKQILRQ 827
SR smart00202
Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR ...
245-316 2.17e-06

Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.


Pssm-ID: 214555 [Multi-domain]  Cd Length: 101  Bit Score: 46.18  E-value: 2.17e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 332164762   245 QWLPICSSNWNDSYSEKTCQQLGFESAHRTTEVAHrDFANSFSILRYNSTI--QESlHRSECPSQRYISLQCSH 316
Cdd:smart00202  21 QWGTVCDDGWDLRDANVVCRQLGFGGAVSASGSAY-FGPGSGPIWLDNVRCsgTEA-SLSDCPHSGWGSHNCSH 92
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
2-155 2.83e-05

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 46.49  E-value: 2.83e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    2 ERDSHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASP 81
Cdd:pfam17823  89 EHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPRAAIAAASAP 168
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   82 AQASPARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSA--------PATRATRESPGTS 153
Cdd:pfam17823 169 HAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTAlaavgnssPAAGTVTAAVGTV 248

                  ..
gi 332164762  154 LP 155
Cdd:pfam17823 249 TP 250
rad23 TIGR00601
UV excision repair protein Rad23; All proteins in this family for which functions are known ...
9-70 4.31e-04

UV excision repair protein Rad23; All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273167 [Multi-domain]  Cd Length: 378  Bit Score: 42.57  E-value: 4.31e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 332164762    9 ASPARTPSAGASPAQASPAGTPPGraspAQASPAQASPAGTPPGRA--SPAQASPAGTPPGRAS 70
Cdd:TIGR00601  85 APPAATPTSAPTPTPSPPASPASG----MSAAPASAVEEKSPSEESatATAPESPSTSVPSSGS 144
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
6-91 2.24e-03

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 40.37  E-value: 2.24e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   6 HGNASPARTPSAGASPAQAS-PAGTPP--GRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPgrASPAQASPA 82
Cdd:NF041121  14 QMGRAAAPPSPEGPAPTAASqPATPPPpaAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPGPAG--AAPGAALPV 91

                 ....*....
gi 332164762  83 QASPARASP 91
Cdd:NF041121  92 RVPAPPALP 100
KREPA2 cd23959
Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of ...
6-91 2.51e-03

Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of the parasitic protozoan's KREPA RNA editing catalytic complex (RECC). Kinetoplastid RNA editing (KRE) proteins occur as pairs or sets of related proteins in multiple complexes. KREPA complex is composed of six components (KREPA1-6), which share a conserved C-terminal region containing an oligonucleotide-binding (OB)-fold-like domain. KREPAs are responsible for the site-specific insertion and deletion of U nucleotides in the kinetoplastid mitochondria pre-messenger RNA. Apart from the conserved C-terminal OB-fold domain, KREPA1, KREPA2, and KREPA3 contain two conserved C2H2 zinc-finger domains. KREPA2 and kinetoplastid RNA editing ligase 1 (KREL1) are specific for ligation post-U-deletion and are paralogous to KREL2 and KREPA1 that are specific for ligation post-U-insertion. KREPA2, is critical for RECC stability and KREL1 integration into the complex.


Pssm-ID: 467780 [Multi-domain]  Cd Length: 424  Bit Score: 40.24  E-value: 2.51e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   6 HGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQAS 85
Cdd:cd23959  159 HPPPAKPLPAAAAAQQSSASPGEVASPFASGTVSASPFATATDTAPSSGAPDGFPAEASAPSPFAAPASAASFPAAPVAN 238

                 ....*.
gi 332164762  86 PARASP 91
Cdd:cd23959  239 GEAATP 244
LDLa cd00112
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central ...
210-226 4.41e-03

Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure


Pssm-ID: 238060  Cd Length: 35  Bit Score: 34.87  E-value: 4.41e-03
                         10
                 ....*....|....*..
gi 332164762 210 RCDGVVDCKLKSDELGC 226
Cdd:cd00112   19 VCDGEDDCGDGSDEENC 35
 
Name Accession Description Interval E-value
Tryp_SPc smart00020
Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens ...
325-441 1.01e-31

Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.


Pssm-ID: 214473  Cd Length: 229  Bit Score: 121.63  E-value: 1.01e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   325 RIVGGALASDSKWPWQVSLHF-GTTHICGGTLIDAQWVLTAAHCFfvtREKVLEGWKVYAGTSNLHQLPEA--ASIAEII 401
Cdd:smart00020   1 RIVGGSEANIGSFPWQVSLQYgGGRHFCGGSLISPRWVLTAAHCV---RGSDPSNIRVRLGSHDLSSGEEGqvIKVSKVI 77
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 332164762   402 INSNYTDEEDDYDIALMRLSKPLTLSG--EGICTPRSPAPQP 441
Cdd:smart00020  78 IHPNYNPSTYDNDIALLKLKEPVTLSDnvRPICLPSSNYNVP 119
Tryp_SPc cd00190
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens ...
326-427 7.20e-30

Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.


Pssm-ID: 238113 [Multi-domain]  Cd Length: 232  Bit Score: 116.61  E-value: 7.20e-30
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762 326 IVGGALASDSKWPWQVSLHFGT-THICGGTLIDAQWVLTAAHCFfvtREKVLEGWKVYAGTSNLHQLPE---AASIAEII 401
Cdd:cd00190    1 IVGGSEAKIGSFPWQVSLQYTGgRHFCGGSLISPRWVLTAAHCV---YSSAPSNYTVRLGSHDLSSNEGggqVIKVKKVI 77
                         90       100
                 ....*....|....*....|....*.
gi 332164762 402 INSNYTDEEDDYDIALMRLSKPLTLS 427
Cdd:cd00190   78 VHPNYNPSTYDNDIALLKLKRPVTLS 103
SRCR_2 pfam15494
Scavenger receptor cysteine-rich domain; SRCR_2 is a scavenger receptor cysteine-rich domain ...
231-321 2.07e-29

Scavenger receptor cysteine-rich domain; SRCR_2 is a scavenger receptor cysteine-rich domain family found largely on vertebrate sequences up-stream of the trypsin-like transmembrane serine protease, Spinesin.


Pssm-ID: 464747  Cd Length: 99  Bit Score: 110.88  E-value: 2.07e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762  231 WDKSLLKIYSGSSHQWLPICSSNWNDSYSEKTCQQLGFESAHRTTEVAHRD----FANSFSIL---RYNSTIQESL-HRS 302
Cdd:pfam15494   1 GENFLLQVYSSARPSWLPVCSDDWNPAYGRAACQQLGYLRLTHHKSVNLTDissnSSQSFMKLnssSLNTDLYEALqPRD 80
                          90
                  ....*....|....*....
gi 332164762  303 ECPSQRYISLQCSHCGLRA 321
Cdd:pfam15494  81 SCSSGSVVSLRCSECGLRS 99
COG5640 COG5640
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, ...
317-425 2.55e-26

Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 444365 [Multi-domain]  Cd Length: 262  Bit Score: 107.43  E-value: 2.55e-26
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762 317 CGLRAMTGRIVGGALASDSKWPWQVSLHF---GTTHICGGTLIDAQWVLTAAHCFFvtrEKVLEGWKVYAGTSNLH-QLP 392
Cdd:COG5640   22 APAADAAPAIVGGTPATVGEYPWMVALQSsngPSGQFCGGTLIAPRWVLTAAHCVD---GDGPSDLRVVIGSTDLStSGG 98
                         90       100       110
                 ....*....|....*....|....*....|...
gi 332164762 393 EAASIAEIIINSNYTDEEDDYDIALMRLSKPLT 425
Cdd:COG5640   99 TVVKVARIVVHPDYDPATPGNDIALLKLATPVP 131
Trypsin pfam00089
Trypsin;
326-441 6.18e-25

Trypsin;


Pssm-ID: 459667 [Multi-domain]  Cd Length: 219  Bit Score: 102.52  E-value: 6.18e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762  326 IVGGALASDSKWPWQVSLHFGT-THICGGTLIDAQWVLTAAHCFfvtreKVLEGWKVYAGTSNLHQLPEA---ASIAEII 401
Cdd:pfam00089   1 IVGGDEAQPGSFPWQVSLQLSSgKHFCGGSLISENWVLTAAHCV-----SGASDVKVVLGAHNIVLREGGeqkFDVEKII 75
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 332164762  402 INSNYTDEEDDYDIALMRLSKPLTLSG--EGICTPRSPAPQP 441
Cdd:pfam00089  76 VHPNYNPDTLDNDIALLKLESPVTLGDtvRPICLPDASSDLP 117
PHA03378 PHA03378
EBNA-3B; Provisional
9-148 2.22e-12

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 69.71  E-value: 2.22e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPAR 88
Cdd:PHA03378 689 WAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPA 768
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762  89 ASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPiRSSPARSAPATRATRE 148
Cdd:PHA03378 769 AAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMP-RAAPGQQGPTKQILRQ 827
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
15-142 3.54e-10

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 62.42  E-value: 3.54e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762  15 PSAGASPAQASPAGTP--PGRASPAqASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPARASPA 92
Cdd:PRK14951 366 PAAAAEAAAPAEKKTParPEAAAPA-AAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAA 444
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|
gi 332164762  93 LASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPirsSPARSAPA 142
Cdd:PRK14951 445 VALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAP---AAARLTPT 491
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
7-91 7.12e-10

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 61.54  E-value: 7.12e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   7 GNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASP 86
Cdd:PRK07764 392 GAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPA 471

                 ....*
gi 332164762  87 ARASP 91
Cdd:PRK07764 472 AAPEP 476
PHA03378 PHA03378
EBNA-3B; Provisional
8-155 7.85e-10

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 61.62  E-value: 7.85e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   8 NASPARTPSAgaSPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPA 87
Cdd:PHA03378 680 GANTMLPIQW--APGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPP 757
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 332164762  88 RASPALASLSRSSSGRSSSARSASVTTSPtrvylvRATPVGA-VPIRSSPARSAPATRATRESPGTSLP 155
Cdd:PHA03378 758 AAAPGRARPPAAAPGAPTPQPPPQAPPAP------QQRPRGApTPQPPPQAGPTSMQLMPRAAPGQQGP 820
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
9-155 9.28e-10

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 61.02  E-value: 9.28e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQAS---------PAGTPPGRASPGRASPAQA 79
Cdd:PRK07003 385 ARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATAdrgddaadgDAPVPAKANARASADSRCD 464
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 332164762  80 SPAQASPARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVR-ATPVGAVPIRSSPARSA-PATRATRESPGTSLP 155
Cdd:PRK07003 465 ERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPdARAPAAASREDAPAAAApPAPEARPPTPAAAAP 542
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
3-170 3.37e-09

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 59.12  E-value: 3.37e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   3 RDSHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPA 82
Cdd:PRK12323 420 AAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPW 499
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762  83 QASP---ARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSAPATRATRESPGTSLPKFT- 158
Cdd:PRK12323 500 EELPpefASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDg 579
                        170
                 ....*....|...
gi 332164762 159 -WREGQKQLPLIG 170
Cdd:PRK12323 580 dWPALAARLPVRG 592
PHA03378 PHA03378
EBNA-3B; Provisional
4-150 3.79e-09

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 59.31  E-value: 3.79e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   4 DSHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQ 83
Cdd:PHA03378 644 NVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGR 723
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 332164762  84 ASPARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSAPATRATRESP 150
Cdd:PHA03378 724 ARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRP 790
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
15-155 5.37e-09

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 58.84  E-value: 5.37e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762  15 PSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPARASPALA 94
Cdd:PRK07764 365 PSASDDERGLLARLERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSP 444
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 332164762  95 SLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSAPATRATRESPGTSLP 155
Cdd:PRK07764 445 AGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAG 505
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
21-151 5.51e-09

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 58.57  E-value: 5.51e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762  21 PAQASPAGTPPGRASPAQasPAQASPAGTPPGRASPAQAsPAGTPPGRASPGRASPAQASPAQASPARASPALASLSRSS 100
Cdd:PRK14951 366 PAAAAEAAAPAEKKTPAR--PEAAAPAAAPVAQAAAAPA-PAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAP 442
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|..
gi 332164762 101 SGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSAPA-TRATRESPG 151
Cdd:PRK14951 443 AAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAaARLTPTEEG 494
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
9-130 5.51e-09

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 58.57  E-value: 5.51e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   9 ASPARTPSAGASPAQASPAGTPPGRASPAQAS----PAQASPAGTPPGRASPAQASP----AGTPPGRASPGRASPAQAS 80
Cdd:PRK14951 372 AAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPaaapAAAASAPAAPPAAAPPAPVAApaaaAPAAAPAAAPAAVALAPAP 451
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 332164762  81 PAQAS------PARASPALASLSRSSSGRSSSARSASVTTSPTRVY------LVRATPVGAV 130
Cdd:PRK14951 452 PAQAApetvaiPVRVAPEPAVASAAPAPAAAPAAARLTPTEEGDVWhatvqqLAAAEAITAL 513
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
15-155 1.03e-08

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 57.94  E-value: 1.03e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762  15 PSAGASPAQASPAGTPPGRASP---AQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPARASP 91
Cdd:PRK07003 368 PGGGVPARVAGAVPAPGARAAAavgASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDA 447
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 332164762  92 alASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPirsSPARSAPATRATRESPGTSLP 155
Cdd:PRK07003 448 --PVPAKANARASADSRCDERDAQPPADSGSASAPASDAP---PDAAFEPAPRAAAPSAATPAA 506
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
5-150 1.63e-08

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 57.30  E-value: 1.63e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   5 SHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQA 84
Cdd:PRK07764 637 AEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADD 716
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 332164762  85 SPARASPALASLSRSSSGRSSSARSASVTTSPTRVYlvrATPVGAVPIRSSPARSAPATRATRESP 150
Cdd:PRK07764 717 PAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPA---GAPAQPPPPPAPAPAAAPAAAPPPSPP 779
eMpr COG3591
V8-like Glu-specific endopeptidase [Posttranslational modification, protein turnover, ...
346-425 2.27e-08

V8-like Glu-specific endopeptidase [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 442810 [Multi-domain]  Cd Length: 194  Bit Score: 53.91  E-value: 2.27e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762 346 GTTHICGGTLIDAQWVLTAAHCFF-VTREKVLEGWKVYAGTSNLHqlPEAASIAEIIINSNYTDEED-DYDIALMRLSKP 423
Cdd:COG3591    9 GGGGVCTGTLIGPNLVLTAGHCVYdGAGGGWATNIVFVPGYNGGP--YGTATATRFRVPPGWVASGDaGYDYALLRLDEP 86

                 ..
gi 332164762 424 LT 425
Cdd:COG3591   87 LG 88
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
31-163 5.24e-08

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 55.49  E-value: 5.24e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762  31 PGRASPAQASPAQASPAgtPPGRASPAQASPAGTPPGRAsPGRASPAQASPAQASPARASPALASLSRSSSGRSSSARSA 110
Cdd:PRK14951 366 PAAAAEAAAPAEKKTPA--RPEAAAPAAAPVAQAAAAPA-PAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAP 442
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|...
gi 332164762 111 SVTTSPTRVYLVRATPVGAVPIRSSPARSAPATRATRESPGTSLPKFTWREGQ 163
Cdd:PRK14951 443 AAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEEGD 495
PHA03247 PHA03247
large tegument protein UL36; Provisional
9-172 6.01e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 55.71  E-value: 6.01e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAgTPPGRASPAQASPAGTPPGRASPGRaSPAQASPAQASPAR 88
Cdd:PHA03247 2696 TSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPA-LPAAPAPPAVPAGPATPGGPARPAR-PPTTAGPPAPAPPA 2773
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   89 ---ASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPAR-SAPATRATRESPGTSLPKFtwregQK 164
Cdd:PHA03247 2774 apaAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGpLPPPTSAQPTAPPPPPGPP-----PP 2848

                  ....*...
gi 332164762  165 QLPLIGCV 172
Cdd:PHA03247 2849 SLPLGGSV 2856
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
9-91 6.03e-08

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 55.26  E-value: 6.03e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQAS-PAGTPPGRASPGRASPAQASPAQASPA 87
Cdd:PRK07994 370 VPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQlLAARQQLQRAQGATKAKKSEPAAASRA 449

                 ....
gi 332164762  88 RASP 91
Cdd:PRK07994 450 RPVN 453
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1-91 1.00e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 54.61  E-value: 1.00e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   1 MERdSHGNASPARTPSAGASP-AQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQA 79
Cdd:PRK07764 381 LER-RLGVAGGAGAPAAAAPSaAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPA 459
                         90
                 ....*....|..
gi 332164762  80 SPAQASPARASP 91
Cdd:PRK07764 460 AAPSAQPAPAPA 471
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
4-91 1.41e-07

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 53.95  E-value: 1.41e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   4 DSHGNASPARTPSAGASPAQASPAGTPPGRASP----AQASPAQASPAGTPPGRASPAQASPAG-TPPGRASPGRASPAQ 78
Cdd:PRK14951 396 QAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAApaaaAPAAAPAAAPAAVALAPAPPAQAAPETvAIPVRVAPEPAVASA 475
                         90
                 ....*....|...
gi 332164762  79 ASPAQASPARASP 91
Cdd:PRK14951 476 APAPAAAPAAARL 488
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
5-170 1.94e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 53.73  E-value: 1.94e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   5 SHGNASPArtpsagaSPAQASPAGTPPGRASPAQASPAQASPAGTPPgrasPAQASPAGTPPGRASPGRASPAQASPAQA 84
Cdd:PRK12323 368 SGGGAGPA-------TAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPA----AAPAAAAAARAVAAAPARRSPAPEALAAA 436
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762  85 SPARAspalaSLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPirsSPARSAPATRATRESPGTSlpkfTWREGQK 164
Cdd:PRK12323 437 RQASA-----RGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAA---APARAAPAAAPAPADDDPP----PWEELPP 504

                 ....*.
gi 332164762 165 QLPLIG 170
Cdd:PRK12323 505 EFASPA 510
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
7-91 3.44e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 52.86  E-value: 3.44e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   7 GNASPARTPSAGASPAQASPA-GTPPGRASPAQASPAQASPAGTPPgrASPAQASPAGTPPGRASPgRASPAQASPAQAS 85
Cdd:PRK14971 367 DDASGGRGPKQHIKPVFTQPAaAPQPSAAAAASPSPSQSSAAAQPS--APQSATQPAGTPPTVSVD-PPAAVPVNPPSTA 443

                 ....*.
gi 332164762  86 PARASP 91
Cdd:PRK14971 444 PQAVRP 449
PHA03247 PHA03247
large tegument protein UL36; Provisional
5-150 3.58e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.02  E-value: 3.58e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    5 SHGNASPArTPSAGASPAQASPAGTPPGRASPAQ----ASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQAS 80
Cdd:PHA03247 2727 AARQASPA-LPAAPAPPAVPAGPATPGGPARPARppttAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPA 2805
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 332164762   81 PAQASPARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGA--VP----IRSSPARSAPATRATRESP 150
Cdd:PHA03247 2806 DPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGsvAPggdvRRRPPSRSPAAKPAAPARP 2881
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
14-87 5.32e-07

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 52.59  E-value: 5.32e-07
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 332164762   14 TPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPA 87
Cdd:PRK12270   39 GSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAA 112
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
5-152 9.66e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 51.71  E-value: 9.66e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    5 SHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQA 84
Cdd:PHA03307  101 AREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPE 180
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 332164762   85 SPARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAvPIRSSPARSAPATRATRESPGT 152
Cdd:PHA03307  181 ETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPG-RSAADDAGASSSDSSSSESSGC 247
PHA03378 PHA03378
EBNA-3B; Provisional
3-87 1.08e-06

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 51.61  E-value: 1.08e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   3 RDSHGNASPARTPSagASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASP---AGTPPGRASPG-----RA 74
Cdd:PHA03378 715 QRPAAATGRARPPA--AAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPgapTPQPPPQAPPApqqrpRG 792
                         90
                 ....*....|...
gi 332164762  75 SPAQASPAQASPA 87
Cdd:PHA03378 793 APTPQPPPQAGPT 805
PHA03247 PHA03247
large tegument protein UL36; Provisional
4-152 1.31e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.48  E-value: 1.31e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    4 DSHGNASPartPSAGASPAQASPAGTPPGRASPAQASPAQASPA---GTPPGRASP-AQASPAGTPPGRASPGRASPAQA 79
Cdd:PHA03247 2546 DDAGDPPP---PLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRArrpDAPPQSARPrAPVDDRGDPRGPAPPSPLPPDTH 2622
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 332164762   80 SPAQASPARaSPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSAPATRATRESPGT 152
Cdd:PHA03247 2623 APDPPPPSP-SPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGS 2694
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
9-155 1.62e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 50.75  E-value: 1.62e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQAS--PAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASP 86
Cdd:PRK07764 609 PEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPgvAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPA 688
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 332164762  87 ARASPALASLSRSSSGRSSSARSASVTTSPTRvylVRATPVGAVPIRSSPARSAPATRATRESPGTSLP 155
Cdd:PRK07764 689 APAAPAGAAPAQPAPAPAATPPAGQADDPAAQ---PPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGA 754
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
3-140 1.89e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 50.37  E-value: 1.89e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   3 RDSHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPA 82
Cdd:PRK07764 663 SDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLP 742
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 332164762  83 -----QASPARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSA 140
Cdd:PRK07764 743 pepddPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRDAEEVA 805
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
9-150 2.11e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 50.37  E-value: 2.11e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASP-------------AQASPAGTPPGRASPGRAS 75
Cdd:PRK07764 613 ARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPdasdggdgwpakaGGAAPAAPPPAPAPAAPAA 692
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762  76 PAQASPAQASPARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATP------VGAVPIRSSPARSAPATRATRES 149
Cdd:PRK07764 693 PAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPepddppDPAGAPAQPPPPPAPAPAAAPAA 772

                 .
gi 332164762 150 P 150
Cdd:PRK07764 773 A 773
SR smart00202
Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR ...
245-316 2.17e-06

Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.


Pssm-ID: 214555 [Multi-domain]  Cd Length: 101  Bit Score: 46.18  E-value: 2.17e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 332164762   245 QWLPICSSNWNDSYSEKTCQQLGFESAHRTTEVAHrDFANSFSILRYNSTI--QESlHRSECPSQRYISLQCSH 316
Cdd:smart00202  21 QWGTVCDDGWDLRDANVVCRQLGFGGAVSASGSAY-FGPGSGPIWLDNVRCsgTEA-SLSDCPHSGWGSHNCSH 92
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
3-153 2.90e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 50.17  E-value: 2.90e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    3 RDSHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPA 82
Cdd:PHA03307  141 VGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSP 220
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 332164762   83 QASPARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPArSAPATRATRESPGTS 153
Cdd:PHA03307  221 APAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGW-NGPSSRPGPASSSSS 290
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
14-153 3.94e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 49.60  E-value: 3.94e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762  14 TPSAGASPAQASPAGTPPGRASPAQASPAQASPAGtPPGRASPAQASPAGTPPGrasPGRASPAQASPAQASPARASPAL 93
Cdd:PRK07764 583 QVEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAA-PAAPAAPAAPAPAGAAAA---PAEASAAPAPGVAAPEHHPKHVA 658
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762  94 ASLSRSSSGRSSSARSASVTTSPTrvylVRATPVGAVPIRSSPARSAPATRATRESPGTS 153
Cdd:PRK07764 659 VPDASDGGDGWPAKAGGAAPAAPP----PAPAPAAPAAPAGAAPAQPAPAPAATPPAGQA 714
PHA03247 PHA03247
large tegument protein UL36; Provisional
11-168 1.07e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.40  E-value: 1.07e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   11 PARTPSAGASPAQASPAGTPPGRASP-AQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPARA 89
Cdd:PHA03247 2573 PAPRPSEPAVTSRARRPDAPPQSARPrAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERP 2652
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   90 SPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSS----PARSAPATRATRESPGTSLPKFTWREGQKQ 165
Cdd:PHA03247 2653 RDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLadppPPPPTPEPAPHALVSATPLPPGPAAARQAS 2732

                  ...
gi 332164762  166 LPL 168
Cdd:PHA03247 2733 PAL 2735
PHA03247 PHA03247
large tegument protein UL36; Provisional
7-155 1.07e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.40  E-value: 1.07e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    7 GNASPARTPSAGASPAQASpagTPPGRASPAQASPAQAS---PAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPaq 83
Cdd:PHA03247 2659 GRVSRPRRARRLGRAAQAS---SPPQRPRRRAARPTVGSltsLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASP-- 2733
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 332164762   84 ASPARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRAtPVGAVPIRSSPARSAPATRATRESPGTSLP 155
Cdd:PHA03247 2734 ALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA-PAAGPPRRLTRPAVASLSESRESLPSPWDP 2804
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
9-156 1.60e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.86  E-value: 1.60e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQA---SPAGTPPGRASPGRAsPAQASPAQAS 85
Cdd:PHA03307  157 ASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPrrsSPISASASSPAPAPG-RSAADDAGAS 235
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 332164762   86 PARASPALASLSRSSSGRSSSARSASVTTSPTRVYlvRATPVGAVPIRSSPARSAPATRATRESPGTSLPK 156
Cdd:PHA03307  236 SSDSSSSESSGCGWGPENECPLPRPAPITLPTRIW--EASGWNGPSSRPGPASSSSSPRERSPSPSPSSPG 304
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
9-88 2.39e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 47.15  E-value: 2.39e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAqasPAQASPAR 88
Cdd:PRK07003 468 AQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPT---PAAAAPAA 544
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
2-155 2.83e-05

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 46.49  E-value: 2.83e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    2 ERDSHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASP 81
Cdd:pfam17823  89 EHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPRAAIAAASAP 168
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   82 AQASPARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSA--------PATRATRESPGTS 153
Cdd:pfam17823 169 HAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTAlaavgnssPAAGTVTAAVGTV 248

                  ..
gi 332164762  154 LP 155
Cdd:pfam17823 249 TP 250
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
1-91 3.57e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 46.21  E-value: 3.57e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   1 MERDSHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQAS 80
Cdd:PRK14959 386 AEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRVPWDDAPPAPPRSGIPPRPAPRMPEASPVPGA 465
                         90
                 ....*....|.
gi 332164762  81 PAQASPARASP 91
Cdd:PRK14959 466 PDSVASASDAP 476
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
6-147 3.90e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 46.52  E-value: 3.90e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   6 HGNASPARTPSAGASPAQASPAGTPPgrASPAQASPAQASPAGTPPGRASPAQAS-PAGTPPGRASPGRASPAQASPAQA 84
Cdd:PRK07764 591 APGAAGGEGPPAPASSGPPEEAARPA--APAAPAAPAAPAPAGAAAAPAEASAAPaPGVAAPEHHPKHVAVPDASDGGDG 668
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 332164762  85 SPARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSAPATRATR 147
Cdd:PRK07764 669 WPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAP 731
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
9-91 9.15e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.98  E-value: 9.15e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGT------PPGRASPAQASPAGTPPGRASPGRASPAQASPA 82
Cdd:PRK07764 429 PQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPApaaapePTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADD 508

                 ....*....
gi 332164762  83 QASPARASP 91
Cdd:PRK07764 509 AATLRERWP 517
PHA03247 PHA03247
large tegument protein UL36; Provisional
10-167 1.00e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.31  E-value: 1.00e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   10 SPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRA-SPGRASPAQASPAQASPAR 88
Cdd:PHA03247 2608 PRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRArRLGRAAQASSPPQRPRRRA 2687
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   89 ASPALASLSRSSSGRSSSArsasvTTSPTRVYLVRATPVGAVPIRSSPARSA-PATRATRESP-GTSLPKFTWREGQKQL 166
Cdd:PHA03247 2688 ARPTVGSLTSLADPPPPPP-----TPEPAPHALVSATPLPPGPAAARQASPAlPAAPAPPAVPaGPATPGGPARPARPPT 2762

                  .
gi 332164762  167 P 167
Cdd:PHA03247 2763 T 2763
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
3-91 1.10e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.16  E-value: 1.10e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    3 RDSHGNASPARTPSaGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPA 82
Cdd:PHA03307  339 AAVSPGPSPSRSPS-PSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPL 417

                  ....*....
gi 332164762   83 QASPARASP 91
Cdd:PHA03307  418 DAGAASGAF 426
PHA03247 PHA03247
large tegument protein UL36; Provisional
9-91 1.95e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 1.95e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQAS---------PAGTPPGRASPAQASPAGTPPGRASPGRASPAQ- 78
Cdd:PHA03247  378 ASLPTRKRRSARHAATPFARGPGGDDQTRPAAPVPASvptpaptpvPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAp 457
                          90
                  ....*....|....
gi 332164762   79 -ASPAQASPARASP 91
Cdd:PHA03247  458 aTEPAPDDPDDATR 471
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
9-146 2.10e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 44.07  E-value: 2.10e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   9 ASPARTPSAGASPAQASPAGT----PPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGrASPAQASPAQA 84
Cdd:PRK07003 420 ATRAEAPPAAPAPPATADRGDdaadGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPD-AAFEPAPRAAA 498
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 332164762  85 SPARASPALASLSRSSSGRSSSARSASVTTSPtrvylvRATPvgavpirSSPARSAPATRAT 146
Cdd:PRK07003 499 PSAATPAAVPDARAPAAASREDAPAAAAPPAP------EARP-------PTPAAAAPAARAG 547
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
7-149 2.27e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.01  E-value: 2.27e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    7 GNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASP 86
Cdd:PHA03307   68 PTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPP 147
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 332164762   87 ARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRA--TPVGAVPIRSSPARSAPATRATRES 149
Cdd:PHA03307  148 PAASPPAAGASPAAVASDAASSRQAALPLSSPEETARApsSPPAEPPPSTPPAAASPRPPRRSSP 212
motB PRK05996
MotB family protein;
1-90 2.33e-04

MotB family protein;


Pssm-ID: 235665 [Multi-domain]  Cd Length: 423  Bit Score: 43.53  E-value: 2.33e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   1 MERDSHGNASP---ARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPA 77
Cdd:PRK05996 190 VEVTTAGDLLPpgqAREQAQGAKSATAAPATVPQAAPLPQAQPKKAATEEELIADAKKAATGEPAANAAKAAKPEPMPDD 269
                         90
                 ....*....|...
gi 332164762  78 QASPAQASPARAS 90
Cdd:PRK05996 270 QQKEAEQLQAAIA 282
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
8-77 2.59e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 43.73  E-value: 2.59e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    8 NASPARTPSAGASPAQASPAGTPPgRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPA 77
Cdd:PRK12270   57 APAAAPAAKAPAAPAPAPPAAAAP-AAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEVTPLRGAAA 125
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
9-87 2.70e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 43.64  E-value: 2.70e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   9 ASPARTPSAGaSPAQASPAGTPPGRASPAQasPAQASPAGTPPGRASPAQASPAGTPP-GRASPGRASPAQASPAQASPA 87
Cdd:PRK14950 372 TAAAPSPVRP-TPAPSTRPKAAAAANIPPK--EPVRETATPPPVPPRPVAPPVPHTPEsAPKLTRAAIPVDEKPKYTPPA 448
PHA03247 PHA03247
large tegument protein UL36; Provisional
12-153 3.10e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 3.10e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   12 ARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPaQASPAGTPPGRASPGRASPAQASPAQASPARASP 91
Cdd:PHA03247 2863 RRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPER-PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP 2941
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 332164762   92 ALASLSRSSSGRSSSARSASV---TTSPTRVYLVRATPVGAVPIRSSPARSAPATRATRESPGTS 153
Cdd:PHA03247 2942 PLAPTTDPAGAGEPSGAVPQPwlgALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSS 3006
PHA03247 PHA03247
large tegument protein UL36; Provisional
9-155 3.77e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.39  E-value: 3.77e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASP---AGTPPGRASPAQASPAGTPPGRASPGRASPAQASP---- 81
Cdd:PHA03247 2769 PAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADppaAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGpppp 2848
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   82 --------------AQASPARASPALASLSRSSSGRSSSARSASVTTSPTRVylvraTPVGAVPIRSSPARSAPATRATR 147
Cdd:PHA03247 2849 slplggsvapggdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFAL-----PPDQPERPPQPQAPPPPQPQPQP 2923

                  ....*...
gi 332164762  148 ESPGTSLP 155
Cdd:PHA03247 2924 PPPPQPQP 2931
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
14-86 4.05e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 42.87  E-value: 4.05e-04
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 332164762  14 TPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGraSPAQASPAgTPPGRASPGRASPAQASPAQASP 86
Cdd:PRK14950 361 VPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPP--KEPVRETA-TPPPVPPRPVAPPVPHTPESAPK 430
rad23 TIGR00601
UV excision repair protein Rad23; All proteins in this family for which functions are known ...
9-70 4.31e-04

UV excision repair protein Rad23; All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273167 [Multi-domain]  Cd Length: 378  Bit Score: 42.57  E-value: 4.31e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 332164762    9 ASPARTPSAGASPAQASPAGTPPGraspAQASPAQASPAGTPPGRA--SPAQASPAGTPPGRAS 70
Cdd:TIGR00601  85 APPAATPTSAPTPTPSPPASPASG----MSAAPASAVEEKSPSEESatATAPESPSTSVPSSGS 144
PHA02682 PHA02682
ORF080 virion core protein; Provisional
8-91 5.18e-04

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 41.77  E-value: 5.18e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   8 NASPARTPSaGASPAQASPAGTPPGRASPAQASPAQAsPAGTPPgraSPAQASPAGT----PPGRASPGRASPAQASPAQ 83
Cdd:PHA02682  69 NSACMQRPS-GQSPLAPSPACAAPAPACPACAPAAPA-PAVTCP---APAPACPPATaptcPPPAVCPAPARPAPACPPS 143

                 ....*...
gi 332164762  84 ASPARASP 91
Cdd:PHA02682 144 TRQCPPAP 151
PHA03377 PHA03377
EBNA-3C; Provisional
19-150 5.40e-04

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 42.73  E-value: 5.40e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   19 ASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPARASPALASLSR 98
Cdd:PHA03377  550 ATPPKVSPSDRGPPKASPPVMAPPSTGPRVMATPSTGPRDMAPPSTGPRQQAKCKDGPPASGPHEKQPPSSAPRDMAPSV 629
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 332164762   99 SSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSAPATRATRESP 150
Cdd:PHA03377  630 VRMFLRERLLEQSTGPKPKSFWEMRAGRDGSGIQQEPSSRRQPATQSTPPRP 681
PRK14965 PRK14965
DNA polymerase III subunits gamma and tau; Provisional
10-91 5.66e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237871 [Multi-domain]  Cd Length: 576  Bit Score: 42.42  E-value: 5.66e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762  10 SPARTPSAGASPAQASPAGTPPGRASPAQAspaqasPAGTPPGRASPAQASPAGTPPGraspgrasPAQASPAQASPARA 89
Cdd:PRK14965 381 APAPPSAAWGAPTPAAPAAPPPAAAPPVPP------AAPARPAAARPAPAPAPPAAAA--------PPARSADPAAAASA 446

                 ..
gi 332164762  90 SP 91
Cdd:PRK14965 447 GD 448
PRK10856 PRK10856
cytoskeleton protein RodZ;
4-91 5.69e-04

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 41.94  E-value: 5.69e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   4 DSHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPpgraSPAQASPAGT--PPGRASPGRASPAQASP 81
Cdd:PRK10856 166 TSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAP----SQANVDTAATpaPAAPATPDGAAPLPTDQ 241
                         90
                 ....*....|
gi 332164762  82 AQASPARASP 91
Cdd:PRK10856 242 AGVSTPAADP 251
PHA03247 PHA03247
large tegument protein UL36; Provisional
3-86 6.46e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 6.46e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    3 RDSHGNASPARTPSAGASPaqasPAGTPPGRASPAQASPAQASPAGTPPGRASPAqASPAGTPPGRASPgrasPAQASPA 82
Cdd:PHA03247  397 RGPGGDDQTRPAAPVPASV----PTPAPTPVPASAPPPPATPLPSAEPGSDDGPA-PPPERQPPAPATE----PAPDDPD 467

                  ....
gi 332164762   83 QASP 86
Cdd:PHA03247  468 DATR 471
PHA03247 PHA03247
large tegument protein UL36; Provisional
4-156 7.15e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 7.15e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    4 DSHGNASPARTPSAGAS-PAQASPAGTPPGRASPAQASPAQASPA--GTPPGRASPAQASPAGTPPGRASPGRAS----- 75
Cdd:PHA03247 2620 DTHAPDPPPPSPSPAANePDPHPPPTVPPPERPRDDPAPGRVSRPrrARRLGRAAQASSPPQRPRRRAARPTVGSltsla 2699
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   76 -PAQASPAQASPARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSAPATRATRESPGTSL 154
Cdd:PHA03247 2700 dPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779

                  ..
gi 332164762  155 PK 156
Cdd:PHA03247 2780 PR 2781
flhF PRK06995
flagellar biosynthesis protein FlhF;
11-91 7.26e-04

flagellar biosynthesis protein FlhF;


Pssm-ID: 235904 [Multi-domain]  Cd Length: 484  Bit Score: 41.88  E-value: 7.26e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762  11 PARTPSAGASPAQASPAGTPP------GRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQA 84
Cdd:PRK06995  66 PAAAPAAVSRPAAPAAEPAPWlvehakRLTAQREQLVARAAAPAAPEAQAPAAPAERAAAENAARRLARAAAAAPRPRVP 145

                 ....*..
gi 332164762  85 SPARASP 91
Cdd:PRK06995 146 ADAAAAV 152
PRK14965 PRK14965
DNA polymerase III subunits gamma and tau; Provisional
1-72 7.53e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237871 [Multi-domain]  Cd Length: 576  Bit Score: 42.04  E-value: 7.53e-04
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 332164762   1 MERDSHGNASPARTPSAGASPAQASPAGTPP----GRASPAQASPAQASPAGTPPGrasPAQASPAGTPPGRASPG 72
Cdd:PRK14965 377 LERGAPAPPSAAWGAPTPAAPAAPPPAAAPPvppaAPARPAAARPAPAPAPPAAAA---PPARSADPAAAASAGDR 449
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
16-90 7.87e-04

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 41.98  E-value: 7.87e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 332164762   16 SAGASPAQASPA-GTPPGRASPAQASPAQASPAGTPPGRA---SPAQASPAGTPPGRASPGRASPAQASPAQASPARAS 90
Cdd:pfam03546 246 PAAATPAQAKPAlKTPQTKASPRKGTPITPTSAKVPPVRVgtpAPWKAGTVTSPACASSPAVARGAQRPEEDSSSSEES 324
PHA03377 PHA03377
EBNA-3C; Provisional
3-86 8.16e-04

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 41.96  E-value: 8.16e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    3 RDSHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGrASPAQASPAGTPPGRASPGRASPAQASPA 82
Cdd:PHA03377  545 RRQKRATPPKVSPSDRGPPKASPPVMAPPSTGPRVMATPSTGPRDMAPPS-TGPRQQAKCKDGPPASGPHEKQPPSSAPR 623

                  ....
gi 332164762   83 QASP 86
Cdd:PHA03377  624 DMAP 627
PRK12373 PRK12373
NADH-quinone oxidoreductase subunit E;
14-87 9.85e-04

NADH-quinone oxidoreductase subunit E;


Pssm-ID: 237082 [Multi-domain]  Cd Length: 400  Bit Score: 41.33  E-value: 9.85e-04
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 332164762  14 TPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRAsPAQASPAQASPA 87
Cdd:PRK12373 231 LAPWQGDAAPVPPSEAARPKSADAETNAALKTPATAPKAAAKNAKAPEAQPVSGTAAAEPA-PKEAAKAAAAAA 303
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
1-90 1.10e-03

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 41.60  E-value: 1.10e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    1 MERDSHGNASPARTPSAGASPAQASPAG-TPPGRASPAQASPAQASPAGTPPGRASPA-----QASPA-GTPPGRASPGR 73
Cdd:pfam03546 382 AQEDSESSEEESDSEEAAATPAQVKASGkTPQAKANPAPTKASSAKGAASAPGKVVAAaaqakQGSPAkVKPPARTPQNS 461
                          90       100       110
                  ....*....|....*....|....*....|.
gi 332164762   74 ASPAQ--------------ASPAQASPARAS 90
Cdd:pfam03546 462 AISVRgqasvpavgkavatAAQAQKGPVGGP 492
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
8-152 1.12e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 41.70  E-value: 1.12e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    8 NASPARTPSAGASP-AQASPAGTPPGRASPAQASPAQASPAGTPPGRASP-AQASPAGTPPGRA-SPGRASPAQASPAQA 84
Cdd:PHA03307  287 SSSSPRERSPSPSPsSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSrGAAVSPGPSPSRSpSPSRPPPPADPSSPR 366
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 332164762   85 SPARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSAPATRATRESPGT 152
Cdd:PHA03307  367 KRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLT 434
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
3-90 1.16e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 41.59  E-value: 1.16e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   3 RDSHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQA-SPAGTPPGRASPAQA-SPA-------GTPPGRASPGR 73
Cdd:PRK14959 372 RPSGGGASAPSGSAAEGPASGGAATIPTPGTQGPQGTAPAAGmTPSSAAPATPAPSAApSPRvpwddapPAPPRSGIPPR 451
                         90
                 ....*....|....*....
gi 332164762  74 ASPA--QASPAQASPARAS 90
Cdd:PRK14959 452 PAPRmpEASPVPGAPDSVA 470
PHA03378 PHA03378
EBNA-3B; Provisional
3-83 1.17e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 41.59  E-value: 1.17e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   3 RDSHGNASPARTPSAG---ASPAQASPAGTPPGRASPAQASPAQaSPAGTPPGRASPaQASPAGTPPGRASPG------R 73
Cdd:PHA03378 735 RPPAAAPGRARPPAAApgrARPPAAAPGRARPPAAAPGAPTPQP-PPQAPPAPQQRP-RGAPTPQPPPQAGPTsmqlmpR 812
                         90
                 ....*....|
gi 332164762  74 ASPAQASPAQ 83
Cdd:PHA03378 813 AAPGQQGPTK 822
flhF PRK06995
flagellar biosynthesis protein FlhF;
9-91 1.23e-03

flagellar biosynthesis protein FlhF;


Pssm-ID: 235904 [Multi-domain]  Cd Length: 484  Bit Score: 41.11  E-value: 1.23e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   9 ASPARTPSAGASPAQASP-AGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPA 87
Cdd:PRK06995  42 ALADSDLAALAPPAAAAPaAAQPPPAAAPAAVSRPAAPAAEPAPWLVEHAKRLTAQREQLVARAAAPAAPEAQAPAAPAE 121

                 ....
gi 332164762  88 RASP 91
Cdd:PRK06995 122 RAAA 125
PHA03381 PHA03381
tegument protein VP22; Provisional
10-91 1.70e-03

tegument protein VP22; Provisional


Pssm-ID: 177618 [Multi-domain]  Cd Length: 290  Bit Score: 40.38  E-value: 1.70e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762  10 SPARTPSAGASPAQASPAG---------TPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASP-AQA 79
Cdd:PHA03381  39 EPADRARRGAGQARGRSQAerrfhhydeARADYPYYTGSSSEDERPADPRPSRRPHAQPEASGPGPARGARGPAGSrGRG 118
                         90
                 ....*....|..
gi 332164762  80 SPAQASPARASP 91
Cdd:PHA03381 119 RRAESPSPRDPP 130
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
6-91 2.24e-03

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 40.37  E-value: 2.24e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   6 HGNASPARTPSAGASPAQAS-PAGTPP--GRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPgrASPAQASPA 82
Cdd:NF041121  14 QMGRAAAPPSPEGPAPTAASqPATPPPpaAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPGPAG--AAPGAALPV 91

                 ....*....
gi 332164762  83 QASPARASP 91
Cdd:NF041121  92 RVPAPPALP 100
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
21-150 2.36e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 40.62  E-value: 2.36e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762  21 PAQASPAgtppgraspAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPARASPALASLSRSS 100
Cdd:PRK07994 361 PAAPLPE---------PEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQR 431
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|
gi 332164762 101 SGRSSSARSASVTTSPtrvylvRATPVGAVPIRSSPARSAPATRATRESP 150
Cdd:PRK07994 432 AQGATKAKKSEPAAAS------RARPVNSALERLASVRPAPSALEKAPAK 475
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
3-161 2.38e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 40.54  E-value: 2.38e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    3 RDSHGNASPARTPSAG--ASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQAS 80
Cdd:PHA03307  271 EASGWNGPSSRPGPASssSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRS 350
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   81 PaqaSPARASPALASLSRSSSGRSSSARSASVTTSPT-RVYLVRATPVGAVPIRSSPARS-APATRATRESPGTSLPKFT 158
Cdd:PHA03307  351 P---SPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRpTRRRARAAVAGRARRRDATGRFpAGRPRPSPLDAGAASGAFY 427

                  ...
gi 332164762  159 WRE 161
Cdd:PHA03307  428 ARY 430
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
3-144 2.42e-03

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 40.32  E-value: 2.42e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   3 RDSHGNASPARTPSAGASPAQA-SPAGTPPGRASPAQASPAQA-SPAGTPPGRASPAQASPAGTPPGRASPgrASPAQAS 80
Cdd:PTZ00436 214 KKSAKAAAPAKAAAAPAKAAAPpAKAAAAPAKAAAAPAKAAAPpAKAAAPPAKAAAPPAKAAAPPAKAAAP--PAKAAAP 291
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 332164762  81 PAQASPARASPALASLSRSSSGRSSSARSASVTTSPTRVylvrATPVGAVPIRSSPARSAPATR 144
Cdd:PTZ00436 292 PAKAAAAPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKA----ATPPAKAAAPPAKAAAAPVGK 351
rad23 TIGR00601
UV excision repair protein Rad23; All proteins in this family for which functions are known ...
21-90 2.44e-03

UV excision repair protein Rad23; All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273167 [Multi-domain]  Cd Length: 378  Bit Score: 40.26  E-value: 2.44e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 332164762   21 PAQASPAGTPPGRASPAQASPAQASPAGTPPGR-ASPAQASPAGTPPGRASPgraSPAQASPAQASPARAS 90
Cdd:TIGR00601  77 PKTGTGKVAPPAATPTSAPTPTPSPPASPASGMsAAPASAVEEKSPSEESAT---ATAPESPSTSVPSSGS 144
KREPA2 cd23959
Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of ...
6-91 2.51e-03

Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of the parasitic protozoan's KREPA RNA editing catalytic complex (RECC). Kinetoplastid RNA editing (KRE) proteins occur as pairs or sets of related proteins in multiple complexes. KREPA complex is composed of six components (KREPA1-6), which share a conserved C-terminal region containing an oligonucleotide-binding (OB)-fold-like domain. KREPAs are responsible for the site-specific insertion and deletion of U nucleotides in the kinetoplastid mitochondria pre-messenger RNA. Apart from the conserved C-terminal OB-fold domain, KREPA1, KREPA2, and KREPA3 contain two conserved C2H2 zinc-finger domains. KREPA2 and kinetoplastid RNA editing ligase 1 (KREL1) are specific for ligation post-U-deletion and are paralogous to KREL2 and KREPA1 that are specific for ligation post-U-insertion. KREPA2, is critical for RECC stability and KREL1 integration into the complex.


Pssm-ID: 467780 [Multi-domain]  Cd Length: 424  Bit Score: 40.24  E-value: 2.51e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   6 HGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQAS 85
Cdd:cd23959  159 HPPPAKPLPAAAAAQQSSASPGEVASPFASGTVSASPFATATDTAPSSGAPDGFPAEASAPSPFAAPASAASFPAAPVAN 238

                 ....*.
gi 332164762  86 PARASP 91
Cdd:cd23959  239 GEAATP 244
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
9-155 2.78e-03

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 39.93  E-value: 2.78e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   9 ASPARTPSAGASpAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASpAGTPPGRASpgrASPAQAS--PAQASP 86
Cdd:PTZ00436 209 AAPSGKKSAKAA-APAKAAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAPPAK-AAAPPAKAA---APPAKAAapPAKAAA 283
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 332164762  87 ARASPALASLSRSSSGRSSSARSASVTTSPTRVylvrATPVGAVPIRSSPARSAPATRATRESPGTSLP 155
Cdd:PTZ00436 284 PPAKAAAPPAKAAAAPAKAAAAPAKAAAAPAKA----AAPPAKAAAPPAKAATPPAKAAAPPAKAAAAP 348
PRK12495 PRK12495
hypothetical protein; Provisional
4-90 2.79e-03

hypothetical protein; Provisional


Pssm-ID: 183558 [Multi-domain]  Cd Length: 226  Bit Score: 39.47  E-value: 2.79e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   4 DSHGNASPARTPSAGAS----PAQASPAGTPPGRASPAQASPAQASPA---GTPPGRASPAQASPAGTPPGRAS--PGRA 74
Cdd:PRK12495  76 DDAGDGAEATAPSDAGSqaspDDDAQPAAEAEAADQSAPPEASSTSATdeaATDPPATAAARDGPTPDPTAQPAtpDERR 155
                         90
                 ....*....|....*.
gi 332164762  75 SPAQASPAQASPARAS 90
Cdd:PRK12495 156 SPRQRPPVSGEPPTPS 171
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
4-155 2.97e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 40.22  E-value: 2.97e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   4 DSHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQAspAGTPPGRAS------------- 70
Cdd:PRK07003 483 DAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPA--AAAPAARAGgaaaaldvlrnag 560
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762  71 ------PGRASPAQASPAQASPARASPalaslsrsssgrsssarsasvttSPTRVYLVRATPVGavPIRSSPARSAPATR 144
Cdd:PRK07003 561 mrvssdRGARAAAAAKPAAAPAAAPKP-----------------------AAPRVAVQVPTPRA--RAATGDAPPNGAAR 615
                        170
                 ....*....|...
gi 332164762 145 ATR--ESPGTSLP 155
Cdd:PRK07003 616 AEQaaESRGAPPP 628
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
36-91 3.01e-03

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 40.06  E-value: 3.01e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 332164762   36 PAQASPAQASPAG-TPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPARASP 91
Cdd:pfam03546 397 EAAATPAQVKASGkTPQAKANPAPTKASSAKGAASAPGKVVAAAAQAKQGSPAKVKP 453
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
12-155 3.26e-03

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 39.55  E-value: 3.26e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762  12 ARTPSAGASPAQASPAGTPPGRASPAQASPAQASPA----GTPPGRASPAQASpAGTPPGRASPGRASpAQASPAQASPA 87
Cdd:PTZ00436 193 AAAAAAAKQKAAAKKAAAPSGKKSAKAAAPAKAAAApakaAAPPAKAAAAPAK-AAAAPAKAAAPPAK-AAAPPAKAAAP 270
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 332164762  88 RASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSP---ARSAPATRATRESPGTSLP 155
Cdd:PTZ00436 271 PAKAAAPPAKAAAPPAKAAAPPAKAAAAPAKAAAAPAKAAAAPAKAAAPpakAAAPPAKAATPPAKAAAPP 341
KREPA2 cd23959
Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of ...
20-90 3.41e-03

Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of the parasitic protozoan's KREPA RNA editing catalytic complex (RECC). Kinetoplastid RNA editing (KRE) proteins occur as pairs or sets of related proteins in multiple complexes. KREPA complex is composed of six components (KREPA1-6), which share a conserved C-terminal region containing an oligonucleotide-binding (OB)-fold-like domain. KREPAs are responsible for the site-specific insertion and deletion of U nucleotides in the kinetoplastid mitochondria pre-messenger RNA. Apart from the conserved C-terminal OB-fold domain, KREPA1, KREPA2, and KREPA3 contain two conserved C2H2 zinc-finger domains. KREPA2 and kinetoplastid RNA editing ligase 1 (KREL1) are specific for ligation post-U-deletion and are paralogous to KREL2 and KREPA1 that are specific for ligation post-U-insertion. KREPA2, is critical for RECC stability and KREL1 integration into the complex.


Pssm-ID: 467780 [Multi-domain]  Cd Length: 424  Bit Score: 39.85  E-value: 3.41e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 332164762  20 SPAQASPAGTPPGRASPAQASPAQ----ASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQA-SPAQASPARAS 90
Cdd:cd23959  155 MFGQHPPPAKPLPAAAAAQQSSASpgevASPFASGTVSASPFATATDTAPSSGAPDGFPAEASApSPFAAPASAAS 230
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
23-143 3.57e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 40.05  E-value: 3.57e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762  23 QASPAGtppGRASPAQASPAQASPAG-----TPPGRASPAQASPA-GTPPGRASPGRASPAQAsPAQASP-ARASPALAS 95
Cdd:PRK14959 370 SLRPSG---GGASAPSGSAAEGPASGgaatiPTPGTQGPQGTAPAaGMTPSSAAPATPAPSAA-PSPRVPwDDAPPAPPR 445
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*...
gi 332164762  96 LSRSSSGRSSSARSASVTTSPTRVylvrATPVGAVPIRSSPARSAPAT 143
Cdd:PRK14959 446 SGIPPRPAPRMPEASPVPGAPDSV----ASASDAPPTLGDPSDTAEHT 489
PHA03377 PHA03377
EBNA-3C; Provisional
9-91 3.57e-03

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 40.04  E-value: 3.57e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPAR 88
Cdd:PHA03377  530 AKPHRKVQDGFQRSGRRQKRATPPKVSPSDRGPPKASPPVMAPPSTGPRVMATPSTGPRDMAPPSTGPRQQAKCKDGPPA 609

                  ...
gi 332164762   89 ASP 91
Cdd:PHA03377  610 SGP 612
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
9-155 3.74e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 40.15  E-value: 3.74e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    9 ASPARTPSAGASPAQASPAGTPPGRASPAqaspaqaspagTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPAR 88
Cdd:PHA03307  260 PAPITLPTRIWEASGWNGPSSRPGPASSS-----------SSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSS 328
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 332164762   89 ASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSAPATRATRESPGTSLP 155
Cdd:PHA03307  329 TSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVA 395
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
2-91 3.77e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 40.15  E-value: 3.77e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    2 ERDSHGNASPARtPSAGASPAQASPAG-----TPPGRASPAQASPA---QASPAGTPPGRA-SPAQASPAGTPPGRASPG 72
Cdd:PHA03307  291 PRERSPSPSPSS-PGSGPAPSSPRASSsssssRESSSSSTSSSSESsrgAAVSPGPSPSRSpSPSRPPPPADPSSPRKRP 369
                          90
                  ....*....|....*....
gi 332164762   73 RASPAQASPAQASPARASP 91
Cdd:PHA03307  370 RPSRAPSSPAASAGRPTRR 388
PHA03270 PHA03270
envelope glycoprotein C; Provisional
9-67 3.78e-03

envelope glycoprotein C; Provisional


Pssm-ID: 165528 [Multi-domain]  Cd Length: 466  Bit Score: 39.53  E-value: 3.78e-03
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   9 ASPART-PSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPG 67
Cdd:PHA03270  19 LCAGAGaPRGAVSNASEAPTSGSPGSAEGPRTTPTPTRGKGTPTGPASPPKSGPPKSPPA 78
PRK06975 PRK06975
bifunctional uroporphyrinogen-III synthetase/uroporphyrin-III C-methyltransferase; Reviewed
17-77 3.84e-03

bifunctional uroporphyrinogen-III synthetase/uroporphyrin-III C-methyltransferase; Reviewed


Pssm-ID: 235899 [Multi-domain]  Cd Length: 656  Bit Score: 39.70  E-value: 3.84e-03
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 332164762  17 AGASPAQASPAgtpPGRASPAQASPaQASPAGTPPGRASPAQASPAGTPPG-RASPGRASPA 77
Cdd:PRK06975 268 AAAQPATAAPA---PSRMTDTNDSK-SVTSQPAAAAAAPAPPPNPPATPPEpPARRGRGSAA 325
PHA03381 PHA03381
tegument protein VP22; Provisional
2-90 3.91e-03

tegument protein VP22; Provisional


Pssm-ID: 177618 [Multi-domain]  Cd Length: 290  Bit Score: 39.22  E-value: 3.91e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   2 ERDSHGNASPARTPSAGASPAQASPAGTPPgrasPAQASPAQASPAGTPPGRASPAQASpAGTPPGRASPGRASPAQAS- 80
Cdd:PHA03381  80 EDERPADPRPSRRPHAQPEASGPGPARGAR----GPAGSRGRGRRAESPSPRDPPNPKG-ASAPRGRKSACADSAALLDa 154
                         90
                 ....*....|
gi 332164762  81 PAQASPARAS 90
Cdd:PHA03381 155 PAPAAPKRQK 164
PDHac_trf_long TIGR01348
pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form; This model ...
25-87 4.40e-03

pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form; This model describes a subset of pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase specifically close by both phylogenetic and per cent identity (UPGMA) trees. Members of this set include two or three copies of the lipoyl-binding domain. E. coli AceF is a member of this model, while mitochondrial and some other bacterial forms belong to a separate model. [Energy metabolism, Pyruvate dehydrogenase]


Pssm-ID: 273566 [Multi-domain]  Cd Length: 546  Bit Score: 39.47  E-value: 4.40e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 332164762   25 SPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRaSPAQASPaqASPA 87
Cdd:TIGR01348 191 VAGSTPATAPAPASAQPAAQSPAATQPEPAAAPAAAKAQAPAPQQAGTQ-NPAKVDH--AAPA 250
LDLa cd00112
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central ...
210-226 4.41e-03

Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure


Pssm-ID: 238060  Cd Length: 35  Bit Score: 34.87  E-value: 4.41e-03
                         10
                 ....*....|....*..
gi 332164762 210 RCDGVVDCKLKSDELGC 226
Cdd:cd00112   19 VCDGEDDCGDGSDEENC 35
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
7-91 4.48e-03

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 39.69  E-value: 4.48e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   7 GN-ASPARTPSAGASPAQASPAGTPPGRASPAQASPaQASPAGTPPgrASPAQ-ASPAGTPP----GRASPGRASPAQAS 80
Cdd:PLN02217 564 GNpGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSP-PAGHLGSPP--ATPSKiVSPSTSPPashlGSPSTTPSSPESSI 640
                         90
                 ....*....|.
gi 332164762  81 PAQASPArASP 91
Cdd:PLN02217 641 KVASTET-ASP 650
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
2-91 6.17e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 39.37  E-value: 6.17e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    2 ERDSHGNASPARTPSAGASPAQASPAGTPPgRASPAQASPAQASPAGTPPGRAS-PAQASPAGTPPGRASPGRASP---A 77
Cdd:pfam03154 153 DNESDSDSSAQQQILQTQPPVLQAQSGAAS-PPSPPPPGTTQAATAGPTPSAPSvPPQGSPATSQPPNQTQSTAAPhtlI 231
                          90
                  ....*....|....
gi 332164762   78 QASPAQASPARASP 91
Cdd:pfam03154 232 QQTPTLHPQRLPSP 245
PHA03201 PHA03201
uracil DNA glycosylase; Provisional
1-91 6.41e-03

uracil DNA glycosylase; Provisional


Pssm-ID: 165468  Cd Length: 318  Bit Score: 38.72  E-value: 6.41e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   1 MERDSHGNASPARTPSAGASPAQASPAGTPPgRASPAQASPAQASPAGTPPGRASPaQASPAGTPPGRASPGRASPAQAS 80
Cdd:PHA03201   1 MKRARSRSPSPPRRPSPPRPTPPRSPDASPE-ETPPSPPGPGAEPPPGRAAGPAAP-RRRPRGCPAGVTFSSSAPPRPPL 78
                         90
                 ....*....|.
gi 332164762  81 PAQASPARASP 91
Cdd:PHA03201  79 GLDDAPAATPP 89
aceF PRK11854
pyruvate dehydrogenase dihydrolipoyltransacetylase; Validated
17-71 7.08e-03

pyruvate dehydrogenase dihydrolipoyltransacetylase; Validated


Pssm-ID: 236999 [Multi-domain]  Cd Length: 633  Bit Score: 38.83  E-value: 7.08e-03
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|....*
gi 332164762  17 AGASPAQASPagtPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASP 71
Cdd:PRK11854 281 EGAAPAAAPA---KQEAAAPAPAAAKAEAPAAAPAAKAEGKSEFAENDAYVHATP 332
PRK10856 PRK10856
cytoskeleton protein RodZ;
6-91 7.19e-03

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 38.47  E-value: 7.19e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   6 HGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAgtppgrASPAQASPAGTPPGRASPgrASPAQASPAQAS 85
Cdd:PRK10856 158 SGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPA------PAVDPQQNAVVAPSQANV--DTAATPAPAAPA 229

                 ....*.
gi 332164762  86 PARASP 91
Cdd:PRK10856 230 TPDGAA 235
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
23-90 7.40e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 38.77  E-value: 7.40e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 332164762  23 QASPAGTPPGRAS----PAQASPAQASPAG-TPPGRASPAQASPAG--TPPGRASPGRASPAQASPAQASPARAS 90
Cdd:PRK14954 381 APSPAGSPDVKKKapepDLPQPDRHPGPAKpEAPGARPAELPSPASapTPEQQPPVARSAPLPPSPQASAPRNVA 455
PHA03247 PHA03247
large tegument protein UL36; Provisional
16-85 8.36e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 39.15  E-value: 8.36e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   16 SAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQAS 85
Cdd:PHA03247  369 SAGRHHPKRASLPTRKRRSARHAATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSA 438
PRK12373 PRK12373
NADH-quinone oxidoreductase subunit E;
8-91 8.58e-03

NADH-quinone oxidoreductase subunit E;


Pssm-ID: 237082 [Multi-domain]  Cd Length: 400  Bit Score: 38.63  E-value: 8.58e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   8 NASPARTPSAGAspAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPA 87
Cdd:PRK12373 207 NASKALAEDIGD--TVKRIDGTEVPLLAPWQGDAAPVPPSEAARPKSADAETNAALKTPATAPKAAAKNAKAPEAQPVSG 284

                 ....
gi 332164762  88 RASP 91
Cdd:PRK12373 285 TAAA 288
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1-155 9.12e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 39.00  E-value: 9.12e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    1 MERDSHGNASPARTPSAGASPAQASPAGtppGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQAS 80
Cdd:PHA03307  768 LAEALALLEPAEPQRGAGSSPPVRAEAA---FRRPGRLRRSGPAADAASRTASKRKSRSHTPDGGSESSGPARPPGAAAR 844
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 332164762   81 PaqaSPARASPALASLSRSSSGRSSSARSASVTTSPTRvylvRATPVGAVPIRSSPARSAPATRATRESPGTSLP 155
Cdd:PHA03307  845 P---PPARSSESSKSKPAAAGGRARGKNGRRRPRPPEP----RARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVK 912
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1-156 9.12e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 39.00  E-value: 9.12e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762    1 MERDSHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRA---------SP 71
Cdd:PHA03307  227 SAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSpspspsspgSG 306
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332164762   72 GRASPAQASPAQASPARASPALASLSRSSSGRSSSARSASVTTSPTRvylVRATPVGAVPIRSSPARSAPATRATRESPG 151
Cdd:PHA03307  307 PAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSP---SRPPPPADPSSPRKRPRPSRAPSSPAASAG 383

                  ....*
gi 332164762  152 TSLPK 156
Cdd:PHA03307  384 RPTRR 388
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH