NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|116256363|ref|NP_001070731|]
View 

transmembrane protease serine 13 isoform 1 [Homo sapiens]

Protein Classification

SRCR_2 and Tryp_SPc domain-containing protein( domain architecture ID 12173813)

SRCR_2 and Tryp_SPc domain-containing protein

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Tryp_SPc smart00020
Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens ...
325-554 3.35e-99

Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.


:

Pssm-ID: 214473  Cd Length: 229  Bit Score: 300.36  E-value: 3.35e-99
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   325 RIVGGALASDSKWPWQVSLHF-GTTHICGGTLIDAQWVLTAAHCFfvtREKVLEGWKVYAGTSNLHQLPEA--ASIAEII 401
Cdd:smart00020   1 RIVGGSEANIGSFPWQVSLQYgGGRHFCGGSLISPRWVLTAAHCV---RGSDPSNIRVRLGSHDLSSGEEGqvIKVSKVI 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   402 INSNYTDEEDDYDIALMRLSKPLTLSAHIHPACLPMHGQTFSLNETCWITGFGKTRETDDKTSPFLREVQVNLIDFKKCN 481
Cdd:smart00020  78 IHPNYNPSTYDNDIALLKLKEPVTLSDNVRPICLPSSNYNVPAGTTCTVSGWGRTSEGAGSLPDTLQEVNVPIVSNATCR 157
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 116256363   482 DYLVYDSYLTPRMMCAGDLRGGRDSCQGDSGGPLVCeQNNRWYLAGVTSWGTGCGQRNKPGVYTKVTEVLPWI 554
Cdd:smart00020 158 RAYSGGGAITDNMLCAGGLEGGKDACQGDSGGPLVC-NDGRWVLVGIVSWGSGCARPGKPGVYTRVSSYLDWI 229
SRCR_2 pfam15494
Scavenger receptor cysteine-rich domain; SRCR_2 is a scavenger receptor cysteine-rich domain ...
231-321 4.96e-29

Scavenger receptor cysteine-rich domain; SRCR_2 is a scavenger receptor cysteine-rich domain family found largely on vertebrate sequences up-stream of the trypsin-like transmembrane serine protease, Spinesin.


:

Pssm-ID: 464747  Cd Length: 99  Bit Score: 110.50  E-value: 4.96e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  231 WDKSLLKIYSGSSHQWLPICSSNWNDSYSEKTCQQLGFESAHRTTEVAHRD----FANSFSIL---RYNSTIQESL-HRS 302
Cdd:pfam15494   1 GENFLLQVYSSARPSWLPVCSDDWNPAYGRAACQQLGYLRLTHHKSVNLTDissnSSQSFMKLnssSLNTDLYEALqPRD 80
                          90
                  ....*....|....*....
gi 116256363  303 ECPSQRYISLQCSHCGLRA 321
Cdd:pfam15494  81 SCSSGSVVSLRCSECGLRS 99
PHA03378 super family cl33729
EBNA-3B; Provisional
9-148 3.14e-13

EBNA-3B; Provisional


The actual alignment was detected with superfamily member PHA03378:

Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 72.79  E-value: 3.14e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPAR 88
Cdd:PHA03378 689 WAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPA 768
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  89 ASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPiRSSPARSAPATRATRE 148
Cdd:PHA03378 769 AAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMP-RAAPGQQGPTKQILRQ 827
 
Name Accession Description Interval E-value
Tryp_SPc smart00020
Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens ...
325-554 3.35e-99

Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.


Pssm-ID: 214473  Cd Length: 229  Bit Score: 300.36  E-value: 3.35e-99
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   325 RIVGGALASDSKWPWQVSLHF-GTTHICGGTLIDAQWVLTAAHCFfvtREKVLEGWKVYAGTSNLHQLPEA--ASIAEII 401
Cdd:smart00020   1 RIVGGSEANIGSFPWQVSLQYgGGRHFCGGSLISPRWVLTAAHCV---RGSDPSNIRVRLGSHDLSSGEEGqvIKVSKVI 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   402 INSNYTDEEDDYDIALMRLSKPLTLSAHIHPACLPMHGQTFSLNETCWITGFGKTRETDDKTSPFLREVQVNLIDFKKCN 481
Cdd:smart00020  78 IHPNYNPSTYDNDIALLKLKEPVTLSDNVRPICLPSSNYNVPAGTTCTVSGWGRTSEGAGSLPDTLQEVNVPIVSNATCR 157
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 116256363   482 DYLVYDSYLTPRMMCAGDLRGGRDSCQGDSGGPLVCeQNNRWYLAGVTSWGTGCGQRNKPGVYTKVTEVLPWI 554
Cdd:smart00020 158 RAYSGGGAITDNMLCAGGLEGGKDACQGDSGGPLVC-NDGRWVLVGIVSWGSGCARPGKPGVYTRVSSYLDWI 229
Tryp_SPc cd00190
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens ...
326-557 5.54e-97

Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.


Pssm-ID: 238113 [Multi-domain]  Cd Length: 232  Bit Score: 294.57  E-value: 5.54e-97
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363 326 IVGGALASDSKWPWQVSLHFGT-THICGGTLIDAQWVLTAAHCFfvtREKVLEGWKVYAGTSNLHQLPE---AASIAEII 401
Cdd:cd00190    1 IVGGSEAKIGSFPWQVSLQYTGgRHFCGGSLISPRWVLTAAHCV---YSSAPSNYTVRLGSHDLSSNEGggqVIKVKKVI 77
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363 402 INSNYTDEEDDYDIALMRLSKPLTLSAHIHPACLPMHGQTFSLNETCWITGFGKTRETDdKTSPFLREVQVNLIDFKKCN 481
Cdd:cd00190   78 VHPNYNPSTYDNDIALLKLKRPVTLSDNVRPICLPSSGYNLPAGTTCTVSGWGRTSEGG-PLPDVLQEVNVPIVSNAECK 156
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 116256363 482 DYLVYDSYLTPRMMCAGDLRGGRDSCQGDSGGPLVCEQNNRWYLAGVTSWGTGCGQRNKPGVYTKVTEVLPWIYSK 557
Cdd:cd00190  157 RAYSYGGTITDNMLCAGGLEGGKDACQGDSGGPLVCNDNGRGVLVGIVSWGSGCARPNYPGVYTRVSSYLDWIQKT 232
COG5640 COG5640
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, ...
317-558 2.03e-70

Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 444365 [Multi-domain]  Cd Length: 262  Bit Score: 227.22  E-value: 2.03e-70
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363 317 CGLRAMTGRIVGGALASDSKWPWQVSLHF---GTTHICGGTLIDAQWVLTAAHCFFvtrEKVLEGWKVYAGTSNLH-QLP 392
Cdd:COG5640   22 APAADAAPAIVGGTPATVGEYPWMVALQSsngPSGQFCGGTLIAPRWVLTAAHCVD---GDGPSDLRVVIGSTDLStSGG 98
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363 393 EAASIAEIIINSNYTDEEDDYDIALMRLSKPLTLSAhihPACLPMHGQTFSLNETCWITGFGKTRETDDKTSPFLREVQV 472
Cdd:COG5640   99 TVVKVARIVVHPDYDPATPGNDIALLKLATPVPGVA---PAPLATSADAAAPGTPATVAGWGRTSEGPGSQSGTLRKADV 175
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363 473 NLIDFKKCNdylVYDSYLTPRMMCAGDLRGGRDSCQGDSGGPLVCEQNNRWYLAGVTSWGTGCGQRNKPGVYTKVTEVLP 552
Cdd:COG5640  176 PVVSDATCA---AYGGFDGGTMLCAGYPEGGKDACQGDSGGPLVVKDGGGWVLVGVVSWGGGPCAAGYPGVYTRVSAYRD 252

                 ....*.
gi 116256363 553 WIYSKM 558
Cdd:COG5640  253 WIKSTA 258
Trypsin pfam00089
Trypsin;
326-554 2.17e-70

Trypsin;


Pssm-ID: 459667 [Multi-domain]  Cd Length: 219  Bit Score: 225.40  E-value: 2.17e-70
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  326 IVGGALASDSKWPWQVSLHFGT-THICGGTLIDAQWVLTAAHCFfvtreKVLEGWKVYAGTSNLHQLPEA---ASIAEII 401
Cdd:pfam00089   1 IVGGDEAQPGSFPWQVSLQLSSgKHFCGGSLISENWVLTAAHCV-----SGASDVKVVLGAHNIVLREGGeqkFDVEKII 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  402 INSNYTDEEDDYDIALMRLSKPLTLSAHIHPACLPMHGQTFSLNETCWITGFGKTREtdDKTSPFLREVQVNLIDFKKCN 481
Cdd:pfam00089  76 VHPNYNPDTLDNDIALLKLESPVTLGDTVRPICLPDASSDLPVGTTCTVSGWGNTKT--LGPSDTLQEVTVPVVSRETCR 153
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 116256363  482 DYlvYDSYLTPRMMCAGDlrGGRDSCQGDSGGPLVCEQNnrwYLAGVTSWGTGCGQRNKPGVYTKVTEVLPWI 554
Cdd:pfam00089 154 SA--YGGTVTDTMICAGA--GGKDACQGDSGGPLVCSDG---ELIGIVSWGYGCASGNYPGVYTPVSSYLDWI 219
SRCR_2 pfam15494
Scavenger receptor cysteine-rich domain; SRCR_2 is a scavenger receptor cysteine-rich domain ...
231-321 4.96e-29

Scavenger receptor cysteine-rich domain; SRCR_2 is a scavenger receptor cysteine-rich domain family found largely on vertebrate sequences up-stream of the trypsin-like transmembrane serine protease, Spinesin.


Pssm-ID: 464747  Cd Length: 99  Bit Score: 110.50  E-value: 4.96e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  231 WDKSLLKIYSGSSHQWLPICSSNWNDSYSEKTCQQLGFESAHRTTEVAHRD----FANSFSIL---RYNSTIQESL-HRS 302
Cdd:pfam15494   1 GENFLLQVYSSARPSWLPVCSDDWNPAYGRAACQQLGYLRLTHHKSVNLTDissnSSQSFMKLnssSLNTDLYEALqPRD 80
                          90
                  ....*....|....*....
gi 116256363  303 ECPSQRYISLQCSHCGLRA 321
Cdd:pfam15494  81 SCSSGSVVSLRCSECGLRS 99
PHA03378 PHA03378
EBNA-3B; Provisional
9-148 3.14e-13

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 72.79  E-value: 3.14e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPAR 88
Cdd:PHA03378 689 WAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPA 768
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  89 ASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPiRSSPARSAPATRATRE 148
Cdd:PHA03378 769 AAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMP-RAAPGQQGPTKQILRQ 827
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
2-155 3.40e-06

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 49.96  E-value: 3.40e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    2 ERDSHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASP 81
Cdd:pfam17823  89 EHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPRAAIAAASAP 168
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   82 AQASPARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSA--------PATRATRESPGTS 153
Cdd:pfam17823 169 HAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTAlaavgnssPAAGTVTAAVGTV 248

                  ..
gi 116256363  154 LP 155
Cdd:pfam17823 249 TP 250
SR smart00202
Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR ...
245-316 7.79e-06

Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.


Pssm-ID: 214555 [Multi-domain]  Cd Length: 101  Bit Score: 44.64  E-value: 7.79e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 116256363   245 QWLPICSSNWNDSYSEKTCQQLGFESAHRTTEVAHrDFANSFSILRYNSTI--QESlHRSECPSQRYISLQCSH 316
Cdd:smart00202  21 QWGTVCDDGWDLRDANVVCRQLGFGGAVSASGSAY-FGPGSGPIWLDNVRCsgTEA-SLSDCPHSGWGSHNCSH 92
rad23 TIGR00601
UV excision repair protein Rad23; All proteins in this family for which functions are known ...
9-70 1.00e-04

UV excision repair protein Rad23; All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273167 [Multi-domain]  Cd Length: 378  Bit Score: 44.89  E-value: 1.00e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 116256363    9 ASPARTPSAGASPAQASPAGTPPGraspAQASPAQASPAGTPPGRA--SPAQASPAGTPPGRAS 70
Cdd:TIGR00601  85 APPAATPTSAPTPTPSPPASPASG----MSAAPASAVEEKSPSEESatATAPESPSTSVPSSGS 144
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
6-91 7.21e-04

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 42.30  E-value: 7.21e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   6 HGNASPARTPSAGASPAQAS-PAGTPP--GRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPgrASPAQASPA 82
Cdd:NF041121  14 QMGRAAAPPSPEGPAPTAASqPATPPPpaAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPGPAG--AAPGAALPV 91

                 ....*....
gi 116256363  83 QASPARASP 91
Cdd:NF041121  92 RVPAPPALP 100
KREPA2 cd23959
Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of ...
6-91 7.61e-04

Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of the parasitic protozoan's KREPA RNA editing catalytic complex (RECC). Kinetoplastid RNA editing (KRE) proteins occur as pairs or sets of related proteins in multiple complexes. KREPA complex is composed of six components (KREPA1-6), which share a conserved C-terminal region containing an oligonucleotide-binding (OB)-fold-like domain. KREPAs are responsible for the site-specific insertion and deletion of U nucleotides in the kinetoplastid mitochondria pre-messenger RNA. Apart from the conserved C-terminal OB-fold domain, KREPA1, KREPA2, and KREPA3 contain two conserved C2H2 zinc-finger domains. KREPA2 and kinetoplastid RNA editing ligase 1 (KREL1) are specific for ligation post-U-deletion and are paralogous to KREL2 and KREPA1 that are specific for ligation post-U-insertion. KREPA2, is critical for RECC stability and KREL1 integration into the complex.


Pssm-ID: 467780 [Multi-domain]  Cd Length: 424  Bit Score: 42.16  E-value: 7.61e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   6 HGNASPARTPSAGASPAQASPAGTPPGRASPAQA----SPAQAS--PAGTPPGRASPAQA-SPAGTPPGRASPGraSPAQ 78
Cdd:cd23959  159 HPPPAKPLPAAAAAQQSSASPGEVASPFASGTVSaspfATATDTapSSGAPDGFPAEASApSPFAAPASAASFP--AAPV 236
                         90
                 ....*....|...
gi 116256363  79 ASPAQASPARASP 91
Cdd:cd23959  237 ANGEAATPTHACT 249
FimV COG3170
Type IV pilus assembly protein FimV [Cell motility, Extracellular structures];
7-155 9.82e-04

Type IV pilus assembly protein FimV [Cell motility, Extracellular structures];


Pssm-ID: 442403 [Multi-domain]  Cd Length: 508  Bit Score: 42.09  E-value: 9.82e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   7 GNASPARTPSAGASPAQASPAGTPPGRASPAqASPAQASPAGTPPGRASPAQASPAGTPPGRASPgrASPAQASPA--QA 84
Cdd:COG3170  108 AYAAAAAAPAAAPAPAPAAPAAAAAAADQPA-AEAAPAASGEYYPVRPGDTLWSIAARPVRPSSG--VSLDQMMVAlyRA 184
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 116256363  85 SPAraspalaslsRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSAPATRATRESPGTSLP 155
Cdd:COG3170  185 NPD----------AFIDGNINRLKAGAVLRVPAAEEVAALSPAEARQEVQAQSADWAAYRARLAAAVEPAP 245
 
Name Accession Description Interval E-value
Tryp_SPc smart00020
Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens ...
325-554 3.35e-99

Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.


Pssm-ID: 214473  Cd Length: 229  Bit Score: 300.36  E-value: 3.35e-99
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   325 RIVGGALASDSKWPWQVSLHF-GTTHICGGTLIDAQWVLTAAHCFfvtREKVLEGWKVYAGTSNLHQLPEA--ASIAEII 401
Cdd:smart00020   1 RIVGGSEANIGSFPWQVSLQYgGGRHFCGGSLISPRWVLTAAHCV---RGSDPSNIRVRLGSHDLSSGEEGqvIKVSKVI 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   402 INSNYTDEEDDYDIALMRLSKPLTLSAHIHPACLPMHGQTFSLNETCWITGFGKTRETDDKTSPFLREVQVNLIDFKKCN 481
Cdd:smart00020  78 IHPNYNPSTYDNDIALLKLKEPVTLSDNVRPICLPSSNYNVPAGTTCTVSGWGRTSEGAGSLPDTLQEVNVPIVSNATCR 157
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 116256363   482 DYLVYDSYLTPRMMCAGDLRGGRDSCQGDSGGPLVCeQNNRWYLAGVTSWGTGCGQRNKPGVYTKVTEVLPWI 554
Cdd:smart00020 158 RAYSGGGAITDNMLCAGGLEGGKDACQGDSGGPLVC-NDGRWVLVGIVSWGSGCARPGKPGVYTRVSSYLDWI 229
Tryp_SPc cd00190
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens ...
326-557 5.54e-97

Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.


Pssm-ID: 238113 [Multi-domain]  Cd Length: 232  Bit Score: 294.57  E-value: 5.54e-97
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363 326 IVGGALASDSKWPWQVSLHFGT-THICGGTLIDAQWVLTAAHCFfvtREKVLEGWKVYAGTSNLHQLPE---AASIAEII 401
Cdd:cd00190    1 IVGGSEAKIGSFPWQVSLQYTGgRHFCGGSLISPRWVLTAAHCV---YSSAPSNYTVRLGSHDLSSNEGggqVIKVKKVI 77
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363 402 INSNYTDEEDDYDIALMRLSKPLTLSAHIHPACLPMHGQTFSLNETCWITGFGKTRETDdKTSPFLREVQVNLIDFKKCN 481
Cdd:cd00190   78 VHPNYNPSTYDNDIALLKLKRPVTLSDNVRPICLPSSGYNLPAGTTCTVSGWGRTSEGG-PLPDVLQEVNVPIVSNAECK 156
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 116256363 482 DYLVYDSYLTPRMMCAGDLRGGRDSCQGDSGGPLVCEQNNRWYLAGVTSWGTGCGQRNKPGVYTKVTEVLPWIYSK 557
Cdd:cd00190  157 RAYSYGGTITDNMLCAGGLEGGKDACQGDSGGPLVCNDNGRGVLVGIVSWGSGCARPNYPGVYTRVSSYLDWIQKT 232
COG5640 COG5640
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, ...
317-558 2.03e-70

Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 444365 [Multi-domain]  Cd Length: 262  Bit Score: 227.22  E-value: 2.03e-70
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363 317 CGLRAMTGRIVGGALASDSKWPWQVSLHF---GTTHICGGTLIDAQWVLTAAHCFFvtrEKVLEGWKVYAGTSNLH-QLP 392
Cdd:COG5640   22 APAADAAPAIVGGTPATVGEYPWMVALQSsngPSGQFCGGTLIAPRWVLTAAHCVD---GDGPSDLRVVIGSTDLStSGG 98
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363 393 EAASIAEIIINSNYTDEEDDYDIALMRLSKPLTLSAhihPACLPMHGQTFSLNETCWITGFGKTRETDDKTSPFLREVQV 472
Cdd:COG5640   99 TVVKVARIVVHPDYDPATPGNDIALLKLATPVPGVA---PAPLATSADAAAPGTPATVAGWGRTSEGPGSQSGTLRKADV 175
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363 473 NLIDFKKCNdylVYDSYLTPRMMCAGDLRGGRDSCQGDSGGPLVCEQNNRWYLAGVTSWGTGCGQRNKPGVYTKVTEVLP 552
Cdd:COG5640  176 PVVSDATCA---AYGGFDGGTMLCAGYPEGGKDACQGDSGGPLVVKDGGGWVLVGVVSWGGGPCAAGYPGVYTRVSAYRD 252

                 ....*.
gi 116256363 553 WIYSKM 558
Cdd:COG5640  253 WIKSTA 258
Trypsin pfam00089
Trypsin;
326-554 2.17e-70

Trypsin;


Pssm-ID: 459667 [Multi-domain]  Cd Length: 219  Bit Score: 225.40  E-value: 2.17e-70
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  326 IVGGALASDSKWPWQVSLHFGT-THICGGTLIDAQWVLTAAHCFfvtreKVLEGWKVYAGTSNLHQLPEA---ASIAEII 401
Cdd:pfam00089   1 IVGGDEAQPGSFPWQVSLQLSSgKHFCGGSLISENWVLTAAHCV-----SGASDVKVVLGAHNIVLREGGeqkFDVEKII 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  402 INSNYTDEEDDYDIALMRLSKPLTLSAHIHPACLPMHGQTFSLNETCWITGFGKTREtdDKTSPFLREVQVNLIDFKKCN 481
Cdd:pfam00089  76 VHPNYNPDTLDNDIALLKLESPVTLGDTVRPICLPDASSDLPVGTTCTVSGWGNTKT--LGPSDTLQEVTVPVVSRETCR 153
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 116256363  482 DYlvYDSYLTPRMMCAGDlrGGRDSCQGDSGGPLVCEQNnrwYLAGVTSWGTGCGQRNKPGVYTKVTEVLPWI 554
Cdd:pfam00089 154 SA--YGGTVTDTMICAGA--GGKDACQGDSGGPLVCSDG---ELIGIVSWGYGCASGNYPGVYTPVSSYLDWI 219
SRCR_2 pfam15494
Scavenger receptor cysteine-rich domain; SRCR_2 is a scavenger receptor cysteine-rich domain ...
231-321 4.96e-29

Scavenger receptor cysteine-rich domain; SRCR_2 is a scavenger receptor cysteine-rich domain family found largely on vertebrate sequences up-stream of the trypsin-like transmembrane serine protease, Spinesin.


Pssm-ID: 464747  Cd Length: 99  Bit Score: 110.50  E-value: 4.96e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  231 WDKSLLKIYSGSSHQWLPICSSNWNDSYSEKTCQQLGFESAHRTTEVAHRD----FANSFSIL---RYNSTIQESL-HRS 302
Cdd:pfam15494   1 GENFLLQVYSSARPSWLPVCSDDWNPAYGRAACQQLGYLRLTHHKSVNLTDissnSSQSFMKLnssSLNTDLYEALqPRD 80
                          90
                  ....*....|....*....
gi 116256363  303 ECPSQRYISLQCSHCGLRA 321
Cdd:pfam15494  81 SCSSGSVVSLRCSECGLRS 99
eMpr COG3591
V8-like Glu-specific endopeptidase [Posttranslational modification, protein turnover, ...
346-534 1.09e-13

V8-like Glu-specific endopeptidase [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 442810 [Multi-domain]  Cd Length: 194  Bit Score: 69.71  E-value: 1.09e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363 346 GTTHICGGTLIDAQWVLTAAHCFF-VTREKVLEGWKVYAGTSNLHqlPEAASIAEIIINSNYTDEED-DYDIALMRLSKP 423
Cdd:COG3591    9 GGGGVCTGTLIGPNLVLTAGHCVYdGAGGGWATNIVFVPGYNGGP--YGTATATRFRVPPGWVASGDaGYDYALLRLDEP 86
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363 424 LTLSAhihpACLPMHGQTFSL-NETCWITGFGKtretDDKTSPFLRevqvnlidfkkCNDYLVYDSyltprmmcAGDLRG 502
Cdd:COG3591   87 LGDTT----GWLGLAFNDAPLaGEPVTIIGYPG----DRPKDLSLD-----------CSGRVTGVQ--------GNRLSY 139
                        170       180       190
                 ....*....|....*....|....*....|..
gi 116256363 503 GRDSCQGDSGGPLVCEQNNRWYLAGVTSWGTG 534
Cdd:COG3591  140 DCDTTGGSSGSPVLDDSDGGGRVVGVHSAGGA 171
PHA03378 PHA03378
EBNA-3B; Provisional
9-148 3.14e-13

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 72.79  E-value: 3.14e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPAR 88
Cdd:PHA03378 689 WAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPA 768
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  89 ASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPiRSSPARSAPATRATRE 148
Cdd:PHA03378 769 AAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMP-RAAPGQQGPTKQILRQ 827
PHA03378 PHA03378
EBNA-3B; Provisional
8-91 8.20e-11

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 65.09  E-value: 8.20e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   8 NASPARTPSAgaSPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPA 87
Cdd:PHA03378 680 GANTMLPIQW--APGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPP 757

                 ....
gi 116256363  88 RASP 91
Cdd:PHA03378 758 AAAP 761
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
15-142 1.35e-10

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 63.97  E-value: 1.35e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  15 PSAGASPAQASPAGTP--PGRASPAqASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPARASPA 92
Cdd:PRK14951 366 PAAAAEAAAPAEKKTParPEAAAPA-AAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAA 444
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|
gi 116256363  93 LASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPirsSPARSAPA 142
Cdd:PRK14951 445 VALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAP---AAARLTPT 491
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
7-91 1.37e-10

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 64.24  E-value: 1.37e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   7 GNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASP 86
Cdd:PRK07764 392 GAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPA 471

                 ....*
gi 116256363  87 ARASP 91
Cdd:PRK07764 472 AAPEP 476
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
9-155 1.51e-10

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 64.10  E-value: 1.51e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQAS---------PAGTPPGRASPGRASPAQA 79
Cdd:PRK07003 385 ARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATAdrgddaadgDAPVPAKANARASADSRCD 464
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 116256363  80 SPAQASPARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVR-ATPVGAVPIRSSPARSA-PATRATRESPGTSLP 155
Cdd:PRK07003 465 ERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPdARAPAAASREDAPAAAApPAPEARPPTPAAAAP 542
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
3-170 5.67e-10

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 62.20  E-value: 5.67e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   3 RDSHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPA 82
Cdd:PRK12323 420 AAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPW 499
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  83 QASP---ARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSAPATRATRESPGTSLPKFT- 158
Cdd:PRK12323 500 EELPpefASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDg 579
                        170
                 ....*....|...
gi 116256363 159 -WREGQKQLPLIG 170
Cdd:PRK12323 580 dWPALAARLPVRG 592
PHA03378 PHA03378
EBNA-3B; Provisional
4-150 7.75e-10

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 62.01  E-value: 7.75e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   4 DSHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQ 83
Cdd:PHA03378 644 NVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGR 723
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 116256363  84 ASPARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSAPATRATRESP 150
Cdd:PHA03378 724 ARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRP 790
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
15-155 1.86e-09

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 60.38  E-value: 1.86e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  15 PSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPARASPALA 94
Cdd:PRK07764 365 PSASDDERGLLARLERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSP 444
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 116256363  95 SLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSAPATRATRESPGTSLP 155
Cdd:PRK07764 445 AGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAG 505
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
15-155 2.00e-09

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 60.63  E-value: 2.00e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  15 PSAGASPAQASPAGTPPGRASP---AQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPARASP 91
Cdd:PRK07003 368 PGGGVPARVAGAVPAPGARAAAavgASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDA 447
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 116256363  92 alASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPirsSPARSAPATRATRESPGTSLP 155
Cdd:PRK07003 448 --PVPAKANARASADSRCDERDAQPPADSGSASAPASDAP---PDAAFEPAPRAAAPSAATPAA 506
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
9-91 2.06e-09

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 60.11  E-value: 2.06e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   9 ASPARTPSAGASPAQASPAGTPPGRASPAQAS----PAQASPAGTPPGRASPAQASP----AGTPPGRASPGRASPAQAS 80
Cdd:PRK14951 372 AAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPaaapAAAASAPAAPPAAAPPAPVAApaaaAPAAAPAAAPAAVALAPAP 451
                         90
                 ....*....|.
gi 116256363  81 PAQASPARASP 91
Cdd:PRK14951 452 PAQAAPETVAI 462
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
21-151 2.88e-09

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 59.73  E-value: 2.88e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  21 PAQASPAGTPPGRASPAQasPAQASPAGTPPGRASPAQAsPAGTPPGRASPGRASPAQASPAQASPARASPALASLSRSS 100
Cdd:PRK14951 366 PAAAAEAAAPAEKKTPAR--PEAAAPAAAPVAQAAAAPA-PAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAP 442
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|..
gi 116256363 101 SGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSAPA-TRATRESPG 151
Cdd:PRK14951 443 AAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAaARLTPTEEG 494
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
5-150 5.52e-09

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 59.23  E-value: 5.52e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   5 SHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQA 84
Cdd:PRK07764 637 AEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADD 716
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 116256363  85 SPARASPALASLSRSSSGRSSSARSASVTTSPTRVYlvrATPVGAVPIRSSPARSAPATRATRESP 150
Cdd:PRK07764 717 PAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPA---GAPAQPPPPPAPAPAAAPAAAPPPSPP 779
PHA03247 PHA03247
large tegument protein UL36; Provisional
9-172 6.55e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 59.18  E-value: 6.55e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAgTPPGRASPAQASPAGTPPGRASPGRaSPAQASPAQASPAR 88
Cdd:PHA03247 2696 TSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPA-LPAAPAPPAVPAGPATPGGPARPAR-PPTTAGPPAPAPPA 2773
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   89 ---ASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPAR-SAPATRATRESPGTSLPKFtwregQK 164
Cdd:PHA03247 2774 apaAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGpLPPPTSAQPTAPPPPPGPP-----PP 2848

                  ....*...
gi 116256363  165 QLPLIGCV 172
Cdd:PHA03247 2849 SLPLGGSV 2856
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1-91 1.71e-08

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 57.30  E-value: 1.71e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   1 MERdSHGNASPARTPSAGASP-AQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQA 79
Cdd:PRK07764 381 LER-RLGVAGGAGAPAAAAPSaAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPA 459
                         90
                 ....*....|..
gi 116256363  80 SPAQASPARASP 91
Cdd:PRK07764 460 AAPSAQPAPAPA 471
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
5-170 2.43e-08

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 56.81  E-value: 2.43e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   5 SHGNASPArtpsagaSPAQASPAGTPPGRASPAQASPAQASPAGTPPgrasPAQASPAGTPPGRASPGRASPAQASPAQA 84
Cdd:PRK12323 368 SGGGAGPA-------TAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPA----AAPAAAAAARAVAAAPARRSPAPEALAAA 436
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  85 SPARAspalaSLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPirsSPARSAPAtRATRESPGTSLPkftWREGQK 164
Cdd:PRK12323 437 RQASA-----RGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAA---APARAAPA-AAPAPADDDPPP---WEELPP 504

                 ....*.
gi 116256363 165 QLPLIG 170
Cdd:PRK12323 505 EFASPA 510
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
9-91 2.93e-08

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 56.41  E-value: 2.93e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQAS-PAGTPPGRASPGRASPAQASPAQASPA 87
Cdd:PRK07994 370 VPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQlLAARQQLQRAQGATKAKKSEPAAASRA 449

                 ....
gi 116256363  88 RASP 91
Cdd:PRK07994 450 RPVN 453
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
31-163 3.32e-08

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 56.26  E-value: 3.32e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  31 PGRASPAQASPAQASPAgtPPGRASPAQASPAGTPPGRAsPGRASPAQASPAQASPARASPALASLSRSSSGRSSSARSA 110
Cdd:PRK14951 366 PAAAAEAAAPAEKKTPA--RPEAAAPAAAPVAQAAAAPA-PAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAP 442
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|...
gi 116256363 111 SVTTSPTRVYLVRATPVGAVPIRSSPARSAPATRATRESPGTSLPKFTWREGQ 163
Cdd:PRK14951 443 AAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEEGD 495
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
4-91 3.65e-08

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 56.26  E-value: 3.65e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   4 DSHGNASPARTPSAGASPAQASPAGTPPGRASP----AQASPAQASPAGTPPGRASPAQASPAG-TPPGRASPGRASPAQ 78
Cdd:PRK14951 396 QAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAApaaaAPAAAPAAAPAAVALAPAPPAQAAPETvAIPVRVAPEPAVASA 475
                         90
                 ....*....|...
gi 116256363  79 ASPAQASPARASP 91
Cdd:PRK14951 476 APAPAAAPAAARL 488
PHA03247 PHA03247
large tegument protein UL36; Provisional
5-150 5.09e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.10  E-value: 5.09e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    5 SHGNASPArTPSAGASPAQASPAGTPPGRASPAQ----ASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQAS 80
Cdd:PHA03247 2727 AARQASPA-LPAAPAPPAVPAGPATPGGPARPARppttAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPA 2805
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 116256363   81 PAQASPARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGA--VP----IRSSPARSAPATRATRESP 150
Cdd:PHA03247 2806 DPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGsvAPggdvRRRPPSRSPAAKPAAPARP 2881
PHA03378 PHA03378
EBNA-3B; Provisional
3-87 8.61e-08

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 55.46  E-value: 8.61e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   3 RDSHGNASPARTPSagASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASP---AGTPPGRASPG-----RA 74
Cdd:PHA03378 715 QRPAAATGRARPPA--AAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPgapTPQPPPQAPPApqqrpRG 792
                         90
                 ....*....|...
gi 116256363  75 SPAQASPAQASPA 87
Cdd:PHA03378 793 APTPQPPPQAGPT 805
PHA03247 PHA03247
large tegument protein UL36; Provisional
4-152 1.58e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.56  E-value: 1.58e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    4 DSHGNASPartPSAGASPAQASPAGTPPGRASPAQASPAQASPA---GTPPGRASP-AQASPAGTPPGRASPGRASPAQA 79
Cdd:PHA03247 2546 DDAGDPPP---PLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRArrpDAPPQSARPrAPVDDRGDPRGPAPPSPLPPDTH 2622
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 116256363   80 SPAQASPARaSPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSAPATRATRESPGT 152
Cdd:PHA03247 2623 APDPPPPSP-SPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGS 2694
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
5-152 2.66e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 53.64  E-value: 2.66e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    5 SHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQA 84
Cdd:PHA03307  101 AREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPE 180
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 116256363   85 SPARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAvPIRSSPARSAPATRATRESPGT 152
Cdd:PHA03307  181 ETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPG-RSAADDAGASSSDSSSSESSGC 247
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
3-156 2.78e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 53.64  E-value: 2.78e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    3 RDSHGNASPARTPSAGASPAQ-ASPAGTPPGRASPAQASPAQA----SPAGTPPGRASPAQASPAGTPPGRASPGRASPA 77
Cdd:PHA03307  141 VGSPGPPPAASPPAAGASPAAvASDAASSRQAALPLSSPEETArapsSPPAEPPPSTPPAAASPRPPRRSSPISASASSP 220
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   78 QASPA--QASPARASPALASLSRSSSGRSSSARSASVtTSPTRVYLVRATPVGAVPI----RSSPARSAPATRATRESPG 151
Cdd:PHA03307  221 APAPGrsAADDAGASSSDSSSSESSGCGWGPENECPL-PRPAPITLPTRIWEASGWNgpssRPGPASSSSSPRERSPSPS 299

                  ....*
gi 116256363  152 TSLPK 156
Cdd:PHA03307  300 PSSPG 304
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
7-91 4.35e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 52.86  E-value: 4.35e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   7 GNASPARTPSAGASPAQASPA-GTPPGRASPAQASPAQASPAGTPPgrASPAQASPAGTPPGRASPgRASPAQASPAQAS 85
Cdd:PRK14971 367 DDASGGRGPKQHIKPVFTQPAaAPQPSAAAAASPSPSQSSAAAQPS--APQSATQPAGTPPTVSVD-PPAAVPVNPPSTA 443

                 ....*.
gi 116256363  86 PARASP 91
Cdd:PRK14971 444 PQAVRP 449
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
14-87 4.92e-07

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 52.97  E-value: 4.92e-07
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 116256363   14 TPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPA 87
Cdd:PRK12270   39 GSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAA 112
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
9-91 6.78e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 52.30  E-value: 6.78e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASP-------------AQASPAGTPPGRASPGRAS 75
Cdd:PRK07764 613 ARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPdasdggdgwpakaGGAAPAAPPPAPAPAAPAA 692
                         90
                 ....*....|....*.
gi 116256363  76 PAQASPAQASPARASP 91
Cdd:PRK07764 693 PAGAAPAQPAPAPAAT 708
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
3-82 7.52e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 52.30  E-value: 7.52e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   3 RDSHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPA 82
Cdd:PRK07764 663 SDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLP 742
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
14-153 9.99e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 51.91  E-value: 9.99e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  14 TPSAGASPAQASPAGTPPGRASPAQASPAQASPAGtPPGRASPAQASPAGTPPGrasPGRASPAQASPAQASPARASPAL 93
Cdd:PRK07764 583 QVEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAA-PAAPAAPAAPAPAGAAAA---PAEASAAPAPGVAAPEHHPKHVA 658
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  94 ASLSRSSSGRSSSARSASVTTSPTrvylVRATPVGAVPIRSSPARSAPATRATRESPGTS 153
Cdd:PRK07764 659 VPDASDGGDGWPAKAGGAAPAAPP----PAPAPAAPAAPAGAAPAQPAPAPAATPPAGQA 714
PHA03247 PHA03247
large tegument protein UL36; Provisional
7-155 1.96e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.09  E-value: 1.96e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    7 GNASPARTPSAGASPAQASpagTPPGRASPAQASPAQAS---PAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPaq 83
Cdd:PHA03247 2659 GRVSRPRRARRLGRAAQAS---SPPQRPRRRAARPTVGSltsLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASP-- 2733
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 116256363   84 ASPARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRAtPVGAVPIRSSPARSAPATRATRESPGTSLP 155
Cdd:PHA03247 2734 ALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA-PAAGPPRRLTRPAVASLSESRESLPSPWDP 2804
PHA03247 PHA03247
large tegument protein UL36; Provisional
11-168 2.29e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.09  E-value: 2.29e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   11 PARTPSAGASPAQASPAGTPPGRASP-AQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPARA 89
Cdd:PHA03247 2573 PAPRPSEPAVTSRARRPDAPPQSARPrAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERP 2652
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   90 SPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSS----PARSAPATRATRESPGTSLPKFTWREGQKQ 165
Cdd:PHA03247 2653 RDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLadppPPPPTPEPAPHALVSATPLPPGPAAARQAS 2732

                  ...
gi 116256363  166 LPL 168
Cdd:PHA03247 2733 PAL 2735
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
2-155 3.40e-06

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 49.96  E-value: 3.40e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    2 ERDSHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASP 81
Cdd:pfam17823  89 EHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPRAAIAAASAP 168
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   82 AQASPARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSA--------PATRATRESPGTS 153
Cdd:pfam17823 169 HAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTAlaavgnssPAAGTVTAAVGTV 248

                  ..
gi 116256363  154 LP 155
Cdd:pfam17823 249 TP 250
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
9-88 4.70e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 49.46  E-value: 4.70e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAqasPAQASPAR 88
Cdd:PRK07003 468 AQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPT---PAAAAPAA 544
SR smart00202
Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR ...
245-316 7.79e-06

Scavenger receptor Cys-rich; The sea urchin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.


Pssm-ID: 214555 [Multi-domain]  Cd Length: 101  Bit Score: 44.64  E-value: 7.79e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 116256363   245 QWLPICSSNWNDSYSEKTCQQLGFESAHRTTEVAHrDFANSFSILRYNSTI--QESlHRSECPSQRYISLQCSH 316
Cdd:smart00202  21 QWGTVCDDGWDLRDANVVCRQLGFGGAVSASGSAY-FGPGSGPIWLDNVRCsgTEA-SLSDCPHSGWGSHNCSH 92
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
1-90 1.26e-05

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 48.14  E-value: 1.26e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    1 MERDSHGNASPARTPSAGASPAQASPAG-TPPGRASPAQASPAQASPAGTPPGRASPA-----QASPA-GTPPGRASPGR 73
Cdd:pfam03546 382 AQEDSESSEEESDSEEAAATPAQVKASGkTPQAKANPAPTKASSAKGAASAPGKVVAAaaqakQGSPAkVKPPARTPQNS 461
                          90       100       110
                  ....*....|....*....|....*....|.
gi 116256363   74 ASPAQ--------------ASPAQASPARAS 90
Cdd:pfam03546 462 AISVRgqasvpavgkavatAAQAQKGPVGGP 492
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
1-91 1.63e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 47.75  E-value: 1.63e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   1 MERDSHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQAS 80
Cdd:PRK14959 386 AEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRVPWDDAPPAPPRSGIPPRPAPRMPEASPVPGA 465
                         90
                 ....*....|.
gi 116256363  81 PAQASPARASP 91
Cdd:PRK14959 466 PDSVASASDAP 476
PHA03247 PHA03247
large tegument protein UL36; Provisional
10-167 1.74e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 1.74e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   10 SPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRA-SPGRASPAQASPAQASPAR 88
Cdd:PHA03247 2608 PRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRArRLGRAAQASSPPQRPRRRA 2687
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   89 ASPALASLSRSSSGRSSSArsasvTTSPTRVYLVRATPVGAVPIRSSPARSA-PATRATRESP-GTSLPKFTWREGQKQL 166
Cdd:PHA03247 2688 ARPTVGSLTSLADPPPPPP-----TPEPAPHALVSATPLPPGPAAARQASPAlPAAPAPPAVPaGPATPGGPARPARPPT 2762

                  .
gi 116256363  167 P 167
Cdd:PHA03247 2763 T 2763
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
3-91 1.81e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.86  E-value: 1.81e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    3 RDSHGNASPARTPSaGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPA 82
Cdd:PHA03307  339 AAVSPGPSPSRSPS-PSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPL 417

                  ....*....
gi 116256363   83 QASPARASP 91
Cdd:PHA03307  418 DAGAASGAF 426
PHA02682 PHA02682
ORF080 virion core protein; Provisional
8-91 2.09e-05

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 46.39  E-value: 2.09e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   8 NASPARTPSaGASPAQASPAGTPPGRASPAQASPAQAsPAGTPPgraSPAQASPAGT----PPGRASPGRASPAQASPAQ 83
Cdd:PHA02682  69 NSACMQRPS-GQSPLAPSPACAAPAPACPACAPAAPA-PAVTCP---APAPACPPATaptcPPPAVCPAPARPAPACPPS 143

                 ....*...
gi 116256363  84 ASPARASP 91
Cdd:PHA02682 144 TRQCPPAP 151
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
6-147 2.09e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 47.67  E-value: 2.09e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   6 HGNASPARTPSAGASPAQASPAGTPPgrASPAQASPAQASPAGTPPGRASPAQAS-PAGTPPGRASPGRASPAQASPAQA 84
Cdd:PRK07764 591 APGAAGGEGPPAPASSGPPEEAARPA--APAAPAAPAAPAPAGAAAAPAEASAAPaPGVAAPEHHPKHVAVPDASDGGDG 668
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 116256363  85 SPARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSAPATRATR 147
Cdd:PRK07764 669 WPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAP 731
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
9-91 2.38e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 47.29  E-value: 2.38e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGT------PPGRASPaQASPAGTPPGRAspgrASPAQASPA 82
Cdd:PRK07764 429 PQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPApaaapePTAAPAP-APPAAPAPAAAP----AAPAAPAAP 503

                 ....*....
gi 116256363  83 QASPARASP 91
Cdd:PRK07764 504 AGADDAATL 512
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
2-90 2.47e-05

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 46.99  E-value: 2.47e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    2 ERDSHGNASPArtpsagASPAQASPA-GTPPGRASPAQASPAQASPAGTPPGRA---SPAQASPAGTPPGRASPGRASPA 77
Cdd:pfam03546 238 SSDSEEEAPAA------ATPAQAKPAlKTPQTKASPRKGTPITPTSAKVPPVRVgtpAPWKAGTVTSPACASSPAVARGA 311
                          90
                  ....*....|...
gi 116256363   78 QASPAQASPARAS 90
Cdd:pfam03546 312 QRPEEDSSSSEES 324
PHA03247 PHA03247
large tegument protein UL36; Provisional
9-91 2.82e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.24  E-value: 2.82e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQAS---------PAGTPPGRASPAQASPAGTPPGRASPGRASPAQ- 78
Cdd:PHA03247  378 ASLPTRKRRSARHAATPFARGPGGDDQTRPAAPVPASvptpaptpvPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAp 457
                          90
                  ....*....|....
gi 116256363   79 -ASPAQASPARASP 91
Cdd:PHA03247  458 aTEPAPDDPDDATR 471
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
9-146 4.45e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 46.38  E-value: 4.45e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   9 ASPARTPSAGASPAQASPAGT----PPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGrASPAQASPAQA 84
Cdd:PRK07003 420 ATRAEAPPAAPAPPATADRGDdaadGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPD-AAFEPAPRAAA 498
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 116256363  85 SPARASPALASLSRSSSGRSSSARSASVTTSPtrvylvRATPvgavpirSSPARSAPATRAT 146
Cdd:PRK07003 499 PSAATPAAVPDARAPAAASREDAPAAAAPPAP------EARP-------PTPAAAAPAARAG 547
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
9-91 6.49e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 45.75  E-value: 6.49e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPAR 88
Cdd:PRK07764 719 AQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDR 798

                 ...
gi 116256363  89 ASP 91
Cdd:PRK07764 799 RDA 801
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
14-155 6.97e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.93  E-value: 6.97e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   14 TPSAGA---SPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPARAS 90
Cdd:PHA03307   72 PPGPGTeapANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAAS 151
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 116256363   91 PALASLSRSSSGRSSSARSASVTTSPTRVYLVRA--TPVGAVPIRSSPARSAPATRATRE--SPGTSLP 155
Cdd:PHA03307  152 PPAAGASPAAVASDAASSRQAALPLSSPEETARApsSPPAEPPPSTPPAAASPRPPRRSSpiSASASSP 220
PHA03247 PHA03247
large tegument protein UL36; Provisional
9-155 7.09e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.08  E-value: 7.09e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASP---AGTPPGRASPAQASPAGTPPGRASPGRASPAQASP---- 81
Cdd:PHA03247 2769 PAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADppaAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGpppp 2848
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   82 --------------AQASPARASPALASLSRSSSGRSSSARSASVTTSPTRVylvraTPVGAVPIRSSPARSAPATRATR 147
Cdd:PHA03247 2849 slplggsvapggdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFAL-----PPDQPERPPQPQAPPPPQPQPQP 2923

                  ....*...
gi 116256363  148 ESPGTSLP 155
Cdd:PHA03247 2924 PPPPQPQP 2931
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
12-91 7.72e-05

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 45.45  E-value: 7.72e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   12 ARTPSAGASPAQASPAgtPPGRASPAQA------------------------SPAQASPAG-TPPGRASPAQASPAGTPP 66
Cdd:pfam03546 351 ASAPTKGPSGQGTAPV--PPGKTGPAVAqvkaeaqedsesseeesdseeaaaTPAQVKASGkTPQAKANPAPTKASSAKG 428
                          90       100
                  ....*....|....*....|....*
gi 116256363   67 GRASPGRASPAQASPAQASPARASP 91
Cdd:pfam03546 429 AASAPGKVVAAAAQAKQGSPAKVKP 453
PHA03247 PHA03247
large tegument protein UL36; Provisional
3-86 8.69e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.70  E-value: 8.69e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    3 RDSHGNASPARTPSAGASPaqasPAGTPPGRASPAQASPAQASPAGTPPGRASPAqASPAGTPPGRASPgrasPAQASPA 82
Cdd:PHA03247  397 RGPGGDDQTRPAAPVPASV----PTPAPTPVPASAPPPPATPLPSAEPGSDDGPA-PPPERQPPAPATE----PAPDDPD 467

                  ....
gi 116256363   83 QASP 86
Cdd:PHA03247  468 DATR 471
PHA03247 PHA03247
large tegument protein UL36; Provisional
12-153 9.31e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.70  E-value: 9.31e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   12 ARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPaQASPAGTPPGRASPGRASPAQASPAQASPARASP 91
Cdd:PHA03247 2863 RRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPER-PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP 2941
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 116256363   92 ALASLSRSSSGRSSSARSASV---TTSPTRVYLVRATPVGAVPIRSSPARSAPATRATRESPGTS 153
Cdd:PHA03247 2942 PLAPTTDPAGAGEPSGAVPQPwlgALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSS 3006
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
3-144 9.31e-05

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 44.94  E-value: 9.31e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   3 RDSHGNASPARTPSAGASPAQA-SPAGTPPGRASPAQASPAQA-SPAGTPPGRASPAQASpAGTPPGRASPGRASpAQAS 80
Cdd:PTZ00436 214 KKSAKAAAPAKAAAAPAKAAAPpAKAAAAPAKAAAAPAKAAAPpAKAAAPPAKAAAPPAK-AAAPPAKAAAPPAK-AAAP 291
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 116256363  81 PAQASPARASPALASLSRSSSGRSSSARSASVTTSPTRVylvrATPVGAVPIRSSPARSAPATR 144
Cdd:PTZ00436 292 PAKAAAAPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKA----ATPPAKAAAPPAKAAAAPVGK 351
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
9-87 9.74e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 45.19  E-value: 9.74e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   9 ASPARTPSAGaSPAQASPAGTPPGRASPAQasPAQASPAGTPPGRASPAQASPAGTPP-GRASPGRASPAQASPAQASPA 87
Cdd:PRK14950 372 TAAAPSPVRP-TPAPSTRPKAAAAANIPPK--EPVRETATPPPVPPRPVAPPVPHTPEsAPKLTRAAIPVDEKPKYTPPA 448
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
9-155 9.99e-05

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 44.94  E-value: 9.99e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   9 ASPARTPSAGASpAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASpAGTPPGRASpgrASPAQAS--PAQASP 86
Cdd:PTZ00436 209 AAPSGKKSAKAA-APAKAAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAPPAK-AAAPPAKAA---APPAKAAapPAKAAA 283
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 116256363  87 ARASPALASLSRSSSGRSSSARSASVTTSPTRVylvrATPVGAVPIRSSPARSAPATRATRESPGTSLP 155
Cdd:PTZ00436 284 PPAKAAAPPAKAAAAPAKAAAAPAKAAAAPAKA----AAPPAKAAAPPAKAATPPAKAAAPPAKAAAAP 348
rad23 TIGR00601
UV excision repair protein Rad23; All proteins in this family for which functions are known ...
9-70 1.00e-04

UV excision repair protein Rad23; All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273167 [Multi-domain]  Cd Length: 378  Bit Score: 44.89  E-value: 1.00e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 116256363    9 ASPARTPSAGASPAQASPAGTPPGraspAQASPAQASPAGTPPGRA--SPAQASPAGTPPGRAS 70
Cdd:TIGR00601  85 APPAATPTSAPTPTPSPPASPASG----MSAAPASAVEEKSPSEESatATAPESPSTSVPSSGS 144
PHA03378 PHA03378
EBNA-3B; Provisional
3-83 1.12e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 45.06  E-value: 1.12e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   3 RDSHGNASPARTPSAG---ASPAQASPAGTPPGRASPAQASPAQaSPAGTPPGRASPaQASPAGTPPGRASPG------R 73
Cdd:PHA03378 735 RPPAAAPGRARPPAAApgrARPPAAAPGRARPPAAAPGAPTPQP-PPQAPPAPQQRP-RGAPTPQPPPQAGPTsmqlmpR 812
                         90
                 ....*....|
gi 116256363  74 ASPAQASPAQ 83
Cdd:PHA03378 813 AAPGQQGPTK 822
motB PRK05996
MotB family protein;
1-90 1.24e-04

MotB family protein;


Pssm-ID: 235665 [Multi-domain]  Cd Length: 423  Bit Score: 44.69  E-value: 1.24e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   1 MERDSHGNASP---ARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPA 77
Cdd:PRK05996 190 VEVTTAGDLLPpgqAREQAQGAKSATAAPATVPQAAPLPQAQPKKAATEEELIADAKKAATGEPAANAAKAAKPEPMPDD 269
                         90
                 ....*....|...
gi 116256363  78 QASPAQASPARAS 90
Cdd:PRK05996 270 QQKEAEQLQAAIA 282
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
12-155 1.41e-04

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 44.17  E-value: 1.41e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  12 ARTPSAGASPAQASPAGTPPGRASPAQASPAQASPA----GTPPGRASPAQASpAGTPPGRASpgrASPAQAS--PAQAS 85
Cdd:PTZ00436 193 AAAAAAAKQKAAAKKAAAPSGKKSAKAAAPAKAAAApakaAAPPAKAAAAPAK-AAAAPAKAA---APPAKAAapPAKAA 268
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 116256363  86 PARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSP---ARSAPATRATRESPGTSLP 155
Cdd:PTZ00436 269 APPAKAAAPPAKAAAPPAKAAAPPAKAAAAPAKAAAAPAKAAAAPAKAAAPpakAAAPPAKAATPPAKAAAPP 341
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
14-86 1.58e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 44.42  E-value: 1.58e-04
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 116256363  14 TPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGraSPAQASPAgTPPGRASPGRASPAQASPAQASP 86
Cdd:PRK14950 361 VPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPP--KEPVRETA-TPPPVPPRPVAPPVPHTPESAPK 430
PHA03247 PHA03247
large tegument protein UL36; Provisional
4-156 1.73e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.93  E-value: 1.73e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    4 DSHGNASPARTPSAGAS-PAQASPAGTPPGRASPAQASPAQASPA--GTPPGRASPAQASPAGTPPGRASPGRAS----- 75
Cdd:PHA03247 2620 DTHAPDPPPPSPSPAANePDPHPPPTVPPPERPRDDPAPGRVSRPrrARRLGRAAQASSPPQRPRRRAARPTVGSltsla 2699
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   76 -PAQASPAQASPARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSAPATRATRESPGTSL 154
Cdd:PHA03247 2700 dPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779

                  ..
gi 116256363  155 PK 156
Cdd:PHA03247 2780 PR 2781
PHA03377 PHA03377
EBNA-3C; Provisional
19-155 1.92e-04

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 44.66  E-value: 1.92e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   19 ASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPARASPALASLSR 98
Cdd:PHA03377  550 ATPPKVSPSDRGPPKASPPVMAPPSTGPRVMATPSTGPRDMAPPSTGPRQQAKCKDGPPASGPHEKQPPSSAPRDMAPSV 629
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 116256363   99 SSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSAPATRATRESPgTSLP 155
Cdd:PHA03377  630 VRMFLRERLLEQSTGPKPKSFWEMRAGRDGSGIQQEPSSRRQPATQSTPPRP-SWLP 685
PHA03381 PHA03381
tegument protein VP22; Provisional
10-91 1.95e-04

tegument protein VP22; Provisional


Pssm-ID: 177618 [Multi-domain]  Cd Length: 290  Bit Score: 43.46  E-value: 1.95e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  10 SPARTPSAGASPAQASPAG---------TPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASP-AQA 79
Cdd:PHA03381  39 EPADRARRGAGQARGRSQAerrfhhydeARADYPYYTGSSSEDERPADPRPSRRPHAQPEASGPGPARGARGPAGSrGRG 118
                         90
                 ....*....|..
gi 116256363  80 SPAQASPARASP 91
Cdd:PHA03381 119 RRAESPSPRDPP 130
PHA03377 PHA03377
EBNA-3C; Provisional
3-86 2.04e-04

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 44.27  E-value: 2.04e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    3 RDSHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGrASPAQASPAGTPPGRASPGRASPAQASPA 82
Cdd:PHA03377  545 RRQKRATPPKVSPSDRGPPKASPPVMAPPSTGPRVMATPSTGPRDMAPPS-TGPRQQAKCKDGPPASGPHEKQPPSSAPR 623

                  ....
gi 116256363   83 QASP 86
Cdd:PHA03377  624 DMAP 627
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
4-155 2.18e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 44.07  E-value: 2.18e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   4 DSHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQAspAGTPPGRAS------------- 70
Cdd:PRK07003 483 DAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPA--AAAPAARAGgaaaaldvlrnag 560
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  71 ------PGRASPAQASPAQASPARASPalaslsrsssgrsssarsasvttSPTRVYLVRATPVGavPIRSSPARSAPATR 144
Cdd:PRK07003 561 mrvssdRGARAAAAAKPAAAPAAAPKP-----------------------AAPRVAVQVPTPRA--RAATGDAPPNGAAR 615
                        170
                 ....*....|...
gi 116256363 145 ATR--ESPGTSLP 155
Cdd:PRK07003 616 AEQaaESRGAPPP 628
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
8-91 2.27e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.39  E-value: 2.27e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    8 NASPARTPSAGASP-AQASPAGTPPGRASPAQASPAQASPAGTPPGRASP-AQASPAGTPPGRA-SPGRASPAQASPAQA 84
Cdd:PHA03307  287 SSSSPRERSPSPSPsSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSrGAAVSPGPSPSRSpSPSRPPPPADPSSPR 366

                  ....*..
gi 116256363   85 SPARASP 91
Cdd:PHA03307  367 KRPRPSR 373
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
8-77 2.29e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 44.11  E-value: 2.29e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    8 NASPARTPSAGASPAQASPAGTPPgRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPA 77
Cdd:PRK12270   57 APAAAPAAKAPAAPAPAPPAAAAP-AAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEVTPLRGAAA 125
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
3-90 2.43e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 43.90  E-value: 2.43e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   3 RDSHGNASPartPSAGASPAQASP-AGT--PPGRASPAQASPAQA-SPAGTPPGRASPAQA-SPA-------GTPPGRAS 70
Cdd:PRK14959 372 RPSGGGASA---PSGSAAEGPASGgAATipTPGTQGPQGTAPAAGmTPSSAAPATPAPSAApSPRvpwddapPAPPRSGI 448
                         90       100
                 ....*....|....*....|..
gi 116256363  71 PGRASPA--QASPAQASPARAS 90
Cdd:PRK14959 449 PPRPAPRmpEASPVPGAPDSVA 470
PRK14965 PRK14965
DNA polymerase III subunits gamma and tau; Provisional
10-91 2.51e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237871 [Multi-domain]  Cd Length: 576  Bit Score: 43.96  E-value: 2.51e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  10 SPARTPSAGASPAQASPAGTPPGRASPAQAspaqasPAGTPPGRASPAQASPAGTPPGraspgrasPAQASPAQASPARA 89
Cdd:PRK14965 381 APAPPSAAWGAPTPAAPAAPPPAAAPPVPP------AAPARPAAARPAPAPAPPAAAA--------PPARSADPAAAASA 446

                 ..
gi 116256363  90 SP 91
Cdd:PRK14965 447 GD 448
PRK10856 PRK10856
cytoskeleton protein RodZ;
4-91 2.98e-04

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 43.09  E-value: 2.98e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   4 DSHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPpgraSPAQASPAGT--PPGRASPGRASPAQASP 81
Cdd:PRK10856 166 TSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAP----SQANVDTAATpaPAAPATPDGAAPLPTDQ 241
                         90
                 ....*....|
gi 116256363  82 AQASPARASP 91
Cdd:PRK10856 242 AGVSTPAADP 251
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
2-91 2.99e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.99  E-value: 2.99e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    2 ERDSHGNASPARTPSAGASPAQASPAGTPPgRASPAQASPAQASPAGTPPGRAS-PAQASPAGTPPGRASPGRASP---A 77
Cdd:pfam03154 153 DNESDSDSSAQQQILQTQPPVLQAQSGAAS-PPSPPPPGTTQAATAGPTPSAPSvPPQGSPATSQPPNQTQSTAAPhtlI 231
                          90
                  ....*....|....
gi 116256363   78 QASPAQASPARASP 91
Cdd:pfam03154 232 QQTPTLHPQRLPSP 245
PRK14965 PRK14965
DNA polymerase III subunits gamma and tau; Provisional
1-91 3.11e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237871 [Multi-domain]  Cd Length: 576  Bit Score: 43.58  E-value: 3.11e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   1 MERDSHGNASPARTPSAGASPAQASPAGTPP----GRASPAQASPAQASPAGTPPGrasPAQASPAGTPPGRASPG---- 72
Cdd:PRK14965 377 LERGAPAPPSAAWGAPTPAAPAAPPPAAAPPvppaAPARPAAARPAPAPAPPAAAA---PPARSADPAAAASAGDRwraf 453
                         90       100
                 ....*....|....*....|....*
gi 116256363  73 -----RASPAQASP-AQASPARASP 91
Cdd:PRK14965 454 vafvkGKKPALGASlEQGSPLGVSA 478
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
2-91 3.50e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 43.62  E-value: 3.50e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    2 ERDSHGNASPARtPSAGASPAQASPAG-----TPPGRASPAQASPA---QASPAGTPPGRA-SPAQASPAGTPPGRASPG 72
Cdd:PHA03307  291 PRERSPSPSPSS-PGSGPAPSSPRASSsssssRESSSSSTSSSSESsrgAAVSPGPSPSRSpSPSRPPPPADPSSPRKRP 369
                          90
                  ....*....|....*....
gi 116256363   73 RASPAQASPAQASPARASP 91
Cdd:PHA03307  370 RPSRAPSSPAASAGRPTRR 388
Metaviral_G pfam09595
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ...
2-155 3.72e-04

Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.


Pssm-ID: 462833 [Multi-domain]  Cd Length: 183  Bit Score: 41.86  E-value: 3.72e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    2 ERDSHGNASParTPSAGASPAQASPAGTPpgraSPAQASPAQASpaGTPPGRASPAQASPAGTPPgraspgrASPAQASP 81
Cdd:pfam09595  61 EQEHHENPPL--NEAAKEAPSESEDAPDI----DPNNQHPSQDR--SEAPPLEPAAKTKPSEHEP-------ANPPDASN 125
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 116256363   82 AQASPARAspalaslsrsssgrsssarsasvTTSPTRVYLVRATPVGAVPIRSSPARSAPATRATRESPGTSLP 155
Cdd:pfam09595 126 RLSPPDAS-----------------------TAAIREARTFRKPSTGKRNNPSSAQSDQSPPRANHEAIGRANP 176
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
6-86 3.77e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.60  E-value: 3.77e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    6 HGNASPARTPSAGasPAQASPAGTPPGRAS-PAQASPAQASPAGTPPGRASP-----------AQASPAGTPPGRASPGR 73
Cdd:pfam03154 178 SGAASPPSPPPPG--TTQAATAGPTPSAPSvPPQGSPATSQPPNQTQSTAAPhtliqqtptlhPQRLPSPHPPLQPMTQP 255
                          90
                  ....*....|...
gi 116256363   74 ASPAQASPaQASP 86
Cdd:pfam03154 256 PPPSQVSP-QPLP 267
PHA03381 PHA03381
tegument protein VP22; Provisional
2-90 4.16e-04

tegument protein VP22; Provisional


Pssm-ID: 177618 [Multi-domain]  Cd Length: 290  Bit Score: 42.69  E-value: 4.16e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   2 ERDSHGNASPARTPSAGASPAQASPAGTPPgrasPAQASPAQASPAGTPPGRASPAQASpAGTPPGRASPGRASPAQAS- 80
Cdd:PHA03381  80 EDERPADPRPSRRPHAQPEASGPGPARGAR----GPAGSRGRGRRAESPSPRDPPNPKG-ASAPRGRKSACADSAALLDa 154
                         90
                 ....*....|
gi 116256363  81 PAQASPARAS 90
Cdd:PHA03381 155 PAPAAPKRQK 164
flhF PRK06995
flagellar biosynthesis protein FlhF;
11-91 4.60e-04

flagellar biosynthesis protein FlhF;


Pssm-ID: 235904 [Multi-domain]  Cd Length: 484  Bit Score: 43.03  E-value: 4.60e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  11 PARTPSAGASPAQASPAGTPP------GRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQA 84
Cdd:PRK06995  66 PAAAPAAVSRPAAPAAEPAPWlvehakRLTAQREQLVARAAAPAAPEAQAPAAPAERAAAENAARRLARAAAAAPRPRVP 145

                 ....*..
gi 116256363  85 SPARASP 91
Cdd:PRK06995 146 ADAAAAV 152
PRK12495 PRK12495
hypothetical protein; Provisional
4-90 4.92e-04

hypothetical protein; Provisional


Pssm-ID: 183558 [Multi-domain]  Cd Length: 226  Bit Score: 41.78  E-value: 4.92e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   4 DSHGNASPARTPSAGAS----PAQASPAGTPPGRASPAQASPAQASPA---GTPPGRASPAQASPAGTPPGRAS--PGRA 74
Cdd:PRK12495  76 DDAGDGAEATAPSDAGSqaspDDDAQPAAEAEAADQSAPPEASSTSATdeaATDPPATAAARDGPTPDPTAQPAtpDERR 155
                         90
                 ....*....|....*.
gi 116256363  75 SPAQASPAQASPARAS 90
Cdd:PRK12495 156 SPRQRPPVSGEPPTPS 171
PRK12373 PRK12373
NADH-quinone oxidoreductase subunit E;
14-87 5.20e-04

NADH-quinone oxidoreductase subunit E;


Pssm-ID: 237082 [Multi-domain]  Cd Length: 400  Bit Score: 42.48  E-value: 5.20e-04
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 116256363  14 TPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRAsPAQASPAQASPA 87
Cdd:PRK12373 231 LAPWQGDAAPVPPSEAARPKSADAETNAALKTPATAPKAAAKNAKAPEAQPVSGTAAAEPA-PKEAAKAAAAAA 303
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
3-145 5.85e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.85  E-value: 5.85e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    3 RDSHGNASPARTPSAG--ASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQAS 80
Cdd:PHA03307  271 EASGWNGPSSRPGPASssSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRS 350
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 116256363   81 PaqaSPARASPALASLSRSSSGRSSSARSASVTTSPT-RVYLVRATPVGAVPIRSSPARSaPATRA 145
Cdd:PHA03307  351 P---SPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRpTRRRARAAVAGRARRRDATGRF-PAGRP 412
rad23 TIGR00601
UV excision repair protein Rad23; All proteins in this family for which functions are known ...
21-90 6.16e-04

UV excision repair protein Rad23; All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273167 [Multi-domain]  Cd Length: 378  Bit Score: 42.19  E-value: 6.16e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 116256363   21 PAQASPAGTPPGRASPAQASPAQASPAGTPPGR-ASPAQASPAGTPPGRASPgraSPAQASPAQASPARAS 90
Cdd:TIGR00601  77 PKTGTGKVAPPAATPTSAPTPTPSPPASPASGMsAAPASAVEEKSPSEESAT---ATAPESPSTSVPSSGS 144
DUF1986 pfam09342
Domain of unknown function (DUF1986); This domain is found in serine proteases and is ...
337-436 7.08e-04

Domain of unknown function (DUF1986); This domain is found in serine proteases and is predicted to contain disulphide bonds.


Pssm-ID: 286432  Cd Length: 116  Bit Score: 39.45  E-value: 7.08e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  337 WPWQVSLHFGTTHICGGTLIDAQWVLTAAHCFFVTREK------VLEGWKVYAGTSNLHQLPEAASIAEIIINSNytdee 410
Cdd:pfam09342   1 WPWIAKVYLDGNMICSGVLIDASWVIVSGSCLRDTNLRhqyisvVLGGAKTLKSIEGPYEQIVRVDCRHDIPESE----- 75
                          90       100
                  ....*....|....*....|....*.
gi 116256363  411 ddydIALMRLSKPLTLSAHIHPACLP 436
Cdd:pfam09342  76 ----ISLLHLASPASFSNHVLPTFVP 97
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
6-91 7.21e-04

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 42.30  E-value: 7.21e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   6 HGNASPARTPSAGASPAQAS-PAGTPP--GRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPgrASPAQASPA 82
Cdd:NF041121  14 QMGRAAAPPSPEGPAPTAASqPATPPPpaAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPGPAG--AAPGAALPV 91

                 ....*....
gi 116256363  83 QASPARASP 91
Cdd:NF041121  92 RVPAPPALP 100
KREPA2 cd23959
Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of ...
6-91 7.61e-04

Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of the parasitic protozoan's KREPA RNA editing catalytic complex (RECC). Kinetoplastid RNA editing (KRE) proteins occur as pairs or sets of related proteins in multiple complexes. KREPA complex is composed of six components (KREPA1-6), which share a conserved C-terminal region containing an oligonucleotide-binding (OB)-fold-like domain. KREPAs are responsible for the site-specific insertion and deletion of U nucleotides in the kinetoplastid mitochondria pre-messenger RNA. Apart from the conserved C-terminal OB-fold domain, KREPA1, KREPA2, and KREPA3 contain two conserved C2H2 zinc-finger domains. KREPA2 and kinetoplastid RNA editing ligase 1 (KREL1) are specific for ligation post-U-deletion and are paralogous to KREL2 and KREPA1 that are specific for ligation post-U-insertion. KREPA2, is critical for RECC stability and KREL1 integration into the complex.


Pssm-ID: 467780 [Multi-domain]  Cd Length: 424  Bit Score: 42.16  E-value: 7.61e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   6 HGNASPARTPSAGASPAQASPAGTPPGRASPAQA----SPAQAS--PAGTPPGRASPAQA-SPAGTPPGRASPGraSPAQ 78
Cdd:cd23959  159 HPPPAKPLPAAAAAQQSSASPGEVASPFASGTVSaspfATATDTapSSGAPDGFPAEASApSPFAAPASAASFP--AAPV 236
                         90
                 ....*....|...
gi 116256363  79 ASPAQASPARASP 91
Cdd:cd23959  237 ANGEAATPTHACT 249
flhF PRK06995
flagellar biosynthesis protein FlhF;
9-91 7.80e-04

flagellar biosynthesis protein FlhF;


Pssm-ID: 235904 [Multi-domain]  Cd Length: 484  Bit Score: 42.26  E-value: 7.80e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   9 ASPARTPSAGASPAQASP-AGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPA 87
Cdd:PRK06995  42 ALADSDLAALAPPAAAAPaAAQPPPAAAPAAVSRPAAPAAEPAPWLVEHAKRLTAQREQLVARAAAPAAPEAQAPAAPAE 121

                 ....
gi 116256363  88 RASP 91
Cdd:PRK06995 122 RAAA 125
KREPA2 cd23959
Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of ...
20-90 8.67e-04

Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of the parasitic protozoan's KREPA RNA editing catalytic complex (RECC). Kinetoplastid RNA editing (KRE) proteins occur as pairs or sets of related proteins in multiple complexes. KREPA complex is composed of six components (KREPA1-6), which share a conserved C-terminal region containing an oligonucleotide-binding (OB)-fold-like domain. KREPAs are responsible for the site-specific insertion and deletion of U nucleotides in the kinetoplastid mitochondria pre-messenger RNA. Apart from the conserved C-terminal OB-fold domain, KREPA1, KREPA2, and KREPA3 contain two conserved C2H2 zinc-finger domains. KREPA2 and kinetoplastid RNA editing ligase 1 (KREL1) are specific for ligation post-U-deletion and are paralogous to KREL2 and KREPA1 that are specific for ligation post-U-insertion. KREPA2, is critical for RECC stability and KREL1 integration into the complex.


Pssm-ID: 467780 [Multi-domain]  Cd Length: 424  Bit Score: 41.78  E-value: 8.67e-04
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 116256363  20 SPAQASPAGTPPGRASPAQASPAQ----ASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQA-SPAQASPARAS 90
Cdd:cd23959  155 MFGQHPPPAKPLPAAAAAQQSSASpgevASPFASGTVSASPFATATDTAPSSGAPDGFPAEASApSPFAAPASAAS 230
FimV COG3170
Type IV pilus assembly protein FimV [Cell motility, Extracellular structures];
7-155 9.82e-04

Type IV pilus assembly protein FimV [Cell motility, Extracellular structures];


Pssm-ID: 442403 [Multi-domain]  Cd Length: 508  Bit Score: 42.09  E-value: 9.82e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   7 GNASPARTPSAGASPAQASPAGTPPGRASPAqASPAQASPAGTPPGRASPAQASPAGTPPGRASPgrASPAQASPA--QA 84
Cdd:COG3170  108 AYAAAAAAPAAAPAPAPAAPAAAAAAADQPA-AEAAPAASGEYYPVRPGDTLWSIAARPVRPSSG--VSLDQMMVAlyRA 184
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 116256363  85 SPAraspalaslsRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSAPATRATRESPGTSLP 155
Cdd:COG3170  185 NPD----------AFIDGNINRLKAGAVLRVPAAEEVAALSPAEARQEVQAQSADWAAYRARLAAAVEPAP 245
PHA03377 PHA03377
EBNA-3C; Provisional
9-91 1.12e-03

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 41.96  E-value: 1.12e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPAR 88
Cdd:PHA03377  530 AKPHRKVQDGFQRSGRRQKRATPPKVSPSDRGPPKASPPVMAPPSTGPRVMATPSTGPRDMAPPSTGPRQQAKCKDGPPA 609

                  ...
gi 116256363   89 ASP 91
Cdd:PHA03377  610 SGP 612
PRK06975 PRK06975
bifunctional uroporphyrinogen-III synthetase/uroporphyrin-III C-methyltransferase; Reviewed
17-77 1.26e-03

bifunctional uroporphyrinogen-III synthetase/uroporphyrin-III C-methyltransferase; Reviewed


Pssm-ID: 235899 [Multi-domain]  Cd Length: 656  Bit Score: 41.63  E-value: 1.26e-03
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 116256363  17 AGASPAQASPAgtpPGRASPAQASPaQASPAGTPPGRASPAQASPAGTPPG-RASPGRASPA 77
Cdd:PRK06975 268 AAAQPATAAPA---PSRMTDTNDSK-SVTSQPAAAAAAPAPPPNPPATPPEpPARRGRGSAA 325
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
9-155 1.42e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 41.70  E-value: 1.42e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    9 ASPARTPSagaSPAQASPAGTPPGRASPAqaspaqaSPAGTPPGRASPAQASPAGTPPgRASPGRASPAQASPAQASPAR 88
Cdd:PHA03307  260 PAPITLPT---RIWEASGWNGPSSRPGPA-------SSSSSPRERSPSPSPSSPGSGP-APSSPRASSSSSSSRESSSSS 328
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 116256363   89 ASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPVGAVPIRSSPARSAPATRATRESPGTSLP 155
Cdd:PHA03307  329 TSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVA 395
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
23-143 1.45e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 41.59  E-value: 1.45e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  23 QASPAGtppGRASPAQASPAQASPAG-----TPPGRASPAQASPA-GTPPGRASPGRASPAQAsPAQASP-ARASPALAS 95
Cdd:PRK14959 370 SLRPSG---GGASAPSGSAAEGPASGgaatiPTPGTQGPQGTAPAaGMTPSSAAPATPAPSAA-PSPRVPwDDAPPAPPR 445
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*...
gi 116256363  96 LSRSSSGRSSSARSASVTTSPTRVylvrATPVGAVPIRSSPARSAPAT 143
Cdd:PRK14959 446 SGIPPRPAPRMPEASPVPGAPDSV----ASASDAPPTLGDPSDTAEHT 489
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1-155 1.47e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 41.70  E-value: 1.47e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    1 MERDSHGNASPARTPSAGASPAQASPAGtppGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQAS 80
Cdd:PHA03307  768 LAEALALLEPAEPQRGAGSSPPVRAEAA---FRRPGRLRRSGPAADAASRTASKRKSRSHTPDGGSESSGPARPPGAAAR 844
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 116256363   81 PaqaSPARASPALASLSRSSSGRSSSARSASVTTSPTRvylvRATPVGAVPIRSSPARSAPATRATRESPGTSLP 155
Cdd:PHA03307  845 P---PPARSSESSKSKPAAAGGRARGKNGRRRPRPPEP----RARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVK 912
PDHac_trf_long TIGR01348
pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form; This model ...
25-87 1.70e-03

pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form; This model describes a subset of pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase specifically close by both phylogenetic and per cent identity (UPGMA) trees. Members of this set include two or three copies of the lipoyl-binding domain. E. coli AceF is a member of this model, while mitochondrial and some other bacterial forms belong to a separate model. [Energy metabolism, Pyruvate dehydrogenase]


Pssm-ID: 273566 [Multi-domain]  Cd Length: 546  Bit Score: 41.01  E-value: 1.70e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 116256363   25 SPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRaSPAQASPaqASPA 87
Cdd:TIGR01348 191 VAGSTPATAPAPASAQPAAQSPAATQPEPAAAPAAAKAQAPAPQQAGTQ-NPAKVDH--AAPA 250
PHA03270 PHA03270
envelope glycoprotein C; Provisional
9-67 1.95e-03

envelope glycoprotein C; Provisional


Pssm-ID: 165528 [Multi-domain]  Cd Length: 466  Bit Score: 41.07  E-value: 1.95e-03
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   9 ASPART-PSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPG 67
Cdd:PHA03270  19 LCAGAGaPRGAVSNASEAPTSGSPGSAEGPRTTPTPTRGKGTPTGPASPPKSGPPKSPPA 78
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
21-90 1.98e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 41.00  E-value: 1.98e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  21 PAQASPAgtppgraspAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPARAS 90
Cdd:PRK07994 361 PAAPLPE---------PEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQ 421
PHA03201 PHA03201
uracil DNA glycosylase; Provisional
1-91 2.00e-03

uracil DNA glycosylase; Provisional


Pssm-ID: 165468  Cd Length: 318  Bit Score: 40.65  E-value: 2.00e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   1 MERDSHGNASPARTPSAGASPAQASPAGTPPgRASPAQASPAQASPAGTPPGRASPaQASPAGTPPGRASPGRASPAQAS 80
Cdd:PHA03201   1 MKRARSRSPSPPRRPSPPRPTPPRSPDASPE-ETPPSPPGPGAEPPPGRAAGPAAP-RRRPRGCPAGVTFSSSAPPRPPL 78
                         90
                 ....*....|.
gi 116256363  81 PAQASPARASP 91
Cdd:PHA03201  79 GLDDAPAATPP 89
PHA01929 PHA01929
putative scaffolding protein
22-91 2.01e-03

putative scaffolding protein


Pssm-ID: 177328  Cd Length: 306  Bit Score: 40.42  E-value: 2.01e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 116256363  22 AQASPAGTPPGRASP--AQASPAQASPAGTPPGRASPAQAsPAGTPPGRASPGRASPAQASPAQASPARASP 91
Cdd:PHA01929  17 ANVPPAAAPTPQPNPviQPQAPVQPGQPGAPQQLAIPTQQ-PQPVPTSAMTPHVVQQAPAQPAPAAPPAAGA 87
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1-156 2.35e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 40.92  E-value: 2.35e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    1 MERDSHGNASPARTPSAGASPAQASPAGTPPGRASP--AQASPAQASPAGTPPGRASPAqaSPAGTPPGRA--------- 69
Cdd:PHA03307  227 SAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPitLPTRIWEASGWNGPSSRPGPA--SSSSSPRERSpspspsspg 304
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   70 SPGRASPAQASPAQASPARASPALASLSRSSSGRSSSARSASVTTSPTRvylVRATPVGAVPIRSSPARSAPATRATRES 149
Cdd:PHA03307  305 SGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSP---SRPPPPADPSSPRKRPRPSRAPSSPAAS 381

                  ....*..
gi 116256363  150 PGTSLPK 156
Cdd:PHA03307  382 AGRPTRR 388
PHA03247 PHA03247
large tegument protein UL36; Provisional
16-85 2.46e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.08  E-value: 2.46e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   16 SAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQAS 85
Cdd:PHA03247  369 SAGRHHPKRASLPTRKRRSARHAATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSA 438
PRK12495 PRK12495
hypothetical protein; Provisional
9-89 2.65e-03

hypothetical protein; Provisional


Pssm-ID: 183558 [Multi-domain]  Cd Length: 226  Bit Score: 39.47  E-value: 2.65e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   9 ASPARTPSAGASPAqASPAGTPPGRASPAQASPAQASPAG--TPPGRASPAQASPA-GTPPGRASPGRASPAQASPAQAS 85
Cdd:PRK12495 109 ADQSAPPEASSTSA-TDEAATDPPATAAARDGPTPDPTAQpaTPDERRSPRQRPPVsGEPPTPSTPDAHVAGTLQAARES 187

                 ....
gi 116256363  86 PARA 89
Cdd:PRK12495 188 LVET 191
PHA03247 PHA03247
large tegument protein UL36; Provisional
9-160 2.73e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.08  E-value: 2.73e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    9 ASPARTPSAGASPAQASPAgtPPGRASPA---QASPAQASP-----------------AGTPPGRASPAQASPA---GTP 65
Cdd:PHA03247 2492 AGAAPDPGGGGPPDPDAPP--APSRLAPAilpDEPVGEPVHprmltwirgleelasddAGDPPPPLPPAAPPAApdrSVP 2569
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   66 PGRASPGRASPAQAS-------PAQA-----------SPARASPALASLSRSSSGRSSSARSASVTTSPTRVYLVRATPV 127
Cdd:PHA03247 2570 PPRPAPRPSEPAVTSrarrpdaPPQSarprapvddrgDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPP 2649
                         170       180       190
                  ....*....|....*....|....*....|....*
gi 116256363  128 GAVPIRSSPARSAPATRATRES--PGTSLPKFTWR 160
Cdd:PHA03247 2650 ERPRDDPAPGRVSRPRRARRLGraAQASSPPQRPR 2684
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
7-91 3.34e-03

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 40.46  E-value: 3.34e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   7 GN-ASPARTPSAGASPAQASPAGTPPGRASPAQASPaQASPAGTPPgrASPAQ-ASPAGTPP----GRASPGRASPAQAS 80
Cdd:PLN02217 564 GNpGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSP-PAGHLGSPP--ATPSKiVSPSTSPPashlGSPSTTPSSPESSI 640
                         90
                 ....*....|.
gi 116256363  81 PAQASPArASP 91
Cdd:PLN02217 641 KVASTET-ASP 650
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
35-91 3.83e-03

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 40.06  E-value: 3.83e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 116256363   35 SPAQASPAQASPA-GTPPGRASPAQASPAGTPPGRASPGRA---SPAQASPAQASPARASP 91
Cdd:pfam03546 245 APAAATPAQAKPAlKTPQTKASPRKGTPITPTSAKVPPVRVgtpAPWKAGTVTSPACASSP 305
PRK10856 PRK10856
cytoskeleton protein RodZ;
6-91 3.98e-03

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 39.62  E-value: 3.98e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   6 HGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAgtppgrASPAQASPAGTPPGRASPgrASPAQASPAQAS 85
Cdd:PRK10856 158 SGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPA------PAVDPQQNAVVAPSQANV--DTAATPAPAAPA 229

                 ....*.
gi 116256363  86 PARASP 91
Cdd:PRK10856 230 TPDGAA 235
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
23-90 4.37e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 39.93  E-value: 4.37e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 116256363  23 QASPAGTPPGRAS----PAQASPAQASPAG-TPPGRASPAQASPAG--TPPGRASPGRASPAQASPAQASPARAS 90
Cdd:PRK14954 381 APSPAGSPDVKKKapepDLPQPDRHPGPAKpEAPGARPAELPSPASapTPEQQPPVARSAPLPPSPQASAPRNVA 455
KLF9_13_N-like cd21975
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like ...
11-90 4.41e-03

Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved alpha-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF9, KLF13, KLF14, KLF16, and similar proteins.


Pssm-ID: 409240 [Multi-domain]  Cd Length: 163  Bit Score: 38.13  E-value: 4.41e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363  11 PARTPSAGASpAQASPAGTPPGRASPAQASPAqasPAGTPPGRASPAQASP---AGTPPGRASPGRASPAQAspAQASPA 87
Cdd:cd21975   78 PLRGPSVEGS-SLESGDADMGSDSDVAPASGA---AASTSPESSSDAASSPsplSLLHPGEAGLEPERPRPR--VRRGVR 151

                 ...
gi 116256363  88 RAS 90
Cdd:cd21975  152 RRG 154
PRK12373 PRK12373
NADH-quinone oxidoreductase subunit E;
8-91 4.88e-03

NADH-quinone oxidoreductase subunit E;


Pssm-ID: 237082 [Multi-domain]  Cd Length: 400  Bit Score: 39.40  E-value: 4.88e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   8 NASPARTPSAGAspAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASPA 87
Cdd:PRK12373 207 NASKALAEDIGD--TVKRIDGTEVPLLAPWQGDAAPVPPSEAARPKSADAETNAALKTPATAPKAAAKNAKAPEAQPVSG 284

                 ....
gi 116256363  88 RASP 91
Cdd:PRK12373 285 TAAA 288
PHA03247 PHA03247
large tegument protein UL36; Provisional
9-156 5.39e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 39.92  E-value: 5.39e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    9 ASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASP--------AGTPPGRASPAQASPAG----------------- 63
Cdd:PHA03247  257 PPPVVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPdgvwgaalAGAPLALPAPPDPPPPApagdaeeeddedgamev 336
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   64 -----------------------TPP--------GRASPGRASPA----QASPAQASPARASPALASLSRSSSGRSSSAR 108
Cdd:PHA03247  337 vsplprprqhyplgfpkrrrptwTPPssledlsaGRHHPKRASLPtrkrRSARHAATPFARGPGGDDQTRPAAPVPASVP 416
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 116256363  109 SASVTTSPTRVYLVRATPVG----------AVPIRSSPARsaPATRATRESPGTSLPK 156
Cdd:PHA03247  417 TPAPTPVPASAPPPPATPLPsaepgsddgpAPPPERQPPA--PATEPAPDDPDDATRK 472
PRK06975 PRK06975
bifunctional uroporphyrinogen-III synthetase/uroporphyrin-III C-methyltransferase; Reviewed
33-90 5.66e-03

bifunctional uroporphyrinogen-III synthetase/uroporphyrin-III C-methyltransferase; Reviewed


Pssm-ID: 235899 [Multi-domain]  Cd Length: 656  Bit Score: 39.70  E-value: 5.66e-03
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|....*...
gi 116256363  33 RASPAQASPAQASPAGTPPgrasPAQASPAGTPPGRASPgrASPAQASPAQASPARAS 90
Cdd:PRK06975 269 AAQPATAAPAPSRMTDTND----SKSVTSQPAAAAAAPA--PPPNPPATPPEPPARRG 320
Rib_recp_KP_reg pfam05104
Ribosome receptor lysine/proline rich region; This highly conserved region is found towards ...
11-91 6.01e-03

Ribosome receptor lysine/proline rich region; This highly conserved region is found towards the C-terminus of the transmembrane domain. The function is unclear.


Pssm-ID: 461548 [Multi-domain]  Cd Length: 140  Bit Score: 37.41  E-value: 6.01e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   11 PARTPSAGASPAQASPAGTPPGRASPAQASPAQASpagtppgraSPAQASPAGTPPGRASPGrASPAQASPAQASPARAS 90
Cdd:pfam05104  69 PDEAPSAALEPEPVPTPVPAPVEPEPAPPSESPAP---------SPKEKKKKEKKSAKVEPA-ETPEAVQPKPALEKEEP 138

                  .
gi 116256363   91 P 91
Cdd:pfam05104 139 P 139
motB PRK12799
flagellar motor protein MotB; Reviewed
4-149 8.90e-03

flagellar motor protein MotB; Reviewed


Pssm-ID: 183756 [Multi-domain]  Cd Length: 421  Bit Score: 38.54  E-value: 8.90e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   4 DSHGNASPARTPSAGASPAQASPAGTPPGRASPAQ-ASPAQASPAGTPPGRASPAQASpAGTPPGRASPGRASPAQASPA 82
Cdd:PRK12799 294 DTHGTVPVAAVTPSSAVTQSSAITPSSAAIPSPAViPSSVTTQSATTTQASAVALSSA-GVLPSDVTLPGTVALPAAEPV 372
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 116256363  83 QASPARASPALASLSRsssgrsssarsasvTTSPTRvylVRATPVGAVPirSSPARSAPATRATRES 149
Cdd:PRK12799 373 NMQPQPMSTTETQQSS--------------TGNITS---TANGPTTSLP--AAPASNIPVSPTSRDA 420
KLF9_13_N-like cd21975
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like ...
2-78 9.08e-03

Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved alpha-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF9, KLF13, KLF14, KLF16, and similar proteins.


Pssm-ID: 409240 [Multi-domain]  Cd Length: 163  Bit Score: 37.36  E-value: 9.08e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 116256363   2 ERDSHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPaqASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQ 78
Cdd:cd21975   85 EGSSLESGDADMGSDSDVAPASGAAASTSPESSSDAASSP--SPLSLLHPGEAGLEPERPRPRVRRGVRRRGVTPAA 159
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
5-91 9.69e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 39.00  E-value: 9.69e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363    5 SHGNASPARTPSAGASPAQASPAGTPPGRASPAQASPAQASPAGTPPGRASPAQASPAGTppGRASPGRASPAQASPAQ- 83
Cdd:PHA03307  331 SSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRA--RAAVAGRARRRDATGRFp 408

                  ....*...
gi 116256363   84 ASPARASP 91
Cdd:PHA03307  409 AGRPRPSP 416
KREPA2 cd23959
Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of ...
7-91 9.87e-03

Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of the parasitic protozoan's KREPA RNA editing catalytic complex (RECC). Kinetoplastid RNA editing (KRE) proteins occur as pairs or sets of related proteins in multiple complexes. KREPA complex is composed of six components (KREPA1-6), which share a conserved C-terminal region containing an oligonucleotide-binding (OB)-fold-like domain. KREPAs are responsible for the site-specific insertion and deletion of U nucleotides in the kinetoplastid mitochondria pre-messenger RNA. Apart from the conserved C-terminal OB-fold domain, KREPA1, KREPA2, and KREPA3 contain two conserved C2H2 zinc-finger domains. KREPA2 and kinetoplastid RNA editing ligase 1 (KREL1) are specific for ligation post-U-deletion and are paralogous to KREL2 and KREPA1 that are specific for ligation post-U-insertion. KREPA2, is critical for RECC stability and KREL1 integration into the complex.


Pssm-ID: 467780 [Multi-domain]  Cd Length: 424  Bit Score: 38.70  E-value: 9.87e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 116256363   7 GNASPARTPSAGASPAQASPAgtPPGRAspaqASPAQASPAGTPPGRASPAQASPAGTPPGRASPGRASPAQASPAQASP 86
Cdd:cd23959  157 GQHPPPAKPLPAAAAAQQSSA--SPGEV----ASPFASGTVSASPFATATDTAPSSGAPDGFPAEASAPSPFAAPASAAS 230

                 ....*
gi 116256363  87 ARASP 91
Cdd:cd23959  231 FPAAP 235
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH