NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|112382226|ref|NP_001036147|]
View 

arginine-glutamic acid dipeptide repeats protein isoform b [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Atrophin-1 super family cl38111
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
14-1011 0e+00

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


The actual alignment was detected with superfamily member pfam03154:

Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 998.51  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226    14 GKHSMRTRRSRGSMSTLRSGRKKQPASPDGRTSPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEASSPLKSN 93
Cdd:pfam03154    1 GKHSMRTRRSRGSMSTLRSGRKKQTASPDGRASPTNEDLRSSGRNSPSAASTSSNDSKAESMKKSSKKIKEEAPSPLKSA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226    94 KRQREKVASDTEEADRTSSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDS 173
Cdd:pfam03154   81 KRQREKGASDTEEPERATAKKSKTQEISRPNSPSEGEGESSDGRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDS 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   174 SAQQQMLQAQPPALQAPTGVTPAPSSAPPGTPQLPTPGPTPSATAVPPQGSPTASQAPNQPQAPTAPvpHTHIQQAPALH 253
Cdd:pfam03154  161 SAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAP--HTLIQQTPTLH 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   254 PQRPPSPHPPPHPSPHPPLQPltgsagQPSAPSHAQPPLHGQGPPGPHSLQAGP-LLQHPGPPQPFGLPPQASQGQAPLG 332
Cdd:pfam03154  239 PQRLPSPHPPLQPMTQPPPPS------QVSPQPLPQPSLHGQMPPMPHSLQTGPsHMQHPVPPQPFPLTPQSSQSQVPPG 312
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   333 TSPAAAYP-HTSLQLPASQSALQSQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSMNANLPP 411
Cdd:pfam03154  313 PSPAAPGQsQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPP 392
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   412 PPALKPLSSLSTHHPPSAHPPPLQLMPQSQPLPSSPAQPPGLTQSQNLPPPPASHPPT-GLHQVAPQPPFAQHPFVPGGP 490
Cdd:pfam03154  393 PPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTsGLHQVPSQSPFPQHPFVPGGP 472
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   491 PPITPPTCPSTSTPPAGPGTsaQPPCSGAAASGGSIAGGSSCPLPTVQIKEEALDDAEEPESPPPPPRSPSPEPTVVDTP 570
Cdd:pfam03154  473 PPITPPSGPPTSTSSAMPGI--QPPSSASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTP 550
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   571 SHASQSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEAIEKAKREAEQKAREEREREKEKEKEREREREREREAER 650
Cdd:pfam03154  551 SHASQSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEALEKAKREAEQKAREEKEREKEKEKEREREREREREAER 630
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   651 AAKASSSAHEGRLSDPQLSGPGHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFYMPLNPTDPL 730
Cdd:pfam03154  631 AAKASSSSHEGRMGDPQLAGPAHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFFVPLNPTDPL 710
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   731 LAYHMPGLYNVDPTIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPAANPMEHFARHSALTIPPTAGPHPF 810
Cdd:pfam03154  711 LAYHMPGLYNVDPAIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHGALTLPPMAGPHPF 790
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   811 ASFHPGLNPLERERLALAGPQLRPEMSYPDRLAAERIHAERMASLTSDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPL 890
Cdd:pfam03154  791 ASFHPGLNPLERERLALAGPQLRPEMSYPDRLAAERLHAERMASLTNDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPL 870
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   891 HQGSAGPVHPLVDPLTAGPHLARFPYPPGTLPNPLLGQPPHEHEMLRHPVFGTPYPRDLPGAIPPPMSAAHQLQAMHAQS 970
Cdd:pfam03154  871 HQGSGGPVHPLVDPLAAGPHLARFPYPPGAIPNPLLGQPPHEHEMLRHPVFGTPYPRDLPGGLPPPMSAAHQLQAMHAQS 950
                          970       980       990      1000
                   ....*....|....*....|....*....|....*....|.
gi 112382226   971 AELQRLAMEQQWLHGHPHMHGGHLPSQEDYYSRLKKEGDKQ 1011
Cdd:pfam03154  951 AELQRLAMEQQWLHGHPHMHGGHLPGQEDYYSRLKKESDKQ 991
 
Name Accession Description Interval E-value
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
14-1011 0e+00

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 998.51  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226    14 GKHSMRTRRSRGSMSTLRSGRKKQPASPDGRTSPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEASSPLKSN 93
Cdd:pfam03154    1 GKHSMRTRRSRGSMSTLRSGRKKQTASPDGRASPTNEDLRSSGRNSPSAASTSSNDSKAESMKKSSKKIKEEAPSPLKSA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226    94 KRQREKVASDTEEADRTSSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDS 173
Cdd:pfam03154   81 KRQREKGASDTEEPERATAKKSKTQEISRPNSPSEGEGESSDGRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDS 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   174 SAQQQMLQAQPPALQAPTGVTPAPSSAPPGTPQLPTPGPTPSATAVPPQGSPTASQAPNQPQAPTAPvpHTHIQQAPALH 253
Cdd:pfam03154  161 SAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAP--HTLIQQTPTLH 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   254 PQRPPSPHPPPHPSPHPPLQPltgsagQPSAPSHAQPPLHGQGPPGPHSLQAGP-LLQHPGPPQPFGLPPQASQGQAPLG 332
Cdd:pfam03154  239 PQRLPSPHPPLQPMTQPPPPS------QVSPQPLPQPSLHGQMPPMPHSLQTGPsHMQHPVPPQPFPLTPQSSQSQVPPG 312
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   333 TSPAAAYP-HTSLQLPASQSALQSQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSMNANLPP 411
Cdd:pfam03154  313 PSPAAPGQsQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPP 392
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   412 PPALKPLSSLSTHHPPSAHPPPLQLMPQSQPLPSSPAQPPGLTQSQNLPPPPASHPPT-GLHQVAPQPPFAQHPFVPGGP 490
Cdd:pfam03154  393 PPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTsGLHQVPSQSPFPQHPFVPGGP 472
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   491 PPITPPTCPSTSTPPAGPGTsaQPPCSGAAASGGSIAGGSSCPLPTVQIKEEALDDAEEPESPPPPPRSPSPEPTVVDTP 570
Cdd:pfam03154  473 PPITPPSGPPTSTSSAMPGI--QPPSSASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTP 550
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   571 SHASQSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEAIEKAKREAEQKAREEREREKEKEKEREREREREREAER 650
Cdd:pfam03154  551 SHASQSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEALEKAKREAEQKAREEKEREKEKEKEREREREREREAER 630
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   651 AAKASSSAHEGRLSDPQLSGPGHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFYMPLNPTDPL 730
Cdd:pfam03154  631 AAKASSSSHEGRMGDPQLAGPAHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFFVPLNPTDPL 710
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   731 LAYHMPGLYNVDPTIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPAANPMEHFARHSALTIPPTAGPHPF 810
Cdd:pfam03154  711 LAYHMPGLYNVDPAIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHGALTLPPMAGPHPF 790
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   811 ASFHPGLNPLERERLALAGPQLRPEMSYPDRLAAERIHAERMASLTSDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPL 890
Cdd:pfam03154  791 ASFHPGLNPLERERLALAGPQLRPEMSYPDRLAAERLHAERMASLTNDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPL 870
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   891 HQGSAGPVHPLVDPLTAGPHLARFPYPPGTLPNPLLGQPPHEHEMLRHPVFGTPYPRDLPGAIPPPMSAAHQLQAMHAQS 970
Cdd:pfam03154  871 HQGSGGPVHPLVDPLAAGPHLARFPYPPGAIPNPLLGQPPHEHEMLRHPVFGTPYPRDLPGGLPPPMSAAHQLQAMHAQS 950
                          970       980       990      1000
                   ....*....|....*....|....*....|....*....|.
gi 112382226   971 AELQRLAMEQQWLHGHPHMHGGHLPSQEDYYSRLKKEGDKQ 1011
Cdd:pfam03154  951 AELQRLAMEQQWLHGHPHMHGGHLPGQEDYYSRLKKESDKQ 991
PHA03247 PHA03247
large tegument protein UL36; Provisional
22-402 1.67e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 62.26  E-value: 1.67e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   22 RSRGSMSTLRSGRKKQPASPDGRTSPINE--DIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEASSPLKSNKRQREK 99
Cdd:PHA03247 2576 RPSEPAVTSRARRPDAPPQSARPRAPVDDrgDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDD 2655
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  100 VASDTEEADRTSSKKTKTqeiSRPNSPSEGEGESSDSRSVNDEGSS-----DPKDIDQDNRSTSPSIPSPQDNESDSDSS 174
Cdd:PHA03247 2656 PAPGRVSRPRRARRLGRA---AQASSPPQRPRRRAARPTVGSLTSLadpppPPPTPEPAPHALVSATPLPPGPAAARQAS 2732
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  175 AQQQMLQAQPPALQAP-TGVTPAPSSAPPGTPQLPTPGPTPSATAVPPQGSPTASQAPNQPQAPTAPVPhthiqQAPALH 253
Cdd:PHA03247 2733 PALPAAPAPPAVPAGPaTPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSP-----WDPADP 2807
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  254 PQRPPSPHPPPHPSPHPPLQPLTGSAGQPSAPSHAQPPLHGQGPPGPHSLQAGPLLQHPGPPQPFGLPPQASQGQAPLGT 333
Cdd:PHA03247 2808 PAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLA 2887
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  334 SPAAAYPHTSLQLPASQSALQSQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQA-HKHPPHLSGPSP 402
Cdd:PHA03247 2888 RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLApTTDPAGAGEPSG 2957
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
3-175 1.58e-06

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 52.22  E-value: 1.58e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226    3 KPVKEEDDGLSGKHSMRTRRSR------GSMSTLRSGRKKQPASPDGRTSPINEDIRSSGRNSPSAASTSSNDSKAETVK 76
Cdd:NF033609  555 EPIPEDSDSDPGSDSGSDSSNSdsgsdsGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDS 634
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   77 KSAKKVKEEASSPLKSNKRQREKVASDTE---EADRTSSKKTKTQEISRPNSPSEGEGES-SDSRSVNDEGSSDPKDIDQ 152
Cdd:NF033609  635 DSASDSDSDSDSDSDSDSDSDSDSDSDSDsdsDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDS 714
                         170       180
                  ....*....|....*....|...
gi 112382226  153 DNRSTSPSiPSPQDNESDSDSSA 175
Cdd:NF033609  715 DSDSDSDS-DSDSDSDSDSDSDS 736
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
36-251 8.43e-06

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 49.63  E-value: 8.43e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   36 KQPASPDGRTSPINEDIRSSGRNSPSAASTSsnDSKAETVKKSAKKVKEEA--SSPLKSNKR---------QREKVASDT 104
Cdd:NF033838  246 KEAVEKNVATSEQDKPKRRAKRGVLGEPATP--DKKENDAKSSDSSVGEETlpSPSLKPEKKvaeaekkveEAKKKAKDQ 323
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  105 EEADR----TSSKKTKTQEISRPNSP-SEGE-----GESSDSRsvNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSS 174
Cdd:NF033838  324 KEEDRrnypTNTYKTLELEIAESDVKvKEAElelvkEEAKEPR--NEEKIKQAKAKVESKKAEATRLEKIKTDRKKAEEE 401
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 112382226  175 AQQQMlqaqppALQAPTGVTPAPSSAPPGTPQLPTPGPTPSATAVPPQGSPTASQAPNQPQAPTAPVPHTHI--QQAPA 251
Cdd:NF033838  402 AKRKA------AEEDKVKEKPAEQPQPAPAPQPEKPAPKPEKPAEQPKAEKPADQQAEEDYARRSEEEYNRLtqQQPPK 474
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
26-175 9.00e-05

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 46.44  E-value: 9.00e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   26 SMSTLRSGRKKQPASPDGRTSPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEASSPLKSNKRQREKVASDTE 105
Cdd:NF033609  606 SASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 685
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 112382226  106 -EADRTSSKKTKTQEISRPNSPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 175
Cdd:NF033609  686 sDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 756
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
26-175 1.18e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 46.06  E-value: 1.18e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   26 SMSTLRSGRKKQPASPDGRTSPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEASSPLKSNKRQREKVASDTE 105
Cdd:NF033609  630 SASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 709
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 112382226  106 -EADRTSSKKTKTQEISRPNSPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 175
Cdd:NF033609  710 sDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 780
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
26-175 1.67e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 45.67  E-value: 1.67e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   26 SMSTLRSGRKKQPASPDGRTSPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEASSPLKSNKRQREKVASDTE 105
Cdd:NF033609  650 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 729
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 112382226  106 -EADRTSSKKTKTQEISRPNSPSEGEGES---SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 175
Cdd:NF033609  730 sDSDSDSDSDSDSDSDSDSDSDSDSDSDSdsdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 802
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
46-244 1.77e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 45.67  E-value: 1.77e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   46 SPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEASSPLKSNKRQREKVASDTE-EADRTSSKKTKTQEISRPN 124
Cdd:NF033609  704 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSD 783
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  125 SPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDS-----SAQQQMLQAQPPALQAPTGVTPAPS 198
Cdd:NF033609  784 SDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSdsdsdSDSDSDSDSDSDSDSDSDSESDSNS 862
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*...
gi 112382226  199 SAPPGTpqlptpgptpSATAVPPQGSPTASQAPNQPQAPTA--PVPHT 244
Cdd:NF033609  863 DSESGS----------NNNVVPPNSPKNGTNASNKNEAKDSkePLPDT 900
COG5373 COG5373
Uncharacterized membrane protein [Function unknown];
174-251 1.32e-03

Uncharacterized membrane protein [Function unknown];


Pssm-ID: 444140 [Multi-domain]  Cd Length: 854  Bit Score: 42.68  E-value: 1.32e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 112382226  174 SAQQQMLQAQPPAlqAPTGVTPAPSSAPPGTPQLPTPGPTPSATAVPPQGSPTASQAPNQPQAPTAPVPhthiQQAPA 251
Cdd:COG5373    31 EELEAELAEAAEA--ASAPAEPEPEAAAAATAAAPEAAPAPVPEAPAAPPAAAEAPAPAAAAPPAEAEP----AAAPA 102
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
202-362 6.43e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 40.18  E-value: 6.43e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   202 PGTPQLPTPGPTPSATAVPPQGSPtasqapnQPQAPTAPVPHTHIQQAPalhpqrppsphppphpsphpplqplTGSAGQ 281
Cdd:TIGR01628  380 PRMRQLPMGSPMGGAMGQPPYYGQ-------GPQQQFNGQPLGWPRMSM-------------------------MPTPMG 427
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   282 PSAPSHAQ--PPLHGQGPPGPHSLQAgpllQHPGPPQPFGLPPQASQGQAPLGTSPAAAYPHTSLQLPASQSALQSqQPP 359
Cdd:TIGR01628  428 PGGPLRPNglAPMNAVRAPSRNAQNA----AQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLAS-ATP 502

                   ...
gi 112382226   360 REQ 362
Cdd:TIGR01628  503 QMQ 505
 
Name Accession Description Interval E-value
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
14-1011 0e+00

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 998.51  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226    14 GKHSMRTRRSRGSMSTLRSGRKKQPASPDGRTSPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEASSPLKSN 93
Cdd:pfam03154    1 GKHSMRTRRSRGSMSTLRSGRKKQTASPDGRASPTNEDLRSSGRNSPSAASTSSNDSKAESMKKSSKKIKEEAPSPLKSA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226    94 KRQREKVASDTEEADRTSSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDS 173
Cdd:pfam03154   81 KRQREKGASDTEEPERATAKKSKTQEISRPNSPSEGEGESSDGRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDS 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   174 SAQQQMLQAQPPALQAPTGVTPAPSSAPPGTPQLPTPGPTPSATAVPPQGSPTASQAPNQPQAPTAPvpHTHIQQAPALH 253
Cdd:pfam03154  161 SAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAP--HTLIQQTPTLH 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   254 PQRPPSPHPPPHPSPHPPLQPltgsagQPSAPSHAQPPLHGQGPPGPHSLQAGP-LLQHPGPPQPFGLPPQASQGQAPLG 332
Cdd:pfam03154  239 PQRLPSPHPPLQPMTQPPPPS------QVSPQPLPQPSLHGQMPPMPHSLQTGPsHMQHPVPPQPFPLTPQSSQSQVPPG 312
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   333 TSPAAAYP-HTSLQLPASQSALQSQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSMNANLPP 411
Cdd:pfam03154  313 PSPAAPGQsQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPP 392
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   412 PPALKPLSSLSTHHPPSAHPPPLQLMPQSQPLPSSPAQPPGLTQSQNLPPPPASHPPT-GLHQVAPQPPFAQHPFVPGGP 490
Cdd:pfam03154  393 PPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTsGLHQVPSQSPFPQHPFVPGGP 472
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   491 PPITPPTCPSTSTPPAGPGTsaQPPCSGAAASGGSIAGGSSCPLPTVQIKEEALDDAEEPESPPPPPRSPSPEPTVVDTP 570
Cdd:pfam03154  473 PPITPPSGPPTSTSSAMPGI--QPPSSASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTP 550
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   571 SHASQSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEAIEKAKREAEQKAREEREREKEKEKEREREREREREAER 650
Cdd:pfam03154  551 SHASQSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEALEKAKREAEQKAREEKEREKEKEKEREREREREREAER 630
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   651 AAKASSSAHEGRLSDPQLSGPGHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFYMPLNPTDPL 730
Cdd:pfam03154  631 AAKASSSSHEGRMGDPQLAGPAHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFFVPLNPTDPL 710
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   731 LAYHMPGLYNVDPTIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPAANPMEHFARHSALTIPPTAGPHPF 810
Cdd:pfam03154  711 LAYHMPGLYNVDPAIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHGALTLPPMAGPHPF 790
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   811 ASFHPGLNPLERERLALAGPQLRPEMSYPDRLAAERIHAERMASLTSDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPL 890
Cdd:pfam03154  791 ASFHPGLNPLERERLALAGPQLRPEMSYPDRLAAERLHAERMASLTNDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPL 870
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   891 HQGSAGPVHPLVDPLTAGPHLARFPYPPGTLPNPLLGQPPHEHEMLRHPVFGTPYPRDLPGAIPPPMSAAHQLQAMHAQS 970
Cdd:pfam03154  871 HQGSGGPVHPLVDPLAAGPHLARFPYPPGAIPNPLLGQPPHEHEMLRHPVFGTPYPRDLPGGLPPPMSAAHQLQAMHAQS 950
                          970       980       990      1000
                   ....*....|....*....|....*....|....*....|.
gi 112382226   971 AELQRLAMEQQWLHGHPHMHGGHLPSQEDYYSRLKKEGDKQ 1011
Cdd:pfam03154  951 AELQRLAMEQQWLHGHPHMHGGHLPGQEDYYSRLKKESDKQ 991
PHA03247 PHA03247
large tegument protein UL36; Provisional
22-402 1.67e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 62.26  E-value: 1.67e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   22 RSRGSMSTLRSGRKKQPASPDGRTSPINE--DIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEASSPLKSNKRQREK 99
Cdd:PHA03247 2576 RPSEPAVTSRARRPDAPPQSARPRAPVDDrgDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDD 2655
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  100 VASDTEEADRTSSKKTKTqeiSRPNSPSEGEGESSDSRSVNDEGSS-----DPKDIDQDNRSTSPSIPSPQDNESDSDSS 174
Cdd:PHA03247 2656 PAPGRVSRPRRARRLGRA---AQASSPPQRPRRRAARPTVGSLTSLadpppPPPTPEPAPHALVSATPLPPGPAAARQAS 2732
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  175 AQQQMLQAQPPALQAP-TGVTPAPSSAPPGTPQLPTPGPTPSATAVPPQGSPTASQAPNQPQAPTAPVPhthiqQAPALH 253
Cdd:PHA03247 2733 PALPAAPAPPAVPAGPaTPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSP-----WDPADP 2807
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  254 PQRPPSPHPPPHPSPHPPLQPLTGSAGQPSAPSHAQPPLHGQGPPGPHSLQAGPLLQHPGPPQPFGLPPQASQGQAPLGT 333
Cdd:PHA03247 2808 PAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLA 2887
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  334 SPAAAYPHTSLQLPASQSALQSQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQA-HKHPPHLSGPSP 402
Cdd:PHA03247 2888 RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLApTTDPAGAGEPSG 2957
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
184-362 1.17e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 55.76  E-value: 1.17e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  184 PPALQAPTGVTPAPSSAPPGTPQLPTPGPTPSATAVPPQGSPTASQAPNQPQAPTAPVPHTHIQQAPALHPQRPPSPHPP 263
Cdd:PRK07764  591 APGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWP 670
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  264 PHPSPHPPLQPLTGSAGQPSAPSHAQPPLHGQ--GPPGPHSLQA-GPLLQHPGPPQPFGLPPQASQGQAPLGTSPAA--A 338
Cdd:PRK07764  671 AKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPApaPAATPPAGQAdDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDppD 750
                         170       180
                  ....*....|....*....|....
gi 112382226  339 YPHTSLQLPASQSALQSQQPPREQ 362
Cdd:PRK07764  751 PAGAPAQPPPPPAPAPAAAPAAAP 774
PHA03247 PHA03247
large tegument protein UL36; Provisional
121-457 3.75e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.56  E-value: 3.75e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  121 SRPNSPSEGEGESSDSRSVNDEGSSdPKDIDQDNRSTSPSIPSpqdnesdsdsSAQQQMLQAQPPALQAPTGvTPAPSSA 200
Cdd:PHA03247 2633 PAANEPDPHPPPTVPPPERPRDDPA-PGRVSRPRRARRLGRAA----------QASSPPQRPRRRAARPTVG-SLTSLAD 2700
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  201 PPGTPQLPTPGPTPSATAVPPQGSPTASQAPNqPQAPTAPVPhthiqQAPALHPQRPPSPHPPPHPSPHPPLQPLTGSAG 280
Cdd:PHA03247 2701 PPPPPPTPEPAPHALVSATPLPPGPAAARQAS-PALPAAPAP-----PAVPAGPATPGGPARPARPPTTAGPPAPAPPAA 2774
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  281 QPSAPSHAQPPlhgqgPPGPHSLQAGPLLQHPGPPQPFGLPPQASQGQAPLGTSPAAAYPHTSLQLPASQSALQSQQPPR 360
Cdd:PHA03247 2775 PAAGPPRRLTR-----PAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPS 2849
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  361 EQPLPPAPLAMPHIKPPPTTPIPQLPAPQAH------KHPPHLSGPSPFSMNANLPPPPALKPLSSLSTHHPPSAHPPPL 434
Cdd:PHA03247 2850 LPLGGSVAPGGDVRRRPPSRSPAAKPAAPARppvrrlARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQP 2929
                         330       340
                  ....*....|....*....|...
gi 112382226  435 QLMPQSQPLPSSPAQPPGLTQSQ 457
Cdd:PHA03247 2930 QPPPPPPPRPQPPLAPTTDPAGA 2952
PHA03247 PHA03247
large tegument protein UL36; Provisional
154-547 1.22e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.02  E-value: 1.22e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  154 NRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPTG------VTPAPSSAPPGTPQLPTPGPTPSATAV-PPQGSPT 226
Cdd:PHA03247 2565 DRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDdrgdprGPAPPSPLPPDTHAPDPPPPSPSPAANePDPHPPP 2644
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  227 ASQAPNQPQAPTAP--VPHTHIQQAPALHPQRPPSPHPPPHPSPHPPLQPLTGSAGQPSAPSHAQPPLHGQGPPGPHSLQ 304
Cdd:PHA03247 2645 TVPPPERPRDDPAPgrVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPG 2724
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  305 AGPLLQ-HPGPPQPFGLPPQASQGQAPLGTSPAAAyphtslqlPASQSALQSQQPPREQPLPPAPLAMPHIKPPPTTPIP 383
Cdd:PHA03247 2725 PAAARQaSPALPAAPAPPAVPAGPATPGGPARPAR--------PPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRE 2796
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  384 QLPAPQAHKHPPHLSGPSPFSMNANLPPPPAlkplsslsthhppsahppplqLMPQSQPLPSSPAQPPG-----LTQSQN 458
Cdd:PHA03247 2797 SLPSPWDPADPPAAVLAPAAALPPAASPAGP---------------------LPPPTSAQPTAPPPPPGppppsLPLGGS 2855
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  459 LPPPPASHPPTGLHQVAPQPPFAQHPFVPGGPPPITPPTCPSTSTPPAGPgtsAQPPCSGAAASGGSIAGGSSCPLPTVQ 538
Cdd:PHA03247 2856 VAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQP---ERPPQPQAPPPPQPQPQPPPPPQPQPP 2932

                  ....*....
gi 112382226  539 IKEEALDDA 547
Cdd:PHA03247 2933 PPPPPRPQP 2941
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
3-175 1.58e-06

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 52.22  E-value: 1.58e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226    3 KPVKEEDDGLSGKHSMRTRRSR------GSMSTLRSGRKKQPASPDGRTSPINEDIRSSGRNSPSAASTSSNDSKAETVK 76
Cdd:NF033609  555 EPIPEDSDSDPGSDSGSDSSNSdsgsdsGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDS 634
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   77 KSAKKVKEEASSPLKSNKRQREKVASDTE---EADRTSSKKTKTQEISRPNSPSEGEGES-SDSRSVNDEGSSDPKDIDQ 152
Cdd:NF033609  635 DSASDSDSDSDSDSDSDSDSDSDSDSDSDsdsDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDS 714
                         170       180
                  ....*....|....*....|...
gi 112382226  153 DNRSTSPSiPSPQDNESDSDSSA 175
Cdd:NF033609  715 DSDSDSDS-DSDSDSDSDSDSDS 736
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
184-336 3.63e-06

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 51.19  E-value: 3.63e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   184 PPALQAPTGVTPAPSSAPPGTPQLP----------------TPGPTPSATAVPPQgSPTASQAPNQPQAPTAPVPHTHIQ 247
Cdd:pfam09770  166 APKKAAAPAPAPQPAAQPASLPAPSrkmmsleeveaamraqAKKPAQQPAPAPAQ-PPAAPPAQQAQQQQQFPPQIQQQQ 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   248 QAPALHPQRPPSPHPPPHPSPHPPLQPLTGSAGQPSAPSHAQPPLHGQGPPGPHSLQA-----------------GPLLQ 310
Cdd:pfam09770  245 QPQQQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQIlqnpnrlsaarvgypqnPQPGV 324
                          170       180
                   ....*....|....*....|....*.
gi 112382226   311 HPGPPQPFGLPPQASQGQAPLGTSPA 336
Cdd:pfam09770  325 QPAPAHQAHRQQGSFGRQAPIITHPQ 350
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
36-251 8.43e-06

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 49.63  E-value: 8.43e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   36 KQPASPDGRTSPINEDIRSSGRNSPSAASTSsnDSKAETVKKSAKKVKEEA--SSPLKSNKR---------QREKVASDT 104
Cdd:NF033838  246 KEAVEKNVATSEQDKPKRRAKRGVLGEPATP--DKKENDAKSSDSSVGEETlpSPSLKPEKKvaeaekkveEAKKKAKDQ 323
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  105 EEADR----TSSKKTKTQEISRPNSP-SEGE-----GESSDSRsvNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSS 174
Cdd:NF033838  324 KEEDRrnypTNTYKTLELEIAESDVKvKEAElelvkEEAKEPR--NEEKIKQAKAKVESKKAEATRLEKIKTDRKKAEEE 401
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 112382226  175 AQQQMlqaqppALQAPTGVTPAPSSAPPGTPQLPTPGPTPSATAVPPQGSPTASQAPNQPQAPTAPVPHTHI--QQAPA 251
Cdd:NF033838  402 AKRKA------AEEDKVKEKPAEQPQPAPAPQPEKPAPKPEKPAEQPKAEKPADQQAEEDYARRSEEEYNRLtqQQPPK 474
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
106-291 9.67e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 49.60  E-value: 9.67e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  106 EADRTSSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPP 185
Cdd:PRK07764  598 EGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAA 677
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  186 ALQAPTGVTPAPSSAPPGTPQlPTPGPTPSATAVPPQGSPTASQAPNQPQAPTAPVPHTHIQQAPALHPQRPPSPHPPPH 265
Cdd:PRK07764  678 PAAPPPAPAPAAPAAPAGAAP-AQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPA 756
                         170       180
                  ....*....|....*....|....*.
gi 112382226  266 PSPHPPLQPLTGSAGQPSAPSHAQPP 291
Cdd:PRK07764  757 QPPPPPAPAPAAAPAAAPPPSPPSEE 782
PLN02967 PLN02967
kinase
4-133 5.54e-05

kinase


Pssm-ID: 215521 [Multi-domain]  Cd Length: 581  Bit Score: 46.96  E-value: 5.54e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226    4 PVKEEDDGLSGKHSMRTRRSRgsmstlRSGRKKQPASPDGRTSPINEDIRssgrNSPSAASTSSNDSKAETVKKSA---K 80
Cdd:PLN02967   57 AVDEEPDENGAVSKKKPTRSV------KRATKKTVVEISEPLEEGSELVV----NEDAALDKESKKTPRRTRRKAAaasS 126
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|...
gi 112382226   81 KVKEEASSPLKSNKRQREKVASDTEEADRTSSKKTKTQEISRPNSPSEGEGES 133
Cdd:PLN02967  127 DVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEFVTSLENESEEEL 179
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
169-308 5.87e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 47.29  E-value: 5.87e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  169 SDSDSSAQQQMLQAQPPALQAPTGVTPAPSSAPPGTPQLPTPGPTPSATAVPPQGSPTASQAPNQPQAPTAPVPHTHIQQ 248
Cdd:PRK07764  367 ASDDERGLLARLERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAG 446
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 112382226  249 APALHPQRPPSPHPPPHPSPHPPLQPLTGSAGQP-SAPSHAQPPLHGQGPPGPHSLQAGPL 308
Cdd:PRK07764  447 NAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPaPAPPAAPAPAAAPAAPAAPAAPAGAD 507
PRK13042 PRK13042
superantigen-like protein SSL4; Reviewed;
156-242 5.92e-05

superantigen-like protein SSL4; Reviewed;


Pssm-ID: 183854 [Multi-domain]  Cd Length: 291  Bit Score: 46.16  E-value: 5.92e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  156 STSPSIPSPQDNESDSDSSAQQQMLQAQPPALQaPTGVTPAPSSAPPGTPQLPTPGPTPSATAvPPQGSPTASQAPNQPQ 235
Cdd:PRK13042   17 TTGVITTTTQAANATTPSSTKVEAPQSTPPSTK-VEAPQSKPNATTPPSTKVEAPQQTPNATT-PSSTKVETPQSPTTKQ 94

                  ....*..
gi 112382226  236 APTAPVP 242
Cdd:PRK13042   95 VPTEINP 101
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
179-356 5.93e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 47.02  E-value: 5.93e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  179 MLQAQPPAlqAPTGVTPAPSSAP-PGTPQLPTPGPTPSATAVPPQGSPTASQAPNQPQAPTAPVPHTHIQQAPALHPQRP 257
Cdd:PRK14951  361 LLAFKPAA--AAEAAAPAEKKTPaRPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAP 438
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  258 PSPHPPPHPSPHPPLqpltgsagqPSAPSHAQPPLHGQgpPGPHSLQAGPllqHPGPPQPFGLPPQASQGQAPLGTSP-- 335
Cdd:PRK14951  439 AAAPAAVALAPAPPA---------QAAPETVAIPVRVA--PEPAVASAAP---APAAAPAAARLTPTEEGDVWHATVQql 504
                         170       180
                  ....*....|....*....|.
gi 112382226  336 AAAYPHTSLqlpASQSALQSQ 356
Cdd:PRK14951  505 AAAEAITAL---ARELALQSE 522
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
162-340 7.14e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 46.90  E-value: 7.14e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  162 PSPQDNESDSDSSAQQQMLQAQPPALQAPTGVTPAPSSAPPGTPQLPTPGPTPSATAVPPQGSPTASQAPNQPQAPTAPV 241
Cdd:PRK07764  597 GEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGA 676
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  242 PHTHIQQAPALHPQRPPSPHPPPHPSPHPPLQPLtgSAGQPSAPSHAQPPLHGQGPPGPHSLQAGPLLQHPG-PPQPFGL 320
Cdd:PRK07764  677 APAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPP--AGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDdPPDPAGA 754
                         170       180
                  ....*....|....*....|
gi 112382226  321 PPQASQGQAPLGTSPAAAYP 340
Cdd:PRK07764  755 PAQPPPPPAPAPAAAPAAAP 774
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
193-400 7.77e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 46.90  E-value: 7.77e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  193 VTPAPSSAPPGTPQLPTPGPTPSATAVP-PQGSPTASQAPNQPQAPTAPVPHTHIQQAPALHPQRPPSPHPPPHPSPHPP 271
Cdd:PRK07764  588 VGPAPGAAGGEGPPAPASSGPPEEAARPaAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGD 667
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  272 LQPLTGSAGQPSAPSHAQPPLHGQGPPGphslQAGPLLQHPGPPQPFGLPPQASQGQAPLGTSPAAAyphtslqlpaSQS 351
Cdd:PRK07764  668 GWPAKAGGAAPAAPPPAPAPAAPAAPAG----AAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASA----------PSP 733
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*....
gi 112382226  352 ALQSQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGP 400
Cdd:PRK07764  734 AADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEE 782
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
126-356 8.23e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 46.41  E-value: 8.23e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  126 PSEGEGESSDSRSVNDEgSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQ--APTGVTPAPSSAPPG 203
Cdd:PRK12323  365 PGQSGGGAGPATAAAAP-VAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRspAPEALAAARQASARG 443
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  204 TPQLPTPGPTPSATAVPPQGSPTASQAPNQPQAPTAPVPHTHIQQAPALHPQRPPSPHPPPHPSPHPPLQPLTGSAGQPS 283
Cdd:PRK12323  444 PGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVA 523
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  284 A----PSHAQPPlhGQGPPGPHSLQAGPLLQHPGPPQPFGLPPQASQGQAPLGTSPAAAYPHTSLQLP----ASQSALQS 355
Cdd:PRK12323  524 EsipdPATADPD--DAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGDWPALAARLPvrglAQQLARQS 601

                  .
gi 112382226  356 Q 356
Cdd:PRK12323  602 E 602
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
26-175 9.00e-05

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 46.44  E-value: 9.00e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   26 SMSTLRSGRKKQPASPDGRTSPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEASSPLKSNKRQREKVASDTE 105
Cdd:NF033609  606 SASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 685
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 112382226  106 -EADRTSSKKTKTQEISRPNSPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 175
Cdd:NF033609  686 sDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 756
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
26-175 1.18e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 46.06  E-value: 1.18e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   26 SMSTLRSGRKKQPASPDGRTSPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEASSPLKSNKRQREKVASDTE 105
Cdd:NF033609  630 SASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 709
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 112382226  106 -EADRTSSKKTKTQEISRPNSPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 175
Cdd:NF033609  710 sDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 780
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
123-340 1.27e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 46.13  E-value: 1.27e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  123 PNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSAQQQMLQAqPPALQAPTGVTPAPSSAPP 202
Cdd:PRK07764  591 APGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPA-PAGAAAAPAEASAAPAPGVAA-PEHHPKHVAVPDASDGGDG 668
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  203 GTPQLPTPGPTPSATAVPPQGSPT-ASQAPNQPQAPTAPVPHTHIQQAPAlhpqrppsphppPHPSPHPPLQPLTGSAGQ 281
Cdd:PRK07764  669 WPAKAGGAAPAAPPPAPAPAAPAApAGAAPAQPAPAPAATPPAGQADDPA------------AQPPQAAQGASAPSPAAD 736
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 112382226  282 PSAPSHAQPPLHGQGPPGPHSLQAGPllqHPGPPQPFGLPPQASQGQAPLGTSPAAAYP 340
Cdd:PRK07764  737 DPVPLPPEPDDPPDPAGAPAQPPPPP---APAPAAAPAAAPPPSPPSEEEEMAEDDAPS 792
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
26-175 1.67e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 45.67  E-value: 1.67e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   26 SMSTLRSGRKKQPASPDGRTSPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEASSPLKSNKRQREKVASDTE 105
Cdd:NF033609  650 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 729
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 112382226  106 -EADRTSSKKTKTQEISRPNSPSEGEGES---SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 175
Cdd:NF033609  730 sDSDSDSDSDSDSDSDSDSDSDSDSDSDSdsdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 802
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
46-244 1.77e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 45.67  E-value: 1.77e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   46 SPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEASSPLKSNKRQREKVASDTE-EADRTSSKKTKTQEISRPN 124
Cdd:NF033609  704 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSD 783
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  125 SPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDS-----SAQQQMLQAQPPALQAPTGVTPAPS 198
Cdd:NF033609  784 SDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSdsdsdSDSDSDSDSDSDSDSDSDSESDSNS 862
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*...
gi 112382226  199 SAPPGTpqlptpgptpSATAVPPQGSPTASQAPNQPQAPTA--PVPHT 244
Cdd:NF033609  863 DSESGS----------NNNVVPPNSPKNGTNASNKNEAKDSkePLPDT 900
PRK10856 PRK10856
cytoskeleton protein RodZ;
132-236 2.81e-04

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 44.25  E-value: 2.81e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  132 ESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPTGVTPAPSSAPPGTPQLPTPG 211
Cdd:PRK10856  149 QSSAELSQNSGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAP 228
                          90       100
                  ....*....|....*....|....*
gi 112382226  212 PTPSATAVPPQGSPTASQAPNQPQA 236
Cdd:PRK10856  229 ATPDGAAPLPTDQAGVSTPAADPNA 253
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
1-154 3.50e-04

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 44.65  E-value: 3.50e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226    1 MFKPVKEEDDGLSGKHSMRTRRSRGSMSTLRSGRKKQPASPDGRTSPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAK 80
Cdd:PTZ00108 1236 KKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLE 1315
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 112382226   81 KVKEEASSPLKSNKRQREKVASDTEEADRTSSKKTKTQEISRPNSPSEGEgESSDSRSVNDEGSSDPKDIDQDN 154
Cdd:PTZ00108 1316 GSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSSS-EDDDDSEVDDSEDEDDEDDEDDD 1388
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
206-459 4.11e-04

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 44.26  E-value: 4.11e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   206 QLPTPGPTPSATAVPPQGSPTASQAPNQPQAPTAPV-----------PHTHIQQAPALHPQRPPSPHPPPHPSPHPPLQP 274
Cdd:pfam09770  105 QQPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVrtgyekykepePIPDLQVDASLWGVAPKKAAAPAPAPQPAAQPA 184
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   275 LTGSAG---------------------QPSAPSHAQPPLHGQGPPGPHSLQAGPLLQHPGPPQPFGLPPQASQGQAPlgt 333
Cdd:pfam09770  185 SLPAPSrkmmsleeveaamraqakkpaQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGH--- 261
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   334 spaaayPHTSLQLPASQSALQSQQPPREQplppaplamphikpppttpipqlpaPQAHKHPPHLSGPSPFSMNANLPPPP 413
Cdd:pfam09770  262 ------PVTILQRPQSPQPDPAQPSIQPQ-------------------------AQQFHQQPPPVPVQPTQILQNPNRLS 310
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 112382226   414 ALKPLssLSTHHPPSAHPPPLQLMPQSQplPSSPAQPPGLTQSQNL 459
Cdd:pfam09770  311 AARVG--YPQNPQPGVQPAPAHQAHRQQ--GSFGRQAPIITHPQQL 352
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
181-405 5.23e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 44.10  E-value: 5.23e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  181 QAQPPALQAPTGVTPAPSSAPPGTPQLPTPGPTPSATAVPPQGSPTASQAPNQPQAPTAPVPHTHIQQAPALHPQRPPSP 260
Cdd:PRK12323  371 GAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAP 450
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  261 HPPPHPSPHPplqpltgsAGQPSAPSHAQPPLHGQGPPGPHSLQAGPLLQHPGPPQPFGLPPQasqgqaplgtsPAAAYP 340
Cdd:PRK12323  451 APAPAAAPAA--------AARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPE-----------FASPAP 511
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 112382226  341 HTSLQLPASQSALQSQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSM 405
Cdd:PRK12323  512 AQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDM 576
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
137-251 5.75e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 44.11  E-value: 5.75e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  137 RSVNDEGSSDPK--DIDQDNRSTSPSIPSPQDNESDSDSSAQqqmlqaqPPALQAPTGVTPAPSSAPPGTPQlPTPGPTP 214
Cdd:PRK12270   17 QYLADPNSVDPSwrEFFADYGPGSTAAPTAAAAAAAAAASAP-------AAAPAAKAPAAPAPAPPAAAAPA-APPKPAA 88
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 112382226  215 SATAVPPQGSPTASQAPNQPQAPTAPVPHTHIQQAPA 251
Cdd:PRK12270   89 AAAAAAAPAAPPAAAAAAAPAAAAVEDEVTPLRGAAA 125
PHA03378 PHA03378
EBNA-3B; Provisional
185-328 5.78e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 43.90  E-value: 5.78e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  185 PALQAPTGV-TPAPSSAPPGTPQLPTPGPTPsatAVPPQGSPTASQAP---NQPQAPTAPVPHTHIQQAPALHPQRPPSP 260
Cdd:PHA03378  673 PYQPSPTGAnTMLPIQWAPGTMQPPPRAPTP---MRPPAAPPGRAQRPaaaTGRARPPAAAPGRARPPAAAPGRARPPAA 749
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 112382226  261 HPPPHPSPHPPLQPLTGSAGQPSAPSHAQPPlhgQGPPGPHSL-QAGPLLQHP--GPPQPFGLPPQASQGQ 328
Cdd:PHA03378  750 APGRARPPAAAPGRARPPAAAPGAPTPQPPP---QAPPAPQQRpRGAPTPQPPpqAGPTSMQLMPRAAPGQ 817
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
137-234 7.95e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 43.23  E-value: 7.95e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  137 RSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPAlqAPTGVTPAPSSAPPGTPQLPTPGPTPSA 216
Cdd:PRK14971  363 TQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPS--APQSATQPAGTPPTVSVDPPAAVPVNPP 440
                          90
                  ....*....|....*...
gi 112382226  217 TAVPPQGSPTASQAPNQP 234
Cdd:PRK14971  441 STAPQAVRPAQFKEEKKI 458
PRK10856 PRK10856
cytoskeleton protein RodZ;
175-284 1.23e-03

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 42.32  E-value: 1.23e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  175 AQQQMLQA---QPPALQAPTGVTPAPSSAPPGTPQLPTPGPTPSATAVPPQGSPTASQAPNQPQAP---TAPVPHTHIQQ 248
Cdd:PRK10856  138 AQQEEITTmadQSSAELSQNSGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDpqqNAVVAPSQANV 217
                          90       100       110
                  ....*....|....*....|....*....|....*.
gi 112382226  249 APALHPQRPPSPHPPPHPSPHPPLQPLTGSAGQPSA 284
Cdd:PRK10856  218 DTAATPAPAAPATPDGAAPLPTDQAGVSTPAADPNA 253
COG5373 COG5373
Uncharacterized membrane protein [Function unknown];
174-251 1.32e-03

Uncharacterized membrane protein [Function unknown];


Pssm-ID: 444140 [Multi-domain]  Cd Length: 854  Bit Score: 42.68  E-value: 1.32e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 112382226  174 SAQQQMLQAQPPAlqAPTGVTPAPSSAPPGTPQLPTPGPTPSATAVPPQGSPTASQAPNQPQAPTAPVPhthiQQAPA 251
Cdd:COG5373    31 EELEAELAEAAEA--ASAPAEPEPEAAAAATAAAPEAAPAPVPEAPAAPPAAAEAPAPAAAAPPAEAEP----AAAPA 102
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
155-304 1.53e-03

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 42.72  E-value: 1.53e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   155 RSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPTGVTPAPSSAPPGTPQLPTPGPTPSATAVPPQGSPTASQAPNQP 234
Cdd:pfam09770  204 RAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQP 283
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   235 -------QAPTAPVPHTHIQQAPALHPQRPPSPHPPPHPsphpplqpltGSAGQPSAPSHAQPPLHGQGPP---GPHSLQ 304
Cdd:pfam09770  284 qaqqfhqQPPPVPVQPTQILQNPNRLSAARVGYPQNPQP----------GVQPAPAHQAHRQQGSFGRQAPiitHPQQLA 353
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
38-237 1.57e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.47  E-value: 1.57e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   38 PASPDGRTSPINEDIRSSGRNSPSAASTSSNDSKAEtvKKSAKKVKEEASSPLKSNKRQREKVASDTEEADRTSSKKTKT 117
Cdd:PHA03307  190 PAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPG--RSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPT 267
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  118 QEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPTGVTPAP 197
Cdd:PHA03307  268 RIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSP 347
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|
gi 112382226  198 SSAPPgtpqlPTPGPTPSATAVPPQGSPTASQAPNQPQAP 237
Cdd:PHA03307  348 SRSPS-----PSRPPPPADPSSPRKRPRPSRAPSSPAASA 382
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
207-338 2.29e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.90  E-value: 2.29e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  207 LPTPGPTPSATAVPPqGSPTASQAPNQPQAPTAPVPHthiqQAPALHPQRPPSPHPPPHPSPHPPLQPLTGSAGQPSAPS 286
Cdd:PRK07764  385 LGVAGGAGAPAAAAP-SAAAAAPAAAPAPAAAAPAAA----AAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPA 459
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 112382226  287 HAQPPLHGQGPPGPHSLQAGPLLQHPGPPQPFGLPPQASQGQAPLGTSPAAA 338
Cdd:PRK07764  460 AAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAAT 511
SEEEED pfam14797
Serine-rich region of AP3B1, clathrin-adaptor complex; This short low-complexity, highly ...
63-174 2.52e-03

Serine-rich region of AP3B1, clathrin-adaptor complex; This short low-complexity, highly serine-rich region lies on clathrin-adaptor complex 3 beta-1 subunit proteins, between family Adaptin_N, pfam01602 and a C-terminal domain, AP3B1_C,pfam14796.


Pssm-ID: 434218 [Multi-domain]  Cd Length: 111  Bit Score: 38.76  E-value: 2.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226    63 ASTSSNDSKAETVKKSAKKVKEEASSplksnkrqrekvasdtEEADRTSSKKTKTQeisrpnSPSEGEGESSDSRSVNDE 142
Cdd:pfam14797   15 SSDSSSDSESESGSESEEEGKEGSSS----------------EDSSEDSSSEQESE------SGSESEKKRTAKRNSKAK 72
                           90       100       110
                   ....*....|....*....|....*....|..
gi 112382226   143 GSSDPKDIDQDNRSTSPSIPSPQDNESDSDSS 174
Cdd:pfam14797   73 GKSDSEDGEKKNEKSKTSDSSDTESSSSEESS 104
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
162-242 3.31e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 41.39  E-value: 3.31e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  162 PSPQDNESDSDSSAQQQMLQAQPPALQAPTGVTPAPSSAPPGTPQLPTPGPTPSATAVPPQGSPTASQAPNQPQAPTAPV 241
Cdd:PRK07994  368 PEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKSEPAAAS 447

                  .
gi 112382226  242 P 242
Cdd:PRK07994  448 R 448
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
185-300 3.36e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 41.39  E-value: 3.36e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  185 PALQAPTGVTPAPSSAPPGTPQLPtpgPTPSATAVPPQGSPTASQAPNQPQAPTAPVPHTHIQQAPALHPQRPPSPHPPP 264
Cdd:PRK07994  361 PAAPLPEPEVPPQSAAPAASAQAT---AAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATK 437
                          90       100       110
                  ....*....|....*....|....*....|....*.
gi 112382226  265 HPSPHPPLQPLTGSAGQPSAPSHAQPPLHGQGPPGP 300
Cdd:PRK07994  438 AKKSEPAAASRARPVNSALERLASVRPAPSALEKAP 473
PRK08581 PRK08581
amidase domain-containing protein;
49-209 3.58e-03

amidase domain-containing protein;


Pssm-ID: 236304 [Multi-domain]  Cd Length: 619  Bit Score: 41.31  E-value: 3.58e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   49 NEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKvkeeassplksNKRQREKVASDTEEADRTSSKKTKTQEISRPNSPSe 128
Cdd:PRK08581  136 YEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQ-----------DKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSN- 203
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  129 gegesSDSRSVNDEGSsdpKDIDQDNRSTSPS-IPSPQDNESDsDSSAQQQMLQAQppalqaptGVTPAPSSAPPGTPQL 207
Cdd:PRK08581  204 -----SQPASDDTANQ---KSSSKDNQSMSDSaLDSILDQYSE-DAKKTQKDYASQ--------SKKDKTETSNTKNPQL 266

                  ..
gi 112382226  208 PT 209
Cdd:PRK08581  267 PT 268
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
181-315 4.85e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 40.62  E-value: 4.85e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  181 QAQPPALQAPTGVTPAPSSAPPGTPQLPTPGPTPSATAVPPQGSPT--ASQAPNQPQAPTAPVPHTHIQQAPALHPQRPP 258
Cdd:PRK07994  374 SAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTsqLLAARQQLQRAQGATKAKKSEPAAASRARPVN 453
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 112382226  259 SPHPPPHPSPHPPLQPLTGSAGQPSAPSHAQPPLHGQGPPGPHSLQAGPLLQHPGPP 315
Cdd:PRK07994  454 SALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKALKKALEHEKTP 510
PHA03264 PHA03264
envelope glycoprotein D; Provisional
144-246 6.13e-03

envelope glycoprotein D; Provisional


Pssm-ID: 223029 [Multi-domain]  Cd Length: 416  Bit Score: 40.37  E-value: 6.13e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226  144 SSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQmlqAQPPALQAPTGVTPAPSSAPPGTPQLPTPGPT-PSATAVPPQ 222
Cdd:PHA03264  260 ESKGYEPPPAPSGGSPAPPGDDRPEAKPEPGPVED---GAPGRETGGEGEGPEPAGRDGAAGGEPKPGPPrPAPDADRPE 336
                          90       100
                  ....*....|....*....|....
gi 112382226  223 GSPTASQAPNQPQAPTAPVPHTHI 246
Cdd:PHA03264  337 GWPSLEAITFPPPTPATPAVPRAR 360
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
202-362 6.43e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 40.18  E-value: 6.43e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   202 PGTPQLPTPGPTPSATAVPPQGSPtasqapnQPQAPTAPVPHTHIQQAPalhpqrppsphppphpsphpplqplTGSAGQ 281
Cdd:TIGR01628  380 PRMRQLPMGSPMGGAMGQPPYYGQ-------GPQQQFNGQPLGWPRMSM-------------------------MPTPMG 427
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 112382226   282 PSAPSHAQ--PPLHGQGPPGPHSLQAgpllQHPGPPQPFGLPPQASQGQAPLGTSPAAAYPHTSLQLPASQSALQSqQPP 359
Cdd:TIGR01628  428 PGGPLRPNglAPMNAVRAPSRNAQNA----AQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLAS-ATP 502

                   ...
gi 112382226   360 REQ 362
Cdd:TIGR01628  503 QMQ 505
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH