NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2462613745|ref|XP_054213898|]
View 

protein piccolo isoform X4 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PDZ_RIM-like cd06714
PDZ domain of Rab3-interacting molecule 1 (RIM), RIM2, piccolo and related domains; PDZ ...
4493-4586 8.48e-48

PDZ domain of Rab3-interacting molecule 1 (RIM), RIM2, piccolo and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of RIM, RIM2, piccolo and related domains. RIM proteins and Gallus gallus protein piccolo (also called aczonin) are involved in neurotransmitter release at presynaptic active zones, the site of vesicle fusion. A protein complex containing RIM proteins positions synaptic vesicles containing synaptotagmin at the active zone. RIM proteins simultaneously activate docking and priming of synaptic vesicles and recruit Ca2+-channels to active zones, thereby connecting primed synaptic vesicles to Ca2+-channels. RIM binding to vesicular Rab proteins (Rab3 and Rab27 isoforms) mediates vesicle docking; RIM binding to Munc13 activates vesicle priming; RIM binding to the Ca2+-channel, both directly and indirectly via RIM-BP, recruits the Ca2+-channels. The RIM PDZ domain interacts with the C-termini of N- and P/Q-type voltage-gated Ca2+-channels. RIM1, RIM2 and piccolo also participate in regulated exocytosis through binding cAMP-GEFII (cAMP-binding protein-guanidine nucleotide exchange factor II). The piccolo PDZ domain binds cAMP-GEFII. RIM2 also plays a role in dendrite formation by melanocytes. Caenorhabditis elegans RIM (also known as unc-10) may be involved in the regulation of defecation and daumone response. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This RIM-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


:

Pssm-ID: 467198 [Multi-domain]  Cd Length: 95  Bit Score: 166.96  E-value: 8.48e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 4493 FPHARIKITRDSKDHTVSGNGLGIRIVGGKEIPghSGEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEV 4572
Cdd:cd06714      2 FLIGRIILQRDPKDGSVSGNGLGLKVVGGKMTE--SGRLGAYVTKVKPGSVADTVGHLREGDEVLEWNGISLQGKTFEEV 79
                           90
                   ....*....|....
gi 2462613745 4573 QSIISQQSGEAEIC 4586
Cdd:cd06714     80 QDIISQSKGEVELV 93
FYVE1_PCLO cd15774
FYVE-related domain 1 found in protein piccolo; Protein piccolo, also termed aczonin, is a ...
587-648 1.61e-42

FYVE-related domain 1 found in protein piccolo; Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Piccolo is a multi-domain protein containing two N-terminal FYVE zinc fingers, a polyproline tract, and a PDZ domain and two C-terminal C2 domains. This family corresponds to the first FYVE domain, which resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


:

Pssm-ID: 277313 [Multi-domain]  Cd Length: 62  Bit Score: 150.57  E-value: 1.61e-42
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462613745  587 TICPLCNTTELLLHVPEKANFNTCTECQTTVCSLCGFNPNPHLTEVKEWLCLNCQMKRALGG 648
Cdd:cd15774      1 TICPLCKTTELLLHTPEKANYNTCTQCQTTVCSLCGFNPNPHITEKKEWLCLNCQMQRALGG 62
FYVE2_PCLO cd15776
FYVE-related domain 2 found in protein piccolo; Protein piccolo, also termed aczonin, is a ...
1057-1120 8.33e-42

FYVE-related domain 2 found in protein piccolo; Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Piccolo is a multi-domain protein containing two N-terminal FYVE zinc fingers, a polyproline tract, and a PDZ domain and two C-terminal C2 domains. This family corresponds to the second FYVE domain, which resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


:

Pssm-ID: 277315 [Multi-domain]  Cd Length: 64  Bit Score: 148.68  E-value: 8.33e-42
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462613745 1057 STCPLCKTELNIGSKDPPNFNTCTECKNQVCNLCGFNPTPHLTEIQEWLCLNCQTQRAISGQLG 1120
Cdd:cd15776      1 LLCPLCKTELNIGSKDPPNFNTCTECKKTVCNLCGFNPTPHLTEVKEWLCLNCQTQRAMSGQLG 64
PHA03247 super family cl33720
large tegument protein UL36; Provisional
309-905 1.11e-25

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 118.50  E-value: 1.11e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  309 PTPGKPPAQQPGHEKSQPGPAKPPAQPSG------LTKPLA-QQPGTVKPPV----QPPGTTKPPAQPLGPAKPPAQQTG 377
Cdd:PHA03247  2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEpavtsrARRPDApPQSARPRAPVddrgDPRGPAPPSPLPPDTHAPDPPPPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  378 SEKPSSEQPGPKALAQPPGVGKTPAQQPGPAKPP---TQQVGTPKPLAQQPGLQSPAKAP--GPTKTPVQQPGPGKIPAq 452
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPrraRRLGRAAQASSPPQRPRRRAARPtvGSLTSLADPPPPPPTPE- 2709
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  453 qagPGKTSAQQTGPTKPPSQLPGPAKP-PPQQPGPAKPP--PQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKP 529
Cdd:PHA03247  2710 ---PAPHALVSATPLPPGPAAARQASPaLPAAPAPPAVPagPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRP 2786
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  530 PSQQPGSAKPSAQQPS-----PAKPSAQQSTKPVSQTGSGkPLQPPTVSPSAKQPPSQGLPKTICPLCNTTelllhvpek 604
Cdd:PHA03247  2787 AVASLSESRESLPSPWdpadpPAAVLAPAAALPPAASPAG-PLPPPTSAQPTAPPPPPGPPPPSLPLGGSV--------- 2856
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  605 anfntctecqttvcslcgfnpnphltevkewlclncqmkrALGGDLAPVPSSPQPKLKTAPVTTTSAVSKSSPQPQQtSP 684
Cdd:PHA03247  2857 ----------------------------------------APGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSR-ST 2895
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  685 KKDAAPKQDLSKAPEPKKPPPLVKQPTLhgsPSAKAKQPPeadslskPAPPKEPSVPSEQDKAPVADDKPKQPKMVKPTT 764
Cdd:PHA03247  2896 ESFALPPDQPERPPQPQAPPPPQPQPQP---PPPPQPQPP-------PPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG 2965
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  765 DLVSSSSATTKPDIPSSKVQSQAEEKTTPPLKTDSAKPSQSFPPTgekvSPFDSKAIPRPASDSKIISHPGPSSESKGQK 844
Cdd:PHA03247  2966 ALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASS----LALHEETDPPPVSLKQTLWPPDDTEDSDADS 3041
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462613745  845 QVDPVQKKEEpkkaQTKMSPKPDAKPMPKGSPTPPGPRPTAGQTVPTPQQSPKPQEQ----SRRF 905
Cdd:PHA03247  3042 LFDSDSERSD----LEALDPLPPEPHDPFAHEPDPATPEAGARESPSSQFGPPPLSAnaalSRRY 3102
PTZ00121 super family cl31754
MAEBL; Provisional
1157-1700 2.30e-10

MAEBL; Provisional


The actual alignment was detected with superfamily member PTZ00121:

Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 67.47  E-value: 2.30e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1157 VKKQEQEVKTEAEkvilEKVKETLSMEKIPPMVTTDQKQEESKLEKDKASALQEKKPLPEEKKLiPEEEKIRSEEKKPLL 1236
Cdd:PTZ00121  1320 AKKKAEEAKKKAD----AAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK-ADAAKKKAEEKKKAD 1394
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1237 EEKKPTPEDKKLLPEAKTSAPEEQKHDLLKSQvqiAEEKLEGRVAPKTVQEGKQP-QTKMEGLPSGTPQSLPKEDDKTTK 1315
Cdd:PTZ00121  1395 EAKKKAEEDKKKADELKKAAAAKKKADEAKKK---AEEKKKADEAKKKAEEAKKAdEAKKKAEEAKKAEEAKKKAEEAKK 1471
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1316 TIKEQPQPPCTAKPDQVEPGKEKTEKEDDKSDTSSSQQPKSPQGL-SDTGYSSDGI--------------------SSSL 1374
Cdd:PTZ00121  1472 ADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKkAEEAKKADEAkkaeeakkadeakkaeekkkADEL 1551
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1375 GEIPSLIPTDEKDILKGLKKDSFSQESSPSSPSDLAKLESTVLSILEAQASTLADEKSE--KKTQPHEVSPEQPKDQEKT 1452
Cdd:PTZ00121  1552 KKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEeaKKAEEAKIKAEELKKAEEE 1631
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1453 QSLSETLEITISEEEIKESQEERKDTFKK-DSQQDIPSSKDHKEKSEfvddiTTRREPYDSVEESSESENSPvPQRKRRT 1531
Cdd:PTZ00121  1632 KKKVEQLKKKEAEEKKKAEELKKAEEENKiKAAEEAKKAEEDKKKAE-----EAKKAEEDEKKAAEALKKEA-EEAKKAE 1705
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1532 SVGSSSSDEYKQEDSQGSGEEEDFIRKQIIEMSADEDASGSED---DEFIRNQLKEISSSTESQKKEETKGKGKITAGKH 1608
Cdd:PTZ00121  1706 ELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEakkDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEEL 1785
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1609 RRltrksstsidEDAGRRHSWHDEDDEAFDESPELK------------YRETKSQESEELVVTGGGGLRRFKTIELNSTI 1676
Cdd:PTZ00121  1786 DE----------EDEKRRMEVDKKIKDIFDNFANIIeggkegnlvindSKEMEDSAIKEVADSKNMQLEEADAFEKHKFN 1855
                          570       580       590
                   ....*....|....*....|....*....|
gi 2462613745 1677 ADKYSAESSQ------KKTSLYFDEEPELE 1700
Cdd:PTZ00121  1856 KNNENGEDGNkeadfnKEKDLKEDDEEEIE 1885
kgd super family cl39092
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
947-1027 2.00e-05

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


The actual alignment was detected with superfamily member PRK12270:

Pssm-ID: 476867 [Multi-domain]  Cd Length: 1228  Bit Score: 51.04  E-value: 2.00e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  947 STAGQPGPHSQSGPGAPMKQAPAPSQPPTSQGPPKSTGQAPPAPAKsipvKKETKAPAAEKLEPKAEQAPTVKRTETEKK 1026
Cdd:PRK12270    43 APTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAA----AAAAAAAPAAPPAAAAAAAPAAAAVEDEVT 118

                   .
gi 2462613745 1027 P 1027
Cdd:PRK12270   119 P 119
PRK07764 super family cl35613
DNA polymerase III subunits gamma and tau; Validated
32-411 1.01e-04

DNA polymerase III subunits gamma and tau; Validated


The actual alignment was detected with superfamily member PRK07764:

Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 48.83  E-value: 1.01e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745   32 PSHTAIPAGMEADLSQLSEEERRQIAAVMSRAQGLPKGSVPPAAAESPSMHRKQELDSSHPPKQSGRPPDPGRPAQPGLS 111
Cdd:PRK07764   427 AAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGA 506
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  112 ksRTTDTFRSE-----QKLPGRSPSTISLKESKSR------TDLKEEHKSSMMPGFLSEVNALSAVSSVVnkfnpfdlis 180
Cdd:PRK07764   507 --DDAATLRERwpeilAAVPKRSRKTWAILLPEATvlgvrgDTLVLGFSTGGLARRFASPGNAEVLVTAL---------- 574
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  181 dseasQEETTKKQKVVQKEQGKPEGiikPPLQQQPPKPIPKQQGPGRDPLQQDGTPKSISSQQPEKikSQPPGTGKPIQG 260
Cdd:PRK07764   575 -----AEELGGDWQVEAVVGPAPGA---AGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGA--AAAPAEASAAPA 644
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  261 PTQTPQTDHAKLPLQRDASRPQTKQADIVRGESVKPSLPSPSKPPIQQPTPGKPPAQQPgheKSQPGPAKPPAQPSGLTK 340
Cdd:PRK07764   645 PGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAP---APAATPPAGQADDPAAQP 721
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462613745  341 PLAQQPGTVKPPVQPPGTTKPPAQPLGPAKPPAQqtgsekPSSEQPGPKALAQPPGVGKTPAQQPGPAKPP 411
Cdd:PRK07764   722 PQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAP------AQPPPPPAPAPAAAPAAAPPPSPPSEEEEMA 786
PRK10263 super family cl35903
DNA translocase FtsK; Provisional
3461-3966 1.54e-04

DNA translocase FtsK; Provisional


The actual alignment was detected with superfamily member PRK10263:

Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 48.16  E-value: 1.54e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 3461 RRRRTKKSVDTSVQTDDEDQDEW---------DMPTRSRRKARVGKY--------GDSMTEADKTKPLSKVSSIAVQTVA 3523
Cdd:PRK10263   258 MGRQTDAALFSGKRMDDDEEITYtargvaadpDDVLFSGNRATQPEYdeydpllnGAPITEPVAVAAAATTATQSWAAPV 337
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 3524 EISVQTEPVGTIRTPSIRARVDAKVEIIKHISAPEktykggslgCQTEADSDTQSPQYLSATSPPKDK-KRPTPLEIGYS 3602
Cdd:PRK10263   338 EPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPV---------IAPAPEGYPQQSQYAQPAVQYNEPlQQPVQPQQPYY 408
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 3603 SHLRADSTVQLAPSPPKSPKVLYSPISPLSPGKALESAFVPYEKPLPDDisPQKVLHPDMAKVPPASPKTAKMMQRSMSD 3682
Cdd:PRK10263   409 APAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFA--PQSTYQTEQTYQQPAAQEPLYQQPQPVEQ 486
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 3683 PKPLSPT--ADES--SRAPFQYTEGYTTKGSQTMTSSGA-----QKKVKRTLPN------------PPPEEIST------ 3735
Cdd:PRK10263   487 QPVVEPEpvVEETkpARPPLYYFEEVEEKRAREREQLAAwyqpiPEPVKEPEPIksslkapsvaavPPVEAAAAvsplas 566
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 3736 ----GTQSTFSTMGTVSRRRICRTNTMARAKILQDIDRELDLVERESAKLRKKQAELD--------EEEKEIDAKLRYLE 3803
Cdd:PRK10263   567 gvkkATLATGAAATVAAPVFSLANSGGPRPQVKEGIGPQLPRPKRIRVPTRRELASYGiklpsqraAEEKAREAQRNQYD 646
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 3804 MGINRRK---EALLKEREKRERAYLQ-------------GVAEDRDYMSDSEVSSTRPTRIESQHGIERPRTAPQTEFSQ 3867
Cdd:PRK10263   647 SGDQYNDdeiDAMQQDELARQFAQTQqqrygeqyqhdvpVNAEDADAAAEAELARQFAQTQQQRYSGEQPAGANPFSLDD 726
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 3868 F-----------------IPPQTQTESQlvPPTSPYTQYQYSSPALPTQAPTSYTQ-QSHFEQQTLYHQQVSPYQTQPTF 3929
Cdd:PRK10263   727 FefspmkallddgpheplFTPIVEPVQQ--PQQPVAPQQQYQQPQQPVAPQPQYQQpQQPVAPQPQYQQPQQPVAPQPQY 804
                          570       580       590
                   ....*....|....*....|....*....|....*..
gi 2462613745 3930 QavatmsftpQVQPTPTPQPSYQLPSQMMVIQQKPRQ 3966
Cdd:PRK10263   805 Q---------QPQQPVAPQPQYQQPQQPVAPQPQYQQ 832
 
Name Accession Description Interval E-value
PDZ_RIM-like cd06714
PDZ domain of Rab3-interacting molecule 1 (RIM), RIM2, piccolo and related domains; PDZ ...
4493-4586 8.48e-48

PDZ domain of Rab3-interacting molecule 1 (RIM), RIM2, piccolo and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of RIM, RIM2, piccolo and related domains. RIM proteins and Gallus gallus protein piccolo (also called aczonin) are involved in neurotransmitter release at presynaptic active zones, the site of vesicle fusion. A protein complex containing RIM proteins positions synaptic vesicles containing synaptotagmin at the active zone. RIM proteins simultaneously activate docking and priming of synaptic vesicles and recruit Ca2+-channels to active zones, thereby connecting primed synaptic vesicles to Ca2+-channels. RIM binding to vesicular Rab proteins (Rab3 and Rab27 isoforms) mediates vesicle docking; RIM binding to Munc13 activates vesicle priming; RIM binding to the Ca2+-channel, both directly and indirectly via RIM-BP, recruits the Ca2+-channels. The RIM PDZ domain interacts with the C-termini of N- and P/Q-type voltage-gated Ca2+-channels. RIM1, RIM2 and piccolo also participate in regulated exocytosis through binding cAMP-GEFII (cAMP-binding protein-guanidine nucleotide exchange factor II). The piccolo PDZ domain binds cAMP-GEFII. RIM2 also plays a role in dendrite formation by melanocytes. Caenorhabditis elegans RIM (also known as unc-10) may be involved in the regulation of defecation and daumone response. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This RIM-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467198 [Multi-domain]  Cd Length: 95  Bit Score: 166.96  E-value: 8.48e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 4493 FPHARIKITRDSKDHTVSGNGLGIRIVGGKEIPghSGEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEV 4572
Cdd:cd06714      2 FLIGRIILQRDPKDGSVSGNGLGLKVVGGKMTE--SGRLGAYVTKVKPGSVADTVGHLREGDEVLEWNGISLQGKTFEEV 79
                           90
                   ....*....|....
gi 2462613745 4573 QSIISQQSGEAEIC 4586
Cdd:cd06714     80 QDIISQSKGEVELV 93
FYVE1_PCLO cd15774
FYVE-related domain 1 found in protein piccolo; Protein piccolo, also termed aczonin, is a ...
587-648 1.61e-42

FYVE-related domain 1 found in protein piccolo; Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Piccolo is a multi-domain protein containing two N-terminal FYVE zinc fingers, a polyproline tract, and a PDZ domain and two C-terminal C2 domains. This family corresponds to the first FYVE domain, which resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


Pssm-ID: 277313 [Multi-domain]  Cd Length: 62  Bit Score: 150.57  E-value: 1.61e-42
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462613745  587 TICPLCNTTELLLHVPEKANFNTCTECQTTVCSLCGFNPNPHLTEVKEWLCLNCQMKRALGG 648
Cdd:cd15774      1 TICPLCKTTELLLHTPEKANYNTCTQCQTTVCSLCGFNPNPHITEKKEWLCLNCQMQRALGG 62
FYVE2_PCLO cd15776
FYVE-related domain 2 found in protein piccolo; Protein piccolo, also termed aczonin, is a ...
1057-1120 8.33e-42

FYVE-related domain 2 found in protein piccolo; Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Piccolo is a multi-domain protein containing two N-terminal FYVE zinc fingers, a polyproline tract, and a PDZ domain and two C-terminal C2 domains. This family corresponds to the second FYVE domain, which resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


Pssm-ID: 277315 [Multi-domain]  Cd Length: 64  Bit Score: 148.68  E-value: 8.33e-42
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462613745 1057 STCPLCKTELNIGSKDPPNFNTCTECKNQVCNLCGFNPTPHLTEIQEWLCLNCQTQRAISGQLG 1120
Cdd:cd15776      1 LLCPLCKTELNIGSKDPPNFNTCTECKKTVCNLCGFNPTPHLTEVKEWLCLNCQTQRAMSGQLG 64
zf-piccolo pfam05715
Piccolo Zn-finger; This (predicted) Zinc finger is found in the bassoon and piccolo proteins. ...
1057-1115 1.55e-39

Piccolo Zn-finger; This (predicted) Zinc finger is found in the bassoon and piccolo proteins. There are eight conserved cysteines, suggesting that it coordinates two zinc ligands.


Pssm-ID: 461722 [Multi-domain]  Cd Length: 60  Bit Score: 142.17  E-value: 1.55e-39
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1057 STCPLCK-TELNIGSKDPPNFNTCTECKNQVCNLCGFNPTPHLTEIQEWLCLNCQTQRAI 1115
Cdd:pfam05715    1 TLCPLCKtTELNVGSKEPPNYNTCTECKSQVCNLCGFNPTPHLTEKKEWLCLNCQTQRAL 60
zf-piccolo pfam05715
Piccolo Zn-finger; This (predicted) Zinc finger is found in the bassoon and piccolo proteins. ...
587-646 2.79e-37

Piccolo Zn-finger; This (predicted) Zinc finger is found in the bassoon and piccolo proteins. There are eight conserved cysteines, suggesting that it coordinates two zinc ligands.


Pssm-ID: 461722 [Multi-domain]  Cd Length: 60  Bit Score: 135.62  E-value: 2.79e-37
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  587 TICPLCNTTELLLHVPEKANFNTCTECQTTVCSLCGFNPNPHLTEVKEWLCLNCQMKRAL 646
Cdd:pfam05715    1 TLCPLCKTTELNVGSKEPPNYNTCTECKSQVCNLCGFNPTPHLTEKKEWLCLNCQTQRAL 60
PHA03247 PHA03247
large tegument protein UL36; Provisional
309-905 1.11e-25

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 118.50  E-value: 1.11e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  309 PTPGKPPAQQPGHEKSQPGPAKPPAQPSG------LTKPLA-QQPGTVKPPV----QPPGTTKPPAQPLGPAKPPAQQTG 377
Cdd:PHA03247  2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEpavtsrARRPDApPQSARPRAPVddrgDPRGPAPPSPLPPDTHAPDPPPPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  378 SEKPSSEQPGPKALAQPPGVGKTPAQQPGPAKPP---TQQVGTPKPLAQQPGLQSPAKAP--GPTKTPVQQPGPGKIPAq 452
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPrraRRLGRAAQASSPPQRPRRRAARPtvGSLTSLADPPPPPPTPE- 2709
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  453 qagPGKTSAQQTGPTKPPSQLPGPAKP-PPQQPGPAKPP--PQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKP 529
Cdd:PHA03247  2710 ---PAPHALVSATPLPPGPAAARQASPaLPAAPAPPAVPagPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRP 2786
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  530 PSQQPGSAKPSAQQPS-----PAKPSAQQSTKPVSQTGSGkPLQPPTVSPSAKQPPSQGLPKTICPLCNTTelllhvpek 604
Cdd:PHA03247  2787 AVASLSESRESLPSPWdpadpPAAVLAPAAALPPAASPAG-PLPPPTSAQPTAPPPPPGPPPPSLPLGGSV--------- 2856
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  605 anfntctecqttvcslcgfnpnphltevkewlclncqmkrALGGDLAPVPSSPQPKLKTAPVTTTSAVSKSSPQPQQtSP 684
Cdd:PHA03247  2857 ----------------------------------------APGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSR-ST 2895
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  685 KKDAAPKQDLSKAPEPKKPPPLVKQPTLhgsPSAKAKQPPeadslskPAPPKEPSVPSEQDKAPVADDKPKQPKMVKPTT 764
Cdd:PHA03247  2896 ESFALPPDQPERPPQPQAPPPPQPQPQP---PPPPQPQPP-------PPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG 2965
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  765 DLVSSSSATTKPDIPSSKVQSQAEEKTTPPLKTDSAKPSQSFPPTgekvSPFDSKAIPRPASDSKIISHPGPSSESKGQK 844
Cdd:PHA03247  2966 ALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASS----LALHEETDPPPVSLKQTLWPPDDTEDSDADS 3041
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462613745  845 QVDPVQKKEEpkkaQTKMSPKPDAKPMPKGSPTPPGPRPTAGQTVPTPQQSPKPQEQ----SRRF 905
Cdd:PHA03247  3042 LFDSDSERSD----LEALDPLPPEPHDPFAHEPDPATPEAGARESPSSQFGPPPLSAnaalSRRY 3102
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
226-575 9.73e-19

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 94.83  E-value: 9.73e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  226 GRDPLQQDGTPKSISSQQPEKIKSQPPGTGKPIQGPTQTPQTDHAKLPLQRDASRPQTKQADIVRGESVKPSLPSPSKPP 305
Cdd:pfam03154  169 TQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPP 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  306 IQQPTPGKPPAQQPGHEKSQP---GPAKPPAQPSGLTKPLAQQPGtvkpPVQPPGTTKPPAQPLGPAKPPAQQTGSEKPS 382
Cdd:pfam03154  249 LQPMTQPPPPSQVSPQPLPQPslhGQMPPMPHSLQTGPSHMQHPV----PPQPFPLTPQSSQSQVPPGPSPAAPGQSQQR 324
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  383 SEQPGPKALAQPPgvgKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPTKtpVQQPGPGKIPAQQAGPGKTSAQ 462
Cdd:pfam03154  325 IHTPPSQSQLQSQ---QPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPH--LSGPSPFQMNSNLPPPPALKPL 399
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  463 QTGPTK-PPSQLPGPAKPPPQ----QPGPAKPP-----PQQPGSAKPPPQQPGSTKPPPQQP------GPAKPSPQQPGS 526
Cdd:pfam03154  400 SSLSTHhPPSAHPPPLQLMPQsqqlPPPPAQPPvltqsQSLPPPAASHPPTSGLHQVPSQSPfpqhpfVPGGPPPITPPS 479
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462613745  527 TKPPS--------QQPGSAKPSAQQPSPAKPSA-----QQSTKPVSQTGSGKPLQPPTVSPS 575
Cdd:pfam03154  480 GPPTStssampgiQPPSSASVSSSGPVPAAVSCplppvQIKEEALDEAEEPESPPPPPRSPS 541
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
263-578 4.03e-11

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 69.32  E-value: 4.03e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  263 QTPQTDHAKLPLQRDASRPQTKQADIVRGESVKPSLPSPSKPPIQQPTPGKPPAQQPGHEKSQPGP--AKPPAQPSGLTK 340
Cdd:COG5180    156 QRSDPILAKDPDGDSASTLPPPAEKLDKVLTEPRDALKDSPEKLDRPKVEVKDEAQEEPPDLTGGAdhPRPEAASSPKVD 235
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  341 PLAQQPGTVKPPVQPpgtTKPPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAQPPGVGKTPAQQPGPAK---------PP 411
Cdd:COG5180    236 PPSTSEARSRPATVD---AQPEMRPPADAKERRRAAIGDTPAAEPPGLPVLEAGSEPQSDAPEAETARPidvkgvasaPP 312
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  412 TQQVGTPKPLAQQPGLQ-------SPAKAPgPTKTPVQQPGPGKIPAQQAGPGK------TSAQQTGPTKPPSQLPGPAK 478
Cdd:COG5180    313 ATRPVRPPGGARDPGTPrpgqpteRPAGVP-EAASDAGQPPSAYPPAEEAVPGKpleqgaPRPGSSGGDGAPFQPPNGAP 391
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  479 PPPQQPGPAKPPPQQPG-SAKPPPQQPGSTKPPPQQPGpakpspQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQ----Q 553
Cdd:COG5180    392 QPGLGRRGAPGPPMGAGdLVQAALDGGGRETASLGGAA------GGAGQGPKADFVPGDAESVSGPAGLADQAGAaastA 465
                          330       340
                   ....*....|....*....|....*
gi 2462613745  554 STKPVSQTGSGKPLQPPTVSPSAKQ 578
Cdd:COG5180    466 MADFVAPVTDATPVDVADVLGVRPD 490
PDZ smart00228
Domain present in PSD-95, Dlg, and ZO-1/2; Also called DHR (Dlg homologous region) or GLGF ...
4510-4586 4.09e-11

Domain present in PSD-95, Dlg, and ZO-1/2; Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.


Pssm-ID: 214570 [Multi-domain]  Cd Length: 85  Bit Score: 62.01  E-value: 4.09e-11
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462613745  4510 SGNGLGIRIVGGKEIPGhsgeiGAYIAKILPGGSAEQTGkLMEGMQVLEWNGIPLTSKTYEEVQSIISQQSGEAEIC 4586
Cdd:smart00228   10 GGGGLGFSLVGGKDEGG-----GVVVSSVVPGSPAAKAG-LRVGDVILEVNGTSVEGLTHLEAVDLLKKAGGKVTLT 80
PTZ00121 PTZ00121
MAEBL; Provisional
1157-1700 2.30e-10

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 67.47  E-value: 2.30e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1157 VKKQEQEVKTEAEkvilEKVKETLSMEKIPPMVTTDQKQEESKLEKDKASALQEKKPLPEEKKLiPEEEKIRSEEKKPLL 1236
Cdd:PTZ00121  1320 AKKKAEEAKKKAD----AAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK-ADAAKKKAEEKKKAD 1394
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1237 EEKKPTPEDKKLLPEAKTSAPEEQKHDLLKSQvqiAEEKLEGRVAPKTVQEGKQP-QTKMEGLPSGTPQSLPKEDDKTTK 1315
Cdd:PTZ00121  1395 EAKKKAEEDKKKADELKKAAAAKKKADEAKKK---AEEKKKADEAKKKAEEAKKAdEAKKKAEEAKKAEEAKKKAEEAKK 1471
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1316 TIKEQPQPPCTAKPDQVEPGKEKTEKEDDKSDTSSSQQPKSPQGL-SDTGYSSDGI--------------------SSSL 1374
Cdd:PTZ00121  1472 ADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKkAEEAKKADEAkkaeeakkadeakkaeekkkADEL 1551
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1375 GEIPSLIPTDEKDILKGLKKDSFSQESSPSSPSDLAKLESTVLSILEAQASTLADEKSE--KKTQPHEVSPEQPKDQEKT 1452
Cdd:PTZ00121  1552 KKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEeaKKAEEAKIKAEELKKAEEE 1631
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1453 QSLSETLEITISEEEIKESQEERKDTFKK-DSQQDIPSSKDHKEKSEfvddiTTRREPYDSVEESSESENSPvPQRKRRT 1531
Cdd:PTZ00121  1632 KKKVEQLKKKEAEEKKKAEELKKAEEENKiKAAEEAKKAEEDKKKAE-----EAKKAEEDEKKAAEALKKEA-EEAKKAE 1705
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1532 SVGSSSSDEYKQEDSQGSGEEEDFIRKQIIEMSADEDASGSED---DEFIRNQLKEISSSTESQKKEETKGKGKITAGKH 1608
Cdd:PTZ00121  1706 ELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEakkDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEEL 1785
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1609 RRltrksstsidEDAGRRHSWHDEDDEAFDESPELK------------YRETKSQESEELVVTGGGGLRRFKTIELNSTI 1676
Cdd:PTZ00121  1786 DE----------EDEKRRMEVDKKIKDIFDNFANIIeggkegnlvindSKEMEDSAIKEVADSKNMQLEEADAFEKHKFN 1855
                          570       580       590
                   ....*....|....*....|....*....|
gi 2462613745 1677 ADKYSAESSQ------KKTSLYFDEEPELE 1700
Cdd:PTZ00121  1856 KNNENGEDGNkeadfnKEKDLKEDDEEEIE 1885
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
337-587 3.26e-10

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 66.33  E-value: 3.26e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  337 GLTKPLAQQPGTVKPPVQPPGTTKPPAQPLGPAKPPAQQTGSE-KPSSEQPGPKALAQPPGVGKTPAQQPGPAKPPTQqv 415
Cdd:NF033839   278 GLTQDTPKEPGNKKPSAPKPGMQPSPQPEKKEVKPEPETPKPEvKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVK-- 355
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  416 gtPKPLAQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPgKTSAQQTGPTKPPSQLPGPAKPPPQ-QPGPAKPPPQ-Q 493
Cdd:NF033839   356 --PQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKP-KPEVKPQPEKPKPEVKPQPEKPKPEvKPQPEKPKPEvK 432
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  494 PGSAKPPPQ---QPGSTKPP-PQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTkPVSQTGSGKPLQP 569
Cdd:NF033839   433 PQPEKPKPEvkpQPEKPKPEvKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPDNSKPQADDKKPST-PNNLSKDKQPSNQ 511
                          250
                   ....*....|....*...
gi 2462613745  570 PTVSPSAKQPPSQGLPKT 587
Cdd:NF033839   512 ASTNEKATNKPKKSLPST 529
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
307-550 8.28e-09

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 61.71  E-value: 8.28e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  307 QQPTPGKPPAQQPGHEKSQPGPAKPPAQPSgltkplaQQPGTVKPPVQP-PGTTKPPAQPlGPAKPPAQQtgseKPSSEQ 385
Cdd:NF033839   304 QPEKKEVKPEPETPKPEVKPQLEKPKPEVK-------PQPEKPKPEVKPqLETPKPEVKP-QPEKPKPEV----KPQPEK 371
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  386 PGPKALAQPPGVGKTPAQQPGPAKPPTQqvgtPKPLAQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPgKTSAQQTG 465
Cdd:NF033839   372 PKPEVKPQPETPKPEVKPQPEKPKPEVK----PQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKP-KPEVKPQP 446
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  466 PTKPPSQLPGPAKPPPQ-QPGPAKPPPQqpgsAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQP 544
Cdd:NF033839   447 EKPKPEVKPQPETPKPEvKPQPEKPKPE----VKPQPEKPKPDNSKPQADDKKPSTPNNLSKDKQPSNQASTNEKATNKP 522

                   ....*.
gi 2462613745  545 SPAKPS 550
Cdd:NF033839   523 KKSLPS 528
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
322-583 2.48e-08

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 59.92  E-value: 2.48e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  322 EKSQPGPAKP--------PAQPSGLTKPlaQQPGTVKPPVQPPGTTKP-----PAQPLGPAKPPAQQtGSEKPSSE--QP 386
Cdd:NF038329   118 EKGEPGPAGPagpageqgPRGDRGETGP--AGPAGPPGPQGERGEKGPagpqgEAGPQGPAGKDGEA-GAKGPAGEkgPQ 194
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  387 GPKALAQPPGvGKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPT-----KTPVQQPGPGKIPAQQAGPGKT-- 459
Cdd:NF038329   195 GPRGETGPAG-EQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTgedgpQGPDGPAGKDGPRGDRGEAGPDgp 273
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  460 --SAQQTGPTKPPSQ--------LPGPAKPPPQQPGPAKP-------PPQQPGSAKPPPQ--QPGstKPPPQQPG-PAKP 519
Cdd:NF038329   274 dgKDGERGPVGPAGKdgqngkdgLPGKDGKDGQNGKDGLPgkdgkdgQPGKDGLPGKDGKdgQPG--KPAPKTPEvPQKP 351
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462613745  520 SpQQPGSTKPPsQQPGSAKPSAqqPSPAKPSAQQSTKPvsQTGSGKPL-QPPTVSPSAKQPPSQG 583
Cdd:NF038329   352 D-TAPHTPKTP-QIPGQSKDVT--PAPQNPSNRGLNKP--QTQGGNQLaKTPAAHDTHRQLPATG 410
BimA_second NF040983
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ...
453-557 2.72e-08

trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.


Pssm-ID: 468913 [Multi-domain]  Cd Length: 382  Bit Score: 59.53  E-value: 2.72e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  453 QAGPGKTSAQQTGPTKPPSQLPGPakPPPQQPGPAKPPPQQPGSAKPPPqqPGSTKPPPQQPGPAKPSPQQPgSTKPPSQ 532
Cdd:NF040983    69 QIKKGDFKLKPVGDRTLPNKVPPP--PPPPPPPPPPPPTPPPPPPPPPP--PPPPSPPPPPPPSPPPSPPPP-TTTPPTR 143
                           90       100
                   ....*....|....*....|....*
gi 2462613745  533 QPgsakPSAQQPSPAKPSAQQSTKP 557
Cdd:NF040983   144 TT----PSTTTPTPSMHPIQPTQLP 164
BimA_second NF040983
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ...
458-563 8.58e-08

trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.


Pssm-ID: 468913 [Multi-domain]  Cd Length: 382  Bit Score: 57.99  E-value: 8.58e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  458 KTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQpgSTKPPSQQP--G 535
Cdd:NF040983    83 RTLPNKVPPPPPPPPPPPPPPPTPPPPPPPPPPPPPPSPPPPPPPSPPPSPPPPTTTPPTRTTPST--TTPTPSMHPiqP 160
                           90       100
                   ....*....|....*....|....*...
gi 2462613745  536 SAKPSAQQPSPAKPSAQQSTKPVSQTGS 563
Cdd:NF040983   161 TQLPSIPNATPTSGSATNVTINFNSTGA 188
PDZ pfam00595
PDZ domain; PDZ domains are found in diverse signaling proteins.
4503-4582 1.79e-07

PDZ domain; PDZ domains are found in diverse signaling proteins.


Pssm-ID: 395476 [Multi-domain]  Cd Length: 81  Bit Score: 51.51  E-value: 1.79e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 4503 DSKDHTVSGNGLGIRIVGGKEipghSGEIGAYIAKILPGGSAEQTGkLMEGMQVLEWNGIPLTSKTYEEVQSIISQQSGE 4582
Cdd:pfam00595    1 QVTLEKDGRGGLGFSLKGGSD----QGDPGIFVSEVLPGGAAEAGG-LKVGDRILSINGQDVENMTHEEAVLALKGSGGK 75
PHA03247 PHA03247
large tegument protein UL36; Provisional
823-1389 4.15e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.87  E-value: 4.15e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  823 RPAsDSKIISHPGPSSESKGQKQVDPVQKKEEPKKAQTKMSPKPDAKPMP-----------------KGSPTPP----GP 881
Cdd:PHA03247  2482 RPA-EARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHprmltwirgleelasddAGDPPPPlppaAP 2560
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  882 RPTAGQTVPTPQQSPKPQEQSRRfslnlgsitdAPKSQPTTPQETVTGKlfgfgasifsqasnliSTAGQPGPHSQSGPG 961
Cdd:PHA03247  2561 PAAPDRSVPPPRPAPRPSEPAVT----------SRARRPDAPPQSARPR----------------APVDDRGDPRGPAPP 2614
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  962 APMKQAPAPSQPPTSQGPPKSTGQAPPAPAKSIPVKKETKAPAAEKLEPKaeqaptvKRTETEKKPP----PIKDSKSLT 1037
Cdd:PHA03247  2615 SPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRP-------RRARRLGRAAqassPPQRPRRRA 2687
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1038 AEPQKAVLpTKLEKSPKPESTCPLCKTELNIGSKDPPNFNTCTecknqvcnlcGFNPTPHLTEIQEwlclncQTQRAISG 1117
Cdd:PHA03247  2688 ARPTVGSL-TSLADPPPPPPTPEPAPHALVSATPLPPGPAAAR----------QASPALPAAPAPP------AVPAGPAT 2750
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1118 QLGDIRKM-PPAPSGPKASPMPVPTESSSQKTAVPPQVklvkkqeQEVKTEAEKVILEKVKETLSMEKIPPMVTTDQKQE 1196
Cdd:PHA03247  2751 PGGPARPArPPTTAGPPAPAPPAAPAAGPPRRLTRPAV-------ASLSESRESLPSPWDPADPPAAVLAPAAALPPAAS 2823
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1197 ESKLEKDKASALQEKKPLPEEkkliPEEEKIRSE---------EKKPLLEEKKPTPEDKKLLPEAKTSAPEEQKHDLLKS 1267
Cdd:PHA03247  2824 PAGPLPPPTSAQPTAPPPPPG----PPPPSLPLGgsvapggdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFA 2899
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1268 QVQIAEEKLEGRVAPKTVQEGKQPQTKMEGLPSGTPQSLPKEDDKTTKTIKEQPQPPCTAK--------PDQVEPGKEKT 1339
Cdd:PHA03247  2900 LPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPqpwlgalvPGRVAVPRFRV 2979
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1340 EKEDDKSDTSSSQQPkSPQGLSDTGYSSDGISSSLGEIPSLIPTDEKDIL 1389
Cdd:PHA03247  2980 PQPAPSREAPASSTP-PLTGHSLSRVSSWASSLALHEETDPPPVSLKQTL 3028
SP2_N cd22540
N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins ...
331-591 2.73e-06

N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP2 contains the least conserved DNA-binding domain within the SP subfamily of proteins, and its DNA sequence specificity differs from the other SP proteins. It localizes primarily within subnuclear foci associated with the nuclear matrix, and can activate, or in some cases, repress expression from different promoters. The transcription factor SP2 serves as a paradigm for indirect genomic binding. It does not require its DNA-binding domain for genomic DNA binding and occupies target promoters independently of whether they contain a cognate DNA-binding motif. SP2 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP2.


Pssm-ID: 411776 [Multi-domain]  Cd Length: 511  Bit Score: 53.39  E-value: 2.73e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  331 PPAQPSGLTKPLAQQPGTVKPPVQppgttKPPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAQPPGVGKTPAQQ--PGPA 408
Cdd:cd22540     39 PPAVEAAVTPPAPPQPTPRKLVPI-----KPAPLPLGPGKNSIGFLSAKGNIIQLQGSQLSSSAPGGQQVFAIQnpTMII 113
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  409 KPPTQQVGTPKPLAQQPGLQspakapgptkTPVQQPGPGKIpaqQAGPGKTSAQ----QTGPTKPPSQLPGPAKPPPQQP 484
Cdd:cd22540    114 KGSQTRSSTNQQYQISPQIQ----------AAGQINNSGQI---QIIPGTNQAIitpvQVLQQPQQAHKPVPIKPAPLQT 180
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  485 GPA-KPPPQQPGSA--KPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQT 561
Cdd:cd22540    181 SNTnSASLQVPGNVikLQSGGNVALTLPVNNLVGTQDGATQLQLAAAPSKPSKKIRKKSAQAAQPAVTVAEQVETVLIET 260
                          250       260       270
                   ....*....|....*....|....*....|
gi 2462613745  562 GSGKPLQPPTvSPSAKQPPSQGLPKTICPL 591
Cdd:cd22540    261 TADNIIQAGN-NLLIVQSPGTGQPAVLQQV 289
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
461-552 1.23e-05

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 51.16  E-value: 1.23e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  461 AQQTGPTKPPsqlpgPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPS 540
Cdd:NF041121    12 AAQMGRAAAP-----PSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPGPAGAAPG 86
                           90
                   ....*....|....
gi 2462613745  541 AQQP--SPAKPSAQ 552
Cdd:NF041121    87 AALPvrVPAPPALP 100
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
947-1027 2.00e-05

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 51.04  E-value: 2.00e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  947 STAGQPGPHSQSGPGAPMKQAPAPSQPPTSQGPPKSTGQAPPAPAKsipvKKETKAPAAEKLEPKAEQAPTVKRTETEKK 1026
Cdd:PRK12270    43 APTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAA----AAAAAAAPAAPPAAAAAAAPAAAAVEDEVT 118

                   .
gi 2462613745 1027 P 1027
Cdd:PRK12270   119 P 119
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
443-566 2.04e-05

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 50.96  E-value: 2.04e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  443 QPGPGKIPAQQAGPGKTSAQQTGPTKPPSQLPGpaKPPPQQPGPAKPPPQQPGSAKPP--PQQPGSTKPPPQQPGPAKPS 520
Cdd:TIGR01628  379 QPRMRQLPMGSPMGGAMGQPPYYGQGPQQQFNG--QPLGWPRMSMMPTPMGPGGPLRPngLAPMNAVRAPSRNAQNAAQK 456
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 2462613745  521 PQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQTGSGKP 566
Cdd:TIGR01628  457 PPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATP 502
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
339-586 6.14e-05

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 49.38  E-value: 6.14e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  339 TKPLAQQPGTVKP--PVQPPGTTKPPAQPLGPaKPPAQQTGSEKpssEQPGPKALAQPPGVGKTPAQQPGPAKPPTQQVG 416
Cdd:NF033839   157 TKPETPQPENPEHqkPTTPAPDTKPSPQPEGK-KPSVPDINQEK---EKAKLAVATYMSKILDDIQKHHLQKEKHRQIVA 232
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  417 TPKPLAQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQ-QPGPAKPPPQQPG 495
Cdd:NF033839   233 LIKELDELKKQALSEIDNVNTKVEIENTVHKIFADMDAVVTKFKKGLTQDTPKEPGNKKPSAPKPGmQPSPQPEKKEVKP 312
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  496 SAKPPPQQpgsTKPPPQQPGP-AKPSPQQPGSTKPPsqQPGSAKPSAqQPSPAKPSAQQSTKPVSQTGSGKPlQPPTVSP 574
Cdd:NF033839   313 EPETPKPE---VKPQLEKPKPeVKPQPEKPKPEVKP--QLETPKPEV-KPQPEKPKPEVKPQPEKPKPEVKP-QPETPKP 385
                          250
                   ....*....|..
gi 2462613745  575 SAKQPPSQGLPK 586
Cdd:NF033839   386 EVKPQPEKPKPE 397
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
32-411 1.01e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 48.83  E-value: 1.01e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745   32 PSHTAIPAGMEADLSQLSEEERRQIAAVMSRAQGLPKGSVPPAAAESPSMHRKQELDSSHPPKQSGRPPDPGRPAQPGLS 111
Cdd:PRK07764   427 AAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGA 506
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  112 ksRTTDTFRSE-----QKLPGRSPSTISLKESKSR------TDLKEEHKSSMMPGFLSEVNALSAVSSVVnkfnpfdlis 180
Cdd:PRK07764   507 --DDAATLRERwpeilAAVPKRSRKTWAILLPEATvlgvrgDTLVLGFSTGGLARRFASPGNAEVLVTAL---------- 574
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  181 dseasQEETTKKQKVVQKEQGKPEGiikPPLQQQPPKPIPKQQGPGRDPLQQDGTPKSISSQQPEKikSQPPGTGKPIQG 260
Cdd:PRK07764   575 -----AEELGGDWQVEAVVGPAPGA---AGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGA--AAAPAEASAAPA 644
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  261 PTQTPQTDHAKLPLQRDASRPQTKQADIVRGESVKPSLPSPSKPPIQQPTPGKPPAQQPgheKSQPGPAKPPAQPSGLTK 340
Cdd:PRK07764   645 PGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAP---APAATPPAGQADDPAAQP 721
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462613745  341 PLAQQPGTVKPPVQPPGTTKPPAQPLGPAKPPAQqtgsekPSSEQPGPKALAQPPGVGKTPAQQPGPAKPP 411
Cdd:PRK07764   722 PQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAP------AQPPPPPAPAPAAAPAAAPPPSPPSEEEEMA 786
PRK10263 PRK10263
DNA translocase FtsK; Provisional
3461-3966 1.54e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 48.16  E-value: 1.54e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 3461 RRRRTKKSVDTSVQTDDEDQDEW---------DMPTRSRRKARVGKY--------GDSMTEADKTKPLSKVSSIAVQTVA 3523
Cdd:PRK10263   258 MGRQTDAALFSGKRMDDDEEITYtargvaadpDDVLFSGNRATQPEYdeydpllnGAPITEPVAVAAAATTATQSWAAPV 337
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 3524 EISVQTEPVGTIRTPSIRARVDAKVEIIKHISAPEktykggslgCQTEADSDTQSPQYLSATSPPKDK-KRPTPLEIGYS 3602
Cdd:PRK10263   338 EPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPV---------IAPAPEGYPQQSQYAQPAVQYNEPlQQPVQPQQPYY 408
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 3603 SHLRADSTVQLAPSPPKSPKVLYSPISPLSPGKALESAFVPYEKPLPDDisPQKVLHPDMAKVPPASPKTAKMMQRSMSD 3682
Cdd:PRK10263   409 APAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFA--PQSTYQTEQTYQQPAAQEPLYQQPQPVEQ 486
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 3683 PKPLSPT--ADES--SRAPFQYTEGYTTKGSQTMTSSGA-----QKKVKRTLPN------------PPPEEIST------ 3735
Cdd:PRK10263   487 QPVVEPEpvVEETkpARPPLYYFEEVEEKRAREREQLAAwyqpiPEPVKEPEPIksslkapsvaavPPVEAAAAvsplas 566
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 3736 ----GTQSTFSTMGTVSRRRICRTNTMARAKILQDIDRELDLVERESAKLRKKQAELD--------EEEKEIDAKLRYLE 3803
Cdd:PRK10263   567 gvkkATLATGAAATVAAPVFSLANSGGPRPQVKEGIGPQLPRPKRIRVPTRRELASYGiklpsqraAEEKAREAQRNQYD 646
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 3804 MGINRRK---EALLKEREKRERAYLQ-------------GVAEDRDYMSDSEVSSTRPTRIESQHGIERPRTAPQTEFSQ 3867
Cdd:PRK10263   647 SGDQYNDdeiDAMQQDELARQFAQTQqqrygeqyqhdvpVNAEDADAAAEAELARQFAQTQQQRYSGEQPAGANPFSLDD 726
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 3868 F-----------------IPPQTQTESQlvPPTSPYTQYQYSSPALPTQAPTSYTQ-QSHFEQQTLYHQQVSPYQTQPTF 3929
Cdd:PRK10263   727 FefspmkallddgpheplFTPIVEPVQQ--PQQPVAPQQQYQQPQQPVAPQPQYQQpQQPVAPQPQYQQPQQPVAPQPQY 804
                          570       580       590
                   ....*....|....*....|....*....|....*..
gi 2462613745 3930 QavatmsftpQVQPTPTPQPSYQLPSQMMVIQQKPRQ 3966
Cdd:PRK10263   805 Q---------QPQQPVAPQPQYQQPQQPVAPQPQYQQ 832
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
449-524 1.67e-04

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 47.69  E-value: 1.67e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462613745  449 IPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQP 524
Cdd:NF041121    15 MGRAAAPPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPGPAGAAPGAALP 90
BimA_second NF040983
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ...
436-527 1.80e-04

trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.


Pssm-ID: 468913 [Multi-domain]  Cd Length: 382  Bit Score: 47.20  E-value: 1.80e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  436 PTKTPVQQPGPGKIPaqqagPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPqqpgSAKPPPQQPGSTKPPPQQPG 515
Cdd:NF040983    86 PNKVPPPPPPPPPPP-----PPPPTPPPPPPPPPPPPPPSPPPPPPPSPPPSPPPP----TTTPPTRTTPSTTTPTPSMH 156
                           90
                   ....*....|....
gi 2462613745  516 PAKPS--PQQPGST 527
Cdd:NF040983   157 PIQPTqlPSIPNAT 170
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
716-1072 2.68e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 47.07  E-value: 2.68e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  716 PSAKAKQPPEADSLSKPAPPKEPSVPSEQDKAPVADDKPKQPK--------MVKPTTDLVSSSSATTKPDIPSSKVQSQA 787
Cdd:NF033839   159 PETPQPENPEHQKPTTPAPDTKPSPQPEGKKPSVPDINQEKEKaklavatyMSKILDDIQKHHLQKEKHRQIVALIKELD 238
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  788 EEKTTPPLKTDSAKPSQSFPPTGEKVSPfDSKAIPRPASDSKIISHPGPsseskgqkqvdPVQKKEEPKKAQTKMSPKPD 867
Cdd:NF033839   239 ELKKQALSEIDNVNTKVEIENTVHKIFA-DMDAVVTKFKKGLTQDTPKE-----------PGNKKPSAPKPGMQPSPQPE 306
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  868 AKPmPKGSPTPPGPRPTAGQTVPTPQQSPKPQEQSRRFSLNLGSITDAPKSQPTTPQETVTGKLFGFGASIfsqasnlis 947
Cdd:NF033839   307 KKE-VKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPEVKPQPEKPKPEV--------- 376
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  948 tagQPGPHSQSGPGAPMKQAPAPSQPPTsqgPPKSTGQAPPAPAKSIPVKKETKAPAAEKLEPKAEQAPTVKRTETEKKP 1027
Cdd:NF033839   377 ---KPQPETPKPEVKPQPEKPKPEVKPQ---PEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPK 450
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*
gi 2462613745 1028 PPIKdsksltAEPQKAvlptKLEKSPKPESTCPLCKTELNIGSKD 1072
Cdd:NF033839   451 PEVK------PQPETP----KPEVKPQPEKPKPEVKPQPEKPKPD 485
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
651-1047 1.06e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 45.14  E-value: 1.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  651 APVPSSPQPKLKTAPVTTTsavsKSSPQPQQTSPKK-DAAPKQDLSKAPEPKKPPPLVKQPTLHGSPSAKAKQ----PPE 725
Cdd:NF033839   161 TPQPENPEHQKPTTPAPDT----KPSPQPEGKKPSVpDINQEKEKAKLAVATYMSKILDDIQKHHLQKEKHRQivalIKE 236
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  726 ADSLSKPAPPKEPSV-----PSEQDKAPVADDKPKQPKMVKPTTDLVSSSSATTKPDIPSSKVQSQAEEKTtPPLKTDSA 800
Cdd:NF033839   237 LDELKKQALSEIDNVntkveIENTVHKIFADMDAVVTKFKKGLTQDTPKEPGNKKPSAPKPGMQPSPQPEK-KEVKPEPE 315
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  801 KPSQSFPPTGEKVSPfdsKAIPRPAsdskiishpGPSSESKGQKQVDPVQKKEEPKKAQTKMSPKPDaKPMPKGSPTPPG 880
Cdd:NF033839   316 TPKPEVKPQLEKPKP---EVKPQPE---------KPKPEVKPQLETPKPEVKPQPEKPKPEVKPQPE-KPKPEVKPQPET 382
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  881 PRPTAGQTVPTPQQSPKPQEQSRRFSLNLGSITDAP--KSQPTTPQETVtgklfgfgasifsqasnlistagQPGPHSQS 958
Cdd:NF033839   383 PKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPevKPQPEKPKPEV-----------------------KPQPEKPK 439
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  959 GPGAPMKQAPAPSQPPTSQGPPKSTGQAPPAPAKSIPVKKETKAPAAEKLEPKAEQAPTVKRTETEKKPPPIKDSKSLTA 1038
Cdd:NF033839   440 PEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPDNSKPQADDKKPSTPNNLSKDKQPSNQASTNEKAT 519

                   ....*....
gi 2462613745 1039 EPQKAVLPT 1047
Cdd:NF033839   520 NKPKKSLPS 528
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
660-1055 1.06e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 45.14  E-value: 1.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  660 KLKTAPVTTTSAVSKSSPQPQQTSPKKDAAPKQDLSKapepkkppplvkqptlhgSPSAKAKQPPEADSLSKPAPPKePS 739
Cdd:NF033839   145 KDSSSSSSSGSSTKPETPQPENPEHQKPTTPAPDTKP------------------SPQPEGKKPSVPDINQEKEKAK-LA 205
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  740 VPSEQDKapVADDKPKQPKMVKPTTDLVSsssatTKPDIPSSKVQSQAEEKTTPPLKTDSAKPSQSFPPTGEKVSPFDSK 819
Cdd:NF033839   206 VATYMSK--ILDDIQKHHLQKEKHRQIVA-----LIKELDELKKQALSEIDNVNTKVEIENTVHKIFADMDAVVTKFKKG 278
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  820 AIPRPAS--DSKIISHPGPSSESKGQKQVDPVQKKEEPKKAQTKMSPKpdaKPMPKGSPTPPGPRPTAGQTVPTPQQSPK 897
Cdd:NF033839   279 LTQDTPKepGNKKPSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLE---KPKPEVKPQPEKPKPEVKPQLETPKPEVK 355
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  898 PQEQSRRFSLNLGSITDAP--KSQPTTPQETVTGKLFGFGASIfsqasnlistagQPGPHSQSGPGAPMKQAPAPSQPPT 975
Cdd:NF033839   356 PQPEKPKPEVKPQPEKPKPevKPQPETPKPEVKPQPEKPKPEV------------KPQPEKPKPEVKPQPEKPKPEVKPQ 423
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  976 SQGPPKSTGQAPPAPAKSIPVKKETKAPA--AEKLEPKAEQAPTVKRTETEKKPPPIK---DSKSLTAEPQKAVLPTKLE 1050
Cdd:NF033839   424 PEKPKPEVKPQPEKPKPEVKPQPEKPKPEvkPQPETPKPEVKPQPEKPKPEVKPQPEKpkpDNSKPQADDKKPSTPNNLS 503

                   ....*
gi 2462613745 1051 KSPKP 1055
Cdd:NF033839   504 KDKQP 508
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
1154-1558 1.52e-03

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 44.62  E-value: 1.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1154 VKLVKKQeQEVKTE---AEKVILEKVKETLSMEKippmvttdQKQEESKLEKDKASALQEKKPLPEEKKLIPEEEKiRSE 1230
Cdd:NF033838    87 VALNKKL-SDIKTEylyELNVLKEKSEAELTSKT--------KKELDAAFEQFKKDTLEPGKKVAEATKKVEEAEK-KAK 156
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1231 EKKPLLEEKKPTPEDKKLLPEAKTSAPEEQKHDLLKSQVQIAEEKLEGRV--APKTVQEGKQPQTKMEGLPsgTPQSLPK 1308
Cdd:NF033838   157 DQKEEDRRNYPTNTYKTLELEIAESDVEVKKAELELVKEEAKEPRDEEKIkqAKAKVESKKAEATRLEKIK--TDREKAE 234
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1309 EDDKTTKTIKEQpqppctakpDQVEPGKEKTEKEDDKSDT--SSSQQPKSPQGLSDTGYSSDgisSSLGE----IPSLIP 1382
Cdd:NF033838   235 EEAKRRADAKLK---------EAVEKNVATSEQDKPKRRAkrGVLGEPATPDKKENDAKSSD---SSVGEetlpSPSLKP 302
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1383 tdEKDILKGLKK-DSFSQESSPSSPSDLAKLESTVLSILEAQastLADEKSEKKTQPHEVSPE---QPKDQEKTQSLSET 1458
Cdd:NF033838   303 --EKKVAEAEKKvEEAKKKAKDQKEEDRRNYPTNTYKTLELE---IAESDVKVKEAELELVKEeakEPRNEEKIKQAKAK 377
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1459 LEitiseeeIKESQEERKDTFKKDSQQdipSSKDHKEKSEFVDDIttrrepydSVEESSESENSPVPQRKRRTSVGSSSS 1538
Cdd:NF033838   378 VE-------SKKAEATRLEKIKTDRKK---AEEEAKRKAAEEDKV--------KEKPAEQPQPAPAPQPEKPAPKPEKPA 439
                          410       420
                   ....*....|....*....|
gi 2462613745 1539 DEYKQEDSQGSGEEEDFIRK 1558
Cdd:NF033838   440 EQPKAEKPADQQAEEDYARR 459
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1127-1363 2.60e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 43.99  E-value: 2.60e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1127 PAPSGPKASPMPVPTESSSQKTAVPPQVKLVKKQEQEVKTEAEKVILEKVKETlsmEKIPPMVTTDQKQEESKLEKDKAS 1206
Cdd:NF033839   284 PKEPGNKKPSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQP---EKPKPEVKPQLETPKPEVKPQPEK 360
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1207 ALQEKKPLPEEKK-LIPEEEKIRSEEKKPLLEEKKP--TPEDKKLLPEAKtsaPEEQKHdllKSQVQIAEEKLEGRVAPK 1283
Cdd:NF033839   361 PKPEVKPQPEKPKpEVKPQPETPKPEVKPQPEKPKPevKPQPEKPKPEVK---PQPEKP---KPEVKPQPEKPKPEVKPQ 434
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1284 TVQEGKQPQTKMEG-LPSGTPQslPKEDDKTTKTIKEQPQPPCTAKPDQVEPGKEKTEKEDDK-SDTSSSQQPKSPQGLS 1361
Cdd:NF033839   435 PEKPKPEVKPQPEKpKPEVKPQ--PETPKPEVKPQPEKPKPEVKPQPEKPKPDNSKPQADDKKpSTPNNLSKDKQPSNQA 512

                   ..
gi 2462613745 1362 DT 1363
Cdd:NF033839   513 ST 514
tolA_full TIGR02794
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ...
1134-1261 3.51e-03

TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]


Pssm-ID: 274303 [Multi-domain]  Cd Length: 346  Bit Score: 42.91  E-value: 3.51e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1134 ASPMPVPTESSSQKTAVPPQVKLVKKQEQEVKTEAEKVILEKVKETLSMEKIPPMVTTDQKQEESKLEKDKASAL-QEKK 1212
Cdd:TIGR02794   25 HSVKPEPGGGAEIIQAVLVDPGAVAQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAaEKAA 104
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2462613745 1213 PLPEEKKLIPEEEKIRSEEKKP-LLEEKKPTPE---DKKLLPEAKTSAPEEQK 1261
Cdd:TIGR02794  105 KQAEQAAKQAEEKQKQAEEAKAkQAAEAKAKAEaeaERKAKEEAAKQAEEEAK 157
CCDC47 pfam07946
PAT complex subunit CCDC47; This family represents CCDC47 proteins which are a component of ...
3759-3822 3.68e-03

PAT complex subunit CCDC47; This family represents CCDC47 proteins which are a component of the PAT complex, an endoplasmic reticulum (ER)-resident membrane multiprotein complex that facilitates multi-pass membrane proteins insertion into membranes. The PAT complex, formed by CCDC47 and Asterix proteins, acts as an intramembrane chaperone by directly interacting with nascent transmembrane domains (TMDs), releasing its substrates upon correct folding, and is needed for optimal biogenesis of multi-pass membrane proteins. CCDC47 is required to maintain the stability of Asterix. CCDC47 is associated with various membrane-associated processes and is component of a ribosome-associated ER translocon complex involved in multi-pass membrane protein transport into the ER membrane and biogenesis. It is also involved in the regulation of calcium ion homeostasis in the ER, being also required for proper protein degradation via the ERAD (ER-associated degradation) pathway.


Pssm-ID: 462322  Cd Length: 323  Bit Score: 42.94  E-value: 3.68e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462613745 3759 ARAKILQDIDRELDLVERESAKLRKKQAELDEEEKEIDaklrylEMGINRRKEALLKEREKRER 3822
Cdd:pfam07946  265 TREEEIEKIKKAAEEERAEEAQEKKEEAKKKEREEKLA------KLSPEEQRKYEEKERKKEQR 322
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
382-517 5.97e-03

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 40.54  E-value: 5.97e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745   382 SSEQPGPKALAQPPGVGKTPAQQPGpakPPTQqvgtpkPLAQQPGLQSpakapgptKTPVQQPGPGKIPAQQAGPGKTSA 461
Cdd:smart00818   43 SQQHPPTHTLQPHHHIPVLPAQQPV---VPQQ------PLMPVPGQHS--------MTPTQHHQPNLPQPAQQPFQPQPL 105
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*...
gi 2462613745   462 QQTGPTKPPSQLPGPAKPPPQQPGPAKPP--PQQPgsakPPPQQPGStkppPQQPGPA 517
Cdd:smart00818  106 QPPQPQQPMQPQPPVHPIPPLPPQPPLPPmfPMQP----LPPLLPDL----PLEAWPA 155
TolA COG3064
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];
1142-1309 7.07e-03

Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442298 [Multi-domain]  Cd Length: 485  Bit Score: 42.33  E-value: 7.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1142 ESSSQKTAVPPQVKLVKKQEQEVKTEAEkvILEKVKETLSMEKIPPMVTT--------------DQKQEESKLEKDKASA 1207
Cdd:COG3064      3 EALEEKAAEAAAQERLEQAEAEKRAAAE--AEQKAKEEAEEERLAELEAKrqaeeeareakaeaEQRAAELAAEAAKKLA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1208 LQEKKPLPEEKKLIPEEEKIRSEEKKPLLEEKKPTPEDKKLLPEAKTSAPEEQKHDLLKSQVQIAEEKLEGRVAPKTVQE 1287
Cdd:COG3064     81 EAEKAAAEAEKKAAAEKAKAAKEAEAAAAAEKAAAAAEKEKAEEAKRKAEEEAKRKAEEERKAAEAEAAAKAEAEAARAA 160
                          170       180
                   ....*....|....*....|..
gi 2462613745 1288 GKQPQTKMEGLPSGTPQSLPKE 1309
Cdd:COG3064    161 AAAAAAAAAAAARAAAGAAAAL 182
 
Name Accession Description Interval E-value
PDZ_RIM-like cd06714
PDZ domain of Rab3-interacting molecule 1 (RIM), RIM2, piccolo and related domains; PDZ ...
4493-4586 8.48e-48

PDZ domain of Rab3-interacting molecule 1 (RIM), RIM2, piccolo and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of RIM, RIM2, piccolo and related domains. RIM proteins and Gallus gallus protein piccolo (also called aczonin) are involved in neurotransmitter release at presynaptic active zones, the site of vesicle fusion. A protein complex containing RIM proteins positions synaptic vesicles containing synaptotagmin at the active zone. RIM proteins simultaneously activate docking and priming of synaptic vesicles and recruit Ca2+-channels to active zones, thereby connecting primed synaptic vesicles to Ca2+-channels. RIM binding to vesicular Rab proteins (Rab3 and Rab27 isoforms) mediates vesicle docking; RIM binding to Munc13 activates vesicle priming; RIM binding to the Ca2+-channel, both directly and indirectly via RIM-BP, recruits the Ca2+-channels. The RIM PDZ domain interacts with the C-termini of N- and P/Q-type voltage-gated Ca2+-channels. RIM1, RIM2 and piccolo also participate in regulated exocytosis through binding cAMP-GEFII (cAMP-binding protein-guanidine nucleotide exchange factor II). The piccolo PDZ domain binds cAMP-GEFII. RIM2 also plays a role in dendrite formation by melanocytes. Caenorhabditis elegans RIM (also known as unc-10) may be involved in the regulation of defecation and daumone response. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This RIM-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467198 [Multi-domain]  Cd Length: 95  Bit Score: 166.96  E-value: 8.48e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 4493 FPHARIKITRDSKDHTVSGNGLGIRIVGGKEIPghSGEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEV 4572
Cdd:cd06714      2 FLIGRIILQRDPKDGSVSGNGLGLKVVGGKMTE--SGRLGAYVTKVKPGSVADTVGHLREGDEVLEWNGISLQGKTFEEV 79
                           90
                   ....*....|....
gi 2462613745 4573 QSIISQQSGEAEIC 4586
Cdd:cd06714     80 QDIISQSKGEVELV 93
FYVE1_PCLO cd15774
FYVE-related domain 1 found in protein piccolo; Protein piccolo, also termed aczonin, is a ...
587-648 1.61e-42

FYVE-related domain 1 found in protein piccolo; Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Piccolo is a multi-domain protein containing two N-terminal FYVE zinc fingers, a polyproline tract, and a PDZ domain and two C-terminal C2 domains. This family corresponds to the first FYVE domain, which resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


Pssm-ID: 277313 [Multi-domain]  Cd Length: 62  Bit Score: 150.57  E-value: 1.61e-42
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462613745  587 TICPLCNTTELLLHVPEKANFNTCTECQTTVCSLCGFNPNPHLTEVKEWLCLNCQMKRALGG 648
Cdd:cd15774      1 TICPLCKTTELLLHTPEKANYNTCTQCQTTVCSLCGFNPNPHITEKKEWLCLNCQMQRALGG 62
FYVE2_PCLO cd15776
FYVE-related domain 2 found in protein piccolo; Protein piccolo, also termed aczonin, is a ...
1057-1120 8.33e-42

FYVE-related domain 2 found in protein piccolo; Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Piccolo is a multi-domain protein containing two N-terminal FYVE zinc fingers, a polyproline tract, and a PDZ domain and two C-terminal C2 domains. This family corresponds to the second FYVE domain, which resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


Pssm-ID: 277315 [Multi-domain]  Cd Length: 64  Bit Score: 148.68  E-value: 8.33e-42
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462613745 1057 STCPLCKTELNIGSKDPPNFNTCTECKNQVCNLCGFNPTPHLTEIQEWLCLNCQTQRAISGQLG 1120
Cdd:cd15776      1 LLCPLCKTELNIGSKDPPNFNTCTECKKTVCNLCGFNPTPHLTEVKEWLCLNCQTQRAMSGQLG 64
FYVE2_BSN_PCLO cd15772
FYVE-related domain 2 found in protein bassoon and piccolo; This family includes protein ...
1057-1120 2.37e-41

FYVE-related domain 2 found in protein bassoon and piccolo; This family includes protein bassoon and piccolo. Protein bassoon, also termed zinc finger protein 231, is a core component of the presynaptic cytomatrix. It is a vertebrate-specific active zone scaffolding protein that plays a key role in structural organization and functional regulation of presynaptic release sites. Bassoon may modulate synaptic transmission efficiency by binding to presynaptic P/Q-type voltage-dependent calcium channel (VDCC) complexes and modify the channel function. As one of the most highly phosphorylated synaptic proteins, bassoon can interact with the small ubiquitous adaptor protein 14-3-3 in a phosphorylation-dependent manner, which modulates its anchoring to the presynaptic cytomatrix. Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Both bassoon and piccolo contain two N-terminal FYVE zinc fingers, a PDZ domain and two C-terminal C2 domains. Their FYVE domain resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif. This model corresponds to the second FYVE-related domain.


Pssm-ID: 277311 [Multi-domain]  Cd Length: 64  Bit Score: 147.48  E-value: 2.37e-41
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462613745 1057 STCPLCKTELNIGSKDPPNFNTCTECKNQVCNLCGFNPTPHLTEIQEWLCLNCQTQRAISGQLG 1120
Cdd:cd15772      1 VTCPLCKTELNVGSKEPPNYNTCTQCHTQVCNLCGFNPTPHLVEKKEWLCLNCQTQRLMSGGLG 64
zf-piccolo pfam05715
Piccolo Zn-finger; This (predicted) Zinc finger is found in the bassoon and piccolo proteins. ...
1057-1115 1.55e-39

Piccolo Zn-finger; This (predicted) Zinc finger is found in the bassoon and piccolo proteins. There are eight conserved cysteines, suggesting that it coordinates two zinc ligands.


Pssm-ID: 461722 [Multi-domain]  Cd Length: 60  Bit Score: 142.17  E-value: 1.55e-39
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1057 STCPLCK-TELNIGSKDPPNFNTCTECKNQVCNLCGFNPTPHLTEIQEWLCLNCQTQRAI 1115
Cdd:pfam05715    1 TLCPLCKtTELNVGSKEPPNYNTCTECKSQVCNLCGFNPTPHLTEKKEWLCLNCQTQRAL 60
FYVE1_BSN_PCLO cd15771
FYVE-related domain 1 found in protein bassoon and piccolo; This family includes protein ...
587-648 1.21e-38

FYVE-related domain 1 found in protein bassoon and piccolo; This family includes protein bassoon and piccolo. Protein bassoon, also termed zinc finger protein 231, is a core component of the presynaptic cytomatrix. It is a vertebrate-specific active zone scaffolding protein that plays a key role in structural organization and functional regulation of presynaptic release sites. Bassoon may modulate synaptic transmission efficiency by binding to presynaptic P/Q-type voltage-dependent calcium channel (VDCC) complexes and modify the channel function. As one of the most highly phosphorylated synaptic proteins, bassoon can interact with the small ubiquitous adaptor protein 14-3-3 in a phosphorylation-dependent manner, which modulates its anchoring to the presynaptic cytomatrix. Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Both bassoon and piccolo contain two N-terminal FYVE zinc fingers, a PDZ domain and two C-terminal C2 domains. Their FYVE domain resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif. This model corresponds to the first FYVE-related domain.


Pssm-ID: 277310 [Multi-domain]  Cd Length: 61  Bit Score: 139.37  E-value: 1.21e-38
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462613745  587 TICPLCNTTELLLHVPeKANFNTCTECQTTVCSLCGFNPNPHLTEVKEWLCLNCQMKRALGG 648
Cdd:cd15771      1 TLCPLCNTTELTLHVP-KPNFNTCTQCHTTVCNQCGFNPNPHLTEVKEWLCLNCQMQRALGM 61
zf-piccolo pfam05715
Piccolo Zn-finger; This (predicted) Zinc finger is found in the bassoon and piccolo proteins. ...
587-646 2.79e-37

Piccolo Zn-finger; This (predicted) Zinc finger is found in the bassoon and piccolo proteins. There are eight conserved cysteines, suggesting that it coordinates two zinc ligands.


Pssm-ID: 461722 [Multi-domain]  Cd Length: 60  Bit Score: 135.62  E-value: 2.79e-37
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  587 TICPLCNTTELLLHVPEKANFNTCTECQTTVCSLCGFNPNPHLTEVKEWLCLNCQMKRAL 646
Cdd:pfam05715    1 TLCPLCKTTELNVGSKEPPNYNTCTECKSQVCNLCGFNPTPHLTEKKEWLCLNCQTQRAL 60
FYVE2_BSN cd15775
FYVE-related domain 2 found in protein bassoon; Protein bassoon, also termed zinc finger ...
1056-1120 8.42e-36

FYVE-related domain 2 found in protein bassoon; Protein bassoon, also termed zinc finger protein 231, is a core component of the presynaptic cytomatrix. It is a vertebrate-specific active zone scaffolding protein that plays a key role in structural organization and functional regulation of presynaptic release sites. Bassoon may modulate synaptic transmission efficiency by binding to presynaptic P/Q-type voltage-dependent calcium channel (VDCC) complexes and modify the channel function. As one of the most highly phosphorylated synaptic proteins, bassoon can interact with the small ubiquitous adaptor protein 14-3-3 in a phosphorylation-dependent manner, which modulates its anchoring to the presynaptic cytomatrix. Bassoon contains two N-terminal FYVE zinc fingers, a PDZ domain and two C-terminal C2 domains. This family corresponds to the second FYVE domain, which resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


Pssm-ID: 277314 [Multi-domain]  Cd Length: 65  Bit Score: 131.58  E-value: 8.42e-36
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462613745 1056 ESTCPLCKTELNIGSKDPPNFNTCTECKNQVCNLCGFNPTPHLTEIQEWLCLNCQTQRAISGQLG 1120
Cdd:cd15775      1 RVTCPLCKTELNVGSTEPPNYNTCTSCRTQVCNLCGFNPTPHLVEKNEWLCLNCQTQRLLEGSLG 65
FYVE1_BSN cd15773
FYVE-related domain 1 found in protein bassoon; Protein bassoon, also termed zinc finger ...
587-647 1.82e-30

FYVE-related domain 1 found in protein bassoon; Protein bassoon, also termed zinc finger protein 231, is a core component of the presynaptic cytomatrix. It is a vertebrate-specific active zone scaffolding protein that plays a key role in structural organization and functional regulation of presynaptic release sites. Bassoon may modulate synaptic transmission efficiency by binding to presynaptic P/Q-type voltage-dependent calcium channel (VDCC) complexes and modify the channel function. As one of the most highly phosphorylated synaptic proteins, bassoon can interact with the small ubiquitous adaptor protein 14-3-3 in a phosphorylation-dependent manner, which modulates its anchoring to the presynaptic cytomatrix. Bassoon contains two N-terminal FYVE zinc fingers, a PDZ domain and two C-terminal C2 domains. This family corresponds to the first FYVE domain, which resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


Pssm-ID: 277312 [Multi-domain]  Cd Length: 64  Bit Score: 116.34  E-value: 1.82e-30
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462613745  587 TICPLCNTTELLlHVPEKANFNTCTECQTTVCSLCGFNPNPHLTEVKEWLCLNCQMKRALG 647
Cdd:cd15773      4 TLCPICNTTELT-SFPSQPNFNTCTQCHNKVCNQCGFNPNPHLTEVKEWLCLNCQMQRALG 63
FYVE_BSN_PCLO cd15751
FYVE-related domain found in protein bassoon and piccolo; This family includes protein bassoon ...
1057-1114 1.17e-28

FYVE-related domain found in protein bassoon and piccolo; This family includes protein bassoon and piccolo. Protein bassoon, also termed zinc finger protein 231, is a core component of the presynaptic cytomatrix. It is a vertebrate-specific active zone scaffolding protein that plays a key role in structural organization and functional regulation of presynaptic release sites. Bassoon may modulate synaptic transmission efficiency by binding to presynaptic P/Q-type voltage-dependent calcium channel (VDCC) complexes and modify the channel function. As one of the most highly phosphorylated synaptic proteins, bassoon can interact with the small ubiquitous adaptor protein 14-3-3 in a phosphorylation-dependent manner, which modulates its anchoring to the presynaptic cytomatrix. Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Both bassoon and piccolo contain two N-terminal FYVE zinc fingers, a PDZ domain and two C-terminal C2 domains. Their FYVE domain resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


Pssm-ID: 277290 [Multi-domain]  Cd Length: 62  Bit Score: 111.00  E-value: 1.17e-28
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2462613745 1057 STCPLCK-TELNIGSKDPPNFNTCTECKNQVCNLCGFNPTPHLTEIQEWLCLNCQTQRA 1114
Cdd:cd15751      1 SACPLCGtSELPLGSKSPPNYNTCTDCKNRVCNQCGFNSTPPVTKVKEWLCLNCQKKRA 59
FYVE1_PCLO cd15774
FYVE-related domain 1 found in protein piccolo; Protein piccolo, also termed aczonin, is a ...
1059-1117 1.29e-27

FYVE-related domain 1 found in protein piccolo; Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Piccolo is a multi-domain protein containing two N-terminal FYVE zinc fingers, a polyproline tract, and a PDZ domain and two C-terminal C2 domains. This family corresponds to the first FYVE domain, which resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


Pssm-ID: 277313 [Multi-domain]  Cd Length: 62  Bit Score: 108.19  E-value: 1.29e-27
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1059 CPLCK-TELNIGSKDPPNFNTCTECKNQVCNLCGFNPTPHLTEIQEWLCLNCQTQRAISG 1117
Cdd:cd15774      3 CPLCKtTELLLHTPEKANYNTCTQCQTTVCSLCGFNPNPHITEKKEWLCLNCQMQRALGG 62
FYVE1_BSN_PCLO cd15771
FYVE-related domain 1 found in protein bassoon and piccolo; This family includes protein ...
1059-1117 1.89e-27

FYVE-related domain 1 found in protein bassoon and piccolo; This family includes protein bassoon and piccolo. Protein bassoon, also termed zinc finger protein 231, is a core component of the presynaptic cytomatrix. It is a vertebrate-specific active zone scaffolding protein that plays a key role in structural organization and functional regulation of presynaptic release sites. Bassoon may modulate synaptic transmission efficiency by binding to presynaptic P/Q-type voltage-dependent calcium channel (VDCC) complexes and modify the channel function. As one of the most highly phosphorylated synaptic proteins, bassoon can interact with the small ubiquitous adaptor protein 14-3-3 in a phosphorylation-dependent manner, which modulates its anchoring to the presynaptic cytomatrix. Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Both bassoon and piccolo contain two N-terminal FYVE zinc fingers, a PDZ domain and two C-terminal C2 domains. Their FYVE domain resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif. This model corresponds to the first FYVE-related domain.


Pssm-ID: 277310 [Multi-domain]  Cd Length: 61  Bit Score: 107.78  E-value: 1.89e-27
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2462613745 1059 CPLCKTELNIGSKDPPNFNTCTECKNQVCNLCGFNPTPHLTEIQEWLCLNCQTQRAISG 1117
Cdd:cd15771      3 CPLCNTTELTLHVPKPNFNTCTQCHTTVCNQCGFNPNPHLTEVKEWLCLNCQMQRALGM 61
FYVE1_BSN cd15773
FYVE-related domain 1 found in protein bassoon; Protein bassoon, also termed zinc finger ...
1055-1115 1.35e-26

FYVE-related domain 1 found in protein bassoon; Protein bassoon, also termed zinc finger protein 231, is a core component of the presynaptic cytomatrix. It is a vertebrate-specific active zone scaffolding protein that plays a key role in structural organization and functional regulation of presynaptic release sites. Bassoon may modulate synaptic transmission efficiency by binding to presynaptic P/Q-type voltage-dependent calcium channel (VDCC) complexes and modify the channel function. As one of the most highly phosphorylated synaptic proteins, bassoon can interact with the small ubiquitous adaptor protein 14-3-3 in a phosphorylation-dependent manner, which modulates its anchoring to the presynaptic cytomatrix. Bassoon contains two N-terminal FYVE zinc fingers, a PDZ domain and two C-terminal C2 domains. This family corresponds to the first FYVE domain, which resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


Pssm-ID: 277312 [Multi-domain]  Cd Length: 64  Bit Score: 105.16  E-value: 1.35e-26
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462613745 1055 PEST-CPLCKTELNIGSKDPPNFNTCTECKNQVCNLCGFNPTPHLTEIQEWLCLNCQTQRAI 1115
Cdd:cd15773      1 PSSTlCPICNTTELTSFPSQPNFNTCTQCHNKVCNQCGFNPNPHLTEVKEWLCLNCQMQRAL 62
FYVE_BSN_PCLO cd15751
FYVE-related domain found in protein bassoon and piccolo; This family includes protein bassoon ...
587-647 5.43e-26

FYVE-related domain found in protein bassoon and piccolo; This family includes protein bassoon and piccolo. Protein bassoon, also termed zinc finger protein 231, is a core component of the presynaptic cytomatrix. It is a vertebrate-specific active zone scaffolding protein that plays a key role in structural organization and functional regulation of presynaptic release sites. Bassoon may modulate synaptic transmission efficiency by binding to presynaptic P/Q-type voltage-dependent calcium channel (VDCC) complexes and modify the channel function. As one of the most highly phosphorylated synaptic proteins, bassoon can interact with the small ubiquitous adaptor protein 14-3-3 in a phosphorylation-dependent manner, which modulates its anchoring to the presynaptic cytomatrix. Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Both bassoon and piccolo contain two N-terminal FYVE zinc fingers, a PDZ domain and two C-terminal C2 domains. Their FYVE domain resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


Pssm-ID: 277290 [Multi-domain]  Cd Length: 62  Bit Score: 103.30  E-value: 5.43e-26
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462613745  587 TICPLCNTTELLLHVPEKANFNTCTECQTTVCSLCGFNPNPHLTEVKEWLCLNCQMKRALG 647
Cdd:cd15751      1 SACPLCGTSELPLGSKSPPNYNTCTDCKNRVCNQCGFNSTPPVTKVKEWLCLNCQKKRALG 61
PHA03247 PHA03247
large tegument protein UL36; Provisional
309-905 1.11e-25

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 118.50  E-value: 1.11e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  309 PTPGKPPAQQPGHEKSQPGPAKPPAQPSG------LTKPLA-QQPGTVKPPV----QPPGTTKPPAQPLGPAKPPAQQTG 377
Cdd:PHA03247  2551 PPPPLPPAAPPAAPDRSVPPPRPAPRPSEpavtsrARRPDApPQSARPRAPVddrgDPRGPAPPSPLPPDTHAPDPPPPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  378 SEKPSSEQPGPKALAQPPGVGKTPAQQPGPAKPP---TQQVGTPKPLAQQPGLQSPAKAP--GPTKTPVQQPGPGKIPAq 452
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPrraRRLGRAAQASSPPQRPRRRAARPtvGSLTSLADPPPPPPTPE- 2709
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  453 qagPGKTSAQQTGPTKPPSQLPGPAKP-PPQQPGPAKPP--PQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKP 529
Cdd:PHA03247  2710 ---PAPHALVSATPLPPGPAAARQASPaLPAAPAPPAVPagPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRP 2786
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  530 PSQQPGSAKPSAQQPS-----PAKPSAQQSTKPVSQTGSGkPLQPPTVSPSAKQPPSQGLPKTICPLCNTTelllhvpek 604
Cdd:PHA03247  2787 AVASLSESRESLPSPWdpadpPAAVLAPAAALPPAASPAG-PLPPPTSAQPTAPPPPPGPPPPSLPLGGSV--------- 2856
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  605 anfntctecqttvcslcgfnpnphltevkewlclncqmkrALGGDLAPVPSSPQPKLKTAPVTTTSAVSKSSPQPQQtSP 684
Cdd:PHA03247  2857 ----------------------------------------APGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSR-ST 2895
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  685 KKDAAPKQDLSKAPEPKKPPPLVKQPTLhgsPSAKAKQPPeadslskPAPPKEPSVPSEQDKAPVADDKPKQPKMVKPTT 764
Cdd:PHA03247  2896 ESFALPPDQPERPPQPQAPPPPQPQPQP---PPPPQPQPP-------PPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG 2965
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  765 DLVSSSSATTKPDIPSSKVQSQAEEKTTPPLKTDSAKPSQSFPPTgekvSPFDSKAIPRPASDSKIISHPGPSSESKGQK 844
Cdd:PHA03247  2966 ALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASS----LALHEETDPPPVSLKQTLWPPDDTEDSDADS 3041
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462613745  845 QVDPVQKKEEpkkaQTKMSPKPDAKPMPKGSPTPPGPRPTAGQTVPTPQQSPKPQEQ----SRRF 905
Cdd:PHA03247  3042 LFDSDSERSD----LEALDPLPPEPHDPFAHEPDPATPEAGARESPSSQFGPPPLSAnaalSRRY 3102
PHA03247 PHA03247
large tegument protein UL36; Provisional
224-585 1.23e-25

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 118.12  E-value: 1.23e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  224 GPGRDPLQQDGTPKSISSQQPEKIKSQPPGTGKPIQGPTQTPqtdhaklPLQRDASRPQTKQADIVRGESVKPSLPSPSK 303
Cdd:PHA03247  2606 GDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVP-------PPERPRDDPAPGRVSRPRRARRLGRAAQASS 2678
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  304 PPIQQPTPGKPPAQQPGHEKSQPGPAKPPAQPsgltKPLAQQPGTVKPPVQPPGTTKPPAQPLGPAkPPAQQTGSEKPSs 383
Cdd:PHA03247  2679 PPQRPRRRAARPTVGSLTSLADPPPPPPTPEP----APHALVSATPLPPGPAAARQASPALPAAPA-PPAVPAGPATPG- 2752
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  384 eqpGPKALAQPPgvgkTPAQQPGPAkPPTQQVGTPKPLAQQPGLQSPAKA---------PGPTKTPVQQPGPGKIPAQQA 454
Cdd:PHA03247  2753 ---GPARPARPP----TTAGPPAPA-PPAAPAAGPPRRLTRPAVASLSESreslpspwdPADPPAAVLAPAAALPPAASP 2824
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  455 GPG---KTSAQQTGPTKPPSQLP------------------GPAKPPPQQPG------------PAKPPPQQPGSAKPPP 501
Cdd:PHA03247  2825 AGPlppPTSAQPTAPPPPPGPPPpslplggsvapggdvrrrPPSRSPAAKPAaparppvrrlarPAVSRSTESFALPPDQ 2904
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  502 QQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQTGSGKPLQPPTVSPSAKQP-P 580
Cdd:PHA03247  2905 PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPaP 2984

                   ....*
gi 2462613745  581 SQGLP 585
Cdd:PHA03247  2985 SREAP 2989
FYVE2_BSN_PCLO cd15772
FYVE-related domain 2 found in protein bassoon and piccolo; This family includes protein ...
587-650 1.95e-25

FYVE-related domain 2 found in protein bassoon and piccolo; This family includes protein bassoon and piccolo. Protein bassoon, also termed zinc finger protein 231, is a core component of the presynaptic cytomatrix. It is a vertebrate-specific active zone scaffolding protein that plays a key role in structural organization and functional regulation of presynaptic release sites. Bassoon may modulate synaptic transmission efficiency by binding to presynaptic P/Q-type voltage-dependent calcium channel (VDCC) complexes and modify the channel function. As one of the most highly phosphorylated synaptic proteins, bassoon can interact with the small ubiquitous adaptor protein 14-3-3 in a phosphorylation-dependent manner, which modulates its anchoring to the presynaptic cytomatrix. Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Both bassoon and piccolo contain two N-terminal FYVE zinc fingers, a PDZ domain and two C-terminal C2 domains. Their FYVE domain resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif. This model corresponds to the second FYVE-related domain.


Pssm-ID: 277311 [Multi-domain]  Cd Length: 64  Bit Score: 102.03  E-value: 1.95e-25
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462613745  587 TICPLCNTtELLLHVPEKANFNTCTECQTTVCSLCGFNPNPHLTEVKEWLCLNCQMKRALGGDL 650
Cdd:cd15772      1 VTCPLCKT-ELNVGSKEPPNYNTCTQCHTQVCNLCGFNPTPHLVEKKEWLCLNCQTQRLMSGGL 63
PHA03247 PHA03247
large tegument protein UL36; Provisional
225-585 7.73e-25

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 115.42  E-value: 7.73e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  225 PGRDPLQQDGTPKSISSQQPEKIKSQPPGTGKPIQGPTQtpqtdhaklPLQRDASRPQTKQADivrgeSVKPSLPSPSKP 304
Cdd:PHA03247  2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRR---------RAARPTVGSLTSLAD-----PPPPPPTPEPAP 2712
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  305 PIQQPTPGKPPAQQPGHEKSQPGPAKP--PAQPSGltkplAQQPGTVKPPVQPPGTTKP--PAQPLGPAKPPAQQTGSEK 380
Cdd:PHA03247  2713 HALVSATPLPPGPAAARQASPALPAAPapPAVPAG-----PATPGGPARPARPPTTAGPpaPAPPAAPAAGPPRRLTRPA 2787
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  381 PSSEQPGPKALAQPPGVGKTPAQQPGP--AKPPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPVQQPG---PGKIPAQQAG 455
Cdd:PHA03247  2788 VASLSESRESLPSPWDPADPPAAVLAPaaALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGsvaPGGDVRRRPP 2867
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  456 PGKTSAQQTGPTKPPSQ-------------LPGPAKPPPQQPGPakPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQ 522
Cdd:PHA03247  2868 SRSPAAKPAAPARPPVRrlarpavsrstesFALPPDQPERPPQP--QAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAP 2945
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462613745  523 QPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPvsqtgsgkPLQPPTVSPSAKQPPSQGLP 585
Cdd:PHA03247  2946 TTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVP--------QPAPSREAPASSTPPLTGHS 3000
FYVE2_BSN cd15775
FYVE-related domain 2 found in protein bassoon; Protein bassoon, also termed zinc finger ...
586-651 1.45e-24

FYVE-related domain 2 found in protein bassoon; Protein bassoon, also termed zinc finger protein 231, is a core component of the presynaptic cytomatrix. It is a vertebrate-specific active zone scaffolding protein that plays a key role in structural organization and functional regulation of presynaptic release sites. Bassoon may modulate synaptic transmission efficiency by binding to presynaptic P/Q-type voltage-dependent calcium channel (VDCC) complexes and modify the channel function. As one of the most highly phosphorylated synaptic proteins, bassoon can interact with the small ubiquitous adaptor protein 14-3-3 in a phosphorylation-dependent manner, which modulates its anchoring to the presynaptic cytomatrix. Bassoon contains two N-terminal FYVE zinc fingers, a PDZ domain and two C-terminal C2 domains. This family corresponds to the second FYVE domain, which resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


Pssm-ID: 277314 [Multi-domain]  Cd Length: 65  Bit Score: 99.61  E-value: 1.45e-24
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462613745  586 KTICPLCNTtELLLHVPEKANFNTCTECQTTVCSLCGFNPNPHLTEVKEWLCLNCQMKRALGGDLA 651
Cdd:cd15775      1 RVTCPLCKT-ELNVGSTEPPNYNTCTSCRTQVCNLCGFNPTPHLVEKNEWLCLNCQTQRLLEGSLG 65
FYVE2_PCLO cd15776
FYVE-related domain 2 found in protein piccolo; Protein piccolo, also termed aczonin, is a ...
589-650 3.38e-24

FYVE-related domain 2 found in protein piccolo; Protein piccolo, also termed aczonin, is a neuron-specific presynaptic active zone scaffolding protein that mainly interacts with a detergent-resistant cytoskeletal-like subcellular fraction and is involved in the organization of the interplay between neurotransmitter vesicles, the cytoskeleton, and the plasma membrane at synaptic active zones. It binds profilin, an actin-binding protein implicated in actin cytoskeletal dynamics. It also functions as a presynaptic low-affinity Ca2+ sensor and has been implicated in Ca2+ regulation of neurotransmitter release. Piccolo is a multi-domain protein containing two N-terminal FYVE zinc fingers, a polyproline tract, and a PDZ domain and two C-terminal C2 domains. This family corresponds to the second FYVE domain, which resembles a FYVE-related domain that is structurally similar to the canonical FYVE domains but lacks the three signature sequences: an N-terminal WxxD motif (x for any residue), the central basic R(R/K)HHCRxCG patch, and a C-terminal RVC motif.


Pssm-ID: 277315 [Multi-domain]  Cd Length: 64  Bit Score: 98.60  E-value: 3.38e-24
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462613745  589 CPLCNTtELLLHVPEKANFNTCTECQTTVCSLCGFNPNPHLTEVKEWLCLNCQMKRALGGDL 650
Cdd:cd15776      3 CPLCKT-ELNIGSKDPPNFNTCTECKKTVCNLCGFNPTPHLTEVKEWLCLNCQTQRAMSGQL 63
PHA03247 PHA03247
large tegument protein UL36; Provisional
311-1037 1.43e-23

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 111.57  E-value: 1.43e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  311 PGKPPAQQPGHEKSqPGPAKPPAQPSGLTKPLAQQPgtvkPPVQPPGTTKPPAQPLGPAKPPAQQT---GSEKPSSEQPG 387
Cdd:PHA03247  2475 PGAPVYRRPAEARF-PFAAGAAPDPGGGGPPDPDAP----PAPSRLAPAILPDEPVGEPVHPRMLTwirGLEELASDDAG 2549
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  388 PKAlaqPPGVGKTPAQQPGPAKPPTQqvgtPKPLAQQPGLQSPAKAPGptktpvqqpgpgkIPAQQAGPgktsAQQTGPT 467
Cdd:PHA03247  2550 DPP---PPLPPAAPPAAPDRSVPPPR----PAPRPSEPAVTSRARRPD-------------APPQSARP----RAPVDDR 2605
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  468 KPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAkPSPQQPGSTKPPSQQPGSAKPSAQQPSPA 547
Cdd:PHA03247  2606 GDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDD-PAPGRVSRPRRARRLGRAAQASSPPQRPR 2684
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  548 KPSAQQSTKPVsqTGSGKPLQPPTVSPSAKQPPSQGLPKTICPLCNTTELLLHVPEKANFNTCTECQTTVcslcgfNPNP 627
Cdd:PHA03247  2685 RRAARPTVGSL--TSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPG------GPAR 2756
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  628 HLTevkewlclncqmKRALGGDLAPVP-----SSPQPKLKTAPVTTTSAVSKSSPQPQQTSPKKDAAPkqdlskapepkk 702
Cdd:PHA03247  2757 PAR------------PPTTAGPPAPAPpaapaAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVL------------ 2812
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  703 ppPLVKQPTLHGSPSAKAKQPPEADSLSkPAPPKEPSVPSEQDKAPVAddkpkqpkmvkPTTDLVSSSSATTKPDIPSSK 782
Cdd:PHA03247  2813 --APAAALPPAASPAGPLPPPTSAQPTA-PPPPPGPPPPSLPLGGSVA-----------PGGDVRRRPPSRSPAAKPAAP 2878
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  783 VQSQAEEKTTPPLktdsAKPSQSFPptgekvSPFDSKAIPRPASdskiishPGPSSESKGQKQVDPVQKKEEPKKAQTKM 862
Cdd:PHA03247  2879 ARPPVRRLARPAV----SRSTESFA------LPPDQPERPPQPQ-------APPPPQPQPQPPPPPQPQPPPPPPPRPQP 2941
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  863 SPKPDAKPMPKGSPTPPGPRPTAGQTVPTPQQSPKPQEQSRRFSLnlgsitDAPKSQPTTPQEtvtgklfgfgasifSQA 942
Cdd:PHA03247  2942 PLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSR------EAPASSTPPLTG--------------HSL 3001
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  943 SNLISTAGQPGPHSQSGPG-APMKQAPAPSQPPTSQGPPKSTGQAPPAPAKSI--PVKKETKAPAAEKLEPKAEQAPTVK 1019
Cdd:PHA03247  3002 SRVSSWASSLALHEETDPPpVSLKQTLWPPDDTEDSDADSLFDSDSERSDLEAldPLPPEPHDPFAHEPDPATPEAGARE 3081
                          730
                   ....*....|....*...
gi 2462613745 1020 RTETEKKPPPIKDSKSLT 1037
Cdd:PHA03247  3082 SPSSQFGPPPLSANAALS 3099
PHA03378 PHA03378
EBNA-3B; Provisional
313-553 2.28e-23

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 110.16  E-value: 2.28e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  313 KPPAQQPGHEKSQPGPAKPPAQPSGLTKPLAQQPGTVKP-------PVQPPGTTkpPAQPLGPAKPPAQQTGSEKPSSEQ 385
Cdd:PHA03378   552 EPASTEPVHDQLLPAPGLGPLQIQPLTSPTTSQLASSAPsyaqtpwPVPHPSQT--PEPPTTQSHIPETSAPRQWPMPLR 629
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  386 PGPKALA----------------QPPGVGKTPAQ----QPG--PAKPPTQQVGTPKPLAQQPG-LQSPAKAPGPTKTPVQ 442
Cdd:PHA03378   630 PIPMRPLrmqpitfnvlvfptphQPPQVEITPYKptwtQIGhiPYQPSPTGANTMLPIQWAPGtMQPPPRAPTPMRPPAA 709
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  443 QPGPGKIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGstKPPPQQPGPAKPSPQ 522
Cdd:PHA03378   710 PPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPG--APTPQPPPQAPPAPQ 787
                          250       260       270
                   ....*....|....*....|....*....|.
gi 2462613745  523 QPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQ 553
Cdd:PHA03378   788 QRPRGAPTPQPPPQAGPTSMQLMPRAAPGQQ 818
PHA03247 PHA03247
large tegument protein UL36; Provisional
233-747 3.15e-22

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 106.95  E-value: 3.15e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  233 DGTPKSISSQQPEKIKSQPPGTGKPIQGPTQTPqtdhaklPLQRDASRPQTKQADIVRGESVKPSLPSPSKppiQQPTPG 312
Cdd:PHA03247  2590 DAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTH-------APDPPPPSPSPAANEPDPHPPPTVPPPERPR---DDPAPG 2659
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  313 KppAQQPGHEKSQPGPAKPPAQPSGLTKPLAQQP-GTVKPPVQPPgttkPPAQPLGPAkPPAQQTGSEKPSSEQPGPKAL 391
Cdd:PHA03247  2660 R--VSRPRRARRLGRAAQASSPPQRPRRRAARPTvGSLTSLADPP----PPPPTPEPA-PHALVSATPLPPGPAAARQAS 2732
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  392 AQPPGVGKTPAQQPGPAKPptqqvGTPKPLAQQPGLQSPAkAPGPTKTPVQQPgpgkiPAQQAGPGKTSAQQTGPTKPPS 471
Cdd:PHA03247  2733 PALPAAPAPPAVPAGPATP-----GGPARPARPPTTAGPP-APAPPAAPAAGP-----PRRLTRPAVASLSESRESLPSP 2801
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  472 QLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTkPPPQQPGPAKPS--------PQQPGSTKPPSQQPgSAKPSAQQ 543
Cdd:PHA03247  2802 WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPT-APPPPPGPPPPSlplggsvaPGGDVRRRPPSRSP-AAKPAAPA 2879
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  544 PSP----AKPSAQQSTKPVSQTGSGKPLQPPTVSPSAKQPPSQGLPKTICPLCNTTELLLHVPEKANFNTCTECQTTvcs 619
Cdd:PHA03247  2880 RPPvrrlARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPS--- 2956
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  620 lcGFNPNPHLTEVKEWLCLNCQMKRALGGDLAPVPSSPQPKLKTAPVTTTSAVSKSSPQPQQTSPKKdAAPKQDLSKAPE 699
Cdd:PHA03247  2957 --GAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPP-VSLKQTLWPPDD 3033
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*...
gi 2462613745  700 PKKPPPLVKQPTLHGSPSAKAKQPPEADSLSKPAPPKEPSVPSEQDKA 747
Cdd:PHA03247  3034 TEDSDADSLFDSDSERSDLEALDPLPPEPHDPFAHEPDPATPEAGARE 3081
PHA03378 PHA03378
EBNA-3B; Provisional
310-579 6.99e-22

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 105.15  E-value: 6.99e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  310 TPGKPPAQQPGHEKSQPGPAKPPAQPSGLtKPLAQQPGTVKPPVQPPgTTKPPAQPLGPAKPPAQQTGSEKPSSEQPGPK 389
Cdd:PHA03378   605 TPEPPTTQSHIPETSAPRQWPMPLRPIPM-RPLRMQPITFNVLVFPT-PHQPPQVEITPYKPTWTQIGHIPYQPSPTGAN 682
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  390 ALAQP---PGVGKTPAQQPGPAKPPTqqvGTPKPLaqqpglQSPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQQTGP 466
Cdd:PHA03378   683 TMLPIqwaPGTMQPPPRAPTPMRPPA---APPGRA------QRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGR 753
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  467 TKPPSQLPGPAKPPPQQPGpaKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPG----------- 535
Cdd:PHA03378   754 ARPPAAAPGRARPPAAAPG--APTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAAPGQQGPtkqilrqlltg 831
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*...
gi 2462613745  536 ---SAKPSAQQPSPAKPSAQQSTKPVSQTGSG-KPLQPPTVSPSAKQP 579
Cdd:PHA03378   832 gvkRGRPSLKKPAALERQAAAGPTPSPGSGTSdKIVQAPVFYPPVLQP 879
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
226-575 9.73e-19

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 94.83  E-value: 9.73e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  226 GRDPLQQDGTPKSISSQQPEKIKSQPPGTGKPIQGPTQTPQTDHAKLPLQRDASRPQTKQADIVRGESVKPSLPSPSKPP 305
Cdd:pfam03154  169 TQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPP 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  306 IQQPTPGKPPAQQPGHEKSQP---GPAKPPAQPSGLTKPLAQQPGtvkpPVQPPGTTKPPAQPLGPAKPPAQQTGSEKPS 382
Cdd:pfam03154  249 LQPMTQPPPPSQVSPQPLPQPslhGQMPPMPHSLQTGPSHMQHPV----PPQPFPLTPQSSQSQVPPGPSPAAPGQSQQR 324
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  383 SEQPGPKALAQPPgvgKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPTKtpVQQPGPGKIPAQQAGPGKTSAQ 462
Cdd:pfam03154  325 IHTPPSQSQLQSQ---QPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPH--LSGPSPFQMNSNLPPPPALKPL 399
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  463 QTGPTK-PPSQLPGPAKPPPQ----QPGPAKPP-----PQQPGSAKPPPQQPGSTKPPPQQP------GPAKPSPQQPGS 526
Cdd:pfam03154  400 SSLSTHhPPSAHPPPLQLMPQsqqlPPPPAQPPvltqsQSLPPPAASHPPTSGLHQVPSQSPfpqhpfVPGGPPPITPPS 479
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462613745  527 TKPPS--------QQPGSAKPSAQQPSPAKPSA-----QQSTKPVSQTGSGKPLQPPTVSPS 575
Cdd:pfam03154  480 GPPTStssampgiQPPSSASVSSSGPVPAAVSCplppvQIKEEALDEAEEPESPPPPPRSPS 541
PHA03247 PHA03247
large tegument protein UL36; Provisional
446-1046 1.18e-18

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 95.01  E-value: 1.18e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  446 PGKiPAQQAGPGKTSAQQTGPTKPPSQLPGP---AKPPPQQPGPAKPPPQQPGSAKPPPQ-----------QPGSTKPPP 511
Cdd:PHA03247  2475 PGA-PVYRRPAEARFPFAAGAAPDPGGGGPPdpdAPPAPSRLAPAILPDEPVGEPVHPRMltwirgleelaSDDAGDPPP 2553
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  512 QQPgPAKPSPQQPGSTKPPSQQPGSAKPSAQQ--------PSPAKPSAqqstkPVSQTGSGKPLQPPTVSPSAKQPPSQG 583
Cdd:PHA03247  2554 PLP-PAAPPAAPDRSVPPPRPAPRPSEPAVTSrarrpdapPQSARPRA-----PVDDRGDPRGPAPPSPLPPDTHAPDPP 2627
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  584 LPkticplcnttelllhvpekanfntctecqttvcslcgfNPNPHLTEvkewlclncqmkrALGGDLAPVPSSPQPKLKT 663
Cdd:PHA03247  2628 PP--------------------------------------SPSPAANE-------------PDPHPPPTVPPPERPRDDP 2656
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  664 APvtttSAVSKSSPQPQQTSPKKDAAPKQDLSKAPEPKKPpplvkqptlhGSPSAKAKQPPEADSLSKPAPPKEPSVPSE 743
Cdd:PHA03247  2657 AP----GRVSRPRRARRLGRAAQASSPPQRPRRRAARPTV----------GSLTSLADPPPPPPTPEPAPHALVSATPLP 2722
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  744 QDKAPVADDKPKQPkmVKPTTDLVSSSSAT-TKPDIPSSKVQSQAEEKTTPPLKTDSAKPSQSFPPTGEKVSP-FDSKAI 821
Cdd:PHA03247  2723 PGPAAARQASPALP--AAPAPPAVPAGPATpGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSEsRESLPS 2800
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  822 PRPASDSKIISHPGPSSESKGQKQVDPVQKKEEPKKAQTKMSPKPDAKPMP-KGSPTPPGP---RPTAGQTVPTPQQSPK 897
Cdd:PHA03247  2801 PWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPlGGSVAPGGDvrrRPPSRSPAAKPAAPAR 2880
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  898 PqeQSRRFSLNLGSITDAPKSQPTTPQEtvtgklfgfgasifsqasNLISTAGQPGPHSQSGPGAPMKQAPAPSQPPTSQ 977
Cdd:PHA03247  2881 P--PVRRLARPAVSRSTESFALPPDQPE------------------RPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQ 2940
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462613745  978 GPPKSTGQAPPAPAKSIPVkketKAPAAEKLEPKAEQAPtvkRTETEKKPPPIKDSKSLTAEPQKAVLP 1046
Cdd:PHA03247  2941 PPLAPTTDPAGAGEPSGAV----PQPWLGALVPGRVAVP---RFRVPQPAPSREAPASSTPPLTGHSLS 3002
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
226-585 4.03e-17

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 89.24  E-value: 4.03e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  226 GRDPLQQDGTPKSISSQQPEKikSQPPGTGKPIQGPTQTPQTDHAKLPLQRDASRPQTKQADIVRGESVKPSLPSPSKPP 305
Cdd:pfam03157  230 GQQPGQGQQPGQGQQGQQPGQ--PQQLGQGQQGYYPISPQQPRQWQQSGQGQQGYYPTSLQQPGQGQSGYYPTSQQQAGQ 307
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  306 IQQPTPGKPPAQQP----GHEKSQPGPAKPPAQPSGLTKPLAQQPGTVKPPVQPPGTTKPPAQPLGPAKPPAQQTGSEKP 381
Cdd:pfam03157  308 LQQEQQLGQEQQDQqpgqGRQGQQPGQGQQGQQPAQGQQPGQGQPGYYPTSPQQPGQGQPGYYPTSQQQPQQGQQPEQGQ 387
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  382 SSEQPGPKALAQPPGVGKTPAQ-QPGPAKPPTQQVGTPKP-----------LAQQPG--LQSPAKAPGPTKTPVQ--QPG 445
Cdd:pfam03157  388 QGQQQGQGQQGQQPGQGQQPGQgQPGYYPTSPQQSGQGQPgyyptspqqsgQGQQPGqgQQPGQEQPGQGQQPGQgqQGQ 467
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  446 PGKIPAQQAGPGK-------TSAQQTGPTKPPSQL--PGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGP 516
Cdd:pfam03157  468 QPGQPEQGQQPGQgqpgyypTSPQQSGQGQQLGQWqqQGQGQPGYYPTSPLQPGQGQPGYYPTSPQQPGQGQQLGQLQQP 547
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462613745  517 AKPSPQQPGSTKPPSQQPGSAKPSAQqpsPAKPSAQQSTKPVSQTGSGKPLQPPT--------VSPSAKQPPSQGLP 585
Cdd:pfam03157  548 TQGQQGQQSGQGQQGQQPGQGQQGQQ---PGQGQQGQQPGQGQQPGQGQPGYYPTspqqsgqgQQPGQWQQPGQGQP 621
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
282-595 1.30e-16

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 87.90  E-value: 1.30e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  282 QTKQADIVRGESVKPSLPSPSKPPIQQPTPGKPPAQQPGHEKSQPGPAKPPAQPSGLTKPLA--QQPGTVKPPVQPpgTT 359
Cdd:pfam03154  168 QTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTliQQTPTLHPQRLP--SP 245
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  360 KPPAQPLGPAKPPAQQTGSEKPSSEQPGP-KALAQPPGVGKTPAQQPGPAKPptqqVGTPKPLAQQPGLQSPA-KAPGPT 437
Cdd:pfam03154  246 HPPLQPMTQPPPPSQVSPQPLPQPSLHGQmPPMPHSLQTGPSHMQHPVPPQP----FPLTPQSSQSQVPPGPSpAAPGQS 321
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  438 KTPVQQPgPGKIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGP---------AKPPPQQPGSAKPPP------- 501
Cdd:pfam03154  322 QQRIHTP-PSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPqshkhpphlSGPSPFQMNSNLPPPpalkpls 400
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  502 ----QQPGSTKPPPQQPGPAK-----PSPQQPGSTKPPSQQPGSAK---PSAQQPSPAK-PSAQQSTKPVSQTGSGKPLQ 568
Cdd:pfam03154  401 slstHHPPSAHPPPLQLMPQSqqlppPPAQPPVLTQSQSLPPPAAShppTSGLHQVPSQsPFPQHPFVPGGPPPITPPSG 480
                          330       340       350
                   ....*....|....*....|....*....|
gi 2462613745  569 PPTVSPSAK---QPPSQGLPKTICPLCNTT 595
Cdd:pfam03154  481 PPTSTSSAMpgiQPPSSASVSSSGPVPAAV 510
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
307-559 2.00e-16

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 87.01  E-value: 2.00e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  307 QQPTPGKPPAQQPGHEKSQPGPAKPPAQPSGLTKPLA------QQPGTVkPPVQPP----GTT-KPPAQPlgpakPPAQQ 375
Cdd:pfam09770  105 QQPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRtgyekyKEPEPI-PDLQVDaslwGVApKKAAAP-----APAPQ 178
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  376 TGSEKPSSEQPGPKALAqppgVGKTPAQQPGPAKPPTQQVgtPKPLAQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAG 455
Cdd:pfam09770  179 PAAQPASLPAPSRKMMS----LEEVEAAMRAQAKKPAQQP--APAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQ 252
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  456 PGktsaQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKP---PPQQPGPAKPSPQQPgstkPPSQ 532
Cdd:pfam09770  253 PQ----QHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQIlqnPNRLSAARVGYPQNP----QPGV 324
                          250       260
                   ....*....|....*....|....*..
gi 2462613745  533 QPGSAKPSAQQPSPAKPSAQQSTKPVS 559
Cdd:pfam09770  325 QPAPAHQAHRQQGSFGRQAPIITHPQQ 351
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
187-586 2.60e-16

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 86.54  E-value: 2.60e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  187 EETTKKQKVVQKEQGKPEGIIKPPLQQQPPKPIPKQQGPGRDPlQQDGTPKSISSQQPEKikSQPPGTGKPIQGPTQTPQ 266
Cdd:pfam03157  177 QQPGQGQQLRQGQQGQQSGQGQPGYYPTSSQQPGQLQQTGQGQ-QGQQPERGQQGQQPGQ--GQQPGQGQQGQQPGQPQQ 253
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  267 TDHAKLPLQRDASRPQTKQADIVRGEsvkpslPSPSKPPIQQPTPGkppaqQPGHEKSQPGPAKPPAQPSGLTKPLAQQP 346
Cdd:pfam03157  254 LGQGQQGYYPISPQQPRQWQQSGQGQ------QGYYPTSLQQPGQG-----QSGYYPTSQQQAGQLQQEQQLGQEQQDQQ 322
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  347 GTVKPPVQPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAQPPGVGKTPAQQPGPAKPPTQQVGTPKPLAQQPG 426
Cdd:pfam03157  323 PGQGRQGQQPGQGQQGQQPAQGQQPGQGQPGYYPTSPQQPGQGQPGYYPTSQQQPQQGQQPEQGQQGQQQGQGQQGQQPG 402
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  427 L-QSPAKA-PGPTKTPVQQPGPGkipaqQAGPGKTSAQQTGPTKPPSQLPGPAKPPP---QQPGPAKP--PPQQPGSAKP 499
Cdd:pfam03157  403 QgQQPGQGqPGYYPTSPQQSGQG-----QPGYYPTSPQQSGQGQQPGQGQQPGQEQPgqgQQPGQGQQgqQPGQPEQGQQ 477
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  500 PPQ-QPGSTKPPPQQPGPAKPSPQ-------QPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQTGSGKPLQPPT 571
Cdd:pfam03157  478 PGQgQPGYYPTSPQQSGQGQQLGQwqqqgqgQPGYYPTSPLQPGQGQPGYYPTSPQQPGQGQQLGQLQQPTQGQQGQQSG 557
                          410
                   ....*....|....*
gi 2462613745  572 VSPSAKQPPSQGLPK 586
Cdd:pfam03157  558 QGQQGQQPGQGQQGQ 572
PHA03378 PHA03378
EBNA-3B; Provisional
307-581 6.38e-16

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 85.50  E-value: 6.38e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  307 QQPTPGKPPAQQPGHEKSQPGPA---------KPPAQ----PSGLTKPLAQQPGTVKPPVQPPGTTKPPAQPLGPAKPPA 373
Cdd:PHA03378   461 PLEGPTGPLSVQAPLEPWQPLPHpqvtpvilhQPPAQgvqaHGSMLDLLEKDDEDMEQRVMATLLPPSPPQPRAGRRAPC 540
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  374 QQTG-----SEKPSSEQPGPKALAQPPGVGKTP-------------------AQQPGPAKPPTQQVGTPKPLAQQPGLQS 429
Cdd:PHA03378   541 VYTEdldieSDEPASTEPVHDQLLPAPGLGPLQiqpltspttsqlassapsyAQTPWPVPHPSQTPEPPTTQSHIPETSA 620
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  430 PAKAPGPTKtpvqqPGPGKIPAQQAGPGKTSAQQTgPTKPPSQLPGPAKPPPQQPG--PAKPPPQQPGSAKPPPQQPGST 507
Cdd:PHA03378   621 PRQWPMPLR-----PIPMRPLRMQPITFNVLVFPT-PHQPPQVEITPYKPTWTQIGhiPYQPSPTGANTMLPIQWAPGTM 694
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462613745  508 KPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQTGSGKPLQPPTVSPSAKQPPS 581
Cdd:PHA03378   695 QPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPA 768
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
316-541 9.04e-16

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 84.54  E-value: 9.04e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  316 AQQPGHEKSQPGPAKPPAQPSGLTKPLAQQPGTVKP-PVQPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAQP 394
Cdd:PRK12323   362 AFRPGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPaPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASA 441
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  395 PGVGKTPAQQPGPAKPPtqqVGTPKPLAQQPGlqsPAKAPGPTKTPVQQPGPGKIPAQQAGPgktsAQQTGPTKPPSqlP 474
Cdd:PRK12323   442 RGPGGAPAPAPAPAAAP---AAAARPAAAGPR---PVAAAAAAAPARAAPAAAPAPADDDPP----PWEELPPEFAS--P 509
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462613745  475 GPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSA 541
Cdd:PRK12323   510 APAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDM 576
PDZ_canonical cd00136
canonical PDZ domain; Canonical PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs ...
4497-4585 9.62e-16

canonical PDZ domain; Canonical PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain. PDZ domains usually bind to short specific peptide sequences located at the C-terminal end of their partner proteins known as PDZ binding motifs. These domains can also interact with internal peptide motifs and certain lipids, and can take part in a head-to-tail oligomerization with other PDZ domains. The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. The canonical PDZ domain contains six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467153 [Multi-domain]  Cd Length: 81  Bit Score: 74.89  E-value: 9.62e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 4497 RIKITRDSkdhtvsGNGLGIRIVGGKEIPGhsgeiGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSII 4576
Cdd:cd00136      1 TVTLEKDP------GGGLGFSIRGGKDGGG-----GIFVSRVEPGGPAARDGRLRVGDRILEVNGVSLEGLTHEEAVELL 69

                   ....*....
gi 2462613745 4577 SQQSGEAEI 4585
Cdd:cd00136     70 KSAGGEVTL 78
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
312-583 1.02e-15

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 84.65  E-value: 1.02e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  312 GKPPAQQPGHEKSQPGPAKPPAQPSGLTKPLAQQPGTVKPPVQPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQPGPKAL 391
Cdd:PRK07764   392 GAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPA 471
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  392 AQPPGVGKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQ----------------------SPAKAPGPTKTPVQ------- 442
Cdd:PRK07764   472 AAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDaatlrerwpeilaavpkrsrktWAILLPEATVLGVRgdtlvlg 551
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  443 -----------QPGPGKIPAQ--------------QAGPGKTSAQQTGPTKPPSQLPGPAKP-PPQQPGPAKPPPQQPGS 496
Cdd:PRK07764   552 fstgglarrfaSPGNAEVLVTalaeelggdwqveaVVGPAPGAAGGEGPPAPASSGPPEEAArPAAPAAPAAPAAPAPAG 631
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  497 AKPPPQQPGSTKPPPQQPGPAKPSPQ-------------QPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQTGS 563
Cdd:PRK07764   632 AAAAPAEASAAPAPGVAAPEHHPKHVavpdasdggdgwpAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPA 711
                          330       340
                   ....*....|....*....|
gi 2462613745  564 GKPLQPPTVSPSAKQPPSQG 583
Cdd:PRK07764   712 GQADDPAAQPPQAAQGASAP 731
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
229-583 1.27e-15

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 84.23  E-value: 1.27e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  229 PLQQDGTPKSISSQQPEKikSQPPGTGKPIQGPTQTPQTDHAKLPLQrdasrpqtkqadivrGESVKPSLPSPSKPPIQQ 308
Cdd:pfam03157  116 PQQVSYYPGQASPQRPGQ--GQQPGQGQQWYYPTSPQQPGQWQQPGQ---------------GQQGYYPTSPQQSGQRQQ 178
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  309 PTPGKPPAQqpGHEKSQPGPAKPPAQPSGLTKP-LAQQPGTVKPPVQPpGTTKPPAQPLGPAKPPAQQTGSEKPSSEQPG 387
Cdd:pfam03157  179 PGQGQQLRQ--GQQGQQSGQGQPGYYPTSSQQPgQLQQTGQGQQGQQP-ERGQQGQQPGQGQQPGQGQQGQQPGQPQQLG 255
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  388 PKALAQPPGVGKTPAQ-------QPGPAKPPTQQVGTPK----PLAQQPGLQSPAKAPGPTKTPVQQPGPGKiPAQQAGP 456
Cdd:pfam03157  256 QGQQGYYPISPQQPRQwqqsgqgQQGYYPTSLQQPGQGQsgyyPTSQQQAGQLQQEQQLGQEQQDQQPGQGR-QGQQPGQ 334
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  457 GKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKP-----PPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKpPS 531
Cdd:pfam03157  335 GQQGQQPAQGQQPGQGQPGYYPTSPQQPGQGQPgyyptSQQQPQQGQQPEQGQQGQQQGQGQQGQQPGQGQQPGQGQ-PG 413
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2462613745  532 QQPGSAKPSAQQPSPAKPSAQQSTKPVSQTGSGKplQPPTVSPSAKQPPSQG 583
Cdd:pfam03157  414 YYPTSPQQSGQGQPGYYPTSPQQSGQGQQPGQGQ--QPGQEQPGQGQQPGQG 463
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
307-585 1.90e-15

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 83.88  E-value: 1.90e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  307 QQPTPGKPPAQQPGHEKSQPGPAKPPAQPSGLTKPLAQQPGTVKPPVQPPGTTKPPAQPLGPAKP-PAQQTGSEKPSSEQ 385
Cdd:PRK07764   396 AAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAaPSAQPAPAPAAAPE 475
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  386 PGPKALAQPPGVGKTPAQQPGPAKPPTQQ--------------------------------------------------- 414
Cdd:PRK07764   476 PTAAPAPAPPAAPAPAAAPAAPAAPAAPAgaddaatlrerwpeilaavpkrsrktwaillpeatvlgvrgdtlvlgfstg 555
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  415 --------------------------------VGTPKPLAQQPGlqSPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQ 462
Cdd:PRK07764   556 glarrfaspgnaevlvtalaeelggdwqveavVGPAPGAAGGEG--PPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAA 633
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  463 QTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQ 542
Cdd:PRK07764   634 AAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQ 713
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 2462613745  543 QPSPAKPSAQQSTKPVSQTGSGKPLQPPTVSPSAKQPPSQGLP 585
Cdd:PRK07764   714 ADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPA 756
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
307-587 2.37e-15

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 84.07  E-value: 2.37e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  307 QQPTPGKPPAQQPGheksqpGPAKPPAqpsgltkPLAQQPGTVKPPVQPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQP 386
Cdd:PHA03307   111 PSSPDPPPPTPPPA------SPPPSPA-------PDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLS 177
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  387 GPKALAQPPGvgkTPAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQQTGP 466
Cdd:PHA03307   178 SPEETARAPS---SPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENE 254
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  467 TKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKP--PPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQP 544
Cdd:PHA03307   255 CPLPRPAPITLPTRIWEASGWNGPSSRPGPASSssSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSE 334
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|...
gi 2462613745  545 SPAKPSAQQSTKPVSQTGSGKPLQPPTVSPSAKQPPSQGLPKT 587
Cdd:PHA03307   335 SSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSS 377
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
318-524 3.06e-15

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 83.11  E-value: 3.06e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  318 QPGHEKSQPGPAKPPAQPSGLTKPLAQQPGTVKPPVQPPGtTKPPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAqPPGV 397
Cdd:PRK07764   587 VVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAA-PAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVP-DASD 664
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  398 GKTPAQQPGPAKPPTQQVGTPKPLAQqpglqsPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQQTGPTKPPSQLPGPA 477
Cdd:PRK07764   665 GGDGWPAKAGGAAPAAPPPAPAPAAP------AAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDP 738
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*..
gi 2462613745  478 KPPPQQPGPAKPPPQQPGSAkPPPQQPGSTKPPPQQPGPAKPSPQQP 524
Cdd:PRK07764   739 VPLPPEPDDPPDPAGAPAQP-PPPPAPAPAAAPAAAPPPSPPSEEEE 784
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
184-583 3.21e-15

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 83.07  E-value: 3.21e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  184 ASQEETTKKQKVVQKEQGKPEGiikpplqqqpPKPIPKQQGPGRDPLQQDGTPKSISSQQPekiksqppGTGKPIQGPTQ 263
Cdd:pfam03157  372 TSQQQPQQGQQPEQGQQGQQQG----------QGQQGQQPGQGQQPGQGQPGYYPTSPQQS--------GQGQPGYYPTS 433
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  264 TPQTDHAKLPLQRDASRPQTKQADIVRGESVKPSLPSPSKPPiQQPTPGKP-----PAQQPGH-----EKSQPGPAKPPA 333
Cdd:pfam03157  434 PQQSGQGQQPGQGQQPGQEQPGQGQQPGQGQQGQQPGQPEQG-QQPGQGQPgyyptSPQQSGQgqqlgQWQQQGQGQPGY 512
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  334 QPSGLTKPLAQQPGTVKPPVQPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAQPPGvgktpaqQPGPAKPPTQ 413
Cdd:pfam03157  513 YPTSPLQPGQGQPGYYPTSPQQPGQGQQLGQLQQPTQGQQGQQSGQGQQGQQPGQGQQGQQPG-------QGQQGQQPGQ 585
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  414 qvgtpkplAQQPGLQSPAKAPgptkTPVQQPGPGKIPA--QQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPP 491
Cdd:pfam03157  586 --------GQQPGQGQPGYYP----TSPQQSGQGQQPGqwQQPGQGQPGYYPTSSLQLGQGQQGYYPTSPQQPGQGQQPG 653
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  492 QQPGSAKpppQQPGSTKPPPQQPGpakpSPQQPGSTKPPSQ--QPGSAKPSAQQPSPAKPSAQQSTKPVSQTGSGKPLQP 569
Cdd:pfam03157  654 QWQQSGQ---GQQGYYPTSPQQSG----QAQQPGQGQQPGQwlQPGQGQQGYYPTSPQQPGQGQQLGQGQQSGQGQQGYY 726
                          410
                   ....*....|....
gi 2462613745  570 PTvSPSAKQPPSQG 583
Cdd:pfam03157  727 PT-SPGQGQQSGQG 739
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
255-585 4.45e-15

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 82.73  E-value: 4.45e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  255 GKPIQGPTQTPQTDHaklPLQRDASRPQTKQADIVRGESVKPSLPSPSKPPIQQPTPGKPPAQQPG-HEKSQPGPAKPPA 333
Cdd:PRK07764   384 RLGVAGGAGAPAAAA---PSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGnAPAGGAPSPPPAA 460
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  334 QPSGLTKPLAQQPG--TVKPPVQPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAQPPG------VGKTPAQQP 405
Cdd:PRK07764   461 APSAQPAPAPAAAPepTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLRERWPEILAAVPKrsrktwAILLPEATV 540
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  406 GPAKPPTQQVGTPKPLAQQ--------------------------------PGLQSPAKAPGPTK----TPVQQPGPGKI 449
Cdd:PRK07764   541 LGVRGDTLVLGFSTGGLARrfaspgnaevlvtalaeelggdwqveavvgpaPGAAGGEGPPAPASsgppEEAARPAAPAA 620
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  450 PAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQ---QPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGS 526
Cdd:PRK07764   621 PAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASdggDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQ 700
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  527 TKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQTGSGKPLQP-PTVSPSAKQPPSQGLP 585
Cdd:PRK07764   701 PAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPePDDPPDPAGAPAQPPP 760
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
356-553 5.88e-15

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 82.34  E-value: 5.88e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  356 PGTTKPPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAQPPGVGKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPG 435
Cdd:PRK07764   588 VGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGD 667
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  436 PTKTPVQQPGPGKiPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPG 515
Cdd:PRK07764   668 GWPAKAGGAAPAA-PPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPD 746
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 2462613745  516 PAkPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQ 553
Cdd:PRK07764   747 DP-PDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEE 783
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
186-585 1.19e-14

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 81.21  E-value: 1.19e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  186 QEETTKKQKVVQKEQGKPEGIIKPPLQQQPPKPIPKQQGPGRDPLQQDGTPKSISSQQPEKIKSQPPGTGKPIQGPTQTP 265
Cdd:pfam09606   51 RDMSKKAAQQQQPQGGQGNGGMGGGQQGMPDPINALQNLAGQGTRPQMMGPMGPGPGGPMGQQMGGPGTASNLLASLGRP 130
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  266 QTDHAKlplqrdASRPQTKQAdivrgesVKPSLPSPSKPPIQQPTPGKPPAQQPghekSQPGPAKPPAQPSGLTKPLAQQ 345
Cdd:pfam09606  131 QMPMGG------AGFPSQMSR-------VGRMQPGGQAGGMMQPSSGQPGSGTP----NQMGPNGGPGQGQAGGMNGGQQ 193
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  346 PGtvkPPVQPPGTTKPPAqPLGPAKPPAQqTGSEKPSSEQPGPKALAQPPGVGKTPAQQPGPAKPPTQ---QVGTPKPLA 422
Cdd:pfam09606  194 GP---MGGQMPPQMGVPG-MPGPADAGAQ-MGQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQGQQSQlgmGINQMQQMP 268
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  423 QQPGLQSPAKAPGPTKTPVQQpGPGKIPAQQAGPGKTSAQQTGpTKPPSQLPGPAKPP--PQQP----GPAKPPPQQPGS 496
Cdd:pfam09606  269 QGVGGGAGQGGPGQPMGPPGQ-QPGAMPNVMSIGDQNNYQQQQ-TRQQQQQQGGNHPAahQQQMnqsvGQGGQVVALGGL 346
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  497 AKPPPQQPGSTKP---PPQQPGPA--KPSPQQPGSTKPPSQQPGSAKPSAQQPSpaKPSAQQSTKPVSQTGSGKPLQPPT 571
Cdd:pfam09606  347 NHLETWNPGNFGGlgaNPMQRGQPgmMSSPSPVPGQQVRQVTPNQFMRQSPQPS--VPSPQGPGSQPPQSHPGGMIPSPA 424
                          410
                   ....*....|....
gi 2462613745  572 VSPSAKQPPSQGLP 585
Cdd:pfam09606  425 LIPSPSPQMSQQPA 438
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
343-585 1.39e-14

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 81.24  E-value: 1.39e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  343 AQQPgtvkPPVQPPGTTKPPAQPLGPAKPPAQQ---------TGSEKPSSEQPGPK--ALAQPPGVG-KTPAQQPGPAKP 410
Cdd:pfam09770  104 RQQP----AARAAQSSAQPPASSLPQYQYASQQsqqpskpvrTGYEKYKEPEPIPDlqVDASLWGVApKKAAAPAPAPQP 179
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  411 PTQQVGTPKP---------LAQQPGLQSPAKAPGPTKTPVQQPGPgkIPAQQAGPGktsaQQTGPTKPPSQLPGPAKPPP 481
Cdd:pfam09770  180 AAQPASLPAPsrkmmsleeVEAAMRAQAKKPAQQPAPAPAQPPAA--PPAQQAQQQ----QQFPPQIQQQQQPQQQPQQP 253
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  482 QQPGPAKPPPQQPgsakpppQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQT 561
Cdd:pfam09770  254 QQHPGQGHPVTIL-------QRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNPNRLSAARVGYPQNPQPGVQP 326
                          250       260
                   ....*....|....*....|....
gi 2462613745  562 GSGKPlQPPTVSPSAKQPPSQGLP 585
Cdd:pfam09770  327 APAHQ-AHRQQGSFGRQAPIITHP 349
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
346-546 3.35e-14

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 79.64  E-value: 3.35e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  346 PGTVKPPVQPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAQPPGVGKTPAQQPGPAKPPtqqvgtpkPLAQQP 425
Cdd:PRK07764   596 GGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPD--------ASDGGD 667
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  426 GLQSPAKAPGPTKTPVQQPGPGKiPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPg 505
Cdd:PRK07764   668 GWPAKAGGAAPAAPPPAPAPAAP-AAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEP- 745
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|.
gi 2462613745  506 stkPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSP 546
Cdd:PRK07764   746 ---DDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEE 783
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
224-570 4.16e-14

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 79.28  E-value: 4.16e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  224 GPGRDPLQQDGTPKSISSQQPEKIKSQPPgtgkpiQGPTQTPQTDHAKLPLQRDASRPQTKQADIVRGESVKPSLPSPSK 303
Cdd:pfam09606  105 GPGGPMGQQMGGPGTASNLLASLGRPQMP------MGGAGFPSQMSRVGRMQPGGQAGGMMQPSSGQPGSGTPNQMGPNG 178
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  304 PPIQQPTPGKPPAQQPGHEKSQPGPAKPPAQPsGLTKPLAQQPGTVKPPVQPPgttkPPAQPLGPAKPPAQQtgsekpss 383
Cdd:pfam09606  179 GPGQGQAGGMNGGQQGPMGGQMPPQMGVPGMP-GPADAGAQMGQQAQANGGMN----PQQMGGAPNQVAMQQ-------- 245
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  384 EQPGPKALAQPPGVGKTPAQQPGPAKPPTQQVGTP----KPLAQQPGLQSPAKAPGPTKTPVQQPG-PGKIPAQQAGPGK 458
Cdd:pfam09606  246 QQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPgqpmGPPGQQPGAMPNVMSIGDQNNYQQQQTrQQQQQQGGNHPAA 325
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  459 TSAQQTGPTKPPSQ---LPGPAKPPPQQPG---PAKPPPQQPG------SAKPPP------QQPGSTKPPPQQPgpAKPS 520
Cdd:pfam09606  326 HQQQMNQSVGQGGQvvaLGGLNHLETWNPGnfgGLGANPMQRGqpgmmsSPSPVPgqqvrqVTPNQFMRQSPQP--SVPS 403
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2462613745  521 PQQPGSTKPPSQQPGSAKPSA--QQPSPAKPSAQQSTKPVSQTGSGKPLQPP 570
Cdd:pfam09606  404 PQGPGSQPPQSHPGGMIPSPAliPSPSPQMSQQPAQQRTIGQDSPGGSLNTP 455
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
338-583 5.42e-14

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 78.76  E-value: 5.42e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  338 LTKPLAQQPGTVKPPVQPPGTTKPP-AQPLGPAKPPAQQTgsEKPSSEQPGPKALAQPPGVGKTPAQQPGPAKPPtqqvg 416
Cdd:PRK12323   357 LLRMLAFRPGQSGGGAGPATAAAAPvAQPAPAAAAPAAAA--PAPAAPPAAPAAAPAAAAAARAVAAAPARRSPA----- 429
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  417 tpkPLAQQPGLQSPAKAPGPTKTPVqqPGPGKIPAQQAGPgktsaqqtgPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGS 496
Cdd:PRK12323   430 ---PEALAAARQASARGPGGAPAPA--PAPAAAPAAAARP---------AAAGPRPVAAAAAAAPARAAPAAAPAPADDD 495
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  497 AKPPPQQPgstkPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPA-KPSAQQSTKPVSQTGSGKPLQPPTVSPS 575
Cdd:PRK12323   496 PPPWEELP----PEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLApAPAAAPAPRAAAATEPVVAPRPPRASAS 571

                   ....*...
gi 2462613745  576 AKQPPSQG 583
Cdd:PRK12323   572 GLPDMFDG 579
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
372-587 9.92e-14

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 77.99  E-value: 9.92e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  372 PAQQTGSEKPSSEQPGP-----KALAQPPGVGKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPVQQPGP 446
Cdd:PRK12323   365 PGQSGGGAGPATAAAAPvaqpaPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGP 444
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  447 GKIPAQQAGPGKTSAQQTGPtkppsqlpgPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPG-PAKPSPQQPG 525
Cdd:PRK12323   445 GGAPAPAPAPAAAPAAAARP---------AAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPeFASPAPAQPD 515
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462613745  526 STKPPSQQPGSAKPSAQQPSPAKPSaqQSTKPVSQTGSGKPLQPPTVSPSakQPPSQGLPKT 587
Cdd:PRK12323   516 AAPAGWVAESIPDPATADPDDAFET--LAPAPAAAPAPRAAAATEPVVAP--RPPRASASGL 573
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
366-561 3.48e-13

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 76.56  E-value: 3.48e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  366 LGPAKPPAQQTGSEKPSSEQPGPkALAQPPGVGKtPAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPVQQPG 445
Cdd:PRK07764   588 VGPAPGAAGGEGPPAPASSGPPE-EAARPAAPAA-PAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDG 665
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  446 PGKIPAQqAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKP--PPQQPGSTKPPPQQPGPAKPSPQQ 523
Cdd:PRK07764   666 GDGWPAK-AGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAaqPPQAAQGASAPSPAADDPVPLPPE 744
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 2462613745  524 PGSTKPPSQQPGsakPSAQQPSPAKPSAQQSTKPVSQT 561
Cdd:PRK07764   745 PDDPPDPAGAPA---QPPPPPAPAPAAAPAAAPPPSPP 779
PRK10263 PRK10263
DNA translocase FtsK; Provisional
320-923 5.90e-13

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 75.89  E-value: 5.90e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  320 GHEKSQPGPAKPPAQPSGLTKPLAQQPGTVKPPVqpPGTTKPPAQPLGPAKP-PAQQTG----SEKPSSEQPGPKaLAQP 394
Cdd:PRK10263   313 GAPITEPVAVAAAATTATQSWAAPVEPVTQTPPV--ASVDVPPAQPTVAWQPvPGPQTGepviAPAPEGYPQQSQ-YAQP 389
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  395 PGVGKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAkapgptkTPVQQPGPGKIPAQQAGPGKTSAQQTGPTKPPSQLP 474
Cdd:PRK10263   390 AVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPE-------QPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTY 462
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  475 GPAKPPPqQPGPAKPPPQQPgsakPPPQQPGSTKPPP--QQPGPAKP--------------SPQQPGSTKPPSQQPgsak 538
Cdd:PRK10263   463 QTEQTYQ-QPAAQEPLYQQP----QPVEQQPVVEPEPvvEETKPARPplyyfeeveekrarEREQLAAWYQPIPEP---- 533
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  539 psAQQPSPAKPSAQQSTKPVSqtgsgkplqPPTVSPSAKQPPSQGLPKTIcpLCNTTELLLHVPEkanFNTCTecqttvc 618
Cdd:PRK10263   534 --VKEPEPIKSSLKAPSVAAV---------PPVEAAAAVSPLASGVKKAT--LATGAAATVAAPV---FSLAN------- 590
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  619 slcGFNPNPhltevkewlclncQMKRALGgdlapvPSSPQPKLKTAPvTTTSAVSKSSPQPQQTSPKKDAAPKQdlSKAP 698
Cdd:PRK10263   591 ---SGGPRP-------------QVKEGIG------PQLPRPKRIRVP-TRRELASYGIKLPSQRAAEEKAREAQ--RNQY 645
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  699 EPKKPPPLVKQPTLHGSPSAKAKQPPEADSLSKPAPPKEPSVPSEQDKAPVAD--DKPKQPKMVKPTTDLVSSSSATTKP 776
Cdd:PRK10263   646 DSGDQYNDDEIDAMQQDELARQFAQTQQQRYGEQYQHDVPVNAEDADAAAEAElaRQFAQTQQQRYSGEQPAGANPFSLD 725
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  777 DIPSSKVQSQAEEKTTPPLKTDSAKPSQSfpPTGEKVSPFDSKAIPRPASDSKIISHPGPSSESKGQKQVDPVQKKEEPK 856
Cdd:PRK10263   726 DFEFSPMKALLDDGPHEPLFTPIVEPVQQ--PQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQ 803
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462613745  857 KAQTKMSPKPdaKPMPKGSPTPPGPRPTAGQtvptPQQSPKPQEQSRRFSLNLGSITDA-PKSQPTTP 923
Cdd:PRK10263   804 YQQPQQPVAP--QPQYQQPQQPVAPQPQYQQ----PQQPVAPQPQDTLLHPLLMRNGDSrPLHKPTTP 865
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
324-582 9.47e-13

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 74.89  E-value: 9.47e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  324 SQPGPAKPPAQPSGLTKPLAQQPGTVKPPVQPPGttkpPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAQPPGVGKtpAQ 403
Cdd:PRK07003   366 GAPGGGVPARVAGAVPAPGARAAAAVGASAVPAV----TAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATAD--RG 439
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  404 QPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQ 483
Cdd:PRK07003   440 DDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASRE 519
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  484 PGPAKPPPQQPGSAKPppqQPGSTKPPPQQPGPAKP-----SPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPV 558
Cdd:PRK07003   520 DAPAAAAPPAPEARPP---TPAAAAPAARAGGAAAAldvlrNAGMRVSSDRGARAAAAAKPAAAPAAAPKPAAPRVAVQV 596
                          250       260
                   ....*....|....*....|....
gi 2462613745  559 SQTGSGKPLQPPTVSPSAKQPPSQ 582
Cdd:PRK07003   597 PTPRARAATGDAPPNGAARAEQAA 620
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
307-525 1.80e-12

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 74.25  E-value: 1.80e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  307 QQPTPGKPPAQQPGHEKSQPGPAKPPAQPSGLTKPLAQQPGTVKPPVQPPGTTKPPAQPLGPAKPPAqqtgsekPSSEQP 386
Cdd:PRK07764   596 GGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDA-------SDGGDG 668
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  387 GPKALAQPPGVGKTPAQQPGPAKPPTqqvGTPKPLAQQPGLQSPAKAPGPtktpvQQPGPGKIPAQQAGPGKTSAQQTGP 466
Cdd:PRK07764   669 WPAKAGGAAPAAPPPAPAPAAPAAPA---GAAPAQPAPAPAATPPAGQAD-----DPAAQPPQAAQGASAPSPAADDPVP 740
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  467 TKP-PSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGstkPPPQQPGPAKPSPQQPG 525
Cdd:PRK07764   741 LPPePDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS---EEEEMAEDDAPSMDDED 797
PDZ1_GgSTXBP4-like cd06692
PDZ1 domain of Gallus gallus uncharacterized syntaxin-binding protein 4 (STXBP4) isoform X1, ...
4512-4580 3.17e-12

PDZ1 domain of Gallus gallus uncharacterized syntaxin-binding protein 4 (STXBP4) isoform X1, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of Gallus gallus uncharacterized syntaxin-binding protein 4 (STXBP4) isoform X1, and related domains. Gallus gallus STXBP4 isoform X1 contains 2 PDZ domains (PDZ1 and PDZ2). PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This STXBP4-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467179 [Multi-domain]  Cd Length: 88  Bit Score: 65.32  E-value: 3.17e-12
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 4512 NGLGIRIVGGkeIPGHSGE-IGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQQS 4580
Cdd:cd06692      8 KGLGIKIIGG--YRENTGEeFGIFIKRILPGGLAATDGRLKEGDLILEVNGESLQGVTNERAVSILRSAS 75
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
285-581 3.30e-12

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 73.67  E-value: 3.30e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  285 QADIVRGESVKPSLPSPSKPPIQQPTPGKPPAQQPGHEKSQPGPAKPPAQPSGLTKPLAQQPGTVKPPVQPPgttkPPAQ 364
Cdd:PHA03307    47 SAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPP----PTPP 122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  365 PLGPAKPPAQQTGSEKPSSEQPGPKALAQPPGVGKTPAQQPGPAKPPTQ-QVGTPKPLAQQPGLQSPAKAPGPTKTPVQQ 443
Cdd:PHA03307   123 PASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQaALPLSSPEETARAPSSPPAEPPPSTPPAAA 202
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  444 PGPG---KIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKP------PPQQPGSAKPPPQQPGSTKPPPQQP 514
Cdd:PHA03307   203 SPRPprrSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPenecplPRPAPITLPTRIWEASGWNGPSSRP 282
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462613745  515 GPAK---------PSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQTGSGKPLQPPTVSPSAKQPPS 581
Cdd:PHA03307   283 GPASssssprersPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPP 358
PHA03379 PHA03379
EBNA-3A; Provisional
369-591 3.45e-12

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 73.17  E-value: 3.45e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  369 AKPPAQQTGSEKPSSEQPGPKA---LAQPPGVGktPAQQPgpaKPPTQQVGTPKPLAQQPGLqspakAPGPtktpVQQPG 445
Cdd:PHA03379   406 EKASEPTYGTPRPPVEKPRPEVpqsLETATSHG--SAQVP---EPPPVHDLEPGPLHDQHSM-----APCP----VAQLP 471
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  446 PGkiPAQQAGPGKtsaQQTGPTKPPSQLPGPAKPP------PQQPGPAKPPPQQPGSAKPPPQ------QPGSTKPPPQQ 513
Cdd:PHA03379   472 PG--PLQDLEPGD---QLPGVVQDGRPACAPVPAPagpivrPWEASLSQVPGVAFAPVMPQPMpvepvpVPTVALERPVC 546
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462613745  514 PGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKpvSQTGSGKPLQPPTVSPSAKQPPSQGLPKTICPL 591
Cdd:PHA03379   547 PAPPLIAMQGPGETSGIVRVRERWRPAPWTPNPPRSPSQMSVR--DRLARLRAEAQPYQASVEVQPPQLTQVSPQQPM 622
PHA03247 PHA03247
large tegument protein UL36; Provisional
654-1073 3.54e-12

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 73.82  E-value: 3.54e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  654 PSSPQPKlKTAPVTTTSAVSKSSPqPQQTSPKKDAAPKQDLSkapepkkppplvkqptlhgSPSAKAKQPPeADSLSKPA 733
Cdd:PHA03247  2570 PPRPAPR-PSEPAVTSRARRPDAP-PQSARPRAPVDDRGDPR-------------------GPAPPSPLPP-DTHAPDPP 2627
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  734 PPKEPSVPSEQDKAPVADDKPKQPKMVKPTTDLVSSS---SATTKPDIPSSKVQSQAEEKTTPPLK--TDSAKPSQSfPP 808
Cdd:PHA03247  2628 PPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPrraRRLGRAAQASSPPQRPRRRAARPTVGslTSLADPPPP-PP 2706
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  809 TGEKVSPFDSKAIPRP-ASDSKIISHPG----PSSESKGQKQVDPVQKKEEPKKAQTKMSPKPDAKPMPKGSPTPPGPRP 883
Cdd:PHA03247  2707 TPEPAPHALVSATPLPpGPAAARQASPAlpaaPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRP 2786
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  884 TAGQTVPTPQQSPKPQEQS--------RRFSLNLGSITDAPKSQPTTPQETVTGKLFGFGASIFSQASNLI--------- 946
Cdd:PHA03247  2787 AVASLSESRESLPSPWDPAdppaavlaPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVApggdvrrrp 2866
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  947 -STAGQPGPHSQSGPGAPMKQAPAPSQPPTSQGPPKSTGQAPPAPAKSIPVKKETKAPAAEKLEPKAE-----QAPTVKR 1020
Cdd:PHA03247  2867 pSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPppprpQPPLAPT 2946
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462613745 1021 TETEKKPPP---IKDSKSLTAEPQKAVLPTKLEKSPKPESTCPLCKTELNIGSKDP 1073
Cdd:PHA03247  2947 TDPAGAGEPsgaVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLS 3002
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
410-810 4.43e-12

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 73.28  E-value: 4.43e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  410 PPTQQVGTPKPLAQQPGLQspAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKP--PPQQPGPA 487
Cdd:PHA03307    39 SQGQLVSDSAELAAVTVVA--GAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPtpPGPSSPDP 116
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  488 KPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSqQPGSAKPSAQQPSPAKPSAQQSTKPVSqtgSGKPL 567
Cdd:PHA03307   117 PPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPA-AVASDAASSRQAALPLSSPEETARAPS---SPPAE 192
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  568 QPPTVSPSAKQPPSQGLPKTICP--LCNTTELLLH----VPEKANFNTCTECQTTVCSLCGFNPNPHltevkewlclnCQ 641
Cdd:PHA03307   193 PPPSTPPAAASPRPPRRSSPISAsaSSPAPAPGRSaaddAGASSSDSSSSESSGCGWGPENECPLPR-----------PA 261
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  642 MKRALGGDLAPVPSSPQPKLKTaPVTTTSAVSKSSPQPQQTSPKKDAAPKQ--DLSKAPEPKKPPPLVKQPTLHGSPSAK 719
Cdd:PHA03307   262 PITLPTRIWEASGWNGPSSRPG-PASSSSSPRERSPSPSPSSPGSGPAPSSprASSSSSSSRESSSSSTSSSSESSRGAA 340
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  720 AKQPPEADSLSKPAPPKEPSVPSEQDKAPVADDKPKQPKMVKPttdlvSSSSATTKPDIPSSKVQSQAEEKTT----PPL 795
Cdd:PHA03307   341 VSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAG-----RPTRRRARAAVAGRARRRDATGRFPagrpRPS 415
                          410
                   ....*....|....*
gi 2462613745  796 KTDSAKPSQSFPPTG 810
Cdd:PHA03307   416 PLDAGAASGAFYARY 430
PHA03247 PHA03247
large tegument protein UL36; Provisional
351-575 3.13e-11

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 70.74  E-value: 3.13e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  351 PPVQPPGTTKPPAQPLGPAKPPAqqtgseKPSSEQPGPKAlAQPPGV------GKTPAQQPGPAKPPTQQVGTPKPLAQQ 424
Cdd:PHA03247   258 PPVVGEGADRAPETARGATGPPP------PPEAAAPNGAA-APPDGVwgaalaGAPLALPAPPDPPPPAPAGDAEEEDDE 330
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  425 PG---LQSPAKAPG------------PTKTP---VQQPGPGKIPAQQAGPGKTSAQQTGPTKPP--SQLPGPAKPPPQQP 484
Cdd:PHA03247   331 DGameVVSPLPRPRqhyplgfpkrrrPTWTPpssLEDLSAGRHHPKRASLPTRKRRSARHAATPfaRGPGGDDQTRPAAP 410
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  485 GPAKPPpqqpgsAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPakpsAQQSTKPVSQTGSG 564
Cdd:PHA03247   411 VPASVP------TPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDP----DDATRKALDALRER 480
                          250
                   ....*....|.
gi 2462613745  565 KPLQPPTVSPS 575
Cdd:PHA03247   481 RPPEPPGADLA 491
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
401-590 3.18e-11

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 69.90  E-value: 3.18e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  401 PAQQPGPAKPPTQQ---VGTPKPLAQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTS--------AQQTGPTKP 469
Cdd:PRK12323   365 PGQSGGGAGPATAAaapVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSpapealaaARQASARGP 444
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  470 PSQlPGPAKPPPQQPGPAKPPPQQPgsAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKP 549
Cdd:PRK12323   445 GGA-PAPAPAPAAAPAAAARPAAAG--PRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGW 521
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|..
gi 2462613745  550 SAQQSTKP-VSQTGSGKPLQPPTVSPSAKQPPSQGLPKTICP 590
Cdd:PRK12323   522 VAESIPDPaTADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
225-589 3.33e-11

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 70.20  E-value: 3.33e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  225 PGRDPLQQDGTPKSISSQQPEKIKS-QPPGTGKPIQGPTQTPQTDHAklPLQRDASRPQTKQAdivRGESVKPSLPSPSK 303
Cdd:PHA03307    73 PGPGTEAPANESRSTPTWSLSTLAPaSPAREGSPTPPGPSSPDPPPP--TPPPASPPPSPAPD---LSEMLRPVGSPGPP 147
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  304 PPIQQPTPGKPPAQQPGHEKSQPGPAKPPAQPSGLTKPLAQQPGTVKPPVQPPGTTKPPAQPLGPAKPPAQQTGS----- 378
Cdd:PHA03307   148 PAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPapgrs 227
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  379 ------EKPSSEQPGPKALAQPPGVGKTPAQQPGPAKPPTQ--QVGTPKPLAQQPGLQSPAKAPGPTKTPVQQPGPGKIP 450
Cdd:PHA03307   228 aaddagASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRiwEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGP 307
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  451 AQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGstkPPPQQPGPAKPSPQqpgstkPP 530
Cdd:PHA03307   308 APSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADP---SSPRKRPRPSRAPS------SP 378
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2462613745  531 SQQPGSAKPSAQQPSPAkPSAQQSTKPVSQTGSGKPLQPPTVSPSAKQPPSQGLPKTIC 589
Cdd:PHA03307   379 AASAGRPTRRRARAAVA-GRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPS 436
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
263-578 4.03e-11

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 69.32  E-value: 4.03e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  263 QTPQTDHAKLPLQRDASRPQTKQADIVRGESVKPSLPSPSKPPIQQPTPGKPPAQQPGHEKSQPGP--AKPPAQPSGLTK 340
Cdd:COG5180    156 QRSDPILAKDPDGDSASTLPPPAEKLDKVLTEPRDALKDSPEKLDRPKVEVKDEAQEEPPDLTGGAdhPRPEAASSPKVD 235
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  341 PLAQQPGTVKPPVQPpgtTKPPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAQPPGVGKTPAQQPGPAK---------PP 411
Cdd:COG5180    236 PPSTSEARSRPATVD---AQPEMRPPADAKERRRAAIGDTPAAEPPGLPVLEAGSEPQSDAPEAETARPidvkgvasaPP 312
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  412 TQQVGTPKPLAQQPGLQ-------SPAKAPgPTKTPVQQPGPGKIPAQQAGPGK------TSAQQTGPTKPPSQLPGPAK 478
Cdd:COG5180    313 ATRPVRPPGGARDPGTPrpgqpteRPAGVP-EAASDAGQPPSAYPPAEEAVPGKpleqgaPRPGSSGGDGAPFQPPNGAP 391
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  479 PPPQQPGPAKPPPQQPG-SAKPPPQQPGSTKPPPQQPGpakpspQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQ----Q 553
Cdd:COG5180    392 QPGLGRRGAPGPPMGAGdLVQAALDGGGRETASLGGAA------GGAGQGPKADFVPGDAESVSGPAGLADQAGAaastA 465
                          330       340
                   ....*....|....*....|....*
gi 2462613745  554 STKPVSQTGSGKPLQPPTVSPSAKQ 578
Cdd:COG5180    466 MADFVAPVTDATPVDVADVLGVRPD 490
PDZ smart00228
Domain present in PSD-95, Dlg, and ZO-1/2; Also called DHR (Dlg homologous region) or GLGF ...
4510-4586 4.09e-11

Domain present in PSD-95, Dlg, and ZO-1/2; Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.


Pssm-ID: 214570 [Multi-domain]  Cd Length: 85  Bit Score: 62.01  E-value: 4.09e-11
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462613745  4510 SGNGLGIRIVGGKEIPGhsgeiGAYIAKILPGGSAEQTGkLMEGMQVLEWNGIPLTSKTYEEVQSIISQQSGEAEIC 4586
Cdd:smart00228   10 GGGGLGFSLVGGKDEGG-----GVVVSSVVPGSPAAKAG-LRVGDVILEVNGTSVEGLTHLEAVDLLKKAGGKVTLT 80
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
270-585 4.86e-11

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 68.94  E-value: 4.86e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  270 AKLPLQRDASRPQTKQADIVRGESVKPSLPSPSKPPIQQPTPgKPPAQQPGHEKSQPGPAKPPAQPSGLTKPLAQQPGTV 349
Cdd:COG5180    146 AGVALAAALLQRSDPILAKDPDGDSASTLPPPAEKLDKVLTE-PRDALKDSPEKLDRPKVEVKDEAQEEPPDLTGGADHP 224
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  350 KPPVQPPGTTKPPAQPLGPAKPPaqqTGSEKPSSEQPGPKALAQPPGVGKTPAQQPgPAKPPTQQVGTPKPLAQQPGLQS 429
Cdd:COG5180    225 RPEAASSPKVDPPSTSEARSRPA---TVDAQPEMRPPADAKERRRAAIGDTPAAEP-PGLPVLEAGSEPQSDAPEAETAR 300
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  430 PAKAPGPTKTPVQqPGPGKIPAqqagpgktSAQQTGPTKPPSQLPGPAKPPPqqpgpAKPPPQQPGSAKPPPQQPGSTKP 509
Cdd:COG5180    301 PIDVKGVASAPPA-TRPVRPPG--------GARDPGTPRPGQPTERPAGVPE-----AASDAGQPPSAYPPAEEAVPGKP 366
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462613745  510 PPQQpgpaKPSPQQPGSTKPPSQqPGSAKPSAQQPSPAKPSAQQSTKPVSQTGSGKPLQPPTVSPSAKQPPSQGLP 585
Cdd:COG5180    367 LEQG----APRPGSSGGDGAPFQ-PPNGAPQPGLGRRGAPGPPMGAGDLVQAALDGGGRETASLGGAAGGAGQGPK 437
PRK10263 PRK10263
DNA translocase FtsK; Provisional
429-585 8.66e-11

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 68.96  E-value: 8.66e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  429 SPAKA---PGPTKtPVQQPG--PGKIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPgpAKPPPQQPGSAKPPPQQ 503
Cdd:PRK10263   730 SPMKAlldDGPHE-PLFTPIvePVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQP--QYQQPQQPVAPQPQYQQ 806
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  504 PGStkppPQQPGPAKPSPQQPGSTKPPSQQPgsakpsaQQPSPAKPSAQQSTKPVSQTGSGKPLQPPTVS-PS---AKQP 579
Cdd:PRK10263   807 PQQ----PVAPQPQYQQPQQPVAPQPQYQQP-------QQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPlPSldlLTPP 875

                   ....*.
gi 2462613745  580 PSQGLP 585
Cdd:PRK10263   876 PSEVEP 881
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
309-549 1.39e-10

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 67.95  E-value: 1.39e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  309 PTPGKPPAQQPGhekSQPGPAKPPAQPSGLTKPLAQQPGTVKPPVQPpgtTKPPAQPLGPAKPPAQQTGSEKPSSEQPGP 388
Cdd:PRK07003   367 APGGGVPARVAG---AVPAPGARAAAAVGASAVPAVTAVTGAAGAAL---APKAAAAAAATRAEAPPAAPAPPATADRGD 440
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  389 KALAQPPGVgKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPTK---TPVQQPGPGKIPAQQAGPGKTSAQQTG 465
Cdd:PRK07003   441 DAADGDAPV-PAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAfepAPRAAAPSAATPAAVPDARAPAAASRE 519
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  466 PTKPPSQLPGPAKPPPQqPGPAKPPPQQ-------------------------PGSAKPPPQQPGSTKPPPQQPGPAKPS 520
Cdd:PRK07003   520 DAPAAAAPPAPEARPPT-PAAAAPAARAggaaaaldvlrnagmrvssdrgaraAAAAKPAAAPAAAPKPAAPRVAVQVPT 598
                          250       260
                   ....*....|....*....|....*....
gi 2462613745  521 PQQPGSTKPPSQQPGSAKPSAQQPSPAKP 549
Cdd:PRK07003   599 PRARAATGDAPPNGAARAEQAAESRGAPP 627
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
225-538 1.80e-10

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 67.87  E-value: 1.80e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  225 PGRDPLQQDGTPKSISSQ--QPEKIKSQPPGTGKPIQ-GPTQTPQtdhaKLPLQRDASRPQTKQADIVRGESVKPSLPSP 301
Cdd:pfam03154  247 PPLQPMTQPPPPSQVSPQplPQPSLHGQMPPMPHSLQtGPSHMQH----PVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQ 322
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  302 SKPPiQQPTPGKPPAQQPGHEksQPGPAKPPAQPSglTKPlaqQPGTVKPPVQPPGTTKPPAQPLGPAK---------PP 372
Cdd:pfam03154  323 QRIH-TPPSQSQLQSQQPPRE--QPLPPAPLSMPH--IKP---PPTTPIPQLPNPQSHKHPPHLSGPSPfqmnsnlppPP 394
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  373 AQQTGSEKPSSEQPGpkalAQPPGVGKTPAQQ---PGPAKPP--TQQVGTPKPLAQQPGL----QSPAKAPGPTKtPVQQ 443
Cdd:pfam03154  395 ALKPLSSLSTHHPPS----AHPPPLQLMPQSQqlpPPPAQPPvlTQSQSLPPPAASHPPTsglhQVPSQSPFPQH-PFVP 469
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  444 PGPGKIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPakpspqQ 523
Cdd:pfam03154  470 GGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRSPSP------E 543
                          330
                   ....*....|....*
gi 2462613745  524 PGSTKPPSQQPGSAK 538
Cdd:pfam03154  544 PTVVNTPSHASQSAR 558
PTZ00121 PTZ00121
MAEBL; Provisional
1157-1700 2.30e-10

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 67.47  E-value: 2.30e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1157 VKKQEQEVKTEAEkvilEKVKETLSMEKIPPMVTTDQKQEESKLEKDKASALQEKKPLPEEKKLiPEEEKIRSEEKKPLL 1236
Cdd:PTZ00121  1320 AKKKAEEAKKKAD----AAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK-ADAAKKKAEEKKKAD 1394
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1237 EEKKPTPEDKKLLPEAKTSAPEEQKHDLLKSQvqiAEEKLEGRVAPKTVQEGKQP-QTKMEGLPSGTPQSLPKEDDKTTK 1315
Cdd:PTZ00121  1395 EAKKKAEEDKKKADELKKAAAAKKKADEAKKK---AEEKKKADEAKKKAEEAKKAdEAKKKAEEAKKAEEAKKKAEEAKK 1471
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1316 TIKEQPQPPCTAKPDQVEPGKEKTEKEDDKSDTSSSQQPKSPQGL-SDTGYSSDGI--------------------SSSL 1374
Cdd:PTZ00121  1472 ADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKkAEEAKKADEAkkaeeakkadeakkaeekkkADEL 1551
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1375 GEIPSLIPTDEKDILKGLKKDSFSQESSPSSPSDLAKLESTVLSILEAQASTLADEKSE--KKTQPHEVSPEQPKDQEKT 1452
Cdd:PTZ00121  1552 KKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEeaKKAEEAKIKAEELKKAEEE 1631
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1453 QSLSETLEITISEEEIKESQEERKDTFKK-DSQQDIPSSKDHKEKSEfvddiTTRREPYDSVEESSESENSPvPQRKRRT 1531
Cdd:PTZ00121  1632 KKKVEQLKKKEAEEKKKAEELKKAEEENKiKAAEEAKKAEEDKKKAE-----EAKKAEEDEKKAAEALKKEA-EEAKKAE 1705
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1532 SVGSSSSDEYKQEDSQGSGEEEDFIRKQIIEMSADEDASGSED---DEFIRNQLKEISSSTESQKKEETKGKGKITAGKH 1608
Cdd:PTZ00121  1706 ELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEakkDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEEL 1785
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1609 RRltrksstsidEDAGRRHSWHDEDDEAFDESPELK------------YRETKSQESEELVVTGGGGLRRFKTIELNSTI 1676
Cdd:PTZ00121  1786 DE----------EDEKRRMEVDKKIKDIFDNFANIIeggkegnlvindSKEMEDSAIKEVADSKNMQLEEADAFEKHKFN 1855
                          570       580       590
                   ....*....|....*....|....*....|
gi 2462613745 1677 ADKYSAESSQ------KKTSLYFDEEPELE 1700
Cdd:PTZ00121  1856 KNNENGEDGNkeadfnKEKDLKEDDEEEIE 1885
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
337-587 3.26e-10

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 66.33  E-value: 3.26e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  337 GLTKPLAQQPGTVKPPVQPPGTTKPPAQPLGPAKPPAQQTGSE-KPSSEQPGPKALAQPPGVGKTPAQQPGPAKPPTQqv 415
Cdd:NF033839   278 GLTQDTPKEPGNKKPSAPKPGMQPSPQPEKKEVKPEPETPKPEvKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVK-- 355
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  416 gtPKPLAQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPgKTSAQQTGPTKPPSQLPGPAKPPPQ-QPGPAKPPPQ-Q 493
Cdd:NF033839   356 --PQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKP-KPEVKPQPEKPKPEVKPQPEKPKPEvKPQPEKPKPEvK 432
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  494 PGSAKPPPQ---QPGSTKPP-PQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTkPVSQTGSGKPLQP 569
Cdd:NF033839   433 PQPEKPKPEvkpQPEKPKPEvKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPDNSKPQADDKKPST-PNNLSKDKQPSNQ 511
                          250
                   ....*....|....*...
gi 2462613745  570 PTVSPSAKQPPSQGLPKT 587
Cdd:NF033839   512 ASTNEKATNKPKKSLPST 529
PHA03247 PHA03247
large tegument protein UL36; Provisional
311-522 3.29e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 67.27  E-value: 3.29e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  311 PGKPPAQQPGHEKSqPGPAKPPAQPSGLTKPLAQQPGTVKPP----VQPPGTtkPPAQPLGPAKPPAQQTGSEKPSSEQP 386
Cdd:PHA03247   255 PAPPPVVGEGADRA-PETARGATGPPPPPEAAAPNGAAAPPDgvwgAALAGA--PLALPAPPDPPPPAPAGDAEEEDDED 331
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  387 GPKALAQPpgVGKTPAQQPGPAKPPTQQVGTPKPLAQQ--PGLQSPAKAPGPT---------KTPVQQPGPGKIPAQQAG 455
Cdd:PHA03247   332 GAMEVVSP--LPRPRQHYPLGFPKRRRPTWTPPSSLEDlsAGRHHPKRASLPTrkrrsarhaATPFARGPGGDDQTRPAA 409
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462613745  456 PGKTSAqqtgPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQ 522
Cdd:PHA03247   410 PVPASV----PTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATRK 472
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
307-587 3.55e-10

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 66.71  E-value: 3.55e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  307 QQPTPGKPPAQQpghekSQPGPAKPPAQPSGLTKPLAQqpgtvkpPVQPPGTTKPPAQPLGPAKPPAQQTGSEKPsseqp 386
Cdd:pfam03154  164 QQILQTQPPVLQ-----AQSGAASPPSPPPPGTTQAAT-------AGPTPSAPSVPPQGSPATSQPPNQTQSTAA----- 226
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  387 gPKALAQPpGVGKTPAQQPGPaKPPTQQVGTPKPlaqqPGLQSPAKAPGPTKTPVQQPGPGKIpaqqagpgktsaqQTGP 466
Cdd:pfam03154  227 -PHTLIQQ-TPTLHPQRLPSP-HPPLQPMTQPPP----PSQVSPQPLPQPSLHGQMPPMPHSL-------------QTGP 286
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  467 TKPPSQLPGPAKPPPQQPGPAKPPPQqPGSAKPPPQQPGSTKPPPQqPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSP 546
Cdd:pfam03154  287 SHMQHPVPPQPFPLTPQSSQSQVPPG-PSPAAPGQSQQRIHTPPSQ-SQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIP 364
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2462613745  547 AKPSAQQSTKPVSQTGSgKPLQ-------PPTVS----------PSAKQPPSQGLPKT 587
Cdd:pfam03154  365 QLPNPQSHKHPPHLSGP-SPFQmnsnlppPPALKplsslsthhpPSAHPPPLQLMPQS 421
MISS pfam15822
MAPK-interacting and spindle-stabilising protein-like; MISS is a family of eukaryotic ...
306-549 4.84e-10

MAPK-interacting and spindle-stabilising protein-like; MISS is a family of eukaryotic MAPK-interacting and spindle-stabilising protein-like proteins. MISS is rich in prolines and has four potential MAPK-phosphorylation sites, a MAPK-docking site, a PEST sequence (PEST motif) and a bipartite nuclear localization signal. The endogenous protein accumulates during mouse meiotic maturation and is found as discrete dots on the MII spindle. MISS is the first example of a physiological MAPK-substrate that is stabilized in MII that specifically regulates MII spindle integrity during the CSF arrest.


Pssm-ID: 318115 [Multi-domain]  Cd Length: 238  Bit Score: 63.08  E-value: 4.84e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  306 IQQPTPGKPPAQQPGhEKSQPGPAKPPAQPSGL-------TKPLAQQPGTVKPPVQPPGTTKPPAQPLGPAKPPAQQTGS 378
Cdd:pfam15822   18 VSNPKPGQPPQGWPG-SNPWNNPSAPPAVPSGLppstapsTVPFGPAPTGMYPSIPLTGPSPGPPAPFPPSGPSCPPPGG 96
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  379 EKPSSEQPGPKAlaqppgvgktpaqqPGPAKPPTQQVgtpkplaqqPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPGk 458
Cdd:pfam15822   97 PYPAPTVPGPGP--------------IGPYPTPNMPF---------PELPRPYGAPTDPAAAAPSGPWGSMSSGPWAPG- 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  459 TSAQQTGPTKP-PSQLPGPAKPPPQQPGPAKPPPQqpGSAKPPPQQPGSTKPPPQQPGPAK-PSPQQPGSTKPPSqqpgs 536
Cdd:pfam15822  153 MGGQYPAPNMPyPSPGPYPAVPPPQSPGAAPPVPW--GTVPPGPWGPPAPYPDPTGSYPMPgLYPTPNNPFQVPS----- 225
                          250
                   ....*....|...
gi 2462613745  537 aKPSAQQPSPAKP 549
Cdd:pfam15822  226 -GPSGAPPMPGGP 237
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
443-901 5.76e-10

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 65.78  E-value: 5.76e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  443 QPGPGKIPAQQAGPGKTSAqqtgptkPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQ 522
Cdd:PRK07764   386 GVAGGAGAPAAAAPSAAAA-------APAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPP 458
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  523 QPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTkpvsqtgsgkplQPPTVSPSAKQPPSQGLPKTICPLCNttELLLHVP 602
Cdd:PRK07764   459 AAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPA------------AAPAAPAAPAAPAGADDAATLRERWP--EILAAVP 524
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  603 EKANFntctecqttvcSLCGFNPNPHLTEVKE---WL------------------CLNCQMKRALGGDLAP-VPSSPQPK 660
Cdd:PRK07764   525 KRSRK-----------TWAILLPEATVLGVRGdtlVLgfstgglarrfaspgnaeVLVTALAEELGGDWQVeAVVGPAPG 593
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  661 LktAPVTTTSAVSKSSPQPQQTSPKKDAAPKQDlskapepkkppplvkqptlhgSPSAKAKQPPEADSLSKPAPPKEPSV 740
Cdd:PRK07764   594 A--AGGEGPPAPASSGPPEEAARPAAPAAPAAP---------------------AAPAPAGAAAAPAEASAAPAPGVAAP 650
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  741 PSEQDKAPVADdkpkqpkmvkpTTDLVSSSSATTKPDIPSSKVQSQAEekTTPPLKTDSAKPSQSFPPTGEKVSPFDSKA 820
Cdd:PRK07764   651 EHHPKHVAVPD-----------ASDGGDGWPAKAGGAAPAAPPPAPAP--AAPAAPAGAAPAQPAPAPAATPPAGQADDP 717
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  821 IPRPASDSKIISHPGPSSEskgqkqvDPVQKKEEPkkaqtkmsPKPDAKPMPKGSPTPPGPRPTAGQTVPTPQQSPKPQE 900
Cdd:PRK07764   718 AAQPPQAAQGASAPSPAAD-------DPVPLPPEP--------DDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEE 782

                   .
gi 2462613745  901 Q 901
Cdd:PRK07764   783 E 783
PDZ3_PDZD2-PDZ1_hPro-IL-16-like cd06759
PDZ domain 3 of PDZ domain containing 2 (PDZD2), PDZ domain 1 of human pro-interleukin-16 ...
4511-4582 6.53e-10

PDZ domain 3 of PDZ domain containing 2 (PDZD2), PDZ domain 1 of human pro-interleukin-16 (isoform 1, 1332 AA), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 3 of PDZD2, also known as KIAA0300, PIN-1, activated in prostate cancer (AIPC) and PDZ domain-containing protein 3 (PDZK3). PDZD2 has seven PDZ domains. PDZD2 is expressed at exceptionally high levels in the pancreas and certain cancer tissues, such as prostate cancer. It promotes the proliferation of insulinoma cells and is upregulated during prostate tumorigenesis. In osteosarcoma (OS), the microRNA miR-363 acts as a tumor suppressor by inhibiting PDZD2. This family also includes the first PDZ domain (PDZ1) of human pro-interleukin-16 (isoform 1, also known as nPro-Il-16; 1332 amino-acid protein). Precursor IL-16 is cleaved to produce pro-IL-16 and mature IL-16 (derived from the C-terminal 121 AA). Pro-IL-16 functions as a regulator of T cell growth; mature IL-16 is a CD4 ligand that induces chemotaxis and CD25 expression in CD4+ T cells. IL-16 bioactivity has been closely associated with the progression of several different cancers. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PDZD2-like family PDZ3 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467240 [Multi-domain]  Cd Length: 87  Bit Score: 58.44  E-value: 6.53e-10
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462613745 4511 GNGLGIRIVGGKEIPghSGEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEE-VQSIISQQSGE 4582
Cdd:cd06759     11 GKGLGFSIVGGRDSP--RGPMGIYVKTIFPGGAAAEDGRLKEGDEILEVNGESLQGLTHQEaIQKFKQIKKGL 81
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
430-872 6.79e-10

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 65.87  E-value: 6.79e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  430 PAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQQTGPTKPPSQLPGPA--KP-PPQQPGPAKPP-----PQQPGSAKPP- 500
Cdd:PTZ00449   510 PPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVgkKPgPAKEHKPSKIPtlskkPEFPKDPKHPk 589
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  501 -PQQPGSTKPP--PQQPgPAKPSPQQPGSTKPP--SQQPGSAKPSAQQPSPAKPSAQQSTKPVSQTGSGKPLQPPTV--S 573
Cdd:PTZ00449   590 dPEEPKKPKRPrsAQRP-TRPKSPKLPELLDIPksPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPpfD 668
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  574 PSAKQppsqglpkticplcnttELLLHVPEKAnfNTCTECQTTVCSLCGFNPNPHLTEVKEwlclncqmkralGGDLAPV 653
Cdd:PTZ00449   669 PKFKE-----------------KFYDDYLDAA--AKSKETKTTVVLDESFESILKETLPET------------PGTPFTT 717
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  654 PSSPQPKLKTAPVTTTSAVSKssPQPQQTSPKKDAAPKQDLSKAPEPKKPPPLvkqptLHGSPSAKAKQPP-EADSLSKP 732
Cdd:PTZ00449   718 PRPLPPKLPRDEEFPFEPIGD--PDAEQPDDIEFFTPPEEERTFFHETPADTP-----LPDILAEEFKEEDiHAETGEPD 790
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  733 APPKEPSVPSEQDKAPVAdDKPKQPK-------MVKPTTDLVSSSSATTKPdiPSSKVQSQAEEKTTPPLKTDSAKPSQS 805
Cdd:PTZ00449   791 EAMKRPDSPSEHEDKPPG-DHPSLPKkrhrldgLALSTTDLESDAGRIAKD--ASGKIVKLKRSKSFDDLTTVEEAEEMG 867
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462613745  806 fpPTGEKVSPFDSKAiprPASDSKiiSHPGPSSESKGQKQVDPVQKKEEPKKAQTKMSP-KPDAKPMP 872
Cdd:PTZ00449   868 --AEARKIVVDDDGT---EADDED--THPPEEKHKSEVRRRRPPKKPSKPKKPSKPKKPkKPDSAFIP 928
PHA03377 PHA03377
EBNA-3C; Provisional
316-582 7.18e-10

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 65.84  E-value: 7.18e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  316 AQQPGHEKSQPGPAKPPAQPS--------GLTKPLAQQPgtvkPPVQPPGTTKPPAQPLGPAKPPA------------QQ 375
Cdd:PHA03377   443 AEQAQSTPERPGPSDQPSVPVepahltpvEHTTVILHQP----PQSPPTVAIKPAPPPSRRRRGACvvydddiievidVE 518
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  376 TGSEKPSSEQPGPKALAQPPGVGKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPVQQPGPGKIPAqqag 455
Cdd:PHA03377   519 TTEEEESVTQPAKPHRKVQDGFQRSGRRQKRATPPKVSPSDRGPPKASPPVMAPPSTGPRVMATPSTGPRDMAPPS---- 594
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  456 pgktsaqqtgpTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPP--------QQPGSTKPPPQQPGPAKPSPQQPGST 527
Cdd:PHA03377   595 -----------TGPRQQAKCKDGPPASGPHEKQPPSSAPRDMAPSVvrmflrerLLEQSTGPKPKSFWEMRAGRDGSGIQ 663
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2462613745  528 KPPSQQPGsakpSAQQPSPAKPSAQQS--TKPVSQTGSGKPLQPPTVSP-SAKQPPSQ 582
Cdd:PHA03377   664 QEPSSRRQ----PATQSTPPRPSWLPSvfVLPSVDAGRAQPSEESHLSSmSPTQPISH 717
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
251-530 9.74e-10

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 65.26  E-value: 9.74e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  251 PPGTGKPIQGPTQTPqtdhaklplqrdASRPQtkqADIVRGESVKPSLPSPSKPPIQQPTPgKPPAQQPGHEKSQPGPAK 330
Cdd:PRK07003   367 APGGGVPARVAGAVP------------APGAR---AAAAVGASAVPAVTAVTGAAGAALAP-KAAAAAAATRAEAPPAAP 430
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  331 PPAQPSGLTKPLAQQPGTVKPPVQPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAQPPGVGKTPAQQPGPAKP 410
Cdd:PRK07003   431 APPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDA 510
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  411 PTQQVgtpkplAQQPGLQSPAKAPGPTKTPvQQPGPGKIPAQQAGPG-----------KTSAQQTGPTkppsqlPGPAKP 479
Cdd:PRK07003   511 RAPAA------ASREDAPAAAAPPAPEARP-PTPAAAAPAARAGGAAaaldvlrnagmRVSSDRGARA------AAAAKP 577
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2462613745  480 PPQQPGPAKPPPQQPGSAKPPPQQPGSTkpPPQQPGPAKPSPQQPGSTKPP 530
Cdd:PRK07003   578 AAAPAAAPKPAAPRVAVQVPTPRARAAT--GDAPPNGAARAEQAAESRGAP 626
PHA03377 PHA03377
EBNA-3C; Provisional
251-549 1.23e-09

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 65.07  E-value: 1.23e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  251 PPGTGKPIQGPTQTPQTDHAKLPLQRDASRPQTKQADIVRGESVKPSLPSP--SKPPIQQPTPGKPPA------------ 316
Cdd:PHA03377   582 TPSTGPRDMAPPSTGPRQQAKCKDGPPASGPHEKQPPSSAPRDMAPSVVRMflRERLLEQSTGPKPKSfwemragrdgsg 661
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  317 -QQPGHEKSQPGPAKPPAQPSGLtkplaqqPGTVKPPVQPPGTTKPPAQ----PLGPAKPPAQqtgSEKPSSEQP-GPKA 390
Cdd:PHA03377   662 iQQEPSSRRQPATQSTPPRPSWL-------PSVFVLPSVDAGRAQPSEEshlsSMSPTQPISH---EEQPRYEDPdDPLD 731
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  391 LAQPPGVGKTPAQQ---PGPAKPPTQQvgtpkplAQQPGLQSPAKAPGPTKTpVQQPGPGKIPAQQ----AGPGKTSAQQ 463
Cdd:PHA03377   732 LSLHPDQAPPPSHQapySGHEEPQAQQ-------APYPGYWEPRPPQAPYLG-YQEPQAQGVQVSSypgyAGPWGLRAQH 803
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  464 T---GPTKPPSQLPGPAKP-PPQQPGPAKPPPQQPGSAKPPPQQpgSTKPPPQQPGPAKPSPQQPGSTKPPSQQP--GSA 537
Cdd:PHA03377   804 PryrHSWAYWSQYPGHGHPqGPWAPRPPHLPPQWDGSAGHGQDQ--VSQFPHLQSETGPPRLQLSQVPQLPYSQTlvSSS 881
                          330
                   ....*....|..
gi 2462613745  538 KPSAQQPSPAKP 549
Cdd:PHA03377   882 APSWSSPQPRAP 893
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
385-557 3.53e-09

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 63.33  E-value: 3.53e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  385 QPGPKALAQPPGVGKT--PAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQ 462
Cdd:PRK07003   359 EPAVTGGGAPGGGVPArvAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADR 438
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  463 QTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPG--SAKPS 540
Cdd:PRK07003   439 GDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARapAAASR 518
                          170
                   ....*....|....*..
gi 2462613745  541 AQQPSPAKPSAQQSTKP 557
Cdd:PRK07003   519 EDAPAAAAPPAPEARPP 535
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
314-467 5.17e-09

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 62.59  E-value: 5.17e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  314 PPAQQPGHEKSQPGPAKPPAQPSGltkPLAQQPGTVKPPVQPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAQ 393
Cdd:PRK12323   434 AAARQASARGPGGAPAPAPAPAAA---PAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPA 510
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462613745  394 PpgvgktpaQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQQTGPT 467
Cdd:PRK12323   511 P--------AQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDM 576
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
308-579 5.35e-09

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 62.64  E-value: 5.35e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  308 QPTPGKPPAQQPGHEKSQPGPAKPPAQpsgLTKPLAqqPGTVKPPVQPPgTTKPPAQPLGPAKPPAQQTGSEKPSSEQPG 387
Cdd:PLN03209   340 KPVPTKPVTPEAPSPPIEEEPPQPKAV---VPRPLS--PYTAYEDLKPP-TSPIPTPPSSSPASSKSVDAVAKPAEPDVV 413
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  388 PKAlAQPPGVgktPAQQPGPAkpPTQQVGTPKPLAQQPGLQSPAkAPGPTKtpvqqPGPGKIPAQQAgpgkTSAQQTGPT 467
Cdd:PLN03209   414 PSP-GSASNV---PEVEPAQV--EAKKTRPLSPYARYEDLKPPT-SPSPTA-----PTGVSPSVSST----SSVPAVPDT 477
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  468 KPPSQLPGPAKPPPQQPGPAKPPPQQpGSAKPPpqqpgsTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPA 547
Cdd:PLN03209   478 APATAATDAAAPPPANMRPLSPYAVY-DDLKPP------TSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQP 550
                          250       260       270
                   ....*....|....*....|....*....|..
gi 2462613745  548 KPsaqqstKPVSQTGSGKPLQPPTvSPSAKQP 579
Cdd:PLN03209   551 KP------RPLSPYTMYEDLKPPT-SPTPSPV 575
PDZ2-PTPN13_FRMPD2-like cd06792
PDZ domain 2 of tyrosine kinase PTPN13, FERM and PDZ domain-containing protein 2 (FRMPD2), and ...
4512-4578 5.46e-09

PDZ domain 2 of tyrosine kinase PTPN13, FERM and PDZ domain-containing protein 2 (FRMPD2), and similar domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of human PTPN13, and related domains. PTPN13, also known as Fas-associated protein-tyrosine phosphatase 1 (FAP-1), protein-tyrosine phosphatase 1E (PTP-E1), and protein-tyrosine phosphatase (PTPL1), negatively regulates FAS-mediated apoptosis and NGFR-mediated pro-apoptotic signaling, and may also regulate phosphoinositide 3-kinase (PI3K) signaling. It contains 5 PDZ domains; interaction partners of its second PDZ domain (PDZ2) include the Fas receptor (TNFRSF6) and thyroid receptor-interacting protein 6 (TRIP6). The second PDZ (PDZ2) domain, but not PDZ1 or PDZ3, of FRMPD2 binds to GluN2A and GluN2B, two subunits of N-methyl-d-aspartic acid (NMDA) receptors. Other binding partners of the FRMPDZ2 PDZ2 domain include NOD2, and catenin family members, delta catenin (CTNND2), armadillo repeat gene deleted in velo-cardio-facial syndrome (ARVCF) and p0071 (also known as plakophilin 4; PKP4). PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PTPN13-like family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467254 [Multi-domain]  Cd Length: 87  Bit Score: 56.07  E-value: 5.46e-09
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462613745 4512 NGLGIRIVGGKEIPGHSGeiGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQ 4578
Cdd:cd06792     12 GSLGISVTGGINTSVRHG--GIYVKSLVPGGAAEQDGRIQKGDRLLEVNGVSLEGVTHKQAVECLKN 76
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
445-595 5.57e-09

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 62.35  E-value: 5.57e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  445 GPGKIPAQQAGPGKTSAQQTGPTKPPSqlPGPAKPPPQQPGPAKPPpqQPGSAKPPPQQPGSTKPP-------PQQPGPA 517
Cdd:COG5164      5 GPGKTGPSDPGGVTTPAGSQGSTKPAQ--NQGSTRPAGNTGGTRPA--QNQGSTTPAGNTGGTRPAgnqgatgPAQNQGG 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462613745  518 KPSPQQPGSTKPPSQQPGSaKPSAQQPSPAKPSAQQSTKPVSQTGSGKPLQPPTVSPSAKQPPSQGLPKTICPLCNTT 595
Cdd:COG5164     81 TTPAQNQGGTRPAGNTGGT-TPAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGSTPPGPGSTGPGGSTT 157
PDZ2_PDZD2-like cd06758
PDZ domain 2 of PDZ domain containing 2 (PDZD2), and related domains; PDZ (PSD-95 ...
4505-4581 5.78e-09

PDZ domain 2 of PDZ domain containing 2 (PDZD2), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of PDZD2, also known as KIAA0300, PIN-1, activated in prostate cancer (AIPC) and PDZ domain-containing protein 3 (PDZK3). PDZD2 has seven PDZ domains, and is expressed at exceptionally high levels in the pancreas and certain cancer tissues such as prostate cancer. It promotes the proliferation of insulinoma cells and is upregulated during prostate tumorigenesis. In osteosarcoma (OS), the microRNA miR-363 acts as a tumor suppressor by inhibiting PDZD2. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PDZD2-like family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467239 [Multi-domain]  Cd Length: 88  Bit Score: 55.82  E-value: 5.78e-09
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462613745 4505 KDHTVSGN-GLGIRIVGGKEipGHSGEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQQSG 4581
Cdd:cd06758      4 KMHLLKEKgGLGIQITGGKG--SKRGDIGIFVAGVEEGGSADRDGRLKKGDELLMINGQSLIGLSHQEAVAILRSSAS 79
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
465-591 6.43e-09

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 62.42  E-value: 6.43e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  465 GPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAkPPPQQPGSTKPPPQQPGPAKPSP-QQPGSTKPPSQQPGSAKPSAQQ 543
Cdd:PRK14951   370 AEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPA-PAAAPAAAASAPAAPPAAAPPAPvAAPAAAAPAAAPAAAPAAVALA 448
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 2462613745  544 PSPAKPSAQQSTKPVSQTGSGKPLQPPTVSPSAKQPPSQGLPKTICPL 591
Cdd:PRK14951   449 PAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEEGDV 496
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
421-549 6.48e-09

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 62.42  E-value: 6.48e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  421 LAQQPGLQSPAKAPGPTKTPVQ--QPGPGKIPAQQAGPGKTSAqqtgPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAK 498
Cdd:PRK14951   362 LAFKPAAAAEAAAPAEKKTPARpeAAAPAAAPVAQAAAAPAPA----AAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAA 437
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2462613745  499 PPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKP 549
Cdd:PRK14951   438 PAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARL 488
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
308-510 7.64e-09

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 62.20  E-value: 7.64e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  308 QPTPGKPPAQQPGHEKSQPGPAKPPAQPSGLtkPLAQQPGTVKPPVQPPGTTKPPAQPLGPAKPPAQqtgsekpsseQPG 387
Cdd:PRK12323   403 PAAPAAAPAAAAAARAVAAAPARRSPAPEAL--AAARQASARGPGGAPAPAPAPAAAPAAAARPAAA----------GPR 470
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  388 PKALAQPPgvgktPAQQPGPAKPPTQQVGTPKPLAQQPGlqspakapgptktPVQQPGPGKIPAQQAGPGKTSAQQTGPT 467
Cdd:PRK12323   471 PVAAAAAA-----APARAAPAAAPAPADDDPPPWEELPP-------------EFASPAPAQPDAAPAGWVAESIPDPATA 532
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 2462613745  468 KPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPP 510
Cdd:PRK12323   533 DPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPD 575
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
649-1010 8.04e-09

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 62.50  E-value: 8.04e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  649 DLAPVPSSPQPklktAPVTTTSAVSKSSPQPQQTSPKKDAAPKQDLSKAPEPKKPPPLVKQPTLHGSPSakakqPPEADS 728
Cdd:PHA03307    63 DRFEPPTGPPP----GPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPP-----PSPAPD 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  729 LSKPAPPKEPSVPSEQDKAPVADDKPKQPkmvkpTTDLVSSSSATTKPDIPSSKVQSQAEEKTTPPLKTDSAKPSQSFPP 808
Cdd:PHA03307   134 LSEMLRPVGSPGPPPAASPPAAGASPAAV-----ASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPR 208
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  809 TGEKVSPFDSKAIPRPASDSKIISHPGPSSESKGQKQVdpvqKKEEPKKAQTKMSPKPDAKPMPKGSPTPPGPRPTAGQT 888
Cdd:PHA03307   209 RSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSG----CGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGP 284
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  889 V----PTPQQSPKPQEqsrrfslnlgsitDAPKSQPTTPQETVTGKLFGfgasifSQASNLISTAgqPGPHSQSGPGAPm 964
Cdd:PHA03307   285 AssssSPRERSPSPSP-------------SSPGSGPAPSSPRASSSSSS------SRESSSSSTS--SSSESSRGAAVS- 342
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*.
gi 2462613745  965 kQAPAPSQPPTSQGPPKSTgqAPPAPAKSIPVKKETKAPAAEKLEP 1010
Cdd:PHA03307   343 -PGPSPSRSPSPSRPPPPA--DPSSPRKRPRPSRAPSSPAASAGRP 385
PDZ7_MUPP1-PD6_PATJ-like cd06671
PDZ domain 7 of multi-PDZ-domain protein 1 (MUPP1), PDZ domain 6 of PATJ (protein-associated ...
4494-4576 8.09e-09

PDZ domain 7 of multi-PDZ-domain protein 1 (MUPP1), PDZ domain 6 of PATJ (protein-associated tight junction) and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 7 of MUPP1 and PDZ domain 6 of PATJ, and related domains. MUPP1 and PATJ serve as scaffolding proteins linking different proteins and protein complexes involved in the organization of tight junctions and epithelial polarity. MUPP1 contains an L27 (Lin-2 and Lin-7 binding) domain and 13 PDZ domains. PATJ (also known as INAD-like) contains an L27 domain and ten PDZ domains. MUPP1 and PATJ share several binding partners, including junctional adhesion molecules (JAM), zonula occludens (ZO)-3, Pals1 (protein associated with Lin-7), Par (partitioning defective)-6 proteins, and nectins (adherence junction adhesion molecules). PATJ lacks 3 PDZ domains seen in MUPP1: PDZ6, 9, and 13; consequently, MUPP1 PDZ7 and 8 align with PATJ PDZ6 and 7; and MUPP1 PDZ domains 10-12 align with PATJ PDZ domains 8-10. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MUPP1-like family PDZ7 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467159 [Multi-domain]  Cd Length: 96  Bit Score: 55.79  E-value: 8.09e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 4494 PHARIKITRDSkdhtvsGNGLGIRIVGGKEIPGH--SGEI--GAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTY 4569
Cdd:cd06671      1 PPRRVELWREP------GKSLGISIVGGRVMGSRlsNGEEirGIFIKHVLEDSPAGRNGTLKTGDRILEVNGVDLRNATH 74

                   ....*..
gi 2462613745 4570 EEVQSII 4576
Cdd:cd06671     75 EEAVEAI 81
PRK10263 PRK10263
DNA translocase FtsK; Provisional
241-776 8.14e-09

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 62.41  E-value: 8.14e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  241 SQQPEkIKSQP---PGTGKPIQGP---TQTPQTDHAKL------PLQRDASRPQTKQADIVRGESVKPSLPSPSKPPIQQ 308
Cdd:PRK10263   353 PAQPT-VAWQPvpgPQTGEPVIAPapeGYPQQSQYAQPavqynePLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQ 431
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  309 PTPGKPPAQQPGHEKSQPGPAKPPAQP-------SGLTKPLAQQPGTVKPPVQPPGTT---KPPAQPLGPAKPPAQ--QT 376
Cdd:PRK10263   432 PYYAPAPEQPVAGNAWQAEEQQSTFAPqstyqteQTYQQPAAQEPLYQQPQPVEQQPVvepEPVVEETKPARPPLYyfEE 511
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  377 GSEKPSSEQPGPKALAQPPgvgKTPAQQPGPAKPPTQqvgtpkplaqqpgLQSPAKAPGPTKTPVQQPGPGKIpaQQAGP 456
Cdd:PRK10263   512 VEEKRAREREQLAAWYQPI---PEPVKEPEPIKSSLK-------------APSVAAVPPVEAAAAVSPLASGV--KKATL 573
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  457 GKTSAqqTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGST---KPPPQQPGPAKPSPQQPGSTKPPSQQ 533
Cdd:PRK10263   574 ATGAA--ATVAAPVFSLANSGGPRPQVKEGIGPQLPRPKRIRVPTRRELASygiKLPSQRAAEEKAREAQRNQYDSGDQY 651
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  534 PGSAKPSAQQPSPAKP-SAQQSTKPVSQTGSGKPLQPPTVSPSAKQPPSQGLPKTicplcNTTELLLHVPEKANFNTCTE 612
Cdd:PRK10263   652 NDDEIDAMQQDELARQfAQTQQQRYGEQYQHDVPVNAEDADAAAEAELARQFAQT-----QQQRYSGEQPAGANPFSLDD 726
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  613 CQttvcslcgFNPNPHLTEVKEWLCLNCQMKRALGGDLAPVPSSPQPKLKTAPVtttsavsksSPQPQQTSPKKDAAPKQ 692
Cdd:PRK10263   727 FE--------FSPMKALLDDGPHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPV---------APQPQYQQPQQPVAPQP 789
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  693 DLSKAPEPKKPPPLVKQPTLHGSPSAKAKQPPEADSLSKP-APPKEPSVPSEQDKA--PV----ADDKPKQ-PKMVKPTT 764
Cdd:PRK10263   790 QYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQyQQPQQPVAPQPQDTLlhPLlmrnGDSRPLHkPTTPLPSL 869
                          570
                   ....*....|..
gi 2462613745  765 DLVSSSSATTKP 776
Cdd:PRK10263   870 DLLTPPPSEVEP 881
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
307-550 8.28e-09

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 61.71  E-value: 8.28e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  307 QQPTPGKPPAQQPGHEKSQPGPAKPPAQPSgltkplaQQPGTVKPPVQP-PGTTKPPAQPlGPAKPPAQQtgseKPSSEQ 385
Cdd:NF033839   304 QPEKKEVKPEPETPKPEVKPQLEKPKPEVK-------PQPEKPKPEVKPqLETPKPEVKP-QPEKPKPEV----KPQPEK 371
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  386 PGPKALAQPPGVGKTPAQQPGPAKPPTQqvgtPKPLAQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPgKTSAQQTG 465
Cdd:NF033839   372 PKPEVKPQPETPKPEVKPQPEKPKPEVK----PQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKP-KPEVKPQP 446
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  466 PTKPPSQLPGPAKPPPQ-QPGPAKPPPQqpgsAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQP 544
Cdd:NF033839   447 EKPKPEVKPQPETPKPEvKPQPEKPKPE----VKPQPEKPKPDNSKPQADDKKPSTPNNLSKDKQPSNQASTNEKATNKP 522

                   ....*.
gi 2462613745  545 SPAKPS 550
Cdd:NF033839   523 KKSLPS 528
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
663-992 1.41e-08

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 61.63  E-value: 1.41e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  663 TAPVTTTSAVSKSSPQPQQTSPKKDAAPKQDLSKAPEPKKPPPLVKQPTLHGSPSAKAKQPPEADSLSKPAPPKEPSVPS 742
Cdd:PTZ00449   522 KAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPR 601
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  743 EQDKaPVADDKPKQPKMvkptTDLVSSSSATTKPDIPSSKVQSQAEEKTTPPLKTDSAKPSQsfPPTGEKVsPFDSK--- 819
Cdd:PTZ00449   602 SAQR-PTRPKSPKLPEL----LDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPK--PPKSPKP-PFDPKfke 673
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  820 ----AIPRPASDSKIISHPGPSSESkgQKQVDPVQKKEEPKKAQTKMSPKPDAKPMPKGSPTPPGPRPTAGQTVPTPQQS 895
Cdd:PTZ00449   674 kfydDYLDAAAKSKETKTTVVLDES--FESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFT 751
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  896 PkPQEQSRRFSlnlgsitdapKSQPTTPQETVTGKLFgfgasifsqasnlistaGQPGPHSQSG-PGAPMKQAPAPsqpp 974
Cdd:PTZ00449   752 P-PEEERTFFH----------ETPADTPLPDILAEEF-----------------KEEDIHAETGePDEAMKRPDSP---- 799
                          330
                   ....*....|....*...
gi 2462613745  975 tSQGPPKSTGQAPPAPAK 992
Cdd:PTZ00449   800 -SEHEDKPPGDHPSLPKK 816
PDZ_SYNJ2BP-like cd06709
PDZ domain of synaptojanin-2-binding protein (SYNJ2BP), and related domains; PDZ (PSD-95 ...
4498-4572 1.41e-08

PDZ domain of synaptojanin-2-binding protein (SYNJ2BP), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of SYNJ2BP, and related domains. SYNJ2BP (also known as mitochondrial outer membrane protein 25, OMP25) regulates endocytosis of activin type 2 receptor kinases through the Ral/RALBP1-dependent pathway and may be involved in suppression of activin-induced signal transduction. Binding partners of the SYNJ2BP PDZ domain include activin type II receptors (ActR-II), and SYNJ2. SYNJ2BP interacts with the PDZ binding motif of the Notch Delta-like ligand 1 (DLL1) and DLL4, promoting Delta-Notch signaling, and inhibiting sprouting angiogenesis. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This SYNJ2BP-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467193 [Multi-domain]  Cd Length: 86  Bit Score: 54.61  E-value: 1.41e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462613745 4498 IKITRDskdhtvsGNGLGIRIVGGKEIPGHSGEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEV 4572
Cdd:cd06709      3 ITLKRG-------PSGLGFNIVGGTDQPYIPNDSGIYVAKIKEDGAAAIDGRLQEGDKILEINGQSLENLTHQDA 70
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
312-583 1.71e-08

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 61.24  E-value: 1.71e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  312 GKPPAQQPGHEKSQPGPAKPpAQPSGLtkplaqqPGTVKPPvQPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQPGPKAL 391
Cdd:PTZ00449   547 GKPGETKEGEVGKKPGPAKE-HKPSKI-------PTLSKKP-EFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPEL 617
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  392 AQPPGVGKTPAQQPGPAKPPTQQvgTPKPLAQQPGLQSPaKAPGPTKTPvQQPGPGKIPAQQAGPGKTSAQQTGPTKPPS 471
Cdd:PTZ00449   618 LDIPKSPKRPESPKSPKRPPPPQ--RPSSPERPEGPKII-KSPKPPKSP-KPPFDPKFKEKFYDDYLDAAAKSKETKTTV 693
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  472 QLPGPAKPPPQQPGPAKP-PPQQPGSAKPP--PQQPGSTKPPPQQPGPAKPSPQQPgSTKPPSQQPGSAKPSAQQPSPAK 548
Cdd:PTZ00449   694 VLDESFESILKETLPETPgTPFTTPRPLPPklPRDEEFPFEPIGDPDAEQPDDIEF-FTPPEEERTFFHETPADTPLPDI 772
                          250       260       270
                   ....*....|....*....|....*....|....*..
gi 2462613745  549 PSAQQSTKPV-SQTGS-GKPLQPPTvSPSAKQPPSQG 583
Cdd:PTZ00449   773 LAEEFKEEDIhAETGEpDEAMKRPD-SPSEHEDKPPG 808
PDZ_GOPC-like cd06800
PDZ domain of Golgi-associated PDZ and coiled-coil motif-containing protein (GOPC), and ...
4513-4585 2.26e-08

PDZ domain of Golgi-associated PDZ and coiled-coil motif-containing protein (GOPC), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of GOPC and related domains. GOPC, also known as PIST (PDZ domain protein interacting specifically with TC10), FIG (fused in glioblastoma), and CAL (CFTR-associated ligand), regulates the trafficking of a wide array of proteins, including small GTPases, receptors, and cell surface molecules such as cadherin 23 and CFTR. It may regulate CFTR chloride currents and acid-sensing ASIC3 currents by modulating cell surface expression of both channels, and may play a role in autophagy. Interaction partners of the GOPC PDZ domains include: FZD5, FZD8, ASIC3, CFTR, MUC3, ARFRP1, Ggamma13, neuroligin, and Stargazin. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This GOPC-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467261 [Multi-domain]  Cd Length: 83  Bit Score: 53.91  E-value: 2.26e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462613745 4513 GLGIRIVGGKEipgHSGEIgaYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQQSGEAEI 4585
Cdd:cd06800     12 GLGISITGGKE---HGVPI--LISEIHEGQPADRCGGLYVGDAILSVNGIDLRDAKHKEAVTILSQQRGEITL 79
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
322-583 2.48e-08

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 59.92  E-value: 2.48e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  322 EKSQPGPAKP--------PAQPSGLTKPlaQQPGTVKPPVQPPGTTKP-----PAQPLGPAKPPAQQtGSEKPSSE--QP 386
Cdd:NF038329   118 EKGEPGPAGPagpageqgPRGDRGETGP--AGPAGPPGPQGERGEKGPagpqgEAGPQGPAGKDGEA-GAKGPAGEkgPQ 194
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  387 GPKALAQPPGvGKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPT-----KTPVQQPGPGKIPAQQAGPGKT-- 459
Cdd:NF038329   195 GPRGETGPAG-EQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTgedgpQGPDGPAGKDGPRGDRGEAGPDgp 273
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  460 --SAQQTGPTKPPSQ--------LPGPAKPPPQQPGPAKP-------PPQQPGSAKPPPQ--QPGstKPPPQQPG-PAKP 519
Cdd:NF038329   274 dgKDGERGPVGPAGKdgqngkdgLPGKDGKDGQNGKDGLPgkdgkdgQPGKDGLPGKDGKdgQPG--KPAPKTPEvPQKP 351
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462613745  520 SpQQPGSTKPPsQQPGSAKPSAqqPSPAKPSAQQSTKPvsQTGSGKPL-QPPTVSPSAKQPPSQG 583
Cdd:NF038329   352 D-TAPHTPKTP-QIPGQSKDVT--PAPQNPSNRGLNKP--QTQGGNQLaKTPAAHDTHRQLPATG 410
BimA_second NF040983
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ...
453-557 2.72e-08

trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.


Pssm-ID: 468913 [Multi-domain]  Cd Length: 382  Bit Score: 59.53  E-value: 2.72e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  453 QAGPGKTSAQQTGPTKPPSQLPGPakPPPQQPGPAKPPPQQPGSAKPPPqqPGSTKPPPQQPGPAKPSPQQPgSTKPPSQ 532
Cdd:NF040983    69 QIKKGDFKLKPVGDRTLPNKVPPP--PPPPPPPPPPPPTPPPPPPPPPP--PPPPSPPPPPPPSPPPSPPPP-TTTPPTR 143
                           90       100
                   ....*....|....*....|....*
gi 2462613745  533 QPgsakPSAQQPSPAKPSAQQSTKP 557
Cdd:NF040983   144 TT----PSTTTPTPSMHPIQPTQLP 164
PDZ3_Dlg1-2-4-like cd06795
PDZ domain 3 of human discs large homolog 1 (Dlg1), Dlg2, and Dlg4, Drosophila disc large (Dlg) ...
4511-4571 3.25e-08

PDZ domain 3 of human discs large homolog 1 (Dlg1), Dlg2, and Dlg4, Drosophila disc large (Dlg), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 3 of Drosophila Dlg1, human Dlg1, 2, and 4 and related domains. Dlg1 (also known as synapse-associated protein Dlg197; SAP-97), Dlg2 (also known as channel-associated protein of synapse-110; postsynaptic density protein 93, PSD-93), Dlg4 (also known as postsynaptic density protein 95, PSD-95; synapse-associated protein 90, SAP-90) each have 3 PDZ domains and belong to the membrane-associated guanylate kinase family. Dlg1 regulates antigen receptor signaling and cell polarity in lymphocytes, B-cell proliferation and antibody production, and TGFalpha bioavailability; its PDZ3 domain binds pro-TGFalpha, and its PDZ2 domain binds the TACE metalloprotease responsible for cleaving pro-TGFalpha to a soluble form. Dlg2 is involved in N-methyl-D-aspartate (NMDA) receptor signaling, regulating surface expression of NMDA receptors in dorsal horn neurons of the spinal cord; it interacts with NMDA receptor subunits and with Shaker-type K+ channel subunits to cluster into a channel complex. The Dlg4 PDZ1 domain binds NMDA receptors, and its PDZ2 domain binds neuronal nitric oxide synthase (nNOS), forming a complex in neurons. The Drosophila Scribble complex (Scribble, Dlg, and lethal giant larvae) plays a role in apico-basal cell polarity, and in other forms of polarity, including regulation of the actin cytoskeleton, cell signaling and vesicular trafficking, and in tumor development; postsynaptic targeting of Drosophila DLG requires interactions mediated by the first two PDZ domains. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Dlg-like family PDZ3 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467257 [Multi-domain]  Cd Length: 91  Bit Score: 53.90  E-value: 3.25e-08
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462613745 4511 GNGLGIRIVGGKEipghsGEiGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEE 4571
Cdd:cd06795     11 STGLGFNIVGGED-----GE-GIFISFILAGGPADLSGELRRGDQILSVNGVDLRNATHEQ 65
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
354-604 3.72e-08

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 60.09  E-value: 3.72e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  354 QPPgttKPPAQPLGPAKPPAQQTGSEK---PSSEQPGPKALAQPPGVGKTPA-QQPGPAKPptqqvgtpkplaqqpglQS 429
Cdd:PTZ00449   509 EPP---EGPEASGLPPKAPGDKEGEEGeheDSKESDEPKEGGKPGETKEGEVgKKPGPAKE-----------------HK 568
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  430 PAKAPGPTKTPvQQPGPGKIPAQQAGPGKTSAQQT--GPTKPPS-QLPGPAKPPPQQPGPAKP-PPQQPgsakPPPQQPG 505
Cdd:PTZ00449   569 PSKIPTLSKKP-EFPKDPKHPKDPEEPKKPKRPRSaqRPTRPKSpKLPELLDIPKSPKRPESPkSPKRP----PPPQRPS 643
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  506 StkppPQQP-GPAKP-SPQQPGSTKPPSqQPGSAKPSAQQPSPAKPSAQQSTKPVSQTGSGKPLQPPTVSPSAKQPPSQg 583
Cdd:PTZ00449   644 S----PERPeGPKIIkSPKPPKSPKPPF-DPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTT- 717
                          250       260
                   ....*....|....*....|.
gi 2462613745  584 lPKTICPLCNTTELLLHVPEK 604
Cdd:PTZ00449   718 -PRPLPPKLPRDEEFPFEPIG 737
PHA03377 PHA03377
EBNA-3C; Provisional
312-580 3.79e-08

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 60.07  E-value: 3.79e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  312 GKPPAQQPGHEKSQPGP-------AKPPAQPSGLTKPLAQQPGTVKPPVQPPGTTKPP-AQPLGPAKPPAQQTGSEKPSS 383
Cdd:PHA03377   561 GPPKASPPVMAPPSTGPrvmatpsTGPRDMAPPSTGPRQQAKCKDGPPASGPHEKQPPsSAPRDMAPSVVRMFLRERLLE 640
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  384 EQPGPK--------------ALAQPPGVGKTPAQQPGPAKP---------PTQQVGTPKPL----------------AQQ 424
Cdd:PHA03377   641 QSTGPKpksfwemragrdgsGIQQEPSSRRQPATQSTPPRPswlpsvfvlPSVDAGRAQPSeeshlssmsptqpishEEQ 720
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  425 PGLQSPAKAPGPTKTPVQQPGPgkiPAQQAGPGKTSAQQTGPTKPPSQLPgpakPPPQQP--GPAKPPPQQ------PGS 496
Cdd:PHA03377   721 PRYEDPDDPLDLSLHPDQAPPP---SHQAPYSGHEEPQAQQAPYPGYWEP----RPPQAPylGYQEPQAQGvqvssyPGY 793
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  497 AKPPPQQPG------STKPPPQQPGPAKP-SPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQTGSGKPLQP 569
Cdd:PHA03377   794 AGPWGLRAQhpryrhSWAYWSQYPGHGHPqGPWAPRPPHLPPQWDGSAGHGQDQVSQFPHLQSETGPPRLQLSQVPQLPY 873
                          330
                   ....*....|.
gi 2462613745  570 PTVSPSAKQPP 580
Cdd:PHA03377   874 SQTLVSSSAPS 884
PDZ_MPP-like cd06726
PDZ domain of membrane palmitoylated proteins (MPPs), and related domains; PDZ (PSD-95 ...
4535-4581 4.39e-08

PDZ domain of membrane palmitoylated proteins (MPPs), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of MPP1-7 (also known as MAGUK p55 subfamily members 1-7), and related domains. MPPs comprise a subfamily of a larger group of multidomain proteins, namely, membrane-associated guanylate kinases (MAGUKs). MPPs form diverse protein complexes at the cell membranes, which are involved in a wide range of cellular processes, including establishing proper cell structure, polarity and cell adhesion. MPPs have only one PDZ domain. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MPP1-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467208 [Multi-domain]  Cd Length: 80  Bit Score: 53.04  E-value: 4.39e-08
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 2462613745 4535 IAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQQSG 4581
Cdd:cd06726     26 VARILHGGMAHRSGLLHVGDEILEINGIPVSGKTVDELQKLLSSLSG 72
PDZ1_LNX1_2-like cd06677
PDZ domain 1 of human Ligand of Numb protein X 1 (LNX1) and LNX2, and related domains; PDZ ...
4498-4578 5.69e-08

PDZ domain 1 of human Ligand of Numb protein X 1 (LNX1) and LNX2, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of LNX1 (also known as PDZ domain-containing RING finger protein 2, PDZRN2) and LNX2 (also known as PDZ domain-containing RING finger protein 1, PDZRN1), and related domains. LNX1 and LNX2 are Ring (Really Interesting New Gene) finger and PDZ domain-containing E3 ubiquitin ligases that bind to the cell fate determinant protein NUMB and mediate its ubiquitination. LNX1 can ubiquitinate a number of other ligands including PPFIA1, KLHL11, KIF7 and ERC2. LNX1 and LNX2 each have four PDZ domains. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This LNX family PDZ1 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467165 [Multi-domain]  Cd Length: 89  Bit Score: 53.02  E-value: 5.69e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 4498 IKITRDSKDhtvsgNGLGIRIVGGKEIPghsgEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIIS 4577
Cdd:cd06677      6 IEIHRSDPY-----EELGISIVGGNDTP----LINIVIQEVYRDGVIARDGRLLPGDQILEVNGVDISNVTHSQARSVLR 76

                   .
gi 2462613745 4578 Q 4578
Cdd:cd06677     77 Q 77
PDZ1_MUPP1-like cd06689
PDZ domain 1 of multi-PDZ-domain protein 1 (MUPP1) and PATJ (protein-associated tight junction) ...
4479-4585 6.87e-08

PDZ domain 1 of multi-PDZ-domain protein 1 (MUPP1) and PATJ (protein-associated tight junction), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of MUPP1 and PATJ, and related domains. MUPP1 and PATJ serve as scaffolding proteins linking different proteins and protein complexes involved in the organization of tight junctions and epithelial polarity. MUPP1 contains an L27 (Lin-2 and Lin-7 binding) domain and 13 PDZ domains. PATJ (also known as INAD-like) contains an L27 domain and ten PDZ domains. MUPP1 and PATJ share several binding partners, including junctional adhesion molecules (JAM), zonula occludens (ZO)-3, Pals1 (protein associated with Lin-7), Par (partitioning defective)-6 proteins, and nectins (adherence junction adhesion molecules). PATJ lacks 3 PDZ domains seen in MUPP1: PDZ6, 9, and 13; consequently, MUPP1 PDZ7 and 8 align with PATJ PDZ6 and 7; and MUPP1 PDZ domains 10-12 align with PATJ PDZ domains 8-10. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MUPP1-like family PDZ1 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467176 [Multi-domain]  Cd Length: 102  Bit Score: 53.40  E-value: 6.87e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 4479 EQIIQM--NGKTMHYIfphariKITRDSkdhtvsGNGLGIRIVGGKEipGHSGEIGAYIAKILPGGSAEQTGKLMEGMQV 4556
Cdd:cd06689      3 EQAIQSmaQGRQVEYI------ELEKPE------SGGLGFSVVGLKS--ENRGELGIFVQEIQPGSVAARDGRLKENDQI 68
                           90       100       110
                   ....*....|....*....|....*....|
gi 2462613745 4557 LEWNGIPL-TSKTYEEVQSIISQQSGEAEI 4585
Cdd:cd06689     69 LAINGQPLdQSISHQQAIAILQQAKGSVEL 98
BimA_second NF040983
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ...
458-563 8.58e-08

trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.


Pssm-ID: 468913 [Multi-domain]  Cd Length: 382  Bit Score: 57.99  E-value: 8.58e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  458 KTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQpgSTKPPSQQP--G 535
Cdd:NF040983    83 RTLPNKVPPPPPPPPPPPPPPPTPPPPPPPPPPPPPPSPPPPPPPSPPPSPPPPTTTPPTRTTPST--TTPTPSMHPiqP 160
                           90       100
                   ....*....|....*....|....*...
gi 2462613745  536 SAKPSAQQPSPAKPSAQQSTKPVSQTGS 563
Cdd:NF040983   161 TQLPSIPNATPTSGSATNVTINFNSTGA 188
PHA03377 PHA03377
EBNA-3C; Provisional
352-550 1.01e-07

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 58.53  E-value: 1.01e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  352 PVQPPGTTKPPAQPLGPAKPPAQQTgsEKPSSEQPGPKALAQPPGVGKTPaqqPGPAKPP-TQQVGTPKPLAQQPGLQSP 430
Cdd:PHA03377   412 PWRKPRTLPWPTPKTHPVKRTLVKT--SGRSDEAEQAQSTPERPGPSDQP---SVPVEPAhLTPVEHTTVILHQPPQSPP 486
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  431 AKAPGPTKTPVQQPGPGKI-----------------------PAQQAGPGKTSAQQTGPTKPPSQLPgPAKPPPQQPGPA 487
Cdd:PHA03377   487 TVAIKPAPPPSRRRRGACVvydddiievidvetteeeesvtqPAKPHRKVQDGFQRSGRRQKRATPP-KVSPSDRGPPKA 565
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462613745  488 KPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPS 550
Cdd:PHA03377   566 SPPVMAPPSTGPRVMATPSTGPRDMAPPSTGPRQQAKCKDGPPASGPHEKQPPSSAPRDMAPS 628
PDZ2_Par3-like cd23058
PDZ domain 2 of partitioning defective 3 (Par3), and related domains; PDZ (PSD-95 ...
4484-4575 1.05e-07

PDZ domain 2 of partitioning defective 3 (Par3), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of Par3 (or PAR3 or Par-3, also known as Atypical PKC isotype-specific-interacting protein, ASIP, Drosophila Bazooka) and related domains. Par3 is a scaffold protein involved in organizing cell polarity across animals. Par3 binds numerous molecules both for its recruitment to one pole of the cell and for downstream contributions to polarized cell function. It regulates cell polarity by targeting the Par complex proteins Par6 and atypical protein kinase C (aPKC) to specific cortical sites. Physical interactions between Par3 and the Par complex include Par3 PDZ domain 1 binding to the Par6 PDZ domain, Par3 PDZ domain 1 and PDZ domain 3 binding the Par6's PDZ-binding motif, and an interaction with an undefined region of aPKC that requires both Par3 PDZ2 and PDZ3. The PDZ domains of Par3 have also been implicated as potential phosphoinositide signaling integrators, since its second PDZ domain binds to phosphoinositides, and the third PDZ interacts with phosphoinositide phosphatase PTEN. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Par3 family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467271 [Multi-domain]  Cd Length: 93  Bit Score: 52.64  E-value: 1.05e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 4484 MNGKTMHyifpharIKITRDSkdhtvsgNGLGIRIVGGKEIPGHSGEIgaYIAKILPGGSAEQTGKLMEGMQVLEWNGIP 4563
Cdd:cd23058      1 KIGKKLH-------IQLKKGP-------EGLGFSITSRDNPTGGSGPI--YIKNILPKGAAIQDGRLKAGDRLLEVNGVD 64
                           90
                   ....*....|..
gi 2462613745 4564 LTSKTYEEVQSI 4575
Cdd:cd23058     65 VTGKTQEEVVSL 76
PDZ2_Scribble-like cd06703
PDZ domain 2 of Drosophila Scribble, human Scribble homolog, and related domains; PDZ (PSD-95 ...
4511-4580 1.12e-07

PDZ domain 2 of Drosophila Scribble, human Scribble homolog, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of Drosophila Scribble (also known as LAP4), human Scribble homolog (also known as hScrib, LAP4, CriB1, ScrB1 and Vartul), and related domains. They belong to the LAP family, which describes proteins that contain either one or four PDZ domains and 16 LRRs (leucine-rich repeats) and function in controlling cell shape, size and subcellular protein localization. In Drosophila, the Scribble complex, comprising Scribble, discs large, and lethal giant larvae, plays a role in apico-basal cell polarity, in other forms of polarity, including regulation of the actin cytoskeleton, cell signaling and vesicular trafficking, and in tumor development. Mammalian Scribble is important in many aspects of cancer development. Scribble and its homologs can be downregulated or overexpressed in cancer; they have a role in cancer beyond their function in loss of cell polarity. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Scribble-like family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467187 [Multi-domain]  Cd Length: 92  Bit Score: 52.26  E-value: 1.12e-07
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462613745 4511 GNGLGIRIVGGKE-IPGHSGEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQQS 4580
Cdd:cd06703     11 GKGLGFSIAGGKGsTPFRDGDEGIFISRITEGGAADRDGKLQVGDRVLSINGVDVTEARHDQAVALLTSSS 81
dnaA PRK14086
chromosomal replication initiator protein DnaA;
351-585 1.12e-07

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 58.30  E-value: 1.12e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  351 PPVQPPGTTKPPAQPLGPAKPPAQQTgsEKPSSEQPGPKALaqpPGVGKTPAQQPGPAKPPTQQVGTPKPLAqqpglqsp 430
Cdd:PRK14086    80 RPIRIAITVDPSAGEPAPPPPHARRT--SEPELPRPGRRPY---EGYGGPRADDRPPGLPRQDQLPTARPAY-------- 146
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  431 akapgPTKTPVQQPGPGKIPAQQAGPgktsaQQtgptkPPSQLPGPAKPPPqqpgPAKPPPQQPGSAKPPPQQPGSTKPP 510
Cdd:PRK14086   147 -----PAYQQRPEPGAWPRAADDYGW-----QQ-----QRLGFPPRAPYAS----PASYAPEQERDREPYDAGRPEYDQR 207
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462613745  511 PQQPGPAKPSPQQPG--STKPPSQQPGS-AKPSAQQPSPAKPSAQQstkpvsqtgsgKPLQPPTVSPSAKQPPSQGLP 585
Cdd:PRK14086   208 RRDYDHPRPDWDRPRrdRTDRPEPPPGAgHVHRGGPGPPERDDAPV-----------VPIRPSAPGPLAAQPAPAPGP 274
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
386-517 1.14e-07

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 58.19  E-value: 1.14e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  386 PGPKALAQPPGVGKTPAQQPGP---AKPPTQQVGTPKPLAQQPGLQSPAKAPgptkTPVQQPGPGKIPAQQAGPGKTSAQ 462
Cdd:PRK14951   366 PAAAAEAAAPAEKKTPARPEAAapaAAPVAQAAAAPAPAAAPAAAASAPAAP----PAAAPPAPVAAPAAAAPAAAPAAA 441
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462613745  463 QTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPA 517
Cdd:PRK14951   442 PAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEEGDV 496
PDZ_MPP3-MPP4-MPP7-like cd06799
PDZ domain of membrane palmitoylated proteins 3 (MPP3), MPP4, and MPP7, and related domains; ...
4516-4581 1.27e-07

PDZ domain of membrane palmitoylated proteins 3 (MPP3), MPP4, and MPP7, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of MPP3, MPP4, and MPP7, and related domains. MPP3 (also known as MAGUK p55 subfamily member 3, erythrocyte membrane protein p55, or EMP55), MPP4 (also known as MAGUK p55 subfamily member 4 or Discs large homolog 6), and MPP7 (also known as MAGUK p55 subfamily member 7) are membrane-associated guanylate kinase (MAGUK)-like proteins. MPP3 is part of a cell adhesion protein complex including tumor suppressor CADM1 and actin-binding protein 4.1B. Participation in the Crumbs cell polarity complex has also been demonstrated for MPP7 in epithelial cells, and for MPP3 and MPP4 in the retina. MPP4 is needed for proper localization of plasma membrane calcium ATPases and maintenance of calcium homeostasis at the rod photoreceptor synaptic terminals. Binding partners of the MPP3 PDZ domain include nectin-3, serotonin 5-hydroxytryptamine, 5-HT(2C) receptor, and a cell adhesion protein, TSLC1 (tumor suppressor in lung cancer 1); fragments of MPP4 having the PDZ domain bind CRB (PDZ-SH3-GUK) and GABA transporter GAT1 (PDZ-SH3). PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MPP1-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467260 [Multi-domain]  Cd Length: 81  Bit Score: 51.86  E-value: 1.27e-07
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462613745 4516 IRIVGGKEIPG-------HSGEIgaYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQQSG 4581
Cdd:cd06799      3 VRLVKNNEPLGatikrdeKTGAI--VVARIMRGGAADRSGLIHVGDELREVNGISVEGKDPEEVIQILANSQG 73
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
329-585 1.66e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 58.26  E-value: 1.66e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  329 AKPPAQPSGLTKPLAQQPGTVKPPVQPPGTTKPPAQPLGPA---KPPAQQTGSEKPSSEQPGPKALAQPPGVGKTPAQQP 405
Cdd:PHA03307    22 PRPPATPGDAADDLLSGSQGQLVSDSAELAAVTVVAGAAACdrfEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPA 101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  406 GPAKP--PTQQVGTPKPLAQQPGlqSPAKAPGPTKTPVQQPGPGKIPAQQ---AGPGKTSAQQTGPTKPPSQLPGPAKPP 480
Cdd:PHA03307   102 REGSPtpPGPSSPDPPPPTPPPA--SPPPSPAPDLSEMLRPVGSPGPPPAaspPAAGASPAAVASDAASSRQAALPLSSP 179
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  481 PQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQ 560
Cdd:PHA03307   180 EETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPR 259
                          250       260
                   ....*....|....*....|....*
gi 2462613745  561 TGSGKPLQPPTVSPSAKQPPSQGLP 585
Cdd:PHA03307   260 PAPITLPTRIWEASGWNGPSSRPGP 284
PDZ2_Dlg1-2-4-like cd06724
PDZ domain 2 of human discs large homolog 1 (Dlg1), Dlg2, and Dlg4, Drosophila disc large (Dlg) ...
4511-4580 1.69e-07

PDZ domain 2 of human discs large homolog 1 (Dlg1), Dlg2, and Dlg4, Drosophila disc large (Dlg), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of Drosophila Dlg1, human Dlg1,2, and 4 and related domains. Dlg1 (also known as synapse-associated protein Dlg197 or SAP-97), Dlg2 (also known as channel-associated protein of synapse-110, postsynaptic density protein 93, or PSD-93), Dlg4 (also known as postsynaptic density protein 95, PSD-95, synapse-associated protein 90, or SAP-90) each have 3 PDZ domains and belong to the membrane-associated guanylate kinase family. Dlg1 regulates antigen receptor signaling and cell polarity in lymphocytes, B-cell proliferation and antibody production, and TGFalpha bioavailability; its PDZ3 domain binds pro-TGFalpha, and its PDZ2 domain binds the TACE metalloprotease responsible for cleaving pro-TGFalpha to a soluble form. Dlg2 is involved in N-methyl-D-aspartate (NMDA) receptor signaling. It regulates surface expression of NMDA receptors in dorsal horn neurons of the spinal cord, and it also interacts with NMDA receptor subunits and with Shaker-type K+ channel subunits to cluster into a channel complex. Dlg4 PDZ1 domain binds NMDA receptors, and its PDZ2 domain binds neuronal nitric oxide synthase (nNOS), forming a complex in neurons. The Drosophila Scribble complex (Scribble, Dlg, and lethal giant larvae) plays a role in apico-basal cell polarity, and in other forms of polarity, including regulation of the actin cytoskeleton, cell signaling and vesicular trafficking, and in tumor development. Postsynaptic targeting of Drosophila DLG requires interactions mediated by the first two PDZ domains. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Dlg-like family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467207 [Multi-domain]  Cd Length: 85  Bit Score: 51.50  E-value: 1.69e-07
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462613745 4511 GNGLGIRIVGGK---EIPGHSGeigAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQQS 4580
Cdd:cd06724      8 PKGLGFSIAGGVgnqHIPGDNG---IYVTKIIEGGAAQKDGRLQVGDKLLAVNDVSLEEVTHEEAVAALKNTS 77
PDZ pfam00595
PDZ domain; PDZ domains are found in diverse signaling proteins.
4503-4582 1.79e-07

PDZ domain; PDZ domains are found in diverse signaling proteins.


Pssm-ID: 395476 [Multi-domain]  Cd Length: 81  Bit Score: 51.51  E-value: 1.79e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 4503 DSKDHTVSGNGLGIRIVGGKEipghSGEIGAYIAKILPGGSAEQTGkLMEGMQVLEWNGIPLTSKTYEEVQSIISQQSGE 4582
Cdd:pfam00595    1 QVTLEKDGRGGLGFSLKGGSD----QGDPGIFVSEVLPGGAAEAGG-LKVGDRILSINGQDVENMTHEEAVLALKGSGGK 75
PDZ3_MAGI-1_3-like cd06733
PDZ domain 3 of membrane-associated guanylate kinase inverted 1 (MAGI-1), MAGI-2, and MAGI-3, ...
4512-4572 1.88e-07

PDZ domain 3 of membrane-associated guanylate kinase inverted 1 (MAGI-1), MAGI-2, and MAGI-3, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 3 of MAGI1, 2, 3 (MAGI is also known as Membrane-associated guanylate kinase, WW and PDZ domain-containing protein) and related domains. MAGI proteins have been implicated in the control of cell migration and invasion through altering the activity of phosphatase and tensin homolog (PTEN) and modulating Akt signaling. Four MAGI proteins have been identified (MAGI1-3 and MAGIX). MAGI1-3 have 6 PDZ domains and bind to the C-terminus of PTEN via their PDZ2 domain. MAGIX has a single PDZ domain that is related to MAGI1-3 PDZ domain 5. Other binding partners for MAGI1 include JAM4, C-terminal tail of high risk HPV-18 E6, megalin, TRAF6, Kir4.1 (basolateral K+ channel subunit), and cadherin 23; for MAGI2, include DASM1, dendrin, axin, beta- and delta-catenin, neuroligin, hyperpolarization-activated cation channels, beta1-adrenergic receptors, NMDA receptor, and TARPs; and for MAGI3 includes LPA2. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MAGI family PDZ3 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2); arranged as beta-strands A, -B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467215 [Multi-domain]  Cd Length: 85  Bit Score: 51.46  E-value: 1.88e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462613745 4512 NGLGIRIVGGKEiPGHSGEIGAyiakILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEV 4572
Cdd:cd06733     11 TGFGFRILGGTE-EGSQVSIGA----IVPGGAADLDGRLRTGDELLSVDGVNVVGASHHKV 66
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
490-1049 2.24e-07

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 57.39  E-value: 2.24e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  490 PPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAkpsAQQPSPAKPSaqqstkpvsqtgsgKPLQP 569
Cdd:PTZ00449   510 PPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEV---GKKPGPAKEH--------------KPSKI 572
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  570 PTVSpsaKQPPSQGLPKTicplcnttelllhvpekanfntctecqttvcslcgfnpnphltevkewlclncqmkralggd 649
Cdd:PTZ00449   573 PTLS---KKPEFPKDPKH-------------------------------------------------------------- 587
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  650 lapvPSSPQ-PKLKTAPVTTTSAVSKSSPQPQQTS--PKKDAAPKQDLSKAPEPKKPPPLvkqptlhgSPsakaKQPPEA 726
Cdd:PTZ00449   588 ----PKDPEePKKPKRPRSAQRPTRPKSPKLPELLdiPKSPKRPESPKSPKRPPPPQRPS--------SP----ERPEGP 651
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  727 DSLSKPAPPKEPSVPSEQD-KAPVADD---KPKQPKMVKPTTDLVSSSSATTKPDIPSSKVQSQAEEKTTPPLKTDSakP 802
Cdd:PTZ00449   652 KIIKSPKPPKSPKPPFDPKfKEKFYDDyldAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRD--E 729
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  803 SQSFPPTGEKVSPF--DSKAIPRPASDSKIISHPGPSSESKGqkqVDPVQKKEEPKKAQTKMSPKPDAKPMP--KGSPTP 878
Cdd:PTZ00449   730 EFPFEPIGDPDAEQpdDIEFFTPPEEERTFFHETPADTPLPD---ILAEEFKEEDIHAETGEPDEAMKRPDSpsEHEDKP 806
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  879 PGPRPTAgqtvptpqqsPKPQEQSRRFSLNlgsitdapksqpTTPQETVTGKLFgfgasifsqasnlistagqpgpHSQS 958
Cdd:PTZ00449   807 PGDHPSL----------PKKRHRLDGLALS------------TTDLESDAGRIA----------------------KDAS 842
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  959 GPGAPMKQAPAPSQPPTSQGPPKStgqapPAPAKSIPVKKE-TKAPAAEKLEPKAEQAPTVKRTETEKKP--PPIKDSKS 1035
Cdd:PTZ00449   843 GKIVKLKRSKSFDDLTTVEEAEEM-----GAEARKIVVDDDgTEADDEDTHPPEEKHKSEVRRRRPPKKPskPKKPSKPK 917
                          570
                   ....*....|....
gi 2462613745 1036 LTAEPQKAVLPTKL 1049
Cdd:PTZ00449   918 KPKKPDSAFIPSII 931
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
391-535 2.36e-07

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 57.03  E-value: 2.36e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  391 LAQPPGVGkTPAQQPGPAKPPTQqvgtpkplaqqPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQQTGPTKPP 470
Cdd:PRK14951   362 LAFKPAAA-AEAAAPAEKKTPAR-----------PEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAP 429
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462613745  471 SQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPG 535
Cdd:PRK14951   430 AAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTEEG 494
PHA03379 PHA03379
EBNA-3A; Provisional
234-582 2.42e-07

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 57.38  E-value: 2.42e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  234 GTPKS-ISSQQPEKIKS--QPPGTGkPIQGPTQTPQTDHAKLPLQRDASRPQTKQADIVRGESVKPSLPSPSKPPIQQPT 310
Cdd:PHA03379   414 GTPRPpVEKPRPEVPQSleTATSHG-SAQVPEPPPVHDLEPGPLHDQHSMAPCPVAQLPPGPLQDLEPGDQLPGVVQDGR 492
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  311 PGKPPAQQPGHEKSQPGPAKPPAQPSGLTKPLAQQPGTVKPPVQPPGTTKPPAQPL-------GPAKPPAQQTGSEK--P 381
Cdd:PHA03379   493 PACAPVPAPAGPIVRPWEASLSQVPGVAFAPVMPQPMPVEPVPVPTVALERPVCPAppliamqGPGETSGIVRVRERwrP 572
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  382 SSEQPGP-------------------------KALAQPPGVGKTPAQQP--GPAKPPTQQVgtpkPLAQQPGLQSPAKAP 434
Cdd:PHA03379   573 APWTPNPprspsqmsvrdrlarlraeaqpyqaSVEVQPPQLTQVSPQQPmeYPLEPEQQMF----PGSPFSQVADVMRAG 648
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  435 GptkTPVQQPGPGKIPAQQagpgktSAQQTGPTKPPSQLPGPAKP-PPQQPGPAKPPPQQP-----GSAKPPPQQPGStk 508
Cdd:PHA03379   649 G---VPAMQPQYFDLPLQQ------PISQGAPLAPLRASMGPVPPvPATQPQYFDIPLTEPinqgaSAAHFLPQQPME-- 717
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  509 pPPQQPgpakPSPQQPGSTKPPSQQPGSAKPSA-----QQP----SPAKPSAQQSTKPVSQTGSGKPLQPPTVSPSAKQP 579
Cdd:PHA03379   718 -GPLVP----ERWMFQGATLSQSVRPGVAQSQYfdlplTQPinhgAPAAHFLHQPPMEGPWVPEQWMFQGAPPSQGTDVV 792

                   ...
gi 2462613745  580 PSQ 582
Cdd:PHA03379   793 QHQ 795
SSDP pfam04503
Single-stranded DNA binding protein, SSDP; This is a family of eukaryotic single-stranded DNA ...
355-555 2.56e-07

Single-stranded DNA binding protein, SSDP; This is a family of eukaryotic single-stranded DNA binding proteins with specificity to a pyrimidine-rich element found in the promoter region of the alpha2(I) collagen gene.


Pssm-ID: 461334 [Multi-domain]  Cd Length: 293  Bit Score: 55.73  E-value: 2.56e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  355 PPGTTKPpAQPLGPA--KPPAQQTGSEKPSSEQPGPKALAQPPGVGKTP---AQQPGPAKP----PTQQVGTPKPLAQQP 425
Cdd:pfam04503   29 PPGDGMP-GGPMPPGffQSPPSHPSSQPSPHAQPPPHNPATMMGPHSQPfmgPRYPGGPRPsvrmPQQGNDFNGPPGQQP 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  426 GLqspakaPGPTKTPVQQPGPGKIPAQQAGPGkTSAQQTGPTKPPSQLPGPAKPPPQQP-GPAKPPPQQPG-SAKPPPQQ 503
Cdd:pfam04503  108 MM------PNSMDPTRPGGHPNMGGPMQRMNP-PRGPGMGPMGPQSYGPGMRGPPPNSTdGPGGMPPMNMGpGGRRPWPQ 180
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2462613745  504 PGSTKPPPQqpgpakpSPQQPGSTKPPsqqPGSAKPSAqqPSPAKPSAQQST 555
Cdd:pfam04503  181 PNASNPLPY-------SSSSPGSYGGP---PGGGGPPG--PTPIMPSPQDST 220
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
425-557 3.22e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 56.71  E-value: 3.22e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  425 PGLQSPAKAPGPTKT---PVQQPGPGKIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPP 501
Cdd:PRK14971   370 SGGRGPKQHIKPVFTqpaAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRP 449
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462613745  502 QQPGSTKPPPqqpgpakpsPQQPGSTKPPSQQPgsAKPSAQQPSPAKPSAQQSTKP 557
Cdd:PRK14971   450 AQFKEEKKIP---------VSKVSSLGPSTLRP--IQEKAEQATGNIKEAPTGTQK 494
PDZ4_Scribble-like cd06701
PDZ domain 4 of Drosophila Scribble, human Scribble homolog, and related domains; PDZ (PSD-95 ...
4511-4585 3.77e-07

PDZ domain 4 of Drosophila Scribble, human Scribble homolog, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 4 of Drosophila Scribble (also known as LAP4), human Scribble homolog (also known as hScrib, LAP4, CriB1, ScrB1 and Vartul), and related domains. They belong to the LAP family, which describes proteins that contain either one or four PDZ domains and 16 LRRs (leucine-rich repeats) and function in controlling cell shape, size and subcellular protein localization. In Drosophila, the Scribble complex, comprising Scribble, discs large, and lethal giant larvae, plays a role in apico-basal cell polarity, in other forms of polarity, including regulation of the actin cytoskeleton, cell signaling and vesicular trafficking, and in tumor development. Mammalian Scribble is important in many aspects of cancer development. Scribble and its homologs can be downregulated or overexpressed in cancer; they have a role in cancer beyond their function in loss of cell polarity. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Scribble-like family PDZ4 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467185 [Multi-domain]  Cd Length: 98  Bit Score: 51.07  E-value: 3.77e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 4511 GNGLGIRIVGGkeIPGHSG------EIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQQSGEAE 4584
Cdd:cd06701     14 GEKLGISIRGG--AKGHAGnpldptDEGIFISKINPDGAAARDGRLKVGQRILEVNGQSLLGATHQEAVRILRSVGDTLT 91

                   .
gi 2462613745 4585 I 4585
Cdd:cd06701     92 L 92
PDZ3_Scribble-like cd06702
PDZ domain 3 of Drosophila Scribble, human Scribble homolog, and related domains; PDZ (PSD-95 ...
4507-4577 3.86e-07

PDZ domain 3 of Drosophila Scribble, human Scribble homolog, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of Drosophila Scribble (also known as LAP4), human Scribble homolog (also known as hScrib, LAP4, CriB1, ScrB1 and Vartul), and related domains. They belong to the LAP family, which describes proteins that contain either one or four PDZ domains and 16 LRRs (leucine-rich repeats) and function in controlling cell shape, size and subcellular protein localization. In Drosophila, the Scribble complex, comprising Scribble, discs large, and lethal giant larvae, plays a role in apico-basal cell polarity, in other forms of polarity, including regulation of the actin cytoskeleton, cell signaling and vesicular trafficking, and in tumor development. Mammalian Scribble is important in many aspects of cancer development. Scribble and its homologs can be downregulated or overexpressed in cancer; they have a role in cancer beyond their function in loss of cell polarity. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Scribble-like family PDZ3 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467186 [Multi-domain]  Cd Length: 89  Bit Score: 50.72  E-value: 3.86e-07
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462613745 4507 HTVSGNG-LGIRIVGGKEIPGH---SGEIGAYIAKILPGGSAEQTGkLMEGMQVLEWNGIPLTSKTYEE-VQSIIS 4577
Cdd:cd06702      4 HLVKAGGpLGLSIVGGSDHSSHpfgVDEPGIFISKVIPDGAAAKSG-LRIGDRILSVNGKDLRHATHQEaVSALLS 78
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
411-534 3.91e-07

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 53.12  E-value: 3.91e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  411 PTQQVGTPKPLAQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQqagPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPP 490
Cdd:pfam15240   38 QSQQGGQGPQGPPPGGFPPQPPASDDPPGPPPPGGPQQPPPQ---GGKQKPQGPPPQGGPRPPPGKPQGPPPQGGNQQQG 114
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 2462613745  491 PQQPGSAKPPPQQPGStkPPPQQPGPAKPSPQQPGSTKPPSQQP 534
Cdd:pfam15240  115 PPPPGKPQGPPPQGGG--PPPQGGNQQGPPPPPPGNPQGPPQRP 156
PHA03247 PHA03247
large tegument protein UL36; Provisional
823-1389 4.15e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.87  E-value: 4.15e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  823 RPAsDSKIISHPGPSSESKGQKQVDPVQKKEEPKKAQTKMSPKPDAKPMP-----------------KGSPTPP----GP 881
Cdd:PHA03247  2482 RPA-EARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHprmltwirgleelasddAGDPPPPlppaAP 2560
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  882 RPTAGQTVPTPQQSPKPQEQSRRfslnlgsitdAPKSQPTTPQETVTGKlfgfgasifsqasnliSTAGQPGPHSQSGPG 961
Cdd:PHA03247  2561 PAAPDRSVPPPRPAPRPSEPAVT----------SRARRPDAPPQSARPR----------------APVDDRGDPRGPAPP 2614
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  962 APMKQAPAPSQPPTSQGPPKSTGQAPPAPAKSIPVKKETKAPAAEKLEPKaeqaptvKRTETEKKPP----PIKDSKSLT 1037
Cdd:PHA03247  2615 SPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRP-------RRARRLGRAAqassPPQRPRRRA 2687
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1038 AEPQKAVLpTKLEKSPKPESTCPLCKTELNIGSKDPPNFNTCTecknqvcnlcGFNPTPHLTEIQEwlclncQTQRAISG 1117
Cdd:PHA03247  2688 ARPTVGSL-TSLADPPPPPPTPEPAPHALVSATPLPPGPAAAR----------QASPALPAAPAPP------AVPAGPAT 2750
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1118 QLGDIRKM-PPAPSGPKASPMPVPTESSSQKTAVPPQVklvkkqeQEVKTEAEKVILEKVKETLSMEKIPPMVTTDQKQE 1196
Cdd:PHA03247  2751 PGGPARPArPPTTAGPPAPAPPAAPAAGPPRRLTRPAV-------ASLSESRESLPSPWDPADPPAAVLAPAAALPPAAS 2823
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1197 ESKLEKDKASALQEKKPLPEEkkliPEEEKIRSE---------EKKPLLEEKKPTPEDKKLLPEAKTSAPEEQKHDLLKS 1267
Cdd:PHA03247  2824 PAGPLPPPTSAQPTAPPPPPG----PPPPSLPLGgsvapggdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFA 2899
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1268 QVQIAEEKLEGRVAPKTVQEGKQPQTKMEGLPSGTPQSLPKEDDKTTKTIKEQPQPPCTAK--------PDQVEPGKEKT 1339
Cdd:PHA03247  2900 LPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPqpwlgalvPGRVAVPRFRV 2979
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1340 EKEDDKSDTSSSQQPkSPQGLSDTGYSSDGISSSLGEIPSLIPTDEKDIL 1389
Cdd:PHA03247  2980 PQPAPSREAPASSTP-PLTGHSLSRVSSWASSLALHEETDPPPVSLKQTL 3028
PDZ1_PTPN13_FRMPD2-like cd06694
PDZ domain 1 of protein tyrosine phosphatase non-receptor type 13 (PTPN13),FERM and PDZ ...
4497-4581 4.66e-07

PDZ domain 1 of protein tyrosine phosphatase non-receptor type 13 (PTPN13),FERM and PDZ domain-containing protein 2 (FRMPD2), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of PTPN13 [also known as Fas-associated protein-tyrosine phosphatase 1 (FAP-1), protein-tyrosine phosphatase 1E (PTP-E1), and protein-tyrosine phosphatase (PTPL1)], FRMPD2 (also known as PDZ domain-containing protein 4; PDZ domain-containing protein 5C), and related domains. PTPN13 regulates negative apoptotic signaling and mediates phosphoinositide 3-kinase (PI3K) signaling. PTPN13 has five PDZ domains. Proteins known to interact with PTPN13 PDZ domains include: PLEKHA1 and PLEKHA2 via PTPN13-PDZ domain 1, Fas receptor and thyroid receptor-interacting protein 6 via PTPN13-PDZ domain 2, nerve growth factor receptor and protein kinase N2 via PTPN13-PDZ domain 3, PDZ and LIM domain 4 (PDLIM4) via PTPN13-PDZ domains 2 and 4, and brain calpain-2 via PTPN13-PDZ domains 3, 4 and 5. Calpain-2-mediated PTPN13 fragments may be involved in abnormal tau aggregation and increased risk for Alzheimer's disease. FRMPD2 is localized in the basolateral membranes of polarized epithelial cells and is associated with tight junction formation and immune response; it contains 3 PDZ domains. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PTPN13 family PDZ1 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467180 [Multi-domain]  Cd Length: 92  Bit Score: 50.47  E-value: 4.66e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 4497 RIKITRDSKdhtvsgNGLGIRIVGGkEIPGhSGEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTY------- 4569
Cdd:cd06694      4 IVTLKKDPQ------KGLGFTIVGG-ENSG-SLDLGIFVKSIIPGGPADKDGRIKPGDRIIAINGQSLEGKTHhaaveii 75
                           90
                   ....*....|....*.
gi 2462613745 4570 ----EEVQSIISQQSG 4581
Cdd:cd06694     76 qnapDKVELIISQPKS 91
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
332-505 6.95e-07

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 55.49  E-value: 6.95e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  332 PAQPSGLT----KPLAQQPGTvKPPVQPPGTTKPPAQPlgPAKPPAQqtgseKPSSEQPGPKALAQPPgvgktPAQQPGP 407
Cdd:PRK14951   348 PDEYAALTmvllRLLAFKPAA-AAEAAAPAEKKTPARP--EAAAPAA-----APVAQAAAAPAPAAAP-----AAAASAP 414
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  408 AKPPTQQVGTPkplAQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQqagpgktsaqqtgptKPPSQLPGPAKPPPQQPGPA 487
Cdd:PRK14951   415 AAPPAAAPPAP---VAAPAAAAPAAAPAAAPAAVALAPAPPAQAA---------------PETVAIPVRVAPEPAVASAA 476
                          170
                   ....*....|....*...
gi 2462613745  488 KPPPQQPGSAKPPPQQPG 505
Cdd:PRK14951   477 PAPAAAPAAARLTPTEEG 494
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
647-1029 7.78e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 55.76  E-value: 7.78e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  647 GGDLAPVPSSPQPKLKTAPVTTTSAVSKSSPQPQQTSPKKDAAPKQDLSKAPEPKKPPPLVKQPTLHGSPSAKAKQPPEA 726
Cdd:PRK07764   403 AAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAP 482
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  727 DSLSKPAPPKEPSVPSE----QDKAPVADDKPKQPKMVkpttDLVSSSSATTKPDIpSSKVQSQAEEKTTPPLKTDSakp 802
Cdd:PRK07764   483 APPAAPAPAAAPAAPAApaapAGADDAATLRERWPEIL----AAVPKRSRKTWAIL-LPEATVLGVRGDTLVLGFST--- 554
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  803 sqsfPPTGEKVSPFDSKAIPRPA------SDSKIISHPGPSSESKG-QKQVDPVQKKEEPKKAQTKMSPKPDAKPMPKGS 875
Cdd:PRK07764   555 ----GGLARRFASPGNAEVLVTAlaeelgGDWQVEAVVGPAPGAAGgEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPA 630
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  876 PTPPGPRPTAGQTVPTPQQSPKPQEQSRRFSLNLGSITDAPKSQPTTPQETVTGklfgfGASIFSQASnlistAGQPGPH 955
Cdd:PRK07764   631 GAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPA-----PAPAAPAAP-----AGAAPAQ 700
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462613745  956 SQSGPGAPMKQAPAPSQPPTSQGPPKSTGQAPPAPAKSIPVKKETKAPAAEKLEPKAEQAPTVKRTETEKKPPP 1029
Cdd:PRK07764   701 PAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAP 774
PDZ2_LNX1_2-like cd06678
PDZ domain 2 of human Ligand of Numb protein X 1 (LNX1) and LNX2, and related domains; PDZ ...
4507-4582 8.74e-07

PDZ domain 2 of human Ligand of Numb protein X 1 (LNX1) and LNX2, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of LNX1 (also known as PDZ domain-containing RING finger protein 2, PDZRN2) and LNX2 (also known as PDZ domain-containing RING finger protein 1, PDZRN1), and related domains. LNX1 and LNX2 are Ring (Really Interesting New Gene) finger and PDZ domain-containing E3 ubiquitin ligases that bind to the cell fate determinant protein NUMB and mediate its ubiquitination. LNX1 can ubiquitinate a number of other ligands including PPFIA1, KLHL11, KIF7 and ERC2. LNX1 and LNX2 each have four PDZ domains. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This LNX family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467166 [Multi-domain]  Cd Length: 82  Bit Score: 49.55  E-value: 8.74e-07
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462613745 4507 HTVSGNGLGIRIVGGKEIPGhsgeigAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIIsQQSGE 4582
Cdd:cd06678      6 NKRDGEQLGIKLVRKKDEPG------VFILDLLEGGLAARDGRLKSDDRVLAINGQDLRHGTPEQAAQII-QASGE 74
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
374-518 9.85e-07

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 51.96  E-value: 9.85e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  374 QQTGSEKPSSEQPGPKALAQPPGvGKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQSPaKAPGPTKTPVQQPGPGKIPAQQ 453
Cdd:pfam15240   30 SLISEEEGQSQQGGQGPQGPPPG-GFPPQPPASDDPPGPPPPGGPQQPPPQGGKQKP-QGPPPQGGPRPPPGKPQGPPPQ 107
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462613745  454 AGPgktsaQQTGPTKPPSQL--PGPAKPPPQQPG-PAKPPPQQPGSAKPPPQQPgstKPPPQQPGPAK 518
Cdd:pfam15240  108 GGN-----QQQGPPPPGKPQgpPPQGGGPPPQGGnQQGPPPPPPGNPQGPPQRP---PQPGNPQGPPQ 167
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
438-580 9.94e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 55.26  E-value: 9.94e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  438 KTPVQQPGPGKIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSA-----KPPPQQPGSTKPP-- 510
Cdd:PRK07994   362 AAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQllaarQQLQRAQGATKAKks 441
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462613745  511 -PQQPGPAKP--SPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQTgsgKPLQPPTVSPSAKQPP 580
Cdd:PRK07994   442 ePAAASRARPvnSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVA---TPKALKKALEHEKTPE 511
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
446-565 1.03e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 55.17  E-value: 1.03e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  446 PGKIPAQQAGP---GKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQ 522
Cdd:PRK14971   371 GGRGPKQHIKPvftQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPA 450
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 2462613745  523 QPGSTKPPS-QQPGSAKPSAQQpsPAKPSAQQSTK--PVSQTGSGK 565
Cdd:PRK14971   451 QFKEEKKIPvSKVSSLGPSTLR--PIQEKAEQATGniKEAPTGTQK 494
PDZ_syntrophin-like cd06801
PDZ domain of syntrophins, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), ...
4511-4572 1.04e-06

PDZ domain of syntrophins, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of syntrophins (including alpha-1-syntrophin, beta-1-syntrophin, beta-2-syntrophin, gamma-1-syntrophin, and gamma-2-syntrophin), and related domains. Syntrophins play a role in recruiting various signaling molecules into signaling complexes and help provide appropriate spatiotemporal regulation of signaling pathways. They function in cytoskeletal organization and maintenance; as components of the dystrophin-glycoprotein complex (DGC), they help maintain structural integrity of skeletal muscle fibers. They link voltage-gated sodium channels to the actin cytoskeleton and the extracellular matrix, and control the localization and activity of the actin reorganizing proteins such as PI3K, PI(3,4)P2 and TAPP1. Through association with various cytoskeletal proteins within the cells, they are involved in processes such as regulation of focal adhesions, myogenesis, calcium homeostasis, and cell migration. They also have roles in synapse formation and in the organization of utrophin, acetylcholine receptor, and acetylcholinesterase at the neuromuscular synapse. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This syntrophin-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467262 [Multi-domain]  Cd Length: 83  Bit Score: 49.49  E-value: 1.04e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462613745 4511 GNGLGIRIVGGKEipgHSGEIgaYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEV 4572
Cdd:cd06801     10 VGGLGISIKGGAE---HKMPI--LISKIFKGQAADQTGQLFVGDAILSVNGENLEDATHDEA 66
SOBP pfam15279
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual ...
307-527 1.10e-06

Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual disability. It carries a zinc-finger of the zf-C2H2 type at the N-terminus, and a highly characteriztic C-terminal PhPhPhPhPhPh motif. The deduced 873-amino acid protein contains an N-terminal nuclear localization signal (NLS), followed by 2 FCS-type zinc finger motifs, a proline-rich region (PR1), a putative RNA-binding motif region, and a C-terminal NLS embedded in a second proline-rich motif. SOBP is expressed in various human tissues, including developing mouse brain at embryonic day 14. In postnatal and adult mouse brain SOBP is expressed in all neurons, with intense staining in the limbic system. Highest expression is in layer V cortical neurons, hippocampus, pyriform cortex, dorsomedial nucleus of thalamus, amygdala, and hypothalamus. Postnatal expression of SOBP in the limbic system corresponds to a time of active synaptogenesis. the family is also referred to as Jackson circler, JXC1. In seven affected siblings from a consanguineous Israeli Arab family with mental retardation, anterior maxillary protrusion, and strabismus mutations were found in this protein.


Pssm-ID: 464609 [Multi-domain]  Cd Length: 325  Bit Score: 54.05  E-value: 1.10e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  307 QQPTPGKPPAQQPGHEKSQPGPAKPPAQPSGLTKPLAqqpgTVKPPVQPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQP 386
Cdd:pfam15279   81 KSASPASTRSESVSPGPSSSASPSSSPTSSNSSKPLI----SVASSSKLLAPKPHEPPSLPPPPLPPKKGRRHRPGLHPP 156
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  387 GPKALAQPPGVgKTPAQQPGPakPPTQQVGTPKPLAQQPGL-QSPAKAPGPTKTPVQQPGPGKIPAQQAGP-------GK 458
Cdd:pfam15279  157 LGRPPGSPPMS-MTPRGLLGK--PQQHPPPSPLPAFMEPSSmPPPFLRPPPSIPQPNSPLSNPMLPGIGPPpkpprnlGP 233
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462613745  459 TSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQ--QPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGST 527
Cdd:pfam15279  234 PSNPMHRPPFSPHHPPPPPTPPGPPPGLPPPPPRgfTPPFGPPFPPVNMMPNPPEMNFGLPSLAPLVPPVT 304
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
425-585 1.39e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 54.86  E-value: 1.39e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  425 PGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPqQPGPAKPPPQQPGSAKPPPQQP 504
Cdd:PRK07003   368 PGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAE-APPAAPAPPATADRGDDAADGD 446
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  505 GSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQTGSG-KPLQPPTVSPSAKQPPSQG 583
Cdd:PRK07003   447 APVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAvPDARAPAAASREDAPAAAA 526

                   ..
gi 2462613745  584 LP 585
Cdd:PRK07003   527 PP 528
GGN pfam15685
Gametogenetin; GGN is a family of proteins largely found in mammals. It reacts with POG in the ...
349-554 1.59e-06

Gametogenetin; GGN is a family of proteins largely found in mammals. It reacts with POG in the maturation of sperm and is expressed virtually only in the testis. It is found to be associated with the intracellular membrane, binds with GGNBP1 and may be involved in vesicular trafficking.


Pssm-ID: 434857 [Multi-domain]  Cd Length: 668  Bit Score: 54.39  E-value: 1.59e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  349 VKPPVQPPGTTKPPAQPLG-----PAKPPAQQTGSEKPSSEQPGPKALAQPPGVgktpaqQPGPAKPPTQQVGTPKPLAQ 423
Cdd:pfam15685   31 VEPEVTPSSPAMRLAQGLGvwfpgSSAPPGLLVPPEPQASPSPLPLTLELPLPV------TPPPEEAAAAAVSTAPPPAV 104
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  424 QPGLQSPAKAPGPTKTPVQQPgPGKIPAQQAGPGKTSAQQTGPTKPPSQLpgpAKPPPQQPGPAKPPPQQPGSAKPPPQQ 503
Cdd:pfam15685  105 GSLLPAPSKWRKPTGTAVARI-RGLLEASHRGQGDPLSLRPLLPLLPRQL---IEKDPAPGAPAPPPPTPLEPRKPPPLP 180
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  504 PGSTKPPPQQPGPA------KPSPQQP---GSTKPPSQQPGSAKPSAQQPSPAKPSAQQS 554
Cdd:pfam15685  181 PSDRQPPNRGITPAlatsatSPTDSQAkhiAEGKTAGGACGGAPPQAGEGEMARFAASES 240
PDZ2_MUPP1-like cd06667
PDZ domain 2 of multi-PDZ-domain protein 1 (MUPP1) and PATJ (protein-associated tight junction) ...
4511-4579 1.77e-06

PDZ domain 2 of multi-PDZ-domain protein 1 (MUPP1) and PATJ (protein-associated tight junction) and similar domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of MUPP1 and PATJ, and related domains. MUPP1 and PATJ serve as scaffolding proteins linking different proteins and protein complexes involved in the organization of tight junctions and epithelial polarity. MUPP1 contains an L27 (Lin-2 and Lin-7 binding) domain and 13 PDZ domains. PATJ (also known as INAD-like) contains an L27 domain and ten PDZ domains. MUPP1 and PATJ share several binding partners, including junctional adhesion molecules (JAM), zonula occludens (ZO)-3, Pals1 (protein associated with Lin-7), Par (partitioning defective)-6 proteins, and nectins (adherence junction adhesion molecules). PATJ lacks 3 PDZ domains seen in MUPP1: PDZ6, 9, and 13; consequently, MUPP1 PDZ7 and 8 align with PATJ PDZ6 and 7; and MUPP1 PDZ domains 10-12 align with PATJ PDZ domains 8-10. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MUPP1-like family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F


Pssm-ID: 467155 [Multi-domain]  Cd Length: 80  Bit Score: 48.43  E-value: 1.77e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462613745 4511 GNGLGIRIVGGKEIpghsgeiGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQQ 4579
Cdd:cd06667      9 GSGLGFGIVGGKST-------GVVVKTILPGGVADRDGRLRSGDHILQIGDTNLRGMGSEQVAQVLRQC 70
PRK10263 PRK10263
DNA translocase FtsK; Provisional
424-991 1.88e-06

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 54.71  E-value: 1.88e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  424 QPGLQSPAkAPGPtKTPVQQPgpgkIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAkpPPQQPGsakpPPQQ 503
Cdd:PRK10263   338 EPVTQTPP-VASV-DVPPAQP----TVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNE--PLQQPV----QPQQ 405
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  504 PGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQ-STKPVSQTGSGKPlQPPTVSPSAKQPPSQ 582
Cdd:PRK10263   406 PYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTfAPQSTYQTEQTYQ-QPAAQEPLYQQPQPV 484
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  583 GLPKTICPLCNTTELLLHVPekanfntctecqttvcslcgfnPNPHLTEVKEwlcLNCQMKRALGGDLAPVPSSPQPKLK 662
Cdd:PRK10263   485 EQQPVVEPEPVVEETKPARP----------------------PLYYFEEVEE---KRAREREQLAAWYQPIPEPVKEPEP 539
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  663 TAPVTTTSAVSKSSP-QPQQTSPKKDAAPKQDLSKAPEPKKPPPLVKQPTLHGSPSAKAK-----QPPEADSLSKPAPP- 735
Cdd:PRK10263   540 IKSSLKAPSVAAVPPvEAAAAVSPLASGVKKATLATGAAATVAAPVFSLANSGGPRPQVKegigpQLPRPKRIRVPTRRe 619
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  736 ------KEPSVPSEQDKAPVADDKPKQPKMVKPTTDLVSSSSATTKPDIPSSKVQSQAEE--KTTPPLKTDSAKPSQSfp 807
Cdd:PRK10263   620 lasygiKLPSQRAAEEKAREAQRNQYDSGDQYNDDEIDAMQQDELARQFAQTQQQRYGEQyqHDVPVNAEDADAAAEA-- 697
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  808 ptgEKVSPFDSKAIPRPASDSKIISHPGPSSESKGQKQVDPVQkkEEPKKAQTKMSPKPDAKPMPKGSPTPPGPRPTagQ 887
Cdd:PRK10263   698 ---ELARQFAQTQQQRYSGEQPAGANPFSLDDFEFSPMKALLD--DGPHEPLFTPIVEPVQQPQQPVAPQQQYQQPQ--Q 770
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  888 TVPTPQQSPKPQEQSrrfslnlgsitdAPKSQPTTPQETVtgklfgfgasifsqasnlistAGQPGPHSQSGPGAPMKQA 967
Cdd:PRK10263   771 PVAPQPQYQQPQQPV------------APQPQYQQPQQPV---------------------APQPQYQQPQQPVAPQPQY 817
                          570       580
                   ....*....|....*....|....
gi 2462613745  968 PAPSQPPTSQgPPKSTGQAPPAPA 991
Cdd:PRK10263   818 QQPQQPVAPQ-PQYQQPQQPVAPQ 840
PDZ12_MUPP1-like cd06675
PDZ domain 12 of multi-PDZ-domain protein 1 (MUPP1), PDZ domain 10 of protein-associated tight ...
4497-4581 2.00e-06

PDZ domain 12 of multi-PDZ-domain protein 1 (MUPP1), PDZ domain 10 of protein-associated tight junction (PATJ, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 12 of MUPP1, PDZ domain 10 of PATJ, and related domains. MUPP1 and PATJ serve as scaffolding proteins linking different proteins and protein complexes involved in the organization of tight junctions and epithelial polarity. MUPP1 contains an L27 (Lin-2 and Lin-7 binding) domain and 13 PDZ domains. PATJ (also known as INAD-like) contains an L27 domain and ten PDZ domains. MUPP1 and PATJ share several binding partners, including junctional adhesion molecules (JAM), zonula occludens (ZO)-3, Pals1 (protein associated with Lin-7), Par (partitioning defective)-6 proteins, and nectins (adherence junction adhesion molecules). PATJ lacks 3 PDZ domains seen in MUPP1: PDZ6, 9, and 13; consequently, MUPP1 PDZ7 and 8 align with PATJ PDZ6 and 7; and MUPP1 PDZ domains 10-12 align with PATJ PDZ domains 8-10. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MUPP1-like PDZ12 family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F


Pssm-ID: 467163 [Multi-domain]  Cd Length: 86  Bit Score: 48.51  E-value: 2.00e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 4497 RIKITRDSKDhtvsgnGLGIRIVGGKEIPghSGEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSII 4576
Cdd:cd06675      2 TVEIKRGPQD------SLGISIAGGVGSP--LGDVPVFIAMIQPNGVAAQTGKLKVGDRIVSINGQSTDGLTHSEAVNLL 73

                   ....*
gi 2462613745 4577 SQQSG 4581
Cdd:cd06675     74 KNASG 78
PDZ_Lin-7-like cd06796
PDZ domain of protein Lin-7 and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), ...
4513-4581 2.03e-06

PDZ domain of protein Lin-7 and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of Lin-7 (also known as LIN-7 or LIN7), and related domains. Lin-7 targets and organize protein complexes to epithelial and synaptic plasma membranes. There are three mammalian Lin-7 homologs: Lin-7A (protein lin-7 homolog A, also known as mammalian lin-seven protein 1 (MALS-1), vertebrate lin-7 homolog 1 (Veli-1), tax interaction protein 33); Lin-7B (also known as MALS-2, Veli-2); and Lin-7C (also known as MALS-3, Veli-3). Lin-7 is involved in localization of the Let-23 growth factor receptor to the basolateral membrane of epithelial cells, in tight junction localization of insulin receptor substrate p53 (IRSp53), in retaining gamma-aminobutyric (GABA) transporter (BGT-1) at the basolateral surface of epithelial cells, and in regulating recruitment of neurotransmitter receptors to the postsynaptic density (PSD). The Lin7 PDZ domain binds Let-23, BGT and beta-catenin, and NMDA (N-methyl-D-aspartate) receptor NR2B. Lin-7 also binds to the PDZ binding motif located in the C-terminal tail of Rhotekin, an effector protein for small GTPase Rho. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Lin-7-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467258 [Multi-domain]  Cd Length: 86  Bit Score: 48.59  E-value: 2.03e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462613745 4513 GLGIRIVGGKEipgHSGEIgaYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQQSG 4581
Cdd:cd06796     13 GLGFNVMGGKE---QNSPI--YISRIIPGGVADRHGGLKRGDQLLSVNGVSVEGEHHEKAVELLKAAQG 76
PDZ13_MUPP1-like cd06676
PDZ domain 13 of multi-PDZ-domain protein 1 (MUPP1) and related domains; PDZ (PSD-95 ...
4513-4581 2.05e-06

PDZ domain 13 of multi-PDZ-domain protein 1 (MUPP1) and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 13 of MUPP1. MUPP1 and PATJ serve as scaffolding proteins linking different proteins and protein complexes involved in the organization of tight junctions and epithelial polarity. MUPP1 contains an L27 (Lin-2 and Lin-7 binding) domain and 13 PDZ domains. PATJ (also known as INAD-like) contains an L27 domain and ten PDZ domains. PATJ lacks 3 PDZ domains seen in MUPP1: PDZ6, PDZ9, and PDZ13. This MuPP1-like PDZ13 domain is therefore absent from PATJ. MUPP1 and PATJ share several binding partners, including junctional adhesion molecules (JAM), zonula occludens (ZO)-3, Pals1 (protein associated with Lin-7), Par (partitioning defective)-6 proteins, and nectins (adherence junction adhesion molecules). PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MUPP1-like family PDZ13 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467164 [Multi-domain]  Cd Length: 83  Bit Score: 48.49  E-value: 2.05e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462613745 4513 GLGIRIVGGKEIPghSGEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQQSG 4581
Cdd:cd06676     10 GLGFSIVGGFGSP--HGDLPIYVKTVFEKGAAAEDGRLKRGDQILAVNGESLEGVTHEEAVNILKKTKG 76
PDZ11_MUPP1-PDZ9_PATJ-like cd06674
PDZ domain 11 of MUPP1 of multi-PDZ-domain protein 1 (MUPP1), domain 9 of PATJ ...
4510-4582 2.24e-06

PDZ domain 11 of MUPP1 of multi-PDZ-domain protein 1 (MUPP1), domain 9 of PATJ (protein-associated tight junction) and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 11 of MUPP1, PDZ domain 9 of PATJ, and related domains. MUPP1 and PATJ serve as scaffolding proteins linking different proteins and protein complexes involved in the organization of tight junctions and epithelial polarity. MUPP1 contains an L27 (Lin-2 and Lin-7 binding) domain and 13 PDZ domains. PATJ (also known as INAD-like) contains an L27 domain and ten PDZ domains. MUPP1 and PATJ share several binding partners, including junctional adhesion molecules (JAM), zonula occludens (ZO)-3, Pals1 (protein associated with Lin-7), Par (partitioning defective)-6 proteins, and nectins (adherence junction adhesion molecules). PATJ lacks 3 PDZ domains seen in MUPP1: PDZ6, 9, and 13; consequently, MUPP1 PDZ7 and 8 align with PATJ PDZ6 and 7; and MUPP1 PDZ domains 10-12 align with PATJ PDZ domains 8-10. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MUPP1-like family PDZ11 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467162 [Multi-domain]  Cd Length: 87  Bit Score: 48.43  E-value: 2.24e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462613745 4510 SGNGLGIRIVGGKEipghsgEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQQSGE 4582
Cdd:cd06674     12 PGRGLGLSIVGKRN------DTGVFVSDIVKGGAADADGRLMQGDQILSVNGEDVRNASQEAAAALLKCAQGK 78
SAP130_C pfam16014
Histone deacetylase complex subunit SAP130 C-terminus;
446-585 2.32e-06

Histone deacetylase complex subunit SAP130 C-terminus;


Pssm-ID: 464973 [Multi-domain]  Cd Length: 371  Bit Score: 53.40  E-value: 2.32e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  446 PGKIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPppQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPG 525
Cdd:pfam16014   32 PPVTVAVEALPGQNSEQQTASASPPSQHPAQAIP--TILAPAAPPSQPSVVLSTLPAAMAVTPPIPASMANVVAPPTQPA 109
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462613745  526 STKPPSQQPGSAKPSAQQPSPAKP-SAQQSTKPVSQTGSGKPLqpPTVSPSAKQPPSQGLP 585
Cdd:pfam16014  110 ASSTAACAVSSVLPEIKIKQEAEPmDTSQSVPPLTPTSISPAL--TSLANNLSVPAGDLLP 168
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
463-584 2.38e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 54.01  E-value: 2.38e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  463 QTGPTKPPSQLPG-PAKPPPQQPGPAKPPPQQPGsakpPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAK--P 539
Cdd:PRK14971   364 QKGDDASGGRGPKqHIKPVFTQPAAAPQPSAAAA----ASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPvnP 439
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 2462613745  540 SAQQPSPAKPSAQQSTK--PVSQTGSgkpLQPPTVSPsaKQPPSQGL 584
Cdd:PRK14971   440 PSTAPQAVRPAQFKEEKkiPVSKVSS---LGPSTLRP--IQEKAEQA 481
PDZ1_FRMPD2-like cd23071
PDZ domain 1 of FERM and PDZ domain-containing protein 2 (FRMPD2), and related domains; PDZ ...
4497-4581 2.60e-06

PDZ domain 1 of FERM and PDZ domain-containing protein 2 (FRMPD2), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of FRMPD2 (also known as PDZ domain-containing protein 4, and related domains. FRMPD2 is localized in the basolateral membranes of polarized epithelial cells and is associated with tight junction formation and immune response; it contains 3 PDZ domains. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PTPN13 family PDZ1 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467284 [Multi-domain]  Cd Length: 92  Bit Score: 48.64  E-value: 2.60e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 4497 RIKITRDSKdhtvsgNGLGIRIVGGKEIpgHSGEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTY------- 4569
Cdd:cd23071      4 CVTLKRDPK------RGFGFVIVGGENT--GKLDLGIFIASIIPGGPAEKDGRIKPGGRLISLNNISLEGVTFntavkil 75
                           90
                   ....*....|....*.
gi 2462613745 4570 ----EEVQSIISQQSG 4581
Cdd:cd23071     76 qnspDEVELIISQPKD 91
SP2_N cd22540
N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins ...
331-591 2.73e-06

N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP2 contains the least conserved DNA-binding domain within the SP subfamily of proteins, and its DNA sequence specificity differs from the other SP proteins. It localizes primarily within subnuclear foci associated with the nuclear matrix, and can activate, or in some cases, repress expression from different promoters. The transcription factor SP2 serves as a paradigm for indirect genomic binding. It does not require its DNA-binding domain for genomic DNA binding and occupies target promoters independently of whether they contain a cognate DNA-binding motif. SP2 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP2.


Pssm-ID: 411776 [Multi-domain]  Cd Length: 511  Bit Score: 53.39  E-value: 2.73e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  331 PPAQPSGLTKPLAQQPGTVKPPVQppgttKPPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAQPPGVGKTPAQQ--PGPA 408
Cdd:cd22540     39 PPAVEAAVTPPAPPQPTPRKLVPI-----KPAPLPLGPGKNSIGFLSAKGNIIQLQGSQLSSSAPGGQQVFAIQnpTMII 113
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  409 KPPTQQVGTPKPLAQQPGLQspakapgptkTPVQQPGPGKIpaqQAGPGKTSAQ----QTGPTKPPSQLPGPAKPPPQQP 484
Cdd:cd22540    114 KGSQTRSSTNQQYQISPQIQ----------AAGQINNSGQI---QIIPGTNQAIitpvQVLQQPQQAHKPVPIKPAPLQT 180
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  485 GPA-KPPPQQPGSA--KPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQT 561
Cdd:cd22540    181 SNTnSASLQVPGNVikLQSGGNVALTLPVNNLVGTQDGATQLQLAAAPSKPSKKIRKKSAQAAQPAVTVAEQVETVLIET 260
                          250       260       270
                   ....*....|....*....|....*....|
gi 2462613745  562 GSGKPLQPPTvSPSAKQPPSQGLPKTICPL 591
Cdd:cd22540    261 TADNIIQAGN-NLLIVQSPGTGQPAVLQQV 289
dnaA PRK14086
chromosomal replication initiator protein DnaA;
326-542 3.00e-06

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 53.68  E-value: 3.00e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  326 PGPAKPPAQPSgltkplAQQPGTVKPPVQP-PGTTKPPAQPLGPAKPPaqqtGSEKPSSeQPGPKALAQPPgvgktpaqQ 404
Cdd:PRK14086    94 EPAPPPPHARR------TSEPELPRPGRRPyEGYGGPRADDRPPGLPR----QDQLPTA-RPAYPAYQQRP--------E 154
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  405 PGPAKPPTQQVGtpkplAQQPGLQSPAKAPgptKTPVQQPGPGkiPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQP 484
Cdd:PRK14086   155 PGAWPRAADDYG-----WQQQRLGFPPRAP---YASPASYAPE--QERDREPYDAGRPEYDQRRRDYDHPRPDWDRPRRD 224
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2462613745  485 GPAKPPPqQPGSAKPPPQQPGSTKPPPQQPGPAKPSpqQPGS-TKPPSQQPGSAKPSAQ 542
Cdd:PRK14086   225 RTDRPEP-PPGAGHVHRGGPGPPERDDAPVVPIRPS--APGPlAAQPAPAPGPGEPTAR 280
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
390-568 3.00e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 53.72  E-value: 3.00e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  390 ALA-QPPGVGKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPVQQPgpgkiPAQQAGPGKTSAQQTGPTK 468
Cdd:PRK07994   356 MLAfHPAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAP-----AVPLPETTSQLLAARQQLQ 430
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  469 PPSQLPGPAKPPPQQPGPAKP--PPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPgSTKPPSQQPGSAKPSAQQ--- 543
Cdd:PRK07994   431 RAQGATKAKKSEPAAASRARPvnSALERLASVRPAPSALEKAPAKKEAYRWKATNPVE-VKKEPVATPKALKKALEHekt 509
                          170       180       190
                   ....*....|....*....|....*....|.
gi 2462613745  544 PSPAKPSAQQSTKP------VSQTGSGKPLQ 568
Cdd:PRK07994   510 PELAAKLAAEAIERdpwaalVSQLGLPGLVE 540
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
250-474 3.09e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 53.70  E-value: 3.09e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  250 QPPGTGKPIQGPTQTPQTDHAKLPLQRDASRPQTKQAdivrGESVKPSLPSPSKPPIQQPTPGKPPAQQPGHEKSQPGPA 329
Cdd:PRK07003   399 VTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPAT----ADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADS 474
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  330 KPPAQPSGLTKPLAQ-QPGTvkPPVQPPGTTKPPAQPlgPAKPPAQQTGSEKPSSEQPGPKALAQPPGVGKTPAQQ---- 404
Cdd:PRK07003   475 GSASAPASDAPPDAAfEPAP--RAAAPSAATPAAVPD--ARAPAAASREDAPAAAAPPAPEARPPTPAAAAPAARAggaa 550
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  405 ---------------------PGPAKPPTQQVGTPKPLAQQPGLQSPakAPGPTKTPVQQPGPGKIPAQQAgpgktsAQQ 463
Cdd:PRK07003   551 aaldvlrnagmrvssdrgaraAAAAKPAAAPAAAPKPAAPRVAVQVP--TPRARAATGDAPPNGAARAEQA------AES 622
                          250
                   ....*....|.
gi 2462613745  464 TGPTKPPSQLP 474
Cdd:PRK07003   623 RGAPPPWEDIP 633
PHA03160 PHA03160
hypothetical protein; Provisional
446-560 3.58e-06

hypothetical protein; Provisional


Pssm-ID: 165431  Cd Length: 499  Bit Score: 53.17  E-value: 3.58e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  446 PGKIPAQQAG------PGKTSAQQTGPTKppsqLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPgPAKP 519
Cdd:PHA03160   376 PNRIIPHHFSnpysfdPGHAPFFRYAPYG----APKNDHHLLPPLACSQQLPMQPLHVQQAPMQAPHVAPPPMQP-PHVQ 450
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 2462613745  520 SPQQPGSTKPPSQQpgSAKPSAQQPSPAKPSAQQStkPVSQ 560
Cdd:PHA03160   451 QPRVLPSTDGASNE--APKPSAQEPVHIDASFAQD--PVSK 487
PDZ3_PTPN13_FRMPD2-like cd06695
PDZ domain 3 of protein tyrosine phosphatase non-receptor type 13 (PTPN13), FERM and PDZ ...
4510-4588 3.77e-06

PDZ domain 3 of protein tyrosine phosphatase non-receptor type 13 (PTPN13), FERM and PDZ domain-containing protein 2 (FRMPD2), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 3 of PTPN13 [also known as Fas-associated protein-tyrosine phosphatase 1 (FAP-1), protein-tyrosine phosphatase 1E (PTP-E1), and protein-tyrosine phosphatase (PTPL1)], FRMPD2 (also known as PDZ domain-containing protein 4; PDZ domain-containing protein 5C), and related domains. PTPN13 regulates negative apoptotic signaling and mediates phosphoinositide 3-kinase (PI3K) signaling. PTPN13 has five PDZ domains. Proteins known to interact with PTPN13 PDZ domains include: PLEKHA1 and PLEKHA2 via PTPN13-PDZ domain 1, Fas receptor and thyroid receptor-interacting protein 6 via PTPN13-PDZ domain 2, nerve growth factor receptor and protein kinase N2 via PTPN13-PDZ domain 3, PDZ and LIM domain 4 (PDLIM4) via PTPN13-PDZ domains 2 and 4, and brain calpain-2 via PTPN13-PDZ domains 3, 4 and 5. Calpain-2-mediated PTPN13 fragments may be involved in abnormal tau aggregation and increased risk for Alzheimer's disease. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). FRMPD2 is localized in the basolateral membranes of polarized epithelial cells and is associated with tight junction formation and immune response; it contains 3 PDZ domains). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PTPN13 family PDZ3 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467181 [Multi-domain]  Cd Length: 90  Bit Score: 48.02  E-value: 3.77e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 4510 SGNGLGIRIVGGK-EIPGHSGEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQQSGEAE--IC 4586
Cdd:cd06695      9 GSSGLGFSFLGGEnNSPEDPFSGLVRIKKLFPGQPAAESGLIQEGDVILAVNGEPLKGLSYQEVLSLLRGAPPEVTllLC 88

                   ..
gi 2462613745 4587 GP 4588
Cdd:cd06695     89 RP 90
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
673-1061 3.80e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 53.62  E-value: 3.80e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  673 SKSSPQPQQTSPKKDAAPKQDLskapepkKPPPLVKQPTLHGSPSAKAKQPPEADSLSKPAP-PKEPSVPSEqdKAPVAD 751
Cdd:pfam03154  145 SPSIPSPQDNESDSDSSAQQQI-------LQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPtPSAPSVPPQ--GSPATS 215
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  752 DKPKQPKMVKPTTDLVSSSSATTKPDIPSSKVQSQAEEKTTPPLKTdSAKPSQS------FPPTGEKVS--------PFD 817
Cdd:pfam03154  216 QPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQV-SPQPLPQpslhgqMPPMPHSLQtgpshmqhPVP 294
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  818 SKAIPRPASDSKIISHPGPSSESKGQKQvdpvQKKEEPKKAQTKMSPKPdakpmPKGSPTPPGPRPTAG----QTVPTPQ 893
Cdd:pfam03154  295 PQPFPLTPQSSQSQVPPGPSPAAPGQSQ----QRIHTPPSQSQLQSQQP-----PREQPLPPAPLSMPHikppPTTPIPQ 365
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  894 QsPKPQEQSRRFSLNLGSITDAPKSQPTTPqetvtgklfgfgasifsqASNLISTAGQPGPHSQSGPGAPMKQAPAPSQP 973
Cdd:pfam03154  366 L-PNPQSHKHPPHLSGPSPFQMNSNLPPPP------------------ALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPP 426
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  974 PTSQGPPKSTGQAPPAPAKSIPVKKETKAPAAEKLEPKAEQAPTVKRTETEKKPPPIKDSKSLTAEPQKAVLPTKLEKSP 1053
Cdd:pfam03154  427 PPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPV 506

                   ....*...
gi 2462613745 1054 KPESTCPL 1061
Cdd:pfam03154  507 PAAVSCPL 514
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
373-494 3.89e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 53.24  E-value: 3.89e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  373 AQQTGSEKPSSEQPGPKALAQPPGVGKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQS-PAKAPGPTKTPVQQPGPGKIPA 451
Cdd:PRK14971   360 AQLTQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSaTQPAGTPPTVSVDPPAAVPVNP 439
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 2462613745  452 QQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQpGPAKPPPQQP 494
Cdd:PRK14971   440 PSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTL-RPIQEKAEQA 481
PDZ10_MUPP1-PDZ8_PATJ-like cd06673
PDZ domain 10 of multi-PDZ-domain protein 1 (MUPP1), domain 8 of PATJ (protein-associated ...
4513-4578 4.50e-06

PDZ domain 10 of multi-PDZ-domain protein 1 (MUPP1), domain 8 of PATJ (protein-associated tight junction) and similar domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 10 of MUPP1, PDZ domain 8 of PATJ, and related domains. MUPP1 and PATJ serve as scaffolding proteins linking different proteins and protein complexes involved in the organization of tight junctions and epithelial polarity. MUPP1 contains an L27 (Lin-2 and Lin-7 binding) domain and 13 PDZ domains. PATJ (also known as INAD-like) contains an L27 domain and ten PDZ domains. MUPP1 and PATJ share several binding partners, including junctional adhesion molecules (JAM), zonula occludens (ZO)-3, Pals1 (protein associated with Lin-7), Par (partitioning defective)-6 proteins, and nectins (adherence junction adhesion molecules). PATJ lacks 3 PDZ domains seen in MUPP1: PDZ6, 9, and 13; consequently, MUPP1 PDZ7 and 8 align with PATJ PDZ6 and 7; and MUPP1 PDZ domains 10-12 align with PATJ PDZ domains 8-10. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MUPP1-like family PDZ10 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467161 [Multi-domain]  Cd Length: 86  Bit Score: 47.67  E-value: 4.50e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462613745 4513 GLGIRIVGGKEIPghsgeIGA-YIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQ 4578
Cdd:cd06673     14 GLGLSIVGGSDTL-----LGAiIIHEVYEDGAAAKDGRLWAGDQILEVNGEDLRKATHDEAINVLRQ 75
PDZ4_MUPP1-like cd06668
PDZ domain 4 of multi-PDZ-domain protein 1 (MUPP1) and PATJ (protein-associated tight junction) ...
4498-4576 4.59e-06

PDZ domain 4 of multi-PDZ-domain protein 1 (MUPP1) and PATJ (protein-associated tight junction) and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 4 of MUPP1 and PATJ, and related domains. MUPP1 and PATJ serve as scaffolding proteins linking different proteins and protein complexes involved in the organization of tight junctions and epithelial polarity. MUPP1 contains an L27 (Lin-2 and Lin-7 binding) domain and 13 PDZ domains. PATJ (also known as INAD-like) contains an L27 domain and ten PDZ domains. MUPP1 and PATJ share several binding partners, including junctional adhesion molecules (JAM), zonula occludens (ZO)-3, Pals1 (protein associated with Lin-7), Par (partitioning defective)-6 proteins, and nectins (adherence junction adhesion molecules). PATJ lacks 3 PDZ domains seen in MUPP1: PDZ6, 9, and 13; consequently, MUPP1 PDZ7 and 8 align with PATJ PDZ6 and 7; and MUPP1 PDZ domains 10-12 align with PATJ PDZ domains 8-10. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MUPP1-like family PDZ4 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F


Pssm-ID: 467156 [Multi-domain]  Cd Length: 88  Bit Score: 47.68  E-value: 4.59e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462613745 4498 IKITRDSKDHTVSGNGLGIRIVGGKEIPGHSgeigaYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSII 4576
Cdd:cd06668      2 IVVAQLSKFSESSGLGISLEGTVDVEVRGHH-----YIRSILPEGPVGRNGKLFSGDELLEVNGIQLLGLSHKEVVSIL 75
PDZ4_PDZD2-PDZ2_hPro-IL-16-like cd06760
PDZ domain 4 of PDZ domain containing 2 (PDZD2), PDZ domain 2 of human pro-interleukin-16 ...
4507-4585 4.74e-06

PDZ domain 4 of PDZ domain containing 2 (PDZD2), PDZ domain 2 of human pro-interleukin-16 (isoform 1, 1332 AA), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 4 of PDZD2, also known as KIAA0300, PIN-1, activated in prostate cancer (AIPC) and PDZ domain-containing protein 3 (PDZK3). PDZD2 has seven PDZ domains. PDZD2 is expressed at exceptionally high levels in the pancreas and certain cancer tissues, such as prostate cancer. It promotes the proliferation of insulinoma cells and is upregulated during prostate tumorigenesis. In osteosarcoma (OS), the microRNA miR-363 acts as a tumor suppressor by inhibiting PDZD2. This family also includes the second PDZ domain (PDZ2) of human pro-interleukin-16 (isoform 1, also known as nPro-Il-16; 1332 amino-acid protein). Precursor IL-16 is cleaved to produce pro-IL-16 and mature IL-16 (derived from the C-terminal 121 AA). Pro-IL-16 functions as a regulator of T cell growth; mature IL-16 is a CD4 ligand that induces chemotaxis and CD25 expression in CD4+ T cells. IL-16 bioactivity has been closely associated with the progression of several different cancers PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PDZD2-like family PDZ4 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467241 [Multi-domain]  Cd Length: 90  Bit Score: 47.65  E-value: 4.74e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 4507 HTVSGNGLGIRIVGgkeIPGHSGEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQ-QSGEAEI 4585
Cdd:cd06760     10 NKEPGVGLGIGLCC---LPLENDIPGIFIHHLSPGSVAHMDGRLRRGDQILEINGTSLRNVTLNEAYAILSQcKPGPVTL 86
PDZ3_ZO1-like_domain cd06729
PDZ domain 3 of Zonula Occludens-1 (ZO-1), homologs ZO-2 and ZO-3, and related domains; PDZ ...
4511-4582 4.77e-06

PDZ domain 3 of Zonula Occludens-1 (ZO-1), homologs ZO-2 and ZO-3, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 3 of ZO-1, -2, -3 and related domains. Zonula occludens proteins (ZO-1, ZO-2, ZO-3) are multi-PDZ domain proteins involved in the maintenance and biogenesis of multi-protein networks at the cytoplasmic surface of intercellular contacts in epithelial and endothelial cells. They have three N-terminal PDZ domains, PDZ1-3, followed by a Src homology-3 (SH3) domain and a guanylate kinase (GuK)-like domain. Among protein-protein interactions for all ZO proteins is the binding of the first PDZ domain (PDZ1) to the C-termini of claudins , and the homo- and hetero-dimerization of ZO-proteins via their second PDZ domain (PDZ2), which takes place by symmetrical domain swapping of the first two beta-strands of PDZ2. At the cell level, ZO-1 and ZO-2 are involved in polarity maintenance, gene transcription, cell proliferation, and tumor cell metastasis. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This ZO family PDZ3 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467211 [Multi-domain]  Cd Length: 82  Bit Score: 47.56  E-value: 4.77e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462613745 4511 GNGLGIRIVGGKEIpghsgeiGAYIAKILPGGSAEQTGkLMEGMQVLEWNGIPLTSKTYEE-VQSIISQQSGE 4582
Cdd:cd06729     10 GGSVGLRLAGGNDV-------GIFVAGVQEGSPAEKQG-LQEGDQILKVNGVDFRNLTREEaVLFLLDLPKGE 74
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
455-538 5.08e-06

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 52.97  E-value: 5.08e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  455 GPGKTSAQQTGPTKP----PSQLPGPAKPPPQQPGPAKPPPQQPgSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPP 530
Cdd:PRK12270    37 GPGSTAAPTAAAAAAaaaaSAPAAAPAAKAPAAPAPAPPAAAAP-AAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVED 115
                           90
                   ....*....|
gi 2462613745  531 SQQP--GSAK 538
Cdd:PRK12270   116 EVTPlrGAAA 125
PDZ_AFDN-like cd06789
PDZ domain of afadin (AFDN), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95) ...
4511-4578 5.23e-06

PDZ domain of afadin (AFDN), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of afadin (AFDN, also known as ALL1-fused gene from chromosome 6 protein (AF6) and MLLT4), and related domains. AFDN belongs to the adhesion system, probably together with the E-cadherin-catenin system, that plays a role in the organization of homotypic, interneuronal, and heterotypic cell-cell adherens junctions. The AFDN PDZ domain interaction partners include poliovirus receptor-related protein PRR2/nectin, the junctional adhesion molecule (JAM), the breakpoint-cluster-region protein (BCR), connexin36 (Cx36), and a subset of Eph-related receptor tyrosine kinases; it can also bind low molecular weight ligands, in competition with a natural peptide ligand. Other AFDN-binding proteins have been identified. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This AFDN family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467251 [Multi-domain]  Cd Length: 89  Bit Score: 47.67  E-value: 5.23e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462613745 4511 GNGLGIRIVGGKEIpgHSGEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQ 4578
Cdd:cd06789     12 GNGMGLSIVAAKGA--GQDKLGIYIKSVVKGGAADLDGRLQAGDQLLSVDGHSLVGLSQERAAELMTK 77
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
313-446 5.83e-06

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 52.79  E-value: 5.83e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  313 KPPAqqpGHEKSQPGPAKPPAQPSGLTKPLAQQPGTVKPPVQ---PPGTTKPPAQPLGPAKP-PAQQTGSEKPSSEQPGP 388
Cdd:PRK14951   365 KPAA---AAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPaaaPAAAASAPAAPPAAAPPaPVAAPAAAAPAAAPAAA 441
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2462613745  389 KALAQPPGVGKTPAQQPGPAKPPTQQVGTPKPLAQQPglqsPAKAPGPTKTPVQQPGP 446
Cdd:PRK14951   442 PAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPA----PAAAPAAARLTPTEEGD 495
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
335-550 6.70e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 52.27  E-value: 6.70e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  335 PSGLTKPLAQQPGTVKPPVQPPGTTKPPAQPlgPAKPPAQQTGSEKPSSEQPGPKALAQPPGVGKTPAQQPGPAKPPTQQ 414
Cdd:PRK14948   361 PSAFISEIANASAPANPTPAPNPSPPPAPIQ--PSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPTPPANAANAPPS 438
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  415 VGTpKPLAQQ--PGLQSPAkapgpTKTPVQQPG------PGKIPAQQAG----------PGKTSA--------------Q 462
Cdd:PRK14948   439 LNL-EELWQQilAKLELPS-----TRMLLSQQAelvsldSNRAVIAVSPnwlgmvqsrkPLLEQAfakvlgrsiklnleS 512
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  463 QTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPgstkPPPQQPGPAKPSPQQPGSTKPPSQQPgsakPSAQ 542
Cdd:PRK14948   513 QSGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPP----PPPTATQASSNAPAQIPADSSPPPPI----PEEP 584

                   ....*...
gi 2462613745  543 QPSPAKPS 550
Cdd:PRK14948   585 TPSPTKDS 592
PRK11633 PRK11633
cell division protein DedD; Provisional
469-560 6.84e-06

cell division protein DedD; Provisional


Pssm-ID: 236940 [Multi-domain]  Cd Length: 226  Bit Score: 50.39  E-value: 6.84e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  469 PPSQLPGPAKPPP-----QQPGPAKPPPQQPgSAKPPPQQPGSTKPPPQQPGPAKPSPQqpgsTKPPSQQPGSAKPSAQQ 543
Cdd:PRK11633    57 PAATQALPTQPPEgaaeaVRAGDAAAPSLDP-ATVAPPNTPVEPEPAPVEPPKPKPVEK----PKPKPKPQQKVEAPPAP 131
                           90
                   ....*....|....*..
gi 2462613745  544 PSPAKPSAQQSTKPVSQ 560
Cdd:PRK11633   132 KPEPKPVVEEKAAPTGK 148
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
712-1043 7.31e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 52.48  E-value: 7.31e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  712 LHGSPSAKAKQPPEADSLSKPAPPKEPSVPSEQDKAPVADDKPKQPKMVKPTTDLVSSSSATTKPDIPSSkvqsqaEEKT 791
Cdd:PHA03307    42 QLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGP------SSPD 115
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  792 TPPLKTDSAKP--------SQSFPPTGEKVSPFDSKAIPRPASDSKIISHPGPSSESkgqkqVDPVQKKEEPkkAQTKMS 863
Cdd:PHA03307   116 PPPPTPPPASPppspapdlSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQA-----ALPLSSPEET--ARAPSS 188
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  864 PKPDA---KPMPKGSPTPPGPRPTAGQTVPTPQQSPKPQEQSRRFSLNLGSITDAPKSQPTTPQETVTGKLFGFGASIFS 940
Cdd:PHA03307   189 PPAEPppsTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTR 268
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  941 QASNLISTAGQPGPHSQSGPGAPMKQAPAPSqPPTSQGPPKSTGqaPPAPAKSIPVKKETKAPAAEKLEPKAEQAPTVKR 1020
Cdd:PHA03307   269 IWEASGWNGPSSRPGPASSSSSPRERSPSPS-PSSPGSGPAPSS--PRASSSSSSSRESSSSSTSSSSESSRGAAVSPGP 345
                          330       340
                   ....*....|....*....|...
gi 2462613745 1021 TETEkKPPPIKDSKSLTAEPQKA 1043
Cdd:PHA03307   346 SPSR-SPSPSRPPPPADPSSPRK 367
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
391-579 7.37e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 52.54  E-value: 7.37e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  391 LAQPPGVGKTPAqqPGPAKPPTQQVGTPKPLAQqpglqsPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQQTGPTKPP 470
Cdd:PRK07003   356 LAFEPAVTGGGA--PGGGVPARVAGAVPAPGAR------AAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPP 427
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  471 SQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGStKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPS 550
Cdd:PRK07003   428 AAPAPPATADRGDDAADGDAPVPAKANARASADSRC-DERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAA 506
                          170       180
                   ....*....|....*....|....*....
gi 2462613745  551 AQQSTKPVSQTGSGKPLQPPTVSPSAKQP 579
Cdd:PRK07003   507 VPDARAPAAASREDAPAAAAPPAPEARPP 535
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
448-684 7.66e-06

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 52.24  E-value: 7.66e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  448 KIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPP-----PQQPGSA----KPPpqqpgsTKPPPQQPGPAK 518
Cdd:PLN03209   322 KIPSQRVPPKESDAADGPKPVPTKPVTPEAPSPPIEEEPPQPKavvprPLSPYTAyedlKPP------TSPIPTPPSSSP 395
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  519 PSPQQPGSTKPPSQ---QPGSAKPSAQQPSPAKPSAQQSTKPVSQTGSGKPLQPPTvSPSAKQPPSQGLPKTicplcnTT 595
Cdd:PLN03209   396 ASSKSVDAVAKPAEpdvVPSPGSASNVPEVEPAQVEAKKTRPLSPYARYEDLKPPT-SPSPTAPTGVSPSVS------ST 468
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  596 ELLLHVPEKANFNTCTECQTTVCSlcgfNPNPH-LTEVKEWLCLNCQMKRALGGDLAPVPSSPQPKLKTAPVTTTSAVSK 674
Cdd:PLN03209   469 SSVPAVPDTAPATAATDAAAPPPA----NMRPLsPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADE 544
                          250
                   ....*....|...
gi 2462613745  675 ---SSPQPQQTSP 684
Cdd:PLN03209   545 qhhAQPKPRPLSP 557
PTZ00121 PTZ00121
MAEBL; Provisional
1153-1712 7.76e-06

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 52.84  E-value: 7.76e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1153 QVKLVKKQEQEVKTEAEKVILEKVKETLSMEKIPP--MVTTDQKQEESKL-EKDKASAL---QEKKPLPE----EKKLIP 1222
Cdd:PTZ00121  1225 KAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEarMAHFARRQAAIKAeEARKADELkkaEEKKKADEakkaEEKKKA 1304
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1223 EEEKIRSEEKKPLLEEKKPTPEDKKLLPEAKTSAPEEQKHD-LLKSQVQIAEEKLEGRVAPKTVQEGKQPQTKMEGLPSG 1301
Cdd:PTZ00121  1305 DEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAeAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAK 1384
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1302 TPQSLPKEDDKTTKTIKEQPQppctaKPDQVEPGKEKTEKEDD-KSDTSSSQQPKSPQGLSDTGYSSDGISSSlgeipsl 1380
Cdd:PTZ00121  1385 KKAEEKKKADEAKKKAEEDKK-----KADELKKAAAAKKKADEaKKKAEEKKKADEAKKKAEEAKKADEAKKK------- 1452
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1381 ipTDEKDILKGLKKdsfsqESSPSSPSDLAKLESTvlsilEAQASTLADEKSEKKTQPHEVSPEQPKDQEKTQSLSETLE 1460
Cdd:PTZ00121  1453 --AEEAKKAEEAKK-----KAEEAKKADEAKKKAE-----EAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEE 1520
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1461 ITISEEEIKESQEERKDTFKKDSQ----QDIPSSKDHKEKSEFVDDITTRREPYDSVEESSESENSPVPQRKRRTSVGSS 1536
Cdd:PTZ00121  1521 AKKADEAKKAEEAKKADEAKKAEEkkkaDELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKL 1600
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1537 SSDEYKQEDSQGSGEEEDFIRKqiiemsadEDASGSEDDEFIRNQLKEiSSSTESQKKEETKGKGKITAGKHRRLTRKSs 1616
Cdd:PTZ00121  1601 YEEEKKMKAEEAKKAEEAKIKA--------EELKKAEEEKKKVEQLKK-KEAEEKKKAEELKKAEEENKIKAAEEAKKA- 1670
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1617 tsiDEDAGRRHSWHDEDDEAFDESPELKYRETKSQESEELVVTGGGGLRRFKTI----ELNSTIAD--KYSAESSQKKT- 1689
Cdd:PTZ00121  1671 ---EEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELkkaeEENKIKAEeaKKEAEEDKKKAe 1747
                          570       580
                   ....*....|....*....|...
gi 2462613745 1690 SLYFDEEPELEMESLTDSPEDRS 1712
Cdd:PTZ00121  1748 EAKKDEEEKKKIAHLKKEEEKKA 1770
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
718-1053 8.17e-06

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 52.38  E-value: 8.17e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  718 AKAKQPP--EADSLSKPAPPKEPSVPSEQDKAPvaDDKPKQPKMVKPttdlvssSSATTKPDIPSSKVQSQAEEKTTPPL 795
Cdd:PTZ00449   492 SKKKLAPieEEDSDKHDEPPEGPEASGLPPKAP--GDKEGEEGEHED-------SKESDEPKEGGKPGETKEGEVGKKPG 562
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  796 KTDSAKPSQSFPPTGEKVSPFDSKAIPRPASDSKIISHPGPSSESKGQKQVDPvQKKEEPKKAQTKMSPKPDAKPMPKGS 875
Cdd:PTZ00449   563 PAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLP-ELLDIPKSPKRPESPKSPKRPPPPQR 641
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  876 PTPPG-PRPTAGQTVPTPQQSPK-PQEQSRRFSLNLGSITDAPKSQPTTPQETVTGKLFGFGASIFSQASNLISTAGQP- 952
Cdd:PTZ00449   642 PSSPErPEGPKIIKSPKPPKSPKpPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPl 721
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  953 ---GPHSQSGPGAPMKQAPAPSQPPTSQGPP--------KSTGQAPPAPA------KSIPVKKETKAPaaeklepkaEQA 1015
Cdd:PTZ00449   722 ppkLPRDEEFPFEPIGDPDAEQPDDIEFFTPpeeertffHETPADTPLPDilaeefKEEDIHAETGEP---------DEA 792
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|..
gi 2462613745 1016 PTVKRTETEKKPPPIKDSKSLTAEPQK----AVLPTKLEKSP 1053
Cdd:PTZ00449   793 MKRPDSPSEHEDKPPGDHPSLPKKRHRldglALSTTDLESDA 834
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
468-582 8.53e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 52.18  E-value: 8.53e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  468 KPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQ--QPGSAKPSAQQPS 545
Cdd:PRK07994   360 HPAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAarQQLQRAQGATKAK 439
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 2462613745  546 PAKPSAQQSTKPVSQTGSGKPLQPPTVSPSAKQPPSQ 582
Cdd:PRK07994   440 KSEPAAASRARPVNSALERLASVRPAPSALEKAPAKK 476
PHA01929 PHA01929
putative scaffolding protein
439-575 8.79e-06

putative scaffolding protein


Pssm-ID: 177328  Cd Length: 306  Bit Score: 51.21  E-value: 8.79e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  439 TPVQQPGPGKIPAQQAgpgktsaqqtgpTKPPSQLPGPAKPPPQQPGPAKPP--PQQPGSAKPPPQQPGSTKPPPQQPGP 516
Cdd:PHA01929     2 TQNEQQLPPGLAGLVA------------NVPPAAAPTPQPNPVIQPQAPVQPgqPGAPQQLAIPTQQPQPVPTSAMTPHV 69
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2462613745  517 AKPSPQQPGSTKPPSqqPGSAKPSAQQPSPakPSAQQSTKPVSQTGSGKPLQPPTVSPS 575
Cdd:PHA01929    70 VQQAPAQPAPAAPPA--AGAALPEALEVPP--PPAFTPNGEIVGTLAGNLEGDPQLAPS 124
PRK12727 PRK12727
flagellar biosynthesis protein FlhF;
343-546 1.05e-05

flagellar biosynthesis protein FlhF;


Pssm-ID: 237182 [Multi-domain]  Cd Length: 559  Bit Score: 51.91  E-value: 1.05e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  343 AQQPGTVKPPVQPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQP---------GPKALAQPPGVgktPAQQPGPAkpPTQ 413
Cdd:PRK12727    57 TARSDTPATAAAPAPAPQAPTKPAAPVHAPLKLSANANMSQRQRvasaaedmiAAMALRQPVSV---PRQAPAAA--PVR 131
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  414 QVGTPKPLAQQpglqspaKAPGPTKTPVQQPGPG--KIPAQQAGPGKTSAQQTGPTKPPSQLPGPAK-PPPQQPGPAKPP 490
Cdd:PRK12727   132 AASIPSPAAQA-------LAHAAAVRTAPRQEHAlsAVPEQLFADFLTTAPVPRAPVQAPVVAAPAPvPAIAAALAAHAA 204
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462613745  491 PQQPGSAKPPPQQPGSTKPPPqqpgPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSP 546
Cdd:PRK12727   205 YAQDDDEQLDDDGFDLDDALP----QILPPAALPPIVVAPAAPAALAAVAAAAPAP 256
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
491-588 1.16e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 51.70  E-value: 1.16e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  491 PQQPGSAKPPPQQPGSTKPPPQQPGpakpSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPV--SQTGSGKPLQ 568
Cdd:PRK14971   363 TQKGDDASGGRGPKQHIKPVFTQPA----AAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTvsVDPPAAVPVN 438
                           90       100
                   ....*....|....*....|
gi 2462613745  569 PPTVSPSAKQPPSQGLPKTI 588
Cdd:PRK14971   439 PPSTAPQAVRPAQFKEEKKI 458
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
461-552 1.23e-05

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 51.16  E-value: 1.23e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  461 AQQTGPTKPPsqlpgPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPS 540
Cdd:NF041121    12 AAQMGRAAAP-----PSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPGPAGAAPG 86
                           90
                   ....*....|....
gi 2462613745  541 AQQP--SPAKPSAQ 552
Cdd:NF041121    87 AALPvrVPAPPALP 100
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
652-1016 1.28e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 51.69  E-value: 1.28e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  652 PVPSSPQPKLKTAPVTTTSAVSKSSPQPQQTSPKKDAAPKQDLSKAPEPKKPPPLVkqpTLHgSPSAKAKQPPEADSLSK 731
Cdd:pfam03154  180 AASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTP---TLH-PQRLPSPHPPLQPMTQP 255
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  732 PAPPKEPSVPSEQDKA---------PVADDKPKQPKMVKPTT-DLVSSSSATTKPDIPSSKVQSQAEEKTTPPLKTDSAK 801
Cdd:pfam03154  256 PPPSQVSPQPLPQPSLhgqmppmphSLQTGPSHMQHPVPPQPfPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQ 335
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  802 PSQsfpPTGEKVSPFDSKAIP--RPASDSKIISHPGPSSESKGQKQVDPvqkkeEPKKAQTKMSPKPDAKPMPKGSP-TP 878
Cdd:pfam03154  336 SQQ---PPREQPLPPAPLSMPhiKPPPTTPIPQLPNPQSHKHPPHLSGP-----SPFQMNSNLPPPPALKPLSSLSThHP 407
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  879 PGPRPTAGQTVPTPQQSPKPQEQSRrfslnlgSITDAPKSQPTTPQETVTGKLFGfGASIFSQASNLISTAGQPGPHSQS 958
Cdd:pfam03154  408 PSAHPPPLQLMPQSQQLPPPPAQPP-------VLTQSQSLPPPAASHPPTSGLHQ-VPSQSPFPQHPFVPGGPPPITPPS 479
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2462613745  959 GPGAPMKQAPAPSQPPTSqGPPKSTGQAPPAPAKSIPVKKETKAPAAEKLEPKAEQAP 1016
Cdd:pfam03154  480 GPPTSTSSAMPGIQPPSS-ASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPP 536
PHA03418 PHA03418
hypothetical E4 protein; Provisional
338-516 1.32e-05

hypothetical E4 protein; Provisional


Pssm-ID: 177646 [Multi-domain]  Cd Length: 230  Bit Score: 49.74  E-value: 1.32e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  338 LTKPLAQQPGTVKPPVQPPGTTKPPAQPLGPAKPPAQQTGSEKPS-SEQPGPKALAQPPGVGKTPAQQPGPakpptqqvg 416
Cdd:PHA03418    31 LCLPLLPAPHHPNPQEDPDKNPSPPPDPPLTPRPPAQPNGHNKPPvTKQPGGEGTEEDHQAPLAADADDDP--------- 101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  417 tpkplaqQPGLQSPAKAPGPTKTPvQQPGPGKIPAQQAGPgktsaqQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGS 496
Cdd:PHA03418   102 -------RPGKRSKADEHGPAPGR-AALAPFKLDLDQDPL------HGDPDPPPGATGGQGEEPPEGGEESQPPLGEGEG 167
                          170       180
                   ....*....|....*....|...
gi 2462613745  497 A---KPPPqqpgstKPPPQQPGP 516
Cdd:PHA03418   168 AvegHPPP------LPPAPEPKP 184
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
269-422 1.40e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 51.40  E-value: 1.40e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  269 HAKLPLQRDASRPQTKQADIVRGESVKPSLPSPSKPPIQQPTPGKP-PAQQPgheksQPGPAKPPAQPSGLTKPLAQQPG 347
Cdd:PRK07994   360 HPAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASaPQQAP-----AVPLPETTSQLLAARQQLQRAQG 434
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  348 TVKPPVQPPGTTKPP-----AQPLGPAKPPAQQTGSEKPSSEQPGPKALAQPPGVGKTPAQQPGPAKPPTQQVGTPKPLA 422
Cdd:PRK07994   435 ATKAKKSEPAAASRArpvnsALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKALKKALEHEKTPELAA 514
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
464-544 1.61e-05

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 51.43  E-value: 1.61e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  464 TGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQ 543
Cdd:PRK12270    39 GSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEVT 118

                   .
gi 2462613745  544 P 544
Cdd:PRK12270   119 P 119
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
485-590 1.67e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 51.25  E-value: 1.67e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  485 GPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSqtgsg 564
Cdd:PRK14951   369 AAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPA----- 443
                           90       100
                   ....*....|....*....|....*.
gi 2462613745  565 kPLQPPTVSPSAKQPPSQGLPKTICP 590
Cdd:PRK14951   444 -AVALAPAPPAQAAPETVAIPVRVAP 468
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
327-487 1.69e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 51.25  E-value: 1.69e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  327 GPAKPPAQPSGLTKPLAQQPGTVKPPvqppgttkPPAQPLGPAKPPAQQTgsekpsSEQPGPKALAQPPGVGKTPAQQPG 406
Cdd:PRK14951   370 AEAAAPAEKKTPARPEAAAPAAAPVA--------QAAAAPAPAAAPAAAA------SAPAAPPAAAPPAPVAAPAAAAPA 435
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  407 PAKPPtqqvgTPKPLAQQPGLQSPAkAPGPTKTPVQ-QPGPgkiPAQQAGPgktsaqqtgptkPPSQLPGPAKPPPQQPG 485
Cdd:PRK14951   436 AAPAA-----APAAVALAPAPPAQA-APETVAIPVRvAPEP---AVASAAP------------APAAAPAAARLTPTEEG 494

                   ..
gi 2462613745  486 PA 487
Cdd:PRK14951   495 DV 496
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
235-565 1.69e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 51.46  E-value: 1.69e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  235 TPKSISSQQPEKIKSQPPGTGKPIQGPTQT---PQTDHAKLPLQRDASR-PQTKQADIVrgesvkpSLPSPSKPPIQQPT 310
Cdd:pfam05109  416 THKVIFSKAPESTTTSPTLNTTGFAAPNTTtglPSSTHVPTNLTAPASTgPTVSTADVT-------SPTPAGTTSGASPV 488
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  311 PGKPPAQQPGHEKSQPGPAKPPaqpSGLTKPL----AQQPGTVKPPVQPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQP 386
Cdd:pfam05109  489 TPSPSPRDNGTESKAPDMTSPT---SAVTTPTpnatSPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTP 565
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  387 GPKALAqpPGVGKTPaqqpgpakpPTQQVGTPKPLAQQP--GLQSPAKAP------GPTKTPV-QQPGPGKIPAQQAGPG 457
Cdd:pfam05109  566 TPNATI--PTLGKTS---------PTSAVTTPTPNATSPtvGETSPQANTtnhtlgGTSSTPVvTSPPKNATSAVTTGQH 634
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  458 K-TSAQQTGPTKPPSQLPGPAKPPPQ----------------------QPGPAKPPPQQPGSAKPPPQqPGSTKpppQQP 514
Cdd:pfam05109  635 NiTSSSTSSMSLRPSSISETLSPSTSdnstshmplltsahptggenitQVTPASTSTHHVSTSSPAPR-PGTTS---QAS 710
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2462613745  515 GPAKPSPqqpgSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQTGSGK 565
Cdd:pfam05109  711 GPGNSST----STKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTVTSTGGK 757
PRK11633 PRK11633
cell division protein DedD; Provisional
440-534 1.77e-05

cell division protein DedD; Provisional


Pssm-ID: 236940 [Multi-domain]  Cd Length: 226  Bit Score: 49.23  E-value: 1.77e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  440 PVQQPGPGKIPaqqagPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPaKPPPQQPgsAKPPPQQPgsTKPPPQQPGPAKP 519
Cdd:PRK11633    58 AATQALPTQPP-----EGAAEAVRAGDAAAPSLDPATVAPPNTPVEP-EPAPVEP--PKPKPVEK--PKPKPKPQQKVEA 127
                           90
                   ....*....|....*
gi 2462613745  520 SPQQPGSTKPPSQQP 534
Cdd:PRK11633   128 PPAPKPEPKPVVEEK 142
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
456-548 1.79e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 50.96  E-value: 1.79e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  456 PGKTSAQQTGPTKPPSQ-LPGPAKPPPqqPGPAKPPPQQPGSAKPPPQQPgstkPPPQQPGPAKPSPQQPGSTKPPSQQP 534
Cdd:PRK14950   364 PAPQPAKPTAAAPSPVRpTPAPSTRPK--AAAAANIPPKEPVRETATPPP----VPPRPVAPPVPHTPESAPKLTRAAIP 437
                           90
                   ....*....|....
gi 2462613745  535 GSAKPSAQQPSPAK 548
Cdd:PRK14950   438 VDEKPKYTPPAPPK 451
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
329-434 1.80e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 50.93  E-value: 1.80e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  329 AKPPAQPSGLTKPL------AQQPGTVKPPVQPPGTTKPPAQPlgPAKPPA-QQTGSEKPSSEQPGPKALAQPPGVGK-- 399
Cdd:PRK14971   369 ASGGRGPKQHIKPVftqpaaAPQPSAAAAASPSPSQSSAAAQP--SAPQSAtQPAGTPPTVSVDPPAAVPVNPPSTAPqa 446
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 2462613745  400 -TPAQQPGPAKPPTQQVGTPKPLAQQPgLQSPAKAP 434
Cdd:PRK14971   447 vRPAQFKEEKKIPVSKVSSLGPSTLRP-IQEKAEQA 481
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
328-516 1.89e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 51.33  E-value: 1.89e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  328 PAKPPAQPSGLTKPLAQQPGTVKPPVQPPGTTKPPAQPLGP-AKPPAQQTGSEKPSSEQPGPKALAQPPGvgktPAQQPG 406
Cdd:PHA03307   765 PAKLAEALALLEPAEPQRGAGSSPPVRAEAAFRRPGRLRRSgPAADAASRTASKRKSRSHTPDGGSESSG----PARPPG 840
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  407 PAkpptqqvGTPKPLAQQPGLQSPAKAPGPtktpvqqpgpgkipaqqAGPGKTSAQQTGPTKPPSQLPGPAKPPPQqpgP 486
Cdd:PHA03307   841 AA-------ARPPPARSSESSKSKPAAAGG-----------------RARGKNGRRRPRPPEPRARPGAAAPPKAA---A 893
                          170       180       190
                   ....*....|....*....|....*....|
gi 2462613745  487 AKPPPQQPGSAKPPPQQPGSTKPPPQQPGP 516
Cdd:PHA03307   894 AAPPAGAPAPRPRPAPRVKLGPMPPGGPDP 923
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
947-1027 2.00e-05

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 51.04  E-value: 2.00e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  947 STAGQPGPHSQSGPGAPMKQAPAPSQPPTSQGPPKSTGQAPPAPAKsipvKKETKAPAAEKLEPKAEQAPTVKRTETEKK 1026
Cdd:PRK12270    43 APTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAA----AAAAAAAPAAPPAAAAAAAPAAAAVEDEVT 118

                   .
gi 2462613745 1027 P 1027
Cdd:PRK12270   119 P 119
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
472-579 2.03e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 50.93  E-value: 2.03e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  472 QLP--GPAKPPPQQPG-PAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAK 548
Cdd:PRK14971   361 QLTqkGDDASGGRGPKqHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPP 440
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 2462613745  549 PSAQQSTKPvsqtGSGKPLQPPTVS------PSAKQP 579
Cdd:PRK14971   441 STAPQAVRP----AQFKEEKKIPVSkvsslgPSTLRP 473
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
326-587 2.04e-05

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 50.70  E-value: 2.04e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  326 PGPAKPPAQPSGltkplAQQPGTVKPPVQPPGTTKPPAQPlGPAKPPAQQTGSEKPSSEQPGPKalaqppgvgktPAQQP 405
Cdd:PLN03209   324 PSQRVPPKESDA-----ADGPKPVPTKPVTPEAPSPPIEE-EPPQPKAVVPRPLSPYTAYEDLK-----------PPTSP 386
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  406 GPAkPPTQQVGTPKPL--AQQPGLQSPAKAPGPTKTpVQQPGPGKIPAQQAGPGKTSAQQTGpTKPPSqLPGPAKPPPQQ 483
Cdd:PLN03209   387 IPT-PPSSSPASSKSVdaVAKPAEPDVVPSPGSASN-VPEVEPAQVEAKKTRPLSPYARYED-LKPPT-SPSPTAPTGVS 462
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  484 PGPAKPP--PQQPGSAKP----PPQQPGSTKPPPQQP--------GPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKP 549
Cdd:PLN03209   463 PSVSSTSsvPAVPDTAPAtaatDAAAPPPANMRPLSPyavyddlkPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALA 542
                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 2462613745  550 SAQQSTKPvsqtgSGKPLQPPTVSPSAKqPPSQGLPKT 587
Cdd:PLN03209   543 DEQHHAQP-----KPRPLSPYTMYEDLK-PPTSPTPSP 574
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
443-566 2.04e-05

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 50.96  E-value: 2.04e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  443 QPGPGKIPAQQAGPGKTSAQQTGPTKPPSQLPGpaKPPPQQPGPAKPPPQQPGSAKPP--PQQPGSTKPPPQQPGPAKPS 520
Cdd:TIGR01628  379 QPRMRQLPMGSPMGGAMGQPPYYGQGPQQQFNG--QPLGWPRMSMMPTPMGPGGPLRPngLAPMNAVRAPSRNAQNAAQK 456
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 2462613745  521 PQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQTGSGKP 566
Cdd:TIGR01628  457 PPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATP 502
PDZ_MPP5-like cd06798
PDZ domain of membrane palmitoylated protein 5 (MPP5), Drosophila Stardust, and related ...
4535-4581 2.25e-05

PDZ domain of membrane palmitoylated protein 5 (MPP5), Drosophila Stardust, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of MPP5, Drosophila Stardust, and related domains. MPP5 (also known as MAGUK p55 subfamily member 1, protein associated with Lin-7 1 or PALS1) and Drosophila Stardust are membrane-associated guanylate kinase (MAGUK)-like proteins that serve as signaling and scaffolding proteins, linking different proteins critical to the formation and maintenance of tight junctions (TJ) and apical-basal polarity. Apical-basal polarity determinants cluster in complexes; in particular, the Crumbs complex (Crb, MPP5, and PATJ) and the PAR/aPKC-complex (PAR-3, PAR-6, aPKC) determine the apical plasma membrane domain. Within the Crumbs complex, Crb is stabilized in the plasma membrane by MPP5, which in turn recruits PATJ and Lin-7 to the complex. MPP5 also links the Crumbs complex with the PAR/aPKC-complex. The Drosophila homolog of the Crumbs complex is the (CRB)-Stardust (Sdt)-Discs Lost (Dlt) complex. MPP5 also acts as an interaction partner for SARS-CoV envelope protein E, which results in delayed formation of TJs and dysregulation of cell polarity. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MPP5-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467259 [Multi-domain]  Cd Length: 79  Bit Score: 45.41  E-value: 2.25e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 2462613745 4535 IAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQQSG 4581
Cdd:cd06798     25 ISRIVKGGAAEKSGLLHEGDEILEINGIEIRGKDVNEVCDLLADMHG 71
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
462-580 2.37e-05

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 47.73  E-value: 2.37e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  462 QQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKP--PPQQPGPAKPSPQQPGSTKPPSQQPGSAKP 539
Cdd:pfam15240   38 QSQQGGQGPQGPPPGGFPPQPPASDDPPGPPPPGGPQQPPPQGGKQKPqgPPPQGGPRPPPGKPQGPPPQGGNQQQGPPP 117
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 2462613745  540 SAQQPSPAKPSAQQSTKPVSQTG-----SGKPLQPPTVSPSAKQPP 580
Cdd:pfam15240  118 PGKPQGPPPQGGGPPPQGGNQQGpppppPGNPQGPPQRPPQPGNPQ 163
PDZ3_Par3-like cd23059
PDZ domain 3 of partitioning defective 3 (Par3), and related domains; PDZ (PSD-95 ...
4513-4576 2.48e-05

PDZ domain 3 of partitioning defective 3 (Par3), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 3 of Par3 (or PAR3 or Par-3, also known as Atypical PKC isotype-specific-interacting protein, ASIP, Drosophila Bazooka) and related domains. Par3 is a scaffold protein involved in organizing cell polarity across animals. Par3 binds numerous molecules both for its recruitment to one pole of the cell and for downstream contributions to polarized cell function. It regulates cell polarity by targeting the Par complex proteins Par6 and atypical protein kinase C (aPKC) to specific cortical sites. Physical interactions between Par-3 and the Par complex include Par3 PDZ domain 1 binding to the Par6 PDZ domain, Par3 PDZ domain 1 and PDZ domain 3 binding the Par6's PDZ-binding motif, and an interaction with an undefined region of aPKC that requires both Par3 PDZ2 and PDZ3. The PDZ domains of Par3 have also been implicated as potential phosphoinositide signaling integrators, since its second PDZ domain binds to phosphoinositides, and the third PDZ interacts with phosphoinositide phosphatase PTEN. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Par3 family PDZ3 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467272 [Multi-domain]  Cd Length: 103  Bit Score: 46.12  E-value: 2.48e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462613745 4513 GLGIRIVG--GKEIPGHSGEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSII 4576
Cdd:cd23059     17 GLGVSVKGktSKEDNGGKADLGIFIKSIIHGGAASKDGRLRVNDQLIAVNGESLLGLTNSEAMETL 82
PRK11901 PRK11901
hypothetical protein; Reviewed
367-540 2.50e-05

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 49.68  E-value: 2.50e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  367 GPAKPPAQQT----GSEK-----PSSEQPGPkalAQPPGVGKTPAQqPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPgpt 437
Cdd:PRK11901    60 SPTEHESQQSsnnaGAEKnidlsGSSSLSSG---NQSSPSAANNTS-DGHDASGVKNTAPPQDISAPPISPTPTQAA--- 132
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  438 ktPVQQPG-------PGKIP---AQQAG------PGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPP 501
Cdd:PRK11901   133 --PPQTPNgqqrielPGNISdalSQQQGqvnaasQNAQGNTSTLPTAPATVAPSKGAKVPATAETHPTPPQKPATKKPAV 210
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|
gi 2462613745  502 qqpgstKPPPQQPGPAKPSPQ-QPGSTKPPSQQPGSAKPS 540
Cdd:PRK11901   211 ------NHHKTATVAVPPATSgKPKSGAASARALSSAPAS 244
PDZ2_DLG5-like cd06765
PDZ domain 2 of Discs Large 5 (Dlg5) and related domains; PDZ (PSD-95 (Postsynaptic density ...
4516-4577 2.72e-05

PDZ domain 2 of Discs Large 5 (Dlg5) and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of Drosophila and mammalian Dlg5, and related domains. Dlg5 is a scaffold protein with multiple conserved functions that are independent of each other in regulating growth, cell polarity, and cell adhesion. It has a coiled-coil domain, 4 PDZ domains and a MAGUK domain (an SH3 domain next to a non-catalytically active guanylate kinase domain). Deregulation of Dlg5 has been implicated in the malignancy of several cancer types. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Dlg5-like family PSZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467246 [Multi-domain]  Cd Length: 77  Bit Score: 45.03  E-value: 2.72e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462613745 4516 IRIVGGKEiPGHSGEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIIS 4577
Cdd:cd06765      2 INLSGQKD-SGISLENGVFISRIVPGSPAAKEGSLTVGDRIIAINGIALDNKSLSECEALLR 62
PRK11633 PRK11633
cell division protein DedD; Provisional
420-530 2.72e-05

cell division protein DedD; Provisional


Pssm-ID: 236940 [Multi-domain]  Cd Length: 226  Bit Score: 48.85  E-value: 2.72e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  420 PLAQQPGLQSPAKA-PGPTKTPVQQPGPGKIPAQQAGPGKTSAQQtgptkpPSQLPGPAKPPPQQPGPAKPPPQQPgsaK 498
Cdd:PRK11633    42 PLVPKPGDRDEPDMmPAATQALPTQPPEGAAEAVRAGDAAAPSLD------PATVAPPNTPVEPEPAPVEPPKPKP---V 112
                           90       100       110
                   ....*....|....*....|....*....|..
gi 2462613745  499 PPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPP 530
Cdd:PRK11633   113 EKPKPKPKPQQKVEAPPAPKPEPKPVVEEKAA 144
dnaA PRK14086
chromosomal replication initiator protein DnaA;
393-566 3.16e-05

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 50.21  E-value: 3.16e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  393 QPPGVGKTPAQQPGPAKPP---------TQQVGTPKPLAQQPGLQSPAKAPG-PTKTPVQQPGPGKIPAQQAGPgktSAQ 462
Cdd:PRK14086    94 EPAPPPPHARRTSEPELPRpgrrpyegyGGPRADDRPPGLPRQDQLPTARPAyPAYQQRPEPGAWPRAADDYGW---QQQ 170
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  463 QTGPTkPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKP---PP-----QQPGPAKPSPQQPGSTKPPSQQP 534
Cdd:PRK14086   171 RLGFP-PRAPYASPASYAPEQERDREPYDAGRPEYDQRRRDYDHPRPdwdRPrrdrtDRPEPPPGAGHVHRGGPGPPERD 249
                          170       180       190
                   ....*....|....*....|....*....|..
gi 2462613745  535 GSAKPSAQQPSPAKPSAQQSTKPvsqtGSGKP 566
Cdd:PRK14086   250 DAPVVPIRPSAPGPLAAQPAPAP----GPGEP 277
PHA03378 PHA03378
EBNA-3B; Provisional
654-1043 3.21e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 50.45  E-value: 3.21e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  654 PSSPQPKL-KTAPVTTTSAVSKSSPQPQQTSPKKDAAPKQDLSKAPEPKKPPPLVKQPTLHGSPSAKAKQPPEADSLSKP 732
Cdd:PHA03378   527 PSPPQPRAgRRAPCVYTEDLDIESDEPASTEPVHDQLLPAPGLGPLQIQPLTSPTTSQLASSAPSYAQTPWPVPHPSQTP 606
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  733 APPKEPSVPSEqDKAPVADDKPKQPKMVKP------TTDLVSSSSATTKPDIPSSKVQSQAEEKTTPPLKTDSAKPSQSF 806
Cdd:PHA03378   607 EPPTTQSHIPE-TSAPRQWPMPLRPIPMRPlrmqpiTFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTML 685
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  807 PPTGEKVSPFDSKAIPRPASDSKIisHPGPSSESKGQKQVDPV-----QKKEEPKKAQTKMSP---KPDAKPMPKGSPT- 877
Cdd:PHA03378   686 PIQWAPGTMQPPPRAPTPMRPPAA--PPGRAQRPAAATGRARPpaaapGRARPPAAAPGRARPpaaAPGRARPPAAAPGr 763
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  878 --PPGPRPTAGQTVPTPQQSPKPQEQSRrfslnlGSITDAPKSQ-PTTPQETVTGKLFGFGASIFSQASNLISTAGQPGP 954
Cdd:PHA03378   764 arPPAAAPGAPTPQPPPQAPPAPQQRPR------GAPTPQPPPQaGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKRGR 837
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  955 HSQSGPGAPMKQAPAPSQPPTSQGPPKSTGQAP---PAPAKSIPVKKETKAPAAEKLEpKAEQAPTVKRTET-------- 1023
Cdd:PHA03378   838 PSLKKPAALERQAAAGPTPSPGSGTSDKIVQAPvfyPPVLQPIQVMRQLGSVRAAAAS-TVTQAPTEYTGERrgvgpmhp 916
                          410       420
                   ....*....|....*....|
gi 2462613745 1024 EKKPPPIKDSKSLTAEPQKA 1043
Cdd:PHA03378   917 TDIPPSKRAKTDAYVESQPP 936
Drf_FH1 pfam06346
Formin Homology Region 1; This region is found in some of the Diaphanous related formins (Drfs) ...
464-585 3.32e-05

Formin Homology Region 1; This region is found in some of the Diaphanous related formins (Drfs). It consists of low complexity repeats of around 12 residues.


Pssm-ID: 461881 [Multi-domain]  Cd Length: 157  Bit Score: 47.17  E-value: 3.32e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  464 TGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSA--KPPPQQPGSTKPPPQQPGPAKPS----PQQPGSTKPPSQQPGSA 537
Cdd:pfam06346   10 SSTIPLPPGACIPTPPPLPGGGGPPPPPPLPGSAaiPPPPPLPGGTSIPPPPPLPGAASipppPPLPGSTGIPPPPPLPG 89
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2462613745  538 KPSAQQPSPAKPSAQQSTKPVSQTGSGKPLQPPTVSPSAK----QPPSQGLP 585
Cdd:pfam06346   90 GAGIPPPPPPLPGGAGVPPPPPPLPGGPGIPPPPPFPGGPgippPPPGMGMP 141
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
475-557 3.35e-05

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 50.27  E-value: 3.35e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  475 GPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQS 554
Cdd:PRK12270    37 GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDE 116

                   ...
gi 2462613745  555 TKP 557
Cdd:PRK12270   117 VTP 119
FAP pfam07174
Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment ...
459-599 3.93e-05

Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.


Pssm-ID: 429334  Cd Length: 301  Bit Score: 49.15  E-value: 3.93e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  459 TSAQQTGPTKPPsqlPGPAKPPPqQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQ------QPGPAKPSPQQPGSTKPPSQ 532
Cdd:pfam07174   36 VAHADPEPAPPP---PSTATAPP-APPPPPPAPAAPAPPPPPAAPNAPNAPPPPadpnapPPPPADPNAPPPPAVDPNAP 111
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462613745  533 QPGSAKPSAQQPSPAKPSAQQSTKpVSQTGSGKPLqpptVSPSAKQPPSQGLPKticPLCNTTELLL 599
Cdd:pfam07174  112 EPGRIDNAVGGFSYVVPAGWVESD-ATHLDYGSAL----LSKTTGQPPEGGQPP---PVANDTRVVL 170
PDZ1_FL-whirlin cd06740
PDZ domain 1 of the full-length isoform of whirlin and related domains; PDZ (PSD-95 ...
4497-4576 4.58e-05

PDZ domain 1 of the full-length isoform of whirlin and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of the full-length isoform of whirlin and related domains. Whirlin is an essential protein for developmental pathways in photoreceptor cells of the retina and hair cells of the inner ear. The full-length whirlin isoform has two harmonin N-like domains, three PDZ domains, a proline-rich region, and a PDZ-binding motif. Whirlin isoforms may form different complexes at the periciliary membrane complex (PMC) in photoreceptors, and the stereociliary tip and base in inner ear hair cells. It interacts with ADGRV1 and usherin at the PMC; with SANS and RpgrORF15 at the connecting cilium in photoreceptors; with EPS8, MYO15A, p55, and CASK proteins at the stereociliary tip of inner ear hair cells; and with ADGRV1, usherin, and PDZD7 at the stereociliary base in inner ear hair cells. Mutations in the gene encoding whirlin (WHRN; also known as USH2D and DFNB31), have been found to cause either USH2 subtype (USH2D) or autosomal recessive non-syndromic deafness type 31 (DFNB31). Whirlin is the key protein in the USH2 complex (whirlin, usherin and GPR98) which recruits other USH2 causative proteins at the periciliary membrane in photoreceptors and the ankle link of the stereocilia in hair cells. Whirlin's interaction with espin, another stereociliary protein, may be important for the architecture of the USH2 complex. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This whirlin family PDZ1 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467222 [Multi-domain]  Cd Length: 82  Bit Score: 44.66  E-value: 4.58e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 4497 RIKITRDSKDHTvsgnGLGIRIVGGKEipgHSgeIGAYIAKILPGGSAEQTGkLMEGMQVLEWNGIPLTSKTYEEVQSII 4576
Cdd:cd06740      2 RQVTLKRSKSHE----GLGFSIRGGAE---HG--VGIYVSLVEPGSLAEKEG-LRVGDQILRVNDVSFEKVTHAEAVKIL 71
PDZ3_LNX1_2-like cd06679
PDZ domain 3 of human Ligand of Numb protein X 1 (LNX1) and LNX2, and related domains; PDZ ...
4514-4585 4.89e-05

PDZ domain 3 of human Ligand of Numb protein X 1 (LNX1) and LNX2, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 3 of LNX1 (also known as PDZ domain-containing RING finger protein 2, PDZRN2) and LNX2 (also known as PDZ domain-containing RING finger protein 1, PDZRN1), and related domains. LNX1 and LNX2 are Ring (Really Interesting New Gene) finger and PDZ domain-containing E3 ubiquitin ligases that bind to the cell fate determinant protein NUMB and mediate its ubiquitination. LNX1 can ubiquitinate a number of other ligands including PPFIA1, KLHL11, KIF7 and ERC2. LNX1 and LNX2 each have four PDZ domains. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This LNX family PDZ3 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467167 [Multi-domain]  Cd Length: 88  Bit Score: 44.55  E-value: 4.89e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462613745 4514 LGIRIVGGKEipGHSGEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQQSGEAEI 4585
Cdd:cd06679     13 LGISVAGGRG--SRRGDLPIYVTNVQPDGCLGRDGRIKKGDVLLSINGISLTNLSHSEAVAVLKASAASSSI 82
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
450-580 5.34e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 49.48  E-value: 5.34e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  450 PAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGStkPPPQQPGPAKPSPQQPGSTKP 529
Cdd:PRK07994   361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPE--TTSQLLAARQQLQRAQGATKA 438
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2462613745  530 PSQQPGSAKPSAQQPSPAKPSAQQStkPVSQTGSGKPLQPPTVSPSAKQPP 580
Cdd:PRK07994   439 KKSEPAAASRARPVNSALERLASVR--PAPSALEKAPAKKEAYRWKATNPV 487
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
468-549 5.69e-05

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 49.89  E-value: 5.69e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  468 KPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQ-PGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSP 546
Cdd:PRK12270    37 GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAApPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDE 116

                   ...
gi 2462613745  547 AKP 549
Cdd:PRK12270   117 VTP 119
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
450-585 5.75e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 49.46  E-value: 5.75e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  450 PAQQAGPGKTSAQqtgPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPP---------PQQPGSTKPPPQQPGPAKPS 520
Cdd:PRK07003   360 PAVTGGGAPGGGV---PARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGaalapkaaaAAAATRAEAPPAAPAPPATA 436
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462613745  521 PQ-QPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQTGSGKPLQPPTV---SPSAKQPPSQGLP 585
Cdd:PRK07003   437 DRgDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAfepAPRAAAPSAATPA 505
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
428-518 5.88e-05

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 49.50  E-value: 5.88e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  428 QSPAKAPGPTK--TPVQQPGPGKIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPqqpgsAKPPPQQPG 505
Cdd:PRK12270    36 YGPGSTAAPTAaaAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPA-----AAAAAAPAA 110
                           90
                   ....*....|....*
gi 2462613745  506 STKPPPQQP--GPAK 518
Cdd:PRK12270   111 AAVEDEVTPlrGAAA 125
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
339-586 6.14e-05

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 49.38  E-value: 6.14e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  339 TKPLAQQPGTVKP--PVQPPGTTKPPAQPLGPaKPPAQQTGSEKpssEQPGPKALAQPPGVGKTPAQQPGPAKPPTQQVG 416
Cdd:NF033839   157 TKPETPQPENPEHqkPTTPAPDTKPSPQPEGK-KPSVPDINQEK---EKAKLAVATYMSKILDDIQKHHLQKEKHRQIVA 232
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  417 TPKPLAQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQ-QPGPAKPPPQQPG 495
Cdd:NF033839   233 LIKELDELKKQALSEIDNVNTKVEIENTVHKIFADMDAVVTKFKKGLTQDTPKEPGNKKPSAPKPGmQPSPQPEKKEVKP 312
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  496 SAKPPPQQpgsTKPPPQQPGP-AKPSPQQPGSTKPPsqQPGSAKPSAqQPSPAKPSAQQSTKPVSQTGSGKPlQPPTVSP 574
Cdd:NF033839   313 EPETPKPE---VKPQLEKPKPeVKPQPEKPKPEVKP--QLETPKPEV-KPQPEKPKPEVKPQPEKPKPEVKP-QPETPKP 385
                          250
                   ....*....|..
gi 2462613745  575 SAKQPPSQGLPK 586
Cdd:NF033839   386 EVKPQPEKPKPE 397
PDZ1_hSTXBP4-PDZ2_GgSTXBP4-like cd06698
PDZ1 domain of human syntaxin-binding protein 4 (STXBP4), PDZ2 domain of Gallus gallus ...
4513-4576 6.24e-05

PDZ1 domain of human syntaxin-binding protein 4 (STXBP4), PDZ2 domain of Gallus gallus uncharacterized STXBP4 isoform X1, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of human syntaxin-binding protein 4 (STXBP4), PDZ2 domain of Gallus gallus uncharacterized STXBP4 isoform X1, and related domains. Human STXBP4 (also known as Synip) includes a single PDZ domain, a coiled-coil domain, and a WW domain (named for its two conserved tryptophans); Gallus gallus STXBP4 isoform X1 contains 2 PDZ domains (PDZ1 and PDZ2). Human STXBP4 plays a role in the translocation of transport vesicles from the cytoplasm to the plasma membrane: insulin induces the dissociation of the STXBP4 and STX4 complex liberating STX4 to interact with Vamp2, and to form the SNARE complex thereby promoting vesicle fusion. It may also play a role in the regulation of insulin release by pancreatic beta cells after stimulation by glucose. Human STXBP4 is also known to physically associate with a prominent isoform of TP63 (deltaNp63alpha 9) whose overexpression promotes squamous cell carcinoma development, and in doing so prevents degradation of this isoform by the Cdc20-APC/C complex, Itch, and RACK1. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This STXBP4-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467184 [Multi-domain]  Cd Length: 89  Bit Score: 44.60  E-value: 6.24e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462613745 4513 GLGIRIVGGKEipgHSGEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSII 4576
Cdd:cd06698     12 GLGLSIVGGIN---RPEGPMVFIQEVIPGGDCYKDGRLRPGDQLVSINKESLIGVTLEEAKSIL 72
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
381-530 6.45e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 49.40  E-value: 6.45e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  381 PSSEQPGPKALAQPPGVGKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPT---KTPVQQPGPGKIPAQQAGPG 457
Cdd:PHA03307   760 NPSLVPAKLAEALALLEPAEPQRGAGSSPPVRAEAAFRRPGRLRRSGPAADAASRTAskrKSRSHTPDGGSESSGPARPP 839
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  458 KTSAQQTGPTKPPSQLPGPAKPPPQQPG-------PAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAkPSPQQPGSTKPP 530
Cdd:PHA03307   840 GAAARPPPARSSESSKSKPAAAGGRARGkngrrrpRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPR-PAPRVKLGPMPP 918
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
481-587 6.50e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 49.39  E-value: 6.50e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  481 PQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQpgpAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQ 560
Cdd:PRK14971   363 TQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAA---AAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNP 439
                           90       100
                   ....*....|....*....|....*..
gi 2462613745  561 TGSGKPLQPPTVSPSAKQPPSQGLPKT 587
Cdd:PRK14971   440 PSTAPQAVRPAQFKEEKKIPVSKVSSL 466
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
442-522 7.14e-05

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 49.50  E-value: 7.14e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  442 QQPGPGKIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSP 521
Cdd:PRK12270    36 YGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVED 115

                   .
gi 2462613745  522 Q 522
Cdd:PRK12270   116 E 116
PRK12373 PRK12373
NADH-quinone oxidoreductase subunit E;
398-566 7.43e-05

NADH-quinone oxidoreductase subunit E;


Pssm-ID: 237082 [Multi-domain]  Cd Length: 400  Bit Score: 48.64  E-value: 7.43e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  398 GKTPAQQPGPakpptqQVG--TPKPLAqqpGLQSPAKAPGPTKTpvqqpGPGKIPAQQAGPGKTSAQQTGPTKPPSQLPG 475
Cdd:PRK12373   172 GKGPVVKPGP------QIGryASEPAG---GLTSLTEEAGKARY-----NASKALAEDIGDTVKRIDGTEVPLLAPWQGD 237
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  476 PAKPPPQQPGPAKPPPQQPGSAKPPPqqpgstKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQST 555
Cdd:PRK12373   238 AAPVPPSEAARPKSADAETNAALKTP------ATAPKAAAKNAKAPEAQPVSGTAAAEPAPKEAAKAAAAAAKPALEDKP 311
                          170
                   ....*....|.
gi 2462613745  556 KPVSQTGSGKP 566
Cdd:PRK12373   312 RPLGIARPGGA 322
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
443-566 7.43e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 49.17  E-value: 7.43e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  443 QPGPGKIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPpqQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQ 522
Cdd:PRK14954   375 RNDGGVAPSPAGSPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELP--SPASAPTPEQQPPVARSAPLPPSPQASAPR 452
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2462613745  523 QPGSTKpPSQQPGS---------AKPSAQQPspakpsAQQSTKPVSQTGSGKP 566
Cdd:PRK14954   453 NVASGK-PGVDLGSwqgkfmnftRNGSRKQP------VQASSSDAAQTGVFEG 498
PDZ3_DLG5-like cd06767
PDZ domain 3 of Discs Large 5 (Dlg5) and related domains; PDZ (PSD-95 (Postsynaptic density ...
4513-4579 7.66e-05

PDZ domain 3 of Discs Large 5 (Dlg5) and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 3 of Drosophila and mammalian Dlg5, and related domains. Dlg5 is a scaffold protein with multiple conserved functions that are independent of each other in regulating growth, cell polarity, and cell adhesion. It has a coiled-coil domain, 4 PDZ domains and a MAGUK domain (an SH3 domain next to a non-catalytically active guanylate kinase domain). Deregulation of Dlg5 has been implicated in the malignancy of several cancer types. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Dlg5-like family PDZ3 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467248 [Multi-domain]  Cd Length: 82  Bit Score: 43.85  E-value: 7.66e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462613745 4513 GLGIRIVGGKeipghSGeiGAYIAKILPGGSAEQTGkLMEGMQVLEWNGIPLTSKTYEEVQSIISQQ 4579
Cdd:cd06767     14 PLGISIVSGE-----NG--GIFVSSVTEGSLAHQAG-LEYGDQLLEVNGINLRNATEQQAALILRQC 72
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
484-576 8.07e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 49.04  E-value: 8.07e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  484 PGPAkPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQTGS 563
Cdd:PRK14950   362 PVPA-PQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDE 440
                           90
                   ....*....|...
gi 2462613745  564 GKPLQPPTVSPSA 576
Cdd:PRK14950   441 KPKYTPPAPPKEE 453
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
496-758 8.11e-05

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 48.77  E-value: 8.11e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  496 SAKPPPQQPGSTKPPpqqpgpaKPSPQQPGSTKPPSqqpgsakpSAQQPSPAKPSAQqSTKPVSQTGSGKPLQPPTvSPS 575
Cdd:PLN03209   325 SQRVPPKESDAADGP-------KPVPTKPVTPEAPS--------PPIEEEPPQPKAV-VPRPLSPYTAYEDLKPPT-SPI 387
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  576 AKQPPSQglPKTICPLCNTTelllhVPEKANFNTCTECQTTVcslcgfnpnphltEVKEWLCLNCQMKRALG-----GDL 650
Cdd:PLN03209   388 PTPPSSS--PASSKSVDAVA-----KPAEPDVVPSPGSASNV-------------PEVEPAQVEAKKTRPLSpyaryEDL 447
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  651 APvPSSPQPKLKTA---PVTTTSAVSKSSPQPQQTSPKKDAAPKQDLSKAPEPKKPPPLVKQPTlhgSPSAKAKQPPEAD 727
Cdd:PLN03209   448 KP-PTSPSPTAPTGvspSVSSTSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPT---SPSPAAPVGKVAP 523
                          250       260       270
                   ....*....|....*....|....*....|.
gi 2462613745  728 SLSKPAPPKEPSVPseqDKAPVADDKPKQPK 758
Cdd:PLN03209   524 SSTNEVVKVGNSAP---PTALADEQHHAQPK 551
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
442-550 8.27e-05

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 46.18  E-value: 8.27e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  442 QQPGPGKIPAQQ--AGPGKTSAQQTGPTKPPSQlPGPAKP--PPQQPGPAKPPPQQPGsakPPPQQPGSTKPPPQQPGPA 517
Cdd:pfam15240   47 QGPPPGGFPPQPpaSDDPPGPPPPGGPQQPPPQ-GGKQKPqgPPPQGGPRPPPGKPQG---PPPQGGNQQQGPPPPGKPQ 122
                           90       100       110
                   ....*....|....*....|....*....|....*
gi 2462613745  518 KPSPQQPGSTKPP--SQQPGSAKPSAQQPSPAKPS 550
Cdd:pfam15240  123 GPPPQGGGPPPQGgnQQGPPPPPPGNPQGPPQRPP 157
SOBP pfam15279
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual ...
366-570 8.62e-05

Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual disability. It carries a zinc-finger of the zf-C2H2 type at the N-terminus, and a highly characteriztic C-terminal PhPhPhPhPhPh motif. The deduced 873-amino acid protein contains an N-terminal nuclear localization signal (NLS), followed by 2 FCS-type zinc finger motifs, a proline-rich region (PR1), a putative RNA-binding motif region, and a C-terminal NLS embedded in a second proline-rich motif. SOBP is expressed in various human tissues, including developing mouse brain at embryonic day 14. In postnatal and adult mouse brain SOBP is expressed in all neurons, with intense staining in the limbic system. Highest expression is in layer V cortical neurons, hippocampus, pyriform cortex, dorsomedial nucleus of thalamus, amygdala, and hypothalamus. Postnatal expression of SOBP in the limbic system corresponds to a time of active synaptogenesis. the family is also referred to as Jackson circler, JXC1. In seven affected siblings from a consanguineous Israeli Arab family with mental retardation, anterior maxillary protrusion, and strabismus mutations were found in this protein.


Pssm-ID: 464609 [Multi-domain]  Cd Length: 325  Bit Score: 48.27  E-value: 8.62e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  366 LGPAKPPAQQTGSEKPSSEQPGPKALAQPPGVGKTPAQQP-GPAKPPTQQVGTPKPLAQQPGLQSPAKAPgptktpVQQP 444
Cdd:pfam15279   91 ESVSPGPSSSASPSSSPTSSNSSKPLISVASSSKLLAPKPhEPPSLPPPPLPPKKGRRHRPGLHPPLGRP------PGSP 164
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  445 GPGKIPAQQAGPGKTsaQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPP-------------P 511
Cdd:pfam15279  165 PMSMTPRGLLGKPQQ--HPPPSPLPAFMEPSSMPPPFLRPPPSIPQPNSPLSNPMLPGIGPPPKPPrnlgppsnpmhrpP 242
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462613745  512 QQPGPAKPSPQQPGSTkPPSQQPGsakPSAQQPSPAKPSAQQSTKPVSQ-----TGSGKPLQPP 570
Cdd:pfam15279  243 FSPHHPPPPPTPPGPP-PGLPPPP---PRGFTPPFGPPFPPVNMMPNPPemnfgLPSLAPLVPP 302
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
445-529 9.04e-05

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 49.12  E-value: 9.04e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  445 GPGKIPAQQAGPGKTSAQQTGPTKPPsqlPGPAKPPPQQPGPAKPPPQQ-PGSAKPPPQQPGSTKPPPQQPGPAKPSPQQ 523
Cdd:PRK12270    37 GPGSTAAPTAAAAAAAAAASAPAAAP---AAKAPAAPAPAPPAAAAPAApPKPAAAAAAAAAPAAPPAAAAAAAPAAAAV 113

                   ....*.
gi 2462613745  524 PGSTKP 529
Cdd:PRK12270   114 EDEVTP 119
PRK10905 PRK10905
cell division protein DamX; Validated
367-587 9.14e-05

cell division protein DamX; Validated


Pssm-ID: 236792 [Multi-domain]  Cd Length: 328  Bit Score: 48.01  E-value: 9.14e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  367 GPAKPPAQQTGSEKPS---SEQPGPKALAQPPGVGKTPAQQPGPAKPptQQVGTPkPLAQQPGL-QSPAKAPGPTKTPVQ 442
Cdd:PRK10905    22 APSTSSSDQTASGEKSidlAGNATDQANGVQPAPGTTSAEQTAGNTQ--QDVSLP-PISSTPTQgQTPVATDGQQRVEVQ 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  443 QPGPGKI--PAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPgPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPs 520
Cdd:PRK10905    99 GDLNNALtqPQNQQQLNNVAVNSTLPTEPATVAPVRNGNASRQT-AKTQTAERPATTRPARKQAVIEPKKPQATAKTEP- 176
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462613745  521 pqqpgstKPPSQQPGSAKPSAqQPSPAKPSAQQSTKPVSQTGSGKPLQppTVSPSAKQPPSQGLPKT 587
Cdd:PRK10905   177 -------KPVAQTPKRTEPAA-PVASTKAPAATSTPAPKETATTAPVQ--TASPAQTTATPAAGGKT 233
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
391-514 1.01e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 48.62  E-value: 1.01e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  391 LAQPPGVGKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQQTGPTKPP 470
Cdd:PRK14971   359 LAQLTQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVN 438
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*
gi 2462613745  471 SQLPGPAKPPPQQPGPAKP-PPQQPGSAKPPPQQPgsTKPPPQQP 514
Cdd:PRK14971   439 PPSTAPQAVRPAQFKEEKKiPVSKVSSLGPSTLRP--IQEKAEQA 481
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
32-411 1.01e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 48.83  E-value: 1.01e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745   32 PSHTAIPAGMEADLSQLSEEERRQIAAVMSRAQGLPKGSVPPAAAESPSMHRKQELDSSHPPKQSGRPPDPGRPAQPGLS 111
Cdd:PRK07764   427 AAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGA 506
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  112 ksRTTDTFRSE-----QKLPGRSPSTISLKESKSR------TDLKEEHKSSMMPGFLSEVNALSAVSSVVnkfnpfdlis 180
Cdd:PRK07764   507 --DDAATLRERwpeilAAVPKRSRKTWAILLPEATvlgvrgDTLVLGFSTGGLARRFASPGNAEVLVTAL---------- 574
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  181 dseasQEETTKKQKVVQKEQGKPEGiikPPLQQQPPKPIPKQQGPGRDPLQQDGTPKSISSQQPEKikSQPPGTGKPIQG 260
Cdd:PRK07764   575 -----AEELGGDWQVEAVVGPAPGA---AGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGA--AAAPAEASAAPA 644
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  261 PTQTPQTDHAKLPLQRDASRPQTKQADIVRGESVKPSLPSPSKPPIQQPTPGKPPAQQPgheKSQPGPAKPPAQPSGLTK 340
Cdd:PRK07764   645 PGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAP---APAATPPAGQADDPAAQP 721
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462613745  341 PLAQQPGTVKPPVQPPGTTKPPAQPLGPAKPPAQqtgsekPSSEQPGPKALAQPPGVGKTPAQQPGPAKPP 411
Cdd:PRK07764   722 PQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAP------AQPPPPPAPAPAAAPAAAPPPSPPSEEEEMA 786
PDZ2_FL-whirlin cd06741
PDZ domain 2 of the full-length isoform of whirlin and related domains; PDZ (PSD-95 ...
4510-4576 1.03e-04

PDZ domain 2 of the full-length isoform of whirlin and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of the full-length isoform of whirlin and related domains. Whirlin is an essential protein for developmental pathways in photoreceptor cells of the retina and hair cells of the inner ear. The full-length whirlin isoform has two harmonin N-like domains, three PDZ domains, a proline-rich region, and a PDZ-binding motif. Whirlin isoforms may form different complexes at the periciliary membrane complex (PMC) in photoreceptors, and the stereociliary tip and base in inner ear hair cells. It interacts with ADGRV1 and usherin at the PMC; with SANS and RpgrORF15 at the connecting cilium in photoreceptors; with EPS8, MYO15A, p55, and CASK proteins at the stereociliary tip of inner ear hair cells; and with ADGRV1, usherin, and PDZD7 at the stereociliary base in inner ear hair cells. Mutations in the gene encoding whirlin (WHRN; also known as USH2D and DFNB31), have been found to cause either USH2 subtype (USH2D) or autosomal recessive non-syndromic deafness type 31 (DFNB31). Whirlin is the key protein in the USH2 complex (whirlin, usherin and GPR98) which recruits other USH2 causative proteins at the periciliary membrane in photoreceptors and the ankle link of the stereocilia in hair cells. Whirlin's interaction with espin, another stereociliary protein, may be important for the architecture of the USH2 complex. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This whirlin family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467223 [Multi-domain]  Cd Length: 84  Bit Score: 43.79  E-value: 1.03e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462613745 4510 SGNGLGIRIVGGKEIpghsgEIGAYIAKILPGGSAEQTGkLMEGMQVLEWNGIPLTSKTYEEVQSII 4576
Cdd:cd06741     10 DGQSLGLMIRGGAEY-----GLGIYVTGVDPGSVAENAG-LKVGDQILEVNGRSFLDITHDEAVKIL 70
Drf_FH1 pfam06346
Formin Homology Region 1; This region is found in some of the Diaphanous related formins (Drfs) ...
395-541 1.03e-04

Formin Homology Region 1; This region is found in some of the Diaphanous related formins (Drfs). It consists of low complexity repeats of around 12 residues.


Pssm-ID: 461881 [Multi-domain]  Cd Length: 157  Bit Score: 45.63  E-value: 1.03e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  395 PGVGKTPAQQPGPAKPPtqqvgtPKPLAQQPGLQSPAKAPGPTKTPVQQPGPG--KIPAQQAGPGKTSaqqtgpTKPPSQ 472
Cdd:pfam06346    7 PGDSSTIPLPPGACIPT------PPPLPGGGGPPPPPPLPGSAAIPPPPPLPGgtSIPPPPPLPGAAS------IPPPPP 74
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462613745  473 LPG-----PAKPPPQQPGPAKPPPQQPGSAK-PPPQQP----GSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSA 541
Cdd:pfam06346   75 LPGstgipPPPPLPGGAGIPPPPPPLPGGAGvPPPPPPlpggPGIPPPPPFPGGPGIPPPPPGMGMPPPPPFGFGVPAA 153
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
369-484 1.04e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 48.62  E-value: 1.04e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  369 AKPPAQQTGSEKPSSEQPgpkALAQPPGVGKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQS-PAKAPGPTKTPVQQPGPG 447
Cdd:PRK14971   369 ASGGRGPKQHIKPVFTQP---AAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTvSVDPPAAVPVNPPSTAPQ 445
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 2462613745  448 KIPAQQAGPGKTSAQQTGPTKPPSQLpGPAKPPPQQP 484
Cdd:PRK14971   446 AVRPAQFKEEKKIPVSKVSSLGPSTL-RPIQEKAEQA 481
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
426-528 1.08e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 48.52  E-value: 1.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  426 GLQSPAKAPGP-TKTPVQQPGPGKIPAQQAGP--GKTSAQQTGPTKPPSQLPGPAKP-------PPQQPGPAKPPPQQPG 495
Cdd:PRK14959   379 SAPSGSAAEGPaSGGAATIPTPGTQGPQGTAPaaGMTPSSAAPATPAPSAAPSPRVPwddappaPPRSGIPPRPAPRMPE 458
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 2462613745  496 sAKPPPQQPGSTKP----PPQQPGPAKPSPQQPGSTK 528
Cdd:PRK14959   459 -ASPVPGAPDSVASasdaPPTLGDPSDTAEHTPSGPR 494
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
478-586 1.14e-04

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 45.80  E-value: 1.14e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  478 KPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSA--QQPSPAKPSAQQST 555
Cdd:pfam15240   45 GPQGPPPGGFPPQPPASDDPPGPPPPGGPQQPPPQGGKQKPQGPPPQGGPRPPPGKPQGPPPQGgnQQQGPPPPGKPQGP 124
                           90       100       110
                   ....*....|....*....|....*....|....
gi 2462613745  556 KPVSQTGS---GKPLQPPTVSPSAKQPPSQGLPK 586
Cdd:pfam15240  125 PPQGGGPPpqgGNQQGPPPPPPGNPQGPPQRPPQ 158
PDZ1_ZO1-like cd06727
PDZ domain 1 of Zonula Occludens-1 (ZO-1), homologs ZO-2 and ZO-3, and related domains; PDZ ...
4507-4585 1.17e-04

PDZ domain 1 of Zonula Occludens-1 (ZO-1), homologs ZO-2 and ZO-3, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of ZO-1, -2, -3 and related domains. Zonula occludens proteins (ZO-1, ZO-2, ZO-3) are multi-PDZ domain proteins involved in the maintenance and biogenesis of multi-protein networks at the cytoplasmic surface of intercellular contacts in epithelial and endothelial cells. They have three N-terminal PDZ domains, PDZ1-3, followed by a Src homology-3 (SH3) domain and a guanylate kinase (GuK)-like domain. Among protein-protein interactions for all ZO proteins is the binding of the first PDZ domain (PDZ1) to the C-termini of claudins, and the homo- and hetero-dimerization of ZO-proteins via their second PDZ domain (PDZ2), which takes place by symmetrical domain swapping of the first two beta-strands of PDZ2. At the cell level, ZO-1 and ZO-2 are involved in polarity maintenance, gene transcription, cell proliferation, and tumor cell metastasis. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This ZO family PDZ1 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467209 [Multi-domain]  Cd Length: 87  Bit Score: 43.80  E-value: 1.17e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 4507 HTV-----SGNGLGIRIVGGKEIPG-HSGEIGAYIAKILPGGSAEqtGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQQS 4580
Cdd:cd06727      1 HTVtlhraPGFGFGIAVSGGRDNPHfQSGDTSIVISDVLKGGPAE--GKLQENDRVVSVNGVSMENVEHSFAVQILRKCG 78

                   ....*
gi 2462613745 4581 GEAEI 4585
Cdd:cd06727     79 KTANI 83
FAP pfam07174
Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment ...
450-519 1.38e-04

Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.


Pssm-ID: 429334  Cd Length: 301  Bit Score: 47.23  E-value: 1.38e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462613745  450 PAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGP-AKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKP 519
Cdd:pfam07174   43 PAPPPPSTATAPPAPPPPPPAPAAPAPPPPPAAPNAPnAPPPPADPNAPPPPPADPNAPPPPAVDPNAPEP 113
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
485-569 1.41e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 48.35  E-value: 1.41e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  485 GPAKPPPQQPGSAKPPPQQPGsTKPPPQQPGPAKPSPQQPGSTkPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQTGSG 564
Cdd:PRK12270    37 GPGSTAAPTAAAAAAAAAASA-PAAAPAAKAPAAPAPAPPAAA-APAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVE 114

                   ....*
gi 2462613745  565 KPLQP 569
Cdd:PRK12270   115 DEVTP 119
motB PRK05996
MotB family protein;
380-560 1.49e-04

MotB family protein;


Pssm-ID: 235665 [Multi-domain]  Cd Length: 423  Bit Score: 47.77  E-value: 1.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  380 KPSSEQPGPKALaQPPGVGKTPAQQPGPAKPPTQQVGTPKPLAQqpGLQSPAKAPGPTK--------TP-------VQQP 444
Cdd:PRK05996    72 KLTDRKPSEKGL-KDPVDGAEGEQKPGKSKFEEDQRVEGSSAVT--GDDTTRTSGDQTNyseadlfrNPyavlaeiAQEV 148
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  445 GPgkiPAQQAGPGKTSAQQTGPT---------KPP-------SQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTK 508
Cdd:PRK05996   149 GQ---QANVSAKGDGGAAQSGPAtgadggeayRDPfdpdfwsKQVEVTTAGDLLPPGQAREQAQGAKSATAAPATVPQAA 225
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2462613745  509 PPPqQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQ 560
Cdd:PRK05996   226 PLP-QAQPKKAATEEELIADAKKAATGEPAANAAKAAKPEPMPDDQQKEAEQ 276
PRK10263 PRK10263
DNA translocase FtsK; Provisional
3461-3966 1.54e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 48.16  E-value: 1.54e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 3461 RRRRTKKSVDTSVQTDDEDQDEW---------DMPTRSRRKARVGKY--------GDSMTEADKTKPLSKVSSIAVQTVA 3523
Cdd:PRK10263   258 MGRQTDAALFSGKRMDDDEEITYtargvaadpDDVLFSGNRATQPEYdeydpllnGAPITEPVAVAAAATTATQSWAAPV 337
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 3524 EISVQTEPVGTIRTPSIRARVDAKVEIIKHISAPEktykggslgCQTEADSDTQSPQYLSATSPPKDK-KRPTPLEIGYS 3602
Cdd:PRK10263   338 EPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPV---------IAPAPEGYPQQSQYAQPAVQYNEPlQQPVQPQQPYY 408
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 3603 SHLRADSTVQLAPSPPKSPKVLYSPISPLSPGKALESAFVPYEKPLPDDisPQKVLHPDMAKVPPASPKTAKMMQRSMSD 3682
Cdd:PRK10263   409 APAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFA--PQSTYQTEQTYQQPAAQEPLYQQPQPVEQ 486
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 3683 PKPLSPT--ADES--SRAPFQYTEGYTTKGSQTMTSSGA-----QKKVKRTLPN------------PPPEEIST------ 3735
Cdd:PRK10263   487 QPVVEPEpvVEETkpARPPLYYFEEVEEKRAREREQLAAwyqpiPEPVKEPEPIksslkapsvaavPPVEAAAAvsplas 566
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 3736 ----GTQSTFSTMGTVSRRRICRTNTMARAKILQDIDRELDLVERESAKLRKKQAELD--------EEEKEIDAKLRYLE 3803
Cdd:PRK10263   567 gvkkATLATGAAATVAAPVFSLANSGGPRPQVKEGIGPQLPRPKRIRVPTRRELASYGiklpsqraAEEKAREAQRNQYD 646
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 3804 MGINRRK---EALLKEREKRERAYLQ-------------GVAEDRDYMSDSEVSSTRPTRIESQHGIERPRTAPQTEFSQ 3867
Cdd:PRK10263   647 SGDQYNDdeiDAMQQDELARQFAQTQqqrygeqyqhdvpVNAEDADAAAEAELARQFAQTQQQRYSGEQPAGANPFSLDD 726
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 3868 F-----------------IPPQTQTESQlvPPTSPYTQYQYSSPALPTQAPTSYTQ-QSHFEQQTLYHQQVSPYQTQPTF 3929
Cdd:PRK10263   727 FefspmkallddgpheplFTPIVEPVQQ--PQQPVAPQQQYQQPQQPVAPQPQYQQpQQPVAPQPQYQQPQQPVAPQPQY 804
                          570       580       590
                   ....*....|....*....|....*....|....*..
gi 2462613745 3930 QavatmsftpQVQPTPTPQPSYQLPSQMMVIQQKPRQ 3966
Cdd:PRK10263   805 Q---------QPQQPVAPQPQYQQPQQPVAPQPQYQQ 832
PHA03269 PHA03269
envelope glycoprotein C; Provisional
467-580 1.57e-04

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 47.80  E-value: 1.57e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  467 TKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPgSTKPPPQ---QPGPAkPSPQQPGSTKPpsqQPGSAKPSAQQ 543
Cdd:PHA03269    33 TSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDL-AQAPTPAaseKFDPA-PAPHQAASRAP---DPAVAPQLAAA 107
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 2462613745  544 PSPakpsaqqstkpvsqtgsgKPLQPPTVSPSAKQPP 580
Cdd:PHA03269   108 PKP------------------DAAEAFTSAAQAHEAP 126
PHA03269 PHA03269
envelope glycoprotein C; Provisional
412-544 1.58e-04

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 47.80  E-value: 1.58e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  412 TQQVGTPKPLAQ---QPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQqtGPTKPPSQLPGPAKPPPQQPGPAK 488
Cdd:PHA03269    19 IANLNTNIPIPElhtSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQ--APTPAASEKFDPAPAPHQAASRAP 96
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462613745  489 PPPQQPGSAKPPPQQPGStkpPPQQPGPAKPSP-QQPGSTKPPSQQPGSAKPSAQQP 544
Cdd:PHA03269    97 DPAVAPQLAAAPKPDAAE---AFTSAAQAHEAPaDAGTSAASKKPDPAAHTQHSPPP 150
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
465-556 1.59e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 48.35  E-value: 1.59e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  465 GPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPspqqpgstkPPSQQPGSAKPSAQQP 544
Cdd:PRK12270    37 GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAA---------AAAAPAAPPAAAAAAA 107
                           90
                   ....*....|..
gi 2462613745  545 SPAKPSAQQSTK 556
Cdd:PRK12270   108 PAAAAVEDEVTP 119
PDZ_Radil-like cd06690
PDZ domain of Ras-associating and dilute domain-containing protein (Radil) and related domains; ...
4511-4582 1.60e-04

PDZ domain of Ras-associating and dilute domain-containing protein (Radil) and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of Radil (also known as protein KIAA1849) and related domains. Radil is required for cell adhesion and migration of neural crest precursors during development. Radil is a component of a Rasip1-Radil-ARHGAP29 complex at endothelial cell-cell junctions. Rap1, via its effectors Radil and Rasip1 and their binding partner ArhGAP29, controls the endothelial barrier by decreasing Rho-mediated radial tension on cell-cell junctions. ArhGAP29 binds the Radil PDZ domain. The Radil PDZ domain also binds kinesin family protein 14 (KIF14); KIF14 negatively regulates Rap1-mediated inside-out integrin activation by tethering Radil on microtubules. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Radil-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467177 [Multi-domain]  Cd Length: 88  Bit Score: 43.43  E-value: 1.60e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462613745 4511 GNGLGIRIVGGKEIPGHSGeiGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIIsQQSGE 4582
Cdd:cd06690     12 PKGLGLGLIDGLHTPLRSP--GIYIRTLVPDSPAARDGRLRLGDRILAVNGTSLVGADYQSAMDLI-RTSGD 80
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
449-524 1.67e-04

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 47.69  E-value: 1.67e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462613745  449 IPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQP 524
Cdd:NF041121    15 MGRAAAPPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPGPAGAAPGAALP 90
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
422-552 1.69e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 47.88  E-value: 1.69e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  422 AQQPGLQSPAKAPGPT---KTPVQQPGPGKIPAQQAGPGKTSAQQTGPTKPPSQlPGPAKPPPQQPGPAKPPPQQPGSAK 498
Cdd:TIGR01628  378 LQPRMRQLPMGSPMGGamgQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGP-LRPNGLAPMNAVRAPSRNAQNAAQK 456
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2462613745  499 PPPqqPGSTKPPPQQPGPAKPSPQQPGSTKppsQQPGSAKPSAQQPSPAKPSAQ 552
Cdd:TIGR01628  457 PPM--QPVMYPPNYQSLPLSQDLPQPQSTA---SQGGQNKKLAQVLASATPQMQ 505
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
375-542 1.73e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 47.88  E-value: 1.73e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  375 QTGSEKPSSEQPGPKALAQPPGVGKTPAQQPGPaKPPTQQVGTPKPLAQQPGLQSPAKAPGPtktpvqqpgpgkiPAQQA 454
Cdd:TIGR01628  379 QPRMRQLPMGSPMGGAMGQPPYYGQGPQQQFNG-QPLGWPRMSMMPTPMGPGGPLRPNGLAP-------------MNAVR 444
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  455 GPGKTsaqqtgptkppsqlpgpakpppQQPGPAKPPPQqpgsakPPPQQPGSTKPPPQQPGPAKPS-PQQPGSTKPPSQQ 533
Cdd:TIGR01628  445 APSRN----------------------AQNAAQKPPMQ------PVMYPPNYQSLPLSQDLPQPQStASQGGQNKKLAQV 496

                   ....*....
gi 2462613745  534 PGSAKPSAQ 542
Cdd:TIGR01628  497 LASATPQMQ 505
Drf_FH1 pfam06346
Formin Homology Region 1; This region is found in some of the Diaphanous related formins (Drfs) ...
469-585 1.74e-04

Formin Homology Region 1; This region is found in some of the Diaphanous related formins (Drfs). It consists of low complexity repeats of around 12 residues.


Pssm-ID: 461881 [Multi-domain]  Cd Length: 157  Bit Score: 45.25  E-value: 1.74e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  469 PPSQLPGPAKPPPQQPG---PAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPA----KPSPQQPGSTKPPSQQPGSAKPSA 541
Cdd:pfam06346    2 PPPPLPGDSSTIPLPPGaciPTPPPLPGGGGPPPPPPLPGSAAIPPPPPLPGgtsiPPPPPLPGAASIPPPPPLPGSTGI 81
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*
gi 2462613745  542 QQPSPAKPSAQQSTKPVSQTGS-GKPLQPPTVSPSAKQPPSQGLP 585
Cdd:pfam06346   82 PPPPPLPGGAGIPPPPPPLPGGaGVPPPPPPLPGGPGIPPPPPFP 126
KREPA2 cd23959
Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of ...
362-593 1.80e-04

Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of the parasitic protozoan's KREPA RNA editing catalytic complex (RECC). Kinetoplastid RNA editing (KRE) proteins occur as pairs or sets of related proteins in multiple complexes. KREPA complex is composed of six components (KREPA1-6), which share a conserved C-terminal region containing an oligonucleotide-binding (OB)-fold-like domain. KREPAs are responsible for the site-specific insertion and deletion of U nucleotides in the kinetoplastid mitochondria pre-messenger RNA. Apart from the conserved C-terminal OB-fold domain, KREPA1, KREPA2, and KREPA3 contain two conserved C2H2 zinc-finger domains. KREPA2 and kinetoplastid RNA editing ligase 1 (KREL1) are specific for ligation post-U-deletion and are paralogous to KREL2 and KREPA1 that are specific for ligation post-U-insertion. KREPA2, is critical for RECC stability and KREL1 integration into the complex.


Pssm-ID: 467780 [Multi-domain]  Cd Length: 424  Bit Score: 47.56  E-value: 1.80e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  362 PAQPLGPAKPPAQQTGSEKPSSEQPGPKALAQPPGVGKTPAQQPGPAKP-PTQQVGTPKPLAqqpgLQSPAKAPGPTKTP 440
Cdd:cd23959     53 QEEPLYGAVSPEGENPFDGPGLVTASTVSDCYVGNANFYEVDMSDAFAMaPDESLGPFRAAR----VPNPFSASSSTQRE 128
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  441 VQQPGPGKIPaqqagpgkTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAkpppqqPGSTKPPPQQPGPAKPS 520
Cdd:cd23959    129 THKTAQVAPP--------KAEPQTAPVTPFGQLPMFGQHPPPAKPLPAAAAAQQSSA------SPGEVASPFASGTVSAS 194
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462613745  521 P-QQPGSTKPPSQQPGSAKPSAQQPSPAKPSAqqstkpvsqtgSGKPLQPPTVSPSAKQPPSQGlpktiCPLCN 593
Cdd:cd23959    195 PfATATDTAPSSGAPDGFPAEASAPSPFAAPA-----------SAASFPAAPVANGEAATPTHA-----CTICG 252
BimA_second NF040983
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ...
436-527 1.80e-04

trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.


Pssm-ID: 468913 [Multi-domain]  Cd Length: 382  Bit Score: 47.20  E-value: 1.80e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  436 PTKTPVQQPGPGKIPaqqagPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPqqpgSAKPPPQQPGSTKPPPQQPG 515
Cdd:NF040983    86 PNKVPPPPPPPPPPP-----PPPPTPPPPPPPPPPPPPPSPPPPPPPSPPPSPPPP----TTTPPTRTTPSTTTPTPSMH 156
                           90
                   ....*....|....
gi 2462613745  516 PAKPS--PQQPGST 527
Cdd:NF040983   157 PIQPTqlPSIPNAT 170
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
290-471 1.82e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 48.06  E-value: 1.82e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  290 RGESVKPSLPSPSKPPIQQPTPGKPPAQQPGHekSQPGPAKPPAQPSGLTKPLAQQPGTVKPPV------------QPPG 357
Cdd:PRK07764   596 GGEGPPAPASSGPPEEAARPAAPAAPAAPAAP--APAGAAAAPAEASAAPAPGVAAPEHHPKHVavpdasdggdgwPAKA 673
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  358 TTKPPAQPLGPAKPPAQQTGSEKPSSeQPGPKALAQPPGVgktPAQQPGPAKPPTQQVGTPK-----------------P 420
Cdd:PRK07764   674 GGAAPAAPPPAPAPAAPAAPAGAAPA-QPAPAPAATPPAG---QADDPAAQPPQAAQGASAPspaaddpvplppepddpP 749
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2462613745  421 LAQQPGLQSPAKAPGPtktPVQQPGPGKIPAQQAGPGKTSAQQTGPTKPPS 471
Cdd:PRK07764   750 DPAGAPAQPPPPPAPA---PAAAPAAAPPPSPPSEEEEMAEDDAPSMDDED 797
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
225-490 1.82e-04

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 47.75  E-value: 1.82e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  225 PGRDPLQQDGTPKSISSQQPEKIKSQPPGTGKPIQGPTQT---PQTDHAKLPLQRDASRPQTKQADIVrGESVKPSLPSP 301
Cdd:COG5180    246 PATVDAQPEMRPPADAKERRRAAIGDTPAAEPPGLPVLEAgsePQSDAPEAETARPIDVKGVASAPPA-TRPVRPPGGAR 324
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  302 SKPPIQQPTPGKPPAQQP--GHEKSQPGPAKPPAQPSGLTKPLAQ------QPGTVKPPVQPPGTTKPPAQPLGPAKPPA 373
Cdd:COG5180    325 DPGTPRPGQPTERPAGVPeaASDAGQPPSAYPPAEEAVPGKPLEQgaprpgSSGGDGAPFQPPNGAPQPGLGRRGAPGPP 404
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  374 Q------QTGSEKPSSEQPGPKALAQPPGVGKTPAQQPGPAKPptqqVGTPKPLAQQPGlQSPAKAPGPTKTPVQQPGPG 447
Cdd:COG5180    405 MgagdlvQAALDGGGRETASLGGAAGGAGQGPKADFVPGDAES----VSGPAGLADQAG-AAASTAMADFVAPVTDATPV 479
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|...
gi 2462613745  448 KIPAQQAGPGKtsAQQTGPTKPPSQLPGPAKPPPQQPGPAKPP 490
Cdd:COG5180    480 DVADVLGVRPD--AILGGNVAPASGLDAETRIIEAEGAPATED 520
dnaA PRK14086
chromosomal replication initiator protein DnaA;
281-499 1.87e-04

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 47.90  E-value: 1.87e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  281 PQTKQADIVRGESVKPSLPSPSKPPIQQP-TPGKPPAQQPGH---------EKSQPGPAKPPAQPsglTKPLAQQPGTVK 350
Cdd:PRK14086    80 RPIRIAITVDPSAGEPAPPPPHARRTSEPeLPRPGRRPYEGYggpraddrpPGLPRQDQLPTARP---AYPAYQQRPEPG 156
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  351 PPVQPPGTTKPPAQPLGPakPPAQQTGSekPSSEQPGPKALAQPPGvgktpaQQPGPAKPPTQQVGTPKPLAQQPGLQSp 430
Cdd:PRK14086   157 AWPRAADDYGWQQQRLGF--PPRAPYAS--PASYAPEQERDREPYD------AGRPEYDQRRRDYDHPRPDWDRPRRDR- 225
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462613745  431 akapgpTKTPVQQPGPGKIPaqQAGPGKTSAQQTGPTKPPSQLPGpakPPPQQPGPAKPPPQQPGSAKP 499
Cdd:PRK14086   226 ------TDRPEPPPGAGHVH--RGGPGPPERDDAPVVPIRPSAPG---PLAAQPAPAPGPGEPTARLNP 283
PRK12727 PRK12727
flagellar biosynthesis protein FlhF;
314-502 1.97e-04

flagellar biosynthesis protein FlhF;


Pssm-ID: 237182 [Multi-domain]  Cd Length: 559  Bit Score: 47.68  E-value: 1.97e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  314 PPAQQPgheksqpgPAKPPAQPSGLTKPLAQQPGTVKPPVQPPGTTKPPAQPL-GPAKPPAQQTGSEK-PSSEQPGPKAL 391
Cdd:PRK12727    70 APAPQA--------PTKPAAPVHAPLKLSANANMSQRQRVASAAEDMIAAMALrQPVSVPRQAPAAAPvRAASIPSPAAQ 141
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  392 AQPPGVGKTPAQQPGPAKPPTQQVGTPKPLAQQPgLQSPAKAPGPTKTPVQQPG-PGKIPAQQAGPGKTSAQQTGPTKPP 470
Cdd:PRK12727   142 ALAHAAAVRTAPRQEHALSAVPEQLFADFLTTAP-VPRAPVQAPVVAAPAPVPAiAAALAAHAAYAQDDDEQLDDDGFDL 220
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 2462613745  471 SQLPGPAKPPPQQPGPAKPPPQQP-----GSAKPPPQ 502
Cdd:PRK12727   221 DDALPQILPPAALPPIVVAPAAPAalaavAAAAPAPQ 257
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
747-1032 2.01e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 47.92  E-value: 2.01e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  747 APVADDKPKQPKMVkPTTDLVSSSSATTKPDIPSSKVQSQAEEKTTPPLKTdSAKPSQSFPPTGEKVSPFDSKAIPRPAS 826
Cdd:PRK07003   367 APGGGVPARVAGAV-PAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAA-AAAATRAEAPPAAPAPPATADRGDDAAD 444
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  827 DSKIISHPGPSSESKGQKQVDPVQkkEEPKKAQTKMSPKPDAKPMPKGSPTPPGPRPTAGQTVPTPQqsPKPQEQSRRFS 906
Cdd:PRK07003   445 GDAPVPAKANARASADSRCDERDA--QPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPD--ARAPAAASRED 520
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  907 LNLGSITDAPKSQPTTPQETVTGKLFGfGASifsQASNLISTAGQpgpHSQSGPGAPMKQAPAPSQPPTSQGPPkstgqA 986
Cdd:PRK07003   521 APAAAAPPAPEARPPTPAAAAPAARAG-GAA---AALDVLRNAGM---RVSSDRGARAAAAAKPAAAPAAAPKP-----A 588
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 2462613745  987 PPAPAKSIPVkkeTKAPAAEKLEPKAEQAPTVKRTETEKKPPPIKD 1032
Cdd:PRK07003   589 APRVAVQVPT---PRARAATGDAPPNGAARAEQAAESRGAPPPWED 631
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
481-586 2.02e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 47.55  E-value: 2.02e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  481 PQQPGPAkpPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQqpgsakPSAQQPSPAKPSAQQSTKPVSQ 560
Cdd:PRK07994   361 PAAPLPE--PEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQA------PAVPLPETTSQLLAARQQLQRA 432
                           90       100
                   ....*....|....*....|....*.
gi 2462613745  561 TGSGKPLQPPTVSPSAKQPPSQGLPK 586
Cdd:PRK07994   433 QGATKAKKSEPAAASRARPVNSALER 458
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
436-557 2.07e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 47.75  E-value: 2.07e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  436 PTKTPVQQPGPGkipAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPA---KPPPQQPG----SAKPPPQQPGSTK 508
Cdd:PRK14959   363 PRLMPVESLRPS---GGGASAPSGSAAEGPASGGAATIPTPGTQGPQGTAPAagmTPSSAAPAtpapSAAPSPRVPWDDA 439
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2462613745  509 PP--PQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKP 557
Cdd:PRK14959   440 PPapPRSGIPPRPAPRMPEASPVPGAPDSVASASDAPPTLGDPSDTAEHTP 490
PHA03160 PHA03160
hypothetical protein; Provisional
326-405 2.08e-04

hypothetical protein; Provisional


Pssm-ID: 165431  Cd Length: 499  Bit Score: 47.39  E-value: 2.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  326 PGPAKPPAQPSGLTKPLAQQPGTVK-PPVQPPGTTKPPAQPLGPAKPPAQQTgSEKPSSEQPGPKAlAQPPGVGKTPAQQ 404
Cdd:PHA03160   406 PKNDHHLLPPLACSQQLPMQPLHVQqAPMQAPHVAPPPMQPPHVQQPRVLPS-TDGASNEAPKPSA-QEPVHIDASFAQD 483

                   .
gi 2462613745  405 P 405
Cdd:PHA03160   484 P 484
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
1211-1554 2.09e-04

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 47.73  E-value: 2.09e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1211 KKPLPEE-KKL----IPEEEKIRSEEKKPLLEEKKPTPEDKKLLPEAKTSAPEEQKHDLL-KSQVQIAEEKLEgrvapKT 1284
Cdd:PTZ00108  1033 KKDLVKElKKLgyvrFKDIIKKKSEKITAEEEEGAEEDDEADDEDDEEELGAAVSYDYLLsMPIWSLTKEKVE-----KL 1107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1285 VQEGKQPQTKMEGLPSGTPQSLPKED-DKTTKTIKEQPQPpctakpDQVEPGKEKTEKEDDKSDTSSSQQPKSPQGLSDT 1363
Cdd:PTZ00108  1108 NAELEKKEKELEKLKNTTPKDMWLEDlDKFEEALEEQEEV------EEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKK 1181
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1364 G--------YSSDGISSSLGEIPSLIPTDEKDILKGLKKDSFSQESSPSSPSDLAKLESTVLSILEAQASTLADEKSEKK 1435
Cdd:PTZ00108  1182 KkssadkskKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFS 1261
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1436 TQPHEVSPEQPKDQEKTQSLSE-------TLEITISEEEIKESQEERKDTFKKDSQQDIPSSKDHKEKSEFVDDITTRRE 1508
Cdd:PTZ00108  1262 SDDLSKEGKPKNAPKRVSAVQYsppppskRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRV 1341
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2462613745 1509 PYdsveesseSENSPVPQRKRRTSVGSSSSDE--------YKQEDSQGSGEEED 1554
Cdd:PTZ00108  1342 KQ--------ASASQSSRLLRRPRKKKSDSSSeddddsevDDSEDEDDEDDEDD 1387
PDZ2_PDZD7-like cd10834
PDZ domain 2 of the canonical isoform 1 of PDZ domain containing 7 (PDZD7), and related ...
4497-4580 2.11e-04

PDZ domain 2 of the canonical isoform 1 of PDZ domain containing 7 (PDZD7), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of the long isoform 1 of PDZD7, and related domains. PDZD7 is critical for the organization of Usher syndrome type 2 (USH2) complex. Usher syndrome is the leading cause of hereditary sensory deaf-blindness in humans; USH2 is the most common sub-type. Formation of the USH2 complex is based upon heterodimerization between PDZD7 and whirlin (another PDZ domain-containing protein) and a subsequent dynamic interplay between USH2 proteins via their multiple PDZ domains. The PDZD7 PDZ2 domain binds GPR98 (also known as VLGR1) and usherin (USH2A). PDZD7 and whirlin form heterodimers through their multiple PDZ domains; whirlin and PDZD7 interact with usherin and GPR98 to form an interdependent ankle link complex. PDZD7 also interacts with myosin VIIa. PDZD7 also forms homodimers through its PDZ2 domain. Various isoforms of PDZD7 produced by alternative splicing have been identified; this subgroup includes the second PDZ domain of the canonical isoform of PDZD7- isoform 1. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PDZD7-like family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467270 [Multi-domain]  Cd Length: 85  Bit Score: 42.76  E-value: 2.11e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 4497 RIKITRDSKDHTVsgngLGIRIVGGKEIPghsgeIGAYIAKILPGGSAEQTGkLMEGMQVLEWNGIPLTSKTYEEVQSII 4576
Cdd:cd10834      2 RIVHLYTTSDDYC----LGFNIRGGSEYG-----LGIYVSKVDPGGLAEQNG-IKVGDQILAVNGVSFEDITHSKAVEVL 71

                   ....
gi 2462613745 4577 SQQS 4580
Cdd:cd10834     72 KSQT 75
PHA03291 PHA03291
envelope glycoprotein I; Provisional
394-516 2.26e-04

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 47.26  E-value: 2.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  394 PPGVGKTPAQQPgPAKPPTQQvgTPKPLAQQPGLqsPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQQTGPTKPPSQL 473
Cdd:PHA03291   160 PLGLAAFPAEGT-LAAPPLGE--GSADGSCDPAL--PLSAPRLGPADVFVPATPRPTPRTTASPETTPTPSTTTSPPSTT 234
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 2462613745  474 PGPAKPPPQQPGPAKpPPQQPGSAKPPPQQPGSTKPPPQQPGP 516
Cdd:PHA03291   235 IPAPSTTIAAPQAGT-TPEAEGTPAPPTPGGGEAPPANATPAP 276
PDZ1_Dlg1-2-4-like cd06723
PDZ domain 1 of human discs large homolog 1 (Dlg1), Dlg2, and Dlg4, Drosophila disc large (Dlg) ...
4511-4550 2.34e-04

PDZ domain 1 of human discs large homolog 1 (Dlg1), Dlg2, and Dlg4, Drosophila disc large (Dlg), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of Drosophila Dlg1, human Dlg1,2, and 4 and related domains. Dlg1 (also known as synapse-associated protein Dlg197 or SAP-97), Dlg2 (also known as channel-associated protein of synapse-110, postsynaptic density protein 93, or PSD-93), Dlg4 (also known as postsynaptic density protein 95, PSD-95, synapse-associated protein 90, or SAP-90) each have 3 PDZ domains and belong to the membrane-associated guanylate kinase family. Dlg1 regulates antigen receptor signaling and cell polarity in lymphocytes, B-cell proliferation and antibody production, and TGFalpha bioavailability; its PDZ3 domain binds pro-TGFalpha, and its PDZ2 domain binds the TACE metalloprotease responsible for cleaving pro-TGFalpha to a soluble form. Dlg2 is involved in N-methyl-D-aspartate (NMDA) receptor signaling. It regulates surface expression of NMDA receptors in dorsal horn neurons of the spinal cord, and it also interacts with NMDA receptor subunits and with Shaker-type K+ channel subunits to cluster into a channel complex. Dlg4 PDZ1 domain binds NMDA receptors, and its PDZ2 domain binds neuronal nitric oxide synthase (nNOS), forming a complex in neurons. The Drosophila Scribble complex (Scribble, Dlg, and lethal giant larvae) plays a role in apico-basal cell polarity, and in other forms of polarity, including regulation of the actin cytoskeleton, cell signaling and vesicular trafficking, and in tumor development. Postsynaptic targeting of Drosophila DLG requires interactions mediated by the first two PDZ domains. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Dlg-like family PDZ1 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467206 [Multi-domain]  Cd Length: 89  Bit Score: 42.69  E-value: 2.34e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 2462613745 4511 GNGLGIRIVGGKEIPGHSGEIGAYIAKILPGGSAEQTGKL 4550
Cdd:cd06723     10 NSGLGFSIAGGTDNPHIGDDPSIYITKIIPGGAAAADGRL 49
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
915-1061 2.36e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 47.55  E-value: 2.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  915 APKSQPTTPQETVTgklfgfgASIFSQASNLISTAGQPGPHSQSGP---GAPMKQAPAPSQPPTSQ----GPPKSTGQAP 987
Cdd:PRK07994   363 APLPEPEVPPQSAA-------PAASAQATAAPTAAVAPPQAPAVPPppaSAPQQAPAVPLPETTSQllaaRQQLQRAQGA 435
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462613745  988 PAPAKSIPVKKETKAPAAEKLEPKAEQAPTVKRTETEKK---PPPIKDSKSLTAEPQKAVLPTKLEKSPKPESTCPL 1061
Cdd:PRK07994   436 TKAKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAkkeAYRWKATNPVEVKKEPVATPKALKKALEHEKTPEL 512
PDZ7_PDZD2-PDZ4_hPro-IL-16-like cd06763
PDZ domain 7 of PDZ domain containing 2 (PDZD2), PDZ domain 4 of human pro-interleukin-16 ...
4513-4576 2.42e-04

PDZ domain 7 of PDZ domain containing 2 (PDZD2), PDZ domain 4 of human pro-interleukin-16 (isoform 1, 1332 AA), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 7 of PDZD2, also known as KIAA0300, PIN-1, PAPIN, activated in prostate cancer (AIPC) and PDZ domain-containing protein 3 (PDZK3). PDZD2 has seven PDZ domains. PDZD2 is expressed at exceptionally high levels in the pancreas and certain cancer tissues, such as prostate cancer. It promotes the proliferation of insulinoma cells and is upregulated during prostate tumorigenesis. In osteosarcoma (OS), the microRNA miR-363 acts as a tumor suppressor by inhibiting PDZD2. This family include the PDZ domain of the secreted mature form of human interleukin-16 (IL-16); this is the fourth PDZ domain (PDZ4) of human pro-interleukin-16 (isoform 1, also known as nPro-Il-16). Precursor IL-16 is cleaved to produce pro-IL-16 and C-terminal mature IL-16. Pro-IL-16 functions as a regulator of T cell growth; mature IL-16 is a CD4 ligand that induces chemotaxis and CD25 expression in CD4+ T cells. IL-16 bioactivity has been closely associated with the progression of several different cancers PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PDZD2-like family PDZ7 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467244 [Multi-domain]  Cd Length: 86  Bit Score: 42.60  E-value: 2.42e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462613745 4513 GLGIRIVGGKEIPghSGEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSII 4576
Cdd:cd06763     12 GLGFSLEGGKGSP--LGDRPLTIKRIFKGGAAEQSGVLQVGDEILQINGTSLQGLTRFEAWNII 73
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
444-524 2.48e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 47.58  E-value: 2.48e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  444 PGPGKIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPgPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQ 523
Cdd:PRK12270    40 STAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAP-AAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEVT 118

                   .
gi 2462613745  524 P 524
Cdd:PRK12270   119 P 119
Med25_SD1 pfam11235
Mediator complex subunit 25 synapsin 1; The overall function of the full-length Med25 is ...
418-548 2.55e-04

Mediator complex subunit 25 synapsin 1; The overall function of the full-length Med25 is efficiently to coordinate the transcriptional activation of RAR/RXR (retinoic acid receptor/retinoic X receptor) in higher eukaryotic cells. Human Med25 consists of several domains with different binding properties, the N-terminal, VWA, domain, this SD1 - synapsin 1 - domain from residues 229-381, a PTOV(B) or ACID domain from 395-545, an SD2 domain from residues 564-645 and a C-terminal NR box-containing domain (646-650) from 646-747. This The function of the SD domains is unclear.


Pssm-ID: 463244 [Multi-domain]  Cd Length: 157  Bit Score: 44.77  E-value: 2.55e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  418 PKPL-AQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQQTGPTKP----PSQLPG------PAKPPPQQPGP 486
Cdd:pfam11235    9 PGPLqSKQPVPLPPAAPSGATLSAAPQQPLPPVPPQYQVPGNLSAAQVAAQNAveaaKNQKAGlgprfsPITPLQQAAPG 88
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462613745  487 AKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGS-------AKPSAQQPSPAK 548
Cdd:pfam11235   89 VGPPFSQAPAPQLPPGPPGAPKPVPPASQPSLVSTVAPGSGLAPTAQPGApsmagtvAPGGVSGPSPAQ 157
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
716-1072 2.68e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 47.07  E-value: 2.68e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  716 PSAKAKQPPEADSLSKPAPPKEPSVPSEQDKAPVADDKPKQPK--------MVKPTTDLVSSSSATTKPDIPSSKVQSQA 787
Cdd:NF033839   159 PETPQPENPEHQKPTTPAPDTKPSPQPEGKKPSVPDINQEKEKaklavatyMSKILDDIQKHHLQKEKHRQIVALIKELD 238
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  788 EEKTTPPLKTDSAKPSQSFPPTGEKVSPfDSKAIPRPASDSKIISHPGPsseskgqkqvdPVQKKEEPKKAQTKMSPKPD 867
Cdd:NF033839   239 ELKKQALSEIDNVNTKVEIENTVHKIFA-DMDAVVTKFKKGLTQDTPKE-----------PGNKKPSAPKPGMQPSPQPE 306
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  868 AKPmPKGSPTPPGPRPTAGQTVPTPQQSPKPQEQSRRFSLNLGSITDAPKSQPTTPQETVTGKLFGFGASIfsqasnlis 947
Cdd:NF033839   307 KKE-VKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPEVKPQPEKPKPEV--------- 376
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  948 tagQPGPHSQSGPGAPMKQAPAPSQPPTsqgPPKSTGQAPPAPAKSIPVKKETKAPAAEKLEPKAEQAPTVKRTETEKKP 1027
Cdd:NF033839   377 ---KPQPETPKPEVKPQPEKPKPEVKPQ---PEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPK 450
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*
gi 2462613745 1028 PPIKdsksltAEPQKAvlptKLEKSPKPESTCPLCKTELNIGSKD 1072
Cdd:NF033839   451 PEVK------PQPETP----KPEVKPQPEKPKPEVKPQPEKPKPD 485
PDZ2_GRIP1-2-like cd06681
PDZ domain 2 of glutamate receptor-interacting protein 1 (GRIP1) and GRIP2, and related ...
4495-4583 2.70e-04

PDZ domain 2 of glutamate receptor-interacting protein 1 (GRIP1) and GRIP2, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of alpha-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid receptor (AMPAR) binding proteins GRIP1 (ABP/GRIP2) and GRIP2, and related domains. GRIP1 and GRIP2 each have 7 PDZ domains. The interaction of GRIP1 and GRIP2 with GluA2/3 (AMPAR subunit) regulates AMPAR trafficking and synaptic targeting. GRIP1 has an essential role in regulating AMPAR trafficking during synaptic plasticity and learning and memory. GRIP1 and GRIP2 interact with a variety of other proteins associated with protein trafficking and internalization, for example GRIP1 also interacts with KIF5 (also known as kinesin 1), EphB receptors, scaffold protein liprin-alpha, and the rasGEF GRASP-1. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This GRIP family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467169 [Multi-domain]  Cd Length: 89  Bit Score: 42.61  E-value: 2.70e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 4495 HARIKITRDskdhtvsGNGLGIRIVGGKeipgHSGEIGAY---IAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEE 4571
Cdd:cd06681      2 TVEVTLEKE-------GNSFGFVIRGGA----HEDRNKSRpltVTHVRPGGPADREGTIKPGDRLLSVDGISLHGATHAE 70
                           90
                   ....*....|..
gi 2462613745 4572 VQSIISQQSGEA 4583
Cdd:cd06681     71 AMSILKQCGQEA 82
PDZ1_harmonin cd06737
PDZ domain 1 of harmonin isoforms a, b, and c, and related domains; PDZ (PSD-95 (Postsynaptic ...
4510-4580 2.73e-04

PDZ domain 1 of harmonin isoforms a, b, and c, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of harmonin isoforms a, b, and c, and related domains. Harmonin (also known as Usher Type 1C, PDZ-73 and AIE-75) is a key organizer of the Usher (USH) protein interactome. USH syndrome is the leading cause of hereditary sensory deaf-blindness in humans; three clinically distinct types of USH have been identified, type 1 to 3. The gene encoding harmonin (USH1C) is the causative gene for the USH type 1C phenotype. There are at least 10 alternatively spliced isoforms of harmonin, which are divided into three subclasses (a, b, and c). All isoforms contain the first two PDZ domains and the first coiled-coil domain. The a and b isoforms all have a third PDZ domain. The different PDZ domains are responsible for interactions with all known Usher syndrome type 1 proteins, and most Usher syndrome type 2 proteins. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This harmonin family PDZ1 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467219 [Multi-domain]  Cd Length: 85  Bit Score: 42.63  E-value: 2.73e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462613745 4510 SGNGLGIRIVGGKEipgHSgeIGAYIAKILPGGSAEQTGkLMEGMQVLEWNGIPLTSKTYEEVQSIISQQS 4580
Cdd:cd06737     11 GPESLGFSVRGGLE---HG--CGLFVSHVSPGSQADNKG-LRVGDEIVRINGYSISQCTHEEVINLIKTKK 75
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
316-502 2.77e-04

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 44.64  E-value: 2.77e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  316 AQQPGHEKSQPGpakppaQPSGLTKPLAQQPGTVKPPVQPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAQPP 395
Cdd:pfam15240   16 AQSSSEDVSQED------SPSLISEEEGQSQQGGQGPQGPPPGGFPPQPPASDDPPGPPPPGGPQQPPPQGGKQKPQGPP 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  396 gvgktpaQQPGPAKPPTQQVGTPKPLAQQPglQSPAKAPGPTKTPVQQPGPGKIPAQQAGPgktsaqqtgptkPPsqlpg 475
Cdd:pfam15240   90 -------PQGGPRPPPGKPQGPPPQGGNQQ--QGPPPPGKPQGPPPQGGGPPPQGGNQQGP------------PP----- 143
                          170       180
                   ....*....|....*....|....*..
gi 2462613745  476 pakPPPQQPGPAKPPPQQPGSAKPPPQ 502
Cdd:pfam15240  144 ---PPPGNPQGPPQRPPQPGNPQGPPQ 167
PDZ3_FL-whirlin-like cd06742
PDZ domain 3 of the full-length isoform of whirlin, PDZ domain 1 of the short isoform of ...
4514-4578 2.82e-04

PDZ domain 3 of the full-length isoform of whirlin, PDZ domain 1 of the short isoform of whirlin, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 3 of the full-length isoform of whirlin, PDZ domain 1 of the short isoform of whirlin, and related domains. Whirlin is an essential protein for developmental pathways in photoreceptor cells of the retina and hair cells of the inner ear. The full-length whirlin isoform has two harmonin N-like domains, three PDZ domains, a proline-rich region, and a PDZ-binding motif. Whirlin isoforms may form different complexes at the periciliary membrane complex (PMC) in photoreceptors, and the stereociliary tip and base in inner ear hair cells. It interacts with ADGRV1 and usherin at the PMC; with SANS and RpgrORF15 at the connecting cilium in photoreceptors; with EPS8, MYO15A, p55, and CASK proteins at the stereociliary tip of inner ear hair cells; and with ADGRV1, usherin, and PDZD7 at the stereociliary base in inner ear hair cells. Mutations in the gene encoding whirlin (WHRN; also known as USH2D and DFNB31), have been found to cause either USH2 subtype (USH2D) or autosomal recessive non-syndromic deafness type 31 (DFNB31). Whirlin is the key protein in the USH2 complex (whirlin, usherin and GPR98) which recruits other USH2 causative proteins at the periciliary membrane in photoreceptors and the ankle link of the stereocilia in hair cells. Whirlin's interaction with espin, another stereociliary protein, may be important for the architecture of the USH2 complex. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This whirlin family PDZ3 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F


Pssm-ID: 467224 [Multi-domain]  Cd Length: 91  Bit Score: 42.73  E-value: 2.82e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462613745 4514 LGIRIVGGkeipGHSGEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQ 4578
Cdd:cd06742     13 LGIAIEGG----ANTKQPLPRVINIQRGGSAHNCGGLKVGHVILEVNGTSLRGLEHREAARLIAE 73
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
446-587 3.05e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 46.98  E-value: 3.05e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  446 PGKIPAQQAGPGKtsaqqtGPTKPPSqlpGPAKPPPQQPGPAKPPPqqPGSAKPPPQQPGSTKPP----PQQPGPAK-PS 520
Cdd:PRK14959   363 PRLMPVESLRPSG------GGASAPS---GSAAEGPASGGAATIPT--PGTQGPQGTAPAAGMTPssaaPATPAPSAaPS 431
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462613745  521 PQQPGSTKPPSQQPGSAKPsaqQPSPAKPSAQQSTKPVSQTGSGKPLQPPTVSPSAKQPPSQGLPKT 587
Cdd:PRK14959   432 PRVPWDDAPPAPPRSGIPP---RPAPRMPEASPVPGAPDSVASASDAPPTLGDPSDTAEHTPSGPRT 495
Androgen_recep pfam02166
Androgen receptor;
482-589 3.06e-04

Androgen receptor;


Pssm-ID: 426632 [Multi-domain]  Cd Length: 501  Bit Score: 46.84  E-value: 3.06e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  482 QQPGPAKPppQQPGSAKPP----PQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKp 557
Cdd:pfam02166   30 QNPGPRHP--EAAGGAAPPgarlQHQQQQQQQVPQQPQQQESSPRQPQASVQPQQAGDDGSPPAHNRGPAGYLALEDDE- 106
                           90       100       110
                   ....*....|....*....|....*....|..
gi 2462613745  558 vsqtgsgkplQPptvSPSAKQPPSQGLPKTIC 589
Cdd:pfam02166  107 ----------QP---QPSQAQPAAECCPENGC 125
Totivirus_coat pfam05518
Totivirus coat protein;
365-523 3.10e-04

Totivirus coat protein;


Pssm-ID: 428505  Cd Length: 727  Bit Score: 47.06  E-value: 3.10e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  365 PLGPAKPPAQQTGSEKPSSEQPGPKALAQPPGVGK-TPAQQPGPAKPPTQQVGTPKPlaqqpGLQSPAKAPGPTktpvqQ 443
Cdd:pfam05518  589 PTGLASGASNAEDPEVRRARTRGARALAQARTFGRaTVGEMIISGFPPVFKTALPRP-----DYNRGGEAGGPG-----V 658
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  444 PGPGkipaQQAGPGKTsaqqtgpTKPPSQLPG-PAKPPPQ-QPGPAKPPPQQPGSAKPPPqqPGSTKPPPQQPGPAKPSP 521
Cdd:pfam05518  659 PGPV----PVGMEAHT-------VRPSRVARGdPVRPTAHhAALRAPQAPRGPSSLIPSP--TAPPEPEPPGAEQADRAE 725

                   ..
gi 2462613745  522 QQ 523
Cdd:pfam05518  726 NQ 727
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
724-1058 3.28e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 47.22  E-value: 3.28e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  724 PEADSLSKPAPPKEPSVPSEQdKAPVADDKPKQPKMVKPTTDLVSSSSATT----KPDIPSSKVQSQAEEKTTPPLKTds 799
Cdd:pfam05109  432 PTLNTTGFAAPNTTTGLPSST-HVPTNLTAPASTGPTVSTADVTSPTPAGTtsgaSPVTPSPSPRDNGTESKAPDMTS-- 508
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  800 akpsqsfpPTGEKVSPFDSKAIPRPAsdskiISHPGPSSESkgqkqvdPVQKKEEPKKAQTKMSPKPDAKPMPKGSPTPP 879
Cdd:pfam05109  509 --------PTSAVTTPTPNATSPTPA-----VTTPTPNATS-------PTLGKTSPTSAVTTPTPNATSPTPAVTTPTPN 568
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  880 GPRPTAGQTVPTPQ-QSPKPQE----------QSRRFSLNLGSITDAPKSqpTTPQETVTGKLFGFGASIFSQASNLIST 948
Cdd:pfam05109  569 ATIPTLGKTSPTSAvTTPTPNAtsptvgetspQANTTNHTLGGTSSTPVV--TSPPKNATSAVTTGQHNITSSSTSSMSL 646
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  949 AgqpgPHSQSGPGAPMKQAPAPSQPP--TSQGPPKSTGQAPPAPAKSIPVKKETKAPAAEKLEPKAEQAPTVKRTETE-- 1024
Cdd:pfam05109  647 R----PSSISETLSPSTSDNSTSHMPllTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTKpg 722
                          330       340       350
                   ....*....|....*....|....*....|....*....
gi 2462613745 1025 -----KKPPPIKDSKSLTAEPQKAVLPTKLEKSPKPEST 1058
Cdd:pfam05109  723 evnvtKGTPPKNATSPQAPSGQKTAVPTVTSTGGKANST 761
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
354-522 3.30e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.09  E-value: 3.30e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  354 QPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAQPPGVGKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKA 433
Cdd:PHA03307   759 SNPSLVPAKLAEALALLEPAEPQRGAGSSPPVRAEAAFRRPGRLRRSGPAADAASRTASKRKSRSHTPDGGSESSGPARP 838
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  434 PGPTKTPvqqpgpgkiPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQ 513
Cdd:PHA03307   839 PGAAARP---------PPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAP 909

                   ....*....
gi 2462613745  514 PGPAKPSPQ 522
Cdd:PHA03307   910 RVKLGPMPP 918
COG3416 COG3416
Uncharacterized conserved protein, DUF2076 domain [Function unknown];
482-536 3.53e-04

Uncharacterized conserved protein, DUF2076 domain [Function unknown];


Pssm-ID: 442642 [Multi-domain]  Cd Length: 237  Bit Score: 45.40  E-value: 3.53e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462613745  482 QQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGstkPPSQQPGS 536
Cdd:COG3416     94 QRPPPAPQPSQPGPQQQPAPPSGPWGQAAPQQPGYGQPQYGQPA---AGPSGGGG 145
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
470-573 3.57e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 46.72  E-value: 3.57e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  470 PSQLPGPAKPPP-QQPGPAKPPPQQP-----GSAKPPPQQPGStkpPPQQPGPAKPSPQQpgsTKPPSQQPGSAKPSaQQ 543
Cdd:TIGR01628  388 GSPMGGAMGQPPyYGQGPQQQFNGQPlgwprMSMMPTPMGPGG---PLRPNGLAPMNAVR---APSRNAQNAAQKPP-MQ 460
                           90       100       110
                   ....*....|....*....|....*....|
gi 2462613745  544 PSPAKPSAQQstKPVSQtgsGKPLQPPTVS 573
Cdd:TIGR01628  461 PVMYPPNYQS--LPLSQ---DLPQPQSTAS 485
Androgen_recep pfam02166
Androgen receptor;
379-520 3.73e-04

Androgen receptor;


Pssm-ID: 426632 [Multi-domain]  Cd Length: 501  Bit Score: 46.46  E-value: 3.73e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  379 EKPSSEQPGPKALAQPPGVGKTPAQQpgpakpptQQVGTPKPLAQQpglQSPAKAPGPTKTPVQQPGPGKIPAQQAGPGK 458
Cdd:pfam02166   30 QNPGPRHPEAAGGAAPPGARLQHQQQ--------QQQQVPQQPQQQ---ESSPRQPQASVQPQQAGDDGSPPAHNRGPAG 98
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462613745  459 TSAQQTGPTKPPSQLPGPAKPPPQQpgPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPS 520
Cdd:pfam02166   99 YLALEDDEQPQPSQAQPAAECCPEN--GCVPEPGAAAAAGKGLPQQAVAPAAPDDDDSAAPS 158
PDZ4_GRIP1-2-like cd06686
PDZ domain 4 of glutamate receptor-interacting protein 1 (GRIP1) and GRIP2, and related ...
4512-4581 3.74e-04

PDZ domain 4 of glutamate receptor-interacting protein 1 (GRIP1) and GRIP2, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of alpha-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid receptor (AMPAR) binding proteins GRIP1 (ABP/GRIP2) and GRIP2, and related domains. GRIP1 and GRIP2 each have 7 PDZ domains. The interaction of GRIP1 and GRIP2 with GluA2/3 (AMPAR subunit) regulates AMPAR trafficking and synaptic targeting. GRIP1 has an essential role in regulating AMPAR trafficking during synaptic plasticity and learning and memory. GRIP1 and GRIP2 interact with a variety of other proteins associated with protein trafficking and internalization, for example GRIP1 also interacts with KIF5 (also known as kinesin 1), EphB receptors, scaffold protein liprin-alpha, and the rasGEF GRASP-1. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This GRIP family PDZ4 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467174 [Multi-domain]  Cd Length: 99  Bit Score: 42.72  E-value: 3.74e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462613745 4512 NGLGIRIVGGkeipGHSGEIGAY---IAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQQSG 4581
Cdd:cd06686     18 KGFGIQLQGG----VFATETLSSpplISFIEPDSPAERCGVLQVGDRVLSINGIPTEDRTLEEANQLLRDSAS 86
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
350-523 3.79e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 46.78  E-value: 3.79e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  350 KPPVQPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAQPPgvgktPAQQPGPAKPPTQQVGTPKPLAQQPGLQS 429
Cdd:PRK07994   362 AAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAP-----QQAPAVPLPETTSQLLAARQQLQRAQGAT 436
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  430 PAKAPGPTKTPVQQPGPgkiPAQQAGPGKTSAQQTGPTKPPSQLPGPAKppPQQPGPAKPPPQ-QPGSAKPPPQQPgstK 508
Cdd:PRK07994   437 KAKKSEPAAASRARPVN---SALERLASVRPAPSALEKAPAKKEAYRWK--ATNPVEVKKEPVaTPKALKKALEHE---K 508
                          170
                   ....*....|....*
gi 2462613745  509 PPPQQPGPAKPSPQQ 523
Cdd:PRK07994   509 TPELAAKLAAEAIER 523
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
484-561 3.91e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 46.81  E-value: 3.91e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462613745  484 PGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQT 561
Cdd:PRK12270    40 STAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEV 117
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
388-543 3.99e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 46.78  E-value: 3.99e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  388 PKALAQPPGVGKTPAQQPGPAKPPTQQVGTPKPlaQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQ-QTGP 466
Cdd:PRK07994   361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAP--PQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQgATKA 438
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  467 TKPPSQLPGPAKP--PPQQPGPAKPPPQQPGSAKPP--------PQQPGSTKPPPQ-QPGPAKPSPQQPgstKPPSQQPG 535
Cdd:PRK07994   439 KKSEPAAASRARPvnSALERLASVRPAPSALEKAPAkkeayrwkATNPVEVKKEPVaTPKALKKALEHE---KTPELAAK 515

                   ....*...
gi 2462613745  536 SAKPSAQQ 543
Cdd:PRK07994   516 LAAEAIER 523
PRK11901 PRK11901
hypothetical protein; Reviewed
455-582 4.01e-04

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 45.83  E-value: 4.01e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  455 GPGKTSAQQTGPTKPP---------SQLPGPAKPPPQQPGPAKPPPQQPGSAkpPPQQPGSTKPPPQQPGPAKPSPQQPg 525
Cdd:PRK11901    60 SPTEHESQQSSNNAGAeknidlsgsSSLSSGNQSSPSAANNTSDGHDASGVK--NTAPPQDISAPPISPTPTQAAPPQT- 136
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462613745  526 stkPPSQQ----PGSAkPSA--QQPSPAKPSAQQSTKPvsqtGSGKPLQPPTVSPSAKQPPSQ 582
Cdd:PRK11901   137 ---PNGQQrielPGNI-SDAlsQQQGQVNAASQNAQGN----TSTLPTAPATVAPSKGAKVPA 191
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
437-532 4.10e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 46.73  E-value: 4.10e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  437 TKTPVQQPGPGKIPAQQAGPGKTsAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPqqPGSAKPPPQQPgstKPPPQQPGP 516
Cdd:PRK14950   361 VPVPAPQPAKPTAAAPSPVRPTP-APSTRPKAAAAANIPPKEPVRETATPPPVPP--RPVAPPVPHTP---ESAPKLTRA 434
                           90
                   ....*....|....*.
gi 2462613745  517 AKPSPQQPGSTKPPSQ 532
Cdd:PRK14950   435 AIPVDEKPKYTPPAPP 450
PRK01297 PRK01297
ATP-dependent RNA helicase RhlB; Provisional
495-559 4.12e-04

ATP-dependent RNA helicase RhlB; Provisional


Pssm-ID: 234938 [Multi-domain]  Cd Length: 475  Bit Score: 46.44  E-value: 4.12e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462613745  495 GSAKPPPQQPGstkPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVS 559
Cdd:PRK01297    10 GKGEAEQPAPA---PPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERKPKPAS 71
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
508-591 4.78e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 46.81  E-value: 4.78e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  508 KPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQTGSGKPLQPPTVSPSAKQPPSQGLPKT 587
Cdd:PRK12270    37 GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDE 116

                   ....
gi 2462613745  588 ICPL 591
Cdd:PRK12270   117 VTPL 120
FimV COG3170
Type IV pilus assembly protein FimV [Cell motility, Extracellular structures];
309-550 4.80e-04

Type IV pilus assembly protein FimV [Cell motility, Extracellular structures];


Pssm-ID: 442403 [Multi-domain]  Cd Length: 508  Bit Score: 46.33  E-value: 4.80e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  309 PTPGKPPAQQPGHEkSQPGPAKPPAQPSGLTKPLAQQPGTVKP---PVQPPGT-TKPPAQPLGPAKPPAQQTGSEK---- 380
Cdd:COG3170    106 PPAYAAAAAAPAAA-PAPAPAAPAAAAAAADQPAAEAAPAASGeyyPVRPGDTlWSIAARPVRPSSGVSLDQMMVAlyra 184
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  381 -PSSEQPGPKALAQPPGVGKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPGKT 459
Cdd:COG3170    185 nPDAFIDGNINRLKAGAVLRVPAAEEVAALSPAEARQEVQAQSADWAAYRARLAAAVEPAPAAAAPAAPPAAAAAAGPVP 264
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  460 SAQQTG--PTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSA 537
Cdd:COG3170    265 AAAEDTlsPEVTAAAAAEEADALPEAAAELAERLAALEAQLAELQRLLALKNPAPAAAVSAPAAAAAAATVEAAAPAAAA 344
                          250
                   ....*....|...
gi 2462613745  538 KPSAQQPSPAKPS 550
Cdd:COG3170    345 QPAAAAPAPALDN 357
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
309-390 4.85e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 46.34  E-value: 4.85e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  309 PTPGKPPAQQPGHEKSQPGPAKPPAQPSGLTKPLAQ-QPGTVKPPVQPP---GTTKPPAQPLGPAKPPAQQTGSEKPSSE 384
Cdd:PRK14950   366 PQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEpVRETATPPPVPPrpvAPPVPHTPESAPKLTRAAIPVDEKPKYT 445

                   ....*.
gi 2462613745  385 QPGPKA 390
Cdd:PRK14950   446 PPAPPK 451
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
650-898 5.05e-04

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 46.46  E-value: 5.05e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  650 LAPVPSSPQPKLKTAPVTTTSAVSKSSPQPQQTSPKKDAAPKQDLSKAPEPKKPPPL-----VKQPTLHGSPSAKAKQPP 724
Cdd:PLN03209   320 LAKIPSQRVPPKESDAADGPKPVPTKPVTPEAPSPPIEEEPPQPKAVVPRPLSPYTAyedlkPPTSPIPTPPSSSPASSK 399
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  725 EADSLSKPAPPKEPSVPSEQDKAPVADDKPKQPKMVKPTtdlvssSSATTKPDI-PSSKVQSQAEEKTTPPLKTDSAKPS 803
Cdd:PLN03209   400 SVDAVAKPAEPDVVPSPGSASNVPEVEPAQVEAKKTRPL------SPYARYEDLkPPTSPSPTAPTGVSPSVSSTSSVPA 473
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  804 --QSFPPTGEKVSPFDSKAIPRPASDSKIISHPGPSSESKGQKQVDPVQKKEEPkKAQTKMSPKPDAKPMPKGSPTPPGP 881
Cdd:PLN03209   474 vpDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTN-EVVKVGNSAPPTALADEQHHAQPKP 552
                          250       260
                   ....*....|....*....|
gi 2462613745  882 RPTAGQTV---PTPQQSPKP 898
Cdd:PLN03209   553 RPLSPYTMyedLKPPTSPTP 572
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
308-453 5.41e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 46.40  E-value: 5.41e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  308 QPTPGKPPAQQPGHEKSQPGPAKPPAQPSGLTKPLAQQPGTVKPPVQPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQPG 387
Cdd:PRK07994   365 LPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKSEPA 444
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462613745  388 PKALAQPpgvgKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQQ 453
Cdd:PRK07994   445 AASRARP----VNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKALKKALEH 506
PRK14965 PRK14965
DNA polymerase III subunits gamma and tau; Provisional
461-549 5.89e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237871 [Multi-domain]  Cd Length: 576  Bit Score: 45.89  E-value: 5.89e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  461 AQQTGPTKPPSQLPGPAKPPPQQPGP--AKPPPQQPGSAKPPPQQPgstKPPPQQPgPAKPSPQQPGSTKPPSQQPGSAK 538
Cdd:PRK14965   376 ALERGAPAPPSAAWGAPTPAAPAAPPpaAAPPVPPAAPARPAAARP---APAPAPP-AAAAPPARSADPAAAASAGDRWR 451
                           90
                   ....*....|.
gi 2462613745  539 PSAQQPSPAKP 549
Cdd:PRK14965   452 AFVAFVKGKKP 462
PHA03269 PHA03269
envelope glycoprotein C; Provisional
437-568 5.98e-04

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 45.87  E-value: 5.98e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  437 TKTPVQQPGPGKIPAQQAGPGKTSAqqTGPTKPPSQLPGPAKPPP----QQPGPAKPPPQQPGSAKPPPQQPGSTKPPpq 512
Cdd:PHA03269    33 TSAATQKPDPAPAPHQAASRAPDPA--VAPTSAASRKPDLAQAPTpaasEKFDPAPAPHQAASRAPDPAVAPQLAAAP-- 108
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462613745  513 QPGPAKPSPQQPGSTKPPSQQPGSAkpSAQQPSPAKpSAQQSTKPVSQTGSGKPLQ 568
Cdd:PHA03269   109 KPDAAEAFTSAAQAHEAPADAGTSA--ASKKPDPAA-HTQHSPPPFAYTRSMEHIA 161
PDZ_PDLIM-like cd06753
PDZ domain of PDZ-LIM family proteins, and related domains; PDZ (PSD-95 (Postsynaptic density ...
4515-4582 5.99e-04

PDZ domain of PDZ-LIM family proteins, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of PDZ-LIM family proteins including PDLIM1-7, and related domains. PDZ-LIM family proteins (also known as Zasp PDZ domain proteins) are involved in the rearrangement of the actin cytoskeleton; they mediate association with the cytoskeleton through alpha-actinin as well as with other proteins involved in signal transduction pathways. Members of this family include PDLIM1 (also known as C-terminal LIM domain protein 1, elfin, LIM domain protein CLP-36), PDLIM2 (also known as PDZ-LIM protein mystique), PDLIM3 (also known as actinin-associated LIM protein, alpha-actinin-2-associated LIM protein, ALP), PDLIM4 (also known as LIM protein RIL, Reversion-induced LIM protein), PDLIM5 (also known as enigma homolog, ENH, enigma-like PDZ and LIM domains protein), PDLIM6 (also known as LIM domain-binding protein 3, ZASP, Cypher, Oracle), and PDLIM7 (also known as PDZ and LIM domain protein 7, LIM mineralization protein, LMP; protein enigma). PDLIM1 has been shown to negatively regulate NF-kappaB-mediated signaling in the cytoplasm. PDLIM7 negatively regulates p53 through binding murine double minute 2 (MDM2). The PDZ domains of PDZ-LIM family proteins PDLIM1, 2, 3, 5, 6, 7 have been shown to bind actin. Other PDZ-LIM family PDZ domain binding partners include thyroid receptor interacting protein-6 (PDLIM4-PDZ), the LIM domain of PDLIM4 (PDLIM4-PDZ), tropomyosin (PDLIM7-PDZ), myotilin and calsarcin 1 (PDLIM6-PDZ), and proteins from the myotilin and FATZ (calsarcin/myozenin) families (PDLIM1, 3, 4, 6 PDZ domains). PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PDLIM-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467235 [Multi-domain]  Cd Length: 79  Bit Score: 41.36  E-value: 5.99e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 4515 GIRIVGGKE--IPghsgeigAYIAKILPGGSAEQTGkLMEGMQVLEWNGIPLTSKTYEEVQSIISQQSGE 4582
Cdd:cd06753     11 GFRLQGGKDfnQP-------LTISRVTPGGKAAQAN-LRPGDVILAINGESTEGMTHLEAQNKIKAATGS 72
PRK12373 PRK12373
NADH-quinone oxidoreductase subunit E;
447-578 6.05e-04

NADH-quinone oxidoreductase subunit E;


Pssm-ID: 237082 [Multi-domain]  Cd Length: 400  Bit Score: 45.56  E-value: 6.05e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  447 GKIPAQQAGP--GKTSAQqtgPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQP 524
Cdd:PRK12373   172 GKGPVVKPGPqiGRYASE---PAGGLTSLTEEAGKARYNASKALAEDIGDTVKRIDGTEVPLLAPWQGDAAPVPPSEAAR 248
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2462613745  525 GSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQTGSGKPLQPPTVSPSAKQ 578
Cdd:PRK12373   249 PKSADAETNAALKTPATAPKAAAKNAKAPEAQPVSGTAAAEPAPKEAAKAAAAA 302
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
397-488 6.06e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 45.96  E-value: 6.06e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  397 VGKTPAQQPGPAKPPTQQVGTPKPLAQQPgLQSPAKAPGPTKTPVQQPG-PGKIPAQQAGPGKTSAQQTGPTKPPSQLPG 475
Cdd:PRK14950   360 LVPVPAPQPAKPTAAAPSPVRPTPAPSTR-PKAAAAANIPPKEPVRETAtPPPVPPRPVAPPVPHTPESAPKLTRAAIPV 438
                           90
                   ....*....|...
gi 2462613745  476 PAKPPPQQPGPAK 488
Cdd:PRK14950   439 DEKPKYTPPAPPK 451
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
342-468 6.44e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 45.92  E-value: 6.44e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  342 LAQQPGTVKPPVQP-----PGTTKPPAQPLGPAKP-PAQQTGSEKPSSEQPGPKALAQPPGVGKT-PAQQPGPAKPPTQQ 414
Cdd:PRK14971   362 LTQKGDDASGGRGPkqhikPVFTQPAAAPQPSAAAaASPSPSQSSAAAQPSAPQSATQPAGTPPTvSVDPPAAVPVNPPS 441
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462613745  415 VGTPK-PLAQQPGLQSPAKAPGPTKTP-VQQPgpgKIPAQQAGPGKTSAQQTGPTK 468
Cdd:PRK14971   442 TAPQAvRPAQFKEEKKIPVSKVSSLGPsTLRP---IQEKAEQATGNIKEAPTGTQK 494
motB PRK12799
flagellar motor protein MotB; Reviewed
447-561 6.45e-04

flagellar motor protein MotB; Reviewed


Pssm-ID: 183756 [Multi-domain]  Cd Length: 421  Bit Score: 45.86  E-value: 6.45e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  447 GKIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQ---PGPAKPSPQQ 523
Cdd:PRK12799   297 GTVPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTvalPAAEPVNMQP 376
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 2462613745  524 PGSTKPPSQQP--GSAKPSAQQPSPAKPSAQQSTKPVSQT 561
Cdd:PRK12799   377 QPMSTTETQQSstGNITSTANGPTTSLPAAPASNIPVSPT 416
PDZ2-PDZRN4-like cd06716
PDZ domain 2 of PDZ domain-containing RING finger protein 4 (PDZRN4), PDZRN3-B, and related ...
4530-4580 6.95e-04

PDZ domain 2 of PDZ domain-containing RING finger protein 4 (PDZRN4), PDZRN3-B, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of PDZRN4, PDZRN3-B, and related domains. PDZRN4 (also known as ligand of numb protein X 4, and SEMACAP3-like protein) contains an N-terminal RING domain and two tandem repeat PDZ domains. It is involved in the progression of cancer, including human liver cancer and breast cancer, and may contribute to the tumorigenesis of rectal adenocarcinoma. Danio rerio PDZRN3-B may participate in neurogenesis: the first PDZ domain of Danio rerio Pdzrn3 interacts with Kidins220 (Kinase D-interacting substrate 220 kD, also named Ankyrin Repeat-Rich Membrane Spanning), a crucial mediator of signal transduction in neural tissues. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PDZRN4-like family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467200 [Multi-domain]  Cd Length: 88  Bit Score: 41.49  E-value: 6.95e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2462613745 4530 EIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKtyEEVQSIISQQS 4580
Cdd:cd06716     30 DTGIYVSEVDPNSIAAKDGRIREGDQILQINGVDVQNR--EEAIALLSEEE 78
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
328-581 6.95e-04

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 45.83  E-value: 6.95e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  328 PAKPPAQPSGLTKPLAQQPGTVKPPV-QPPGTTKPPAQPLGPAKPPAQQTGSEKPS-SEQPGPKAL--AQPPGVGKTPAQ 403
Cdd:pfam03546  184 PAATQAKPSGKILQVRPASGPAKGAApAPPQKAGPVATQVKAERSKEDSESSEESSdSEEEAPAAAtpAQAKPALKTPQT 263
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  404 Q----------PGPAKPPTQQVGTPKP-----LAQQPGLQSPAKAPGPTKTPVQQ-------------PGPGKIPAQQAG 455
Cdd:pfam03546  264 KasprkgtpitPTSAKVPPVRVGTPAPwkagtVTSPACASSPAVARGAQRPEEDSssseeseseeetaPAAAVGQAKSVG 343
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  456 PGKTSAQQTGPTKPPSQlPGPAKPPPQQPGPAKPPPQqpgsAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSqqpg 535
Cdd:pfam03546  344 KGLQGKAASAPTKGPSG-QGTAPVPPGKTGPAVAQVK----AEAQEDSESSEEESDSEEAAATPAQVKASGKTPQA---- 414
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 2462613745  536 SAKPSAQQPSPAKPSAQQSTKPVSQTGSGKPLQPPTVSPSAKQPPS 581
Cdd:pfam03546  415 KANPAPTKASSAKGAASAPGKVVAAAAQAKQGSPAKVKPPARTPQN 460
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
863-1061 7.17e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 46.02  E-value: 7.17e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  863 SPKPDAKPMPKGSPTPPGPRPTAGQTVPTPQQSPKPQEqsrrfslnlgsitdAPKSQPTTPQETVTGKlfgfgASIFSQA 942
Cdd:PRK12323   372 AGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAA--------------PAAAAAARAVAAAPAR-----RSPAPEA 432
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  943 SNLISTAGQPGPHSQSGPGAPMKQAPAPSQPPTSQGPPKSTGQAPPAPAKSIPVKkeTKAPAAEKLEPKAEQAPTVKRTE 1022
Cdd:PRK12323   433 LAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAA--APAPADDDPPPWEELPPEFASPA 510
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|
gi 2462613745 1023 -TEKKPPPIKDSKSLTAEPQKAVLPTKLEKSPKPESTCPL 1061
Cdd:PRK12323   511 pAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPA 550
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
415-581 7.71e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.93  E-value: 7.71e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  415 VGTPKPLAQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQQTGPTKPpsqlpgpaKPPPQQPGPAKPPPqqP 494
Cdd:PHA03307   772 LALLEPAEPQRGAGSSPPVRAEAAFRRPGRLRRSGPAADAASRTASKRKSRSHTP--------DGGSESSGPARPPG--A 841
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  495 GSAKPPPQQPGSTKPPPQQPGPAKP--SPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQStkpvsqtgSGKPLQPPTV 572
Cdd:PHA03307   842 AARPPPARSSESSKSKPAAAGGRARgkNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAP--------RPRPAPRVKL 913

                   ....*....
gi 2462613745  573 SPSAKQPPS 581
Cdd:PHA03307   914 GPMPPGGPD 922
PHA03247 PHA03247
large tegument protein UL36; Provisional
308-505 8.34e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.08  E-value: 8.34e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  308 QPTPGKPPAQQPGHeksqpgPAKPPAQPSGL-TKPLAQQPGTVKPPVQPPgtTKPPAQ---------------------- 364
Cdd:PHA03247   273 RGATGPPPPPEAAA------PNGAAAPPDGVwGAALAGAPLALPAPPDPP--PPAPAGdaeeeddedgamevvsplprpr 344
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  365 ---PLGPAK-------PPAQQ---TGSEKPSSEQPGPKA-----------LAQPPGVGKTPAQQ-PGPAKPPTqqvgtpk 419
Cdd:PHA03247   345 qhyPLGFPKrrrptwtPPSSLedlSAGRHHPKRASLPTRkrrsarhaatpFARGPGGDDQTRPAaPVPASVPT------- 417
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  420 plaqqpglqspakaPGPTKTPVQQPGPGKIPAQQAGPGKTSaqqtGPTKPPSQLPGPAKPPPQQPGPAKPPPQ--QPGSA 497
Cdd:PHA03247   418 --------------PAPTPVPASAPPPPATPLPSAEPGSDD----GPAPPPERQPPAPATEPAPDDPDDATRKalDALRE 479

                   ....*...
gi 2462613745  498 KPPPQQPG 505
Cdd:PHA03247   480 RRPPEPPG 487
PDZ3_MUPP1-like cd06791
PDZ domain 3 of multi-PDZ-domain protein 1 (MUPP1) and PATJ (protein-associated tight junction) ...
4511-4572 8.74e-04

PDZ domain 3 of multi-PDZ-domain protein 1 (MUPP1) and PATJ (protein-associated tight junction) and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 3 of MUPP1 and PATJ, and related domains. MUPP1 and PATJ serve as scaffolding proteins linking different proteins and protein complexes involved in the organization of tight junctions and epithelial polarity. MUPP1 contains an L27 (Lin-2 and Lin-7 binding) domain and 13 PDZ domains. PATJ (also known as INAD-like) contains an L27 domain and ten PDZ domains. MUPP1 and PATJ share several binding partners, including junctional adhesion molecules (JAM), zonula occludens (ZO)-3, Pals1 (protein associated with Lin-7), Par (partitioning defective)-6 proteins, and nectins (adherence junction adhesion molecules). PATJ lacks 3 PDZ domains seen in MUPP1: PDZ6, 9, and 13; consequently, MUPP1 PDZ7 and 8 align with PATJ PDZ6 and 7; and MUPP1 PDZ domains 10-12 align with PATJ PDZ domains 8-10. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MUPP1-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467253 [Multi-domain]  Cd Length: 89  Bit Score: 41.06  E-value: 8.74e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462613745 4511 GNGLGIRIVG--GKeipGHSGEI-GAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEV 4572
Cdd:cd06791     11 EQGLGITIAGyvGE---KASGELsGIFVKSIIPGSAADQDGRIQVNDQIIAVDGVNLQGFTNQEA 72
PHA03247 PHA03247
large tegument protein UL36; Provisional
386-581 8.84e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.08  E-value: 8.84e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  386 PGPKALAQPPGVGKTPAQQPGPAKPPTQQVGTPKPLAQQP-GLQSPAKAPGPTKTPVQQPGPgkiPAQQAGPGKTSAQQT 464
Cdd:PHA03247   255 PAPPPVVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPdGVWGAALAGAPLALPAPPDPP---PPAPAGDAEEEDDED 331
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  465 GPTKPPSQLPGP-AKPP---PQQPGPAKPPPQQ----PGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPG---STKPPSQQ 533
Cdd:PHA03247   332 GAMEVVSPLPRPrQHYPlgfPKRRRPTWTPPSSledlSAGRHHPKRASLPTRKRRSARHAATPFARGPGgddQTRPAAPV 411
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 2462613745  534 PGSAKPSAQQPSPA-KPSAQQSTKPVSQTGSGK-PLQPPTVSPSAKQPPS 581
Cdd:PHA03247   412 PASVPTPAPTPVPAsAPPPPATPLPSAEPGSDDgPAPPPERQPPAPATEP 461
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
863-1032 8.93e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 45.63  E-value: 8.93e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  863 SPKPDAKPMPKGSPTPPGPRPTAGQTVPTPQQSPKPQEQsrrfslnlgsiTDAPKSQPTTPQETVTGKLfgfgasifSQA 942
Cdd:PRK07994   365 LPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPP-----------ASAPQQAPAVPLPETTSQL--------LAA 425
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  943 SNLISTAGQPGPHSQSGPGAPMKQAPAPSQPPTSQGPPKSTGQAPPAPAKSIPVKKETKAPAAEKLEPKAEQAPTVKRTE 1022
Cdd:PRK07994   426 RQQLQRAQGATKAKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKALKKALE 505
                          170
                   ....*....|
gi 2462613745 1023 TEKKPPPIKD 1032
Cdd:PRK07994   506 HEKTPELAAK 515
PHA02682 PHA02682
ORF080 virion core protein; Provisional
348-519 9.03e-04

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 44.47  E-value: 9.03e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  348 TVKPPVQPPgttkPPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAQPPgVGKTPAQQPGPAKPPTQQVGTPKPLAQQPGL 427
Cdd:PHA02682    33 TIPAPAAPC----PPDADVDPLDKYSVKEAGRYYQSRLKANSACMQRP-SGQSPLAPSPACAAPAPACPACAPAAPAPAV 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  428 QSPAKAPG-PTKTPVQQPGPGKIPAqqagpgktsAQQTGPTKPPSQLPGPAKPPPQQPGPAkpPPQQPG---SAKPPPQQ 503
Cdd:PHA02682   108 TCPAPAPAcPPATAPTCPPPAVCPA---------PARPAPACPPSTRQCPPAPPLPTPKPA--PAAKPIflhNQLPPPDY 176
                          170
                   ....*....|....*.
gi 2462613745  504 PGSTKPPPQQPGPAKP 519
Cdd:PHA02682   177 PAASCPTIETAPAASP 192
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
244-437 9.64e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 45.64  E-value: 9.64e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  244 PEKIKSQPPGTGKPIQGPTQTPQTDHAKLPLQRDASRPQTKQADIVRGESVKPSLPSPSKPpiqqPTPGKPPAQQPghek 323
Cdd:PRK12323   397 PAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAA----PAAAARPAAAG---- 468
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  324 SQPGPAKPPAQPSGLTKPLAQQPGTVKPPV---QPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAQPPgvGKT 400
Cdd:PRK12323   469 PRPVAAAAAAAPARAAPAAAPAPADDDPPPweeLPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAP--APA 546
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 2462613745  401 PAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPT 437
Cdd:PRK12323   547 AAPAPRAAAATEPVVAPRPPRASASGLPDMFDGDWPA 583
PRK14949 PRK14949
DNA polymerase III subunits gamma and tau; Provisional
713-896 1.04e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237863 [Multi-domain]  Cd Length: 944  Bit Score: 45.49  E-value: 1.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  713 HGSPSAKAKQPPEADSLSKPAPPKEPSVPSEQD---------------------------KAPVADDKPKQPKMVKPTtd 765
Cdd:PRK14949   577 VQSAQSAAEAQPSSQSLSPISAVTTAAASLADDdildavlaardsllsdldalspkegdgKKSSADRKPKTPPSRAPP-- 654
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  766 lVSSSSATTKPDiPSSKVQSQAEEKTTPPLKTDSAKPSQSFPPTGEKVSPfDSKAIPRPASDSKIISHPGPSSESKGQKQ 845
Cdd:PRK14949   655 -ASLSKPASSPD-ASQTSASFDLDPDFELATHQSVPEAALASGSAPAPPP-VPDPYDRPPWEEAPEVASANDGPNNAAEG 731
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2462613745  846 VDPVQKKEEPKKAQTKMSPKPDAKPMPKGSPTPPGPRPTAGQTVPTPQQSP 896
Cdd:PRK14949   732 NLSESVEDASNSELQAVEQQATHQPQVQAEAQSPASTTALTQTSSEVQDTE 782
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
651-1047 1.06e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 45.14  E-value: 1.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  651 APVPSSPQPKLKTAPVTTTsavsKSSPQPQQTSPKK-DAAPKQDLSKAPEPKKPPPLVKQPTLHGSPSAKAKQ----PPE 725
Cdd:NF033839   161 TPQPENPEHQKPTTPAPDT----KPSPQPEGKKPSVpDINQEKEKAKLAVATYMSKILDDIQKHHLQKEKHRQivalIKE 236
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  726 ADSLSKPAPPKEPSV-----PSEQDKAPVADDKPKQPKMVKPTTDLVSSSSATTKPDIPSSKVQSQAEEKTtPPLKTDSA 800
Cdd:NF033839   237 LDELKKQALSEIDNVntkveIENTVHKIFADMDAVVTKFKKGLTQDTPKEPGNKKPSAPKPGMQPSPQPEK-KEVKPEPE 315
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  801 KPSQSFPPTGEKVSPfdsKAIPRPAsdskiishpGPSSESKGQKQVDPVQKKEEPKKAQTKMSPKPDaKPMPKGSPTPPG 880
Cdd:NF033839   316 TPKPEVKPQLEKPKP---EVKPQPE---------KPKPEVKPQLETPKPEVKPQPEKPKPEVKPQPE-KPKPEVKPQPET 382
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  881 PRPTAGQTVPTPQQSPKPQEQSRRFSLNLGSITDAP--KSQPTTPQETVtgklfgfgasifsqasnlistagQPGPHSQS 958
Cdd:NF033839   383 PKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPevKPQPEKPKPEV-----------------------KPQPEKPK 439
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  959 GPGAPMKQAPAPSQPPTSQGPPKSTGQAPPAPAKSIPVKKETKAPAAEKLEPKAEQAPTVKRTETEKKPPPIKDSKSLTA 1038
Cdd:NF033839   440 PEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPDNSKPQADDKKPSTPNNLSKDKQPSNQASTNEKAT 519

                   ....*....
gi 2462613745 1039 EPQKAVLPT 1047
Cdd:NF033839   520 NKPKKSLPS 528
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
660-1055 1.06e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 45.14  E-value: 1.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  660 KLKTAPVTTTSAVSKSSPQPQQTSPKKDAAPKQDLSKapepkkppplvkqptlhgSPSAKAKQPPEADSLSKPAPPKePS 739
Cdd:NF033839   145 KDSSSSSSSGSSTKPETPQPENPEHQKPTTPAPDTKP------------------SPQPEGKKPSVPDINQEKEKAK-LA 205
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  740 VPSEQDKapVADDKPKQPKMVKPTTDLVSsssatTKPDIPSSKVQSQAEEKTTPPLKTDSAKPSQSFPPTGEKVSPFDSK 819
Cdd:NF033839   206 VATYMSK--ILDDIQKHHLQKEKHRQIVA-----LIKELDELKKQALSEIDNVNTKVEIENTVHKIFADMDAVVTKFKKG 278
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  820 AIPRPAS--DSKIISHPGPSSESKGQKQVDPVQKKEEPKKAQTKMSPKpdaKPMPKGSPTPPGPRPTAGQTVPTPQQSPK 897
Cdd:NF033839   279 LTQDTPKepGNKKPSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLE---KPKPEVKPQPEKPKPEVKPQLETPKPEVK 355
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  898 PQEQSRRFSLNLGSITDAP--KSQPTTPQETVTGKLFGFGASIfsqasnlistagQPGPHSQSGPGAPMKQAPAPSQPPT 975
Cdd:NF033839   356 PQPEKPKPEVKPQPEKPKPevKPQPETPKPEVKPQPEKPKPEV------------KPQPEKPKPEVKPQPEKPKPEVKPQ 423
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  976 SQGPPKSTGQAPPAPAKSIPVKKETKAPA--AEKLEPKAEQAPTVKRTETEKKPPPIK---DSKSLTAEPQKAVLPTKLE 1050
Cdd:NF033839   424 PEKPKPEVKPQPEKPKPEVKPQPEKPKPEvkPQPETPKPEVKPQPEKPKPEVKPQPEKpkpDNSKPQADDKKPSTPNNLS 503

                   ....*
gi 2462613745 1051 KSPKP 1055
Cdd:NF033839   504 KDKQP 508
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
331-538 1.09e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 45.34  E-value: 1.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  331 PPAQPSGLTKPLAQQPGTVKPPVQPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAQPPGVGKTPAQQPGPA-- 408
Cdd:PRK14948   361 PSAFISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPTPPANAANAPPSln 440
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  409 -KPPTQQV-------GTPKPLAQQPGL----------------------------QSPAKAPG-PTKTPVQQPGPGkipa 451
Cdd:PRK14948   441 lEELWQQIlaklelpSTRMLLSQQAELvsldsnraviavspnwlgmvqsrkplleQAFAKVLGrSIKLNLESQSGS---- 516
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  452 qQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPgsTKPPPQQPGPAKPSPQQPGSTKPPS 531
Cdd:PRK14948   517 -ASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPA--QIPADSSPPPPIPEEPTPSPTKDSS 593

                   ....*..
gi 2462613745  532 QQPGSAK 538
Cdd:PRK14948   594 PEEIDKA 600
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
226-489 1.12e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 45.45  E-value: 1.12e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  226 GRDPLQQDGTPKSISSQQPEKiKSQPPGTGKPIQGPTQTPQTDHAKLPLQ-RDASRPQTKQADIvrgesvkpslpsPSKP 304
Cdd:PTZ00449   539 ESDEPKEGGKPGETKEGEVGK-KPGPAKEHKPSKIPTLSKKPEFPKDPKHpKDPEEPKKPKRPR------------SAQR 605
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  305 PIQQPTPGKPPAQQPGHEKSQPGPAKPPAQPSGLTKPLA----QQPGTVKPPvQPPGTTKPPAQP---------LGPAKP 371
Cdd:PTZ00449   606 PTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSperpEGPKIIKSP-KPPKSPKPPFDPkfkekfyddYLDAAA 684
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  372 PAQQTGSEKPSSEQPGPKALAQPPGVGKTPAQQPGPAKP--PTQQVGTPKPLaQQPGLQSPAKAPGPT----------KT 439
Cdd:PTZ00449   685 KSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPklPRDEEFPFEPI-GDPDAEQPDDIEFFTppeeertffhET 763
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2462613745  440 PVQQPGPGkIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPG--PAKP 489
Cdd:PTZ00449   764 PADTPLPD-ILAEEFKEEDIHAETGEPDEAMKRPDSPSEHEDKPPGdhPSLP 814
PRK01297 PRK01297
ATP-dependent RNA helicase RhlB; Provisional
475-530 1.15e-03

ATP-dependent RNA helicase RhlB; Provisional


Pssm-ID: 234938 [Multi-domain]  Cd Length: 475  Bit Score: 44.90  E-value: 1.15e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2462613745  475 GPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGST--KPPPQQPGPAKPSPQQPGSTKPP 530
Cdd:PRK01297    10 GKGEAEQPAPAPPSPAAAPAPPPPAKTAAPATKaaAPAAAAPRAEKPKKDKPRRERKP 67
PHA03369 PHA03369
capsid maturational protease; Provisional
460-776 1.18e-03

capsid maturational protease; Provisional


Pssm-ID: 223061 [Multi-domain]  Cd Length: 663  Bit Score: 44.99  E-value: 1.18e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  460 SAQQTGPTKPPSQLPGPAKPPPQQPgPAKPPPQQPGSAkppPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKP 539
Cdd:PHA03369   350 TASLTAPSRVLAAAAKVAVIAAPQT-HTGPADRQRPQR---PDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNPQSPG 425
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  540 SAQQPSPAKPSAQQSTKPVSQTGSGKPLQPPTV-SPSAKQppsQGLPKTICPLCNTTELLLHVPEKANFNTCTECQTTVC 618
Cdd:PHA03369   426 TSYGPEPVGPVPPQPTNPYVMPISMANMVYPGHpQEHGHE---RKRKRGGELKEELIETLKLVKKLKEEQESLAKELEAT 502
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  619 SLCGFNPNPHLTEVKewlclNCQMKRAlGGDLAPVPSSPQ--PKLKTAPVTTTSAVSKSSPQPQQTSPKKDAAPKQDL-S 695
Cdd:PHA03369   503 AHKSEIKKIAESEFK-----NAGAKTA-AANIEPNCSADAaaPATKRARPETKTELEAVVRFPYQIRNMESPAFVHSFtS 576
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  696 KAPEPKKPPPLVKQPTLHG---SPSAKAKQPPEADSLSKPAPPKEPSVPSEQDKaPVADDKPKQPKMVKPTTDlvsSSSA 772
Cdd:PHA03369   577 TTLAAAAGQGSDTAEALAGaieTLLTQASAQPAGLSLPAPAVPVNASTPASTPP-PLAPQEPPQPGTSAPSLE---TSLP 652

                   ....
gi 2462613745  773 TTKP 776
Cdd:PHA03369   653 QQKP 656
DUF4813 pfam16072
Domain of unknown function (DUF4813); This family of proteins is functionally uncharacterized. ...
446-562 1.18e-03

Domain of unknown function (DUF4813); This family of proteins is functionally uncharacterized. This family of proteins is found in eukaryotes. Proteins in this family are typically between 345 and 672 amino acids in length.


Pssm-ID: 435117 [Multi-domain]  Cd Length: 288  Bit Score: 44.36  E-value: 1.18e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  446 PGKIPAQQAGPGKTSAQQTGptKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGStkpppqqpgPAKPSPQQPG 525
Cdd:pfam16072  146 PGSVTTTSAGSGTTVINAGG--QQPAAPAAPAYPVAPAAYPAQAPAAAPAPAPGAPQTPLA---------PLNPVAAAPA 214
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 2462613745  526 STKPPSQQP--GSAKPSAQQPSPAKPSAQQSTKPVSQTG 562
Cdd:pfam16072  215 AAAGAAAAPvvAAAAPAAAAPPPPAPAAPPADAAPPAPG 253
PHA03291 PHA03291
envelope glycoprotein I; Provisional
418-536 1.22e-03

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 44.56  E-value: 1.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  418 PKPLAQQPGLQSPAKAPGpTKTPVQQPGPGKIPAQQAGPGKTSAQQTGPTKPPSQLPGP--AKPPPQQPGPAKPPPQQPG 495
Cdd:PHA03291   160 PLGLAAFPAEGTLAAPPL-GEGSADGSCDPALPLSAPRLGPADVFVPATPRPTPRTTASpeTTPTPSTTTSPPSTTIPAP 238
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 2462613745  496 SAKPPPQQPGSTKPPPQQPGPAKPSpqqPGSTKPPSQQPGS 536
Cdd:PHA03291   239 STTIAAPQAGTTPEAEGTPAPPTPG---GGEAPPANATPAP 276
HC2 pfam07382
Histone H1-like nucleoprotein HC2; This family contains the bacterial histone H1-like ...
323-487 1.22e-03

Histone H1-like nucleoprotein HC2; This family contains the bacterial histone H1-like nucleoprotein HC2 (approximately 200 residues long), which seems to be found mostly in Chlamydia. HC2 functions in DNA condensation, although it has been suggested that it also has other roles.


Pssm-ID: 369339 [Multi-domain]  Cd Length: 187  Bit Score: 43.23  E-value: 1.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  323 KSQPGPAKPPAQPSGLTKPLAQQPGTVKPPVQPPGTTKPPAQPLGPAKppaqQTGSEKPSSEQPGPKALAQPPGVGKTPA 402
Cdd:pfam07382    5 QKKRSSKKTAAKKAAVRKPAAKKAAAKKTVVRKVAAKKPAARKTVAKK----TVAAKKPAAKKAAKKAVAKKVVAKKPVA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  403 QQPGPAKPPTQQVGTPKPLAQQ-PGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPP 481
Cdd:pfam07382   81 KKAVAKKATAKKVAAKKVVAKKtVAKKAAAKKPAAKKAVAKKAVARKPAAKKAVAKKAASTCHKNHKHTAACKRVASSSA 160

                   ....*.
gi 2462613745  482 QQPGPA 487
Cdd:pfam07382  161 TRAACG 166
PRK10263 PRK10263
DNA translocase FtsK; Provisional
651-1054 1.23e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 45.46  E-value: 1.23e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  651 APVPSSPQPKLKTAPVTTTSAVSKSSPQPQQTSPKKDAAPKQDLSKAPEPKKPPPLVKQPTLHGSPSA------------ 718
Cdd:PRK10263   363 VPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQqpyyapapeqpv 442
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  719 --KAKQPPEADSLSKPAPPKEPSVPSEQdkaPVADDKPKQPKMVKPTTDLVSSSSAT--TKPDIPSSKVQSQAEEKTTPP 794
Cdd:PRK10263   443 agNAWQAEEQQSTFAPQSTYQTEQTYQQ---PAAQEPLYQQPQPVEQQPVVEPEPVVeeTKPARPPLYYFEEVEEKRARE 519
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  795 lKTDSAKPSQSFP-PTGEKVSPFDSKAIPRPASDSKIISHPGPSSESKGQKQVdPVQKKEEPKKAQTKMSPKPDAKPMP- 872
Cdd:PRK10263   520 -REQLAAWYQPIPePVKEPEPIKSSLKAPSVAAVPPVEAAAAVSPLASGVKKA-TLATGAAATVAAPVFSLANSGGPRPq 597
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  873 -KGSPTPPGPRP------------TAGQTVPTPQQSPKPQEQSRRFSLNLGSIT----------DAPKSQPTTPQETVTG 929
Cdd:PRK10263   598 vKEGIGPQLPRPkrirvptrrelaSYGIKLPSQRAAEEKAREAQRNQYDSGDQYnddeidamqqDELARQFAQTQQQRYG 677
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  930 KLFGFGASIFSQASNLIS--------TAGQPGPHSQSGPG------------APMKQA--PAPSQPPTSQG-PPKSTGQA 986
Cdd:PRK10263   678 EQYQHDVPVNAEDADAAAeaelarqfAQTQQQRYSGEQPAganpfslddfefSPMKALldDGPHEPLFTPIvEPVQQPQQ 757
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462613745  987 PPAPAKSIPVKKETKAPAAEKLEPKAEQAPTVKRTETEKKPPPikdsKSLTAEPQKAVLPTKLEKSPK 1054
Cdd:PRK10263   758 PVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAP----QPQYQQPQQPVAPQPQYQQPQ 821
COG3416 COG3416
Uncharacterized conserved protein, DUF2076 domain [Function unknown];
495-549 1.27e-03

Uncharacterized conserved protein, DUF2076 domain [Function unknown];


Pssm-ID: 442642 [Multi-domain]  Cd Length: 237  Bit Score: 43.86  E-value: 1.27e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462613745  495 GSAKPPPQQPgstkpPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKP 549
Cdd:COG3416     92 GGQRPPPAPQ-----PSQPGPQQQPAPPSGPWGQAAPQQPGYGQPQYGQPAAGPS 141
PTZ00144 PTZ00144
dihydrolipoamide succinyltransferase; Provisional
946-1028 1.35e-03

dihydrolipoamide succinyltransferase; Provisional


Pssm-ID: 240289 [Multi-domain]  Cd Length: 418  Bit Score: 44.67  E-value: 1.35e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  946 ISTAGQPGPHsqsgpgaPMKQAPAPSQPPTSQGPPKSTGQAPPAPAKSIPVKKETKAPAAEKLEPKAEQAPTVKRTETEK 1025
Cdd:PTZ00144   118 IDTGGAPPAA-------APAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAPAAKPPPTPVARADPRET 190

                   ...
gi 2462613745 1026 KPP 1028
Cdd:PTZ00144   191 RVP 193
Androgen_recep pfam02166
Androgen receptor;
456-591 1.39e-03

Androgen receptor;


Pssm-ID: 426632 [Multi-domain]  Cd Length: 501  Bit Score: 44.92  E-value: 1.39e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  456 PGKTSAQQTGPTKPP----SQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPS 531
Cdd:pfam02166   32 PGPRHPEAAGGAAPPgarlQHQQQQQQQVPQQPQQQESSPRQPQASVQPQQAGDDGSPPAHNRGPAGYLALEDDEQPQPS 111
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  532 QqpgsakpsAQQPSPAKPSAQQSTKPVSQTGSGKPLQPPTVSPSAKQPPSQGLPKTICPL 591
Cdd:pfam02166  112 Q--------AQPAAECCPENGCVPEPGAAAAAGKGLPQQAVAPAAPDDDDSAAPSTLSLL 163
DamX COG3266
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell ...
252-555 1.41e-03

Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442497 [Multi-domain]  Cd Length: 455  Bit Score: 44.84  E-value: 1.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  252 PGTGKPIQGPTQTPQTDHAKLPLQRDASRPQTKQADIVRGESVKPSLPSPSKPPIQQPTPGKPPAQQPGHEKSQPGPAKP 331
Cdd:COG3266     53 LLAGLLLLLIRLLSEAVDLGALASAALLLALASLALLGILLLALLALLLDLLLLADLLRAAALLLLKLLLLLLTLLLLVL 132
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  332 PAQPSGLTKPLAQQPGTVKPPVQPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAQPPGVGKTP-------AQQ 404
Cdd:COG3266    133 LLLLALLLALLLDLPLLTLLIVLPLLEEQLLLLALQDIQGTLQALGAVAALLGLRKAEEALALRAGSAAAdalalllLLL 212
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  405 PGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQP 484
Cdd:COG3266    213 ASALGEAVAAAAELAALALLAAGAAEVLTARLVLLLLIIGSALKAPSQASSASAPATTSLGEQQEVSLPPAVAAQPAAAA 292
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462613745  485 GPAKPPPQQPgsAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQqpsPAKPSAQQST 555
Cdd:COG3266    293 AAQPSAVALP--AAPAAAAAAAAPAEAAAPQPTAAKPVVTETAAPAAPAPEAAAAAAA---PAAPAVAKKL 358
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
422-565 1.43e-03

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 44.56  E-value: 1.43e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  422 AQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAG-PGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQ----PGS 496
Cdd:PTZ00436   208 AAAPSGKKSAKAAAPAKAAAAPAKAAAPPAKAAAaPAKAAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKaaapPAK 287
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462613745  497 AKPPPQQpGSTKPPPQQPGPAKPSPQQPGSTKPPSQqpGSAKPSAQQPSPAK---PSAQQSTKPVSQTGSGK 565
Cdd:PTZ00436   288 AAAPPAK-AAAAPAKAAAAPAKAAAAPAKAAAPPAK--AAAPPAKAATPPAKaaaPPAKAAAAPVGKKAGGK 356
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
417-528 1.43e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 44.80  E-value: 1.43e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  417 TPKPLAQQPGLQSPAKAPgptktpvQQPGPGkipaqQAGPGKTSAQQTGPTKPPSqlPGPAKPPPQQPGPAKPPPqqpgs 496
Cdd:PRK14950   361 VPVPAPQPAKPTAAAPSP-------VRPTPA-----PSTRPKAAAAANIPPKEPV--RETATPPPVPPRPVAPPV----- 421
                           90       100       110
                   ....*....|....*....|....*....|..
gi 2462613745  497 akPPPQQPGSTKPPPQQPGPAKPSPQQPGSTK 528
Cdd:PRK14950   422 --PHTPESAPKLTRAAIPVDEKPKYTPPAPPK 451
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
1154-1558 1.52e-03

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 44.62  E-value: 1.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1154 VKLVKKQeQEVKTE---AEKVILEKVKETLSMEKippmvttdQKQEESKLEKDKASALQEKKPLPEEKKLIPEEEKiRSE 1230
Cdd:NF033838    87 VALNKKL-SDIKTEylyELNVLKEKSEAELTSKT--------KKELDAAFEQFKKDTLEPGKKVAEATKKVEEAEK-KAK 156
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1231 EKKPLLEEKKPTPEDKKLLPEAKTSAPEEQKHDLLKSQVQIAEEKLEGRV--APKTVQEGKQPQTKMEGLPsgTPQSLPK 1308
Cdd:NF033838   157 DQKEEDRRNYPTNTYKTLELEIAESDVEVKKAELELVKEEAKEPRDEEKIkqAKAKVESKKAEATRLEKIK--TDREKAE 234
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1309 EDDKTTKTIKEQpqppctakpDQVEPGKEKTEKEDDKSDT--SSSQQPKSPQGLSDTGYSSDgisSSLGE----IPSLIP 1382
Cdd:NF033838   235 EEAKRRADAKLK---------EAVEKNVATSEQDKPKRRAkrGVLGEPATPDKKENDAKSSD---SSVGEetlpSPSLKP 302
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1383 tdEKDILKGLKK-DSFSQESSPSSPSDLAKLESTVLSILEAQastLADEKSEKKTQPHEVSPE---QPKDQEKTQSLSET 1458
Cdd:NF033838   303 --EKKVAEAEKKvEEAKKKAKDQKEEDRRNYPTNTYKTLELE---IAESDVKVKEAELELVKEeakEPRNEEKIKQAKAK 377
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1459 LEitiseeeIKESQEERKDTFKKDSQQdipSSKDHKEKSEFVDDIttrrepydSVEESSESENSPVPQRKRRTSVGSSSS 1538
Cdd:NF033838   378 VE-------SKKAEATRLEKIKTDRKK---AEEEAKRKAAEEDKV--------KEKPAEQPQPAPAPQPEKPAPKPEKPA 439
                          410       420
                   ....*....|....*....|
gi 2462613745 1539 DEYKQEDSQGSGEEEDFIRK 1558
Cdd:NF033838   440 EQPKAEKPADQQAEEDYARR 459
PRK10927 PRK10927
cell division protein FtsN;
368-553 1.52e-03

cell division protein FtsN;


Pssm-ID: 236797 [Multi-domain]  Cd Length: 319  Bit Score: 44.29  E-value: 1.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  368 PAKPPAQQTGSEKPSSEQPGPKALAQPPGVG--KTPAQQPGPAKPPTQQVGTPkpLAQQPglQSPAKAPGPTKTPVQQpg 445
Cdd:PRK10927    76 PPKPEERWRYIKELESRQPGVRAPTEPSAGGevKTPEQLTPEQRQLLEQMQAD--MRQQP--TQLVEVPWNEQTPEQR-- 149
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  446 pgkipaQQAGPGKTSAQQTGPTKPPSQlpgpakpPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPgPAKPSPQQPG 525
Cdd:PRK10927   150 ------QQTLQRQRQAQQLAEQQRLAQ-------QSRTTEQSWQQQTRTSQAAPVQAQPRQSKPASTQQ-PYQDLLQTPA 215
                          170       180
                   ....*....|....*....|....*...
gi 2462613745  526 STKPPSQQPGSAkPSAQQPSPAKPSAQQ 553
Cdd:PRK10927   216 HTTAQSKPQQAA-PVTRAADAPKPTAEK 242
COG3416 COG3416
Uncharacterized conserved protein, DUF2076 domain [Function unknown];
471-519 1.57e-03

Uncharacterized conserved protein, DUF2076 domain [Function unknown];


Pssm-ID: 442642 [Multi-domain]  Cd Length: 237  Bit Score: 43.47  E-value: 1.57e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 2462613745  471 SQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKP 519
Cdd:COG3416     93 GQRPPPAPQPSQPGPQQQPAPPSGPWGQAAPQQPGYGQPQYGQPAAGPS 141
PTZ00395 PTZ00395
Sec24-related protein; Provisional
451-587 1.61e-03

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 45.07  E-value: 1.61e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  451 AQQAGPGKTSAQ--QTGPTKPPSQLPGPAKPPPQQPGPAKPP-PQQPGSAKPPPQQPGSTKP--------PPQQPGPAKP 519
Cdd:PTZ00395   397 AAQSNAAQSNAGfsNAGYSNPGNSNPGYNNAPNSNTPYNNPPnSNTPYSNPPNSNPPYSNLPysntpysnAPLSNAPPSS 476
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462613745  520 SPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQstkpvSQTGSGKPLQPPTVSPSAKQPPSQGLPKT 587
Cdd:PTZ00395   477 AKDHHSAYHAAYQHRAANQPAANLPTANQPAANN-----FHGAAGNSVGNPFASRPFGSAPYGGNAAT 539
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
421-588 1.62e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 44.57  E-value: 1.62e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  421 LAQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPG---KTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQP--- 494
Cdd:PRK14948   360 LPSAFISEIANASAPANPTPAPNPSPPPAPIQPSAPKtkqAATTPSPPPAKASPPIPVPAEPTEPSPTPPANAANAPpsl 439
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  495 ----------GSAKPPP--------------------------------------------------------QQPGSTK 508
Cdd:PRK14948   440 nleelwqqilAKLELPStrmllsqqaelvsldsnraviavspnwlgmvqsrkplleqafakvlgrsiklnlesQSGSASN 519
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  509 PPPQQPGPAKPSPQQPgsTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQTGSGKPLQPPTVSPSAKQPPSQGLPKTI 588
Cdd:PRK14948   520 TAKTPPPPQKSPPPPA--PTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPSPTKDSSPEEI 597
COG3416 COG3416
Uncharacterized conserved protein, DUF2076 domain [Function unknown];
460-525 1.66e-03

Uncharacterized conserved protein, DUF2076 domain [Function unknown];


Pssm-ID: 442642 [Multi-domain]  Cd Length: 237  Bit Score: 43.47  E-value: 1.66e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  460 SAQQTGPTKPPSQL----PGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPG 525
Cdd:COG3416     73 QLQQQQPQSSGGFLsglfGGGQRPPPAPQPSQPGPQQQPAPPSGPWGQAAPQQPGYGQPQYGQPAAGPSG 142
PHA03247 PHA03247
large tegument protein UL36; Provisional
446-603 1.90e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.93  E-value: 1.90e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  446 PGKIPAQQAGPGKTSAQQTGPTKPPsqlpgpAKPPPQQP-GPAKPPPQQPGSA--KPPPQQPGSTKPPPQQPGPAK---- 518
Cdd:PHA03247   255 PAPPPVVGEGADRAPETARGATGPP------PPPEAAAPnGAAAPPDGVWGAAlaGAPLALPAPPDPPPPAPAGDAeeed 328
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  519 ------------PSPQQ-----------PGSTKPPSQQPGSA----KPSAQQPSPAKPSAQQSTKPVSQtGSGKPLQPPT 571
Cdd:PHA03247   329 dedgamevvsplPRPRQhyplgfpkrrrPTWTPPSSLEDLSAgrhhPKRASLPTRKRRSARHAATPFAR-GPGGDDQTRP 407
                          170       180       190
                   ....*....|....*....|....*....|..
gi 2462613745  572 VSPSAKQPPSQGLPKTICPLCNTTELLLHVPE 603
Cdd:PHA03247   408 AAPVPASVPTPAPTPVPASAPPPPATPLPSAE 439
PDZ3_PDZD7-like cd06751
PDZ domain 3 of the canonical isoform 1 of PDZ domain containing 7 (PDZD7), and related ...
4514-4578 1.96e-03

PDZ domain 3 of the canonical isoform 1 of PDZ domain containing 7 (PDZD7), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 3 of the long isoform 1 of PDZD7, and related domains. PDZD7 is critical for the organization of the Usher syndrome type 2 (USH2) complex. Usher syndrome is the leading cause of hereditary sensory deaf-blindness in humans; USH2 is the most common sub-type. Formation of the USH2 complex is based upon heterodimerization between PDZD7 and whirlin (another PDZ domain-containing protein) and a subsequent dynamic interplay between USH2 proteins via their multiple PDZ domains. The PDZD7 PDZ2 domain binds GPR98 (also known as VLGR1) and usherin (USH2A). PDZD7 and whirlin form heterodimers through their multiple PDZ domains; whirlin and PDZD7 interact with usherin and GPR98 to form an interdependent ankle link complex. PDZD7 also interacts with myosin VIIa and can also form homodimers through its PDZ2 domain. Various isoforms of PDZD7 produced by alternative splicing have been identified; this subgroup includes the third PDZ domain of the canonical isoform of PDZD7- isoform 1. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PDZD7-like family PDZ3 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467233 [Multi-domain]  Cd Length: 89  Bit Score: 40.11  E-value: 1.96e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462613745 4514 LGIRIVGGKEipgHSGEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQ 4578
Cdd:cd06751     13 LGISISGGIE---SKVQPVVKIEKIFPGGAAALSGNLKAGYELVSVDGESLQQVTHQQAVDIIRR 74
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
322-409 2.00e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 44.50  E-value: 2.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  322 EKSQPGPAKPPAQPSGLTKPLAQQPgtvKPPVQPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAQPPGVGKTP 401
Cdd:PRK12270    39 GSTAAPTAAAAAAAAAASAPAAAPA---AKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVED 115
                           90
                   ....*....|
gi 2462613745  402 AQQP--GPAK 409
Cdd:PRK12270   116 EVTPlrGAAA 125
PRK14965 PRK14965
DNA polymerase III subunits gamma and tau; Provisional
495-557 2.07e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237871 [Multi-domain]  Cd Length: 576  Bit Score: 44.35  E-value: 2.07e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462613745  495 GSAKPPPQQPGSTKPPPQQPGP--AKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKP 557
Cdd:PRK14965   380 GAPAPPSAAWGAPTPAAPAAPPpaAAPPVPPAAPARPAAARPAPAPAPPAAAAPPARSADPAAAA 444
FAP pfam07174
Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment ...
430-509 2.22e-03

Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.


Pssm-ID: 429334  Cd Length: 301  Bit Score: 43.38  E-value: 2.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  430 PAKAPGPTKTPVQQPGPGKIPAQQAGPGktsaqqtgPTKPPSQLPGP-AKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTK 508
Cdd:pfam07174   41 PEPAPPPPSTATAPPAPPPPPPAPAAPA--------PPPPPAAPNAPnAPPPPADPNAPPPPPADPNAPPPPAVDPNAPE 112

                   .
gi 2462613745  509 P 509
Cdd:pfam07174  113 P 113
TYA pfam01021
Ty transposon capsid protein; Ty are yeast transposons. A 5.7kb transcript codes for p3 a ...
425-577 2.26e-03

Ty transposon capsid protein; Ty are yeast transposons. A 5.7kb transcript codes for p3 a fusion protein of TYA and TYB. The TYA protein is analogous to the gag protein of retroviruses. TYA a is cleaved to form 46kd protein which can form mature virion like particles. This entry corresponds to the capsid protein from Ty1 and Ty2 transposons.


Pssm-ID: 425992  Cd Length: 384  Bit Score: 43.79  E-value: 2.26e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  425 PGLQSPAKAPGpTKTPVQQPGPGKIPAQQAGPGKTSAQqtgPTKPPSQLPGPAKPPPQQPGPAK-----PPPQQPGSAKP 499
Cdd:pfam01021   11 HTNQDPLDVSA-SKLQEYDKDSTKANSQQTTTPGSSAV---PENHHHASPQPASVPPPQNGPYSqqcmmTPNQANPSGWP 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  500 PPQQPGSTKPPPQQ-------PGPAKPSPQQPGSTKPPSqqpgsakpSAQQPSPAKPSAQQSTKPVSQTGSGKPLQPPTV 572
Cdd:pfam01021   87 FYGHPSMMPYTPYQmspmyfpPGPQSQFPQYPSSVGTPL--------STPSPESGNTFTDSSSAKSDMTSTNKYVRPPPI 158

                   ....*
gi 2462613745  573 SPSAK 577
Cdd:pfam01021  159 LTSPN 163
tolA PRK09510
cell envelope integrity inner membrane protein TolA; Provisional
1146-1290 2.28e-03

cell envelope integrity inner membrane protein TolA; Provisional


Pssm-ID: 236545 [Multi-domain]  Cd Length: 387  Bit Score: 43.64  E-value: 2.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1146 QKTAVPPQVKLVKKQEQEVKTEaekvILEKVKEtlsmekippmvttdQKQEESKLEKDKASALQEKKPLPEEKKLIPEEE 1225
Cdd:PRK09510    70 QQKSAKRAEEQRKKKEQQQAEE----LQQKQAA--------------EQERLKQLEKERLAAQEQKKQAEEAAKQAALKQ 131
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1226 KIRSEEKKPLLEE--KKPTPEDKKLLPEAKTSAPEEQKHDLLKSQVQIAEE---KLEGRVAPKTVQEGKQ 1290
Cdd:PRK09510   132 KQAEEAAAKAAAAakAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEakkKAEAEAAAKAAAEAKK 201
PDZ1_PTPN13-like cd23072
PDZ domain 1 of protein tyrosine phosphatase non-receptor type 13 (PTPN13), and related ...
4513-4578 2.37e-03

PDZ domain 1 of protein tyrosine phosphatase non-receptor type 13 (PTPN13), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of PTPN13 [also known as Fas-associated protein-tyrosine phosphatase 1 (FAP-1), protein-tyrosine phosphatase 1E (PTP-E1), and protein-tyrosine phosphatase (PTPL1)], and related domains. PTPN13 regulates negative apoptotic signaling and mediates phosphoinositide 3-kinase (PI3K) signaling. PTPN13 has five PDZ domains. Proteins known to interact with PTPN13 PDZ domains include: PLEKHA1 and PLEKHA2 via PTPN13-PDZ domain 1, Fas receptor and thyroid receptor-interacting protein 6 via PTPN13-PDZ domain 2, nerve growth factor receptor and protein kinase N2 via PTPN13-PDZ domain 3, PDZ and LIM domain 4 (PDLIM4) via PTPN13-PDZ domains 2 and 4, and brain calpain-2 via PTPN13-PDZ domains 3, 4 and 5. Calpain-2-mediated PTPN13 fragments may be involved in abnormal tau aggregation and increased risk for Alzheimer's disease. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PTPN13 family PDZ1 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467285 [Multi-domain]  Cd Length: 92  Bit Score: 40.17  E-value: 2.37e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462613745 4513 GLGIRIVGGkEIPGHSgEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTY-----------EEVQSIISQ 4578
Cdd:cd23072     14 GLGFQIVGG-EKSGRL-DLGIFISSITPGGPADLDGRLKPGDRLISVNDVSLEGLSHdaaveilqnapEDVTLVVSQ 88
DUF2076 pfam09849
Uncharacterized protein conserved in bacteria (DUF2076); This domain, found in various ...
451-541 2.44e-03

Uncharacterized protein conserved in bacteria (DUF2076); This domain, found in various hypothetical prokaryotic proteins, has no known function. The domain, however, is found in various periplasmic ligand-binding sensor proteins.


Pssm-ID: 430876  Cd Length: 263  Bit Score: 43.19  E-value: 2.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  451 AQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPakPPPqQPGSAKPPPQQPGSTKPPPQQPGpaKPSPQQPGSTKPP 530
Cdd:pfam09849   69 AQLQQLQQQQQQAQAQPSSGGFLGGLFGGGSQSRPP--PPP-QARPAWPAGQAPGQPQPYPGQPG--YAQQGQPQYGQPA 143
                           90
                   ....*....|.
gi 2462613745  531 SQQPGSAKPSA 541
Cdd:pfam09849  144 QPPRGPWGPGG 154
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
470-560 2.48e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 44.03  E-value: 2.48e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  470 PSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPpqQPGPAKPSPQQP---GSTKPPSQQPGSAKPSAQQPSP 546
Cdd:PRK14950   362 PVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEP--VRETATPPPVPPrpvAPPVPHTPESAPKLTRAAIPVD 439
                           90
                   ....*....|....
gi 2462613745  547 AKPSAQQSTKPVSQ 560
Cdd:PRK14950   440 EKPKYTPPAPPKEE 453
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
487-583 2.53e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 44.03  E-value: 2.53e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  487 AKPPPQQPGSAKPPpqqpgstKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQ--STKPVSQTGSG 564
Cdd:PRK14950   361 VPVPAPQPAKPTAA-------APSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPvpHTPESAPKLTR 433
                           90
                   ....*....|....*....
gi 2462613745  565 KPLqPPTVSPSAKQPPSQG 583
Cdd:PRK14950   434 AAI-PVDEKPKYTPPAPPK 451
PHA03418 PHA03418
hypothetical E4 protein; Provisional
476-580 2.55e-03

hypothetical E4 protein; Provisional


Pssm-ID: 177646 [Multi-domain]  Cd Length: 230  Bit Score: 42.80  E-value: 2.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  476 PAKPPPQQPGPAKPPPQQPgsaKPPPQQPGSTKPPPQQPGPAKPS-PQQPGS--TKPPSQQPGSAKpSAQQPSPAKPSAQ 552
Cdd:PHA03418    34 PLLPAPHHPNPQEDPDKNP---SPPPDPPLTPRPPAQPNGHNKPPvTKQPGGegTEEDHQAPLAAD-ADDDPRPGKRSKA 109
                           90       100       110
                   ....*....|....*....|....*....|....
gi 2462613745  553 QSTKPVSQTGSGKPLQ------PPTVSPSakQPP 580
Cdd:PHA03418   110 DEHGPAPGRAALAPFKldldqdPLHGDPD--PPP 141
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1127-1363 2.60e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 43.99  E-value: 2.60e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1127 PAPSGPKASPMPVPTESSSQKTAVPPQVKLVKKQEQEVKTEAEKVILEKVKETlsmEKIPPMVTTDQKQEESKLEKDKAS 1206
Cdd:NF033839   284 PKEPGNKKPSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQP---EKPKPEVKPQLETPKPEVKPQPEK 360
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1207 ALQEKKPLPEEKK-LIPEEEKIRSEEKKPLLEEKKP--TPEDKKLLPEAKtsaPEEQKHdllKSQVQIAEEKLEGRVAPK 1283
Cdd:NF033839   361 PKPEVKPQPEKPKpEVKPQPETPKPEVKPQPEKPKPevKPQPEKPKPEVK---PQPEKP---KPEVKPQPEKPKPEVKPQ 434
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1284 TVQEGKQPQTKMEG-LPSGTPQslPKEDDKTTKTIKEQPQPPCTAKPDQVEPGKEKTEKEDDK-SDTSSSQQPKSPQGLS 1361
Cdd:NF033839   435 PEKPKPEVKPQPEKpKPEVKPQ--PETPKPEVKPQPEKPKPEVKPQPEKPKPDNSKPQADDKKpSTPNNLSKDKQPSNQA 512

                   ..
gi 2462613745 1362 DT 1363
Cdd:NF033839   513 ST 514
PDZ4_INAD-like cd23065
PDZ domain 4 of inactivation-no-after-potential D (INAD), and related domains; PDZ (PSD-95 ...
4498-4585 2.62e-03

PDZ domain 4 of inactivation-no-after-potential D (INAD), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 4 of INAD, and related domains. INAD assembles key enzymes of the Drosophila compound eye photo-transduction pathway into a supramolecular complex, supporting efficient and fast light signaling. It contains 5 PDZ domains arranged in tandem (PDZ1-PDZ5) which independently bind various proteins. INAD PDZ2 binds eye-specific protein kinase C, INAD PDZ3 binds transient receptor potential (TRP) channel, and INAD PDZ4,5 tandem binds NORPA (phospholipase Cbeta, PLCbeta). Mutations of the inaD gene that lead to disruption of each of these interactions impair fly photo signal transduction. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This INAD-like family PDZ4 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467278 [Multi-domain]  Cd Length: 82  Bit Score: 39.81  E-value: 2.62e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 4498 IKITRDSKDhtvsgngLGIRIVGGKEipghSGEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIIS 4577
Cdd:cd23065      2 IELKTDKSP-------LGVSVVGGKN----HVTTGCIITHIYPNSIVAADKRLKVFDQILDINGTKVHVMTTLKVHQLFH 70

                   ....*...
gi 2462613745 4578 QQSGEAEI 4585
Cdd:cd23065     71 KTYEKAVT 78
PRK01297 PRK01297
ATP-dependent RNA helicase RhlB; Provisional
447-510 2.70e-03

ATP-dependent RNA helicase RhlB; Provisional


Pssm-ID: 234938 [Multi-domain]  Cd Length: 475  Bit Score: 43.75  E-value: 2.70e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462613745  447 GKIPAQQAGPGKTSAQQTGPTKPPsqlpgpAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPP 510
Cdd:PRK01297    10 GKGEAEQPAPAPPSPAAAPAPPPP------AKTAAPATKAAAPAAAAPRAEKPKKDKPRRERKP 67
Neisseria_TspB pfam05616
Neisseria meningitidis TspB protein; This family consists of several Neisseria meningitidis ...
395-515 2.78e-03

Neisseria meningitidis TspB protein; This family consists of several Neisseria meningitidis TspB virulence factor proteins.


Pssm-ID: 283306 [Multi-domain]  Cd Length: 517  Bit Score: 43.93  E-value: 2.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  395 PGVGKTPAQQPGPA---KPPTQQVGTPKPLAQQPGlqspAKAPGPTKTPVQQ-PGPGKIPAQQAGPGKTSAQQTGPTKPP 470
Cdd:pfam05616  277 PGYSEKVEVAPGTKvnmGPVTDRNGNPVQVAATFG----RDAQGNTTADVQViPRPDLTPASAEAPHAQPLPEVSPAENP 352
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*
gi 2462613745  471 SQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPG 515
Cdd:pfam05616  353 ANNPDPDENPGTRPNPEPDPDLNPDANPDTDGQPGTRPDSPAVPD 397
COG3416 COG3416
Uncharacterized conserved protein, DUF2076 domain [Function unknown];
471-535 2.88e-03

Uncharacterized conserved protein, DUF2076 domain [Function unknown];


Pssm-ID: 442642 [Multi-domain]  Cd Length: 237  Bit Score: 42.70  E-value: 2.88e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462613745  471 SQLPGPAKPPPQQPGP---------AKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPG 535
Cdd:COG3416     69 AQLAQLQQQQPQSSGGflsglfgggQRPPPAPQPSQPGPQQQPAPPSGPWGQAAPQQPGYGQPQYGQPAAGPSG 142
PRK14965 PRK14965
DNA polymerase III subunits gamma and tau; Provisional
452-527 2.89e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237871 [Multi-domain]  Cd Length: 576  Bit Score: 43.96  E-value: 2.89e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462613745  452 QQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPgpakPPPQQPGSAKPPPQQPGSTKPPPQQPgpaKPSPQQPGST 527
Cdd:PRK14965   378 ERGAPAPPSAAWGAPTPAAPAAPPPAAAPPVPP----AAPARPAAARPAPAPAPPAAAAPPAR---SADPAAAASA 446
DUF4106 pfam13388
Protein of unknown function (DUF4106); This family of proteins are found in large numbers in ...
436-524 2.94e-03

Protein of unknown function (DUF4106); This family of proteins are found in large numbers in the Trichomonas vaginalis proteome. The function of this protein is unknown.


Pssm-ID: 404296  Cd Length: 431  Bit Score: 43.73  E-value: 2.94e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  436 PTKTPVQQPGPGKIPAQQAGPGKTSAQQTGPTkppsqlpgpAKPPPQQP---GPAKPPPQQPGSAKPPPQQPGSTKPPPQ 512
Cdd:pfam13388  173 PPNPPREAPAPGLPKTFTSSHGHRHRHAPKPT---------VQNPAQQPtvqNPAQQPTQQPTVQNPAQQQNPAQQPPPQ 243
                           90
                   ....*....|....*
gi 2462613745  513 ---QPGPAKPSPQQP 524
Cdd:pfam13388  244 paqQPTVQNPAQQQP 258
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
425-502 2.99e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 44.11  E-value: 2.99e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462613745  425 PGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQ 502
Cdd:PRK12270    39 GSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDE 116
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
848-1055 3.21e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 43.71  E-value: 3.21e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  848 PVQKKEEPKKAQTKMSPKP-DAKPMPKGSPTPPGPRPTAGQTVPTPQQSPKPQEQSRRFSLNLGSITDAPKSQPTTpqet 926
Cdd:PRK12323   381 PVAQPAPAAAAPAAAAPAPaAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAA---- 456
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  927 vtgklfgfgasifSQASNLISTAGQPGPHSQSGPGAPMKQAPAPSQPPTSQGPPKSTgQAPPAPAKSIPVKKETKAPAAE 1006
Cdd:PRK12323   457 -------------APAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWE-ELPPEFASPAPAQPDAAPAGWV 522
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2462613745 1007 K---LEPKAEQAPTVKRTETEKKPPPIKDSKSLTAEPQKAVLPTKLEKSPKP 1055
Cdd:PRK12323   523 AesiPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLP 574
PDZ6_PDZD2-PDZ3_hPro-IL-16-like cd06762
PDZ domain 6 of PDZ domain containing 2 (PDZD2), PDZ domain 3 of human pro-interleukin-16 ...
4507-4578 3.23e-03

PDZ domain 6 of PDZ domain containing 2 (PDZD2), PDZ domain 3 of human pro-interleukin-16 (isoform 1, 1332 AA), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 6 of PDZD2, also known as KIAA0300, PIN-1, activated in prostate cancer (AIPC) and PDZ domain-containing protein 3 (PDZK3). PDZD2 has seven PDZ domains. PDZD2 is expressed at exceptionally high levels in the pancreas and certain cancer tissues, such as prostate cancer. It promotes the proliferation of insulinoma cells and is upregulated during prostate tumorigenesis. In osteosarcoma (OS), the microRNA miR-363 acts as a tumor suppressor by inhibiting PDZD2. This family also includes the third PDZ domain (PDZ3) of human pro-interleukin-16 (isoform 1, also known as nPro-IL-16). Precursor IL-16 is cleaved to produce pro-IL-16 and C-terminal mature IL-16. Pro-IL-16 functions as a regulator of T cell growth; mature IL-16 is a CD4 ligand that induces chemotaxis and CD25 expression in CD4+ T cells. IL-16 bioactivity has been closely associated with the progression of several different cancers. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PDZD2-like family PDZ6 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467243 [Multi-domain]  Cd Length: 86  Bit Score: 39.55  E-value: 3.23e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462613745 4507 HTVSGNGLGIRIVGG-----KEIPGHsgeigayiaKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQ 4578
Cdd:cd06762      7 HKEEGSGLGFSLAGGsdlenKSITVH---------RVFPSGLAAQEGTIQKGDRILSINGKSLKGVTHGDALSVLKQ 74
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
647-879 3.35e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 43.71  E-value: 3.35e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  647 GGDLAPVPSSPQPKLKTAPVTTTSAVSKSSPQPQQTSPKKDAAPkqdlskapepKKPPPLVKQPTLHGSPSAKAKQPPEA 726
Cdd:PRK12323   369 GGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAA----------AAAARAVAAAPARRSPAPEALAAARQ 438
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  727 DSLSKPAPPKEPsvPSEQDKAPVADDKPkqpkmvkPTTDLVSSSSATTKPDIPSSKVQSQAEEKTTPPlktdsakPSQSF 806
Cdd:PRK12323   439 ASARGPGGAPAP--APAPAAAPAAAARP-------AAAGPRPVAAAAAAAPARAAPAAAPAPADDDPP-------PWEEL 502
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462613745  807 PPTGEKVSPFDSKAIPRPAsDSKIISHPGPSseskgqkQVDPVQKKEEPKKAQTKMsPKPDAKPMPKGSPTPP 879
Cdd:PRK12323   503 PPEFASPAPAQPDAAPAGW-VAESIPDPATA-------DPDDAFETLAPAPAAAPA-PRAAAATEPVVAPRPP 566
tolA_full TIGR02794
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ...
1134-1261 3.51e-03

TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]


Pssm-ID: 274303 [Multi-domain]  Cd Length: 346  Bit Score: 42.91  E-value: 3.51e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1134 ASPMPVPTESSSQKTAVPPQVKLVKKQEQEVKTEAEKVILEKVKETLSMEKIPPMVTTDQKQEESKLEKDKASAL-QEKK 1212
Cdd:TIGR02794   25 HSVKPEPGGGAEIIQAVLVDPGAVAQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAaEKAA 104
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2462613745 1213 PLPEEKKLIPEEEKIRSEEKKP-LLEEKKPTPE---DKKLLPEAKTSAPEEQK 1261
Cdd:TIGR02794  105 KQAEQAAKQAEEKQKQAEEAKAkQAAEAKAKAEaeaERKAKEEAAKQAEEEAK 157
PRK10819 PRK10819
transport protein TonB; Provisional
469-585 3.57e-03

transport protein TonB; Provisional


Pssm-ID: 236768 [Multi-domain]  Cd Length: 246  Bit Score: 42.36  E-value: 3.57e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  469 PPSQLPGPAKPPPQQPgPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPA-KPSPQqPGSTKPPSQQPGSAKPSAQQPSPA 547
Cdd:PRK10819    55 PADLEPPQAVQPPPEP-VVEPEPEPEPIPEPPKEAPVVIPKPEPKPKPKpKPKPK-PVKKVEEQPKREVKPVEPRPASPF 132
                           90       100       110
                   ....*....|....*....|....*....|....*...
gi 2462613745  548 KPSAQQstKPVSQTGSGKPLQPPTVSPSAKQPPSQGLP 585
Cdd:PRK10819   133 ENTAPA--RPTSSTATAAASKPVTSVSSGPRALSRNQP 168
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
238-579 3.60e-03

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 43.52  E-value: 3.60e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  238 SISSQQPEKIKSQPPGTGKPIQgPTQTPQTDHAKLPLQRdaSRPQTKQADIVRGESVKPSLPSPSKPPIQQPTPGKPPAQ 317
Cdd:pfam03546  178 SEGEAPPAATQAKPSGKILQVR-PASGPAKGAAPAPPQK--AGPVATQVKAERSKEDSESSEESSDSEEEAPAAATPAQA 254
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  318 QPGHEKSQP--GPAK-PPAQPSGlTKPLAQQPGTVKP----PVQPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQ----P 386
Cdd:pfam03546  255 KPALKTPQTkaSPRKgTPITPTS-AKVPPVRVGTPAPwkagTVTSPACASSPAVARGAQRPEEDSSSSEESESEEetapA 333
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  387 GPKALAQPPGVGKTPAQQPGPAKPPTQQVGTPKPlaqqPGLQSPAKAPGPTKTPvQQPGPGKIPAQQAGPGKTSAQQTGP 466
Cdd:pfam03546  334 AAVGQAKSVGKGLQGKAASAPTKGPSGQGTAPVP----PGKTGPAVAQVKAEAQ-EDSESSEEESDSEEAAATPAQVKAS 408
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  467 TKPPsqlPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPgPAKPSPQQPGSTKPPSQQPGSAKPSA----Q 542
Cdd:pfam03546  409 GKTP---QAKANPAPTKASSAKGAASAPGKVVAAAAQAKQGSPAKVKP-PARTPQNSAISVRGQASVPAVGKAVAtaaqA 484
                          330       340       350
                   ....*....|....*....|....*....|....*..
gi 2462613745  543 QPSPAKPSAQQSTKPVSQTGSGKPLQPPTVSPSAKQP 579
Cdd:pfam03546  485 QKGPVGGPQEEDSESSEEESDSEEEAPAQAKPSGKTP 521
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
497-585 3.61e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 43.64  E-value: 3.61e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  497 AKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQPGsAKPSAQQPSPAKPSAQQSTKPVSQTGSGKPLQPPTVSPSA 576
Cdd:PRK14950   361 VPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPP-KEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVD 439

                   ....*....
gi 2462613745  577 KQPPSQGLP 585
Cdd:PRK14950   440 EKPKYTPPA 448
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
308-400 3.64e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 43.61  E-value: 3.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  308 QPTPGKPPAQQPGHEKSQP---GPAKPPAQPSGLTKPLAQQPGTVKP----PVQPPGTTK---PPAQPLGPAKPPAQQTG 377
Cdd:PRK14971   385 QPAAAPQPSAAAAASPSPSqssAAAQPSAPQSATQPAGTPPTVSVDPpaavPVNPPSTAPqavRPAQFKEEKKIPVSKVS 464
                           90       100
                   ....*....|....*....|...
gi 2462613745  378 SEKPSSEQPGPKALAQPPGVGKT 400
Cdd:PRK14971   465 SLGPSTLRPIQEKAEQATGNIKE 487
CCDC47 pfam07946
PAT complex subunit CCDC47; This family represents CCDC47 proteins which are a component of ...
3759-3822 3.68e-03

PAT complex subunit CCDC47; This family represents CCDC47 proteins which are a component of the PAT complex, an endoplasmic reticulum (ER)-resident membrane multiprotein complex that facilitates multi-pass membrane proteins insertion into membranes. The PAT complex, formed by CCDC47 and Asterix proteins, acts as an intramembrane chaperone by directly interacting with nascent transmembrane domains (TMDs), releasing its substrates upon correct folding, and is needed for optimal biogenesis of multi-pass membrane proteins. CCDC47 is required to maintain the stability of Asterix. CCDC47 is associated with various membrane-associated processes and is component of a ribosome-associated ER translocon complex involved in multi-pass membrane protein transport into the ER membrane and biogenesis. It is also involved in the regulation of calcium ion homeostasis in the ER, being also required for proper protein degradation via the ERAD (ER-associated degradation) pathway.


Pssm-ID: 462322  Cd Length: 323  Bit Score: 42.94  E-value: 3.68e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462613745 3759 ARAKILQDIDRELDLVERESAKLRKKQAELDEEEKEIDaklrylEMGINRRKEALLKEREKRER 3822
Cdd:pfam07946  265 TREEEIEKIKKAAEEERAEEAQEKKEEAKKKEREEKLA------KLSPEEQRKYEEKERKKEQR 322
PHA02682 PHA02682
ORF080 virion core protein; Provisional
458-581 3.82e-03

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 42.54  E-value: 3.82e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  458 KTSAQQTGPTKPPSQLPGPAKPPPQQPGPAkPPPQQPGSAKP-PPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSqqPGS 536
Cdd:PHA02682    75 RPSGQSPLAPSPACAAPAPACPACAPAAPA-PAVTCPAPAPAcPPATAPTCPPPAVCPAPARPAPACPPSTRQCP--PAP 151
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*
gi 2462613745  537 AKPSAQQPSPAKPSAQQSTKPvsqtgsgKPLQPPTVSPSAKQPPS 581
Cdd:PHA02682   152 PLPTPKPAPAAKPIFLHNQLP-------PPDYPAASCPTIETAPA 189
PDZ_PDZD11-like cd06752
PDZ domain of PDZ domain-containing protein 11, and related domains; PDZ (PSD-95 (Postsynaptic ...
4510-4576 3.90e-03

PDZ domain of PDZ domain-containing protein 11, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of PDZD11, and related domains. PDZD11 (also known as ATPase-interacting PDZ protein, plasma membrane calcium ATPase-interacting single-PDZ protein, PMCA-interacting single-PDZ protein, PISP) is involved in the dynamic assembly of apical junctions (AJs). It is recruited by PLEKHA7 to AJs to promote the efficient junctional recruitment and stabilization of nectins, and the efficient early phases of assembly of AJs in epithelial cells. The PDZD11 PDZ domain binds nectin-1 and nectin-3. PDZD11 also binds to a PDZ binding motif located in the C-terminal tail of the human sodium-dependent multivitamin transporter, to the cytoplasmic tail of the Menkes copper ATPase ATP7A, and to the cytoplasmic tail of all plasma membrane Ca2+-ATPase b-splice variants. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PDZD11-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467234 [Multi-domain]  Cd Length: 83  Bit Score: 39.22  E-value: 3.90e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462613745 4510 SGNGLGIRIVGGKEipghsGEIGAYIAKILPGGSAEQTGkLMEGMQVLEWNGIPLTSKTYEEVQSII 4576
Cdd:cd06752      9 PGEQLGFNIRGGKA-----SGLGIFISKVIPDSDAHRLG-LKEGDQILSVNGVDFEDIEHSEAVKVL 69
PRK12757 PRK12757
cell division protein FtsN; Provisional
469-565 4.05e-03

cell division protein FtsN; Provisional


Pssm-ID: 237191 [Multi-domain]  Cd Length: 256  Bit Score: 42.34  E-value: 4.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  469 PPSQLPGPA--KPPPQQPGPAKPPPQQPGSAKPPPQQpgsTKPPPQQPGPAKPSPQQPGSTKPPSQQPgsAKPSAQQPSP 546
Cdd:PRK12757    98 QPTQLSEVPynEQTPQVPRSTVQIQQQAQQQQPPATT---AQPQPVTPPRQTTAPVQPQTPAPVRTQP--AAPVTQAVEA 172
                           90       100
                   ....*....|....*....|.
gi 2462613745  547 AKPSAQQST--KPVSQTGSGK 565
Cdd:PRK12757   173 PKVEAEKEKeqRWMVQCGSFK 193
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
651-1067 4.20e-03

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 43.14  E-value: 4.20e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  651 APVPSSPQPKLKTAPVTTTSAVSKSSP-----------------QPQQTSPKKDAAPKQDLSKAPEPKKPPPLVKQPTLH 713
Cdd:pfam03546   41 AKTPLQAKPSGKTPQVRAASAPAKESPrkgappvppgktgpaaaQAQAGKPEEDSESSSEESDSDGETPAAATLTTSPAQ 120
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  714 GSPSAKAKQPPEADSLSKPAPPKEPSVPSEQDKAPVAddkpKQPKMVKPTTDLVSSSSATTKPDIPSSKVQSQAEEKTTP 793
Cdd:pfam03546  121 VKPLGKNSQVRPASTVGKGPSGKGANPAPPGKAGSAA----PLVQVGKKEEDSESSSEESDSEGEAPPAATQAKPSGKIL 196
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  794 PLKTDSAKPSQSFPPTGEKVSPFDSKAIPRPASDSKIISHPGPSSESKGQKQVDPVQKKEEPKKAQTKMSPKPDAKPMPK 873
Cdd:pfam03546  197 QVRPASGPAKGAAPAPPQKAGPVATQVKAERSKEDSESSEESSDSEEEAPAAATPAQAKPALKTPQTKASPRKGTPITPT 276
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  874 GSPTPP------GPRPTAGQTVPTPQQSPKPQEQSRRFSLNLGSITDAPKSQPTTPQETVtgklfgfgasifsqasnlis 947
Cdd:pfam03546  277 SAKVPPvrvgtpAPWKAGTVTSPACASSPAVARGAQRPEEDSSSSEESESEEETAPAAAV-------------------- 336
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  948 tagqpGPHSQSGPGAPMKQAPAPSQPPTSQG----PPKSTGQAPP---APAKSIPVKKETKAPAAEKLEPKAEQAPTVKR 1020
Cdd:pfam03546  337 -----GQAKSVGKGLQGKAASAPTKGPSGQGtapvPPGKTGPAVAqvkAEAQEDSESSEEESDSEEAAATPAQVKASGKT 411
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*....
gi 2462613745 1021 TETEKKPPPIKDS--KSLTAEPQKAVLPTKLEKSPKPESTCPLCKTELN 1067
Cdd:pfam03546  412 PQAKANPAPTKASsaKGAASAPGKVVAAAAQAKQGSPAKVKPPARTPQN 460
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
404-499 4.30e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 43.39  E-value: 4.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  404 QPGPAK-PPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPVQQPGPGKIPAqqagpgktsaqqtgptkpPSQLPGPAKPPPQ 482
Cdd:PRK14954   381 APSPAGsPDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPT------------------PEQQPPVARSAPL 442
                           90
                   ....*....|....*..
gi 2462613745  483 QPGPAKPPPQQPGSAKP 499
Cdd:PRK14954   443 PPSPQASAPRNVASGKP 459
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
793-1064 4.57e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 43.52  E-value: 4.57e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  793 PPLKTDSAKPSQSFPPTGEKVSPFDSKAiPRPASDSKIISHPGPSSESKGQKQVDPVQKKEEPKKAQTKMSPKPDAKPMP 872
Cdd:PTZ00449   521 PKAPGDKEGEEGEHEDSKESDEPKEGGK-PGETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKR 599
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  873 KGSPTPPGPRPTAGQtvptPQQSPKPQEQSRRFSlnlgsiTDAPKSqPTTPQETVTgklfgfgasifsqasnlistagqp 952
Cdd:PTZ00449   600 PRSAQRPTRPKSPKL----PELLDIPKSPKRPES------PKSPKR-PPPPQRPSS------------------------ 644
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  953 gPHSQSGPGAPmkQAPAPSQPPTSQGPPKSTGQAPPAPAKSIPVKKETKAPAA------EKLEPKAEQAPTVKRTETEKK 1026
Cdd:PTZ00449   645 -PERPEGPKII--KSPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVldesfeSILKETLPETPGTPFTTPRPL 721
                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 2462613745 1027 PPPIKDSKSLTAEPQKAvlPTKLEKSPKPESTCPLCKT 1064
Cdd:PTZ00449   722 PPKLPRDEEFPFEPIGD--PDAEQPDDIEFFTPPEEER 757
PHA02682 PHA02682
ORF080 virion core protein; Provisional
409-549 4.60e-03

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 42.54  E-value: 4.60e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  409 KPPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPVQQPGPGkipaqqagpgktsaqqtgPTKPPSQLPgPAKPPPQQPGPAK 488
Cdd:PHA02682    75 RPSGQSPLAPSPACAAPAPACPACAPAAPAPAVTCPAPA------------------PACPPATAP-TCPPPAVCPAPAR 135
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462613745  489 PPPQQPGSAKP-PPQQPGST-KPPPqqpgPAKPSPQQpgSTKPPSQQPGSAKPSAQQPSPAKP 549
Cdd:PHA02682   136 PAPACPPSTRQcPPAPPLPTpKPAP----AAKPIFLH--NQLPPPDYPAASCPTIETAPAASP 192
DUF3824 pfam12868
Domain of unknwon function (DUF3824); This is a repeating domain found in fungal proteins. It ...
474-553 4.83e-03

Domain of unknwon function (DUF3824); This is a repeating domain found in fungal proteins. It is proline-rich, and the function is not known.


Pssm-ID: 372351 [Multi-domain]  Cd Length: 145  Bit Score: 40.49  E-value: 4.83e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  474 PGPAKPPPQQPGPAKPPPQQPGSAKPPPqqPGSTKPPPQQPGPAK-PSPQQPGSTKPPsqqPGSAKPSAQQPSPAKPSAQ 552
Cdd:pfam12868   58 PSPYPPSPAGPYASQGQYYPETNYFPPP--PGSTPQPPVDPQPNApPPPYNPADYPPP---PGAAPPPQPYQYPPPPGPD 132

                   .
gi 2462613745  553 Q 553
Cdd:pfam12868  133 P 133
PRK11901 PRK11901
hypothetical protein; Reviewed
406-568 5.09e-03

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 42.36  E-value: 5.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  406 GPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQQTgpTKPPsqlpgPAKPPPQQPG 485
Cdd:PRK11901    60 SPTEHESQQSSNNAGAEKNIDLSGSSSLSSGNQSSPSAANNTSDGHDASGVKNTAPPQD--ISAP-----PISPTPTQAA 132
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  486 PAKPPPQQ-----PG--------------SAKPPPQQPGS---TKPPPQQPGPAKPSPQQPGSTKPPSQQPGSAKPSAQQ 543
Cdd:PRK11901   133 PPQTPNGQqrielPGnisdalsqqqgqvnAASQNAQGNTStlpTAPATVAPSKGAKVPATAETHPTPPQKPATKKPAVNH 212
                          170       180
                   ....*....|....*....|....*..
gi 2462613745  544 PS--PAKPSAQQSTKPVSQTGSGKPLQ 568
Cdd:PRK11901   213 HKtaTVAVPPATSGKPKSGAASARALS 239
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
498-584 5.17e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 43.34  E-value: 5.17e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  498 KPPPQQPGSTKPPPQQPGPAKPSPQQPgstKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQTGSGKPLQPPTVSPSAK 577
Cdd:PRK12270    37 GPGSTAAPTAAAAAAAAAASAPAAAPA---AKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAV 113

                   ....*..
gi 2462613745  578 QPPSQGL 584
Cdd:PRK12270   114 EDEVTPL 120
PDZ_Par6-like cd06718
PDZ domain of partitioning defective 6 (Par6), Drosophila Rho GTPase-activating protein 100F ...
4514-4577 5.23e-03

PDZ domain of partitioning defective 6 (Par6), Drosophila Rho GTPase-activating protein 100F (RhoGAP100F), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of Par6 (also known as PAR6 or Par-6), RhoGAP100F, and related domains. Par6 is part of a conserved machinery that directs metazoan cell polarity, a process necessary for the function of diverse cell types. Par6 forms a cell polarity-regulatory complex with atypical protein kinase C (aPKC) and Par3. Par6 can also directly associate with PALS1 (proteins associated with Lin7, also known as Stardust) providing a link between the Par3/aPKC/Par6 complex and the PALS1-PATJ (protein-associated TJ) complex. Binding partners of the Par6-PDZ domain include Par3, PALS1/Stardust; leucine-rich repeat-containing protein netrin-G ligand-2 (NGL-2), human crumbs (CRB3) involve in the morphogenesis of the tight junctions in mammalian epithelial cells, and PAR-6 co-operates with the Par6 semi-CRIB domain to bind CDC42. CDC42 regulates the Par6 PDZ domain through an allosteric CRIB-PDZ transition. Drosophila RhoGAP100F, also known as synapse defective protein 1 homolog (syd-1 homolog), is a GTPase activator for the Rho-type GTPases by converting them to an inactive GDP-bound form. The RhoGAP100F-PDZ domain binds the neurexin C terminus to control synapse formation at the Drosophila neuromuscular junction. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Par6-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467202 [Multi-domain]  Cd Length: 84  Bit Score: 38.70  E-value: 5.23e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462613745 4514 LGIRIVGGKEIPGHSGeigAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIIS 4577
Cdd:cd06718     13 LGFYIRDGNGVERVPG---IFISRLVLGSLADSTGLLAVGDEILEVNGVEVTGKSLDDVTDMMV 73
PDZ2_harmonin cd06738
PDZ domain 2 of harmonin isoforms a, b, and c, and related domains; PDZ (PSD-95 (Postsynaptic ...
4507-4572 5.40e-03

PDZ domain 2 of harmonin isoforms a, b, and c, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of harmonin isoforms a, b, and c, and related domains. Harmonin (also known as Usher Type 1C, PDZ-73 and AIE-75) is a key organizer of the Usher (USH) protein interactome. USH syndrome is the leading cause of hereditary sensory deaf-blindness in humans; three clinically distinct types of USH have been identified, type 1 to 3. The gene encoding harmonin (USH1C) is the causative gene for the USH type 1C phenotype. There are at least 10 alternatively spliced isoforms of harmonin, which are divided into three subclasses (a, b, and c). All isoforms contain the first two PDZ domains and the first coiled-coil domain. The a and b isoforms all have a third PDZ domain. The different PDZ domains are responsible for interactions with all known Usher syndrome type 1 proteins, and most Usher syndrome type 2 proteins. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This harmonin family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467220 [Multi-domain]  Cd Length: 82  Bit Score: 38.84  E-value: 5.40e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462613745 4507 HTVSGNGLGIRIVGGkeiPGHSGeiGAYIAKILPGGSAEQTGkLMEGMQVLEWNGIPLTSKTYEEV 4572
Cdd:cd06738      8 SLVGTRGLGCSISSG---PTQKP--GIFISNVKPGSLAEEVG-LEVGDQIVEVNGTSFTNVDHKEA 67
PDZ1_Scribble-like cd06704
PDZ domain 1 of Drosophila Scribble, human Scribble homolog, and related domains; PDZ (PSD-95 ...
4498-4573 5.48e-03

PDZ domain 1 of Drosophila Scribble, human Scribble homolog, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of Drosophila Scribble (also known as LAP4), human Scribble homolog (also known as hScrib, LAP4, CriB1, ScrB1 and Vartul), and related domains. They belong to the LAP family, which describes proteins that contain either one or four PDZ domains and 16 LRRs (leucine-rich repeats) and function in controlling cell shape, size and subcellular protein localization. In Drosophila, the Scribble complex, comprising Scribble, discs large, and lethal giant larvae, plays a role in apico-basal cell polarity, in other forms of polarity, including regulation of the actin cytoskeleton, cell signaling and vesicular trafficking, and in tumor development. Mammalian Scribble is important in many aspects of cancer development. Scribble and its homologs can be downregulated or overexpressed in cancer; they have a role in cancer beyond their function in loss of cell polarity. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Scribble-like family PDZ1 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467188 [Multi-domain]  Cd Length: 87  Bit Score: 38.80  E-value: 5.48e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462613745 4498 IKITRDskdhtvsGNGLGIRIVGGK-EIPGHSGEIGAYIAKILPGGSAEQTGkLMEGMQVLEWNGIPLTSKT-YEEVQ 4573
Cdd:cd06704      3 ITIERQ-------TGGLGISIAGGKgSTPYKGDDEGIFISRVTEGGPAAKAG-VRVGDKLLEVNGVDLVDADhHEAVE 72
HC2 pfam07382
Histone H1-like nucleoprotein HC2; This family contains the bacterial histone H1-like ...
370-560 5.63e-03

Histone H1-like nucleoprotein HC2; This family contains the bacterial histone H1-like nucleoprotein HC2 (approximately 200 residues long), which seems to be found mostly in Chlamydia. HC2 functions in DNA condensation, although it has been suggested that it also has other roles.


Pssm-ID: 369339 [Multi-domain]  Cd Length: 187  Bit Score: 41.30  E-value: 5.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  370 KPPAQQTGSEKPSSEQPGP-KALAQPPGVGKTPAQQPGPAKP-PTQQVGTPKPLAQQPGLQSPAKAPGPTKTPVQQPGPG 447
Cdd:pfam07382    7 KRSSKKTAAKKAAVRKPAAkKAAAKKTVVRKVAAKKPAARKTvAKKTVAAKKPAAKKAAKKAVAKKVVAKKPVAKKAVAK 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  448 KIPAQQAGPGKTSAQQTGPTKPPSQLPGpAKPPPQQPGPAKPPPQQPGSAKPPPQQpgstkppPQQPGPAKPSPQQPGSt 527
Cdd:pfam07382   87 KATAKKVAAKKVVAKKTVAKKAAAKKPA-AKKAVAKKAVARKPAAKKAVAKKAAST-------CHKNHKHTAACKRVAS- 157
                          170       180       190
                   ....*....|....*....|....*....|...
gi 2462613745  528 kPPSQQPGSAKPSAQQPSPAKPsaQQSTKPVSQ 560
Cdd:pfam07382  158 -SSATRAACGSKSRVNPAHGWR--QQLMKLVSR 187
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
382-517 5.97e-03

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 40.54  E-value: 5.97e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745   382 SSEQPGPKALAQPPGVGKTPAQQPGpakPPTQqvgtpkPLAQQPGLQSpakapgptKTPVQQPGPGKIPAQQAGPGKTSA 461
Cdd:smart00818   43 SQQHPPTHTLQPHHHIPVLPAQQPV---VPQQ------PLMPVPGQHS--------MTPTQHHQPNLPQPAQQPFQPQPL 105
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*...
gi 2462613745   462 QQTGPTKPPSQLPGPAKPPPQQPGPAKPP--PQQPgsakPPPQQPGStkppPQQPGPA 517
Cdd:smart00818  106 QPPQPQQPMQPQPPVHPIPPLPPQPPLPPmfPMQP----LPPLLPDL----PLEAWPA 155
PHA03418 PHA03418
hypothetical E4 protein; Provisional
314-534 6.09e-03

hypothetical E4 protein; Provisional


Pssm-ID: 177646 [Multi-domain]  Cd Length: 230  Bit Score: 41.65  E-value: 6.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  314 PPAQQPGHEKSQPGPAKPPAqpsgltkplaqqpgtvkPPVQPPGTTKPPAQPLGPAKPPAqqtgsekpsSEQPGPKALAQ 393
Cdd:PHA03418    34 PLLPAPHHPNPQEDPDKNPS-----------------PPPDPPLTPRPPAQPNGHNKPPV---------TKQPGGEGTEE 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  394 PPGVGKTPAQQPGPakpptqqvgtpkplaqQPGLQSPAKAPGPtktpvqqpgpgkipaqqaGPGKTSAQqtgptkpPSQL 473
Cdd:PHA03418    88 DHQAPLAADADDDP----------------RPGKRSKADEHGP------------------APGRAALA-------PFKL 126
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462613745  474 PGPAKPPPQQPgpaKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTKPPSQQP 534
Cdd:PHA03418   127 DLDQDPLHGDP---DPPPGATGGQGEEPPEGGEESQPPLGEGEGAVEGHPPPLPPAPEPKP 184
PRK10263 PRK10263
DNA translocase FtsK; Provisional
855-1060 6.27e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 43.15  E-value: 6.27e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  855 PKKAQTKMSPKPDAKPMPKgSPT---PPGPRPTAGQTVPTPQ-QSPKPQEQSRRFSLNlgsiTDAPKSQPTTPQETVTGK 930
Cdd:PRK10263   336 PVEPVTQTPPVASVDVPPA-QPTvawQPVPGPQTGEPVIAPApEGYPQQSQYAQPAVQ----YNEPLQQPVQPQQPYYAP 410
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  931 LFGFGASIFSQASNLISTAGQPGPHSQsgPGAPMKQAPAPSQPPTSQGPPKSTGQAPPAPAKSIPVKKETKAPAAEKLEP 1010
Cdd:PRK10263   411 AAEQPAQQPYYAPAPEQPAQQPYYAPA--PEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQP 488
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462613745 1011 KAEQAPTVKrtETEKKPPPI-----------KDSKSLTA------EPQKAVLPTKLEKSPKPESTCP 1060
Cdd:PRK10263   489 VVEPEPVVE--ETKPARPPLyyfeeveekraREREQLAAwyqpipEPVKEPEPIKSSLKAPSVAAVP 553
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
67-501 6.63e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.85  E-value: 6.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745   67 PKGSVPPAAAESPSMHRKQELDSSHPPKQSGRPPDPGRPAQPglSKSRTTDTFRSEQKlPGRSPSTISLKESKSRTDLKE 146
Cdd:PHA03307    88 PTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPS--PAPDLSEMLRPVGS-PGPPPAASPPAAGASPAAVAS 164
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  147 EHKSSMMPGFLSEVnALSAVSSVVNKFNPFDLISDSEAsqeettkkqkvvqkeqgkPEGIIKPPLQQQPPKPIPKQQGPG 226
Cdd:PHA03307   165 DAASSRQAALPLSS-PEETARAPSSPPAEPPPSTPPAA------------------ASPRPPRRSSPISASASSPAPAPG 225
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  227 RDplQQDGTPKSISSQQPekikSQPPGTGKPIQGPTQTPQTDHAKLPlqrdasrpqtkqadivrgesvkpslpspskPPI 306
Cdd:PHA03307   226 RS--AADDAGASSSDSSS----SESSGCGWGPENECPLPRPAPITLP------------------------------TRI 269
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  307 QQPTPGKPPAQQPGHEKSQPGPAKPPAQPSgltkplaqqpgtvkpPVQPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQP 386
Cdd:PHA03307   270 WEASGWNGPSSRPGPASSSSSPRERSPSPS---------------PSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSE 334
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  387 GPKALAQPPGVGKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQQTGP 466
Cdd:PHA03307   335 SSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRP 414
                          410       420       430
                   ....*....|....*....|....*....|....*
gi 2462613745  467 TKPPSQLPGPAKPPPQQPGPAKPPPqQPGSAKPPP 501
Cdd:PHA03307   415 SPLDAGAASGAFYARYPLLTPSGEP-WPGSPPPPP 448
PDZ4_LNX1_2-like cd06680
PDZ domain 4 of human Ligand of Numb protein X 1 (LNX1) and LNX2, and related domains; PDZ ...
4514-4581 6.73e-03

PDZ domain 4 of human Ligand of Numb protein X 1 (LNX1) and LNX2, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 4 of LNX1 (also known as PDZ domain-containing RING finger protein 2, PDZRN2)and LNX2 (also known as PDZ domain-containing RING finger protein 1, PDZRN1), and related domains. LNX1 and LNX2 are Ring (Really Interesting New Gene) finger and PDZ domain-containing E3 ubiquitin ligases that bind to the cell fate determinant protein NUMB and mediate its ubiquitination. LNX1 can ubiquitinate a number of other ligands including PPFIA1, KLHL11, KIF7 and ERC2. LNX1 and LNX2 each have four PDZ domains. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This LNX family PDZ4 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467168 [Multi-domain]  Cd Length: 89  Bit Score: 38.87  E-value: 6.73e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462613745 4514 LGIRIVGGKEipGHSGEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIISQQSG 4581
Cdd:cd06680     13 LGFSIVGGYE--ESHGNQPFFVKSIVPGTPAYNDGRLKCGDIILAVNGVSTVGMSHAALVPLLKEQRG 78
PRK01297 PRK01297
ATP-dependent RNA helicase RhlB; Provisional
457-521 6.75e-03

ATP-dependent RNA helicase RhlB; Provisional


Pssm-ID: 234938 [Multi-domain]  Cd Length: 475  Bit Score: 42.59  E-value: 6.75e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462613745  457 GKTSAQQTGPTkPPSQLPGPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPppQQPGPAKPSP 521
Cdd:PRK01297    10 GKGEAEQPAPA-PPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKP--RRERKPKPAS 71
PDZ5_MAGI-1_3-like cd06735
PDZ domain 5 of membrane-associated guanylate kinase inverted 1 (MAGI-1), MAGI-2, and MAGI-3, ...
4510-4581 6.96e-03

PDZ domain 5 of membrane-associated guanylate kinase inverted 1 (MAGI-1), MAGI-2, and MAGI-3, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 5 of MAGI1, 2, 3 (MAGI is also known as Membrane-associated guanylate kinase, WW and PDZ domain-containing protein) and related domains. MAGI proteins have been implicated in the control of cell migration and invasion through altering the activity of phosphatase and tensin homolog (PTEN) and modulating Akt signaling. Four MAGI proteins have been identified (MAGI1-3 and MAGIX). MAGI1-3 have 6 PDZ domains and bind to the C-terminus of PTEN via their PDZ2 domain. MAGIX has a single PDZ domain that is related to MAGI1-3 PDZ domain 5, and belongs to this MAGI1,2,3-like family. Other binding partners for MAGI1 include JAM4, C-terminal tail of high risk HPV-18 E6, megalin, TRAF6, Kir4.1 (basolateral K+ channel subunit), and cadherin 23; for MAGI2, include DASM1, dendrin, axin, beta- and delta-catenin, neuroligin, hyperpolarization-activated cation channels, beta1-adrenergic receptors, NMDA receptor, and TARPs; and for MAGI3 includes LPA2. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MAGI family PDZ5 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged as beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467217 [Multi-domain]  Cd Length: 84  Bit Score: 38.33  E-value: 6.96e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462613745 4510 SGNGLGIRIVGGKEipghSGEIGAYIAKILPGGSAEQTGKLMEGMQVLEWNGIPLTSKTYEEVQSIIsQQSG 4581
Cdd:cd06735      9 GPKGFGFSIRGGRE----YNNMPLYVLRLAEDGPAQRDGRLRVGDQILEINGESTQGMTHAQAIELI-RSGG 75
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
942-1007 7.00e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 42.96  E-value: 7.00e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462613745  942 ASNLISTAGQPGPHSQSGPGAPMKQAPAPSQPPTSQGPPKSTGQAPPAPAKSIPVKKETKAPAAEK 1007
Cdd:PRK12270    46 AAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAA 111
TolA COG3064
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];
1142-1309 7.07e-03

Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442298 [Multi-domain]  Cd Length: 485  Bit Score: 42.33  E-value: 7.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1142 ESSSQKTAVPPQVKLVKKQEQEVKTEAEkvILEKVKETLSMEKIPPMVTT--------------DQKQEESKLEKDKASA 1207
Cdd:COG3064      3 EALEEKAAEAAAQERLEQAEAEKRAAAE--AEQKAKEEAEEERLAELEAKrqaeeeareakaeaEQRAAELAAEAAKKLA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 1208 LQEKKPLPEEKKLIPEEEKIRSEEKKPLLEEKKPTPEDKKLLPEAKTSAPEEQKHDLLKSQVQIAEEKLEGRVAPKTVQE 1287
Cdd:COG3064     81 EAEKAAAEAEKKAAAEKAKAAKEAEAAAAAEKAAAAAEKEKAEEAKRKAEEEAKRKAEEERKAAEAEAAAKAEAEAARAA 160
                          170       180
                   ....*....|....*....|..
gi 2462613745 1288 GKQPQTKMEGLPSGTPQSLPKE 1309
Cdd:COG3064    161 AAAAAAAAAAAARAAAGAAAAL 182
PRK11633 PRK11633
cell division protein DedD; Provisional
401-512 7.25e-03

cell division protein DedD; Provisional


Pssm-ID: 236940 [Multi-domain]  Cd Length: 226  Bit Score: 41.14  E-value: 7.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  401 PAQQPGPAKPPtqqvgtpkplaqqPGLQSPAKAPGPTKTPVQqpgpgkiPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPP 480
Cdd:PRK11633    58 AATQALPTQPP-------------EGAAEAVRAGDAAAPSLD-------PATVAPPNTPVEPEPAPVEPPKPKPVEKPKP 117
                           90       100       110
                   ....*....|....*....|....*....|..
gi 2462613745  481 PQQPGPAKPPPQQPgsaKPPPQQPGSTKPPPQ 512
Cdd:PRK11633   118 KPKPQQKVEAPPAP---KPEPKPVVEEKAAPT 146
PHA03291 PHA03291
envelope glycoprotein I; Provisional
329-486 7.29e-03

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 42.25  E-value: 7.29e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  329 AKPPAQPSgLTKPLAQQP------------GTVKPPVQPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQPG-----PKAL 391
Cdd:PHA03291   113 TQSPAYAT-LTLDLARQPllrargaaravvGLYVLRVWVEGATNASLFPLGLAAFPAEGTLAAPPLGEGSAdgscdPALP 191
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  392 AQPPGVGKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPtktpvqqPGPGKIPAQQAGpgkTSAQQTGPTKPPS 471
Cdd:PHA03291   192 LSAPRLGPADVFVPATPRPTPRTTASPETTPTPSTTTSPPSTTIP-------APSTTIAAPQAG---TTPEAEGTPAPPT 261
                          170
                   ....*....|....*
gi 2462613745  472 QLPGPAKPPPQQPGP 486
Cdd:PHA03291   262 PGGGEAPPANATPAP 276
PHA03418 PHA03418
hypothetical E4 protein; Provisional
307-486 7.36e-03

hypothetical E4 protein; Provisional


Pssm-ID: 177646 [Multi-domain]  Cd Length: 230  Bit Score: 41.26  E-value: 7.36e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  307 QQPTPGKPPAQQPGHEKSQPGPAKPPAQPSGLTKP-LAQQPGTvkppvqpPGTTKPPAQPLGPakppaqqTGSEKPsseQ 385
Cdd:PHA03418    40 HHPNPQEDPDKNPSPPPDPPLTPRPPAQPNGHNKPpVTKQPGG-------EGTEEDHQAPLAA-------DADDDP---R 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  386 PGPKALAQPPGVGktpaqqPGPAKPPTQQVGtpkpLAQQPGLQSPAKAPGPTKTPVQQPGPGKiPAQQAGPGKTSAQQTG 465
Cdd:PHA03418   103 PGKRSKADEHGPA------PGRAALAPFKLD----LDQDPLHGDPDPPPGATGGQGEEPPEGG-EESQPPLGEGEGAVEG 171
                          170       180
                   ....*....|....*....|.
gi 2462613745  466 PTKPPsqlpgpakPPPQQPGP 486
Cdd:PHA03418   172 HPPPL--------PPAPEPKP 184
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
808-1060 7.55e-03

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 42.61  E-value: 7.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  808 PTGEKVSPFDSKAIPRPASDSKIISHPGPsSESKGQKQVDPVQKKEEPKKAQTKMSPKP------DAKPMPKGSPTPPGP 881
Cdd:PLN03209   315 PMEELLAKIPSQRVPPKESDAADGPKPVP-TKPVTPEAPSPPIEEEPPQPKAVVPRPLSpytayeDLKPPTSPIPTPPSS 393
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  882 RPTAGQTVPTPQQSPKPQEQSrrfslNLGSITDAPKSQPTTPQETVTGKLfgfgaSIFSQASNLiSTAGQPGPHSQSGPG 961
Cdd:PLN03209   394 SPASSKSVDAVAKPAEPDVVP-----SPGSASNVPEVEPAQVEAKKTRPL-----SPYARYEDL-KPPTSPSPTAPTGVS 462
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  962 APMKQAPAPSQPPTSQGPPKSTGQAPPAPAKSIPVkketkAPAAEKLEPKAEQAPTVKRTETEKKPPPIKD-SKSLTAEP 1040
Cdd:PLN03209   463 PSVSSTSSVPAVPDTAPATAATDAAAPPPANMRPL-----SPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEvVKVGNSAP 537
                          250       260
                   ....*....|....*....|
gi 2462613745 1041 QKAVLPTKLEKSPKPESTCP 1060
Cdd:PLN03209   538 PTALADEQHHAQPKPRPLSP 557
PROL5-SMR pfam15621
Proline-rich submaxillary gland androgen-regulated family; SMR is a family of proteins found ...
469-546 7.83e-03

Proline-rich submaxillary gland androgen-regulated family; SMR is a family of proteins found in eukaryotes. The family of SMR proteins is expressed in the submaxillary gland. SMR members may play a role in protection or detoxification.


Pssm-ID: 434817 [Multi-domain]  Cd Length: 103  Bit Score: 39.02  E-value: 7.83e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  469 PPSQLPgPAKPPPQQPG---PAKPPPQQPGSAKPPPqqpgstkPPPQQPGPAKPSPqqpgstkPPSQQPGSAKPSAQQPS 545
Cdd:pfam15621   31 PRRPLP-PSQPPPNGPQigpPPPPPPYGPGRIPPPP-------PPPFGPGRIPPPP-------PPPYGPGRILSQSFPPP 95

                   .
gi 2462613745  546 P 546
Cdd:pfam15621   96 P 96
PRK10263 PRK10263
DNA translocase FtsK; Provisional
3845-4003 7.94e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 42.76  E-value: 7.94e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745 3845 PTRIESQHGIERPRTAPQTEFSQFIPPQTQTESQLVPPTSPYTQYQYSSPALPTQAPTSYTQQSHFEQQTlYHQQVSPYQ 3924
Cdd:PRK10263   747 PIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQ-YQQPQQPVA 825
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462613745 3925 TQPTFQavatmsfTPQVQPTPTPQPSYQLPSQMMVIQQKPRQTTLYLEPKItsnyEVIRNQPLMIAPVstdNTFAVSHL 4003
Cdd:PRK10263   826 PQPQYQ-------QPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPLPSL----DLLTPPPSEVEPV---DTFALEQM 890
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
330-433 8.01e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 42.57  E-value: 8.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  330 KPPAQPSGLTKPLAQQPGTVKPPVQPPGTTKPPAQPLGPAKPPAQQTgsekpsseqPGPKALAQPPgvgkTPAQQPGPAK 409
Cdd:PRK12270    37 GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAP---------PKPAAAAAAA----AAPAAPPAAA 103
                           90       100
                   ....*....|....*....|....
gi 2462613745  410 PPTQQVGTPKPLAQQPgLQSPAKA 433
Cdd:PRK12270   104 AAAAPAAAAVEDEVTP-LRGAAAA 126
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
668-883 8.16e-03

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 42.73  E-value: 8.16e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  668 TTSAVSKSSPQPQQ----TSPKKDAAPKQDLSKAPEPKKPPPLVKQPTLHGSPSAKAKQPPEADSLSKPAPPKEPSVPSE 743
Cdd:PTZ00108  1161 KTKGKASKLRKPKLkkkeKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSV 1240
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  744 QDKAPVADDKPKQPKMVKPTTDLVSSSSATTKPDIPSSkvqsQAEEKTTPPLKTDSAKPSQsfppTGEKVSPFDSKAIPR 823
Cdd:PTZ00108  1241 KRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRV----SAVQYSPPPPSKRPDGESN----GGSKPSSPTKKKVKK 1312
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  824 PasdskiishpgpssesKGQKQVDPVQKKEEPKKAQTKMSPKPDAKPMPKGSPTPPGPRP 883
Cdd:PTZ00108  1313 R----------------LEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRP 1356
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
452-549 8.22e-03

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 40.16  E-value: 8.22e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745   452 QQAGPGKTSAQQTGPTKPPSQLPgpaKPPPQQPG-PAKPPPQQPG--SAKPPPQQPGSTKPPPQQPGPAKPSPQQPGSTK 528
Cdd:smart00818   37 HQIIPVSQQHPPTHTLQPHHHIP---VLPAQQPVvPQQPLMPVPGqhSMTPTQHHQPNLPQPAQQPFQPQPLQPPQPQQP 113
                            90       100
                    ....*....|....*....|.
gi 2462613745   529 PPSQQPGSAKPSAQQPSPAKP 549
Cdd:smart00818  114 MQPQPPVHPIPPLPPQPPLPP 134
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
385-530 9.03e-03

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 40.16  E-value: 9.03e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745   385 QPGPKALAQPPGVGKTPaQQPGPAKPPTQQVGTPKPLAQQPGLQSpakapgptKTPVQQPGPGKIPAQQagpgktsaqqt 464
Cdd:smart00818   38 QIIPVSQQHPPTHTLQP-HHHIPVLPAQQPVVPQQPLMPVPGQHS--------MTPTQHHQPNLPQPAQ----------- 97
                            90       100       110       120       130       140
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462613745   465 gptkppsqlpgpakpPPQQPGPAKPPPQQPGSAKPPPQQPgstkPPPQQPGPAKPS--PQQPGSTKPP 530
Cdd:smart00818   98 ---------------QPFQPQPLQPPQPQQPMQPQPPVHP----IPPLPPQPPLPPmfPMQPLPPLLP 146
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
488-574 9.09e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 42.57  E-value: 9.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  488 KPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPSPqqpgSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQTGSGKPL 567
Cdd:PRK12270    37 GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAP----APAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAA 112

                   ....*..
gi 2462613745  568 QPPTVSP 574
Cdd:PRK12270   113 VEDEVTP 119
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
350-440 9.24e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 42.57  E-value: 9.24e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  350 KPPVQPPGTTKPPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAqppgvgktPAQQPGPAKPPTQQVGTPKPLAQQPGLQS 429
Cdd:PRK12270    37 GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAA--------PAAPPKPAAAAAAAAAPAAPPAAAAAAAP 108
                           90
                   ....*....|.
gi 2462613745  430 PAKAPGPTKTP 440
Cdd:PRK12270   109 AAAAVEDEVTP 119
PHA03291 PHA03291
envelope glycoprotein I; Provisional
446-554 9.35e-03

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 41.86  E-value: 9.35e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  446 PGKIPAQQAGPGKTSAQQTGPTKPPSQLPGPAKPPPQQPGPAKPPPQQPGS--AKPPPQQPGSTKPPPQQPGPAKPSPQQ 523
Cdd:PHA03291   167 PAEGTLAAPPLGEGSADGSCDPALPLSAPRLGPADVFVPATPRPTPRTTASpeTTPTPSTTTSPPSTTIPAPSTTIAAPQ 246
                           90       100       110
                   ....*....|....*....|....*....|...
gi 2462613745  524 PGSTKPPSQQPGSAKPSAQQ--PSPAKPSAQQS 554
Cdd:PHA03291   247 AGTTPEAEGTPAPPTPGGGEapPANATPAPEAS 279
PRK08691 PRK08691
DNA polymerase III subunits gamma and tau; Validated
429-597 9.53e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236333 [Multi-domain]  Cd Length: 709  Bit Score: 42.39  E-value: 9.53e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  429 SPAKAPGPTKTPVQQPGPGKIPAQQAGPGKTSAQQTGPtKPPSQlpgPAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTK 508
Cdd:PRK08691   359 APLAAASCDANAVIENTELQSPSAQTAEKETAAKKPQP-RPEAE---TAQTPVQTASAAAMPSEGKTAGPVSNQENNDVP 434
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  509 PPPQQPGPAK-PSPQQPGSTKPPSQQPGSAKPSAQQPSPAKPSAQQSTKPVSQTGSGKPLQPPTVS-----------PSA 576
Cdd:PRK08691   435 PWEDAPDEAQtAAGTAQTSAKSIQTASEAETPPENQVSKNKAADNETDAPLSEVPSENPIQATPNDeavetetfaheAPA 514
                          170       180
                   ....*....|....*....|.
gi 2462613745  577 KQPPSQGLPKTICPLCNTTEL 597
Cdd:PRK08691   515 EPFYGYGFPDNDCPPEDGAEI 535
PRK01297 PRK01297
ATP-dependent RNA helicase RhlB; Provisional
360-472 9.58e-03

ATP-dependent RNA helicase RhlB; Provisional


Pssm-ID: 234938 [Multi-domain]  Cd Length: 475  Bit Score: 41.82  E-value: 9.58e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  360 KPPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAQPPGVGKTPAQQPGPAKPPTQQVGTPKPLAQ---------------- 423
Cdd:PRK01297    11 KGEAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERKPKPASLWKledfvvepqegktrfh 90
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462613745  424 ----QPGLQSPAKAPG-PTKTPVQQ-------PGPGKIPAQQAGPGKTSA------QQTGPTKPPSQ 472
Cdd:PRK01297    91 dfnlAPELMHAIHDLGfPYCTPIQAqvlgytlAGHDAIGRAQTGTGKTAAflisiiNQLLQTPPPKE 157
Jun pfam03957
Jun-like transcription factor;
364-454 9.74e-03

Jun-like transcription factor;


Pssm-ID: 461108 [Multi-domain]  Cd Length: 231  Bit Score: 41.05  E-value: 9.74e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  364 QPLGPAKPPAQQTGSekPSSEQPGPKALAQPPGVGKTPAQQPGPAKPPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPVQQ 443
Cdd:pfam03957  125 QNQLPGATPAPQALA--AGGGGSGPGALAAGGIATEPPVYANLSSFNPAAAPASGAAPAQPPQPVSYAAEPPPFAVPVQH 202
                           90
                   ....*....|.
gi 2462613745  444 PGPGKIPAQQA 454
Cdd:pfam03957  203 PPPGRPPRLQA 213
PRK08691 PRK08691
DNA polymerase III subunits gamma and tau; Validated
361-546 9.78e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236333 [Multi-domain]  Cd Length: 709  Bit Score: 42.00  E-value: 9.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  361 PPAQPLGPAKPPAQQTGSEKPSSEQPGPKALAQPPgvgktpaqQPGPAKPPTQqvgTPKPLAQQPGLQSPAKAPGPTKTP 440
Cdd:PRK08691   360 PLAAASCDANAVIENTELQSPSAQTAEKETAAKKP--------QPRPEAETAQ---TPVQTASAAAMPSEGKTAGPVSNQ 428
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  441 VQQPGPgkipaqqagPGKTSAQQTGPTKPPSQLPgpAKPPPQQPGPAKPPPQQPGSAKPPPQQPGSTKPPPQQPGPAKPS 520
Cdd:PRK08691   429 ENNDVP---------PWEDAPDEAQTAAGTAQTS--AKSIQTASEAETPPENQVSKNKAADNETDAPLSEVPSENPIQAT 497
                          170       180
                   ....*....|....*....|....*.
gi 2462613745  521 PQQPGSTKPPSQQPGSAKPSAQQPSP 546
Cdd:PRK08691   498 PNDEAVETETFAHEAPAEPFYGYGFP 523
PHA03264 PHA03264
envelope glycoprotein D; Provisional
410-519 9.82e-03

envelope glycoprotein D; Provisional


Pssm-ID: 223029 [Multi-domain]  Cd Length: 416  Bit Score: 41.91  E-value: 9.82e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462613745  410 PPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPVQQPGPgkipAQQAGPGK-TSAQQTGPTKPPSQLPGPAKPPPQQPGPAk 488
Cdd:PHA03264   255 PPYFEESKGYEPPPAPSGGSPAPPGDDRPEAKPEPGP----VEDGAPGReTGGEGEGPEPAGRDGAAGGEPKPGPPRPA- 329
                           90       100       110
                   ....*....|....*....|....*....|.
gi 2462613745  489 PPPQQPGSAkppPQQPGSTKPPPQQPGPAKP 519
Cdd:PHA03264   330 PDADRPEGW---PSLEAITFPPPTPATPAVP 357
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH