NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|767994867|ref|XP_011523143|]
View 

leucine-rich repeat-containing protein 37A2 isoform X1 [Homo sapiens]

Protein Classification

leucine-rich repeat domain-containing protein( domain architecture ID 13465530)

leucine-rich repeat (LRR) domain-containing protein may participate in protein-protein interactions

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
LRRC37AB_C pfam14914
LRRC37A/B like protein 1 C-terminal domain; This family represents the C-terminal domain of ...
1444-1589 2.69e-78

LRRC37A/B like protein 1 C-terminal domain; This family represents the C-terminal domain of the putative Leucine Rich Repeat Containing protein 37A or protein 37B (LRRC37A/B) found in eukaryotes. The Leucine Rich Repeats (LRR) lies in the central region. The gene that encodes this protein is found in the chromosomal position 17q11.2, and its microdeletion results in the disease, neurofibromatosis type-1 (NF1). The function of the protein, LRRC37B is unknown, however experimental data shows expression in the aorta, heart, skeletal muscle, liver and brain during gestation.


:

Pssm-ID: 464370  Cd Length: 147  Bit Score: 255.04  E-value: 2.69e-78
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  1444 SPGDQFEIQLTQQLQSLIPNNNVRRLIAHVIRTLKMDCSGAHVQVTCAKLISRTGHLMKLLSGQQEVKASKIEWDTDQWK 1523
Cdd:pfam14914    1 SPGDQFEIQLNQQLLSLIPNVDVRRLISHVIRTLKMDCSEPQMQLACAKLISRTGLLMKLLSEQQEAKVSKADWDTDQWK 80
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 767994867  1524 IENYINESTEAQSEQKE-KSLELKKEVPGYGYTDKLILALIVTGILTILIILFCLIVICCHRRSLQE 1589
Cdd:pfam14914   81 NENYINESTEAQSKQKKqSSRELTKEVPGYGYNNKLILAISVTVVIMILIIILCLIEICSHRSASGE 147
LRRC37 pfam15779
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, ...
560-629 8.26e-23

Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, and is approximately 70 amino acids in length. The function of this protein is unknown but it is likely to be upregulated by androgen.


:

Pssm-ID: 434930 [Multi-domain]  Cd Length: 73  Bit Score: 93.58  E-value: 8.26e-23
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 767994867   560 EVELSPTMKETPTQP---PKKVVPQLRVYQGVTNPTPGQDQAQHPVSPSVTVQLLDLGLTITPEPTTEVGHST 629
Cdd:pfam15779    1 EVEPSPTQQETPTQPpesPKEVVAQPPVHHEVTVPTPGQGQAQHPTLPNVTVQPLDLELTITPEPTKEAEHST 73
LRRC37 pfam15779
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, ...
251-319 2.83e-15

Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, and is approximately 70 amino acids in length. The function of this protein is unknown but it is likely to be upregulated by androgen.


:

Pssm-ID: 434930 [Multi-domain]  Cd Length: 73  Bit Score: 72.01  E-value: 2.83e-15
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 767994867   251 EEEPSSMQQEAPALPPESSMESL--TLPNHEVSVQPPGEDQAYY-HLPNITVKPADVEVTITSEPTNETESS 319
Cdd:pfam15779    1 EVEPSPTQQETPTQPPESPKEVVaqPPVHHEVTVPTPGQGQAQHpTLPNVTVQPLDLELTITPEPTKEAEHS 72
PRK10263 super family cl35903
DNA translocase FtsK; Provisional
206-704 3.33e-15

DNA translocase FtsK; Provisional


The actual alignment was detected with superfamily member PRK10263:

Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 81.67  E-value: 3.33e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  206 PPGPSEQVGPSQFHLE----PETQNPETLEDIQSSSLQQEAPAQLPQLLEEEPssMQQEAPALPPESSMESLTLPNHEVS 281
Cdd:PRK10263  344 PPVASVDVPPAQPTVAwqpvPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEP--LQQPVQPQQPYYAPAAEQPAQQPYY 421
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  282 VQPPGEDQAYYHLPNITVKPA--------DVEVTITSEPTNETESSQAQ---QETPIQFPEEVEPSATQQEAP----IEP 346
Cdd:PRK10263  422 APAPEQPAQQPYYAPAPEQPVagnawqaeEQQSTFAPQSTYQTEQTYQQpaaQEPLYQQPQPVEQQPVVEPEPvveeTKP 501
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  347 PVPPMEHELSISEQQ-----------QPV-QPSESPREVESSPTQQETPGQPPEHHEVTVSP--PGHHQTHHLASPSVSV 412
Cdd:PRK10263  502 ARPPLYYFEEVEEKRarereqlaawyQPIpEPVKEPEPIKSSLKAPSVAAVPPVEAAAAVSPlaSGVKKATLATGAAATV 581
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  413 KPPDVQLTIAAEPSAEVGTSLVHQ-------EATTRLSGSGNDVEPPAIQhggppLLPESSEEAGPLAVQQETSFQSPEP 485
Cdd:PRK10263  582 AAPVFSLANSGGPRPQVKEGIGPQlprpkriRVPTRRELASYGIKLPSQR-----AAEEKAREAQRNQYDSGDQYNDDEI 656
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  486 INNENPSPTQQEAAAEHPQTAEEGESSLTHQ----EAPAQTPEFPNVVVAQPPEHSHLTQATVQPLDLG-FTITP----- 555
Cdd:PRK10263  657 DAMQQDELARQFAQTQQQRYGEQYQHDVPVNaedaDAAAEAELARQFAQTQQQRYSGEQPAGANPFSLDdFEFSPmkall 736
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  556 -ESKTEVELSPT-MKETPTQPPKKVVPQLRVYQGVTNPTPGQDQAQHPVSPSVTVQLLDLGLTITPEpTTEVGHSTPPKR 633
Cdd:PRK10263  737 dDGPHEPLFTPIvEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQ-YQQPQQPVAPQP 815
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 767994867  634 TIVSPKHPEVTLPHPDQVQTQhshltrATVQPLDLgfTITPKSMTEVEPSTALMTTAPPPGHPEVTLPPSD 704
Cdd:PRK10263  816 QYQQPQQPVAPQPQYQQPQQP------VAPQPQDT--LLHPLLMRNGDSRPLHKPTTPLPSLDLLTPPPSE 878
LRR_8 pfam13855
Leucine rich repeat;
892-952 2.78e-12

Leucine rich repeat;


:

Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 62.93  E-value: 2.78e-12
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 767994867   892 EKLILRENNLTELHKDSFEGLLSLQYLNLSCNVITELSFGTFqawHGMQFLHKLILNHNPL 952
Cdd:pfam13855    4 RSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAF---SGLPSLRYLDLSGNRL 61
LRRC37 pfam15779
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, ...
689-739 3.37e-12

Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, and is approximately 70 amino acids in length. The function of this protein is unknown but it is likely to be upregulated by androgen.


:

Pssm-ID: 434930 [Multi-domain]  Cd Length: 73  Bit Score: 63.15  E-value: 3.37e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 767994867   689 TAPPPGHPEVTLPPSDKGQAQHSHLTQATVQPLDLELTITTKPTTEVKPSP 739
Cdd:pfam15779   23 VAQPPVHHEVTVPTPGQGQAQHPTLPNVTVQPLDLELTITPEPTKEAEHST 73
 
Name Accession Description Interval E-value
LRRC37AB_C pfam14914
LRRC37A/B like protein 1 C-terminal domain; This family represents the C-terminal domain of ...
1444-1589 2.69e-78

LRRC37A/B like protein 1 C-terminal domain; This family represents the C-terminal domain of the putative Leucine Rich Repeat Containing protein 37A or protein 37B (LRRC37A/B) found in eukaryotes. The Leucine Rich Repeats (LRR) lies in the central region. The gene that encodes this protein is found in the chromosomal position 17q11.2, and its microdeletion results in the disease, neurofibromatosis type-1 (NF1). The function of the protein, LRRC37B is unknown, however experimental data shows expression in the aorta, heart, skeletal muscle, liver and brain during gestation.


Pssm-ID: 464370  Cd Length: 147  Bit Score: 255.04  E-value: 2.69e-78
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  1444 SPGDQFEIQLTQQLQSLIPNNNVRRLIAHVIRTLKMDCSGAHVQVTCAKLISRTGHLMKLLSGQQEVKASKIEWDTDQWK 1523
Cdd:pfam14914    1 SPGDQFEIQLNQQLLSLIPNVDVRRLISHVIRTLKMDCSEPQMQLACAKLISRTGLLMKLLSEQQEAKVSKADWDTDQWK 80
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 767994867  1524 IENYINESTEAQSEQKE-KSLELKKEVPGYGYTDKLILALIVTGILTILIILFCLIVICCHRRSLQE 1589
Cdd:pfam14914   81 NENYINESTEAQSKQKKqSSRELTKEVPGYGYNNKLILAISVTVVIMILIIILCLIEICSHRSASGE 147
LRRC37 pfam15779
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, ...
560-629 8.26e-23

Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, and is approximately 70 amino acids in length. The function of this protein is unknown but it is likely to be upregulated by androgen.


Pssm-ID: 434930 [Multi-domain]  Cd Length: 73  Bit Score: 93.58  E-value: 8.26e-23
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 767994867   560 EVELSPTMKETPTQP---PKKVVPQLRVYQGVTNPTPGQDQAQHPVSPSVTVQLLDLGLTITPEPTTEVGHST 629
Cdd:pfam15779    1 EVEPSPTQQETPTQPpesPKEVVAQPPVHHEVTVPTPGQGQAQHPTLPNVTVQPLDLELTITPEPTKEAEHST 73
LRRC37 pfam15779
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, ...
251-319 2.83e-15

Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, and is approximately 70 amino acids in length. The function of this protein is unknown but it is likely to be upregulated by androgen.


Pssm-ID: 434930 [Multi-domain]  Cd Length: 73  Bit Score: 72.01  E-value: 2.83e-15
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 767994867   251 EEEPSSMQQEAPALPPESSMESL--TLPNHEVSVQPPGEDQAYY-HLPNITVKPADVEVTITSEPTNETESS 319
Cdd:pfam15779    1 EVEPSPTQQETPTQPPESPKEVVaqPPVHHEVTVPTPGQGQAQHpTLPNVTVQPLDLELTITPEPTKEAEHS 72
PRK10263 PRK10263
DNA translocase FtsK; Provisional
206-704 3.33e-15

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 81.67  E-value: 3.33e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  206 PPGPSEQVGPSQFHLE----PETQNPETLEDIQSSSLQQEAPAQLPQLLEEEPssMQQEAPALPPESSMESLTLPNHEVS 281
Cdd:PRK10263  344 PPVASVDVPPAQPTVAwqpvPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEP--LQQPVQPQQPYYAPAAEQPAQQPYY 421
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  282 VQPPGEDQAYYHLPNITVKPA--------DVEVTITSEPTNETESSQAQ---QETPIQFPEEVEPSATQQEAP----IEP 346
Cdd:PRK10263  422 APAPEQPAQQPYYAPAPEQPVagnawqaeEQQSTFAPQSTYQTEQTYQQpaaQEPLYQQPQPVEQQPVVEPEPvveeTKP 501
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  347 PVPPMEHELSISEQQ-----------QPV-QPSESPREVESSPTQQETPGQPPEHHEVTVSP--PGHHQTHHLASPSVSV 412
Cdd:PRK10263  502 ARPPLYYFEEVEEKRarereqlaawyQPIpEPVKEPEPIKSSLKAPSVAAVPPVEAAAAVSPlaSGVKKATLATGAAATV 581
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  413 KPPDVQLTIAAEPSAEVGTSLVHQ-------EATTRLSGSGNDVEPPAIQhggppLLPESSEEAGPLAVQQETSFQSPEP 485
Cdd:PRK10263  582 AAPVFSLANSGGPRPQVKEGIGPQlprpkriRVPTRRELASYGIKLPSQR-----AAEEKAREAQRNQYDSGDQYNDDEI 656
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  486 INNENPSPTQQEAAAEHPQTAEEGESSLTHQ----EAPAQTPEFPNVVVAQPPEHSHLTQATVQPLDLG-FTITP----- 555
Cdd:PRK10263  657 DAMQQDELARQFAQTQQQRYGEQYQHDVPVNaedaDAAAEAELARQFAQTQQQRYSGEQPAGANPFSLDdFEFSPmkall 736
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  556 -ESKTEVELSPT-MKETPTQPPKKVVPQLRVYQGVTNPTPGQDQAQHPVSPSVTVQLLDLGLTITPEpTTEVGHSTPPKR 633
Cdd:PRK10263  737 dDGPHEPLFTPIvEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQ-YQQPQQPVAPQP 815
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 767994867  634 TIVSPKHPEVTLPHPDQVQTQhshltrATVQPLDLgfTITPKSMTEVEPSTALMTTAPPPGHPEVTLPPSD 704
Cdd:PRK10263  816 QYQQPQQPVAPQPQYQQPQQP------VAPQPQDT--LLHPLLMRNGDSRPLHKPTTPLPSLDLLTPPPSE 878
LRRC37 pfam15779
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, ...
352-429 1.47e-14

Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, and is approximately 70 amino acids in length. The function of this protein is unknown but it is likely to be upregulated by androgen.


Pssm-ID: 434930 [Multi-domain]  Cd Length: 73  Bit Score: 70.08  E-value: 1.47e-14
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 767994867   352 EHELSISEQQQPVQPSESPREVEssptqqetpGQPPEHHEVTVSPPGHHQTHHLASPSVSVKPPDVQLTIAAEPSAEV 429
Cdd:pfam15779    1 EVEPSPTQQETPTQPPESPKEVV---------AQPPVHHEVTVPTPGQGQAQHPTLPNVTVQPLDLELTITPEPTKEA 69
LRR_8 pfam13855
Leucine rich repeat;
892-952 2.78e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 62.93  E-value: 2.78e-12
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 767994867   892 EKLILRENNLTELHKDSFEGLLSLQYLNLSCNVITELSFGTFqawHGMQFLHKLILNHNPL 952
Cdd:pfam13855    4 RSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAF---SGLPSLRYLDLSGNRL 61
LRRC37 pfam15779
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, ...
689-739 3.37e-12

Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, and is approximately 70 amino acids in length. The function of this protein is unknown but it is likely to be upregulated by androgen.


Pssm-ID: 434930 [Multi-domain]  Cd Length: 73  Bit Score: 63.15  E-value: 3.37e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 767994867   689 TAPPPGHPEVTLPPSDKGQAQHSHLTQATVQPLDLELTITTKPTTEVKPSP 739
Cdd:pfam15779   23 VAQPPVHHEVTVPTPGQGQAQHPTLPNVTVQPLDLELTITPEPTKEAEHST 73
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
868-984 2.31e-11

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 68.04  E-value: 2.31e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  868 TILNFQGNYISYIDGNVWKAYSwTEKLILRENNLTELHkDSFEGLLSLQYLNLSCNVITELSfgtfQAWHGMQFLHKLIL 947
Cdd:COG4886   162 KSLDLSNNQLTDLPEELGNLTN-LKELDLSNNQITDLP-EPLGNLTNLEELDLSGNQLTDLP----EPLANLTNLETLDL 235
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|.
gi 767994867  948 NHNPLTTVedPYLFKLPALKYLDMG----TTLVPLTTLKNI 984
Cdd:COG4886   236 SNNQLTDL--PELGNLTNLEELDLSnnqlTDLPPLANLTNL 274
Not5 COG5665
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
302-604 1.17e-03

CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];


Pssm-ID: 444384 [Multi-domain]  Cd Length: 874  Bit Score: 43.88  E-value: 1.17e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  302 ADVEVTITSEPTNETESSQAQQETPIQFPEEVEPSATQQEAPIEPPVPPMEHELSISEQQQPVQ--PSESPREVESSPTQ 379
Cdd:COG5665   240 PSLLATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTSNTPTSTAKAQPQPPTKKQPAKepPSDTASGNPSAPSV 319
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  380 QETPGQPPEHHEVTVSPPGHHQTHHLASPSVSVKPPdvqltiaAEPSAEVgTSLVHQEATTRLSGSgndVEPPAIQHGGP 459
Cdd:COG5665   320 LINSDSPTSEDPATASVPTTEETTAFTTPSSVPSTP-------AEKDTPA-TDLATPVSPTPPETS---VDKKVSPDSAT 388
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  460 PLLPESSEEAGPLAV-QQETSFQSPEPINNENPSPTQQEAAAEHPQTAEegesSLTHQEAPAQTPEFPNVVVAQPPEHSH 538
Cdd:COG5665   389 SSTKSEKEGGTASSPmPPNIAIGAKDDVDATDPSQEAKEYTKNAPMTPE----ADSAPESSVRTEASPSAGSDLEPENTT 464
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 767994867  539 LTQAtvqpldlgftiTPESKTEVELSPTMKETPTQPPKKVVPQLRVYQGVTNPTPGQDQAQHPVSP 604
Cdd:COG5665   465 LRDP-----------APNAIPPPEDPSTIGRLSSGDKLANETGPPVIRRDSTPSSTADQSIVGVLA 519
ftsN TIGR02223
cell division protein FtsN; FtsN is a poorly conserved protein active in cell division in a ...
181-414 1.77e-03

cell division protein FtsN; FtsN is a poorly conserved protein active in cell division in a number of Proteobacteria. The N-terminal 30 residue region tends to by Lys/Arg-rich, and is followed by a membrane-spanning region. This is followed by an acidic low-complexity region of variable length and a well-conserved C-terminal domain of two tandem regions matched by pfam05036 (Sporulation related repeat), found in several cell division and sporulation proteins. The role of FtsN as a suppressor for other cell division mutations is poorly understood; it may involve cell wall hydrolysis. [Cellular processes, Cell division]


Pssm-ID: 274041 [Multi-domain]  Cd Length: 298  Bit Score: 42.37  E-value: 1.77e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   181 QNEYSSTDTPYPgslppELRVKSDEPPGPSEQVGPSQFHLEPETQNPETLedIQSSSLQQEAPAQLPQLLEEEPSSMQQE 260
Cdd:TIGR02223    3 QRDYVRRGRGAP-----QKKKKNRRLVRATVLIAAILILLFIGGSSGLYL--LTESKQANEPETLQPKNQTENGETAADL 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   261 APAlPPESSMESLTLPNHEVSVQPPGEDQAyyhlpnitVKPADVEVTITSEPTNETESSQAQQETPIQFPEEVEPSATQQ 340
Cdd:TIGR02223   76 PPK-PEERWSYIEELEAREVLINDPEEPSN--------GGGVEESAQLTAEQRQLLEQMQADMRAAEKVLATAPSEQTVA 146
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 767994867   341 EAPIEPPVPPMEHELSISEQQQ-PVQPSESPREVESSPTQQETPGQPPEHHEVTVSPpghHQTHHLASPSVSVKP 414
Cdd:TIGR02223  147 VEARKQTAEKKPQKARTAEAQKtPVETEKIASKVKEAKQKQKALPKQTAETQSNSKP---IETAPKADKADKTKP 218
 
Name Accession Description Interval E-value
LRRC37AB_C pfam14914
LRRC37A/B like protein 1 C-terminal domain; This family represents the C-terminal domain of ...
1444-1589 2.69e-78

LRRC37A/B like protein 1 C-terminal domain; This family represents the C-terminal domain of the putative Leucine Rich Repeat Containing protein 37A or protein 37B (LRRC37A/B) found in eukaryotes. The Leucine Rich Repeats (LRR) lies in the central region. The gene that encodes this protein is found in the chromosomal position 17q11.2, and its microdeletion results in the disease, neurofibromatosis type-1 (NF1). The function of the protein, LRRC37B is unknown, however experimental data shows expression in the aorta, heart, skeletal muscle, liver and brain during gestation.


Pssm-ID: 464370  Cd Length: 147  Bit Score: 255.04  E-value: 2.69e-78
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  1444 SPGDQFEIQLTQQLQSLIPNNNVRRLIAHVIRTLKMDCSGAHVQVTCAKLISRTGHLMKLLSGQQEVKASKIEWDTDQWK 1523
Cdd:pfam14914    1 SPGDQFEIQLNQQLLSLIPNVDVRRLISHVIRTLKMDCSEPQMQLACAKLISRTGLLMKLLSEQQEAKVSKADWDTDQWK 80
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 767994867  1524 IENYINESTEAQSEQKE-KSLELKKEVPGYGYTDKLILALIVTGILTILIILFCLIVICCHRRSLQE 1589
Cdd:pfam14914   81 NENYINESTEAQSKQKKqSSRELTKEVPGYGYNNKLILAISVTVVIMILIIILCLIEICSHRSASGE 147
LRRC37 pfam15779
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, ...
560-629 8.26e-23

Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, and is approximately 70 amino acids in length. The function of this protein is unknown but it is likely to be upregulated by androgen.


Pssm-ID: 434930 [Multi-domain]  Cd Length: 73  Bit Score: 93.58  E-value: 8.26e-23
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 767994867   560 EVELSPTMKETPTQP---PKKVVPQLRVYQGVTNPTPGQDQAQHPVSPSVTVQLLDLGLTITPEPTTEVGHST 629
Cdd:pfam15779    1 EVEPSPTQQETPTQPpesPKEVVAQPPVHHEVTVPTPGQGQAQHPTLPNVTVQPLDLELTITPEPTKEAEHST 73
LRRC37 pfam15779
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, ...
251-319 2.83e-15

Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, and is approximately 70 amino acids in length. The function of this protein is unknown but it is likely to be upregulated by androgen.


Pssm-ID: 434930 [Multi-domain]  Cd Length: 73  Bit Score: 72.01  E-value: 2.83e-15
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 767994867   251 EEEPSSMQQEAPALPPESSMESL--TLPNHEVSVQPPGEDQAYY-HLPNITVKPADVEVTITSEPTNETESS 319
Cdd:pfam15779    1 EVEPSPTQQETPTQPPESPKEVVaqPPVHHEVTVPTPGQGQAQHpTLPNVTVQPLDLELTITPEPTKEAEHS 72
PRK10263 PRK10263
DNA translocase FtsK; Provisional
206-704 3.33e-15

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 81.67  E-value: 3.33e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  206 PPGPSEQVGPSQFHLE----PETQNPETLEDIQSSSLQQEAPAQLPQLLEEEPssMQQEAPALPPESSMESLTLPNHEVS 281
Cdd:PRK10263  344 PPVASVDVPPAQPTVAwqpvPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEP--LQQPVQPQQPYYAPAAEQPAQQPYY 421
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  282 VQPPGEDQAYYHLPNITVKPA--------DVEVTITSEPTNETESSQAQ---QETPIQFPEEVEPSATQQEAP----IEP 346
Cdd:PRK10263  422 APAPEQPAQQPYYAPAPEQPVagnawqaeEQQSTFAPQSTYQTEQTYQQpaaQEPLYQQPQPVEQQPVVEPEPvveeTKP 501
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  347 PVPPMEHELSISEQQ-----------QPV-QPSESPREVESSPTQQETPGQPPEHHEVTVSP--PGHHQTHHLASPSVSV 412
Cdd:PRK10263  502 ARPPLYYFEEVEEKRarereqlaawyQPIpEPVKEPEPIKSSLKAPSVAAVPPVEAAAAVSPlaSGVKKATLATGAAATV 581
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  413 KPPDVQLTIAAEPSAEVGTSLVHQ-------EATTRLSGSGNDVEPPAIQhggppLLPESSEEAGPLAVQQETSFQSPEP 485
Cdd:PRK10263  582 AAPVFSLANSGGPRPQVKEGIGPQlprpkriRVPTRRELASYGIKLPSQR-----AAEEKAREAQRNQYDSGDQYNDDEI 656
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  486 INNENPSPTQQEAAAEHPQTAEEGESSLTHQ----EAPAQTPEFPNVVVAQPPEHSHLTQATVQPLDLG-FTITP----- 555
Cdd:PRK10263  657 DAMQQDELARQFAQTQQQRYGEQYQHDVPVNaedaDAAAEAELARQFAQTQQQRYSGEQPAGANPFSLDdFEFSPmkall 736
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  556 -ESKTEVELSPT-MKETPTQPPKKVVPQLRVYQGVTNPTPGQDQAQHPVSPSVTVQLLDLGLTITPEpTTEVGHSTPPKR 633
Cdd:PRK10263  737 dDGPHEPLFTPIvEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQ-YQQPQQPVAPQP 815
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 767994867  634 TIVSPKHPEVTLPHPDQVQTQhshltrATVQPLDLgfTITPKSMTEVEPSTALMTTAPPPGHPEVTLPPSD 704
Cdd:PRK10263  816 QYQQPQQPVAPQPQYQQPQQP------VAPQPQDT--LLHPLLMRNGDSRPLHKPTTPLPSLDLLTPPPSE 878
LRRC37 pfam15779
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, ...
352-429 1.47e-14

Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, and is approximately 70 amino acids in length. The function of this protein is unknown but it is likely to be upregulated by androgen.


Pssm-ID: 434930 [Multi-domain]  Cd Length: 73  Bit Score: 70.08  E-value: 1.47e-14
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 767994867   352 EHELSISEQQQPVQPSESPREVEssptqqetpGQPPEHHEVTVSPPGHHQTHHLASPSVSVKPPDVQLTIAAEPSAEV 429
Cdd:pfam15779    1 EVEPSPTQQETPTQPPESPKEVV---------AQPPVHHEVTVPTPGQGQAQHPTLPNVTVQPLDLELTITPEPTKEA 69
LRR_8 pfam13855
Leucine rich repeat;
892-952 2.78e-12

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 62.93  E-value: 2.78e-12
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 767994867   892 EKLILRENNLTELHKDSFEGLLSLQYLNLSCNVITELSFGTFqawHGMQFLHKLILNHNPL 952
Cdd:pfam13855    4 RSLDLSNNRLTSLDDGAFKGLSNLKVLDLSNNLLTTLSPGAF---SGLPSLRYLDLSGNRL 61
LRRC37 pfam15779
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, ...
689-739 3.37e-12

Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, and is approximately 70 amino acids in length. The function of this protein is unknown but it is likely to be upregulated by androgen.


Pssm-ID: 434930 [Multi-domain]  Cd Length: 73  Bit Score: 63.15  E-value: 3.37e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 767994867   689 TAPPPGHPEVTLPPSDKGQAQHSHLTQATVQPLDLELTITTKPTTEVKPSP 739
Cdd:pfam15779   23 VAQPPVHHEVTVPTPGQGQAQHPTLPNVTVQPLDLELTITPEPTKEAEHST 73
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
868-984 2.31e-11

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 68.04  E-value: 2.31e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  868 TILNFQGNYISYIDGNVWKAYSwTEKLILRENNLTELHkDSFEGLLSLQYLNLSCNVITELSfgtfQAWHGMQFLHKLIL 947
Cdd:COG4886   162 KSLDLSNNQLTDLPEELGNLTN-LKELDLSNNQITDLP-EPLGNLTNLEELDLSGNQLTDLP----EPLANLTNLETLDL 235
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|.
gi 767994867  948 NHNPLTTVedPYLFKLPALKYLDMG----TTLVPLTTLKNI 984
Cdd:COG4886   236 SNNQLTDL--PELGNLTNLEELDLSnnqlTDLPPLANLTNL 274
PHA03247 PHA03247
large tegument protein UL36; Provisional
190-752 4.29e-11

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 68.43  E-value: 4.29e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  190 PYPGSLPPELRVKS-------DEPPGPSEQVGPSQFHLEPETQNPETLEDIQSSSLQQEAPAQLPqlleeePSSMQQEAP 262
Cdd:PHA03247 2554 PLPPAAPPAAPDRSvppprpaPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLP------PDTHAPDPP 2627
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  263 alPPESSMESLTLPNHEVSVQPPGEDQAYYHLPNITVKPAdvEVTITSEPTNEteSSQAQQETPIQFPEEVEPSATQQEA 342
Cdd:PHA03247 2628 --PPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPR--RARRLGRAAQA--SSPPQRPRRRAARPTVGSLTSLADP 2701
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  343 PIEPPVPPMEHELSISEQQQPVQPSESPREVESSPTQQETPGQP-----PEHHEVTVSPPGHHQTHHLASPSVSVKPPDV 417
Cdd:PHA03247 2702 PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPagpatPGGPARPARPPTTAGPPAPAPPAAPAAGPPR 2781
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  418 QLTIAAEPSAevgtslvhQEATTRLSGSGNDVEPPAIQHGGPPLLPESSEEAGPLAVQQetsfqSPEPINNENPSPTQQE 497
Cdd:PHA03247 2782 RLTRPAVASL--------SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPT-----SAQPTAPPPPPGPPPP 2848
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  498 AAAEHPQTAEEGESSlthQEAPAQTPefPNVVVAQP-PEHSHLTQATVQPLDLGFTITPESKtEVELSPTMKETPTQPPK 576
Cdd:PHA03247 2849 SLPLGGSVAPGGDVR---RRPPSRSP--AAKPAAPArPPVRRLARPAVSRSTESFALPPDQP-ERPPQPQAPPPPQPQPQ 2922
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  577 KVVPQLRVYQgvtNPTPGQDQAQHPVSPSVTVQLLDLGLTITPEPTTEVGHSTPPKRTIVSPKHPEVTLPHPDQVQTQHS 656
Cdd:PHA03247 2923 PPPPPQPQPP---PPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGH 2999
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  657 HLTRATVQPLDLGFTITPksmtevepstalmttAPPPGHPEVTL-PPSDKGQAQHSHLTQATVQPLDLEltiTTKPTTEV 735
Cdd:PHA03247 3000 SLSRVSSWASSLALHEET---------------DPPPVSLKQTLwPPDDTEDSDADSLFDSDSERSDLE---ALDPLPPE 3061
                         570
                  ....*....|....*..
gi 767994867  736 KPSPTTEETSTQPPDLG 752
Cdd:PHA03247 3062 PHDPFAHEPDPATPEAG 3078
LRRC37 pfam15779
Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, ...
508-565 1.24e-10

Leucine-rich repeat-containing protein 37 family; This domain family is found in eukaryotes, and is approximately 70 amino acids in length. The function of this protein is unknown but it is likely to be upregulated by androgen.


Pssm-ID: 434930 [Multi-domain]  Cd Length: 73  Bit Score: 58.91  E-value: 1.24e-10
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 767994867   508 EGESSLTHQEAPAQTPEFPNVVVAQPP---------------EHSHLTQATVQPLDLGFTITPESKTEVELSP 565
Cdd:pfam15779    1 EVEPSPTQQETPTQPPESPKEVVAQPPvhhevtvptpgqgqaQHPTLPNVTVQPLDLELTITPEPTKEAEHST 73
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
307-739 4.66e-10

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 64.79  E-value: 4.66e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   307 TITSEPTNETES-SQAQQETPIQFPEEVEpsaTQQEAPIEPPVPPMEHELSISEQQQPVQPSESPREVE--SSPTQQETP 383
Cdd:pfam03154  147 SIPSPQDNESDSdSSAQQQILQTQPPVLQ---AQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPatSQPPNQTQS 223
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   384 GQPPeHHEVTVSPPGHHQthHLASPsvsvKPPDVQLTIAAEPSAEVGTSLVHQEATTRLSGSGNDVE--PPAIQHGGPP- 460
Cdd:pfam03154  224 TAAP-HTLIQQTPTLHPQ--RLPSP----HPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQtgPSHMQHPVPPq 296
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   461 ---LLPESSEEAGPLAVQQETSFQSPEPINNENPSPTQQEAAAEHPQTAEEGESSLTHQEAPAQTPeFPNVVVAQPPEH- 536
Cdd:pfam03154  297 pfpLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTP-IPQLPNPQSHKHp 375
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   537 SHLTQATVQPLDLGFTITPESKTEVELSPTMKETPTQPPKKVVPQlrvyqgvtnptpGQDQAQHPVSPSVTVQLLDLGLT 616
Cdd:pfam03154  376 PHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQ------------SQQLPPPPAQPPVLTQSQSLPPP 443
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   617 ITPEPTTEVGHSTPPKRTIvsPKHPEVTLPHPdqvqtqhshltraTVQPLDLGFTITPKSMTEVEP--STALMTTAPPPG 694
Cdd:pfam03154  444 AASHPPTSGLHQVPSQSPF--PQHPFVPGGPP-------------PITPPSGPPTSTSSAMPGIQPpsSASVSSSGPVPA 508
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*
gi 767994867   695 HPEVTLPPsdkgqaqhshlTQATVQPLDLELTITTKPTTEVKPSP 739
Cdd:pfam03154  509 AVSCPLPP-----------VQIKEEALDEAEEPESPPPPPRSPSP 542
rne PRK10811
ribonuclease E; Reviewed
223-593 1.66e-07

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 56.59  E-value: 1.66e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  223 ETQNPETLEDIQSSSLQQEAPAQLPQ-LLEEEPSSMQQEAPALPPEssmesltlpnHEVSVQPPGEDQAYYHLPNITVKP 301
Cdd:PRK10811  655 ESQQAEVTEKARTQDEQQQAPRRERQrRRNDEKRQAQQEAKALNVE----------EQSVQETEQEERVQQVQPRRKQRQ 724
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  302 ADVEVTITSEPTNETESSQAQQETPIQFPEEVEPSATQQEAPIEPPV---PPMEHELSISEQQQ-----PVQPSESPREV 373
Cdd:PRK10811  725 LNQKVRIEQSVAEEAVAPVVEETVAAEPVVQEVPAPRTELVKVPLPVvaqTAPEQDEENNAENRdnngmPRRSRRSPRHL 804
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  374 ------------ESSPTQQETP----GQPPE--------HHEVTvsPPGHHQTHHLASPSVSVKPPDVQLTIAAEPSAEV 429
Cdd:PRK10811  805 rvsgqrrrryrdERYPTQSPMPltvaCASPEmasgkvwiRYPVV--RPQDVQVEEQREAEEVQVQPVVAEVPVAAAVEPV 882
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  430 GTSLVHQEATTrlsgsgnDVEPPAIQHGGPPLLPESSEEAGPLAVQqetsfqspEPInNENPSPTQQEAAAEHPQTAEEG 509
Cdd:PRK10811  883 VSAPVVEAVAE-------VVEEPVVVAEPQPEEVVVVETTHPEVIA--------APV-TEQPQVITESDVAVAQEVAEHA 946
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  510 ESSLTHQEAPAQTPEFPNVVVAQPPEHSHLTQATVQPldlgfTITPESKTEVELSPTMKETPTQPPKKVVPQLRVYQGVT 589
Cdd:PRK10811  947 EPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPV-----VAEVAAEVETVTAVEPEVAPAQVPEATVEHNHATAPMT 1021

                  ....*
gi 767994867  590 N-PTP 593
Cdd:PRK10811 1022 RaPAP 1026
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
892-996 2.15e-07

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 55.32  E-value: 2.15e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  892 EKLILRENNLTELhKDSFEGLLSLQYLNLSCNVITEL--SFGTFQAwhgmqfLHKLILNHNPLTTVEDPyLFKLPALKYL 969
Cdd:COG4886   116 ESLDLSGNQLTDL-PEELANLTNLKELDLSNNQLTDLpePLGNLTN------LKSLDLSNNQLTDLPEE-LGNLTNLKEL 187
                          90       100
                  ....*....|....*....|....*..
gi 767994867  970 DMGTTlvPLTTLKNILMMTVELEKLIL 996
Cdd:COG4886   188 DLSNN--QITDLPEPLGNLTNLEELDL 212
PHA03247 PHA03247
large tegument protein UL36; Provisional
315-777 9.89e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.17  E-value: 9.89e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  315 ETESSQAQQETPIqFPEEVEPSATQQEAPIEPPVPpmehelsiseqqQPVQPSESPRE----VESSPTQQETPGQPPEhh 390
Cdd:PHA03247 2542 ELASDDAGDPPPP-LPPAAPPAAPDRSVPPPRPAP------------RPSEPAVTSRArrpdAPPQSARPRAPVDDRG-- 2606
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  391 evtvSPPGHHQTHHLASPSVSVKPPDVQLTIAAEPSAEVGTSLVHQEATTRLSGSGNDVEPP--------AIQHGGPPLL 462
Cdd:PHA03247 2607 ----DPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPrrarrlgrAAQASSPPQR 2682
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  463 PEssEEAGPLAVQQETSFQSPEPINNE-NPSPTQQEAAAEHPQTAeegesslthQEAPAQTPEFPNVVVAQPPEHSHLTQ 541
Cdd:PHA03247 2683 PR--RRAARPTVGSLTSLADPPPPPPTpEPAPHALVSATPLPPGP---------AAARQASPALPAAPAPPAVPAGPATP 2751
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  542 ATVQPldlgftitpesktevelsPTMKETPTQPPKKVVPQLRvyqgvtnPTPGQDQAQHPVSPSVTVQLLDLGLTITPEP 621
Cdd:PHA03247 2752 GGPAR------------------PARPPTTAGPPAPAPPAAP-------AAGPPRRLTRPAVASLSESRESLPSPWDPAD 2806
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  622 TTEVghSTPPKRTIVSPKHPEVTLPHPDQVQTQHSHLTRATVQP-LDLGFTITPKS-MTEVEPSTALMTTAPPPGHPevt 699
Cdd:PHA03247 2807 PPAA--VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPsLPLGGSVAPGGdVRRRPPSRSPAAKPAAPARP--- 2881
                         410       420       430       440       450       460       470
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 767994867  700 lPPSDKGQAQHSHLTQATVQPLDlelTITTKPTTEVKPSPTTEETSTQPPDLGlaiiPEPTTETRHSTALEKTTAPRP 777
Cdd:PHA03247 2882 -PVRRLARPAVSRSTESFALPPD---QPERPPQPQAPPPPQPQPQPPPPPQPQ----PPPPPPPRPQPPLAPTTDPAG 2951
PHA03247 PHA03247
large tegument protein UL36; Provisional
68-535 2.37e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.02  E-value: 2.37e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   68 PRESPHAPTLPADPwdfDHLGPSASSEMPAPPQESTEnlvPFLDTWDSAGEQPLEPEQFLASqqdlkdklSPQERLPVSP 147
Cdd:PHA03247 2676 ASSPPQRPRRRAAR---PTVGSLTSLADPPPPPPTPE---PAPHALVSATPLPPGPAAARQA--------SPALPAAPAP 2741
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  148 KKLKKDPAqrwslaeIIGITRQLSTPQSqkqtlqneyssTDTPyPGSLPPELRVKSDEPPGPSEQVGPSQFHLE--PETQ 225
Cdd:PHA03247 2742 PAVPAGPA-------TPGGPARPARPPT-----------TAGP-PAPAPPAAPAAGPPRRLTRPAVASLSESREslPSPW 2802
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  226 NPETLEDIQSSSLQQEAPAQLPQLLEEEPSSMQQEAPALPPESSMESLTLpnhEVSVQPPGedqayyhlPNITVKPADVE 305
Cdd:PHA03247 2803 DPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPL---GGSVAPGG--------DVRRRPPSRSP 2871
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  306 VTITSEPTNETESSQAQqetpiqfpeevePSATQQEAPIepPVPPmehelsiSEQQQPVQPSESPREVESSPTQQETPGQ 385
Cdd:PHA03247 2872 AAKPAAPARPPVRRLAR------------PAVSRSTESF--ALPP-------DQPERPPQPQAPPPPQPQPQPPPPPQPQ 2930
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  386 PPEHHEVTVSPPGHHQTHHLASPSVSVKPPDVQLTIAAEPSAEVGTSLVHQEATTRLSGSgndvEPPAIQHGGPplLPES 465
Cdd:PHA03247 2931 PPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPA----SSTPPLTGHS--LSRV 3004
                         410       420       430       440       450       460       470
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 767994867  466 SEEAGPLAVQQETsfqSPEPINNEN---PSPTQQEAAAEHPQTAEEGESSLthqEAPAQTPEFPNVVVAQPPE 535
Cdd:PHA03247 3005 SSWASSLALHEET---DPPPVSLKQtlwPPDDTEDSDADSLFDSDSERSDL---EALDPLPPEPHDPFAHEPD 3071
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
49-383 5.05e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 51.69  E-value: 5.05e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867    49 LTSNPLGPPDSWSSHSSHfPRESPHAPtLPADPWDFdHLGPSaSSEMPAPPQestenlvPFLDTWDSAGEQ-PLEPEQFL 127
Cdd:pfam03154  249 LQPMTQPPPPSQVSPQPL-PQPSLHGQ-MPPMPHSL-QTGPS-HMQHPVPPQ-------PFPLTPQSSQSQvPPGPSPAA 317
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   128 ASQQDLKDKLSPQERLPVSPKKLKKDPAQRWSLAeIIGITRQLSTPQSQKQTLQneysSTDTPYPGSLPPELRVKSDEPP 207
Cdd:pfam03154  318 PGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLS-MPHIKPPPTTPIPQLPNPQ----SHKHPPHLSGPSPFQMNSNLPP 392
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   208 GPSEQVGPSQFHLEPETQNPETLEDIQSSSLQQEAPAQLPQLleeepssmqQEAPALPPESSMESLTLPNHEVSVQPPGE 287
Cdd:pfam03154  393 PPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVL---------TQSQSLPPPAASHPPTSGLHQVPSQSPFP 463
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   288 DQAYYHLPNITVKPAdvevtitSEPTNETESSQaqqeTPIQFPEEVEPSATQQ-EAPIEPPVPPME-HELSISEQQQPVQ 365
Cdd:pfam03154  464 QHPFVPGGPPPITPP-------SGPPTSTSSAM----PGIQPPSSASVSSSGPvPAAVSCPLPPVQiKEEALDEAEEPES 532
                          330
                   ....*....|....*...
gi 767994867   366 PSESPREVESSPTQQETP 383
Cdd:pfam03154  533 PPPPPRSPSPEPTVVNTP 550
LRR COG4886
Leucine-rich repeat (LRR) protein [Transcription];
900-996 6.61e-06

Leucine-rich repeat (LRR) protein [Transcription];


Pssm-ID: 443914 [Multi-domain]  Cd Length: 414  Bit Score: 50.70  E-value: 6.61e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  900 NLTEL---HKDSFEGLLSLQYLNLSCNVITELSFGTFQawhgMQFLHKLILNHNPLTTVEDPyLFKLPALKYLDMGTTlv 976
Cdd:COG4886    97 NLTELdlsGNEELSNLTNLESLDLSGNQLTDLPEELAN----LTNLKELDLSNNQLTDLPEP-LGNLTNLKSLDLSNN-- 169
                          90       100
                  ....*....|....*....|
gi 767994867  977 PLTTLKNILMMTVELEKLIL 996
Cdd:COG4886   170 QLTDLPEELGNLTNLKELDL 189
rne PRK10811
ribonuclease E; Reviewed
299-538 1.37e-05

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 50.04  E-value: 1.37e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  299 VKPADVEVTITSEPTNETESSQAQQETPIQFPEEVEPSATQQEAPIEPPVPPMEHElsiseqqqPVQPSESPREvesspt 378
Cdd:PRK10811  848 VRPQDVQVEEQREAEEVQVQPVVAEVPVAAAVEPVVSAPVVEAVAEVVEEPVVVAE--------PQPEEVVVVE------ 913
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  379 qqetpgqppEHHEVTVSPPGHHQTHHLASPSVSVKPPDVQLTIAAEPSAEVGTSLVHQEATTrlsgsgndveppaiqhgg 458
Cdd:PRK10811  914 ---------TTHPEVIAAPVTEQPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAETA------------------ 966
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  459 PPLLPESSEEAGPLAVQQETsfqSPEPINNENPSPTQQEAAAEHPQTAEEGESSLTHQEAPAqtPEFpnvvVAQPPEHSH 538
Cdd:PRK10811  967 EVVVAEPEVVAQPAAPVVAE---VAAEVETVTAVEPEVAPAQVPEATVEHNHATAPMTRAPA--PEY----VPEAPRHSD 1037
LRR_8 pfam13855
Leucine rich repeat;
914-971 1.81e-05

Leucine rich repeat;


Pssm-ID: 404697 [Multi-domain]  Cd Length: 61  Bit Score: 43.67  E-value: 1.81e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 767994867   914 SLQYLNLSCNVITELSFGTFQAWHGMQflhKLILNHNPLTTVEDPYLFKLPALKYLDM 971
Cdd:pfam13855    2 NLRSLDLSNNRLTSLDDGAFKGLSNLK---VLDLSNNLLTTLSPGAFSGLPSLRYLDL 56
rne PRK10811
ribonuclease E; Reviewed
215-418 4.43e-05

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 48.50  E-value: 4.43e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  215 PSQFHLEPETQNPETLEDIQSSSLQQEAPAQlpqllEEEPSSMQQEAPALPPEssmesltlPNHEVSVQPPGEDQAYYHL 294
Cdd:PRK10811  850 PQDVQVEEQREAEEVQVQPVVAEVPVAAAVE-----PVVSAPVVEAVAEVVEE--------PVVVAEPQPEEVVVVETTH 916
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  295 PNITVKPADVEVTITSEPTNETESSQAQQETPIQFPEEVEPSATQQEAPIEPPVPpmehelsisEQQQPVQPSESPREVE 374
Cdd:PRK10811  917 PEVIAAPVTEQPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAETAEVVVA---------EPEVVAQPAAPVVAEV 987
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....
gi 767994867  375 SSPTQQETPGQPPEHHEVTVSPPGHHqtHHLASPSVSVKPPDVQ 418
Cdd:PRK10811  988 AAEVETVTAVEPEVAPAQVPEATVEH--NHATAPMTRAPAPEYV 1029
PRK14949 PRK14949
DNA polymerase III subunits gamma and tau; Provisional
154-602 2.34e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237863 [Multi-domain]  Cd Length: 944  Bit Score: 45.87  E-value: 2.34e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  154 PAQRWSLAEIIGITRQLSTPQSQKQTLQNEYSSTDTPYPGSLPPElrvksdEPPGPSEQVGPSQFHLEPETQNP-ETLED 232
Cdd:PRK14949  362 PVKRWQVDDPAEISLPEGQTPSALAAAVQAPHANEPQFVNAAPAE------KKTALTEQTTAQQQVQAANAEAVaEADAS 435
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  233 IQSSSLQQEAPAQLPQLLEEEPSSMQ---QEAPALPPESSMESLTLPNHEVSVQPPGEDQAYYHlPNITVKPADVEVTIT 309
Cdd:PRK14949  436 AEPADTVEQALDDESELLAALNAEQAvilSQAQSQGFEASSSLDADNSAVPEQIDSTAEQSVVN-PSVTDTQVDDTSASN 514
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  310 SEPTNETESSQAQQETPIQFPEEVEPSATQQEAPIEPPVPPMEHElsISEQQQPVQPSESPREVESSPTQQETPGQPPEH 389
Cdd:PRK14949  515 NSAADNTVDDNYSAEDTLESNGLDEGDYAQDSAPLDAYQDDYVAF--SSESYNALSDDEQHSANVQSAQSAAEAQPSSQS 592
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  390 HEVTVSPPghhqthhlaSPSVSVKPPDV-QLTIAAEPS--AEVGTSLVHQEATTRLSGSGNDVEPPAIQHGGPPLLPESS 466
Cdd:PRK14949  593 LSPISAVT---------TAAASLADDDIlDAVLAARDSllSDLDALSPKEGDGKKSSADRKPKTPPSRAPPASLSKPASS 663
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  467 EEAGPLAVQQETSFQS-PEPINNENPSPTQ-QEAAAEHPQT----------AEEGESSLTHQEAPAQTPEFPNVV----- 529
Cdd:PRK14949  664 PDASQTSASFDLDPDFeLATHQSVPEAALAsGSAPAPPPVPdpydrppweeAPEVASANDGPNNAAEGNLSESVEdasns 743
                         410       420       430       440       450       460       470
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 767994867  530 ----VAQPPEHSHLTQATVQPldlgftitpesktevelsPTMKETPTQPPKKVVPQlrvyQGVTNPTPGQDQAQHPV 602
Cdd:PRK14949  744 elqaVEQQATHQPQVQAEAQS------------------PASTTALTQTSSEVQDT----ELNLVLLSSGSITGHPL 798
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
190-468 3.63e-04

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 45.41  E-value: 3.63e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   190 PYPGSLPPELRVKSDEPPGPSEQVGPSQFHLEP-----ET-QNPETLEDIQ-SSSLQQEAPAQLPQLLEEEPSSMQQEAP 262
Cdd:pfam09770  107 PAARAAQSSAQPPASSLPQYQYASQQSQQPSKPvrtgyEKyKEPEPIPDLQvDASLWGVAPKKAAAPAPAPQPAAQPASL 186
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   263 ALPPESSMeSLtlpnHEVsvqppgEDQAYYHLPNITVKPADVEVTITSEPTnetessQAQQETPIQFPEEVEPSATQQEA 342
Cdd:pfam09770  187 PAPSRKMM-SL----EEV------EAAMRAQAKKPAQQPAPAPAQPPAAPP------AQQAQQQQQFPPQIQQQQQPQQQ 249
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   343 PIEPPvPPMEHELSISEQQQPVQPSESPREVESSPTQQETPGQPPehhevtvsPPGHHQTHHLASPSVsvkPPDVQLTIA 422
Cdd:pfam09770  250 PQQPQ-QHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPP--------PVPVQPTQILQNPNR---LSAARVGYP 317
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 767994867   423 AEPSAEVGTSLVHQEATTRLSGSGNdvePPAIQHggPPLLPESSEE 468
Cdd:pfam09770  318 QNPQPGVQPAPAHQAHRQQGSFGRQ---APIITH--PQQLAQLSEE 358
PHA03377 PHA03377
EBNA-3C; Provisional
295-748 4.79e-04

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 45.04  E-value: 4.79e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  295 PNITVKPADVEVTITSEPTNETESSQAqqetpiqFPEEVEPSaTQQEAPIEPPVP-PMEHELSISEQQQPV--------- 364
Cdd:PHA03377  422 PTPKTHPVKRTLVKTSGRSDEAEQAQS-------TPERPGPS-DQPSVPVEPAHLtPVEHTTVILHQPPQSpptvaikpa 493
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  365 -QPSESPR--------------EVESS--PTQQETPGQPPEHHEVTVSPPGHHQTHHL---ASPSVSVKP--------PD 416
Cdd:PHA03377  494 pPPSRRRRgacvvydddiieviDVETTeeEESVTQPAKPHRKVQDGFQRSGRRQKRATppkVSPSDRGPPkasppvmaPP 573
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  417 VQLTIAAEPSAEVGTSLVHQEATTRLSGSGNDVEPPAIQHGGPPLLPESSEEAGPLAVQQETSFQSPEPINNENPSPTQQ 496
Cdd:PHA03377  574 STGPRVMATPSTGPRDMAPPSTGPRQQAKCKDGPPASGPHEKQPPSSAPRDMAPSVVRMFLRERLLEQSTGPKPKSFWEM 653
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  497 EAAAEHPQTAEEGESSL--THQEAPAQTPEFPNVVV--------AQPPEHSHL-----TQATVQPLDLGFTiTPESKTEV 561
Cdd:PHA03377  654 RAGRDGSGIQQEPSSRRqpATQSTPPRPSWLPSVFVlpsvdagrAQPSEESHLssmspTQPISHEEQPRYE-DPDDPLDL 732
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  562 ELSPtmkETPTQPPKKvvpqlrvyqgvtNPTPGQDQAQHPVSPSVTVQlldlglTITPEPTTEVGHSTPPKRTIVSPKHP 641
Cdd:PHA03377  733 SLHP---DQAPPPSHQ------------APYSGHEEPQAQQAPYPGYW------EPRPPQAPYLGYQEPQAQGVQVSSYP 791
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  642 EVTLPHPDQVQTQ-HSHLTRATVQPLDLGFTITPKSMTEVEPSTALMTTA---------PPPGHPEVTLPPSDKGQAQHS 711
Cdd:PHA03377  792 GYAGPWGLRAQHPrYRHSWAYWSQYPGHGHPQGPWAPRPPHLPPQWDGSAghgqdqvsqFPHLQSETGPPRLQLSQVPQL 871
                         490       500       510
                  ....*....|....*....|....*....|....*..
gi 767994867  712 HLTQATVQPLDLELTiTTKPTTEVKPSPTTEETSTQP 748
Cdd:PHA03377  872 PYSQTLVSSSAPSWS-SPQPRAPIRPIPTRFPPPPMP 907
PHA03378 PHA03378
EBNA-3B; Provisional
171-646 4.90e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 45.06  E-value: 4.90e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  171 STPQSQKQT--LQNEYSSTDTPYPGSLPPELRVKSdEPPGPSEQVGPSQFHLEP--ETQNPETLEDIQSSSLQQEAPAQL 246
Cdd:PHA03378  443 ATPHSQAPTvvLHRPPTQPLEGPTGPLSVQAPLEP-WQPLPHPQVTPVILHQPPaqGVQAHGSMLDLLEKDDEDMEQRVM 521
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  247 PQLLEEEPSsmQQEAPALPPESSMESLTLPNHEVSVQPPGEDQAyyhLPNITVKPADVEvTITSEPTNETES---SQAQQ 323
Cdd:PHA03378  522 ATLLPPSPP--QPRAGRRAPCVYTEDLDIESDEPASTEPVHDQL---LPAPGLGPLQIQ-PLTSPTTSQLASsapSYAQT 595
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  324 ETPIQFPEEVEPSATQQEAPIEPPVPPMEHELSISEQQQPVQPSESPREVESSPTQQETPGQPPEHHEVTVSPPGH---- 399
Cdd:PHA03378  596 PWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHipyq 675
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  400 -HQTHHLASPSVS-----VKPPDVQLTIAAEPSAEVGTSLVHQEATTRL---SGSGNDVEPPAiqhGGPPLLPESSEEAG 470
Cdd:PHA03378  676 pSPTGANTMLPIQwapgtMQPPPRAPTPMRPPAAPPGRAQRPAAATGRArppAAAPGRARPPA---AAPGRARPPAAAPG 752
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  471 PLAVQQETSFQSPEPINNEN-PSPTQQEAAAEHPQTAEEGESslTHQEAPAQTPEFPNVVVAQPPEHSHLTQATVQPLDL 549
Cdd:PHA03378  753 RARPPAAAPGRARPPAAAPGaPTPQPPPQAPPAPQQRPRGAP--TPQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLT 830
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  550 GFTIT--PESKTEVELSPTMKETPTQPPK-----KVVPQLRVYQGVTNPT--PGQDQAQHPVSPSvtvqlldlglTITPE 620
Cdd:PHA03378  831 GGVKRgrPSLKKPAALERQAAAGPTPSPGsgtsdKIVQAPVFYPPVLQPIqvMRQLGSVRAAAAS----------TVTQA 900
                         490       500
                  ....*....|....*....|....*.
gi 767994867  621 PTTEVGhstppKRTIVSPKHPEVTLP 646
Cdd:PHA03378  901 PTEYTG-----ERRGVGPMHPTDIPP 921
PRK14960 PRK14960
DNA polymerase III subunit gamma/tau;
301-510 1.07e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237868 [Multi-domain]  Cd Length: 702  Bit Score: 43.88  E-value: 1.07e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  301 PADVEVTITSEPTNETE---SSQAQQETPIQFPEEVEPSATQQEAPIEPPVPPMEHELSISEQQQPvQPSESPrEVESSP 377
Cdd:PRK14960  363 PNEILVSEPVQQNGQAEvglNSQAQTAQEITPVSAVQPVEVISQPAMVEPEPEPEPEPEPEPEPEP-EPEPEP-EPEPEP 440
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  378 tqqetpgQPPEHHEVTVSPPGHHQTHHLASPSVSvkppdVQLTIAAEPSAEVGTS-LVHQEATTRLsgsgNDVEPPAIQH 456
Cdd:PRK14960  441 -------EPQPNQDLMVFDPNHHELIGLESAVVQ-----ETVSVLEEDFIPVPEQkLVQVQAETQV----KQIEPEPAST 504
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 767994867  457 GGPPLLPESSEEAGPLAVQQETSFQSPEPINNENP----SPTQQEAAAEHPQTAEEGE 510
Cdd:PRK14960  505 AEPIGLFEASSAEFSLAQDTSAYDLVSEPVIEQQSlvqaEIVETVAVVKEPNATDNSQ 562
Not5 COG5665
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
302-604 1.17e-03

CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];


Pssm-ID: 444384 [Multi-domain]  Cd Length: 874  Bit Score: 43.88  E-value: 1.17e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  302 ADVEVTITSEPTNETESSQAQQETPIQFPEEVEPSATQQEAPIEPPVPPMEHELSISEQQQPVQ--PSESPREVESSPTQ 379
Cdd:COG5665   240 PSLLATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTSNTPTSTAKAQPQPPTKKQPAKepPSDTASGNPSAPSV 319
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  380 QETPGQPPEHHEVTVSPPGHHQTHHLASPSVSVKPPdvqltiaAEPSAEVgTSLVHQEATTRLSGSgndVEPPAIQHGGP 459
Cdd:COG5665   320 LINSDSPTSEDPATASVPTTEETTAFTTPSSVPSTP-------AEKDTPA-TDLATPVSPTPPETS---VDKKVSPDSAT 388
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  460 PLLPESSEEAGPLAV-QQETSFQSPEPINNENPSPTQQEAAAEHPQTAEegesSLTHQEAPAQTPEFPNVVVAQPPEHSH 538
Cdd:COG5665   389 SSTKSEKEGGTASSPmPPNIAIGAKDDVDATDPSQEAKEYTKNAPMTPE----ADSAPESSVRTEASPSAGSDLEPENTT 464
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 767994867  539 LTQAtvqpldlgftiTPESKTEVELSPTMKETPTQPPKKVVPQLRVYQGVTNPTPGQDQAQHPVSP 604
Cdd:COG5665   465 LRDP-----------APNAIPPPEDPSTIGRLSSGDKLANETGPPVIRRDSTPSSTADQSIVGVLA 519
ftsN TIGR02223
cell division protein FtsN; FtsN is a poorly conserved protein active in cell division in a ...
181-414 1.77e-03

cell division protein FtsN; FtsN is a poorly conserved protein active in cell division in a number of Proteobacteria. The N-terminal 30 residue region tends to by Lys/Arg-rich, and is followed by a membrane-spanning region. This is followed by an acidic low-complexity region of variable length and a well-conserved C-terminal domain of two tandem regions matched by pfam05036 (Sporulation related repeat), found in several cell division and sporulation proteins. The role of FtsN as a suppressor for other cell division mutations is poorly understood; it may involve cell wall hydrolysis. [Cellular processes, Cell division]


Pssm-ID: 274041 [Multi-domain]  Cd Length: 298  Bit Score: 42.37  E-value: 1.77e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   181 QNEYSSTDTPYPgslppELRVKSDEPPGPSEQVGPSQFHLEPETQNPETLedIQSSSLQQEAPAQLPQLLEEEPSSMQQE 260
Cdd:TIGR02223    3 QRDYVRRGRGAP-----QKKKKNRRLVRATVLIAAILILLFIGGSSGLYL--LTESKQANEPETLQPKNQTENGETAADL 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   261 APAlPPESSMESLTLPNHEVSVQPPGEDQAyyhlpnitVKPADVEVTITSEPTNETESSQAQQETPIQFPEEVEPSATQQ 340
Cdd:TIGR02223   76 PPK-PEERWSYIEELEAREVLINDPEEPSN--------GGGVEESAQLTAEQRQLLEQMQADMRAAEKVLATAPSEQTVA 146
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 767994867   341 EAPIEPPVPPMEHELSISEQQQ-PVQPSESPREVESSPTQQETPGQPPEHHEVTVSPpghHQTHHLASPSVSVKP 414
Cdd:TIGR02223  147 VEARKQTAEKKPQKARTAEAQKtPVETEKIASKVKEAKQKQKALPKQTAETQSNSKP---IETAPKADKADKTKP 218
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
471-777 4.13e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 41.83  E-value: 4.13e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   471 PLAVQQETSFQSPEpinNENPSPTQQEAAAEHPQTAEEGESSLThqeaPAQTPEfPNVVVAQPPEHSHLTQATVQPLDLG 550
Cdd:pfam05109  449 PSSTHVPTNLTAPA---STGPTVSTADVTSPTPAGTTSGASPVT----PSPSPR-DNGTESKAPDMTSPTSAVTTPTPNA 520
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   551 FTITPESKTEV--ELSPTM-KETPTQPPKKVVPQLRV-YQGVTNPTPGQD-QAQHPVSPSVTVQlldlglTITPEPTT-E 624
Cdd:pfam05109  521 TSPTPAVTTPTpnATSPTLgKTSPTSAVTTPTPNATSpTPAVTTPTPNATiPTLGKTSPTSAVT------TPTPNATSpT 594
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   625 VGHSTPPKRT-------------IVSPKHPEVTLPHPDQVQTQHSHLTRATVQPLDLGFTITPKSMTEVEPSTALMTTAP 691
Cdd:pfam05109  595 VGETSPQANTtnhtlggtsstpvVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAH 674
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   692 PPG---------------HPEVTLP---PSDKGQAQHSHLTQATVQPLDLELTITTKPTTEVKPSPTTEETSTQPPDLGL 753
Cdd:pfam05109  675 PTGgenitqvtpaststhHVSTSSPaprPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKTAVPTVTST 754
                          330       340
                   ....*....|....*....|....
gi 767994867   754 AIIPEPTTETRHSTALEKTTAPRP 777
Cdd:pfam05109  755 GGKANSTTGGKHTTGHGARTSTEP 778
PRK10263 PRK10263
DNA translocase FtsK; Provisional
260-799 4.21e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.99  E-value: 4.21e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  260 EAPALPPESSMESLTLPNHEVSVQPPGEDqaYYHLPNitvkPADVEVTITSEPTNETESSQAQQetpiqfPEEVEPSATQ 339
Cdd:PRK10263  331 QSWAAPVEPVTQTPPVASVDVPPAQPTVA--WQPVPG----PQTGEPVIAPAPEGYPQQSQYAQ------PAVQYNEPLQ 398
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  340 QEAPIEPPVPPMEHELSISEQQQPVQPSESPREVESSPTQQETPGQPPEHHEVTVSPPGHHQTHHLASPSVSVKPPDVQl 419
Cdd:PRK10263  399 QPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPL- 477
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  420 tiaaepsaEVGTSLVHQEATTRLSGSGNDVEPPAiqhggPPLLP-ESSEEAGPLAVQQETSFQSPEPINNENPSPTQQEA 498
Cdd:PRK10263  478 --------YQQPQPVEQQPVVEPEPVVEETKPAR-----PPLYYfEEVEEKRAREREQLAAWYQPIPEPVKEPEPIKSSL 544
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  499 AAEHPQTAEEGESslthqeAPAQTPEFPNvvVAQPPEHSHLTQATVQP-LDLGFTITPESKTEVELSPTMKetptQPPKK 577
Cdd:PRK10263  545 KAPSVAAVPPVEA------AAAVSPLASG--VKKATLATGAAATVAAPvFSLANSGGPRPQVKEGIGPQLP----RPKRI 612
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  578 VVPQLRVYQGVTNPTPGQDQAQHPVSPSVTVQlLDLGLTITPEPTTEVGHSTPPKRTIVSPKH---PEVTLPHPDQV--- 651
Cdd:PRK10263  613 RVPTRRELASYGIKLPSQRAAEEKAREAQRNQ-YDSGDQYNDDEIDAMQQDELARQFAQTQQQrygEQYQHDVPVNAeda 691
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  652 ----------------QTQHSHLTRATVQPL---DLGFT--------------ITPKSMTEVEPSTALMTTAPPPGHPEV 698
Cdd:PRK10263  692 daaaeaelarqfaqtqQQRYSGEQPAGANPFsldDFEFSpmkallddgpheplFTPIVEPVQQPQQPVAPQQQYQQPQQP 771
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  699 TLPPSDKGQAQHSHLTQAtvQPLDLELTITTKPTTEVKPSPTTEETSTQPPDLGLA-----------IIPEPTTETRH-- 765
Cdd:PRK10263  772 VAPQPQYQQPQQPVAPQP--QYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVApqpqyqqpqqpVAPQPQDTLLHpl 849
                         570       580       590       600
                  ....*....|....*....|....*....|....*....|
gi 767994867  766 ------STALEKTTAPRPdrvqtlhrSLTEVTGPPTELEP 799
Cdd:PRK10263  850 lmrngdSRPLHKPTTPLP--------SLDLLTPPPSEVEP 881
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
174-458 4.81e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 41.83  E-value: 4.81e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   174 QSQKQTLQNEYSSTDTPYPGSLPPELRVKSDEPPGPSEQVGPSQFHLEPETQNPEtlediqssslqqeAPAQLPQLLEEE 253
Cdd:pfam05109  500 ESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPN-------------ATSPTPAVTTPT 566
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   254 PSSMqqeAPALPPESSMESLTLPNHEVSVQPPGEDQAYYHLPNITVKPADVEVTITSEPTNETESSQAQQEtpiQFPEEV 333
Cdd:pfam05109  567 PNAT---IPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQH---NITSSS 640
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   334 EPSATQQEAPIEPPVPPMEHELSISeqQQPVQPSESPREVESspTQQETPGQPPEHHEVTVSP-PGHHQTHHLASP---S 409
Cdd:pfam05109  641 TSSMSLRPSSISETLSPSTSDNSTS--HMPLLTSAHPTGGEN--ITQVTPASTSTHHVSTSSPaPRPGTTSQASGPgnsS 716
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*....
gi 767994867   410 VSVKPPDVQLTIAAEPsaevgtslvhQEATTRLSGSGNDVEPPAIQHGG 458
Cdd:pfam05109  717 TSTKPGEVNVTKGTPP----------KNATSPQAPSGQKTAVPTVTSTG 755
PHA03369 PHA03369
capsid maturational protease; Provisional
221-392 5.89e-03

capsid maturational protease; Provisional


Pssm-ID: 223061 [Multi-domain]  Cd Length: 663  Bit Score: 41.52  E-value: 5.89e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  221 EPETQNPETLEDIQSSSLQQEApaqlpqlLEEEPSSMQQEAPALPPESSMESLTLPNHEVSVQPPGED-----QAYYHLP 295
Cdd:PHA03369  491 EQESLAKELEATAHKSEIKKIA-------ESEFKNAGAKTAAANIEPNCSADAAAPATKRARPETKTEleavvRFPYQIR 563
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  296 NITVKPADVEVTITSEPTNETESSQAQqETPIQFPEEVEPSATQQEAPIEPPVPPMEHELSISEQ-QQPVQPSESPREVE 374
Cdd:PHA03369  564 NMESPAFVHSFTSTTLAAAAGQGSDTA-EALAGAIETLLTQASAQPAGLSLPAPAVPVNASTPAStPPPLAPQEPPQPGT 642
                         170
                  ....*....|....*...
gi 767994867  375 SSPTqqeTPGQPPEHHEV 392
Cdd:PHA03369  643 SAPS---LETSLPQQKPV 657
LRR_4 pfam12799
Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a ...
892-930 7.03e-03

Leucine Rich repeats (2 copies); Leucine rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in protein-protein interactions. Each Leucine Rich Repeat is composed of a beta-alpha unit. These units form elongated non-globular structures. Leucine Rich Repeats are often flanked by cysteine rich domains.


Pssm-ID: 463713 [Multi-domain]  Cd Length: 44  Bit Score: 36.07  E-value: 7.03e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 767994867   892 EKLILRENNLTELhkDSFEGLLSLQYLNLS-CNVITELSF 930
Cdd:pfam12799    4 EVLDLSNNQITDI--PPLAKLPNLETLDLSgNNKITDLSD 41
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
618-831 7.34e-03

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 40.80  E-value: 7.34e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   618 TPEPTTEvghSTPPKRTIVSPKhpevTLPHPDQVQTQHSHLTRATVQPLDLGFTITPKS---MTEVEPSTALMTTAPPpg 694
Cdd:pfam05539  178 TSWPTEV---SHPTYPSQVTPQ----SQPATQGHQTATANQRLSSTEPVGTQGTTTSSNpepQTEPPPSQRGPSGSPQ-- 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   695 HPEVTLPPSDKGQAQHSHLTQATVQPLDLELTITTKPTTevKPSPTTEETSTQPPdlglaiIPEPTTETRHSTALEKTTA 774
Cdd:pfam05539  249 HPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPHSTA--TPPPTTKRQETGRP------TPRPTATTQSGSSPPHSSP 320
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   775 PRPDRVQTlhrslTEVTGPPTELEPAQDSLVQSESYTQNKALTAP---EEHKASTSTNIC 831
Cdd:pfam05539  321 PGVQANPT-----TQNLVDCKELDPPKPNSICYGVGIYNEALPRGcdiVVPLCSTYTIMC 375
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
310-523 8.94e-03

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 40.83  E-value: 8.94e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   310 SEPTNEtESSQAQQETPI-QFPEEVEP----------SATQQEAPIE--PPVPPMEHELSISEQQQPVQPSESPREVESS 376
Cdd:pfam03546   24 SESSSE-EESDSEEETPAaKTPLQAKPsgktpqvraaSAPAKESPRKgaPPVPPGKTGPAAAQAQAGKPEEDSESSSEES 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867   377 PTQQETP-GQPPEHHEVTVSPPGhhQTHHLASPSVSVKPPDVQLTIAAEPSAEVGTSLVHQ--------EATTRLSGSGN 447
Cdd:pfam03546  103 DSDGETPaAATLTTSPAQVKPLG--KNSQVRPASTVGKGPSGKGANPAPPGKAGSAAPLVQvgkkeedsESSSEESDSEG 180
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 767994867   448 DVEPPAIQHGGPPLLPESSEEAGPlavqqeTSFQSPEPINNENPSPTQ--QEAAAEHPQTAEEgeSSLTHQEAPAQTP 523
Cdd:pfam03546  181 EAPPAATQAKPSGKILQVRPASGP------AKGAAPAPPQKAGPVATQvkAERSKEDSESSEE--SSDSEEEAPAAAT 250
PRK10263 PRK10263
DNA translocase FtsK; Provisional
438-703 9.53e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 40.84  E-value: 9.53e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  438 ATTRLSGSGNDVEPpaIQHGGPPLLPESSEEAGPLAVQQETSFQSPEPINNENPsptqqEAAAEHPQTAEEGESSLTHQE 517
Cdd:PRK10263  326 ATTATQSWAAPVEP--VTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAPAP-----EGYPQQSQYAQPAVQYNEPLQ 398
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  518 APAQTPEFPNVVVAQPPEHSHLTQATVQPLDLGFTITPESKTEVELSPTMKEtPTQPPKKVVPQLRVYQGVTNPTPG--Q 595
Cdd:PRK10263  399 QPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAE-EQQSTFAPQSTYQTEQTYQQPAAQepL 477
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 767994867  596 DQAQHPVSPSVTVQlldlgltitPEPTTEvghSTPPKRTivspkhPEVTLPHPDQVQTQHSHLTRATVQPLdlgftitPK 675
Cdd:PRK10263  478 YQQPQPVEQQPVVE---------PEPVVE---ETKPARP------PLYYFEEVEEKRAREREQLAAWYQPI-------PE 532
                         250       260
                  ....*....|....*....|....*...
gi 767994867  676 SMTEVEPSTALMTTAPPPGHPEVTLPPS 703
Cdd:PRK10263  533 PVKEPEPIKSSLKAPSVAAVPPVEAAAA 560
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH