|
Name |
Accession |
Description |
Interval |
E-value |
| PRP40 |
COG5104 |
Splicing factor [RNA processing and modification]; |
402-1016 |
1.99e-23 |
|
Splicing factor [RNA processing and modification];
Pssm-ID: 227435 [Multi-domain] Cd Length: 590 Bit Score: 106.32 E-value: 1.99e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 402 ASPATLAGATAVSEWTEYKTADGKTYYYNNRTLESTWEKPQEL--KEKEKLDEkikepikeaseeplpmeteeedpkeep 479
Cdd:COG5104 3 AALLGMASGEARSEWEELKAPDGRIYYYNKRTGKSSWEKPKELlkGSEEDLDV--------------------------- 55
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 480 vkeikeepkeeemteeekaaqkakpvattpipgTPWCVVWTGDERVFFYNPTTRLSMWDRPDDligRADVDKIIQEpphK 559
Cdd:COG5104 56 ---------------------------------DPWKECRTADGKVYYYNSITRESRWKIPPE---RKKVEPIAEQ---K 96
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 560 KGLEDMKKLRHPAPTMLSIQKWQFSmsaiKEEQELMEemnedepikakkrkrmskksfmwiaraslfrrddnkdidsekE 639
Cdd:COG5104 97 HDERSMIGGNGNDMAITDHETSEPK----YLLGRLMS------------------------------------------Q 130
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 640 AAMEAEIKAARERAivpLEARMKQFKDMLLERGVSAFSTWEKELHKIVfDPRYLLL--NPKERKQVFDQYVKTRAEEERR 717
Cdd:COG5104 131 YGITSTKDAVYRLT---KEEAEKEFITMLKENQVDSTWPIFRAIEELR-DPRYWMVdtDPLWRKDLFKKYFENQEKDQRE 206
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 718 EKKNKIMQAKEDFKKMME-EAKFNPRATFSEFAAKHAKDSRFKAIEKMKDREALFNEFVAAARKKEKEDSKTRGEKIKSD 796
Cdd:COG5104 207 EEENKQRKYINEFCKMLAgNSHIKYYTDWFTFKSIFSKHPYYSSVVNEKTKRQTFQKYKDKLGCYEKYVGKHMGGTALGR 286
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 797 FFELLSNHHLDSQSRWSKVKDKVESDPRYKAvdSSSM----REDLFKQYIeKIAKNLdsekekelerqarieaslrerER 872
Cdd:COG5104 287 LEEVLRSLGSETFIIWLLNHYVFDSVVRYLK--NKEMkpldRKDILFSFI-RYVRRL---------------------EK 342
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 873 EVQKARSEQTKEIDReREQHKREeaiqNFKALLSDMVRSSDVS----WSDTRRTLRKDHRWESGSLLEREEKEKLFNEHI 948
Cdd:COG5104 343 ELLSAIEERKAAAAQ-NARHHRD----EFRTLLRKLYSEGKIYyrmkWKNAYPLIKDDPRFLNLLGRTGSSPLDLFFDFI 417
|
570 580 590 600 610 620 630
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 575501595 949 EALTKKKREHFRQLLDETSaITLTSTW--KEVKKIIKEDPRciKFSSSDRKKQREFEE---YIRDKYITAKAD 1016
Cdd:COG5104 418 VDLENMYGFARRSYERETR-TGQISPTdrRAVDEIFEAIAE--KKEEGEIKFDKVDKEdisLIVDGLIKQRNE 487
|
|
| FF |
pfam01846 |
FF domain; This domain has been predicted to be involved in protein-protein interaction. This ... |
792-841 |
1.36e-13 |
|
FF domain; This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions.
Pssm-ID: 426471 [Multi-domain] Cd Length: 50 Bit Score: 65.94 E-value: 1.36e-13
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|
gi 575501595 792 KIKSDFFELLSNHHLDSQSRWSKVKDKVESDPRYKAVDSSSMREDLFKQY 841
Cdd:pfam01846 1 KAREAFKELLKEHKITPYSTWSEIKKKIENDPRYKALLDGSEREELFEDY 50
|
|
| FF |
smart00441 |
Contains two conserved F residues; A novel motif that often accompanies WW domains. Often ... |
953-1008 |
9.29e-10 |
|
Contains two conserved F residues; A novel motif that often accompanies WW domains. Often contains two conserved Phe (F) residues.
Pssm-ID: 128718 [Multi-domain] Cd Length: 55 Bit Score: 55.27 E-value: 9.29e-10
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*.
gi 575501595 953 KKKREHFRQLLDETSAITLTSTWKEVKKIIKEDPRCiKFSSSDRKKQREFEEYIRD 1008
Cdd:smart00441 1 EEAKEAFKELLKEHEVITPDTTWSEARKKLKNDPRY-KALLSESEREQLFEDHIEE 55
|
|
| WW |
cd00201 |
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; ... |
414-443 |
1.04e-08 |
|
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
Pssm-ID: 238122 [Multi-domain] Cd Length: 31 Bit Score: 51.76 E-value: 1.04e-08
10 20 30
....*....|....*....|....*....|
gi 575501595 414 SEWTEYKTADGKTYYYNNRTLESTWEKPQE 443
Cdd:cd00201 2 PGWEERWDPDGRVYYYNHNTKETQWEDPRE 31
|
|
| WW |
pfam00397 |
WW domain; The WW domain is a protein module with two highly conserved tryptophans that binds ... |
137-162 |
6.42e-08 |
|
WW domain; The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.
Pssm-ID: 459800 [Multi-domain] Cd Length: 30 Bit Score: 49.43 E-value: 6.42e-08
10 20
....*....|....*....|....*.
gi 575501595 137 WVENKTPDGKVYYYNARTRESAWTKP 162
Cdd:pfam00397 5 WEERWDPDGRVYYYNHETGETQWEKP 30
|
|
| WW |
smart00456 |
Domain with 2 conserved Trp (W) residues; Also known as the WWP or rsp5 domain. Binds ... |
132-164 |
6.50e-08 |
|
Domain with 2 conserved Trp (W) residues; Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
Pssm-ID: 197736 [Multi-domain] Cd Length: 33 Bit Score: 49.52 E-value: 6.50e-08
10 20 30
....*....|....*....|....*....|...
gi 575501595 132 PTEEIWVENKTPDGKVYYYNARTRESAWTKPDG 164
Cdd:smart00456 1 PLPPGWEERKDPDGRPYYYNHETKETQWEKPRE 33
|
|
| WW |
cd00201 |
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; ... |
137-164 |
1.71e-07 |
|
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
Pssm-ID: 238122 [Multi-domain] Cd Length: 31 Bit Score: 48.29 E-value: 1.71e-07
10 20
....*....|....*....|....*...
gi 575501595 137 WVENKTPDGKVYYYNARTRESAWTKPDG 164
Cdd:cd00201 4 WEERWDPDGRVYYYNHNTKETQWEDPRE 31
|
|
| PRP40 |
COG5104 |
Splicing factor [RNA processing and modification]; |
124-173 |
1.29e-05 |
|
Splicing factor [RNA processing and modification];
Pssm-ID: 227435 [Multi-domain] Cd Length: 590 Bit Score: 49.31 E-value: 1.29e-05
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|
gi 575501595 124 APGAPALPPTEEIWVENKTPDGKVYYYNARTRESAWTKPDgvKVIQQSEL 173
Cdd:COG5104 4 ALLGMASGEARSEWEELKAPDGRIYYYNKRTGKSSWEKPK--ELLKGSEE 51
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
708-1048 |
1.68e-05 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 49.37 E-value: 1.68e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 708 VKTRAEEERR--EKKNKIMQAK--EDFKKMMEEAKFNPRATFSEFAAKHAKDSRFKAIEKMKDREAlfNEFVAAARKKEK 783
Cdd:PTZ00121 1423 AKKKAEEKKKadEAKKKAEEAKkaDEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEA--KKKAEEAKKKAD 1500
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 784 EDSKTRGEKIKSDffELLSNHHLDSQSRWSKVKDKVESDPRYKAVDSSSMREDLFKQYIEKIAKNLDSEKEKELERQARI 863
Cdd:PTZ00121 1501 EAKKAAEAKKKAD--EAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNM 1578
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 864 EASLREREREVQKARSEQTKEIDREREQHKREEAiqnfKALLSDMVRSSDVSWSDTRRtlRKDHRWESGSLLEREEKEKL 943
Cdd:PTZ00121 1579 ALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEA----KKAEEAKIKAEELKKAEEEK--KKVEQLKKKEAEEKKKAEEL 1652
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 944 FNEHIEALTKKKREHFRQLLDETSAitltstwKEVKKIIKEDPRCIKFSSSDRKKQREFEEyIRDKYITAKADFRTLLKE 1023
Cdd:PTZ00121 1653 KKAEEENKIKAAEEAKKAEEDKKKA-------EEAKKAEEDEKKAAEALKKEAEEAKKAEE-LKKKEAEEKKKAEELKKA 1724
|
330 340
....*....|....*....|....*
gi 575501595 1024 TKFITYRSKKLIQESDQHLKDVEKI 1048
Cdd:PTZ00121 1725 EEENKIKAEEAKKEAEEDKKKAEEA 1749
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
258-346 |
1.51e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 46.06 E-value: 1.51e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 258 VGAPTPTTSSPAPAVSTSTPTSTPSSTTATTTtatsvAQTVSTPTTQDQTPSSAVSVATPTVSVSAPAPTaTPVQTVPQP 337
Cdd:pfam05109 513 VTTPTPNATSPTPAVTTPTPNATSPTLGKTSP-----TSAVTTPTPNATSPTPAVTTPTPNATIPTLGKT-SPTSAVTTP 586
|
....*....
gi 575501595 338 HPQTLPPAV 346
Cdd:pfam05109 587 TPNATSPTV 595
|
|
| FF |
pfam01846 |
FF domain; This domain has been predicted to be involved in protein-protein interaction. This ... |
1013-1072 |
2.01e-04 |
|
FF domain; This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions.
Pssm-ID: 426471 [Multi-domain] Cd Length: 50 Bit Score: 40.13 E-value: 2.01e-04
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 1013 AKADFRTLLKETKfITYRSkkliqesdqHLKDVEKILQNDKRYLVLDcVPEERRKLIVAY 1072
Cdd:pfam01846 2 AREAFKELLKEHK-ITPYS---------TWSEIKKKIENDPRYKALL-DGSEREELFEDY 50
|
|
| FF |
smart00441 |
Contains two conserved F residues; A novel motif that often accompanies WW domains. Often ... |
1011-1075 |
2.10e-04 |
|
Contains two conserved F residues; A novel motif that often accompanies WW domains. Often contains two conserved Phe (F) residues.
Pssm-ID: 128718 [Multi-domain] Cd Length: 55 Bit Score: 40.25 E-value: 2.10e-04
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 575501595 1011 ITAKADFRTLLKETKFITYrskkliqesDQHLKDVEKILQNDKRYLVLDcVPEERRKLIVAYVDD 1075
Cdd:smart00441 1 EEAKEAFKELLKEHEVITP---------DTTWSEARKKLKNDPRYKALL-SESEREQLFEDHIEE 55
|
|
| half-pint |
TIGR01645 |
poly-U binding splicing factor, half-pint family; The proteins represented by this model ... |
299-463 |
4.44e-04 |
|
poly-U binding splicing factor, half-pint family; The proteins represented by this model contain three RNA recognition motifs (rrm: pfam00076) and have been characterized as poly-pyrimidine tract binding proteins associated with RNA splicing factors. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
Pssm-ID: 130706 [Multi-domain] Cd Length: 612 Bit Score: 44.29 E-value: 4.44e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 299 STPTTQDQTPSSAVSVATPTVSVSapaPTATPVQTVPQPHPqTLPPAVPHSVPQPAAAIPAFppvmVPPFRVPLPGMPIP 378
Cdd:TIGR01645 322 AVLGPRAQSPATPSSSLPTDIGNK---AVVSSAKKEAEEVP-PLPQAAPAVVKPGPMEIPTP----VPPPGLAIPSLVAP 393
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 379 LPGVLPGMAPPIV------PMIHPQVAIAASP--ATLAGATAVSEwtEYKTADGKTYYYNNRTLESTWEKPQElKEKEKL 450
Cdd:TIGR01645 394 PGLVAPTEINPSFlasprkKMKREKLPVTFGAldDTLAWKEPSKE--DQTSEDGKMLAIMGEAAAALALEPKK-KKKEKE 470
|
170
....*....|...
gi 575501595 451 DEKIKEPIKEASE 463
Cdd:TIGR01645 471 GEELQPKLVMNSE 483
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
260-424 |
8.36e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 40.31 E-value: 8.36e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 260 APTPTTSSPAPAVSTSTPTSTPSSTTATTTTATSVAQTVSTPTTQDQTPSSAVSVATPTVSVSAPAPTATPVQTVPQPHP 339
Cdd:PHA03247 2703 PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRR 2782
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 340 QTLPPAVPHSVPQPAAAIpafppvmvPPFRVPLPGMPIPLPGVLPGMAPPIVPMIHPQVAIAASPATLAGATAVSEWTEY 419
Cdd:PHA03247 2783 LTRPAVASLSESRESLPS--------PWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG 2854
|
....*
gi 575501595 420 KTADG 424
Cdd:PHA03247 2855 SVAPG 2859
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PRP40 |
COG5104 |
Splicing factor [RNA processing and modification]; |
402-1016 |
1.99e-23 |
|
Splicing factor [RNA processing and modification];
Pssm-ID: 227435 [Multi-domain] Cd Length: 590 Bit Score: 106.32 E-value: 1.99e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 402 ASPATLAGATAVSEWTEYKTADGKTYYYNNRTLESTWEKPQEL--KEKEKLDEkikepikeaseeplpmeteeedpkeep 479
Cdd:COG5104 3 AALLGMASGEARSEWEELKAPDGRIYYYNKRTGKSSWEKPKELlkGSEEDLDV--------------------------- 55
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 480 vkeikeepkeeemteeekaaqkakpvattpipgTPWCVVWTGDERVFFYNPTTRLSMWDRPDDligRADVDKIIQEpphK 559
Cdd:COG5104 56 ---------------------------------DPWKECRTADGKVYYYNSITRESRWKIPPE---RKKVEPIAEQ---K 96
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 560 KGLEDMKKLRHPAPTMLSIQKWQFSmsaiKEEQELMEemnedepikakkrkrmskksfmwiaraslfrrddnkdidsekE 639
Cdd:COG5104 97 HDERSMIGGNGNDMAITDHETSEPK----YLLGRLMS------------------------------------------Q 130
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 640 AAMEAEIKAARERAivpLEARMKQFKDMLLERGVSAFSTWEKELHKIVfDPRYLLL--NPKERKQVFDQYVKTRAEEERR 717
Cdd:COG5104 131 YGITSTKDAVYRLT---KEEAEKEFITMLKENQVDSTWPIFRAIEELR-DPRYWMVdtDPLWRKDLFKKYFENQEKDQRE 206
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 718 EKKNKIMQAKEDFKKMME-EAKFNPRATFSEFAAKHAKDSRFKAIEKMKDREALFNEFVAAARKKEKEDSKTRGEKIKSD 796
Cdd:COG5104 207 EEENKQRKYINEFCKMLAgNSHIKYYTDWFTFKSIFSKHPYYSSVVNEKTKRQTFQKYKDKLGCYEKYVGKHMGGTALGR 286
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 797 FFELLSNHHLDSQSRWSKVKDKVESDPRYKAvdSSSM----REDLFKQYIeKIAKNLdsekekelerqarieaslrerER 872
Cdd:COG5104 287 LEEVLRSLGSETFIIWLLNHYVFDSVVRYLK--NKEMkpldRKDILFSFI-RYVRRL---------------------EK 342
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 873 EVQKARSEQTKEIDReREQHKREeaiqNFKALLSDMVRSSDVS----WSDTRRTLRKDHRWESGSLLEREEKEKLFNEHI 948
Cdd:COG5104 343 ELLSAIEERKAAAAQ-NARHHRD----EFRTLLRKLYSEGKIYyrmkWKNAYPLIKDDPRFLNLLGRTGSSPLDLFFDFI 417
|
570 580 590 600 610 620 630
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 575501595 949 EALTKKKREHFRQLLDETSaITLTSTW--KEVKKIIKEDPRciKFSSSDRKKQREFEE---YIRDKYITAKAD 1016
Cdd:COG5104 418 VDLENMYGFARRSYERETR-TGQISPTdrRAVDEIFEAIAE--KKEEGEIKFDKVDKEdisLIVDGLIKQRNE 487
|
|
| FF |
pfam01846 |
FF domain; This domain has been predicted to be involved in protein-protein interaction. This ... |
792-841 |
1.36e-13 |
|
FF domain; This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions.
Pssm-ID: 426471 [Multi-domain] Cd Length: 50 Bit Score: 65.94 E-value: 1.36e-13
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|
gi 575501595 792 KIKSDFFELLSNHHLDSQSRWSKVKDKVESDPRYKAVDSSSMREDLFKQY 841
Cdd:pfam01846 1 KAREAFKELLKEHKITPYSTWSEIKKKIENDPRYKALLDGSEREELFEDY 50
|
|
| FF |
pfam01846 |
FF domain; This domain has been predicted to be involved in protein-protein interaction. This ... |
725-774 |
1.39e-11 |
|
FF domain; This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions.
Pssm-ID: 426471 [Multi-domain] Cd Length: 50 Bit Score: 60.16 E-value: 1.39e-11
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|
gi 575501595 725 QAKEDFKKMMEEAKFNPRATFSEFAAKHAKDSRFKAIEKMKDREALFNEF 774
Cdd:pfam01846 1 KAREAFKELLKEHKITPYSTWSEIKKKIENDPRYKALLDGSEREELFEDY 50
|
|
| FF |
smart00441 |
Contains two conserved F residues; A novel motif that often accompanies WW domains. Often ... |
953-1008 |
9.29e-10 |
|
Contains two conserved F residues; A novel motif that often accompanies WW domains. Often contains two conserved Phe (F) residues.
Pssm-ID: 128718 [Multi-domain] Cd Length: 55 Bit Score: 55.27 E-value: 9.29e-10
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*.
gi 575501595 953 KKKREHFRQLLDETSAITLTSTWKEVKKIIKEDPRCiKFSSSDRKKQREFEEYIRD 1008
Cdd:smart00441 1 EEAKEAFKELLKEHEVITPDTTWSEARKKLKNDPRY-KALLSESEREQLFEDHIEE 55
|
|
| FF |
pfam01846 |
FF domain; This domain has been predicted to be involved in protein-protein interaction. This ... |
660-707 |
9.51e-10 |
|
FF domain; This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions.
Pssm-ID: 426471 [Multi-domain] Cd Length: 50 Bit Score: 55.16 E-value: 9.51e-10
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 575501595 660 RMKQFKDMLLERGVSAFSTWEKELHKIVFDPRYL-LLNPKERKQVFDQY 707
Cdd:pfam01846 2 AREAFKELLKEHKITPYSTWSEIKKKIENDPRYKaLLDGSEREELFEDY 50
|
|
| FF |
smart00441 |
Contains two conserved F residues; A novel motif that often accompanies WW domains. Often ... |
791-844 |
2.01e-09 |
|
Contains two conserved F residues; A novel motif that often accompanies WW domains. Often contains two conserved Phe (F) residues.
Pssm-ID: 128718 [Multi-domain] Cd Length: 55 Bit Score: 54.12 E-value: 2.01e-09
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*
gi 575501595 791 EKIKSDFFELLSNHHLD-SQSRWSKVKDKVESDPRYKAVDSSSMREDLFKQYIEK 844
Cdd:smart00441 1 EEAKEAFKELLKEHEVItPDTTWSEARKKLKNDPRYKALLSESEREQLFEDHIEE 55
|
|
| FF |
pfam01846 |
FF domain; This domain has been predicted to be involved in protein-protein interaction. This ... |
896-947 |
3.23e-09 |
|
FF domain; This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions.
Pssm-ID: 426471 [Multi-domain] Cd Length: 50 Bit Score: 53.61 E-value: 3.23e-09
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 575501595 896 EAIQNFKALLSDMVRSSDVSWSDTRRTLRKDHRWEsgSLLEREEKEKLFNEH 947
Cdd:pfam01846 1 KAREAFKELLKEHKITPYSTWSEIKKKIENDPRYK--ALLDGSEREELFEDY 50
|
|
| WW |
cd00201 |
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; ... |
414-443 |
1.04e-08 |
|
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
Pssm-ID: 238122 [Multi-domain] Cd Length: 31 Bit Score: 51.76 E-value: 1.04e-08
10 20 30
....*....|....*....|....*....|
gi 575501595 414 SEWTEYKTADGKTYYYNNRTLESTWEKPQE 443
Cdd:cd00201 2 PGWEERWDPDGRVYYYNHNTKETQWEDPRE 31
|
|
| WW |
pfam00397 |
WW domain; The WW domain is a protein module with two highly conserved tryptophans that binds ... |
414-441 |
3.75e-08 |
|
WW domain; The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.
Pssm-ID: 459800 [Multi-domain] Cd Length: 30 Bit Score: 50.20 E-value: 3.75e-08
10 20
....*....|....*....|....*...
gi 575501595 414 SEWTEYKTADGKTYYYNNRTLESTWEKP 441
Cdd:pfam00397 3 PGWEERWDPDGRVYYYNHETGETQWEKP 30
|
|
| WW |
pfam00397 |
WW domain; The WW domain is a protein module with two highly conserved tryptophans that binds ... |
137-162 |
6.42e-08 |
|
WW domain; The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.
Pssm-ID: 459800 [Multi-domain] Cd Length: 30 Bit Score: 49.43 E-value: 6.42e-08
10 20
....*....|....*....|....*.
gi 575501595 137 WVENKTPDGKVYYYNARTRESAWTKP 162
Cdd:pfam00397 5 WEERWDPDGRVYYYNHETGETQWEKP 30
|
|
| WW |
smart00456 |
Domain with 2 conserved Trp (W) residues; Also known as the WWP or rsp5 domain. Binds ... |
132-164 |
6.50e-08 |
|
Domain with 2 conserved Trp (W) residues; Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
Pssm-ID: 197736 [Multi-domain] Cd Length: 33 Bit Score: 49.52 E-value: 6.50e-08
10 20 30
....*....|....*....|....*....|...
gi 575501595 132 PTEEIWVENKTPDGKVYYYNARTRESAWTKPDG 164
Cdd:smart00456 1 PLPPGWEERKDPDGRPYYYNHETKETQWEKPRE 33
|
|
| WW |
smart00456 |
Domain with 2 conserved Trp (W) residues; Also known as the WWP or rsp5 domain. Binds ... |
416-443 |
7.10e-08 |
|
Domain with 2 conserved Trp (W) residues; Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
Pssm-ID: 197736 [Multi-domain] Cd Length: 33 Bit Score: 49.14 E-value: 7.10e-08
10 20
....*....|....*....|....*...
gi 575501595 416 WTEYKTADGKTYYYNNRTLESTWEKPQE 443
Cdd:smart00456 6 WEERKDPDGRPYYYNHETKETQWEKPRE 33
|
|
| FF |
smart00441 |
Contains two conserved F residues; A novel motif that often accompanies WW domains. Often ... |
724-777 |
1.54e-07 |
|
Contains two conserved F residues; A novel motif that often accompanies WW domains. Often contains two conserved Phe (F) residues.
Pssm-ID: 128718 [Multi-domain] Cd Length: 55 Bit Score: 49.11 E-value: 1.54e-07
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*
gi 575501595 724 MQAKEDFKKMMEEAKFN-PRATFSEFAAKHAKDSRFKAIEKMKDREALFNEFVAA 777
Cdd:smart00441 1 EEAKEAFKELLKEHEVItPDTTWSEARKKLKNDPRYKALLSESEREQLFEDHIEE 55
|
|
| WW |
cd00201 |
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; ... |
137-164 |
1.71e-07 |
|
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
Pssm-ID: 238122 [Multi-domain] Cd Length: 31 Bit Score: 48.29 E-value: 1.71e-07
10 20
....*....|....*....|....*...
gi 575501595 137 WVENKTPDGKVYYYNARTRESAWTKPDG 164
Cdd:cd00201 4 WEERWDPDGRVYYYNHNTKETQWEDPRE 31
|
|
| FF |
pfam01846 |
FF domain; This domain has been predicted to be involved in protein-protein interaction. This ... |
954-1005 |
3.20e-07 |
|
FF domain; This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions.
Pssm-ID: 426471 [Multi-domain] Cd Length: 50 Bit Score: 47.84 E-value: 3.20e-07
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 575501595 954 KKREHFRQLLDETSaITLTSTWKEVKKIIKEDPRCIKFSSSDRKKQrEFEEY 1005
Cdd:pfam01846 1 KAREAFKELLKEHK-ITPYSTWSEIKKKIENDPRYKALLDGSEREE-LFEDY 50
|
|
| FF |
smart00441 |
Contains two conserved F residues; A novel motif that often accompanies WW domains. Often ... |
895-950 |
3.04e-06 |
|
Contains two conserved F residues; A novel motif that often accompanies WW domains. Often contains two conserved Phe (F) residues.
Pssm-ID: 128718 [Multi-domain] Cd Length: 55 Bit Score: 45.26 E-value: 3.04e-06
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*..
gi 575501595 895 EEAIQNFKALLSDMVRS-SDVSWSDTRRTLRKDHRWESgsLLEREEKEKLFNEHIEA 950
Cdd:smart00441 1 EEAKEAFKELLKEHEVItPDTTWSEARKKLKNDPRYKA--LLSESEREQLFEDHIEE 55
|
|
| FF |
smart00441 |
Contains two conserved F residues; A novel motif that often accompanies WW domains. Often ... |
658-709 |
7.65e-06 |
|
Contains two conserved F residues; A novel motif that often accompanies WW domains. Often contains two conserved Phe (F) residues.
Pssm-ID: 128718 [Multi-domain] Cd Length: 55 Bit Score: 44.10 E-value: 7.65e-06
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 575501595 658 EARMKQFKDMLLERGVS-AFSTWEKELHKIVFDPRY-LLLNPKERKQVFDQYVK 709
Cdd:smart00441 1 EEAKEAFKELLKEHEVItPDTTWSEARKKLKNDPRYkALLSESEREQLFEDHIE 54
|
|
| WW |
smart00456 |
Domain with 2 conserved Trp (W) residues; Also known as the WWP or rsp5 domain. Binds ... |
514-542 |
1.14e-05 |
|
Domain with 2 conserved Trp (W) residues; Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
Pssm-ID: 197736 [Multi-domain] Cd Length: 33 Bit Score: 42.97 E-value: 1.14e-05
10 20
....*....|....*....|....*....
gi 575501595 514 PWCVVWTGDERVFFYNPTTRLSMWDRPDD 542
Cdd:smart00456 5 GWEERKDPDGRPYYYNHETKETQWEKPRE 33
|
|
| PRP40 |
COG5104 |
Splicing factor [RNA processing and modification]; |
124-173 |
1.29e-05 |
|
Splicing factor [RNA processing and modification];
Pssm-ID: 227435 [Multi-domain] Cd Length: 590 Bit Score: 49.31 E-value: 1.29e-05
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|
gi 575501595 124 APGAPALPPTEEIWVENKTPDGKVYYYNARTRESAWTKPDgvKVIQQSEL 173
Cdd:COG5104 4 ALLGMASGEARSEWEELKAPDGRIYYYNKRTGKSSWEKPK--ELLKGSEE 51
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
699-949 |
1.65e-05 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 48.97 E-value: 1.65e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 699 ERKQVfDQYVKTRAEEERREKKNKIMQAKEdfKKMMEEAKFNPRATFSEFAAKHAKDSRFkAIEKMKDREALFNEfvaaa 778
Cdd:pfam17380 286 ERQQQ-EKFEKMEQERLRQEKEEKAREVER--RRKLEEAEKARQAEMDRQAAIYAEQERM-AMERERELERIRQE----- 356
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 779 rKKEKEDSKTRGEKIKSDFFEL--LSNHHLDSQSRWSKVKDKVESDPRYKAVDSSSMRE-DLFKQYIEKIaknldsEKEK 855
Cdd:pfam17380 357 -ERKRELERIRQEEIAMEISRMreLERLQMERQQKNERVRQELEAARKVKILEEERQRKiQQQKVEMEQI------RAEQ 429
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 856 ELERQARIEASLREREREVQKARSEQ---TKEIDREREQhkrEEAIQNFKALLSDMVRSSDVSWSDTRRTLRKDHRWESG 932
Cdd:pfam17380 430 EEARQREVRRLEEERAREMERVRLEEqerQQQVERLRQQ---EEERKRKKLELEKEKRDRKRAEEQRRKILEKELEERKQ 506
|
250
....*....|....*..
gi 575501595 933 SLLEREEKEKLFNEHIE 949
Cdd:pfam17380 507 AMIEEERKRKLLEKEME 523
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
708-1048 |
1.68e-05 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 49.37 E-value: 1.68e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 708 VKTRAEEERR--EKKNKIMQAK--EDFKKMMEEAKFNPRATFSEFAAKHAKDSRFKAIEKMKDREAlfNEFVAAARKKEK 783
Cdd:PTZ00121 1423 AKKKAEEKKKadEAKKKAEEAKkaDEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEA--KKKAEEAKKKAD 1500
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 784 EDSKTRGEKIKSDffELLSNHHLDSQSRWSKVKDKVESDPRYKAVDSSSMREDLFKQYIEKIAKNLDSEKEKELERQARI 863
Cdd:PTZ00121 1501 EAKKAAEAKKKAD--EAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNM 1578
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 864 EASLREREREVQKARSEQTKEIDREREQHKREEAiqnfKALLSDMVRSSDVSWSDTRRtlRKDHRWESGSLLEREEKEKL 943
Cdd:PTZ00121 1579 ALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEA----KKAEEAKIKAEELKKAEEEK--KKVEQLKKKEAEEKKKAEEL 1652
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 944 FNEHIEALTKKKREHFRQLLDETSAitltstwKEVKKIIKEDPRCIKFSSSDRKKQREFEEyIRDKYITAKADFRTLLKE 1023
Cdd:PTZ00121 1653 KKAEEENKIKAAEEAKKAEEDKKKA-------EEAKKAEEDEKKAAEALKKEAEEAKKAEE-LKKKEAEEKKKAEELKKA 1724
|
330 340
....*....|....*....|....*
gi 575501595 1024 TKFITYRSKKLIQESDQHLKDVEKI 1048
Cdd:PTZ00121 1725 EEENKIKAEEAKKEAEEDKKKAEEA 1749
|
|
| WW |
cd00201 |
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; ... |
513-542 |
1.98e-05 |
|
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
Pssm-ID: 238122 [Multi-domain] Cd Length: 31 Bit Score: 42.52 E-value: 1.98e-05
10 20 30
....*....|....*....|....*....|
gi 575501595 513 TPWCVVWTGDERVFFYNPTTRLSMWDRPDD 542
Cdd:cd00201 2 PGWEERWDPDGRVYYYNHNTKETQWEDPRE 31
|
|
| PRP40 |
COG5104 |
Splicing factor [RNA processing and modification]; |
137-172 |
8.73e-05 |
|
Splicing factor [RNA processing and modification];
Pssm-ID: 227435 [Multi-domain] Cd Length: 590 Bit Score: 46.61 E-value: 8.73e-05
10 20 30
....*....|....*....|....*....|....*.
gi 575501595 137 WVENKTPDGKVYYYNARTRESAWTKPDGVKVIQQSE 172
Cdd:COG5104 58 WKECRTADGKVYYYNSITRESRWKIPPERKKVEPIA 93
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
711-980 |
9.57e-05 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 46.67 E-value: 9.57e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 711 RAEEERR--EKKNKIMQAK--EDFKKMMEEAKFNPRATFSEFAAKHAKDSRFKAIEKMKDREALFNEFVAAARKKEKEDS 786
Cdd:PTZ00121 1297 KAEEKKKadEAKKKAEEAKkaDEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEA 1376
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 787 KTRGEKIKSDFFELLSNHHLDSQSRwskvKDKVESDPRYKAVDSSSMREDLfKQYIEKIAKNLDSEKEKELERQARieaS 866
Cdd:PTZ00121 1377 KKKADAAKKKAEEKKKADEAKKKAE----EDKKKADELKKAAAAKKKADEA-KKKAEEKKKADEAKKKAEEAKKAD---E 1448
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 867 LREREREVQKARSEQTKEIDREREQHKREEAIQNFKAllSDMVRSSDVSWSDTRRTLRKDHRWESGSLLEREEKEKLFNE 946
Cdd:PTZ00121 1449 AKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKA--DEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADE 1526
|
250 260 270
....*....|....*....|....*....|....
gi 575501595 947 HIEALTKKKREHFRQLLDETSAITLTSTwKEVKK 980
Cdd:PTZ00121 1527 AKKAEEAKKADEAKKAEEKKKADELKKA-EELKK 1559
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
587-1001 |
1.32e-04 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 46.29 E-value: 1.32e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 587 AIKEEQELMEEMNEDEPIKAKKRKRMskksfmwIARASLFRRDDNKDIDSEKEAAMEAEIKAARERAIVPLEARMKQFKD 666
Cdd:PTZ00121 1333 AAKKKAEEAKKAAEAAKAEAEAAADE-------AEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKK 1405
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 667 MLLE-RGVSAFSTWEKELHKIVFDPRYL--LLNPKERKQVFDQyVKTRAEEERR--EKKNKIMQAK--EDFKKMMEEAKF 739
Cdd:PTZ00121 1406 KADElKKAAAAKKKADEAKKKAEEKKKAdeAKKKAEEAKKADE-AKKKAEEAKKaeEAKKKAEEAKkaDEAKKKAEEAKK 1484
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 740 NPRA-TFSEFAAKHAKDSRFKAIEKMKDREAL-------FNEFVAAARKKEKEDSKTRGEKIKSDffELLSNHHLDSQSR 811
Cdd:PTZ00121 1485 ADEAkKKAEEAKKKADEAKKAAEAKKKADEAKkaeeakkADEAKKAEEAKKADEAKKAEEKKKAD--ELKKAEELKKAEE 1562
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 812 WSKVKD-KVESDPRYKAVDSSSMREDLFKQYIEKIAKNLDSEKEKELErQARIEASLREREREVQKARSEQTK-EIDRER 889
Cdd:PTZ00121 1563 KKKAEEaKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAE-EAKKAEEAKIKAEELKKAEEEKKKvEQLKKK 1641
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 890 EQHKREEAIQNFKALLSDMVRSSDVSwsdtRRTLRKDHRWESGSLLEREEKEKLFNEHIEALTKKKREHFRQLLDETsai 969
Cdd:PTZ00121 1642 EAEEKKKAEELKKAEEENKIKAAEEA----KKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEE--- 1714
|
410 420 430
....*....|....*....|....*....|..
gi 575501595 970 tlTSTWKEVKKiiKEDPRCIKFSSSDRKKQRE 1001
Cdd:PTZ00121 1715 --KKKAEELKK--AEEENKIKAEEAKKEAEED 1742
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
258-346 |
1.51e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 46.06 E-value: 1.51e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 258 VGAPTPTTSSPAPAVSTSTPTSTPSSTTATTTtatsvAQTVSTPTTQDQTPSSAVSVATPTVSVSAPAPTaTPVQTVPQP 337
Cdd:pfam05109 513 VTTPTPNATSPTPAVTTPTPNATSPTLGKTSP-----TSAVTTPTPNATSPTPAVTTPTPNATIPTLGKT-SPTSAVTTP 586
|
....*....
gi 575501595 338 HPQTLPPAV 346
Cdd:pfam05109 587 TPNATSPTV 595
|
|
| FF |
pfam01846 |
FF domain; This domain has been predicted to be involved in protein-protein interaction. This ... |
1013-1072 |
2.01e-04 |
|
FF domain; This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions.
Pssm-ID: 426471 [Multi-domain] Cd Length: 50 Bit Score: 40.13 E-value: 2.01e-04
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 1013 AKADFRTLLKETKfITYRSkkliqesdqHLKDVEKILQNDKRYLVLDcVPEERRKLIVAY 1072
Cdd:pfam01846 2 AREAFKELLKEHK-ITPYS---------TWSEIKKKIENDPRYKALL-DGSEREELFEDY 50
|
|
| FF |
smart00441 |
Contains two conserved F residues; A novel motif that often accompanies WW domains. Often ... |
1011-1075 |
2.10e-04 |
|
Contains two conserved F residues; A novel motif that often accompanies WW domains. Often contains two conserved Phe (F) residues.
Pssm-ID: 128718 [Multi-domain] Cd Length: 55 Bit Score: 40.25 E-value: 2.10e-04
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 575501595 1011 ITAKADFRTLLKETKFITYrskkliqesDQHLKDVEKILQNDKRYLVLDcVPEERRKLIVAYVDD 1075
Cdd:smart00441 1 EEAKEAFKELLKEHEVITP---------DTTWSEARKKLKNDPRYKALL-SESEREQLFEDHIEE 55
|
|
| WW |
pfam00397 |
WW domain; The WW domain is a protein module with two highly conserved tryptophans that binds ... |
513-540 |
2.15e-04 |
|
WW domain; The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.
Pssm-ID: 459800 [Multi-domain] Cd Length: 30 Bit Score: 39.41 E-value: 2.15e-04
10 20
....*....|....*....|....*...
gi 575501595 513 TPWCVVWTGDERVFFYNPTTRLSMWDRP 540
Cdd:pfam00397 3 PGWEERWDPDGRVYYYNHETGETQWEKP 30
|
|
| PHA02682 |
PHA02682 |
ORF080 virion core protein; Provisional |
294-423 |
2.29e-04 |
|
ORF080 virion core protein; Provisional
Pssm-ID: 177464 [Multi-domain] Cd Length: 280 Bit Score: 44.47 E-value: 2.29e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 294 VAQTVSTPTTQDQTPSSAVSVATPTVSVSAPAPTATPVQTVPQPHPQTLP-PAVPHSVPQpaaaipafppvmvPPFRVPL 372
Cdd:PHA02682 82 LAPSPACAAPAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPaPARPAPACP-------------PSTRQCP 148
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 575501595 373 PGMPIPLPGVLPGMAPPIV-PMIHPQVAIAASPATLAGATAVSEWTEYKTAD 423
Cdd:PHA02682 149 PAPPLPTPKPAPAAKPIFLhNQLPPPDYPAASCPTIETAPAASPVLEPRIPD 200
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
695-1067 |
3.88e-04 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 44.75 E-value: 3.88e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 695 LNPKERKQVFDQYVKTRAEEERREKKNKIMQAKEDFKKMMEEAKfnpratFSEFAAKHAKDSRfKAIEKMKDREALFNEf 774
Cdd:PTZ00121 1072 LKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEEAR------KAEEAKKKAEDAR-KAEEARKAEDARKAE- 1143
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 775 vaAARKKEkEDSKTRGEKIKSDFFELLSNHHLDSQSRWSKVKDKVE---SDPRYKAVDSSSMREDLFKQYIEKIAKNLDS 851
Cdd:PTZ00121 1144 --EARKAE-DAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEvrkAEELRKAEDARKAEAARKAEEERKAEEARKA 1220
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 852 EKEKELERQARIEaSLREREREVQKARSEQTKEIDREREQHKREEAIQNFKALLSDMVRSSDvswsdtrrTLRK-DHRWE 930
Cdd:PTZ00121 1221 EDAKKAEAVKKAE-EAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKAD--------ELKKaEEKKK 1291
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 931 SGSLLEREEKEKLFNEHIEALTKKKREHFRQLLDET--SAITLTSTWKEVKKIIKEDPRCIKFSSSDRKKQREFEEYIRD 1008
Cdd:PTZ00121 1292 ADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAkkKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEK 1371
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 575501595 1009 KYITAKADFRTLLK--ETKFITYRSKKLIQESDQHLKDVEKILQNDKRYLVLDCVPEERRK 1067
Cdd:PTZ00121 1372 KKEEAKKKADAAKKkaEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKK 1432
|
|
| half-pint |
TIGR01645 |
poly-U binding splicing factor, half-pint family; The proteins represented by this model ... |
299-463 |
4.44e-04 |
|
poly-U binding splicing factor, half-pint family; The proteins represented by this model contain three RNA recognition motifs (rrm: pfam00076) and have been characterized as poly-pyrimidine tract binding proteins associated with RNA splicing factors. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
Pssm-ID: 130706 [Multi-domain] Cd Length: 612 Bit Score: 44.29 E-value: 4.44e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 299 STPTTQDQTPSSAVSVATPTVSVSapaPTATPVQTVPQPHPqTLPPAVPHSVPQPAAAIPAFppvmVPPFRVPLPGMPIP 378
Cdd:TIGR01645 322 AVLGPRAQSPATPSSSLPTDIGNK---AVVSSAKKEAEEVP-PLPQAAPAVVKPGPMEIPTP----VPPPGLAIPSLVAP 393
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 379 LPGVLPGMAPPIV------PMIHPQVAIAASP--ATLAGATAVSEwtEYKTADGKTYYYNNRTLESTWEKPQElKEKEKL 450
Cdd:TIGR01645 394 PGLVAPTEINPSFlasprkKMKREKLPVTFGAldDTLAWKEPSKE--DQTSEDGKMLAIMGEAAAALALEPKK-KKKEKE 470
|
170
....*....|...
gi 575501595 451 DEKIKEPIKEASE 463
Cdd:TIGR01645 471 GEELQPKLVMNSE 483
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
714-1068 |
7.25e-04 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 43.90 E-value: 7.25e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 714 EERREKKNKIMQAKEDFKKMMEEAKFNPRATFSEFAAKHAKDSRF-----KAIEKMKDREALFNEFVAAARKKEKEDSKT 788
Cdd:PRK03918 175 KRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELreeleKLEKEVKELEELKEEIEELEKELESLEGSK 254
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 789 RGEKIKsdffelLSNhhldSQSRWSKVKDKVEsDPRYKAVDSSSMREDLfKQYIEkiaknLDSEKEKELERQARIE---A 865
Cdd:PRK03918 255 RKLEEK------IRE----LEERIEELKKEIE-ELEEKVKELKELKEKA-EEYIK-----LSEFYEEYLDELREIEkrlS 317
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 866 SLREREREVQKARSEQTKEIDREREQHKREEAIQNFKALLSDMVRSSDvswsDTRRTLRKDHRWESGslLEREEKEKLFN 945
Cdd:PRK03918 318 RLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYE----EAKAKKEELERLKKR--LTGLTPEKLEK 391
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 946 EhIEALTKKKREHFRQLLdetsaiTLTSTWKEVKKIIKEDPRCIKFSSSDRKK----QREFEEYIRDKYITA-KADFRTL 1020
Cdd:PRK03918 392 E-LEELEKAKEEIEEEIS------KITARIGELKKEIKELKKAIEELKKAKGKcpvcGRELTEEHRKELLEEyTAELKRI 464
|
330 340 350 360
....*....|....*....|....*....|....*....|....*...
gi 575501595 1021 LKETKFITYRSKKLIQEsdqhLKDVEKILQNDKRYLVLDCVPEERRKL 1068
Cdd:PRK03918 465 EKELKEIEEKERKLRKE----LRELEKVLKKESELIKLKELAEQLKEL 508
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
260-345 |
9.97e-04 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 43.02 E-value: 9.97e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 260 APTPTTSSPAPAVSTSTPTSTPSSTTATTTTATSVAQTVSTPTTQDQTPSSAVSVATPTVS-VSAPAPTATPVQTVPQPH 338
Cdd:pfam17823 165 ASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGtALAAVGNSSPAAGTVTAA 244
|
....*..
gi 575501595 339 PQTLPPA 345
Cdd:pfam17823 245 VGTVTPA 251
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
696-1088 |
1.32e-03 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 43.04 E-value: 1.32e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 696 NPKERKQVFDQYVKTRAEEERREKKNKIMQAKEDfkkmmeeakfnpratfsefaakhakdsrfKAIEKMKDREALFNEFV 775
Cdd:pfam02463 151 KPERRLEIEEEAAGSRLKRKKKEALKKLIEETEN-----------------------------LAELIIDLEELKLQELK 201
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 776 AAARKKEKEDSKTRGEKIKSDFFELLSNHHLDSQSRWSKVKDKVESDPRYKavDSSSMREDLFKQYIEKIAKNLDSEKEK 855
Cdd:pfam02463 202 LKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQEE--IESSKQEIEKEEEKLAQVLKENKEEEK 279
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 856 ELERQARIEASLREREREVQKAR--SEQTKEIDREREQHKREEAIQNFKALLSDMVRSSDvswsdtRRTLRKDHRWESGS 933
Cdd:pfam02463 280 EKKLQEEELKLLAKEEEELKSELlkLERRKVDDEEKLKESEKEKKKAEKELKKEKEEIEE------LEKELKELEIKREA 353
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 934 LLEREE----KEKLFNEHIEALTKKKREHFRQLLDETSAITLTSTWKEVKKIIkedprcikfsSSDRKKQREFEEYIRDK 1009
Cdd:pfam02463 354 EEEEEEelekLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKSEEEKE----------AQLLLELARQLEDLLKE 423
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 575501595 1010 YITAKADFrtLLKETKFITYRSKKLIQESDqHLKDVEKILQNDKRYLVLDCVPEERRKLIVAYVDDLDRRGPPPPPTAS 1088
Cdd:pfam02463 424 EKKEELEI--LEEEEESIELKQGKLTEEKE-ELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERS 499
|
|
| HEC1 |
COG5185 |
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ... |
711-993 |
2.00e-03 |
|
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 444066 [Multi-domain] Cd Length: 594 Bit Score: 42.25 E-value: 2.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 711 RAEEERREKKNKIMQAKEDFKKMMEEAKFNPRATFSEFAAKHAKDSrfKAIEKMKDREALFNEFVAAARKKEKEDSKTRG 790
Cdd:COG5185 257 KLVEQNTDLRLEKLGENAESSKRLNENANNLIKQFENTKEKIAEYT--KSIDIKKATESLEEQLAAAEAEQELEESKRET 334
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 791 EKIKSDFFELLSNHHLDSQSRWSKVKDKVESDPRYKAVDSSSMREDLFKQYIEKIAKNLDSekekelerqarIEASLRER 870
Cdd:COG5185 335 ETGIQNLTAEIEQGQESLTENLEAIKEEIENIVGEVELSKSSEELDSFKDTIESTKESLDE-----------IPQNQRGY 403
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 871 EREVQKARSEQTKEIDREREQHKR---------EEAIQNFKALLSDMVRSSDVSWSDTRRTLRKDHRWESGSLLEREEKE 941
Cdd:COG5185 404 AQEILATLEDTLKAADRQIEELQRqieqatssnEEVSKLLNELISELNKVMREADEESQSRLEEAYDEINRSVRSKKEDL 483
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 575501595 942 --------------KLFNEHIEALTKKKREHFRQLLDETSAITLTSTWKEVKKIIKEDPRCIKFSS 993
Cdd:COG5185 484 neeltqiesrvstlKATLEKLRAKLERQLEGVRSKLDQVAESLKDFMRARGYAHILALENLIPASE 549
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
298-414 |
2.01e-03 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 41.87 E-value: 2.01e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 298 VSTPTTQDQTPSSAVSVATPTVSvSAPAPTatpvqtvPQPHPQTLPPAVPHSVPQPAAAIPAFPPVMVP-PFRVPLPGMP 376
Cdd:pfam17823 296 AAPMGAQAQGPIIQVSTDQPVHN-TAGEPT-------PSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKePSASPVPVLH 367
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....
gi 575501595 377 ---IP---------LPGVLP---GMAPPIVPMIHPQVAIAASPATL-AGATAVS 414
Cdd:pfam17823 368 tsmIPeveatspttQPSPLLptqGAAGPGILLAPEQVATEATAGTAsAGPTPRS 421
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
260-397 |
3.35e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 41.68 E-value: 3.35e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 260 APTPTTSSPAPAVSTSTPTSTPSSTTATTTTATSVAQTVSTPTTQDQTPSSAVSVATPTVSVSAP------------APT 327
Cdd:pfam03154 176 AQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQrlpsphpplqpmTQP 255
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 328 ATPVQTVPQPHP---------------QTLPPAVPHSV------------------------------PQPAAAIPAFPP 362
Cdd:pfam03154 256 PPPSQVSPQPLPqpslhgqmppmphslQTGPSHMQHPVppqpfpltpqssqsqvppgpspaapgqsqqRIHTPPSQSQLQ 335
|
170 180 190
....*....|....*....|....*....|....*
gi 575501595 363 VMVPPFRVPLPGMPIPLPGVLPGMAPPIVPMIHPQ 397
Cdd:pfam03154 336 SQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQ 370
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
834-965 |
5.89e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 40.67 E-value: 5.89e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 834 REDLFKQYIEKIAKNLDSEKEKELERQARIEAsLREREREVQKARSEQ--------TKEIDR-EREQHKREEAIQNFKAL 904
Cdd:COG4913 289 RLELLEAELEELRAELARLEAELERLEARLDA-LREELDELEAQIRGNggdrleqlEREIERlERELEERERRRARLEAL 367
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 575501595 905 LSDMvrssDVSWSDTRRTLRKDHRwESGSLLER--EEKEKLFNEHIEALTKKK--REHFRQLLDE 965
Cdd:COG4913 368 LAAL----GLPLPASAEEFAALRA-EAAALLEAleEELEALEEALAEAEAALRdlRRELRELEAE 427
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
842-1051 |
8.18e-03 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 40.44 E-value: 8.18e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 842 IEKIAKNLDSEKEKELERQARIEASLREREREVQKARSEQT---KEIDREREQ-HKREEAIQNFKALLSD-MVRSSDVSW 916
Cdd:TIGR02169 721 IEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKeleARIEELEEDlHKLEEALNDLEARLSHsRIPEIQAEL 800
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 917 SDTRRTLRkdhRWESG-SLLEREEKEKLFNEHIEaltKKKREHFRQLLDEtsaitLTSTWKEVKKIIKEDPRCIKFSSSD 995
Cdd:TIGR02169 801 SKLEEEVS---RIEARlREIEQKLNRLTLEKEYL---EKEIQELQEQRID-----LKEQIKSIEKEIENLNGKKEELEEE 869
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 575501595 996 RKKQREFEEYIRDKYITAKADFRTLLKETKFITYRSKKL---IQESDQHLKDVEKILQN 1051
Cdd:TIGR02169 870 LEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELeaqIEKKRKRLSELKAKLEA 928
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
260-424 |
8.36e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 40.31 E-value: 8.36e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 260 APTPTTSSPAPAVSTSTPTSTPSSTTATTTTATSVAQTVSTPTTQDQTPSSAVSVATPTVSVSAPAPTATPVQTVPQPHP 339
Cdd:PHA03247 2703 PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRR 2782
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 340 QTLPPAVPHSVPQPAAAIpafppvmvPPFRVPLPGMPIPLPGVLPGMAPPIVPMIHPQVAIAASPATLAGATAVSEWTEY 419
Cdd:PHA03247 2783 LTRPAVASLSESRESLPS--------PWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG 2854
|
....*
gi 575501595 420 KTADG 424
Cdd:PHA03247 2855 SVAPG 2859
|
|
| PLN02316 |
PLN02316 |
synthase/transferase |
831-916 |
9.40e-03 |
|
synthase/transferase
Pssm-ID: 215180 [Multi-domain] Cd Length: 1036 Bit Score: 40.24 E-value: 9.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 575501595 831 SSMREDLFKQYiekiaknLDSEKEKELERQARIEASlREREREVQKARSEQTKEIDREREQHKREEAIQNFKA--LLSDM 908
Cdd:PLN02316 239 GGMDEHSFEDF-------LLEEKRRELEKLAKEEAE-RERQAEEQRRREEEKAAMEADRAQAKAEVEKRREKLqnLLKKA 310
|
....*...
gi 575501595 909 VRSSDVSW 916
Cdd:PLN02316 311 SRSADNVW 318
|
|
|