|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
236-309 |
3.71e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. :
Pssm-ID: 464173 Cd Length: 78 Bit Score: 88.38 E-value: 3.71e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958670811 236 MVHILTSVVGSKCEVQVKNGGVYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 309
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
377-438 |
9.31e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain. :
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 9.31e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958670811 377 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 438
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
897-1214 |
4.98e-05 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.01 E-value: 4.98e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 897 PKPSTTPTSPRPQAQPSP------SMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKV 970
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPTvgsltsLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGG 2753
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 971 PNMPQQRQEQHHQSTMMHPASAAGPPIVATPPAAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 1050
Cdd:PHA03247 2754 PARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 1051 PAHAQPGLVSSSAAQFGAHEQTHAMYVSTGSLA------QQYAHPNATLHPHPPHPQPSA-TPTGQQQSQHGGSHPAPSP 1123
Cdd:PHA03247 2831 PTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRrrppsrSPAAKPAAPARPPVRRLARPAvSRSTESFALPPDQPERPPQ 2910
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 1124 VQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSFPAAPQTVFTIHPSHVQPAYTTPPHMAHVPQYK 1203
Cdd:PHA03247 2911 PQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPA 2990
|
330
....*....|.
gi 1958670811 1204 PTTNSSCKSAL 1214
Cdd:PHA03247 2991 SSTPPLTGHSL 3001
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
534-717 |
1.18e-03 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.39 E-value: 1.18e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 534 RYQSGPNSLPPRAAtptRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSAEGPPRMSPKAQRH---------PRNHRV 604
Cdd:PHA03247 2609 RGPAPPSPLPPDTH---APDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLgraaqasspPQRPRR 2685
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 605 SAGRGSMSSGLEFVSHNPPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQNSAGNSPSGPV-LASPQAGIT 683
Cdd:PHA03247 2686 RAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPArPARPPTTAG 2765
|
170 180 190
....*....|....*....|....*....|....*
gi 1958670811 684 PAEAVSMPVPAASPTPASPASNRA-LTPSIEAKDS 717
Cdd:PHA03247 2766 PPAPAPPAAPAAGPPRRLTRPAVAsLSESRESLPS 2800
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
878-893 |
6.39e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains. :
Pssm-ID: 429316 Cd Length: 17 Bit Score: 35.28 E-value: 6.39e-03
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
236-309 |
3.71e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
Pssm-ID: 464173 Cd Length: 78 Bit Score: 88.38 E-value: 3.71e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958670811 236 MVHILTSVVGSKCEVQVKNGGVYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 309
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
377-438 |
9.31e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain.
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 9.31e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958670811 377 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 438
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
897-1214 |
4.98e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.01 E-value: 4.98e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 897 PKPSTTPTSPRPQAQPSP------SMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKV 970
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPTvgsltsLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGG 2753
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 971 PNMPQQRQEQHHQSTMMHPASAAGPPIVATPPAAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 1050
Cdd:PHA03247 2754 PARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 1051 PAHAQPGLVSSSAAQFGAHEQTHAMYVSTGSLA------QQYAHPNATLHPHPPHPQPSA-TPTGQQQSQHGGSHPAPSP 1123
Cdd:PHA03247 2831 PTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRrrppsrSPAAKPAAPARPPVRRLARPAvSRSTESFALPPDQPERPPQ 2910
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 1124 VQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSFPAAPQTVFTIHPSHVQPAYTTPPHMAHVPQYK 1203
Cdd:PHA03247 2911 PQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPA 2990
|
330
....*....|.
gi 1958670811 1204 PTTNSSCKSAL 1214
Cdd:PHA03247 2991 SSTPPLTGHSL 3001
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
869-1037 |
9.76e-05 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 46.72 E-value: 9.76e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 869 ERKDTteqvRKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQP 948
Cdd:TIGR01628 362 QRKEQ----RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPL 432
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 949 LypiPMTPMPVNQaktyrAGKVPNMPQQRQEQHHQSTMMHPASAAGPPIVATPPAAYSTqYVAYSPQQFPNQPLVQHVPH 1028
Cdd:TIGR01628 433 R---PNGLAPMNA-----VRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQST-ASQGGQNKKLAQVLASATPQ 503
|
170
....*....|....*
gi 1958670811 1029 YQSQ------HPHVY 1037
Cdd:TIGR01628 504 MQKQvlgerlFPLVE 518
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
978-1201 |
6.48e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 44.26 E-value: 6.48e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 978 QEQHHQSTMMHPASAAGPPIVATPPAAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHP----HVySPVIQGNA--RMMAPP 1051
Cdd:pfam09770 96 EEEQVRFNRQQPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRTGYEKYKEPEPipdlQV-DASLWGVApkKAAAPA 174
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 1052 AHAQPGLVSSSAAQFG----------------AHEQTHAMYVSTGSLAQQYAHPNATLHPHPPHPQPSATPTGQQQSQHG 1115
Cdd:pfam09770 175 PAPQPAAQPASLPAPSrkmmsleeveaamraqAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQ 254
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 1116 GSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQ----SSFPAAPQTVFTIHPSHVQPAYT 1191
Cdd:pfam09770 255 QHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNPNrlsaARVGYPQNPQPGVQPAPAHQAHR 334
|
250
....*....|
gi 1958670811 1192 TPPHMAHVPQ 1201
Cdd:pfam09770 335 QQGSFGRQAP 344
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
534-717 |
1.18e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.39 E-value: 1.18e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 534 RYQSGPNSLPPRAAtptRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSAEGPPRMSPKAQRH---------PRNHRV 604
Cdd:PHA03247 2609 RGPAPPSPLPPDTH---APDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLgraaqasspPQRPRR 2685
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 605 SAGRGSMSSGLEFVSHNPPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQNSAGNSPSGPV-LASPQAGIT 683
Cdd:PHA03247 2686 RAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPArPARPPTTAG 2765
|
170 180 190
....*....|....*....|....*....|....*
gi 1958670811 684 PAEAVSMPVPAASPTPASPASNRA-LTPSIEAKDS 717
Cdd:PHA03247 2766 PPAPAPPAAPAAGPPRRLTRPAVAsLSESRESLPS 2800
|
|
| Sm_like |
cd00600 |
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ... |
240-307 |
1.49e-03 |
|
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.
Pssm-ID: 212462 [Multi-domain] Cd Length: 63 Bit Score: 38.00 E-value: 1.49e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958670811 240 LTSVVGSKCEVQVKNGGVYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESVLFKCSDFVVVQ 307
Cdd:cd00600 1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
878-893 |
6.39e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.
Pssm-ID: 429316 Cd Length: 17 Bit Score: 35.28 E-value: 6.39e-03
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
833-958 |
9.19e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 40.45 E-value: 9.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 833 SPSVLSNAEHKRGPEVTSQGVQTSSPACKQEKDDREERKDTTEQVRKSTLNPNAKEFNPRSFSQP-KPSTTPTSPRPQAQ 911
Cdd:PRK10263 746 TPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPvAPQPQYQQPQQPVA 825
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|.
gi 1958670811 912 PSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIP----MTPMP 958
Cdd:PRK10263 826 PQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPLPsldlLTPPP 876
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
236-309 |
3.71e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
Pssm-ID: 464173 Cd Length: 78 Bit Score: 88.38 E-value: 3.71e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958670811 236 MVHILTSVVGSKCEVQVKNGGVYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 309
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
377-438 |
9.31e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain.
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 9.31e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958670811 377 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 438
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
897-1214 |
4.98e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.01 E-value: 4.98e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 897 PKPSTTPTSPRPQAQPSP------SMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAGKV 970
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPTvgsltsLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGG 2753
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 971 PNMPQQRQEQHHQSTMMHPASAAGPPIVATPPAAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 1050
Cdd:PHA03247 2754 PARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 1051 PAHAQPGLVSSSAAQFGAHEQTHAMYVSTGSLA------QQYAHPNATLHPHPPHPQPSA-TPTGQQQSQHGGSHPAPSP 1123
Cdd:PHA03247 2831 PTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRrrppsrSPAAKPAAPARPPVRRLARPAvSRSTESFALPPDQPERPPQ 2910
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 1124 VQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSFPAAPQTVFTIHPSHVQPAYTTPPHMAHVPQYK 1203
Cdd:PHA03247 2911 PQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPA 2990
|
330
....*....|.
gi 1958670811 1204 PTTNSSCKSAL 1214
Cdd:PHA03247 2991 SSTPPLTGHSL 3001
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
869-1037 |
9.76e-05 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 46.72 E-value: 9.76e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 869 ERKDTteqvRKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQP 948
Cdd:TIGR01628 362 QRKEQ----RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPL 432
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 949 LypiPMTPMPVNQaktyrAGKVPNMPQQRQEQHHQSTMMHPASAAGPPIVATPPAAYSTqYVAYSPQQFPNQPLVQHVPH 1028
Cdd:TIGR01628 433 R---PNGLAPMNA-----VRAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQST-ASQGGQNKKLAQVLASATPQ 503
|
170
....*....|....*
gi 1958670811 1029 YQSQ------HPHVY 1037
Cdd:TIGR01628 504 MQKQvlgerlFPLVE 518
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
978-1201 |
6.48e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 44.26 E-value: 6.48e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 978 QEQHHQSTMMHPASAAGPPIVATPPAAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHP----HVySPVIQGNA--RMMAPP 1051
Cdd:pfam09770 96 EEEQVRFNRQQPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRTGYEKYKEPEPipdlQV-DASLWGVApkKAAAPA 174
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 1052 AHAQPGLVSSSAAQFG----------------AHEQTHAMYVSTGSLAQQYAHPNATLHPHPPHPQPSATPTGQQQSQHG 1115
Cdd:pfam09770 175 PAPQPAAQPASLPAPSrkmmsleeveaamraqAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQ 254
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 1116 GSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQ----SSFPAAPQTVFTIHPSHVQPAYT 1191
Cdd:pfam09770 255 QHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNPNrlsaARVGYPQNPQPGVQPAPAHQAHR 334
|
250
....*....|
gi 1958670811 1192 TPPHMAHVPQ 1201
Cdd:pfam09770 335 QQGSFGRQAP 344
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
895-1039 |
1.04e-03 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 43.49 E-value: 1.04e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 895 SQPKPSTTPtsPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYP-IPMTPMPVNQAKTYRAGKVPNM 973
Cdd:pfam09770 205 AQAKKPAQQ--PAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQgHPVTILQRPQSPQPDPAQPSIQ 282
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1958670811 974 PQQRQEQHHQ-STMMHPASAAGPPIVATPPAAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSP 1039
Cdd:pfam09770 283 PQAQQFHQQPpPVPVQPTQILQNPNRLSAARVGYPQNPQPGVQPAPAHQAHRQQGSFGRQAPIITHP 349
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
534-717 |
1.18e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.39 E-value: 1.18e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 534 RYQSGPNSLPPRAAtptRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSAEGPPRMSPKAQRH---------PRNHRV 604
Cdd:PHA03247 2609 RGPAPPSPLPPDTH---APDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLgraaqasspPQRPRR 2685
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 605 SAGRGSMSSGLEFVSHNPPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQNSAGNSPSGPV-LASPQAGIT 683
Cdd:PHA03247 2686 RAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPArPARPPTTAG 2765
|
170 180 190
....*....|....*....|....*....|....*
gi 1958670811 684 PAEAVSMPVPAASPTPASPASNRA-LTPSIEAKDS 717
Cdd:PHA03247 2766 PPAPAPPAAPAAGPPRRLTRPAVAsLSESRESLPS 2800
|
|
| Sm_like |
cd00600 |
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ... |
240-307 |
1.49e-03 |
|
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.
Pssm-ID: 212462 [Multi-domain] Cd Length: 63 Bit Score: 38.00 E-value: 1.49e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958670811 240 LTSVVGSKCEVQVKNGGVYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESVLFKCSDFVVVQ 307
Cdd:cd00600 1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
895-1059 |
1.99e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 42.76 E-value: 1.99e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 895 SQPKPSTTPTSPRPQAQPSPsmvGHQQPAPVYT-QPVCFAPNMMYPVPVSPGVQPLypipmtPMPVNQAKTYRAGKVPNM 973
Cdd:PRK10263 345 PVASVDVPPAQPTVAWQPVP---GPQTGEPVIApAPEGYPQQSQYAQPAVQYNEPL------QQPVQPQQPYYAPAAEQP 415
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 974 PQQRQEqhhqstmmhpASAAGPPIVATPPAAYSTQYVAYSPQQFPNQ-PLVQHVPHYQSQHPHVySPVIQgNARMMAPPA 1052
Cdd:PRK10263 416 AQQPYY----------APAPEQPAQQPYYAPAPEQPVAGNAWQAEEQqSTFAPQSTYQTEQTYQ-QPAAQ-EPLYQQPQP 483
|
....*..
gi 1958670811 1053 HAQPGLV 1059
Cdd:PRK10263 484 VEQQPVV 490
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
536-714 |
2.69e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 42.17 E-value: 2.69e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 536 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSAEGPPRMSPKAQRHPRNHRVSAGRGSM 611
Cdd:PRK12323 367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 612 SSGlefvshnPPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQNSAGNSP--------SGPVLASPQAGIT 683
Cdd:PRK12323 447 APA-------PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPweelppefASPAPAQPDAAPA 519
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 1958670811 684 PAEAVSMPVPAAS------------PTPASPASNRALTPSIEA 714
Cdd:PRK12323 520 GWVAESIPDPATAdpddafetlapaPAAAPAPRAAAATEPVVA 562
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
875-1121 |
5.88e-03 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 40.79 E-value: 5.88e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 875 EQVRKSTLNPNAKEFNP--RSFSQPKPSTTPTSPRPQAQPSPSMVG---HQQPAPVytqpvcfaPNM-----MYPVPVSP 944
Cdd:pfam09770 98 EQVRFNRQQPAARAAQSsaQPPASSLPQYQYASQQSQQPSKPVRTGyekYKEPEPI--------PDLqvdasLWGVAPKK 169
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 945 GVQPLYPIPMTPMPVNQAKTYRagKVPNMPQ---QRQEQHHQSTMMHPASAAGPPivATPPAAYSTQYVAYSPQQFPNQP 1021
Cdd:pfam09770 170 AAAPAPAPQPAAQPASLPAPSR--KMMSLEEveaAMRAQAKKPAQQPAPAPAQPP--AAPPAQQAQQQQQFPPQIQQQQQ 245
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 1022 LVQHVPHYQSQHPHVYSPVIQgnARMMAPPAhaQPGLVSSSAAQFGAHEQTHAMYVSTGSLAQQ------YAHPNATLHP 1095
Cdd:pfam09770 246 PQQQPQQPQQHPGQGHPVTIL--QRPQSPQP--DPAQPSIQPQAQQFHQQPPPVPVQPTQILQNpnrlsaARVGYPQNPQ 321
|
250 260
....*....|....*....|....*.
gi 1958670811 1096 HPphpQPSATPTGQQQSQHGGSHPAP 1121
Cdd:pfam09770 322 PG---VQPAPAHQAHRQQGSFGRQAP 344
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
896-1034 |
6.18e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 40.84 E-value: 6.18e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 896 QPKPSTTPTSPRPQAQPSPSMVGHQQPapvYTQPVCFAPNMMYPVPVSPGVQPLYPiPMTPMPVNQAKTYRAGKVPNMPQ 975
Cdd:PRK10263 370 EPVIAPAPEGYPQQSQYAQPAVQYNEP---LQQPVQPQQPYYAPAAEQPAQQPYYA-PAPEQPAQQPYYAPAPEQPVAGN 445
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*....
gi 1958670811 976 QRQEQHHQSTmMHPASAAGPPIVATPPAAYSTQYVAysPQQFPNQPLVQHVPHYQSQHP 1034
Cdd:PRK10263 446 AWQAEEQQST-FAPQSTYQTEQTYQQPAAQEPLYQQ--PQPVEQQPVVEPEPVVEETKP 501
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
878-893 |
6.39e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.
Pssm-ID: 429316 Cd Length: 17 Bit Score: 35.28 E-value: 6.39e-03
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
535-1107 |
7.71e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 40.69 E-value: 7.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 535 YQSGPNSLPPRAATPtrpPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSAEGPPRMSpkaqrhprnhrvsagrgSMSSG 614
Cdd:PHA03247 2480 YRRPAEARFPFAAGA---APDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRML-----------------TWIRG 2539
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 615 LEFVSHN------PPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQNSAGNSPSGPvlaspqAGITPAEAV 688
Cdd:PHA03247 2540 LEELASDdagdppPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDD------RGDPRGPAP 2613
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 689 SMPVPAASPTPASPASNRaltpsieakdsrlqdqrqnSPAGNKENIKASETSPSFSKAENKGVSPVISEHRKqiddlkkf 768
Cdd:PHA03247 2614 PSPLPPDTHAPDPPPPSP-------------------SPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR-------- 2666
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 769 kndfrlqpsSTSESMDQLLSKNREGEKSRDLmkdKTEASAKDSFIDSGSSSCTSSSSKTNSPSASPSVLSNAEHKRG-PE 847
Cdd:PHA03247 2667 ---------ARRLGRAAQASSPPQRPRRRAA---RPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQAsPA 2734
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 848 VTSQGVQTSSPA--CKQEKDDREERKDTTEQVRKSTlnPNAKEFNPRSFSQPKPSTTPTSPRPQAQPS---PSMVGHQQP 922
Cdd:PHA03247 2735 LPAAPAPPAVPAgpATPGGPARPARPPTTAGPPAPA--PPAAPAAGPPRRLTRPAVASLSESRESLPSpwdPADPPAAVL 2812
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 923 APVYTQPVCFAPNMmyPVPVSPGVQPLYPiPMTPMPVNQAKTYRAGKVPNMPQQRQEQhHQSTMMHPASAAGPPI--VAT 1000
Cdd:PHA03247 2813 APAAALPPAASPAG--PLPPPTSAQPTAP-PPPPGPPPPSLPLGGSVAPGGDVRRRPP-SRSPAAKPAAPARPPVrrLAR 2888
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 1001 PPAAYSTQYVAYSP--QQFPNQPLVQHVPHYQSQHPHVYSPVIQgnarmMAPPAHAQPGLVSSSAAQFGAHEQTHAMYVS 1078
Cdd:PHA03247 2889 PAVSRSTESFALPPdqPERPPQPQAPPPPQPQPQPPPPPQPQPP-----PPPPPRPQPPLAPTTDPAGAGEPSGAVPQPW 2963
|
570 580 590
....*....|....*....|....*....|
gi 1958670811 1079 TGSLAQ-QYAHPNATLHPHPPHPQPSATPT 1107
Cdd:PHA03247 2964 LGALVPgRVAVPRFRVPQPAPSREAPASST 2993
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
833-958 |
9.19e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 40.45 E-value: 9.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 833 SPSVLSNAEHKRGPEVTSQGVQTSSPACKQEKDDREERKDTTEQVRKSTLNPNAKEFNPRSFSQP-KPSTTPTSPRPQAQ 911
Cdd:PRK10263 746 TPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPvAPQPQYQQPQQPVA 825
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|.
gi 1958670811 912 PSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIP----MTPMP 958
Cdd:PRK10263 826 PQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPLPsldlLTPPP 876
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
622-704 |
9.73e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 40.35 E-value: 9.73e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958670811 622 PPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQNSAGNSPSGPVLASPQAGITPAEAVSMPVPAASPTPAS 701
Cdd:PRK07764 417 PAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAA 496
|
...
gi 1958670811 702 PAS 704
Cdd:PRK07764 497 PAA 499
|
|
|