|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
108-181 |
4.58e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. :
Pssm-ID: 464173 Cd Length: 78 Bit Score: 88.38 E-value: 4.58e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622996232 108 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESILFKCSDFVVVQFK 181
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
249-310 |
8.68e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain. :
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 8.68e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622996232 249 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 310
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
769-1085 |
1.72e-07 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 56.10 E-value: 1.72e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 769 PKPSTTPTSPRPQAQPSP--SMVGHQQPTPVYTQPVCFAPNMMYPVPVSPGVQ------PLYPIPMTPMPVNQAKTYRAV 840
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPTvgSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAaarqasPALPAAPAPPAVPAGPATPGG 2753
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 841 PNMPQQRQDQHHQSAMMHPAS-AAGPPIAATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 919
Cdd:PHA03247 2754 PARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 920 PTHAQP-------GLVSSSATQYGA------------HEQTHAMYACPKLPYNKETsPSFYFAISTGSLAQQyahPNATL 980
Cdd:PHA03247 2831 PTSAQPtapppppGPPPPSLPLGGSvapggdvrrrppSRSPAAKPAAPARPPVRRL-ARPAVSRSTESFALP---PDQPE 2906
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 981 HPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSfpa 1060
Cdd:PHA03247 2907 RPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA--- 2983
|
330 340
....*....|....*....|....*
gi 1622996232 1061 aqqtvftihPSHVQPAYTNPPHMAH 1085
Cdd:PHA03247 2984 ---------PSREAPASSTPPLTGH 2999
|
|
| PRK12323 super family |
cl46901 |
DNA polymerase III subunit gamma/tau; |
408-599 |
9.42e-04 |
|
DNA polymerase III subunit gamma/tau; The actual alignment was detected with superfamily member PRK12323:
Pssm-ID: 481241 [Multi-domain] Cd Length: 700 Bit Score: 43.33 E-value: 9.42e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 408 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSI 483
Cdd:PRK12323 367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 484 SSGLEFVSHNPPSEAATPPVARTSPSGGTWSSV-VSGVPRLSPKTHRPRSPRQNSIGNTPSGPVLASPQAGIIPTEAVAM 562
Cdd:PRK12323 447 APAPAPAPAAAPAAAARPAAAGPRPVAAAAAAApARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESI 526
|
170 180 190
....*....|....*....|....*....|....*..
gi 1622996232 563 PIPAASPTPASPASNRAVTPSSEAKDSRLQDQRQNSP 599
Cdd:PRK12323 527 PDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
750-765 |
7.40e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains. :
Pssm-ID: 429316 Cd Length: 17 Bit Score: 34.89 E-value: 7.40e-03
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
108-181 |
4.58e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
Pssm-ID: 464173 Cd Length: 78 Bit Score: 88.38 E-value: 4.58e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622996232 108 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESILFKCSDFVVVQFK 181
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
249-310 |
8.68e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain.
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 8.68e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622996232 249 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 310
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
769-1085 |
1.72e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 56.10 E-value: 1.72e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 769 PKPSTTPTSPRPQAQPSP--SMVGHQQPTPVYTQPVCFAPNMMYPVPVSPGVQ------PLYPIPMTPMPVNQAKTYRAV 840
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPTvgSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAaarqasPALPAAPAPPAVPAGPATPGG 2753
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 841 PNMPQQRQDQHHQSAMMHPAS-AAGPPIAATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 919
Cdd:PHA03247 2754 PARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 920 PTHAQP-------GLVSSSATQYGA------------HEQTHAMYACPKLPYNKETsPSFYFAISTGSLAQQyahPNATL 980
Cdd:PHA03247 2831 PTSAQPtapppppGPPPPSLPLGGSvapggdvrrrppSRSPAAKPAAPARPPVRRL-ARPAVSRSTESFALP---PDQPE 2906
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 981 HPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSfpa 1060
Cdd:PHA03247 2907 RPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA--- 2983
|
330 340
....*....|....*....|....*
gi 1622996232 1061 aqqtvftihPSHVQPAYTNPPHMAH 1085
Cdd:PHA03247 2984 ---------PSREAPASSTPPLTGH 2999
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
770-942 |
1.30e-05 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 49.65 E-value: 1.30e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 770 KPSTTPTSPRPQAQPSPSMVGHQ--------------QPTPVYTQPvcfaPNMMYPVPVSPGVQPLYPIPMTPMPVNQAK 835
Cdd:pfam09770 169 KAAAPAPAPQPAAQPASLPAPSRkmmsleeveaamraQAKKPAQQP----APAPAQPPAAPPAQQAQQQQQFPPQIQQQQ 244
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 836 TYRAVPNMPQQRQDQHHQ-SAMMHPASAAGPPiaaTPPAYSTQyvaySPQQFPNQPLVQHVPHYQSQHPHVYSPV---IQ 911
Cdd:pfam09770 245 QPQQQPQQPQQHPGQGHPvTILQRPQSPQPDP---AQPSIQPQ----AQQFHQQPPPVPVQPTQILQNPNRLSAArvgYP 317
|
170 180 190
....*....|....*....|....*....|.
gi 1622996232 912 GNARMMAPPTHAQPGLVSSSATQYGAHEQTH 942
Cdd:pfam09770 318 QNPQPGVQPAPAHQAHRQQGSFGRQAPIITH 348
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
777-890 |
5.97e-05 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 47.11 E-value: 5.97e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 777 SPRPQAQPSPS-MVGHQQPTPVYTQpvcfAPNMMYP-VPVSPGVQPLYPIPMTPMPVNQAKTYravPNMPQQRQDQHHQS 854
Cdd:TIGR01628 379 QPRMRQLPMGSpMGGAMGQPPYYGQ----GPQQQFNgQPLGWPRMSMMPTPMGPGGPLRPNGL---APMNAVRAPSRNAQ 451
|
90 100 110
....*....|....*....|....*....|....*.
gi 1622996232 855 AMMHPASAagPPIAATPPAYSTQyvaySPQQFPNQP 890
Cdd:TIGR01628 452 NAAQKPPM--QPVMYPPNYQSLP----LSQDLPQPQ 481
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
408-599 |
9.42e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 43.33 E-value: 9.42e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 408 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSI 483
Cdd:PRK12323 367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 484 SSGLEFVSHNPPSEAATPPVARTSPSGGTWSSV-VSGVPRLSPKTHRPRSPRQNSIGNTPSGPVLASPQAGIIPTEAVAM 562
Cdd:PRK12323 447 APAPAPAPAAAPAAAARPAAAGPRPVAAAAAAApARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESI 526
|
170 180 190
....*....|....*....|....*....|....*..
gi 1622996232 563 PIPAASPTPASPASNRAVTPSSEAKDSRLQDQRQNSP 599
Cdd:PRK12323 527 PDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
750-765 |
7.40e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.
Pssm-ID: 429316 Cd Length: 17 Bit Score: 34.89 E-value: 7.40e-03
|
| Sm_like |
cd00600 |
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ... |
112-179 |
8.15e-03 |
|
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.
Pssm-ID: 212462 [Multi-domain] Cd Length: 63 Bit Score: 36.07 E-value: 8.15e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622996232 112 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESILFKCSDFVVVQ 179
Cdd:cd00600 1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
108-181 |
4.58e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
Pssm-ID: 464173 Cd Length: 78 Bit Score: 88.38 E-value: 4.58e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622996232 108 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESILFKCSDFVVVQFK 181
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
249-310 |
8.68e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain.
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 8.68e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622996232 249 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 310
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
769-1085 |
1.72e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 56.10 E-value: 1.72e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 769 PKPSTTPTSPRPQAQPSP--SMVGHQQPTPVYTQPVCFAPNMMYPVPVSPGVQ------PLYPIPMTPMPVNQAKTYRAV 840
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPTvgSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAaarqasPALPAAPAPPAVPAGPATPGG 2753
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 841 PNMPQQRQDQHHQSAMMHPAS-AAGPPIAATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 919
Cdd:PHA03247 2754 PARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 920 PTHAQP-------GLVSSSATQYGA------------HEQTHAMYACPKLPYNKETsPSFYFAISTGSLAQQyahPNATL 980
Cdd:PHA03247 2831 PTSAQPtapppppGPPPPSLPLGGSvapggdvrrrppSRSPAAKPAAPARPPVRRL-ARPAVSRSTESFALP---PDQPE 2906
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 981 HPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSfpa 1060
Cdd:PHA03247 2907 RPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA--- 2983
|
330 340
....*....|....*....|....*
gi 1622996232 1061 aqqtvftihPSHVQPAYTNPPHMAH 1085
Cdd:PHA03247 2984 ---------PSREAPASSTPPLTGH 2999
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
770-1132 |
3.14e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.86 E-value: 3.14e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 770 KPSTTPTSPRPQAQPSPSMVGHQQPTPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAVPNMPQQRQd 849
Cdd:PHA03247 2588 RPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR- 2666
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 850 qhhQSAMMHPASAAGPPIAATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAPPTHAQP---- 925
Cdd:PHA03247 2667 ---ARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPappa 2743
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 926 ---GLVSSSATQYGAHEQTHAMYACPKLPYNKETSPSFYFAISTGSLAQQYAHPNATLHPHTPHPQPSATPTG---QQQS 999
Cdd:PHA03247 2744 vpaGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAalpPAAS 2823
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 1000 QHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPqnSFPAAQQTVFTIHPSHVQPAYTN 1079
Cdd:PHA03247 2824 PAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAP--ARPPVRRLARPAVSRSTESFALP 2901
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|...
gi 1622996232 1080 PPHMAHVPQAHVQSGMVPSHPTAHAPMMLMTTQPPGGPQAALAQSAlQPIPVS 1132
Cdd:PHA03247 2902 PDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTT-DPAGAG 2953
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
770-942 |
1.30e-05 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 49.65 E-value: 1.30e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 770 KPSTTPTSPRPQAQPSPSMVGHQ--------------QPTPVYTQPvcfaPNMMYPVPVSPGVQPLYPIPMTPMPVNQAK 835
Cdd:pfam09770 169 KAAAPAPAPQPAAQPASLPAPSRkmmsleeveaamraQAKKPAQQP----APAPAQPPAAPPAQQAQQQQQFPPQIQQQQ 244
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 836 TYRAVPNMPQQRQDQHHQ-SAMMHPASAAGPPiaaTPPAYSTQyvaySPQQFPNQPLVQHVPHYQSQHPHVYSPV---IQ 911
Cdd:pfam09770 245 QPQQQPQQPQQHPGQGHPvTILQRPQSPQPDP---AQPSIQPQ----AQQFHQQPPPVPVQPTQILQNPNRLSAArvgYP 317
|
170 180 190
....*....|....*....|....*....|.
gi 1622996232 912 GNARMMAPPTHAQPGLVSSSATQYGAHEQTH 942
Cdd:pfam09770 318 QNPQPGVQPAPAHQAHRQQGSFGRQAPIITH 348
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
858-1112 |
4.68e-05 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 47.72 E-value: 4.68e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 858 HPASAAGPPIAATPPAYSTQY-VAYSPQQFPNQPLVQHVPHYQSQHP----HVySPVIQGNARMMAPPTHAQPGLVSSSA 932
Cdd:pfam09770 106 QPAARAAQSSAQPPASSLPQYqYASQQSQQPSKPVRTGYEKYKEPEPipdlQV-DASLWGVAPKKAAAPAPAPQPAAQPA 184
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 933 TQYGAH-------EQTHAMYACPKLPYNKETSPSFYFAistgslaQQYAHPNATLHPHTPHPQPSATPTGQQQSQHGGSH 1005
Cdd:pfam09770 185 SLPAPSrkmmsleEVEAAMRAQAKKPAQQPAPAPAQPP-------AAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHP 257
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 1006 PAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPqNSFPAAQQTVFTIHPSHVQPAytnPPHMAH 1085
Cdd:pfam09770 258 GQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNP-NRLSAARVGYPQNPQPGVQPA---PAHQAH 333
|
250 260
....*....|....*....|....*..
gi 1622996232 1086 vPQAHVQSGMVPSHpTAHAPMMLMTTQ 1112
Cdd:pfam09770 334 -RQQGSFGRQAPII-THPQQLAQLSEE 358
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
777-890 |
5.97e-05 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 47.11 E-value: 5.97e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 777 SPRPQAQPSPS-MVGHQQPTPVYTQpvcfAPNMMYP-VPVSPGVQPLYPIPMTPMPVNQAKTYravPNMPQQRQDQHHQS 854
Cdd:TIGR01628 379 QPRMRQLPMGSpMGGAMGQPPYYGQ----GPQQQFNgQPLGWPRMSMMPTPMGPGGPLRPNGL---APMNAVRAPSRNAQ 451
|
90 100 110
....*....|....*....|....*....|....*.
gi 1622996232 855 AMMHPASAagPPIAATPPAYSTQyvaySPQQFPNQP 890
Cdd:TIGR01628 452 NAAQKPPM--QPVMYPPNYQSLP----LSQDLPQPQ 481
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
767-1055 |
6.98e-05 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 47.39 E-value: 6.98e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 767 SQPKPSTTPTSPRPQAQPSPsmvGHQQPTPVYT-QPVCFAPNMMYPVPVSPGVQPLYPiPMTPMPVNQAKTYRAVPNMPQ 845
Cdd:PRK10263 345 PVASVDVPPAQPTVAWQPVP---GPQTGEPVIApAPEGYPQQSQYAQPAVQYNEPLQQ-PVQPQQPYYAPAAEQPAQQPY 420
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 846 QRQDQHHQSAMMHPASAAGPPIAATPpaystqyvaysPQQFPNQPLVQHVPHYQSQHPHVySPVIQgnarmmaPPTHAQP 925
Cdd:PRK10263 421 YAPAPEQPAQQPYYAPAPEQPVAGNA-----------WQAEEQQSTFAPQSTYQTEQTYQ-QPAAQ-------EPLYQQP 481
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 926 GLVsssatqygahEQTHAMYACPKLPYNKETSPSFYFaisTGSLAQQYAHPNATLHP-HTPHPQPSATPTGQQQSQHGGS 1004
Cdd:PRK10263 482 QPV----------EQQPVVEPEPVVEETKPARPPLYY---FEEVEEKRAREREQLAAwYQPIPEPVKEPEPIKSSLKAPS 548
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....
gi 1622996232 1005 HPAPSPVQHHQHQAAQALHLaspqqQSAIYHAGLAPTP--PSMTPASN-TQSPQ 1055
Cdd:PRK10263 549 VAAVPPVEAAAAVSPLASGV-----KKATLATGAAATVaaPVFSLANSgGPRPQ 597
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
752-1105 |
2.09e-04 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 45.44 E-value: 2.09e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 752 STLNPNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSMVGHQQPTPVytQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPV 831
Cdd:PHA03378 583 SQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPL--RPIPMRPLRMQPITFNVLVFPTPHQPPQVEIT 660
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 832 NQAKTYRAVPNMPQQRQDQHHqsAMMHPASAAgPPIAATPPAYSTQyvAYSPQQFPnqplvqhVPHYQSQHPHVYSPVIQ 911
Cdd:PHA03378 661 PYKPTWTQIGHIPYQPSPTGA--NTMLPIQWA-PGTMQPPPRAPTP--MRPPAAPP-------GRAQRPAAATGRARPPA 728
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 912 GNARMMAPPTHAQPGLVSSSATQYGAHEQTHAMYACPKlpynketspsfyfaistgslaqqyahPNATLHPHTPHPQPSA 991
Cdd:PHA03378 729 AAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARP--------------------------PAAAPGAPTPQPPPQA 782
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 992 TPTGQQQSQHGgshPAPSPVQHHQHQAAQALHLASPQQQSAIYH----------------------------AGLAPTPP 1043
Cdd:PHA03378 783 PPAPQQRPRGA---PTPQPPPQAGPTSMQLMPRAAPGQQGPTKQilrqlltggvkrgrpslkkpaalerqaaAGPTPSPG 859
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1622996232 1044 SMTPASNTQSPQNSFPAAQQTVFTIHPSHVQPAYTNPPHMAHVPQAHVQSGMVPSHPTAHAP 1105
Cdd:PHA03378 860 SGTSDKIVQAPVFYPPVLQPIQVMRQLGSVRAAAASTVTQAPTEYTGERRGVGPMHPTDIPP 921
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
768-1008 |
2.18e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 45.41 E-value: 2.18e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 768 QPKPSTTPTSPRPQAQPSPSMVGHQQPTPVYTQPVCFA-PNMMYPVPVSP--------GVQPLYPIPMTPMPVNQAKTYR 838
Cdd:pfam09770 106 QPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRTGyEKYKEPEPIPDlqvdaslwGVAPKKAAAPAPAPQPAAQPAS 185
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 839 AVPNMPQQRQDQHHQSAMM---HPASAAGPPIAATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPhvysPVIQGNAR 915
Cdd:pfam09770 186 LPAPSRKMMSLEEVEAAMRaqaKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQP----QQHPGQGH 261
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 916 MMAPPTHAQPGLVSSSATQYGAHEQTHAMYACPKLPynketSPSfyfaistgslaQQYAHPN------ATLHPHTPHPQP 989
Cdd:pfam09770 262 PVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPV-----QPT-----------QILQNPNrlsaarVGYPQNPQPGVQ 325
|
250
....*....|....*....
gi 1622996232 990 SATPTGQQQSQHGGSHPAP 1008
Cdd:pfam09770 326 PAPAHQAHRQQGSFGRQAP 344
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
763-1135 |
4.49e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 44.59 E-value: 4.49e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 763 PRSFSQPKPSTTPTSPRPQAQPSPSMVGHQQPTPvytqpvcfAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAVPN 842
Cdd:PRK07764 400 SAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAP--------APAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPA 471
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 843 MPQQRQDQHHQSAMMHPASAAGPPIAATPPAYSTQYVAYSPQQFPNQpLVQHVPHYQ-------SQHPHVYSpvIQGNAR 915
Cdd:PRK07764 472 AAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLRERWPE-ILAAVPKRSrktwailLPEATVLG--VRGDTL 548
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 916 MMApptHAQPGLVSSSATQYGA-------HEQTHAMYAcpklpYNKETSPSFYFAISTGSLAQQYAHPNatlhPHTPHPQ 988
Cdd:PRK07764 549 VLG---FSTGGLARRFASPGNAevlvtalAEELGGDWQ-----VEAVVGPAPGAAGGEGPPAPASSGPP----EEAARPA 616
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 989 PSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHL----------ASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSF 1058
Cdd:PRK07764 617 APAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHhpkhvavpdaSDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGA 696
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 1059 PAAQQTVFTIHPSHVQPAYTNPPHMAHVPQA-----HVQSGMVPSHPTAHAPMMLMTT--QPPGGPQAALAQSALQPIPV 1131
Cdd:PRK07764 697 APAQPAPAPAATPPAGQADDPAAQPPQAAQGasapsPAADDPVPLPPEPDDPPDPAGApaQPPPPPAPAPAAAPAAAPPP 776
|
....
gi 1622996232 1132 STTA 1135
Cdd:PRK07764 777 SPPS 780
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
750-906 |
6.06e-04 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 44.03 E-value: 6.06e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 750 RKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPTPVYTQPVCFAPNMMYPVPVSPGVQ--PLYPIPMT 827
Cdd:TIGR01628 367 RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPlrPNGLAPMN 441
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 828 PMpvnqaktyRAVPNMPQQRQDQHHQSAMMHPASAAGPPIAATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQ------ 901
Cdd:TIGR01628 442 AV--------RAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQMQKQvlgerl 513
|
....*
gi 1622996232 902 HPHVY 906
Cdd:TIGR01628 514 FPLVE 518
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
775-1138 |
9.12e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.77 E-value: 9.12e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 775 PTSPRPQAQPSPSmVGHQQPTPVYTqpvcfapnmmyPVPVSPGVQPLYPIPMTPMPVNQAKTYRAVPNMPQQRQdqhhQS 854
Cdd:PHA03247 2551 PPPPLPPAAPPAA-PDRSVPPPRPA-----------PRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPA----PP 2614
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 855 AMMHPASAAGPPIAATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAPPTHAQPGLVSSSATQ 934
Cdd:PHA03247 2615 SPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGS 2694
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 935 YGAHEQTHAMyacPKLPYNKETSPSFYFAISTGSLAQQYAHPNATLHPHTP----HPQPSATPTGQQQSQHGGSHPAPSP 1010
Cdd:PHA03247 2695 LTSLADPPPP---PPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPavpaGPATPGGPARPARPPTTAGPPAPAP 2771
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 1011 VQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSFPAAQQTVFTIHPSHVQPAYTNPPHMAHVPQAH 1090
Cdd:PHA03247 2772 PAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP 2851
|
330 340 350 360
....*....|....*....|....*....|....*....|....*...
gi 1622996232 1091 VQSGMVPSHPTAHAPMMLMTTQPPGGPQAALAQSALQPIPVSTTAHFP 1138
Cdd:PHA03247 2852 LGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFA 2899
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
408-599 |
9.42e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 43.33 E-value: 9.42e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 408 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSI 483
Cdd:PRK12323 367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 484 SSGLEFVSHNPPSEAATPPVARTSPSGGTWSSV-VSGVPRLSPKTHRPRSPRQNSIGNTPSGPVLASPQAGIIPTEAVAM 562
Cdd:PRK12323 447 APAPAPAPAAAPAAAARPAAAGPRPVAAAAAAApARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESI 526
|
170 180 190
....*....|....*....|....*....|....*..
gi 1622996232 563 PIPAASPTPASPASNRAVTPSSEAKDSRLQDQRQNSP 599
Cdd:PRK12323 527 PDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAP 563
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
411-1011 |
2.25e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.62 E-value: 2.25e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 411 PNSLPPRAATPTRPPSRPPSRPSRPPSHPSAHGSPAPV-STMPK--RMSSEGPPRMSPKAQRHPRNHRVSAGRGSISSGL 487
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPqSARPRapVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAA 2635
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 488 EFVSHNPPSEAATPPVARTSPSGGTWS-----SVVSGVPRLSPKTHRPRSPRQNSigntPSGPV--LASPQAGIIPTEAV 560
Cdd:PHA03247 2636 NEPDPHPPPTVPPPERPRDDPAPGRVSrprraRRLGRAAQASSPPQRPRRRAARP----TVGSLtsLADPPPPPPTPEPA 2711
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 561 AMPIPAASPTPASPASNRAVTPSSEAKD-SRLQDQRQNSPAGNKENIKPNETSPSFSKAENKGisPVVSEHRKQIddlkk 639
Cdd:PHA03247 2712 PHALVSATPLPPGPAAARQASPALPAAPaPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA--PAAGPPRRLT----- 2784
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 640 fkndfRLQPSSTSESMDQLlnknregeksrdlikdkiePSAKDSFIENSSSNCTSGSSKPNSPSISPSILSNTEHKRGPE 719
Cdd:PHA03247 2785 -----RPAVASLSESRESL-------------------PSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP 2840
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 720 VTSQGVQTSSPACkqekddkeekkdaaeqvrkSTLNPNAkEFNPRSFSQPKPSTTPTSPRPQ----AQPSPSmvghqQPT 795
Cdd:PHA03247 2841 PPPGPPPPSLPLG-------------------GSVAPGG-DVRRRPPSRSPAAKPAAPARPPvrrlARPAVS-----RST 2895
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 796 PVYTQPvcfapnmmyPVPVSPGVQPLYPIPMTPMPVNQAKTYRAVPNMPQQRQDqhhqsammhpasAAGPPIAATPPAYS 875
Cdd:PHA03247 2896 ESFALP---------PDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQ------------PPLAPTTDPAGAGE 2954
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 876 TQYVAYSPQQFPNQPLVQHVPHYQSQHPhvySPVIQGNARMMAPPTHAQPGLVSSSATQYGAHEQTHAMYACPK----LP 951
Cdd:PHA03247 2955 PSGAVPQPWLGALVPGRVAVPRFRVPQP---APSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPVSLKqtlwPP 3031
|
570 580 590 600 610 620
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1622996232 952 YNKETSPSFYFAISTGSLAQQYA---HPNATLHPHTPHPQPSATPTGQQQSQHggSHPAPSPV 1011
Cdd:PHA03247 3032 DDTEDSDADSLFDSDSERSDLEAldpLPPEPHDPFAHEPDPATPEAGARESPS--SQFGPPPL 3092
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
711-1081 |
3.05e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 41.68 E-value: 3.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 711 NTEHKRGPEVTSQGVQTSSPACKQEKDDKEEKKDAAEQVRKSTLNPNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSMVG 790
Cdd:pfam03154 127 NDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSV 206
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 791 HQQPTPVYTQPVCFAPNMMYPV-------------------PVSPGVQPLYPIPMTPMPVNQAKTYRAVPNMPQQRQDQh 851
Cdd:pfam03154 207 PPQGSPATSQPPNQTQSTAAPHtliqqtptlhpqrlpsphpPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTG- 285
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 852 hQSAMMHPASAAGPPIAAT------PPAYSTQyVAYSPQQFPNQPLVQhvPHYQSQHPHVYSPVIQGNARM---MAPPTH 922
Cdd:pfam03154 286 -PSHMQHPVPPQPFPLTPQssqsqvPPGPSPA-APGQSQQRIHTPPSQ--SQLQSQQPPREQPLPPAPLSMphiKPPPTT 361
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 923 AQPGLVSSSATQYGAHEQTHAMYacpKLPYNKETSPSFYFAISTGSLAQQYAHPNA-TLHPHT-PHPQPSATPTGQQQSQ 1000
Cdd:pfam03154 362 PIPQLPNPQSHKHPPHLSGPSPF---QMNSNLPPPPALKPLSSLSTHHPPSAHPPPlQLMPQSqQLPPPPAQPPVLTQSQ 438
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622996232 1001 hggSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQNSFPAAQQTVFTIHPSHVQPAYTNP 1080
Cdd:pfam03154 439 ---SLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLP 515
|
.
gi 1622996232 1081 P 1081
Cdd:pfam03154 516 P 516
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
750-765 |
7.40e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.
Pssm-ID: 429316 Cd Length: 17 Bit Score: 34.89 E-value: 7.40e-03
|
| Sm_like |
cd00600 |
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ... |
112-179 |
8.15e-03 |
|
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.
Pssm-ID: 212462 [Multi-domain] Cd Length: 63 Bit Score: 36.07 E-value: 8.15e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622996232 112 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESILFKCSDFVVVQ 179
Cdd:cd00600 1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
|
|
|