NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1908918786|ref|NP_001374113|]
View 

ataxin-2-like protein isoform 20 [Homo sapiens]

Protein Classification

SM-ATX and LsmAD domain-containing protein( domain architecture ID 13860551)

SM-ATX and LsmAD domain-containing protein

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
SM-ATX pfam14438
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
123-196 1.62e-19

Ataxin 2 SM domain; This SM domain is found in Ataxin-2.


:

Pssm-ID: 464173  Cd Length: 78  Bit Score: 83.76  E-value: 1.62e-19
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918786  123 MLHFLTAVVGSTCDVKVKNGTTYEGIFKTLS--SKFELAVDAVHRKASE--PAGGPRREDIVDTMVFKPSDVMLVHFR 196
Cdd:pfam14438    1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASleKDFGVVLKMARRIKKSngSGLNPVRGEIVDTMIFPAKDIVDIEAK 78
LsmAD pfam06741
LsmAD domain; This domain is found associated with Lsm domain.
264-326 1.54e-17

LsmAD domain; This domain is found associated with Lsm domain.


:

Pssm-ID: 461998 [Multi-domain]  Cd Length: 65  Bit Score: 77.61  E-value: 1.54e-17
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918786  264 YGVKTTYDSSLssYTVPLEKdNSEEFRQRELRAAQLAREIESSPQYRLRIAMEN-----DDGRTEEEK 326
Cdd:pfam06741    1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERgldvdDSGLDEEDK 65
PHA03247 super family cl33720
large tegument protein UL36; Provisional
398-943 3.63e-09

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.49  E-value: 3.63e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  398 ARGINGGPSRMSPKAQRPLRGAKTLSSPSNR--PSGETSVPPPPAVGRMYPPRSPKSAAPAPisASCPEPPIGSAV-PTS 474
Cdd:PHA03247  2456 ARTILGAPFSLSLLLGELFPGAPVYRRPAEArfPFAAGAAPDPGGGGPPDPDAPPAPSRLAP--AILPDEPVGEPVhPRM 2533
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  475 SASIPVTSSVSDPGVGSISPASPKISLAPTDVKELSTKEPG-RTLEPQELARiaGKVPGLQNEQKRFQLEELRKFGAQFK 553
Cdd:PHA03247  2534 LTWIRGLEELASDDAGDPPPPLPPAAPPAAPDRSVPPPRPApRPSEPAVTSR--ARRPDAPPQSARPRAPVDDRGDPRGP 2611
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  554 LQPSSSPENSLDPFPPRILKEEPKGKEKEVDGLLTSEPMGSPVSSKTESVS------------------DKEDKPPLAPS 615
Cdd:PHA03247  2612 APPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSrprrarrlgraaqassppQRPRRRAARPT 2691
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  616 GGTEGPEQPPPPCPSQTGSPPVGLIKGEDKDEGP-VAEQVKKSTLNPNAKEFNPTKPLL--SVNKSTSTPTSPGPRTHST 692
Cdd:PHA03247  2692 VGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPaAARQASPALPAAPAPPAVPAGPATpgGPARPARPPTTAGPPAPAP 2771
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  693 PSIPVlTAGQSGLYSPQYISYIPQIHMGPAVQAPQMYPYPVSNSVPGQQGKYRGAKGSLPPQRSDQHQPASAPPmmqaaa 772
Cdd:PHA03247  2772 PAAPA-AGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG------ 2844
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  773 aagpplvaatPYSSYIPYNPQQFPGQPAMMQPMahyPSQPVFAPMLQSNP--RMLTSGSHPQAIVSSSTPQYPSAEQPTP 850
Cdd:PHA03247  2845 ----------PPPPSLPLGGSVAPGGDVRRRPP---SRSPAAKPAAPARPpvRRLARPAVSRSTESFALPPDQPERPPQP 2911
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  851 QAlyatvhqsyPHHATQLHAHQPQPATTPTGSQPQSQHAAPSPVQQHQAGQAPHLGSGQPQQNLYHPGALTGTPPSLP-P 929
Cdd:PHA03247  2912 QA---------PPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPqP 2982
                          570
                   ....*....|....
gi 1908918786  930 GPSAQSPQSSFPQP 943
Cdd:PHA03247  2983 APSREAPASSTPPL 2996
PAT1 super family cl37801
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
844-1054 9.03e-03

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


The actual alignment was detected with superfamily member pfam09770:

Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 40.02  E-value: 9.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  844 SAEQPTPQAlyatvhqsYPHHATQLHAHQPQPATTPTGSQPQSQHAAPSPVQQHQAGQAPHLgsgQPQQNLYHPGALTGT 923
Cdd:pfam09770  103 NRQQPAARA--------AQSSAQPPASSLPQYQYASQQSQQPSKPVRTGYEKYKEPEPIPDL---QVDASLWGVAPKKAA 171
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  924 PPSLPPGPSAQSPQSSF-------------------PQPAAVYAIHHQQLPHGftnmahvtQAHVQTGITAAPPPHPGAP 984
Cdd:pfam09770  172 APAPAPQPAAQPASLPApsrkmmsleeveaamraqaKKPAQQPAPAPAQPPAA--------PPAQQAQQQQQFPPQIQQQ 243
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  985 HPPQVMLLHPPQSHGGPPQGAV---PQSGVPALSASTPSPYPYIGH----PQGEQPGQA---PGFPGGADDRIPPLPPPG 1054
Cdd:pfam09770  244 QQPQQQPQQPQQHPGQGHPVTIlqrPQSPQPDPAQPSIQPQAQQFHqqppPVPVQPTQIlqnPNRLSAARVGYPQNPQPG 323
 
Name Accession Description Interval E-value
SM-ATX pfam14438
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
123-196 1.62e-19

Ataxin 2 SM domain; This SM domain is found in Ataxin-2.


Pssm-ID: 464173  Cd Length: 78  Bit Score: 83.76  E-value: 1.62e-19
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918786  123 MLHFLTAVVGSTCDVKVKNGTTYEGIFKTLS--SKFELAVDAVHRKASE--PAGGPRREDIVDTMVFKPSDVMLVHFR 196
Cdd:pfam14438    1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASleKDFGVVLKMARRIKKSngSGLNPVRGEIVDTMIFPAKDIVDIEAK 78
LsmAD pfam06741
LsmAD domain; This domain is found associated with Lsm domain.
264-326 1.54e-17

LsmAD domain; This domain is found associated with Lsm domain.


Pssm-ID: 461998 [Multi-domain]  Cd Length: 65  Bit Score: 77.61  E-value: 1.54e-17
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918786  264 YGVKTTYDSSLssYTVPLEKdNSEEFRQRELRAAQLAREIESSPQYRLRIAMEN-----DDGRTEEEK 326
Cdd:pfam06741    1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERgldvdDSGLDEEDK 65
PHA03247 PHA03247
large tegument protein UL36; Provisional
398-943 3.63e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.49  E-value: 3.63e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  398 ARGINGGPSRMSPKAQRPLRGAKTLSSPSNR--PSGETSVPPPPAVGRMYPPRSPKSAAPAPisASCPEPPIGSAV-PTS 474
Cdd:PHA03247  2456 ARTILGAPFSLSLLLGELFPGAPVYRRPAEArfPFAAGAAPDPGGGGPPDPDAPPAPSRLAP--AILPDEPVGEPVhPRM 2533
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  475 SASIPVTSSVSDPGVGSISPASPKISLAPTDVKELSTKEPG-RTLEPQELARiaGKVPGLQNEQKRFQLEELRKFGAQFK 553
Cdd:PHA03247  2534 LTWIRGLEELASDDAGDPPPPLPPAAPPAAPDRSVPPPRPApRPSEPAVTSR--ARRPDAPPQSARPRAPVDDRGDPRGP 2611
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  554 LQPSSSPENSLDPFPPRILKEEPKGKEKEVDGLLTSEPMGSPVSSKTESVS------------------DKEDKPPLAPS 615
Cdd:PHA03247  2612 APPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSrprrarrlgraaqassppQRPRRRAARPT 2691
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  616 GGTEGPEQPPPPCPSQTGSPPVGLIKGEDKDEGP-VAEQVKKSTLNPNAKEFNPTKPLL--SVNKSTSTPTSPGPRTHST 692
Cdd:PHA03247  2692 VGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPaAARQASPALPAAPAPPAVPAGPATpgGPARPARPPTTAGPPAPAP 2771
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  693 PSIPVlTAGQSGLYSPQYISYIPQIHMGPAVQAPQMYPYPVSNSVPGQQGKYRGAKGSLPPQRSDQHQPASAPPmmqaaa 772
Cdd:PHA03247  2772 PAAPA-AGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG------ 2844
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  773 aagpplvaatPYSSYIPYNPQQFPGQPAMMQPMahyPSQPVFAPMLQSNP--RMLTSGSHPQAIVSSSTPQYPSAEQPTP 850
Cdd:PHA03247  2845 ----------PPPPSLPLGGSVAPGGDVRRRPP---SRSPAAKPAAPARPpvRRLARPAVSRSTESFALPPDQPERPPQP 2911
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  851 QAlyatvhqsyPHHATQLHAHQPQPATTPTGSQPQSQHAAPSPVQQHQAGQAPHLGSGQPQQNLYHPGALTGTPPSLP-P 929
Cdd:PHA03247  2912 QA---------PPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPqP 2982
                          570
                   ....*....|....
gi 1908918786  930 GPSAQSPQSSFPQP 943
Cdd:PHA03247  2983 APSREAPASSTPPL 2996
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
791-1029 3.63e-07

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 54.27  E-value: 3.63e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  791 NPQQFPGQPAMMQPMAHYPSQPVFAPMLQSNPRMLTSG----SHPQAI----VSSS------TPQYPSAEQPTPQALYAT 856
Cdd:pfam09770  106 QPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRTGyekyKEPEPIpdlqVDASlwgvapKKAAAPAPAPQPAAQPAS 185
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  857 VHQSYPHHAT------QLHAHQPQPATTPTGS---QPQSQHAAPSPVQQHQAGQAPHLGSGQPQQNLYHPGALTGTPPSL 927
Cdd:pfam09770  186 LPAPSRKMMSleeveaAMRAQAKKPAQQPAPApaqPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVTI 265
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  928 PPGPSAQSPQSSFPQPAAVYAIHHQQLPhgfTNMAHVTQAhvqtgitaappphpGAPHPPQVMLLHPPQSHGGPPQGAVP 1007
Cdd:pfam09770  266 LQRPQSPQPDPAQPSIQPQAQQFHQQPP---PVPVQPTQI--------------LQNPNRLSAARVGYPQNPQPGVQPAP 328
                          250       260
                   ....*....|....*....|..
gi 1908918786 1008 QSGVPALSASTPSPYPYIGHPQ 1029
Cdd:pfam09770  329 AHQAHRQQGSFGRQAPIITHPQ 350
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
815-952 6.93e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 40.18  E-value: 6.93e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  815 APMLQSNPRMLTSGSHPQaivssstPQYpsaEQPTPQALYATVHQSYPHHATQLHAHQPQPATTPTGSQPQSQHAAPSPV 894
Cdd:TIGR01628  380 PRMRQLPMGSPMGGAMGQ-------PPY---YGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRAPSRN 449
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918786  895 QQHQAGQAPhlgsgqpqqnlyhpgaltgtPPSLPPGPSAQSPQSSFPQPAAVYAIHHQ 952
Cdd:TIGR01628  450 AQNAAQKPP--------------------MQPVMYPPNYQSLPLSQDLPQPQSTASQG 487
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
844-1054 9.03e-03

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 40.02  E-value: 9.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  844 SAEQPTPQAlyatvhqsYPHHATQLHAHQPQPATTPTGSQPQSQHAAPSPVQQHQAGQAPHLgsgQPQQNLYHPGALTGT 923
Cdd:pfam09770  103 NRQQPAARA--------AQSSAQPPASSLPQYQYASQQSQQPSKPVRTGYEKYKEPEPIPDL---QVDASLWGVAPKKAA 171
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  924 PPSLPPGPSAQSPQSSF-------------------PQPAAVYAIHHQQLPHGftnmahvtQAHVQTGITAAPPPHPGAP 984
Cdd:pfam09770  172 APAPAPQPAAQPASLPApsrkmmsleeveaamraqaKKPAQQPAPAPAQPPAA--------PPAQQAQQQQQFPPQIQQQ 243
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  985 HPPQVMLLHPPQSHGGPPQGAV---PQSGVPALSASTPSPYPYIGH----PQGEQPGQA---PGFPGGADDRIPPLPPPG 1054
Cdd:pfam09770  244 QQPQQQPQQPQQHPGQGHPVTIlqrPQSPQPDPAQPSIQPQAQQFHqqppPVPVQPTQIlqnPNRLSAARVGYPQNPQPG 323
 
Name Accession Description Interval E-value
SM-ATX pfam14438
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
123-196 1.62e-19

Ataxin 2 SM domain; This SM domain is found in Ataxin-2.


Pssm-ID: 464173  Cd Length: 78  Bit Score: 83.76  E-value: 1.62e-19
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918786  123 MLHFLTAVVGSTCDVKVKNGTTYEGIFKTLS--SKFELAVDAVHRKASE--PAGGPRREDIVDTMVFKPSDVMLVHFR 196
Cdd:pfam14438    1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASleKDFGVVLKMARRIKKSngSGLNPVRGEIVDTMIFPAKDIVDIEAK 78
LsmAD pfam06741
LsmAD domain; This domain is found associated with Lsm domain.
264-326 1.54e-17

LsmAD domain; This domain is found associated with Lsm domain.


Pssm-ID: 461998 [Multi-domain]  Cd Length: 65  Bit Score: 77.61  E-value: 1.54e-17
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918786  264 YGVKTTYDSSLssYTVPLEKdNSEEFRQRELRAAQLAREIESSPQYRLRIAMEN-----DDGRTEEEK 326
Cdd:pfam06741    1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERgldvdDSGLDEEDK 65
PHA03247 PHA03247
large tegument protein UL36; Provisional
398-943 3.63e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.49  E-value: 3.63e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  398 ARGINGGPSRMSPKAQRPLRGAKTLSSPSNR--PSGETSVPPPPAVGRMYPPRSPKSAAPAPisASCPEPPIGSAV-PTS 474
Cdd:PHA03247  2456 ARTILGAPFSLSLLLGELFPGAPVYRRPAEArfPFAAGAAPDPGGGGPPDPDAPPAPSRLAP--AILPDEPVGEPVhPRM 2533
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  475 SASIPVTSSVSDPGVGSISPASPKISLAPTDVKELSTKEPG-RTLEPQELARiaGKVPGLQNEQKRFQLEELRKFGAQFK 553
Cdd:PHA03247  2534 LTWIRGLEELASDDAGDPPPPLPPAAPPAAPDRSVPPPRPApRPSEPAVTSR--ARRPDAPPQSARPRAPVDDRGDPRGP 2611
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  554 LQPSSSPENSLDPFPPRILKEEPKGKEKEVDGLLTSEPMGSPVSSKTESVS------------------DKEDKPPLAPS 615
Cdd:PHA03247  2612 APPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSrprrarrlgraaqassppQRPRRRAARPT 2691
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  616 GGTEGPEQPPPPCPSQTGSPPVGLIKGEDKDEGP-VAEQVKKSTLNPNAKEFNPTKPLL--SVNKSTSTPTSPGPRTHST 692
Cdd:PHA03247  2692 VGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPaAARQASPALPAAPAPPAVPAGPATpgGPARPARPPTTAGPPAPAP 2771
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  693 PSIPVlTAGQSGLYSPQYISYIPQIHMGPAVQAPQMYPYPVSNSVPGQQGKYRGAKGSLPPQRSDQHQPASAPPmmqaaa 772
Cdd:PHA03247  2772 PAAPA-AGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG------ 2844
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  773 aagpplvaatPYSSYIPYNPQQFPGQPAMMQPMahyPSQPVFAPMLQSNP--RMLTSGSHPQAIVSSSTPQYPSAEQPTP 850
Cdd:PHA03247  2845 ----------PPPPSLPLGGSVAPGGDVRRRPP---SRSPAAKPAAPARPpvRRLARPAVSRSTESFALPPDQPERPPQP 2911
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  851 QAlyatvhqsyPHHATQLHAHQPQPATTPTGSQPQSQHAAPSPVQQHQAGQAPHLGSGQPQQNLYHPGALTGTPPSLP-P 929
Cdd:PHA03247  2912 QA---------PPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPqP 2982
                          570
                   ....*....|....
gi 1908918786  930 GPSAQSPQSSFPQP 943
Cdd:PHA03247  2983 APSREAPASSTPPL 2996
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
791-1029 3.63e-07

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 54.27  E-value: 3.63e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  791 NPQQFPGQPAMMQPMAHYPSQPVFAPMLQSNPRMLTSG----SHPQAI----VSSS------TPQYPSAEQPTPQALYAT 856
Cdd:pfam09770  106 QPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRTGyekyKEPEPIpdlqVDASlwgvapKKAAAPAPAPQPAAQPAS 185
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  857 VHQSYPHHAT------QLHAHQPQPATTPTGS---QPQSQHAAPSPVQQHQAGQAPHLGSGQPQQNLYHPGALTGTPPSL 927
Cdd:pfam09770  186 LPAPSRKMMSleeveaAMRAQAKKPAQQPAPApaqPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVTI 265
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  928 PPGPSAQSPQSSFPQPAAVYAIHHQQLPhgfTNMAHVTQAhvqtgitaappphpGAPHPPQVMLLHPPQSHGGPPQGAVP 1007
Cdd:pfam09770  266 LQRPQSPQPDPAQPSIQPQAQQFHQQPP---PVPVQPTQI--------------LQNPNRLSAARVGYPQNPQPGVQPAP 328
                          250       260
                   ....*....|....*....|..
gi 1908918786 1008 QSGVPALSASTPSPYPYIGHPQ 1029
Cdd:pfam09770  329 AHQAHRQQGSFGRQAPIITHPQ 350
PHA03247 PHA03247
large tegument protein UL36; Provisional
336-850 1.74e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 1.74e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  336 GRESPSLASREGKYIPLPQRVREGPRGGVRCSSSRGGRPGLSSLPPRGPHHLDNSSPGP-----------GSEARGINGG 404
Cdd:PHA03247  2513 SRLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPPPPLPPAAPPAAPDRSVPPPrpaprpsepavTSRARRPDAP 2592
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  405 PSRMSPKAQRPLRGAKTLSSPSNRPSGETSVPPPPAVGRmyPPRSPKSAAPAPISASCPEPPIGSAVPTSSASIPVTSSV 484
Cdd:PHA03247  2593 PQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSP--SPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRL 2670
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  485 SDPGVGSISPASPKISLAPTDVKELST----KEPGRTLEPQELARIAGKVPGLQNEQKRfqleelrkfgaqfklqpSSSP 560
Cdd:PHA03247  2671 GRAAQASSPPQRPRRRAARPTVGSLTSladpPPPPPTPEPAPHALVSATPLPPGPAAAR-----------------QASP 2733
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  561 ENSLDPFPPrilkeePKGKEKEVDGllTSEPMGSPVSSKTESVSdkedKPPLAPSGGTEGPEQPPPPCPSQTGSPpvGLI 640
Cdd:PHA03247  2734 ALPAAPAPP------AVPAGPATPG--GPARPARPPTTAGPPAP----APPAAPAAGPPRRLTRPAVASLSESRE--SLP 2799
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  641 KGEDKDEGPVAEQVKKSTLNPNAKEFNPTKPllsvnKSTSTPTSPGPRthSTPSIPVLTAGqsGLYSPQyisyipqihmG 720
Cdd:PHA03247  2800 SPWDPADPPAAVLAPAAALPPAASPAGPLPP-----PTSAQPTAPPPP--PGPPPPSLPLG--GSVAPG----------G 2860
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  721 PAVQAPQMYPYPVSNSVPGQQGKYRGAKGSLPPQRSDQHQPASAPPMMQAAAAAGPPLVAATPYSSYIPYNPQQFPGQP- 799
Cdd:PHA03247  2861 DVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPq 2940
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1908918786  800 AMMQPMAHYPSQPV---FAPMLQSNPRMLTSGSHPQAIVSSSTPQYPSAEQPTP 850
Cdd:PHA03247  2941 PPLAPTTDPAGAGEpsgAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTP 2994
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
829-992 2.94e-06

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 51.58  E-value: 2.94e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  829 SHPQAIVSSSTPQYPSAEQPTPQALYATVHQSYPHHATQL--HAHQPQPATTPTGSQPQSQHAAPSPVQQHQAGQAPHLg 906
Cdd:pfam09770  217 APAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHpgQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPV- 295
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  907 SGQPQQNLYHPgaltgtppSLPPGPSAQSPQSSFPQPAAVYAIHHQQLPHGFTNMAHVtQAHvqtgitaappphpgaphP 986
Cdd:pfam09770  296 PVQPTQILQNP--------NRLSAARVGYPQNPQPGVQPAPAHQAHRQQGSFGRQAPI-ITH-----------------P 349

                   ....*.
gi 1908918786  987 PQVMLL 992
Cdd:pfam09770  350 QQLAQL 355
PRK10263 PRK10263
DNA translocase FtsK; Provisional
809-940 4.08e-06

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 51.24  E-value: 4.08e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  809 PSQPVFAPMLQSNPRMLTSGSHPQAIVSSSTP-----QYPSAEQPT-PQALYATVHQSYPHHATQLHAHQPQpATTPTGS 882
Cdd:PRK10263   740 PHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPvapqpQYQQPQQPVaPQPQYQQPQQPVAPQPQYQQPQQPV-APQPQYQ 818
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1908918786  883 QPQSQHAAPSPVQQHQAGQAPhlgsgQPQQNLYHPGALTG----------TP-PS---LPPGPSAQSPQSSF 940
Cdd:PRK10263   819 QPQQPVAPQPQYQQPQQPVAP-----QPQDTLLHPLLMRNgdsrplhkptTPlPSldlLTPPPSEVEPVDTF 885
PHA03247 PHA03247
large tegument protein UL36; Provisional
668-1055 6.34e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 6.34e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  668 PTKPLLSVNKSTSTPTSPGPrTHSTPSIPVLTAGQSGLYSPQYISYIPQIHMGPAVQAPQMYPYPVSNSVPGQ-QGKYRG 746
Cdd:PHA03247  2595 SARPRAPVDDRGDPRGPAPP-SPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRaRRLGRA 2673
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  747 AKGSLPPQRSdqHQPASAPPMMQAAAAAGPPLVAATPYssyiPYNPQQFPGQPAMMQPMAHYPSQPVfAPMLQSNPRMLT 826
Cdd:PHA03247  2674 AQASSPPQRP--RRRAARPTVGSLTSLADPPPPPPTPE----PAPHALVSATPLPPGPAAARQASPA-LPAAPAPPAVPA 2746
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  827 SGSHPQAIVSSSTPQYPSA-EQPTPQALYATVhqsyPHHATQLHAHQPQPATTPTGSQPQSQHAAPSPVQQHQAGQAPhl 905
Cdd:PHA03247  2747 GPATPGGPARPARPPTTAGpPAPAPPAAPAAG----PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPP-- 2820
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  906 gSGQPQQNLYHPGALTGTPPSLPPGPSAQS------------------PQSSFPQPAAVYAIHHQQLPHGFTNMAHVTQA 967
Cdd:PHA03247  2821 -AASPAGPLPPPTSAQPTAPPPPPGPPPPSlplggsvapggdvrrrppSRSPAAKPAAPARPPVRRLARPAVSRSTESFA 2899
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  968 HVQTGITAAPPPHPGAPHPPQVMLLHPPQSHGGPPQGAVPQSGVPALSASTPSPYPYIGHPQGEQPGQAPGFPGGADDRI 1047
Cdd:PHA03247  2900 LPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRV 2979

                   ....*...
gi 1908918786 1048 PPLPPPGE 1055
Cdd:PHA03247  2980 PQPAPSRE 2987
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
652-956 1.86e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 48.88  E-value: 1.86e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  652 EQVKKSTLNPNAKEFNPTKPLLSVNKSTSTPTSPGPrthSTPSIPVLTagqsGLYSPQYISYIPQIH-------MGPAVQ 724
Cdd:pfam09770   98 EQVRFNRQQPAARAAQSSAQPPASSLPQYQYASQQS---QQPSKPVRT----GYEKYKEPEPIPDLQvdaslwgVAPKKA 170
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  725 APQMYPYPVSNSVPGQQGKYR---------------GAKGSLPPQRSDQHQPASAPPMMQAAaaagpplvaatpyssyiP 789
Cdd:pfam09770  171 AAPAPAPQPAAQPASLPAPSRkmmsleeveaamraqAKKPAQQPAPAPAQPPAAPPAQQAQQ-----------------Q 233
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  790 YNPQQFPGQPAMMQPMAHYPSQPvfapmlqsnprmlTSGSHPQAIVssstpQYPSAEQPTPqalyatvhqsyphhatqlh 869
Cdd:pfam09770  234 QQFPPQIQQQQQPQQQPQQPQQH-------------PGQGHPVTIL-----QRPQSPQPDP------------------- 276
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  870 aHQPQPattPTGSQPQSQHAAPSPVQQHQAGQAPHLGSgqPQQNLYHPGALTGTPPslPPGPSAQSPQSSFPQPAAVYAi 949
Cdd:pfam09770  277 -AQPSI---QPQAQQFHQQPPPVPVQPTQILQNPNRLS--AARVGYPQNPQPGVQP--APAHQAHRQQGSFGRQAPIIT- 347

                   ....*..
gi 1908918786  950 HHQQLPH 956
Cdd:pfam09770  348 HPQQLAQ 354
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
374-696 2.88e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 45.07  E-value: 2.88e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  374 PGLSSLPPRGPHHLDNSS----PGPGSEARGINGGPSRMSP----KAQRPLRGAKTLSSP--SNRPSGETSVPPPPavgR 443
Cdd:PTZ00449   514 PEASGLPPKAPGDKEGEEgeheDSKESDEPKEGGKPGETKEgevgKKPGPAKEHKPSKIPtlSKKPEFPKDPKHPK---D 590
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  444 MYPPRSPKS--AAPAPISASCPEPPIGSAVPTSSASIPVTSSVSDPgVGSISPASPKISLAPTDVKE-LSTKEPGRTLEP 520
Cdd:PTZ00449   591 PEEPKKPKRprSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRP-PPPQRPSSPERPEGPKIIKSpKPPKSPKPPFDP 669
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  521 ---QELARIAGKVPGLQNEQKRFQL--EELRKFGAQFKLQPSSSPENSLDPFPPRIlkeepkgkekEVDGLLTSEPMGSP 595
Cdd:PTZ00449   670 kfkEKFYDDYLDAAAKSKETKTTVVldESFESILKETLPETPGTPFTTPRPLPPKL----------PRDEEFPFEPIGDP 739
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  596 VSsktESVSDKE-DKPPLAPSggtegpeqpPPPCPSQTGSPPVGLIKGEDKDEGPVAEqvkksTLNPNAKEFNPTKPlls 674
Cdd:PTZ00449   740 DA---EQPDDIEfFTPPEEER---------TFFHETPADTPLPDILAEEFKEEDIHAE-----TGEPDEAMKRPDSP--- 799
                          330       340
                   ....*....|....*....|..
gi 1908918786  675 vnkSTSTPTSPGprTHstPSIP 696
Cdd:PTZ00449   800 ---SEHEDKPPG--DH--PSLP 814
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
870-946 3.77e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 44.38  E-value: 3.77e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1908918786  870 AHQPQPATTPTGSQPQSqHAAPSPVQQHQAGQAphlgSGQPQQNLYHPGALTGTPPSLPPGPSAQSPQSSFPQPAAV 946
Cdd:PRK14971   389 APQPSAAAAASPSPSQS-SAAAQPSAPQSATQP----AGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPV 460
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
734-1053 5.00e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.37  E-value: 5.00e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  734 SNSVPGQQGKYRGAKGSLPPQRSDQHQPASAPPMMQAAAAAGPPLVAATPYSSYIPYNPQQFPGQPAMMQPMAHYPSQPV 813
Cdd:pfam03154  145 SPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQST 224
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  814 FAPM------LQSNPRMLTSGSHPQAIVSSSTPQYPSAEQPTPQALYATVHQSYPHHATQLHAHQPQP-ATTPTGSQPQS 886
Cdd:pfam03154  225 AAPHtliqqtPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPvPPQPFPLTPQS 304
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  887 QHA----APSPVQQHQAGQAPHLGSGQPQQNLYHPGALTGTPPSLPPGPSAQSPQSS----FPQPAAVYAIHHQQLPHGF 958
Cdd:pfam03154  305 SQSqvppGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTpipqLPNPQSHKHPPHLSGPSPF 384
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  959 TNMAHVTQAHVQTGITAAPPPHPGAPHPPQVMLLhpPQSHGGPPQGAVP-----QSGVPALSASTPSPYPYIGHPQGEQP 1033
Cdd:pfam03154  385 QMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQLM--PQSQQLPPPPAQPpvltqSQSLPPPAASHPPTSGLHQVPSQSPF 462
                          330       340
                   ....*....|....*....|
gi 1908918786 1034 GQAPGFPGGADDRIPPLPPP 1053
Cdd:pfam03154  463 PQHPFVPGGPPPITPPSGPP 482
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
409-552 3.11e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 41.30  E-value: 3.11e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  409 SPKAQRPLRGAKTLSSPSNRPSGETSVPPPPAVGRMYPPrsPKSAAPAPISASCPEPPIGSAVPTSSASIPVTSSVSDPG 488
Cdd:PRK14971   370 SGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAA--AQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAV 447
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1908918786  489 VGSISPASPKISLAPTDVKELSTKEPGRtlEPQELARIAGKVPGLQNEQKRFQLEELRKFGAQF 552
Cdd:PRK14971   448 RPAQFKEEKKIPVSKVSSLGPSTLRPIQ--EKAEQATGNIKEAPTGTQKEIFTEEDLQYYWQEF 509
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
359-528 4.14e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 41.37  E-value: 4.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  359 GPRGGVRCSSSRGGRPGLSSLPPRGPHHLDNSSPGPGSEARGINGGPSRMSPKAQRPLRGAKTLSSPS-NRPSGETSVPP 437
Cdd:PRK07003   366 GAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPaTADRGDDAADG 445
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  438 PPAVGRMYPPRSPKSAAPAPISASCPEPPIGSAVPTSSASIPVTSSVSDPGVGSISPASPKISLAPTDVKELSTKEPGRT 517
Cdd:PRK07003   446 DAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAA 525
                          170
                   ....*....|.
gi 1908918786  518 LEPQELARIAG 528
Cdd:PRK07003   526 APPAPEARPPT 536
PHA03247 PHA03247
large tegument protein UL36; Provisional
332-532 4.70e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.08  E-value: 4.70e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  332 RQGSGRESPSLASREGKYIPLPQRVREGPRGGVRCSSSRGGRPGLSSLPPRGPHH-------------LDNSSPGPGSEA 398
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRraarptvgsltslADPPPPPPTPEP 2710
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  399 RGINGGPSRMSPKAQRPLRGAK--TLSSPSNRPSGETSVPP-----------PPAVGRMYPPRSPKSAAP--APISASCP 463
Cdd:PHA03247  2711 APHALVSATPLPPGPAAARQASpaLPAAPAPPAVPAGPATPggparparpptTAGPPAPAPPAAPAAGPPrrLTRPAVAS 2790
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1908918786  464 EPPIGSAVPTSSASIPVTSSVSDPGVGSISPASPKISLAPTDVKELSTKEPGRTLEPQELARIAGKVPG 532
Cdd:PHA03247  2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG 2859
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
394-555 6.77e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 40.47  E-value: 6.77e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  394 PGSEARGINGGPSRMSPKAQRPLRGAKTLSSPsnRPSGETSVPPPPAVGRMYPPRSP-----KSAAPAPISASCPEPPIG 468
Cdd:PRK14951   374 APAEKKTPARPEAAAPAAAPVAQAAAAPAPAA--APAAAASAPAAPPAAAPPAPVAApaaaaPAAAPAAAPAAVALAPAP 451
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  469 SAVPTSSASIPVTSSVSDPGVGS----ISPASPKISLAPTD--------VKELSTKEPGRTLePQELAriagkvpgLQNE 536
Cdd:PRK14951   452 PAQAAPETVAIPVRVAPEPAVASaapaPAAAPAAARLTPTEegdvwhatVQQLAAAEAITAL-ARELA--------LQSE 522
                          170       180
                   ....*....|....*....|....*...
gi 1908918786  537 ---------QKRFQLEELRKFGAQFKLQ 555
Cdd:PRK14951   523 lvardgdqwLLRVERESLNQPGARERLR 550
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
815-952 6.93e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 40.18  E-value: 6.93e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  815 APMLQSNPRMLTSGSHPQaivssstPQYpsaEQPTPQALYATVHQSYPHHATQLHAHQPQPATTPTGSQPQSQHAAPSPV 894
Cdd:TIGR01628  380 PRMRQLPMGSPMGGAMGQ-------PPY---YGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRAPSRN 449
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918786  895 QQHQAGQAPhlgsgqpqqnlyhpgaltgtPPSLPPGPSAQSPQSSFPQPAAVYAIHHQ 952
Cdd:TIGR01628  450 AQNAAQKPP--------------------MQPVMYPPNYQSLPLSQDLPQPQSTASQG 487
PRK10927 PRK10927
cell division protein FtsN;
841-945 8.45e-03

cell division protein FtsN;


Pssm-ID: 236797 [Multi-domain]  Cd Length: 319  Bit Score: 39.66  E-value: 8.45e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  841 QYPSAEQpTPQALYATVH-QSYPHHATQLHAHQPQPATTPTGSQPQSQHAAPSPVQQhQAGQAPHLGSGQPQQNLYHPGA 919
Cdd:PRK10927   138 EVPWNEQ-TPEQRQQTLQrQRQAQQLAEQQRLAQQSRTTEQSWQQQTRTSQAAPVQA-QPRQSKPASTQQPYQDLLQTPA 215
                           90       100
                   ....*....|....*....|....*..
gi 1908918786  920 LTGTPPSLP-PGPSAQSPQSsfPQPAA 945
Cdd:PRK10927   216 HTTAQSKPQqAAPVTRAADA--PKPTA 240
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
844-1054 9.03e-03

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 40.02  E-value: 9.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  844 SAEQPTPQAlyatvhqsYPHHATQLHAHQPQPATTPTGSQPQSQHAAPSPVQQHQAGQAPHLgsgQPQQNLYHPGALTGT 923
Cdd:pfam09770  103 NRQQPAARA--------AQSSAQPPASSLPQYQYASQQSQQPSKPVRTGYEKYKEPEPIPDL---QVDASLWGVAPKKAA 171
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  924 PPSLPPGPSAQSPQSSF-------------------PQPAAVYAIHHQQLPHGftnmahvtQAHVQTGITAAPPPHPGAP 984
Cdd:pfam09770  172 APAPAPQPAAQPASLPApsrkmmsleeveaamraqaKKPAQQPAPAPAQPPAA--------PPAQQAQQQQQFPPQIQQQ 243
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  985 HPPQVMLLHPPQSHGGPPQGAV---PQSGVPALSASTPSPYPYIGH----PQGEQPGQA---PGFPGGADDRIPPLPPPG 1054
Cdd:pfam09770  244 QQPQQQPQQPQQHPGQGHPVTIlqrPQSPQPDPAQPSIQPQAQQFHqqppPVPVQPTQIlqnPNRLSAARVGYPQNPQPG 323
PHA03369 PHA03369
capsid maturational protease; Provisional
396-709 9.39e-03

capsid maturational protease; Provisional


Pssm-ID: 223061 [Multi-domain]  Cd Length: 663  Bit Score: 39.98  E-value: 9.39e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  396 SEARGINGGPSRMSPKAQRPLRGAKTLSSPSnRPSGETSVPPPPAVGRMYPPRSPKSAAPAPISASCPEPPIGSAVPTss 475
Cdd:PHA03369   355 APSRVLAAAAKVAVIAAPQTHTGPADRQRPQ-RPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNPQSPGTSYGPE-- 431
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  476 asiPVTSSVSDPGVGSISPASPKISLAPTDVKELSTKEPGRTLEP--QELARIAGKVPGL--QNEQKRFQLEELRKFGAQ 551
Cdd:PHA03369   432 ---PVGPVPPQPTNPYVMPISMANMVYPGHPQEHGHERKRKRGGElkEELIETLKLVKKLkeEQESLAKELEATAHKSEI 508
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918786  552 fklqpSSSPENSLDPFPPRILKEEPKGKEKEVDGLLTSEPMGSPVSSKTESVSDKEDKPPLAPSGGTEGPEQPPPPCPSQ 631
Cdd:PHA03369   509 -----KKIAESEFKNAGAKTAAANIEPNCSADAAAPATKRARPETKTELEAVVRFPYQIRNMESPAFVHSFTSTTLAAAA 583
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918786  632 TGSPpvgliKGEDKDEGPVAEQVKKSTLNPnaKEFNPTKPLLSVNKSTSTPTSPgPRTHSTPSIPVLTAGQSGLYSPQ 709
Cdd:PHA03369   584 GQGS-----DTAEALAGAIETLLTQASAQP--AGLSLPAPAVPVNASTPASTPP-PLAPQEPPQPGTSAPSLETSLPQ 653
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH