NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1958788272|ref|XP_038934580|]
View 

adenomatous polyposis coli protein 2 isoform X7 [Rattus norvegicus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
1770-2102 1.71e-96

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


:

Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 315.66  E-value: 1.71e-96
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1770 AMLRGRTVIYTASpASRAQSKGISGPCSAPKKMgtsgttqpETATKTPSPEQQRSRSLHRPGKISELAALSHPPRSATPP 1849
Cdd:pfam05956    1 VVFRGRTVIYMPG-VKESQPSTSPPPKKTPPKT--------DAPAKNPNLGQQRSRSLHRLGKPSELADLSPPKRSATPP 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1850 ARLTKTPSSSSSQTSPASQSLPRRSPLATPTG------GPLPGPGGSPVPKSPARALLAKQ--HKTQKSPVRIPFMQRPA 1921
Cdd:pfam05956   72 ARISKAPSSGSSRDSTPSRPPQKKLTSPSQSPgrlpgsGGRNKLSPLPKTKSPARASTKKSgsHKTQKSPVRIPFMQTPT 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1922 R-----RVPPPLARPSPEPgsRGRAGAEGTPGARGSRLGLVRVASTRSSGSESSDRSgFRRQLTFIKESPGLLRRRRSEL 1996
Cdd:pfam05956  152 KqtglpRNPSPLVTNQPEP--RSESASKGLRSLPGKRLDLVRMSSARSSGSESDRSG-FLRQLTFIKESPSLLLRRRLEL 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1997 SSADSTVSTSQTASPCRGRPALPAVFLCSSRCDELRASPRQPLAA--QRVPQAKPGLAPRAPRRTSSESPSRLPVRATPG 2074
Cdd:pfam05956  229 SASESLSPSSQPASPRRSRPGLPAVFLCSSRCQELKGWRKQPPNPnsRAEPSDRPLTRRRPPRRTSSESPSRLPVRNGTW 308
                          330       340
                   ....*....|....*....|....*...
gi 1958788272 2075 RPETVKRYASLPHISVSRRPDSAVSVPT 2102
Cdd:pfam05956  309 KRETFKRYSSLPHINVWRRTGSSSSILS 336
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
362-435 6.01e-36

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


:

Pssm-ID: 465870  Cd Length: 74  Bit Score: 131.52  E-value: 6.01e-36
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1958788272  362 SQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQARDSGTESGTGDTPVPIEPQICQAACAVMKLSFDEEYRRAM 435
Cdd:pfam18797    1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_N_CC pfam16689
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ...
6-57 5.89e-26

Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.


:

Pssm-ID: 435517  Cd Length: 52  Bit Score: 101.99  E-value: 5.89e-26
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1958788272    6 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKL 57
Cdd:pfam16689    1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
Suppressor_APC pfam11414
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ...
124-211 1.26e-17

Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.


:

Pssm-ID: 463275  Cd Length: 82  Bit Score: 79.61  E-value: 1.26e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272  124 SRATIRLLEELDQERCFLLSEIEKEEKEKLWYYSQLQGLSKRLDELPHVDTpvprsqQFSMQMDLIRQQLEFEAQHIRSL 203
Cdd:pfam11414    1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGT------YFDYGSDAQQERLEFLLARIQEV 74

                   ....*...
gi 1958788272  204 MEERFGTS 211
Cdd:pfam11414   75 NRCLGGLI 82
Arm_APC_u3 super family cl25003
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
701-947 5.36e-17

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


The actual alignment was detected with superfamily member pfam16629:

Pssm-ID: 435476  Cd Length: 293  Bit Score: 83.87  E-value: 5.36e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272  701 HRPAKYQAAAMaVSPGTCVPSLYVRKQRALEAELDTRHLVHALGHLEKQGlPEAETTSKKplpplRHLDGLVQDYASDSG 780
Cdd:pfam16629    1 NRPAKYKDANI-MSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLS-PKASHRNKQ-----RHKQNVYSEYVLDSG 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272  781 CFDDddapsLAAAATTAEPASPAVMSMFLGGPFLQGQAlARTPPARQGGLEAEKE-----------------TGGEAAVA 843
Cdd:pfam16629   74 RHDD-----SVCRSDNFNTGNVTVLSPYLNTTVLPSSS-SRDSRGNAESSRSEKDrsldrergaglsnfhpaTENSGNSS 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272  844 AKAKAKLALAVARIDRLVEDISALHTSSDDSFSLSSGDP---GQEAPREGRAQSCSPCRGTEG-GRREAGSRAHPLLRLK 919
Cdd:pfam16629  148 KRIGMQISTTAAQIAKVMEEVSSMHISQEDRSSGSTSDMhcmQDDRNSIRRSSTAHPHSNVYSfNKSESSNRPCPMPYMK 227
                          250       260
                   ....*....|....*....|....*...
gi 1958788272  920 AAHTSLSNDSLNSGSTSDGYCTREHMTP 947
Cdd:pfam16629  228 MEYKRASNDSLNSVSSSDGYGKRGQMKP 255
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
619-658 1.44e-06

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


:

Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 46.68  E-value: 1.44e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1958788272  619 EDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 658
Cdd:pfam00514    2 PENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1604-1625 7.81e-05

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


:

Pssm-ID: 461782  Cd Length: 22  Bit Score: 41.42  E-value: 7.81e-05
                           10        20
                   ....*....|....*....|..
gi 1958788272 1604 SPRAEEELLQRCISLAMPRRRT 1625
Cdd:pfam05924    1 SPDDEDDLLQECINSAMPKKRR 22
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1510-1935 9.64e-05

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 9.64e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1510 DSDEEPPATAPPTRRASAIPRALKREKPAGRKETP---TRATQPATLPVRAQPRLIVDETPPCYSLTSSASSLSEPEASE 1586
Cdd:PHA03247  2546 DDAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPavtSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPD 2625
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1587 QPACHPR---VEEQGSKQDSSPRAEEELLQRCISLAMPRRRTQVPS--------SRRRKPRAVRSDIRPteLPQKCREEV 1655
Cdd:PHA03247  2626 PPPPSPSpaaNEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGraaqasspPQRPRRRAARPTVGS--LTSLADPPP 2703
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1656 PGSDPASdldsVEWQAIQEGANSIVTWLHQAAAKASLEASSESDSLLSLVSGLSASSTLQPSKLRKGRKPVAEAGGAWRP 1735
Cdd:PHA03247  2704 PPPTPEP----APHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1736 EKRGTTSTKGSGSPRFPSGPEKAKGTQKTMAgesAMLRGRTVIYTASPASRAQSKGISGPCSAPKKMGTSGTTQPETATK 1815
Cdd:PHA03247  2780 PRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSV 2856
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1816 TPSPEQQRSRSLHRPgkISELAALSHPP-RSATPPARLTKTPSSSSSQTSPASQSLPRRSPLATPTGGPLPGPGGSPVPK 1894
Cdd:PHA03247  2857 APGGDVRRRPPSRSP--AAKPAAPARPPvRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPP 2934
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1958788272 1895 SPARALLAKQHKTQKSPVRIPFMQRPA---------------RRVPPPL-ARPSPEP 1935
Cdd:PHA03247  2935 PPPRPQPPLAPTTDPAGAGEPSGAVPQpwlgalvpgrvavprFRVPQPApSREAPAS 2991
DUF5585 super family cl39316
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
2047-2286 3.75e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


The actual alignment was detected with superfamily member pfam17823:

Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 45.72  E-value: 3.75e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 2047 AKPGLAPRAPRRTSSESPSRLPVRATPGRPETVKRYASLPHISVSRRPDSAVS-VPTTQANATRRGSdgearPLPRVAAP 2125
Cdd:pfam17823  106 AADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPrAAIAAASAPHAAS-----PAPRTAAS 180
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 2126 GTTW--RRIKDEDVPHILRSTLPATALPLRGSSPEDSPAGTPHRKTSDAVV----------------------------- 2174
Cdd:pfam17823  181 STTAasSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVgnsspaagtvtaavgtvtpaalatlaaaa 260
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 2175 QTEDVATSKTNS----STSPSLESRDPPQAPISGPVAPLGSDVDGPVLAkppasapFTHEGLSVVTGGFPTSrhgSPSRA 2250
Cdd:pfam17823  261 GTVASAAGTINMgdphARRLSPAKHMPSDTMARNPAAPMGAQAQGPIIQ-------VSTDQPVHNTAGEPTP---SPSNT 330
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1958788272 2251 ARVPPFNYVPSPMVVATMTSDSAVEKAPVTSPASLL 2286
Cdd:pfam17823  331 TLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVL 366
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1383-1405 6.94e-04

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


:

Pssm-ID: 461781  Cd Length: 24  Bit Score: 38.90  E-value: 6.94e-04
                           10        20
                   ....*....|....*....|...
gi 1958788272 1383 DDSGTDSAEGTPVNFSSAASLSD 1405
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSS 23
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
660-700 3.22e-03

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


:

Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 37.05  E-value: 3.22e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1958788272  660 SPRDQELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLA 700
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1259-1280 3.71e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


:

Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.59  E-value: 3.71e-03
                           10        20
                   ....*....|....*....|..
gi 1958788272 1259 SVRFTVEKPDENFSCASSLSAL 1280
Cdd:pfam05923    3 PKRYCVEGTPANFSRASSLSSL 24
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1147-1170 5.95e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


:

Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.20  E-value: 5.95e-03
                           10        20
                   ....*....|....*....|....
gi 1958788272 1147 SSSSENCVQETPLVLSRCSSVSSL 1170
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSSL 24
 
Name Accession Description Interval E-value
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
1770-2102 1.71e-96

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 315.66  E-value: 1.71e-96
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1770 AMLRGRTVIYTASpASRAQSKGISGPCSAPKKMgtsgttqpETATKTPSPEQQRSRSLHRPGKISELAALSHPPRSATPP 1849
Cdd:pfam05956    1 VVFRGRTVIYMPG-VKESQPSTSPPPKKTPPKT--------DAPAKNPNLGQQRSRSLHRLGKPSELADLSPPKRSATPP 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1850 ARLTKTPSSSSSQTSPASQSLPRRSPLATPTG------GPLPGPGGSPVPKSPARALLAKQ--HKTQKSPVRIPFMQRPA 1921
Cdd:pfam05956   72 ARISKAPSSGSSRDSTPSRPPQKKLTSPSQSPgrlpgsGGRNKLSPLPKTKSPARASTKKSgsHKTQKSPVRIPFMQTPT 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1922 R-----RVPPPLARPSPEPgsRGRAGAEGTPGARGSRLGLVRVASTRSSGSESSDRSgFRRQLTFIKESPGLLRRRRSEL 1996
Cdd:pfam05956  152 KqtglpRNPSPLVTNQPEP--RSESASKGLRSLPGKRLDLVRMSSARSSGSESDRSG-FLRQLTFIKESPSLLLRRRLEL 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1997 SSADSTVSTSQTASPCRGRPALPAVFLCSSRCDELRASPRQPLAA--QRVPQAKPGLAPRAPRRTSSESPSRLPVRATPG 2074
Cdd:pfam05956  229 SASESLSPSSQPASPRRSRPGLPAVFLCSSRCQELKGWRKQPPNPnsRAEPSDRPLTRRRPPRRTSSESPSRLPVRNGTW 308
                          330       340
                   ....*....|....*....|....*...
gi 1958788272 2075 RPETVKRYASLPHISVSRRPDSAVSVPT 2102
Cdd:pfam05956  309 KRETFKRYSSLPHINVWRRTGSSSSILS 336
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
362-435 6.01e-36

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


Pssm-ID: 465870  Cd Length: 74  Bit Score: 131.52  E-value: 6.01e-36
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1958788272  362 SQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQARDSGTESGTGDTPVPIEPQICQAACAVMKLSFDEEYRRAM 435
Cdd:pfam18797    1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_N_CC pfam16689
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ...
6-57 5.89e-26

Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.


Pssm-ID: 435517  Cd Length: 52  Bit Score: 101.99  E-value: 5.89e-26
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1958788272    6 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKL 57
Cdd:pfam16689    1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
Suppressor_APC pfam11414
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ...
124-211 1.26e-17

Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.


Pssm-ID: 463275  Cd Length: 82  Bit Score: 79.61  E-value: 1.26e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272  124 SRATIRLLEELDQERCFLLSEIEKEEKEKLWYYSQLQGLSKRLDELPHVDTpvprsqQFSMQMDLIRQQLEFEAQHIRSL 203
Cdd:pfam11414    1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGT------YFDYGSDAQQERLEFLLARIQEV 74

                   ....*...
gi 1958788272  204 MEERFGTS 211
Cdd:pfam11414   75 NRCLGGLI 82
Arm_APC_u3 pfam16629
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
701-947 5.36e-17

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435476  Cd Length: 293  Bit Score: 83.87  E-value: 5.36e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272  701 HRPAKYQAAAMaVSPGTCVPSLYVRKQRALEAELDTRHLVHALGHLEKQGlPEAETTSKKplpplRHLDGLVQDYASDSG 780
Cdd:pfam16629    1 NRPAKYKDANI-MSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLS-PKASHRNKQ-----RHKQNVYSEYVLDSG 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272  781 CFDDddapsLAAAATTAEPASPAVMSMFLGGPFLQGQAlARTPPARQGGLEAEKE-----------------TGGEAAVA 843
Cdd:pfam16629   74 RHDD-----SVCRSDNFNTGNVTVLSPYLNTTVLPSSS-SRDSRGNAESSRSEKDrsldrergaglsnfhpaTENSGNSS 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272  844 AKAKAKLALAVARIDRLVEDISALHTSSDDSFSLSSGDP---GQEAPREGRAQSCSPCRGTEG-GRREAGSRAHPLLRLK 919
Cdd:pfam16629  148 KRIGMQISTTAAQIAKVMEEVSSMHISQEDRSSGSTSDMhcmQDDRNSIRRSSTAHPHSNVYSfNKSESSNRPCPMPYMK 227
                          250       260
                   ....*....|....*....|....*...
gi 1958788272  920 AAHTSLSNDSLNSGSTSDGYCTREHMTP 947
Cdd:pfam16629  228 MEYKRASNDSLNSVSSSDGYGKRGQMKP 255
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
619-658 1.44e-06

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 46.68  E-value: 1.44e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1958788272  619 EDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 658
Cdd:pfam00514    2 PENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
PHA03247 PHA03247
large tegument protein UL36; Provisional
1781-2287 1.66e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.79  E-value: 1.66e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1781 ASPASRAQSKGISGPCSAPKKMGTSGTTQpETATKTPsPEQQRSRSlhrPGKISELAALSHPPrSATPPArlTKTPSSSS 1860
Cdd:PHA03247  2557 PAAPPAAPDRSVPPPRPAPRPSEPAVTSR-ARRPDAP-PQSARPRA---PVDDRGDPRGPAPP-SPLPPD--THAPDPPP 2628
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1861 SQTSPASQSLPRRSPLATPTGGPLPGPGGSPVPKSPARALLAKQHKTQKSPvripfMQRPARRVPPP-------LARPSP 1933
Cdd:PHA03247  2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP-----PQRPRRRAARPtvgsltsLADPPP 2703
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1934 EPGSrgragAEGTPGARGSRLGLVRVASTRSSGSESSDRSGFRRQLTFIKESPGLLRRRRSELSSADSTVSTSQTASPCR 2013
Cdd:PHA03247  2704 PPPT-----PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAG 2778
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 2014 GRPALPAVFLCSSrcDELRASPRQPLAAQRVPQAKPGLAPRAPRRTSSESPSRLPVRATPGRPETvkryaslphisvsrr 2093
Cdd:PHA03247  2779 PPRRLTRPAVASL--SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPP--------------- 2841
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 2094 PDSAVSVPTTQANATRRGSDGEARPLPRVAAPGTTWRrikdedvPHILRSTLPATALPlRGSSPEDSPAGTPHRKtsdav 2173
Cdd:PHA03247  2842 PPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAP-------ARPPVRRLARPAVS-RSTESFALPPDQPERP----- 2908
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 2174 vQTEDVATSKTNSSTSPSLESRDPPQAPISGPVAPLGSDVDGPVLAKPPASAPFTHEGlSVVTGGFPTSRHGSPSRAARV 2253
Cdd:PHA03247  2909 -PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG-ALVPGRVAVPRFRVPQPAPSR 2986
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|
gi 1958788272 2254 PPFNYVPSPMV------VATMTSDSAVEKAPVTSPASLLE 2287
Cdd:PHA03247  2987 EAPASSTPPLTghslsrVSSWASSLALHEETDPPPVSLKQ 3026
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
618-658 2.30e-06

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 46.27  E-value: 2.30e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 1958788272   618 REDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 658
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1604-1625 7.81e-05

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 41.42  E-value: 7.81e-05
                           10        20
                   ....*....|....*....|..
gi 1958788272 1604 SPRAEEELLQRCISLAMPRRRT 1625
Cdd:pfam05924    1 SPDDEDDLLQECINSAMPKKRR 22
PHA03247 PHA03247
large tegument protein UL36; Provisional
1510-1935 9.64e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 9.64e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1510 DSDEEPPATAPPTRRASAIPRALKREKPAGRKETP---TRATQPATLPVRAQPRLIVDETPPCYSLTSSASSLSEPEASE 1586
Cdd:PHA03247  2546 DDAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPavtSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPD 2625
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1587 QPACHPR---VEEQGSKQDSSPRAEEELLQRCISLAMPRRRTQVPS--------SRRRKPRAVRSDIRPteLPQKCREEV 1655
Cdd:PHA03247  2626 PPPPSPSpaaNEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGraaqasspPQRPRRRAARPTVGS--LTSLADPPP 2703
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1656 PGSDPASdldsVEWQAIQEGANSIVTWLHQAAAKASLEASSESDSLLSLVSGLSASSTLQPSKLRKGRKPVAEAGGAWRP 1735
Cdd:PHA03247  2704 PPPTPEP----APHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1736 EKRGTTSTKGSGSPRFPSGPEKAKGTQKTMAgesAMLRGRTVIYTASPASRAQSKGISGPCSAPKKMGTSGTTQPETATK 1815
Cdd:PHA03247  2780 PRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSV 2856
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1816 TPSPEQQRSRSLHRPgkISELAALSHPP-RSATPPARLTKTPSSSSSQTSPASQSLPRRSPLATPTGGPLPGPGGSPVPK 1894
Cdd:PHA03247  2857 APGGDVRRRPPSRSP--AAKPAAPARPPvRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPP 2934
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1958788272 1895 SPARALLAKQHKTQKSPVRIPFMQRPA---------------RRVPPPL-ARPSPEP 1935
Cdd:PHA03247  2935 PPPRPQPPLAPTTDPAGAGEPSGAVPQpwlgalvpgrvavprFRVPQPApSREAPAS 2991
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
2047-2286 3.75e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 45.72  E-value: 3.75e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 2047 AKPGLAPRAPRRTSSESPSRLPVRATPGRPETVKRYASLPHISVSRRPDSAVS-VPTTQANATRRGSdgearPLPRVAAP 2125
Cdd:pfam17823  106 AADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPrAAIAAASAPHAAS-----PAPRTAAS 180
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 2126 GTTW--RRIKDEDVPHILRSTLPATALPLRGSSPEDSPAGTPHRKTSDAVV----------------------------- 2174
Cdd:pfam17823  181 STTAasSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVgnsspaagtvtaavgtvtpaalatlaaaa 260
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 2175 QTEDVATSKTNS----STSPSLESRDPPQAPISGPVAPLGSDVDGPVLAkppasapFTHEGLSVVTGGFPTSrhgSPSRA 2250
Cdd:pfam17823  261 GTVASAAGTINMgdphARRLSPAKHMPSDTMARNPAAPMGAQAQGPIIQ-------VSTDQPVHNTAGEPTP---SPSNT 330
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1958788272 2251 ARVPPFNYVPSPMVVATMTSDSAVEKAPVTSPASLL 2286
Cdd:pfam17823  331 TLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVL 366
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1383-1405 6.94e-04

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 38.90  E-value: 6.94e-04
                           10        20
                   ....*....|....*....|...
gi 1958788272 1383 DDSGTDSAEGTPVNFSSAASLSD 1405
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSS 23
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
660-700 3.22e-03

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 37.05  E-value: 3.22e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1958788272  660 SPRDQELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLA 700
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
2-86 3.59e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 42.20  E-value: 3.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272    2 ASSVASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQEARVL--VSSGQTEVLEQLKAL 79
Cdd:COG4372     27 AALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLqaAQAELAQAQEELESL 106

                   ....*..
gi 1958788272   80 QTDISSL 86
Cdd:COG4372    107 QEEAEEL 113
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1259-1280 3.71e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.59  E-value: 3.71e-03
                           10        20
                   ....*....|....*....|..
gi 1958788272 1259 SVRFTVEKPDENFSCASSLSAL 1280
Cdd:pfam05923    3 PKRYCVEGTPANFSRASSLSSL 24
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
6-241 4.13e-03

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 42.35  E-value: 4.13e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272    6 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQ--EARVLVSSGQTEVLEQLKALQTDI 83
Cdd:TIGR02168  267 EKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEEleAQLEELESKLDELAEELAELEEKL 346
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272   84 SSLYNLKFHAPALGPEPAAQTPEGSpvHGPAPSKDSFGELSRATIRLLEELDQERCFLLSEIekeekeklwyySQLQGLS 163
Cdd:TIGR02168  347 EELKEELESLEAELEELEAELEELE--SRLEELEEQLETLRSKVAQLELQIASLNNEIERLE-----------ARLERLE 413
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958788272  164 KRLDELpHVDTPVPRSQQFSMQMDLIRQQLEFEAQHIRSLMEERfgtsDEMVQRAQIRASRLEQIDKELLEAQDRVQQ 241
Cdd:TIGR02168  414 DRRERL-QQEIEELLKKLEEAELKELQAELEELEEELEELQEEL----ERLEEALEELREELEEAEQALDAAERELAQ 486
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1147-1170 5.95e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.20  E-value: 5.95e-03
                           10        20
                   ....*....|....*....|....
gi 1958788272 1147 SSSSENCVQETPLVLSRCSSVSSL 1170
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSSL 24
 
Name Accession Description Interval E-value
APC_basic pfam05956
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ...
1770-2102 1.71e-96

APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.


Pssm-ID: 428690 [Multi-domain]  Cd Length: 336  Bit Score: 315.66  E-value: 1.71e-96
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1770 AMLRGRTVIYTASpASRAQSKGISGPCSAPKKMgtsgttqpETATKTPSPEQQRSRSLHRPGKISELAALSHPPRSATPP 1849
Cdd:pfam05956    1 VVFRGRTVIYMPG-VKESQPSTSPPPKKTPPKT--------DAPAKNPNLGQQRSRSLHRLGKPSELADLSPPKRSATPP 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1850 ARLTKTPSSSSSQTSPASQSLPRRSPLATPTG------GPLPGPGGSPVPKSPARALLAKQ--HKTQKSPVRIPFMQRPA 1921
Cdd:pfam05956   72 ARISKAPSSGSSRDSTPSRPPQKKLTSPSQSPgrlpgsGGRNKLSPLPKTKSPARASTKKSgsHKTQKSPVRIPFMQTPT 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1922 R-----RVPPPLARPSPEPgsRGRAGAEGTPGARGSRLGLVRVASTRSSGSESSDRSgFRRQLTFIKESPGLLRRRRSEL 1996
Cdd:pfam05956  152 KqtglpRNPSPLVTNQPEP--RSESASKGLRSLPGKRLDLVRMSSARSSGSESDRSG-FLRQLTFIKESPSLLLRRRLEL 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1997 SSADSTVSTSQTASPCRGRPALPAVFLCSSRCDELRASPRQPLAA--QRVPQAKPGLAPRAPRRTSSESPSRLPVRATPG 2074
Cdd:pfam05956  229 SASESLSPSSQPASPRRSRPGLPAVFLCSSRCQELKGWRKQPPNPnsRAEPSDRPLTRRRPPRRTSSESPSRLPVRNGTW 308
                          330       340
                   ....*....|....*....|....*...
gi 1958788272 2075 RPETVKRYASLPHISVSRRPDSAVSVPT 2102
Cdd:pfam05956  309 KRETFKRYSSLPHINVWRRTGSSSSILS 336
APC_rep pfam18797
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ...
362-435 6.01e-36

Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.


Pssm-ID: 465870  Cd Length: 74  Bit Score: 131.52  E-value: 6.01e-36
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1958788272  362 SQPDQGLARKEMRVLHVLEQIRAYCETCWDWLQARDSGTESGTGDTPVPIEPQICQAACAVMKLSFDEEYRRAM 435
Cdd:pfam18797    1 SQPDDKRGRREMRVLHLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
APC_N_CC pfam16689
Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil ...
6-57 5.89e-26

Coiled-coil N-terminus of APC, dimerization domain; APC_N_CC is the N-terminal, coiled-coil dimerization domain of the adenomatosis polyposis coli (APC) tumour-repressor proteins. It plays a key role in the regulation of cellular levels of the oncogene product beta-catenin. Coiled-coil regions are binding repeats that in this case bind to the armadillo repeat region of beta-catenin.


Pssm-ID: 435517  Cd Length: 52  Bit Score: 101.99  E-value: 5.89e-26
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1958788272    6 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKL 57
Cdd:pfam16689    1 ASYDQLLRQVEALKLENTTLRQELRDNSSHLSKLETEASNMKEVLKHLQGSI 52
Suppressor_APC pfam11414
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ...
124-211 1.26e-17

Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.


Pssm-ID: 463275  Cd Length: 82  Bit Score: 79.61  E-value: 1.26e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272  124 SRATIRLLEELDQERCFLLSEIEKEEKEKLWYYSQLQGLSKRLDELPHVDTpvprsqQFSMQMDLIRQQLEFEAQHIRSL 203
Cdd:pfam11414    1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGT------YFDYGSDAQQERLEFLLARIQEV 74

                   ....*...
gi 1958788272  204 MEERFGTS 211
Cdd:pfam11414   75 NRCLGGLI 82
Arm_APC_u3 pfam16629
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ...
701-947 5.36e-17

Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.


Pssm-ID: 435476  Cd Length: 293  Bit Score: 83.87  E-value: 5.36e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272  701 HRPAKYQAAAMaVSPGTCVPSLYVRKQRALEAELDTRHLVHALGHLEKQGlPEAETTSKKplpplRHLDGLVQDYASDSG 780
Cdd:pfam16629    1 NRPAKYKDANI-MSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLS-PKASHRNKQ-----RHKQNVYSEYVLDSG 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272  781 CFDDddapsLAAAATTAEPASPAVMSMFLGGPFLQGQAlARTPPARQGGLEAEKE-----------------TGGEAAVA 843
Cdd:pfam16629   74 RHDD-----SVCRSDNFNTGNVTVLSPYLNTTVLPSSS-SRDSRGNAESSRSEKDrsldrergaglsnfhpaTENSGNSS 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272  844 AKAKAKLALAVARIDRLVEDISALHTSSDDSFSLSSGDP---GQEAPREGRAQSCSPCRGTEG-GRREAGSRAHPLLRLK 919
Cdd:pfam16629  148 KRIGMQISTTAAQIAKVMEEVSSMHISQEDRSSGSTSDMhcmQDDRNSIRRSSTAHPHSNVYSfNKSESSNRPCPMPYMK 227
                          250       260
                   ....*....|....*....|....*...
gi 1958788272  920 AAHTSLSNDSLNSGSTSDGYCTREHMTP 947
Cdd:pfam16629  228 MEYKRASNDSLNSVSSSDGYGKRGQMKP 255
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
619-658 1.44e-06

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 46.68  E-value: 1.44e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1958788272  619 EDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 658
Cdd:pfam00514    2 PENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
PHA03247 PHA03247
large tegument protein UL36; Provisional
1781-2287 1.66e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.79  E-value: 1.66e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1781 ASPASRAQSKGISGPCSAPKKMGTSGTTQpETATKTPsPEQQRSRSlhrPGKISELAALSHPPrSATPPArlTKTPSSSS 1860
Cdd:PHA03247  2557 PAAPPAAPDRSVPPPRPAPRPSEPAVTSR-ARRPDAP-PQSARPRA---PVDDRGDPRGPAPP-SPLPPD--THAPDPPP 2628
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1861 SQTSPASQSLPRRSPLATPTGGPLPGPGGSPVPKSPARALLAKQHKTQKSPvripfMQRPARRVPPP-------LARPSP 1933
Cdd:PHA03247  2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP-----PQRPRRRAARPtvgsltsLADPPP 2703
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1934 EPGSrgragAEGTPGARGSRLGLVRVASTRSSGSESSDRSGFRRQLTFIKESPGLLRRRRSELSSADSTVSTSQTASPCR 2013
Cdd:PHA03247  2704 PPPT-----PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAG 2778
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 2014 GRPALPAVFLCSSrcDELRASPRQPLAAQRVPQAKPGLAPRAPRRTSSESPSRLPVRATPGRPETvkryaslphisvsrr 2093
Cdd:PHA03247  2779 PPRRLTRPAVASL--SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPP--------------- 2841
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 2094 PDSAVSVPTTQANATRRGSDGEARPLPRVAAPGTTWRrikdedvPHILRSTLPATALPlRGSSPEDSPAGTPHRKtsdav 2173
Cdd:PHA03247  2842 PPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAP-------ARPPVRRLARPAVS-RSTESFALPPDQPERP----- 2908
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 2174 vQTEDVATSKTNSSTSPSLESRDPPQAPISGPVAPLGSDVDGPVLAKPPASAPFTHEGlSVVTGGFPTSRHGSPSRAARV 2253
Cdd:PHA03247  2909 -PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG-ALVPGRVAVPRFRVPQPAPSR 2986
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|
gi 1958788272 2254 PPFNYVPSPMV------VATMTSDSAVEKAPVTSPASLLE 2287
Cdd:PHA03247  2987 EAPASSTPPLTghslsrVSSWASSLALHEETDPPPVSLKQ 3026
ARM smart00185
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ...
618-658 2.30e-06

Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.


Pssm-ID: 214547 [Multi-domain]  Cd Length: 41  Bit Score: 46.27  E-value: 2.30e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 1958788272   618 REDYRQVLRDHNCLQTLLQHLTSHSLTIVSNACGTLWNLSA 658
Cdd:smart00185    1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1918-2252 8.67e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 51.33  E-value: 8.67e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1918 QRPARRVPPPLARPSPEPGSRGRAGAeGTPGARGSRLGLVRVASTRSSGSESSDRSGFRRQLTfikESPGLLRRRRSELS 1997
Cdd:PHA03307    80 PANESRSTPTWSLSTLAPASPAREGS-PTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPV---GSPGPPPAASPPAA 155
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1998 SADSTVSTSQTASPCRGRPALPAVflcssrcDELRASPRQPLAAQRVPQAKPGLAPRAPRRTSSESPSRLPVRATPGRPE 2077
Cdd:PHA03307   156 GASPAAVASDAASSRQAALPLSSP-------EETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSA 228
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 2078 TVKRYASLPHISVSRRPDSAVSvpttQANATRRGSDGEARpLPRVAAPGTTWrriKDEDVPHILRStlPATALPLRGSSP 2157
Cdd:PHA03307   229 ADDAGASSSDSSSSESSGCGWG----PENECPLPRPAPIT-LPTRIWEASGW---NGPSSRPGPAS--SSSSPRERSPSP 298
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 2158 EDSPAGTPHRKTSDAVVQTEDVATSKTNSSTSPSLESRDPPQAPISGPVAPLGSDVDGPVLAKPP----ASAPFTHEGLS 2233
Cdd:PHA03307   299 SPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSsprkRPRPSRAPSSP 378
                          330
                   ....*....|....*....
gi 1958788272 2234 VVTGGFPTSRHGSPSRAAR 2252
Cdd:PHA03307   379 AASAGRPTRRRARAAVAGR 397
SAMP pfam05924
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ...
1604-1625 7.81e-05

SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.


Pssm-ID: 461782  Cd Length: 22  Bit Score: 41.42  E-value: 7.81e-05
                           10        20
                   ....*....|....*....|..
gi 1958788272 1604 SPRAEEELLQRCISLAMPRRRT 1625
Cdd:pfam05924    1 SPDDEDDLLQECINSAMPKKRR 22
PHA03247 PHA03247
large tegument protein UL36; Provisional
1510-1935 9.64e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 9.64e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1510 DSDEEPPATAPPTRRASAIPRALKREKPAGRKETP---TRATQPATLPVRAQPRLIVDETPPCYSLTSSASSLSEPEASE 1586
Cdd:PHA03247  2546 DDAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPavtSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPD 2625
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1587 QPACHPR---VEEQGSKQDSSPRAEEELLQRCISLAMPRRRTQVPS--------SRRRKPRAVRSDIRPteLPQKCREEV 1655
Cdd:PHA03247  2626 PPPPSPSpaaNEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGraaqasspPQRPRRRAARPTVGS--LTSLADPPP 2703
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1656 PGSDPASdldsVEWQAIQEGANSIVTWLHQAAAKASLEASSESDSLLSLVSGLSASSTLQPSKLRKGRKPVAEAGGAWRP 1735
Cdd:PHA03247  2704 PPPTPEP----APHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGP 2779
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1736 EKRGTTSTKGSGSPRFPSGPEKAKGTQKTMAgesAMLRGRTVIYTASPASRAQSKGISGPCSAPKKMGTSGTTQPETATK 1815
Cdd:PHA03247  2780 PRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSV 2856
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1816 TPSPEQQRSRSLHRPgkISELAALSHPP-RSATPPARLTKTPSSSSSQTSPASQSLPRRSPLATPTGGPLPGPGGSPVPK 1894
Cdd:PHA03247  2857 APGGDVRRRPPSRSP--AAKPAAPARPPvRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPP 2934
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1958788272 1895 SPARALLAKQHKTQKSPVRIPFMQRPA---------------RRVPPPL-ARPSPEP 1935
Cdd:PHA03247  2935 PPPRPQPPLAPTTDPAGAGEPSGAVPQpwlgalvpgrvavprFRVPQPApSREAPAS 2991
PHA03247 PHA03247
large tegument protein UL36; Provisional
1734-2208 3.26e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.47  E-value: 3.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1734 RPEKRGTTSTKGSGSPRFPSGPEKAKGtqKTMAGESAMLRGRTVIYTASPASRAQSKGISGPCSAPKKMGTSGT-TQPET 1812
Cdd:PHA03247  2572 RPAPRPSEPAVTSRARRPDAPPQSARP--RAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPpTVPPP 2649
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1813 ATKTPSPEQQRSRSLHRPGKISELAALSHPPRSATPPARLTKTPSSSSSQTSPASQSLPRRSPLATPTGGPLPGPGGSPV 1892
Cdd:PHA03247  2650 ERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAAR 2729
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1893 PKSPARALLAKQHKTQKSPVrIPFMQRPARRVPPPLARPSPEPGsrgrAGAEGTPGARGSRLGLVRVASTRSSGSESsdr 1972
Cdd:PHA03247  2730 QASPALPAAPAPPAVPAGPA-TPGGPARPARPPTTAGPPAPAPP----AAPAAGPPRRLTRPAVASLSESRESLPSP--- 2801
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1973 sgfrrqltfikespgllrrrrSELSSADSTVSTSQTASPCRGRPALPAVFLCSSrcdeLRASPRQPLAAQRVPQAKPG-L 2051
Cdd:PHA03247  2802 ---------------------WDPADPPAAVLAPAAALPPAASPAGPLPPPTSA----QPTAPPPPPGPPPPSLPLGGsV 2856
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 2052 APRAPRRTSSESPSRLPVRATPGRPetvkRYASLPHISVSRRPDSAVSVPTTQANATRRGSDGEARPLPRVAAPGTTWRR 2131
Cdd:PHA03247  2857 APGGDVRRRPPSRSPAAKPAAPARP----PVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPP 2932
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1958788272 2132 IKDEDVPhilrstlPATALPLRGSSPEDSPAGTPHRKTSDAVVQTEDVATSKTNSSTSPSLESRDPPQAPISGPVAP 2208
Cdd:PHA03247  2933 PPPPPRP-------QPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLS 3002
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
2047-2286 3.75e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 45.72  E-value: 3.75e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 2047 AKPGLAPRAPRRTSSESPSRLPVRATPGRPETVKRYASLPHISVSRRPDSAVS-VPTTQANATRRGSdgearPLPRVAAP 2125
Cdd:pfam17823  106 AADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPrAAIAAASAPHAAS-----PAPRTAAS 180
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 2126 GTTW--RRIKDEDVPHILRSTLPATALPLRGSSPEDSPAGTPHRKTSDAVV----------------------------- 2174
Cdd:pfam17823  181 STTAasSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVgnsspaagtvtaavgtvtpaalatlaaaa 260
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 2175 QTEDVATSKTNS----STSPSLESRDPPQAPISGPVAPLGSDVDGPVLAkppasapFTHEGLSVVTGGFPTSrhgSPSRA 2250
Cdd:pfam17823  261 GTVASAAGTINMgdphARRLSPAKHMPSDTMARNPAAPMGAQAQGPIIQ-------VSTDQPVHNTAGEPTP---SPSNT 330
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1958788272 2251 ARVPPFNYVPSPMVVATMTSDSAVEKAPVTSPASLL 2286
Cdd:pfam17823  331 TLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVL 366
PHA03247 PHA03247
large tegument protein UL36; Provisional
1985-2217 5.08e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.70  E-value: 5.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 1985 SPGLLRRRRSELSSADSTVSTSQTASPCRGRPALPAVFLCSSRCDELRASP-RQPLAAQRVPQAKPGLAPRAPR------ 2057
Cdd:PHA03247   256 APPPVVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPlALPAPPDPPPPAPAGDAEEEDDedgame 335
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 2058 ------RTSSESPSRLPVRATPgrpeTVKRYASLPHISVSRRPDSAVSVPTTQANATR-------RGSDGEARPLPRVAA 2124
Cdd:PHA03247   336 vvsplpRPRQHYPLGFPKRRRP----TWTPPSSLEDLSAGRHHPKRASLPTRKRRSARhaatpfaRGPGGDDQTRPAAPV 411
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272 2125 PGTTwrriKDEDVPHILRSTLPATALPLRGSSP--EDSPAGTPHRKTSDAVVQTEDVATSKTNSSTSPSLESRDPPQAPI 2202
Cdd:PHA03247   412 PASV----PTPAPTPVPASAPPPPATPLPSAEPgsDDGPAPPPERQPPAPATEPAPDDPDDATRKALDALRERRPPEPPG 487
                          250
                   ....*....|....*..
gi 1958788272 2203 SGPVAPLGS--DVDGPV 2217
Cdd:PHA03247   488 ADLAELLGRhpDTAGTV 504
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1383-1405 6.94e-04

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 38.90  E-value: 6.94e-04
                           10        20
                   ....*....|....*....|...
gi 1958788272 1383 DDSGTDSAEGTPVNFSSAASLSD 1405
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSS 23
Arm pfam00514
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ...
660-700 3.22e-03

Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.


Pssm-ID: 425727 [Multi-domain]  Cd Length: 41  Bit Score: 37.05  E-value: 3.22e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1958788272  660 SPRDQELLWDLGAVGMLRNLVHSKHKMIAMGSAAALRNLLA 700
Cdd:pfam00514    1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
2-86 3.59e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 42.20  E-value: 3.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272    2 ASSVASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQEARVL--VSSGQTEVLEQLKAL 79
Cdd:COG4372     27 AALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLqaAQAELAQAQEELESL 106

                   ....*..
gi 1958788272   80 QTDISSL 86
Cdd:COG4372    107 QEEAEEL 113
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1259-1280 3.71e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.59  E-value: 3.71e-03
                           10        20
                   ....*....|....*....|..
gi 1958788272 1259 SVRFTVEKPDENFSCASSLSAL 1280
Cdd:pfam05923    3 PKRYCVEGTPANFSRASSLSSL 24
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
6-241 4.13e-03

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 42.35  E-value: 4.13e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272    6 ASYEQLVRQVEALKAENTHLRQELRDNSSHLSKLETETSGMKEVLKHLQGKLEQ--EARVLVSSGQTEVLEQLKALQTDI 83
Cdd:TIGR02168  267 EKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEEleAQLEELESKLDELAEELAELEEKL 346
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958788272   84 SSLYNLKFHAPALGPEPAAQTPEGSpvHGPAPSKDSFGELSRATIRLLEELDQERCFLLSEIekeekeklwyySQLQGLS 163
Cdd:TIGR02168  347 EELKEELESLEAELEELEAELEELE--SRLEELEEQLETLRSKVAQLELQIASLNNEIERLE-----------ARLERLE 413
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1958788272  164 KRLDELpHVDTPVPRSQQFSMQMDLIRQQLEFEAQHIRSLMEERfgtsDEMVQRAQIRASRLEQIDKELLEAQDRVQQ 241
Cdd:TIGR02168  414 DRRERL-QQEIEELLKKLEEAELKELQAELEELEEELEELQEEL----ERLEEALEELREELEEAEQALDAAERELAQ 486
APC_r pfam05923
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ...
1147-1170 5.95e-03

APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.


Pssm-ID: 461781  Cd Length: 24  Bit Score: 36.20  E-value: 5.95e-03
                           10        20
                   ....*....|....*....|....
gi 1958788272 1147 SSSSENCVQETPLVLSRCSSVSSL 1170
Cdd:pfam05923    1 DSPKRYCVEGTPANFSRASSLSSL 24
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH