NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|4504191|ref|NP_000170|]
View 

DNA mismatch repair protein Msh6 isoform 1 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
MutS super family cl33816
DNA mismatch repair ATPase MutS [Replication, recombination and repair];
407-1360 1.04e-164

DNA mismatch repair ATPase MutS [Replication, recombination and repair];


The actual alignment was detected with superfamily member COG0249:

Pssm-ID: 440019 [Multi-domain]  Cd Length: 861  Bit Score: 515.38  E-value: 1.04e-164
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   407 TPGMRKWWQIKSQNFDLVICYKVGKFYELYHMDALIGVSELGLVFMKGNwAHS-------GFPEIAFGRYSDSLVQKGYK 479
Cdd:COG0249    8 TPMMQQYLEIKAQYPDALLFFRMGDFYELFFEDAEKASRLLDITLTKRG-KGAgepipmaGVPYHAAEGYLAKLVKAGYK 86
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   480 VARVEQTETPEmmEARcrkmahiskydRVVRREICRIITKGTQTYSVLEGDPSENYskyLLSLKEKEEdssghtrAYGVC 559
Cdd:COG0249   87 VAICEQVEDPA--EAK-----------GLVKREVVRVVTPGTLTEDALLDAKRNNY---LAAVARDKG-------RYGLA 143
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   560 FVDTSLGKFFIGQFSDDrhcSRFRTLVAHYPPVQVLFEKGNLSKETKTILKSSLSCSLQEglIPGSQFwDASKTLRTLLE 639
Cdd:COG0249  144 WLDISTGEFLVTELDGE---EALLDELARLAPAEILVPEDLPDPEELLELLRERGAAVTR--LPDWAF-DPDAARRRLLE 217
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   640 eeYFREKLSDGIGVmlpqvlkgmtsesdsigltpgEKSELALSALGGCVFYLKKCLIdQELLSMANFEEYiplDSDtvst 719
Cdd:COG0249  218 --QFGVASLDGFGL---------------------EDLPAAIAAAGALLAYLEETQK-GALPHLRRLRRY---EED---- 266
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   720 trsgaiftkayQRMVLDAVTLNNLEIFLNgTNGSTEGTLLERVDTCHTPFGKRLLKQWLCAPLCNHYAINDRLDAIEDLM 799
Cdd:COG0249  267 -----------DYLILDAATRRNLELTET-LRGGRKGSLLSVLDRTVTAMGSRLLRRWLLRPLRDRAAIEARLDAVEELL 334
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   800 VVPDKISEVVELLKKLPDLERLLSKI-HNVGSPlksqnhpdsRaimyeettyskkkiiDFLSALEGFKVMCKIIGIMEEV 878
Cdd:COG0249  335 EDPLLREELRELLKGVYDLERLLSRIaLGRANP---------R---------------DLAALRDSLAALPELKELLAEL 390
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   879 ADGFKSKILKQVislqtknpeGRFPDLTVELNRwdtAFDHE---KARKTGLItpKAGFDSDYDQaLADIREN-EQSLLEY 954
Cdd:COG0249  391 DSPLLAELAEAL---------DPLEDLAELLER---AIVDEpplLIRDGGVI--REGYDAELDE-LRELSENgKEWLAEL 455
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   955 LEKQRNRIGCRTIvywGIGRNR---YQLEIpenftTR----NLPEEYELKSTKKGCKRYWT---KTIEKKLANlinAEER 1024
Cdd:COG0249  456 EARERERTGIKSL---KVGYNKvfgYYIEV-----TKanadKVPDDYIRKQTLKNAERYITpelKELEDKILS---AEER 524
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1025 RDV-----------SLKDCMRRLfynfdknykdwQSAVECIAVLDVLLCLA------NYsrggdgpmCRPVIllpeDTPP 1087
Cdd:COG0249  525 ALAleyelfeelreEVAAHIERL-----------QALARALAELDVLASLAevavenNY--------VRPEL----DDSP 581
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1088 FLELKGSRHPCITKTFFGDDFIPNDILIGCEEEeqengkayCVLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPAEVCRL 1167
Cdd:COG0249  582 GIEIEGGRHPVVEQALPGEPFVPNDCDLDPDRR--------ILLITGPNMAGKSTYMRQVALIVLLAQIGSFVPAESARI 653
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1168 TPIDRVFTRLGASDRIMSGESTFFVELSETASILMHATAHSLVLVDELGRGTATFDGTAIANAVVKELAETIKCRTLFST 1247
Cdd:COG0249  654 GIVDRIFTRVGASDDLARGQSTFMVEMTETANILNNATERSLVLLDEIGRGTSTYDGLSIAWAVAEYLHDKIRARTLFAT 733
                        890       900       910       920       930       940       950       960
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1248 HYH---SLVEDYSQ----NVAVRlghmacmvEnecedpSQETITFLYKFIKGACPKSYGFNAARLANLPEEVIQ------ 1314
Cdd:COG0249  734 HYHeltELAEKLPGvknyHVAVK--------E------WGGDIVFLHKVVPGPADRSYGIHVAKLAGLPASVIErareil 799
                        970       980       990      1000      1010
                 ....*....|....*....|....*....|....*....|....*....|
gi 4504191  1315 ----KGHRKAREFEKMNQsLRLFrevclasersTVDAEAVHKLLTLIKEL 1360
Cdd:COG0249  800 aeleKGEAAAAGKAAPDQ-LSLF----------AAADPEPSPVLEELKAL 838
PWWP_MSH6 cd05837
PWWP domain found in DNA mismatch repair protein MSH6 and similar proteins; MSH6, also called ...
90-193 1.37e-55

PWWP domain found in DNA mismatch repair protein MSH6 and similar proteins; MSH6, also called G/T mismatch-binding protein (GTBP or GTMBP), MutS protein homolog 6, or MutS-alpha 160 kDa subunit (p160), is a mismatch repair protein homologous to bacterial MutS. It is a component of the post-replicative DNA mismatch repair system (MMR). It heterodimerizes with MSH2 to form MutS alpha, which binds to DNA mismatches thereby initiating DNA repair. When bound, MutS alpha bends the DNA helix and shields approximately 20 base pairs, and recognizes single base mismatches and dinucleotide insertion-deletion loops (IDL) in the DNA. After mismatch binding, it forms a ternary complex with the MutL alpha heterodimer, which is thought to be responsible for directing the downstream MMR events, including strand discrimination, excision, and resynthesis. Mutations in MSH6 have been linked to increased cancer susceptibility, particularly in hereditary nonpolyposis colorectal cancer in humans. MSH6 contains a PWWP domain, but its role in MSH6 remains unclear. MSH6 orthologs found in Saccharomyces cerevisiae, Caenorhabditis elegans, and Arabidopsis thaliana lack the PWWP domain. PWWP domains typically recognize DNA and histone methylated lysines.


:

Pssm-ID: 438962  Cd Length: 103  Bit Score: 187.88  E-value: 1.37e-55
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    90 FSPGDLVWAKMEGYPWWPCLVYNHPFDGtFIREKGKSVRVHVQFFDDSPTRGWVSKRLLKPYTGSKSKEAQKGGHFYSAK 169
Cdd:cd05837    1 FSPGDLVWAKLEGYPWWPSLVCNHPTTG-FHKKFGKKGEVHVQFFDDPPSRAWVKAKNVKPFTGSDDKEFQKGGMFFSKD 79
                         90       100
                 ....*....|....*....|....
gi 4504191   170 PEILRAMQRADEALNKDKIKRLEL 193
Cdd:cd05837   80 PKWKKAVKEADKALKLSVEERLKL 103
 
Name Accession Description Interval E-value
MutS COG0249
DNA mismatch repair ATPase MutS [Replication, recombination and repair];
407-1360 1.04e-164

DNA mismatch repair ATPase MutS [Replication, recombination and repair];


Pssm-ID: 440019 [Multi-domain]  Cd Length: 861  Bit Score: 515.38  E-value: 1.04e-164
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   407 TPGMRKWWQIKSQNFDLVICYKVGKFYELYHMDALIGVSELGLVFMKGNwAHS-------GFPEIAFGRYSDSLVQKGYK 479
Cdd:COG0249    8 TPMMQQYLEIKAQYPDALLFFRMGDFYELFFEDAEKASRLLDITLTKRG-KGAgepipmaGVPYHAAEGYLAKLVKAGYK 86
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   480 VARVEQTETPEmmEARcrkmahiskydRVVRREICRIITKGTQTYSVLEGDPSENYskyLLSLKEKEEdssghtrAYGVC 559
Cdd:COG0249   87 VAICEQVEDPA--EAK-----------GLVKREVVRVVTPGTLTEDALLDAKRNNY---LAAVARDKG-------RYGLA 143
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   560 FVDTSLGKFFIGQFSDDrhcSRFRTLVAHYPPVQVLFEKGNLSKETKTILKSSLSCSLQEglIPGSQFwDASKTLRTLLE 639
Cdd:COG0249  144 WLDISTGEFLVTELDGE---EALLDELARLAPAEILVPEDLPDPEELLELLRERGAAVTR--LPDWAF-DPDAARRRLLE 217
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   640 eeYFREKLSDGIGVmlpqvlkgmtsesdsigltpgEKSELALSALGGCVFYLKKCLIdQELLSMANFEEYiplDSDtvst 719
Cdd:COG0249  218 --QFGVASLDGFGL---------------------EDLPAAIAAAGALLAYLEETQK-GALPHLRRLRRY---EED---- 266
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   720 trsgaiftkayQRMVLDAVTLNNLEIFLNgTNGSTEGTLLERVDTCHTPFGKRLLKQWLCAPLCNHYAINDRLDAIEDLM 799
Cdd:COG0249  267 -----------DYLILDAATRRNLELTET-LRGGRKGSLLSVLDRTVTAMGSRLLRRWLLRPLRDRAAIEARLDAVEELL 334
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   800 VVPDKISEVVELLKKLPDLERLLSKI-HNVGSPlksqnhpdsRaimyeettyskkkiiDFLSALEGFKVMCKIIGIMEEV 878
Cdd:COG0249  335 EDPLLREELRELLKGVYDLERLLSRIaLGRANP---------R---------------DLAALRDSLAALPELKELLAEL 390
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   879 ADGFKSKILKQVislqtknpeGRFPDLTVELNRwdtAFDHE---KARKTGLItpKAGFDSDYDQaLADIREN-EQSLLEY 954
Cdd:COG0249  391 DSPLLAELAEAL---------DPLEDLAELLER---AIVDEpplLIRDGGVI--REGYDAELDE-LRELSENgKEWLAEL 455
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   955 LEKQRNRIGCRTIvywGIGRNR---YQLEIpenftTR----NLPEEYELKSTKKGCKRYWT---KTIEKKLANlinAEER 1024
Cdd:COG0249  456 EARERERTGIKSL---KVGYNKvfgYYIEV-----TKanadKVPDDYIRKQTLKNAERYITpelKELEDKILS---AEER 524
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1025 RDV-----------SLKDCMRRLfynfdknykdwQSAVECIAVLDVLLCLA------NYsrggdgpmCRPVIllpeDTPP 1087
Cdd:COG0249  525 ALAleyelfeelreEVAAHIERL-----------QALARALAELDVLASLAevavenNY--------VRPEL----DDSP 581
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1088 FLELKGSRHPCITKTFFGDDFIPNDILIGCEEEeqengkayCVLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPAEVCRL 1167
Cdd:COG0249  582 GIEIEGGRHPVVEQALPGEPFVPNDCDLDPDRR--------ILLITGPNMAGKSTYMRQVALIVLLAQIGSFVPAESARI 653
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1168 TPIDRVFTRLGASDRIMSGESTFFVELSETASILMHATAHSLVLVDELGRGTATFDGTAIANAVVKELAETIKCRTLFST 1247
Cdd:COG0249  654 GIVDRIFTRVGASDDLARGQSTFMVEMTETANILNNATERSLVLLDEIGRGTSTYDGLSIAWAVAEYLHDKIRARTLFAT 733
                        890       900       910       920       930       940       950       960
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1248 HYH---SLVEDYSQ----NVAVRlghmacmvEnecedpSQETITFLYKFIKGACPKSYGFNAARLANLPEEVIQ------ 1314
Cdd:COG0249  734 HYHeltELAEKLPGvknyHVAVK--------E------WGGDIVFLHKVVPGPADRSYGIHVAKLAGLPASVIErareil 799
                        970       980       990      1000      1010
                 ....*....|....*....|....*....|....*....|....*....|
gi 4504191  1315 ----KGHRKAREFEKMNQsLRLFrevclasersTVDAEAVHKLLTLIKEL 1360
Cdd:COG0249  800 aeleKGEAAAAGKAAPDQ-LSLF----------AAADPEPSPVLEELKAL 838
PRK05399 PRK05399
DNA mismatch repair protein MutS; Provisional
407-1360 1.09e-161

DNA mismatch repair protein MutS; Provisional


Pssm-ID: 235444 [Multi-domain]  Cd Length: 854  Bit Score: 506.94  E-value: 1.09e-161
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    407 TPGMRKWWQIKSQNFDLVICYKVGKFYELYHMDALIGVSELGLVFMKGNwAHS-------GFPEIAFGRYSDSLVQKGYK 479
Cdd:PRK05399    9 TPMMQQYLEIKAQYPDALLFFRMGDFYELFFEDAKKASRLLDITLTKRG-KSAgepipmaGVPYHAAEGYLAKLVKKGYK 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    480 VARVEQTETPEMmearcrkmahiSKydRVVRREICRIITKGTQTYSVLEGDPSENYskyLLSLKEKEEdssghtrAYGVC 559
Cdd:PRK05399   88 VAICEQVEDPAT-----------AK--GPVKREVVRIVTPGTVTDEALLDEKQNNY---LAAIAQDGG-------GYGLA 144
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    560 FVDTSLGKFFIGQFSDDRhcsrFRTLVAHYPPVQVLFEKgNLSKETKTILKSSLSCslqeglIPGSQFwDASKTLRTLLE 639
Cdd:PRK05399  145 YLDLSTGEFRVTELDEEE----LLAELARLNPAEILVPE-DFSEDELLLLRRGLRR------RPPWEF-DLDTAEKRLLE 212
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    640 eeYFREKLSDGIGVMLPqvlkgmtsesdsigltpgekseLALSALGGCVFYLKKCLIdQEL-----LSMANFEEYiplds 714
Cdd:PRK05399  213 --QFGVASLDGFGVDLP----------------------LAIRAAGALLQYLKETQK-RSLphlrsPKRYEESDY----- 262
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    715 dtvsttrsgaiftkayqrMVLDAVTLNNLEIFLNgTNGSTEGTLLERVDTCHTPFGKRLLKQWLCAPLCNHYAINDRLDA 794
Cdd:PRK05399  263 ------------------LILDAATRRNLELTEN-LRGGRKNSLLSVLDRTVTAMGGRLLRRWLHRPLRDREAIEARLDA 323
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    795 IEDLMVVPDKISEVVELLKKLPDLERLLSKI-HNVGSPlksqnhpdsRaimyeettyskkkiiDFLSALEGFKVMCKIIG 873
Cdd:PRK05399  324 VEELLEDPLLREDLRELLKGVYDLERLLSRIaLGRANP---------R---------------DLAALRDSLEALPELKE 379
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    874 IMEEVADGFKSKILKQVislqtknpeGRFPDLTVELNRwdtAFDHE---KARKTGLItpKAGFDSDYDQaLADIREN-EQ 949
Cdd:PRK05399  380 LLAELDSPLLAELAEQL---------DPLEELADLLER---AIVEEpplLIRDGGVI--ADGYDAELDE-LRALSDNgKD 444
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    950 SLLEYLEKQRNRIGCRTIvywGIGRNR---YQLEIPeNFTTRNLPEEYELKSTKKGCKRYWT---KTIEKKLANlinAEE 1023
Cdd:PRK05399  445 WLAELEARERERTGISSL---KVGYNKvfgYYIEVT-KANLDKVPEDYIRRQTLKNAERYITpelKELEDKILS---AEE 517
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   1024 RRdVSL-KDCMRRLFYNFDKNYKDWQSAVECIAVLDVLLCLA------NYsrggdgpmCRPVIllpeDTPPFLELKGSRH 1096
Cdd:PRK05399  518 KA-LALeYELFEELREEVAEHIERLQKLAKALAELDVLASLAevaeenNY--------VRPEF----TDDPGIDIEEGRH 584
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   1097 PCITKTFFGDDFIPNDILIgceeeeqeNGKAYCVLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPAEVCRLTPIDRVFTR 1176
Cdd:PRK05399  585 PVVEQVLGGEPFVPNDCDL--------DEERRLLLITGPNMAGKSTYMRQVALIVLLAQIGSFVPAESARIGIVDRIFTR 656
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   1177 LGASDRIMSGESTFFVELSETASILMHATAHSLVLVDELGRGTATFDGTAIANAVVKELAETIKCRTLFSTHYHSLVEdy 1256
Cdd:PRK05399  657 IGASDDLASGRSTFMVEMTETANILNNATERSLVLLDEIGRGTSTYDGLSIAWAVAEYLHDKIGAKTLFATHYHELTE-- 734
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   1257 sqnVAVRLGHMACM----VEnecedpSQETITFLYKFIKGACPKSYGFNAARLANLPEEVIQKGHRKAREFEKMNQSLRl 1332
Cdd:PRK05399  735 ---LEEKLPGVKNVhvavKE------HGGDIVFLHKVVPGAADKSYGIHVAKLAGLPASVIKRAREILAQLESASEKAK- 804
                         970       980
                  ....*....|....*....|....*...
gi 4504191   1333 fREVCLASERSTVDAEAVHKLLTLIKEL 1360
Cdd:PRK05399  805 -AASAEEDQLSLFAEPEESPLLEALKAL 831
mutS1 TIGR01070
DNA mismatch repair protein MutS; [DNA metabolism, DNA replication, recombination, and repair]
407-1325 6.75e-124

DNA mismatch repair protein MutS; [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273427 [Multi-domain]  Cd Length: 840  Bit Score: 405.69  E-value: 6.75e-124
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     407 TPGMRKWWQIKSQNFDLVICYKVGKFYELYHMDALIGVSELGLVFMKGNWAH------SGFPEIAFGRYSDSLVQKGYKV 480
Cdd:TIGR01070    2 TPMMQQYLKLKAEHPDALLFFRMGDFYELFYEDAKKAAQLLDISLTSRGQSAdepipmAGIPYHAVEAYLEKLVKQGESV 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     481 ARVEQTETPEMMEArcrkmahiskydrVVRREICRIITKGTQTYSVLEGDPSENYskyLLSLkekeedsSGHTRAYGVCF 560
Cdd:TIGR01070   82 AICEQIEDPKTAKG-------------PVEREVVQLITPGTVSDEALLPERQDNL---LAAI-------AQESNGFGLAT 138
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     561 VDTSLGKFFIGQFSDdrhcsrFRTLVAH---YPPVQVLFeKGNLSKETkTILKSSLSCSlqeglipgsqfwdaskTLRTL 637
Cdd:TIGR01070  139 LDLTTGEFKVTELAD------KETLYAElqrLNPAEVLL-AEDLSEME-AIELREFRKD----------------TAVMS 194
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     638 LEEEYFREKLSdGIGVmlpqvlkgmtsesdsigltpgEKSELALSALGGCVFYLKKCLiDQELLSMANFEEYIPLDSdtv 717
Cdd:TIGR01070  195 LEAQFGTEDLG-GLGL---------------------RNAPLGLTAAGCLLQYAKRTQ-RTALPHLQPVRLYELQDF--- 248
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     718 sttrsgaiftkayqrMVLDAVTLNNLEIFLNgTNGSTEGTLLERVDTCHTPFGKRLLKQWLCAPLCNHYAINDRLDAIED 797
Cdd:TIGR01070  249 ---------------MQLDAATRRNLELTEN-LRGGKQNTLFSVLDETKTAMGSRLLKRWLHRPLRDREVLEARQDTVEV 312
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     798 LMVVPDKISEVVELLKKLPDLERLLSKIhnvgsPLKSQNHPDsraimyeettyskkkiidfLSAL-EGFKVMCKIIGIME 876
Cdd:TIGR01070  313 LLRHFFLREGLRPLLKEVGDLERLAARV-----ALGNARPRD-------------------LARLrTSLEQLPELRALLE 368
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     877 EVADGFKSKILKQVislqtknpeGRFPDLTVELNRWDTAFDHEKARKTGLItpKAGFDSDYDQALADIRENEQSLLEYLE 956
Cdd:TIGR01070  369 ELEGPTLQALAAQI---------DDFSELLELLEAALIENPPLVVRDGGLI--REGYDEELDELRAASREGTDYLARLEA 437
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     957 KQRNRIGCRTIvywGIGRNR---YQLEIPeNFTTRNLPEEYELKSTKKGCKRYWTKTIEKKLANLINAEERRDVSLKDCM 1033
Cdd:TIGR01070  438 RERERTGIPTL---KVGYNAvfgYYIEVT-RGQLHLVPAHYRRRQTLKNAERYITPELKEKEDKVLEAEGKILALEKELF 513
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    1034 RRLFYNFDKNYKDWQSAVECIAVLDVLLCLA------NYSRggdgpmcrpvillPE-DTPPFLELKGSRHPCITKTFfGD 1106
Cdd:TIGR01070  514 EELRELLKKYLEALQEAARALAELDVLANLAevaetlHYTR-------------PRfGDDPQLRIREGRHPVVEQVL-RT 579
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    1107 DFIPNDILIGCEEEeqengkayCVLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPAEVCRLTPIDRVFTRLGASDRIMSG 1186
Cdd:TIGR01070  580 PFVPNDLEMAHNRR--------MLLITGPNMGGKSTYMRQTALIALLAQIGSFVPAESAELPLFDRIFTRIGASDDLASG 651
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    1187 ESTFFVELSETASILMHATAHSLVLVDELGRGTATFDGTAIANAVVKELAETIKCRTLFSTHYHSLVEDYSQNVAVRLGH 1266
Cdd:TIGR01070  652 RSTFMVEMTEAANILHNATENSLVLFDEIGRGTSTYDGLALAWAIAEYLHEHIRAKTLFATHYFELTALEESLPGLKNVH 731
                          890       900       910       920       930
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 4504191    1267 MACMVENecedpsqETITFLYKFIKGACPKSYGFNAARLANLPEEVIQKGHRKAREFEK 1325
Cdd:TIGR01070  732 VAALEHN-------GTIVFLHQVLPGPASKSYGLAVAALAGLPKEVIARARQILTQLEA 783
ABC_MSH6_euk cd03286
ATP-binding cassette domain of eukaryotic MutS6 homolog; The MutS protein initiates DNA ...
1091-1313 8.17e-106

ATP-binding cassette domain of eukaryotic MutS6 homolog; The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding clamps, and recognition of specific DNA structures or lesions results in ADP/ATP exchange.


Pssm-ID: 213253 [Multi-domain]  Cd Length: 218  Bit Score: 334.01  E-value: 8.17e-106
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1091 LKGSRHPCITKTFfGDDFIPNDILIGCEEeeqengkAYCVLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPAEVCRLTPI 1170
Cdd:cd03286    2 FEELRHPCLNAST-ASSFVPNDVDLGATS-------PRILVLTGPNMGGKSTLLRTVCLAVIMAQMGMDVPAKSMRLSLV 73
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1171 DRVFTRLGASDRIMSGESTFFVELSETASILMHATAHSLVLVDELGRGTATFDGTAIANAVVKELAETIKCRTLFSTHYH 1250
Cdd:cd03286   74 DRIFTRIGARDDIMKGESTFMVELSETANILRHATPDSLVILDELGRGTSTHDGYAIAHAVLEYLVKKVKCLTLFSTHYH 153
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 4504191  1251 SLVEDYSQNVAVRLGHMACMVENEcEDPSQETITFLYKFIKGACPKSYGFNAARLANLPEEVI 1313
Cdd:cd03286  154 SLCDEFHEHGGVRLGHMACAVKNE-SDPTIRDITFLYKLVAGICPKSYGLYVALMAGIPDGVV 215
MUTSac smart00534
ATPase domain of DNA mismatch repair MUTS family;
1129-1320 5.73e-94

ATPase domain of DNA mismatch repair MUTS family;


Pssm-ID: 197777 [Multi-domain]  Cd Length: 185  Bit Score: 299.86  E-value: 5.73e-94
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     1129 CVLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPAEVCRLTPIDRVFTRLGASDRIMSGESTFFVELSETASILMHATAHS 1208
Cdd:smart00534    1 VVIITGPNMGGKSTYLRQVALIVIMAQIGSFVPAESAELPVFDRIFTRIGASDSLAQGLSTFMVEMKETANILKNATKNS 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     1209 LVLVDELGRGTATFDGTAIANAVVKELAETIKCRTLFSTHYHSLVEDYSQNVAVRLGHMACMVENecedpsqETITFLYK 1288
Cdd:smart00534   81 LVLLDELGRGTSTYDGLAIAAAILEYLLEKIGARTLFATHYHELTKLADNHPGVRNLHMSALEET-------ENITFLYK 153
                           170       180       190
                    ....*....|....*....|....*....|..
gi 4504191     1289 FIKGACPKSYGFNAARLANLPEEVIQKGHRKA 1320
Cdd:smart00534  154 LKPGVAGKSYGIEVAKLAGLPKEVIERAKRIL 185
MutS_V pfam00488
MutS domain V; This domain is found in proteins of the MutS family (DNA mismatch repair ...
1130-1324 1.18e-86

MutS domain V; This domain is found in proteins of the MutS family (DNA mismatch repair proteins) and is found associated with pfam01624, pfam05188, pfam05192 and pfam05190. The mutS family of proteins is named after the Salmonella typhimurium MutS protein involved in mismatch repair; other members of the family included the eukaryotic MSH 1,2,3, 4,5 and 6 proteins. These have various roles in DNA repair and recombination. Human MSH has been implicated in non-polyposis colorectal carcinoma (HNPCC) and is a mismatch binding protein. The aligned region corresponds with domain V of Thermus aquaticus MutS as characterized in, which contains a Walker A motif, and is structurally similar to the ATPase domain of ABC transporters.


Pssm-ID: 425714 [Multi-domain]  Cd Length: 188  Bit Score: 279.85  E-value: 1.18e-86
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    1130 VLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPAEVCRLTPIDRVFTRLGASDRIMSGESTFFVELSETASILMHATAHSL 1209
Cdd:pfam00488    1 LIITGPNMGGKSTYLRQVALIVLMAQIGSFVPAESAEIGIVDRIFTRIGASDDLAKGRSTFMVEMLETANILHNATDKSL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    1210 VLVDELGRGTATFDGTAIANAVVKELAETIKCRTLFSTHYHSLVEDYSQNVAVRLGHMACMVENecedpsqETITFLYKF 1289
Cdd:pfam00488   81 VILDELGRGTSTYDGLAIAWAVAEHLAEKIKARTLFATHYHELTKLAEKLPAVKNLHMAAVEDD-------DDIVFLYKV 153
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 4504191    1290 IKGACPKSYGFNAARLANLPEEVIQKGHRKAREFE 1324
Cdd:pfam00488  154 QPGAADKSYGIHVAELAGLPESVVERAREILAELE 188
PWWP_MSH6 cd05837
PWWP domain found in DNA mismatch repair protein MSH6 and similar proteins; MSH6, also called ...
90-193 1.37e-55

PWWP domain found in DNA mismatch repair protein MSH6 and similar proteins; MSH6, also called G/T mismatch-binding protein (GTBP or GTMBP), MutS protein homolog 6, or MutS-alpha 160 kDa subunit (p160), is a mismatch repair protein homologous to bacterial MutS. It is a component of the post-replicative DNA mismatch repair system (MMR). It heterodimerizes with MSH2 to form MutS alpha, which binds to DNA mismatches thereby initiating DNA repair. When bound, MutS alpha bends the DNA helix and shields approximately 20 base pairs, and recognizes single base mismatches and dinucleotide insertion-deletion loops (IDL) in the DNA. After mismatch binding, it forms a ternary complex with the MutL alpha heterodimer, which is thought to be responsible for directing the downstream MMR events, including strand discrimination, excision, and resynthesis. Mutations in MSH6 have been linked to increased cancer susceptibility, particularly in hereditary nonpolyposis colorectal cancer in humans. MSH6 contains a PWWP domain, but its role in MSH6 remains unclear. MSH6 orthologs found in Saccharomyces cerevisiae, Caenorhabditis elegans, and Arabidopsis thaliana lack the PWWP domain. PWWP domains typically recognize DNA and histone methylated lysines.


Pssm-ID: 438962  Cd Length: 103  Bit Score: 187.88  E-value: 1.37e-55
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    90 FSPGDLVWAKMEGYPWWPCLVYNHPFDGtFIREKGKSVRVHVQFFDDSPTRGWVSKRLLKPYTGSKSKEAQKGGHFYSAK 169
Cdd:cd05837    1 FSPGDLVWAKLEGYPWWPSLVCNHPTTG-FHKKFGKKGEVHVQFFDDPPSRAWVKAKNVKPFTGSDDKEFQKGGMFFSKD 79
                         90       100
                 ....*....|....*....|....
gi 4504191   170 PEILRAMQRADEALNKDKIKRLEL 193
Cdd:cd05837   80 PKWKKAVKEADKALKLSVEERLKL 103
PWWP pfam00855
PWWP domain; The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The domain ...
93-184 3.92e-25

PWWP domain; The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The domain binds to Histone-4 methylated at lysine-20, H4K20me, suggesting that it is methyl-lysine recognition motif. Removal of two conserved aromatic residues in a hydrophobic cavity created by this domain within the full-length protein, Pdp1, abolishes the interaction o f the protein with H4K20me3. In fission yeast, Set9 is the sole enzyme that catalyzes all three states of H4K20me, and Set9-mediated H4K20me is required for efficient recruitment of checkpoint protein Crb2 to sites of DNA damage. The methylation of H4K20 is involved in a diverse array of cellular processes, such as organizing higher-order chromatin, maintaining genome stability, and regulating cell-cycle progression.


Pssm-ID: 459964 [Multi-domain]  Cd Length: 92  Bit Score: 100.58  E-value: 3.92e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191      93 GDLVWAKMEGYPWWPCLVYNHPFDGTFIREKGKSVR-VHVQFFDDSPTrGWVSKRLLKPYTGSKSKEAQKGGHFYSAKPE 171
Cdd:pfam00855    1 GDLVWAKLKGYPWWPARVVDPEELPENVLKPKKKDGeYLVRFFGDSEF-AWVKPKDLKPFDEGDEFEYLKKKKKKKKKKA 79
                           90
                   ....*....|...
gi 4504191     172 ILRAMQRADEALN 184
Cdd:pfam00855   80 FKKALEEAEEALK 92
PWWP smart00293
domain with conserved PWWP motif; conservation of Pro-Trp-Trp-Pro residues
90-152 3.35e-22

domain with conserved PWWP motif; conservation of Pro-Trp-Trp-Pro residues


Pssm-ID: 214603 [Multi-domain]  Cd Length: 63  Bit Score: 91.25  E-value: 3.35e-22
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 4504191       90 FSPGDLVWAKMEGYPWWPCLVYNHPFDGTFI-REKGKSVRVHVQFFDDSPTrGWVSKRLLKPYT 152
Cdd:smart00293    1 FKPGDLVWAKMKGFPWWPALVISPKMTPDNImKRKSDENLYPVLFFGDKDT-AWIPSSKLFPLT 63
 
Name Accession Description Interval E-value
MutS COG0249
DNA mismatch repair ATPase MutS [Replication, recombination and repair];
407-1360 1.04e-164

DNA mismatch repair ATPase MutS [Replication, recombination and repair];


Pssm-ID: 440019 [Multi-domain]  Cd Length: 861  Bit Score: 515.38  E-value: 1.04e-164
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   407 TPGMRKWWQIKSQNFDLVICYKVGKFYELYHMDALIGVSELGLVFMKGNwAHS-------GFPEIAFGRYSDSLVQKGYK 479
Cdd:COG0249    8 TPMMQQYLEIKAQYPDALLFFRMGDFYELFFEDAEKASRLLDITLTKRG-KGAgepipmaGVPYHAAEGYLAKLVKAGYK 86
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   480 VARVEQTETPEmmEARcrkmahiskydRVVRREICRIITKGTQTYSVLEGDPSENYskyLLSLKEKEEdssghtrAYGVC 559
Cdd:COG0249   87 VAICEQVEDPA--EAK-----------GLVKREVVRVVTPGTLTEDALLDAKRNNY---LAAVARDKG-------RYGLA 143
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   560 FVDTSLGKFFIGQFSDDrhcSRFRTLVAHYPPVQVLFEKGNLSKETKTILKSSLSCSLQEglIPGSQFwDASKTLRTLLE 639
Cdd:COG0249  144 WLDISTGEFLVTELDGE---EALLDELARLAPAEILVPEDLPDPEELLELLRERGAAVTR--LPDWAF-DPDAARRRLLE 217
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   640 eeYFREKLSDGIGVmlpqvlkgmtsesdsigltpgEKSELALSALGGCVFYLKKCLIdQELLSMANFEEYiplDSDtvst 719
Cdd:COG0249  218 --QFGVASLDGFGL---------------------EDLPAAIAAAGALLAYLEETQK-GALPHLRRLRRY---EED---- 266
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   720 trsgaiftkayQRMVLDAVTLNNLEIFLNgTNGSTEGTLLERVDTCHTPFGKRLLKQWLCAPLCNHYAINDRLDAIEDLM 799
Cdd:COG0249  267 -----------DYLILDAATRRNLELTET-LRGGRKGSLLSVLDRTVTAMGSRLLRRWLLRPLRDRAAIEARLDAVEELL 334
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   800 VVPDKISEVVELLKKLPDLERLLSKI-HNVGSPlksqnhpdsRaimyeettyskkkiiDFLSALEGFKVMCKIIGIMEEV 878
Cdd:COG0249  335 EDPLLREELRELLKGVYDLERLLSRIaLGRANP---------R---------------DLAALRDSLAALPELKELLAEL 390
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   879 ADGFKSKILKQVislqtknpeGRFPDLTVELNRwdtAFDHE---KARKTGLItpKAGFDSDYDQaLADIREN-EQSLLEY 954
Cdd:COG0249  391 DSPLLAELAEAL---------DPLEDLAELLER---AIVDEpplLIRDGGVI--REGYDAELDE-LRELSENgKEWLAEL 455
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   955 LEKQRNRIGCRTIvywGIGRNR---YQLEIpenftTR----NLPEEYELKSTKKGCKRYWT---KTIEKKLANlinAEER 1024
Cdd:COG0249  456 EARERERTGIKSL---KVGYNKvfgYYIEV-----TKanadKVPDDYIRKQTLKNAERYITpelKELEDKILS---AEER 524
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1025 RDV-----------SLKDCMRRLfynfdknykdwQSAVECIAVLDVLLCLA------NYsrggdgpmCRPVIllpeDTPP 1087
Cdd:COG0249  525 ALAleyelfeelreEVAAHIERL-----------QALARALAELDVLASLAevavenNY--------VRPEL----DDSP 581
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1088 FLELKGSRHPCITKTFFGDDFIPNDILIGCEEEeqengkayCVLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPAEVCRL 1167
Cdd:COG0249  582 GIEIEGGRHPVVEQALPGEPFVPNDCDLDPDRR--------ILLITGPNMAGKSTYMRQVALIVLLAQIGSFVPAESARI 653
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1168 TPIDRVFTRLGASDRIMSGESTFFVELSETASILMHATAHSLVLVDELGRGTATFDGTAIANAVVKELAETIKCRTLFST 1247
Cdd:COG0249  654 GIVDRIFTRVGASDDLARGQSTFMVEMTETANILNNATERSLVLLDEIGRGTSTYDGLSIAWAVAEYLHDKIRARTLFAT 733
                        890       900       910       920       930       940       950       960
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1248 HYH---SLVEDYSQ----NVAVRlghmacmvEnecedpSQETITFLYKFIKGACPKSYGFNAARLANLPEEVIQ------ 1314
Cdd:COG0249  734 HYHeltELAEKLPGvknyHVAVK--------E------WGGDIVFLHKVVPGPADRSYGIHVAKLAGLPASVIErareil 799
                        970       980       990      1000      1010
                 ....*....|....*....|....*....|....*....|....*....|
gi 4504191  1315 ----KGHRKAREFEKMNQsLRLFrevclasersTVDAEAVHKLLTLIKEL 1360
Cdd:COG0249  800 aeleKGEAAAAGKAAPDQ-LSLF----------AAADPEPSPVLEELKAL 838
PRK05399 PRK05399
DNA mismatch repair protein MutS; Provisional
407-1360 1.09e-161

DNA mismatch repair protein MutS; Provisional


Pssm-ID: 235444 [Multi-domain]  Cd Length: 854  Bit Score: 506.94  E-value: 1.09e-161
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    407 TPGMRKWWQIKSQNFDLVICYKVGKFYELYHMDALIGVSELGLVFMKGNwAHS-------GFPEIAFGRYSDSLVQKGYK 479
Cdd:PRK05399    9 TPMMQQYLEIKAQYPDALLFFRMGDFYELFFEDAKKASRLLDITLTKRG-KSAgepipmaGVPYHAAEGYLAKLVKKGYK 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    480 VARVEQTETPEMmearcrkmahiSKydRVVRREICRIITKGTQTYSVLEGDPSENYskyLLSLKEKEEdssghtrAYGVC 559
Cdd:PRK05399   88 VAICEQVEDPAT-----------AK--GPVKREVVRIVTPGTVTDEALLDEKQNNY---LAAIAQDGG-------GYGLA 144
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    560 FVDTSLGKFFIGQFSDDRhcsrFRTLVAHYPPVQVLFEKgNLSKETKTILKSSLSCslqeglIPGSQFwDASKTLRTLLE 639
Cdd:PRK05399  145 YLDLSTGEFRVTELDEEE----LLAELARLNPAEILVPE-DFSEDELLLLRRGLRR------RPPWEF-DLDTAEKRLLE 212
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    640 eeYFREKLSDGIGVMLPqvlkgmtsesdsigltpgekseLALSALGGCVFYLKKCLIdQEL-----LSMANFEEYiplds 714
Cdd:PRK05399  213 --QFGVASLDGFGVDLP----------------------LAIRAAGALLQYLKETQK-RSLphlrsPKRYEESDY----- 262
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    715 dtvsttrsgaiftkayqrMVLDAVTLNNLEIFLNgTNGSTEGTLLERVDTCHTPFGKRLLKQWLCAPLCNHYAINDRLDA 794
Cdd:PRK05399  263 ------------------LILDAATRRNLELTEN-LRGGRKNSLLSVLDRTVTAMGGRLLRRWLHRPLRDREAIEARLDA 323
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    795 IEDLMVVPDKISEVVELLKKLPDLERLLSKI-HNVGSPlksqnhpdsRaimyeettyskkkiiDFLSALEGFKVMCKIIG 873
Cdd:PRK05399  324 VEELLEDPLLREDLRELLKGVYDLERLLSRIaLGRANP---------R---------------DLAALRDSLEALPELKE 379
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    874 IMEEVADGFKSKILKQVislqtknpeGRFPDLTVELNRwdtAFDHE---KARKTGLItpKAGFDSDYDQaLADIREN-EQ 949
Cdd:PRK05399  380 LLAELDSPLLAELAEQL---------DPLEELADLLER---AIVEEpplLIRDGGVI--ADGYDAELDE-LRALSDNgKD 444
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    950 SLLEYLEKQRNRIGCRTIvywGIGRNR---YQLEIPeNFTTRNLPEEYELKSTKKGCKRYWT---KTIEKKLANlinAEE 1023
Cdd:PRK05399  445 WLAELEARERERTGISSL---KVGYNKvfgYYIEVT-KANLDKVPEDYIRRQTLKNAERYITpelKELEDKILS---AEE 517
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   1024 RRdVSL-KDCMRRLFYNFDKNYKDWQSAVECIAVLDVLLCLA------NYsrggdgpmCRPVIllpeDTPPFLELKGSRH 1096
Cdd:PRK05399  518 KA-LALeYELFEELREEVAEHIERLQKLAKALAELDVLASLAevaeenNY--------VRPEF----TDDPGIDIEEGRH 584
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   1097 PCITKTFFGDDFIPNDILIgceeeeqeNGKAYCVLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPAEVCRLTPIDRVFTR 1176
Cdd:PRK05399  585 PVVEQVLGGEPFVPNDCDL--------DEERRLLLITGPNMAGKSTYMRQVALIVLLAQIGSFVPAESARIGIVDRIFTR 656
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   1177 LGASDRIMSGESTFFVELSETASILMHATAHSLVLVDELGRGTATFDGTAIANAVVKELAETIKCRTLFSTHYHSLVEdy 1256
Cdd:PRK05399  657 IGASDDLASGRSTFMVEMTETANILNNATERSLVLLDEIGRGTSTYDGLSIAWAVAEYLHDKIGAKTLFATHYHELTE-- 734
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   1257 sqnVAVRLGHMACM----VEnecedpSQETITFLYKFIKGACPKSYGFNAARLANLPEEVIQKGHRKAREFEKMNQSLRl 1332
Cdd:PRK05399  735 ---LEEKLPGVKNVhvavKE------HGGDIVFLHKVVPGAADKSYGIHVAKLAGLPASVIKRAREILAQLESASEKAK- 804
                         970       980
                  ....*....|....*....|....*...
gi 4504191   1333 fREVCLASERSTVDAEAVHKLLTLIKEL 1360
Cdd:PRK05399  805 -AASAEEDQLSLFAEPEESPLLEALKAL 831
mutS1 TIGR01070
DNA mismatch repair protein MutS; [DNA metabolism, DNA replication, recombination, and repair]
407-1325 6.75e-124

DNA mismatch repair protein MutS; [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273427 [Multi-domain]  Cd Length: 840  Bit Score: 405.69  E-value: 6.75e-124
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     407 TPGMRKWWQIKSQNFDLVICYKVGKFYELYHMDALIGVSELGLVFMKGNWAH------SGFPEIAFGRYSDSLVQKGYKV 480
Cdd:TIGR01070    2 TPMMQQYLKLKAEHPDALLFFRMGDFYELFYEDAKKAAQLLDISLTSRGQSAdepipmAGIPYHAVEAYLEKLVKQGESV 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     481 ARVEQTETPEMMEArcrkmahiskydrVVRREICRIITKGTQTYSVLEGDPSENYskyLLSLkekeedsSGHTRAYGVCF 560
Cdd:TIGR01070   82 AICEQIEDPKTAKG-------------PVEREVVQLITPGTVSDEALLPERQDNL---LAAI-------AQESNGFGLAT 138
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     561 VDTSLGKFFIGQFSDdrhcsrFRTLVAH---YPPVQVLFeKGNLSKETkTILKSSLSCSlqeglipgsqfwdaskTLRTL 637
Cdd:TIGR01070  139 LDLTTGEFKVTELAD------KETLYAElqrLNPAEVLL-AEDLSEME-AIELREFRKD----------------TAVMS 194
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     638 LEEEYFREKLSdGIGVmlpqvlkgmtsesdsigltpgEKSELALSALGGCVFYLKKCLiDQELLSMANFEEYIPLDSdtv 717
Cdd:TIGR01070  195 LEAQFGTEDLG-GLGL---------------------RNAPLGLTAAGCLLQYAKRTQ-RTALPHLQPVRLYELQDF--- 248
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     718 sttrsgaiftkayqrMVLDAVTLNNLEIFLNgTNGSTEGTLLERVDTCHTPFGKRLLKQWLCAPLCNHYAINDRLDAIED 797
Cdd:TIGR01070  249 ---------------MQLDAATRRNLELTEN-LRGGKQNTLFSVLDETKTAMGSRLLKRWLHRPLRDREVLEARQDTVEV 312
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     798 LMVVPDKISEVVELLKKLPDLERLLSKIhnvgsPLKSQNHPDsraimyeettyskkkiidfLSAL-EGFKVMCKIIGIME 876
Cdd:TIGR01070  313 LLRHFFLREGLRPLLKEVGDLERLAARV-----ALGNARPRD-------------------LARLrTSLEQLPELRALLE 368
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     877 EVADGFKSKILKQVislqtknpeGRFPDLTVELNRWDTAFDHEKARKTGLItpKAGFDSDYDQALADIRENEQSLLEYLE 956
Cdd:TIGR01070  369 ELEGPTLQALAAQI---------DDFSELLELLEAALIENPPLVVRDGGLI--REGYDEELDELRAASREGTDYLARLEA 437
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     957 KQRNRIGCRTIvywGIGRNR---YQLEIPeNFTTRNLPEEYELKSTKKGCKRYWTKTIEKKLANLINAEERRDVSLKDCM 1033
Cdd:TIGR01070  438 RERERTGIPTL---KVGYNAvfgYYIEVT-RGQLHLVPAHYRRRQTLKNAERYITPELKEKEDKVLEAEGKILALEKELF 513
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    1034 RRLFYNFDKNYKDWQSAVECIAVLDVLLCLA------NYSRggdgpmcrpvillPE-DTPPFLELKGSRHPCITKTFfGD 1106
Cdd:TIGR01070  514 EELRELLKKYLEALQEAARALAELDVLANLAevaetlHYTR-------------PRfGDDPQLRIREGRHPVVEQVL-RT 579
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    1107 DFIPNDILIGCEEEeqengkayCVLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPAEVCRLTPIDRVFTRLGASDRIMSG 1186
Cdd:TIGR01070  580 PFVPNDLEMAHNRR--------MLLITGPNMGGKSTYMRQTALIALLAQIGSFVPAESAELPLFDRIFTRIGASDDLASG 651
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    1187 ESTFFVELSETASILMHATAHSLVLVDELGRGTATFDGTAIANAVVKELAETIKCRTLFSTHYHSLVEDYSQNVAVRLGH 1266
Cdd:TIGR01070  652 RSTFMVEMTEAANILHNATENSLVLFDEIGRGTSTYDGLALAWAIAEYLHEHIRAKTLFATHYFELTALEESLPGLKNVH 731
                          890       900       910       920       930
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 4504191    1267 MACMVENecedpsqETITFLYKFIKGACPKSYGFNAARLANLPEEVIQKGHRKAREFEK 1325
Cdd:TIGR01070  732 VAALEHN-------GTIVFLHQVLPGPASKSYGLAVAALAGLPKEVIARARQILTQLEA 783
ABC_MSH6_euk cd03286
ATP-binding cassette domain of eukaryotic MutS6 homolog; The MutS protein initiates DNA ...
1091-1313 8.17e-106

ATP-binding cassette domain of eukaryotic MutS6 homolog; The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding clamps, and recognition of specific DNA structures or lesions results in ADP/ATP exchange.


Pssm-ID: 213253 [Multi-domain]  Cd Length: 218  Bit Score: 334.01  E-value: 8.17e-106
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1091 LKGSRHPCITKTFfGDDFIPNDILIGCEEeeqengkAYCVLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPAEVCRLTPI 1170
Cdd:cd03286    2 FEELRHPCLNAST-ASSFVPNDVDLGATS-------PRILVLTGPNMGGKSTLLRTVCLAVIMAQMGMDVPAKSMRLSLV 73
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1171 DRVFTRLGASDRIMSGESTFFVELSETASILMHATAHSLVLVDELGRGTATFDGTAIANAVVKELAETIKCRTLFSTHYH 1250
Cdd:cd03286   74 DRIFTRIGARDDIMKGESTFMVELSETANILRHATPDSLVILDELGRGTSTHDGYAIAHAVLEYLVKKVKCLTLFSTHYH 153
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 4504191  1251 SLVEDYSQNVAVRLGHMACMVENEcEDPSQETITFLYKFIKGACPKSYGFNAARLANLPEEVI 1313
Cdd:cd03286  154 SLCDEFHEHGGVRLGHMACAVKNE-SDPTIRDITFLYKLVAGICPKSYGLYVALMAGIPDGVV 215
MUTSac smart00534
ATPase domain of DNA mismatch repair MUTS family;
1129-1320 5.73e-94

ATPase domain of DNA mismatch repair MUTS family;


Pssm-ID: 197777 [Multi-domain]  Cd Length: 185  Bit Score: 299.86  E-value: 5.73e-94
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     1129 CVLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPAEVCRLTPIDRVFTRLGASDRIMSGESTFFVELSETASILMHATAHS 1208
Cdd:smart00534    1 VVIITGPNMGGKSTYLRQVALIVIMAQIGSFVPAESAELPVFDRIFTRIGASDSLAQGLSTFMVEMKETANILKNATKNS 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     1209 LVLVDELGRGTATFDGTAIANAVVKELAETIKCRTLFSTHYHSLVEDYSQNVAVRLGHMACMVENecedpsqETITFLYK 1288
Cdd:smart00534   81 LVLLDELGRGTSTYDGLAIAAAILEYLLEKIGARTLFATHYHELTKLADNHPGVRNLHMSALEET-------ENITFLYK 153
                           170       180       190
                    ....*....|....*....|....*....|..
gi 4504191     1289 FIKGACPKSYGFNAARLANLPEEVIQKGHRKA 1320
Cdd:smart00534  154 LKPGVAGKSYGIEVAKLAGLPKEVIERAKRIL 185
MutS_V pfam00488
MutS domain V; This domain is found in proteins of the MutS family (DNA mismatch repair ...
1130-1324 1.18e-86

MutS domain V; This domain is found in proteins of the MutS family (DNA mismatch repair proteins) and is found associated with pfam01624, pfam05188, pfam05192 and pfam05190. The mutS family of proteins is named after the Salmonella typhimurium MutS protein involved in mismatch repair; other members of the family included the eukaryotic MSH 1,2,3, 4,5 and 6 proteins. These have various roles in DNA repair and recombination. Human MSH has been implicated in non-polyposis colorectal carcinoma (HNPCC) and is a mismatch binding protein. The aligned region corresponds with domain V of Thermus aquaticus MutS as characterized in, which contains a Walker A motif, and is structurally similar to the ATPase domain of ABC transporters.


Pssm-ID: 425714 [Multi-domain]  Cd Length: 188  Bit Score: 279.85  E-value: 1.18e-86
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    1130 VLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPAEVCRLTPIDRVFTRLGASDRIMSGESTFFVELSETASILMHATAHSL 1209
Cdd:pfam00488    1 LIITGPNMGGKSTYLRQVALIVLMAQIGSFVPAESAEIGIVDRIFTRIGASDDLAKGRSTFMVEMLETANILHNATDKSL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    1210 VLVDELGRGTATFDGTAIANAVVKELAETIKCRTLFSTHYHSLVEDYSQNVAVRLGHMACMVENecedpsqETITFLYKF 1289
Cdd:pfam00488   81 VILDELGRGTSTYDGLAIAWAVAEHLAEKIKARTLFATHYHELTKLAEKLPAVKNLHMAAVEDD-------DDIVFLYKV 153
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 4504191    1290 IKGACPKSYGFNAARLANLPEEVIQKGHRKAREFE 1324
Cdd:pfam00488  154 QPGAADKSYGIHVAELAGLPESVVERAREILAELE 188
ABC_MutS1 cd03284
ATP-binding cassette domain of MutS1 homolog; The MutS protein initiates DNA mismatch repair ...
1090-1315 8.95e-83

ATP-binding cassette domain of MutS1 homolog; The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding clamps, and recognition of specific DNA structures or lesions results in ADP/ATP exchange.


Pssm-ID: 213251 [Multi-domain]  Cd Length: 216  Bit Score: 269.91  E-value: 8.95e-83
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1090 ELKGSRHPCITKTFFGDDFIPNDILIGCEEeeqengkaYCVLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPAEVCRLTP 1169
Cdd:cd03284    1 EIEGGRHPVVEQVLDNEPFVPNDTELDPER--------QILLITGPNMAGKSTYLRQVALIALLAQIGSFVPASKAEIGV 72
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1170 IDRVFTRLGASDRIMSGESTFFVELSETASILMHATAHSLVLVDELGRGTATFDGTAIANAVVKELAETIKCRTLFSTHY 1249
Cdd:cd03284   73 VDRIFTRIGASDDLAGGRSTFMVEMVETANILNNATERSLVLLDEIGRGTSTYDGLSIAWAIVEYLHEKIGAKTLFATHY 152
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 4504191  1250 HSLVEDYSQNVAVRLGHMAcmVENEcedpsQETITFLYKFIKGACPKSYGFNAARLANLPEEVIQK 1315
Cdd:cd03284  153 HELTELEGKLPRVKNFHVA--VKEK-----GGGVVFLHKIVEGAADKSYGIEVARLAGLPEEVIER 211
MUTSd smart00533
DNA-binding domain of DNA mismatch repair MUTS family;
755-1099 6.56e-78

DNA-binding domain of DNA mismatch repair MUTS family;


Pssm-ID: 214710 [Multi-domain]  Cd Length: 308  Bit Score: 259.92  E-value: 6.56e-78
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191      755 EGTLLERVDTCHTPFGKRLLKQWLCAPLCNHYAINDRLDAIEDLMVVPDKISEVVELLKKLPDLERLLSKIHnvgsplks 834
Cdd:smart00533    1 KGSLFELLNHTKTPMGKRLLRRWLLQPLLDLKEINERLDAVEELVENPELRQKLRQLLKRIPDLERLLSRIE-------- 72
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191      835 qnhpdsraimyeETTYSKKKIIDFLSALEGFKVMCKIIGIMEEVADGFkskiLKQVISlqtkNPEGRFPDLTVELNRWDT 914
Cdd:smart00533   73 ------------RGRASPRDLLRLYDSLEGLKEIRQLLESLDGPLLGL----LLKVIL----EPLLELLELLLELLNDDD 132
                           170       180       190       200       210       220       230       240
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191      915 AFDHEKArktglITPKAGFDSDYDQALADIRENEQSLLEYLEKQRNRIGCRTIVYWGIGRNRYQLEIPENFTTrNLPEEY 994
Cdd:smart00533  133 PLEVNDG-----GLIKDGFDPELDELREKLEELEEELEELLKKEREELGIDSLKLGYNKVHGYYIEVTKSEAK-KVPKDF 206
                           250       260       270       280       290       300       310       320
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191      995 ELKSTKKGCKRYWTKTIEKKLANLINAEERRDVSLKDCMRRLFYNFDKNYKDWQSAVECIAVLDVLLCLANYSRggDGPM 1074
Cdd:smart00533  207 IRRSSLKNTERFTTPELKELENELLEAKEEIERLEKEILRELLEKVLEYLEELRALAEALAELDVLLSLATLAA--EGNY 284
                           330       340
                    ....*....|....*....|....*
gi 4504191     1075 CRPVIllpeDTPPFLELKGSRHPCI 1099
Cdd:smart00533  285 VRPEF----VDSGELEIKNGRHPVL 305
ABC_MutS_homologs cd03243
ATP-binding cassette domain of MutS homologs; The MutS protein initiates DNA mismatch repair ...
1090-1308 5.07e-76

ATP-binding cassette domain of MutS homologs; The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family also possess a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding clamps, and recognition of specific DNA structures or lesions results in ADP/ATP exchange.


Pssm-ID: 213210 [Multi-domain]  Cd Length: 202  Bit Score: 250.24  E-value: 5.07e-76
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1090 ELKGSRHPCITKTFFGDDFIPNDILIgceeeeqENGKayCVLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPAEVCRLTP 1169
Cdd:cd03243    1 EIKGGRHPVLLALTKGETFVPNDINL-------GSGR--LLLITGPNMGGKSTYLRSIGLAVLLAQIGCFVPAESASIPL 71
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1170 IDRVFTRLGASDRIMSGESTFFVELSETASILMHATAHSLVLVDELGRGTATFDGTAIANAVVKELAEtIKCRTLFSTHY 1249
Cdd:cd03243   72 VDRIFTRIGAEDSISDGRSTFMAELLELKEILSLATPRSLVLIDELGRGTSTAEGLAIAYAVLEHLLE-KGCRTLFATHF 150
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....*....
gi 4504191  1250 HSLVEDYSQNVAVRLGHMACMVENECedpsqetITFLYKFIKGACPKSYGFNAARLANL 1308
Cdd:cd03243  151 HELADLPEQVPGVKNLHMEELITTGG-------LTFTYKLIDGICDPSYALQIAELAGL 202
ABC_MSH2_euk cd03285
ATP-binding cassette domain of eukaryotic MutS2 homolog; The MutS protein initiates DNA ...
1091-1324 3.16e-73

ATP-binding cassette domain of eukaryotic MutS2 homolog; The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding clamps, and recognition of specific DNA structures or lesions results in ADP/ATP exchange.


Pssm-ID: 213252 [Multi-domain]  Cd Length: 222  Bit Score: 243.05  E-value: 3.16e-73
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1091 LKGSRHPCITKTffgDD--FIPNDIligceeeEQENGKAYCVLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPAEVCRLT 1168
Cdd:cd03285    2 LKEARHPCVEAQ---DDvaFIPNDV-------TLTRGKSRFLIITGPNMGGKSTYIRQIGVIVLMAQIGCFVPCDSADIP 71
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1169 PIDRVFTRLGASDRIMSGESTFFVELSETASILMHATAHSLVLVDELGRGTATFDGTAIANAVVKELAETIKCRTLFSTH 1248
Cdd:cd03285   72 IVDCILARVGASDSQLKGVSTFMAEMLETAAILKSATENSLIIIDELGRGTSTYDGFGLAWAIAEYIATQIKCFCLFATH 151
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 4504191  1249 YHSLVEDYSQNVAVRLGHMACMVeneceDPSQETITFLYKFIKGACPKSYGFNAARLANLPEEVIQKGHRKAREFE 1324
Cdd:cd03285  152 FHELTALADEVPNVKNLHVTALT-----DDASRTLTMLYKVEKGACDQSFGIHVAELANFPKEVIEMAKQKALELE 222
ABC_MSH3_euk cd03287
ATP-binding cassette domain of eukaryotic MutS3 homolog; The MutS protein initiates DNA ...
1089-1315 2.84e-69

ATP-binding cassette domain of eukaryotic MutS3 homolog; The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding clamps, and recognition of specific DNA structures or lesions results in ADP/ATP exchange.


Pssm-ID: 213254 [Multi-domain]  Cd Length: 222  Bit Score: 231.99  E-value: 2.84e-69
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1089 LELKGSRHPCItKTFFGDDFIPNDILIGCEEEeqengkaYCVLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPAEVCRLT 1168
Cdd:cd03287    1 ILIKEGRHPMI-ESLLDKSFVPNDIHLSAEGG-------YCQIITGPNMGGKSSYIRQVALITIMAQIGSFVPASSATLS 72
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1169 PIDRVFTRLGASDRIMSGESTFFVELSETASILMHATAHSLVLVDELGRGTATFDGTAIANAVVKELAETIKCRTLFSTH 1248
Cdd:cd03287   73 IFDSVLTRMGASDSIQHGMSTFMVELSETSHILSNCTSRSLVILDELGRGTSTHDGIAIAYATLHYLLEEKKCLVLFVTH 152
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 4504191  1249 YHSLVEDYSQNV-AVRLGHMACMVENEC-EDPSQETITFLYKFIKGACPKSYGFNAARLANLPEEVIQK 1315
Cdd:cd03287  153 YPSLGEILRRFEgSIRNYHMSYLESQKDfETSDSQSITFLYKLVRGLASRSFGLNVARLAGLPKSIISR 221
PWWP_MSH6 cd05837
PWWP domain found in DNA mismatch repair protein MSH6 and similar proteins; MSH6, also called ...
90-193 1.37e-55

PWWP domain found in DNA mismatch repair protein MSH6 and similar proteins; MSH6, also called G/T mismatch-binding protein (GTBP or GTMBP), MutS protein homolog 6, or MutS-alpha 160 kDa subunit (p160), is a mismatch repair protein homologous to bacterial MutS. It is a component of the post-replicative DNA mismatch repair system (MMR). It heterodimerizes with MSH2 to form MutS alpha, which binds to DNA mismatches thereby initiating DNA repair. When bound, MutS alpha bends the DNA helix and shields approximately 20 base pairs, and recognizes single base mismatches and dinucleotide insertion-deletion loops (IDL) in the DNA. After mismatch binding, it forms a ternary complex with the MutL alpha heterodimer, which is thought to be responsible for directing the downstream MMR events, including strand discrimination, excision, and resynthesis. Mutations in MSH6 have been linked to increased cancer susceptibility, particularly in hereditary nonpolyposis colorectal cancer in humans. MSH6 contains a PWWP domain, but its role in MSH6 remains unclear. MSH6 orthologs found in Saccharomyces cerevisiae, Caenorhabditis elegans, and Arabidopsis thaliana lack the PWWP domain. PWWP domains typically recognize DNA and histone methylated lysines.


Pssm-ID: 438962  Cd Length: 103  Bit Score: 187.88  E-value: 1.37e-55
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    90 FSPGDLVWAKMEGYPWWPCLVYNHPFDGtFIREKGKSVRVHVQFFDDSPTRGWVSKRLLKPYTGSKSKEAQKGGHFYSAK 169
Cdd:cd05837    1 FSPGDLVWAKLEGYPWWPSLVCNHPTTG-FHKKFGKKGEVHVQFFDDPPSRAWVKAKNVKPFTGSDDKEFQKGGMFFSKD 79
                         90       100
                 ....*....|....*....|....
gi 4504191   170 PEILRAMQRADEALNKDKIKRLEL 193
Cdd:cd05837   80 PKWKKAVKEADKALKLSVEERLKL 103
MutS_III pfam05192
MutS domain III; This domain is found in proteins of the MutS family (DNA mismatch repair ...
739-1064 2.09e-53

MutS domain III; This domain is found in proteins of the MutS family (DNA mismatch repair proteins) and is found associated with pfam00488, pfam05188, pfam01624 and pfam05190. The MutS family of proteins is named after the Salmonella typhimurium MutS protein involved in mismatch repair; other members of the family included the eukaryotic MSH 1,2,3, 4,5 and 6 proteins. These have various roles in DNA repair and recombination. Human MSH has been implicated in non-polyposis colorectal carcinoma (HNPCC) and is a mismatch binding protein. The aligned region corresponds with domain III, which is central to the structure of Thermus aquaticus MutS as characterized in.


Pssm-ID: 461579 [Multi-domain]  Cd Length: 291  Bit Score: 189.15  E-value: 2.09e-53
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     739 TLNNLEIFLNgTNGSTEGTLLERVDTCHTPFGKRLLKQWLCAPLCNHYAINDRLDAIEDLMVVPDKISEVVELLKKLPDL 818
Cdd:pfam05192    2 TLRNLELTEN-LRGGKEGSLLGLLDRTKTPMGSRLLRQWLLQPLTDLEEINERLDAVEELLENSELREDLRELLRRLPDL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     819 ERLLSKIHnvgsplksqnhpdsraimyeettYSKKKIIDFLSALEGFKVMCKIIGIMEEVADGFKSKI--LKQVISlQTK 896
Cdd:pfam05192   81 ERLLSRIA-----------------------LGKATPRDLLALLDSLEKLPLLKELLLEEKSALLGELasLAELLE-EAI 136
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     897 NPEGrfPDLTVELNRWDTAFDHEKARKTGLITpkagfDSDYDQALADIRENEQSLLEYLEKQRNRigcrtiVYWGIGRNR 976
Cdd:pfam05192  137 DEEP--PALLRDGGVIRDGYDEELDELRDLLL-----DGKRLLAKLEARERERTGIKSLKVLYNK------VFGYYLLLV 203
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     977 YQLEIPENFTTRNLPEEYELKSTKKGCKRYWTKTIEKKLANLINAEERRDVSLKDCMRRLFYNFDKNYKDWQSAVECIAV 1056
Cdd:pfam05192  204 EYYIEVSKSQKDKVPDDYIRIQTTKNAERYITPELKELERKILQAEERLLALEKELFEELLEEVLEYIEVLRRAAEALAE 283

                   ....*...
gi 4504191    1057 LDVLLCLA 1064
Cdd:pfam05192  284 LDVLLSLA 291
ABC_MSH5_euk cd03281
ATP-binding cassette domain of eukaryotic MutS5 homolog; The MutS protein initiates DNA ...
1090-1308 2.55e-49

ATP-binding cassette domain of eukaryotic MutS5 homolog; The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding clamps, and recognition of specific DNA structures or lesions results in ADP/ATP exchange.


Pssm-ID: 213248 [Multi-domain]  Cd Length: 213  Bit Score: 174.41  E-value: 2.55e-49
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1090 ELKGSRHPCITKtfFGDDFIPNDILIGceeeeqeNGKAYCVLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPAEVCRLTP 1169
Cdd:cd03281    1 EIQGGRHPLLEL--FVDSFVPNDTEIG-------GGGPSIMVITGPNSSGKSVYLKQVALIVFLAHIGSFVPADSATIGL 71
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1170 IDRVFTRLGASDRIMSGESTFFVELSETASILMHATAHSLVLVDELGRGTATFDGTAIANAVVKELAE-TIKC-RTLFST 1247
Cdd:cd03281   72 VDKIFTRMSSRESVSSGQSAFMIDLYQVSKALRLATRRSLVLIDEFGKGTDTEDGAGLLIATIEHLLKrGPECpRVIVST 151
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 4504191  1248 HYHSLVEDYS--QNVAVRLGHMACMVENECEDPSqETITFLYKFIKGACPKSYGFNAARLANL 1308
Cdd:cd03281  152 HFHELFNRSLlpERLKIKFLTMEVLLNPTSTSPN-EDITYLYRLVPGLADTSFAIHCAKLAGI 213
ABC_MSH4_euk cd03282
ATP-binding cassette domain of eukaryotic MutS4 homolog; The MutS protein initiates DNA ...
1093-1308 3.02e-41

ATP-binding cassette domain of eukaryotic MutS4 homolog; The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding clamps, and recognition of specific DNA structures or lesions results in ADP/ATP exchange.


Pssm-ID: 213249 [Multi-domain]  Cd Length: 204  Bit Score: 151.00  E-value: 3.02e-41
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1093 GSRHPCITKTffGDDFIPNDILIgCEEEEQENgkaycvLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPAEVCRLTPIDR 1172
Cdd:cd03282    4 DSRHPILDRD--KKNFIPNDIYL-TRGSSRFH------IITGPNMSGKSTYLKQIALLAIMAQIGCFVPAEYATLPIFNR 74
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1173 VFTRLGASDRIMSGESTFFVELSETASILMHATAHSLVLVDELGRGTATFDGTAIANAVVKELAETiKCRTLFSTHYHSL 1252
Cdd:cd03282   75 LLSRLSNDDSMERNLSTFASEMSETAYILDYADGDSLVLIDELGRGTSSADGFAISLAILECLIKK-ESTVFFATHFRDI 153
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....*..
gi 4504191  1253 VEDYSQNVAVRLGHMACMVENECEdpsqetITFLYKFIKGA-CPKSYGFNAARLANL 1308
Cdd:cd03282  154 AAILGNKSCVVHLHMKAQSINSNG------IEMAYKLVLGLyRIVDDGIRFVRVLAL 204
MutS_I pfam01624
MutS domain I; This domain is found in proteins of the MutS family (DNA mismatch repair ...
407-525 1.31e-39

MutS domain I; This domain is found in proteins of the MutS family (DNA mismatch repair proteins) and is found associated with pfam00488, pfam05188, pfam05192 and pfam05190. The MutS family of proteins is named after the Salmonella typhimurium MutS protein involved in mismatch repair; other members of the family included the eukaryotic MSH 1,2,3, 4,5 and 6 proteins. These have various roles in DNA repair and recombination. Human MSH has been implicated in non-polyposis colorectal carcinoma (HNPCC) and is a mismatch binding protein. The aligned region corresponds with globular domain I, which is involved in DNA binding, in Thermus aquaticus MutS as characterized in.


Pssm-ID: 426350 [Multi-domain]  Cd Length: 113  Bit Score: 142.72  E-value: 1.31e-39
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     407 TPGMRKWWQIKSQNFDLVICYKVGKFYELYHMDALIGVSELGLVFMKGN------WAHSGFPEIAFGRYSDSLVQKGYKV 480
Cdd:pfam01624    1 TPMMRQYLELKSKYPDAVLFFRVGDFYELFGEDAEIAARELGITLTVRKggsgkrIPMAGVPEHAFERYARRLVNKGYKV 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*
gi 4504191     481 ARVEQTETPEMMEArcrkmahiskydrVVRREICRIITKGTQTYS 525
Cdd:pfam01624   81 AICEQTETPAEAKG-------------VVKREVVRVVTPGTLTDD 112
ABC_MutS2 cd03280
ATP-binding cassette domain of MutS2; MutS2 homologs in bacteria and eukaryotes. The MutS ...
1090-1304 5.42e-34

ATP-binding cassette domain of MutS2; MutS2 homologs in bacteria and eukaryotes. The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family also possess a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding clamps, and recognition of specific DNA structures or lesions results in ADP/ATP exchange.


Pssm-ID: 213247 [Multi-domain]  Cd Length: 200  Bit Score: 129.68  E-value: 5.42e-34
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1090 ELKGSRHPCITKTffGDDFIPNDILIGceEEEQengkayCVLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPA-EVCRLT 1168
Cdd:cd03280    1 RLREARHPLLPLQ--GEKVVPLDIQLG--ENKR------VLVITGPNAGGKTVTLKTLGLLTLMAQSGLPIPAaEGSSLP 70
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1169 PIDRVFTRLGASDRIMSGESTFFVELSETASILMHATAHSLVLVDELGRGTATFDGTAIANAVVKELAEtIKCRTLFSTH 1248
Cdd:cd03280   71 VFENIFADIGDEQSIEQSLSTFSSHMKNIARILQHADPDSLVLLDELGSGTDPVEGAALAIAILEELLE-RGALVIATTH 149
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....*.
gi 4504191  1249 YHSLvedysqnvaVRLGHMACMVENECEDPSQETITFLYKFIKGACPKSYGFNAAR 1304
Cdd:cd03280  150 YGEL---------KAYAYKREGVENASMEFDPETLKPTYRLLIGVPGRSNALEIAR 196
MutS2 COG1193
dsDNA-specific endonuclease/ATPase MutS2 [Replication, recombination and repair];
1048-1330 4.71e-31

dsDNA-specific endonuclease/ATPase MutS2 [Replication, recombination and repair];


Pssm-ID: 440806 [Multi-domain]  Cd Length: 784  Bit Score: 132.19  E-value: 4.71e-31
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1048 QSAVECIAVLDVLLCLANYSRGGDGpmCRPVIllpeDTPPFLELKGSRHPCITKtffgDDFIPNDILIGceeeeqENGKA 1127
Cdd:COG1193  264 LENLEILAELDFIFAKARYALELKA--VKPEL----NDEGYIKLKKARHPLLDL----KKVVPIDIELG------EDFRT 327
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1128 ycVLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPA-EVCRLTPIDRVFTRLGasDR--IMSGESTFFVELSETASILMHA 1204
Cdd:COG1193  328 --LVITGPNTGGKTVTLKTVGLLTLMAQSGLPIPAaEGSELPVFDNIFADIG--DEqsIEQSLSTFSSHMTNIVEILEKA 403
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1205 TAHSLVLVDELGRGTatfD---GTAIANAVVKELAEtIKCRTLFSTHYHSL-----VEDYSQNVAVRLghmacmvenece 1276
Cdd:COG1193  404 DENSLVLLDELGAGT---DpqeGAALAIAILEELLE-RGARVVATTHYSELkayayNTEGVENASVEF------------ 467
                        250       260       270       280       290
                 ....*....|....*....|....*....|....*....|....*....|....*....
gi 4504191  1277 DPsqETITFLYKFIKGACPKSYGFN-AARLaNLPEEVIQKGHRK----AREFEKMNQSL 1330
Cdd:COG1193  468 DV--ETLSPTYRLLIGVPGRSNAFEiARRL-GLPEEIIERARELlgeeSIDVEKLIEEL 523
mutS2 TIGR01069
MutS2 family protein; Function of MutS2 is unknown. It should not be considered a DNA mismatch ...
1011-1330 6.20e-29

MutS2 family protein; Function of MutS2 is unknown. It should not be considered a DNA mismatch repair protein. It is likely a DNA mismatch binding protein of unknown cellular function. [DNA metabolism, Other]


Pssm-ID: 130141 [Multi-domain]  Cd Length: 771  Bit Score: 125.32  E-value: 6.20e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    1011 IEKKLANLINAEERRDVSLkdcMRRLFYNFDKNYKDWQSAVECIAVLDVLLCLANYSR--GGDGPMCrpvillpeDTPPF 1088
Cdd:TIGR01069  227 LNNKLAQLKNEEECEIEKI---LRTLSEKVQEYLLELKFLFKEFDFLDSLQARARYAKavKGEFPMP--------SFTGK 295
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    1089 LELKGSRHPCITKtffgDDFIPNDILIGCEEEeqengkayCVLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPAEVCRLT 1168
Cdd:TIGR01069  296 IILENARHPLLKE----PKVVPFTLNLKFEKR--------VLAITGPNTGGKTVTLKTLGLLALMFQSGIPIPANEHSEI 363
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    1169 PI-DRVFTRLGASDRIMSGESTFFVELSETASILMHATAHSLVLVDELGRGTATFDGTAIANAVVKELAEtIKCRTLFST 1247
Cdd:TIGR01069  364 PYfEEIFADIGDEQSIEQNLSTFSGHMKNISAILSKTTENSLVLFDELGAGTDPDEGSALAISILEYLLK-QNAQVLITT 442
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    1248 HYHSL-----VEDYSQNVAVRLghmacmvenecedpSQETITFLYKFIKGACPKSYGFNAARLANLPEEVIQKGHRKARE 1322
Cdd:TIGR01069  443 HYKELkalmyNNEGVENASVLF--------------DEETLSPTYKLLKGIPGESYAFEIAQRYGIPHFIIEQAKTFYGE 508

                   ....*....
gi 4504191    1323 F-EKMNQSL 1330
Cdd:TIGR01069  509 FkEEINVLI 517
ABC_MutS-like cd03283
ATP-binding cassette domain of MutS-like homolog; The MutS protein initiates DNA mismatch ...
1090-1299 1.34e-28

ATP-binding cassette domain of MutS-like homolog; The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding clamps, and recognition of specific DNA structures or lesions results in ADP/ATP exchange.


Pssm-ID: 213250 [Multi-domain]  Cd Length: 199  Bit Score: 114.32  E-value: 1.34e-28
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1090 ELKGSRHPCITKtffgDDFIPNDILIgceeeEQENGkaycVLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPAEVCRLtP 1169
Cdd:cd03283    1 EAKNLGHPLIGR----EKRVANDIDM-----EKKNG----ILITGSNMSGKSTFLRTIGVNVILAQAGAPVCASSFEL-P 66
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1170 IDRVFTRLGASDRIMSGESTFFVELSETASILMHATA--HSLVLVDELGRGTATFDGTAIANAVVKELAETikcRTLF-- 1245
Cdd:cd03283   67 PVKIFTSIRVSDDLRDGISYFYAELRRLKEIVEKAKKgePVLFLLDEIFKGTNSRERQAASAAVLKFLKNK---NTIGii 143
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....
gi 4504191  1246 STHYHSLVEDYSQNVAVRLGHMACMVENecedpsqETITFLYKFIKGACPKSYG 1299
Cdd:cd03283  144 STHDLELADLLDLDSAVRNYHFREDIDD-------NKLIFDYKLKPGVSPTRNA 190
PWWP_NSD_rpt1 cd20144
first PWWP domain found in nuclear receptor-binding SET domain-containing (NSD) proteins; The ...
92-183 1.01e-25

first PWWP domain found in nuclear receptor-binding SET domain-containing (NSD) proteins; The nuclear receptor binding SET domain (NSD) protein family consists of three HMTases, NSD1, NSD2/MMSET/WHSC1, and NSD3/WHSC1L1 that are critical in maintaining the chromatin integrity. Reducing NSD activity through specific lysine-HMTase inhibitors appears promising in suppressing cancer growth. NSD proteins have specific mono- and dimethylase activities for H3K36, and play nonredundant roles during development. NSD1 plays a role in several pathologies, including but not limited to Sotos and Weaver syndromes, acute myeloid leukemia, breast cancer, neuroblastoma, and glioblastoma formation. NSD2 is involved in cancer cell proliferation, survival, and tumor growth, through mediating constitutive NF-kappaB signaling via the cytokine autocrine loop. NSD3 is amplified in human breast cancer cell lines. Moreover, translocation resulting in NUP98 fusion to NSD3 leads to the development of acute myeloid leukemia. NSD proteins contain a catalytic suppressor of variegation, enhancer of zeste and trithorax (SET) domain, two proline-tryptophan-tryptophan-proline (PWWP) domains, five plant homeodomain (PHD) fingers, and an NSD-specific Cys-His rich domain (C5HCH). This model corresponds to the first PWWP domain. This family also includes Drosophila melanogaster maternal-effect sterile 4 (dMes4) that may act as a histone-lysine N-methyltransferase required for wing morphogenesis. The PWWP domain specifically recognizes DNA and histone methylated lysines.


Pssm-ID: 438972  Cd Length: 114  Bit Score: 103.16  E-value: 1.01e-25
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    92 PGDLVWAKMEGYPWWPCLVYNHPFDGTFIREKGKSVR----VHVQFFDDSPTRGWVSKRLLKPYTG-------------- 153
Cdd:cd20144    1 VGDLVWAKVSGHPWWPCMVTYDPESGLYTKIKGSGGRtyrqYHVQFFGDNGERGWVSEKSLMPFEGkekfeelvkelkkk 80
                         90       100       110
                 ....*....|....*....|....*....|...
gi 4504191   154 --SKSKEAQKGGHFY-SAKPEILRAMQRADEAL 183
Cdd:cd20144   81 akKKSKKAKLEKKVKpSRRKKWEIAVEEAEEAL 113
PWWP pfam00855
PWWP domain; The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The domain ...
93-184 3.92e-25

PWWP domain; The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The domain binds to Histone-4 methylated at lysine-20, H4K20me, suggesting that it is methyl-lysine recognition motif. Removal of two conserved aromatic residues in a hydrophobic cavity created by this domain within the full-length protein, Pdp1, abolishes the interaction o f the protein with H4K20me3. In fission yeast, Set9 is the sole enzyme that catalyzes all three states of H4K20me, and Set9-mediated H4K20me is required for efficient recruitment of checkpoint protein Crb2 to sites of DNA damage. The methylation of H4K20 is involved in a diverse array of cellular processes, such as organizing higher-order chromatin, maintaining genome stability, and regulating cell-cycle progression.


Pssm-ID: 459964 [Multi-domain]  Cd Length: 92  Bit Score: 100.58  E-value: 3.92e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191      93 GDLVWAKMEGYPWWPCLVYNHPFDGTFIREKGKSVR-VHVQFFDDSPTrGWVSKRLLKPYTGSKSKEAQKGGHFYSAKPE 171
Cdd:pfam00855    1 GDLVWAKLKGYPWWPARVVDPEELPENVLKPKKKDGeYLVRFFGDSEF-AWVKPKDLKPFDEGDEFEYLKKKKKKKKKKA 79
                           90
                   ....*....|...
gi 4504191     172 ILRAMQRADEALN 184
Cdd:pfam00855   80 FKKALEEAEEALK 92
PWWP_ZCWPW1 cd20145
PWWP domain found in zinc finger CW-type PWWP domain protein 1 (ZCWPW1) and similar proteins; ...
90-192 6.21e-25

PWWP domain found in zinc finger CW-type PWWP domain protein 1 (ZCWPW1) and similar proteins; ZCWPW1 is a histone H3K4me3 reader. It is associated with late-onset Alzheimer's disease (LOAD). In addition to the PWWP domain, ZCWPW1 contains a zinc finger CW (zf-CW) domain that is a histone modification reader for the histone H3 tail with trimethylated K4. The PWWP domain specifically recognizes DNA and histone methylated lysines.


Pssm-ID: 438973  Cd Length: 115  Bit Score: 100.70  E-value: 6.21e-25
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    90 FSPGDLVWAKMEGYPWWPCLVYNHPFDGTFIREKGKSVRV---HVQFFDDSPTRGWVSKRLLKPYTGSKSKEAQKGGHFY 166
Cdd:cd20145    6 YTPGSLVWAKMPGYPWWPAMVEDDPDTEEFFWLDEESDIPtkyHVTFFDKPVSRAWVRASSIKPFTDNSNEPNLTKKKGK 85
                         90       100
                 ....*....|....*....|....*.
gi 4504191   167 SAKPEILRAMQRADEALNKDKIKRLE 192
Cdd:cd20145   86 KYKKRLNEAVEMAREALKLSIKERLK 111
ABC_Class2 cd03227
ATP-binding cassette domain of non-transporter proteins; ABC-type Class 2 contains systems ...
1090-1273 6.23e-25

ATP-binding cassette domain of non-transporter proteins; ABC-type Class 2 contains systems involved in cellular processes other than transport. These families are characterized by the fact that the ABC subunit is made up of duplicated, fused ABC modules (ABC2). No known transmembrane proteins or domains are associated with these proteins.


Pssm-ID: 213194 [Multi-domain]  Cd Length: 162  Bit Score: 102.44  E-value: 6.23e-25
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1090 ELKGSRHPCItktffgddFIPNDILIGceeeeqengKAYCVLVTGPNMGGKSTLMRQAGLLAVMAQM----------GCY 1159
Cdd:cd03227    1 KIVLGRFPSY--------FVPNDVTFG---------EGSLTIITGPNGSGKSTILDAIGLALGGAQSatrrrsgvkaGCI 63
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1160 VPAEVCRLtpidrVFTRLGASdrimSGEStffvELSETASILMHATAH--SLVLVDELGRGTATFDGTAIANAVVKELAE 1237
Cdd:cd03227   64 VAAVSAEL-----IFTRLQLS----GGEK----ELSALALILALASLKprPLYILDEIDRGLDPRDGQALAEAILEHLVK 130
                        170       180       190
                 ....*....|....*....|....*....|....*.
gi 4504191  1238 tiKCRTLFSTHYHSLVEDysqnvAVRLGHMACMVEN 1273
Cdd:cd03227  131 --GAQVIVITHLPELAEL-----ADKLIHIKKVITG 159
PWWP smart00293
domain with conserved PWWP motif; conservation of Pro-Trp-Trp-Pro residues
90-152 3.35e-22

domain with conserved PWWP motif; conservation of Pro-Trp-Trp-Pro residues


Pssm-ID: 214603 [Multi-domain]  Cd Length: 63  Bit Score: 91.25  E-value: 3.35e-22
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 4504191       90 FSPGDLVWAKMEGYPWWPCLVYNHPFDGTFI-REKGKSVRVHVQFFDDSPTrGWVSKRLLKPYT 152
Cdd:smart00293    1 FKPGDLVWAKMKGFPWWPALVISPKMTPDNImKRKSDENLYPVLFFGDKDT-AWIPSSKLFPLT 63
PWWP cd05162
PWWP (Pro-Trp-Trp-Pro) domain; The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, ...
93-182 7.57e-21

PWWP (Pro-Trp-Trp-Pro) domain; The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids and is composed of a five-stranded antiparallel beta-barrel followed by a helical region. It is found in numerous proteins that are involved in cell division, growth, and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes. PWWP domains specifically recognize DNA and histone methylated lysines at the level of the nucleosome. Based on the fact that other regions of PWWP-domain proteins are responsible for nuclear localization and DNA-binding, is likely that the PWWP domain acts as a site for protein-protein binding interactions, influencing chromatin remodeling and thereby regulating transcriptional processes. Some PWWP-domain proteins have been linked to cancer or other diseases; some are known to function as growth factors.


Pssm-ID: 438958 [Multi-domain]  Cd Length: 86  Bit Score: 87.94  E-value: 7.57e-21
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    93 GDLVWAKMEGYPWWPCLVYNHPFDGTFIREKGKSVRVHVQFFDDSpTRGWVSKRLLKPYTGSKSKEAQKGGhfySAKPEI 172
Cdd:cd05162    1 GDLVWAKLKGYPWWPARVVDPEELPEEVGKKKKKGGVLVQFFGDN-DYAWVKSKNIKPFEEGFKKEFKKKK---KKSKKF 76
                         90
                 ....*....|
gi 4504191   173 LRAMQRADEA 182
Cdd:cd05162   77 KKAVEEAEEA 86
MutS_IV pfam05190
MutS family domain IV; This domain is found in proteins of the MutS family (DNA mismatch ...
932-1024 1.27e-19

MutS family domain IV; This domain is found in proteins of the MutS family (DNA mismatch repair proteins) and is found associated with pfam01624, pfam05188, pfam05192 and pfam00488. The mutS family of proteins is named after the Salmonella typhimurium MutS protein involved in mismatch repair; other members of the family included the eukaryotic MSH 1,2,3, 4,5 and 6 proteins. These have various roles in DNA repair and recombination. Human MSH has been implicated in non-polyposis colorectal carcinoma (HNPCC) and is a mismatch binding protein. The aligned region corresponds in part with globular domain IV, which is involved in DNA binding, in Thermus aquaticus MutS as characterized in.


Pssm-ID: 398730 [Multi-domain]  Cd Length: 92  Bit Score: 84.97  E-value: 1.27e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     932 GFDSDYDQALADIRENEQSLLEYLEKQRNRIGCRTIVywgIGRNR---YQLEIPENFTTrNLPEEYELKSTKKGCKRYWT 1008
Cdd:pfam05190    1 GFDEELDELRDLLDELEKELEELEKKEREKLGIKSLK---VGYNKvfgYYIEVTRSEAK-KVPSNYIRRQTLKNGVRFTT 76
                           90
                   ....*....|....*.
gi 4504191    1009 KTIEKKLANLINAEER 1024
Cdd:pfam05190   77 PELKKLEDELLEAEEE 92
PRK00409 PRK00409
recombination and DNA strand exchange inhibitor protein; Reviewed
1042-1328 3.84e-18

recombination and DNA strand exchange inhibitor protein; Reviewed


Pssm-ID: 234750 [Multi-domain]  Cd Length: 782  Bit Score: 90.66  E-value: 3.84e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   1042 KNYKDWQSAVECIAVLDVLLCLANYSRGGDGPMcrpvILLPEDTPpfLELKGSRHPCITKtffgDDFIPNDILIGCEEEe 1121
Cdd:PRK00409  260 KNLDFLKFLNKIFDELDFIFARARYAKALKATF----PLFNDEGK--IDLRQARHPLLDG----EKVVPKDISLGFDKT- 328
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   1122 qengkayCVLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPA----EVCRLtpiDRVFTRLGASDRIMSGESTFFVELSET 1197
Cdd:PRK00409  329 -------VLVITGPNTGGKTVTLKTLGLAALMAKSGLPIPAnepsEIPVF---KEIFADIGDEQSIEQSLSTFSGHMTNI 398
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191   1198 ASILMHATAHSLVLVDELGRGTATFDGTAIANAVVKELAEtIKCRTLFSTHY---------HSLVEdysqNVAVrlghma 1268
Cdd:PRK00409  399 VRILEKADKNSLVLFDELGAGTDPDEGAALAISILEYLRK-RGAKIIATTHYkelkalmynREGVE----NASV------ 467
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 4504191   1269 cmvenECEDpsqETITFLYKFIKGACPKSYGFNAARLANLPEEVIQkghrKAREF-----EKMNQ 1328
Cdd:PRK00409  468 -----EFDE---ETLRPTYRLLIGIPGKSNAFEIAKRLGLPENIIE----EAKKLigedkEKLNE 520
PWWP_NSD2_rpt1 cd20162
first PWWP domain found in nuclear SET domain-containing protein 2 (NSD2) and similar proteins; ...
90-158 6.23e-18

first PWWP domain found in nuclear SET domain-containing protein 2 (NSD2) and similar proteins; NSD2, also called histone-lysine N-methyltransferase NSD2, multiple myeloma SET domain-containing protein (MMSET), or protein trithorax-5 Wolf-Hirschhorn syndrome candidate 1 protein (WHSC1), is overexpressed frequently in the t(4;14) translocation in 15% to 20% of multiple myeloma. It plays important roles in cancer cell proliferation, survival, and tumor growth, through mediating constitutive NF-kappaB signaling via the cytokine autocrine loop. It also enhances androgen receptor (AR)-mediated transcription. The principal chromatin-regulatory activity of NSD2 is dimethylation of histone H3 at lysine 36 (H3K36me2). NSD2 contains a catalytic suppressor of variegation, enhancer of zeste and trithorax (SET) domain, two proline-tryptophan-tryptophan-prolin motif (PWWP) domains, a high mobility group (HMG) box, five PHD (plant-homeodomain) zinc finger motifs, and an NSD-specific Cys-His rich domain (C5HCH). The SET domain is responsible for histone methyltransferase activity. The PWWP, HMG, and PHD domains mediate chromatin interaction and recognition of histone marks. This model corresponds to the first PWWP domain. The PWWP domain specifically recognizes DNA and histone methylated lysines.


Pssm-ID: 438990  Cd Length: 128  Bit Score: 81.50  E-value: 6.23e-18
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 4504191    90 FSPGDLVWAKMEGYPWWPCLVYNHPFDGTFIREKG--KSVR-VHVQFFDDSPTRGWVSKRLLKPYTGSKSKE 158
Cdd:cd20162    1 YNVGDLVWSKVSGYPWWPCMVSADPLLHSHTKLKGqkKSARqYHVQFFGDAPERAWIFEKSLVPFEGEGQFE 72
PWWP_ZCWPW2 cd20146
PWWP domain found in zinc finger CW-type PWWP domain protein 2 (ZCWPW2) and similar proteins; ...
90-192 2.13e-15

PWWP domain found in zinc finger CW-type PWWP domain protein 2 (ZCWPW2) and similar proteins; ZCWPW2 is a histone H3K4me3 reader. In addition to the PWWP domain, ZCWPW2 contains a zinc finger CW (zf-CW) domain that is a histone modification reader for the histone H3 tail with trimethylated K4. The PWWP domain specifically recognizes DNA and histone methylated lysines.


Pssm-ID: 438974  Cd Length: 113  Bit Score: 73.48  E-value: 2.13e-15
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    90 FSPGDLVWAKMEGYPWWPCLVYNHPFDGTFIR--EKGKSVRVHVQFFDDSPTRGWVSKRLLKPYTGSKSKEAQKGGHFYS 167
Cdd:cd20146    9 LPLGSLVWAKMTGYPRWPAILTPDPICGEYVDydEDGEVEKYHVEFLGKPHSHAWISAKSVEPYNSNTKTPKCKTKKSKK 88
                         90       100
                 ....*....|....*....|....*
gi 4504191   168 AKPEILRAMQRADEALNKDKIKRLE 192
Cdd:cd20146   89 RKKSYESALEEAERLLKLTCEERLE 113
PWWP_NSD3_rpt1 cd20163
first PWWP domain found in nuclear SET domain-containing protein 3 (NSD3) and similar proteins; ...
90-192 1.81e-13

first PWWP domain found in nuclear SET domain-containing protein 3 (NSD3) and similar proteins; NSD3, also called histone-lysine N-methyltransferase NSD3, protein whistle, WHSC1-like 1 isoform 9 with methyltransferase activity to lysine, or Wolf-Hirschhorn syndrome candidate 1-like protein 1 (WHSC1-like protein 1, or WHSC1L1), is a lysine methyltransferase encoded by gene NSD3, which is amplified in human breast cancer cell lines. Moreover, translocation resulting in NUP98 fusion to NSD3 leads to the development of acute myeloid leukemia. NSD3 contains a catalytic suppressor of variegation, enhancer of zeste and trithorax (SET) domain, two proline-tryptophan-tryptophan-prolin motif (PWWP) domains, five plant-homeodomain (PHD) zinc finger motifs, and an NSD-specific Cys-His rich domain (C5HCH). The SET domain is responsible for histone methyltransferase activity. The PWWP and PHD domains are involved in protein-protein interactions. This model corresponds to the first PWWP domain. The PWWP domain specifically recognizes DNA and histone methylated lysines.


Pssm-ID: 438991  Cd Length: 130  Bit Score: 68.81  E-value: 1.81e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    90 FSPGDLVWAKMEGYPWWPCLVYNHPFDGTFIREKGKSVR-VHVQFFDDSPTRGWVSKRLLKPYTGSKS-----KEAQKGG 163
Cdd:cd20163    1 FQVGDLVWSKVGTYPWWPCMVSSDPQLEVHTKINTRGAReYHVQFFSSQPERAWVHEKRVREYKGHKQyeellAEATKQA 80
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|.
gi 4504191   164 HFYSAKPEILRAM-QR-----------ADEALNKDKIKRLE 192
Cdd:cd20163   81 SNHSEKQKIRKPRpQReraqwdigiahAEKALKMTREERIE 121
PWWP_BRPF cd05839
PWWP domain found in the bromodomain and PHD finger-containing (BRPF) protein family; The BRPF ...
90-150 8.54e-11

PWWP domain found in the bromodomain and PHD finger-containing (BRPF) protein family; The BRPF family of proteins includes BRPF1, BRD1/BRPF2, and BRPF3. They are scaffold proteins that form monocytic leukemic zinc-finger protein (MOZ)/MOZ-related factor (MORF) H3 histone acetyltransferase (HAT) complexes with other regulatory subunits, such as inhibitor of growth 5 (ING5) and Esa1-associated factor 6 ortholog (EAF6). BRPF proteins have multiple domains, including a plant homeodomain (PHD) zinc finger followed by a non-canonical extended PHD (ePHD) finger, C2HC5HC2H, a bromodomain, and a proline-tryptophan-tryptophan-proline (PWWP) domain. The PHD finger binds to lysine 4 of histone H3 (K4H3), the bromodomain interacts with acetylated lysines on N-terminal tails of histones and other proteins, and the PWWP domain shows histone-binding and chromatin association properties.


Pssm-ID: 438964  Cd Length: 106  Bit Score: 59.97  E-value: 8.54e-11
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 4504191    90 FSPGDLVWAKMEGYPWWPCLVYN--HPFDGTFI---------REKGKSVRVHVQFFDDSPTRGWVSKRLLKP 150
Cdd:cd05839    1 LEPGDLVWAKCRGYPWYPAEIVDpkDPKEGNGVpipvppdrvLKKSNEKLYLVLFFDAKRTWGWLPRNKLRP 72
PWWP_MUM1-like cd06080
PWWP domain found in mutated melanoma-associated antigen 1 (MUM-1) and similar proteins; MUM-1, ...
90-185 1.44e-10

PWWP domain found in mutated melanoma-associated antigen 1 (MUM-1) and similar proteins; MUM-1, also called PWWP domain-containing DNA repair factor 3A (PWWP3A), or protein expandere (EXPAND1), is a melanoma-associated antigen (MAA) that belongs to the mutated or aberrantly expressed type of MAAs, along with antigens such as CDK4, beta-catenin, gp100-in4, p15, and N-acetylglucosaminyltransferase V. It is highly expressed in several types of human cancers and involved in the DNA damage response pathway by contributing to the maintenance of chromatin architecture. MUM-1 is recruited to the vicinity of DNA breaks by TP53BP1 and plays an accessory role to facilitate damage-induced chromatin changes and promoting chromatin relaxation. It is required for efficient DNA repair and cell survival following DNA damage. This subfamily also includes mutated melanoma-associated antigen 1-like protein 1 (MUM1L1), also called MUM1-like protein 1, or PWWP domain-containing DNA repair factor 3B (PWWP3B). Its biological function remains unclear. The PWWP domain specifically recognizes DNA and histone methylated lysines.


Pssm-ID: 438967  Cd Length: 90  Bit Score: 58.80  E-value: 1.44e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    90 FSPGDLVWAKMEGYPWWPCLVYNhpfdgtfIREKGKsvRVHVQFFDDSPTRGW--VSKRLLKPYTGSKSKEAQKGGHFYS 167
Cdd:cd06080    1 FSKGDIVWAKYRKYPYWPAVVKS-------VYKKPK--KASVLFLELPPEKKGikVSLKKLKPFDCKEKEELLEEGKESP 71
                         90
                 ....*....|....*...
gi 4504191   168 AKPEILRAMQRADEALNK 185
Cdd:cd06080   72 YSEDFKEAVELAEDYLIK 89
MutS_II pfam05188
MutS domain II; This domain is found in proteins of the MutS family (DNA mismatch repair ...
538-692 1.84e-10

MutS domain II; This domain is found in proteins of the MutS family (DNA mismatch repair proteins) and is found associated with pfam00488, pfam01624, pfam05192 and pfam05190. The MutS family of proteins is named after the Salmonella typhimurium MutS protein involved in mismatch repair; other members of the family included the eukaryotic MSH 1,2,3, 4,5 and 6 proteins. These have various roles in DNA repair and recombination. Human MSH has been implicated in non-polyposis colorectal carcinoma (HNPCC) and is a mismatch binding protein. This domain corresponds to domain II in Thermus aquaticus MutS as characterized in, and has similarity resembles RNAse-H-like domains (see pfam00075).


Pssm-ID: 398728 [Multi-domain]  Cd Length: 133  Bit Score: 60.06  E-value: 1.84e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191     538 YLLSLKEKEEDSsghtraYGVCFVDTSLGKFFIGQFSDdrhCSRFRTLVAHYPPVQVLFEKgNLSKETKTILKSSLSCSL 617
Cdd:pfam05188    2 YLAAISRGDGNR------YGLAFLDLSTGEFGVSEFED---FEELLAELSRLSPKELLLPE-SLSSSTVAESQKLLELRL 71
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 4504191     618 qeGLIPGSQFWDASKTLRTLLEEEYFREklsdgigvmlpqVLKGMTSesdsigltpgEKSELALSALGGCVFYLK 692
Cdd:pfam05188   72 --RVGRRPTWLFELEHAYEDLNEDFGVE------------DLDGFGL----------EELPLALCAAGALISYLK 122
PWWP_NSD1_rpt1 cd20161
first PWWP domain found in nuclear receptor-binding SET domain-containing protein 1 (NSD1) and ...
90-188 2.15e-10

first PWWP domain found in nuclear receptor-binding SET domain-containing protein 1 (NSD1) and similar proteins; NSD1, also called H3 Lysine-36 and H4 Lysine-20 specific histone-lysine N-methyltransferase, androgen receptor coactivator 267 kDa protein, androgen receptor-associated protein of 267 kDa, H3-K36-HMTase H4-K20-HMTase, Lysine N-methyltransferase 3B (KMT3B), or NR-binding SET domain-containing protein, is a lysine methyltransferase that preferentially methylates H3 on Lysine36 (H3-K36) and H4 on Lysine20 (H4-K20), which is primarily associated with active transcription. It plays a role in several pathologies, including but not limited to Sotos and Weaver syndromes, acute myeloid leukemia, breast cancer, neuroblastoma, and glioblastoma formation. It can alter transcription by interacting with the protein NSD1-interacting zinc finger protein 1 (NIZP1). It also mitigates caspase-1 activation by listeriolysin o (LLO) in macrophages, and requires functional LLO for the regulation of IL-1beta secretion. Moreover, NSD1 regulates RNA polymerase II (RNAP II) recruitment to bone morphogenetic protein 4 (BMP4). NSD1 contains a catalytic suppressor of variegation, enhancer of zeste and trithorax (SET) domain, two proline-tryptophan-tryptophan-proline (PWWP) domains, five plant homeodomain (PHD) fingers, and an NSD-specific Cys-His rich domain (C5HCH). The SET domain is responsible for histone methyltransferase activity. The PWWP and PHD domains are involved in protein-protein interactions. The model corresponds to the first PWWP domain. The PWWP domain specifically recognizes DNA and histone methylated lysines.


Pssm-ID: 438989  Cd Length: 116  Bit Score: 59.41  E-value: 2.15e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    90 FSPGDLVWAKMEGYPWWPCLVYNHPFDGTFIREKGKSVR----VHVQFFDDSPTRGWVSKRLLKPYtgskskeaqKGGHF 165
Cdd:cd20161    4 YEVGDLVWAKFSRRPWWPCRICADPLLDTHSKMKVPSRRpcrqYYVETLGELTEKAWVAAKAVVPF---------EGRHQ 74
                         90       100
                 ....*....|....*....|...
gi 4504191   166 YSAKPeILRAMQRADEALNKDKI 188
Cdd:cd20161   75 FEELP-VLRRRGKQKEKDYKHKI 96
ABC_ATPase cd00267
ATP-binding cassette transporter nucleotide-binding domain; ABC transporters are a large ...
1104-1262 3.49e-10

ATP-binding cassette transporter nucleotide-binding domain; ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide-binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins.


Pssm-ID: 213179 [Multi-domain]  Cd Length: 157  Bit Score: 59.95  E-value: 3.49e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191  1104 FGDDFIPNDILIGCEEEEqengkayCVLVTGPNMGGKSTLMRQAGLLAVMAQMGCYVPAEVCRLTPIDRVFTRLGASDRI 1183
Cdd:cd00267    9 YGGRTALDNVSLTLKAGE-------IVALVGPNGSGKSTLLRAIAGLLKPTSGEILIDGKDIAKLPLEELRRRIGYVPQL 81
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 4504191  1184 MSGEStffvELSETASILMHATahSLVLVDELGRGTATFDGTAIANAvVKELAETiKCRTLFSTHYHSLVEDYSQNVAV 1262
Cdd:cd00267   82 SGGQR----QRVALARALLLNP--DLLLLDEPTSGLDPASRERLLEL-LRELAEE-GRTVIIVTHDPELAELAADRVIV 152
PWWP_GLYR1 cd05836
PWWP domain found in glyoxylate reductase 1 (GLYR1) and similar proteins; GLYR1, also called ...
90-172 8.38e-10

PWWP domain found in glyoxylate reductase 1 (GLYR1) and similar proteins; GLYR1, also called 3-hydroxyisobutyrate dehydrogenase-like protein, cytokine-like nuclear factor N-PAC, nuclear protein NP60, or nuclear protein of 60 kDa, is a putative oxidoreductase that is recruited on chromatin and promotes KDM1B demethylase activity. It recognizes and binds trimethylated 'Lys-36' of histone H3 (H3K36me3). GLYR1 enhances the activity of MAP2K4 and MAP2K6 kinases to phosphorylate p38-alpha. In addition to the PWWP domain, GLYR1 also contains an AT-hook and a C-terminal NAD-binding domain. The PWWP domain specifically recognizes DNA and histone methylated lysines.


Pssm-ID: 438961 [Multi-domain]  Cd Length: 86  Bit Score: 56.84  E-value: 8.38e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    90 FSPGDLVWAKMEGYPWWPCLVYNHPFDgtfIREKGKSVRVHVQFFDDSPTRGWVSKRLLKPYTGSKSKEAQ--KGGHFYS 167
Cdd:cd05836    1 FKIGDLVWAKMKGFPPWPGKIVNPPPD---LKKPPRKKKMHCVYFFGSENYAWIEDENIKPYEEFKEEMLKskKSAGFKD 77

                 ....*
gi 4504191   168 AKPEI 172
Cdd:cd05836   78 AVEAI 82
PWWP_HRP cd05834
PWWP domain found in hepatoma-derived growth factor (HDGF)-related protein (HRP) family; The ...
90-172 4.14e-09

PWWP domain found in hepatoma-derived growth factor (HDGF)-related protein (HRP) family; The HRP family includes hepatoma-derived growth factor (HDGF), and HDGF-related proteins (HRPs). HDGF, also called high mobility group protein 1-like 2 (HMG-1L2), is a heparin-binding protein that acts as a transcriptional repressor with mitogenic activity for fibroblasts. It is a prognostic factor in several types of cancer. HDGFL1 is also called PWWP domain-containing protein 1 (PWWP1). Its biological function remains unclear. HDGFL2, also called HDGF-related protein 2 (HRP-2), or hepatoma-derived growth factor 2 (HDGF-2), is involved in cellular growth control, through the regulation of cyclin D1 expression. HDGFL3, also called HDGF-related protein 3 (HRP-3), enhances DNA synthesis and may play a role in cell proliferation. The family also includes PC4 and SFRS1-interacting protein (PSIP) and similar proteins. PSIP, also called CLL-associated antigen KW-7, dense fine speckles 70 kDa protein (DFS 70), lens epithelium-derived growth factor (LEDGF), or transcriptional coactivator p75/p52, acts as a transcriptional coactivator involved in neuroepithelial stem cell differentiation and neurogenesis. Members of the HRP family contains a PWWP domain, which is necessary for DNA binding.


Pssm-ID: 438959 [Multi-domain]  Cd Length: 82  Bit Score: 54.48  E-value: 4.14e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    90 FSPGDLVWAKMEGYPWWPCLVYNHPFDGTFIREKgksvrVHVQFFDDSPTrGWVSKRLLKPYTGSKSK--EAQKGGHFYS 167
Cdd:cd05834    1 FKPGDLVFAKVKGYPPWPARIDEIPEGAKIPKNK-----YPVFFYGTHET-AFLKPKDLFPYEENKEKygKPRKRKGFNE 74

                 ....*
gi 4504191   168 AKPEI 172
Cdd:cd05834   75 GLWEI 79
PWWP_ScIOC4-like cd05840
PWWP domain found in Saccharomyces cerevisiae ISWI one complex protein 4 (ScIOC4) and similar ...
93-194 8.30e-09

PWWP domain found in Saccharomyces cerevisiae ISWI one complex protein 4 (ScIOC4) and similar proteins; ScIOC4 functions as a component of the ISW1B complex, which acts in remodeling the chromatin by catalyzing an ATP-dependent alteration in the structure of nucleosomal DNA. The ISW1B complex acts within coding regions to control the amount of RNA polymerase II released into productive elongation and to coordinate elongation with termination and pre-mRNA processing. The family also includes Schizosaccharomyces pombe PWWP domain-containing proteins 1 and 2 (SpPDP1 and SpPDP2). SpPDP1 associates with Set9 to regulate its chromatin localization and methyltransferase activity towards H4K20. Members of this family contain a PWWP domain. The PWWP domain specifically recognizes DNA and histone methylated lysines.


Pssm-ID: 438965  Cd Length: 94  Bit Score: 54.23  E-value: 8.30e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    93 GDLVWAKMEGYPWWPCLVYNH------PFDGTFIREKGKSVRVHVQFFDDsPTRGWVSKRLLKPYTgskskeaqkgghfy 166
Cdd:cd05840    1 GDLVLAKVKGYPPWPAMVLPEellpknVLKAKKRKPKSKKTVYPVQFFPD-NEYYWVSPSSLKPLT-------------- 65
                         90       100
                 ....*....|....*....|....*...
gi 4504191   167 sakPEILRAMQRADEALNKDKIKRLELA 194
Cdd:cd05840   66 ---KEEIDKFLSKSKRKNKDLIEAYEVA 90
PWWP_BS69 cd20159
PWWP domain found in protein BS69 and similar proteins; Protein BS69, also called zinc finger ...
88-161 2.39e-07

PWWP domain found in protein BS69 and similar proteins; Protein BS69, also called zinc finger MYND domain-containing protein 11 (ZMYND11 or ZMY11), is a ubiquitously expressed nuclear protein acting as a transcriptional co-repressor in association with various transcription factors. It was originally identified as an adenovirus 5 E1A-binding protein that inhibits E1A transactivation, as well as c-Myb transcription. It also mediates repression, at least in part, through interaction with the co-repressor N-CoR. Moreover, it interacts with Toll-interleukin 1 receptor domain (TIR)-containing adaptor molecule-1 (TICAM-1, also named TRIF) to facilitate NF-kappaB activation and type I IFN induction. It associates with PIAS1, a SUMO E3 enzyme, and Ubc9, a SUMO E2 enzyme, and plays an inhibitory role in muscle and neuronal differentiation. Moreover, BS69 regulates Epstein-Barr virus (EBV) latent membrane protein 1 (LMP1)/C-terminal activation region 2 (CTAR2)-mediated NF-kappaB activation by interfering with the complex formation between TNFR-associated death domain protein (TRADD) and LMP1/CTAR2. It also cooperates with tumor necrosis factor receptor (TNFR)-associated factor 3 (TRAF3) in the regulation of EBV-derived LMP1/CTAR1-induced NF-kappaB activation. Furthermore, BS69 is involved in the p53-p21Cip1-mediated senescence pathway. BS69 contains a plant homeodomain (PHD) finger, a bromodomain, a proline-tryptophan-tryptophan-proline (PWWP) domain, and a MYeloid translocation protein 8, Nervy and DEAF-1 (MYND) domain. The PWWP domain specifically recognizes DNA and histone methylated lysines.


Pssm-ID: 438987  Cd Length: 85  Bit Score: 49.52  E-value: 2.39e-07
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 4504191    88 CDFsPGDLVWAKMEGYPWWPCLVynhpfdgtfIREKGKsvRVHVQFFDDSPTRGWVSKRLLKPYTGSKSKEAQK 161
Cdd:cd20159    3 CRP-PHELVWAKQKGFPYWPAKV---------IQKEDN--QYDVRFFGGHHQRAWIPKENIKPITTSPKQLKVK 64
PWWP_AtATX3-like cd20143
PWWP domain found in Arabidopsis thaliana histone-lysine N-methyltransferase trithorax-like ...
93-181 3.87e-07

PWWP domain found in Arabidopsis thaliana histone-lysine N-methyltransferase trithorax-like protein ATX3, ATX4, ATX5, and similar proteins; The family includes A. thaliana ATX3 (also called protein SET domain group 14, or trithorax-homolog protein 3), ATX4 (also called protein SET domain group 16, or trithorax-homolog protein 4) and ATX5 (also called protein SET domain group 29, or trithorax-homolog protein 5), which belong to the histone-lysine methyltransferase family. They show distinct phylogenetic origins from the family of ATX1 and ATX2. They are multi-domain containing protein that consists of an N-terminal PWWP domain, a canonical plant homeodomain (PHD) domain, a non-canonical extended PHD (ePHD) domain, and a C-terminal SET domain. The PWWP domain specifically recognizes DNA and histone methylated lysines.


Pssm-ID: 438971 [Multi-domain]  Cd Length: 100  Bit Score: 49.68  E-value: 3.87e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    93 GDLVWAKMEGYPWWPCLVYnHPFdgtfirEKGKSVRVH-------VQFFDDSPTR--GWVSKRLLKPYT-------GSKS 156
Cdd:cd20143    3 GDLVWAKVGTHPFWPARVV-EPA------EQAEEVRRRcvpgslcVYFFGPGGSRdyGWVRRSMIFPFTddlarfqTQKI 75
                         90       100
                 ....*....|....*....|....*
gi 4504191   157 KEAQKGGHFYSAKPEILRAMQRADE 181
Cdd:cd20143   76 KNKKRPQEFQEALEEAKLADAGFEE 100
PWWP_AtATX1-like cd20142
PWWP domain found in Arabidopsis thaliana histone-lysine N-methyltransferase trithorax-like ...
91-134 7.32e-07

PWWP domain found in Arabidopsis thaliana histone-lysine N-methyltransferase trithorax-like proteins ATX1, ATX2, and similar proteins; This family includes A. thaliana ATX1 and ATX2, which are sister paralogs originating from a segmental chromosomal duplication. They are plant counterparts of the Drosophila melanogaster trithorax (TRX) and mammalian mixed-lineage leukemia (MLL1) proteins. ATX1, also called protein SET domain group 27, or trithorax-homolog protein 1 (TRX-homolog protein 1), is a methyltransferase that trimethylates histone H3 at lysine 4 (H3K4me3). It also acts as a histone modifier and as a positive effector of gene expression. ATX1 regulates transcription from diverse classes of genes implicated in biotic and abiotic stress responses. It is involved in dehydration stress signaling in both abscisic acid (ABA)-dependent and ABA-independent pathways. ATX2, also called protein SET domain group 30, or trithorax-homolog protein 2 (TRX-homolog protein 2), is involved in dimethylating histone H3 at lysine 4 (H3K4me2). Both ATX1 and ATX2 are multi-domain containing proteins that consist of an N-terminal PWWP domain, FYRN- and FYRC (DAST, domain associated with SET in trithorax) domains, a canonical plant homeodomain (PHD) domain, a non-canonical extended PHD (ePHD) domain, and a C-terminal SET domain. The PWWP domain specifically recognizes DNA and histone methylated lysines.


Pssm-ID: 438970 [Multi-domain]  Cd Length: 97  Bit Score: 48.50  E-value: 7.32e-07
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|....*..
gi 4504191    91 SPGDLVWAKMEGYPWWPCLVYN--HPFD-GTFIREKGKSVRVHVQFF 134
Cdd:cd20142    1 SPGDVVWAKVKGYPMWPALVIDeeHAERcGLEANRPGKKGTVPVQFF 47
PWWP_PWWP2A cd20152
PWWP domain found in PWWP domain-containing protein 2A (PWWP2A); PWWP2A is a H2A.Z-specific ...
68-151 2.50e-06

PWWP domain found in PWWP domain-containing protein 2A (PWWP2A); PWWP2A is a H2A.Z-specific chromatin binding protein which may play an important role in the neural crest stem cell migration and differentiation during early development. It is also required for proper mitosis progression. PWWP2A and its paralog PWWP2B form a stable complex with NuRD subunits MTA1/2/3:HDAC1/2:RBBP4/7, but not with MBD2/3, p66alpha/beta, and CHD3/4. The PWWP domain specifically recognizes DNA and histone methylated lysines.


Pssm-ID: 438980  Cd Length: 122  Bit Score: 48.08  E-value: 2.50e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    68 KAKNLNGGLRRSVAPAAPTSCdfsPGDLVWAKMEGYPWWPC----LVYNHPFDGTFIREKGKsvrvhVQFFdDSPTRGWV 143
Cdd:cd20152    1 KGKVFSKNVSKCVTPDGRTIC---VGDIVWAKIYGFPWWPArilaITVSRKDNGLLVRQEAR-----ISWF-GSPTTSFL 71

                 ....*...
gi 4504191   144 SKRLLKPY 151
Cdd:cd20152   72 ALSQLAPF 79
PWWP_PSIP cd20151
PWWP domain found in PC4 and SFRS1-interacting protein (PSIP); PSIP, also called ...
90-160 4.10e-06

PWWP domain found in PC4 and SFRS1-interacting protein (PSIP); PSIP, also called CLL-associated antigen KW-7, dense fine speckles 70 kDa protein (DFS 70), lens epithelium-derived growth factor (LEDGF), or transcriptional coactivator p75/p52, acts as a transcriptional coactivator involved in neuroepithelial stem cell differentiation and neurogenesis. It contains a PWWP domain, which is necessary for DNA binding.


Pssm-ID: 438979 [Multi-domain]  Cd Length: 88  Bit Score: 46.52  E-value: 4.10e-06
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 4504191    90 FSPGDLVWAKMEGYPWWPCLVYNHPfDGTFireKGKSVRVHVQFFDDSPTrGWVSKRLLKPYTGSKSKEAQ 160
Cdd:cd20151    1 FKPGDLIFAKMKGYPHWPARVDEVP-DGAV---KPPTNKLPIFFFGTHET-AFLGPKDIFPYSENKEKYGK 66
PWWP_HDGF cd20148
PWWP domain found in Hepatoma-derived growth factor (HDGF); HDGF, also called high mobility ...
90-157 1.35e-05

PWWP domain found in Hepatoma-derived growth factor (HDGF); HDGF, also called high mobility group protein 1-like 2 (HMG-1L2), is a heparin-binding protein that acts as a transcriptional repressor with mitogenic activity for fibroblasts. It contains a PWWP domain, which is necessary for DNA binding.


Pssm-ID: 438976 [Multi-domain]  Cd Length: 87  Bit Score: 45.01  E-value: 1.35e-05
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 4504191    90 FSPGDLVWAKMEGYPWWPCLVYNHPFDGTfireKGKSVRVHVQFFDDSPTRGWVSKRLLkPYTGSKSK 157
Cdd:cd20148    1 YKCGDLVFAKMKGYPHWPARIDEMPEAAV----KSTANKYQVFFFGTHETAFLGPKDLF-PYEESKEK 63
PWWP_DNMT3 cd05835
PWWP domain found in the DNA (cytosine-5)-methyltransferase 3 (DNMT3) family; The DNMT3 family ...
92-113 1.41e-05

PWWP domain found in the DNA (cytosine-5)-methyltransferase 3 (DNMT3) family; The DNMT3 family includes DNMT3A and DNMT3B, which are required for genome-wide de novo methylation and is essential for the establishment of DNA methylation patterns during development. DNMT3A, also called DNA methyltransferase HsaIIIA, DNA MTase HsaIIIA, or M.HsaIIIA, modifies DNA in a non-processive manner and also methylates non-CpG sites. It may preferentially methylate DNA linker between 2 nucleosomal cores and is inhibited by histone H1. DNMT3A is recruited to trimethylated 'Lys-36' of histone H3 (H3K36me3) sites. DNMT3B, also called DNA methyltransferase HsaIIIB, DNA MTase HsaIIIB, or M.HsaIIIB, may preferentially methylate nucleosomal DNA within the nucleosome core region. DNMT3B may function as a transcriptional co-repressor by associating with CBX4 and independently of DNA methylation. Members of this family contains a PWWP domain that is responsible for establishing DNA methylation patterns during embryogenesis and gametogenesis. In tumorigenesis, DNA methylation by DNMT3B is known to play a role in the inactivation of tumor suppressor genes. In addition, a point mutation in the PWWP domain of DNMT3B has been identified in patients with ICF (immunodeficiency, centromeric instability, and facial anomalie) syndrome , a rare autosomal recessive disorder characterized by hypomethylation of classical satellite DNA.


Pssm-ID: 438960 [Multi-domain]  Cd Length: 89  Bit Score: 44.94  E-value: 1.41e-05
                         10        20
                 ....*....|....*....|..
gi 4504191    92 PGDLVWAKMEGYPWWPCLVYNH 113
Cdd:cd05835    2 IGDLVWAKLKGSPWWPGIVVSH 23
PWWP_NSD_rpt2 cd05838
second PWWP domain found in nuclear receptor-binding SET domain-containing (NSD) proteins; The ...
93-182 1.87e-05

second PWWP domain found in nuclear receptor-binding SET domain-containing (NSD) proteins; The nuclear receptor binding SET domain (NSD) protein family consists of three HMTases, NSD1, NSD2/MMSET/WHSC1, and NSD3/WHSC1L1, that are critical in maintaining chromatin integrity. Reducing NSD activity through specific lysine-HMTase inhibitors appears promising in suppressing cancer growth. NSD proteins have specific mono- and dimethylase activities for H3K36, and they play nonredundant roles during development. NSD1 plays a role in several pathologies, including but not limited to Sotos and Weaver syndromes, acute myeloid leukemia, breast cancer, neuroblastoma, and glioblastoma formation. NSD2 is involved in cancer cell proliferation, survival, and tumor growth, by mediating constitutive NF-kappaB signaling via the cytokine autocrine loop. NSD3 is amplified in human breast cancer cell lines. Moreover, translocation resulting in NUP98 fusion to NSD3 leads to the development of acute myeloid leukemia. NSD proteins contain a catalytic suppressor of variegation, enhancer of zeste and trithorax (SET) domain, two proline-tryptophan-tryptophan-proline (PWWP) domains, five plant homeodomain (PHD) fingers, and an NSD-specific Cys-His rich domain (C5HCH). This model corresponds to the second PWWP domain. The family also includes Drosophila melanogaster maternal-effect sterile 4 (dMes4) that may act as a histone-lysine N-methyltransferase required for wing morphogenesis. The PWWP domain specifically recognizes DNA and histone methylated lysines.


Pssm-ID: 438963 [Multi-domain]  Cd Length: 96  Bit Score: 44.54  E-value: 1.87e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    93 GDLVWAKMEGYPWWPCLVYNH-----------PFDGTFIrekgksvrvhVQFF--DDSptrGWVSKRLLKPYTgsKSKEA 159
Cdd:cd05838    3 GDIVWVKLGNYRWWPAEILHPrevpdniqslpHPPGEFP----------VRFFgsHDY---YWVHRGRVFLFE--EGDKG 67
                         90       100
                 ....*....|....*....|...
gi 4504191   160 QKGGHFYSAKPEILRAMQRADEA 182
Cdd:cd05838   68 SKEKSKKSLDKSFKRALKEANEA 90
PWWP_PRKCBP1 cd20160
PWWP domain found in protein kinase C-binding protein 1 (PRKCBP1) and similar proteins; ...
95-143 6.02e-05

PWWP domain found in protein kinase C-binding protein 1 (PRKCBP1) and similar proteins; PRKCBP1, also called cutaneous T-cell lymphoma-associated antigen se14-3 (CTCL-associated antigen se14-3), Rack7, or zinc finger MYND domain-containing protein 8 (ZMYND8), is a novel receptor for activated C-kinase (RACK)-like protein that may play an important role in the activation and regulation of PKC-beta I, and the PKC signaling cascade. It also has been identified as a formin homology-2-domain containing protein 1 (FHOD1)-binding protein that may be involved in FHOD1-regulated actin polymerization and transcription. Moreover, PRKCBP1 may function as a REST co-repressor 2 (RCOR2) interacting factor. They form a RCOR2/ZMYND8 complex which might be involved in the regulation of neural differentiation. PRKCBP1 contains a plant homeodomain (PHD) finger, a bromodomain, and a proline-tryptophan-tryptophan-proline (PWWP) domain. The PWWP domain specifically recognizes DNA and histone methylated lysines.


Pssm-ID: 438988  Cd Length: 91  Bit Score: 42.94  E-value: 6.02e-05
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|....*....
gi 4504191    95 LVWAKMEGYPWWPCLVYnhpfdgtfireKGKSVRVHVQFFDDSPtRGWV 143
Cdd:cd20160    9 LVWAKLKGFPFWPAKAL-----------RVNNGQVDVRFFGAHD-RAWV 45
PWWP_PWWP2 cd20140
PWWP domain found in the PWWP domain-containing protein 2 (PWWP2) family; The PWWP2 family ...
93-110 6.49e-05

PWWP domain found in the PWWP domain-containing protein 2 (PWWP2) family; The PWWP2 family includes PWWP2A and its paralog PWWP2B. PWWP2A is a H2A.Z-specific chromatin binding protein which may play an important role in the neural crest stem cell migration and differentiation during early development. It is also required for proper mitosis progression. PWWP2A and PWWP2B form a stable complex with NuRD subunits MTA1/2/3:HDAC1/2:RBBP4/7, but not with MBD2/3, p66alpha/beta, and CHD3/4. The PWWP domain specifically recognizes DNA and histone methylated lysines.


Pssm-ID: 438968  Cd Length: 92  Bit Score: 43.02  E-value: 6.49e-05
                         10
                 ....*....|....*...
gi 4504191    93 GDLVWAKMEGYPWWPCLV 110
Cdd:cd20140    7 GDIVWGKIHGFPWWPGRI 24
PWWP_HDGFL2 cd20149
PWWP domain found in Hepatoma-derived growth factor-related protein 2 (HDGFL2); HDGFL2, also ...
90-107 1.46e-04

PWWP domain found in Hepatoma-derived growth factor-related protein 2 (HDGFL2); HDGFL2, also called HDGF-related protein 2 (HRP-2), or Hepatoma-derived growth factor 2 (HDGF-2), is involved in cellular growth control, through the regulation of cyclin D1 expression. It contains a PWWP domain, which is necessary for DNA binding.


Pssm-ID: 438977 [Multi-domain]  Cd Length: 84  Bit Score: 41.81  E-value: 1.46e-04
                         10
                 ....*....|....*...
gi 4504191    90 FSPGDLVWAKMEGYPWWP 107
Cdd:cd20149    1 FKPGDLVFAKMKGYPHWP 18
PWWP_HDGFL3 cd20150
PWWP domain found in Hepatoma-derived growth factor-related protein 3 (HDGFL3); HDGFL3, also ...
85-157 1.41e-03

PWWP domain found in Hepatoma-derived growth factor-related protein 3 (HDGFL3); HDGFL3, also called HDGF-related protein 3 (HRP-3), is the only hepatoma-derived growth factor (HDGF)-related protein (HRP) family member whose expression is almost restricted to the nervous tissue. It enhances DNA synthesis and may play a role in cell proliferation. It contains a PWWP domain, which is necessary for DNA binding.


Pssm-ID: 438978 [Multi-domain]  Cd Length: 93  Bit Score: 39.27  E-value: 1.41e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 4504191    85 PTSCDFSPGDLVWAKMEGYPWWPCLVYNHPfDGTFireKGKSVRVHVQFFDDSPTrGWVSKRLLKPYTGSKSK 157
Cdd:cd20150    1 PRPREYKAGDLVFAKMKGYPHWPARIDELP-EGAV---KPPANKYPIFFFGTHET-AFLGPKDLFPYKEYKDK 68
PWWP_BS69-like cd05841
PWWP domain found in protein BS69, protein kinase C-binding protein 1 (PRKCBP1) and similar ...
95-144 2.06e-03

PWWP domain found in protein BS69, protein kinase C-binding protein 1 (PRKCBP1) and similar proteins; Protein BS69, also called zinc finger MYND domain-containing protein 11 (ZMYND11 or ZMY11), is a ubiquitously expressed nuclear protein acting as a transcriptional co-repressor in association with various transcription factors. It was originally identified as an adenovirus 5 E1A-binding protein that inhibits E1A transactivation, as well as c-Myb transcription. BS69 is a multi-domain protein, containing bromo, plant homeodomain (PHD), proline-tryptophan-tryptophan-proline (PWWP), and MYeloid translocation protein 8, Nervy and DEAF-1 (MYND) domains. The specific role of the PWWP domain within BS69 is not clearly identified, but BS69 functions in chromatin remodeling, consistent with other PWWP-containing proteins. PRKCBP1, also called cutaneous T-cell lymphoma-associated antigen se14-3 (CTCL-associated antigen se14-3), Rack7, or zinc finger MYND domain-containing protein 8 (ZMYND8), is a novel receptor for activated C-kinase (RACK)-like protein that may play an important role in the activation and regulation of PKC-beta I, and the PKC signaling cascade. It also has been identified as a formin homology-2-domain containing protein 1 (FHOD1)-binding protein that may be involved in FHOD1-regulated actin polymerization and transcription. Moreover, PRKCBP1 may function as a REST co-repressor 2 (RCOR2) interacting factor. They form a RCOR2/ZMYND8 complex which might be involved in the regulation of neural differentiation. PRKCBP1 contains a PHD finger, a bromodomain, and a PWWP domain. The PWWP domain specifically recognizes DNA and histone methylated lysines.


Pssm-ID: 438966  Cd Length: 89  Bit Score: 38.53  E-value: 2.06e-03
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|
gi 4504191    95 LVWAKMEGYPWWPclvynhpfdGTFIREKGKSVRvhVQFFDDSpTRGWVS 144
Cdd:cd05841    9 LVWVKLDGFPFWP---------AKVMGTKDGQVD--VRFFGDY-DRAWLP 46
PWWP_HULK cd20147
PWWP domain found in Arabidopsis thaliana protein HUA2-LIKE (HULK) family; The HULK family ...
93-172 2.36e-03

PWWP domain found in Arabidopsis thaliana protein HUA2-LIKE (HULK) family; The HULK family includes HUA2-like proteins 1-3 (HULK1-3), which are probable transcription factors that act with partial redundancy with each other. They may play diverse and essential roles in the control of plant development, physiology and flowering time. The PWWP domain specifically recognizes DNA and histone methylated lysines.


Pssm-ID: 438975 [Multi-domain]  Cd Length: 92  Bit Score: 38.62  E-value: 2.36e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 4504191    93 GDLVWAKMEGYPWWPCLVyNHPFDGTFIREKGKsvrVHVQFFdDSPTRGWVSKRLLKPYTG-------SKSKEAQKGGHF 165
Cdd:cd20147    1 GDLVLAKVKGFPAWPAQV-SEPEDWGSAPDPKK---VFVHFF-GTQQIGFCNPGELSEFTEeikqsllARTLKKKKGSDF 75

                 ....*..
gi 4504191   166 YSAKPEI 172
Cdd:cd20147   76 SRAVKEI 82
PWWP_MBD5 cd20141
PWWP domain found in methyl-CpG-binding domain protein 5 (MBD5) and similar proteins; MBD5, is ...
90-136 2.71e-03

PWWP domain found in methyl-CpG-binding domain protein 5 (MBD5) and similar proteins; MBD5, is a methyl-CpG-binding protein that binds to heterochromatin. It does not interact with either methylated or unmethylated DNA. MBD5 acts as a transcriptional regulator responsible for 2q23.1 deletion syndrome. It belongs to the MBD family proteins, which play central roles in transcriptional regulation and development. The PWWP domain specifically recognizes DNA and histone methylated lysines.


Pssm-ID: 438969  Cd Length: 92  Bit Score: 38.45  E-value: 2.71e-03
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|.
gi 4504191    90 FSPGDLVWAKMEGYPWWPclvynhpfdGTFIRE----KGKSVRVHVQFFDD 136
Cdd:cd20141    1 FNVGDLVWGQIRGFPSWP---------GKLVSEndvgKTNEGKVWVSWFGD 42
PWWP_DNMT3B cd20155
PWWP domain found in DNA (cytosine-5)-methyltransferase 3B (DNMT3B); DNMT3B, also called DNA ...
93-136 3.81e-03

PWWP domain found in DNA (cytosine-5)-methyltransferase 3B (DNMT3B); DNMT3B, also called DNA methyltransferase HsaIIIB, DNA MTase HsaIIIB, or M.HsaIIIB, is required for genome-wide de novo methylation and is essential for the establishment of DNA methylation patterns during development. It may preferentially methylate nucleosomal DNA within the nucleosome core region. DNMT3B may function as a transcriptional co-repressor by associating with CBX4 and independently of DNA methylation. In tumorigenesis, DNA methylation by DNMT3B is known to play a role in the inactivation of tumor suppressor genes. In addition, a point mutation in the PWWP domain of DNMT3B has been identified in patients with ICF (immunodeficiency, centromeric instability, and facial anomalies) syndrome, a rare autosomal recessive disorder characterized by hypomethylation of classical satellite DNA.


Pssm-ID: 438983  Cd Length: 117  Bit Score: 38.70  E-value: 3.81e-03
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|....*...
gi 4504191    93 GDLVWAKMEGYPWWPCLVYNHpfdgtfiREKGKSVRVH----VQFFDD 136
Cdd:cd20155    3 GELVWGKIKGFSWWPAMVVSW-------RATGKRQASSgmrwLQWFGD 43
PWWP_PWWP2B cd20153
PWWP domain found in PWWP domain-containing protein 2B (PWWP2B); PWWP2B is a paralog of PWWP2A, ...
91-110 5.80e-03

PWWP domain found in PWWP domain-containing protein 2B (PWWP2B); PWWP2B is a paralog of PWWP2A, a H2A.Z-specific chromatin binding protein which may play an important role in the neural crest stem cell migration and differentiation during early development. PWWP2A and PWWP2B form a stable complex with NuRD subunits MTA1/2/3:HDAC1/2:RBBP4/7, but not with MBD2/3, p66alpha/beta, and CHD3/4. The PWWP domain specifically recognizes DNA and histone methylated lysines.


Pssm-ID: 438981  Cd Length: 116  Bit Score: 38.01  E-value: 5.80e-03
                         10        20
                 ....*....|....*....|
gi 4504191    91 SPGDLVWAKMEGYPWWPCLV 110
Cdd:cd20153   15 SVGDIVWGKIHGFPWWPARV 34
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH