View
Concise Results
Standard Results
Full Results
dCas9-BFP-DNMT3A [Cloning vector p188_pCCL-PGK-dCas9-BFP-DNMT3A]
Protein Classification
List of domain hits
Name
Accession
Description
Interval
E-value
cas_Csn1
TIGR01865
CRISPR subtype II/NMENI RNA-guided endonuclease Cas9/Csn1; CRISPR loci appear to be mobile ...
4-1050
0e+00
CRISPR subtype II/NMENI RNA-guided endonuclease Cas9/Csn1; CRISPR loci appear to be mobile elements with a wide host range. This model represents a protein found only in CRISPR-containing species, near other CRISPR-associated proteins (cas), as part of the NMENI subtype of CRISPR/Cas locus. The species range so far for this protein is animal pathogens and commensals only.
:Pssm-ID: 273840
Cd Length: 805
Bit Score: 895.63
E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 4 K Y SI GL A IG TN SVGWA VIT D E YKVP SK K FKVL G N tdrhsi KK N LI GA L L FDS GETA E - AT RL K R T ARRR YT RRK N R ICY L 82
Cdd:TIGR01865 1 E Y IL GL D IG IA SVGWA IVE D D YKVP AA K RLID G G ------ VR N FT GA E L PKT GETA A l DR RL A R G ARRR IR RRK H R LLR L 74
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 83 QE I FS N E MAKV D DS FF H RLE E SFLVEEDK KH erhpifgnivdevayhekyp TIYHLRK KLVDSTD K A D L rl I YLAL A H M I 162
Cdd:TIGR01865 75 QE L FS R E GSLT D FD FF S RLE N SFLVEEDK RN -------------------- TIYHLRK AALENKL K P D E -- L YLAL L H I I 132
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 163 K F RGHFLIE gdlnpdnsdvdklfiqlvqtynqlfeenpinasgvdakailsarlsksrrlenliaqlpgekknglfgnli 242
Cdd:TIGR01865 133 K H RGHFLIE ----------------------------------------------------------------------- 141
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 243 alslgltpnfksnfdlaedaklqlskdty DD D L D nllaqigdqyadlflaaknlsdaillsdilr VNTEI T K A P LSA S MI 322
Cdd:TIGR01865 142 ----------------------------- GN D F D ------------------------------- TANKE T G A L LSA V MI 161
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 323 K RY D EH HQ DL TL LK A L VRQQL P E KYKEIF FD qskngyagyidggasqeefykfikpilekmdgteellvklnre DL LR K Q 402
Cdd:TIGR01865 162 N RY L EH EA DL RT LK E L ILKKF P K KYKEIF SE ------------------------------------------- TF LR N Q 198
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 403 R T F D NGSIP H Q IH L G EL H AI L R R Q EDF YPF lkdnreki E K I LTFRIPYY V GPLA R G N S R FA wmtrkseetitpwnfee V V 482
Cdd:TIGR01865 199 R G F Y NGSIP R Q LL L E EL E AI F R K Q REY YPF -------- I K L LTFRIPYY I GPLA E G K S E FA ----------------- F V 253
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 483 DK G ASA QS FIE R MT NFDKN LP N EK VL PKHSLL Y E Y FTV Y NEL TK V KYVTEGMRKPAF LS G E Q K KAIV DLLFK TNRKVTV K 562
Cdd:TIGR01865 254 DK P ASA EN FIE K MT GKCTY LP E EK RA PKHSLL A E K FTV L NEL NN V RIIILEQGETKI LS K E E K QELL DLLFK KKKLTYK K 333
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 563 QL K EDYFKKIEC F DSVEIS G VEDR --- FN A SL G TYH D L L K IIK DKD F LDN EE N EDI L ED IV LT LTL FE DREMI EE RL KT Y 639
Cdd:TIGR01865 334 LR K LLGLSEDAI F KGLRYE G LDNA eka FN I SL K TYH K L R K ALG DKD L LDN PK N PKD L DE IV KI LTL YK DREMI KK RL EL Y 413
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 640 AHLFDDKVM K Q L K R RRY TGWGRLS R K LIN GIR DKQSGKTIL D FLKSDGFA NRNFMQ L I H D DS L TF K ED I Q KA qvsgqgds 719
Cdd:TIGR01865 414 KDVLNEEQV K K L V R LHF TGWGRLS L K ALR GIR PLMEQGKRY D EAILELGG NRNFMQ N I N D SQ L LP K IN I T KA -------- 485
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 720 lhehi ANLAGS P AI K KGI LQ TV KVV D ELVK VM G rh K P EN IVIEMARE N Q T T QK G QK NS R ER M K RI E EG IKE LG S ---- Q I 795
Cdd:TIGR01865 486 ----- KDEILN P VV K RAL LQ AR KVV N ELVK KY G -- P P DK IVIEMARE E Q G T NF G KR NS K ER Y K KN E DK IKE FA S algk E I 558
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 796 LKE H P V EN TQLQNE KL Y LYY L QNG RD MY VDQ E L DI NR L --- S D Y DV D A I V PQS FLK DDSI D NKVL TRSDK N RG K S D NV P S 872
Cdd:TIGR01865 559 LKE E P T EN SSKNIL KL R LYY Q QNG KC MY TGK E I DI DD L fdl S Y Y EI D H I L PQS RSF DDSI S NKVL VLASE N QE K G D QT P Y 638
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 873 E - E V VKK MKNY W RQLLNAK LI TQ RK F D N LT K AERGGLS EL DKAGFI K R Q L VE TR Q IT KH VA QI L DS R M N TKYD endkl I R 951
Cdd:TIGR01865 639 E a E I VKK DSAF W NKFEAYV LI SK RK S D K LT R AERGGLS DD DKAGFI D R N L ND TR Y IT RV VA NY L KD R F N FHLK ----- K R 713
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 952 E VKV I TLK SK L V S DF RK DFQF YK V REINNYHHAHDAY L NAV VGT AL I KK YPK LE S EF V Y GD Y KVY D V RK MIA kseqeig K 1031
Cdd:TIGR01865 714 K VKV V TLK GQ L T S QL RK KWGL YK K REINNYHHAHDAY I NAV STN AL V KK FSQ LE P EF R Y KE Y HNF D G RK KKK ------- S 786
1050
....*....|....*....
gi 1377672160 1032 AT A K YFFY SN I M N FFK TEI 1050
Cdd:TIGR01865 787 AT D K KVKF SN P M E FFK QKV 805
Cas9_PI super family
cl24973
PAM-interacting domain of CRISPR-associated endonuclease Cas9; Cas9_PI is a family found at ...
1102-1358
1.28e-46
PAM-interacting domain of CRISPR-associated endonuclease Cas9; Cas9_PI is a family found at the C-terminal of bacterial type II CRISPR system Cas9 endonuclease. This domain adopts a novel protein fold that is unique to the Cas9 family. It is positioned in the structure-DNA-complex to recognize the PAM sequence on the non-complementary DNA strand of the crRNA. PAM sequence is protospacer-adjacent motifs on DNA. See family CRISPR-DR2, Rfam:RF01315. Cas9 carries two nuclease domains, HNH and RuvC, which cleave the DNA strands that are complementary and non-complementary to the 20 nucleotide guide sequence in crRNAs, respectively.
The actual alignment was detected with superfamily member pfam16595 :Pssm-ID: 435449
Cd Length: 264
Bit Score: 169.04
E-value: 1.28e-46
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1102 T GG FSKES ILP -- K RNSDK LI AR KKD --- W D PK KYGG FD S P T V AY SV LV VAKVE KGK S K KL ksvke LL G ITIMERSSF E K 1176
Cdd:pfam16595 1 K GG LFNQT ILP ah K KKGKG LI PL KKD erg L D VE KYGG YS S L T A AY FS LV EYTGK KGK R K RT ----- IE G VPLYLAAKI E E 75
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1177 N PI -- DF LE A K GYKEVK K DLII K LP K Y SL FE l EN G RKRM L ASAG E --- L QKGNE L A L PSKYVNFLYLASHYE K LKGSPED 1251
Cdd:pfam16595 76 N KD ll EY LE E K LGLKEP K IILP K IK K N SL IK - ID G FRML L TGKT E nrl L KNAVQ L V L SNDDEKYIKKIEKFV K KNKDDII 154
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1252 N E QKQ L FV E QHKHYL DE IIEQISEFSK r VILADANLD K VLSAYN K HRDKPIR E QAENI I HLFT LT NLGA - P A AF K YFDTT 1330
Cdd:pfam16595 155 E E KDG L TE E KNIKLY DE LLDKMKNTIY - YKRPSNQGE K LEKLKE K FIKLSLE E KCKVL I EILK LT HANP t S A DL K LIGGS 233
250 260 270
....*....|....*....|....*....|.
gi 1377672160 1331 IDRK R YTSTKEVLD A --- T LI H QS I TGLYE T 1358
Cdd:pfam16595 234 KHAG R IKISNNISK A sni K LI N QS V TGLYE K 264
GFP super family
cl08319
Green fluorescent protein;
1421-1631
1.78e-43
Green fluorescent protein;
The actual alignment was detected with superfamily member pfam01353 :Pssm-ID: 426217
Cd Length: 211
Bit Score: 158.12
E-value: 1.78e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1421 M HMK L Y MEG T V DN H H F KCTSE G E G K P YE G TQTMRI K VVE G g P LPF AFDI LA TSFL Y gs KTFINHTQ G i PDF F KQSFPE G - 1499
Cdd:pfam01353 1 M THD L H MEG S V NG H E F DIVGG G N G N P ND G SLETKV K STK G - A LPF SPYL LA PHL* Y -- YQYLPFPD G - TSP F QAAVEN G g 76
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1500 FTWE R VTTY EDGGVLT ATQDTSLQD G CLIYNVKIR G VN F TSN GPVM Q K KTL GW EAFT E TL - YPA D GG L E G RNDMA LKL VG 1578
Cdd:pfam01353 77 YQVH R TFKF EDGGVLT IVFTYTYEG G HIKGEFTFQ G SG F PPD GPVM T K SLT GW DPSV E KM i PRN D KT L V G DINWS LKL TD 156
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 1377672160 1579 G SHLI A NIK T T Y RSK KP - AKN LK M P GVYY V DYRL ER IKEANNETY VEQ HEVA V A 1631
Cdd:pfam01353 157 G KRYR A QVV T N Y TFA KP v PAG LK L P PPHF V FRKI ER TGSKTEINL VEQ QKAF V D 210
Dcm super family
cl43082
DNA-cytosine methylase [Replication, recombination and repair];
1682-1960
1.63e-18
DNA-cytosine methylase [Replication, recombination and repair];
The actual alignment was detected with superfamily member COG0270 :Pssm-ID: 440040 [Multi-domain]
Cd Length: 277
Bit Score: 87.94
E-value: 1.63e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1682 R K PIR V LS LF D G I --- AT G L lvl KDL G IQ V dr YI A S E VCE D SITVGMVRHQGKIMYV GD V R SVTQKHIQ ew GPF DL V IGG 1758
Cdd:COG0270 1 S K KLT V ID LF A G A ggl SL G F --- EKA G FE V -- VF A V E IDP D ACETYRANFPEAKVIE GD I R DIDPEELI -- PDV DL L IGG 73
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1759 S PC NDL S IVNP a RKGL YEGT G R LFFEF Y R LLHDA RPK egddrpf FWLF ENV VAMGV SDK ---- RD I SRF LE S ----- NPV 1829
Cdd:COG0270 74 P PC QPF S VAGK - RKGL EDPR G T LFFEF I R IVEEL RPK ------- AFVL ENV PGLLS SDK gktf EE I LKE LE E lgyrv DYK 145
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1830 MID A KEVSAA - H R A R Y F ---- WGN L PGMNR P LASTVNDKLELQEC LE H ---- GRIAKF S K vr TIT TRSN sikqgkdq HFP 1900
Cdd:COG0270 146 VLN A ADYGVP q N R E R V F ivgf RKD L DLFEF P EPTHLKPYVTVGDA LE D lpda HEARYL S E -- TIT AGYG -------- GGG 215
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1377672160 1901 V F MNEK E DI l WC T -- E ME R VF GFP VHYTDVSNMSRLA RQ rl L G RSWSV P VIRHLFAPLKEYF 1960
Cdd:COG0270 216 R F LHPG E PR - RL T vr E AA R LQ GFP DDFKFPGSKTQAY RQ -- I G NAVPP P LAEAIAKAILKAL 274
Name
Accession
Description
Interval
E-value
cas_Csn1
TIGR01865
CRISPR subtype II/NMENI RNA-guided endonuclease Cas9/Csn1; CRISPR loci appear to be mobile ...
4-1050
0e+00
CRISPR subtype II/NMENI RNA-guided endonuclease Cas9/Csn1; CRISPR loci appear to be mobile elements with a wide host range. This model represents a protein found only in CRISPR-containing species, near other CRISPR-associated proteins (cas), as part of the NMENI subtype of CRISPR/Cas locus. The species range so far for this protein is animal pathogens and commensals only.
Pssm-ID: 273840
Cd Length: 805
Bit Score: 895.63
E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 4 K Y SI GL A IG TN SVGWA VIT D E YKVP SK K FKVL G N tdrhsi KK N LI GA L L FDS GETA E - AT RL K R T ARRR YT RRK N R ICY L 82
Cdd:TIGR01865 1 E Y IL GL D IG IA SVGWA IVE D D YKVP AA K RLID G G ------ VR N FT GA E L PKT GETA A l DR RL A R G ARRR IR RRK H R LLR L 74
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 83 QE I FS N E MAKV D DS FF H RLE E SFLVEEDK KH erhpifgnivdevayhekyp TIYHLRK KLVDSTD K A D L rl I YLAL A H M I 162
Cdd:TIGR01865 75 QE L FS R E GSLT D FD FF S RLE N SFLVEEDK RN -------------------- TIYHLRK AALENKL K P D E -- L YLAL L H I I 132
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 163 K F RGHFLIE gdlnpdnsdvdklfiqlvqtynqlfeenpinasgvdakailsarlsksrrlenliaqlpgekknglfgnli 242
Cdd:TIGR01865 133 K H RGHFLIE ----------------------------------------------------------------------- 141
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 243 alslgltpnfksnfdlaedaklqlskdty DD D L D nllaqigdqyadlflaaknlsdaillsdilr VNTEI T K A P LSA S MI 322
Cdd:TIGR01865 142 ----------------------------- GN D F D ------------------------------- TANKE T G A L LSA V MI 161
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 323 K RY D EH HQ DL TL LK A L VRQQL P E KYKEIF FD qskngyagyidggasqeefykfikpilekmdgteellvklnre DL LR K Q 402
Cdd:TIGR01865 162 N RY L EH EA DL RT LK E L ILKKF P K KYKEIF SE ------------------------------------------- TF LR N Q 198
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 403 R T F D NGSIP H Q IH L G EL H AI L R R Q EDF YPF lkdnreki E K I LTFRIPYY V GPLA R G N S R FA wmtrkseetitpwnfee V V 482
Cdd:TIGR01865 199 R G F Y NGSIP R Q LL L E EL E AI F R K Q REY YPF -------- I K L LTFRIPYY I GPLA E G K S E FA ----------------- F V 253
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 483 DK G ASA QS FIE R MT NFDKN LP N EK VL PKHSLL Y E Y FTV Y NEL TK V KYVTEGMRKPAF LS G E Q K KAIV DLLFK TNRKVTV K 562
Cdd:TIGR01865 254 DK P ASA EN FIE K MT GKCTY LP E EK RA PKHSLL A E K FTV L NEL NN V RIIILEQGETKI LS K E E K QELL DLLFK KKKLTYK K 333
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 563 QL K EDYFKKIEC F DSVEIS G VEDR --- FN A SL G TYH D L L K IIK DKD F LDN EE N EDI L ED IV LT LTL FE DREMI EE RL KT Y 639
Cdd:TIGR01865 334 LR K LLGLSEDAI F KGLRYE G LDNA eka FN I SL K TYH K L R K ALG DKD L LDN PK N PKD L DE IV KI LTL YK DREMI KK RL EL Y 413
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 640 AHLFDDKVM K Q L K R RRY TGWGRLS R K LIN GIR DKQSGKTIL D FLKSDGFA NRNFMQ L I H D DS L TF K ED I Q KA qvsgqgds 719
Cdd:TIGR01865 414 KDVLNEEQV K K L V R LHF TGWGRLS L K ALR GIR PLMEQGKRY D EAILELGG NRNFMQ N I N D SQ L LP K IN I T KA -------- 485
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 720 lhehi ANLAGS P AI K KGI LQ TV KVV D ELVK VM G rh K P EN IVIEMARE N Q T T QK G QK NS R ER M K RI E EG IKE LG S ---- Q I 795
Cdd:TIGR01865 486 ----- KDEILN P VV K RAL LQ AR KVV N ELVK KY G -- P P DK IVIEMARE E Q G T NF G KR NS K ER Y K KN E DK IKE FA S algk E I 558
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 796 LKE H P V EN TQLQNE KL Y LYY L QNG RD MY VDQ E L DI NR L --- S D Y DV D A I V PQS FLK DDSI D NKVL TRSDK N RG K S D NV P S 872
Cdd:TIGR01865 559 LKE E P T EN SSKNIL KL R LYY Q QNG KC MY TGK E I DI DD L fdl S Y Y EI D H I L PQS RSF DDSI S NKVL VLASE N QE K G D QT P Y 638
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 873 E - E V VKK MKNY W RQLLNAK LI TQ RK F D N LT K AERGGLS EL DKAGFI K R Q L VE TR Q IT KH VA QI L DS R M N TKYD endkl I R 951
Cdd:TIGR01865 639 E a E I VKK DSAF W NKFEAYV LI SK RK S D K LT R AERGGLS DD DKAGFI D R N L ND TR Y IT RV VA NY L KD R F N FHLK ----- K R 713
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 952 E VKV I TLK SK L V S DF RK DFQF YK V REINNYHHAHDAY L NAV VGT AL I KK YPK LE S EF V Y GD Y KVY D V RK MIA kseqeig K 1031
Cdd:TIGR01865 714 K VKV V TLK GQ L T S QL RK KWGL YK K REINNYHHAHDAY I NAV STN AL V KK FSQ LE P EF R Y KE Y HNF D G RK KKK ------- S 786
1050
....*....|....*....
gi 1377672160 1032 AT A K YFFY SN I M N FFK TEI 1050
Cdd:TIGR01865 787 AT D K KVKF SN P M E FFK QKV 805
Csn1
cd09643
CRISPR/Cas system-associated protein Cas9; CRISPR (Clustered Regularly Interspaced Short ...
4-1049
0e+00
CRISPR/Cas system-associated protein Cas9; CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Very large protein containing McrA/HNH-nuclease related domain and a RuvC-like nuclease domain; signature gene for type II
Pssm-ID: 187774 [Multi-domain]
Cd Length: 799
Bit Score: 880.99
E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 4 K Y SI GL A IG TN SVGWA VIT D E YKVP S KK FK vlgntdr HSIK K NLI GA L LF DS GETA E - AT RL K R T ARRR YT RRK N R ICY L 82
Cdd:cd09643 1 E Y IL GL D IG IA SVGWA IVE D D YKVP A KK MI ------- DCGV K IFT GA E LF KT GETA A l DR RL A R G ARRR IR RRK H R LLR L 73
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 83 QE I F SN E MAKV D DS FF H RLE E SFL veedkkherhpifgnivdev A YH EK YPTIYHLRK KLVDSTD K A D L rl I YLAL A H M I 162
Cdd:cd09643 74 QE L F AR E GSLT D FD FF S RLE D SFL -------------------- E YH KN YPTIYHLRK AALENKL K P D E -- L YLAL L H I I 131
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 163 K F RGHFLIEGD LNPDN sdvdklfiqlvqtynqlfeenpinasgvdakailsarlsksrrlenliaqlpgekknglfgnli 242
Cdd:cd09643 132 K H RGHFLIEGD EDTTA ---------------------------------------------------------------- 147
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 243 alslgltpnfksnfdlaedaklqlskdtydddldnllaqigdqyadlflaaknlsdaillsdilrvn TEI T K A P LSASMI 322
Cdd:cd09643 148 ------------------------------------------------------------------- DKE T G A L LSASMI 160
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 323 KRYDEH HQ DL TL LK A L VRQQLPE KYKEIF F D qskngyagyidggasqeefykfikpilekmdgteellvklnr E DL LR K Q 402
Cdd:cd09643 161 KRYDEH KA DL RK LK E L IKKEFFK KYKEIF G D ------------------------------------------ E TF LR N Q 198
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 403 R T F D NGSIP H Q IH L G EL H AI L R R Q EDF YPF lkdnreki EKILTFRIPYY V GPLA R G N S R FAW M TR KSEE titpwnfeevv 482
Cdd:cd09643 199 R G F Y NGSIP R Q LL L E EL E AI F R K Q REY YPF -------- EKILTFRIPYY I GPLA E G K S E FAW L TR PALS ----------- 259
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 483 dkgasa QS FIE R MT NFDKN LP N EK VL PKHSLL Y E Y FTV Y NEL TKVKYVT E g MRKPAF LS G E Q K KAIV DLLFK T N RKVTVK 562
Cdd:cd09643 260 ------ EA FIE K MT GKCTY LP E EK RA PKHSLL A E K FTV L NEL NNLRIIE E - QGETKI LS K E E K QELL DLLFK K N KLTYKQ 332
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 563 QL K EDYF K KI E C F DSVEIS G -- V E DR FN A SL G TYHDL L K IIKDKDFL D N E E NE D IL ED IV LT LTL FE DREMIE ER L KT Y A 640
Cdd:cd09643 333 KR K LLGL K EE E I F KGLRYE G lk A E KN FN I SL K TYHDL R K ALGKEFLK D L E L NE K IL DE IV KI LTL YK DREMIE KI L EL Y K 412
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 641 H L FDDKVM K Q L KR R RY TGWGRLS R K LIN GIR DKQSGKTIL D FLKSDGFA N R N fm Q L I HD D S L T F KED I Q KAQV sgqgdsl 720
Cdd:cd09643 413 D L LNEEQL K K L LK R HF TGWGRLS L K ALR GIR PLMEQGKRY D EAILELGG N H N -- Q K I NS D E L K F LPI I K KAQV ------- 483
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 721 hehi ANLAGS P AI K KGI LQ TV KVV D ELVK VM G rh K P EN IVIEMAREN Q t T Q KG Q KN SRE R M K RI E EG IKE LG S --- Q I LK 797
Cdd:cd09643 484 ---- KDEILN P VV K RAL LQ AR KVV N ELVK KY G -- P P DK IVIEMAREN G - T N KG T KN RKK R Q K KN E DN IKE AA S ale Q K LK 556
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 798 E H P VENTQLQNE KL Y LYY L QNG RD MY VDQ E L DI NR L --- S D Y DV D A I V PQS FLK DDSI D NKVL TRSDK N RG K S D NV P S EE 874
Cdd:cd09643 557 E L P LDIKSKNIL KL R LYY Q QNG KC MY TGK E I DI DD L fdl S Y Y EI D H I L PQS RSF DDSI S NKVL VLASE N QE K G D QT P Y EE 636
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 875 V V K KM KNY W RQ L LN AKLI T QR --- K F D N L t KA E R g G L S ELD KAGFI K R Q L VE TR Q IT KH VA QI L DS R M N TKYD endkl I R 951
Cdd:cd09643 637 I V S KM SAF W NK L EA AKLI S QR gds K K D R L - LL E K - G I S DDE KAGFI D R N L ND TR Y IT RV VA NY L KD R F N FHLK ----- K R 709
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 952 E VKV I TLK SK L V S DF RK DFQF YK V REINNYHHAHDAY L NAVV GT AL I KK YPK LE sef V Y GD YK VY D VR K MIA K SEQ E I gk 1031
Cdd:cd09643 710 K VKV V TLK GQ L T S QL RK KWGL YK K REINNYHHAHDAY I NAVV TN AL V KK FSQ LE --- R Y KE YK RF D SE K GNK K TLD E N -- 784
1050
....*....|....*...
gi 1377672160 1032 ata K Y FF YS N I MNFFK T E 1049
Cdd:cd09643 785 --- K K FF FA N P MNFFK Q E 799
Cas9_REC
pfam16592
REC lobe of CRISPR-associated endonuclease Cas9; The REC lobe of Cas9 - the CRISPR-associated ...
181-710
0e+00
REC lobe of CRISPR-associated endonuclease Cas9; The REC lobe of Cas9 - the CRISPR-associated endonuclease Cas9 - includes the REC1 and REC2 domains. REC1 forms an elongated, alpha-helical structure consisting of 25 alpha helices and two beta-sheets, whereas REC2 inserted within REC1 adopts a six-helix bundle structure. The REC lobe and the NUC lobe of Cas9 fold to present a positively charged groove at their interface which accommodates the negatively charged sgRNA:target DNA heteroduplex. CRISPR (clustered regularly interspaced short palindromic repeat)-Cas system occurs naturally in bacteria as a defence against invasion by phages or other mobile genetic elements. Cas9 is targeted to specific genomic locations by sgRNAs or single guide RNAs, in order to complex with invading DNA in order to cleave it and render it inactive.
Pssm-ID: 435447
Cd Length: 539
Bit Score: 576.32
E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 181 V DKL F IQ L VQTYNQLF E ENPINASG V DAKA IL SA - RL SK SRR L EN L I A QL P G EK - KNGL F GNLIA L S LG LTPN F KSN F D L 258
Cdd:pfam16592 1 V EES F QD L LNILYEQL E NLELETQN V EIEK IL KK t KI SK KAK L DE L L A LP P N EK n SKKI F AEILK L I LG NKAD F TKI F E L 80
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 259 ------ AEDA KL QL S KDT YD DDLDN L LA Q I GD QY A DLF L AA K NLS D AIL LSDIL R V N T EIT KA P LS AS M IK RYD E H HQ DL 332
Cdd:pfam16592 81 ekfvee PKKI KL SF S DSN YD EKIEE L EN Q L GD EK A EII L IL K KIY D WVV LSDIL T V S T DNG KA Y LS EA M VN RYD K H KE DL 160
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 333 TL LK ALVR Q Q L P EKY KEI F FDQS K N GY AG YI D ---- G GA S Q E E FYK F IK PILE K MDGT E E -- L L V K LNR E DL L R KQRT FD 406
Cdd:pfam16592 161 AQ LK KVIK Q N L S EKY NDM F RKEK K K GY SA YI N gknn G KT S K E D FYK Y IK KLIN K VETS E A qy I L S K IDN E NF L P KQRT KS 240
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 407 NGSIP H Q I HL G EL HA I LRR Q EDF YPFLK D N R EKI E K I LTFRIPYYVGPLA RGN S R FAWM T RK SEET I T PWNFE EV VD KGA 486
Cdd:pfam16592 241 NGSIP Y Q V HL Q EL KK I IKN Q AEY YPFLK E N Q EKI L K L LTFRIPYYVGPLA EKK S K FAWM K RK EQGK I Y PWNFE QK VD IDK 320
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 487 S A QS FI E RMTN FDKN LP N EKVLPK H SLLY EY FTV Y NEL T K V K YVT E gmrkpa FL S G E Q K KA I VDL LFK T N R KVT V K Q LK E 566
Cdd:pfam16592 321 T A EA FI T RMTN YCTY LP D EKVLPK N SLLY SK FTV L NEL N K I K ING E ------ KI S V E L K QD I FNG LFK K N K KVT K K K LK D 394
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 567 DYF K KIEC F DS VEI S G V -- E DR FN A SL G TY H DL L KI I kd K DFLDN EE NEDI L EDI VLT LTLFEDR EMIEE RL - K T Y AH L F 643
Cdd:pfam16592 395 WLV K EGYN F KA VEI K G F dk E NN FN N SL T TY I DL A KI F -- G DFLDN PD NEDI I EDI IYW LTLFEDR KILKR RL q K K Y SN L L 472
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 644 DD K VM KQ LKRRR Y T GWGRLS RK L I NGIR DKQS --- G KTI L D F L KS D gfa NRN F MQLI H D DS L T FKE D I Q K 710
Cdd:pfam16592 473 TE K QI KQ ILKLK Y K GWGRLS KE L L NGIR GADR qge I KTI I D L L WN D --- NRN L MQLI N D ER L S FKE E I E K 539
Cas9
COG3513
CRISPR-Cas system type-II protein Cas9 [Defense mechanisms]; CRISPR-Cas system type-II protein ...
3-1130
9.57e-125
CRISPR-Cas system type-II protein Cas9 [Defense mechanisms]; CRISPR-Cas system type-II protein Cas9 is part of the Pathway/BioSystem: CRISPR-Cas system
Pssm-ID: 442735 [Multi-domain]
Cd Length: 812
Bit Score: 413.97
E-value: 9.57e-125
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 3 K KY SI GL AI G T NSVGWAV ITDEYKV pskkfkvlgntdr HSIKKNLI G ALL FD S GE T ------- A E A T R LK R T ARRR YT RR 75
Cdd:COG3513 2 D KY IL GL DL G I NSVGWAV LELDEDG ------------- EPGEIIDA G VRI FD D GE D pksgesl A A A R R EA R G ARRR RR RR 68
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 76 K N R ICY L QEIFSN E M akvddsffhrlees F L VEE D KKHERHP ifgnivdevayhek YPTI Y H LR K K LV D st D K ADLRLIY 155
Cdd:COG3513 69 K H R LRR L KRLLVE E G -------------- L L PAD D AERKALL -------------- PLNP Y E LR A K AL D -- E K LSPEELG 118
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 156 L AL A H MIKF RG H fliegd LNPDNS D VDKL fiqlvqtynqlfeenpinasgv DAKAILS A RLSKSR RLE NLI A QLP GE kkn 235
Cdd:COG3513 119 R AL F H LAQR RG F ------ KSNRKT D SKDN ---------------------- ESGKVKD A IKELRE RLE AKG A RTV GE --- 167
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 236 glfgnlialslgltpnfksnfdlaedaklqlskdtydddldnllaqigdqyadl F L A aknlsdaillsdilrvnteitka 315
Cdd:COG3513 168 ------------------------------------------------------ Y L Y ----------------------- 170
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 316 plsasmi K R YD E HHQ dltllkalvrqqlpekykeiffdqskngyagyidggasqeefykfikpilekmdgteellvklnr 395
Cdd:COG3513 171 ------- R R LQ E NGK ----------------------------------------------------------------- 178
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 396 edl L R KQRTFDNGS IP HQIHLG E LH AI LRR Q ED F Y P F L KDN -- R EKIEK I LT F RI P YYV G plargnsrfawmtrkseeti 473
Cdd:COG3513 179 --- V R NRKGDYDFY IP REDLED E FE AI WAA Q AE F G P A L LTE el R DELLE I IF F QR P LKS G -------------------- 235
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 474 tpwnfeevvdkgasaqsfi ERMTNFDKNL P N EK VL PK H S L L YEY F TVYNE L TKVKY V TE G m RKPAF L SG E QKKA I V DLL F 553
Cdd:COG3513 236 ------------------- KKLVGKCTFE P D EK RA PK A S P L FQR F RILQK L NNLRI V DD G - GEERP L TL E ERQK I I DLL E 295
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 554 K t NR K V T V K Q L KEDY fk KIE cf D S V EIS G VEDRFN ----- ASLG TY HD L L KI IKDKDF ld NE ENED IL E DIV LT LTLF E D 628
Cdd:COG3513 296 N - KK K L T F K K L RKLL -- GLP -- D G V IFK G FNYEDD drakl KGDK TY AK L A KI FGKAWL -- NE FDPE IL D DIV EA LTLF K D 368
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 629 R E MIE E R LK TYAH L f D DKVMKQ L KRRR - YT G W G R LS R K LING I rdkqsgkti L DF L KSD gfanrnfmqlihdds L TFK E D 707
Cdd:COG3513 369 D E ELK E W LK KLYG L - D EEQAEA L ANLP l PD G Y G N LS L K ALRK I --------- L PL L EEG --------------- L DYD E A 423
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 708 IQK A QVSGQGDSLH -------- E HIANLAGS P AIKKGIL Q TV KVV DE L VKVM G rh KP EN I V IE M AR ENQTTQ K GQ K NSRE 779
Cdd:COG3513 424 VKA A GYDHSSLEIL drlppige E KRKGSIRN P VVHRALN Q LR KVV NA L IRKY G -- KP DE I H IE L AR DLKKSK K ER K EIQK 501
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 780 R MKRI E EGIKELGSQ I LK E HPV E NTQLQNE K LY L YYL QNGR DM Y VDQELD I NR L S D -- YDV D A I V P Q S FLK DDS ID NKVL 857
Cdd:COG3513 502 R QREN E KAREKAREE I AE E GGG E PSRRDIL K YR L WEE QNGR CP Y TGKPIS I SD L L D gs VEI D H I L P R S RTL DDS FN NKVL 581
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 858 TRS D K NR G K SDNV P S E EVVK ---- K MKNYWRQLL N A KLI T Q R K FDNLT K A E RG gls EL D KA GFI K RQL VE TR Q I TKHV A Q 933
Cdd:COG3513 582 CLA D A NR E K GNRT P Y E ALGG deae K WEEILARVE N L KLI P Q K K KKRFL K K E LD --- RD D DE GFI A RQL ND TR Y I SRLA A E 658
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 934 I L D S RMNTKY dendkl IREV KV ITLKSK L VSDF R KDFQFY K V ------- REINNY HHA H DA YLN A VVGTA L IKKYP K LES 1006
Cdd:COG3513 659 Y L K S LYPFED ------ KGKR KV RVVPGQ L TAML R RAWGLN K I lsddgek NRDDHR HHA I DA LVI A CTTQG L LQRLA K ASR 732
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1007 E FVY gdykvydvrkmiakseqeig KAT A KYF F YSNIMN F FKT eitlangeirkrplietngetgeivwdkgr DFAT V RKV 1086
Cdd:COG3513 733 E RED -------------------- AEK A EEH F PPPWDG F RQD ------------------------------ VAEA V DEI 762
1130 1140 1150 1160
....*....|....*....|....*....|....*....|....*
gi 1377672160 1087 L smpqvn IVKKTEVQ - TG GFS KE S I LPKRN s D K LIA RK K d WDPK K 1130
Cdd:COG3513 763 F ------ VSHAPRRK v TG QLH KE T I YSTGE - G K VVL RK P - LTSL K 799
Cas9_PI
pfam16595
PAM-interacting domain of CRISPR-associated endonuclease Cas9; Cas9_PI is a family found at ...
1102-1358
1.28e-46
PAM-interacting domain of CRISPR-associated endonuclease Cas9; Cas9_PI is a family found at the C-terminal of bacterial type II CRISPR system Cas9 endonuclease. This domain adopts a novel protein fold that is unique to the Cas9 family. It is positioned in the structure-DNA-complex to recognize the PAM sequence on the non-complementary DNA strand of the crRNA. PAM sequence is protospacer-adjacent motifs on DNA. See family CRISPR-DR2, Rfam:RF01315. Cas9 carries two nuclease domains, HNH and RuvC, which cleave the DNA strands that are complementary and non-complementary to the 20 nucleotide guide sequence in crRNAs, respectively.
Pssm-ID: 435449
Cd Length: 264
Bit Score: 169.04
E-value: 1.28e-46
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1102 T GG FSKES ILP -- K RNSDK LI AR KKD --- W D PK KYGG FD S P T V AY SV LV VAKVE KGK S K KL ksvke LL G ITIMERSSF E K 1176
Cdd:pfam16595 1 K GG LFNQT ILP ah K KKGKG LI PL KKD erg L D VE KYGG YS S L T A AY FS LV EYTGK KGK R K RT ----- IE G VPLYLAAKI E E 75
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1177 N PI -- DF LE A K GYKEVK K DLII K LP K Y SL FE l EN G RKRM L ASAG E --- L QKGNE L A L PSKYVNFLYLASHYE K LKGSPED 1251
Cdd:pfam16595 76 N KD ll EY LE E K LGLKEP K IILP K IK K N SL IK - ID G FRML L TGKT E nrl L KNAVQ L V L SNDDEKYIKKIEKFV K KNKDDII 154
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1252 N E QKQ L FV E QHKHYL DE IIEQISEFSK r VILADANLD K VLSAYN K HRDKPIR E QAENI I HLFT LT NLGA - P A AF K YFDTT 1330
Cdd:pfam16595 155 E E KDG L TE E KNIKLY DE LLDKMKNTIY - YKRPSNQGE K LEKLKE K FIKLSLE E KCKVL I EILK LT HANP t S A DL K LIGGS 233
250 260 270
....*....|....*....|....*....|.
gi 1377672160 1331 IDRK R YTSTKEVLD A --- T LI H QS I TGLYE T 1358
Cdd:pfam16595 234 KHAG R IKISNNISK A sni K LI N QS V TGLYE K 264
GFP
pfam01353
Green fluorescent protein;
1421-1631
1.78e-43
Green fluorescent protein;
Pssm-ID: 426217
Cd Length: 211
Bit Score: 158.12
E-value: 1.78e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1421 M HMK L Y MEG T V DN H H F KCTSE G E G K P YE G TQTMRI K VVE G g P LPF AFDI LA TSFL Y gs KTFINHTQ G i PDF F KQSFPE G - 1499
Cdd:pfam01353 1 M THD L H MEG S V NG H E F DIVGG G N G N P ND G SLETKV K STK G - A LPF SPYL LA PHL* Y -- YQYLPFPD G - TSP F QAAVEN G g 76
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1500 FTWE R VTTY EDGGVLT ATQDTSLQD G CLIYNVKIR G VN F TSN GPVM Q K KTL GW EAFT E TL - YPA D GG L E G RNDMA LKL VG 1578
Cdd:pfam01353 77 YQVH R TFKF EDGGVLT IVFTYTYEG G HIKGEFTFQ G SG F PPD GPVM T K SLT GW DPSV E KM i PRN D KT L V G DINWS LKL TD 156
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 1377672160 1579 G SHLI A NIK T T Y RSK KP - AKN LK M P GVYY V DYRL ER IKEANNETY VEQ HEVA V A 1631
Cdd:pfam01353 157 G KRYR A QVV T N Y TFA KP v PAG LK L P PPHF V FRKI ER TGSKTEINL VEQ QKAF V D 210
Dcm
COG0270
DNA-cytosine methylase [Replication, recombination and repair];
1682-1960
1.63e-18
DNA-cytosine methylase [Replication, recombination and repair];
Pssm-ID: 440040 [Multi-domain]
Cd Length: 277
Bit Score: 87.94
E-value: 1.63e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1682 R K PIR V LS LF D G I --- AT G L lvl KDL G IQ V dr YI A S E VCE D SITVGMVRHQGKIMYV GD V R SVTQKHIQ ew GPF DL V IGG 1758
Cdd:COG0270 1 S K KLT V ID LF A G A ggl SL G F --- EKA G FE V -- VF A V E IDP D ACETYRANFPEAKVIE GD I R DIDPEELI -- PDV DL L IGG 73
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1759 S PC NDL S IVNP a RKGL YEGT G R LFFEF Y R LLHDA RPK egddrpf FWLF ENV VAMGV SDK ---- RD I SRF LE S ----- NPV 1829
Cdd:COG0270 74 P PC QPF S VAGK - RKGL EDPR G T LFFEF I R IVEEL RPK ------- AFVL ENV PGLLS SDK gktf EE I LKE LE E lgyrv DYK 145
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1830 MID A KEVSAA - H R A R Y F ---- WGN L PGMNR P LASTVNDKLELQEC LE H ---- GRIAKF S K vr TIT TRSN sikqgkdq HFP 1900
Cdd:COG0270 146 VLN A ADYGVP q N R E R V F ivgf RKD L DLFEF P EPTHLKPYVTVGDA LE D lpda HEARYL S E -- TIT AGYG -------- GGG 215
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1377672160 1901 V F MNEK E DI l WC T -- E ME R VF GFP VHYTDVSNMSRLA RQ rl L G RSWSV P VIRHLFAPLKEYF 1960
Cdd:COG0270 216 R F LHPG E PR - RL T vr E AA R LQ GFP DDFKFPGSKTQAY RQ -- I G NAVPP P LAEAIAKAILKAL 274
Cyt_C5_DNA_methylase
cd00315
Cytosine-C5 specific DNA methylases; Methyl transfer reactions play an important role in many ...
1685-1958
1.57e-11
Cytosine-C5 specific DNA methylases; Methyl transfer reactions play an important role in many aspects of biology. Cytosine-specific DNA methylases are found both in prokaryotes and eukaryotes. DNA methylation, or the covalent addition of a methyl group to cytosine within the context of the CpG dinucleotide, has profound effects on the mammalian genome. These effects include transcriptional repression via inhibition of transcription factor binding or the recruitment of methyl-binding proteins and their associated chromatin remodeling factors, X chromosome inactivation, imprinting and the suppression of parasitic DNA sequences. DNA methylation is also essential for proper embryonic development and is an important player in both DNA repair and genome stability.
Pssm-ID: 238192 [Multi-domain]
Cd Length: 275
Bit Score: 66.87
E-value: 1.57e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1685 I RV LS LF D GI ATGL L V L KDL G IQV dr YI A S E VCEDSITVGMVRH q GKIMYV GD VRSVTQ K HIQE wg PF DL VI GG S PC NDL 1764
Cdd:cd00315 1 L RV ID LF A GI GGFR L G L EKA G FEI -- VA A N E IDKSAAETYEANF - PNKLIE GD ITKIDE K DFIP -- DI DL LT GG F PC QPF 75
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1765 SI VN p A RKG LYEGT G R LFFE FY R L L HDAR PK egddrpf FW L F ENV VAMGVS D KR ---- D I SRF LE SN ----- PVMID A KE 1835
Cdd:cd00315 76 SI AG - K RKG FEDTR G T LFFE II R I L KEKK PK ------- YF L L ENV KGLLTH D NG ntlk V I LNT LE EL gynvy WKLLN A SD 147
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1836 VSAAH - R A R Y F W - G NLPGMNRPLA S TV ---- ND K LE L QEC L ehg RI AKFSKV - R T I T TRSNSIKQGKDQHF P VFMNEKED 1908
Cdd:cd00315 148 YGVPQ n R E R V F I i G IRKDLILNFF S PF pkps EK K KT L KDI L --- RI RDPDEP s P T L T ASYGKGTGSVHPTA P DMIGKESN 224
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 1377672160 1909 I LWC T -- E ME R VF GFP VHY t DVSNM S RLARQ R LL G R S WS VPV IRHLFAPL KE 1958
Cdd:cd00315 225 I RRL T pr E CA R LQ GFP DDF - EFPGK S VTQAY R QI G N S VP VPV AEAIAKAI KE 275
dcm
TIGR00675
DNA-methyltransferase (dcm); All proteins in this family for which functions are known are ...
1736-1950
1.36e-07
DNA-methyltransferase (dcm); All proteins in this family for which functions are known are DNA-cytosine methyltransferases. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 273211 [Multi-domain]
Cd Length: 315
Bit Score: 55.41
E-value: 1.36e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1736 GD VRSVTQKH I QE wgp FD LVI GG S PC NDL SI v NPA RKG LYEGT G R LFFE FY R L L HDAR PK egddrpf F W L F ENV VAMGVS 1815
Cdd:TIGR00675 47 GD ITKISPSD I PD --- FD ILL GG F PC QPF SI - AGK RKG FEDTR G T LFFE IV R I L KEKK PK ------- F F L L ENV KGLVSH 115
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1816 DK - R DISRFL E S -------- NPVMID AK EVSAA - H R A R ------- Y F WGN L P g MNR P LAST V ND K LELQEC L EHG ----- 1873
Cdd:TIGR00675 116 DK g R TFKVII E T leelgykv YYKVLN AK DFGVP q N R E R iyivgfr D F DDK L N - FEF P KPIY V AK K KRIGDL L DLS vdlee 194
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1874 ---------------------------------- R IA K F S KV RT ITT R SNSIKQ G KDQ ------ HFP V FMNEKEDI L WCT 1913
Cdd:TIGR00675 195 kyylseekknglllllenmrkkegtgeqigsfyn R ES K S S II RT LSA R GYTFVK G GKS vlivph KST V VHPGRIRR L TPR 274
250 260 270
....*....|....*....|....*....|....*..
gi 1377672160 1914 E ME R VF GFP VHYTD vs NM S RLARQRLL G RSWS VPVI R 1950
Cdd:TIGR00675 275 E CA R LQ GFP DDFKF -- PV S DSQLYKQA G NAVV VPVI E 309
DNA_methylase
pfam00145
C-5 cytosine-specific DNA methylase;
1685-1809
1.11e-06
C-5 cytosine-specific DNA methylase;
Pssm-ID: 395093 [Multi-domain]
Cd Length: 324
Bit Score: 52.70
E-value: 1.11e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1685 IRVLS LF D GI ATGL L V L KDL G IQV dr YI A S E VCEDSITVGMVRHQGKI my V GD VRSVTQ K H I QE wgp F D LVI GG S PC N D L 1764
Cdd:pfam00145 1 FKFID LF A GI GGFR L G L EQA G FEC -- VA A N E IDKSAAKTYEANFPKVP -- I GD ITLIDI K D I PD --- I D ILT GG F PC Q D F 73
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 1377672160 1765 SI VN p AR KG LYEGT G R LFFE FY R LLHDAR PK egddrpf FW L F ENV 1809
Cdd:pfam00145 74 SI AG - KQ KG FEDTR G T LFFE II R IIKEKK PK ------- AF L L ENV 110
Name
Accession
Description
Interval
E-value
cas_Csn1
TIGR01865
CRISPR subtype II/NMENI RNA-guided endonuclease Cas9/Csn1; CRISPR loci appear to be mobile ...
4-1050
0e+00
CRISPR subtype II/NMENI RNA-guided endonuclease Cas9/Csn1; CRISPR loci appear to be mobile elements with a wide host range. This model represents a protein found only in CRISPR-containing species, near other CRISPR-associated proteins (cas), as part of the NMENI subtype of CRISPR/Cas locus. The species range so far for this protein is animal pathogens and commensals only.
Pssm-ID: 273840
Cd Length: 805
Bit Score: 895.63
E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 4 K Y SI GL A IG TN SVGWA VIT D E YKVP SK K FKVL G N tdrhsi KK N LI GA L L FDS GETA E - AT RL K R T ARRR YT RRK N R ICY L 82
Cdd:TIGR01865 1 E Y IL GL D IG IA SVGWA IVE D D YKVP AA K RLID G G ------ VR N FT GA E L PKT GETA A l DR RL A R G ARRR IR RRK H R LLR L 74
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 83 QE I FS N E MAKV D DS FF H RLE E SFLVEEDK KH erhpifgnivdevayhekyp TIYHLRK KLVDSTD K A D L rl I YLAL A H M I 162
Cdd:TIGR01865 75 QE L FS R E GSLT D FD FF S RLE N SFLVEEDK RN -------------------- TIYHLRK AALENKL K P D E -- L YLAL L H I I 132
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 163 K F RGHFLIE gdlnpdnsdvdklfiqlvqtynqlfeenpinasgvdakailsarlsksrrlenliaqlpgekknglfgnli 242
Cdd:TIGR01865 133 K H RGHFLIE ----------------------------------------------------------------------- 141
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 243 alslgltpnfksnfdlaedaklqlskdty DD D L D nllaqigdqyadlflaaknlsdaillsdilr VNTEI T K A P LSA S MI 322
Cdd:TIGR01865 142 ----------------------------- GN D F D ------------------------------- TANKE T G A L LSA V MI 161
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 323 K RY D EH HQ DL TL LK A L VRQQL P E KYKEIF FD qskngyagyidggasqeefykfikpilekmdgteellvklnre DL LR K Q 402
Cdd:TIGR01865 162 N RY L EH EA DL RT LK E L ILKKF P K KYKEIF SE ------------------------------------------- TF LR N Q 198
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 403 R T F D NGSIP H Q IH L G EL H AI L R R Q EDF YPF lkdnreki E K I LTFRIPYY V GPLA R G N S R FA wmtrkseetitpwnfee V V 482
Cdd:TIGR01865 199 R G F Y NGSIP R Q LL L E EL E AI F R K Q REY YPF -------- I K L LTFRIPYY I GPLA E G K S E FA ----------------- F V 253
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 483 DK G ASA QS FIE R MT NFDKN LP N EK VL PKHSLL Y E Y FTV Y NEL TK V KYVTEGMRKPAF LS G E Q K KAIV DLLFK TNRKVTV K 562
Cdd:TIGR01865 254 DK P ASA EN FIE K MT GKCTY LP E EK RA PKHSLL A E K FTV L NEL NN V RIIILEQGETKI LS K E E K QELL DLLFK KKKLTYK K 333
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 563 QL K EDYFKKIEC F DSVEIS G VEDR --- FN A SL G TYH D L L K IIK DKD F LDN EE N EDI L ED IV LT LTL FE DREMI EE RL KT Y 639
Cdd:TIGR01865 334 LR K LLGLSEDAI F KGLRYE G LDNA eka FN I SL K TYH K L R K ALG DKD L LDN PK N PKD L DE IV KI LTL YK DREMI KK RL EL Y 413
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 640 AHLFDDKVM K Q L K R RRY TGWGRLS R K LIN GIR DKQSGKTIL D FLKSDGFA NRNFMQ L I H D DS L TF K ED I Q KA qvsgqgds 719
Cdd:TIGR01865 414 KDVLNEEQV K K L V R LHF TGWGRLS L K ALR GIR PLMEQGKRY D EAILELGG NRNFMQ N I N D SQ L LP K IN I T KA -------- 485
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 720 lhehi ANLAGS P AI K KGI LQ TV KVV D ELVK VM G rh K P EN IVIEMARE N Q T T QK G QK NS R ER M K RI E EG IKE LG S ---- Q I 795
Cdd:TIGR01865 486 ----- KDEILN P VV K RAL LQ AR KVV N ELVK KY G -- P P DK IVIEMARE E Q G T NF G KR NS K ER Y K KN E DK IKE FA S algk E I 558
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 796 LKE H P V EN TQLQNE KL Y LYY L QNG RD MY VDQ E L DI NR L --- S D Y DV D A I V PQS FLK DDSI D NKVL TRSDK N RG K S D NV P S 872
Cdd:TIGR01865 559 LKE E P T EN SSKNIL KL R LYY Q QNG KC MY TGK E I DI DD L fdl S Y Y EI D H I L PQS RSF DDSI S NKVL VLASE N QE K G D QT P Y 638
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 873 E - E V VKK MKNY W RQLLNAK LI TQ RK F D N LT K AERGGLS EL DKAGFI K R Q L VE TR Q IT KH VA QI L DS R M N TKYD endkl I R 951
Cdd:TIGR01865 639 E a E I VKK DSAF W NKFEAYV LI SK RK S D K LT R AERGGLS DD DKAGFI D R N L ND TR Y IT RV VA NY L KD R F N FHLK ----- K R 713
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 952 E VKV I TLK SK L V S DF RK DFQF YK V REINNYHHAHDAY L NAV VGT AL I KK YPK LE S EF V Y GD Y KVY D V RK MIA kseqeig K 1031
Cdd:TIGR01865 714 K VKV V TLK GQ L T S QL RK KWGL YK K REINNYHHAHDAY I NAV STN AL V KK FSQ LE P EF R Y KE Y HNF D G RK KKK ------- S 786
1050
....*....|....*....
gi 1377672160 1032 AT A K YFFY SN I M N FFK TEI 1050
Cdd:TIGR01865 787 AT D K KVKF SN P M E FFK QKV 805
Csn1
cd09643
CRISPR/Cas system-associated protein Cas9; CRISPR (Clustered Regularly Interspaced Short ...
4-1049
0e+00
CRISPR/Cas system-associated protein Cas9; CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Very large protein containing McrA/HNH-nuclease related domain and a RuvC-like nuclease domain; signature gene for type II
Pssm-ID: 187774 [Multi-domain]
Cd Length: 799
Bit Score: 880.99
E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 4 K Y SI GL A IG TN SVGWA VIT D E YKVP S KK FK vlgntdr HSIK K NLI GA L LF DS GETA E - AT RL K R T ARRR YT RRK N R ICY L 82
Cdd:cd09643 1 E Y IL GL D IG IA SVGWA IVE D D YKVP A KK MI ------- DCGV K IFT GA E LF KT GETA A l DR RL A R G ARRR IR RRK H R LLR L 73
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 83 QE I F SN E MAKV D DS FF H RLE E SFL veedkkherhpifgnivdev A YH EK YPTIYHLRK KLVDSTD K A D L rl I YLAL A H M I 162
Cdd:cd09643 74 QE L F AR E GSLT D FD FF S RLE D SFL -------------------- E YH KN YPTIYHLRK AALENKL K P D E -- L YLAL L H I I 131
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 163 K F RGHFLIEGD LNPDN sdvdklfiqlvqtynqlfeenpinasgvdakailsarlsksrrlenliaqlpgekknglfgnli 242
Cdd:cd09643 132 K H RGHFLIEGD EDTTA ---------------------------------------------------------------- 147
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 243 alslgltpnfksnfdlaedaklqlskdtydddldnllaqigdqyadlflaaknlsdaillsdilrvn TEI T K A P LSASMI 322
Cdd:cd09643 148 ------------------------------------------------------------------- DKE T G A L LSASMI 160
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 323 KRYDEH HQ DL TL LK A L VRQQLPE KYKEIF F D qskngyagyidggasqeefykfikpilekmdgteellvklnr E DL LR K Q 402
Cdd:cd09643 161 KRYDEH KA DL RK LK E L IKKEFFK KYKEIF G D ------------------------------------------ E TF LR N Q 198
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 403 R T F D NGSIP H Q IH L G EL H AI L R R Q EDF YPF lkdnreki EKILTFRIPYY V GPLA R G N S R FAW M TR KSEE titpwnfeevv 482
Cdd:cd09643 199 R G F Y NGSIP R Q LL L E EL E AI F R K Q REY YPF -------- EKILTFRIPYY I GPLA E G K S E FAW L TR PALS ----------- 259
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 483 dkgasa QS FIE R MT NFDKN LP N EK VL PKHSLL Y E Y FTV Y NEL TKVKYVT E g MRKPAF LS G E Q K KAIV DLLFK T N RKVTVK 562
Cdd:cd09643 260 ------ EA FIE K MT GKCTY LP E EK RA PKHSLL A E K FTV L NEL NNLRIIE E - QGETKI LS K E E K QELL DLLFK K N KLTYKQ 332
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 563 QL K EDYF K KI E C F DSVEIS G -- V E DR FN A SL G TYHDL L K IIKDKDFL D N E E NE D IL ED IV LT LTL FE DREMIE ER L KT Y A 640
Cdd:cd09643 333 KR K LLGL K EE E I F KGLRYE G lk A E KN FN I SL K TYHDL R K ALGKEFLK D L E L NE K IL DE IV KI LTL YK DREMIE KI L EL Y K 412
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 641 H L FDDKVM K Q L KR R RY TGWGRLS R K LIN GIR DKQSGKTIL D FLKSDGFA N R N fm Q L I HD D S L T F KED I Q KAQV sgqgdsl 720
Cdd:cd09643 413 D L LNEEQL K K L LK R HF TGWGRLS L K ALR GIR PLMEQGKRY D EAILELGG N H N -- Q K I NS D E L K F LPI I K KAQV ------- 483
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 721 hehi ANLAGS P AI K KGI LQ TV KVV D ELVK VM G rh K P EN IVIEMAREN Q t T Q KG Q KN SRE R M K RI E EG IKE LG S --- Q I LK 797
Cdd:cd09643 484 ---- KDEILN P VV K RAL LQ AR KVV N ELVK KY G -- P P DK IVIEMAREN G - T N KG T KN RKK R Q K KN E DN IKE AA S ale Q K LK 556
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 798 E H P VENTQLQNE KL Y LYY L QNG RD MY VDQ E L DI NR L --- S D Y DV D A I V PQS FLK DDSI D NKVL TRSDK N RG K S D NV P S EE 874
Cdd:cd09643 557 E L P LDIKSKNIL KL R LYY Q QNG KC MY TGK E I DI DD L fdl S Y Y EI D H I L PQS RSF DDSI S NKVL VLASE N QE K G D QT P Y EE 636
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 875 V V K KM KNY W RQ L LN AKLI T QR --- K F D N L t KA E R g G L S ELD KAGFI K R Q L VE TR Q IT KH VA QI L DS R M N TKYD endkl I R 951
Cdd:cd09643 637 I V S KM SAF W NK L EA AKLI S QR gds K K D R L - LL E K - G I S DDE KAGFI D R N L ND TR Y IT RV VA NY L KD R F N FHLK ----- K R 709
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 952 E VKV I TLK SK L V S DF RK DFQF YK V REINNYHHAHDAY L NAVV GT AL I KK YPK LE sef V Y GD YK VY D VR K MIA K SEQ E I gk 1031
Cdd:cd09643 710 K VKV V TLK GQ L T S QL RK KWGL YK K REINNYHHAHDAY I NAVV TN AL V KK FSQ LE --- R Y KE YK RF D SE K GNK K TLD E N -- 784
1050
....*....|....*...
gi 1377672160 1032 ata K Y FF YS N I MNFFK T E 1049
Cdd:cd09643 785 --- K K FF FA N P MNFFK Q E 799
Cas9_REC
pfam16592
REC lobe of CRISPR-associated endonuclease Cas9; The REC lobe of Cas9 - the CRISPR-associated ...
181-710
0e+00
REC lobe of CRISPR-associated endonuclease Cas9; The REC lobe of Cas9 - the CRISPR-associated endonuclease Cas9 - includes the REC1 and REC2 domains. REC1 forms an elongated, alpha-helical structure consisting of 25 alpha helices and two beta-sheets, whereas REC2 inserted within REC1 adopts a six-helix bundle structure. The REC lobe and the NUC lobe of Cas9 fold to present a positively charged groove at their interface which accommodates the negatively charged sgRNA:target DNA heteroduplex. CRISPR (clustered regularly interspaced short palindromic repeat)-Cas system occurs naturally in bacteria as a defence against invasion by phages or other mobile genetic elements. Cas9 is targeted to specific genomic locations by sgRNAs or single guide RNAs, in order to complex with invading DNA in order to cleave it and render it inactive.
Pssm-ID: 435447
Cd Length: 539
Bit Score: 576.32
E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 181 V DKL F IQ L VQTYNQLF E ENPINASG V DAKA IL SA - RL SK SRR L EN L I A QL P G EK - KNGL F GNLIA L S LG LTPN F KSN F D L 258
Cdd:pfam16592 1 V EES F QD L LNILYEQL E NLELETQN V EIEK IL KK t KI SK KAK L DE L L A LP P N EK n SKKI F AEILK L I LG NKAD F TKI F E L 80
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 259 ------ AEDA KL QL S KDT YD DDLDN L LA Q I GD QY A DLF L AA K NLS D AIL LSDIL R V N T EIT KA P LS AS M IK RYD E H HQ DL 332
Cdd:pfam16592 81 ekfvee PKKI KL SF S DSN YD EKIEE L EN Q L GD EK A EII L IL K KIY D WVV LSDIL T V S T DNG KA Y LS EA M VN RYD K H KE DL 160
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 333 TL LK ALVR Q Q L P EKY KEI F FDQS K N GY AG YI D ---- G GA S Q E E FYK F IK PILE K MDGT E E -- L L V K LNR E DL L R KQRT FD 406
Cdd:pfam16592 161 AQ LK KVIK Q N L S EKY NDM F RKEK K K GY SA YI N gknn G KT S K E D FYK Y IK KLIN K VETS E A qy I L S K IDN E NF L P KQRT KS 240
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 407 NGSIP H Q I HL G EL HA I LRR Q EDF YPFLK D N R EKI E K I LTFRIPYYVGPLA RGN S R FAWM T RK SEET I T PWNFE EV VD KGA 486
Cdd:pfam16592 241 NGSIP Y Q V HL Q EL KK I IKN Q AEY YPFLK E N Q EKI L K L LTFRIPYYVGPLA EKK S K FAWM K RK EQGK I Y PWNFE QK VD IDK 320
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 487 S A QS FI E RMTN FDKN LP N EKVLPK H SLLY EY FTV Y NEL T K V K YVT E gmrkpa FL S G E Q K KA I VDL LFK T N R KVT V K Q LK E 566
Cdd:pfam16592 321 T A EA FI T RMTN YCTY LP D EKVLPK N SLLY SK FTV L NEL N K I K ING E ------ KI S V E L K QD I FNG LFK K N K KVT K K K LK D 394
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 567 DYF K KIEC F DS VEI S G V -- E DR FN A SL G TY H DL L KI I kd K DFLDN EE NEDI L EDI VLT LTLFEDR EMIEE RL - K T Y AH L F 643
Cdd:pfam16592 395 WLV K EGYN F KA VEI K G F dk E NN FN N SL T TY I DL A KI F -- G DFLDN PD NEDI I EDI IYW LTLFEDR KILKR RL q K K Y SN L L 472
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 644 DD K VM KQ LKRRR Y T GWGRLS RK L I NGIR DKQS --- G KTI L D F L KS D gfa NRN F MQLI H D DS L T FKE D I Q K 710
Cdd:pfam16592 473 TE K QI KQ ILKLK Y K GWGRLS KE L L NGIR GADR qge I KTI I D L L WN D --- NRN L MQLI N D ER L S FKE E I E K 539
Cas9
COG3513
CRISPR-Cas system type-II protein Cas9 [Defense mechanisms]; CRISPR-Cas system type-II protein ...
3-1130
9.57e-125
CRISPR-Cas system type-II protein Cas9 [Defense mechanisms]; CRISPR-Cas system type-II protein Cas9 is part of the Pathway/BioSystem: CRISPR-Cas system
Pssm-ID: 442735 [Multi-domain]
Cd Length: 812
Bit Score: 413.97
E-value: 9.57e-125
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 3 K KY SI GL AI G T NSVGWAV ITDEYKV pskkfkvlgntdr HSIKKNLI G ALL FD S GE T ------- A E A T R LK R T ARRR YT RR 75
Cdd:COG3513 2 D KY IL GL DL G I NSVGWAV LELDEDG ------------- EPGEIIDA G VRI FD D GE D pksgesl A A A R R EA R G ARRR RR RR 68
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 76 K N R ICY L QEIFSN E M akvddsffhrlees F L VEE D KKHERHP ifgnivdevayhek YPTI Y H LR K K LV D st D K ADLRLIY 155
Cdd:COG3513 69 K H R LRR L KRLLVE E G -------------- L L PAD D AERKALL -------------- PLNP Y E LR A K AL D -- E K LSPEELG 118
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 156 L AL A H MIKF RG H fliegd LNPDNS D VDKL fiqlvqtynqlfeenpinasgv DAKAILS A RLSKSR RLE NLI A QLP GE kkn 235
Cdd:COG3513 119 R AL F H LAQR RG F ------ KSNRKT D SKDN ---------------------- ESGKVKD A IKELRE RLE AKG A RTV GE --- 167
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 236 glfgnlialslgltpnfksnfdlaedaklqlskdtydddldnllaqigdqyadl F L A aknlsdaillsdilrvnteitka 315
Cdd:COG3513 168 ------------------------------------------------------ Y L Y ----------------------- 170
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 316 plsasmi K R YD E HHQ dltllkalvrqqlpekykeiffdqskngyagyidggasqeefykfikpilekmdgteellvklnr 395
Cdd:COG3513 171 ------- R R LQ E NGK ----------------------------------------------------------------- 178
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 396 edl L R KQRTFDNGS IP HQIHLG E LH AI LRR Q ED F Y P F L KDN -- R EKIEK I LT F RI P YYV G plargnsrfawmtrkseeti 473
Cdd:COG3513 179 --- V R NRKGDYDFY IP REDLED E FE AI WAA Q AE F G P A L LTE el R DELLE I IF F QR P LKS G -------------------- 235
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 474 tpwnfeevvdkgasaqsfi ERMTNFDKNL P N EK VL PK H S L L YEY F TVYNE L TKVKY V TE G m RKPAF L SG E QKKA I V DLL F 553
Cdd:COG3513 236 ------------------- KKLVGKCTFE P D EK RA PK A S P L FQR F RILQK L NNLRI V DD G - GEERP L TL E ERQK I I DLL E 295
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 554 K t NR K V T V K Q L KEDY fk KIE cf D S V EIS G VEDRFN ----- ASLG TY HD L L KI IKDKDF ld NE ENED IL E DIV LT LTLF E D 628
Cdd:COG3513 296 N - KK K L T F K K L RKLL -- GLP -- D G V IFK G FNYEDD drakl KGDK TY AK L A KI FGKAWL -- NE FDPE IL D DIV EA LTLF K D 368
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 629 R E MIE E R LK TYAH L f D DKVMKQ L KRRR - YT G W G R LS R K LING I rdkqsgkti L DF L KSD gfanrnfmqlihdds L TFK E D 707
Cdd:COG3513 369 D E ELK E W LK KLYG L - D EEQAEA L ANLP l PD G Y G N LS L K ALRK I --------- L PL L EEG --------------- L DYD E A 423
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 708 IQK A QVSGQGDSLH -------- E HIANLAGS P AIKKGIL Q TV KVV DE L VKVM G rh KP EN I V IE M AR ENQTTQ K GQ K NSRE 779
Cdd:COG3513 424 VKA A GYDHSSLEIL drlppige E KRKGSIRN P VVHRALN Q LR KVV NA L IRKY G -- KP DE I H IE L AR DLKKSK K ER K EIQK 501
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 780 R MKRI E EGIKELGSQ I LK E HPV E NTQLQNE K LY L YYL QNGR DM Y VDQELD I NR L S D -- YDV D A I V P Q S FLK DDS ID NKVL 857
Cdd:COG3513 502 R QREN E KAREKAREE I AE E GGG E PSRRDIL K YR L WEE QNGR CP Y TGKPIS I SD L L D gs VEI D H I L P R S RTL DDS FN NKVL 581
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 858 TRS D K NR G K SDNV P S E EVVK ---- K MKNYWRQLL N A KLI T Q R K FDNLT K A E RG gls EL D KA GFI K RQL VE TR Q I TKHV A Q 933
Cdd:COG3513 582 CLA D A NR E K GNRT P Y E ALGG deae K WEEILARVE N L KLI P Q K K KKRFL K K E LD --- RD D DE GFI A RQL ND TR Y I SRLA A E 658
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 934 I L D S RMNTKY dendkl IREV KV ITLKSK L VSDF R KDFQFY K V ------- REINNY HHA H DA YLN A VVGTA L IKKYP K LES 1006
Cdd:COG3513 659 Y L K S LYPFED ------ KGKR KV RVVPGQ L TAML R RAWGLN K I lsddgek NRDDHR HHA I DA LVI A CTTQG L LQRLA K ASR 732
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1007 E FVY gdykvydvrkmiakseqeig KAT A KYF F YSNIMN F FKT eitlangeirkrplietngetgeivwdkgr DFAT V RKV 1086
Cdd:COG3513 733 E RED -------------------- AEK A EEH F PPPWDG F RQD ------------------------------ VAEA V DEI 762
1130 1140 1150 1160
....*....|....*....|....*....|....*....|....*
gi 1377672160 1087 L smpqvn IVKKTEVQ - TG GFS KE S I LPKRN s D K LIA RK K d WDPK K 1130
Cdd:COG3513 763 F ------ VSHAPRRK v TG QLH KE T I YSTGE - G K VVL RK P - LTSL K 799
Cas9_PI
pfam16595
PAM-interacting domain of CRISPR-associated endonuclease Cas9; Cas9_PI is a family found at ...
1102-1358
1.28e-46
PAM-interacting domain of CRISPR-associated endonuclease Cas9; Cas9_PI is a family found at the C-terminal of bacterial type II CRISPR system Cas9 endonuclease. This domain adopts a novel protein fold that is unique to the Cas9 family. It is positioned in the structure-DNA-complex to recognize the PAM sequence on the non-complementary DNA strand of the crRNA. PAM sequence is protospacer-adjacent motifs on DNA. See family CRISPR-DR2, Rfam:RF01315. Cas9 carries two nuclease domains, HNH and RuvC, which cleave the DNA strands that are complementary and non-complementary to the 20 nucleotide guide sequence in crRNAs, respectively.
Pssm-ID: 435449
Cd Length: 264
Bit Score: 169.04
E-value: 1.28e-46
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1102 T GG FSKES ILP -- K RNSDK LI AR KKD --- W D PK KYGG FD S P T V AY SV LV VAKVE KGK S K KL ksvke LL G ITIMERSSF E K 1176
Cdd:pfam16595 1 K GG LFNQT ILP ah K KKGKG LI PL KKD erg L D VE KYGG YS S L T A AY FS LV EYTGK KGK R K RT ----- IE G VPLYLAAKI E E 75
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1177 N PI -- DF LE A K GYKEVK K DLII K LP K Y SL FE l EN G RKRM L ASAG E --- L QKGNE L A L PSKYVNFLYLASHYE K LKGSPED 1251
Cdd:pfam16595 76 N KD ll EY LE E K LGLKEP K IILP K IK K N SL IK - ID G FRML L TGKT E nrl L KNAVQ L V L SNDDEKYIKKIEKFV K KNKDDII 154
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1252 N E QKQ L FV E QHKHYL DE IIEQISEFSK r VILADANLD K VLSAYN K HRDKPIR E QAENI I HLFT LT NLGA - P A AF K YFDTT 1330
Cdd:pfam16595 155 E E KDG L TE E KNIKLY DE LLDKMKNTIY - YKRPSNQGE K LEKLKE K FIKLSLE E KCKVL I EILK LT HANP t S A DL K LIGGS 233
250 260 270
....*....|....*....|....*....|.
gi 1377672160 1331 IDRK R YTSTKEVLD A --- T LI H QS I TGLYE T 1358
Cdd:pfam16595 234 KHAG R IKISNNISK A sni K LI N QS V TGLYE K 264
GFP
pfam01353
Green fluorescent protein;
1421-1631
1.78e-43
Green fluorescent protein;
Pssm-ID: 426217
Cd Length: 211
Bit Score: 158.12
E-value: 1.78e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1421 M HMK L Y MEG T V DN H H F KCTSE G E G K P YE G TQTMRI K VVE G g P LPF AFDI LA TSFL Y gs KTFINHTQ G i PDF F KQSFPE G - 1499
Cdd:pfam01353 1 M THD L H MEG S V NG H E F DIVGG G N G N P ND G SLETKV K STK G - A LPF SPYL LA PHL* Y -- YQYLPFPD G - TSP F QAAVEN G g 76
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1500 FTWE R VTTY EDGGVLT ATQDTSLQD G CLIYNVKIR G VN F TSN GPVM Q K KTL GW EAFT E TL - YPA D GG L E G RNDMA LKL VG 1578
Cdd:pfam01353 77 YQVH R TFKF EDGGVLT IVFTYTYEG G HIKGEFTFQ G SG F PPD GPVM T K SLT GW DPSV E KM i PRN D KT L V G DINWS LKL TD 156
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 1377672160 1579 G SHLI A NIK T T Y RSK KP - AKN LK M P GVYY V DYRL ER IKEANNETY VEQ HEVA V A 1631
Cdd:pfam01353 157 G KRYR A QVV T N Y TFA KP v PAG LK L P PPHF V FRKI ER TGSKTEINL VEQ QKAF V D 210
Dcm
COG0270
DNA-cytosine methylase [Replication, recombination and repair];
1682-1960
1.63e-18
DNA-cytosine methylase [Replication, recombination and repair];
Pssm-ID: 440040 [Multi-domain]
Cd Length: 277
Bit Score: 87.94
E-value: 1.63e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1682 R K PIR V LS LF D G I --- AT G L lvl KDL G IQ V dr YI A S E VCE D SITVGMVRHQGKIMYV GD V R SVTQKHIQ ew GPF DL V IGG 1758
Cdd:COG0270 1 S K KLT V ID LF A G A ggl SL G F --- EKA G FE V -- VF A V E IDP D ACETYRANFPEAKVIE GD I R DIDPEELI -- PDV DL L IGG 73
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1759 S PC NDL S IVNP a RKGL YEGT G R LFFEF Y R LLHDA RPK egddrpf FWLF ENV VAMGV SDK ---- RD I SRF LE S ----- NPV 1829
Cdd:COG0270 74 P PC QPF S VAGK - RKGL EDPR G T LFFEF I R IVEEL RPK ------- AFVL ENV PGLLS SDK gktf EE I LKE LE E lgyrv DYK 145
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1830 MID A KEVSAA - H R A R Y F ---- WGN L PGMNR P LASTVNDKLELQEC LE H ---- GRIAKF S K vr TIT TRSN sikqgkdq HFP 1900
Cdd:COG0270 146 VLN A ADYGVP q N R E R V F ivgf RKD L DLFEF P EPTHLKPYVTVGDA LE D lpda HEARYL S E -- TIT AGYG -------- GGG 215
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1377672160 1901 V F MNEK E DI l WC T -- E ME R VF GFP VHYTDVSNMSRLA RQ rl L G RSWSV P VIRHLFAPLKEYF 1960
Cdd:COG0270 216 R F LHPG E PR - RL T vr E AA R LQ GFP DDFKFPGSKTQAY RQ -- I G NAVPP P LAEAIAKAILKAL 274
HNH_4
pfam13395
HNH endonuclease; This HNH nuclease domain is found in CRISPR-related proteins.
821-871
3.50e-12
HNH endonuclease; This HNH nuclease domain is found in CRISPR-related proteins.
Pssm-ID: 433172 [Multi-domain]
Cd Length: 55
Bit Score: 63.03
E-value: 3.50e-12
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 1377672160 821 DM Y VDQELD I NR L SD --- YD V D A I V P Q S FLK DDS ID NKVL TRSDK N RG K SDNV P 871
Cdd:pfam13395 1 CP Y TGEQIS I DD L FS ekn YD I D H I L P Y S RSF DDS FS NKVL VLRSA N QE K GNRT P 54
Cyt_C5_DNA_methylase
cd00315
Cytosine-C5 specific DNA methylases; Methyl transfer reactions play an important role in many ...
1685-1958
1.57e-11
Cytosine-C5 specific DNA methylases; Methyl transfer reactions play an important role in many aspects of biology. Cytosine-specific DNA methylases are found both in prokaryotes and eukaryotes. DNA methylation, or the covalent addition of a methyl group to cytosine within the context of the CpG dinucleotide, has profound effects on the mammalian genome. These effects include transcriptional repression via inhibition of transcription factor binding or the recruitment of methyl-binding proteins and their associated chromatin remodeling factors, X chromosome inactivation, imprinting and the suppression of parasitic DNA sequences. DNA methylation is also essential for proper embryonic development and is an important player in both DNA repair and genome stability.
Pssm-ID: 238192 [Multi-domain]
Cd Length: 275
Bit Score: 66.87
E-value: 1.57e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1685 I RV LS LF D GI ATGL L V L KDL G IQV dr YI A S E VCEDSITVGMVRH q GKIMYV GD VRSVTQ K HIQE wg PF DL VI GG S PC NDL 1764
Cdd:cd00315 1 L RV ID LF A GI GGFR L G L EKA G FEI -- VA A N E IDKSAAETYEANF - PNKLIE GD ITKIDE K DFIP -- DI DL LT GG F PC QPF 75
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1765 SI VN p A RKG LYEGT G R LFFE FY R L L HDAR PK egddrpf FW L F ENV VAMGVS D KR ---- D I SRF LE SN ----- PVMID A KE 1835
Cdd:cd00315 76 SI AG - K RKG FEDTR G T LFFE II R I L KEKK PK ------- YF L L ENV KGLLTH D NG ntlk V I LNT LE EL gynvy WKLLN A SD 147
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1836 VSAAH - R A R Y F W - G NLPGMNRPLA S TV ---- ND K LE L QEC L ehg RI AKFSKV - R T I T TRSNSIKQGKDQHF P VFMNEKED 1908
Cdd:cd00315 148 YGVPQ n R E R V F I i G IRKDLILNFF S PF pkps EK K KT L KDI L --- RI RDPDEP s P T L T ASYGKGTGSVHPTA P DMIGKESN 224
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 1377672160 1909 I LWC T -- E ME R VF GFP VHY t DVSNM S RLARQ R LL G R S WS VPV IRHLFAPL KE 1958
Cdd:cd00315 225 I RRL T pr E CA R LQ GFP DDF - EFPGK S VTQAY R QI G N S VP VPV AEAIAKAI KE 275
dcm
TIGR00675
DNA-methyltransferase (dcm); All proteins in this family for which functions are known are ...
1736-1950
1.36e-07
DNA-methyltransferase (dcm); All proteins in this family for which functions are known are DNA-cytosine methyltransferases. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 273211 [Multi-domain]
Cd Length: 315
Bit Score: 55.41
E-value: 1.36e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1736 GD VRSVTQKH I QE wgp FD LVI GG S PC NDL SI v NPA RKG LYEGT G R LFFE FY R L L HDAR PK egddrpf F W L F ENV VAMGVS 1815
Cdd:TIGR00675 47 GD ITKISPSD I PD --- FD ILL GG F PC QPF SI - AGK RKG FEDTR G T LFFE IV R I L KEKK PK ------- F F L L ENV KGLVSH 115
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1816 DK - R DISRFL E S -------- NPVMID AK EVSAA - H R A R ------- Y F WGN L P g MNR P LAST V ND K LELQEC L EHG ----- 1873
Cdd:TIGR00675 116 DK g R TFKVII E T leelgykv YYKVLN AK DFGVP q N R E R iyivgfr D F DDK L N - FEF P KPIY V AK K KRIGDL L DLS vdlee 194
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1874 ---------------------------------- R IA K F S KV RT ITT R SNSIKQ G KDQ ------ HFP V FMNEKEDI L WCT 1913
Cdd:TIGR00675 195 kyylseekknglllllenmrkkegtgeqigsfyn R ES K S S II RT LSA R GYTFVK G GKS vlivph KST V VHPGRIRR L TPR 274
250 260 270
....*....|....*....|....*....|....*..
gi 1377672160 1914 E ME R VF GFP VHYTD vs NM S RLARQRLL G RSWS VPVI R 1950
Cdd:TIGR00675 275 E CA R LQ GFP DDFKF -- PV S DSQLYKQA G NAVV VPVI E 309
DNA_methylase
pfam00145
C-5 cytosine-specific DNA methylase;
1685-1809
1.11e-06
C-5 cytosine-specific DNA methylase;
Pssm-ID: 395093 [Multi-domain]
Cd Length: 324
Bit Score: 52.70
E-value: 1.11e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1377672160 1685 IRVLS LF D GI ATGL L V L KDL G IQV dr YI A S E VCEDSITVGMVRHQGKI my V GD VRSVTQ K H I QE wgp F D LVI GG S PC N D L 1764
Cdd:pfam00145 1 FKFID LF A GI GGFR L G L EQA G FEC -- VA A N E IDKSAAKTYEANFPKVP -- I GD ITLIDI K D I PD --- I D ILT GG F PC Q D F 73
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 1377672160 1765 SI VN p AR KG LYEGT G R LFFE FY R LLHDAR PK egddrpf FW L F ENV 1809
Cdd:pfam00145 74 SI AG - KQ KG FEDTR G T LFFE II R IIKEKK PK ------- AF L L ENV 110
Blast search parameters
Data Source:
Precalculated data, version = cdd.v.3.21
Preset Options: Database: CDSEARCH/cdd Low complexity filter: no Composition Based Adjustment: yes E-value threshold: 0.01