NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2707155935|ref|WP_338140998|]
View 

CRISPR-associated endonuclease Cas1 [Candidatus Entotheonella palauensis]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Cas1 COG1518
CRISPR-Cas system-associated integrase Cas1 [Defense mechanisms]; CRISPR-Cas system-associated ...
195-525 5.25e-107

CRISPR-Cas system-associated integrase Cas1 [Defense mechanisms]; CRISPR-Cas system-associated integrase Cas1 is part of the Pathway/BioSystem: CRISPR-Cas system


:

Pssm-ID: 441127  Cd Length: 329  Bit Score: 322.53  E-value: 5.25e-107
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 195 VYVQRQGHSVGLKGEVLEIRSKGAVVSEARLLEMSQLSLFGNVQLTSQALRALAAREIPIVHLSYGGWLSAITTPPPH-K 273
Cdd:COG1518     2 LYITEPGAYLSRKGNTLVVEKEDEEKKRVPLEDIEQIVLFGEVSLSTALLRFLAENGIPVHFLDYYGRYLGRLLPRESgG 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 274 HIDLRRRQFATAVDQAVYVSLARAFVAGKIRNTRTLLRRNSRGLPAGVLERLAA---ARRRAERAKSLEQLLGIEGYAAR 350
Cdd:COG1518    82 NVLLRRAQYQAYLDEEKRLALARSIVRGKIRNQRAVLRRYGRRRKEDLEEAIERleeLLKRLEEADSIDELRGIEGNAAR 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 351 DYFAHFACMFKPGedeqapaFEFTSRNRRPPRDPVNalllflyalltkeMLI-------------TLVGVGFDPYLGFYH 417
Cdd:COG1518   162 IYFSALDGLLPED-------FRFEGRSRRPPRDPVN-------------ALLsfgytllysdvlsAIYAAGLDPYIGFLH 221
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 418 QPKYGRPALALDLMEEFRPLIADSVAIGLINNGELRPSDFIARAGAVALTDNGRRRVLDAYERRLDTLVTHPRFGYAISY 497
Cdd:COG1518   222 EPRPGRPSLALDLMEEFRPILVDRLVLSLINRGQITPKDFEKELGGVLLTEEGRKKFLEAFEERLQETIKHPFLGRKVSY 301
                         330       340
                  ....*....|....*....|....*...
gi 2707155935 498 RRIFEVQARLLARFLLGEIAEYPAFYTR 525
Cdd:COG1518   302 RRLIRLQARKLAKHLRGELDEYPPFVLR 329
Cas4 COG1468
CRISPR/Cas system-associated exonuclease Cas4, RecB family [Defense mechanisms]; CRISPR/Cas ...
16-158 1.30e-32

CRISPR/Cas system-associated exonuclease Cas4, RecB family [Defense mechanisms]; CRISPR/Cas system-associated exonuclease Cas4, RecB family is part of the Pathway/BioSystem: CRISPR-Cas system


:

Pssm-ID: 441077 [Multi-domain]  Cd Length: 184  Bit Score: 123.15  E-value: 1.30e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935  16 EALDSMPELHARSVMAGSDTLGAVARIDVIEGEHEHVVPVDYKRGTPpdipeRAYEPERVQLCLQGLLLREQ-GYPCTHG 94
Cdd:COG1468    46 ERVYKRLERLRREVPLDSERLGLTGKIDLVEFEDGELVPVEYKKSKP-----KPWEADRMQLCAYALLLEEMlGIPVPKG 120
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2707155935  95 ILYFAASQTRVTVEFTEALQARTLELLAEARHTAAAGESPPPLVESPKCPRCSLVGICLPDEVN 158
Cdd:COG1468   121 YLYYPEERKREEVELTEELREEVEEAIEEIREILESEKPPPPTKSKKKCKKCSYREFCLPRETS 184
 
Name Accession Description Interval E-value
Cas1 COG1518
CRISPR-Cas system-associated integrase Cas1 [Defense mechanisms]; CRISPR-Cas system-associated ...
195-525 5.25e-107

CRISPR-Cas system-associated integrase Cas1 [Defense mechanisms]; CRISPR-Cas system-associated integrase Cas1 is part of the Pathway/BioSystem: CRISPR-Cas system


Pssm-ID: 441127  Cd Length: 329  Bit Score: 322.53  E-value: 5.25e-107
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 195 VYVQRQGHSVGLKGEVLEIRSKGAVVSEARLLEMSQLSLFGNVQLTSQALRALAAREIPIVHLSYGGWLSAITTPPPH-K 273
Cdd:COG1518     2 LYITEPGAYLSRKGNTLVVEKEDEEKKRVPLEDIEQIVLFGEVSLSTALLRFLAENGIPVHFLDYYGRYLGRLLPRESgG 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 274 HIDLRRRQFATAVDQAVYVSLARAFVAGKIRNTRTLLRRNSRGLPAGVLERLAA---ARRRAERAKSLEQLLGIEGYAAR 350
Cdd:COG1518    82 NVLLRRAQYQAYLDEEKRLALARSIVRGKIRNQRAVLRRYGRRRKEDLEEAIERleeLLKRLEEADSIDELRGIEGNAAR 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 351 DYFAHFACMFKPGedeqapaFEFTSRNRRPPRDPVNalllflyalltkeMLI-------------TLVGVGFDPYLGFYH 417
Cdd:COG1518   162 IYFSALDGLLPED-------FRFEGRSRRPPRDPVN-------------ALLsfgytllysdvlsAIYAAGLDPYIGFLH 221
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 418 QPKYGRPALALDLMEEFRPLIADSVAIGLINNGELRPSDFIARAGAVALTDNGRRRVLDAYERRLDTLVTHPRFGYAISY 497
Cdd:COG1518   222 EPRPGRPSLALDLMEEFRPILVDRLVLSLINRGQITPKDFEKELGGVLLTEEGRKKFLEAFEERLQETIKHPFLGRKVSY 301
                         330       340
                  ....*....|....*....|....*...
gi 2707155935 498 RRIFEVQARLLARFLLGEIAEYPAFYTR 525
Cdd:COG1518   302 RRLIRLQARKLAKHLRGELDEYPPFVLR 329
Cas1_I-II-III cd09634
CRISPR/Cas system-associated protein Cas1; CRISPR (Clustered Regularly Interspaced Short ...
195-514 9.18e-95

CRISPR/Cas system-associated protein Cas1; CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Cas1 is the most universal CRISPR system protein thought to be involved in spacer integration; Cas1 is metal-dependent deoxyribonuclease, also binds RNA; Shown to possess a unique fold consisting of a N-terminal beta-strand domain and a C-terminal alpha-helical domain


Pssm-ID: 187766  Cd Length: 317  Bit Score: 290.67  E-value: 9.18e-95
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 195 VYVQRQGHSVGLKGEVLEIRSKGAVVSEARLLEMSQLSLFGNVQLTSQALRALAAREIPIVHLSYGGWLSAITTPP-PHK 273
Cdd:cd09634     3 LYITTPGAYLSRKGNRLVVEKEDEKKKRIPLEDIDSIVIFGNVSISTAALRLLAENGIPVHFLDYYGRYLGRLYPPeGGR 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 274 HIDLRRRQFATAVDQAVYVSLARAFVAGKIRNTRTLLRRNSRGLPAGVLERLAAARRRAERAK--SLEQLLGIEGYAARD 351
Cdd:cd09634    83 SVLLRRAQYEAYLDPEKRLELAREIVRGKIRNQRRVLKRYARDGKELLLALAELEELLEKLDKakSIEELRGIEGNAAKI 162
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 352 YFAHFACMFKPGedeqapaFEFTSRNRRPPRDPVNALLLFLYALLTKEMLITLVGVGFDPYLGFYHQPKYGRPALALDLM 431
Cdd:cd09634   163 YFEALFQLLPKE-------FRFEGRSRRPPKDPVNALLSYGYSLLYSAVLSAIVAAGLDPYIGFLHEDRPGRPSLALDLM 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 432 EEFRPLIADSVAIGLINNGELRPSDFIARAGAVALTDNGRRRVLDAYERRLDTLVTHpRFGYAISYRRIFEVQARLLARF 511
Cdd:cd09634   236 EEFRPIIVDRLVLSLINKGQIKKKDFEKELGGVLLTEEGRKKLIEALEERLKETIKH-YLGRKVSYRRLIRLQAYKLAKH 314

                  ...
gi 2707155935 512 LLG 514
Cdd:cd09634   315 LRG 317
Cas_Cas1 pfam01867
CRISPR associated protein Cas1; Clustered regularly interspaced short palindromic repeats ...
195-480 1.29e-93

CRISPR associated protein Cas1; Clustered regularly interspaced short palindromic repeats (CRISPRs) are a family of DNA direct repeats found in many prokaryotic genomes. This family of proteins corresponds to Cas1, a CRISPR-associated protein. Cas1 may be involved in linking DNA segments to CRISPR.


Pssm-ID: 460366  Cd Length: 283  Bit Score: 286.66  E-value: 1.29e-93
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 195 VYVQRQGHSVGLKGEVLEIRSKGAVVSEARLLEMSQLSLFGNVQLTSQALRALAAREIPIVHLSYGGWLSAITTPPPHKH 274
Cdd:pfam01867   1 LYVTTPGAYLSRKGETLVVEKEGEEKKRIPLEDVEQIVIFGNVSISTAALRLLAERGIPVHFLSYYGRYLGRLYPEVSGN 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 275 IDLRRRQFATAVDQAVYVSLARAFVAGKIRNTRTLLRRNSRGLPAGVLERL----AAARRRAERAKSLEQLLGIEGYAAR 350
Cdd:pfam01867  81 VLLRRAQYRAYDDEEKRLELARSFVAGKLRNQRTVLRRYNRDRKDEALEEAieelEELLKKLERADSIDELRGIEGNAAR 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 351 DYFAHFACMFKPGedeqapaFEFTSRNRRPPRDPVNalllflyallTKEMLITLVGVGFDPYLGFYHQPKYGRPALALDL 430
Cdd:pfam01867 161 AYFSAFDELLPKE-------FGFEGRSRRPPLDPVNallsfgytllYSDVLSALEAVGLDPYIGFLHEDRPGRPSLALDL 233
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|
gi 2707155935 431 MEEFRPLIADSVAIGLINNGELRPSDFIARAGAVALTDNGRRRVLDAYER 480
Cdd:pfam01867 234 MEEFRPVIVDRLVLSLINRGQITPKDFEKRENGVLLNDEGRKKFLKAYEE 283
cas1_CYANO TIGR04093
CRISPR-associated endonuclease Cas1, subtype CYANO; The CRISPR-associated protein Cas1 is ...
195-525 2.79e-81

CRISPR-associated endonuclease Cas1, subtype CYANO; The CRISPR-associated protein Cas1 is virtually universal to CRISPR systems. CRISPR, an acronym for Clustered Regularly Interspaced Short Palindromic Repeats, is prokaryotic immunity system for foreign DNA, mostly from phage. CRISPR systems belong to different subtypes, distinguished by both nature of the repeats, the makeup of the cohort of associated Cas proteins, and by molecular phylogeny within the more universal Cas proteins such as this one. This model is of type EXCEPTION and provides more specific information than the EQUIVALOG model TIGR00287. It describes a clade of Cas1 limited to the CYANO subtype of CRISPR/Cas system and most often the type found there.


Pssm-ID: 274976  Cd Length: 323  Bit Score: 256.20  E-value: 2.79e-81
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 195 VYVQRQGHSVGLKGE---VLEIRSKGAVVSeARLLEMSQLSLFGNVQLTSQALRALAAREIPIVHLSYGGWLSAITTPPP 271
Cdd:TIGR04093   3 LYITQPDAKISKDDGriiVVDSDGEEKVAS-FPLIKVETIVVFGEATLTTPALAHLLERGIVLHYLTRFGKYFGSLVPEL 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 272 HKHIDLRRRQFATAVDQAVYVSLARAFVAGKIRNTRTLLRRNSRGlpagvLERLAAARRRAERAKSLEQLLGIEGYAARD 351
Cdd:TIGR04093  82 TRNTILRVAQHQAHISPQQRLAIAREIVRGKLRNSRTMLYRRGRK-----QTKIPELEKPVDDKNSLDSLRGLEGEAAAI 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 352 YFAHFACMFKPGedeqapaFEFTSRNRRPPRDPVNALLLFLYALLTKEMLITLVGVGFDPYLGFYHQPKYGRPALALDLM 431
Cdd:TIGR04093 157 YFGCLSDLLPDE-------WSFHGRTRRPPTDPVNALLSLGYALLRTQVLSALRIVGLDPYIGFLHVDRHGRPALALDLM 229
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 432 EEFRPLIADSVAIGLINNGELRPSDFIARAGAVALTDNGRRRVLDAYERRLDTLVTHPRFGYAISYRRIFEVQARLLARF 511
Cdd:TIGR04093 230 EEFRPIIVDAVVLRLINRKMLKPKDFQEEPGAVRLKDEAFKLFLGKFEEKMQSEFKHPIFKYKVSYRRAIELQARLLAKA 309
                         330
                  ....*....|....
gi 2707155935 512 LLGEIAEYPAFYTR 525
Cdd:TIGR04093 310 LMGEIKEYPPLIIR 323
Cas4 COG1468
CRISPR/Cas system-associated exonuclease Cas4, RecB family [Defense mechanisms]; CRISPR/Cas ...
16-158 1.30e-32

CRISPR/Cas system-associated exonuclease Cas4, RecB family [Defense mechanisms]; CRISPR/Cas system-associated exonuclease Cas4, RecB family is part of the Pathway/BioSystem: CRISPR-Cas system


Pssm-ID: 441077 [Multi-domain]  Cd Length: 184  Bit Score: 123.15  E-value: 1.30e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935  16 EALDSMPELHARSVMAGSDTLGAVARIDVIEGEHEHVVPVDYKRGTPpdipeRAYEPERVQLCLQGLLLREQ-GYPCTHG 94
Cdd:COG1468    46 ERVYKRLERLRREVPLDSERLGLTGKIDLVEFEDGELVPVEYKKSKP-----KPWEADRMQLCAYALLLEEMlGIPVPKG 120
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2707155935  95 ILYFAASQTRVTVEFTEALQARTLELLAEARHTAAAGESPPPLVESPKCPRCSLVGICLPDEVN 158
Cdd:COG1468   121 YLYYPEERKREEVELTEELREEVEEAIEEIREILESEKPPPPTKSKKKCKKCSYREFCLPRETS 184
cas4 TIGR00372
CRISPR-associated protein Cas4; This model represents a family of proteins associated with ...
27-154 6.74e-18

CRISPR-associated protein Cas4; This model represents a family of proteins associated with CRISPR repeats in a wide set of prokaryotic genomes. This scope of this model has been broadened since it was first built to describe an archaeal subset only. The function of the protein is undefined. Distantly related proteins, excluded from this model, include ORFs from Mycobacteriophage D29 and Sulfolobus islandicus filamentous virus and a region of the Schizosaccharomyces pombe DNA replication helicase Dna2p.


Pssm-ID: 273040 [Multi-domain]  Cd Length: 178  Bit Score: 81.30  E-value: 6.74e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935  27 RSVMAGSDTLGAVARIDVIEGEHEHVVPVDYKRGTPPdiPERAYepeRVQLCLQGLLLREQGYPCTHGILYFAASQTRVT 106
Cdd:TIGR00372  56 KEVPLKSKKYGLKGVIDIVLEEDGELVPVEVKSGKPS--PREAH---KYQLLAYAYLLEEMYGEIVRGYILYINAGKKLE 130
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 2707155935 107 VEFTEALQARTLELLAEARHTAAAGESPPPLVESPKCPRCSLVGICLP 154
Cdd:TIGR00372 131 VEISEELRKKAVKLIEKIRELLEGGKPPSPPKSGPKCKFCPYREICLP 178
Cas4_I-A_I-B_I-C_I-D_II-B cd09637
CRISPR/Cas system-associated protein Cas4; CRISPR (Clustered Regularly Interspaced Short ...
27-153 3.41e-16

CRISPR/Cas system-associated protein Cas4; CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Cas4 is RecB-like nuclease with three-cysteine C-terminal cluster


Pssm-ID: 187768 [Multi-domain]  Cd Length: 178  Bit Score: 76.70  E-value: 3.41e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935  27 RSVMAGSDTLGAVARIDVIEGEHEHVVPVDYKRGtPPDIPERAYepeRVQLCLQGLLLREQ-GYPCTHGILYFAASQTRV 105
Cdd:cd09637    56 KEVPLKSKKYGLKGVIDIVLKEDGELVPVEVKSG-RAGSPREAH---KLQLVAYAYLLEEMyGKRVARGYIVYLEGGKRL 131
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 2707155935 106 TVEFTEALQARTLELLAEARhTAAAGESPPPLVESPKCPRCSLVGICL 153
Cdd:cd09637   132 EVEISEELRKKAEKLLEEIR-KLLEGELPPPVKSSPKCKFCPYREICL 178
Cas_Cas4 pfam01930
Domain of unknown function DUF83; This domain has no known function. The domain contains three ...
41-154 2.62e-14

Domain of unknown function DUF83; This domain has no known function. The domain contains three conserved cysteines at its C terminus.


Pssm-ID: 426517 [Multi-domain]  Cd Length: 160  Bit Score: 70.72  E-value: 2.62e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935  41 RIDVIEGEHEHVVPVDYKRGTPPDiperayEPERVQLCLQGLLLREQGYPCThGILYFAASQTRVTVEFTEALQARTLEL 120
Cdd:pfam01930  55 KIDFVRRRGGGLVVHEVKKSSKME------EAHRMQLLYYLYYLKKRGIEIK-GVLHYPKERKREEVELTEEDRRELEEA 127
                          90       100       110
                  ....*....|....*....|....*....|....
gi 2707155935 121 LAEARHtAAAGESPPPLVESPKCPRCSLVGICLP 154
Cdd:pfam01930 128 IKEIEE-IISSEKPPPPQKKKICKKCAYYEFCWP 160
 
Name Accession Description Interval E-value
Cas1 COG1518
CRISPR-Cas system-associated integrase Cas1 [Defense mechanisms]; CRISPR-Cas system-associated ...
195-525 5.25e-107

CRISPR-Cas system-associated integrase Cas1 [Defense mechanisms]; CRISPR-Cas system-associated integrase Cas1 is part of the Pathway/BioSystem: CRISPR-Cas system


Pssm-ID: 441127  Cd Length: 329  Bit Score: 322.53  E-value: 5.25e-107
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 195 VYVQRQGHSVGLKGEVLEIRSKGAVVSEARLLEMSQLSLFGNVQLTSQALRALAAREIPIVHLSYGGWLSAITTPPPH-K 273
Cdd:COG1518     2 LYITEPGAYLSRKGNTLVVEKEDEEKKRVPLEDIEQIVLFGEVSLSTALLRFLAENGIPVHFLDYYGRYLGRLLPRESgG 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 274 HIDLRRRQFATAVDQAVYVSLARAFVAGKIRNTRTLLRRNSRGLPAGVLERLAA---ARRRAERAKSLEQLLGIEGYAAR 350
Cdd:COG1518    82 NVLLRRAQYQAYLDEEKRLALARSIVRGKIRNQRAVLRRYGRRRKEDLEEAIERleeLLKRLEEADSIDELRGIEGNAAR 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 351 DYFAHFACMFKPGedeqapaFEFTSRNRRPPRDPVNalllflyalltkeMLI-------------TLVGVGFDPYLGFYH 417
Cdd:COG1518   162 IYFSALDGLLPED-------FRFEGRSRRPPRDPVN-------------ALLsfgytllysdvlsAIYAAGLDPYIGFLH 221
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 418 QPKYGRPALALDLMEEFRPLIADSVAIGLINNGELRPSDFIARAGAVALTDNGRRRVLDAYERRLDTLVTHPRFGYAISY 497
Cdd:COG1518   222 EPRPGRPSLALDLMEEFRPILVDRLVLSLINRGQITPKDFEKELGGVLLTEEGRKKFLEAFEERLQETIKHPFLGRKVSY 301
                         330       340
                  ....*....|....*....|....*...
gi 2707155935 498 RRIFEVQARLLARFLLGEIAEYPAFYTR 525
Cdd:COG1518   302 RRLIRLQARKLAKHLRGELDEYPPFVLR 329
Cas1_I-II-III cd09634
CRISPR/Cas system-associated protein Cas1; CRISPR (Clustered Regularly Interspaced Short ...
195-514 9.18e-95

CRISPR/Cas system-associated protein Cas1; CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Cas1 is the most universal CRISPR system protein thought to be involved in spacer integration; Cas1 is metal-dependent deoxyribonuclease, also binds RNA; Shown to possess a unique fold consisting of a N-terminal beta-strand domain and a C-terminal alpha-helical domain


Pssm-ID: 187766  Cd Length: 317  Bit Score: 290.67  E-value: 9.18e-95
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 195 VYVQRQGHSVGLKGEVLEIRSKGAVVSEARLLEMSQLSLFGNVQLTSQALRALAAREIPIVHLSYGGWLSAITTPP-PHK 273
Cdd:cd09634     3 LYITTPGAYLSRKGNRLVVEKEDEKKKRIPLEDIDSIVIFGNVSISTAALRLLAENGIPVHFLDYYGRYLGRLYPPeGGR 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 274 HIDLRRRQFATAVDQAVYVSLARAFVAGKIRNTRTLLRRNSRGLPAGVLERLAAARRRAERAK--SLEQLLGIEGYAARD 351
Cdd:cd09634    83 SVLLRRAQYEAYLDPEKRLELAREIVRGKIRNQRRVLKRYARDGKELLLALAELEELLEKLDKakSIEELRGIEGNAAKI 162
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 352 YFAHFACMFKPGedeqapaFEFTSRNRRPPRDPVNALLLFLYALLTKEMLITLVGVGFDPYLGFYHQPKYGRPALALDLM 431
Cdd:cd09634   163 YFEALFQLLPKE-------FRFEGRSRRPPKDPVNALLSYGYSLLYSAVLSAIVAAGLDPYIGFLHEDRPGRPSLALDLM 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 432 EEFRPLIADSVAIGLINNGELRPSDFIARAGAVALTDNGRRRVLDAYERRLDTLVTHpRFGYAISYRRIFEVQARLLARF 511
Cdd:cd09634   236 EEFRPIIVDRLVLSLINKGQIKKKDFEKELGGVLLTEEGRKKLIEALEERLKETIKH-YLGRKVSYRRLIRLQAYKLAKH 314

                  ...
gi 2707155935 512 LLG 514
Cdd:cd09634   315 LRG 317
Cas_Cas1 pfam01867
CRISPR associated protein Cas1; Clustered regularly interspaced short palindromic repeats ...
195-480 1.29e-93

CRISPR associated protein Cas1; Clustered regularly interspaced short palindromic repeats (CRISPRs) are a family of DNA direct repeats found in many prokaryotic genomes. This family of proteins corresponds to Cas1, a CRISPR-associated protein. Cas1 may be involved in linking DNA segments to CRISPR.


Pssm-ID: 460366  Cd Length: 283  Bit Score: 286.66  E-value: 1.29e-93
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 195 VYVQRQGHSVGLKGEVLEIRSKGAVVSEARLLEMSQLSLFGNVQLTSQALRALAAREIPIVHLSYGGWLSAITTPPPHKH 274
Cdd:pfam01867   1 LYVTTPGAYLSRKGETLVVEKEGEEKKRIPLEDVEQIVIFGNVSISTAALRLLAERGIPVHFLSYYGRYLGRLYPEVSGN 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 275 IDLRRRQFATAVDQAVYVSLARAFVAGKIRNTRTLLRRNSRGLPAGVLERL----AAARRRAERAKSLEQLLGIEGYAAR 350
Cdd:pfam01867  81 VLLRRAQYRAYDDEEKRLELARSFVAGKLRNQRTVLRRYNRDRKDEALEEAieelEELLKKLERADSIDELRGIEGNAAR 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 351 DYFAHFACMFKPGedeqapaFEFTSRNRRPPRDPVNalllflyallTKEMLITLVGVGFDPYLGFYHQPKYGRPALALDL 430
Cdd:pfam01867 161 AYFSAFDELLPKE-------FGFEGRSRRPPLDPVNallsfgytllYSDVLSALEAVGLDPYIGFLHEDRPGRPSLALDL 233
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|
gi 2707155935 431 MEEFRPLIADSVAIGLINNGELRPSDFIARAGAVALTDNGRRRVLDAYER 480
Cdd:pfam01867 234 MEEFRPVIVDRLVLSLINRGQITPKDFEKRENGVLLNDEGRKKFLKAYEE 283
cas1_CYANO TIGR04093
CRISPR-associated endonuclease Cas1, subtype CYANO; The CRISPR-associated protein Cas1 is ...
195-525 2.79e-81

CRISPR-associated endonuclease Cas1, subtype CYANO; The CRISPR-associated protein Cas1 is virtually universal to CRISPR systems. CRISPR, an acronym for Clustered Regularly Interspaced Short Palindromic Repeats, is prokaryotic immunity system for foreign DNA, mostly from phage. CRISPR systems belong to different subtypes, distinguished by both nature of the repeats, the makeup of the cohort of associated Cas proteins, and by molecular phylogeny within the more universal Cas proteins such as this one. This model is of type EXCEPTION and provides more specific information than the EQUIVALOG model TIGR00287. It describes a clade of Cas1 limited to the CYANO subtype of CRISPR/Cas system and most often the type found there.


Pssm-ID: 274976  Cd Length: 323  Bit Score: 256.20  E-value: 2.79e-81
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 195 VYVQRQGHSVGLKGE---VLEIRSKGAVVSeARLLEMSQLSLFGNVQLTSQALRALAAREIPIVHLSYGGWLSAITTPPP 271
Cdd:TIGR04093   3 LYITQPDAKISKDDGriiVVDSDGEEKVAS-FPLIKVETIVVFGEATLTTPALAHLLERGIVLHYLTRFGKYFGSLVPEL 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 272 HKHIDLRRRQFATAVDQAVYVSLARAFVAGKIRNTRTLLRRNSRGlpagvLERLAAARRRAERAKSLEQLLGIEGYAARD 351
Cdd:TIGR04093  82 TRNTILRVAQHQAHISPQQRLAIAREIVRGKLRNSRTMLYRRGRK-----QTKIPELEKPVDDKNSLDSLRGLEGEAAAI 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 352 YFAHFACMFKPGedeqapaFEFTSRNRRPPRDPVNALLLFLYALLTKEMLITLVGVGFDPYLGFYHQPKYGRPALALDLM 431
Cdd:TIGR04093 157 YFGCLSDLLPDE-------WSFHGRTRRPPTDPVNALLSLGYALLRTQVLSALRIVGLDPYIGFLHVDRHGRPALALDLM 229
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 432 EEFRPLIADSVAIGLINNGELRPSDFIARAGAVALTDNGRRRVLDAYERRLDTLVTHPRFGYAISYRRIFEVQARLLARF 511
Cdd:TIGR04093 230 EEFRPIIVDAVVLRLINRKMLKPKDFQEEPGAVRLKDEAFKLFLGKFEEKMQSEFKHPIFKYKVSYRRAIELQARLLAKA 309
                         330
                  ....*....|....
gi 2707155935 512 LLGEIAEYPAFYTR 525
Cdd:TIGR04093 310 LMGEIKEYPPLIIR 323
Cas1_I-C cd09721
CRISPR/Cas system-associated protein Cas1; CRISPR (Clustered Regularly Interspaced Short ...
195-522 4.47e-75

CRISPR/Cas system-associated protein Cas1; CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Cas1 is the most universal CRISPR system protein thought to be involved in spacer integration; Cas1 is metal-dependent deoxyribonuclease, also binds RNA; Shown to possess a unique fold consisting of a N-terminal beta-strand domain and a C-terminal alpha-helical domain


Pssm-ID: 187852  Cd Length: 338  Bit Score: 240.60  E-value: 4.47e-75
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 195 VYVQRQGHSVGLKGEVLEIRSKGAVVSEARLLEMSQLSLFGNVQLTSQALRALAAREIPIVHLSYGGWLSAITTPPPHKH 274
Cdd:cd09721     5 LYVTTQGTYLHKDGETVVVEVEGEKKARVPLHHLGGIVCFGNVGLSPFLMGRCAEDGISLVFLTENGRFLARVEGPVSGN 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 275 IDLRRRQFATAVDQAVYVSLARAFVAGKIRNTRTLLRRNSRGLPAGVLERLAAARRRA--------ERAKSLEQLLGIEG 346
Cdd:cd09721    85 VLLRRAQYRAADDPERSLAIARSIVAGKIRNARQVLLRAARDHGEEDDRAALEAAADRlavalrrlQRADDLDELRGIEG 164
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 347 YAARDYFAHFACMFKPgedeQAPAFEFTSRNRRPPRDPVNALLLFLYALLTKEMLITLVGVGFDPYLGFYHQPKYGRPAL 426
Cdd:cd09721   165 EAARLYFAVFDHLLRQ----DAPAFRFDGRSRRPPLDPVNALLSFLYTLLTHDCRSALEGVGLDPAVGFLHRDRPGRPSL 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 427 ALDLMEEFRPLIADSVAIGLINNGELRPSDFIARA-GAVALTDNGRRRVLDAYERRLDTLVTHPRFGYAISYRRIFEVQA 505
Cdd:cd09721   241 ALDLMEEFRAVLADRLALSLINRGQLTAKDFEVREgGAVLLTDDARKTVLVAYQERKQEEITHPFLGEKVPLGLLPHVQA 320
                         330
                  ....*....|....*..
gi 2707155935 506 RLLARFLLGEIAEYPAF 522
Cdd:cd09721   321 RLLARHLRGDLDGYPPF 337
cas1_DVULG TIGR03640
CRISPR-associated endonuclease Cas1, subtype I-C/DVULG; The CRISPR-associated protein Cas1 is ...
195-522 6.60e-75

CRISPR-associated endonuclease Cas1, subtype I-C/DVULG; The CRISPR-associated protein Cas1 is virtually universal to CRISPR systems. CRISPR, an acronym for Clustered Regularly Interspaced Short Palindromic Repeats, is prokaryotic immunity system for foreign DNA, mostly from phage. CRISPR systems belong to different subtypes, distinguished by both nature of the repeats, the makeup of the cohort of associated Cas proteins, and by molecular phylogeny within the more universal Cas proteins such as this one. This model is of type EXCEPTION and provides more specific information than the EQUIVALOG model TIGR00287. It describes the Cas1 protein particular to the DVULG subtype of CRISPR/Cas system.


Pssm-ID: 188360  Cd Length: 340  Bit Score: 240.22  E-value: 6.60e-75
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 195 VYVQRQGHSVGLKGEVLEIRSKGAVVSEARLLEMSQLSLFGNVQLTSQALRALAAREIPIVHLSYGGWLSAITTPPPHKH 274
Cdd:TIGR03640   6 LYVTTQGTYLHKDGETVVVEVEGEKKARVPLHHLGGIVCFGNVGLSPFLMGRCAEDGISLVFLTENGRFLARVEGPVSGN 85
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 275 IDLRRRQFATAVDQAVYVSLARAFVAGKIRNTRTLLRRNSRGLPAGVLERLAAARRRA--------ERAKSLEQLLGIEG 346
Cdd:TIGR03640  86 VLLRRAQYRAADDPERSLAIARSIVAGKIRNARQVLLRAARDHGEEDDAAALEAAADRlavalrrlQRADDLDSLRGIEG 165
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 347 YAARDYFAHFACMFKPgedeQAPAFEFTSRNRRPPRDPVNALLLFLYALLTKEMLITLVGVGFDPYLGFYHQPKYGRPAL 426
Cdd:TIGR03640 166 EAARLYFAVFDHLLRQ----DRPAFRFDGRSRRPPLDPVNALLSFLYTLLTHDCRSALEGVGLDPAVGFLHRDRPGRPSL 241
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 427 ALDLMEEFRPLIADSVAIGLINNGELRPSDFIARA-GAVALTDNGRRRVLDAYERRLDTLVTHPRFGYAISYRRIFEVQA 505
Cdd:TIGR03640 242 ALDLMEEFRAVLADRLALSLINRGQLTAKDFEVREgGAVLLTDDARKTVLVAYQERKQEEILHPFLGEKVPLGLLPHVQA 321
                         330
                  ....*....|....*..
gi 2707155935 506 RLLARFLLGEIAEYPAF 522
Cdd:TIGR03640 322 RLLARHLRGDLDGYPPF 338
cas1 TIGR00287
CRISPR-associated endonuclease Cas1; This model identifies CRISPR-associated protein Cas1, the ...
195-520 9.50e-75

CRISPR-associated endonuclease Cas1; This model identifies CRISPR-associated protein Cas1, the most universal CRISPR system protein. CRISPR is an acronym for Clustered Regularly Interspaced Short Palindromic Repeats, a system for heritable host defense by prokaryotic cells against phage and other foreign DNA. Cas1 is a metal-dependent DNA-specific endonuclease.


Pssm-ID: 272998  Cd Length: 323  Bit Score: 239.17  E-value: 9.50e-75
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 195 VYVQRQGHSVGLKGEVLEIRSKGAVVSEARLLEMSQLSLFGNVQLTSQALRALAAREIPIVHLSYGG-WLSAITTPPPHK 273
Cdd:TIGR00287   2 LYVLEYGSYLSRKGNTLVVEKEGKKKWNIPVANVDCIVLFGGVSISSAAIRLLAKRGIDIVFFGGDGnYVGRLSPQESGS 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 274 HIDLRRRQFATAVDQAVYVSLARAFVAGKIRNTRTLLRRNSRGLPA--GVLERLAAARRRAERAKSLEQLLGIEGYAARD 351
Cdd:TIGR00287  82 TVELRLAQVKAYLDEEKRLKLAKEFVSGKIANQAALLKYLTRRREDlrSYLEEYESLMKELASADSIEELMGIEGNAARA 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 352 YFAHFACMFKPgedeqapAFEFTSRNRRPPRDPVNALLLFLYALLTKEMLITLVGVGFDPYLGFYHQPKYGRPALALDLM 431
Cdd:TIGR00287 162 YYAALAQLLPD-------EFGFNGRSKRPPKDPFNALLSYGYSLLYSNVLTAIYIAGLDPYIGFLHTDRSGRPSLVLDLM 234
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 432 EEFRPLIADSVAIGLINNGELRPSDFIARAGAVALTDNGRRRVLDAYERRLDTLVTHPRFGYAISYRRIFEVQARLLARF 511
Cdd:TIGR00287 235 EEFKPQIVDRLVFSLINRNIITEEDFEKISNGVYLGEEGRKKFLQAFEERLQTTVTHPGLNRRVSYLDIIILQARKLAKA 314

                  ....*....
gi 2707155935 512 LLGEIAEYP 520
Cdd:TIGR00287 315 LRGEERYRP 323
cas1_MYXAN TIGR03983
CRISPR-associated endonuclease Cas1, subtype MYXAN; Members of this protein are the Cas1 ...
182-515 1.07e-58

CRISPR-associated endonuclease Cas1, subtype MYXAN; Members of this protein are the Cas1 endonuclease, or Cas1 domain in Cas4/Cas1 fusion proteins, of the MYXAN subtype of CRISPR/Cas systems. These systems typically feature repeats and spacers each about 36 base pairs in length. Species with this type of CRISPR system include Myxococcus xanthus, Cyanothece sp., Leptospira interrogans, Sorangium cellulosum, Anabaena variabilis ATCC 29413, etc.


Pssm-ID: 274899  Cd Length: 347  Bit Score: 198.06  E-value: 1.07e-58
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 182 PRRLVPARDDRRAVYVQRQGHSVGLKGEVLEIRSKGAVVSEARLLEMSQLSLFGNVQLTSQALRALAAREIPIVHLSYGG 261
Cdd:TIGR03983   1 PVRLFPEDDERQVLHVLEPGTRIGRTGEQLKVTRREGADEKIPIQQVSQVVLHGFSQISTQALHFLASEEIGVHWVSGGG 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 262 -WLSAITTPPphKHIDLRRRQFATAVDQAVYVSLARAFVAGKIRNTRTLLRRNSRGLPAGVLERLAAARRRAERAK---- 336
Cdd:TIGR03983  81 rYIGSFDGRS--GSVQRRIRQFRALTQEDFCLGLARKLVAAKGEGQLRFLLRAKRGDKESRPELESAIAQMRAVLKqveq 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 337 --SLEQLLGIEGYAARDYFAHFACMFKPGEDeqaPAFEFTSRNRRPPRDPVNALLLFLYALLTKEMLITLVGVGFDPYLG 414
Cdd:TIGR03983 159 aeSLESLLGIEGNLAALYFGALPALLGKDVD---DSLRFSGRNRRPPKDRFNALLSFGYSLLYKDVMNAILAVGLEPAFG 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 415 FYHQPKYGRPALALDLMEEFRPLIADSVAIGLINNGELR-PSDFIARAGAVALTDNGRRRVLDAYERRLDTLVTHPRFGY 493
Cdd:TIGR03983 236 FYHQPRTQAPPLALDLMELFRVPLVDMPVVGSINRRQWDiDEDFEITGQQVWLSDSGRRKFIELYERRKAETWKHPVLGY 315
                         330       340
                  ....*....|....*....|..
gi 2707155935 494 AISYRRIFEVQARLLARFLLGE 515
Cdd:TIGR03983 316 SLSYARLIELEVRLLEKEWSGE 337
Cas1_I-II-III cd09636
CRISPR/Cas system-associated protein Cas1; CRISPR (Clustered Regularly Interspaced Short ...
195-457 5.16e-52

CRISPR/Cas system-associated protein Cas1; CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Cas1 is the most universal CRISPR system protein thought to be involved in spacer integration; Cas1 is metal-dependent deoxyribonuclease, also binds RNA; Shown to possess a unique fold consisting of a N-terminal beta-strand domain and a C-terminal alpha-helical domain


Pssm-ID: 187767  Cd Length: 260  Bit Score: 177.87  E-value: 5.16e-52
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 195 VYVQRQGHSVGLKGEVLEIRSKGAVVSEARLLEMSQLSLFGNVQLTSQALRALAAREIPIVHLSYGG-WLSAITTPPPHK 273
Cdd:cd09636     1 LYVLEYGSYLSRKGNTLVVEKEGELKKNIPVANVDCIVIFGGVSISSAAIRELAKRGIDIVFLGGDGnYLGRLYPPEPGS 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 274 HIDLRRRQFATAVDQAVYVSLARAFVAGKIRNTRTLLRRNSRGLP---AGVLERLAAARRRAERAKSLEQLLGIEGYAAR 350
Cdd:cd09636    81 SVFTRRAQYKAYLNPAKRLKLAREFVEGKLANQAALLRYLTRRREdlkSELAEYIAELLKELDNANSIEELMGIEGNAAR 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 351 DYFAHFACMFKPGedeqapaFEFTSRNRRPPRDPVNALLLFLYALLTKEMLITLVGVGFDPYLGFYHQPKYGRPALALDL 430
Cdd:cd09636   161 AYYAALAQLLPDE-------FGFNGRSRRPPKDPFNALLSYGYSLLYSNVLTAIYIAGLDPYIGFLHTDRSGRPSLVLDL 233
                         250       260
                  ....*....|....*....|....*..
gi 2707155935 431 MEEFRPLIADSVAIGLINNGELRPSDF 457
Cdd:cd09636   234 MEEFRPQIVDRLVFSLINRGIITPEDF 260
Cas1_I-B cd09722
CRISPR/Cas system-associated protein Cas1; CRISPR (Clustered Regularly Interspaced Short ...
234-522 9.14e-41

CRISPR/Cas system-associated protein Cas1; CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Cas1 is the most universal CRISPR system protein thought to be involved in spacer integration; Cas1 is metal-dependent deoxyribonuclease, also binds RNA; Shown to possess a unique fold consisting of a N-terminal beta-strand domain and a C-terminal alpha-helical domain


Pssm-ID: 187853  Cd Length: 320  Bit Score: 149.28  E-value: 9.14e-41
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 234 FGNVQLTSQALRALAAREIPIvHL--SYGGWLSAITtppPHKHI---DLRRRQFATAVDQAVYVSLARAFVAGKIRN-TR 307
Cdd:cd09722    39 FGEVSLNSKALSFLSKKGIPI-HFfnYYGYYSGSFY---PRESLnsgYLIVKQVEHYLDSEKRLELARSFVEGAAHNmRR 114
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 308 TLLRRNSRGlpAGVLERLAAARRRAERAKSLEQLLGIEGYAARDYFAHFACMFKPGedeqapaFEFTSRNRRPPRDPVNa 387
Cdd:cd09722   115 VLKYYKDRG--DDLDDYLDEIEEQKESANSINELMGVEGNIRKTYYSAFDEILKDE-------FRFEKRTRRPPKNELN- 184
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935 388 lllflyalltkemliTLVGVG----------------FDPYLGFYHQPKYGRPALALDLMEEFRPLIADSVAIGLINNGE 451
Cdd:cd09722   185 ---------------ALISFGnsllyttvlseiykthLNPTISYLHEPSERRFSLALDIAEIFKPIIVDRLIFRLVNKKI 249
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2707155935 452 LRPSDFIARAGAVALTDNGRRRVLDAYERRLDTLVTHPRFGYAISYRRIFEVQARLLARFLLGEiAEYPAF 522
Cdd:cd09722   250 IKEKHFEKDLNGVLLNEEGRKKFVKEFEEKLKTTIKHRKLKRKVSYRRLIRLELYKLIKHLLGE-EEYKPF 319
Cas4 COG1468
CRISPR/Cas system-associated exonuclease Cas4, RecB family [Defense mechanisms]; CRISPR/Cas ...
16-158 1.30e-32

CRISPR/Cas system-associated exonuclease Cas4, RecB family [Defense mechanisms]; CRISPR/Cas system-associated exonuclease Cas4, RecB family is part of the Pathway/BioSystem: CRISPR-Cas system


Pssm-ID: 441077 [Multi-domain]  Cd Length: 184  Bit Score: 123.15  E-value: 1.30e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935  16 EALDSMPELHARSVMAGSDTLGAVARIDVIEGEHEHVVPVDYKRGTPpdipeRAYEPERVQLCLQGLLLREQ-GYPCTHG 94
Cdd:COG1468    46 ERVYKRLERLRREVPLDSERLGLTGKIDLVEFEDGELVPVEYKKSKP-----KPWEADRMQLCAYALLLEEMlGIPVPKG 120
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2707155935  95 ILYFAASQTRVTVEFTEALQARTLELLAEARHTAAAGESPPPLVESPKCPRCSLVGICLPDEVN 158
Cdd:COG1468   121 YLYYPEERKREEVELTEELREEVEEAIEEIREILESEKPPPPTKSKKKCKKCSYREFCLPRETS 184
cas4 TIGR00372
CRISPR-associated protein Cas4; This model represents a family of proteins associated with ...
27-154 6.74e-18

CRISPR-associated protein Cas4; This model represents a family of proteins associated with CRISPR repeats in a wide set of prokaryotic genomes. This scope of this model has been broadened since it was first built to describe an archaeal subset only. The function of the protein is undefined. Distantly related proteins, excluded from this model, include ORFs from Mycobacteriophage D29 and Sulfolobus islandicus filamentous virus and a region of the Schizosaccharomyces pombe DNA replication helicase Dna2p.


Pssm-ID: 273040 [Multi-domain]  Cd Length: 178  Bit Score: 81.30  E-value: 6.74e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935  27 RSVMAGSDTLGAVARIDVIEGEHEHVVPVDYKRGTPPdiPERAYepeRVQLCLQGLLLREQGYPCTHGILYFAASQTRVT 106
Cdd:TIGR00372  56 KEVPLKSKKYGLKGVIDIVLEEDGELVPVEVKSGKPS--PREAH---KYQLLAYAYLLEEMYGEIVRGYILYINAGKKLE 130
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 2707155935 107 VEFTEALQARTLELLAEARHTAAAGESPPPLVESPKCPRCSLVGICLP 154
Cdd:TIGR00372 131 VEISEELRKKAVKLIEKIRELLEGGKPPSPPKSGPKCKFCPYREICLP 178
Cas4_I-A_I-B_I-C_I-D_II-B cd09637
CRISPR/Cas system-associated protein Cas4; CRISPR (Clustered Regularly Interspaced Short ...
27-153 3.41e-16

CRISPR/Cas system-associated protein Cas4; CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Cas4 is RecB-like nuclease with three-cysteine C-terminal cluster


Pssm-ID: 187768 [Multi-domain]  Cd Length: 178  Bit Score: 76.70  E-value: 3.41e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935  27 RSVMAGSDTLGAVARIDVIEGEHEHVVPVDYKRGtPPDIPERAYepeRVQLCLQGLLLREQ-GYPCTHGILYFAASQTRV 105
Cdd:cd09637    56 KEVPLKSKKYGLKGVIDIVLKEDGELVPVEVKSG-RAGSPREAH---KLQLVAYAYLLEEMyGKRVARGYIVYLEGGKRL 131
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 2707155935 106 TVEFTEALQARTLELLAEARhTAAAGESPPPLVESPKCPRCSLVGICL 153
Cdd:cd09637   132 EVEISEELRKKAEKLLEEIR-KLLEGELPPPVKSSPKCKFCPYREICL 178
Cas_Cas4 pfam01930
Domain of unknown function DUF83; This domain has no known function. The domain contains three ...
41-154 2.62e-14

Domain of unknown function DUF83; This domain has no known function. The domain contains three conserved cysteines at its C terminus.


Pssm-ID: 426517 [Multi-domain]  Cd Length: 160  Bit Score: 70.72  E-value: 2.62e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935  41 RIDVIEGEHEHVVPVDYKRGTPPDiperayEPERVQLCLQGLLLREQGYPCThGILYFAASQTRVTVEFTEALQARTLEL 120
Cdd:pfam01930  55 KIDFVRRRGGGLVVHEVKKSSKME------EAHRMQLLYYLYYLKKRGIEIK-GVLHYPKERKREEVELTEEDRRELEEA 127
                          90       100       110
                  ....*....|....*....|....*....|....
gi 2707155935 121 LAEARHtAAAGESPPPLVESPKCPRCSLVGICLP 154
Cdd:pfam01930 128 IKEIEE-IISSEKPPPPQKKKICKKCAYYEFCWP 160
Cas1_II cd09720
CRISPR/Cas system-associated protein Cas1; CRISPR (Clustered Regularly Interspaced Short ...
403-451 1.47e-05

CRISPR/Cas system-associated protein Cas1; CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Cas1 is the most universal CRISPR system protein thought to be involved in spacer intergration. Cas1 is metal-dependent deoxyribonuclease, also binds RNA; Shown to possess a unique fold consisting of a N-terminal beta-strand domain and a C-terminal alpha-helical domain.


Pssm-ID: 187851  Cd Length: 275  Bit Score: 46.86  E-value: 1.47e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 2707155935 403 TLVGVGFDPYLGFYHQPKYGRPALALDLMEEFRPLIaDSVAIGLINNGE 451
Cdd:cd09720   189 ALVKAGLLPRLGIFHKSEYNPFNLADDLMEPFRPLV-DYLVYELLFESR 236
cas1_NMENI TIGR03639
CRISPR-associated endonuclease Cas1, subtype II/NMENI; The CRISPR-associated protein Cas1 is ...
403-451 3.84e-05

CRISPR-associated endonuclease Cas1, subtype II/NMENI; The CRISPR-associated protein Cas1 is virtually universal to CRISPR systems. CRISPR, an acronym for Clustered Regularly Interspaced Short Palindromic Repeats, is a prokaryotic immunity system for foreign DNA, mostly from phage. CRISPR systems belong to different subtypes, distinguished by both nature of the repeats, the makeup of the cohort of associated Cas proteins, and by molecular phylogeny within the more universal Cas proteins such as this one. This model is of type EXCEPTION and provides more specific information than the EQUIVALOG model TIGR00287. It describes the Cas1 variant of the NMENI subtype of CRISPR/Cas system.


Pssm-ID: 274694  Cd Length: 278  Bit Score: 45.71  E-value: 3.84e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 2707155935 403 TLVGVGFDPYLGFYHQPKYGRPALALDLMEEFRPLIaDSVAIGLINNGE 451
Cdd:TIGR03639 190 ALVKSGLLPRLGIFHKSEYNPFNLADDLMEPFRPLV-DYLVYELLIEEF 237
PDDEXK_1 pfam12705
PD-(D/E)XK nuclease superfamily; Members of this family belong to the PD-(D/E)XK nuclease ...
39-152 7.01e-04

PD-(D/E)XK nuclease superfamily; Members of this family belong to the PD-(D/E)XK nuclease superfamily


Pssm-ID: 432731 [Multi-domain]  Cd Length: 250  Bit Score: 41.37  E-value: 7.01e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2707155935  39 VARIDVIEGEHE-HVVPVDYKRGTPPDIPERAYEPERVQLCLQGLLLREQGYPCTH--GILYFAASQTRVTVEFTEALQA 115
Cdd:pfam12705 128 VGRIDRVDLDGEgYLRIIDYKTGSAPPQSEDLDLYEGLQLLLYLLALAAGEKALGGpaGALYLRLDDPLKKDEEVVEPMV 207
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 2707155935 116 RTLE----LLAEARHTAAAGESPP-PLVESPKCPRCSLVGIC 152
Cdd:pfam12705 208 LTEDefdaLLQELRELAEEILAGEfPARPGKKCRYCPYRSIC 249
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH