NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2102928282|ref|WP_223993174|]
View 

MULTISPECIES: IS21 family transposase [unclassified Arthrobacter]

Protein Classification

transposase( domain architecture ID 11468299)

transposase binds to the end of a transposon and catalyzes the movement of the transposon to another part of the genome by a cut and paste mechanism or a replicative transposition mechanism

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
COG4584 COG4584
Transposase [Mobilome: prophages, transposons];
1-461 1.05e-73

Transposase [Mobilome: prophages, transposons];


:

Pssm-ID: 443641 [Multi-domain]  Cd Length: 484  Bit Score: 241.28  E-value: 1.05e-73
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282   1 MKHDGEIMEILAAYDLTGSLRATAELTGCSHHTVARHVA-ARDAGRPIAEPAPRPRVTDAYLPKIEEWVEASKgRIRADK 79
Cdd:COG4584     1 MLTMEQIREIRRLLREGLSIREIARELGISRNTVRKYLRrAEEWPELYPRRRPRPSKLDPYKEYIDEWLEEGP-RVTAKR 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282  80 AHEKLLAL-GYEGSERSTRRAIAQVKAAWrlgHTRVHRPWITEPGMWLQYDFGDG--PRIGGVKTIL--FVAWLAFSRFR 154
Cdd:COG4584    80 IWEELKEEhGYTGSYSTVRRYVRRLRPEY---PKEAFVRLEHPPGEQAQVDWGEAtvPPITGERRKVyvFVAVLGYSRYK 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282 155 IVIPLRDRTAPSVFAALDRCFRILGGAPTYVLTDNEKTVTTAHVAGVPVRNQQTLDFARHYGVTVLTCQPADPATKGGVE 234
Cdd:COG4584   157 YVEAYPSQTQEDLLEAHVRAFEFFGGVPREIVYDNLKTAVTKADRGEPVLNERFLAFAAHYGFEPRPCRPRRPKEKGKVE 236
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282 235 ASVKLAKADLVPtdtnLRPGYASFAELEAACEAFMDEVNN-REHRATRRKPAMVLAEEAARLHRIPETAHTVAFGLARIV 313
Cdd:COG4584   237 NAVGYVRRNFLA----PRPRFTSLEELNAALLEWLERVANrRIHGTTGESPAERFAEEEEALLPLPPPPPPAVRYETVRV 312
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282 314 PENTPMVSFENAQYSVPSHLLGAKVFVRSHgtgpDEQVIIVHHGptgpVEVARHGRARPGSPAVIEEHFPGTSTRTPGDY 393
Cdd:COG4584   313 VKDDVVVVDGNRYSVPPYSVGRRVVVVVVV----DRVVVIVVGE----VVAAHHRRRHRRGKRRDPPHYLLPLLLRPRAA 384
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2102928282 394 AVKARTAGEAEFLAIGAGAKTWLVEAAAAGTGRMNMKMAEAVTLAKIAGTESVDRALGDAALHGRFAH 461
Cdd:COG4584   385 PPAAALAAALLAEAPEELRLLLLALLRARGRLARLLLLLLLLLLLELAAAAAAAAAALAAALAALLLS 452
 
Name Accession Description Interval E-value
COG4584 COG4584
Transposase [Mobilome: prophages, transposons];
1-461 1.05e-73

Transposase [Mobilome: prophages, transposons];


Pssm-ID: 443641 [Multi-domain]  Cd Length: 484  Bit Score: 241.28  E-value: 1.05e-73
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282   1 MKHDGEIMEILAAYDLTGSLRATAELTGCSHHTVARHVA-ARDAGRPIAEPAPRPRVTDAYLPKIEEWVEASKgRIRADK 79
Cdd:COG4584     1 MLTMEQIREIRRLLREGLSIREIARELGISRNTVRKYLRrAEEWPELYPRRRPRPSKLDPYKEYIDEWLEEGP-RVTAKR 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282  80 AHEKLLAL-GYEGSERSTRRAIAQVKAAWrlgHTRVHRPWITEPGMWLQYDFGDG--PRIGGVKTIL--FVAWLAFSRFR 154
Cdd:COG4584    80 IWEELKEEhGYTGSYSTVRRYVRRLRPEY---PKEAFVRLEHPPGEQAQVDWGEAtvPPITGERRKVyvFVAVLGYSRYK 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282 155 IVIPLRDRTAPSVFAALDRCFRILGGAPTYVLTDNEKTVTTAHVAGVPVRNQQTLDFARHYGVTVLTCQPADPATKGGVE 234
Cdd:COG4584   157 YVEAYPSQTQEDLLEAHVRAFEFFGGVPREIVYDNLKTAVTKADRGEPVLNERFLAFAAHYGFEPRPCRPRRPKEKGKVE 236
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282 235 ASVKLAKADLVPtdtnLRPGYASFAELEAACEAFMDEVNN-REHRATRRKPAMVLAEEAARLHRIPETAHTVAFGLARIV 313
Cdd:COG4584   237 NAVGYVRRNFLA----PRPRFTSLEELNAALLEWLERVANrRIHGTTGESPAERFAEEEEALLPLPPPPPPAVRYETVRV 312
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282 314 PENTPMVSFENAQYSVPSHLLGAKVFVRSHgtgpDEQVIIVHHGptgpVEVARHGRARPGSPAVIEEHFPGTSTRTPGDY 393
Cdd:COG4584   313 VKDDVVVVDGNRYSVPPYSVGRRVVVVVVV----DRVVVIVVGE----VVAAHHRRRHRRGKRRDPPHYLLPLLLRPRAA 384
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2102928282 394 AVKARTAGEAEFLAIGAGAKTWLVEAAAAGTGRMNMKMAEAVTLAKIAGTESVDRALGDAALHGRFAH 461
Cdd:COG4584   385 PPAAALAAALLAEAPEELRLLLLALLRARGRLARLLLLLLLLLLLELAAAAAAAAAALAAALAALLLS 452
transpos_IS21 NF033546
IS21 family transposase;
10-300 4.68e-53

IS21 family transposase;


Pssm-ID: 468077 [Multi-domain]  Cd Length: 296  Bit Score: 181.25  E-value: 4.68e-53
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282  10 ILAAYDLTGSLRATAELTGCSHHTVARHVAARDAGRP--IAEPAPRPRVTDAYLPKIEEWVEAS--KGRIRADKAHEKLL 85
Cdd:NF033546    1 IRLLFRQGLSIREIARELGISRNTVRKYLRRAGLDEPpkYERRPPRPSKLDPFEPYIPDWLEAHlrKPGVTATLLWEELR 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282  86 ALGYEGSERSTRRAIAQVKAAwrLGHTRVHRPWITEPGMWLQYDFGD--GPRIGGVKTIL--FVAWLAFSRFRIVIPLRD 161
Cdd:NF033546   81 AEGYPGSYSTVRRYVRRWRAE--QGPAKVFVRLEHAPGEQAQVDFGEatVVVTGGTGKILhvFVAVLGYSRYTYVEATPS 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282 162 RTAPSVFAALDRCFRILGGAPTYVLTDNEKT-VTTAHVAGVPVRNQQTLDFARHYGVTVLTCQPADPATKGGVEASVKLA 240
Cdd:NF033546  159 ESQEDLLDGHQRAFEFFGGVPREIVYDNLKTaVDKRDRYEEPRLNPRFAAFAAHYGFEPRPCRPYRPQEKGKVERAVGYV 238
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2102928282 241 KADLVPTdtnLRPGYASFAELEAACEAFMDEVNN-REHRATRRKPAMVLAEEAARLHRIPE 300
Cdd:NF033546  239 RRWFLRL---RGRRFESLAELNAALAEWLAELANqRPHGTTGGSPAERFEEERPALQPLPA 296
rve pfam00665
Integrase core domain; Integrase mediates integration of a DNA copy of the viral genome into ...
122-227 1.47e-12

Integrase core domain; Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic domain. The carboxyl terminal domain that is a non-specific DNA binding domain pfam00552. The catalytic domain acts as an endonuclease when two nucleotides are removed from the 3' ends of the blunt-ended viral DNA made by reverse transcription. This domain also catalyzes the DNA strand transfer reaction of the 3' ends of the viral DNA to the 5' ends of the integration site.


Pssm-ID: 459897 [Multi-domain]  Cd Length: 98  Bit Score: 63.49  E-value: 1.47e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282 122 PGMWLQYDFGDGPRI-GGVKTILFVAWLAFSRFRIVIPLRDR-TAPSVFAALDRCFRILGGAPTYVLTDNEKTVTtahva 199
Cdd:pfam00665   1 PNQLWQGDFTYIRIPgGGGKLYLLVIVDDFSREILAWALSSEmDAELVLDALERAIAFRGGVPLIIHSDNGSEYT----- 75
                          90       100
                  ....*....|....*....|....*...
gi 2102928282 200 gvpvrNQQTLDFARHYGVTVLTCQPADP 227
Cdd:pfam00665  76 -----SKAFREFLKDLGIKPSFSRPGNP 98
transpos_IS481 NF033577
IS481 family transposase; null
8-284 5.78e-09

IS481 family transposase; null


Pssm-ID: 468094 [Multi-domain]  Cd Length: 283  Bit Score: 57.22  E-value: 5.78e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282   8 MEILAAYDLTG-SLRATAELTGCSHHTVARHVAARDAGRPIAE------PAPRPRVTDaylPKIEEWVEA--SKGRIRAD 78
Cdd:NF033577    3 LELVRLVLEDGwSVREAARRFGISRKTVYKWLKRYRAGGEEGLidrsrrPHRSPRRTS---PETEARILAlrRELRLGPR 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282  79 KAHEKLLALGYEGSERSTRRAIAQ---VKAAWRLGHTRVHRPWITE-PGMWLQYDFGDGPRIGGV-KTILFVAWLAFSRF 153
Cdd:NF033577   80 RIAYELERQGPGVSRSTVHRILRRhglSRLRALDRKTGKVKRYERAhPGELWHIDIKKLGRIPDVgRLYLHTAIDDHSRF 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282 154 RIVIPLRDRTAPSVFAALDRCFRILGGAPTYVLTDNektvttahvaGVPVRNQQTlDFARH---YGVTVLTCQPADPATK 230
Cdd:NF033577  160 AYAELYPDETAETAADFLRRAFAEHGIPIRRVLTDN----------GSEFRSRAH-GFELAlaeLGIEHRRTRPYHPQTN 228
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 2102928282 231 GGVEASVKLAKADLVPTDTnlrpgYASFAELEAACEAFMDEVNN-REHRATRRKP 284
Cdd:NF033577  229 GKVERFHRTLKDEFAYARP-----YESLAELQAALDEWLHHYNHhRPHSALGGKT 278
 
Name Accession Description Interval E-value
COG4584 COG4584
Transposase [Mobilome: prophages, transposons];
1-461 1.05e-73

Transposase [Mobilome: prophages, transposons];


Pssm-ID: 443641 [Multi-domain]  Cd Length: 484  Bit Score: 241.28  E-value: 1.05e-73
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282   1 MKHDGEIMEILAAYDLTGSLRATAELTGCSHHTVARHVA-ARDAGRPIAEPAPRPRVTDAYLPKIEEWVEASKgRIRADK 79
Cdd:COG4584     1 MLTMEQIREIRRLLREGLSIREIARELGISRNTVRKYLRrAEEWPELYPRRRPRPSKLDPYKEYIDEWLEEGP-RVTAKR 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282  80 AHEKLLAL-GYEGSERSTRRAIAQVKAAWrlgHTRVHRPWITEPGMWLQYDFGDG--PRIGGVKTIL--FVAWLAFSRFR 154
Cdd:COG4584    80 IWEELKEEhGYTGSYSTVRRYVRRLRPEY---PKEAFVRLEHPPGEQAQVDWGEAtvPPITGERRKVyvFVAVLGYSRYK 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282 155 IVIPLRDRTAPSVFAALDRCFRILGGAPTYVLTDNEKTVTTAHVAGVPVRNQQTLDFARHYGVTVLTCQPADPATKGGVE 234
Cdd:COG4584   157 YVEAYPSQTQEDLLEAHVRAFEFFGGVPREIVYDNLKTAVTKADRGEPVLNERFLAFAAHYGFEPRPCRPRRPKEKGKVE 236
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282 235 ASVKLAKADLVPtdtnLRPGYASFAELEAACEAFMDEVNN-REHRATRRKPAMVLAEEAARLHRIPETAHTVAFGLARIV 313
Cdd:COG4584   237 NAVGYVRRNFLA----PRPRFTSLEELNAALLEWLERVANrRIHGTTGESPAERFAEEEEALLPLPPPPPPAVRYETVRV 312
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282 314 PENTPMVSFENAQYSVPSHLLGAKVFVRSHgtgpDEQVIIVHHGptgpVEVARHGRARPGSPAVIEEHFPGTSTRTPGDY 393
Cdd:COG4584   313 VKDDVVVVDGNRYSVPPYSVGRRVVVVVVV----DRVVVIVVGE----VVAAHHRRRHRRGKRRDPPHYLLPLLLRPRAA 384
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2102928282 394 AVKARTAGEAEFLAIGAGAKTWLVEAAAAGTGRMNMKMAEAVTLAKIAGTESVDRALGDAALHGRFAH 461
Cdd:COG4584   385 PPAAALAAALLAEAPEELRLLLLALLRARGRLARLLLLLLLLLLLELAAAAAAAAAALAAALAALLLS 452
transpos_IS21 NF033546
IS21 family transposase;
10-300 4.68e-53

IS21 family transposase;


Pssm-ID: 468077 [Multi-domain]  Cd Length: 296  Bit Score: 181.25  E-value: 4.68e-53
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282  10 ILAAYDLTGSLRATAELTGCSHHTVARHVAARDAGRP--IAEPAPRPRVTDAYLPKIEEWVEAS--KGRIRADKAHEKLL 85
Cdd:NF033546    1 IRLLFRQGLSIREIARELGISRNTVRKYLRRAGLDEPpkYERRPPRPSKLDPFEPYIPDWLEAHlrKPGVTATLLWEELR 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282  86 ALGYEGSERSTRRAIAQVKAAwrLGHTRVHRPWITEPGMWLQYDFGD--GPRIGGVKTIL--FVAWLAFSRFRIVIPLRD 161
Cdd:NF033546   81 AEGYPGSYSTVRRYVRRWRAE--QGPAKVFVRLEHAPGEQAQVDFGEatVVVTGGTGKILhvFVAVLGYSRYTYVEATPS 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282 162 RTAPSVFAALDRCFRILGGAPTYVLTDNEKT-VTTAHVAGVPVRNQQTLDFARHYGVTVLTCQPADPATKGGVEASVKLA 240
Cdd:NF033546  159 ESQEDLLDGHQRAFEFFGGVPREIVYDNLKTaVDKRDRYEEPRLNPRFAAFAAHYGFEPRPCRPYRPQEKGKVERAVGYV 238
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2102928282 241 KADLVPTdtnLRPGYASFAELEAACEAFMDEVNN-REHRATRRKPAMVLAEEAARLHRIPE 300
Cdd:NF033546  239 RRWFLRL---RGRRFESLAELNAALAEWLAELANqRPHGTTGGSPAERFEEERPALQPLPA 296
rve pfam00665
Integrase core domain; Integrase mediates integration of a DNA copy of the viral genome into ...
122-227 1.47e-12

Integrase core domain; Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic domain. The carboxyl terminal domain that is a non-specific DNA binding domain pfam00552. The catalytic domain acts as an endonuclease when two nucleotides are removed from the 3' ends of the blunt-ended viral DNA made by reverse transcription. This domain also catalyzes the DNA strand transfer reaction of the 3' ends of the viral DNA to the 5' ends of the integration site.


Pssm-ID: 459897 [Multi-domain]  Cd Length: 98  Bit Score: 63.49  E-value: 1.47e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282 122 PGMWLQYDFGDGPRI-GGVKTILFVAWLAFSRFRIVIPLRDR-TAPSVFAALDRCFRILGGAPTYVLTDNEKTVTtahva 199
Cdd:pfam00665   1 PNQLWQGDFTYIRIPgGGGKLYLLVIVDDFSREILAWALSSEmDAELVLDALERAIAFRGGVPLIIHSDNGSEYT----- 75
                          90       100
                  ....*....|....*....|....*...
gi 2102928282 200 gvpvrNQQTLDFARHYGVTVLTCQPADP 227
Cdd:pfam00665  76 -----SKAFREFLKDLGIKPSFSRPGNP 98
transpos_IS481 NF033577
IS481 family transposase; null
8-284 5.78e-09

IS481 family transposase; null


Pssm-ID: 468094 [Multi-domain]  Cd Length: 283  Bit Score: 57.22  E-value: 5.78e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282   8 MEILAAYDLTG-SLRATAELTGCSHHTVARHVAARDAGRPIAE------PAPRPRVTDaylPKIEEWVEA--SKGRIRAD 78
Cdd:NF033577    3 LELVRLVLEDGwSVREAARRFGISRKTVYKWLKRYRAGGEEGLidrsrrPHRSPRRTS---PETEARILAlrRELRLGPR 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282  79 KAHEKLLALGYEGSERSTRRAIAQ---VKAAWRLGHTRVHRPWITE-PGMWLQYDFGDGPRIGGV-KTILFVAWLAFSRF 153
Cdd:NF033577   80 RIAYELERQGPGVSRSTVHRILRRhglSRLRALDRKTGKVKRYERAhPGELWHIDIKKLGRIPDVgRLYLHTAIDDHSRF 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2102928282 154 RIVIPLRDRTAPSVFAALDRCFRILGGAPTYVLTDNektvttahvaGVPVRNQQTlDFARH---YGVTVLTCQPADPATK 230
Cdd:NF033577  160 AYAELYPDETAETAADFLRRAFAEHGIPIRRVLTDN----------GSEFRSRAH-GFELAlaeLGIEHRRTRPYHPQTN 228
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 2102928282 231 GGVEASVKLAKADLVPTDTnlrpgYASFAELEAACEAFMDEVNN-REHRATRRKP 284
Cdd:NF033577  229 GKVERFHRTLKDEFAYARP-----YESLAELQAALDEWLHHYNHhRPHSALGGKT 278
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH