NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE108401 Query DataSets for GSE108401
Status Public on Mar 16, 2018
Title The Complex Architecture of Transgene Insertions
Organism Arabidopsis thaliana
Experiment type Expression profiling by high throughput sequencing
Non-coding RNA profiling by high throughput sequencing
Methylation profiling by high throughput sequencing
Genome binding/occupancy profiling by high throughput sequencing
Summary Over the last 30 years the soil bacterium Agrobacterium tumefaciens has been the workhorse tool for plant genome engineering. Replacement of native tumor-inducing (Ti) plasmid elements with customizable cassettes enabled insertion of a sequence of interest as “Transfer DNA” (T-DNA) into the plant genome of interest. Although these T-DNA transfer mechanisms are well understood, detailed understanding of the structure and epigenomic status of insertion events was limited by current technologies. To fill this gap, we analyzed transgenic Arabidopsis thaliana lines from three widely used collections (SALK, SAIL and WISC) with two single molecule technologies, optical genome mapping and nanopore sequencing. Optical maps for four randomly selected T-DNA lines revealed between one and seven insertions/rearrangements with unexpectedly large sizes ranging from 27 to 236 kilobases. De novo nanopore-based genome assemblies for two heterozygous lines resolved T-DNA structures up to 36 kb and revealed large-scale T-DNA associated translocations and exchange of chromosome arm ends. The multiple internally rearranged nature of T-DNA arrays, consisting of identical T-DNA/backbone concatemers made full assembly even for long nanopore reads impossible. For the current TAIR10 reference genome, nanopore contigs corrected 83% of non-centromeric misassemblies. This unprecedented nucleotide-level definition of T-DNA insertions enabled the mapping of epigenome data. The SALK_059379 T-DNA insertions were enriched for 24nt siRNAs and contained dense cytosine DNA methylation. Transgene silencing via the RNA directed DNA methylation pathway was confirmed by in planta assays. In contrast, SAIL_232 T-DNA sequence was predominantly targeted by 21/22nt siRNAs, and DNA methylation and silencing was limited to the GUS gene, but not the resistance gene. With the emergence of genome editing technologies that rely on Agrobacterium for gene delivery, this study provides new insights into the structural impact of engineering plant genomes and demonstrates the utility of state-of-the-art long-range sequencing technologies to rapidly identify unanticipated genomic changes.
 
Overall design 2 biological samples, each RNAseq, smallRNAseq and bisulfite sequencing
 
Contributor(s) Jupe F, Michael TP, Rivkin AC, Chen H, Ecker JR
Citation(s) 30657772
Submission date Dec 21, 2017
Last update date Nov 29, 2022
Contact name Joseph R Ecker
E-mail(s) ecker@salk.edu
Phone 8584534100
Organization name HHMI-Salk-Institute
Department Genomic Analysis Laboratory
Lab Ecker lab
Street address 10010 North Torrey Pines Road
City La Jolla
State/province CA
ZIP/Postal code 92037
Country USA
 
Platforms (2)
GPL17639 Illumina HiSeq 2500 (Arabidopsis thaliana)
GPL21785 Illumina HiSeq 4000 (Arabidopsis thaliana)
Samples (39)
GSM2897724 SAIL_232 RNAseq
GSM2897725 SAIL_232 smallRNAseq
GSM2897726 SAIL_232 bisulfite seq
Relations
BioProject PRJNA427134
SRA SRP127334

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE108401_RAW.tar 685.2 Mb (http)(custom) TAR (of BW, TSV)
GSE108401_bigwig_file_description.txt.gz 1.0 Kb (ftp)(http) TXT
GSE108401_pBIN_ROK2_pCSA110_sequences.txt.gz 6.0 Kb (ftp)(http) TXT
SRA Run SelectorHelp
Raw data are available in SRA
Processed data provided as supplementary file

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap