GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
Series GSE157145 Query DataSets for GSE157145
Status Public on Apr 06, 2021
Title Transcriptional and imprinting complexity in Arabidopsis seeds at single-nucleus resolution
Organism Arabidopsis thaliana
Experiment type Expression profiling by high throughput sequencing
Summary Seeds are the basis of agriculture, yet their full transcriptional complexity has remained unknown. Here, we employ single-nucleus RNA-sequencing to characterize developing Arabidopsis thaliana seeds, with a focus on endosperm. Endosperm, the site of gene imprinting in plants, mediates the relationship between the maternal parent and embryo. We identify new cell types in the chalazal endosperm region, which interfaces with maternal tissue for nutrient unloading. We further demonstrate that the extent of parental bias of maternally expressed imprinted genes varies with cell cycle phase, and that imprinting of paternally expressed imprinted genes is strongest in chalazal endosperm. These data indicate imprinting in endosperm is heterogeneous and suggest that parental conflict, which is proposed to drive the evolution of imprinting, is fiercest at the boundary between filial and maternal tissues.
Overall design To identify cell/nuclei types and investigate imprinting dynamics within endosperm, single nucleus RNA-seq was performed using the Smart-seq2 method on seed nuclei, targeting either the 3C or 6C FANS peak in order to enrich for triploid endosperm (other seed tissues are diploid). A total of 1,664 libraries were sequenced, of which 64 were negative controls (no DNA in library prep or no cell sorted), 51 were made from two nuclei to test single-nuclei sorting accuracy, 112 failed QC checks (< 1,500 genes detected and/or <1,000 detected with 5+ reads), and the remaining 1,437 were high-quality single-nuclei libraries used in final analysis. The final dataset consists primarily of seeds derived from Col, Cvi reciprocal crosses at 4 days after pollination (DAP), although some nuclei were instead obtained from Col, Ler crosses and/or other timepoints.

Please note that the counts.txt files for the P17_10A, P21_10A, and P22_10A samples are identical as they are from negative controls that should not have any reads mapping to A. thaliana.
There are ~50 negative controls in the dataset, and most have at least one or two reads falling in a gene somewhere and so the processed data files (read counts per gene) aren't exactly the same, but the read counts in all three files are actually zero for all genes, so all three files are identical.
Contributor(s) Picard CL, Povilus RA, Williams BP, Gehring M
Citation(s) 34059805
Submission date Aug 30, 2020
Last update date Jul 14, 2021
Contact name Colette L Picard
Organization name University of California - Los Angeles
Department Molecular, Cell and Developmental Biology
Lab Colette L Picard
Street address 610 Charles E Young Dr East, Room 4045
City Los Angeles
State/province California
ZIP/Postal code 90095
Country USA
Platforms (2)
GPL13222 Illumina HiSeq 2000 (Arabidopsis thaliana)
GPL19580 Illumina NextSeq 500 (Arabidopsis thaliana)
Samples (1664)
GSM4755002 P1_2A
GSM4755003 P1_2B
GSM4755004 P1_2C
BioProject PRJNA660263
SRA SRP279357

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE157145_CPM_total_expression.txt.gz 23.0 Mb (ftp)(http) TXT
GSE157145_RAW.tar 156.5 Mb (http)(custom) TAR (of TXT)
GSE157145_norm_maternal_counts.txt.gz 2.2 Mb (ftp)(http) TXT
GSE157145_norm_paternal_counts.txt.gz 1.3 Mb (ftp)(http) TXT
SRA Run SelectorHelp
Raw data are available in SRA
Processed data provided as supplementary file
Processed data are available on Series record

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap