NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM1924865 Query DataSets for GSM1924865
Status Public on Nov 04, 2015
Title Mouse neural stem cell_2
Sample type SRA
 
Source name cerebral cortex
Organism Mus musculus
Characteristics cell line: CD-1
tissue: cerebral cortex
cell type: Mouse neural stem cell
Growth protocol Mouse neural stem cells were cultured and passaged every 2 days as monolayers in MEM/F12 medium containing Glutamax, non-essential amino acids, B27, N2 supplement, 20 ng/ml EGF, and 20 ng/ml FGF. Cells were dissociated using Accutase (Invitrogen) and seeded onto poly-d-lysine-coated plates or dishes.
Extracted molecule polyA RNA
Extraction protocol Total RNA was extracted using the miRNeasy kit (Qiagen) under manufactor’s protocols. The quality was accessed by Bioanalyzer. Samples with high RNA integrity number (RIN) (>8) were used for library construction. 100ng total RNAs were used for each sequencing library.
RNA samples were polyA selected and paired-end sequencing libraries were constructed using TruSeq RNA Sample Prep Kit as described in the TruSeq RNA Sample Preparation V2 Guide (Illumina).
 
Library strategy RNA-Seq
Library source transcriptomic
Library selection cDNA
Instrument model Illumina HiSeq 2000
 
Data processing We mapped the paired-end reads to the mouse reference genome (UCSC version mm10) using TopHat (version 2.0.11) and Bowtie2 (version 2.2.2). Cufflinks (version 1.3.0) was used to assemble transcripts from genomic read alignments. TopHat was run using default settings, except using the option “-G mm10_genes.gtf” (mm10_genes.gtf is an annotation file within the Illumina iGenome package. Cufflinks was run using default settings without any annotation file to identify novel transcripts. All transcripts were then merged using Cuffmerge and then compared to a reference annotation file (mm10_genes.gtf) using Cuffcompare. Transcripts not matching any reference transcripts were given a class code ‘u’ (to designate them as unknown intergenic transcripts) by Cuffcompare and the considered as ‘novel’ transcripts not present in the reference file. The resulting .gtf was then filtered and only multi-exonic transcripts were retained.
In order to generate a comprehensive lncRNA annotation for downstream analysis, we surveyed available lncRNA annotations in the public domain, including lncRNAdb, GENCODE, Ensembl, and UCSC RefSeq genes. For lncRNAdb, the transcript structures were obtained using the BLAT program when only sequence information was available. We also added several recently published RNA-Seq datasets related to brain or CNS development to our analyses, including RNA-Seq data from subventricular zone (SVZ) tissue and olfactory bulb (OB), among others. To combine lncRNA annotations generated using RNA-Seq, we merged lncRNA transcripts identified from the same loci that were not annotated by GENCODE, or any other canonical lncRNA database, into single transcripts because distinguishing full-length isoforms from partially reconstructed fragments is not always possible without further experimental evidence, especially when sequencing depths vary among different RNA-Seq datasets. Subsequently, we combined lncRNA annotations from all sources into one non-redundant annotation file. Because there are overlaps between various annotation resources, a procedure was implemented to eliminate redundancy. Among these annotation resources, lncRNAdb contains the most detailed information for ~100 lncRNAs collected from the literature. When transcripts from different sources overlapped, we retained the gene information (gene_id, gene_name etc.) for the loci in the following order of priority: 1) lncRNAdb, 2) GENCODE, 3) RefSeq, 4) Ensembl, or 5) annotated by RNA-Seq experiments. We further combined the resulting lncRNA annotation .gtf with iGenome mm10 gtf annotation and removed any redundancy.
To quantify transcript expression, we re-ran Cufflink with option –G and supplied the comprehensive .gtf. The multi-read correction and bias correction algorithms in the Cufflinks package were enabled (option: -u -b). Expression level estimation was reported as Fragments per Kilobase of transcript sequence per Million mapped fragment (FPKM) values together with confidence intervals. Transcripts with an FPKM > 0, FPKM_conf_lo (lower bound of confidence interval) > 0, and status = OK were considered as detected with confidence.
To facilitate downstream analysis (such as fold change, enrichment analysis, and other procedures), a previously calculated threshold of FPKM = 0.1 was adopted. Any FPKM < 0.1 were set to 0.1 for fold enrichment calculations in order to avoid ratio inflation.
Genome_build: mm10
Supplementary_files_format_and_content: Excel file include FPKM values for each Sample
 
Submission date Nov 03, 2015
Last update date May 15, 2019
Contact name Xiaomin Dong
E-mail(s) xiaomin.dong@yahoo.com
Organization name UT health
Department Department of neurosurgery
Lab Dr.Jiaqian Wu
Street address 1825 Pressler Street
City houston
State/province Texas
ZIP/Postal code 77030
Country USA
 
Platform ID GPL13112
Series (2)
GSE74643 Comprehensive Identification of Long Non-coding RNAs in Purified Cell Types from the Brain Reveals Functional LncRNA in OPC Fate Determination (RNA-Seq of mouse neural stem cells)
GSE74648 Comprehensive Identification of Long Non-coding RNAs in Purified Cell Types from the Brain Reveals Functional LncRNA in OPC Fate Determination
Relations
BioSample SAMN04233137
SRA SRX1411155

Supplementary data files not provided
SRA Run SelectorHelp
Raw data are available in SRA
Processed data are available on Series record

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap