NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM5987868 Query DataSets for GSM5987868
Status Public on Mar 01, 2023
Title SP_MC4
Sample type SRA
 
Source name PBMC
Organism Homo sapiens
Characteristics strain: Chinese
tissue: blood
age: 20
disease state: severe atherosclerosis
Extracted molecule total RNA
Extraction protocol Sodium heparin-treated blood specimens was collected and PBMC were isolated using ficoll density gradient centrifugation. Total RNA from of PBMCs specimens were extracted by Trizol reagent.
The RNA amount and purity of each sample was quantified using NanoDrop ND-1000 (NanoDrop, Wilmington, DE, USA). The RNA integrity was assessed by Bioanalyzer 2100 (Agilent, CA, USA) with RIN number >7.0, and confirmed by electrophoresis with denaturing agarose gel. Poly (A) RNA is purified from 1μg total RNA using Dynabeads Oligo (dT)25-61005 (Thermo Fisher, CA, USA) using two rounds of purification. Then the poly(A) RNA was fragmented into small pieces using Magnesium RNA Fragmentation Module (NEB, cat.e6150, USA) under 94℃ 5-7min. Then the cleaved RNA fragments were reverse-transcribed to create the cDNA by SuperScript™ II Reverse Transcriptase (Invitrogen, cat. 1896649, USA), which were next used to synthesise U-labeled second-stranded DNAs with E. coli DNA polymerase I (NEB, cat.m0209, USA), RNase H (NEB, cat.m0297, USA) and dUTP Solution (Thermo Fisher, cat.R0133, USA). An A-base is then added to the blunt ends of each strand, preparing them for ligation to the indexed adapters. Each adapter contains a T-base overhang for ligating the adapter to the A-tailed fragmented DNA. Single- or dual-index adapters are ligated to the fragments, and size selection was performed with AMPureXP beads. After the heat-labile UDG enzyme (NEB, cat.m0280, USA) treatment of the U-labeled second-stranded DNAs, the ligated products are amplified with PCR by the following conditions: initial denaturation at 95℃ for 3 min; 8 cycles of denaturation at 98℃ for 15 sec, annealing at 60℃ for 15 sec, and extension at 72℃ for 30 sec; and then final extension at 72℃ for 5 min. The average insert size for the final cDNA library was 300±50 bp. At last, we performed the 2×150bp paired-end sequencing (PE150) on an Illumina Novaseq™ 6000 (LC-Bio Technology CO., Ltd., Hangzhou, China) following the vendor's recommended protocol.
 
Library strategy RNA-Seq
Library source transcriptomic
Library selection cDNA
Instrument model Illumina NovaSeq 6000
 
Data processing Cutadapt software (https://cutadapt.readthedocs.io/en/stable/,version:cutadapt-1.9) was used to remove the reads that contained adaptor contamination,(command line: ~cutadapt -a ADAPT1 -A ADAPT2 -o out1.fastq -p out2.fastq in1.fastq in2.fastq -O 5 -m 100). And After removed the low quality bases and undetermined bases ,we used HISAT2 software (https://daehwankimlab.github.io/hisat2/,version:hisat2-2.0.4) to map reads to the genome (for example:Homo sapiens Ensembl v96),(command line: ~hisat2 -1 R1.fastq.gz -2 R1.fastq.gz -S sample_mapped.sam).
Mapped reads of each sample were assembled using StringTie (http://ccb.jhu.edu/software/stringtie/,version:stringtie-1.3.4d.Linux_x86_64) with default parameters (command line: ~stringtie -p 4 -G genome.gtf -o output.gtf -l sample input.bam).
Transcriptomes from all samples were merged to reconstruct a comprehensive transcriptome using gffcompare software(http://ccb.jhu.edu/software/stringtie/gffcompare.shtml,version:gffcompare-0.9.8.Linux_x86_64).
Final transcriptome was generated, StringTie and ballgown(http://www.bioconductor.org/packages/release/bioc/html/ballgown.html) were used to estimate the expression levels of all transcripts and perform expression level for mRNAs by calculating FPKM (FPKM = [total_exon_fragments / mapped_reads(millions) × exon_length(kB)]),(command line: ~stringtie -e -B -p 4 -G merged.gtf -o samples.gtf samples.bam).
Differentially expressed mRNAs were selected with fold change > 2 or fold change < 0.5 and p value < 0.05 by R package edgeR(https://bioconductor.org/packages/release/bioc/html/edgeR.html) or DESeq2(http://www.bioconductor.org/packages/release/bioc/html/DESeq2.html),and then analysis GO enrichment and KEGG enrichment to the differentially expressed mRNAs
Assembly: we aligned reads of all samples to the < research species > reference genome using HISAT2 (https://daehwankimlab.github.io/hisat2/,version:hisat2-2.0.4) package, which initially remove a portion of the reads based on quality information accompanying each read and then maps the reads to the reference genome. HISAT2 allows multiple alignments per read (up to 20 by default) and a maximum of two mismatch when mapping the reads to the reference. HISAT2 build a database of potential splice junctions and confirms these by comparing the previously unmapped reads against the database of putative junctions. And then by removing sequence-dependent bias and amplification noise using UMI-tools
Supplementary files format and content: The mapped reads of each sample were assembled using StringTie (http://ccb.jhu.edu/software/stringtie/,version:stringtie-1.3.4d) with default parameters. Then, all transcriptomes from all samples were merged to reconstruct a comprehensive transcriptome using gffcompare software(http://ccb.jhu.edu/software/stringtie/gffcompare.shtml,version:gffcompare-0.9.8). After the final transcriptome was generated, StringTie and ballgown (http://www.bioconductor.org/packages/release/bioc/html/ballgown.html) were used to estimate the expression levels of all transcripts and perform expression abundance for mRNAs by calculating FPKM (fragment per kilobase of transcript per million mapped reads) value. FPKM = [total_exon_fragments / mapped_reads(millions) × exon_length(kB)
 
Submission date Mar 30, 2022
Last update date Mar 01, 2023
Contact name Xiaohong Cen
E-mail(s) cherbss@sina.cn
Organization name Southern Medical University
Street address 1838, Guangzhou Avenue North
City 广州
ZIP/Postal code 510515
Country China
 
Platform ID GPL24676
Series (1)
GSE199819 seq-RNA analysis of PBMC in stroke patients with severe atherosclerosis and healthy donors
Relations
BioSample SAMN27112622
SRA SRX14677378

Supplementary data files not provided
SRA Run SelectorHelp
Raw data are available in SRA
Processed data are available on Series record

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap