NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM4256231 Query DataSets for GSM4256231
Status Public on Feb 26, 2021
Title LW_2W_2_Fat_RNA
Sample type SRA
 
Source name Fat
Organism Sus scrofa
Characteristics breed: Large White
tissue: fat
age: 2 weeks
Extracted molecule total RNA
Extraction protocol Total RNA was extracted from each tissue sample using TRIzol reagent, and RNA was further purified by removing the rRNA. These rRNA depleted RNA samples were used for the construction of sequencing libraries.
The strand-specific RNA-seq libraries using the protocol provided by Ilumina. The libraries were sequenced using the illumina Hiseq X ten PE150 platform.
 
Library strategy RNA-Seq
Library source transcriptomic
Library selection cDNA
Instrument model HiSeq X Ten
 
Description individual 2
Data processing The adapter was firstly removed from the raw data and the clean data was obtained after removing low quality reads.
The reads were mapped to the susScr11.1 reference genome using TopHat (v2.1.1). The transcript assembly of each sample was separately performed by using BAM file as input file for Cufflinks (v2.2.1). The resulting individual transcript assemblies were further merged to a single transcript assembly using Cuffcompare.
To identify lncRNA of the pig genome, the transcripts from the single transcript assembly were filtered as the following steps: (i) the identified transcripts which must be with class code “x”, “i” or “.” in at least two samples, and do not overlap with any known transcripts or with class code “u” ; (ii) transcripts with length ≥ 200, exon number ≥ 2; (iii) transcripts with FPKM ≥ 1 and reads ≥ 5 in at least 2 replicates; (iv) the identified transcripts without potential coding regions which were filtered using CNCI and CPC2.
For identification of other types of novel transcripts, the transcripts which were assembled in two replicates in the transcript assembly file of each sample were selected firstly. Then the filtered transcript assembly files were merged by Cuffmerge. Next, Cuffcompare was applied again to integrate the merged transcript assembly file and the lncRNA gtf file. This new gtf was used to calculate transcript expression levels (FPKM value) by RSEM (v1.3.0). The last step was to filter the transcripts to identify novel transcripts by the following criteria: (i) transcripts on the same strand of known transcripts and exon overlapped with any known transcripts were omitted; (ii) lncRNAs which were identified before were removed; (iii) transcripts with class code “x”, “i” and “u” and with FPKM ≥ 5 at least in any 2 replicates.
Finally, The gtf file of lncRNA and novel transcripts was integrated with the annotation file from UCSC for calculating gene expression level(TPM value) by RSEM.
Genome_build: susScr11.1
Supplementary_files_format_and_content: Bed file of each Sample include TPM (​Transcripts Per Million) values calculated by RSEM.
 
Submission date Jan 08, 2020
Last update date Feb 27, 2021
Contact name Shuhong Zhao
E-mail(s) shzhao@mail.hzau.edu.cn
Organization name Huazhong Agricultural University
Street address No.1,Shizishan Street, Hongshan District
City Wuhan
ZIP/Postal code 430070
Country China
 
Platform ID GPL22918
Series (1)
GSE143288 A compendium map of Cis-Regulatory Elements in the Pig Genome and epigenetic comparison with the human Genome
Relations
SRA SRX7438594
BioSample SAMN13676577

Supplementary file Size Download File type/resource
GSM4256231_LW_2W_2_Fat_RNA_TPM.bed.gz 399.2 Kb (ftp)(http) BED
SRA Run SelectorHelp
Raw data are available in SRA
Processed data provided as supplementary file

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap