NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM2076241 Query DataSets for GSM2076241
Status Public on Sep 10, 2016
Title Keo_Apr
Sample type SRA
 
Source name blood
Organism Tursiops truncatus
Characteristics tissue: blood
Sex: Female
age (years): 12
sampling date: 2013-04-01
Extracted molecule total RNA
Extraction protocol Whole blood RNA was extracted using a PAXgene Blood RNA Kit (Qiagen). Total RNA samples were globin-reduced using an Rnase H assay targeting HBA, HBB, and HBM transcripts.
Libraries were constructed using a NEBNext ultra Directional RNA Library Prep kit for Illumina and indexed with the NEBNextMultiplex Oligos for Illumina
 
Library strategy RNA-Seq
Library source transcriptomic
Library selection cDNA
Instrument model Illumina HiSeq 2500
 
Description total RNA globin-depleted by RNase H assay prior to library construction and sequencing
For differential expression analysis, only those genes with PPDE >= 1 - FDR are significantly differentially expressed.
Data processing Data processing steps for DQH_blood_CuffLinks_Genome_data.xls:

The Illumina BCL output files were converted to fastq-sanger file format and sequence quality triming was performed using Trimmomatic on iPlant Collaborative's Discovery Environment using the High-Performance Computing applications. The following Trimmomatic parameters were used: ILLUMINACLIP:TruSeq3-SE.fa 2:30:10; LEADING:10; TRAILING:10; SLIDINGWINDOW:4:20; HEADCROP:6; MINLEN:36.
Reads were mapped to the Ensembl Tursiops truncatus genome, turTru1 v76.1, using Tophat2 v 2.3.13 with bowtie2 v 2.2.4 as the alignment engine on iPlant Collaborative's Discovery Environment using the High-Performance Computing applications.
Read counts as FPKM (fragments per kilobase of trnascript per million mapped reads) were generated using Cufflinks v 2.2.0 with the genome as a reference on iPlant Collaborative's Discovery Environment using the High-Performance Computing applications.
Genome_build: Ensembl turTru1 v76.1
Supplementary_files_format_and_content: Processed data is supplied in a single file. The first column "Gene ID" contains the gene ID from the Ensembl genome. The second column "Gene Symbol" contains the gene symbol, if assigned, from the Ensembl turTru1 v76.1 annotation. The remaining colums contain the FPKM values for each sample as generated by Cufflinks. A FPKM > 0 in at least half the samples and and average FPKM ≥ 1 across all samples was required for all further data analysis. Only genes meeting these requirements are included in the processed data table.

Data processing steps for DQH_blood_RSEM_Genome_data.xls:

The Illumina BCL output files were converted to fastq-sanger file format and sequence quality trimming was performed using Trimmomatic on iPlant Collaborative's Discovery Environment using the High-Performance Computing applications. The following Trimmomatic parameters were used: ILLUMINACLIP:TruSeq3-SE.fa 2:30:10; LEADING:10; TRAILING:10; SLIDINGWINDOW:4:20; HEADCROP:6; MINLEN:36.
Reads were mapped to the Ensembl Tursiops truncatus genome, turTru1 v76.1, using RSEM v 1.2.18 with bowtie2 as the alignment engine and read counts were generated as FPKM (fragments per kilobase of transcript per million mapped reads) at the gene level.
Supplementary_files_format_and_content: Processed data is supplied in a single file. The first column "Gene ID" contains the gene ID from the Ensembl genome. The second column "Gene Symbol" contains the gene symbol, if assigned, from the Ensembl turTru1 v76.1 annotation. The remaining columns contain the FPKM values for each sample as generated by RSEM. A FPKM > 0 in at least half the samples and an average FPKM >= 1 across all samples was required for all further data analysis. Only genes meeting these requirements are included in the processed data table.

Data processing steps for DQH_blood_RSEM_Trinity_data.xls:

The Illumina BCL output files were converted to fastq-sanger file format and sequence quality trimming was performed using Trimmomatic on iPlant Collaborative's Discovery Environment using the High-Performance Computing applications. The following Trimmomatic parameters were used: ILLUMINACLIP:TruSeq3-SE.fa 2:30:10; LEADING:10; TRAILING:10; SLIDINGWINDOW:4:20; HEADCROP:6; MINLEN:36.
The read files from one summer and one winter globin-depleted sample from each animal (n=8; Hua: Feb and Sept, Kainalu: Feb and Aug, Keo: Feb and Aug, Pele: Feb and Sept) were concatenated into a single fastq file for assembly using a minimum K-mer coverage of 1, a minimum overlap value of 25 and a minimum contig length of 400 nucleotides on iPlant Collaborative's Discovery Environment using the High-Performance Computing applications. Annotation of the de novo assembly was obtained by BLASTx searches of the human subset of the uniprot_swissprot database. Reads were mapped to the de novo Trinity assembly, using RSEM v 1.2.18 with bowtie2 as the alignment engine and read counts were generated as FPKM (fragments per kilobase of transcript per million mapped reads) at the gene level.
Supplementary_files_format_and_content: Processed data is supplied in a single file. The first column "Gene ID" contains the gene ID from the de novo Trinity assembly. The second column "Accession Number" contains the uniprot_swissprot accession number, if assigned, from BLASTx searches. The FASTA file with the sequences is available on the series record. The remaining columns contain the FPKM values for each sample as generated by RSEM. A FPKM > 0 in at least half the samples and an average FPKM >= 1 across all samples was required for all further data analysis. Only genes meeting these requirements are included in the processed data table.

Data processing steps for Female_v_Male_FC.xlsx:

FPKM values generated by RSEM were used to calculate fold change values, at the gene level, in EBSeq between female (n=16, all samples from Keo and Pele) and male (n=16, all samples from Hua and Kainalu) samples. The first column "Gene ID" contains the gene ID from the de novo Trinity assembly. The second column "Log2 Fold Change" contains the Log2(fold change) value calculated by EBSeq. Positive Log2(fold change) values are more highly expressed in females. The third column "PPDE" is the EBSeq calculated posterior probability that a gene is differentially expressed. With the hard-threshold method used to control FDR, only those genes with PPDE >= 1 - FDR are significantly differentially expressed.

Data processing steps for Sum_v_Win_FC.xlsx:

FPKM values generated by RSEM were used to calculate fold change values, at the gene level, in EBSeq between summer (n=8, Hua: July and August, Kainalu: July and September, Keo: August, Pele: July, August, and September) and winter (n=8, Hua: December and February, Kainalu: December and February, Keo: December and February, Pele: December and February) samples. The first column "Gene ID" contains the gene ID from the de novo Trinity assembly. The second column "Log2 Fold Change" contains the Log2(fold change) value calculated by EBSeq. Positive Log2(fold change) values are more highly expressed in warmer months. The third column "PPDE" is the EBSeq calculated posterior probability that a gene is differentially expressed. With the hard-threshold method used to control FDR, only those genes with PPDE >= 1 - FDR are significantly differentially expressed.

Data processing steps for Sum_v_Win_Male_FC.xlsx:

FPKM values generated by RSEM were used to calculate fold change values, at the gene level, in EBSeq between summer (n=4, Hua: July and August, Kainalu: July and September) and winter (n=4, Hua: December and February, Kainalu: December and February) in samples from male animals. The first column "Gene ID" contains the gene ID from the de novo Trinity assembly. The second column "Log2 Fold Change" contains the Log2(fold change) value calculated by EBSeq. Positive Log2(fold change) values are more highly expressed in warmer months. The third column "PPDE" is the EBSeq calculated posterior probability that a gene is differentially expressed. With the hard-threshold method used to control FDR, only those genes with PPDE >= 1 - FDR are significantly differentially expressed.

Data processing steps for Sum_v_Win_Female_FC.xlsx:
FPKM values generated by RSEM were used to calculate fold change values, at the gene level, in EBSeq between summer (n=4, Keo: August, Pele: July, August, and September) and winter (n=4, Keo: December and February, Pele: December and February) samples. The first column "Gene ID" contains the gene ID from the de novo Trinity assembly. The second column "Log2 Fold Change" contains the Log2(fold change) value calculated by EBSeq. Positive Log2(fold change) values are more highly expressed in warmer months. The third column "PPDE" is the EBSeq calculated posterior probability that a gene is differentially expressed.
 
Submission date Feb 29, 2016
Last update date May 15, 2019
Contact name Jeanine Morey
Organization name National Marine Mammal Foundation
Department Conservation Medicine
Street address 3419 Maybank Hwy, Ste B
City Johns Island
State/province SC
ZIP/Postal code 29455
Country USA
 
Platform ID GPL21538
Series (1)
GSE78770 RNA-seq analysis of seasonal and individual variation in blood transcriptomes of healthy captive bottlenose dolphins
Relations
BioSample SAMN04523311
SRA SRX1605660

Supplementary data files not provided
SRA Run SelectorHelp
Raw data are available in SRA
Processed data are available on Series record

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap