|
|
GEO help: Mouse over screen elements for information. |
|
Status |
Public on May 20, 2010 |
Title |
Breast and bladder cancer pool |
Sample type |
SRA |
|
|
Source name |
Bladder neoplasia and breast neoplasia
|
Organism |
Homo sapiens |
Characteristics |
tissue: bladder tumors, breast tumors
|
Extracted molecule |
total RNA |
Extraction protocol |
Total RNA was extracted twice from each sample of 23 human formalin-fixed paraffin-embedded (FFPE) samples derived from cancerous tissue. RNA was isolated using ten 10-µm-thick tissue sections using the miRdicatorTM extraction protocol developed at Rosetta Genomics. Briefly, the sample was incubated repeatedly in xylene at 57oC to remove excess paraffin, followed by washing in ethanol. Proteins were degraded by incubation in proteinase K solution at 45oC for a few hours. The RNA was extracted with acid phenol:chloroform followed by ethanol precipitation and DNAse digestion. Total RNA quantity and quality were checked by spectrophotometry (Nanodrop ND-1000). Pools of samples of the small RNA fraction within the total RNA were labeled and hybridized on arrays. After ensuring the presence and expression of more than 100 miRNAs per cancerous tissue pool, tissues were pooled together, resulting in a bladder+breast tumor pool and a colon+lung tumor pool. Array expression revealed the presence of 157 miRNAs from bladder cancer FFPEs, 260 miRNAs from breast cancer FFPEs, 135 miRNAs from lung cancer FFPEs, and 239 miRNAs from colon cancer FFPEs. Total RNA (75 µg) of seven different colon cancer FFPEs were pooled together with 75 µg of six different lung cancer FFPEs, while 75 µg total RNA of five different bladder cancer FFPEs were pooled together with 75 µg of five different breast cancer FFPEs.
|
|
|
Library strategy |
RNA-Seq |
Library source |
transcriptomic |
Library selection |
size fractionation |
Instrument model |
454 GS FLX |
|
|
Description |
Pool1 consists of samples: 4117-Bladder 4141-Bladder 4142-Bladder (transitional cell carcinoma) 4118-Bladder (transitional cell carcinoma) 4138-Bladder 4162-Bladder 4163-Bladder 4139-Bladder 4140-Bladder 4164-Bladder 4126-Breast 4150-Breast 4151-Breast 4127-Breast 4128-Breast 4152-Breast 4135-Breast 4159-Breast 4137-Breast 4161-Breast
|
Data processing |
Adaptors were removed using a Perl script allowing internal polyN sequences within the adaptors and 1 mismatch. About 1000 sequences were removed since they were too short after adaptor removal (<10 bp). The sequences were mapped to the human genome (UCSC hg18 build) using BLAST, allowing maximum three bps mismatched to the genome and maximum insertion/deletion (indels) of three bps. For each aligned sequence the highest scoring hit was retrieved. All sequences with position overlap were clustered together using a Perl script. We assigned each genomic cluster of sequences the most abundant sequence in this cluster and demanded that for candidate miRNAs, the most abundant sequence will be mapped precisely to the genome (not allowing any mismatches/indels). The next step was to annotate known sequences. The following datasets were used for this task: RNA genes, sno/miRNA, RefSeq genes, and RepeatMasker tables were downloaded from the UCSC table browser {Karolchik, 2004 }, and known miRNA precursors were downloaded from miRBase in order to mark whether the sequence is part of a noncoding gene, a snoRNA, a protein-coding gene exon, a genomic repeat, or a known miRNA precursor, respectively. The sequences of the novel miRNA candidates were extended by several hundred bp within their chromosomes in order to predict possible miRNA precursors. An extended sequence was intended to predict the folding of a pri-miRNA that contains a hairpin-folded pre-miRNA. The candidate pri-miRNAs were folded using the Vienna package or mfold programs. All hairpin structures that had at least six base pairs, were at least 55 nucleotides long and had a loop not longer than 20 nucleotides were extracted from the minimum free energy fold of the predicted pri-miRNA (excluding overlapping hairpins). Each hairpin was assigned a Palgrade and conservation score as described before {Bentwich, 2005}. Predicted miRNA precursors have either Palgrade>0 meaning it has structural characteristics of known miRNA) or have absolute value of conservation score>0.9 (conserved in mammals). In addition, only sequences with ten or less genomic copies, with a length of 17-25 bp and a GC content in known miRNA range (15-90%) were chosen as miRNA candidates.
|
|
|
Submission date |
Feb 23, 2010 |
Last update date |
Jun 11, 2013 |
Contact name |
Einat Sitbon |
Organization name |
Rosetta Genomics
|
Street address |
10 Plaut St
|
City |
Rehovot |
ZIP/Postal code |
76706 |
Country |
Israel |
|
|
Platform ID |
GPL9186 |
Series (2) |
GSE20417 |
Discovery of microRNAs and other small RNAs in solid human tumors: sequencing |
GSE20418 |
Discovery of microRNAs and other small RNAs in solid human tumors |
|
Relations |
BioSample |
SAMN02196190 |
Supplementary data files not provided |
Processed data are available on Series record |
|
|
|
|
|