NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM4771 Query DataSets for GSM4771
Status Public on May 01, 2003
Title 923_311
Sample type genomic
 
Channel 1
Source name Bacillus cereus strain ATCC 14579
Organisms Bacillus anthracis; Bacillus cereus
Extracted molecule genomic DNA
 
Channel 2
Source name Bacillus anthracis Ames
Organisms Bacillus anthracis; Bacillus cereus
Extracted molecule genomic DNA
 
 
Description Bacillus cereus strain ATCC 14579, type strain, sequenced at Integrated Genomics/INRA
Amplicons representing 79 of 217 and 41 of 122 genes from pXO1 and pXO2 respectively, and 3601 of 5753 chromosomal genes as predicted by Glimmer 30 ( and see supplemental methods) were arrayed onto glass microscope slides (Telechem, Inc). Redundant genes were generally represented once or a few times on the array.
Genomic DNA was labeled with Cy3 and Cy5 according to J. DeRisi (http://www.microarrays.org/Pdfs/GenomicDNALabel_B.pdf), except that genomic DNA was not digested or sheared before labeling.
Arrays were scanned with a GenePix 4000B scanner (Axon Inc.).
Hybridization signals were quantitated using TIGR SPOTFINDER (software available at http://www.tigr.org/softlab).
Hybridization experiments were competitive using probes derived from B. anthracis Ames (reference) and a B. cereus group (query) strain. Normalized signal intensities were used to generate relative hybridization ratios (query/reference). Data representing weak signal were removed. The ratios from a maximum of six data points (duplicate spots, hybridizations performed in triplicate) were placed in three bins (<0.1) gene is absent in query strain, (0.1-0.3) present but diverged in query strain, and (>0.3) gene is present in the query strain. A majority rule was applied to the data for binning such that more than 50% of ratios were in agreement as to assignment and that at least two data points were used (exceeded in 99% of the cases). In cases where less than two data points existed, the gene was treated as data missing.
The criteria for the numerical ranges of our bins were established in two ways. First, we determined the presence or absence of sequences homologous to 3601 B. anthracis genes in the sequence of B. cereus ATCC 14579 (Integrated Genomics, Inc; http://www.integratedgenomics.com/) using BLASTN and compared that to the assignments inferred from hybridization ratios. A threshold of 0.1 was found to be suitable for classifying a gene as absent (i.e., agreement between sequence and CGH data in 99% of the cases), while a cutoff value of 0.3 was conservative for gene presence (agreement in 92% of the cases). Second, we used a set of 65 genes conserved in 26 bacterial genomes, NCBI COG database (http://www.ncbi.nlm.nih.gov/COG/). Genes judged as present in query strains using our selected cut-offs correctly binned data in 1225 out of 1235 total calls. There was a tendency for underprediction of plasmid homologs by CGH, when compared to results from the sequence analysis. Two possible explanations for this are variability in plasmid copy number in B. cereus strains relative to B. anthracis and/or that the average divergence of plasmid genes is greater than chromosomal genes.and that at least two data points were used (exceeded in 99% of the cases)
 
Submission date Mar 13, 2003
Last update date May 27, 2005
Contact name David Rasko
E-mail(s) drasko@tigr.org
Phone 301-838-0000
Fax 301-838-0208
Organization name The Institute for Genomic Research
Department Microbial Genomics
Street address 9712 Medical Center Drive
City Rockville
State/province MD
ZIP/Postal code 20850
Country USA
 
Platform ID GPL266
Series (1)
GSE341 Bacillus_anthracis_CGH

Data table header descriptions
ID_REF
Cy3-BC Cy3 labelled Bacillus cereus strain ATCC 14579 genomic DNA
Cy5-BA Cy5 labelled Bacillus anthracis Ames genomic DNA
VALUE log ratio of PRE_VALUE
PRE_VALUE Ratio of Cy5/Cy3 label for each spot

Data table
ID_REF Cy3-BC Cy5-BA VALUE PRE_VALUE
1 0 0
2 52160 1872610 1.5551103 35.90126534
3 2004122 4392314 0.3407693 2.19164003
4 6859778 4248780 -0.2080463 0.619375729
5 962773 643733 -0.1748183 0.66862386
6 4388269 3757706 -0.0673703 0.856307122
7 96640 2935692 1.4825543 30.37760762
8 8811207 4888988 -0.2558163 0.554860191
9 4956368 2718681 -0.2608053 0.54852283
10 9098593 4382309 -0.3172713 0.481646888
11 3403351 3667609 0.0324763 1.077646414
12 0 0
13 5815802 5495449 -0.0246063 0.944916797
14 4568996 28611 -2.2032883 0.006261988
15 6079517 6973167 0.0595613 1.146993585
16 3950740 5339261 0.1308033 1.351458461
17 113270 185263 0.2136743 1.635587534
18 2192416 5613006 0.4082733 2.560192044
19 537330 524954 -0.0101203 0.976967599
20 49605 3307894 1.8240263 66.68468904

Total number of rows: 9216

Table truncated, full table size 351 Kbytes.




Supplementary data files not provided

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap