NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM4766 Query DataSets for GSM4766
Status Public on May 01, 2003
Title 874_364
Sample type genomic
 
Channel 1
Source name Bacillus cereus strain 874
Organisms Bacillus anthracis; Bacillus cereus
Extracted molecule genomic DNA
 
Channel 2
Source name Bacillus anthracis Ames
Organisms Bacillus anthracis; Bacillus cereus
Extracted molecule genomic DNA
 
 
Description Bacillus cereus strain 874
Amplicons representing 79 of 217 and 41 of 122 genes from pXO1 and pXO2 respectively, and 3601 of 5753 chromosomal genes as predicted by Glimmer 30 ( and see supplemental methods) were arrayed onto glass microscope slides (Telechem, Inc). Redundant genes were generally represented once or a few times on the array.
Genomic DNA was labeled with Cy3 and Cy5 according to J. DeRisi (http://www.microarrays.org/Pdfs/GenomicDNALabel_B.pdf), except that genomic DNA was not digested or sheared before labeling.
Arrays were scanned with a GenePix 4000B scanner (Axon Inc.).
Hybridization signals were quantitated using TIGR SPOTFINDER (software available at http://www.tigr.org/softlab).
Hybridization experiments were competitive using probes derived from B. anthracis Ames (reference) and a B. cereus group (query) strain. Normalized signal intensities were used to generate relative hybridization ratios (query/reference). Data representing weak signal were removed. The ratios from a maximum of six data points (duplicate spots, hybridizations performed in triplicate) were placed in three bins (<0.1) gene is absent in query strain, (0.1-0.3) present but diverged in query strain, and (>0.3) gene is present in the query strain. A majority rule was applied to the data for binning such that more than 50% of ratios were in agreement as to assignment and that at least two data points were used (exceeded in 99% of the cases). In cases where less than two data points existed, the gene was treated as data missing.
The criteria for the numerical ranges of our bins were established in two ways. First, we determined the presence or absence of sequences homologous to 3601 B. anthracis genes in the sequence of B. cereus ATCC 14579 (Integrated Genomics, Inc; http://www.integratedgenomics.com/) using BLASTN and compared that to the assignments inferred from hybridization ratios. A threshold of 0.1 was found to be suitable for classifying a gene as absent (i.e., agreement between sequence and CGH data in 99% of the cases), while a cutoff value of 0.3 was conservative for gene presence (agreement in 92% of the cases). Second, we used a set of 65 genes conserved in 26 bacterial genomes, NCBI COG database (http://www.ncbi.nlm.nih.gov/COG/). Genes judged as present in query strains using our selected cut-offs correctly binned data in 1225 out of 1235 total calls. There was a tendency for underprediction of plasmid homologs by CGH, when compared to results from the sequence analysis. Two possible explanations for this are variability in plasmid copy number in B. cereus strains relative to B. anthracis and/or that the average divergence of plasmid genes is greater than chromosomal genes.and that at least two data points were used (exceeded in 99% of the cases)
 
Submission date Mar 13, 2003
Last update date May 27, 2005
Contact name David Rasko
E-mail(s) drasko@tigr.org
Phone 301-838-0000
Fax 301-838-0208
Organization name The Institute for Genomic Research
Department Microbial Genomics
Street address 9712 Medical Center Drive
City Rockville
State/province MD
ZIP/Postal code 20850
Country USA
 
Platform ID GPL266
Series (1)
GSE341 Bacillus_anthracis_CGH

Data table header descriptions
ID_REF
Cy3-BC Cy3 labelled Bacillus cereus strain 874 genomic DNA
Cy5-BA Cy5 labelled Bacillus anthracis Ames genomic DNA
VALUE log ratio of PRE_VALUE
PRE_VALUE Ratio of Cy5/Cy3 label for each spot

Data table
ID_REF Cy3-BC Cy5-BA VALUE PRE_VALUE
1 927572 209530 -0.6461013 0.22589082
2 0 0
3 1176143 1634247 0.1428583 1.389496855
4 1488016 819147 -0.2592463 0.550496097
5 1001385 272345 -0.5654823 0.271968324
6 813120 712202 -0.0575513 0.875887938
7 4147 373354 1.9543873 90.02990113
8 34019 612981 1.2557253 18.01878362
9 1553618 453840 -0.5344413 0.29211814
10 2872504 596606 -0.6825733 0.207695446
11 607962 803146 0.1209183 1.321046381
12 0 0
13 999550 605313 -0.2178253 0.605585513
14 614371 64605 -0.9781653 0.105156331
15 964530 1070996 0.0454723 1.110381222
16 803520 879237 0.0391093 1.094231631
17 555954 442602 -0.0990253 0.796112628
18 107962 875809 0.9091383 8.112196884
19 439434 418309 -0.0213963 0.951926797
20 28187 497802 1.2470083 17.66069465

Total number of rows: 9216

Table truncated, full table size 340 Kbytes.




Supplementary data files not provided

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap