<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-7-215</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>The statistics of identifying differentially expressed genes in Expresso and TM4: a comparison</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Sioson</snm>
               <mi>A</mi>
               <fnm>Allan</fnm>
               <insr iid="I1"/>
               <email>asioson@vt.edu</email>
            </au>
            <au id="A2">
               <snm>Mane</snm>
               <mi>P</mi>
               <fnm>Shrinivasrao</fnm>
               <insr iid="I2"/>
               <email>smane@vt.edu</email>
            </au>
            <au id="A3">
               <snm>Li</snm>
               <fnm>Pinghua</fnm>
               <insr iid="I3"/>
               <email>pinghli@life.uiuc.edu</email>
            </au>
            <au id="A4">
               <snm>Sha</snm>
               <fnm>Wei</fnm>
               <insr iid="I4"/>
               <email>wsha@vt.edu</email>
            </au>
            <au id="A5" ca="yes">
               <snm>Heath</snm>
               <mi>S</mi>
               <fnm>Lenwood</fnm>
               <insr iid="I1"/>
               <email>heath@vt.edu</email>
            </au>
            <au id="A6">
               <snm>Bohnert</snm>
               <mi>J</mi>
               <fnm>Hans</fnm>
               <insr iid="I3"/>
               <email>bohnerth@life.uiuc.edu</email>
            </au>
            <au id="A7">
               <snm>Grene</snm>
               <fnm>Ruth</fnm>
               <insr iid="I2"/>
               <email>grene@vt.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Computer Science, Virginia Tech, Blacksburg, USA</p>
            </ins>
            <ins id="I2">
               <p>Department of Plant Pathology, Physiology and Weed Science, Virginia Tech, Blacksburg, USA</p>
            </ins>
            <ins id="I3">
               <p>Department of Plant Biology and Department of Crop Sciences, University of Illinois, Urbana, USA</p>
            </ins>
            <ins id="I4">
               <p>Virginia Bioinformatics Institute, Virginia Tech, Blacksburg, USA</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2006</pubdate>
         <volume>7</volume>
         <issue>1</issue>
         <fpage>215</fpage>
         <url>http://www.biomedcentral.com/1471-2105/7/215</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">16626497</pubid>
               <pubid idtype="doi">10.1186/1471-2105-7-215</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>16</day>
               <month>8</month>
               <year>2005</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>20</day>
               <month>4</month>
               <year>2006</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>20</day>
               <month>4</month>
               <year>2006</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2006</year>
         <collab>Sioson et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Analysis of DNA microarray data takes as input spot intensity measurements from scanner software and returns differential expression of genes between two conditions, together with a statistical significance assessment. This process typically consists of two steps: data normalization and identification of differentially expressed genes through statistical analysis. The Expresso microarray experiment management system implements these steps with a two-stage, log-linear ANOVA mixed model technique, tailored to individual experimental designs. The complement of tools in TM4, on the other hand, is based on a number of preset design choices that limit its flexibility. In the TM4 microarray analysis suite, normalization, filter, and analysis methods form an analysis pipeline. TM4 computes integrated intensity values (IIV) from the average intensities and spot pixel counts returned by the scanner software as input to its normalization steps. By contrast, Expresso can use either IIV data or median intensity values (MIV). Here, we compare Expresso and TM4 analysis of two experiments and assess the results against qRT-PCR data.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>The Expresso analysis using MIV data consistently identifies more genes as differentially expressed, when compared to Expresso analysis with IIV data. The typical TM4 normalization and filtering pipeline corrects systematic intensity-specific bias on a per microarray basis. Subsequent statistical analysis with Expresso or a TM4 <it>t</it>-test can effectively identify differentially expressed genes. The best agreement with qRT-PCR data is obtained through the use of Expresso analysis and MIV data.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The results of this research are of practical value to biologists who analyze microarray data sets. The TM4 normalization and filtering pipeline corrects microarray-specific systematic bias and complements the normalization stage in Expresso analysis. The results of Expresso using MIV data have the best agreement with qRT-PCR results. In one experiment, MIV is a better choice than IIV as input to data normalization and statistical analysis methods, as it yields as greater number of statistically significant differentially expressed genes; TM4 does not support the choice of MIV input data. Overall, the more flexible and extensive statistical models of Expresso achieve more accurate analytical results, when judged by the yardstick of qRT-PCR data, in the context of an experimental design of modest complexity.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>DNA microarrays are a powerful means of monitoring the expression of thousands of genes simultaneously. A variety of computational and statistical methods have been proposed to extract information from the large quantity of data generated from microarray experiments. Many methods assume, as we do here, the use of cDNA labeled with one of two fluorescent dyes to differentiate two treatments on a single microarray, implying data from two images to be analyzed. These methods include a number of data normalization techniques to reduce the effects of systematic errors and various kinds of statistical tests to identify differentially expressed genes in comparisons among different experimental conditions. There is as yet no single method that can be recommended under all circumstances for either normalization or identification of differential gene expression.</p>
         <p>In recent years, ANOVA methods have gained popularity for identification of differential gene expression. The power of ANOVA methods derives from their flexibility in fitting and comparing different models to a given set of data <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. One such method is the two-stage, log-linear ANOVA mixed models technique of Wolfinger, <it>et al</it>., <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. Its first stage uses a normalization model designed to remove global effects across all microarrays. Its second stage uses a gene-specific model to estimate gene-treatment interactions as ratios of gene expression under control and treated conditions, along with a statistical significance. Kerr <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> notes that the global normalization model employed in this technique is conducive to combining data across genes for realistic and robust models of error, especially when random effects are included. Pan <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> compares different microarray statistical analysis methods and demonstrates that the log-linear ANOVA mixed model approach performs better than the <it>t</it>-test and regression approaches. The regression approach, although flexible and robust, assumes that the data is drawn from a normal distribution, while the <it>t</it>-test is limited due to very few degrees of freedom. Chu, <it>et al</it>., <abbrgrp><abbr bid="B5">5</abbr></abbrgrp> compare two log-linear ANOVA mixed models for probe-level, oligonucleotide array data and found that both types of models capture key measurable sources of variability of oligonucleotide arrays for real and simulated data. Cui and Churchill <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> review the use of a mixed ANOVA model for analyzing a cDNA microarray experiment and conclude that such models provide a powerful way to obtain information from experiments with multiple factors or sources of variation. Rosa, <it>et al</it>., <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> review issues of analyzing cDNA microarrays with mixed linear models and puts such analysis in the larger context of Bayesian analysis procedures and adjustments for multiple testing.</p>
         <p>Data normalization is the first step in analyzing microarray data; numerous data normalization methods have been proposed and investigated. While refinements of existing methods continue to appear (e.g., Futschik and Crompton <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>), naive methods, such as total intensity normalization, are still in use (e.g., Held, <it>et al</it>., <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>). Xie, <it>et al</it>., <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> did a comparative study of normalization methods and test statistics to analyze the results of a DNA-protein binding microarray experiment. Using performance and bias correction criteria, Bolstad, <it>et al</it>., <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> evaluate the cyclic lowess method, the contrast method, the quantile method, and baseline array scaling methods, both linear and non-linear; they demonstrate that normalization methods incorporating data from all microarrays perform better than methods employing a baseline array.</p>
         <p>Several software tools that combine data normalization and statistical analysis are currently available. Dudoit, <it>et al</it>., <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> review these software tools with an emphasis on the TM4 microarray software suite, Bioconductor in R, and the BioArray Software Environment (BASE) system. Saeed, et al., <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> describe the features and capabilities of TM4, while Quackenbush <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> describes the normalization and transformation methods implemented in it. Williams, et al., <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, Zhu, et al., <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, and Khaitovich, et al., <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> have used TM4 in microarray data analysis. Another system is Expresso, an experiment management system that serves as a unifying framework to study data driven applications such as microarray experiments <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. Expresso has adapted the two-stage ANOVA mixed models technique of Wolfinger, et al., <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> to the particular needs of individual microarray data sets. Our experience with numerous such data sets has demonstrated that modeling the underlying experiment carefully and completely is essential to obtaining meaningful and defensible results. Use of tools that require experiments to conform to their analysis methods are less than satisfactory.</p>
         <p>In this paper, we compare the Expresso analysis methodology to the approach provided in the TM4 microarray analysis software suite <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Each is invoked to identify differentially expressed genes in two experimental data sets, each of which uses an <it>Arabidopsis thaliana </it>oligonucleotide array. Along the way, we demonstrate differences between the use of integrated intensity values (IIV) and median intensity values (MIV) as inputs. We report interactions between normalization and gene identification methods. We use quantitative reverse-transcriptase PCR (qRT-PCR) results to assess the consistency of genes reported by TM4 and Expresso as having significant differential expression.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <p>Here, we report a portion of the results obtained in our comparison of Expresso analysis and the TM4 pipeline (see Materials and Methods). Figure <figr fid="F1">1</figr> illustrates the overall flow of the statistical analyses of microarray data that were done in this study. We began with microarray data in GPR format from Experiment 1 and Experiment 2. Median intensity values (MIV) from the GPR files can be analyzed by the Expresso GP and GOT models directly. ExpressConverter provides integrated intensity values (IIV) for further Expresso and TM4 analysis. The MIDAS normalization and filtering pipeline executes these steps in order: total intensity normalization (subscript T), lowess normalization (subscript L), standard deviation regularization (subscript S), and low intensity filter (subscript F). MIDAS allows tapping the output of any step in the pipeline; for example, IIV<sub>TL </sub>signifies an MEV file after total intensity normalization followed by lowess normalization. The identification of genes with significant differential expression was performed on all GPR and MEV files, using the Expresso GP and GOT models and the <it>t</it>-test in MEV.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Overall flow of the statistical analyses of microarray data</p>
            </caption>
            <text>
               <p><b>Overall flow of the statistical analyses of microarray data</b>. Data input is in GPR format and provides the MIV for each spot. TM4 analysis requires ExpressConverter to generate MEV format containing the IIV for each spot. The normalization steps performed by MIDAS are T (total intensity normalization), L (lowess normalization), S (standard deviation regularization), and F (low intensity filter). Differential gene expression is obtained from the Expresso GP model, the Expresso GOT model (Experiment 2 only), and the <it>t</it>-test in MEV.</p>
            </text>
            <graphic file="1471-2105-7-215-1"/>
         </fig>
         <sec>
            <st>
               <p>Normalization and low intensity filtering in TM4</p>
            </st>
            <p>Quackenbush <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> describes the use of ratio-intensity plots (RI-plots) to detect and normalize for any systematic intensity-dependent dye bias using lowess normalization (see Materials and Methods). We evaluated the effect of lowess normalization within the context of the flow in Figure <figr fid="F1">1</figr> by creating RI-plots after each step for the second replicate microarray in Experiment 1, WT plant. Supplementary Figure 1 - see <supplr sid="S1">Additional file: 1</supplr> contains these RI-plots. The IIV<sub>TL </sub>is indeed effective, for this data set, in correcting systematic dye bias, suggesting that preprocessing by these two normalization steps in MIDAS may be a good practice in many situations.</p>
            <p>The normalization and filtering pipeline affects the number of genes identified as differentially expressed in both the GP and GOT models. See Table <tblr tid="T1">1</tblr>. For example, in Experiment 1, the GP model using IIV input data identifies 567 up-expressed genes in the WT microarrays, while it identifies only 460 WT genes as up-expressed if IIV<sub>TLSF </sub>(processed by the complete MIDAS pipeline) input data is used.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Number of differentially expressed genes. Numbers of identified differentially expressed genes in the GP and GOT models after preprocessing by 0 or more MIDAS pipeline steps &#8212; IIV, IIV<sub>T</sub>, IIV<sub>TL</sub>, IIV<sub>TLS</sub>, or IIV<sub>TSLF</sub>. For Experiment 1, up-expression (+) and down-expression (-) numbers are given for both WT and antiPLD. For Experiment 2, + and -1 numbers are given for all 4 genotypes separately. The ALL entries correspond to the H1 hypotheses of the GOT model (see Materials and Methods).</p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c ca="left">
                        <p>Genotype</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>IIV</p>
                     </c>
                     <c ca="center">
                        <p>IIV<sub>T</sub></p>
                     </c>
                     <c ca="center">
                        <p>IIV<sub>TL</sub></p>
                     </c>
                     <c ca="center">
                        <p>IIV<sub>TLS</sub></p>
                     </c>
                     <c ca="center">
                        <p>IIV<sub>TLSF</sub></p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="5" ca="center">
                        <p>GP Model &#8212; Experiment 1</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>WT</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>567</p>
                     </c>
                     <c ca="center">
                        <p>571</p>
                     </c>
                     <c ca="center">
                        <p>487</p>
                     </c>
                     <c ca="center">
                        <p>495</p>
                     </c>
                     <c ca="center">
                        <p>460</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>WT</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>552</p>
                     </c>
                     <c ca="center">
                        <p>553</p>
                     </c>
                     <c ca="center">
                        <p>499</p>
                     </c>
                     <c ca="center">
                        <p>499</p>
                     </c>
                     <c ca="center">
                        <p>460</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>antiPLD</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>442</p>
                     </c>
                     <c ca="center">
                        <p>421</p>
                     </c>
                     <c ca="center">
                        <p>440</p>
                     </c>
                     <c ca="center">
                        <p>471</p>
                     </c>
                     <c ca="center">
                        <p>431</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>antiPLD</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>336</p>
                     </c>
                     <c ca="center">
                        <p>346</p>
                     </c>
                     <c ca="center">
                        <p>381</p>
                     </c>
                     <c ca="center">
                        <p>363</p>
                     </c>
                     <c ca="center">
                        <p>334</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="5" ca="center">
                        <p>GOT Model &#8212; Experiment 2</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Col-0</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>227</p>
                     </c>
                     <c ca="center">
                        <p>309</p>
                     </c>
                     <c ca="center">
                        <p>242</p>
                     </c>
                     <c ca="center">
                        <p>240</p>
                     </c>
                     <c ca="center">
                        <p>270</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Col-0</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>122</p>
                     </c>
                     <c ca="center">
                        <p>171</p>
                     </c>
                     <c ca="center">
                        <p>178</p>
                     </c>
                     <c ca="center">
                        <p>180</p>
                     </c>
                     <c ca="center">
                        <p>183</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cvi-0</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>106</p>
                     </c>
                     <c ca="center">
                        <p>108</p>
                     </c>
                     <c ca="center">
                        <p>82</p>
                     </c>
                     <c ca="center">
                        <p>78</p>
                     </c>
                     <c ca="center">
                        <p>71</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cvi-0</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>63</p>
                     </c>
                     <c ca="center">
                        <p>93</p>
                     </c>
                     <c ca="center">
                        <p>74</p>
                     </c>
                     <c ca="center">
                        <p>71</p>
                     </c>
                     <c ca="center">
                        <p>67</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>WS</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>234</p>
                     </c>
                     <c ca="center">
                        <p>346</p>
                     </c>
                     <c ca="center">
                        <p>279</p>
                     </c>
                     <c ca="center">
                        <p>269</p>
                     </c>
                     <c ca="center">
                        <p>150</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>WS</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>331</p>
                     </c>
                     <c ca="center">
                        <p>347</p>
                     </c>
                     <c ca="center">
                        <p>340</p>
                     </c>
                     <c ca="center">
                        <p>340</p>
                     </c>
                     <c ca="center">
                        <p>224</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Th</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>238</p>
                     </c>
                     <c ca="center">
                        <p>398</p>
                     </c>
                     <c ca="center">
                        <p>350</p>
                     </c>
                     <c ca="center">
                        <p>363</p>
                     </c>
                     <c ca="center">
                        <p>361</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Th</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>196</p>
                     </c>
                     <c ca="center">
                        <p>245</p>
                     </c>
                     <c ca="center">
                        <p>241</p>
                     </c>
                     <c ca="center">
                        <p>246</p>
                     </c>
                     <c ca="center">
                        <p>240</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ALL</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>298</p>
                     </c>
                     <c ca="center">
                        <p>428</p>
                     </c>
                     <c ca="center">
                        <p>357</p>
                     </c>
                     <c ca="center">
                        <p>354</p>
                     </c>
                     <c ca="center">
                        <p>290</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ALL</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>202</p>
                     </c>
                     <c ca="center">
                        <p>261</p>
                     </c>
                     <c ca="center">
                        <p>253</p>
                     </c>
                     <c ca="center">
                        <p>248</p>
                     </c>
                     <c ca="center">
                        <p>195</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>Small changes in the number of genes identified as up- or down-expressed after successive MIDAS steps may mask larger changes in the composition of sets of up- and down-expressed genes. To obtain a more precise view of the effects of MIDAS changes, we computed retention counts (RC) and retention percentages (RP) between the gene successive sets whose numbers are in Table <tblr tid="T1">1</tblr>. RC is the number of genes in the set before the MIDAS step that remain in the set after the step. RP is the percentage of remaining genes with respect to the number of genes in the set after the MIDAS step. Table <tblr tid="T2">2</tblr> contains the RC and RP values corresponding to the counts in Table <tblr tid="T1">1</tblr>. For Experiment 1, there is a tremendous drop in retention during the lowess normalization that follows the total intensity normalization. There is not a drop of corresponding magnitude for Experiment 2. For both experiments, normalization has a significant effect on the sets of genes identified as differentially expressed.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Retention counts and percentages. Retention counts (RC) and retention percentages (RP) for the differentially expressed gene sets of Table 1. RC is the number of genes in a set before the MIDAS step that remain in the set after that step. RC is the number of genes in the set before the MIDAS step that remain in the set after that step. RP is the percentage of remaining genes with respect to the number of genes in the set after the MIDAS step. RC and RP are reported for the intersections IIV<sub>&#8745; </sub>IIV<sub>T</sub>, IIV<sub>T </sub>&#8745; IIV<sub>TL</sub>, IIV<sub>TL </sub>&#8745; IIV<sub>TLS</sub>, and IIV<sub>TLS </sub>&#8745; IIV<sub>TLSF</sub>, as well as intersection IIV<sub>&#8745; </sub>&#8745;IIV<sub>TLSF</sub>, which corresponds to the effect of MIDAS steps from the start of the pipeline to the end. The ALL entries correspond to the H1 hypotheses of the GOT model (see Materials and Methods).</p>
               </caption>
               <tblbdy cols="12">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="center">
                        <p>IIV &#8745;</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>IIV<sub>T</sub>&#8745;</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>IIV<sub>TL</sub>&#8745;</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>IIV<sub>TLS</sub>&#8745;</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>IIV &#8745;</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="center">
                        <p>IIV<sub>T</sub></p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>IIV<sub>TL</sub></p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>IIV<sub>TLS</sub></p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>IIV<sub>TLSF</sub></p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>IIV<sub>TLSF</sub></p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Genotype</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>RC</p>
                     </c>
                     <c ca="center">
                        <p>RP</p>
                     </c>
                     <c ca="center">
                        <p>RC</p>
                     </c>
                     <c ca="center">
                        <p>RP</p>
                     </c>
                     <c ca="center">
                        <p>RC</p>
                     </c>
                     <c ca="center">
                        <p>RP</p>
                     </c>
                     <c ca="center">
                        <p>RC</p>
                     </c>
                     <c ca="center">
                        <p>RP</p>
                     </c>
                     <c ca="center">
                        <p>RC</p>
                     </c>
                     <c ca="center">
                        <p>RP</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="12">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="10" ca="center">
                        <p>GP Model &#8212; Experiment 1</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="10">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>WT</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>545</p>
                     </c>
                     <c ca="center">
                        <p>95.45</p>
                     </c>
                     <c ca="center">
                        <p>74</p>
                     </c>
                     <c ca="center">
                        <p>15.20</p>
                     </c>
                     <c ca="center">
                        <p>285</p>
                     </c>
                     <c ca="center">
                        <p>57.58</p>
                     </c>
                     <c ca="center">
                        <p>456</p>
                     </c>
                     <c ca="center">
                        <p>99.13</p>
                     </c>
                     <c ca="center">
                        <p>59</p>
                     </c>
                     <c ca="center">
                        <p>12.83</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>WT</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>532</p>
                     </c>
                     <c ca="center">
                        <p>96.20</p>
                     </c>
                     <c ca="center">
                        <p>55</p>
                     </c>
                     <c ca="center">
                        <p>11.02</p>
                     </c>
                     <c ca="center">
                        <p>306</p>
                     </c>
                     <c ca="center">
                        <p>61.32</p>
                     </c>
                     <c ca="center">
                        <p>459</p>
                     </c>
                     <c ca="center">
                        <p>99.78</p>
                     </c>
                     <c ca="center">
                        <p>53</p>
                     </c>
                     <c ca="center">
                        <p>11.52</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>antiPLD</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>403</p>
                     </c>
                     <c ca="center">
                        <p>95.72</p>
                     </c>
                     <c ca="center">
                        <p>58</p>
                     </c>
                     <c ca="center">
                        <p>13.18</p>
                     </c>
                     <c ca="center">
                        <p>278</p>
                     </c>
                     <c ca="center">
                        <p>59.02</p>
                     </c>
                     <c ca="center">
                        <p>430</p>
                     </c>
                     <c ca="center">
                        <p>99.77</p>
                     </c>
                     <c ca="center">
                        <p>46</p>
                     </c>
                     <c ca="center">
                        <p>10.67</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>antiPLD</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>326</p>
                     </c>
                     <c ca="center">
                        <p>94.22</p>
                     </c>
                     <c ca="center">
                        <p>45</p>
                     </c>
                     <c ca="center">
                        <p>11.81</p>
                     </c>
                     <c ca="center">
                        <p>203</p>
                     </c>
                     <c ca="center">
                        <p>55.92</p>
                     </c>
                     <c ca="center">
                        <p>330</p>
                     </c>
                     <c ca="center">
                        <p>98.80</p>
                     </c>
                     <c ca="center">
                        <p>33</p>
                     </c>
                     <c ca="center">
                        <p>9.88</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="12">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="10" ca="center">
                        <p>GOT Model &#8212; Experiment 2</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="10">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Col-0</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>224</p>
                     </c>
                     <c ca="center">
                        <p>72.49</p>
                     </c>
                     <c ca="center">
                        <p>221</p>
                     </c>
                     <c ca="center">
                        <p>91.32</p>
                     </c>
                     <c ca="center">
                        <p>219</p>
                     </c>
                     <c ca="center">
                        <p>91.25</p>
                     </c>
                     <c ca="center">
                        <p>172</p>
                     </c>
                     <c ca="center">
                        <p>63.70</p>
                     </c>
                     <c ca="center">
                        <p>159</p>
                     </c>
                     <c ca="center">
                        <p>58.89</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Col-0</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>121</p>
                     </c>
                     <c ca="center">
                        <p>70.76</p>
                     </c>
                     <c ca="center">
                        <p>150</p>
                     </c>
                     <c ca="center">
                        <p>84.27</p>
                     </c>
                     <c ca="center">
                        <p>165</p>
                     </c>
                     <c ca="center">
                        <p>91.67</p>
                     </c>
                     <c ca="center">
                        <p>120</p>
                     </c>
                     <c ca="center">
                        <p>65.57</p>
                     </c>
                     <c ca="center">
                        <p>92</p>
                     </c>
                     <c ca="center">
                        <p>50.27</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cvi-0</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>81</p>
                     </c>
                     <c ca="center">
                        <p>75.00</p>
                     </c>
                     <c ca="center">
                        <p>65</p>
                     </c>
                     <c ca="center">
                        <p>79.27</p>
                     </c>
                     <c ca="center">
                        <p>71</p>
                     </c>
                     <c ca="center">
                        <p>91.25</p>
                     </c>
                     <c ca="center">
                        <p>49</p>
                     </c>
                     <c ca="center">
                        <p>69.01</p>
                     </c>
                     <c ca="center">
                        <p>36</p>
                     </c>
                     <c ca="center">
                        <p>50.70</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cvi-0</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>60</p>
                     </c>
                     <c ca="center">
                        <p>64.52</p>
                     </c>
                     <c ca="center">
                        <p>60</p>
                     </c>
                     <c ca="center">
                        <p>81.08</p>
                     </c>
                     <c ca="center">
                        <p>63</p>
                     </c>
                     <c ca="center">
                        <p>88.73</p>
                     </c>
                     <c ca="center">
                        <p>41</p>
                     </c>
                     <c ca="center">
                        <p>61.19</p>
                     </c>
                     <c ca="center">
                        <p>38</p>
                     </c>
                     <c ca="center">
                        <p>56.72</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ws</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>223</p>
                     </c>
                     <c ca="center">
                        <p>64.45</p>
                     </c>
                     <c ca="center">
                        <p>230</p>
                     </c>
                     <c ca="center">
                        <p>82.44</p>
                     </c>
                     <c ca="center">
                        <p>248</p>
                     </c>
                     <c ca="center">
                        <p>92.19</p>
                     </c>
                     <c ca="center">
                        <p>108</p>
                     </c>
                     <c ca="center">
                        <p>72.00</p>
                     </c>
                     <c ca="center">
                        <p>101</p>
                     </c>
                     <c ca="center">
                        <p>67.33</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ws</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>293</p>
                     </c>
                     <c ca="center">
                        <p>84.44</p>
                     </c>
                     <c ca="center">
                        <p>267</p>
                     </c>
                     <c ca="center">
                        <p>78.53</p>
                     </c>
                     <c ca="center">
                        <p>314</p>
                     </c>
                     <c ca="center">
                        <p>92.35</p>
                     </c>
                     <c ca="center">
                        <p>180</p>
                     </c>
                     <c ca="center">
                        <p>80.36</p>
                     </c>
                     <c ca="center">
                        <p>149</p>
                     </c>
                     <c ca="center">
                        <p>66.52</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Th</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>236</p>
                     </c>
                     <c ca="center">
                        <p>59.30</p>
                     </c>
                     <c ca="center">
                        <p>308</p>
                     </c>
                     <c ca="center">
                        <p>88.00</p>
                     </c>
                     <c ca="center">
                        <p>330</p>
                     </c>
                     <c ca="center">
                        <p>90.91</p>
                     </c>
                     <c ca="center">
                        <p>256</p>
                     </c>
                     <c ca="center">
                        <p>70.91</p>
                     </c>
                     <c ca="center">
                        <p>201</p>
                     </c>
                     <c ca="center">
                        <p>55.68</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Th</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>183</p>
                     </c>
                     <c ca="center">
                        <p>74.69</p>
                     </c>
                     <c ca="center">
                        <p>210</p>
                     </c>
                     <c ca="center">
                        <p>87.14</p>
                     </c>
                     <c ca="center">
                        <p>225</p>
                     </c>
                     <c ca="center">
                        <p>91.46</p>
                     </c>
                     <c ca="center">
                        <p>172</p>
                     </c>
                     <c ca="center">
                        <p>71.67</p>
                     </c>
                     <c ca="center">
                        <p>144</p>
                     </c>
                     <c ca="center">
                        <p>60.00</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ALL</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>282</p>
                     </c>
                     <c ca="center">
                        <p>65.89</p>
                     </c>
                     <c ca="center">
                        <p>307</p>
                     </c>
                     <c ca="center">
                        <p>85.99</p>
                     </c>
                     <c ca="center">
                        <p>333</p>
                     </c>
                     <c ca="center">
                        <p>94.07</p>
                     </c>
                     <c ca="center">
                        <p>200</p>
                     </c>
                     <c ca="center">
                        <p>68.97</p>
                     </c>
                     <c ca="center">
                        <p>177</p>
                     </c>
                     <c ca="center">
                        <p>61.03</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ALL</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>196</p>
                     </c>
                     <c ca="center">
                        <p>75.10</p>
                     </c>
                     <c ca="center">
                        <p>196</p>
                     </c>
                     <c ca="center">
                        <p>77.47</p>
                     </c>
                     <c ca="center">
                        <p>230</p>
                     </c>
                     <c ca="center">
                        <p>92.74</p>
                     </c>
                     <c ca="center">
                        <p>133</p>
                     </c>
                     <c ca="center">
                        <p>68.21</p>
                     </c>
                     <c ca="center">
                        <p>103</p>
                     </c>
                     <c ca="center">
                        <p>52.82</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>In Experiment 1 results, the number of genes commonly assessed by Expresso as significantly expressed when using IIV and IIV<sub>T </sub>is high. For example, there is 95.45% retention of WT genes (545 total) assessed as up-expressed when using IIV<sub>T </sub>data in Expresso compared to that when using IIV data. Retention percentage of these genes assessed as expressed however went down after doing lowess normalization. There is only 15.20% retention of WT genes (74 total) assessed as up-expressed in the results when using IIV<sub>TS </sub>data in Expresso compared to when using IIV<sub>T</sub>. While we observe increase in the retention percentage in IIV<sub>TLS </sub>(from IIV<sub>TL</sub>) and IIV<sub>TLSF </sub>(from IIV<sub>TLS</sub>), there's low retention percentage in the results using IIV<sub>TLSF </sub>from IIV data. This can be traced in the low retention percentage of IIV<sub>TL </sub>from IIV<sub>T</sub>. Hence, the normalization method that affects the results in Experiment 1 the most is lowess normalization.</p>
            <p>The results of Expresso on Experiment 2 show that low retention percentages happen after appli-cation of total intensity normalization (lowest is 59.30%) and after application of low intensity filtering (lowest is 61.19%). The low retention percentages shown in the IIV &#8745; IIV<sub>TLSF </sub>column implies that the normalization pipeline also significantly affects the results in Expresso analysis of Experiment 2.</p>
         </sec>
         <sec>
            <st>
               <p>Choice of intensity signal data</p>
            </st>
            <p>The input to statistical analysis of microarray experiments is a set of real numbers that represent the measured intensity signal for each spot in a microarray. Much statistical analysis of microarray data has traditionally used median intensity values (MIV). The alternative used in TM4 is the integrated intensity value (IIV). (See Materials and Methods.) Since IIV is intended to integrate the measured intensity across the biological sample printed at a spot, one might expect IIV to be a more accurate assessment of the biological measurement than MIV data. For example, a spot having 100 pixels and a median intensity of 5,000 has the same IIV as a spot having 50 pixels and a median intensity of 10,000.</p>
            <p>This study provides the opportunity to observe the difference that choosing MIV or IIV makes on the sets of genes ultimately identified as differentially expressed. We used the GP model to analyze unnormalized MIV and IIV data from Experiment 1, and we used the GOT model to analyze unnormalized MIV and IIV data from Experiment 2. Table <tblr tid="T3">3</tblr> reports a summary of the results. In Experiment 1, 725 WT genes are assessed as up-expressed and 774 WT genes are down-expressed when MIV data are used in Expresso. These numbers decreased to 567 up-expressed genes and 552 down-expressed genes when IIV data are used instead. A similar trend is observed in Experiment 2 results when using MIV and IIV data. These results suggest that employing IIV input data with Expresso analysis leads to more conservative results than employing MIV input data.</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Comparison of MIV and IIV. We compare MIV and IIV input data with respect to the sets of genes ultimately identified as differentially expressed. The genotype and set labelings are the same as those in Table 1. The GP model was used to analyze the unnormalized MIV and IIV data from Experiment 1. The GOT model was used to analyze the unnormalized MIV and IIV data from Experiment 2. Up-expressed and down-expressed counts are reported for both experiments and all genotypes. The count of genes in the intersections is found in column Common. For convenience, percentages of the intersection with respect to the MIV and IIV sets are tabulated in the last two columns. The ALL entries correspond to the H1 hypotheses of the GOT model (see Materials and Methods).</p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c ca="left">
                        <p>Genotype</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>MIV</p>
                     </c>
                     <c ca="center">
                        <p>IIV</p>
                     </c>
                     <c ca="center">
                        <p>Common</p>
                     </c>
                     <c ca="center">
                        <p>% of common in MIV</p>
                     </c>
                     <c ca="center">
                        <p>% of common in IIV</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="5" ca="center">
                        <p>GP Model &#8212; Experiment 1</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>WT</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>725</p>
                     </c>
                     <c ca="center">
                        <p>567</p>
                     </c>
                     <c ca="center">
                        <p>501</p>
                     </c>
                     <c ca="center">
                        <p>69.10%</p>
                     </c>
                     <c ca="center">
                        <p>88.36%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>WT</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>774</p>
                     </c>
                     <c ca="center">
                        <p>552</p>
                     </c>
                     <c ca="center">
                        <p>515</p>
                     </c>
                     <c ca="center">
                        <p>66.54%</p>
                     </c>
                     <c ca="center">
                        <p>93.30%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>antiPLD</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>518</p>
                     </c>
                     <c ca="center">
                        <p>422</p>
                     </c>
                     <c ca="center">
                        <p>357</p>
                     </c>
                     <c ca="center">
                        <p>68.92%</p>
                     </c>
                     <c ca="center">
                        <p>84.60%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>antiPLD</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>505</p>
                     </c>
                     <c ca="center">
                        <p>336</p>
                     </c>
                     <c ca="center">
                        <p>313</p>
                     </c>
                     <c ca="center">
                        <p>61.98%</p>
                     </c>
                     <c ca="center">
                        <p>93.15%</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="5" ca="center">
                        <p>GOT Model &#8212; Experiment 2</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Col-0</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>311</p>
                     </c>
                     <c ca="center">
                        <p>227</p>
                     </c>
                     <c ca="center">
                        <p>198</p>
                     </c>
                     <c ca="center">
                        <p>63.67%</p>
                     </c>
                     <c ca="center">
                        <p>87.22%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Col-0</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>190</p>
                     </c>
                     <c ca="center">
                        <p>122</p>
                     </c>
                     <c ca="center">
                        <p>110</p>
                     </c>
                     <c ca="center">
                        <p>57.89%</p>
                     </c>
                     <c ca="center">
                        <p>90.16%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cvi-0</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>185</p>
                     </c>
                     <c ca="center">
                        <p>106</p>
                     </c>
                     <c ca="center">
                        <p>89</p>
                     </c>
                     <c ca="center">
                        <p>48.11%</p>
                     </c>
                     <c ca="center">
                        <p>83.96%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cvi-0</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>97</p>
                     </c>
                     <c ca="center">
                        <p>63</p>
                     </c>
                     <c ca="center">
                        <p>59</p>
                     </c>
                     <c ca="center">
                        <p>60.82%</p>
                     </c>
                     <c ca="center">
                        <p>93.65%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ws</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>326</p>
                     </c>
                     <c ca="center">
                        <p>234</p>
                     </c>
                     <c ca="center">
                        <p>181</p>
                     </c>
                     <c ca="center">
                        <p>55.52%</p>
                     </c>
                     <c ca="center">
                        <p>77.35%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ws</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>482</p>
                     </c>
                     <c ca="center">
                        <p>331</p>
                     </c>
                     <c ca="center">
                        <p>273</p>
                     </c>
                     <c ca="center">
                        <p>56.64%</p>
                     </c>
                     <c ca="center">
                        <p>82.48%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Th</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>363</p>
                     </c>
                     <c ca="center">
                        <p>238</p>
                     </c>
                     <c ca="center">
                        <p>219</p>
                     </c>
                     <c ca="center">
                        <p>60.33%</p>
                     </c>
                     <c ca="center">
                        <p>92.02%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Th</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>251</p>
                     </c>
                     <c ca="center">
                        <p>196</p>
                     </c>
                     <c ca="center">
                        <p>166</p>
                     </c>
                     <c ca="center">
                        <p>66.14%</p>
                     </c>
                     <c ca="center">
                        <p>84.69%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ALL</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>448</p>
                     </c>
                     <c ca="center">
                        <p>298</p>
                     </c>
                     <c ca="center">
                        <p>249</p>
                     </c>
                     <c ca="center">
                        <p>55.58%</p>
                     </c>
                     <c ca="center">
                        <p>83.56%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ALL</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>294</p>
                     </c>
                     <c ca="center">
                        <p>202</p>
                     </c>
                     <c ca="center">
                        <p>159</p>
                     </c>
                     <c ca="center">
                        <p>54.08%</p>
                     </c>
                     <c ca="center">
                        <p>78.71%</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Comparison of statistical methods</p>
            </st>
            <p>We compared the performance of the GP model, the GOT model, and the <it>t</it>-test of MEV in identifying differentially expressed genes in Experiment 2. We used the IIV<sub>TLSF </sub>data of Experiment 2 as input to these methods. We also contrast these results with MIV data analyzed in GP. Table <tblr tid="T4">4</tblr> reports counts for these analyses.</p>
            <tbl id="T4">
               <title>
                  <p>Table 4</p>
               </title>
               <caption>
                  <p>Comparison of Expresso analysis and MEV <it>t </it>test. We compare the number of genes identified as differentially expressed by Expresso analysis and the MEV <it>t</it>-test. Counts for both up-expressed and down-expressed genes, as well as all four genotypes of Experiment 2, are reported. The first three analyses &#8212; the GP model, the GOT model, and the MEV <it>t</it>-test &#8212; take the IIV<sub>TLSF </sub>data as input. For point of comparison, the last analysis uses the GP model on MIV input.</p>
               </caption>
               <tblbdy cols="9">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="center">
                        <p>GP Model</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>GOT Model</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>MEV <it>t</it>-test</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>GP Model</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="center">
                        <p>IIV<sub>TLSF</sub></p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>IIV<sub>TLSF</sub></p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>IIV<sub>TLSF</sub></p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>MIV</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Genotype</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="9">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Col-0</p>
                     </c>
                     <c ca="center">
                        <p>1837</p>
                     </c>
                     <c ca="center">
                        <p>1563</p>
                     </c>
                     <c ca="center">
                        <p>270</p>
                     </c>
                     <c ca="center">
                        <p>183</p>
                     </c>
                     <c ca="center">
                        <p>2125</p>
                     </c>
                     <c ca="center">
                        <p>2007</p>
                     </c>
                     <c ca="center">
                        <p>1761</p>
                     </c>
                     <c ca="center">
                        <p>1212</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cvi-0</p>
                     </c>
                     <c ca="center">
                        <p>817</p>
                     </c>
                     <c ca="center">
                        <p>1248</p>
                     </c>
                     <c ca="center">
                        <p>71</p>
                     </c>
                     <c ca="center">
                        <p>67</p>
                     </c>
                     <c ca="center">
                        <p>478</p>
                     </c>
                     <c ca="center">
                        <p>823</p>
                     </c>
                     <c ca="center">
                        <p>669</p>
                     </c>
                     <c ca="center">
                        <p>1403</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>WS</p>
                     </c>
                     <c ca="center">
                        <p>2606</p>
                     </c>
                     <c ca="center">
                        <p>2002</p>
                     </c>
                     <c ca="center">
                        <p>150</p>
                     </c>
                     <c ca="center">
                        <p>224</p>
                     </c>
                     <c ca="center">
                        <p>535</p>
                     </c>
                     <c ca="center">
                        <p>388</p>
                     </c>
                     <c ca="center">
                        <p>2422</p>
                     </c>
                     <c ca="center">
                        <p>1184</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Th</p>
                     </c>
                     <c ca="center">
                        <p>2016</p>
                     </c>
                     <c ca="center">
                        <p>1314</p>
                     </c>
                     <c ca="center">
                        <p>361</p>
                     </c>
                     <c ca="center">
                        <p>240</p>
                     </c>
                     <c ca="center">
                        <p>1144</p>
                     </c>
                     <c ca="center">
                        <p>827</p>
                     </c>
                     <c ca="center">
                        <p>2395</p>
                     </c>
                     <c ca="center">
                        <p>1457</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="9">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Totals</p>
                     </c>
                     <c ca="center">
                        <p>7276</p>
                     </c>
                     <c ca="center">
                        <p>6127</p>
                     </c>
                     <c ca="center">
                        <p>852</p>
                     </c>
                     <c ca="center">
                        <p>714</p>
                     </c>
                     <c ca="center">
                        <p>4282</p>
                     </c>
                     <c ca="center">
                        <p>4045</p>
                     </c>
                     <c ca="center">
                        <p>7247</p>
                     </c>
                     <c ca="center">
                        <p>5256</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>The plot in Figure <figr fid="F2">2a</figr> demonstrates that the estimates of Iog<sub>2</sub>(fold change) are the same in GP and GOT. As Figure <figr fid="F2">2b</figr> shows, the <it>p </it>values by GOT are smaller compared to the <it>p </it>values calculated by GP. Use of the MEV <it>t</it>-test resulted in fewer genes assessed as significantly expressed when compared to the numbers for the GP model. The results obtained when MIV data was used as input to GP, is closest to the results when using IIV<sub>TLSF</sub>.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Comparison of GP and GOT models</p>
               </caption>
               <text>
                  <p><b>Comparison of GP and GOT models</b>. Comparison of estimated log<sub>2</sub>(fold change) and the corresponding <it>p</it>-value estimates for the GP and GOT model results of the WS ecotype values in Table 4 (a) This is a scatter plot of the estimated log<sub>2</sub>(fold change) values from the GP and GOT models; these values are essentially identical, (b) This is a scatter plot of the - log<sub>10 </sub>(p-value), again for the GP and GOT models. The dotted lines correspond to - log<sub>10</sub>(0.05), as our significance cutoff is 0.05. For the GP model, it is the points to the right of the dotted line that are significant. For the GOT model, it is the points above the dotted line that are significant.</p>
               </text>
               <graphic file="1471-2105-7-215-2"/>
            </fig>
            <p>To compare the effectiveness of Expresso and TM4 in identifying gene differential expression, we compared the identified direction of differential expression of a select set of genes per genotype in Experiment 2 with results obtained by qRT-PCR. See Table <tblr tid="T5">5</tblr>. The lowest overall percentage (71.9%) of agreement is between the qRT-PCR results and the MEV <it>t</it>-test results using IIV<sub>TLSF</sub>. The log(fold change) estimates of the GP model has 77.1% percentage agreement with the qRT-PCR results, which is slightly higher than the percentage for the MEV <it>t</it>-test. The results of the GP model using MIV data demonstrated the greatest agreement, 90.1%, with the qRT-PCR results.</p>
            <tbl id="T5">
               <title>
                  <p>Table 5</p>
               </title>
               <caption>
                  <p>Comparison of qRT-PCR with Expresso and TM4. We compare the qRT-PCR results with identified up-expressed and down-expressed genes in Experiment 2 using Expresso and TM4. Results of qRT-PCR are available for each of the four ecotypes in the numbers <it>n </it>in the second column. Numbers of agreement or non-agreement are shown in the S, D, and F columns. The S (same) column tabulates the number of genes for which the sign of the log(fold change) for statistically significant differential expression matches the direction of change in the corresponding qRT-PCR result. The D (differing) column tabulates the number of genes for which the sign of the log(fold change) for statistically significant differential expression is in the opposite direction of the change in the corresponding qRT-PCR result. The F (filtered) column tabulates the number of genes for which there is a qRT-PCR result, but for which either the gene was filtered by MIDAS low intensity filtering or the analysis method did not assess the change in expression as statistically significant. The MEV <it>t</it>-test (first grouping) results are for the typical TM4 process, which involves IIV input data followed by the four MIDAS steps, which we denote IIV<sub>TLSF </sub>The GP model (second grouping) gives the same numbers and uses the same IIV<sub>TLSF</sub>input data, the GP model (third grouping) results use MIV input data and has no filtered genes.</p>
               </caption>
               <tblbdy cols="10">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="center">
                        <p>GP Model</p>
                     </c>
                     <c cspan="3" ca="center">
                        <p>GP Model</p>
                     </c>
                     <c cspan="3" ca="center">
                        <p>MEV <it>t</it>-test</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>qRT-PCR</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>MIV</p>
                     </c>
                     <c cspan="3" ca="center">
                        <p>IIV<sub>TLSF</sub></p>
                     </c>
                     <c cspan="3" ca="center">
                        <p>IIV<sub>TLSF</sub></p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                     <c cspan="3">
                        <hr/>
                     </c>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Genotype</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>n</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>S</p>
                     </c>
                     <c ca="center">
                        <p>D</p>
                     </c>
                     <c ca="center">
                        <p>S</p>
                     </c>
                     <c ca="center">
                        <p>D</p>
                     </c>
                     <c ca="center">
                        <p>F</p>
                     </c>
                     <c ca="center">
                        <p>S</p>
                     </c>
                     <c ca="center">
                        <p>D</p>
                     </c>
                     <c ca="center">
                        <p>F</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="10">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Col-0</p>
                     </c>
                     <c ca="center">
                        <p>55</p>
                     </c>
                     <c ca="center">
                        <p>50</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>43</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>40</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cvi-0</p>
                     </c>
                     <c ca="center">
                        <p>52</p>
                     </c>
                     <c ca="center">
                        <p>46</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>38</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>36</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ws</p>
                     </c>
                     <c ca="center">
                        <p>59</p>
                     </c>
                     <c ca="center">
                        <p>54</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>46</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>41</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Th</p>
                     </c>
                     <c ca="center">
                        <p>26</p>
                     </c>
                     <c ca="center">
                        <p>23</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>21</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>19</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="10">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Total</p>
                     </c>
                     <c ca="center">
                        <p>192</p>
                     </c>
                     <c ca="center">
                        <p>173</p>
                     </c>
                     <c ca="center">
                        <p>19</p>
                     </c>
                     <c ca="center">
                        <p>148</p>
                     </c>
                     <c ca="center">
                        <p>30</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                     <c ca="center">
                        <p>138</p>
                     </c>
                     <c ca="center">
                        <p>28</p>
                     </c>
                     <c ca="center">
                        <p>26</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Percentage</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>90.1%</p>
                     </c>
                     <c ca="center">
                        <p>9.9%</p>
                     </c>
                     <c ca="center">
                        <p>77.1%</p>
                     </c>
                     <c ca="center">
                        <p>15.6%</p>
                     </c>
                     <c ca="center">
                        <p>7.3%</p>
                     </c>
                     <c ca="center">
                        <p>71.9%</p>
                     </c>
                     <c ca="center">
                        <p>14.6%</p>
                     </c>
                     <c ca="center">
                        <p>13.5%</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>Figures <figr fid="F3">3</figr>, <figr fid="F4">4</figr>, and <figr fid="F5">5</figr> present the actual assessed Iog<sub>2</sub>(fold change) values for 50 selected Col-0 genes in Experiment 2, along with their qRT-PCR values. These are the 50 Col-0 genes, among the 55 with qRT-PCR values, for which we have expression values for all methods. For each gene, a histogram of the Iog<sub>2</sub>(fold change) estimates of qRT-PCR, the GP model using MIV, the GP model using IIV<sub>TLSF</sub>, and the MEV <it>t</it>-test is given. The 50 histograms are spread over three figures to enhance readability and are in increasing order by qRT-PCR estimated change. In general, the log<sub>2</sub>(fold change) estimates of the GP model and of the MEV <it>t</it>-test, all IIV<sub>TLSF </sub>input data, are approximately the same, while being slightly different from estimates of the GP model using MIV input data. As might be expected, disagreement between qRT-PCR and microarray results are more prevalent for small estimated log<sub>2</sub>(fold change) values. The histograms for genes AT4G09020 (Figure <figr fid="F3">3</figr>), AT1G35580 (Figure <figr fid="F4">4</figr>), and AT3G29360 (Figure <figr fid="F4">4</figr>) show that the direction of log(fold change) estimate of qRT-PCR matches the direction of the GP model using MIV input data, while differing from the direction of the estimates of the GP model and the MEV <it>t</it>-test using IIV<sub>TLSF </sub>input data.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Comparison of qRT-PCR results, Part I</p>
               </caption>
               <text>
                  <p><b>Comparison of qRT-PCR results, Part I</b>. Comparison of qRT-PCR results to Expresso and MEV <it>t</it>-test results for first 16 of 50 selected genes of the Col-0 genotype of Experiment 2.</p>
               </text>
               <graphic file="1471-2105-7-215-3"/>
            </fig>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Comparison of qRT-PCR results, Part II</p>
               </caption>
               <text>
                  <p><b>Comparison of qRT-PCR results, Part II</b>. Comparison of qRT-PCR results to Expresso and MEV <it>t</it>-test results for the middle 17 of 50 selected genes of the Col-0 genotype of Experiment 2.</p>
               </text>
               <graphic file="1471-2105-7-215-4"/>
            </fig>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Comparison of qRT-PCR results, Part III</p>
               </caption>
               <text>
                  <p><b>Comparison of qRT-PCR results, Part III</b>. Comparison of qRT-PCR results to Expresso and MEV <it>t</it>-test results for the last 17 of 50 selected genes of the Col-0 genotype of Experiment 2.</p>
               </text>
               <graphic file="1471-2105-7-215-5"/>
            </fig>
            <p>Table <tblr tid="T6">6</tblr> summarizes genotype-specific correlation results, which demonstrate that the GP model us-ing MIV input data has the highest correlation with qRT-PCR compared to the GP model and the MEV <it>t</it>-test using IIV<sub>TLSF </sub>input data. The highest correlation of 0.85 is for the Col-0 results of qRT-PCR versus the GP model using MIV input. The corresponding correlations for Cvi-0, WS, and Th are 0.73, 0.66, and 0.84, respectively, which are all best among the analysis methods.</p>
            <tbl id="T6">
               <title>
                  <p>Table 6</p>
               </title>
               <caption>
                  <p>Correlation of qRT-PCR with Expresso and TM4. We calculate the correlation of qRT-PCR results from the Expresso GP model and the MEV <it>t</it>-test. For each comparison, the analytical result for each gene for which there is a qRT-PCR result for a particular genotype is assembled into a result vector. We used SAS to compute a Pearson correlation of each result vector with the corresponding vector of qRT-PCR results. The computed correlations are as reported above.</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="4" ca="center">
                        <p>Genotype-Specific Correlation</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Comparison</p>
                     </c>
                     <c ca="center">
                        <p>Col-0</p>
                     </c>
                     <c ca="center">
                        <p>Cvi-0</p>
                     </c>
                     <c ca="center">
                        <p>WS</p>
                     </c>
                     <c ca="center">
                        <p>Th</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>qRT-PCR versus GP model with MIV input</p>
                     </c>
                     <c ca="center">
                        <p>0.85</p>
                     </c>
                     <c ca="center">
                        <p>0.73</p>
                     </c>
                     <c ca="center">
                        <p>0.66</p>
                     </c>
                     <c ca="center">
                        <p>0.84</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>qRT-PCR versus GP model with IIV<sub>TLSF </sub>input</p>
                     </c>
                     <c ca="center">
                        <p>0.83</p>
                     </c>
                     <c ca="center">
                        <p>0.69</p>
                     </c>
                     <c ca="center">
                        <p>0.61</p>
                     </c>
                     <c ca="center">
                        <p>0.78</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>qRT-PCR versus MEV <it>t</it>-test with IIV<sub>TLSF </sub>input</p>
                     </c>
                     <c ca="center">
                        <p>0.83</p>
                     </c>
                     <c ca="center">
                        <p>0.65</p>
                     </c>
                     <c ca="center">
                        <p>0.64</p>
                     </c>
                     <c ca="center">
                        <p>0.77</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>Our integration and comparison of Expresso analysis and the capabilities of TM4 has highlighted successes in microarray analysis, some similarities, and some differences. The success of microarray analysis is demonstrated by considerable agreement between qRT-PCR results and the results of all the examined microarray analysis methods. The greatest agreement was found when median intensity value (MIV) inputs were analyzed with the Expresso GP analysis model. We also found that the use of integrated intensity value (IIV) inputs for Expresso analysis consistently resulted in fewer genes identified as differentially expressed when compared to results from MIV inputs. This suggests that the use of IIV inputs is more conservative than the use of MIV inputs, while MIV inputs may give greater agreement to qRT-PCR results than IIV inputs.</p>
         <p>Our results demonstrate that the MIDAS normalization and filtering pipeline corrects systematic intensity-dependent dye bias on a per microarray basis. The normalization stage in Expresso analysis removes global effects across all microarrays and complements the per microarray normalization methods of MIDAS. The generally better agreement of Expresso analysis with qRT-PCR results when compared to the MEV <it>t</it>-test suggests that it would be desirable for MEV to have an ANOVA test that has the greater flexibility of the Expresso gene model.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Median and integrated intensity values</p>
            </st>
            <p>This research considers two ways of measuring spot intensity, one or both of which are reported by typical microarray image processing software. The median intensity value (MIV) of a spot is the median value of all the pixels identified as part of the spot. The integrated intensity value (IIV) of a spot is the total value of all the pixels identified as part of the spot. In this research, both are background-corrected values. If the IIV data is unavailable, but the radius of the bounding circle of the spot and its average intensity value are available, then the IIV data can be estimated as the product of the average intensity and the number of pixels in the circle. This is the estimate used by ExpressConverter (below) when it converts GPR format data to MEV format data.</p>
         </sec>
         <sec>
            <st>
               <p>Microarray data sets</p>
            </st>
            <p>We used data sets from two experiments that used the Arabidopsis Oligonucleotide Microarrays <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>, which include 25,712 elements, each a gene-specific 70-mer (Qiagen/Operon, Valencia, CA) for a known or putative open reading frames in <it>Arabidopsis thaliana</it>. There are 48 blocks per microarray, 25 rows by 24 columns (600 spots) per block, and 28,800 spots per microarray, including spots for the 25,712 gene-specific 70-mers and 302 control elements. The remaining 2,786 spots are blank.</p>
            <sec>
               <st>
                  <p>Experiment 1</p>
               </st>
               <p>Experiment 1 compares the responses of <it>Arabidopsis thaliana </it>wild type, ecotype Columbia (henceforth, WT), and of an antisense plant for phospholipase D &#945; (antiPLD) in Columbia background to drought stress <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. Plants were harvested at a single time point, and two biological replicate hybridizations were done for each of WT and antiPLD. Scan Array Express (PerkinElmer Life and Analytical Sciences, Inc., Boston, MA USA) was used to quantitate the four microarrays. By default, ScanArray Express performs a global lowess normalization of median intensities per microarray. ScanArray Express output its results to four files in GenePix (GPR) format, which constitute the Experiment 1 data set.</p>
            </sec>
            <sec>
               <st>
                  <p>Experiment 2</p>
               </st>
               <p>Li, et al., <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> compare the responses to elevated CO<sub>2 </sub>of a wild <it>Arabidopsis thaliana </it>relative (<it>Thellungiella halophila</it>, ecotype Shandong; Th) and of three <it>Arabidopsis thaliana </it>ecotypes: Wassilewskija (WS), Columbia (Col-0), and Cape Verde Islands (Cvi-0). Three biological replicate hybridizations were done for each genotype. GenePix (Axon Instruments, Union City, CA USA) was used to quantitate the twelve microarrays. GenePix also performs by default a global lowess normalization of median intensities per microarray. The output of GenePix is twelve GPR files, which constitute the Experiment 2 data set.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Real-time quantitative RT-PCR</p>
            </st>
            <p>For verification of microarray results in Experiment 2, Li, et al., <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> performed real-time quantitative reverse-transcriptase PCR (qRT-PCR) for selected genes &#8212; 55 in Col-0; 52 in Cvi-0; 59 in WS; 26 in Th. Supplementary Table 1 - see <supplr sid="S2">Additional file: 2</supplr> contains the annotation of the selected genes. In brief, primer pairs were selected to represent unique sequences in the <it>Arabidopsis thaliana </it>genome and in the <it>Thellungiella </it>sequences deposited in NCBI. <it>Thellungiella </it>actin (CX129618) cDNA primers and <it>Arabidopsis thaliana</it>. Ubiquitin-10 cDNA primers were used as internal controls in the qRT-PCR analyses. RT-PCR products were detected using the fluorescent dye SYBR-green (Applied Biosystems, Foster City, CA USA) and the ABI PRISM/Taqman 7900 Sequence Detection System (Applied Biosystems, Foster City, CA USA). Dissociation curves were generated for each reaction to ensure specific amplification. Three repeats were done for each gene. The averaged threshold cycle numbers were used to estimate original mRNA levels.</p>
            <p>The microarray results of Experiment 2 suggested that exposure to elevated CO2 resulted in changes in expression of many genes associated with carbon metabolism, and those associated with pho-tosynthetic carbon metabolism in particular. This included genes encoding proteins that transport pho-tosynthate out of the chloroplast, where carbon fixation takes place, for export to other parts of the cell, and also genes encoding transport proteins that export carbon skeletons out of the cell to other tissues where growth is taking place. Because of this finding, it was important to validate the results obtained for gene expression associated with carbon metabolism with qRT-PCR. Hence, a number of the genes in Supplementary Table 1 - see <supplr sid="S2">Additional file: 2</supplr> are related to carbon metabolism.</p>
         </sec>
         <sec>
            <st>
               <p>Expresso analysis</p>
            </st>
            <p>Expresso analysis employs a general and flexible method to identify differentially expressed genes that is adapted from the two-stage analysis method of Wolfinger, et al., <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. In general, Expresso analysis consists of two log-linear ANOVA mixed models, called the normalization model and gene model. The first estimates and removes the experiment-wise systematic errors, while the second estimates and removes the gene specific errors. The residual that remains is the log-ratio estimate for each gene. In particular, the Tukey-Kramer multiple comparison of treatment effects on each gene is performed to estimate its expression level and the significance of (confidence in) that expression level. Expresso analysis is implemented for and executed on SAS (SAS/STAT version 8.2, SAS Institute Inc., Gary, NC USA).</p>
            <p>The original model of Wolfinger, et al., <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> includes the treatment and the array as the main effects. In previous Expresso analysis, we have extended that model to experiment-appropriate models that include additional fixed and random effects. Here, the design of the two-dye oligonucleotide microarray used in Experiment 1 and Experiment 2 includes various controls strategically positioned in different blocks of the microarray. This makes it possible to estimate the random block effect in each microarray. Furthermore, the dye effect is included in the normalization model to estimate and remove the global dye bias.</p>
            <p>For this research, we developed two Expresso models, one whose gene model assesses the gene-(plant sample) effect (the GP model) and the other whose gene model assesses the gene-genotype-treatment effect (the GOT model). The GP model is much like previous Expresso models and is applicable to both Experiment 1 and Experiment 2. However, the GOT model is specific to analyzing Experiment 2. In both experiments, we used the GP model to estimate the differences in response of individual genotypes to treatment (drought stress versus control or ozone stress versus ambient ozone). However, the GOT model was used to estimate the effects of treatment (ozone stress), aside from the effect of individual genotypes.</p>
            <sec>
               <st>
                  <p>Expresso GP model</p>
               </st>
               <p>The normalization model is</p>
               <p><it>y</it><sub><it>spdab </it></sub>= <it>&#956; </it>+ <it>P</it><sub><it>p </it></sub>+ <it>D</it><sub><it>d </it></sub>+ <it>A</it><sub><it>a </it></sub>+ (<it>P </it>&#215; <it>A</it>)<sub><it>pa </it></sub>+ <it>B</it><sub><it>ba </it></sub>+ <it>r</it><sub><it>spdab</it></sub>.</p>
               <p>Each <it>y</it><sub><it>spdab </it></sub>value is the log<sub>2</sub>-transformed intensity of spot <it>s </it>within the dye <it>d </it>image in block <it>b </it>of array <it>a</it>. (A spot may represent a gene, a control, or a blank.) The global mean of the <it>y</it><sub><it>spdab </it></sub>values, over all microarrays, is <it>&#956;</it>. The fixed effects in the model are the plant sample effect <it>P</it><sub><it>p</it></sub>, where <it>p </it>indexes the various distinct plant samples from which mRNA was obtained, and the dye effect <it>D</it><sub><it>d</it></sub>, where <it>d </it>has two values for the two dyes. The random effects in the model are the array effect <it>A</it><sub><it>a</it></sub>, where <it>a </it>indexes the microarrays, the interaction effect (<it>P </it>&#215; <it>A</it>)<sub><it>pa </it></sub>of plant sample <it>p </it>with microarray <it>a</it>, and the block effect <it>B</it><sub><it>ba</it></sub>, where <it>b </it>identifies the block within microarray <it>a</it>. The model residual is <it>r</it><sub><it>spdab</it></sub>. This differs from the normalization model in Wolfinger, et al., <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, in that it incorporates dye and block effects. It is a refinement of the Expresso normalization model in Watkinson, et al., <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>, in that it has no printing pin effect, which is specific to the analysis in the earlier paper, and includes the block effect.</p>
               <p>The second stage of the analysis uses the residual values <it>r</it><sub><it>spdab </it></sub>computed in the first stage to estimate the interaction between an individual gene <it>g </it>and each plant sample <it>p </it>at a significance level &#8804; &#945; = 0.05. Index <it>g </it>is added to the residual values <it>r</it><sub><it>spdab </it></sub>resulting to <it>r</it><sub><it>gspdab</it></sub>. The value of <it>g </it>is determined using the mapping of <it>s </it>index values to <it>g </it>index values. The gene model is</p>
               <p><it>r</it><sub><it>gspdab </it></sub>= <it>G</it><sub><it>g </it></sub>+ (<it>G </it>&#215; <it>P</it>)<sub><it>gp </it></sub>+ (<it>G </it>&#215; <it>D</it>)<sub><it>gd </it></sub>+ (<it>G </it>&#215; <it>A</it>)<sub><it>ga </it></sub>+ &#955;<sub><it>gspdab</it></sub>.</p>
               <p>Here, <it>g </it>is a spot that represents a gene (not a blank or control) within the dye <it>d </it>image in block <it>b </it>of array <it>a</it>. The value <it>G</it><sub><it>g </it></sub>is the mean of residual values for all spots that represent gene <it>g </it>in all images. The interactions (<it>G </it>&#215; <it>P</it>)<sub><it>gp </it></sub>of gene <it>g </it>with plant sample <it>p </it>and of (<it>G </it>&#215; <it>D</it>)<sub><it>gd </it></sub>of gene <it>g </it>with dye <it>d </it>are the fixed effects. The interaction (<it>G </it>&#215; <it>A</it>)<sub><it>ga </it></sub>of gene <it>g </it>with mi-croarray a is a random effect. The &#955;<sub><it>gspdab </it></sub>values are stochastic errors. This differs from the gene model in Wolfinger, et al., <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, in that it incorporates interactions between gene and dye and between gene and array. It refines the Expresso gene model in Watkinson, et al., <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> to include the interaction between gene and array.</p>
               <p>The estimate of the expression level of each gene in each treatment comparison is done by computing the pair-wise least square mean differences of gene-treatment effects. The Tukey-Kramer multiple comparison of gene-(plant sample) effects on each gene is made to estimate the <it>p </it>values associated with each calculated expression level. If there are &#961; plant samples, then there are <m:math name="1471-2105-7-215-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mrow><m:mo>(</m:mo><m:mrow><m:mtable><m:mtr><m:mtd><m:mi>&#961;</m:mi></m:mtd></m:mtr><m:mtr><m:mtd><m:mn>2</m:mn></m:mtd></m:mtr></m:mtable></m:mrow><m:mo>)</m:mo></m:mrow></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqadaqaauaabeqaceaaaeaaiiGacqWFbpGCaeaacqaIYaGmaaaacaGLOaGaayzkaaaaaa@30FB@</m:annotation></m:semantics></m:math> possible pairwise comparisons. If we index the plant samples from 1 to &#961;, then the null hypothesis for gene <it>g </it>and comparison <it>i</it>, <it>j</it>, where 1 &#8804; <it>i </it>&lt;<it>j </it>&#8804; &#961;, is</p>
               <p><it>H</it><sub>o</sub>: (<it>G </it>&#215; <it>P</it>)<sub><it>gi </it></sub>= (<it>G </it>&#215; <it>P</it>)<sub><it>gi</it></sub>.</p>
               <p>The difference (<it>G </it>&#215; <it>P</it>)<sub><it>gi </it></sub>- (<it>G </it>&#215; <it>P</it>)<sub><it>gj </it></sub>is the estimate of the log<sub>2</sub>(fold change) of gene <it>g </it>in the experimental comparison <it>P</it><sub><it>i </it></sub>versus <it>P</it><sub><it>j</it></sub>. The analysis also yields a <it>p</it>-value for the statistical confidence in each difference.</p>
               <p>The above GP model was used to analyze both Experiment 1 and Experiment 2. In both experiments, there are 48 blocks per array. In Experiment 1, there are four arrays and four plant samples, namely, WT-control, WT-stressed, antiPLD-control, and antiPLD-stressed. In Experiment 2, there are 12 arrays and eight plant samples, namely, Col-0-test, Col-0-control, Cvi-0-test, Cvi-0-control, WS-test, WS-control, Th-test, and Th-control.</p>
            </sec>
            <sec>
               <st>
                  <p>Expresso GOT model</p>
               </st>
               <p>We wanted to estimate the gene-treatment effects separately from gene-genotype-treatment interaction effects using just one model. To do this, we designed the gene-genotype-treatment model, an alternative set of log-linear ANOVA mixed models, for the elevated CO<sub>2 </sub>experiment where we unfold the genotype information from the plant sample factor in the GP model. This resulted in a normalization model that includes the genotype effect (<it>O</it><sub><it>o</it></sub>) with 4 levels (Col-0, Cvi-0, WS, and Th) and basic treatment effect (<it>T</it><sub><it>t</it></sub>) with 2 levels (test and control). The random array (<it>A</it><sub><it>a</it></sub>) effect however needs to be removed from the model since it confounds the genotype effect.</p>
               <p>The normalization model is</p>
               <p><it>y</it><sub><it>sotdab </it></sub>= <it>&#956; </it>+ <it>O</it><sub><it>o </it></sub>+ <it>T</it><sub>T </sub>+ <it>D</it><sub><it>d </it></sub>+ (<it>O </it>&#215; <it>T</it>)<sub><it>ot </it></sub>+ <it>B</it><sub><it>ba </it></sub>+ <it>r</it><sub><it>sotdab</it></sub>.</p>
               <p>Each <it>y</it><sub><it>sotdab </it></sub>value is the log<sub>2</sub>-transformed intensity of spot <it>s </it>for genotype <it>o </it>and treatment <it>t </it>within the dye <it>d </it>image in block <it>b </it>of array <it>a</it>. We have that <it>&#956; </it>is as in the GP model. The fixed effects in the model are the genotype effect <it>O</it><sub><it>o</it></sub>, where <it>o </it>indexes the genotype (organism), the treatment effect <it>T</it><sub><it>t</it></sub>, where <it>t </it>is the treatment, and the dye effect <it>D</it><sub><it>d</it></sub>, where <it>d </it>has two values for the two dyes. The random effects in the model are the interaction effect (<it>O </it>&#215; <it>T</it>)<sub><it>ot </it></sub>of genotype <it>o </it>with microarray <it>a</it>, and the block effect <it>B</it><sub><it>ba</it></sub>, where <it>b </it>identifies the block within microarray <it>a</it>. The model residual is <it>r</it><sub><it>sotdab</it></sub>. This differs from the normalization model in Wolfinger, et al., <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, in that it incorporates genotype (organism), dye, and block effects. It is a refinement of the Expresso normalization model in Watkinson, et al., <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>, in that it has no printing pin effect, and includes the genotype and block effects.</p>
               <p>The second stage of the analysis uses the residual values <it>r</it><sub><it>sotdab </it></sub>computed in the first stage to estimate the interaction among an individual gene <it>g</it>, each genotype <it>o</it>, and each treatment <it>t</it>, at a significance level &#8804; &#945; = 0.05. Index <it>g </it>is added to the residual values <it>r</it><sub><it>sotdab </it></sub>resulting to <it>r</it><sub><it>gsotdab</it></sub>. The value of <it>g </it>is determined using the mapping of <it>s </it>index values to <it>g </it>index values. The gene model is</p>
               <p><it>r</it><sub><it>gsotdab </it></sub>= <it>G</it><sub><it>g </it></sub>+ (<it>G </it>&#215; <it>O</it>)<sub><it>go </it></sub>+ (<it>G </it>&#215; <it>T</it>)<sub><it>gt </it></sub>+ (<it>G </it>&#215; <it>O </it>&#215; <it>T</it>)<sub><it>got </it></sub>+ (<it>G </it>&#215; <it>D</it>)<sub><it>gd </it></sub>+ &#955;<sub><it>gsotdab</it></sub></p>
               <p>Here, <it>G</it><sub><it>g </it></sub>are as for the GP model. The interactions (<it>G </it>&#215; <it>O</it>)<sub><it>go </it></sub>of gene <it>g </it>with genotype <it>o</it>, (<it>G </it>&#215; <it>T</it>)<sub><it>gt </it></sub>of gene <it>g </it>with treatment <it>t</it>, (<it>G </it>&#215; <it>O </it>&#215; <it>T</it>)<sub><it>got </it></sub>of gene <it>g </it>with genotype <it>o </it>and treatment <it>t</it>, and (<it>G </it>&#215; <it>D</it>)<sub><it>gd </it></sub>of gene <it>g </it>with dye <it>d </it>are fixed effects. The &#955;<sub><it>gsotdab </it></sub>values are stochastic errors. This differs from the gene model in Wolfinger, et al., <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, in that it incorporates interactions between gene and genotype, between gene and genotype with treatment, and between gene and array. It refines the Expresso gene model in Watkinson, et al., <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> to include interactions between gene and genotype and between gene and genotype with treatment.</p>
               <p>We do pairwise comparisons as in the GP model, but there are two plausible classes of null hypotheses to test within the GOT gene model. If <it>&#964; </it>is the number of treatments, then the class 1 null hypothesis for gene <it>g </it>and comparison <it>i</it>,<it>j</it>, where 1 &#8804; <it>i </it>&lt;<it>j </it>&#8804; <it>&#964;</it>, is</p>
               <p><it>H</it><sub>1</sub>: (<it>G </it>&#215; <it>T</it>)<sub><it>gi </it></sub>= (<it>G </it>&#215; <it>T</it>)<sub><it>gj</it></sub>.</p>
               <p>The difference (<it>G </it>&#215; <it>T</it>)<sub><it>gi </it></sub>- (<it>G </it>&#215; <it>T</it>)<sub><it>gj </it></sub>is the estimate of the log<sub>2</sub>(fold change) of gene <it>g </it>in the <it>T</it><sub><it>i </it></sub>versus <it>T</it><sub><it>j</it></sub>comparison. This particular comparison looks for gene-treatment effects that are independent of genotype.</p>
               <p>We can still estimate the expression level of each gene with respect to a specific genotype level by computing the pair-wise least square differences of gene-genotype-treatment interaction effects. The class 1 null hypothesis for gene <it>g </it>and comparison <it>i</it>, <it>j</it>, where 1 &#8804; <it>i </it>&lt;<it>j </it>&#8804; <it>&#964;</it>, is</p>
               <p><it>H</it><sub>2</sub>: (<it>G </it>&#215; <it>O </it>&#215; <it>T</it>)<sub><it>goi </it></sub>= (<it>G </it>&#215; <it>O </it>&#215; <it>T</it>)<sub><it>goj</it></sub>.</p>
               <p>The difference (<it>G </it>&#215; <it>O </it>&#215; <it>T</it>)<sub><it>goi </it></sub>- (<it>G </it>&#215; <it>O </it>&#215; <it>T</it>)<sub><it>goj </it></sub>is the estimate of the log<sub>2 </sub>(fold change) of gene <it>g </it>of a specific genotype <it>o </it>for the <it>T</it><sub><it>i </it></sub>versus <it>T</it><sub><it>j </it></sub>comparison. The analysis also yields a <it>p</it>-value for the statistical confidence in each difference.</p>
               <p>The GOT model applies to Experiment 2 in a straightforward way. There are 12 arrays, two treatments, and four genotypes, namely, Col-0, Cvi-0, WS, and Th.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>TM4 microarray software suite</p>
            </st>
            <p>The TM4 microarray software suite consists of several components freely available from the TM4 web site <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. In this research, we employed these components: ExpressConverter, Microarray Data Analysis Software (MIDAS), and Microarray Experiment Viewer (MEV). ExpressConverter converts microarray data from various data formats, such as the GenePix Results (GPR) format, to the MEV format, which is used by MIDAS and MEV. MEV format includes only integrated intensity values (IIV), which is the kind of intensity values expected of all TM4 components.</p>
         </sec>
         <sec>
            <st>
               <p>MIDAS data normalization methods and filters</p>
            </st>
            <p>Low intensity and saturated spots are marked by quantitation programs. These spots are filtered out from the data before doing any further normalization or statistical analysis. Data normalization methods proceed from the assumption that only a relatively small proportion of the genes change significantly in expression level between the two hybridized mRNA samples. This assumption is reasonable for these data sets since the hybridizations and subsequent analysis address nearly all <it>Arabidopsis thaliana </it>genes. The MIDAS component of TM4 provides a number of data normalization methods and filters and supports applying them in a pipelined fashion <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>.</p>
            <sec>
               <st>
                  <p>Total intensity normalization</p>
               </st>
               <p>While our assumption implies that the average measured intensities of the two channels of a cDNA or oligonucleotide microarray should be almost the same, these averages are often significantly different, due to differences in the inherent fluorescence of the two dyes. The total intensity normalization step in TM4 is a straightforward means to eliminate this global dye bias. For each spot <it>i</it>, where 1 &#8804; <it>i </it>&#8804; <it>n</it>, let <it>R</it><sub><it>i </it></sub>and <it>G</it><sub><it>i </it></sub>be the measured intensities of the spot in the two channels. The normalized intensity data for spot <it>i </it>is <m:math name="1471-2105-7-215-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:msup><m:mi>G</m:mi><m:mo>&#8242;</m:mo></m:msup><m:mi>i</m:mi></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGhbWrgaqbamaaBaaaleaacqWGPbqAaeqaaaaa@2F56@</m:annotation></m:semantics></m:math> = &#954;<it>G</it><sub><it>i </it></sub>and <m:math name="1471-2105-7-215-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msub><m:msup><m:mi>R</m:mi><m:mo>&#8242;</m:mo></m:msup><m:mi>i</m:mi></m:msub></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGsbGugaqbamaaBaaaleaacqWGPbqAaeqaaaaa@2F6C@</m:annotation></m:semantics></m:math> = <it>R</it><sub><it>i</it></sub>, where &#954; is the normalization factor <m:math name="1471-2105-7-215-i4" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mi>&#954;</m:mi><m:mo>=</m:mo><m:mo stretchy="false">(</m:mo><m:mstyle displaystyle="true"><m:msubsup><m:mo>&#8721;</m:mo><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>n</m:mi></m:msubsup><m:mrow><m:msub><m:mi>R</m:mi><m:mi>i</m:mi></m:msub></m:mrow></m:mstyle><m:mo stretchy="false">)</m:mo><m:mo>/</m:mo><m:mo stretchy="false">(</m:mo><m:mstyle displaystyle="true"><m:msubsup><m:mo>&#8721;</m:mo><m:mrow><m:mi>i</m:mi><m:mo>=</m:mo><m:mn>1</m:mn></m:mrow><m:mi>n</m:mi></m:msubsup><m:mrow><m:msub><m:mi>G</m:mi><m:mi>i</m:mi></m:msub><m:mo stretchy="false">)</m:mo></m:mrow></m:mstyle></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWF6oWAcqGH9aqpcqGGOaakdaaeWaqaaiabdkfasnaaBaaaleaacqWGPbqAaeqaaaqaaiabdMgaPjabg2da9iabigdaXaqaaiabd6gaUbqdcqGHris5aOGaeiykaKIaei4la8IaeiikaGYaaabmaeaacqWGhbWrdaWgaaWcbaGaemyAaKgabeaakiabcMcaPaWcbaGaemyAaKMaeyypa0JaeGymaedabaGaemOBa4ganiabggHiLdaaaa@4680@</m:annotation></m:semantics></m:math> Quackenbush <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> discusses this normalization in the context of sev-eral variations that are possible to address differing channel intensities.</p>
            </sec>
            <sec>
               <st>
                  <p>Lowess normalization</p>
               </st>
               <p>Beyond the global dye bias, there is dye bias that is dependent on the measured spot intensities <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr></abbrgrp>. TM4 constructs a scatter plot, called an RI-plot, of the points (<it>x</it><sub><it>i</it></sub>, <it>y</it><sub><it>i</it></sub>), where 1 &#8804; <it>i </it>&#8804; <it>n</it>, given by <it>x</it><sub><it>i </it></sub>= log<sub>10</sub>(<it>R</it><sub><it>i</it></sub><it>G</it><sub><it>i</it></sub>) and <it>y</it><sub><it>i </it></sub>= log<sub>2</sub>(<it>R</it><sub><it>i</it></sub>/<it>G</it><sub><it>i</it></sub>). Under our assumption, the RI-plot should be very nearly symmetric with respect to the line <it>y </it>= 0. In lowess normalization, TM4 applies the lowess method of Cleveland <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> to fit a locally weighted regression curve to the RI-plot; TM4 then adjusts spot intensities to eliminate any systematic intensity-dependent bias. Additional details on correcting intensity-dependent bias is found in <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>.</p>
            </sec>
            <sec>
               <st>
                  <p>Standard deviation regularization</p>
               </st>
               <p>After total intensity and lowess normalizations eliminate dye bias on a global (per microarray) scale, TM4 employs standard deviation regularization to ensure that the per-block variances of log(<it>R</it><sub><it>i</it></sub>/<it>G</it><sub><it>i</it></sub>) values are the same <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B28">28</abbr></abbrgrp>. Quackenbush <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> provides the formulas for this normalization step.</p>
            </sec>
            <sec>
               <st>
                  <p>Low intensity filtering</p>
               </st>
               <p>Since the relative error in the log(<it>R</it><sub><it>i</it></sub>/<it>G</it><sub><it>i</it></sub>) values increases if <it>R</it><sub><it>i </it></sub>or <it>G</it><sub><it>i </it></sub>is close to background levels, spots with low intensities are filtered out. Quackenbush <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> provides additional details, which essentially require that both <it>R</it><sub><it>i </it></sub>and <it>G</it><sub><it>i </it></sub>intensities be above two standard deviations of the respective backgrounds.</p>
            </sec>
            <sec>
               <st>
                  <p>The MIDAS pipeline</p>
               </st>
               <p>We applied a MIDAS pipeline consisting of total intensity normalization, lowess normalization, standard deviation regularization, and low intensity filtering to both microarray data sets. MIDAS default parameters were used throughout; the default low intensity filter cut-off is <it>R</it><sub><it>i</it></sub><it>G</it><sub><it>i </it></sub>&lt; 10,000.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>TM4 MEV analysis</p>
            </st>
            <p>The Multi Experiment Viewer (MEV) component of TM4 provides a number of statistical analyses and clustering algorithms to identify differentially expressed genes. We report results from the one-class <it>t</it>-test analysis applied to output of the MIDAS pipeline. This test assumes that the paired distribution of treated and control groups is normally distributed. Since the intensities measured from the same spot are correlated, we can apply the one-class <it>t</it>-test for the two-group comparison.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>AAS performed the data preprocessing, Expresso analysis of all data sets, developed the database of statistical results, and drafted the manuscript. SPM performed the drought stress experiment (experiment 1), while PL performed the elevated CO<sub>2 </sub>experiment (experiment 2). PL performed the qRT-PCR of 192 genes from experiment 2. WS and PL performed TM4 analysis of experiments 1 and 2 respectively. LSH supervised the data analysis. LSH, HJB, and RG conceived of the study and coordinated the work. All authors read and approved the manuscript.</p>
         <suppl id="S1">
            <title>
               <p>Additional File 1</p>
            </title>
            <text>
               <p><b>Intensity-dependent dye bias in Rl-plots</b>. Supplementary Figure <figr fid="F1">1</figr> is a PDF file that contains six RI-plots that illustrate specific intensity-dependent dye bias as the microarray data is processed through the normalization steps.</p>
            </text>
            <file name="1471-2105-7-215-S1.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional File 2</p>
            </title>
            <text>
               <p>Genes subjected to qRT-PCR. Supplementary Table <tblr tid="T1">1</tblr> is a PDF file that contains the annotation of genes subjected to qRT-PCR and the functional categories represented. For verification of microarray results in Experiment 2, Li, <it>et al</it>., <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> performed real-time quantitative reverse-transcriptase PCR (qRT-PCR) for selected genes &#8212; 55 in Col-0; 52 in Cvi-0; 59 in WS; 26 in Th. The AT numbers of those genes, their annotation, and categorizations into biological functions are in the table.</p>
            </text>
            <file name="1471-2105-7-215-S2.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The work has been supported by the National Science Foundation (DBI-0223905 and BIO/IBN 0219322) and Virginia Tech and UIUC institutional funds.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Using ANOVA to Analyze Microarray Data</p>
            </title>
            <aug>
               <au>
                  <snm>Churchill</snm>
                  <fnm>GA</fnm>
               </au>
            </aug>
            <source>Bio Techniques</source>
            <pubdate>2004</pubdate>
            <volume>37</volume>
            <issue>2</issue>
            <fpage>173</fpage>
            <lpage>177</lpage>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Assessing Gene Significance from cDNA Microarray Expression Data via Mixed Models</p>
            </title>
            <aug>
               <au>
                  <snm>Wolfinger</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Wolfinger</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Bennett</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Hamadeh</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Bushel</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Afshari</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Paules</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Journal of Computational Biology</source>
            <pubdate>2001</pubdate>
            <volume>8</volume>
            <fpage>625</fpage>
            <lpage>637</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1089/106652701753307520</pubid>
                  <pubid idtype="pmpid" link="fulltext">11747616</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Linear Models for Microarray Data Analysis: Hidden Similarities and Differences</p>
            </title>
            <aug>
               <au>
                  <snm>Kerr</snm>
                  <fnm>MK</fnm>
               </au>
            </aug>
            <source>Journal of Computational Biology</source>
            <pubdate>2003</pubdate>
            <volume>10</volume>
            <issue>6</issue>
            <fpage>891</fpage>
            <lpage>901</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1089/106652703322756131</pubid>
                  <pubid idtype="pmpid" link="fulltext">14980016</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>A Comparative Review of Statistical Methods for Discovering Differentially Expressed Genes in Replicated Microarray Experiments</p>
            </title>
            <aug>
               <au>
                  <snm>Pan</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2002</pubdate>
            <volume>18</volume>
            <issue>4</issue>
            <fpage>546</fpage>
            <lpage>554</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/18.4.546</pubid>
                  <pubid idtype="pmpid" link="fulltext">12016052</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Comparison of Li-Wong and Loglinear Mixed Models for the Statistical Analysis of Oligonucleotide Arrays</p>
            </title>
            <aug>
               <au>
                  <snm>Chu</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Weir</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Wolfinger</snm>
                  <fnm>RD</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <issue>4</issue>
            <fpage>500</fpage>
            <lpage>506</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btg435</pubid>
                  <pubid idtype="pmpid" link="fulltext">14990445</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Statistical Tests for Differential Expression in cDNA Microarray Experiments</p>
            </title>
            <aug>
               <au>
                  <snm>Cui</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Churchill</snm>
                  <fnm>GA</fnm>
               </au>
            </aug>
            <source>Genome Biology</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <issue>210</issue>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">154570</pubid>
                  <pubid idtype="pmpid" link="fulltext">12702200</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Reassessing Design and Analysis of Two-colour Microarray Experiments using Mixed Effects Models</p>
            </title>
            <aug>
               <au>
                  <snm>Rosa</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Steibel</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Tempelman</snm>
                  <fnm>RJ</fnm>
               </au>
            </aug>
            <source>Comparative and Functional Genomics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>123</fpage>
            <lpage>131</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1002/cfg.464</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Model Selection and Efficiency Testing for Normalization of cDNA Microarray Data</p>
            </title>
            <aug>
               <au>
                  <snm>Futschik</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Crompton</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Genome Biology</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <issue>R60</issue>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">507885</pubid>
                  <pubid idtype="pmpid" link="fulltext">15287982</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Microarray in Ecological Research: A Case Study of a cDNA Microarray for Plant-Herbivore Interactions</p>
            </title>
            <aug>
               <au>
                  <snm>Held</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gase</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Baldwin</snm>
                  <fnm>IT</fnm>
               </au>
            </aug>
            <source>BMS Ecology</source>
            <pubdate>2004</pubdate>
            <volume>4</volume>
            <issue>13</issue>
         </bibl>
         <bibl id="B10">
            <title>
               <p>A Case Study on Choosing Normalization Methods and Test Statistics for Two-Channel Microarray Data</p>
            </title>
            <aug>
               <au>
                  <snm>Xie</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Jeong</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Pan</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Khodursky</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Carlin</snm>
                  <fnm>BP</fnm>
               </au>
            </aug>
            <source>Comparative and Functional Genomics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>432</fpage>
            <lpage>444</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1002/cfg.416</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>A Comparison of Normalization Methods for High Density Oligonucleotide Array Data Based on Variance and Bias</p>
            </title>
            <aug>
               <au>
                  <snm>Bolstad</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Irizarry</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Astrand</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Speed</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <issue>2</issue>
            <fpage>185</fpage>
            <lpage>193</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/19.2.185</pubid>
                  <pubid idtype="pmpid" link="fulltext">12538238</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Open Source Software for the Analysis of Microarray Data</p>
            </title>
            <aug>
               <au>
                  <snm>Dudoit</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gentleman</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Quackenbush</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>BioTechniques</source>
            <pubdate>2003</pubdate>
            <volume>34</volume>
            <fpage>s45</fpage>
            <lpage>s51</lpage>
         </bibl>
         <bibl id="B13">
            <title>
               <p>TM4: A Free, Open-Source System for Microarray Data Management and Analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Saeed</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sharov</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Liang</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Bhagabati</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Braisted</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Klapa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Currier</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Thiagarajan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sturn</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Snuffin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rezantsev</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Popov</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ryltsov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kostukovich</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Borisovsky</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Vinsavich</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Trush</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Quackenbush</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Bio Techniques</source>
            <pubdate>2003</pubdate>
            <volume>34</volume>
            <fpage>374</fpage>
            <lpage>378</lpage>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Microarray Data Normalization and Transformation</p>
            </title>
            <aug>
               <au>
                  <snm>Quackenbush</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nature Genetics Supplement</source>
            <pubdate>2002</pubdate>
            <volume>32</volume>
            <fpage>496</fpage>
            <lpage>501</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1038/ng1032</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Prognostic Classification of Relapsing Favorable Histology Wilms Tumor using cDNA Microarray Expression Profiling and Support Vector Machines</p>
            </title>
            <aug>
               <au>
                  <snm>Williams</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>King</snm>
                  <fnm>SN</fnm>
               </au>
               <au>
                  <snm>Greer</snm>
                  <fnm>BT</fnm>
               </au>
               <au>
                  <snm>Whiteford</snm>
                  <fnm>CC</fnm>
               </au>
               <au>
                  <snm>Wei</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Natrajan</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kelsey</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rogers</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Campbell</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Pritchard-Jones</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Khan</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genes, Chromosomes and Cancer</source>
            <pubdate>2004</pubdate>
            <volume>41</volume>
            <fpage>65</fpage>
            <lpage>79</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1002/gcc.20060</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Analysis of the Major Patterns of B Cell Gene Expression Changes in Response to Short-Term Stimulation with 33 Single Ligands</p>
            </title>
            <aug>
               <au>
                  <snm>Zhu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Hart</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>SY</fnm>
               </au>
               <au>
                  <snm>Cao</snm>
                  <fnm>YA</fnm>
               </au>
               <au>
                  <snm>Mock</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ke</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Saunders</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Alexander</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Grossoehme</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Yan</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Hsueh</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Scheuermann</snm>
                  <fnm>RH</fnm>
               </au>
               <au>
                  <snm>Fruman</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Seaman</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Subramaniam</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sternweis</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>MI</fnm>
               </au>
               <au>
                  <snm>Choi</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>The Journal of Immunology</source>
            <pubdate>2004</pubdate>
            <volume>173</volume>
            <fpage>7141</fpage>
            <lpage>7149</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15585835</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>A Neutral Model of Transcriptome Evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Khaitovich</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Weiss</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Lachmann</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hellmann</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Enard</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Muetzel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Wirkner</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Ansorge</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Paabo</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>PLoS Biology</source>
            <pubdate>2004</pubdate>
            <volume>2</volume>
            <issue>5</issue>
            <fpage>682</fpage>
            <lpage>689</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1371/journal.pbio.0020132</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Photosynthetic Acclimation is Reflected in Specific Patterns of Gene Expression in Drought-Stressed Loblolly Pine</p>
            </title>
            <aug>
               <au>
                  <snm>Watkinson</snm>
                  <fnm>JI</fnm>
               </au>
               <au>
                  <snm>Sioson</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Vasquez-Robinet</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Shukla</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kumar</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ellis</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Heath</snm>
                  <fnm>LS</fnm>
               </au>
               <au>
                  <snm>Ramakrishnan</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Chevone</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Watson</snm>
                  <fnm>LT</fnm>
               </au>
               <au>
                  <snm>van Zyl</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Egertsdotter</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Sederoff</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Grene</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Plant Physiology</source>
            <pubdate>2003</pubdate>
            <volume>133</volume>
            <issue>4</issue>
            <fpage>1702</fpage>
            <lpage>1716</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">300725</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681533</pubid>
                  <pubid idtype="doi">10.1104/pp.103.026914</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Expresso and Chips: Creating a Next Generation Microarray Experiment Management System</p>
            </title>
            <aug>
               <au>
                  <snm>Sioson</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Watkinson</snm>
                  <fnm>JI</fnm>
               </au>
               <au>
                  <snm>Vasquez-Robinet</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ellis</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shukla</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kumar</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ramakrishnan</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Heath</snm>
                  <fnm>LS</fnm>
               </au>
               <au>
                  <snm>Grene</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Chevone</snm>
                  <fnm>BI</fnm>
               </au>
               <au>
                  <snm>Kadafar</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Watson</snm>
                  <fnm>LT</fnm>
               </au>
            </aug>
            <source>Proceedings of the Next Generation Software Systems Workshop, 17th International Parallel and Distributed Processing Symposium (IPDPS '03)</source>
            <publisher>Nice, France</publisher>
            <pubdate>2003</pubdate>
            <fpage>209b</fpage>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Studying the Functional Genomics of Stress Responses in Loblolly Pine using the Expresso Microarray Management System</p>
            </title>
            <aug>
               <au>
                  <snm>Heath</snm>
                  <fnm>LS</fnm>
               </au>
               <au>
                  <snm>Ramakrishnan</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Sederoff</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Whetten</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Chevone</snm>
                  <fnm>BI</fnm>
               </au>
               <au>
                  <snm>Struble</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Jouenne</snm>
                  <fnm>VY</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>van Zyl</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Grene</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Comparative and Functional Genomics</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>226</fpage>
            <lpage>243</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1002/cfg.169</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Arabidopsis Oligonucleotide Microarrays</p>
            </title>
            <aug>
               <au>
                  <snm>Galbraith</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <url>http://www.ag.arizona.edu/microarray/</url>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Phospholipase D alpha is Involved in Drought Stress Signaling in <it>Arabidopsis</it></p>
            </title>
            <aug>
               <au>
                  <snm>Mane</snm>
                  <fnm>SP</fnm>
               </au>
               <au>
                  <snm>Vasquez-Robinet</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Sioson</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Heath</snm>
                  <fnm>LS</fnm>
               </au>
               <au>
                  <snm>Grene</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Poster presented at the International Conference on Plant Lipid-Mediated Signaling, Raleigh, NC</source>
            <pubdate>2005</pubdate>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Response Diversity of <it>Arabidopsis thaliana </it>Ecotypes and <it>Thellungiella halophila </it>in elevated CO<sub>2 </sub>in the field</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Sioson</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Mane</snm>
                  <fnm>SP</fnm>
               </au>
               <au>
                  <snm>Ulanov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Grothaus</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Heath</snm>
                  <fnm>LS</fnm>
               </au>
               <au>
                  <snm>Murali</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Bohnert</snm>
                  <fnm>HJ</fnm>
               </au>
               <au>
                  <snm>Grene</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Manuscript submitted</source>
            <pubdate>2005</pubdate>
         </bibl>
         <bibl id="B24">
            <title>
               <p>TM4</p>
            </title>
            <url>http://www.tm4.org/</url>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Normalization for cDNA Microarray Data: A Robust Composite Method Addressing Single and Multiple Slide Systematic Variation</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>YH</fnm>
               </au>
               <au>
                  <snm>Dudoit</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Luu</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Peng</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Ngai</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Speed</snm>
                  <fnm>TP</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <issue>4</issue>
            <fpage>e15</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">100354</pubid>
                  <pubid idtype="pmpid" link="fulltext">11842121</pubid>
                  <pubid idtype="doi">10.1093/nar/30.4.e15</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Within the Fold: Assessing Differential Expression Measures and Reproducibility in Microarray Assays</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Hasseman</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Liang</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Frank</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sharov</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Saeed</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Yeatman</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Quackenbush</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genome Biology</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <issue>11</issue>
            <fpage>1</fpage>
            <lpage>12</lpage>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Robust Locally Weighted Regression and Smoothing Scatterplots</p>
            </title>
            <aug>
               <au>
                  <snm>Cleveland</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>J Amer Stat Assoc</source>
            <pubdate>1979</pubdate>
            <volume>74</volume>
            <fpage>829</fpage>
            <lpage>836</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/2286407</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Variance Stabilization Applied to Microarray Data Calibration and to the Quantification of Differential Expression</p>
            </title>
            <aug>
               <au>
                  <snm>Huber</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>von Heydebreck</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sultmann</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Poustka</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Vingron</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2002</pubdate>
            <volume>18</volume>
            <issue>Suppl 1</issue>
            <fpage>S96</fpage>
            <lpage>S104</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12169536</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
