<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2002-3-3-reviews1005</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Minireview</dochead>
      <bibl>
         <title>
            <p>Comparative genomics of <it>Arabidopsis</it> and maize: prospects and limitations</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Brendel</snm>
               <fnm>Volker</fnm>
               <insr iid="I1"/>
               <email>vbrendel@iastate.edu</email>
            </au>
            <au id="A2">
               <snm>Kurtz</snm>
               <fnm>Stefan</fnm>
               <insr iid="I2"/>
            </au>
            <au id="A3">
               <snm>Walbot</snm>
               <fnm>Virginia</fnm>
               <insr iid="I3"/>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Zoology and Genetics and Department of Statistics, Iowa State University, Ames, IA 50010, USA</p>
            </ins>
            <ins id="I2">
               <p>Technische Fakult&#228;t, Universit&#228;t Bielefeld, D-33501 Bielefeld, Germany</p>
            </ins>
            <ins id="I3">
               <p>Department of Biological Sciences, Stanford University, Stanford, CA 94305-5020, USA</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2002</pubdate>
         <volume>3</volume>
         <issue>3</issue>
         <fpage>reviews1005.1</fpage>
         <lpage>reviews1005.6</lpage>
         <url>http://genomebiology.com/2002/3/3/reviews/1005</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="doi">10.1186/gb-2002-3-3-reviews1005</pubid>
               <pubid idtype="pmpid">11897028</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <pub>
            <date>
               <day>14</day>
               <month>2</month>
               <year>2002</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2002</year>
         <collab>BioMed Central Ltd</collab>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <p>The completed <it>Arabidopsis</it> genome seems to be of limited value as a model for maize genomics. In addition to the expansion of repetitive sequences in maize and the lack of genomic micro-colinearity, maize-specific or highly-diverged proteins contribute to a predicted maize proteome of about 50,000 proteins, twice the size of that of <it>Arabidopsis.</it></p>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010019">Plant biology</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010015">Model organisms</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p/>
         </st>
         <p>Maize (<it>Zea mays</it> L., corn) was domesticated in the highlands of Central Mexico approximately 10,000 years ago [<abbr bid="B1">1</abbr>]. Corn agriculture spread rapidly into diverse climate zones, ranging from 45&#176; N to 45&#176; S, and supported vast Native American civilizations. Today, maize is one of the world's most important crops: for direct human consumption, as a key component of animal feed, and as the source of chemical feed stocks. Grass species (including maize) cover 20% of the terrestrial surface of the earth, and the grains from maize, rice, wheat, and minor grass crops provide the majority of calories in the human diet [<abbr bid="B2">2</abbr>].</p>
         <p>Since the beginning of the twentieth century, maize has been a model species for genetic analysis, reflecting its unusual biological features. Maize plants produce separate male and female inflorescences, which greatly facilitates experimentally controlled pollination by eliminating the need for emasculation (Figure <figr fid="F1">1</figr>). Large numbers of progeny (300-600 kernels per ear) and the ease of crossing allow a single maize geneticist to generate more than 100,000 outcross progeny per day. Individual plants produce up to 10<sup>7</sup> pollen grains, allowing fine-structural genetic mapping for phenotypes that can be scored at the pollen stage. Using this abundant material and extraordinary natural diversity, early geneticists mapped many genes, uncovered subtle genetic phenomena such as paramutation and imprinting, and made practical contributions to agriculture through the discovery of hybrid vigor and cytoplasmic male sterility.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Maize inflorescences</p>
            </caption>
            <text>
               <p>Maize inflorescences. The separation of <b>(a)</b> female inflorescence (ear) and <b>(b)</b> male inflorescence (tassel) is one of the key features of the maize plant responsible for its pivotal role in plant genetics, greatly simplifying controlled pollination (photos courtesy of Tom Peterson, Iowa State University).</p>
            </text>
            <graphic file="gb-2002-3-3-reviews1005-1"/>
         </fig>
         <p>The beautiful detail evident in meiotic maize chromosomes stimulated a generation of gifted cytogeneticists to identify the physical basis for recombination, to construct linkage maps tied to chromosomes, and to analyze the consequences of chromosome breakage. Of particular importance to current functional genomics was Barbara McClintock's discovery of transposable elements by analyzing the regulation of somatic variegation and germinal mutation in maize. Once maize transposons were molecularly cloned, they provided the means to clone any tagged gene: maize provided the first discovery of many plant-specific gene products and facilitated the cloning of related genes from other flowering plants. The availability of detailed genetic knowledge, a large community of researchers, and ease of gene cloning and genetic analysis make maize the monocotyledenous species of choice for many studies.</p>
         <p>The maize genome is organized into 10 chromosomes (2N = 20), and is about 2.4 &#215; 10<sup>9</sup> base-pairs in total. Sorghum, which is estimated to have diverged from a common ancestor with maize about 15-20 million years ago (MYA), has the same chromosome number, but its genome is about one third of the size. Rice diverged from a common ancestor with maize and sorghum about 50-60 MYA and has 12 chromosomes (2N = 24), comprising a much smaller genome of about 430 million base-pairs. Comparative genomics of these grasses suggests considerable colinearity between their genomes [<abbr bid="B3">3</abbr>]. The size differences of the genomes are presumed to result from the ancestral allotetraploidization (approximate duplication from diploid to tetraploid when two species hybridize) of the maize genome [<abbr bid="B4">4</abbr>] and differences in the expansion and dispersion of repetitive DNA (long terminal repeat retrotransposons, miniature inverted repeat transposons, and other repetitive sequences) [<abbr bid="B5">5</abbr>].</p>
         <p>In December 2000, <it>Arabidopsis thaliana</it> became the first plant species for which the genome was almost entirely sequenced (currently, 117 of an estimated 125 million base-pairs are available, with only centromeric and ribosomal DNA repeat regions as yet unsequenced [<abbr bid="B6">6</abbr>]; reviewed in [<abbr bid="B7">7</abbr>]). Because of its small genome size, ease of transformation, and tolerance of life in a growth chamber, this seemingly lowly weed has emerged as the model flowering plant, ahead of commercially important crops. The choice will be well justified if the evolutionarily recent advent of flowering plants means that most genes found in <it>Arabidopsis</it> prove to be common to all flowering plants. Among the crops, members of the <it>Brassica</it> genus (including <it>B. oleracea</it> and <it>B. rapa,</it> the so-called 'cole-crops', oilseeds, and mustard) are most closely related to <it>Arabidopsis</it> (divergence less than 20 MYA). Gene order seems to be largely conserved, and thus the <it>Arabidopsis</it> genome should prove a powerful tool for studying <it>Brassica</it> genomics [<abbr bid="B8">8</abbr>,<abbr bid="B9">9</abbr>]. Significant colinearity has also been observed between <it>Arabidopsis</it> and soybean [<abbr bid="B10">10</abbr>] (divergence time 100 MYA), and <it>Arabidopsis</it> and tomato [<abbr bid="B11">11</abbr>,<abbr bid="B12">12</abbr>] (divergence time more than 100 MYA). This article assesses the prospects for comparative maize-<it>Arabidopsis</it> genome analysis in view of the greater divergence time (more than 150 MYA) between grasses (which are monocots) and flowering plants (dicots).</p>
      </sec>
      <sec>
         <st>
            <p>Lack of synteny between maize and <it>Arabidopsis</it></p>
         </st>
         <p>The extent of conservation of gene order between the grasses and <it>Arabidopsis</it> can be estimated from three well-studied groups of maize loci: the <it>a1-sh2</it> region [<abbr bid="B13">13</abbr>,<abbr bid="B14">14</abbr>,<abbr bid="B15">15</abbr>], the <it>adh1</it> region [<abbr bid="B16">16</abbr>,<abbr bid="B17">17</abbr>], and the <it>bz</it> locus and its associated genes [<abbr bid="B18">18</abbr>]. The <it>a1-sh2</it> region in maize, sorghum, and rice contains the <it>sh2</it> gene upstream of <it>a1,</it> transcribed in the same direction. The <it>a1</it> gene encodes an NADPH dihydroflavonol reductase required for anthocyanin biosynthesis and <it>sh2</it> encodes an endosperm-expressed ADP glucose pyrophosphorylase important in starch biosynthesis. The two genes are separated by about 140 kilobases (kb) in maize but only about 19 kb in sorghum and rice. Moreover, <it>a1</it> is duplicated in sorghum. Sequences that are highly similar to <it>sh2</it> can be found on <it>Arabidopsis</it> chromosomes 1, 2, 4, and 5. Potential homologs of <it>a1</it> map to <it>Arabidopsis</it> chromosomes 2 and 5, but they are far apart from the potential <it>sh2</it> genes. Recently, two additional genes have been identified in the <it>a1-sh2</it> interval: <it>x1</it> and <it>yz1,</it> which are of unknown function and conserved among maize, rice, and sorghum [<abbr bid="B14">14</abbr>,<abbr bid="B19">19</abbr>].</p>
         <p>Genic regions are generally conserved between the <it>adh1</it> regions of maize and sorghum, although <it>adh1</it> is the only gene with assigned function (alcohol dehydrogenase), and maize is missing three out of ten other potential genes within this region [<abbr bid="B16">16</abbr>]. Whereas the maize region is replete with retrotransposons, gathered into sequence blocks of 14-70 kb and inserted between the potential genes, the sorghum sequence does not contain any retrotransposons. Colinearity with <it>Arabidopsis</it> appears limited to a block of two genes conserved between sorghum and <it>Arabidopsis</it> [<abbr bid="B16">16</abbr>]. Interestingly, the colinearity of this locus pair is interrupted even between maize and rice [<abbr bid="B17">17</abbr>].</p>
         <p>The recently sequenced <it>bz</it> locus of maize and its chromosomal region displays a gene-dense genomic organization very different from <it>adh1,</it> with ten putative genes within a 32 kb stretch that is free of retrotransposons [<abbr bid="B18">18</abbr>]. Although this gene density is similar to that in <it>Arabidopsis,</it> and most of the genes have potential homologs in <it>Arabidopsis</it> according to the genome sequence, no colinearity is evident. Thus, on the basis of our current picture of plant genome organization, micro-colinearity between different genomes may be even more limited than has previously been stated [<abbr bid="B20">20</abbr>].</p>
      </sec>
      <sec>
         <st>
            <p>Proteome comparisons</p>
         </st>
         <p>Although gene order does not appear to be conserved across the monocot-dicot divide, the repertoires of gene products (that is, the typical monocot and dicot proteomes) may be conserved. This hypothesis cannot be fully tested until the complete <it>Arabidopsis</it> genome is matched to a complete monocot genome, but the current collection of maize proteins and genome sequence fragments may provide a clue. We downloaded the entire set of 4,195 maize protein sequence records from GenBank and reduced this collection to a representative, non-redundant set of maize proteins in several steps: firstly, removal of sequences less than 60 amino acids; secondly, removal of organelle-encoded proteins; and thirdly, selection of a single sequence to represent clusters of highly similar entries (including identities resulting from duplications in GenBank; this was done using the novel fast string matching program 'vmatch' [<abbr bid="B21">21</abbr>]; V.B. and S.K., unpublished). The resulting set of 1,143 sequences was compared with a set of 25,617 putative <it>Arabidopsis</it> proteins [<abbr bid="B22">22</abbr>] using BLASTP [<abbr bid="B23">23</abbr>] at moderate stringency (BLAST -e option set to 1e-5). Most of the 117 entries without significant hits were identified as polypeptides encoded by transposable elements. The remaining sequences were matched directly against the <it>Arabidopsis</it> genome using the GeneSeqer spliced alignment program [<abbr bid="B24">24</abbr>] to check for possible gene products not included in the <it>Arabidopsis</it> predicted protein set (only one unannotated <it>Arabidopsis</it> homolog of a maize protein was identified in this way). About 50 candidate maize-specific proteins remained, including several zeins, some predicted products of unknown function, and several other proteins (the latter group are listed in Table <tblr tid="T1">1</tblr>). On the basis of these results, we can give an upper estimate of 90% of maize proteins that have close homologs in <it>Arabidopsis.</it> The distinct maize genes appear to be tissue-specific (endosperm) or involved in maize pathogen-defense responses.</p>
         <tbl id="T1">
            <title>
               <p>Table 1</p>
            </title>
            <caption>
               <p>Maize proteins with no obvious homologs in <it>Arabidopsis</it></p>
            </caption>
            <tblbdy cols="3">
               <r>
                  <c ca="left">
                     <p>Protein</p>
                  </c>
                  <c ca="left">
                     <p>GenBank accession number</p>
                  </c>
                  <c ca="left">
                     <p>Function</p>
                  </c>
               </r>
               <r>
                  <c cspan="3">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>BETL(2-4)</p>
                  </c>
                  <c ca="left">
                     <p>CAB4466(2-4)</p>
                  </c>
                  <c ca="left">
                     <p>Anti-microbial, endosperm</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Ribosome-inactivating proteins</p>
                  </c>
                  <c ca="left">
                     <p>S11859, CAC16167, P10593, T03942</p>
                  </c>
                  <c ca="left">
                     <p>Anti-microbial, anti-fungal</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Female gametophyte-specific protein ES3</p>
                  </c>
                  <c ca="left">
                     <p>AAK08134</p>
                  </c>
                  <c ca="left">
                     <p>Defensin</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Basal layer anti-fungal peptides</p>
                  </c>
                  <c ca="left">
                     <p>CAC21604, CAC21605, CAC21607</p>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Trypsin inhibitor</p>
                  </c>
                  <c ca="left">
                     <p>TIZM, TIZM1, S36236</p>
                  </c>
                  <c ca="left">
                     <p>Anti-insect</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RAB-17</p>
                  </c>
                  <c ca="left">
                     <p>S08633</p>
                  </c>
                  <c ca="left">
                     <p>Vesicle traffic</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>FDR3</p>
                  </c>
                  <c ca="left">
                     <p>AAK53546</p>
                  </c>
                  <c ca="left">
                     <p>Iron stress</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ZmGR2(b,c)</p>
                  </c>
                  <c ca="left">
                     <p>BAA7480(6,7)</p>
                  </c>
                  <c ca="left">
                     <p>Gibberellin-responsive</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Aluminum-induced proteins</p>
                  </c>
                  <c ca="left">
                     <p>AAB86493, T01322</p>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ABA- and ripening-inducible-like protein</p>
                  </c>
                  <c ca="left">
                     <p>T02081</p>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Bundle-sheath cell specific protein 1</p>
                  </c>
                  <c ca="left">
                     <p>BAB20906</p>
                  </c>
                  <c ca="left">
                     <p>C4 photosynthesis</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Peroxidase K</p>
                  </c>
                  <c ca="left">
                     <p>AAC79955</p>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Phytase</p>
                  </c>
                  <c ca="left">
                     <p>T04130</p>
                  </c>
                  <c ca="left">
                     <p>Degradation of phytic acid, the main phosphor storage in maize seeds</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ESR1c1</p>
                  </c>
                  <c ca="left">
                     <p>CAA67122</p>
                  </c>
                  <c ca="left">
                     <p>Endosperm-specific</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Teosinte-branched protein 1</p>
                  </c>
                  <c ca="left">
                     <p>AAK30124</p>
                  </c>
                  <c ca="left">
                     <p>Associated with maize domestication (specific alleles)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Globulin 1O</p>
                  </c>
                  <c ca="left">
                     <p>C53234</p>
                  </c>
                  <c ca="left">
                     <p>Storage</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Ae(1,3)</p>
                  </c>
                  <c ca="left">
                     <p>CAB5655(2,3)</p>
                  </c>
                  <c ca="left">
                     <p>Amylase extender; modification of kernel starch composition</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Arabinogalactan protein</p>
                  </c>
                  <c ca="left">
                     <p>AAF43497</p>
                  </c>
                  <c ca="left">
                     <p>Cell-wall component</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Probable membrane protein DAD1</p>
                  </c>
                  <c ca="left">
                     <p>T01578</p>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p>The maize proteins were compared against the <it>Arabidopsis</it> protein set using BLASTP (see text for details). Maize query sequences are listed that did not match any <it>Arabidopsis</it> sequences at the 1e-5 level. Not listed: zeins, some sequences of highly biased composition and putative maize-specific proteins. Brackets () are used to show related entries; for example, BETL2 has GenBank accession number CAB44662, and BETL3 CAB44663.</p>
            </tblfn>
         </tbl>
      </sec>
      <sec>
         <st>
            <p>Maize EST analysis</p>
         </st>
         <p>One pivotal strategy for identifying gene products involves sequencing of large sets of expressed sequence tags (ESTs). Many plant genome projects have adopted this approach, and there are currently more than 100,000 EST database entries in the public domain for each of soybean, tomato, <it>Medicago truncatula,</it> maize, <it>Arabidopsis,</it> and rice [<abbr bid="B25">25</abbr>]. To further assess the overlap between the maize and <it>Arabidopsis</it> proteomes, we derived a set of 27,294 maize ESTs with non-redundant open reading frames (ORFs) of at least 120 codons (again using vmatch). The translated ORFs (derived from all six reading frames) were compared to the set of putative <it>Arabidopsis</it> proteins using BLASTP at different stringency levels. As shown in Figure <figr fid="F2">2</figr>, 62-68% of the maize ESTs relate to ORF products that match <it>Arabidopsis</it> proteins, and the total fraction of the <it>Arabidopsis</it> protein set matched by the maize ESTs is 60-73%. Similar numbers were obtained for consensus sequences built from maize EST clusters [<abbr bid="B26">26</abbr>]. Thus, a significant proportion of maize ESTs might encode highly diverged or maize-specific proteins. Some ORF products might not correspond to functional proteins, and incorrect gene prediction models and the as yet partial <it>Arabidopsis</it> protein set may also contribute to incomplete matching. For comparison, the same procedure applied to the <it>Arabidopsis</it> EST set compared to the <it>Arabidopsis</it> protein set gave a matching fraction of 88% or more of 28,161 qualifying ESTs, showing that chance ORFs may account for up to 12% of the unmatched ESTs in <it>Arabidopsis,</it> and presumably also in maize. We can therefore refine the estimate of maize proteins with close homologs in <it>Arabidopsis</it> to 60-90% of the maize proteome. Because ESTs are difficult to derive from genes expressed at low level there may in fact be more unmatched maize proteins to be found.</p>
         <fig id="F2">
            <title>
               <p>Figure 2</p>
            </title>
            <caption>
               <p>Comparison of maize proteins predicted from EST sequences with <it>Arabidopsis</it> proteins</p>
            </caption>
            <text>
               <p>Comparison of maize proteins predicted from EST sequences with <it>Arabidopsis</it> proteins. A non-redundant set of protein sequences consisting of at least 120 amino acids each, derived from 27,294 distinct maize ESTs, was compared with 25,617 putative <it>Arabidopsis</it> proteins at different BLASTP stringency levels. The percentages in each pie chart give the fractions of the two sequence sets involved in these matches, at each stringency level.</p>
            </text>
            <graphic file="gb-2002-3-3-reviews1005-2"/>
         </fig>
      </sec>
      <sec>
         <st>
            <p>A glimpse of the maize genome</p>
         </st>
         <p>Several approaches are currently being used to provide further sequence data from the maize genome. These sequences are entered into the Genome Survey Sequence (GSS) division of GenBank because the sequencing is for the most part exploratory, at a low redundancy level. Table <tblr tid="T2">2</tblr> summarizes a rough analysis of 11,625 maize GSS entries available as of 1 November 2001. The sequences were obtained by different selection strategies, including genomic sequences flanking Mutator transposon insertions [<abbr bid="B26">26</abbr>], random inserts [<abbr bid="B27">27</abbr>], sequences selected for not being methylated [<abbr bid="B28">28</abbr>], bacterial artificial chromosome (BAC) ends [<abbr bid="B29">29</abbr>], sequences that were genetically mapped [<abbr bid="B30">30</abbr>], and sequences selected for long ORFs using the ORF Rescue vector [<abbr bid="B31">31</abbr>]. Table <tblr tid="T2">2</tblr> gives the result of a BLASTP search (option -e 1e-5) of all ORFs of at least 120 codons derived from the GSSs, compared to the non-redundant maize protein set. It can be seen that the random sequencing approaches (random inserts and BAC ends) produce a large fraction of sequences matching transposable elements, whereas the Mutator transposon insertion, methylation filter, and 'ORF rescue' approaches clearly bias against the recovery of such sequences. More than 80% of the GSS entries with ORFs derived from the former two approaches do not show significant similarities to known maize proteins, and, surprisingly, more than 70% do not match any <it>Arabidopsis</it> proteins (Table <tblr tid="T3">3</tblr>). An intriguing explanation would be that these ORFs correspond to novel or highly diverged maize proteins. It is also possible that some of the ORFs do not correspond to native translation products.</p>
         <tbl id="T2">
            <title>
               <p>Table 2</p>
            </title>
            <caption>
               <p>Analysis of maize genome survey sequences: a comparison with maize proteins and ESTs</p>
            </caption>
            <tblbdy cols="10">
               <r>
                  <c ca="left">
                     <p>Approach</p>
                  </c>
                  <c ca="center">
                     <p>Number of entries</p>
                  </c>
                  <c ca="center">
                     <p>Unique sequences</p>
                  </c>
                  <c ca="center">
                     <p>wORF</p>
                  </c>
                  <c cspan="4" ca="center">
                     <p>Comparison with maize proteins</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>Reference</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c cspan="4">
                     <hr/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>%NS</p>
                  </c>
                  <c ca="center">
                     <p>%TE</p>
                  </c>
                  <c ca="center">
                     <p>%HP</p>
                  </c>
                  <c ca="center">
                     <p>%KP</p>
                  </c>
                  <c ca="center">
                     <p>NS %EST</p>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c cspan="10">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Mutator insertions</p>
                  </c>
                  <c ca="center">
                     <p>4412</p>
                  </c>
                  <c ca="center">
                     <p>970</p>
                  </c>
                  <c ca="center">
                     <p>375</p>
                  </c>
                  <c ca="center">
                     <p>93</p>
                  </c>
                  <c ca="center">
                     <p>3</p>
                  </c>
                  <c ca="center">
                     <p>2</p>
                  </c>
                  <c ca="center">
                     <p>2</p>
                  </c>
                  <c ca="center">
                     <p>26</p>
                  </c>
                  <c ca="left">
                     <p>[26]</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Random inserts</p>
                  </c>
                  <c ca="center">
                     <p>3480</p>
                  </c>
                  <c ca="center">
                     <p>2529</p>
                  </c>
                  <c ca="center">
                     <p>1015</p>
                  </c>
                  <c ca="center">
                     <p>61</p>
                  </c>
                  <c ca="center">
                     <p>38</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>44</p>
                  </c>
                  <c ca="left">
                     <p>[27]</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Methylation filter</p>
                  </c>
                  <c ca="center">
                     <p>1692</p>
                  </c>
                  <c ca="center">
                     <p>1083</p>
                  </c>
                  <c ca="center">
                     <p>258</p>
                  </c>
                  <c ca="center">
                     <p>84</p>
                  </c>
                  <c ca="center">
                     <p>10</p>
                  </c>
                  <c ca="center">
                     <p>2</p>
                  </c>
                  <c ca="center">
                     <p>3</p>
                  </c>
                  <c ca="center">
                     <p>37</p>
                  </c>
                  <c ca="left">
                     <p>[28]</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>BAC ends</p>
                  </c>
                  <c ca="center">
                     <p>945</p>
                  </c>
                  <c ca="center">
                     <p>881</p>
                  </c>
                  <c ca="center">
                     <p>454</p>
                  </c>
                  <c ca="center">
                     <p>48</p>
                  </c>
                  <c ca="center">
                     <p>51</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>28</p>
                  </c>
                  <c ca="left">
                     <p>[29]</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>MPP</p>
                  </c>
                  <c ca="center">
                     <p>669</p>
                  </c>
                  <c ca="center">
                     <p>338</p>
                  </c>
                  <c ca="center">
                     <p>150</p>
                  </c>
                  <c ca="center">
                     <p>80</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>7</p>
                  </c>
                  <c ca="center">
                     <p>11</p>
                  </c>
                  <c ca="center">
                     <p>47</p>
                  </c>
                  <c ca="left">
                     <p>[30]</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ORFs</p>
                  </c>
                  <c ca="center">
                     <p>399</p>
                  </c>
                  <c ca="center">
                     <p>86</p>
                  </c>
                  <c ca="center">
                     <p>79</p>
                  </c>
                  <c ca="center">
                     <p>76</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>14</p>
                  </c>
                  <c ca="center">
                     <p>10</p>
                  </c>
                  <c ca="center">
                     <p>22</p>
                  </c>
                  <c ca="left">
                     <p>[31]</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Other</p>
                  </c>
                  <c ca="center">
                     <p>28</p>
                  </c>
                  <c ca="center">
                     <p>11</p>
                  </c>
                  <c ca="center">
                     <p>3</p>
                  </c>
                  <c ca="center">
                     <p>33</p>
                  </c>
                  <c ca="center">
                     <p>67</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p>All sequences were retrieved from the GenBank GSS database (with the number of database entries given in the second column). Sequences shorter than 360 bp and redundant sequences were removed with the vmatch program [21] (V.B. and S.K., unpublished), resulting in the reduced sequence set sizes given in the column 'Unique sequences'. Of these, only sequences with non-redundant open reading frames of at least 120 codons (with the number of qualifying entries given in the wORF column) were compared to a maize protein set using BLASTP [23]. Entries were classified on the basis of BLASTP results and GenBank keywords as novel (NS), transposable element (TE), hypothetical protein (HP), or known protein (KP). The corresponding columns give the fraction of sequences in each class (percent). The column 'NS %EST' gives the percentage of sequences with novel ORFs matching maize ESTs. MPP, Missouri Mapping Project.</p>
            </tblfn>
         </tbl>
         <tbl id="T3">
            <title>
               <p>Table 3</p>
            </title>
            <caption>
               <p>Analysis of maize genome survey sequences: a comparison with <it>Arabidopsis</it> proteins</p>
            </caption>
            <tblbdy cols="5">
               <r>
                  <c ca="left">
                     <p>Approach</p>
                  </c>
                  <c ca="center">
                     <p>%NS</p>
                  </c>
                  <c ca="center">
                     <p>%TE</p>
                  </c>
                  <c ca="center">
                     <p>%HP</p>
                  </c>
                  <c ca="center">
                     <p>%KP</p>
                  </c>
               </r>
               <r>
                  <c cspan="5">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Mutator insertions</p>
                  </c>
                  <c ca="center">
                     <p>69</p>
                  </c>
                  <c ca="center">
                     <p>3</p>
                  </c>
                  <c ca="center">
                     <p>24</p>
                  </c>
                  <c ca="center">
                     <p>4</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Random inserts</p>
                  </c>
                  <c ca="center">
                     <p>63</p>
                  </c>
                  <c ca="center">
                     <p>33</p>
                  </c>
                  <c ca="center">
                     <p>3</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Methylation filter</p>
                  </c>
                  <c ca="center">
                     <p>81</p>
                  </c>
                  <c ca="center">
                     <p>7</p>
                  </c>
                  <c ca="center">
                     <p>11</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>BAC ends</p>
                  </c>
                  <c ca="center">
                     <p>47</p>
                  </c>
                  <c ca="center">
                     <p>48</p>
                  </c>
                  <c ca="center">
                     <p>5</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>MPP</p>
                  </c>
                  <c ca="center">
                     <p>59</p>
                  </c>
                  <c ca="center">
                     <p>3</p>
                  </c>
                  <c ca="center">
                     <p>34</p>
                  </c>
                  <c ca="center">
                     <p>4</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ORFs</p>
                  </c>
                  <c ca="center">
                     <p>46</p>
                  </c>
                  <c ca="center">
                     <p>1</p>
                  </c>
                  <c ca="center">
                     <p>49</p>
                  </c>
                  <c ca="center">
                     <p>4</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Other</p>
                  </c>
                  <c ca="center">
                     <p>33</p>
                  </c>
                  <c ca="center">
                     <p>67</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p>GSS sequences with unique ORFs were compared with <it>Arabidopsis</it> proteins using BLASTP. Data sets and columns are as in Table <tblr tid="T2">2</tblr>.</p>
            </tblfn>
         </tbl>
         <p>To assess these possibilities, we compared the sequences of novel ORFs with the maize EST set (application of GeneSeqer [<abbr bid="B22">22</abbr>]). The result, that 26-44% of the four large GSS collections match (a still limited collection of) maize ESTs (see Table <tblr tid="T2">2</tblr>), suggests that many of the ORFs do indeed correspond to expressed genes. The remaining fraction may include less abundantly expressed genes. We can estimate the gene fraction accessible by EST sequencing from the EST coverage of GSS-derived ORFs: if the roughly 10,000 novel ORFs in the maize EST set constitute only 40% of the genes, we can anticipate some 25,000 novel maize proteins that are not found in <it>Arabidopsis.</it> It is likely that many of these proteins are derived from gene duplications. The lack of sequence conservation across the monocot-dicot divide suggests that there has been extensive functional divergence after duplication.</p>
      </sec>
      <sec>
         <st>
            <p>The need for a maize genome sequencing project</p>
         </st>
         <p>On the basis of available data, we think that the resource provided by the <it>Arabidopsis</it> genome cannot adequately substitute for more extensive maize genome sequencing. Genome organization is very different between the two plants, and the proteomes may also have significant differences, particularly with respect to agronomically important maize genes involved in plant-pathogen interactions, reproduction, and the development and function of specific tissues. The many exceptions to micro-colinearity even among the grasses suggest that the completion of the rice genome [<abbr bid="B32">32</abbr>] will still not answer many of the questions particular to maize genomics. Beyond questions concerning agronomically important traits, plant biologists also look to maize as a model for the evolution of plant genomes that are not as small and streamlined as those of <it>Arabidopsis</it> and rice [<abbr bid="B33">33</abbr>]. Correspondingly, a maize genome sequencing project will focus on sequencing gene-rich genome fractions first [<abbr bid="B34">34</abbr>], and other crop genome projects are likely to follow. Plant biologists should look forward to very exciting times when whole-genome comparisons become possible, leading to a clearer understanding of the development of plants from their genetic blueprints.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>V.B. and V.W. were supported in part by NSF Plant Genome Research Program grant DBI-9872657. S.K. was partially supported by grant KU 1257/1 from the Deutsche Forschungsgemeinschaft. The authors are grateful to Phil Becraft, Alan Myers, Tom Peterson, Pat Schnable, and Robert Thornburg for critical comments on the manuscript.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The limits of selection during maize domestication.</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>R-L</fnm>
               </au>
               <au>
                  <snm>Stec</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hey</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lukens</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Doebley</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1999</pubdate>
            <volume>398</volume>
            <fpage>236</fpage>
            <lpage>239</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/18435</pubid>
                  <pubid idtype="pmpid" link="fulltext">10094045</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Evolutionary history of the grasses.</p>
            </title>
            <aug>
               <au>
                  <snm>Kellogg</snm>
                  <fnm>EA</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2001</pubdate>
            <volume>125</volume>
            <fpage>1198</fpage>
            <lpage>1205</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1104/pp.125.3.1198</pubid>
                  <pubid idtype="pmpid" link="fulltext">11244101</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Grasses as a single genetic system. Reassessment 2001.</p>
            </title>
            <aug>
               <au>
                  <snm>Freeling</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2001</pubdate>
            <volume>125</volume>
            <fpage>1191</fpage>
            <lpage>1197</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1104/pp.125.3.1191</pubid>
                  <pubid idtype="pmpid" link="fulltext">11244100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>DNA sequence evidence for the segmental allotetraploid origin of maize.</p>
            </title>
            <aug>
               <au>
                  <snm>Gaut</snm>
                  <fnm>BS</fnm>
               </au>
               <au>
                  <snm>Doebley</snm>
                  <fnm>JF</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1997</pubdate>
            <volume>94</volume>
            <fpage>6809</fpage>
            <lpage>6814</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">21240</pubid>
                  <pubid idtype="pmpid" link="fulltext">11038553</pubid>
                  <pubid idtype="doi">10.1073/pnas.94.13.6809</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Of genes and genomes and the origin of maize.</p>
            </title>
            <aug>
               <au>
                  <snm>White</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Doebley</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>1998</pubdate>
            <volume>14</volume>
            <fpage>327</fpage>
            <lpage>332</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(98)01524-8</pubid>
                  <pubid idtype="pmpid" link="fulltext">9724966</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Analysis of the genome sequence of the flowering plant <it>Arabidopsis thaliana</it>.</p>
            </title>
            <aug>
               <au>
                  <cnm>The Arabidopsis Genome Initiative</cnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>408</volume>
            <fpage>796</fpage>
            <lpage>815</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35048692</pubid>
                  <pubid idtype="pmpid" link="fulltext">11130711</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>A green chapter in the book of life.</p>
            </title>
            <aug>
               <au>
                  <snm>Walbot</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>408</volume>
            <fpage>794</fpage>
            <lpage>795</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35048685</pubid>
                  <pubid idtype="pmpid" link="fulltext">11130710</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Comparative physical mapping of segments of the genome of <it>Brassica oleracea</it> var. <it>alboglabra</it> that are homoeologous to sequenced regions of chromosomes 4 and 5 of <it>Arabidopsis thaliana</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>O'Neill</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Bancroft</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2000</pubdate>
            <volume>23</volume>
            <fpage>233</fpage>
            <lpage>243</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-313X.2000.00781.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">10929117</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p><it>Brassica</it> genomics: a complement to, and early beneficiary of, the <it>Arabidopsis</it> sequence.</p>
            </title>
            <aug>
               <au>
                  <snm>Paterson</snm>
                  <fnm>AH</fnm>
               </au>
               <au>
                  <snm>Lan</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Amasino</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Osborn</snm>
                  <fnm>TC</fnm>
               </au>
               <au>
                  <snm>Quiros</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2001</pubdate>
            <volume>2</volume>
            <fpage>reviews1011.1</fpage>
            <lpage>1011.4</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">138917</pubid>
                  <pubid idtype="pmpid" link="fulltext">11276431 </pubid>
                  <pubid idtype="doi">10.1186/gb-2001-2-3-reviews1011</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Genome organization in dicots: genome duplication in <it>Arabidopsis</it> and synteny between soybean and <it>Arabidopsis</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Grant</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Cregan</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Shoemaker</snm>
                  <fnm>RC</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2000</pubdate>
            <volume>97</volume>
            <fpage>4168</fpage>
            <lpage>4173</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">18185</pubid>
                  <pubid idtype="pmpid" link="fulltext">10759555</pubid>
                  <pubid idtype="doi">10.1073/pnas.070430597</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Exploitation of <it>Arabidopsis</it>-tomato synteny to construct a high-resolution map of the ovate-containing region in tomato chromosome 2.</p>
            </title>
            <aug>
               <au>
                  <snm>Ku</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Doganlar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tanksley</snm>
                  <fnm>SD</fnm>
               </au>
            </aug>
            <source>Genome</source>
            <pubdate>2001</pubdate>
            <volume>44</volume>
            <fpage>470</fpage>
            <lpage>475</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1139/gen-44-3-470</pubid>
                  <pubid idtype="pmpid" link="fulltext">11444707</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p><it>Arabidopsis</it> genome sequence as a tool for functional genomics in tomato.</p>
            </title>
            <aug>
               <au>
                  <snm>Mysore</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Tuori</snm>
                  <fnm>RP</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>GB</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2001</pubdate>
            <volume>2</volume>
            <fpage>reviews1003.1</fpage>
            <lpage>1003.4</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">150440</pubid>
                  <pubid idtype="pmpid" link="fulltext">11178283  </pubid>
                  <pubid idtype="doi">10.1186/gb-2001-2-1-reviews1003</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>The relationship between the genetic and physical distances of the cloned <it>a1-sh2</it> interval of the <it>Zea mays</it> L. genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Civardi</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Xia</snm>
                  <fnm>YJ</fnm>
               </au>
               <au>
                  <snm>Edwards</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Schnable</snm>
                  <fnm>PS</fnm>
               </au>
               <au>
                  <snm>Nikolau</snm>
                  <fnm>BJ</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1994</pubdate>
            <volume>91</volume>
            <fpage>8268</fpage>
            <lpage>8272</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">44587</pubid>
                  <pubid idtype="pmpid" link="fulltext">8058793</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Microcolinearity in sh2-homologous regions of the maize, rice, and sorghum genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>SanMiguel</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>De Oliveira</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Woo</snm>
                  <fnm>S-S</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Wing</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Bennetzen</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1997</pubdate>
            <volume>94</volume>
            <fpage>3431</fpage>
            <lpage>3435</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">20387</pubid>
                  <pubid idtype="pmpid" link="fulltext">9096411</pubid>
                  <pubid idtype="doi">10.1073/pnas.94.7.3431</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Grass genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Bennetzen</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>SanMiguel</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Tikhonov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Francki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Avramova</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <fpage>1975</fpage>
            <lpage>1978</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">33825</pubid>
                  <pubid idtype="pmpid" link="fulltext">9482817</pubid>
                  <pubid idtype="doi">10.1073/pnas.95.5.1975</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Colinearity and its exceptions in orthologous <it>adh</it> regions of maize and sorghum.</p>
            </title>
            <aug>
               <au>
                  <snm>Tikhonov</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>SanMiguel</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Nakajima</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Gorenstein</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Bennetzen</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Avramova</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1999</pubdate>
            <volume>96</volume>
            <fpage>7409</fpage>
            <lpage>7414</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">22099</pubid>
                  <pubid idtype="pmpid" link="fulltext">10377428</pubid>
                  <pubid idtype="doi">10.1073/pnas.96.13.7409</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>The complete sequence of 340 kb of DNA around the rice <it>Adh1-Adh2</it> region reveals interrupted colinearity with maize chromosome 4.</p>
            </title>
            <aug>
               <au>
                  <snm>Tarchini</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Biddle</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Wineland</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Tingey</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rafalski</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2000</pubdate>
            <volume>12</volume>
            <fpage>381</fpage>
            <lpage>391</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">139838</pubid>
                  <pubid idtype="pmpid" link="fulltext">10715324</pubid>
                  <pubid idtype="doi">10.1105/tpc.12.3.381</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>The highly recombinogenic <it>bz</it> locus lies in an unusually gene-rich region of the maize genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Fu</snm>
                  <fnm>HH</fnm>
               </au>
               <au>
                  <snm>Park</snm>
                  <fnm>WK</fnm>
               </au>
               <au>
                  <snm>Yan</snm>
                  <fnm>XH</fnm>
               </au>
               <au>
                  <snm>Zheng</snm>
                  <fnm>ZW</fnm>
               </au>
               <au>
                  <snm>Shen</snm>
                  <fnm>BZ</fnm>
               </au>
               <au>
                  <snm>Dooner</snm>
                  <fnm>HK</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <fpage>8903</fpage>
            <lpage>8908</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">37533</pubid>
                  <pubid idtype="pmpid" link="fulltext">11438686</pubid>
                  <pubid idtype="doi">10.1073/pnas.141221898</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Meiotic recombination across the 140 kb multigenic maize <it>a1-sh2</it> interval.</p>
            </title>
            <aug>
               <au>
                  <snm>Yao</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yandeau</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nikolau</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Schnable</snm>
                  <fnm>PS</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <inpress/>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Comparative sequence analysis of plant nuclear genomes: microcolinearity and its many exceptions.</p>
            </title>
            <aug>
               <au>
                  <snm>Bennetzen</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2000</pubdate>
            <volume>12</volume>
            <fpage>1021</fpage>
            <lpage>1029</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">149046</pubid>
                  <pubid idtype="pmpid" link="fulltext">10899971</pubid>
                  <pubid idtype="doi">10.1105/tpc.12.7.1021</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>REPuter: the manifold applications of repeat analysis on a genomic scale.</p>
            </title>
            <aug>
               <au>
                  <snm>Kurtz</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Choudhuri</snm>
                  <fnm>JV</fnm>
               </au>
               <au>
                  <snm>Ohlebusch</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Schleiermacher</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Stoye</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Giegerich</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>4633</fpage>
            <lpage>4642</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">92531</pubid>
                  <pubid idtype="pmpid" link="fulltext">11713313</pubid>
                  <pubid idtype="doi">10.1093/nar/29.22.4633</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>TIGR FTP site: <it>Arabidopsis</it> putative proteins</p>
            </title>
            <url>ftp://ftp.tigr.org/pub/data/a_thaliana/ath1/SEQUENCES/ATH1.pep</url>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Schaffer</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <fpage>3389</fpage>
            <lpage>3402</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">146917</pubid>
                  <pubid idtype="pmpid" link="fulltext">9254694</pubid>
                  <pubid idtype="doi">10.1093/nar/25.17.3389</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Gene structure prediction by spliced alignment of genomic DNA with protein sequences: increased accuracy by differential splice site scoring.</p>
            </title>
            <aug>
               <au>
                  <snm>Usuka</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Brendel</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2000</pubdate>
            <volume>297</volume>
            <fpage>1075</fpage>
            <lpage>1085</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2000.3641</pubid>
                  <pubid idtype="pmpid" link="fulltext">10764574</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>dbEST</p>
            </title>
            <url>http://www.ncbi.nlm.nih.gov/dbEST</url>
         </bibl>
         <bibl id="B26">
            <title>
               <p>ZmDB: Maize genome database</p>
            </title>
            <url>http://www.zmdb.iastate.edu/</url>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Abundance, distribution and transcriptional activity of repetitive elements in the maize genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Meyers</snm>
                  <fnm>BC</fnm>
               </au>
               <au>
                  <snm>Tingey</snm>
                  <fnm>SV</fnm>
               </au>
               <au>
                  <snm>Morgante</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <fpage>1660</fpage>
            <lpage>1676</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.188201</pubid>
                  <pubid idtype="pmpid" link="fulltext">11591643</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Differential methylation of genes and retrotransposons facilitates shotgun sequencing of the maize genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Rabinowicz</snm>
                  <fnm>PD</fnm>
               </au>
               <au>
                  <snm>Schutz</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Dedhia</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Yordan</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Parnell</snm>
                  <fnm>LD</fnm>
               </au>
               <au>
                  <snm>Stein</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>McCombie</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Martienssen</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>1999</pubdate>
            <volume>23</volume>
            <fpage>305</fpage>
            <lpage>308</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/15479</pubid>
                  <pubid idtype="pmpid" link="fulltext">10545948</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Maize ZMMBBb STC project</p>
            </title>
            <url>http://www.genome.clemson.edu/projects/stc/maize/ZMMBBb/index.html</url>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Maize mapping project</p>
            </title>
            <url>http://www.cafnr.missouri.edu/mmp/</url>
         </bibl>
         <bibl id="B31">
            <title>
               <p>ISU maize genome project</p>
            </title>
            <url>http://maize.math.iastate.edu/isumaize/homepage.html</url>
         </bibl>
         <bibl id="B32">
            <title>
               <p>A draft sequence of the rice (<it>Oryza sativa</it> ssp. Indica) genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>KG</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Dal</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <etal/>
            </aug>
            <source>Chinese Sci Bull</source>
            <pubdate>2001</pubdate>
            <volume>46</volume>
            <fpage>1937</fpage>
            <lpage>1942</lpage>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Maize as a model for the evolution of plant nuclear genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Gaut</snm>
                  <fnm>BS</fnm>
               </au>
               <au>
                  <snm>Le Thierry d'Ennequin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Peek</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Sawkins</snm>
                  <fnm>MC</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2000</pubdate>
            <volume>97</volume>
            <fpage>7008</fpage>
            <lpage>7015</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">34377</pubid>
                  <pubid idtype="pmpid" link="fulltext">10860964</pubid>
                  <pubid idtype="doi">10.1073/pnas.97.13.7008</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>National Science Foundation-sponsored workshop report. Maize genome sequencing project.</p>
            </title>
            <aug>
               <au>
                  <snm>Bennetzen</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Chandler</snm>
                  <fnm>VL</fnm>
               </au>
               <au>
                  <snm>Schnable</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2001</pubdate>
            <volume>127</volume>
            <fpage>1572</fpage>
            <lpage>1578</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1104/pp.127.4.1572</pubid>
                  <pubid idtype="pmpid" link="fulltext">11743101 </pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>

