<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2006-7-9-r87</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Patterns of expansion and expression divergence in the plant polygalacturonase gene family</p>
         </title>
         <aug>
            <au id="A1" ce="yes">
               <snm>Kim</snm>
               <fnm>Joonyup</fnm>
               <insr iid="I1"/>
               <email>joonyupkim@wisc.edu</email>
            </au>
            <au id="A2" ce="yes">
               <snm>Shiu</snm>
               <fnm>Shin-Han</fnm>
               <insr iid="I2"/>
               <email>shius@msu.edu</email>
            </au>
            <au id="A3">
               <snm>Thoma</snm>
               <fnm>Sharon</fnm>
               <insr iid="I3"/>
               <email>slthoma@wisc.edu</email>
            </au>
            <au id="A4">
               <snm>Li</snm>
               <fnm>Wen-Hsiung</fnm>
               <insr iid="I4"/>
               <email>whli@chicago.edu</email>
            </au>
            <au id="A5" ca="yes">
               <snm>Patterson</snm>
               <mi>E</mi>
               <fnm>Sara</fnm>
               <insr iid="I1"/>
               <email>spatters@wisc.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Horticulture, Cellular and Molecular Biology Program, University of Wisconsin-Madison, Madison, WI 53706, USA</p>
            </ins>
            <ins id="I2">
               <p>Department of Plant Biology, Michigan State University, East Lansing, MI 48824, USA</p>
            </ins>
            <ins id="I3">
               <p>Department of Zoology, University of Wisconsin-Madison, Madison, WI 53706, USA</p>
            </ins>
            <ins id="I4">
               <p>Department of Ecology and Evolution, University of Chicago, Chicago, IL 60637, USA</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2006</pubdate>
         <volume>7</volume>
         <issue>9</issue>
         <fpage>R87</fpage>
         <url>http://genomebiology.com/2006/7/9/R87</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17010199</pubid>
               <pubid idtype="doi">10.1186/gb-2006-7-9-r87</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>19</day>
               <month>5</month>
               <year>2006</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>26</day>
               <month>7</month>
               <year>2006</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>29</day>
               <month>9</month>
               <year>2006</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>29</day>
               <month>09</month>
               <year>2006</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2006</year>
         <collab>Kim et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Plant Polygalacturonase evolution</p>
      </shorttitle>
      <shortabs>
         <p>Analysis of Arabidopsis and rice polygalacturonases suggests that polygalacturonases duplicates underwent rapid expression divergence and that the mechanisms of duplication affect the divergence rate.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Polygalacturonases (PGs) belong to a large gene family in plants and are believed to be responsible for various cell separation processes. PG activities have been shown to be associated with a wide range of plant developmental programs such as seed germination, organ abscission, pod and anther dehiscence, pollen grain maturation, fruit softening and decay, xylem cell formation, and pollen tube growth, thus illustrating divergent roles for members of this gene family. A close look at phylogenetic relationships among <it>Arabidopsis </it>and rice PGs accompanied by analysis of expression data provides an opportunity to address key questions on the evolution and functions of duplicate genes.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We found that both tandem and whole-genome duplications contribute significantly to the expansion of this gene family but are associated with substantial gene losses. In addition, there are at least 21 PGs in the common ancestor of <it>Arabidopsis </it>and rice. We have also determined the relationships between <it>Arabidopsis </it>and rice PGs and their expression patterns in <it>Arabidopsis </it>to provide insights into the functional divergence between members of this gene family. By evaluating expression in five <it>Arabidopsis </it>tissues and during five stages of abscission, we found overlapping but distinct expression patterns for most of the different PGs.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Expression data suggest specialized roles or subfunctionalization for each PG gene member. PGs derived from whole genome duplication tend to have more similar expression patterns than those derived from tandem duplications. Our findings suggest that PG duplicates underwent rapid expression divergence and that the mechanisms of duplication affect the divergence rate.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The functions and regulation of cell wall hydrolytic enzymes have intrigued plant scientists for decades. These enzymes cleave the bonds between the polymers that make up the cell wall, and include polygalacturonases (PGs), beta-1, 4-endoglucanases, pectate lyases, pectin methylesterases, and xyloglucan endo-transglycosylases <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. As a consequence of their action, cell wall extensibility and cell-cell adhesion can be altered leading to cell wall loosening that results in cell elongation, sloughing of cells at the root tip, fruit softening, and fruit decay <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. Cell separation processes also contribute to important agricultural traits such as pollen dehiscence and abscission of organs including leaves, floral parts, and fruits <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. In addition, these enzymes are hypothesized to be involved in general housekeeping functions in plants <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>.</p>
         <p>Among these hydrolytic enzymes, the PGs belong to one of the largest hydrolase families <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. PG activities have been shown to be associated with a wide range of plant developmental programs such as seed germination, organ abscission, pod and anther dehiscence, pollen grain maturation, xylem cell formation, and pollen tube growth <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. Over-expression of a PG in apple (<it>Malus domestica</it>) has resulted in alterations in leaf morphology and premature leaf shedding <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Interestingly, the functions of PGs are not restricted to the control of cell growth and development as they are also reported to be associated with wound responses <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> and host-parasite interactions <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. These findings illustrate the divergent and important roles of PGs in plants.</p>
         <p>PGs have been identified in various plants including <it>Arabidopsis</it>, pea and tomato <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B17">17</abbr></abbrgrp>. In both tomato and <it>Arabidopsis </it>it has been determined that many PGs are located within tandem clusters <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B18">18</abbr></abbrgrp>. In addition to tandem duplication, the <it>Arabidopsis </it>genome contains large blocks of related regions derived from whole genome duplication events <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. In this study, we conducted a comparative analysis of PGs from <it>Arabidopsis </it>and rice to address several key questions on the evolution and function of this gene family. We compared the PGs from <it>Arabidopsis </it>and rice to determine the pattern of expansion and the extent of PG losses prior and subsequent to the divergence between these two species. To uncover the mechanisms that contributed to the expansion of this gene family, we examined the distribution of PGs on <it>Arabidopsis </it>chromosomes in conjunction with the large-scale duplicated blocks. Torki <it>et al</it>. <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> have suggested that a group of related PGs tend to be expressed in the flowers and flower buds, while PGs expressed in vegetative tissues belong to other groups. The implication is that the diverse functions of PGs may be a consequence of differential expression. This expression divergence and/or subfunctionalization most likely contribute to the retention of PG duplicates <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>. To evaluate the degree of spatial expression divergence between PGs, we conducted RT-PCR analysis on all 66 <it>Arabidopsis </it>PG genes in five non-overlapping tissue types. To supplement the RT-PCR expression data, we also examined expression tags generated from other large-scale sequencing projects. Finally, we analyzed expression at five stages of floral organ abscission to assess the degree of temporal expression divergence among members of this gene family.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Expansion of the PG family in <it>Arabidopsis </it>and rice</p>
            </st>
            <p>To investigate the relationships among PGs and the extent of lineage-specific expansion in rice and <it>Arabidopsis</it>, we identified PGs from the GenBank polypeptide records and the genomes of <it>Arabidopsis </it>and rice (<it>Oryza sativa </it>subsp. <it>indica</it>). All PGs identified contain GH28 domains that are approximately 340 amino acids long and encompass approximately 75% of the average PG coding sequence (for lists of genes used in this analysis, see Figure <figr fid="F1">1</figr> and Additional data files 1,2 and 8). According to the phylogenetic relationships of bacterial, fungal, metazoan, and plant PGs (Additional data file 3), we found that the 66 <it>Arabidopsis </it>and 59 rice PGs fall into three distinct groups (Figure <figr fid="F1">1</figr>, groups A, B, and C). Sixteen of the rice PGs contain more than one glycosyl hydrolase 28 (GH28) domain and were regarded as mis-annotated tandem repeats. It should be noted that the rice PGs were derived from the shotgun sequencing of the <it>O. indica </it>genome that was estimated to be 95% complete <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. We identified the nodes that lead to <it>Arabidopsis</it>-specific and rice-specific clades and predict that these represent the divergence point between these two species. We have designated the clades defined by such nodes as AO (<it>Arabidopsis</it>-<it>Oryza</it>) orthologous groups. For example, in the A3 clade there exists one <it>Arabidopsis </it>subclade and one rice subclade, and we predict that only one ancestral A3 sequence was present before the divergence between <it>Arabidopsis </it>and rice. However, gene losses could have occurred and therefore some PGs may be present in the <it>Arabidopsis</it>-rice common ancestor but later lost in either <it>Arabidopsis </it>or rice (Figure <figr fid="F1">1</figr>, arrowheads). Therefore, <it>Arabidopsis </it>(A, indicating loss(es) in rice) and rice (O, indicating loss(es) in <it>Arabidopsis</it>) clades were also identified based on their sister group relationships to the AO clades. Since the clades that we defined are most likely orthologous groups (Figure <figr fid="F1">1</figr>, red circles), the number of clades reflects that there were at least 21 ancestral PGs before the <it>Arabidopsis</it>-rice split. Further expansion of this gene family occurred after the split as suggested by the duplication events in the lineage-specific branches that reside within each clade. It should be noted that some clades such as the A1 clade were not defined based on the AO clade-based criteria because the nodes within had relatively low bootstrap supports (&lt;50%). If we assumed these less well-supported nodes are correct, there are 27 ancestral PGs.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>The phylogeny of <it>Arabidopsis </it>and rice PGs</p>
               </caption>
               <text>
                  <p>The phylogeny of <it>Arabidopsis </it>and rice PGs. The amino acid sequences for the glycosyl hydrolase 28 family motif were aligned. The phylogeny was generated using neighbor-joining algorithm with 1,000 bootstrap replicates. Sequences are color-coded according to the key. The plant PGs are classified into three major groups and multiple clades. The clades were defined by identifying nodes representing speciation events (circles, see Results section for criteria). For these nodes, red circles indicate that the bootstrap support for the subtending branches is higher than 50% and indicate the criteria for least number of common ancestral PGs between rice and <it>Arabidopsis</it>. The nodes are labeled with white circles if the bootstrap support is less than 50%. Arrowheads indicate clades that contain only sequences for one of the two plants.</p>
               </text>
               <graphic file="gb-2006-7-9-r87-1"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Duplication mechanisms accounting for the PG family expansion</p>
            </st>
            <p>Examination of the distribution of the <it>Arabidopsis </it>PGs on all five chromosomes indicates a non-random distribution of many PGs (Figure <figr fid="F2">2</figr>). More than one third of the <it>Arabidopsis </it>PGs (24 of 66) have at least one related sequence within ten predicted genes, and these 24 genes fall into nine clusters that range from two to four genes per cluster (Figure <figr fid="F2">2</figr>, column cluster). In most cases, these physically associated PGs are from the same clades; however, there are five exceptions including genes in clusters 1d, 2b and 3a (Figure <figr fid="F2">2</figr>). In these cases, some members within the cluster are not closest relatives. Besides these 24 tandem repeated sequences, all remaining PGs are at least 100 genes apart. This bimodal distribution of PG physical distances and relationships between closely linked genes suggests that the 24 closely linked PGs are derived from tandem duplications.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Mechanisms of <it>Arabidopsis </it>PG family expansion</p>
               </caption>
               <text>
                  <p>Mechanisms of <it>Arabidopsis </it>PG family expansion. The locations of <it>Arabidopsis </it>PGs are indicated on the <it>Arabidopsis </it>chromosomes. The tandem clusters are also indicated. They are color-coded based on the following scheme: PGs found in both duplicated regions of a block pair (green); PGs found in only one duplicated region of a block pair (red); and no PG is located in these blocks (gray). PGs covered by AGI blocks are either red or green, while PGs covered by BHW but not AGI blocks are with white text and black-boxed background. If PGs are found in both duplicated regions of a block, the gene names are linked. In addition, these gene names are italicized if they belong to the same clade. PGs that are not found in either AGI or BHW blocks are shown in black text. Tandem duplications are indicated by cluster designation. BHW block names were modified from the original designations of Blanc <it>et al</it>. [20]. BHW block names with a prime indicate that they overlap with AGI blocks of the same names. The reference for the block names can be found in <supplr sid="S2">Additional data file 2</supplr>.</p>
               </text>
               <graphic file="gb-2006-7-9-r87-2"/>
            </fig>
            <p>In addition to tandem duplications, it has been shown that the <it>Arabidopsis </it>genome is the product of several rounds of polyploidization or whole-genome duplications <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. To determine the contribution of these large-scale duplications, we mapped <it>Arabidopsis </it>PGs to the duplicated blocks established in two independent studies. The first dataset from the Arabidopsis Genome Initiative <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> contains 31 blocks (AGI blocks), and forty <it>Arabidopsis </it>PGs fall in 16 of the AGI blocks (Figure <figr fid="F2">2</figr>, indicated in red and green). Blocks from the second dataset from Blanc <it>et al</it>. <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> are designated as BHW (after Blanc, Hokamp, Wolfe) blocks, and 19 PGs were found in 10 BHW blocks (Figure <figr fid="F2">2</figr>, shaded). The AGI and BHW blocks were identified using different approaches and their combined use increases the coverage of duplicated regions. As a result, nearly 90% (59 out of 66) of <it>Arabidopsis </it>PGs are covered in the 26 AGI and BHW blocks.</p>
            <p>Within these 26 duplicated blocks, 29 PGs are found in both duplicated regions of ten block pairs. To investigate the origin of PGs in these ten block pairs, we conducted similarity searches between regions of each pair to determine if PGs mapped to the corresponding duplicated regions, and if their neighboring genes were arranged collinearly (Figure <figr fid="F3">3</figr>; see also (Additional data file 4) for all comparisons). Sixteen PGs in five of these block pairs are clearly located in such collinear regions, indicating that they were derived from large-scale duplication of their associated blocks. For example, AGI block 23a contains nine PGs in six corresponding duplicated regions that show extensive collinearity (Figure <figr fid="F3">3</figr>). In Figure <figr fid="F3">3b</figr>, At2g41850 and At3g57510 are flanked by paralogous genes that are arranged collinearly, indicating that they were products of a block duplication. This is also true for a tandem cluster of four PGs and a PG singleton shown in Figure <figr fid="F3">3d</figr>. Interestingly, At3g57790 corresponds to At2g43210, a potential pseudogene lacking the signal peptide and the bulk of the PG catalytic domain (Figure <figr fid="F3">3c</figr>). We also observed that there are 23 duplicated block pairs with asymmetrical distribution (Additional data file 4). Among them, 16 block pairs have PGs on only one of the blocks (Figure <figr fid="F2">2</figr> and (Additional data file 4)): ten for AGI and six for BHW blocks. For the remaining seven block pairs, the PGs are found on both blocks but are not arranged in a collinear fashion. Taken together, these findings clearly indicate that many members of the PG family are derived from large-scale duplication events. However, quite a few of them were not retained.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Collinearity of PGs in AGI block 23a</p>
               </caption>
               <text>
                  <p>Collinearity of PGs in AGI block 23a. After locating areas with similarities in the block 23a (see also <supplr sid="S4">Additional data file 4</supplr>), six distinct PG-containing regions were defined. <b>(a) </b>At2g40310 does not have PG in the collinear region. <b>(b) </b>At2g41850 and At3g57510 are located in collinear regions. <b>(c) </b>The 3' end of At3g57790 is highly similar to At2g42310*, a truncated PG that is likely a pseudogene. <b>(d) </b>A tandem of four PGs (At2g43860, At2g43870, At2g43880, At2g43890) is located in the collinear region with At3g59850. <b>(e) </b>At3g61490 does not have any PG in the corresponding collinear region. <b>(f) </b>At3g62210 does not have any PG in the collinear region. For each region pair, the solid black bars are the chromosomes (top: chromosome 2, bottom: chromosome 3) flanked by the starting and ending positions in Mb. The annotated genes are drawn to scale in a rectangular box on the chromosome and in each box the thicker black line indicates the 3' position of the gene. The names are only shown for PGs and the starting and ending genes in each block pair. The areas that are at least 30 amino acids long with at least 50% identity are linked by colored lines based on their identity levels (see key).</p>
               </text>
               <graphic file="gb-2006-7-9-r87-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>PG expression in <it>Arabidopsis </it>tissues</p>
            </st>
            <p>The size of the plant PG family and the patterns of PG duplication in <it>Arabidopsis </it>indicate that the PG family expanded in both <it>Arabidopsis </it>and rice after their divergence. The continuous expansion of this gene family raises an intriguing question on the mechanisms of duplicate retention and their functions in plants. Since retention may be due to functional divergence between duplicate copies, it is possible that PG functional divergence can be, in part, attributed to expression divergence. To evaluate the degree of expression divergence between PG duplicates, we analyzed the expression of all 66 <it>Arabidopsis </it>PGs in five tissue types (flowers, siliques, inflorescence stems, rosette and cauline leaves, and roots) with RT-PCR (Figure <figr fid="F4">4</figr> and Additional data file 5). PCR reactions were repeated at least three times for each gene in each tissue type, and all primers were tested using genomic DNA as a positive control (see Figure <figr fid="F5">5</figr>). In addition, PCR products of 40 of the 43 PGs were sequenced to verify their identity. We found that 23 PGs did not have detectable RT-PCR products in any of the five tissue types tested. We further tested the expression of these 23 PGs in a T87 suspension culture cell line that had been previously shown to have >60% genes expressed <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. Only one PG (At2g43860) was detected. To rule out the possibility of faulty primer designs, a second primer set was designed for each of these 23 PGs, but none led to detectable products.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>The phylogeny and expression patterns of <it>Arabidopsis </it>PGs</p>
               </caption>
               <text>
                  <p>The phylogeny and expression patterns of <it>Arabidopsis </it>PGs. The phylogeny was generated using all <it>Arabidopsis </it>PGs with <it>Erwinia peh1 </it>as the outgroup. The clade classification, cluster and block designation are also shown. The levels of transcripts are classified into five categories as shown in the key. The tissue source abbreviations are as follows: Fl, flower; Si, silique; St, stem; Lf, rosette and cauline leaf; Rt, root; gDNA, genomic DNA. For each gene, three colored rectangles represent the level of RT-PCR products from three independent biological replications for each tissue type. On the right, the solid black circles indicate the presence of the four different expression tags. RT-PCR data are from this study and a solid circle represents repeatable expression from one or more of the six tissue types analyzed including expression in At2g43860 from suspension cultures. Open circles represent expression that was only detected in one of the RT-PCR reactions yet verified by sequencing. cDNAs, ESTs and MPSS tags were obtained from SIGnAL, GenBank, and the <it>Arabidopsis </it>MPSS project websites, respectively. Branches that were shortened are intersected with a solidus (/).</p>
               </text>
               <graphic file="gb-2006-7-9-r87-4"/>
            </fig>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>RT-PCR of PGs in five tissue types</p>
               </caption>
               <text>
                  <p>RT-PCR of PGs in five tissue types. The competitive RT-PCR, using both cDNA and gDNA templates, is demonstrated. The expression pattern of PGs in the clade A14 is variable except At1g23470, which has no detectable expression in all five tissue types. RT-PCR product sizes are indicated to the right of the figure. Tissue source abbreviations are as in Figure 4.</p>
               </text>
               <graphic file="gb-2006-7-9-r87-5"/>
            </fig>
            <p>To complement the RT-PCR approach, we also examined the expression tags that were publicly available including full-length cDNAs, expressed sequence tags (ESTs), and massive parallel signature sequencing (MPSS) tags (Additional data file 6). The presence of RT-PCR products or other expression tags is shown in Figure <figr fid="F4">4</figr> (far right-hand panel). Among these four different expression measures, the RT-PCR approach detects the highest number of PGs. In the 43 PGs with RT-PCR products, other expression tags support only 30 of them. In addition, only three PGs have cDNA, ESTs, and/or MPSS but not RT-PCR products. These findings indicate that RT-PCR is the most sensitive approach with a relatively low false-negative rate. For further analyses, we consider a PG expressed if two out of three of the RT-PCR reactions had detectable products (42) or if its expression is supported by the presence of either cDNA or EST (three). Based on these criteria, 45 PGs had detectable expression (Figure <figr fid="F4">4</figr>). Approximately 50% of these expressed PGs are found in all five tissues and 20% have relatively higher level of expression in more than one tissue. In addition, more than 50% of expressed PGs have high level of expression in floral tissues, 40% in root tissue, 16% in stem and 12% in silique. Only nine PGs (approximately 20%) are found in only one tissue type (Figure <figr fid="F4">4</figr>). These findings indicate that most PGs have rather wide expression patterns and the expression level seems to be generally higher in floral tissues. The complexity of expression patterns represented in Figure <figr fid="F4">4</figr> emphasizes the need for additional interpretation, and is the basis for the statistical analyses described below for the expression data.</p>
         </sec>
         <sec>
            <st>
               <p>Effects of duplication mechanisms on gene expression</p>
            </st>
            <p>While it was anticipated that more closely related genes would tend to have similar expression patterns, we did not find significant correlation between the synonymous substitution rate (<it>Ks</it>) and the expression profile (Figure <figr fid="F6">6</figr>). In addition, to evaluate the relationships between <it>Ks </it>and expression correlation using all PG pairs, we also reached the same conclusion after partitioning the data as within clade (<it>r </it>= -0.119, <it>p </it>= 0.39), between clade (<it>r </it>= 0.002, <it>p </it>= 0.58), or reciprocal best matches (<it>r </it>= -0.4389, <it>p </it>= 0.12). This finding indicates that expression patterns have diverged quickly after PG duplications. In particular, significantly fewer PGs in tandem clusters were expressed when compared with those not in clusters (Table <tblr tid="T1">1</tblr>; Fisher's exact test; <it>p </it>= 0.0326). In several cases, the tandem duplicated regions have one relatively highly expressed gene while the rest have either low expression levels or no RT-PCR products. For example, in the 1b tandem cluster of clade A14, At1g23460 is highly expressed while At1g23470 does not have any detectable expression. Curiously, we found that related PGs found in duplicated blocks tend to have similar expression patterns at the tissue level. For example, in block 11d clade A14, At1g23460 and At1g70500 have nearly identical expression profiles (Figure <figr fid="F4">4</figr>). We selected 18 PG pairs that were derived from tandem or large-scale block duplication to compare their expression divergence. Among nine pairs in large-scale duplicated blocks, the expression pattern is significantly different in only one pair (Table <tblr tid="T2">2</tblr>). Among the nine pairs derived from tandem duplications, the <it>t</it>-test could only be conducted for four pairs because several of the tandem duplicates had no detectable expression. In addition to two pairs with significant differences (<it>p </it>&lt; 0.05), three pairs with only one of the tandem duplicates expressed are also classified as pairs showing expression divergence. Therefore, excluding two pairs with no expression for both duplicates, five out of seven tandem pairs have divergent expression. Significantly fewer PG pairs derived from tandem duplications have similar expression patterns compared with those derived from large-scale duplications (Fisher's exact test; <it>p </it>&lt; 0.01). Therefore, tandemly duplicated PGs have higher levels of expression divergence compared with PGs derived from large-scale duplications. These findings suggest that duplication mechanisms contribute to divergence of expression patterns differently.</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Expression of PGs shared among tissues and the correlation between expression patterns and the <it>Ks</it></p>
               </caption>
               <text>
                  <p>Expression of PGs shared among tissues and the correlation between expression patterns and the <it>Ks</it>. <b>(a) </b>Overlapping expression of PGs - the majority of expressed PGs are found in all five tissues tested. <b>(b) </b>Pairwise comparisons of tissues with PGs - the numbers in black boxes represent the number of PGs expressed in indicated tissues. The numbers in the upper-right half are the number of PGs expressed in both tissues specified in the top row and in the leftmost column. The numbers in the lower-left half are the percent overlap between two tissues. <b>(c) </b>The relationships between the <it>Ks </it>and transformed correlations in expression patterns - the <it>Ks </it>values were determined for all PG pairs. The correlations between expression patterns were calculated for all PG pairs and transformed as described in the Materials and methods. The formulae for the best fit and the correlation coefficient determined by linear regression are shown on the top right corner.</p>
               </text>
               <graphic file="gb-2006-7-9-r87-6"/>
            </fig>
            <tbl id="T1" hint_layout="double">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Distribution and expression of <it>Arabidopsis </it>PG genes in duplicated regions</p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="left">
                        <p>Out of duplicated regions*</p>
                     </c>
                     <c cspan="4" ca="left">
                        <p>Within duplicated regions*</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="left">
                        <p>With match<sup>&#8224;</sup></p>
                     </c>
                     <c cspan="2" ca="left">
                        <p>Without match<sup>&#8224;</sup></p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Number of genes</p>
                     </c>
                     <c ca="left">
                        <p>Expression<sup>&#8225;</sup></p>
                     </c>
                     <c ca="left">
                        <p>Number of genes</p>
                     </c>
                     <c ca="left">
                        <p>Expression<sup>&#8225;</sup></p>
                     </c>
                     <c ca="left">
                        <p>Number of genes</p>
                     </c>
                     <c ca="left">
                        <p>Expression<sup>&#8225;</sup></p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Singular</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>27</p>
                     </c>
                     <c ca="left">
                        <p>21</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Tandem</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>10</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Total</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>21</p>
                     </c>
                     <c ca="left">
                        <p>17</p>
                     </c>
                     <c ca="left">
                        <p>38</p>
                     </c>
                     <c ca="left">
                        <p>25</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>*Duplicated regions are the regions that are covered by the AGI and BHW blocks. <sup>&#8224;</sup>The presence (with match) or absence (without match) of PGs in collinear regions of each duplicated block pair as shown in Figure 4 and <supplr sid="S4">Additional data file 4</supplr>. <sup>&#8225;</sup>Expression detected in at least two out of three RT-PCR reactions or supported by the presence of cDNA or EST tags.</p>
               </tblfn>
            </tbl>
            <tbl id="T2" hint_layout="double">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Expression (RT-PCR) of <it>Arabidopsis </it>PG genes in different clades</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="left">
                        <p>Set*</p>
                     </c>
                     <c ca="left">
                        <p>Gene1</p>
                     </c>
                     <c ca="left">
                        <p>Gene2</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Ks</it>
                           <sup>&#8224;</sup>
                        </p>
                     </c>
                     <c ca="left">
                        <p>t<sup>&#8225;</sup></p>
                     </c>
                     <c ca="left">
                        <p><it>p </it>&lt; 0.05<sup>&#8225;</sup></p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>B1</p>
                     </c>
                     <c ca="left">
                        <p>At1g02460</p>
                     </c>
                     <c ca="left">
                        <p>At4g01890</p>
                     </c>
                     <c ca="left">
                        <p>1.0564</p>
                     </c>
                     <c ca="left">
                        <p>3.09</p>
                     </c>
                     <c ca="left">
                        <p>n</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>B2</p>
                     </c>
                     <c ca="left">
                        <p>At1g10640</p>
                     </c>
                     <c ca="left">
                        <p>At1g60590</p>
                     </c>
                     <c ca="left">
                        <p>1.252</p>
                     </c>
                     <c ca="left">
                        <p>-0.32</p>
                     </c>
                     <c ca="left">
                        <p>n</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>B3</p>
                     </c>
                     <c ca="left">
                        <p>At1g23460</p>
                     </c>
                     <c ca="left">
                        <p>At1g70500</p>
                     </c>
                     <c ca="left">
                        <p>0.8011</p>
                     </c>
                     <c ca="left">
                        <p>-0.73</p>
                     </c>
                     <c ca="left">
                        <p>n</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>B3</p>
                     </c>
                     <c ca="left">
                        <p>At1g23470</p>
                     </c>
                     <c ca="left">
                        <p>At1g70500</p>
                     </c>
                     <c ca="left">
                        <p>1.877</p>
                     </c>
                     <c ca="left">
                        <p>-14.70</p>
                     </c>
                     <c ca="left">
                        <p>y</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>B4</p>
                     </c>
                     <c ca="left">
                        <p>At2g41850</p>
                     </c>
                     <c ca="left">
                        <p>At3g57510</p>
                     </c>
                     <c ca="left">
                        <p>0.6805</p>
                     </c>
                     <c ca="left">
                        <p>-1.43</p>
                     </c>
                     <c ca="left">
                        <p>n</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>B5</p>
                     </c>
                     <c ca="left">
                        <p>At2g43860</p>
                     </c>
                     <c ca="left">
                        <p>At3g59850</p>
                     </c>
                     <c ca="left">
                        <p>2.1371</p>
                     </c>
                     <c ca="left">
                        <p>-3.00</p>
                     </c>
                     <c ca="left">
                        <p>n</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>B5</p>
                     </c>
                     <c ca="left">
                        <p>At2g43870</p>
                     </c>
                     <c ca="left">
                        <p>At3g59850</p>
                     </c>
                     <c ca="left">
                        <p>0.9534</p>
                     </c>
                     <c ca="left">
                        <p>2.13</p>
                     </c>
                     <c ca="left">
                        <p>n</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>B5</p>
                     </c>
                     <c ca="left">
                        <p>At2g43880</p>
                     </c>
                     <c ca="left">
                        <p>At3g59850</p>
                     </c>
                     <c ca="left">
                        <p>1.8279</p>
                     </c>
                     <c ca="left">
                        <p>1.00</p>
                     </c>
                     <c ca="left">
                        <p>n</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>B5</p>
                     </c>
                     <c ca="left">
                        <p>At2g43890</p>
                     </c>
                     <c ca="left">
                        <p>At3g59850</p>
                     </c>
                     <c ca="left">
                        <p>1.8308</p>
                     </c>
                     <c ca="left">
                        <p>-1.41</p>
                     </c>
                     <c ca="left">
                        <p>n</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>T1</p>
                     </c>
                     <c ca="left">
                        <p>At1g05650</p>
                     </c>
                     <c ca="left">
                        <p>At1g05660</p>
                     </c>
                     <c ca="left">
                        <p>0.2385</p>
                     </c>
                     <c ca="left">
                        <p>ND<sup>&#167;</sup></p>
                     </c>
                     <c ca="left">
                        <p>y</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>T2</p>
                     </c>
                     <c ca="left">
                        <p>At1g23460</p>
                     </c>
                     <c ca="left">
                        <p>At1g23470</p>
                     </c>
                     <c ca="left">
                        <p>0.878</p>
                     </c>
                     <c ca="left">
                        <p>6.53</p>
                     </c>
                     <c ca="left">
                        <p>y</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>T3</p>
                     </c>
                     <c ca="left">
                        <p>At2g43860</p>
                     </c>
                     <c ca="left">
                        <p>At2g43870</p>
                     </c>
                     <c ca="left">
                        <p>1.4013</p>
                     </c>
                     <c ca="left">
                        <p>-6.53</p>
                     </c>
                     <c ca="left">
                        <p>y</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>T4</p>
                     </c>
                     <c ca="left">
                        <p>At2g43880</p>
                     </c>
                     <c ca="left">
                        <p>At2g43890</p>
                     </c>
                     <c ca="left">
                        <p>4.2072</p>
                     </c>
                     <c ca="left">
                        <p>2.83</p>
                     </c>
                     <c ca="left">
                        <p>n</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>T5</p>
                     </c>
                     <c ca="left">
                        <p>At3g07820</p>
                     </c>
                     <c ca="left">
                        <p>At3g07830</p>
                     </c>
                     <c ca="left">
                        <p>0.5342</p>
                     </c>
                     <c ca="left">
                        <p>ND<sup>&#167;</sup></p>
                     </c>
                     <c ca="left">
                        <p>y</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>T5</p>
                     </c>
                     <c ca="left">
                        <p>At3g07820</p>
                     </c>
                     <c ca="left">
                        <p>At3g07840</p>
                     </c>
                     <c ca="left">
                        <p>0.4923</p>
                     </c>
                     <c ca="left">
                        <p>ND<sup>&#167;</sup></p>
                     </c>
                     <c ca="left">
                        <p>y</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>T5</p>
                     </c>
                     <c ca="left">
                        <p>At3g07830</p>
                     </c>
                     <c ca="left">
                        <p>At3g07840</p>
                     </c>
                     <c ca="left">
                        <p>0.457</p>
                     </c>
                     <c ca="left">
                        <p>ND</p>
                     </c>
                     <c ca="left">
                        <p>ND</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>T6</p>
                     </c>
                     <c ca="left">
                        <p>At4g32370</p>
                     </c>
                     <c ca="left">
                        <p>At4g32380</p>
                     </c>
                     <c ca="left">
                        <p>2.6336</p>
                     </c>
                     <c ca="left">
                        <p>0.73</p>
                     </c>
                     <c ca="left">
                        <p>n</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>T7</p>
                     </c>
                     <c ca="left">
                        <p>At5g44830</p>
                     </c>
                     <c ca="left">
                        <p>At5g44840</p>
                     </c>
                     <c ca="left">
                        <p>0.1626</p>
                     </c>
                     <c ca="left">
                        <p>ND</p>
                     </c>
                     <c ca="left">
                        <p>ND</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>*Each set contains genes that were duplicated through either local-scale block duplication (B) or tandem duplication (T). In duplicated blocks where a PG is collinear with a cluster, the one-to-many relationships are shown. For tandem clusters, all pairwise combinations are shown. <sup>&#8224;</sup><it>Ks</it>, synonymous substitution rate. <sup>&#8225;</sup>Differences in expression patterns significant (y) or not (n) for <it>t</it>-test with <it>df </it>= 2, <it>p </it>&lt; 0.05 [52]. ND, not determined since both genes do not have detectable RT-PCR product or <sup>&#167;</sup>expression was documented for only one gene in the pair.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Developmentally regulated expression divergence among PGs expressed in abscission zone</p>
            </st>
            <p>So far, our expression analyses were performed in five widely different tissues. To further expand our understanding of PG expression, we took a close look at 43 of the expressed PGs in the abscission zones of flowers and developing siliques at five developmental stages during floral organ abscission (Figure <figr fid="F7">7a</figr>). During the abscission process there are discrete stages when cell wall loosening and cell wall dissolution occurs, thus providing an excellent biological system to look at more subtle changes in the regulation of cell separation. And indeed, this analysis allowed us to discern differences in expression between PGs that had been initially regarded as similar due to limitations in resolution (Figure <figr fid="F7">7</figr>). For example, at the tissue level, At1g23460 and At1g70500, from block 11d clade A14 were regarded as having nearly identical expression profiles. However, when we examined five stages of abscission, these genes have distinct profiles (Figure <figr fid="F7">7c</figr> and <figr fid="F7">7e</figr>, Additional data file 7).</p>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>RT-PCR on floral organ abscission zones representing five unique stages of development</p>
               </caption>
               <text>
                  <p>RT-PCR on floral organ abscission zones representing five unique stages of development. Expression of 43 PGs is examined in the abscission zones at five different stages of floral organ abscission as determined by position on the inflorescence, where position one represents anthesis and larger numbers are progressively older flowers <b>(a)</b>. Five developmental stages were examined with the RT-PCR; <it>i </it>(position1/2) and <it>ii </it>(position 4/5), pre-abscission, <it>iii </it>(position 7/8), during abscission, <it>iv </it>(position 0/11), and <it>v </it>(position 13/14) post-abscission. Expression during the abscission process is classified into nine different unique patterns shown in (b) to (j); the gene names are provided in <supplr sid="S7">Additional data file 7</supplr>. PGs specifically up-regulated during the abscission process are shown with RT-PCR products (k).</p>
               </text>
               <graphic file="gb-2006-7-9-r87-7"/>
            </fig>
            <p>We determined that there are nine unique patterns of expression for the PGs during the five stages of abscission that are shown in Figure <figr fid="F7">7</figr> and Additional data file 7. Eight PGs display high levels of expression at anthesis, low levels during the events of cell separation, and high levels post abscission as depicted in Figure <figr fid="F7">7b</figr>. These genes are all from independent clades except two sets: At1g19170 and At3g42950 (B8), and At2g23900 and At3g48950 (B6). In Figure <figr fid="F7">7c</figr>, 7 PGs show initial high expression at anthesis that decreases steadily during abscission, while in Figure <figr fid="F7">7d</figr>, PG expression (At1g02460, At1g56710, and At3g61490) initially decreases right before abscission and then increases after the loss of floral organs or during what is described as post abscission repair. In Figure <figr fid="F7">7e</figr>, two PGs (At1g23460 and At1g10640) have very low or undetectable expression during anthesis that goes up continually during abscission. Other patterns include ten PGs with constitutive expression (Figure <figr fid="F7">7f</figr>), and six PGs with no expression (Figure <figr fid="F7">7g</figr>). Last, we observed three patterns of expression that correlated with unique changes during the process of abscission (Figure <figr fid="F7">7h,i,j</figr>). In Figure <figr fid="F7">7h</figr>, high levels of gene expression correlate with cell wall loosening or the earliest steps of abscission, while in Figure <figr fid="F7">7i</figr> highest levels of gene expression correlate with cell separation or loss of floral organs. In Figure <figr fid="F7">7j</figr>, it is only at around positions 10 and 11 that we observe detectable gene expression, and this correlates with predicted stages of cell repair <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>.</p>
            <p>Taken together, expression divergence between PGs that show no difference at the tissue level were revealed when we examined PG expression at different developmental stages of abscission, thus indicating duplication mechanisms contribute to divergence of expression differently. Our findings also provide candidate PGs important for different abscission stages. More importantly, the expression divergence between duplicate genes in general appears to be under-estimated in expression studies due to the limitations in resolution.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <sec>
            <st>
               <p>PG family expansion history</p>
            </st>
            <p>PGs fall into several taxon-specific clades where eubacterial, fungal, and plant PGs organize into different clusters <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. We have hypothesized that there were approximately 21 PGs present in the immediate common ancestor of <it>Arabidopsis </it>and rice, and when additional monocots and dicots are sequenced, we will be able to have a more accurate estimate of the ancestral family size. Since <it>Arabidopsis </it>and rice diverged more than 150 million years ago (MYA), gene conversion events that occurred soon after divergence of these two lineages will be much rarer than those that occurred in a lineage-specific fashion.</p>
            <p>By examining the physical locations of <it>Arabidopsis </it>PGs and their relationships to the proposed large-scale duplication patterns, we found that tandem duplications and large-scale duplications were two of the major factors responsible for the expansion of the PG family in <it>Arabidopsis</it>. This is similar to other gene families such as the NBS-LRR <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> and the RLK/Pelle gene family <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. Among duplicates in the same tandem cluster, nearly all belong to the same PG clades or are close relatives of each other. The only exception is At1g80140 and At1g80170 in cluster 1d, suggesting that they are tandem duplicates that formed before the <it>Arabidopsis</it>-rice split. Most of the PGs (59) are located within 26 duplicated block pairs (Table <tblr tid="T1">1</tblr>). However, the comparison of gene contents between duplicated blocks in each pair indicates that 22 PGs are distributed asymmetrically in ten of these duplicated block pairs, thus suggesting gene losses. The rest of the duplicated block pairs contain PGs in both duplicated regions. Since only 13 of these PGs are collinear, our findings suggest that large-scale duplications did contribute to some expansion of the PG family but gene losses occurred frequently. Members of each PG pair (either one-to-one or one-to-many) located in collinear regions are from the same clade. Since a clade is defined as the PG ancestral unit right before the divergence between <it>Arabidopsis </it>and rice, the blocks harboring these PGs would be duplicated after the split between these two plants. Blanc <it>et al</it>. <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> assigned duplicated gene pairs to blocks and used synonymous substitution rates to establish the block age. We found that 17 PGs were in 'recent' blocks that duplicated after the split between the <it>Arabidopsis </it>and rice lineages (Additional data file 4). This correlation is consistent with our interpretation based on a phylogenetic approach.</p>
            <p>In the cases where PGs were present in only one of the collinear regions, it is likely that the absence of PGs was due to gene losses, and almost 80% of the PGs generated by large-scale duplications could have been lost in <it>Arabidopsis</it>. These findings are consistent with the high duplicate loss rate in the <it>Arabidopsis </it>genome <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr></abbrgrp>. In addition, the collinear regions flanking PGs are generally larger than the corresponding regions without PGs (considering the numbers of genes or physical distances between the two genes flanking the PGs that were collinear), thus suggesting that the deletion of chromosome regions contributes to PG loss. Another explanation for the asymmetrical distribution of PGs in blocks is that they were inserted <it>de novo </it>through an alternative mechanism such as retro-transposition; however, this is unlikely, as all of the plant PGs have multiple introns.</p>
         </sec>
         <sec>
            <st>
               <p>Divergence of expression pattern after duplications</p>
            </st>
            <p>Although a large number of PG duplicates were lost, there is a net gain in the PG family size after the split between <it>Arabidopsis </it>and rice, and thus, the immediate question is how were these duplicates retained? The fate of duplicated genes varies and depends on the selection constraints <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>. Since one third of the <it>Arabidopsis </it>PGs do not have any evidence of expression, these genes could be pseudogenes. However, some of them have diverged substantially from their closest relatives with large synonymous substitution rates and have most likely persisted beyond the time frame of pseudogenization in <it>Arabidopsis </it>proposed to be a million years <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. Meanwhile, PGs without evidence of expression may be present in tissues not sampled or induced under untested conditions. A closer look at other developmental events involving cell wall degradation, cell separation or cell wall loosening may provide additional insights.</p>
            <p>There is mounting evidence that retention of duplicated genes may be due to acquisition of novel functions, partitioning of original functions, or both. The contribution of differential expression in retaining duplicated genes has been hypothesized more than 25 years ago <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>. More recently, Force <it>et al</it>. <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> proposed the DDC (Duplication/Degeneration/Complementation) model predicting that genes sharing overlapping but distinct expression patterns will be retained due to the partitioning of ancestral expression profiles. In our study, we found that two thirds of the <it>Arabidopsis </it>PGs are expressed and almost three quarters of these expressed PGs are detected in at least three tissues. If the AtGenExpress microarray data for <it>Arabidopsis </it>is considered <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>, five additional PGs are likely expressed using a stringent intensity cutoff (data not shown). Among the PGs that are expressed rather ubiquitously, related PGs in general have overlapping but distinct expression profiles, consistent with the prediction of the DDC model, although it is possible that some expression differences are due to gain of expression rather than loss. In any case, divergent expression among closely related PGs is evident in the different developmental stages of abscission. It has also been reported more recently that duplicated genes tend to have more similar expression patterns when the <it>Ks </it>is relatively small <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>. However, in the PG family, the more recent duplicates do not necessarily have more similar expression patterns. The expression correlation breaks down even more when we examine the expression profiles of PGs in different developmental stages of the abscission process. This lack of correlation may be attributed to relatively long divergence time (large <it>Ks </it>value) between PG duplicates and the lack of statistical power, because a much smaller number of genes are examined compared with an analysis of the whole genome. In addition, we suggest that the mechanism of gene duplication appears to contribute differently to expression divergence. The number of expressed PGs is significantly lower if they are located in tandem repeats. On the other hand, PGs with similar tissue expression patterns tend to be localized to corresponding large-scale duplicated blocks. One possible mechanism for this difference in expression pattern conservation may be the fact that tandem duplication may or may not allow the duplication of whole promoter regions and coding sequences. On the other hand, large-scale duplication involves the duplication of multiple genes together with their promoter and/or enhancer elements. Thus, tandem duplications will result in faster expression divergence than large-scale duplications, and that large-scale duplications ultimately lead to "fine tuning" of gene expression. Another potential explanation for the differences in expression may be due to differences in gene silencing. Homology-dependent gene silencing is a common phenomenon in plants <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. Since the average sequence divergence between tandem repeats is smaller than that of large-scale duplications (data not shown), one might also argue that tandemly duplicated genes tend to be silenced at a higher frequency.</p>
            <p>Functional studies have established that plant PGs are involved in diverse roles including plant growth and development, wounding responses, and plant-microbe interactions <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. Although the PG family members have substantial overlap in tissue-level expression even between distantly related members, when we analyzed distinct developmental stages of abscission we were able to discern unique patterns of expression. These findings suggest that although even if there may be functional overlap between PGs, substantial expression divergence contributed to their retention and probably their functions. Given the number of PGs and the complexity of plant tissues and cell types, it is likely that PGs expressed in the same tissues have subtle differences in their temporal or spatial profiles. This is consistent with the PG expression patterns in different developmental stages of abscission. Alternatively, these seemingly co-expressed PGs may have also diverged at the biochemical levels, such as their catalytic properties. In this study, we used genome sequence information combined with gene expression to provide a framework to unravel the complexity of gene family function. By careful analysis we have been able to take a family of 66 genes and identify four members (Figure <figr fid="F7">7i</figr>) that have unique changes just as cell wall loosening and cell wall dissolution is predicted to occur; thus presenting a small subset of genes for further studies on abscission. Additional analyses in the temporal and spatial patterns of expression in other tissues, their biochemical properties, and in the biological functions of these genes will lead to novel insights regarding functional divergence and conservation in this gene family.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Sequence selection, alignment, and phylogenetic analysis</p>
            </st>
            <p>Representative PGs were the sequences in the seed alignment of glycosyl hydrolase family 28 (GH28) from Pfam database <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. The representative set was used as query sequences to conduct BLAST searches <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> against polypeptide sequences of <it>A. thaliana </it>for candidate PGs from Munich Information Center for Protein Sequences (MIPS) <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. All sequences with E values less than one were regarded as candidate PGs and further analyzed with the Pfam HMM models from GenBank polypeptide sequences; The PGs of <it>O. sativa </it>subsp. <it>indica </it>were identified from predicted coding sequences obtained from Dr. W. Karlowski in MIPS <it>Oryza sativa </it>Database (MosDB) <abbrgrp><abbr bid="B41">41</abbr></abbrgrp> with a similar procedure outlined above. The rice PG sequences appeared highly redundant, and thus almost 30% of the entries that were more than 99% identical at the nucleotide level were eliminated from further analysis. For a list of PGs, including redundant entries, see Additional data files 1 and 8. The protein sequences of PGs identified were aligned against the Pfam GH28 seed alignments using the profile alignment function of ClustalW <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. The GH28 domain sequence alignments of rice and <it>Arabidopsis </it>PGs analyzed can be found in Additional data file 8. The phylogeny of all PGs identified was generated with MEGA2 <abbrgrp><abbr bid="B43">43</abbr></abbrgrp> using the neighbor-joining algorithm <abbrgrp><abbr bid="B44">44</abbr></abbrgrp> with 1,000 bootstrap replicates. Poisson correction for multiple substitutions was used. Sequence gaps were treated as missing characters. Both the <it>Arabidopsis</it>-rice and <it>Arabidopsis</it>-only trees were rooted with <it>Erwinia peh1</it>.</p>
         </sec>
         <sec>
            <st>
               <p>Mapping chromosome location and duplicated blocks</p>
            </st>
            <p>Two large-scale duplication datasets were used. The first is based on the analysis of the Arabidopsis Genome Initiative <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> that was provided by Heiko Schoof and MIPS/Institute of Bioinformatics, Germany. The correspondence between block names given in this study and those in the original analysis, and the starting and ending gene names for these blocks are given in Additional data file 2. The second is based on Blanc <it>et al</it>. <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> and is available from <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. The collinearity of blocks that contain PGs in corresponding duplicated regions was determined using tBLASTn. For these blocks, the nucleotide sequences of one of the duplicated regions were used as query to search against a translated database built from the nucleotide sequence of the other region. To increase the number of High Scoring Pairs recovered, the query sequences were split into 5 kb windows. The matching areas (at least 50 amino acids long and 60% identical) of blocks that contain PGs in the corresponding duplicated regions are shown in Additional data file 4. After identifying the collinear regions surrounding PGs, we took at least 100 kb regions surrounding PGs and their corresponding duplication regions, regardless of the presence of PGs, and repeated the BLAST analysis splitting query sequences into 1 kb windows. Matching areas were defined as similar regions at least 30 amino acids long.</p>
         </sec>
         <sec>
            <st>
               <p>Plant materials and growth</p>
            </st>
            <p><it>Arabidopsis </it>ecotype Columbia (COL) was used for this study and plants grown as described by Patterson and Bleecker <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. T87 suspension-cultured cell lines were derived from COL ecotype <abbrgrp><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr></abbrgrp> and provided by Sebastian Bednarek (University of Wisconsin, Madison, WI, USA). The abscission zones of developing flowers and siliques were collected by removing the primary inflorescence from the plant, and then trimming each individual sample within 0.75 mm +/- 0.25 of the floral abscission zone on both sides. Trimmed samples were immediately frozen in liquid nitrogen and stored at -80&#176;C until further analysis.</p>
         </sec>
         <sec>
            <st>
               <p>Nucleic acid isolation and quantification</p>
            </st>
            <p>Plant tissue was frozen in liquid nitrogen, ground and added to TES-Lysis (50 mM Tris pH 8, 5 mM EDTA, 50 mM NaCl, 1% (w/v) SDS, 1% w/v sarkosyl) followed by extraction with a phenol:chloroform:isoamyl alcohol mix (25:24:1). Samples were centrifuged for 5 minutes at (12,000 <it>g</it>) and the resulting aqueous phase was extracted twice with chloroform:isoamyl alcohol (24:1). Nucleic acids were precipitated at 4&#176;C with isopropanol and 10 M NH<sub>4</sub>OAc (one-third volume) and resuspended in TE. One-half volume of 6.0 M LiCl was added to the sample, incubated at 4&#176;C for 4 hours, and then centrifuged 15 minutes at 12,000 <it>g</it>. DNA (supernatant fraction) was precipitated by adding 10 M NH<sub>4</sub>OAc (1/3 volume) and ethanol, and RNA (pellet) was washed with ethanol and resuspended in DEPC-treated H20 (1 &#956;g/&#956;l). DNA and RNA yields were quantified using a Smart Spec 3000 Biorad (Hercules, CA, USA). Nucleic acid quality was assessed by gel electrophoresis.</p>
         </sec>
         <sec>
            <st>
               <p>RT-PCR analysis</p>
            </st>
            <p>A quantity of 1 &#956;g of each RNA sample was used to prepare cDNA by modifying standard procedures <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>. First strand synthesis was carried out using 500 &#956;g/ml of an 18 mer oligo dT primer (IDT, Coralville, IA, USA). Resulting cDNAs were diluted 1:2 and 1 &#956;l was added as template for a standard 20 &#956;l PCR reaction. For each gene, primers were designed that flanked an intron in the genomic DNA similar to that described by Wang <it>et al</it>. <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>. Since the mRNA and genomic copy of a gene share identical primer sites, they had comparable amplification efficiencies in the PCR reaction and were distinguishable by size. Reactions were incubated at 95&#176;C for 5 minutes, and cycled 28 or 36 times as follows: 94&#176;C for 3 minutes, annealing temperature for 30 seconds, 72&#176;C for 2 minutes. After the last cycle, reactions were incubated at 72&#176;C for 7 minutes. Annealing temperatures and cycle numbers were optimized and are shown in Additional data file 5. A quantity of 10 &#956;l of each PCR reaction was analyzed by gel electrophoresis, and the relative levels of PCR products were recorded.</p>
         </sec>
         <sec>
            <st>
               <p>DNA sequencing</p>
            </st>
            <p>PCR products were excised from the gel, cleaned using a Qiagen Gel extraction kit (Qiagen, Valencia, CA, USA) and sequenced directly as described below. Cycle sequencing reactions with a thermostable DNA polymerase and fluorescently labeled dideoxy terminators (BIG DYE Applied Biosystems, Foster, CA, USA) were carried out on each purified product or subcloned fragment. At2g43870 was subcloned into the PCR 4-TOPO vector (Invitrogen, Carlsbad, CA, USA) before sequencing. All reactions were outsourced to the UW-Madison Biotechnology Center and run on an ABI automated DNA sequencer.</p>
         </sec>
         <sec>
            <st>
               <p>Expression tags of PGs and analysis</p>
            </st>
            <p>The cDNA sequences released by the SIGnAL database <abbrgrp><abbr bid="B49">49</abbr></abbrgrp> were retrieved from GenBank. The predicted protein sequences of PGs were used to search against the cDNA sequences. The cDNAs for PGs are listed in part I of Additional data file6. The <it>Arabidopsis </it>ESTs were retrieved from GenBank (part II of Additional data file 6), and a BLAST search was conducted using the predicted coding sequences of PGs. All matches with more than 80% identity were inspected. After eliminating gaps longer than three from the alignments, cognate ESTs were defined as those that were top matches to the gene in question with at least 97% identity. The accessions, source tissue information for the matching ESTs, can be found in part II of Additional data file 6. The MPSS tags matching the PG genes were retrieved using a batch query script from the <it>Arabidopsis </it>MPSS database <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. Only tags matching exons in the crick strand with levels significantly different from 0 were regarded as evidence of expression.</p>
            <p>The PG expression levels as determined by RT-PCR were converted into 5 categories: high (4), medium (3), low (2), trace (1), and none (0). For each gene, the median (M) of the converted expression levels was used for all subsequent analyses. For each gene pair, the synonymous and non-synonymous substitution rates were determined using the yn00 phylogenetic analysis by maximum likelihood program PAML <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>. The Pearson correlation coefficient (<it>r</it>) was determined for each gene pair and transformed into ln [(1+R)/(1-R)] for linear repression analyses <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>. For determining the differences in expression patterns between tandemly duplicated and block-duplicated genes, we conducted <it>t</it>-tests for 18 PG pairs. For each tissue, the expression levels were considered if both or either one of the genes in a pair were expressed.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data are available with the online version of this paper. Additional data file <supplr sid="S1">1</supplr> lists PGs identified from Genbank protein records. Additional data file <supplr sid="S2">2</supplr> is the BHW and AGI assignment of PGs to duplicated blocks in <it>Arabidopsis</it>. Additional data file <supplr sid="S3">3</supplr> shows a phylogeny generated with all the PGs from fungi, bacteria, metazoa, and plants. Additional data file <supplr sid="S4">4</supplr> shows a figure with the matching areas for duplicated blocks containing PGs in both regions. Additional data file <supplr sid="S5">5</supplr> lists the primers used for the RT-PCR analysis. Additional data file <supplr sid="S6">6</supplr> is summary of expression tags including a list of the PG cDNAs from <it>Arabidopsis </it>and a list of the PG cognate ESTs from <it>Arabidopsis</it>. Additional data file <supplr sid="S7">7</supplr> lists PGs that are expressed in the floral organ abscission zones of <it>Arabidopsis </it>with their patterns of expression. Additional data file <supplr sid="S8">8</supplr> shows GH28 domain sequence alignments of rice and <it>Arabidopsis </it>PGs analyzed.</p>
         <suppl id="S1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>List of PGs identified from GenBank protein records except <it>Arabidopsis </it>proteins</p>
            </caption>
            <text>
               <p>Table containing PGs (except <it>Arabidopsis </it>proteins) sorted according to their taxa.</p>
            </text>
            <file name="gb-2006-7-9-r87-S1.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional data file 2</p>
            </title>
            <caption>
               <p>BHW and AGI assignment of PGs to duplicated blocks in <it>Arabidopsis</it></p>
            </caption>
            <text>
               <p>Table of assignment of PGs to duplicated blocks in <it>Arabidopsis</it>.</p>
            </text>
            <file name="gb-2006-7-9-r87-S2.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S3">
            <title>
               <p>Additional data file 3</p>
            </title>
            <caption>
               <p>Phylogeny of fungal, bacterial, metazoan, and plant PGs</p>
            </caption>
            <text>
               <p>Figure showing a phylogeny generated with all the PGs from fungi, bacteria, metazoa, and plants.</p>
            </text>
            <file name="gb-2006-7-9-r87-S3.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S4">
            <title>
               <p>Additional data file 4</p>
            </title>
            <caption>
               <p>Matching areas for duplicated blocks containing PGs in both regions</p>
            </caption>
            <text>
               <p>Figure with the matching areas for duplicated blocks containing PGs in both regions.</p>
            </text>
            <file name="gb-2006-7-9-r87-S4.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S5">
            <title>
               <p>Additional data file 5</p>
            </title>
            <caption>
               <p>Primers used for the RT-PCR analysis</p>
            </caption>
            <text>
               <p>Primer pairs and conditions used to amplify PG genes.</p>
            </text>
            <file name="gb-2006-7-9-r87-S5.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S6">
            <title>
               <p>Additional data file 6</p>
            </title>
            <caption>
               <p>Summary of expression tags including a list of the PG cDNAs from <it>Arabidopsis </it>and a list of the PG cognate ESTs from <it>Arabidopsis</it></p>
            </caption>
            <text>
               <p>Table showing summary of expression tags and list of cDNAs for <it>Arabidopsis </it>PGs.</p>
            </text>
            <file name="gb-2006-7-9-r87-S6.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S7">
            <title>
               <p>Additional data file 7</p>
            </title>
            <caption>
               <p>Patterns of gene expression during abscission</p>
            </caption>
            <text>
               <p>A list of PGs that are expressed in the floral organ abscission zones of <it>Arabidopsis </it>with their patterns of expression.</p>
            </text>
            <file name="gb-2006-7-9-r87-S7.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S8">
            <title>
               <p>Additional data file 8</p>
            </title>
            <caption>
               <p><it>Arabidopsis </it>and rice sequences used for alignment</p>
            </caption>
            <text>
               <p>GH28 domain sequence alignments of rice and <it>Arabidopsis </it>PGs analyzed.</p>
            </text>
            <file name="gb-2006-7-9-r87-S8.doc">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Wojciech M Karlowski and MIPS for providing predicted indica gene sequences, Ronan O'Malley for helpful discussions on the statistical tests and technical advice for the expression work, Runsun Pan for providing software for evolutionary rate calculation, and Yun-Huei Tzeng for helpful discussions on the statistical tests. This work was supported by USDA (00-35301-9085) and NSF (DBI-0217552) to S.E.P., NIH National Research Service Award (5F32GM066554-01) to S-H.S., and NIH grants to W-H.L.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The cell wall.</p>
            </title>
            <aug>
               <au>
                  <snm>Carpita</snm>
                  <fnm>NC</fnm>
               </au>
               <au>
                  <snm>McCann</snm>
                  <fnm>MC</fnm>
               </au>
            </aug>
            <source>Biochemistry and Molecular Biology of Plants</source>
            <publisher>Rockville: American Society Plant Physiologists</publisher>
            <editor>Buchanan BB, Gruissem W, Jones R</editor>
            <pubdate>2000</pubdate>
            <fpage>52</fpage>
            <lpage>109</lpage>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Cooperative disassembly of the cellulose-xyloglucan network of plant cell walls: parallels between cell expansion and fruit ripening.</p>
            </title>
            <aug>
               <au>
                  <snm>Rose</snm>
                  <fnm>JKC</fnm>
               </au>
               <au>
                  <snm>Bennett</snm>
                  <fnm>AB</fnm>
               </au>
            </aug>
            <source>Trends Plant Sci</source>
            <pubdate>1999</pubdate>
            <volume>4</volume>
            <fpage>176</fpage>
            <lpage>183</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1360-1385(99)01405-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">10322557</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Expansive growth of plant cell walls.</p>
            </title>
            <aug>
               <au>
                  <snm>Cosgrove</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Plant Physiol Biochem</source>
            <pubdate>2000</pubdate>
            <volume>38</volume>
            <fpage>109</fpage>
            <lpage>124</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0981-9428(00)00164-9</pubid>
                  <pubid idtype="pmpid" link="fulltext">11543185</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Abscission, dehiscence, and other cell separation processes.</p>
            </title>
            <aug>
               <au>
                  <snm>Roberts</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Elliott</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Gonzalez-Carranza</snm>
                  <fnm>ZH</fnm>
               </au>
            </aug>
            <source>Annu Rev Plant Biol</source>
            <pubdate>2002</pubdate>
            <volume>53</volume>
            <fpage>131</fpage>
            <lpage>158</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.arplant.53.092701.180236</pubid>
                  <pubid idtype="pmpid">12221970</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Polygalacturonases: many genes in search of a function.</p>
            </title>
            <aug>
               <au>
                  <snm>Hadfield</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Bennett</snm>
                  <fnm>AB</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>1998</pubdate>
            <volume>117</volume>
            <fpage>337</fpage>
            <lpage>343</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1539180</pubid>
                  <pubid idtype="pmpid" link="fulltext">9625687</pubid>
                  <pubid idtype="doi">10.1104/pp.117.2.337</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Cell separation processes in plants: models, mechanisms, and manipulation.</p>
            </title>
            <aug>
               <au>
                  <snm>Roberts</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Whitelaw</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Gonzalez-Carranza</snm>
                  <fnm>ZH</fnm>
               </au>
               <au>
                  <snm>McManus</snm>
                  <fnm>MT</fnm>
               </au>
            </aug>
            <source>Ann Bot</source>
            <pubdate>2000</pubdate>
            <volume>86</volume>
            <fpage>223</fpage>
            <lpage>235</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1006/anbo.2000.1203</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Cutting loose. Abscission and dehiscence in Arabidopsis.</p>
            </title>
            <aug>
               <au>
                  <snm>Patterson</snm>
                  <fnm>SE</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2001</pubdate>
            <volume>126</volume>
            <fpage>494</fpage>
            <lpage>500</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1104/pp.126.2.494</pubid>
                  <pubid idtype="pmpid" link="fulltext">11402180</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Multiple endo-1,4-beta-D-glucanase (cellulase) genes in Arabidopsis.</p>
            </title>
            <aug>
               <au>
                  <snm>del Campillo</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Curr Top Dev Biol</source>
            <pubdate>1999</pubdate>
            <volume>46</volume>
            <fpage>39</fpage>
            <lpage>61</lpage>
            <xrefbib>
               <pubid idtype="pmpid">10417876</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Characterization of a ubiquitous expressed gene family encoding polygalacturonase in Arabidopsis thaliana.</p>
            </title>
            <aug>
               <au>
                  <snm>Torki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Mandaron</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Mache</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Falconet</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2000</pubdate>
            <volume>242</volume>
            <fpage>427</fpage>
            <lpage>436</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0378-1119(99)00497-7</pubid>
                  <pubid idtype="pmpid" link="fulltext">10721737</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Pectin degrading glycoside hydrolases of family 28: sequence-structural features, specificities and evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Markovic</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Janecek</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Protein Eng</source>
            <pubdate>2001</pubdate>
            <volume>14</volume>
            <fpage>615</fpage>
            <lpage>631</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/protein/14.9.615</pubid>
                  <pubid idtype="pmpid" link="fulltext">11707607</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Expression of a polygalacturonase associated with tomato seed germination.</p>
            </title>
            <aug>
               <au>
                  <snm>Sitrit</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hadfield</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Bennett</snm>
                  <fnm>AB</fnm>
               </au>
               <au>
                  <snm>Bradford</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Downie</snm>
                  <fnm>AB</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>1999</pubdate>
            <volume>121</volume>
            <fpage>419</fpage>
            <lpage>428</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">59404</pubid>
                  <pubid idtype="pmpid" link="fulltext">10517833</pubid>
                  <pubid idtype="doi">10.1104/pp.121.2.419</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Analysis of a dehiscence zone endo-polygalacturonase in oilseed rape (Brassica napus) and Arabidopsis thaliana: evidence for roles in cell separation in dehiscence and abscission zones, and in stylar tissues during pollen tube growth</p>
            </title>
            <aug>
               <au>
                  <snm>Sander</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Child</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ulvskov</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Albrechtsen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Borkhardt</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Plant Mol Biol</source>
            <pubdate>2001</pubdate>
            <volume>46</volume>
            <fpage>469</fpage>
            <lpage>479</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1023/A:1010619002833</pubid>
                  <pubid idtype="pmpid" link="fulltext">11485203</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Visualization by comprehensive microarray analysis of gene expression programs during transdifferentiation of mesophyll cells into xylem cells.</p>
            </title>
            <aug>
               <au>
                  <snm>Demura</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Tashiro</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Horiguchi</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Kishimoto</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Kubo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Matsuoka</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Minami</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Nagata-Hiwatashi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nakamura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Okamura</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <fpage>15794</fpage>
            <lpage>15799</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">137795</pubid>
                  <pubid idtype="pmpid" link="fulltext">12438691</pubid>
                  <pubid idtype="doi">10.1073/pnas.232590499</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Overexpression of polygalacturonase in transgenic apple trees leads to a range of novel phenotypes involving changes in cell adhesion.</p>
            </title>
            <aug>
               <au>
                  <snm>Atkinson</snm>
                  <fnm>RG</fnm>
               </au>
               <au>
                  <snm>Schroder</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hallett</snm>
                  <fnm>IC</fnm>
               </au>
               <au>
                  <snm>Cohen</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>MacRae</snm>
                  <fnm>EA</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2002</pubdate>
            <volume>129</volume>
            <fpage>122</fpage>
            <lpage>133</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">155877</pubid>
                  <pubid idtype="pmpid" link="fulltext">12011344</pubid>
                  <pubid idtype="doi">10.1104/pp.010986</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Polygalacturonase beta-subunit antisense gene expression in tomato plants leads to a progressive enhanced wound response and necrosis in leaves and abscission of developing flowers.</p>
            </title>
            <aug>
               <au>
                  <snm>Orozco-Cardenas</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Ryan</snm>
                  <fnm>CA</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2003</pubdate>
            <volume>133</volume>
            <fpage>693</fpage>
            <lpage>701</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">219044</pubid>
                  <pubid idtype="pmpid" link="fulltext">12972668</pubid>
                  <pubid idtype="doi">10.1104/pp.103.023226</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Influence of cell wall degrading enzymes on colonization of N2 fixing bacterium, Azorhizobium caulinodans in rice.</p>
            </title>
            <aug>
               <au>
                  <snm>Buvana</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kannaiyan</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Indian J Exp Biol</source>
            <pubdate>2002</pubdate>
            <volume>40</volume>
            <fpage>369</fpage>
            <lpage>372</lpage>
            <xrefbib>
               <pubid idtype="pmpid">12635715</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Analysis of the genome sequence of the flowering plant Arabidopsis thaliana.</p>
            </title>
            <aug>
               <au>
                  <cnm>Arabidopsis Genome Initiative</cnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>408</volume>
            <fpage>796</fpage>
            <lpage>815</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35048692</pubid>
                  <pubid idtype="pmpid" link="fulltext">11130711</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Genomic organization of six tomato polygalacturonases and 5' upstream sequence identity with tap1 and win2 genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Hong</snm>
                  <fnm>SB</fnm>
               </au>
               <au>
                  <snm>Tucker</snm>
                  <fnm>ML</fnm>
               </au>
            </aug>
            <source>Mol Gen Genet</source>
            <pubdate>1998</pubdate>
            <volume>258</volume>
            <fpage>479</fpage>
            <lpage>487</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s004380050758</pubid>
                  <pubid idtype="pmpid">9669329</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>The origins of genomic duplications in Arabidopsis.</p>
            </title>
            <aug>
               <au>
                  <snm>Vision</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Tanksley</snm>
                  <fnm>SD</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2000</pubdate>
            <volume>290</volume>
            <fpage>2114</fpage>
            <lpage>2117</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.290.5499.2114</pubid>
                  <pubid idtype="pmpid" link="fulltext">11118139</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>A recent polyploidy superimposed on older large-scale duplications in the Arabidopsis genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Blanc</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Hokamp</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Wolfe</snm>
                  <fnm>KH</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>137</fpage>
            <lpage>144</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">420368</pubid>
                  <pubid idtype="pmpid" link="fulltext">12566392</pubid>
                  <pubid idtype="doi">10.1101/gr.751803</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Evolutionary change of duplicate genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>WH</fnm>
               </au>
            </aug>
            <source>Isozymes Curr Top Biol Med Res</source>
            <pubdate>1982</pubdate>
            <volume>6</volume>
            <fpage>55</fpage>
            <lpage>92</lpage>
            <xrefbib>
               <pubid idtype="pmpid">6187709</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Splitting pairs: the diverging fates of duplicated genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Prince</snm>
                  <fnm>VE</fnm>
               </au>
               <au>
                  <snm>Pickett</snm>
                  <fnm>FB</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>827</fpage>
            <lpage>837</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg928</pubid>
                  <pubid idtype="pmpid" link="fulltext">12415313</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>A draft sequence of the rice genome (<it>Oryza sativa </it>L. ssp. <it>indica</it>).</p>
            </title>
            <aug>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Dai</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>296</volume>
            <fpage>79</fpage>
            <lpage>92</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1068037</pubid>
                  <pubid idtype="pmpid" link="fulltext">11935017</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Identification of transcribed sequences in Arabidopsis thaliana by using high-resolution genome tilling arrays.</p>
            </title>
            <aug>
               <au>
                  <snm>Stolc</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Samanta</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Tongprasit</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Sethi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Liang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Hegeman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Rancour</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bednarek</snm>
                  <fnm>S</fnm>
               </au>
               <etal/>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>4453</fpage>
            <lpage>4458</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">555476</pubid>
                  <pubid idtype="pmpid" link="fulltext">15755812</pubid>
                  <pubid idtype="doi">10.1073/pnas.0408203102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Ethylene-dependent and -independent processes associated with floral organ abscission in Arabidopsis.</p>
            </title>
            <aug>
               <au>
                  <snm>Patterson</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Bleecker</snm>
                  <fnm>AB</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2004</pubdate>
            <volume>134</volume>
            <fpage>194</fpage>
            <lpage>203</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1104/pp.103.028027</pubid>
                  <pubid idtype="pmpid" link="fulltext">14701913</pubid>
                  <pubid idtype="pmcid">316299</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Genome-wide analysis of NBS-LRR-encoding genes in Arabidopsis.</p>
            </title>
            <aug>
               <au>
                  <snm>Meyers</snm>
                  <fnm>BC</fnm>
               </au>
               <au>
                  <snm>Kozik</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Griego</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kuang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Michelmore</snm>
                  <fnm>RW</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2003</pubdate>
            <volume>15</volume>
            <fpage>809</fpage>
            <lpage>834</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">152331</pubid>
                  <pubid idtype="pmpid" link="fulltext">12671079</pubid>
                  <pubid idtype="doi">10.1105/tpc.009308</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Expansion of the receptor-like kinase/pelle gene family and receptor-like proteins in Arabidopsis.</p>
            </title>
            <aug>
               <au>
                  <snm>Shiu</snm>
                  <fnm>S-H</fnm>
               </au>
               <au>
                  <snm>Bleecker</snm>
                  <fnm>AB</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2003</pubdate>
            <volume>132</volume>
            <fpage>530</fpage>
            <lpage>543</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">166995</pubid>
                  <pubid idtype="pmpid" link="fulltext">12805585</pubid>
                  <pubid idtype="doi">10.1104/pp.103.021964</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>The hidden duplication past of Arabidopsis thaliana.</p>
            </title>
            <aug>
               <au>
                  <snm>Simillion</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Vandepoele</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Van Montagu</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Zabeau</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Van de Peer</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <fpage>13627</fpage>
            <lpage>13632</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">129725</pubid>
                  <pubid idtype="pmpid" link="fulltext">12374856</pubid>
                  <pubid idtype="doi">10.1073/pnas.212522399</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Functional divergence of duplicated genes formed by polyploidy during Arabidopsis evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Blanc</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Wolfe</snm>
                  <fnm>KH</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2004</pubdate>
            <volume>16</volume>
            <fpage>1679</fpage>
            <lpage>1691</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">514153</pubid>
                  <pubid idtype="pmpid" link="fulltext">15208398</pubid>
                  <pubid idtype="doi">10.1105/tpc.021410</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>The evolutionary fate and consequences of duplicate genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Lynch</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Conery</snm>
                  <fnm>JS</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2000</pubdate>
            <volume>290</volume>
            <fpage>1151</fpage>
            <lpage>1155</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.290.5494.1151</pubid>
                  <pubid idtype="pmpid" link="fulltext">11073452</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Multilocus enzymes, gene regulation, and genetic sufficiency.</p>
            </title>
            <aug>
               <au>
                  <snm>Zuckerkandl</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1978</pubdate>
            <volume>12</volume>
            <fpage>57</fpage>
            <lpage>89</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF01732545</pubid>
                  <pubid idtype="pmpid">731711</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Evolution of the differential regulation of duplicate genes after polyploidization.</p>
            </title>
            <aug>
               <au>
                  <snm>Ferris</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Whitt</snm>
                  <fnm>GS</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1979</pubdate>
            <volume>12</volume>
            <fpage>267</fpage>
            <lpage>317</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF01732026</pubid>
                  <pubid idtype="pmpid">448746</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Preservation of duplicate genes by complementary, degenerative mutations.</p>
            </title>
            <aug>
               <au>
                  <snm>Force</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lynch</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pickett</snm>
                  <fnm>FB</fnm>
               </au>
               <au>
                  <snm>Amores</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Yan</snm>
                  <fnm>YL</fnm>
               </au>
               <au>
                  <snm>Postlethwait</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1999</pubdate>
            <volume>151</volume>
            <fpage>1531</fpage>
            <lpage>1545</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1460548</pubid>
                  <pubid idtype="pmpid" link="fulltext">10101175</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>A gene expression map of Arabidopsis thaliana development.</p>
            </title>
            <aug>
               <au>
                  <snm>Schmid</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Davison</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Henz</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Pape</snm>
                  <fnm>UJ</fnm>
               </au>
               <au>
                  <snm>Demar</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Vingron</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Scholkopf</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Weigel</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lohmann</snm>
                  <fnm>JU</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2005</pubdate>
            <volume>37</volume>
            <fpage>501</fpage>
            <lpage>506</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1543</pubid>
                  <pubid idtype="pmpid" link="fulltext">15806101</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Rapid divergence in expression between duplicate genes inferred from microarray data.</p>
            </title>
            <aug>
               <au>
                  <snm>Gu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Nicolae</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lu</snm>
                  <fnm>HH</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>WH</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2002</pubdate>
            <volume>18</volume>
            <fpage>609</fpage>
            <lpage>613</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(02)02837-8</pubid>
                  <pubid idtype="pmpid" link="fulltext">12446139</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Divergence in the spatial pattern of gene expression between human duplicate genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Makova</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>WH</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>1638</fpage>
            <lpage>1645</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">403737</pubid>
                  <pubid idtype="pmpid" link="fulltext">12840042</pubid>
                  <pubid idtype="doi">10.1101/gr.1133803</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Homology-dependent gene silencing in plants.</p>
            </title>
            <aug>
               <au>
                  <snm>Meyer</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Saedler</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Annu Rev Plant Physiol Plant Mol Biol</source>
            <pubdate>1996</pubdate>
            <volume>47</volume>
            <fpage>23</fpage>
            <lpage>48</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.arplant.47.1.23</pubid>
                  <pubid idtype="pmpid" link="fulltext">15012281</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Pfam entry Glyco_hydro_28</p>
            </title>
            <url>http://www.sanger.ac.uk/cgi-bin/Pfam/getacc?PF00295</url>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Schaffer</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <fpage>3389</fpage>
            <lpage>3402</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">146917</pubid>
                  <pubid idtype="pmpid" link="fulltext">9254694</pubid>
                  <pubid idtype="doi">10.1093/nar/25.17.3389</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>MIPS - Plant genome bioinformatics group</p>
            </title>
            <url>http://mips.gsf.de/proj/thal/</url>
         </bibl>
         <bibl id="B41">
            <title>
               <p>The MIPS <it>Oryza sativa </it>Database (MOsDB)</p>
            </title>
            <url>http://mips.gsf.de/proj/plant/jsf/rice/index.jsp</url>
         </bibl>
         <bibl id="B42">
            <title>
               <p>CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1994</pubdate>
            <volume>22</volume>
            <fpage>4673</fpage>
            <lpage>4680</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308517</pubid>
                  <pubid idtype="pmpid">7984417</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>MEGA2: molecular evolutionary genetics analysis software.</p>
            </title>
            <aug>
               <au>
                  <snm>Kumar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tamura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Jakobsen</snm>
                  <fnm>IB</fnm>
               </au>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2001</pubdate>
            <volume>17</volume>
            <fpage>1244</fpage>
            <lpage>1245</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/17.12.1244</pubid>
                  <pubid idtype="pmpid" link="fulltext">11751241</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>The neighbor-joining method: a new method for reconstructing phylogenetic trees.</p>
            </title>
            <aug>
               <au>
                  <snm>Saitou</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1987</pubdate>
            <volume>4</volume>
            <fpage>406</fpage>
            <lpage>425</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">3447015</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Large-scale duplication database</p>
            </title>
            <url>http://wolfe.gen.tcd.ie/athal/all_results</url>
         </bibl>
         <bibl id="B46">
            <title>
               <p>A protocol for transient gene expression in Arabidopsis thaliana protoplasts isolated from cell suspension cultures.</p>
            </title>
            <aug>
               <au>
                  <snm>Axelos</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Curie</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Mazzolini</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Bardet</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lescure</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Plant Physiol Biochem</source>
            <pubdate>1992</pubdate>
            <volume>30</volume>
            <fpage>123</fpage>
            <lpage>128</lpage>
         </bibl>
         <bibl id="B47">
            <title>
               <p>The Arabidopsis cell plate-associated dynamin-like protein, ADL1Ap, is required for multiple stages of plant growth and development.</p>
            </title>
            <aug>
               <au>
                  <snm>Kang</snm>
                  <fnm>BH</fnm>
               </au>
               <au>
                  <snm>Busse</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Dickey</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Rancour</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Bednarek</snm>
                  <fnm>SY</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2001</pubdate>
            <volume>126</volume>
            <fpage>47</fpage>
            <lpage>68</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">102281</pubid>
                  <pubid idtype="pmpid" link="fulltext">11351070</pubid>
                  <pubid idtype="doi">10.1104/pp.126.1.47</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Quantitation of mRNA by polymerase chain reaction.</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Doyle</snm>
                  <fnm>MV</fnm>
               </au>
               <au>
                  <snm>Mark</snm>
                  <fnm>DF</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1989</pubdate>
            <volume>86</volume>
            <fpage>9717</fpage>
            <lpage>9721</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">298572</pubid>
                  <pubid idtype="pmpid" link="fulltext">2481313</pubid>
                  <pubid idtype="doi">10.1073/pnas.86.24.9717</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>SIGnAL Salk Institute Genomic Analysis Laboratory</p>
            </title>
            <url>http://signal.salk.edu/</url>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Arabidopsis MPSS plus - about our database</p>
            </title>
            <url>http://mpss.udel.edu/at/</url>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Codon-substitution models for heterogeneous selection pressure at amino acid sites.</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Goldman</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Pedersen</snm>
                  <fnm>AM</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2000</pubdate>
            <volume>155</volume>
            <fpage>431</fpage>
            <lpage>449</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1461088</pubid>
                  <pubid idtype="pmpid" link="fulltext">10790415</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <aug>
               <au>
                  <snm>Snedecor</snm>
                  <fnm>GW</fnm>
               </au>
               <au>
                  <snm>Cochran</snm>
                  <fnm>WG</fnm>
               </au>
            </aug>
            <source>Statistical Methods</source>
            <publisher>Ames, Iowa: Iowa State University Press</publisher>
            <pubdate>1980</pubdate>
         </bibl>
      </refgrp>
   </bm>
</art>

