<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2009-10-5-r48</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Co-evolutionary networks of genes and cellular processes across fungal species</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Tuller</snm>
               <fnm>Tamir</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <insr iid="I3"/>
               <email>tamirtul@post.tau.ac.il</email>
            </au>
            <au id="A2" ca="yes">
               <snm>Kupiec</snm>
               <fnm>Martin</fnm>
               <insr iid="I2"/>
               <email>martin@post.tau.ac.il</email>
            </au>
            <au id="A3">
               <snm>Ruppin</snm>
               <fnm>Eytan</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
               <email>ruppin@post.tau.ac.il</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>School of Computer Sciences, Tel Aviv University, Ramat Aviv 69978, Israel</p>
            </ins>
            <ins id="I2">
               <p>Department of Molecular Microbiology and Biotechnology, Tel Aviv University, Ramat Aviv 69978, Israel</p>
            </ins>
            <ins id="I3">
               <p>School of Medicine, Tel Aviv University, Ramat Aviv 69978, Israel</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2009</pubdate>
         <volume>10</volume>
         <issue>5</issue>
         <fpage>R48</fpage>
         <url>http://genomebiology.com/2009/10/5/R48</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">19416514</pubid>
               <pubid idtype="doi">10.1186/gb-2009-10-5-r48</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>24</day>
               <month>2</month>
               <year>2009</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>24</day>
               <month>2</month>
               <year>2009</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>5</day>
               <month>5</month>
               <year>2009</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>5</day>
               <month>5</month>
               <year>2009</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2009</year>
         <collab>Tuller et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Co-evolution and co-functionality of fungal genes</p>
      </shorttitle>
      <shortabs>
         <p>Two new measures of evolution are used to study co-evolutionary networks of fungal genes and cellular processes; links between co-evolution and co-functionality are revealed.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The introduction of measures such as evolutionary rate and propensity for gene loss have significantly advanced our knowledge of the evolutionary history and selection forces acting upon individual genes and cellular processes.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We present two new measures, the 'relative evolutionary rate pattern' (rERP), which records the relative evolutionary rates of conserved genes across the different branches of a species' phylogenetic tree, and the 'copy number pattern' (CNP), which quantifies the rate of gene loss of less conserved genes. Together, these measures yield a high-resolution study of the co-evolution of genes in 9 fungal species, spanning 3,540 sets of orthologs. We find that the evolutionary tempo of conserved genes varies in different evolutionary periods. The co-evolution of genes' Gene Ontology categories exhibits a significant correlation with their functional distance in the Gene Ontology hierarchy, but not with their location on chromosomes, showing that cellular functions are a more important driving force in gene co-evolution than their chromosomal proximity. Two fundamental patterns of co-evolution of conserved genes, cooperative and reciprocal, are identified; only genes co-evolving cooperatively functionally back each other up. The co-evolution of conserved and less conserved genes exhibits both commonalities and differences; DNA metabolism is positively correlated with nuclear traffic, transcription processes and vacuolar biology in both analyses.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>Overall, this study charts the first global network view of gene co-evolution in fungi. The future application of the approach presented here to other phylogenetic trees holds much promise in characterizing the forces that shape cellular co-evolution.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The molecular clock hypothesis states that throughout evolutionary history mutations occur at an approximately uniform rate <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. In many cases this hypothesis provides a good approximation of the actual mutation rate <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp> while in other cases it has proven unrealistic <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B4">4</abbr></abbrgrp>. The evolutionary rate (ER) of a gene, the ratio between the number of its non-synonymous to synonymous mutations, <it>dN</it>/<it>dS</it>, is a basic measure of evolution at the molecular level. This measure is affected by many systemic factors, including gene dispensability, expression level, number of protein interactions, and recombination rate <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. Since the factors that influence evolutionary rate are numerous and change in a dynamic fashion, it is likely that the evolutionary rate of an individual gene may vary between different evolutionary periods. Previous studies have investigated co-evolutionary relationships between genes on a small scale, mainly with the aim of inferring functional linkage <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>. These studies were mostly based on the genes' phyletic patterns (the occurrence pattern of a gene in a set of current organisms). Recently, Lopez-Bigas <it>et al. </it><abbrgrp><abbr bid="B18">18</abbr></abbrgrp> performed a comprehensive analysis of the evolution of different functional categories in humans. They showed that certain functional categories exhibit dynamic patterns of sequence divergence across their evolutionary history. Other studies have examined the correlations between genes' evolutionary rates to predict physical protein-protein interactions <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>. A recent publication by Juan <it>et al. </it><abbrgrp><abbr bid="B24">24</abbr></abbrgrp> focused on <it>Escherichia coli </it>and generated a co-evolutionary network containing the raw tree similarities for all pairs of proteins in order to improve the prediction accuracy of protein-protein interactions. Here our goal and methodology are different; we concentrate on a set of nine fungal species spanning approximately 1,000 million years <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. We develop tools to investigate co-evolution in both conserved and less-conserved genes. For the first group, whose members have an identical phylogenetic tree, we employ high-resolution ER measures to investigate gene co-evolution. In the case of less conserved genes, we generalize the concept of propensity for gene loss <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> to encompass the whole phylogenetic tree in order to better understand the driving forces behind co-evolution.</p>
         <p>The first part of this paper describes the analysis of conserved genes. We define a new measure of co-evolution for such genes and study their evolutionary rates along different parts of the evolutionary tree. Next, we reconstruct a co-evolutionary network of genes and a co-evolutionary network of cellular processes according to this measure. In such a network two genes/processes are connected if their co-evolution is correlated. We identify two patterns of co-evolution, correlated (cooperative) and anti-correlated (reciprocal). We show that co-evolution is significantly correlated with co-functionality but not with chromosomal co-organization of genes. We conclude this part by identifying clusters of functions in the co-evolutionary network. Subsequently, in the second part of the paper, we study the evolution of less-conserved genes. We describe a new measure of evolution for such genes and reconstruct a co-evolutionary network of cellular processes according to this measure. We study the resulting clusters in this network and compare it to the co-evolutionary network of the conserved genes.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>The co-evolution of conserved genes</p>
            </st>
            <sec>
               <st>
                  <p>Computing the relative evolutionary rate pattern</p>
               </st>
               <p>First, we focus on the large set of conserved genes (that is, genes that are conserved in all fungal species analyzed), identifying sequence co-evolutionary relationships that are manifested in the absence of major gene gain and loss events. As these co-evolutionary relationships cannot be deciphered by an analysis based on phyletic patterns, and a single evolutionary rate measure is too crude for capturing them, we set out to measure the relative evolutionary rate of each gene at every branch of the evolutionary tree. The resulting new 'relative evolutionary rate pattern' (rERP) measure characterizes a gene's pattern of evolution as a vector of all its relative evolutionary rates in the different branches of a species' phylogenetic tree. A workflow describing the determination of genes' ERPs is presented in Figure <figr fid="F1">1</figr> (for a detailed description of the workflow described in this figure and comparison to other measures of co-evolution see Materials and methods). We analyzed genes from nine fungal species, whose phylogenetic relationship (based on the 18S rDNA <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> and on the comparison of 531 informative proteins <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>) is presented in Figure <figr fid="F2">2</figr>. We first created a set of orthologous genes (lacking paralogs) that are conserved in all species, resulting in a dataset of 1,372 sets of orthologs spanning a total of 12,348 genes. Each such set of orthologous genes (SOG) was then aligned, and its ancestral sequences at the internal nodes of the phylogenetic tree were inferred using maximum likelihood. The resulting sets of orthologs and ancestral sequences were then used to estimate the evolutionary rate, <it>dN</it>/<it>dS </it><abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, along each of the tree branches. To consider the selection forces acting on synonymous (<it>S</it>) sites we used an approach similar to that of <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> and adjusted the evolutionary rates accordingly. These adjusted evolutionary rates are denoted <it>dN</it>/<it>dS'</it>, and compose an ERP vector that specifies a <it>dN</it>/<it>dS' </it>value for each branch of the evolutionary tree, for each SOG. We next carried out an analysis of the resulting ERP matrix, whose rows are the SOGs, its columns are the tree branches, and its entries denote evolutionary rate values (<it>dN</it>/<it>dS'</it>).</p>
               <fig id="F1">
                  <title>
                     <p>Figure 1</p>
                  </title>
                  <caption>
                     <p>The different steps in computing rERP (for additional details see the Materials and methods section)</p>
                  </caption>
                  <text>
                     <p>The different steps in computing rERP (for additional details see the Materials and methods section). AA, amino acids; tAI, tRNA adaptation index.</p>
                  </text>
                  <graphic file="gb-2009-10-5-r48-1"/>
               </fig>
               <fig id="F2">
                  <title>
                     <p>Figure 2</p>
                  </title>
                  <caption>
                     <p>Phylogeny of the 9 fungal species based on the 18S rRNA <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> and 531 concatenated proteins <abbrgrp><abbr bid="B27">27</abbr></abbrgrp></p>
                  </caption>
                  <text>
                     <p>Phylogeny of the 9 fungal species based on the 18S rRNA <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> and 531 concatenated proteins <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. Each of the leaves and the internal nodes is labeled with numbers between 1 and 15. A branch in the phylogenetic tree is designated by the two nodes it connects.</p>
                  </text>
                  <graphic file="gb-2009-10-5-r48-2"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>The evolutionary rate along different branches of the evolutionary tree</p>
               </st>
               <p>Our first task was to characterize the global selection regimes acting upon the genes studied. We conservatively limit this investigation to the short branches of the tree (excluding branches (7,15), (15,16), (8,16), (9,16); Figures <figr fid="F2">2</figr> and <figr fid="F3">3</figr>) to avoid potential saturation problems that may bias the ER computation (Materials and methods). Most of the genes exhibit purifying selection (<it>dN</it>/<it>dS' </it>&lt; 0.9) in the majority of the phylogenetic branches, as one would expect <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. A much smaller group of genes under positive (<it>dN</it>/<it>dS' </it>> 1.1) and neutral (0.9 &lt;<it>dN</it>/<it>dS' </it>&lt; 1.1) selection are concentrated in three branches (Figure <figr fid="F3">3</figr>), with the majority located on the branch leading from internal node 12 to internal node 11, probably following the whole genome duplication event known to have occurred at this bifurcation <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. This major duplication event probably served as a driving force underlying this surge of positive selection, by relaxing the functional constraints acting on each of the gene copies <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. This branch also represents a switch from anaerobic (<it>Saccharomyces cerevisiae</it>, <it>Saccharomyces bayanus </it>and <it>Candida glabrata</it>) to aerobic (<it>Aspergillus nidulans</it>, <it>Candida albicans</it>, <it>Debaryomyces hansenii</it>, <it>Kluyveromyces lactis</it>, <it>Yarrowia lipolytica</it>) metabolism <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, which has likely required a large burst of positive evolution in many genes. Additional data file 1 includes a table that depicts the SOGs with positive evolution along this branch (using their <it>S. cerevisiae </it>representative), which is indeed enriched with many metabolic genes. The other two branches under positive selection are the branch between nodes 13 and 14, leading to a subgroup (<it>D. hansenii </it>and <it>C. albicans</it>) that evolved a modified version of the genetic code <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>, and the branch between nodes 13 and 15 that leads to <it>Y. lipolytica </it>(which is a sole member in one of the three taxonomical clusters of the Saccharomycotina <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>).</p>
               <fig id="F3">
                  <title>
                     <p>Figure 3</p>
                  </title>
                  <caption>
                     <p>Number of genes (y-axis) with <it>dN</it>/<it>dS' </it>> 1.1 (positive selection), 1.1 > <it>dN</it>/<it>dS' </it>> 0.9 (neutral selection), and 0.9 > <it>dN</it>/<it>dS' </it>(purifying selection) in each branch (x-axis; see Figure 3)</p>
                  </caption>
                  <text>
                     <p>Number of genes (y-axis) with <it>dN</it>/<it>dS' </it>> 1.1 (positive selection), 1.1 > <it>dN</it>/<it>dS' </it>> 0.9 (neutral selection), and 0.9 > <it>dN</it>/<it>dS' </it>(purifying selection) in each branch (x-axis; see Figure 3).</p>
                  </text>
                  <graphic file="gb-2009-10-5-r48-3"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>Co-evolution of cellular processes</p>
               </st>
               <p>The major goal of this work is to study the co-evolution of gene pairs and of cellular processes. To this end we utilized the ERP matrix to compute the rERP of each conserved SOG. The rERP is a vector containing the relative, ranked <it>dN</it>/<it>dS' </it>(rER) of each SOG in every branch of the evolutionary tree, thus comparing the evolutionary rate of each individual SOG to that of all other SOGs. The ranking procedure is employed to attenuate the effects of noisy estimations of ER values, especially in long branches of the phylogenetic tree (see Note 1 in Additional data file 2). Defining the rERP of a Gene Ontology (GO) process to be the mean rERP of all the genes it contains, we asked which GO processes have the rERP with the highest mean and the highest variance across the different branches of the evolutionary tree (Figure <figr fid="F4">4</figr>). Notably, processes related to energy production, such as the tricarboxylic acid cycle (involved in cellular respiration), and ATP synthesis-coupled proton transport (which includes genes encoding the mitochondrial ATPase) have the highest mean rERP and also exhibit the highest variance of their rERP. This reflects the primary role that energy production has played in fungal evolution, and the effects that changes from anaerobic to aerobic metabolism have had on the development of fungal species. Additional high rERP energy-related GO terms include aerobic respiration and heme biosynthesis. Interestingly, biological functions related to information flow within the cell exhibit high mean rERP values (tRNA export from nucleus, DNA recombination) or high rERP variance (transcription initiation from polymerase II promoter, RNA processing, transcription termination from RNA polymerase II promoter). The trend, however, is not identical for all processes: protein import to the nucleus, for example, has a high rERP value but very little variance. Full lists of conserved genes and GO groups sorted according to their mean rERP and rERP variance appear in Additional data file 3.</p>
               <fig id="F4">
                  <title>
                     <p>Figure 4</p>
                  </title>
                  <caption>
                     <p>GO categories (biological processes) with extreme mean and variance of their rERPs (for a unbiased comparison we included only GO groups with 5 to 20 genes)</p>
                  </caption>
                  <text>
                     <p>GO categories (biological processes) with extreme mean and variance of their rERPs (for a unbiased comparison we included only GO groups with 5 to 20 genes).</p>
                  </text>
                  <graphic file="gb-2009-10-5-r48-4"/>
               </fig>
               <p>We carried out a hierarchical clustering of GO-slim functions according to their rERP values, which is depicted in Figure <figr fid="F5">5</figr>. Many GO-slim groups exhibit correlated rERP values. For example, processes related to metabolic activity (such as cellular respiration, carbohydrate metabolism, and generation of precursor metabolites and energy) exhibit high rERP values across the tree, whereas others (cell cycle and meiosis) exhibit markedly lower values. Interestingly, processes related to polarized growth and budding exhibit the lowest overall rERPs. Importantly, the figure shows that rERP values can provide additional information to that contained in the global relative evolutionary rates (that is, those measured by aggregating the whole tree). For example, the two GO-slim process groups plasma membrane and microtubule organization center (Figure <figr fid="F5">5</figr>, middle) have relatively similar (low) relative global evolutionary rates but markedly different rERPs (as they appear in the two extreme parts of the hierarchical clustering). While the standard ER measure checks if the average ER of genes is similar (that is, |ER<sub>1 </sub>- ER<sub>2</sub>|), rERP compares the fluctuations in the ER of genes. Thus, two SOGs may appear similar by one measure and very different when applying the other. Figure <figr fid="F6">6</figr> shows two examples in which the two measures provide opposite results. Notably, the correlation between these two measures is significant but rather low (r = -0.055, <it>P </it>&lt; 10<sup>-16</sup>). Overall, GO groups with functionally related gene sets (that is, those that map closer on the GO ontology network) tend to have similar rERP values (the correlation between distance in the GO graph and average correlation of rERP is -0.96, <it>P</it>-value &lt; 4.5 &#215; 10<sup>-4</sup>; see more details in Figure <figr fid="F7">7</figr>, Additional data file 4, and Materials and methods; this comparison is made using the <it>S. cerevisiae </it>GO ontology and mapping all the SOGs to this ontology).</p>
               <fig id="F5">
                  <title>
                     <p>Figure 5</p>
                  </title>
                  <caption>
                     <p>Hierarchical clustering of GO groups (for biological process (top), cellular component (middle), and molecular function (bottom)) according to their rERPs</p>
                  </caption>
                  <text>
                     <p>Hierarchical clustering of GO groups (for biological process (top), cellular component (middle), and molecular function (bottom)) according to their rERPs.</p>
                  </text>
                  <graphic file="gb-2009-10-5-r48-5"/>
               </fig>
               <fig id="F6">
                  <title>
                     <p>Figure 6</p>
                  </title>
                  <caption>
                     <p>Two hypothetical examples that demonstrate the difference between measuring co-evolution using rERP and applying the average ER along the entire evolutionary tree</p>
                  </caption>
                  <text>
                     <p>Two hypothetical examples that demonstrate the difference between measuring co-evolution using rERP and applying the average ER along the entire evolutionary tree. <b>(a) </b>An example in which ER is high but rERP is low: two SOGs (in red) have similar average ER (|E1 - E2| is small) but the correlation between their ERP vectors is low. Note that the level of co-evolution is low in both cases, but the pattern along the phylogenetic tree is very different. <b>(b) </b>A hypothetical evolutionary tree. <b>(c) </b>An example in which ER is low but rERP is high: two SOGs (in blue) have similar ERPs but their mean ERs are different. In this case a similar pattern can be seen despite very different levels of ER.</p>
                  </text>
                  <graphic file="gb-2009-10-5-r48-6"/>
               </fig>
               <fig id="F7">
                  <title>
                     <p>Figure 7</p>
                  </title>
                  <caption>
                     <p>Average correlation between the evolutionary patterns of pairs of GO groups (y-axis) as a function of their distance (the shortest connecting pathway) in the GO network (x-axis)</p>
                  </caption>
                  <text>
                     <p>Average correlation between the evolutionary patterns of pairs of GO groups (y-axis) as a function of their distance (the shortest connecting pathway) in the GO network (x-axis). The distribution of correlations in three out of six consecutive pairs of distance bins is significantly different (<it>t</it>-test, <it>P </it>&lt; 0.05). The correlation between distance (x-axis) and average correlation (y-axis) is -0.96 (<it>P </it>&lt; 4.5 &#215; 10<sup>-4</sup>; a similar result was observed when we used the ontology of <it>S. pombe </it>(Additional data file 4)). The increase distance 9-10 though deviating from the overall trend is not significant (<it>P </it>= 0.23).</p>
                  </text>
                  <graphic file="gb-2009-10-5-r48-7"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>Two fundamental types of co-evolution</p>
               </st>
               <p>Having a representative rERP vector for each SOG/process enables us to examine the correlations between them and to learn about their co-evolutionary history. A positive rERP correlation arises when two SOGs/processes exhibit a similar pattern of change in the different branches of the evolutionary tree and have evolved in a coordinated, cooperative C-type fashion. A simple example of such a co-evolution is the mitochondrial genome maintenance and mitochondrial electron transport categories. A marked negative rERP correlation denotes reciprocal, R-type co-evolution where periods of rapid evolution of one SOG/process are coupled with slow evolution in the other; this may arise when the rapid evolution of one process creates a new niche or biochemical activity that, in turn, enables, or selects for, the rapid evolution of the other process. An illustrative R-type example involves the category of methionine biosynthesis, which has a negative rERP correlation with phosphatidylcholine (PC) biosynthesis. PC is synthesized by three successive transfers of methyl groups from S-adenosyl-methionine to phosphatidyl-ethanolamine <abbrgrp><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp>. Thus, the evolution of the PC biosynthetic pathway may be conditioned on the evolution of the methionine biosynthesis pathway, and thus follow it with some time lag (Figure <figr fid="F8">8</figr>). Interestingly, genes that co-evolve in a C-type manner do provide functional backups to each other, having a statistically significant enrichment in genetic interactions (hypergeometric <it>P</it>-value &lt; 0.0039), while genes co-evolving in an R-type manner do not (where the enrichment is studied using the <it>S. cerevisiae </it>genes in each of the pertaining SOGs). We also found that the fraction of sequence-similar SOGs is significantly larger among pairs of C-type co-evolving genes than among pairs of R-type co-evolving genes (Note 2 in Additional data file 2).</p>
               <fig id="F8">
                  <title>
                     <p>Figure 8</p>
                  </title>
                  <caption>
                     <p>An illustrative example involves the category of methionine biosynthesis, which has a negative rERP correlation with phosphatidylcholine (PC) biosynthesis, an important and abundant structural component of the membranes of eukaryotic cells</p>
                  </caption>
                  <text>
                     <p>An illustrative example involves the category of methionine biosynthesis, which has a negative rERP correlation with phosphatidylcholine (PC) biosynthesis, an important and abundant structural component of the membranes of eukaryotic cells. PC is synthesized by three successive transfers of methyl groups from S-adenosyl-methionine to phosphatidyl-ethanolamine <abbrgrp><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp>; thus, the evolution of PC biosynthetic pathways may be conditioned by the evolution of methionine biosynthesis pathways, and follow it by some time lag. This phenomenon is demonstrated in the subtree below internal node 11 <b>(a)</b>. The rERPs of these two GO functions are shown in <b>(b)</b>.</p>
                  </text>
                  <graphic file="gb-2009-10-5-r48-8"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>Co-evolutionary network of SOGs and its properties</p>
               </st>
               <p>To track down the co-evolution of SOGs, we generated a co-evolution network where two SOGs (termed, for convenience, according to the <it>S. cerevisiae </it>genes they contain) are connected by an edge only if there is a significant (either positive or negative) Spearman rank correlation (with <it>P </it>&lt; 0.05) between their rERPs. The node degrees in the co-evolution network follow a power-law distribution (Figure <figr fid="F9">9</figr>) and the network has small world properties (the average distance between two nodes is 5.03). Many biological networks (for example, see <abbrgrp><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp>) exhibit similar properties. The degree in the co-evolutionary network is significantly correlated with the degree in the <it>S. cerevisiae </it>protein interaction network (r = 0.0726, <it>P </it>= 0.0125) but is not significantly correlated with the degree in the <it>S. cerevisiae </it>genetic interaction network, or with the degree in its gene expression network.</p>
               <fig id="F9">
                  <title>
                     <p>Figure 9</p>
                  </title>
                  <caption>
                     <p>The degree distribution in the co-evolution network is not far from a power-law (the plot of the log(number of genes) as a function of the log(degree) appears in the right-upper corner</p>
                  </caption>
                  <text>
                     <p>The degree distribution in the co-evolution network is not far from a power-law (the plot of the log(number of genes) as a function of the log(degree) appears in the right-upper corner. The correlation between these two measures is -0.77, <it>P </it>= 7.4 &#215; 10<sup>-11</sup>.</p>
                  </text>
                  <graphic file="gb-2009-10-5-r48-9"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>Co-evolution is correlated with similar functionality</p>
               </st>
               <p>A co-evolution network of cellular functional categories was built for each of the three GO ontologies (biological process, cellular component, molecular function), using two significance cutoff values (Spearman <it>P</it>-value &lt; 0.01 and Spearman <it>P</it>-value &lt; 0.001) to determine significant correlations between GO categories. A list of highly correlated pairs of GO terms is provided in Additional data file 5. The correlation between the distance of GO groups in the 0.001 cutoff co-evolution network (that is, their evolutionary distance) and their distance in the corresponding GO ontology network (that is, their functional distance) is highly significant: 0.38 for cellular component, 0.16 for biological process and 0.43 for molecular function (all three with <it>P</it>-values &lt;10<sup>-16</sup>; a similar trend is observed using the 0.01 cutoff network). A similarly marked correlation between evolutionary and functional relationships of GO groups is also found when considering positive and negative co-evolution networks separately (Note 3 in Additional data file 2).</p>
               <p>Similar results were observed when we considered classification according to Enzyme Commission (EC) number <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>, which is a numerical classification scheme for enzymes based on the chemical reactions they catalyze. By this classification, the code of each enzyme consists of the letters 'EC' followed by four numbers separated by periods. Those numbers represent progressively finer classifications of the enzyme. Thus, it induces a functional distance. Our analysis shows that pairs of orthologs with smaller functional distance (genes whose first two roughest classification levels are identical) exhibit higher levels of correlation between their rERP than other pairs of orthologs (mean rERP correlation of 0.31 versus 0.27, <it>P </it>= 1.23 &#215; 10<sup>-7</sup>).</p>
            </sec>
            <sec>
               <st>
                  <p>Co-evolutionary score and other properties of cellular functions and SOGs</p>
               </st>
               <p>We did not find a parallel significant correlation between the genomic co-localization of GO groups and their co-evolutionary score (see Materials and methods for a description of how we computed the co-localization score of pairs of GO groups). The co-evolution of genes and their chromosomal location are not correlated even when considering each chromosome separately. Thus, we conclude that cellular functionality is a more important force driving gene co-evolution than their genomic organization.</p>
               <p>The rERP measure correlates well with other systemic qualities such as genetic and physical interactions. The average Spearman correlation between rERP levels of interacting proteins in the <it>S. cerevisiae </it>protein interaction network is 0.063, which is 155 times higher than the average correlation (4.05 &#215; 10<sup>-4</sup>) for non-interacting proteins (<it>P </it>&lt; 10<sup>-16</sup>). Proteins that are part of a complex show a correlation of 0.05 between their rERPs, 100 times higher than the average correlation for proteins that are not a part of the same complex (<it>P </it>&lt; 10<sup>-16</sup>). The Spearman correlation between rERP levels of genetically interacting proteins is 0.02, which is 32 times higher than the average correlation (6.08 &#215; 10<sup>-4</sup>) for non-interacting proteins (<it>P </it>= 2.71 &#215; 10<sup>-6</sup>). Protein rERPs are also correlated with the co-expression of their genes (Spearman correlation 0.063, <it>P </it>&lt; 10<sup>-16</sup>). The significant correlation between co-evolution and physical/functional interactions suggests that physical interactions between the products of conserved genes play a part in their co-evolution. Namely, to maintain the functionality of an interaction, a change in one protein is likely to facilitate the evolution of the proteins interacting with it, as has already been shown <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Yet, as the magnitude of this correlation is rather low, it is likely that other co-evolutionary forces play a part in determining co-evolution, such as the sharing of common and varying growth environments during evolutionary history.</p>
            </sec>
            <sec>
               <st>
                  <p>Clustering of co-evolutionary networks</p>
               </st>
               <p>We employed the PRISM algorithm <abbrgrp><abbr bid="B41">41</abbr></abbrgrp> to partition each of the three GO co-evolution networks (biological process, cellular component, molecular function) into clusters of nodes, such that nodes from one cluster have similar sign connections (denoting positive or negative rERP correlations) with nodes from other clusters. We focus here on biological processes at a significance cutoff value of <it>P </it>&lt; 0.01 (Figure <figr fid="F10">10</figr>). PRISM clusters the process terms into coherent groups in a statistically significant manner (<it>P </it>&lt; 0.001; see Materials and methods), where most of the groups are enriched for particular types of processes: Cluster A7 contains many processes related to DNA metabolism, chromatin formation and RNA processing. This cluster shows strong negative correlations with clusters A6 (amino acid biosynthesis, tricarboxylic acid cycle, glucose oxidization and energy production) and cluster A8 (protein processing and modification). It has also strong positive correlations with cluster A4 (nuclear traffic and DNA repair) and with cluster A5. We note that among the RNA-related processes in cluster A7, some (such as mRNA export from nucleus and poly-A dependent mRNA degradation) show R-type correlations with functions such as protein degradation via the multivesicular pathway. This relationship points to a mode of evolution in which the two catabolic processes (protein and RNA) require coordination, so that changes in one are dependent on preceding changes in the other. Similarly, cluster A6 shows strong coordinated co-evolution with cluster A3 (amino acid and purine biosynthesis, glucose oxidization, energy production and ribosome biology). Both clusters include GO functions related to the production of energy and, thus, coordinated evolution is expected. An overview of the results shows that genes that affect regulatory or information-related processes (DNA metabolism, chromatin formation and RNA processing (cluster A7)) are 'master players'. These master genes/processes exert reciprocal selection forces on many other metabolic process (clusters A8, A3 and A6) and participate in the co-evolution of other processes such as nuclear traffic (cluster A4).</p>
               <fig id="F10">
                  <title>
                     <p>Figure 10</p>
                  </title>
                  <caption>
                     <p>Clustering of biological process GO terms according to their rERP correlations using the PRISM algorithm (with the less stringent significance criterion of <it>P </it>&lt; 0.01)</p>
                  </caption>
                  <text>
                     <p>Clustering of biological process GO terms according to their rERP correlations using the PRISM algorithm (with the less stringent significance criterion of <it>P </it>&lt; 0.01).</p>
                  </text>
                  <graphic file="gb-2009-10-5-r48-10"/>
               </fig>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Co-evolution of less conserved genes</p>
            </st>
            <sec>
               <st>
                  <p>The copy number pattern measure</p>
               </st>
               <p>The results presented above were focused on the analysis of a conserved set of genes whose orthologs appear in all nine fungal species studied, comprising 1,372 SOGs and spanning a total of 12,348 genes. The fungal dataset additionally includes 2,168 orthologous sets spanning more than 74,851 genes that exhibit at least one change in their copy number along the phylogenetic tree (and hence have undergone gene loss and/or gene duplication events). The 'propensity for gene loss' (PGL) <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> was shown to correlate with gene essentiality, the number of protein-protein interactions and the expression levels of genes. PGL has been used in methods for predicting functional gene linkage <abbrgrp><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp>, extending upon previous methods that used the occurrence pattern of a gene in different organisms for the same aim <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>. Recently, a probabilistic approach related to the PGL was developed <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. A related measure, which is also based on a gene's phyletic pattern (the occurrence pattern of a gene in different current organisms), is phylogenetic profiling (PP) <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B43">43</abbr></abbrgrp>. This measure has been employed in previous small scale studies to identify sets of genes with a shared evolutionary history <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B43">43</abbr></abbrgrp>. We describe a new measure of co-evolution that is a generalization/unification of both PGL and PP, termed the copy number pattern (CNP). Like PP, it characterizes each gene by examining its phyletic pattern (but additionally takes into account the number of paralogous copies of each gene in the genome). Like PGL, it exploits the information embedded in a species' phylogenetic tree to more accurately characterize the evolutionary history of each gene (in comparison, PP carries out a similar computation based on just the phyletic pattern). We used the new CNP measure to analyze orthologous sets that exhibit at least one change in copy number along the analyzed phylogentic tree. This set of genes is, by definition, not completely conserved, and complements the conserved set of genes analyzed by the rERP measure.</p>
               <p>Figure <figr fid="F11">11</figr> provides a stepwise overview of CNP computation. Steps A to F are essentially similar to those used to generate the rERP (see Materials and methods): we first generate a set of 2,168 orthologous sets that exhibit at least one change in copy number along the analyzed phylogentic tree. We then translate the resulting set of orthologs to copy numbers in each organism (the number of paralogs in an organism), and reconstruct the ancestral gene copy number using CAFE' <abbrgrp><abbr bid="B44">44</abbr></abbrgrp> (step F in Figure <figr fid="F11">11</figr>). Finally, using the copy number in each internal node, we compute the change (the difference) in copy number for orthologous sets along each edge (step G in Figure <figr fid="F11">11</figr>). The orthologous sets used for the rERP analysis and the orthologous sets used for the CNP analysis include two different, exclusive groups of conserved versus less-conserved genes, respectively. Analysis of these sets reveals that the first group is enriched with metabolic processes while the second is enriched with functional processes such as reproduction and cell differentiation (Additional data file 6).</p>
               <fig id="F11">
                  <title>
                     <p>Figure 11</p>
                  </title>
                  <caption>
                     <p>The different steps in computing CNP (a detailed description is provided in the 'Co-evolution of less conserved genes' section)</p>
                  </caption>
                  <text>
                     <p>The different steps in computing CNP (a detailed description is provided in the 'Co-evolution of less conserved genes' section).</p>
                  </text>
                  <graphic file="gb-2009-10-5-r48-11"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>Co-evolution of less conserved genes with the copy number pattern measure</p>
               </st>
               <p>Since changes in the copy number of genes are infrequent events, the Spearman correlations between pairs of CNP vectors are usually very high (the average Spearman correlation is 0.63). To overcome this, we generated CNP vectors of GO processes (according to the biological processes ontology) where the CNP of a GO category is the mean CNP of all the genes it contains. These GO process vectors exhibit a wider range of CNP values. Next, we constructed a GO process co-evolution network. In this network two biological processes are connected by an edge only if they manifest an extreme co-evolution pattern - that is, if they have a Spearman rank correlation that is higher (green colored edges, denoting coordinated relationships) or lower (red, denoting reciprocal relationships) than the correlation values of X% of the total GO pairs. We examined the networks formed under two edge-selection regimes, a more stringent one where X% = 99.9% and a less stringent one where X% = 98%. The correlation between the distance of GO groups in the network with X% = 99.9% and the distance of GO groups in the different GO ontology networks is highly significant (r = 0.4209, <it>P </it>&lt; 10<sup>-16</sup>) for green, cooperative edges, and negatively correlated (r = -0.12, <it>P </it>&lt; 0.04) for red, reciprocal edges. This suggests that the two types of edges are informative: the green edges represent functional relationships while the red ones represent pairs of GOs with distant functions.</p>
            </sec>
            <sec>
               <st>
                  <p>Clustering of the copy number pattern evolutionary network</p>
               </st>
               <p>To learn more about the structure of the CNP co-evolution network, we used the PRISM algorithm <abbrgrp><abbr bid="B41">41</abbr></abbrgrp> (as in the case of the rERP analysis) to partition each of the GO terms in the less stringent network (X% = 98%; thus obtaining a larger amount of edges and a more robust clustering) into clusters of nodes, such that nodes from one cluster have similar color edge connections with nodes from other clusters. PRISM is able to separate the process terms into coherent groups according to their mutual correlations, in a statistically significant manner (<it>P </it>&lt; 0.001; see Materials and methods). Figure <figr fid="F12">12</figr> displays the results of the PRISM analysis, clustering of the GO biological processes into seven large groups. The seven interconnected groups are enriched with specific processes, and present clear interactions of the red and green edges. As clearly seen in Figure <figr fid="F12">12</figr>, some clusters show mainly reciprocal co-evolution with most of the others (for example, clusters B1 (fatty acid metabolism) and B6 (sugar metabolism). In contrast, clusters B2, B3 and B7 (nuclear traffic, transcription and DNA metabolism) show coordinated (C-type) co-evolution. Cluster B4 (protein modification, chromatin silencing) shows C-type co-evolution with transcription (cluster B3) and nuclear import (cluster B2) as well as DNA metabolism (cluster B7) but R-type relations with cluster B1, which includes fatty acid metabolism and protein glycosylation.</p>
               <fig id="F12">
                  <title>
                     <p>Figure 12</p>
                  </title>
                  <caption>
                     <p>Clustering of biological process GO terms according to their CNP using the PRISM algorithm (<it>P </it>&lt; 0.04)</p>
                  </caption>
                  <text>
                     <p>Clustering of biological process GO terms according to their CNP using the PRISM algorithm (<it>P </it>&lt; 0.04).</p>
                  </text>
                  <graphic file="gb-2009-10-5-r48-12"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>Comparison of the co-evolution of conserved versus less-conserved genes</p>
               </st>
               <p>A comparison between the results obtained by the rERP and CNP methodologies at a global level should be done with some caution, for three main reasons. First, these two measures are applicable for the analysis of completely disjoint, complementary sets of orthologs. Second, the two methodologies measure different types of co-evolution. The rERP measures evolution via amino acid substitutions while the CNP measures co-evolution via changes in gene copy number, which are mainly driven by gene gain and loss events. Thus, third, these co-evolutionary relationships are possibly the result of the action of different evolutionary forces. However, it may be noted that some biological processes present the same type of evolutionary relationship with both methods. For example, DNA metabolism is always positively correlated with nuclear traffic, transcription and vacuolar biology (ER-Golgi traffic). Yet, some of the clusters exhibit different relationships when analyzed by the two measures. For example, a cluster containing mainly genes labeled ribosome biology and vacuolar biology exhibits reciprocal evolution with DNA metabolism by rERP (clusters A3 to A7) but coordinated evolution by CNP (clusters B5 to B7). Thus, within a certain biological process, the evolutionary pressures exerted on highly conserved genes may differ from those that apply to less conserved ones, and may thus provide different opportunities for co-evolution.</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>Our analysis charts the first global network view of the co-evolution of conserved and less conserved genes in nine fungal species. We find that cellular functions play a more important driving force in gene co-evolution than the genes' chromosomal location. Two fundamental patterns of co-evolution, cooperative and reciprocal, are defined, and, remarkably, we find that only genes co-evolving cooperatively functionally back each other up. At the single gene level, the observation that genes have evolved at accelerated rates in a localized manner on only three branches of the fungal tree is in line with previous findings suggesting that a large fraction of DNA mutations can be attributed to punctuated evolution <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. The fungal tree analyzed here is a natural starting point. The future application of the approach presented here to other phylogenetic trees, including the mammalian one, holds much promise in characterizing the forces that shape cellular co-evolution.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Data sources</p>
            </st>
            <p>The GO functional classification used in this work is the most comprehensive, qualitative, and widely used annotation database <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>.</p>
            <p>The GO and GO slim annotations and protein composition data for complexes were downloaded from the <it>Saccharomyces </it>Genome Database <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. We checked and report results both for the GO slim classification (Figure <figr fid="F5">5</figr>), the roughest level of classification, and the general GO ontology (Figures <figr fid="F4">4</figr>, <figr fid="F7">7</figr>, <figr fid="F8">8</figr>, <figr fid="F10">10</figr>, and <figr fid="F12">12</figr>), where we filtered GO groups that were too small (with less than five SOGs in our dataset). That is, the main bulk of the analysis was performed across the whole GO ontology, without focusing on any arbitrary level. Note 4 in Additional data file 2 and Additional data file 7 includes statistics and error rates of the annotations used in this work.</p>
            <p>The genetic interaction network data were downloaded from the BioGrid database <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>; similar results were obtained when the genetic interaction network of Tong <it>et al. </it><abbrgrp><abbr bid="B38">38</abbr></abbrgrp> was used (data not shown). Recent work showed that only 5% of the genetic interactions are conserved in <it>S. cerevisiae </it>and <it>Caenorhabditis elegans </it><abbrgrp><abbr bid="B48">48</abbr></abbrgrp>. Note that the evolutionary distances between these species (1,542 million years, according to <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>) are much larger than those between the organisms in our dataset (20 to 837 million years). Further, <it>C. elegans </it>is multi-cellular while all the analyzed fungi are unicellular. Thus, it is not clear how the conclusions of Tischler <it>et al. </it><abbrgrp><abbr bid="B48">48</abbr></abbrgrp> are related to our dataset.</p>
            <p>More importantly, in this work we study the relationship between the genetic interaction network and its co-evolutionary network for one organism (<it>S. cerevisiae</it>) for which we know the genetic interactions network. As there is no reason to believe that this organism is 'special' in any way, we believe that these findings are representative of the expected findings for other organisms if or when their genetic interaction networks become known.</p>
            <p>The protein interaction network of the budding yeast <it>S. cerevisiae </it>was downloaded from the BioGrid database <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>.</p>
            <p>Gene expression data were taken from the Stanford MicroArray Database <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. The GO ontology network of yeast was downloaded from the Open Biomedical Ontologies Foundry ontologies <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. EC numbers of the analyzed genes were downloaded from the Kyoto Encyclopedia of Genes and Genomes (KEGG) <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Computing relative evolutionary rate patterns for orthologous gene sets</p>
            </st>
            <p>The selection of species used here is not arbitrary; obviously, different selections are perhaps equally plausible but several considerations led us to the current selection, which we outline below. For this study, we used fungi whose genomes have been completely assembled (at the time this study was performed: July 2007) according to the National Center for Biotechnology Information (NCBI) and for which we could infer the tRNA gene repertoire reliably and, thus, compute the tRNA adaptation index (tAI). These include <it>S. cerevisiae</it>, <it>C. glabrata</it>, <it>K. lactis</it>, <it>D. hansenii</it>, <it>Y. lipolytica </it>and <it>Schizosaccharomyces pombe</it>. This selection was then augmented by three additional species: <it>C. albicans</it>, an important fungal pathogen for which a high-quality gene collection (including tRNA genes) has recently become available <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>; <it>S. bayanus</it>, a <it>Saccharomyces sensu stricto </it>species that diverged from <it>S. cerevisiae </it>approximately 20 million years ago and for which an overwhelming majority of the open reading frames are available <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>; and <it>A. nidulans</it>, a filamentous fungus with a high-quality sequence. Furthermore, these species were analyzed recently by Man and Pilpel <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, serving as an appropriate reference set for studying evolutionary events in fungi.</p>
            <p>Finally, due to the large evolutionary distance between <it>S. pombe </it>and the hemiascomycotic species (350 to 1,000 million years ago <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>), this set of species present a nice distribution of evolutionary time. We believe that small changes in the set of fungi species would likely yield quite similar results (see details in Note 5 in Additional data file 2).</p>
            <p>The final dataset included genomes of nine fungal species: <it>A. nidulans</it>, <it>C. albicans</it>, <it>C. glabrata</it>, <it>D. hansenii</it>, <it>K. lactis</it>, <it>S. bayanus</it>, <it>S. cerevisiae</it>, <it>S. pombe</it>, <it>Y. lipolytica</it>.</p>
            <p>Computation of the rERPs is a multi-step process (Figure <figr fid="F1">1</figr> provides an overview), described in detail as follows. The phylogenetic tree used to analyze the data (Figure <figr fid="F2">2</figr>) was formed according to the analysis of 18S rRNA data in <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, the analysis of 531 concatenated proteins <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>, and the analysis of additional gene sets listed in <abbrgrp><abbr bid="B54">54</abbr></abbrgrp> (step A in Figure <figr fid="F1">1</figr>). The orthologous sets for the nine fungi were downloaded from <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> (step B in Figure <figr fid="F1">1</figr>). This dataset was generated by the MultiParanoid program <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>. We considered only sets that include orthologs in all nine species. Sets of homologs that did not include exactly one representative in each organism were removed from our dataset to filter out paralogs and avoid potential errors in evolutionary rate estimation due to duplication events (step C in Figure <figr fid="F1">1</figr>). Horizontal gene transfer events (see, for example, <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>) are rare in fungi <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> and thus were not considered in our analysis. The final dataset included 1,372 orthologous sets. Stop codons were removed and each gene was translated to a sequence of amino acids. Each orthologous set was then aligned by CLUSTALW 1.83 <abbrgrp><abbr bid="B57">57</abbr></abbrgrp> with default parameters. By using amino acids as templates for the nucleotide sequences and by ignoring gaps we generated gap-free multiple alignments of the nine orthologous proteins in each orthologous set and their corresponding coding sequences (step D in Figure <figr fid="F1">1</figr>).</p>
            <p>Given the alignments of each set of orthologs and given the phylogenetic tree, we used the codeml program in PAML for the joint reconstruction of ancestral codons <abbrgrp><abbr bid="B58">58</abbr></abbrgrp> in each of the internal nodes of the phylogenetic tree (step E in Figure <figr fid="F1">1</figr>). This reconstruction induced the sequence of ancestral proteins and their corresponding ancestral DNA coding sequences. We hence obtained sets of 16 sequences; 9 from the previous step (corresponding to the 9 leaves of the phylogenetic tree; Figure <figr fid="F2">2</figr>) plus 7 reconstructed sequences of the internal nodes of the phylogenetic tree (ancestral nodes 10-16 in Figure <figr fid="F2">2</figr>). We denote such a set of 16 sequences a 'complete ortholgous set'. For each complete ortholgous set, we computed the <it>dN </it>and <it>dS </it>in each branch of the evolutionary tree using the y00 program in PAML <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B59">59</abbr></abbrgrp> (step F in Figure <figr fid="F1">1</figr>). The outputs of this stage are two vectors of 15 positive real numbers for each complete ortholgous set (1,372 pairs of vectors in our case). These vectors denote the <it>dN </it>and <it>dS </it>values at the 15 different branches of the evolutionary tree.</p>
            <p>To adjust for selection on synonymous sites <abbrgrp><abbr bid="B60">60</abbr></abbrgrp>, we used a procedure similar to that described in <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, utilizing the tAI <abbrgrp><abbr bid="B61">61</abbr></abbrgrp> instead of the codon adaptation index <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B62">62</abbr></abbrgrp>. Following Hirsh <it>et al. </it><abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, we assume the following model of evolution on synonymous sites:</p>
            <p>
               <display-formula>
                  <graphic file="gb-2009-10-5-r48-i1.gif"/>
               </display-formula>
            </p>
            <p>where <it>r</it><sub><it>0 </it></sub>is the neutral evolutionary rate, <it>k </it>is a constant, and <it>t </it>is time. Our goal is to estimate <it>dS' = r</it><sub><it>0 </it></sub>&#215; <it>t</it>, which is done using regression. This requires the computation of the tAIs of each of the gene sequences (the leaves of the phylogenetic tree), and the estimation of the tAIs of the sequences at the internal nodes of the phylogenetic tree. To this end (step G in Figure <figr fid="F1">1</figr>) we used the tRNA copy number of each species as reported in <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, and the ancestral tRNA copy numbers were reconstructed following <abbrgrp><abbr bid="B44">44</abbr></abbrgrp> (step I in Figure <figr fid="F1">1</figr>) using the CAF&#201; program.</p>
            <p>The edge lengths (step H in Figure <figr fid="F1">1</figr>) for CAF&#201; were computed by the following steps. Step one: we inferred edge lengths under the molecular clock assumption for the tree topology of Figure <figr fid="F2">2</figr> and the concatenation of all the sets of ortholog proteins (561,072 sites) using the codeml program in PAML <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>. Step two: we normalized the log of branch lengths to obtain branch lengths that are integers between 0 and 1,000 that reflect putative time units (this is the requirement of the method of <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. Step three: we used an expectation-maximization (EM) algorithm to find the optimal value of &#955; (0.001756) for the model (see <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>).</p>
            <p>It is important to note that by optimizing &#955; we actually optimize the likelihood of the model, and the result is invariant for the choice of the normalization factor of the branch lengths. The relatively similar tRNA copy number distribution of the nine species <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> also induces a quite similar tRNA copy number distribution at the ancestral nodes of the phylogenetic tree. To compute the tAI of each complete orthologous set (step J in Figure <figr fid="F1">1</figr>), we used the Matlab, R, and Pearl scripts from <abbrgrp><abbr bid="B61">61</abbr></abbrgrp> (see <abbrgrp><abbr bid="B61">61</abbr></abbrgrp> for the exact description of how to compute the tAI).</p>
            <p>Thus, by using tAI and <it>dS </it>we were able to adjust the <it>dS </it>values for selection on synonymous sites, resulting in a new value, <it>dS</it>'. This was done for each ortholgous set in each of the tree branches (step K in Figure <figr fid="F1">1</figr>). These <it>dS</it>' values were used for computing the corresponding values of adjusted evolutionary rates, <it>dN</it>/<it>dS</it>'. As mentioned, the idea underlying this step <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> is to assume a linear relationship between <it>dS </it>and tAI, and its computation proceeds as follows.</p>
            <p>Let <it>i </it>(0 &lt;<it>i </it>&lt; 1,373) denote an index of a complete ortholgous set, and let <it>j </it>(0 &lt;<it>j </it>&lt; 16) denote a branch in the phylogenetic tree. We perform the following steps. Step one: for each <it>i </it>and <it>j </it>compute the average tAI for the sequences at the two ends of the branch <it>j</it>; let <it>tAIi</it>, <it>j </it>denote this average. Step two: for each branch <it>j </it>use all the <it>tAI</it><sub><it>i</it>, <it>j </it></sub>and all the <it>dS</it><sub>i, j </sub>(where 0 &lt;<it>i </it>&lt; 1,373) and, by regression, estimate <it>a</it><sub><it>j </it></sub>and <it>b</it><sub><it>j </it></sub>that minimize the least squares error of the model:</p>
            <p>
               <display-formula>
                  <graphic file="gb-2009-10-5-r48-i2.gif"/>
               </display-formula>
            </p>
            <p>Step three: for each <it>i </it>and <it>j </it>and an estimation of the substitution rate on synonymous sites <it>dS</it><sub><it>i</it>, <it>j</it></sub>, the adjusted selection on the synonymous sites is:</p>
            <p>
               <display-formula>
                  <graphic file="gb-2009-10-5-r48-i3.gif"/>
               </display-formula>
            </p>
            <p>The final output of this procedure is a total of 1,373 vectors, each with 15 <it>dN</it>/<it>dS</it>' values denoting the ERP values of each complete orthologous set.</p>
            <p>Usually, for very high levels of substitution rate (long branches in the evolutionary tree), the error in the estimated <it>dS </it>values increases <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>. This well known phenomenon is named saturation. Thus, we perform an additional normalization of the <it>dN </it>values by computing the ranked evolutionary rate, rER. The ranked evolutionary rate, rER (step L in Figure <figr fid="F1">1</figr>), is computed separately for each branch of the evolutionary tree. For a given branch, the rank of the <it>dN</it>/<it>dS' </it>of a complete ortholgous set among the <it>dN</it>/<it>dS' </it>values of all the complete ortholgous sets is the number of sets that have lower <it>dN</it>/<it>dS' </it>values in this branch (a number between 1 and the total number of complete ortholgous sets, 1,373). The rERP of a complete ortholgous set is the vector of its ranked evolutionary rate along the 15 branches of the evolutionary tree. Note 6 in Additional data file 2 and Additional data file 8 include a comparison of the <it>dN</it>/<it>dS' </it>values to previous evolutionary rate results in a previous study by Wall <it>et al. </it><abbrgrp><abbr bid="B8">8</abbr></abbrgrp>.</p>
            <p>The rERP developed and used in this work is different from the measure used in <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> in many important ways. We ranked the ER and adjusted the computed <it>dN</it>/<it>dS </it>for selection on synonymous sites by using the tAI measure (an approach that has not been used before). Additionally, we used the non-parametric Spearman correlation instead of the Pearson correlation. A comparison of the results obtained using our measure with those obtained using Fraser <it>et al</it>.'s ER measure for studying co-evolution shows that the ratio between the average correlation of physically interacting genes versus non interacting genes is very low when using Fraser <it>et al</it>.'s measure (0.06/0.022 = 2.72), showing a very low discriminative power. In comparison, the ratio obtained using our measure is markedly higher (0.063/4.05 &#215; 10<sup>-4 </sup>= 155), in correspondence with the expectation that interacting proteins would tend to co-evolve much more than non-interacting ones (for example, see <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>).</p>
            <p>Measures that were based only on <it>dN </it>instead of <it>dN</it>/<it>dS </it>performed worse than the rERP mentioned above. Using <it>dN </it>without ranking is problematic as longer branches have, in general, higher <it>dN </it>and, thus, the correlations obtained were very high and quite similar when comparing genes that physically interact and those that do not (r = 0.77 versus r = 0.73).</p>
            <p>When we used ranked <it>dN </it>instead of ranked <it>dN</it>/<it>dS</it>, we achieved better results, which were almost as good as the results we got using rERP, in terms of separating pairs of physically interacting from non-interacting proteins via their co-evolution. For example, the ratio between the correlation of protein-interacting to non-interacting proteins was 20 using ranked <it>dN</it>, weaker than the ratio of 155 observed using rERP values.</p>
         </sec>
         <sec>
            <st>
               <p>Constructing and analyzing the co-evolution networks of GO terms</p>
            </st>
            <p>The co-evolution network of GO terms was constructed as follows. First, consider only GO groups that include at least three genes. Second, compute the rERP of each GO group. Third, compute the Spearman correlation between the rERP of all pairs of GO groups. Fourth, connect a pair of GO groups, G<sub>i </sub>and G<sub>j</sub>, if the following two conditions are satisfied: condition one, they have significant correlation (<it>P </it>&lt; 0.01 in the case of the network in Figure <figr fid="F10">10</figr>), where significance is computed empirically versus a corresponding random shuffled network; condition two, the two sets do not strongly overlap in their gene content, having a Jaccard coefficient &lt; 0.5 <abbrgrp><abbr bid="B64">64</abbr></abbrgrp>.</p>
            <p>The distance between GO terms on the GO network was computed by replacing each directed edge in the original graph with an undirected one, and computing the length of the minimal path between the two GO groups.</p>
            <p>The co-evolution network was clustered and visualized using the Matlab implementation of the PRISM algorithm <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. The PRISM algorithm was instrumental to our analysis as it partitions the graph according to the two types of edges (positive (cooperative) or negative (reciprocal) rERP correlations) to get clusters of nodes, such that nodes from one cluster have edges of a similar type with nodes from other clusters. This is of particular interest when studying co-evolution, since it identifies 'monochromatic' relationships between groups of genes/GO functions - that is, groups of genes that relate to each other in either a completely cooperative or a completely reciprocal manner. To the best of our knowledge, no other method/algorithm is available to achieve this goal. Other clustering algorithms do not preserve the 'monochromatic' property and, hence, are not suitable for addressing the question at hand.</p>
            <p>The significance of the monochromaticity of the resulting clustering was computed by comparing the number of conflicts (the number of edges between nodes that are in different clusters and have a color different from that of the majority of edges between the two clusters) in the original clustering to its distribution in 1,000 randomly shuffled networks with similar topological properties.</p>
         </sec>
         <sec>
            <st>
               <p>Analyzing the genomic co-localization of GO terms</p>
            </st>
            <p>We define the distance between two GO groups as the median of all shortest distances between each gene in one GO group to each gene in the other GO group. We did not consider genes that are common to the two GO groups. For estimating to what extent two GO groups tend to be located close to each other in the genome, we computed a <it>P</it>-value based on comparing their median distance to that of a background model obtained by randomly locating all the genes of both groups in the genome, and recomposing their median distance for each such assignment (repeating this process 100 times to obtain a distribution of background model medians). The <it>P</it>-value is the fraction of times that a random shift yields a lower distance between the two GO groups.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>CNP: copy number pattern; EC: Enzyme Commission; ER: evolutionary rate; ERP: evolutionary rate pattern; GO: Gene Ontology; PC: phosphatidylcholine; PGL: propensity for gene loss; PP: phylogenetic profiling; rERP: relative evolutionary rate pattern; SOG: set of orthologous genes; tAI: tRNA adaptation index.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>TT carried out all of the analysis. All authors participated in the design of the study. All authors were involved in drafting and writing the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data are available with the online version of this paper: a table that includes the orthologous sets that exhibit positive evolution for each of the tree branches (Additional data file <supplr sid="S1">1</supplr>); supplementary notes 1 to 6 (Additional data file <supplr sid="S2">2</supplr>); a table with GO processes with less than 20 genes (biological process ontology) sorted by their mean rERP and variance of rERP (Additional data file <supplr sid="S3">3</supplr>); a figure that includes the mean correlation between the evolutionary patterns of pairs of GO groups (y-axis) as a function of their distance (the shortest connecting pathway) in the GO network (x-axis) when using the ontology of <it>S. pombe </it>(Additional data file <supplr sid="S4">4</supplr>); a table with pairs of GO groups exhibiting a significant correlation between their rERPs (Additional data file <supplr sid="S5">5</supplr>); a table with GO enrichments (biological process) for the conserved and non-conserved genes (Additional data file <supplr sid="S6">6</supplr>); a figure that depicts the distribution of the number of annotations per gene for the conserved and non-conserved genes (Additional data file <supplr sid="S7">7</supplr>); a figure that depicts the ER values computed in our study versus the ER values computed in Wall <it>et al. </it><abbrgrp><abbr bid="B8">8</abbr></abbrgrp> (Additional data file <supplr sid="S8">8</supplr>).</p>
         <suppl id="S1">
            <title>
               <p>Additional file 1</p>
            </title>
            <caption>
               <p>Orthologous sets that exhibit positive evolution for each of the tree branches</p>
            </caption>
            <text>
               <p>Orthologous sets that exhibit positive evolution for each of the tree branches.</p>
            </text>
            <file name="gb-2009-10-5-r48-S1.xls">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional file 2</p>
            </title>
            <caption>
               <p>Supplementary notes 1 to 6</p>
            </caption>
            <text>
               <p>Supplementary notes 1 to 6.</p>
            </text>
            <file name="gb-2009-10-5-r48-S2.doc">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S3">
            <title>
               <p>Additional file 3</p>
            </title>
            <caption>
               <p>GO processes with less than 20 genes (biological process ontology) sorted by their mean rERP and variance of rERP</p>
            </caption>
            <text>
               <p>GO processes with less than 20 genes (biological process ontology) sorted by their mean rERP and variance of rERP.</p>
            </text>
            <file name="gb-2009-10-5-r48-S3.xls">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S4">
            <title>
               <p>Additional file 4</p>
            </title>
            <caption>
               <p>Mean correlation between the evolutionary patterns of pairs of GO groups (y-axis) as a function of their distance (the shortest connecting pathway) in the GO network (x-axis) when using the ontology of <it>S. pombe</it></p>
            </caption>
            <text>
               <p>Mean correlation between the evolutionary patterns of pairs of GO groups (y-axis) as a function of their distance (the shortest connecting pathway) in the GO network (x-axis) when using the ontology of <it>S. pombe</it>.</p>
            </text>
            <file name="gb-2009-10-5-r48-S4.doc">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S5">
            <title>
               <p>Additional file 5</p>
            </title>
            <caption>
               <p>Pairs of GO groups exhibiting a significant correlation between their rERPs</p>
            </caption>
            <text>
               <p>Pairs of GO groups exhibiting a significant correlation between their rERPs.</p>
            </text>
            <file name="gb-2009-10-5-r48-S5.xls">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S6">
            <title>
               <p>Additional file 6</p>
            </title>
            <caption>
               <p>GO enrichments (biological process) for the conserved and non-conserved genes</p>
            </caption>
            <text>
               <p>GO enrichments (biological process) for the conserved and non-conserved genes.</p>
            </text>
            <file name="gb-2009-10-5-r48-S6.xls">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S7">
            <title>
               <p>Additional file 7</p>
            </title>
            <caption>
               <p>Distribution of the number of annotations per gene for the conserved and non-conserved genes</p>
            </caption>
            <text>
               <p>Distribution of the number of annotations per gene for the conserved and non-conserved genes.</p>
            </text>
            <file name="gb-2009-10-5-r48-S7.doc">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S8">
            <title>
               <p>Additional file 8</p>
            </title>
            <caption>
               <p>ER values computed in our study versus the ER values computed in Wall <it>et al. </it><abbrgrp><abbr bid="B8">8</abbr></abbrgrp></p>
            </caption>
            <text>
               <p>ER values computed in our study versus the ER values computed in Wall <it>et al. </it><abbrgrp><abbr bid="B8">8</abbr></abbrgrp>.</p>
            </text>
            <file name="gb-2009-10-5-r48-S8.doc">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>TT is supported by the Edmond J Safra Bioinformatics program at Tel Aviv University and the Yeshay Horowitz Association through the Center for Complexity Science. MK is supported by grants from the Israel Science Foundation, the Israel Cancer Research Fund, the Israel Cancer Association and the US-Israel Bi-National Fund (BSF). ER's research is supported by grants from the Israel Science Foundation, including the Converging Technologies grant with MK, and the Tauber Fund.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Molecular disease, evolution, and genetic heterogeneity.</p>
            </title>
            <aug>
               <au>
                  <snm>Zuckerkandl</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Pauling</snm>
                  <fnm>LB</fnm>
               </au>
            </aug>
            <source>Horizons in Biochemistry</source>
            <publisher>New York: Academic Press</publisher>
            <editor>Kasha M, Pullman B</editor>
            <pubdate>1962</pubdate>
            <fpage>189</fpage>
            <lpage>225</lpage>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Molecular clocks: four decades of evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Kumar</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>654</fpage>
            <lpage>662</lpage>
            <xrefbib>
               <pubid idtype="pmpid">16136655</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Local molecular clocks in three nuclear genes: divergence times for rodents and other mammals and incompatibility among fossil calibrations.</p>
            </title>
            <aug>
               <au>
                  <snm>Douzery</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Delsuc</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Stanhope</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Huchon</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>2003</pubdate>
            <volume>57</volume>
            <fpage>S201</fpage>
            <lpage>213</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15008417</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Large punctuational contribution of speciation to evolutionary divergence at the molecular level.</p>
            </title>
            <aug>
               <au>
                  <snm>Pagel</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Venditti</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Meade</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2006</pubdate>
            <volume>314</volume>
            <fpage>119</fpage>
            <lpage>121</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17023657</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Evolutionary rate in the protein interaction network.</p>
            </title>
            <aug>
               <au>
                  <snm>Fraser</snm>
                  <fnm>HB</fnm>
               </au>
               <au>
                  <snm>Hirsh</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Steinmetz</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Scharfe</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Feldman</snm>
                  <fnm>MW</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>296</volume>
            <fpage>750</fpage>
            <lpage>752</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11976460</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Protein dispensability and rate of evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Hirsh</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>HB</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>411</volume>
            <fpage>1046</fpage>
            <lpage>1049</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11429604</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Why highly expressed proteins evolve slowly.</p>
            </title>
            <aug>
               <au>
                  <snm>Drummond</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Bloom</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Adami</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Wilke</snm>
                  <fnm>CO</fnm>
               </au>
               <au>
                  <snm>Arnold</snm>
                  <fnm>FH</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>14338</fpage>
            <lpage>14343</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1242296</pubid>
                  <pubid idtype="pmpid" link="fulltext">16176987</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Functional genomic analysis of the rate of protein evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Wall</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Hirsh</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>HB</fnm>
               </au>
               <au>
                  <snm>Kumm</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Giaever</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Feldman</snm>
                  <fnm>MW</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>5483</fpage>
            <lpage>5488</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">555735</pubid>
                  <pubid idtype="pmpid" link="fulltext">15800036</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>An integrated view of protein evolution</p>
            </title>
            <aug>
               <au>
                  <snm>P&#225;l</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Papp</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Lercher</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>337</fpage>
            <lpage>348</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16619049</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>The coordinated evolution of yeast proteins is constrained by functional modularity.</p>
            </title>
            <aug>
               <au>
                  <snm>Chena</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Dokholyana</snm>
                  <fnm>NV</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <fpage>416</fpage>
            <lpage>419</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16797778</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Co-evolutionary rates of functionally related yeast genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Marino-Ramirez</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Bodenreider</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Kantz</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Jordan</snm>
                  <fnm>IK</fnm>
               </au>
            </aug>
            <source>Evol Bioinform Online</source>
            <pubdate>2006</pubdate>
            <volume>2</volume>
            <fpage>295</fpage>
            <lpage>300</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2267758</pubid>
                  <pubid idtype="pmpid" link="fulltext">18345352</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Identification of functional links between genes using phylogenetic profiles.</p>
            </title>
            <aug>
               <au>
                  <snm>Wu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kasif</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>DeLisi</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <fpage>1524</fpage>
            <lpage>1530</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12912833</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Quantifying modularity in the evolution of biomolecular systems.</p>
            </title>
            <aug>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>391</fpage>
            <lpage>397</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">353226</pubid>
                  <pubid idtype="pmpid" link="fulltext">14993205</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Prolinks: a database of protein functional linkages derived from coevolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Bowers</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Pellegrini</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Thompson</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Fierro</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yeates</snm>
                  <fnm>TO</fnm>
               </au>
               <au>
                  <snm>Eisenberg</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>R35</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">416471</pubid>
                  <pubid idtype="pmpid" link="fulltext">15128449</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Discovering functional linkages and uncharacterized cellular pathways using phylogenetic profile comparisons: a comprehensive assessment</p>
            </title>
            <aug>
               <au>
                  <snm>Jothi</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Przytycka</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>173</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1904249</pubid>
                  <pubid idtype="pmpid" link="fulltext">17521444</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Refined phylogenetic profiles method for predicting protein-protein interactions.</p>
            </title>
            <aug>
               <au>
                  <snm>Sun</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Shi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <fpage>3409</fpage>
            <lpage>3415</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15947018</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Gene loss, protein sequence divergence, gene dispensability, expression level, and interactivity are correlated in eukaryotic evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Krylov</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Rogozin</snm>
                  <fnm>IB</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>2229</fpage>
            <lpage>2235</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">403683</pubid>
                  <pubid idtype="pmpid" link="fulltext">14525925</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Functional protein divergence in the evolution of <it>Homo sapiens</it></p>
            </title>
            <aug>
               <au>
                  <snm>Lopez-Bigas</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>De</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Teichmann</snm>
                  <fnm>SA</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2008</pubdate>
            <volume>9</volume>
            <fpage>R33</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2374701</pubid>
                  <pubid idtype="pmpid" link="fulltext">18279504</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Correlated mutations containinformation about protein-protein interaction.</p>
            </title>
            <aug>
               <au>
                  <snm>Pazos</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Helmer-Citterich</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ausiello</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Valencia</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1997</pubdate>
            <volume>271</volume>
            <fpage>511</fpage>
            <lpage>523</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9281423</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Co-evolutionary analysis reveals insights into protein-protein interactions.</p>
            </title>
            <aug>
               <au>
                  <snm>Goh</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Cohen</snm>
                  <fnm>FE</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2002</pubdate>
            <volume>324</volume>
            <fpage>177</fpage>
            <lpage>192</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12421567</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p><it>In silico </it>two-hybrid system for the selection of physically interacting protein pairs.</p>
            </title>
            <aug>
               <au>
                  <snm>Pazos</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Valencia</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>2002</pubdate>
            <volume>47</volume>
            <fpage>219</fpage>
            <lpage>227</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11933068</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Exploiting the co-evolution of interacting proteins to discover interaction specificity.</p>
            </title>
            <aug>
               <au>
                  <snm>Ramani</snm>
                  <fnm>AK</fnm>
               </au>
               <au>
                  <snm>Marcotte</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2003</pubdate>
            <volume>327</volume>
            <fpage>273</fpage>
            <lpage>284</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12614624</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Coevolution of gene expression among interacting proteins.</p>
            </title>
            <aug>
               <au>
                  <snm>Fraser</snm>
                  <fnm>HB</fnm>
               </au>
               <au>
                  <snm>Hirsh</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Wall</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>MB</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <fpage>9033</fpage>
            <lpage>9038</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">439012</pubid>
                  <pubid idtype="pmpid" link="fulltext">15175431</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>High-confidence prediction of global interactomes based on genome-wide coevolutionary networks</p>
            </title>
            <aug>
               <au>
                  <snm>Juan</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Pazos</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Valencia</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2008</pubdate>
            <volume>105</volume>
            <fpage>934</fpage>
            <lpage>939</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2242690</pubid>
                  <pubid idtype="pmpid" link="fulltext">18199838</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Systematics and evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Berbee</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>The Mycota</source>
            <publisher>Berlin: Springer</publisher>
            <editor>McLaughlin D, McLaughlin E, Lemke P</editor>
            <pubdate>2001</pubdate>
            <volume>VIIB</volume>
            <fpage>229</fpage>
            <lpage>245</lpage>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Phylogeny and systematics of the fungi with special reference to the Ascomycota and Basidiomycota.</p>
            </title>
            <aug>
               <au>
                  <snm>Prillinger</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lopandic</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Schweigkofler</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Deak</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Aarts</snm>
                  <fnm>HJ</fnm>
               </au>
               <au>
                  <snm>Bauer</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Sterflinger</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kraus</snm>
                  <fnm>GF</fnm>
               </au>
               <au>
                  <snm>Maraz</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Chem Immunol</source>
            <pubdate>2002</pubdate>
            <volume>81</volume>
            <fpage>207</fpage>
            <lpage>295</lpage>
            <xrefbib>
               <pubid idtype="pmpid">12102002</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Phylogenomics reveal a robust fungal tree of life.</p>
            </title>
            <aug>
               <au>
                  <snm>Kuramae</snm>
                  <fnm>EE</fnm>
               </au>
               <au>
                  <snm>Robert</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Weiss</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Boekhout</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>FEMS Yeast Res</source>
            <pubdate>2006</pubdate>
            <volume>6</volume>
            <fpage>1213</fpage>
            <lpage>1220</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17156018</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Estimating synonymous and nonsynonymous substitution rates under realistic evolutionary models.</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2000</pubdate>
            <volume>17</volume>
            <fpage>32</fpage>
            <lpage>43</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10666704</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Adjusting for selection on synonymous sites in estimates of evolutionary distance.</p>
            </title>
            <aug>
               <au>
                  <snm>Hirsh</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>HB</fnm>
               </au>
               <au>
                  <snm>Wall</snm>
                  <fnm>DP</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2005</pubdate>
            <volume>22</volume>
            <fpage>174</fpage>
            <lpage>177</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15371530</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Getting positive about selection.</p>
            </title>
            <aug>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Rogozin</snm>
                  <fnm>IB</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <fpage>331</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">193638</pubid>
                  <pubid idtype="pmpid" link="fulltext">12914654</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Molecular evidence for an ancient duplication of the entire yeast genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Wolfe</snm>
                  <fnm>KH</fnm>
               </au>
               <au>
                  <snm>Shields</snm>
                  <fnm>DC</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1997</pubdate>
            <volume>387</volume>
            <fpage>708</fpage>
            <lpage>713</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9192896</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <aug>
               <au>
                  <snm>Ohno</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Evolution by Gene Duplication</source>
            <publisher>Berlin: Springer</publisher>
            <pubdate>1970</pubdate>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Differential translation efficiency of orthologous genes is involved in phenotypic divergence of yeast species.</p>
            </title>
            <aug>
               <au>
                  <snm>Man</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Pilpel</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2007</pubdate>
            <volume>39</volume>
            <fpage>415</fpage>
            <lpage>421</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17277776</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Genome evolution in yeasts.</p>
            </title>
            <aug>
               <au>
                  <snm>Dujon</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Sherman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Fischer</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Durrens</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Casaregola</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lafontaine</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>De Montigny</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Marck</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Neuve'glise</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Talla</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Goffard</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Frangeul</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Aigle</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Anthouard</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Babour</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Barbe</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Barnay</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Blanchin</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Beckerich</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Beyne</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Bleykasten</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Boisrame'</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Boyer</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Cattolico</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Confanioleri</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>De Daruvar</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Despons</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Fabre</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Fairhead</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ferry-Dumazet</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>430</volume>
            <fpage>35</fpage>
            <lpage>44</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15229592</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Yeast genome evolution-the origin of the species.</p>
            </title>
            <aug>
               <au>
                  <snm>Scannell</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Butler</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Wolfe</snm>
                  <fnm>KH</fnm>
               </au>
            </aug>
            <source>Yeast</source>
            <pubdate>2007</pubdate>
            <volume>24</volume>
            <fpage>929</fpage>
            <lpage>942</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17621376</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Methyl transferring enzyme system of microsomes in the biosynthesis of lecithin (phosphatidylcholine).</p>
            </title>
            <aug>
               <au>
                  <snm>Bremer</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Greenberg</snm>
                  <fnm>DM</fnm>
               </au>
            </aug>
            <source>Biochim Biophys Acta</source>
            <pubdate>1961</pubdate>
            <volume>46</volume>
            <fpage>205</fpage>
            <lpage>216</lpage>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Phosphatidylcholine synthesis in yeast.</p>
            </title>
            <aug>
               <au>
                  <snm>Chin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bloch</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>J Lipid Res</source>
            <pubdate>1988</pubdate>
            <volume>29</volume>
            <fpage>9</fpage>
            <lpage>14</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">3282018</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Global mapping of the yeast genetic interaction network.</p>
            </title>
            <aug>
               <au>
                  <snm>Tong</snm>
                  <fnm>AH</fnm>
               </au>
               <au>
                  <snm>Lesage</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Bader</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Ding</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Xin</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Berriz</snm>
                  <fnm>GF</fnm>
               </au>
               <au>
                  <snm>Brost</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Cheng</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Chua</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Friesen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Goldberg</snm>
                  <fnm>DS</fnm>
               </au>
               <au>
                  <snm>Haynes</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Humphries</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Hussein</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ke</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Krogan</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Levinson</snm>
                  <fnm>JN</fnm>
               </au>
               <au>
                  <snm>Lu</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Me'nard</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Munyana</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Parsons</snm>
                  <fnm>AB</fnm>
               </au>
               <au>
                  <snm>Ryan</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Tonikian</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Roberts</snm>
                  <fnm>T</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>303</volume>
            <fpage>808</fpage>
            <lpage>813</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14764870</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>The yeast coexpression network has a small-world, scale-free architecture and can be explained by a simple model.</p>
            </title>
            <aug>
               <au>
                  <snm>van Noort</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>EMBO Rep</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>280</fpage>
            <lpage>284</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1299002</pubid>
                  <pubid idtype="pmpid" link="fulltext">14968131</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Enzyme EC Numbers</p>
            </title>
            <url>http://www.genome.ad.jp/htbin/get_htext?ECtable</url>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Modular epistasis in yeast metabolism.</p>
            </title>
            <aug>
               <au>
                  <snm>Segre</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Deluna</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Kishony</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2005</pubdate>
            <volume>37</volume>
            <fpage>77</fpage>
            <lpage>83</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15592468</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Gene loss rate: a probabilistic measure for the conservation of eukaryotic genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Borenstein</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Shlomi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ruppin</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Sharan</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>e7</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1802574</pubid>
                  <pubid idtype="pmpid" link="fulltext">17158152</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Gene annotation and network inference by phylogenetic profiling.</p>
            </title>
            <aug>
               <au>
                  <snm>Wu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>DeLisi</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>80</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1388238</pubid>
                  <pubid idtype="pmpid" link="fulltext">16503966</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>CAFE: a computational tool for the study of gene family evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>De Bie</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Cristianini</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Demuth</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Hahn</snm>
                  <fnm>MW</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <fpage>1269</fpage>
            <lpage>1271</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16543274</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Gene Ontology</p>
            </title>
            <url>http://www.geneontology.org/index.shtml</url>
         </bibl>
         <bibl id="B46">
            <title>
               <p><it>Saccharomyces </it>Genome Database</p>
            </title>
            <url>http://www.yeastgenome.org/</url>
         </bibl>
         <bibl id="B47">
            <title>
               <p>BioGRID Database</p>
            </title>
            <url>http://www.thebiogrid.org/</url>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Evolutionary plasticity of genetic interaction networks.</p>
            </title>
            <aug>
               <au>
                  <snm>Tischler</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lehner</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>AG</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2008</pubdate>
            <volume>40</volume>
            <fpage>390</fpage>
            <lpage>391</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">18362882</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>The Stanford Microarray Database.</p>
            </title>
            <aug>
               <au>
                  <snm>Marinelli</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Montgomery</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Shah</snm>
                  <fnm>NH</fnm>
               </au>
               <au>
                  <snm>Prapong</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Nitzberg</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Zachariah</snm>
                  <fnm>ZK</fnm>
               </au>
               <au>
                  <snm>Sherlock</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Natkunam</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>West</snm>
                  <fnm>RB</fnm>
               </au>
               <au>
                  <snm>Rijn</snm>
                  <mnm>van de</mnm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>PO</fnm>
               </au>
               <au>
                  <snm>Ball</snm>
                  <fnm>CA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>152</fpage>
            <lpage>155</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">29818</pubid>
                  <pubid idtype="pmpid" link="fulltext">11125075</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>The Open Biomedical Ontologies</p>
            </title>
            <url>http://obofoundry.org/</url>
         </bibl>
         <bibl id="B51">
            <title>
               <p>KEGG LIGAND Database</p>
            </title>
            <url>ftp://ftp.genome.jp/pub/kegg/ligand/</url>
         </bibl>
         <bibl id="B52">
            <title>
               <p>A human-curated annotation of the Candida albicans genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Braun</snm>
                  <fnm>BR</fnm>
               </au>
               <au>
                  <snm>van Het Hoog</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>d'Enfert</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Martchenko</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Dungan</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kuo</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Inglis</snm>
                  <fnm>DO</fnm>
               </au>
               <au>
                  <snm>Uhl</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Hogues</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Berriman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lorenz</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Levitin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Oberholzer</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Bachewich</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Harcus</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Marcil</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Dignard</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Iouk</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Zito</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Frangeul</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Tekaia</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Rutherford</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Munro</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Bates</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gow</snm>
                  <fnm>NA</fnm>
               </au>
               <au>
                  <snm>Hoyer</snm>
                  <fnm>LL</fnm>
               </au>
               <au>
                  <snm>K&#246;hler</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Morschh&#228;user</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Newport</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>PLoS Genet</source>
            <pubdate>2005</pubdate>
            <volume>1</volume>
            <fpage>36</fpage>
            <lpage>57</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1183520</pubid>
                  <pubid idtype="pmpid" link="fulltext">16103911</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Proof and evolutionary analysis of ancient genome duplication in the yeast <it>Saccharomyces cerevisiae Nature</it></p>
            </title>
            <aug>
               <au>
                  <snm>Kellis</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Birren</snm>
                  <fnm>BW</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>428</volume>
            <fpage>617</fpage>
            <lpage>624</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15004568</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Phylogenetic relationships among yeasts of the '<it>Saccharomyces </it>complex' determined from multigene sequence analyses.</p>
            </title>
            <aug>
               <au>
                  <snm>Kurtzman</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Robnett</snm>
                  <fnm>CJ</fnm>
               </au>
            </aug>
            <source>FEMS Yeast Res</source>
            <pubdate>2003</pubdate>
            <volume>3</volume>
            <fpage>417</fpage>
            <lpage>432</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12748053</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Automatic clustering of orthologs and in paralogs shared by multiple proteomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Alexeyenko</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Tamas</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>EL</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <fpage>e9</fpage>
            <lpage>e15</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16873526</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Do orthologous gene phylogenies really support tree-thinking?</p>
            </title>
            <aug>
               <au>
                  <snm>Bapeste</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Susko</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Leigh</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>MacLeod</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Charlebois</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
            </aug>
            <source>BMC Evol Biol</source>
            <pubdate>2005</pubdate>
            <volume>5</volume>
            <fpage>33</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1156881</pubid>
                  <pubid idtype="pmpid" link="fulltext">15913459</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Multiple sequence alignment with the Clustal series of programs.</p>
            </title>
            <aug>
               <au>
                  <snm>Ahenna</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Sugawara</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Koike</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Lopez</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Thompson</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <fpage>3497</fpage>
            <lpage>3500</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">168907</pubid>
                  <pubid idtype="pmpid" link="fulltext">12824352</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>A fast algorithm for joint reconstruction of ancestral amino acid sequences.</p>
            </title>
            <aug>
               <au>
                  <snm>Pupko</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Pe'er</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Shamir</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Graur</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2000</pubdate>
            <volume>17</volume>
            <fpage>890</fpage>
            <lpage>896</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10833195</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>PAML: a program package for phylogenetic analysis by maximum likelihood.</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Comput Appl Biosci</source>
            <pubdate>1997</pubdate>
            <volume>13</volume>
            <fpage>555</fpage>
            <xrefbib>
               <pubid idtype="pmpid">9367129</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>Gene expression and molecular evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Akashi</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Curr Opin Genet Dev</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <fpage>660</fpage>
            <lpage>666</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11682310</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Solving the riddle of codon usage preferences: a test for translational selection.</p>
            </title>
            <aug>
               <au>
                  <snm>dos Reis</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Savva</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Wernisch</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>5036</fpage>
            <lpage>5044</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">521650</pubid>
                  <pubid idtype="pmpid" link="fulltext">15448185</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>The codon adaptation index: a measure of directional synonymous codon usage bias and its potential applications.</p>
            </title>
            <aug>
               <au>
                  <snm>Sharp</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>WH</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1987</pubdate>
            <volume>15</volume>
            <fpage>1281</fpage>
            <lpage>1295</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">340524</pubid>
                  <pubid idtype="pmpid" link="fulltext">3547335</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>Codon substitution in evolution and the "saturation" of synonymous changes.</p>
            </title>
            <aug>
               <au>
                  <snm>Gojobori</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1983</pubdate>
            <volume>105</volume>
            <fpage>1011</fpage>
            <lpage>1027</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1202222</pubid>
                  <pubid idtype="pmpid" link="fulltext">6642197</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>The distribution of flora in the alpine zone.</p>
            </title>
            <aug>
               <au>
                  <snm>Jaccard</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>New Phytologist</source>
            <pubdate>1912</pubdate>
            <volume>11</volume>
            <fpage>37</fpage>
            <lpage>50</lpage>
         </bibl>
         <bibl id="B65">
            <title>
               <p>The COG database: an updated version includes eukaryotes.</p>
            </title>
            <aug>
               <au>
                  <snm>Kiryutin</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Krylov</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Mazumder</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Mekhedov</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Nikolskaya</snm>
                  <fnm>AN</fnm>
               </au>
               <au>
                  <snm>Rao</snm>
                  <fnm>BS</fnm>
               </au>
               <au>
                  <snm>Smirnov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sverdlov</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Vasudevan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Yin</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Natale</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <fpage>41</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">222959</pubid>
                  <pubid idtype="pmpid" link="fulltext">12969510</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>Estimating the annotation error rate of curated GO database sequence annotations.</p>
            </title>
            <aug>
               <au>
                  <snm>Jones</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Baumann</snm>
                  <fnm>U</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>170</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1892569</pubid>
                  <pubid idtype="pmpid" link="fulltext">17519041</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p>Gene Ontology annotation quality analysis in model eukaryotes.</p>
            </title>
            <aug>
               <au>
                  <snm>Buza</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>McCarthy</snm>
                  <fnm>FM</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Bridges</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Burgess</snm>
                  <fnm>SC</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2008</pubdate>
            <volume>36</volume>
            <fpage>e12</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2241866</pubid>
                  <pubid idtype="pmpid" link="fulltext">18187504</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>

