<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2007-8-2-r18</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Reduced efficacy of selection in regions of the <it>Drosophila </it>genome that lack crossing over</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Haddrill</snm>
               <mi>R</mi>
               <fnm>Penelope</fnm>
               <insr iid="I1"/>
               <email>p.haddrill@ed.ac.uk</email>
            </au>
            <au id="A2">
               <snm>Halligan</snm>
               <mi>L</mi>
               <fnm>Daniel</fnm>
               <insr iid="I1"/>
               <email>Daniel.Halligan@ed.ac.uk</email>
            </au>
            <au id="A3">
               <snm>Tomaras</snm>
               <fnm>Dimitris</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>dimtom@hotmail.com</email>
            </au>
            <au id="A4">
               <snm>Charlesworth</snm>
               <fnm>Brian</fnm>
               <insr iid="I1"/>
               <email>Brian.Charlesworth@ed.ac.uk</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, EH9 3JT, UK</p>
            </ins>
            <ins id="I2">
               <p>15 Smirnis St, 15669, Papagou, Athens, Greece</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>2</issue>
         <fpage>R18</fpage>
         <url>http://genomebiology.com/2007/8/2/R18</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17284312</pubid>
               <pubid idtype="doi">10.1186/gb-2007-8-2-r18</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>24</day>
               <month>10</month>
               <year>2006</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>18</day>
               <month>12</month>
               <year>2006</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>6</day>
               <month>2</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>06</day>
               <month>02</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Haddrill et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Less effective selection in the absence of crossing over</p>
      </shorttitle>
      <shortabs>
         <p>Observations from a genome-wide comparison of <it>Drosophila melanogaster </it>and <it>Drosophila yakuba </it>are consistent with a severe reduction in the efficacy of selection in the absence of crossing over, resulting in the accumulation of deleterious mutations in these regions.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The recombinational environment is predicted to influence patterns of protein sequence evolution through the effects of Hill-Robertson interference among linked sites subject to selection. In freely recombining regions of the genome, selection should more effectively incorporate new beneficial mutations, and eliminate deleterious ones, than in regions with low rates of genetic recombination.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We examined the effects of recombinational environment on patterns of evolution using a genome-wide comparison of <it>Drosophila melanogaster </it>and <it>D. yakuba</it>. In regions of the genome with no crossing over, we find elevated divergence at nonsynonymous sites and in long introns, a virtual absence of codon usage bias, and an increase in gene length. However, we find little evidence for differences in patterns of evolution between regions with high, intermediate, and low crossover frequencies. In addition, genes on the fourth chromosome exhibit more extreme deviations from regions with crossing over than do other, no crossover genes outside the fourth chromosome.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>All of the patterns observed are consistent with a severe reduction in the efficacy of selection in the absence of crossing over, resulting in the accumulation of deleterious mutations in these regions. Our results also suggest that even a very low frequency of crossing over may be enough to maintain the efficacy of selection.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010009">Genetics</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Patterns of molecular evolution can be profoundly different between loci that differ in their recombinational environment. This is due to Hill-Robertson interference <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>, whereby any locus linked to another that is under directional selection experiences a reduction in effective population size (N<sub>e</sub>). Because the efficacy of selection on a mutation is a function of the product of N<sub>e </sub>and the selection coefficient on a mutation (s), this linkage affects the probability of fixation of a new mutation <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>; favourable mutations are less likely to reach fixation, whereas the opposite is true for deleterious mutations. In other words, selection at one locus has the effect of increasing the effects of genetic drift at another, linked locus. Recombination reduces the effect of this interference, increasing N<sub>e </sub>and hence the efficacy of selection. We would therefore expect higher levels of adaptation, and lower rates of fixation of deleterious mutations, in genomic regions with high levels of genetic recombination, as compared with regions with little or no recombination <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>.</p>
         <p>Various studies have found evidence for such effects, primarily in nonrecombining genomes or chromosomes. For example, endosymbiotic bacteria experience small population sizes and minimal rates of recombination, resulting in accumulation of mildly deleterious mutations and possibly also reduced rates of adaptation <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. The neo-sex chromosomes of <it>Drosophila miranda </it>have also provided compelling evidence for the effects of Hill-Robertson interference, showing elevated rates of fixation of deleterious mutations, and a reduced rate of adaptive evolution on the nonrecombining neo-Y chromosome, compared with the neo-X chromosome <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. In addition to studies examining rates of evolution in nonrecombining chromosomes and genomes, investigation of different recombinational environments within the same genome or chromosome have proved fruitful. For example, in two studies of different recombination regions in <it>Drosophila</it>, Betancourt and Presgraves <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> and Presgraves <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> concluded that recombination affects the efficiency of selection on amino acid sequences, with reduced rates of adaptive evolution in regions of low recombination; these also experience higher frequencies of mildly deleterious segregating mutations <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>.</p>
         <p>However, these two studies used samples that represent only a small fraction of genes found in the <it>Drosophila </it>genome, which may also be biased toward genes that are known to be rapidly evolving, so that the results may not be entirely representative of the genome as a whole. In addition, it is currently unclear what proportion of amino acid differences between species is the result of positive selection. Some studies have found evidence that much protein evolution in <it>Drosophila </it>is the result of positive selection <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>, but the generality of this result is still uncertain. The relative proportions of mutations that are advantageous as opposed to deleterious will be important in determining the influence of recombinational environment on patterns of evolution. The genomes of a number of <it>Drosophila </it>species have recently been sequenced, or are in the process of being sequenced, so that genome-wide comparisons are now possible. We use a dataset of more than 7,500 genes from <it>D. melanogaster </it>and the closely related species <it>D. yakuba </it>to examine the effects of recombinational environment on rates and patterns of evolution in coding and noncoding sequences, and on measures of adaptation at the molecular level.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <p>The final dataset consisted of 7,612 genes, divided into recombination regions as follows: high crossover frequency (<it>n </it>= 3,859), intermediate crossover frequency (<it>n </it>= 2,555), low crossover frequency (<it>n </it>= 1,111), and no crossing over (<it>n </it>= 87). We also divided the no crossover category into fourth chromosome (<it>n </it>= 67) and non-fourth chromosome genes (<it>n </it>= 20), in order to examine whether there are any differences between no crossover genes on chromosomes with crossing over, and genes on a chromosome that is entirely crossover free. Sample sizes for the intron analyses were as follows: 10,407 in genes with high crossover frequency (6,474 short [&#8804;80 base pairs (bp)] and 3,933 long [>80 bp]); 6,965 in genes with intermediate crossover frequency (4,445 short and 2,520 long); 2,898 in genes with low crossover frequency (1,800 short and 1,098 long); 218 in genes with no crossover (120 short and 97 long); 181 in fourth chromosome genes (96 short and 85 long); and 37 in non-fourth chromosome, no crossover genes (24 short and 13 long). We refer to crossing over rather than recombination, because there is evidence that gene conversion occurs in regions of the <it>D. melanogaster </it>genome with very low or zero frequencies of crossing over <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>.</p>
         <p>We found a highly significant effect of recombinational environment on levels of the codon-based PAML measures of sequence divergence (see Materials and methods, below) d<sub>N</sub>, d<sub>S</sub>, and d<sub>N</sub>/d<sub>S </sub>(Kruskal-Wallis test: d<sub>N</sub>, H = 36.84, degrees of freedom [df] = 3, <it>P </it>&lt; 10<sup>-4</sup>; d<sub>S</sub>, H = 40.03, df = 3, <it>P </it>&lt; 10<sup>-4</sup>; and d<sub>N</sub>/d<sub>S</sub>, H = 38.16, df = 3, <it>P </it>&lt; 10<sup>-4</sup>; Figure <figr fid="F1">1</figr>). The no crossover region exhibits elevated levels of d<sub>N </sub>and d<sub>N</sub>/d<sub>S</sub>, with median values being approximately double those found in other recombination regions, and, surprisingly, a somewhat reduced value for d<sub>S</sub>. To further investigate this, we used Gestimator (see Materials and methods, below) to calculate values of the nucleotide site based measures of divergence K<sub>A </sub>and K<sub>S</sub>. Although these results exhibit qualitatively the same patterns as the d<sub>N </sub>and d<sub>S </sub>values, recombinational environment exhibited a significant effect only on K<sub>A </sub>values and not on K<sub>S </sub>values (K<sub>A</sub>, H = 38.21, df = 3, <it>P </it>&lt; 10<sup>-4</sup>; K<sub>S</sub>, H = 1.15, df = 3, <it>P </it>= 0.76; Figure <figr fid="F1">1</figr>).</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Notched box-plots of d<sub>N</sub>, d<sub>S</sub>, d<sub>N</sub>/d<sub>S</sub>, K<sub>A</sub>, K<sub>S </sub>and K<sub>A</sub>/K<sub>S </sub>for each recombination region</p>
            </caption>
            <text>
               <p>Notched box-plots of d<sub>N</sub>, d<sub>S</sub>, d<sub>N</sub>/d<sub>S</sub>, K<sub>A</sub>, K<sub>S </sub>and K<sub>A</sub>/K<sub>S </sub>for each recombination region. Shown are notched box-plots of d<sub>N</sub>, d<sub>S</sub>, d<sub>N</sub>/d<sub>S</sub>, K<sub>A</sub>, K<sub>S </sub>and K<sub>A</sub>/K<sub>S </sub>for regions of high (H), intermediate (I), and low (L) frequency of crossing over and regions of no crossing over, divided into non-fourth chromosome genes (N<sub>O</sub>), fourth chromosome genes (N<sub>4</sub>), and all no crossing over region genes (N<sub>A</sub>). The box extends from the lower to the upper quartile, with a line in the middle at the median. The dotted bars represent the 5th and 95th percentiles. The notches represent an estimate of the uncertainty about the medians for box-to-box comparison; when the notches for two samples do not overlap, the medians of the two groups differ at the 5% significance level.</p>
            </text>
            <graphic file="gb-2007-8-2-r18-1"/>
         </fig>
         <p>Although pairwise tests indicate that there are some significant differences in divergence measures between high, intermediate, and low crossover regions (data not shown), the magnitude of these differences is extremely small compared with the difference between the no crossover region and the rest of the genome (Figure <figr fid="F1">1</figr>). The no crossover region is therefore the only region to show clear evidence of a distinctly different rate of nonsynonymous evolution, and there is an indication that it may also have a reduced rate of synonymous evolution.</p>
         <p>However, when we examined differences between the two groups of genes within no crossover regions, namely the fourth chromosome and non-fourth chromosome genes, we found some surprising differences. Compared with the fourth chromosome genes, the non-fourth chromosome genes exhibit levels of nonsynonymous and synonymous evolution that are closer to those of the high, intermediate, and low crossover regions (Figure <figr fid="F1">1</figr>), and they are not significantly different from these regions (Wilcoxon rank sum test on d<sub>N</sub>, d<sub>S</sub>, d<sub>N</sub>/d<sub>S</sub>, K<sub>A</sub>, and K<sub>S</sub>; non-fourth versus high, intermediate and low combined: <it>P </it>> 0.34 in all cases) or the fourth chromosome genes (non-fourth versus fourth: <it>P </it>> 0.05 in all cases).</p>
         <p>We also examined three measures of codon usage bias: effective number of codons (ENC), the frequency of optimal codons (Fop), and the GC content of the third position of codons (GC3; see Materials and methods, below, and Table <tblr tid="T1">1</tblr>). As expected from previous work <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>, the no crossover region shows almost no evidence of codon usage bias, with elevated ENC and reduced Fop compared with the other recombination regions. Interestingly, the non-fourth chromosome genes within the no crossover category appear to exhibit levels of codon usage bias intermediate between the crossing over regions and the fourth chromosome genes. Betancourt and Presgraves <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> found that Fop was strongly negatively correlated with d<sub>N </sub>in their dataset but weakly positively correlated with d<sub>S</sub>. We find a significantly negative correlation between Fop and d<sub>N </sub>in all recombination regions (Spearman rank correlation [R<sub>s</sub>] with 95% confidence interval [CI; obtained by bootstrapping across genes]: high crossover R<sub>s </sub>= -0.436, 95% CI = -0.463 to -0.409; intermediate crossover R<sub>s </sub>= -0.476, 95% CI = -0.508 to -0.444; low crossover R<sub>s </sub>= -0.438, 95% CI = -0.487 to -0.383; no crossover R<sub>s </sub>= -0.228, 95% CI = -0.435 to -0.042), although the relationship is much weaker in the no crossover region. When the no crossover region is divided into fourth and non-fourth chromosome genes, the correlations are both still negative although not significantly so (fourth chromosome R<sub>s </sub>= -0.078, 95% CI = -0.346 to 0.166; non-fourth chromosome R<sub>s </sub>= -0.135, 95% CI = -0.616 to 0.350). However, the relationship with d<sub>S </sub>is less clear; the correlations are not significantly different from zero in high, intermediate, and no crossover regions (high R<sub>s </sub>= -0.001, 95% CI = -0.031 to 0.034; intermediate R<sub>s </sub>= 0.018, 95% CI = -0.022 to 0.059; no R<sub>s </sub>= -0.076, 95% CI = -0.317 to 0.169), but significantly positive in low crossover regions (low R<sub>s </sub>= 0.222, 95% CI = 0.165 to 0.284). The fourth chromosome genes show a significantly negative correlation between Fop and d<sub>S </sub>(R<sub>s </sub>= -0.283, 95% CI = -0.502 to -0.022)), whereas for non-fourth chromosome, no crossover genes the relationship is nonsignificantly positive (R<sub>s </sub>= 0.480, 95% CI = -0.013 to 0.776).</p>
         <tbl id="T1" hint_layout="double">
            <title>
               <p>Table 1</p>
            </title>
            <caption>
               <p>Measures of codon usage bias, GC content, and gene length in the different recombination regions</p>
            </caption>
            <tblbdy cols="6">
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>ENC</p>
                  </c>
                  <c ca="left">
                     <p>Fop</p>
                  </c>
                  <c ca="left">
                     <p>GC total</p>
                  </c>
                  <c ca="left">
                     <p>GC3</p>
                  </c>
                  <c ca="left">
                     <p>Length</p>
                  </c>
               </r>
               <r>
                  <c cspan="6">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>High</p>
                  </c>
                  <c ca="left">
                     <p>47.41 (47.23-47.62)</p>
                  </c>
                  <c ca="left">
                     <p>0.545 (0.542-0.547)</p>
                  </c>
                  <c ca="left">
                     <p>0.548 (0.546-0.549)</p>
                  </c>
                  <c ca="left">
                     <p>0.669 (0.666-0.672)</p>
                  </c>
                  <c ca="left">
                     <p>1517 (1477-1562)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Intermediate</p>
                  </c>
                  <c ca="left">
                     <p>48.57 (48.33-48.83)</p>
                  </c>
                  <c ca="left">
                     <p>0.532 (0.528-0.535)</p>
                  </c>
                  <c ca="left">
                     <p>0.541 (0.538-0.543)</p>
                  </c>
                  <c ca="left">
                     <p>0.655 (0.651-0.659)</p>
                  </c>
                  <c ca="left">
                     <p>1476 (1424-1520)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Low</p>
                  </c>
                  <c ca="left">
                     <p>47.86 (47.44-48.25)</p>
                  </c>
                  <c ca="left">
                     <p>0.548 (0.543-0.554)</p>
                  </c>
                  <c ca="left">
                     <p>0.549 (0.546-0.552)</p>
                  </c>
                  <c ca="left">
                     <p>0.672 (0.664-0.679)</p>
                  </c>
                  <c ca="left">
                     <p>1489 (1422-1556)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>No (N<sub>O</sub>)</p>
                  </c>
                  <c ca="left">
                     <p>52.24 (49.69-54.42)</p>
                  </c>
                  <c ca="left">
                     <p>0.424 (0.378-0.467)</p>
                  </c>
                  <c ca="left">
                     <p>0.511 (0.466-0.547)</p>
                  </c>
                  <c ca="left">
                     <p>0.572 (0.495-0.638)</p>
                  </c>
                  <c ca="left">
                     <p>1238 (915-1590)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>No (N<sub>4</sub>)</p>
                  </c>
                  <c ca="left">
                     <p>54.14 (53.43-54.82)</p>
                  </c>
                  <c ca="left">
                     <p>0.263 (0.251-0.276)</p>
                  </c>
                  <c ca="left">
                     <p>0.422 (0.411-0.432)</p>
                  </c>
                  <c ca="left">
                     <p>0.368 (0.354-0.383)</p>
                  </c>
                  <c ca="left">
                     <p>2692 (2053-3532)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>No (N<sub>A</sub>)</p>
                  </c>
                  <c ca="left">
                     <p>53.70 (52.89-54.50)</p>
                  </c>
                  <c ca="left">
                     <p>0.300 (0.280-0.321)</p>
                  </c>
                  <c ca="left">
                     <p>0.432 (0.421-0.446)</p>
                  </c>
                  <c ca="left">
                     <p>0.393 (0.374-0.414)</p>
                  </c>
                  <c ca="left">
                     <p>2358 (1860-3016)</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p>Values reported for ENC, Fop, GC total, and GC3 are means for all genes from <it>D. melanogaster </it>and <it>D. yakuba </it>combined (95% confidence interval). Values for gene length are the mean number of base pairs in <it>D. melanogaster </it>for all constitutively spliced exons concatenated, for each gene (95% confidence interval). The no crossing over region is divided as follows: N<sub>O</sub>, non-fourth chromosome genes; N<sub>4</sub>, fourth chromosome genes; and N<sub>A</sub>, all no crossing over region genes. ENC, effective number of codons; Fop, frequency of optimal codons; GC3, GC content of the third position of codons.</p>
            </tblfn>
         </tbl>
         <p>Because there has been some suggestion that comparisons of estimates of d<sub>S </sub>from PAML can be misleading when there are large differences in codon usage bias among genes <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>, we also examined the relationship between Fop and the nucleotide site-based estimators K<sub>A </sub>and K<sub>S</sub>. Consistent with Bierne and Eyre-Walker <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>, the results for K<sub>A </sub>agree very closely with those for d<sub>N </sub>(high R<sub>s </sub>= -0.416, 95% CI = -0.442 to -0.386; intermediate R<sub>s </sub>= -0.456, 95% CI = -0.487 to -0.424; low R<sub>s </sub>= -0.404, 95% CI = -0.456 to -0.350; no R<sub>s </sub>= -0.240, 95% CI = -0.425 to -0.010; fourth R<sub>s </sub>= -0.089, 95% CI = -0.332 to 0.142; non-fourth, no crossover R<sub>s </sub>= -0.123, 95% CI = -0.592 to 0.384). The correlation between Fop and K<sub>S</sub>, however, is quite different from that between Fop and d<sub>S</sub>, being strongly negative in all recombination regions except the no crossover region, where the relationship is not significantly different from zero (high R<sub>s </sub>= -0.377, 95% CI = -0.405 to -0.348; intermediate R<sub>s </sub>= -0.392, 95% CI = -0.425 to -0.359; low R<sub>s </sub>= -0.289, 95% CI = -0.338 to -0.227; no R<sub>s </sub>= 0.194, 95% CI = -0.059 to 0.422; fourth R<sub>s </sub>= 0.096, 95% CI = -0.159 to 0.359; non-fourth, no crossover R<sub>s </sub>= 0.358, 95% -0.118 to 0.743). This is consistent with findings reported by Marais and coworkers <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>.</p>
         <p>The no crossover region also has a much lower GC content at third position sites (GC3) compared with regions with crossing over (Table <tblr tid="T1">1</tblr>), as expected from the fact that preferred codons in <it>D. melanogaster </it>and its relatives mostly end in G or C <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Again, mean GC3 for the non-fourth chromosome genes in the no crossover category is intermediate between the crossing over regions and the genes on the fourth chromosome. If selection for codon usage bias is virtually absent in the no crossover region, then synonymous sites are likely to be evolving close to neutrally. We might therefore expect the GC content at third position sites in the no crossover regions to be closer to equilibrium than in other recombination regions (see Marais and Piganeau <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>).</p>
         <p>To examine this, we compared the GC3 values with the GC content in noncoding regions. GC content was calculated for intronic sites in all recombination regions, and the introns were divided into short and long size classes, because these are known to differ dramatically in their rates of evolution <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>. Figure <figr fid="F2">2</figr> and Additional data file 1 show the results of this analysis. Interestingly, the mean GC3 value for the no crossover region (0.39) is similar to the GC content of short introns in other recombination regions (high 0.35, intermediate 0.35, low 0.39). Again, the GC contents of introns in the non-fourth chromosome genes lie between those of the fourth chromosome genes and the rest of the genome. Because short introns represent a class of sites that are likely to be relatively free from selective constraints <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, this suggests that the base composition of third position sites in the no crossover region are indeed closer to neutral equilibrium than those in other recombination regions, as would be expected if the efficacy of selection for codon usage bias were severely limited in this region.</p>
         <fig id="F2">
            <title>
               <p>Figure 2</p>
            </title>
            <caption>
               <p>GC content of the third position of codons, short introns, and long introns for each recombination region</p>
            </caption>
            <text>
               <p>GC content of the third position of codons, short introns, and long introns for each recombination region. GC content at the third position of codons (GC3), short introns (&#8804;80 base pairs [bp]) and long introns (>80 bp) for regions of high, intermediate and low frequency of crossing over and regions of no crossing over. Values reported are means per site for all introns from <it>D. melanogaster </it>and <it>D. yakuba </it>combined; error bars indicate 95% confidence interval (CI) obtained by bootstrapping by gene/intron.</p>
            </text>
            <graphic file="gb-2007-8-2-r18-2"/>
         </fig>
         <p>We also calculated divergence values for short and long introns (omitting sequences that include splice sites) in the different recombination regions, and these show some interesting patterns (Figure <figr fid="F3">3</figr> and Additional data file 1). Long introns have much lower divergence than short introns, confirming the pattern previously reported between <it>D. melanogaster </it>and <it>D. simulans </it>introns <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>. This pattern is seen in high, intermediate, and low crossing over regions, but not in the no crossover region, where long and short introns exhibit almost identical levels of divergence. This is true when we examine only the fourth chromosome genes; although the non-fourth chromosome genes exhibit lower levels of divergence in long introns than the fourth chromosome genes, there is still a marked increase in intron divergence when comparing regions with crossing over with non-fourth chromosome, no crossover genes.</p>
         <fig id="F3">
            <title>
               <p>Figure 3</p>
            </title>
            <caption>
               <p>Divergence in short and long introns in each recombination region</p>
            </caption>
            <text>
               <p>Divergence in short and long introns in each recombination region. Divergence between <it>D. melanogaster </it>and <it>D. yakuba </it>for short introns (&#8804;80 base pairs [bp]) and long introns (>80 bp) for regions of high, intermediate, and low frequency of crossing over, and regions of no crossing over. Values reported are means per site, corrected for multiple hits [50]. Error bars indicate 95% confidence interval (CI) obtained by bootstrapping by intron.</p>
            </text>
            <graphic file="gb-2007-8-2-r18-3"/>
         </fig>
         <p>Previous work also identified a negative relationship between intron length and divergence, and the same pattern is seen here for high, intermediate, and low crossover regions, but not for no crossover regions (Spearman rank correlation [R<sub>s</sub>] with 95% CI [obtained by bootstrapping across introns]: high R<sub>s </sub>= -0.465, 95% CI = -0.481 to -0.450; intermediate R<sub>s </sub>= -0.383, 95% CI = -0.404 to -0.361; low R<sub>s </sub>= -0.322, 95% CI = -0.355 to -0.287; no R<sub>s </sub>= 0.050, 95% CI = -0.095 to 0.188; fourth R<sub>s </sub>= 0.111, 95% CI = -0.066 to 0.262; non-fourth, no crossover R<sub>s </sub>= -0.209, 95% CI = -0.498 to 0.096). The correlation is not significantly different from zero in no crossover regions, and this is true even after using RepeatMasker to mask any microsatellite and/or interspersed repeats (no R<sub>s </sub>= 0.018, 95% CI = -0.132 to 0.158; fourth R<sub>s </sub>= 0.085, 95% CI = -0.074 to 0.251; non-fourth R<sub>s </sub>= -0.245, 95% CI = -0.529 to 0.080; proportion RepeatMasked, including splice sites: no = 0.254; fourth = 0.277; non-fourth, no crossover = 0.152). We further examined this issue by estimating the linear regressions of intron divergence on log intron length for each recombinational environment, because these provide a quantitative estimate of the strength of the relationship; bootstrapping was again used to assess significance. The regression coefficients get closer to zero moving from high to no crossover regions, and are significantly negative in high, intermediate, and low crossover regions, but not in any of the no crossover regions (regression coefficients: high = -0.0503, 95% CI = -0.0518 to -0.0488; intermediate = -0.0423, 95% CI = -0.0448 to -0.0412; low = -0.0356, 95% CI = -0.0384 to -0.0327; no = -0.0008, 95% CI -0.0089 to 0.0068; fourth = 0.0035, 95% CI = -0.0042 to 0.0123; non-fourth, no crossover = -0.0186, 95% CI = -0.0366 to 0.0001). The fact that the regression coefficients are significantly different between high, intermediate, low, and no crossover regions suggests that the efficacy of selection decreases as recombination rate decreases.</p>
         <p>Finally, we examined gene length in all recombination regions, because there is evidence suggesting that gene length tends to increase when selective constraints are relaxed <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. Consistent with this, there was a significant effect of recombination region on gene length (Kruskal-Wallis test: &#967;<sup>2 </sup>= 16.71, df = 3, <it>P </it>&lt; 10<sup>-3</sup>; Table <tblr tid="T1">1</tblr>), with genes on the fourth chromosome being longer than those in high, intermediate, and low crossover regions as well as non-fourth chromosome genes in no crossover regions.</p>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>One major conclusion from our analysis is that there is a higher rate of nonsynonymous site evolution in the regions of the <it>Drosophila </it>genome that apparently lack crossing over, as compared with regions with low to high rates of crossing over (Figure <figr fid="F1">1</figr>). We also found little evidence of differences in d<sub>N </sub>or d<sub>N</sub>/d<sub>S </sub>between low, intermediate, and high crossover regions. This contrasts with the results of Betancourt and Presgraves <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> and Presgraves <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, who found higher nonsynonymous divergence between <it>D. melanogaster </it>and <it>D. simulans </it>in regions of high recombination when compared with the rest of the genome. The reason for this difference is not entirely clear, but it may reflect the fact that the previous studies were based on relatively few genes. These might have included some genes with unusually high rates of amino acid sequence evolution in the high recombination regions. Consistent with this possibility, Betancourt and Presgraves <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> and Presgraves <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> found a much higher mean ratio of nonsynonymous to synonymous divergence in high recombination regions than in Figure <figr fid="F1">1</figr>. Marais and coworkers <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> also failed to detect any evidence for a positive correlation between the rate of crossing over and nonsynonymous divergence in a comparison of <it>D. melanogaster </it>and a set of cDNA sequences from <it>D. yakuba</it>; they used similar methods to those of Betancourt and Presgraves <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> and Presgraves <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> to estimate recombination rates, and so the difference in conclusion is unlikely to reflect differences in methods between studies. Rather, as pointed out by Marais and coworkers <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, it is more likely to reflect a bias toward fast-evolving genes in these datasets.</p>
         <p>Overall, our results fail to identify faster amino acid sequence evolution in regions of high recombination, but rather they suggest the opposite pattern. They are consistent with less effective selection against weakly deleterious, nonsynonymous mutations when crossing over is effectively absent, as is suggested by studies of the <it>D. miranda </it>neo-sex chromosome system <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>, and as is expected from increased Hill-Robertson effects when crossing over is rare or absent <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. It is, of course, conceivable that the no crossover regions experience a faster rate of adaptive evolution of amino acid mutations, but there is no theoretical basis for expecting this. We also found a significant increase in divergence for long introns (Figure <figr fid="F3">3</figr>) in the no crossover region compared with the rest of the genome; recent studies <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp> show that longer introns are subject to greater selective constraints than short ones, and so this observation is also consistent with a weakening of selective constraints when recombination rates are very low. Definitive proof of the inference of a relaxation of purifying selection in no crossover regions would require comparisons of within-species polymorphism with between-species divergence, as was done for the <it>D. miranda </it>neo-sex chromosome system <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, but suitable data are not yet available.</p>
         <p>Another interesting aspect of the results is that the PAML analysis suggests a lower d<sub>S </sub>in the no crossover region of the genome, compared with other regions, although this is not seen in the analysis of K<sub>S </sub>(Figure <figr fid="F1">1</figr>), and we also found a negative relation between Fop and K<sub>S</sub>, but not d<sub>S</sub>, in all but the no crossover regions. This difference between the behavior of the two estimators of synonymous divergence is similar to that found by Bierne and Eyre-Walker <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. If, as they suggest, estimates of K<sub>S </sub>more accurately reflect divergence at synonymous sites when there are differences in codon usage, our results suggest that selection for codon usage bias acts to reduce divergence at these sites in high, intermediate, and low recombination regions but that this effect is reduced in the absence of crossing over. The K<sub>S </sub>values are thus likely to provide a more reliable indicator of levels of divergence at synonymous sites. For noncoding sites, only K<sub>S </sub>can be used; the results show that divergence in short introns decreases from high to no crossover regions (Figure <figr fid="F3">3</figr>), which is opposite to what is seen for long introns.</p>
         <p>There is a slight increase in GC content between high, intermediate, and low crossover regions for short introns (Figure <figr fid="F2">2</figr>), so that the corresponding decrease in short intron divergence might reflect differences in GC to AT mutational biases among these regions, which can cause a negative relationship between divergence and GC content <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. However, there is a large drop in GC content for short introns in no crossover regions, coupled with a reduction in divergence, which cannot be explained by the mutational bias hypothesis. One possibility is a weakly mutagenic effect of recombination processes, as has been suggested for humans <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. Ometto and coworkers <abbrgrp><abbr bid="B28">28</abbr></abbrgrp> report a similar pattern for long introns and other noncoding sequences, for divergence between <it>D. melanogaster </it>and <it>D. simulans</it>. The lack of a similar effect on K<sub>S </sub>for synonymous sites may reflect the fact that the efficacy of selection on codon usage appears to be drastically reduced in the no crossover regions; this would allow a higher rate of synonymous substitutions <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>, counter-acting any effect of reduced recombination on mutation rates.</p>
         <p>There has been some controversy over the negative correlation between GC content/codon usage and rate of crossing over in the <it>D melanogaster </it>genome, which has been reported in previous studies. Marais and coworkers <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp> argued that this correlation mainly reflects the effect of differences in mutational bias and/or the rate of biased gene conversion (BGC) in favor of GC versus AT, which should affect putatively neutral noncoding sequences, whereas Kliman and Hey <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> and Hey and Kliman <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> argued for an effect of reduced recombination on the efficacy of selection. These analyses used longer introns to estimate the effects of mutational bias and BGC, on the grounds that these are less likely to be affected by selective constraints on splice sites and hence evolve neutrally. As we have seen, this assumption is probably incorrect. Our results for short introns outside the no crossover regions show, if anything, the opposite pattern to that expected based on the BGC hypothesis, because for short introns there is a slight decrease in divergence and increase in GC content between high and low crossover regions. Long introns exhibit almost no differences in GC content moving from high to low crossover regions, but they show a slight increase in divergence. Their GC content drops substantially in the no crossover regions. This behavior of the GC content of long introns is similar to that reported by Kliman and Hey <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Long introns, but not short ones, exhibit a large increase in divergence in the no crossover regions (Figure <figr fid="F3">3</figr>), which is consistent with a relaxation of selective constraints. GC content at third coding positions is still higher than for introns, even in the no crossover regions (Figure <figr fid="F2">2</figr>), suggesting that there is still some selection in favor of preferred codons in these regions.</p>
         <p>Overall, these patterns suggest that selective constraints on weakly deleterious amino acid mutations, mutations to nonpreferred codons, and weakly deleterious mutations in long introns are reduced in genomic regions where crossing over is virtually absent, but they are little affected by rates of crossing over in other regions. One caveat concerning our conclusions is that the recombinational landscape may well differ between <it>D. melanogaster </it>and <it>D. yakuba</it>, for which there is some evidence <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. As described in Materials and methods (below), we have attempted to eliminate genes that differ between the species with respect to their location in telomeric and centromeric regions, where crossing over is absent or greatly reduced <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. However, we cannot exclude smaller differences between species in recombination patterns. Such differences may be why there is little or no effect of crossing over rate in regions of low to high recombination, despite the fact that these are known to show clear patterns with respect to neutral diversity in <it>D. melanogaster </it><abbrgrp><abbr bid="B13">13</abbr><abbr bid="B28">28</abbr></abbrgrp>. <it>D. melanogaster </it>might have only relatively recently evolved low recombination over more extensive regions than in its common ancestor with <it>D. yakuba</it>. Because codon usage, GC content, and divergence must change over longer time scales than neutral diversity within species, this could account for the discrepancy between the pattern for diversity and the other statistics.</p>
         <p>The other possibility is that there is a strongly nonlinear effect of recombination on Hill-Robertson effects. This does not seem likely for either selective sweep or background selection processes <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>, but it does apply to Muller's ratchet <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> and Hill-Robertson interference among groups of weakly selected sites <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. However, it is unclear whether these effects would be strong enough to explain the patterns that we observe.</p>
         <p>There is a general tendency for the effects that we detect to involve primarily differences between chromosome four and the rest of the genome, such that effects on genes with no crossing over that are located on other chromosomes appear to be weak or absent (Table <tblr tid="T1">1</tblr> and Figures <figr fid="F1">1</figr>, <figr fid="F2">2</figr>, <figr fid="F3">3</figr>). This may well reflect the fact that the fourth chromosome is a block of more than 80 genes that fail to crossover with each other, whereas there are much smaller numbers of genes in the no crossover regions of the other chromosomes. There is thus much less opportunity for enhanced Hill-Robertson effects on the latter. It is noteworthy that this difference between the fourth chromosome and other no crossover regions is most marked for the rate of nonsynonymous substitution, for which selective constraints are likely to be stronger and hence more resistant to a moderate reduction in effective population size <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. Alternatively, this pattern may reflect the fact that, although chromosome four has a stable history of no crossing over, it is unclear whether this is true of the no crossover regions on other chromosomes (see above).</p>
         <p>There is one other apparent anomaly in the results, which at first sight is difficult to explain. This is the extreme reduction in GC content of short introns in the no crossover regions (Figure <figr fid="F2">2</figr>), to a mean level that is lower than that of long introns, despite the evidence for a reduced effectiveness of selection in these regions. This may reflect a drastic reduction in the intensity or efficacy of BGC in favor of GC in this region, because this would be the only deterministic force affecting neutral sequences, whereas long introns and synonymous sites are subject to both selection and BGC. This is seemingly inconsistent with the similarity in divergence for long and short introns in the no crossover regions. However, with very weak selection there can be interactions between mutational bias and the product of effective population size and selection coefficient, causing substitution rates to be almost flat or even increase with N<sub>e</sub>s in regions where N<sub>e</sub>s is very small <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp>. Thus, it is theoretically possible for long introns to be under effectively stronger selection than short introns, but to show the same or even higher levels of divergence in no crossover regions.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>We have examined the effect of recombinational environment, in terms of the frequency of crossing over, on the rates of nonsynonymous and synonymous evolution, codon usage bias, and evolution of noncoding DNA. Although we find only very small differences between regions of high, intermediate, and low crossing over frequency, the absence of crossing over appears to have a profound effect on patterns of molecular evolution. The no crossover regions exhibit elevated levels of nonsynonymous evolution, a virtual absence of codon usage bias, and similar levels of divergence for short and long intron size classes. These patterns are all consistent with a dramatic reduction in the efficacy of selection in the absence of crossing over, as a result of greatly enhanced effects of Hill-Robertson interference.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Fourth chromosome data</p>
            </st>
            <p>FlyBase <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> was used to download a list of all <it>D. melanogaster </it>genes with cytological map locations in bands 101 and 102. The genome annotation for each of these genes was examined, and any genes without expressed sequence tag or cDNA hits, or without any genome annotation, were eliminated. For the remaining genes, decorated fasta files containing coding regions were downloaded from FlyMine <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. Where genes are not alternatively spliced, the entire coding region was used in the analysis. For genes that are alternatively spliced, only constitutively spliced exons were used. Exons were also eliminated if they overlapped with coding sequence on the opposite strand. Homologous sequences from <it>D. yakuba </it>were found using BLAST searches on the DroSpeGe website <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>, and individual exons aligned by eye using Sequencher (Gene Codes, Ann Arbor, MI, USA). Exons were then concatenated and a fasta file containing the entire coding region for both species was exported for each locus. These fasta files were then aligned and analyzed as described below for the non-fourth chromosome data.</p>
         </sec>
         <sec>
            <st>
               <p>Non-fourth chromosome data</p>
            </st>
            <p>In order to generate alignments of constitutively spliced exons from coding sequences between <it>D. melanogaster </it>and <it>D. yakuba</it>, we used a modified version of the methods described by Halligan and Keightley <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. This involved obtaining a list of all currently annotated <it>D. melanogaster </it>genes from NCBI's Entrez Gene (using release 4.1 of the <it>D. melanogaster </it>genome), giving a total of 14,183 annotations. From this list, RNA genes and poorly annotated genes were excluded by examining the Flybase synopsis report for each gene, and excluding genes that were based on BLASTX data or gene prediction data only. Genbank format files were then downloaded for the remaining genes (including all annotated spliceforms), to give a dataset of 11,267 Genbank files. We extracted all annotated exons for a randomly chosen spliceform from each gene and used a reciprocal best-hits BLAST approach to identify and extract orthologous exons from the November 2005 freeze of the <it>D. yakuba </it>genome sequence (Genome Sequencing Center, Washington University School of Medicine, St Louis, MI, USA. Short exons (&lt;40 bp) were joined, where possible, to an adjacent section of noncoding DNA (either intronic or intergenic) prior to BLASTing, to increase the chance of a reciprocal best-hit. We used the locations of the orthologous exons in the draft <it>D. yakuba </it>genome sequence to retrieve the orthologous intron sequences. Introns were only retrieved if two adjacent <it>D. melanogaster </it>exons were identified on the same strand and same contig in the <it>D. yakuba </it>genome.</p>
            <p>Coding sequences (formed by concatenating the retrieved exons from the chosen spliceform) from both species were aligned using the amino acid alignment obtained from CLUSTALW <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. Genes were removed from the data set if the coding sequence was invalid in either species. A coding sequence was considered to be valid if it started with a start codon, ended with a stop codon, was a multiple of 3 bp in length, and contained no internal stop codons. We removed exons that were not constitutively spliced (not present in every annotated spliceform) from the coding sequences and ensured that the remaining exons were in frame and a multiple of 3 bp in length. Introns were initially aligned using MAVID <abbrgrp><abbr bid="B44">44</abbr></abbrgrp> and were subsequently realigned at a finer scale using MCALIGN2 <abbrgrp><abbr bid="B45">45</abbr></abbrgrp> by splitting the MAVID alignments into sections of approximately 500 bp at regions of high homology (>8 bp runs of ungapped matches). Introns were removed from the dataset if the sequence in either species did not start and end with a 2 bp consensus sequence ('AT', 'GT', or 'GC' at the 5' end and 'AG' at the 3' end). All alignments (both intronic and coding) with fewer than 10 valid bases (A, T, G, or C) or fewer than 20 valid/invalid bases (A, T, G, C, or N) in either species were discarded. Any clearly nonhomologous sections were masked from all alignments (defined as regions where divergence was above 0.25 within a 40 to 60 bp sliding window).</p>
         </sec>
         <sec>
            <st>
               <p>Estimating measures of divergence and codon usage bias</p>
            </st>
            <p>Maximum-likelihood estimates of d<sub>N </sub>and d<sub>S </sub>for each gene were obtained using Codeml in the PAML package <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>, using runmode = -2. Because estimates of d<sub>N</sub>/d<sub>S </sub>are likely to be unreliable for very short genes, alignments less than 150 bp (50 codons) in length were removed. We also used Gestimator <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>, which implements the method of Comeron <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>, to calculate values of K<sub>A </sub>and K<sub>S</sub>. Estimates of ENC and Fop were calculated using codonw <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. GC content was estimated both for the entire coding sequence and for the third positions of codons only (GC3). We also estimated GC content and divergence (corrected for multiple hits <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>) in introns, following removal of 8 bp/30 bp at the beginning/end of the introns to exclude any sites that may be subject to selective constraints <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Recombination regions</p>
            </st>
            <p>The entire dataset was sorted according to cytologic map location, and was divided into groups with high, intermediate, and low frequencies of crossing over, and a group with no crossing over, based on the regions described by Charlesworth <abbrgrp><abbr bid="B51">51</abbr></abbrgrp> (Additional data file 2). In addition to this, data from a number of cytologic bands were eliminated from the analysis, as described by Marais and coworkers <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. This removes genes in telomeric and centromeric polytene bands that have shifted in position between <it>D. melanogaster </it>and <it>D. yakuba</it>, and hence will have experienced a major change in recombinational environment.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data files are available with the online version of this article. Additional data file <supplr sid="S1">1</supplr> contains information on mean values of GC content and divergence for short and long intron classes in the different recombination regions. Additional data file <supplr sid="S2">2</supplr> contains information on the division of data into recombination classes based on cytologic location.</p>
         <suppl id="S1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>Information on mean values of GC content and divergence for short and long intron classes in the different recombination regions</p>
            </caption>
            <text>
               <p>Information on mean values of GC content and divergence for short and long intron classes in the different recombination regions</p>
            </text>
            <file name="gb-2007-8-2-r18-S1.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional data file 2</p>
            </title>
            <caption>
               <p>Information on the division of data into recombination classes based on cytologic location</p>
            </caption>
            <text>
               <p>Information on the division of data into recombination classes based on cytologic location</p>
            </text>
            <file name="gb-2007-8-2-r18-S2.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We gratefully acknowledge that the <it>D. yakuba </it>data used in this study were produced by the Genome Sequencing Center at Washington University School of Medicine in St. Louis. We thank Andrea Betancourt, Casey Bergman, Kelly Dyer, Bill Hill, John Welch, and two anonymous reviewers for useful comments and discussions. This work was supported by a Wellcome Trust VIP Award to PRH; DLH is supported by the Wellcome Trust and BC by the Royal Society.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The effect of linkage on the limits of artificial selection.</p>
            </title>
            <aug>
               <au>
                  <snm>Hill</snm>
                  <fnm>WG</fnm>
               </au>
               <au>
                  <snm>Robertson</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Genet Res</source>
            <pubdate>1966</pubdate>
            <volume>8</volume>
            <fpage>269</fpage>
            <lpage>294</lpage>
            <xrefbib>
               <pubid idtype="pmpid">5980116</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <aug>
               <au>
                  <snm>Kimura</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>The Neutral Theory of Molecular Evolution</source>
            <publisher>Cambridge: Cambridge University Press</publisher>
            <pubdate>1983</pubdate>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Genetic linkage and molecular evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Gordo</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Charlesworth</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Curr Biol</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <fpage>R684</fpage>
            <lpage>R686</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0960-9822(01)00408-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">11553339</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Genome evolution: recombination speeds up adaptive evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Marais</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Charlesworth</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Curr Biol</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>R68</fpage>
            <lpage>R70</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0960-9822(02)01432-X</pubid>
                  <pubid idtype="pmpid" link="fulltext">12546809</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Accelerated evolution and Muller's ratchet in endosymbiotic bacteria.</p>
            </title>
            <aug>
               <au>
                  <snm>Moran</snm>
                  <fnm>NA</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1996</pubdate>
            <volume>93</volume>
            <fpage>2873</fpage>
            <lpage>2878</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">39726</pubid>
                  <pubid idtype="pmpid" link="fulltext">8610134</pubid>
                  <pubid idtype="doi">10.1073/pnas.93.7.2873</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Evidence for genetic drift in endosymbionts (<it>Buchnera</it>): analyses of protein-coding genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Wernegreen</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Moran</snm>
                  <fnm>NA</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1999</pubdate>
            <volume>16</volume>
            <fpage>83</fpage>
            <lpage>97</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10331254</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>The roles of positive and negative selection in the molecular evolution of insect endosymbionts.</p>
            </title>
            <aug>
               <au>
                  <snm>Fry</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Wernegreen</snm>
                  <fnm>JJ</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2005</pubdate>
            <volume>355</volume>
            <fpage>1</fpage>
            <lpage>10</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.gene.2005.05.021</pubid>
                  <pubid idtype="pmpid" link="fulltext">16039807</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Reduced adaptation of a non-recombining neo-Y chromosome.</p>
            </title>
            <aug>
               <au>
                  <snm>Bachtrog</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Charlesworth</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>416</volume>
            <fpage>323</fpage>
            <lpage>326</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/416323a</pubid>
                  <pubid idtype="pmpid" link="fulltext">11907578</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Adaptation shapes patterns of evolution on sexual and asexual chromosomes in <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Bachtrog</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2003</pubdate>
            <volume>34</volume>
            <fpage>215</fpage>
            <lpage>219</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1164</pubid>
                  <pubid idtype="pmpid" link="fulltext">12754509</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Sex chromosome evolution: molecular aspects of Y-chromosome degeneration in <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Bachtrog</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>1393</fpage>
            <lpage>1401</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1240082</pubid>
                  <pubid idtype="pmpid" link="fulltext">16169921</pubid>
                  <pubid idtype="doi">10.1101/gr.3543605</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Evolution of amino acid sequences and codon usage on the <it>Drosophila miranda </it>neo-sex chromosomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Bartolom&#233;</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Charlesworth</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2006</pubdate>
            <volume>174</volume>
            <fpage>2033</fpage>
            <lpage>2044</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1534/genetics.106.064113</pubid>
                  <pubid idtype="pmpid" link="fulltext">17028318</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Linkage limits the power of natural selection in <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Betancourt</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Presgraves</snm>
                  <fnm>DC</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <fpage>13616</fpage>
            <lpage>13620</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">129723</pubid>
                  <pubid idtype="pmpid" link="fulltext">12370444</pubid>
                  <pubid idtype="doi">10.1073/pnas.212277199</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Recombination enhances protein adaptation in <it>Drosophila melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Presgraves</snm>
                  <fnm>DC</fnm>
               </au>
            </aug>
            <source>Curr Biol</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>1651</fpage>
            <lpage>1656</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cub.2005.07.065</pubid>
                  <pubid idtype="pmpid" link="fulltext">16169487</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>The genomic rate of adaptive amino acid substitution in <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Bierne</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Eyre-Walker</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2004</pubdate>
            <volume>21</volume>
            <fpage>1350</fpage>
            <lpage>1360</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msh134</pubid>
                  <pubid idtype="pmpid" link="fulltext">15044594</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Adaptive evolution of non-coding DNA in <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Andolfatto</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>437</volume>
            <fpage>1149</fpage>
            <lpage>1152</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature04107</pubid>
                  <pubid idtype="pmpid" link="fulltext">16237443</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Estimating the genomewide rate of adaptive protein evolution in <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Welch</snm>
                  <fnm>JJ</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2006</pubdate>
            <volume>173</volume>
            <fpage>821</fpage>
            <lpage>837</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1526489</pubid>
                  <pubid idtype="pmpid" link="fulltext">16582427</pubid>
                  <pubid idtype="doi">10.1534/genetics.106.056911</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Linkage disequilibria and the site frequency spectra in the <it>su(s) </it>and <it>su(w<sup>a</sup>) </it>regions of the <it>Drosophila melanogaster </it>X chromosome.</p>
            </title>
            <aug>
               <au>
                  <snm>Langley</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Lazzaro</snm>
                  <fnm>BP</fnm>
               </au>
               <au>
                  <snm>Phillips</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Heikkinen</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Braverman</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2000</pubdate>
            <volume>156</volume>
            <fpage>1837</fpage>
            <lpage>1852</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1461393</pubid>
                  <pubid idtype="pmpid" link="fulltext">11102378</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Patterns of genetic variation at a chromosome 4 locus of <it>Drosophila melanogaster </it>and <it>D. simulans</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Jensen</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Charlesworth</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kreitman</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2002</pubdate>
            <volume>160</volume>
            <fpage>493</fpage>
            <lpage>507</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1461959</pubid>
                  <pubid idtype="pmpid" link="fulltext">11861556</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Hill-Robertson interference is a minor determinant of variations in codon bias across <it>Drosophila melanogaster </it>and <it>Caenorhabditis elegans </it>genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Marais</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Piganeau</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2002</pubdate>
            <volume>19</volume>
            <fpage>1399</fpage>
            <lpage>1406</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12200468</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Hill-Robertson interference in <it>Drosophila melanogaster</it>: reply to Marais, Mouchiroud and Duret.</p>
            </title>
            <aug>
               <au>
                  <snm>Kliman</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Hey</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genet Res</source>
            <pubdate>2003</pubdate>
            <volume>81</volume>
            <fpage>89</fpage>
            <lpage>90</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1017/S0016672302006067</pubid>
                  <pubid idtype="pmpid">12872910</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>The problem of counting sites in the estimation of the synonymous and nonsynonymous substitution rates: implications for the correlation between the synonymous substitution rate and codon usage bias.</p>
            </title>
            <aug>
               <au>
                  <snm>Bierne</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Eyre-Walker</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2003</pubdate>
            <volume>165</volume>
            <fpage>1587</fpage>
            <lpage>1597</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1462865</pubid>
                  <pubid idtype="pmpid" link="fulltext">14668405</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Correlated evolution of synonymous and nonsynonymous sites in <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Marais</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Domazet-Losos</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Tautz</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Charlesworth</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>2004</pubdate>
            <volume>59</volume>
            <fpage>771</fpage>
            <lpage>779</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00239-004-2671-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">15599509</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Synonymous codon usage in <it>Drosophila melanogaster</it>: natural selection and translational accuracy.</p>
            </title>
            <aug>
               <au>
                  <snm>Akashi</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1994</pubdate>
            <volume>136</volume>
            <fpage>927</fpage>
            <lpage>935</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1205897</pubid>
                  <pubid idtype="pmpid">8005445</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Patterns of intron sequence evolution in <it>Drosophila </it>are dependent upon length and GC content.</p>
            </title>
            <aug>
               <au>
                  <snm>Haddrill</snm>
                  <fnm>PR</fnm>
               </au>
               <au>
                  <snm>Charlesworth</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Halligan</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Andolfatto</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genome Biology</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>R67</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1273634</pubid>
                  <pubid idtype="pmpid" link="fulltext">16086849</pubid>
                  <pubid idtype="doi">10.1186/gb-2005-6-8-r67</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Ubiquitous selective constraints in the <it>Drosophila </it>genome revealed by genome-wide interspecies comparison.</p>
            </title>
            <aug>
               <au>
                  <snm>Halligan</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Keightley</snm>
                  <fnm>PD</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <fpage>875</fpage>
            <lpage>884</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1484454</pubid>
                  <pubid idtype="pmpid" link="fulltext">16751341</pubid>
                  <pubid idtype="doi">10.1101/gr.5022906</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Molecular evolution between <it>Drosophila melanogaster </it>and <it>D. simulans</it>: reduced codon bias, faster rates of amino acid substitution, and larger proteins in <it>D. melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Akashi</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1996</pubdate>
            <volume>144</volume>
            <fpage>1297</fpage>
            <lpage>1307</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1207620</pubid>
                  <pubid idtype="pmpid">8913769</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>A neutral explanation for the correlation of diversity with recombination rates in humans.</p>
            </title>
            <aug>
               <au>
                  <snm>Hellmann</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Ebersberger</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Ptak</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Paabo</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Przeworski</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2003</pubdate>
            <volume>72</volume>
            <fpage>1527</fpage>
            <lpage>1535</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1180312</pubid>
                  <pubid idtype="pmpid" link="fulltext">12740762</pubid>
                  <pubid idtype="doi">10.1086/375657</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Insertion/deletion and nucleotide polymorphism data reveal constraints in <it>Drosophila melanogaster </it>introns and intergenic regions.</p>
            </title>
            <aug>
               <au>
                  <snm>Ometto</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Stephan</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>De Lorenzo</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2005</pubdate>
            <volume>169</volume>
            <fpage>1521</fpage>
            <lpage>1527</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1449560</pubid>
                  <pubid idtype="pmpid" link="fulltext">15654088</pubid>
                  <pubid idtype="doi">10.1534/genetics.104.037689</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Does recombination improve selection on codon usage? Lessons from nematode and fly complete genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Marais</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Mouchiroud</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Duret</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <fpage>5688</fpage>
            <lpage>5692</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">33274</pubid>
                  <pubid idtype="pmpid" link="fulltext">11320215</pubid>
                  <pubid idtype="doi">10.1073/pnas.091427698</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Neutral effect of recombination on base composition in <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Marais</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Mouchiroud</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Duret</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Genet Res</source>
            <pubdate>2003</pubdate>
            <volume>81</volume>
            <fpage>79</fpage>
            <lpage>87</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1017/S0016672302006079</pubid>
                  <pubid idtype="pmpid">12872909</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Interactions between natural selection, recombination and gene density in the genes of <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Hey</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kliman</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2002</pubdate>
            <volume>160</volume>
            <fpage>595</fpage>
            <lpage>608</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1461979</pubid>
                  <pubid idtype="pmpid" link="fulltext">11861564</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Differences in crossover frequency distribution among three sibling species of <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>True</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Mercer</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Laurie</snm>
                  <fnm>CC</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1996</pubdate>
            <volume>142</volume>
            <fpage>507</fpage>
            <lpage>523</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1206984</pubid>
                  <pubid idtype="pmpid">8852849</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Effect of strong directional selection on weakly selected mutations at linked sites: implication for synonymous codon usage.</p>
            </title>
            <aug>
               <au>
                  <snm>Kim</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2004</pubdate>
            <volume>21</volume>
            <fpage>286</fpage>
            <lpage>294</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msh020</pubid>
                  <pubid idtype="pmpid" link="fulltext">14660698</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>A population genetic model for the evolution of synonymous codon usage: patterns and predictions.</p>
            </title>
            <aug>
               <au>
                  <snm>McVean</snm>
                  <fnm>GAT</fnm>
               </au>
               <au>
                  <snm>Charlesworth</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Genet Res</source>
            <pubdate>1999</pubdate>
            <volume>74</volume>
            <fpage>145</fpage>
            <lpage>158</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1017/S0016672399003912</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Mutation accumulation in finite outbreeding and inbreeding populations.</p>
            </title>
            <aug>
               <au>
                  <snm>Charlesworth</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Morgan</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Charlesworth</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Genet Res</source>
            <pubdate>1993</pubdate>
            <volume>61</volume>
            <fpage>39</fpage>
            <lpage>56</lpage>
         </bibl>
         <bibl id="B36">
            <title>
               <p>The effects of Hill-Robertson interference between weakly selected mutations on patterns of molecular evolution and variation.</p>
            </title>
            <aug>
               <au>
                  <snm>McVean</snm>
                  <fnm>GA</fnm>
               </au>
               <au>
                  <snm>Charlesworth</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2000</pubdate>
            <volume>155</volume>
            <fpage>929</fpage>
            <lpage>944</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1461092</pubid>
                  <pubid idtype="pmpid" link="fulltext">10835411</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>The effect of constraint on the rate of evolution in neutral models with biased mutation.</p>
            </title>
            <aug>
               <au>
                  <snm>Eyre-Walker</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1992</pubdate>
            <volume>131</volume>
            <fpage>233</fpage>
            <lpage>234</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1204958</pubid>
                  <pubid idtype="pmpid">1592239</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Local recombination and mutation effects on molecular evolution in <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Takano-Shimizu</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1999</pubdate>
            <volume>153</volume>
            <fpage>1285</fpage>
            <lpage>1296</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1460815</pubid>
                  <pubid idtype="pmpid" link="fulltext">10545459</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Selection in favor of nucleotides G and C diversifies evolution rates and levels of polymorphism at mammalian synonymous sites.</p>
            </title>
            <aug>
               <au>
                  <snm>Kondrashov</snm>
                  <fnm>FA</fnm>
               </au>
               <au>
                  <snm>Ogurtsov</snm>
                  <fnm>AY</fnm>
               </au>
               <au>
                  <snm>Kondrashov</snm>
                  <fnm>AS</fnm>
               </au>
            </aug>
            <source>J Theor Biol</source>
            <pubdate>2006</pubdate>
            <volume>240</volume>
            <fpage>616</fpage>
            <lpage>626</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jtbi.2005.10.020</pubid>
                  <pubid idtype="pmpid" link="fulltext">16343547</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>FlyBase: A database of the <it>Drosophila </it>genome</p>
            </title>
            <url>http://www.flybase.org</url>
            <note>Release 4</note>
         </bibl>
         <bibl id="B41">
            <title>
               <p>FlyMine: An integrated database for <it>Drosophila </it>and <it>Anopheles </it>genomics</p>
            </title>
            <url>http://www.flymine.org</url>
         </bibl>
         <bibl id="B42">
            <title>
               <p>DroSpeGe: Drosophila Species Genomes BLAST</p>
            </title>
            <url>http://insects.eugenes.org/species/blast</url>
         </bibl>
         <bibl id="B43">
            <title>
               <p>CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1994</pubdate>
            <volume>22</volume>
            <fpage>4673</fpage>
            <lpage>4680</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308517</pubid>
                  <pubid idtype="pmpid" link="fulltext">7984417</pubid>
                  <pubid idtype="doi">10.1093/nar/22.22.4673</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>MAVID: constrained ancestral alignment of multiple sequences.</p>
            </title>
            <aug>
               <au>
                  <snm>Bray</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Patcher</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>693</fpage>
            <lpage>699</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">383315</pubid>
                  <pubid idtype="pmpid" link="fulltext">15060012</pubid>
                  <pubid idtype="doi">10.1101/gr.1960404</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>MCALIGN2: faster, accurate global pairwise alignment of non-coding DNA sequences based on explicit models of indel evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Keightley</snm>
                  <fnm>PD</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>292</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1534069</pubid>
                  <pubid idtype="pmpid" link="fulltext">16762073</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-7-292</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>PAML: a program package for phylogenetic analysis by maximum likelihood.</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Comput Appl Biosci</source>
            <pubdate>1997</pubdate>
            <volume>13</volume>
            <fpage>555</fpage>
            <lpage>556</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9367129</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Gestimator</p>
            </title>
            <url>http://molpopgen.org/software/analysis/manpages/gestimator.1.html</url>
         </bibl>
         <bibl id="B48">
            <title>
               <p>A method for estimating the numbers of synonymous and nonsynonymous substitutions per site.</p>
            </title>
            <aug>
               <au>
                  <snm>Comeron</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1995</pubdate>
            <volume>41</volume>
            <fpage>1152</fpage>
            <lpage>1159</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF00173196</pubid>
                  <pubid idtype="pmpid">8587111</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>CodonW: Correspondence analysis of codon usage</p>
            </title>
            <url>http://codonw.sourceforge.net/</url>
         </bibl>
         <bibl id="B50">
            <title>
               <p>A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide substitutions.</p>
            </title>
            <aug>
               <au>
                  <snm>Kimura</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1980</pubdate>
            <volume>16</volume>
            <fpage>111</fpage>
            <lpage>120</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF01731581</pubid>
                  <pubid idtype="pmpid">7463489</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Background selection and patterns of genetic diversity in <it>Drosophila melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Charlesworth</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Genet Res</source>
            <pubdate>1996</pubdate>
            <volume>68</volume>
            <fpage>131</fpage>
            <lpage>149</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8940902</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>

