<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2009-10-10-r106</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Draft genome sequence of the <it>Daphnia </it>pathogen <it>Octosporea bayeri</it>: insights into the gene content of a large microsporidian genome and a model for host-parasite interactions</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Corradi</snm>
               <fnm>Nicolas</fnm>
               <insr iid="I1"/>
               <email>ncorradi@interchange.ubc.ca</email>
            </au>
            <au id="A2">
               <snm>Haag</snm>
               <mi>L</mi>
               <fnm>Karen</fnm>
               <insr iid="I2"/>
               <insr iid="I3"/>
               <email>Karen.Haag@ufrgs.br</email>
            </au>
            <au id="A3">
               <snm>Pombert</snm>
               <fnm>Jean-Fran&#231;ois</fnm>
               <insr iid="I1"/>
               <email>jpombert@interchange.ubc.ca</email>
            </au>
            <au id="A4" ce="yes">
               <snm>Ebert</snm>
               <fnm>Dieter</fnm>
               <insr iid="I2"/>
               <email>dieter.ebert@unibas.ch</email>
            </au>
            <au ca="yes" id="A5" ce="yes">
               <snm>Keeling</snm>
               <mi>J</mi>
               <fnm>Patrick</fnm>
               <insr iid="I1"/>
               <email>pkeeling@interchange.ubc.ca</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Canadian Institute for Advanced Research, The Biodiversity Research Centre, University of British Columbia, University Boulevard, Vancouver, BC, V6T 1Z4, Canada</p>
            </ins>
            <ins id="I2">
               <p>Universit&#228;t Basel, Zoologisches Institut, Evolutionsbiologie, Vesalgasse, CH-4051 Basel, Switzerland</p>
            </ins>
            <ins id="I3">
               <p>Department of Genetics, UFRGS, Porto Alegre, RS 91501-970, Brazil</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2009</pubdate>
         <volume>10</volume>
         <issue>10</issue>
         <fpage>R106</fpage>
         <url>http://genomebiology.com/2009/10/10/R106</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="doi">10.1186/gb-2009-10-10-r106</pubid>
               <pubid idtype="pmpid">19807911</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>9</day>
               <month>7</month>
               <year>2009</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>2</day>
               <month>9</month>
               <year>2009</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>6</day>
               <month>10</month>
               <year>2009</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>6</day>
               <month>10</month>
               <year>2009</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2009</year>
         <collab>Corradi et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>The Octosporea bayeri  genome sequence</p>
      </shorttitle>
      <shortabs>
         <p> The draft genome sequence of Octosporea bayeri,  a microsporidian pathogen of Daphnia, provides insights into the content and evolution of a large microsporidian genome</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The highly compacted 2.9-Mb genome of <it>Encephalitozoon cuniculi </it>placed the microsporidia in the spotlight, encoding a mere 2,000 proteins and a highly reduced suite of biochemical pathways. This extreme level of reduction is not universal across the microsporidia, with genomes known to vary up to sixfold in size, suggesting that some genomes may harbor a gene content that is not as reduced as that of <it>Enc. cuniculi</it>. In this study, we present an in-depth survey of the large genome of <it>Octosporea bayeri</it>, a pathogen of <it>Daphnia magna</it>, with an estimated genome size of 24 Mb, in order to shed light on the organization and content of a large microsporidian genome.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Using Illumina sequencing, 898 Mb of <it>O. bayeri </it>genome sequence was generated, resulting in 13.3 Mb of unique sequence. We annotated a total of 2,174 genes, of which 893 encodes proteins with assigned function. The gene density of the <it>O. bayeri </it>genome is very low on average, but also highly uneven, so gene-dense regions also occur. The data presented here suggest that the <it>O. bayeri </it>proteome is well represented in this analysis and is more complex that that of <it>Enc. cuniculi</it>. Functional annotation of <it>O. bayeri </it>proteins suggests that this species might be less biochemically dependent on its host for its metabolism than its more reduced relatives.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>The combination of the data presented here, together with the imminent annotated genome of <it>Daphnia magna</it>, will provide a wealth of genetic and genomic tools to study host-parasite interactions in an interesting model for pathogenesis.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010016">Molecular biology</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Microsporidia are extremely successful, highly adapted obligate intracellular parasites known to infect a wide range of animals, such as arthropods, fish, and mammals, including humans <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. These parasites are characterized by the presence of a highly specialized host invasion apparatus called the polar tube (or polar filament), which is used to penetrate and infect new host cells. Microsporidian cells significantly differ from other eukaryotes, as they lack conventional mitochondria and Golgi apparatus and harbor 70S instead of 80S ribosomes <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. These features were once taken to suggest that microsporidia represent a very ancient eukaryotic lineage <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>, but recent advances in cell biology, genome sequencing, and phylogenetic reconstruction have all shown that all these apparently primitive features instead reflect an extreme state of reduction, perhaps a result of their obligate intracellular parasitic lifestyle. Instead, it is now widely acknowledged that microsporidia are, in fact, related to fungi, and have relict mitochondria (called mitosomes) <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, degenerated eukaryote-like ribosomal RNA subunits <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, and reduced genes and genomes <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>.</p>
         <p>The extremely reduced nature of microsporidian genomes has attracted attention since they were first noted at the end of the 1990s <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, culminating in 2001 with the completion of the first microsporidian genome from the mammalian parasite <it>Encephalitozoon cuniculi </it><abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. The <it>Enc. cuniculi </it>genome is extremely small, at only 2.9 Mb, and the 2,000 genes it encodes provided the first compelling evidence for a strong correlation between obligate intracellular parasitism and the loss of metabolically important genes in eukaryotes. Metabolic capabilities are indeed significantly reduced in <it>Enc. cuniculi</it>, and genes required for <it>de novo </it>biosynthesis of purine and pyrimidine nucleotides or those involved in the tricarboxylic acid cycle, fatty acid beta-oxidation, respiratory electron-transport chain and the F<sub>0</sub>F<sub>1</sub>-ATPase complex are completely absent from its genome. The reduction of several metabolic pathways in <it>Enc. cuniculi </it>implied that these parasites might be extremely dependent on their host for obtaining most of their metabolites and energy. For example, it has been indeed recently demonstrated that this parasite and its mitosomes both import ATP from its host via specific transporters <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>.</p>
         <p>In addition to a significant reduction in its metabolic capabilities, the genome of <it>Enc. cuniculi </it>is also very compact. Its genes are reduced in size and separated by remarkably short intergenic regions. This extreme compaction has impacted the process of transcription so that in the microsporidia <it>Enc. cuniculi </it>and <it>Antonospora locustae </it>a significant part of their mRNA transcripts has been found to overlap between adjacent genes <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>. Genome reduction has also apparently affected the rate of gene rearrangement, as conservation of gene order is strikingly high among microsporidia compared to what has been reported for other eukaryotes <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>.</p>
         <p>Since the completion of the <it>Enc. cuniculi </it>genome, new genomic data from other microsporidian parasites have been limited to two in-depth genome surveys from <it>Enterocytozoon bieneusi </it>and <it>Nosema ceranae </it><abbrgrp><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>, a smaller survey from <it>A. locustae </it><abbrgrp><abbr bid="B32">32</abbr></abbrgrp> and some very small surveys from various other species <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp>. The deeper-sampled genomes of <it>Ent. bieneusi </it>and <it>A. locustae </it>show many similarities with that of <it>Enc. cuniculi </it>- all three genomes are compact and contain roughly the same number of genes and pathways - but this is perhaps not surprising because all three genomes are also relatively small (ranging from 2.9 to 6 Mb) and might not, therefore, represent all microsporidian genomes adequately.</p>
         <p>So how do larger microsporidian genomes compare with smaller ones? Does their large size reflect the presence of more genes and pathways or do they harbor the same genes but separated by much larger intergenic regions? These questions have been partly addressed with genome surveys from <it>Spraguea lophii </it><abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, <it>Vittaforma cornea </it><abbrgrp><abbr bid="B36">36</abbr></abbrgrp>, <it>Edhazardia aedis</it>, and <it>Brachiola algerae </it><abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp>, but because of their very low sequence coverage no conclusion can be drawn about their overall gene content and evolution. In the present study, we provide a 37&#215; sequence coverage of the large genome of the microsporidian <it>Octosporea bayeri</it>. <it>O. bayeri </it>is a parasite of the freshwater planktonic crusteacean <it>Daphnia magna </it><abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. Other <it>Daphnia </it>species have never been found to be infected. The parasite is both horizontally and vertically transmitted <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. Vertical transmission occurs with 100% efficiency to the asexual (parthenogenetic) eggs of the host and with somewhat reduced efficiency to the sexual eggs. Horizontal transmission occurs after the host cadaver decomposes and environmental spores are released. Infection follows ingestion of spores by the filter feeding host. The parasite reduces host survival and fecundity. Its geographic distribution is limited to rock pool <it>D. magna </it>populations along the baltic Sea in Finland and Sweden <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> and a single report from the Czech Republic.</p>
         <p>From our sequence survey, over 13 Mb of unique <it>O. bayeri </it>sequence data have been assembled and 2,174 ORFs have been identified, providing an excellent framework to characterize the overall gene content and structure of a large microsporidian genome, to compare it with its more reduced relatives and to increase the availability of genetic markers from this latter species. Consistent with small surveys from microsporidia with large genomes, the gene density of the <it>O. bayeri </it>genome is generally low but also highly variable. Most of the genes known in the <it>Enc. cuniculi </it>genome are also found in <it>O. bayeri</it>, but a number of other genes are also found that are apparently absent in other microsporidia. The functional distribution of the proteins significantly differed between <it>O. bayeri </it>and its more reduced relatives, suggesting the metabolic capacity and host dependency within the group is also variable. The wealth of genomic data from this parasite coupled with the annotation of the <it>Daphnia </it>genome should further increase the interest for this model of host-parasite interactions <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Gene content of the <it>O. bayeri </it>genome</p>
            </st>
            <p>Approximately 898 Mb of DNA sequence was obtained from shotgun and paired-end 35-bp reads with the Illumina Genome Analyzer&#8482;, resulting in an estimated 34.2 to 37.2&#215; coverage of the <it>O. bayeri </it>genome, which has been estimated to 24 Mb based on total number of bases sequenced divided by the average coverage. This calculation does not take into account the fact that some assembled contigs might represent several identical regions in the reference genome, and that unassembled reads might represent DNA sequences from other sources (that is, contaminants). Reads were assembled into 41,804 contigs representing a total of 13.3 Mb of sequence data (26% G+C), with only 20 contigs displaying evidence of contamination. The length of contigs averaged 320 bp (100 bp to a maximum of 8 kb). The small size of most contigs resulted in the incompleteness of most ORFs identified in this study and, on average, incomplete ORFs were found to encode 60% of the amino acids of their respective eukaryotic homologs. This explains why the complete (or almost complete) <it>O. bayeri </it>proteome has been identified within an assembly that is almost half the size of the estimated genome.</p>
            <p>A total of four rRNA genes, 37 tRNAs and 2,174 predicted protein-coding ORFs were identified (Table <tblr tid="T1">1</tblr>). Of the <it>O. bayeri </it>ORFs, 1,405 were found to have homologs in the <it>Enc. cuniculi </it>genome, representing about 70% of its annotated genes <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> (Additional data file 1). Over 93% of <it>Enc. cuniculi </it>proteins with assigned functions and 53% of its hypothetical proteins had clear homologs in the <it>O. bayeri </it>genome <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B33">33</abbr></abbrgrp>. Over 25% of <it>Enc. cuniculi </it>homologs identified are full length, while others were slightly truncated in the carboxy-terminal or amino-terminal regions, or both. Another 80 ORFs were identified that were found to have homologs in other organisms, but not <it>Enc. cuniculi</it>, 72 of which could be assigned to a functional category (Additional data file 2), the majority of which have highest similarities with fungal homologs, suggesting that they are ancestral within the lineage and not recently introduced into the <it>O. bayeri </it>genome. The remaining 689 <it>O. bayeri </it>putative ORFs (of at least 200 amino acids) returned no significant hits in BLAST homology searches against the National Center for Biotechnology Information (NCBI) non-redundant database. However, 25 of these showed significant similarities with hypothetical proteins from the <it>A. locustae </it>database, indicating that <it>O. bayeri </it>and <it>A. locustae </it>share a number of hypothetical proteins that are absent in <it>Enc. cuniculi </it>and <it>Ent. bieneusi</it>. It is also important to note that a large proportion of microsporidian hypothetical proteins have been found to be smaller than 200 amino acids <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>, so the actual number of ORFs could be over 25% higher than what we report here, perhaps in the range of, or higher than, what has been recently reported for <it>N. ceranae </it><abbrgrp><abbr bid="B34">34</abbr></abbrgrp>.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>
                     <b>General characteristics of <it>O. bayeri </it>and other microsporidian genomes</b>
                  </p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="left">
                        <p>
                           <b>General characteristics</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <it>O. bayeri</it>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <it>Enc. cuniculi</it>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <it>Ent. Bieneusi</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Number of chromosomes</p>
                     </c>
                     <c ca="center">
                        <p>NA</p>
                     </c>
                     <c ca="center">
                        <p>11</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Genome size (Mb)</p>
                     </c>
                     <c ca="center">
                        <p>&#8804;24.2*</p>
                     </c>
                     <c ca="center">
                        <p>2.9</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Assembled Mb</p>
                     </c>
                     <c ca="center">
                        <p>13.3</p>
                     </c>
                     <c ca="center">
                        <p>2.5</p>
                     </c>
                     <c ca="center">
                        <p>3.86</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Genome coverage (%)</p>
                     </c>
                     <c ca="center">
                        <p>55<sup>&#8224;</sup></p>
                     </c>
                     <c ca="center">
                        <p>86</p>
                     </c>
                     <c ca="center">
                        <p>64</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>G+C content (%)</p>
                     </c>
                     <c ca="center">
                        <p>26</p>
                     </c>
                     <c ca="center">
                        <p>47</p>
                     </c>
                     <c ca="center">
                        <p>25</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Gene density</p>
                     </c>
                     <c ca="center">
                        <p>1 per 4,593 bases<sup>&#8225;</sup></p>
                     </c>
                     <c ca="center">
                        <p>1 per 1,025 bases</p>
                     </c>
                     <c ca="center">
                        <p>1 per 1,148 bases</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Mean intergenic region (bp)</p>
                     </c>
                     <c ca="center">
                        <p>429<sup>&#167;</sup></p>
                     </c>
                     <c ca="center">
                        <p>129</p>
                     </c>
                     <c ca="center">
                        <p>127</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Presence of overlapping genes</p>
                     </c>
                     <c ca="center">
                        <p>No</p>
                     </c>
                     <c ca="center">
                        <p>Yes</p>
                     </c>
                     <c ca="center">
                        <p>Yes</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Number of SSU-LSU rRNA genes</p>
                     </c>
                     <c ca="center">
                        <p>2<sup>&#182;</sup></p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                     <c ca="center">
                        <p>Unkown<sup>&#165;</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Number of 5S rRNA genes</p>
                     </c>
                     <c ca="center">
                        <p>2<sup>&#182;</sup></p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>Unkown<sup>&#165;</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Number of tRNAs</p>
                     </c>
                     <c ca="center">
                        <p>37</p>
                     </c>
                     <c ca="center">
                        <p>46</p>
                     </c>
                     <c ca="center">
                        <p>46</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Number of tRNA synthetases</p>
                     </c>
                     <c ca="center">
                        <p>21</p>
                     </c>
                     <c ca="center">
                        <p>21</p>
                     </c>
                     <c ca="center">
                        <p>21</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Number of tRNA introns (size in bp)</p>
                     </c>
                     <c ca="center">
                        <p>&#8805;1 (50)</p>
                     </c>
                     <c ca="center">
                        <p>2 (16, 42)</p>
                     </c>
                     <c ca="center">
                        <p>2 (13, 30)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Number of splicesomal introns (size in bp)</p>
                     </c>
                     <c ca="center">
                        <p>&#8805;6 (24-33)</p>
                     </c>
                     <c ca="center">
                        <p>13 (23-52)</p>
                     </c>
                     <c ca="center">
                        <p>19 (36-306)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Number of predicted ORFs</p>
                     </c>
                     <c ca="center">
                        <p>2,174<sup>#</sup></p>
                     </c>
                     <c ca="center">
                        <p>1,997</p>
                     </c>
                     <c ca="center">
                        <p>3,804**</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Number of ORFs assigned to functional categories</p>
                     </c>
                     <c ca="center">
                        <p>894 (41%)</p>
                     </c>
                     <c ca="center">
                        <p>884 (44%)</p>
                     </c>
                     <c ca="center">
                        <p>669 (39%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Mean size of CDS (bp)</p>
                     </c>
                     <c ca="center">
                        <p>1,056<sup>&#8224;&#8224;</sup></p>
                     </c>
                     <c ca="center">
                        <p>1,017<sup>&#8224;&#8224;</sup></p>
                     </c>
                     <c ca="center">
                        <p>1,002<sup>&#8224;&#8224;</sup></p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>*The genome size has been estimated using total number of bases sequenced divided by the average coverage. <sup>&#8224;</sup>Based on the 24.2-Mb estimated genome size. <sup>&#8225;</sup>Based on the 200 largest contigs. <sup>&#167;</sup>Based on contigs (n = 23) in which two or more ORFs of at least 100 amino acids have been identified. <sup>&#182;</sup>Only two contigs harboring an SSU-5.8S-LSU gene array have been identified in the <it>O. bayeri </it>genome survey. <sup>&#165;</sup>Based on <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. <sup>#</sup>Includes ORFs with assigned functions, homologs of <it>Enc. cuniculi </it>hypothetical proteins, and hypothetical proteins of at least 200 amino acids identified in the <it>O. bayeri </it>genome. **The <it>Ent. bieneusi </it>genome has been subjected to several segmental duplications and the number of ORFs identified in that study includes a very large number of duplicates <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. This number should, therefore, not be taken into account to determine the haploid coding capacity of this species. <sup>&#8224;&#8224;</sup>Based on 95 and 63 complete <it>Enc. cuniculi </it>and <it>Ent. bieneusi </it>orthologs, respectively. CDS, coding sequence.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Functional categories represented in <it>O. bayeri</it></p>
            </st>
            <p>All identified <it>O. bayeri </it>ORFs were assigned to the 11 functional categories listed in <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B33">33</abbr></abbrgrp> (Figure <figr fid="F1">1</figr>; Additional data file 3). Such comparison is currently unavailable for <it>N. ceranae </it><abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. <it>O. bayeri </it>ORFs are well distributed among the functional categories, yet display differences when compared to <it>Enc. cuniculi </it>and <it>Ent. bieneusi</it>. Specifically, five categories (metabolism, energy production, cell growth and DNA synthesis, transcription and protein destination) are more represented in <it>O. bayeri </it>than in <it>Enc. cuniculi </it>and <it>Ent. bieneusi</it>, whereas four other categories (transport facilitation, intracellular transport, cellular organization - biogenesis, and cell rescue) are reduced in number in <it>O. bayeri</it>. Within each functional category, several pathways stood out as being particularly different among the three species. For instance, genes involved in lipid and fatty acid metabolism and glycosylation were better represented in <it>O. bayeri </it>(37 and 12 proteins, respectively) than either <it>Enc. cuniculi </it>(29 and 7 proteins) or <it>Ent. bieneusi </it>(8 and 5 proteins), while proteins involved in the translocation of various substrates across membranes are underrepresented in <it>O. bayeri </it>(Figure <figr fid="F2">2</figr>). Finally, in contrast to what has been reported for other species with smaller genomes <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>, no evidence for gene or segmental genome duplication events has been identified in the present survey.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Distribution of <it>O. bayeri </it>(blue), <it>Enc. cuniculi </it>(yellow) and <it>Ent. bieneusi </it>(red) proteins among functional categories</p>
               </caption>
               <text>
                  <p><b>Distribution of <it>O. bayeri </it>(blue), <it>Enc. cuniculi </it>(yellow) and <it>Ent. bieneusi </it>(red) proteins among functional categories</b>. The ordinate represents the number of ORFs assigned to the corresponding category. Each of the <it>O. bayeri </it>proteins was assigned to only one of eleven functional categories listed in <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B33">33</abbr></abbrgrp>. The corresponding gene list is presented in the online version of this manuscript (Additional data file 3). *Based on a 4&#215; sequence coverage <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>.</p>
               </text>
               <graphic file="gb-2009-10-10-r106-1"/>
            </fig>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Examples of sub-functional categories showing sharp differences in distribution between <it>O. bayeri </it>(blue), <it>Enc. cuniculi </it>(yellow) and <it>Ent. bieneusi </it>(red) proteins</p>
               </caption>
               <text>
                  <p><b>Examples of sub-functional categories showing sharp differences in distribution between <it>O. bayeri </it>(blue), <it>Enc. cuniculi </it>(yellow) and <it>Ent. bieneusi </it>(red) proteins</b>. <b>(a) </b>Functional sub-categories more highly represented in <it>O. bayeri </it>than in <it>Enc. cuniculi </it>and <it>Ent. bieneusi</it>. <b>(b) </b>Functional sub-categories less represented in <it>O. bayeri </it>than in <it>Enc. cuniculi </it>and <it>Ent. bieneusi</it>. *Based on a 4&#215; sequence coverage <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> (that is, almost 10 times lower than the present genome draft), suggesting a number of these transporters may yet be identified in the <it>Ent. bieneusi </it>genome survey.</p>
               </text>
               <graphic file="gb-2009-10-10-r106-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Phylogeny of <it>O. bayeri </it>and evolution of the ATP transporters in the microsporidia</p>
            </st>
            <p><it>O. bayeri </it>was put into a phylogenetic context by comparing the amino acid sequences from its newly identified alpha- and beta-tubulins with those of other microsporidia (Figure <figr fid="F3">3a</figr>). Our tree is consistent with the most recently reported using the same amino acid sequences <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. Specifically, <it>Nosema </it>and <it>Encephalitozoon </it>are sisters to one another, as are <it>Antonospora </it>and <it>Brachiola</it>. The remaining species all branch more deeply, and <it>O. bayeri </it>is in this tree basal to all other microsporidian species from which large genome sequence data are presently available. Only a single ATP transporter protein was identified in <it>O. bayeri</it>, and phylogenetic analyses of all presently known microsporidian members of this family show the <it>O. bayeri </it>protein clustering with strong support at the base of a clade including <it>Antonospora </it>and <it>Brachiola </it>homologues, all of which are sister to the <it>Encephalitozoon</it>/<it>Enterocytozoon</it>/<it>Nosema </it>clade (Figure <figr fid="F3">3b</figr>). This is not consistent with the rRNA tree, and might represent a mis-rooting of either tree, or ancient paralogy of the ATP transporters.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Phylogenetic relationships of microsporidia and their ATP transporters</p>
               </caption>
               <text>
                  <p><b>Phylogenetic relationships of microsporidia and their ATP transporters</b>. <b>(a) </b>Phylogenetic reconstruction of the microsporidian phylogeny based on available &#945;- and &#946;-tubulin amino acid sequences and gains of ATP and ABC transporters. Known genome sizes and number of transporters are shown. <it>Ent. bieneusi </it>tubulins cluster as a sister group to the clade including <it>Encephalitozoon </it>and <it>Nosema </it>species; this position is represented by a black square. <b>(b) </b>Evolution of the ATP transporter family based on available amino acid sequences from a range of microsporidian parasites. 1, Putative ancestral duplication of ATP transporters within the microsporidia following lateral gene transfer from prokaryotes. 2, A putative secondary gene duplication occurred in the more diverged genera, <it>Nosema</it>, <it>Enterocytozoon </it>and <it>Encephalitozoon</it>. 3, Supported lineage including all three diverged genera. 4, Species-specific duplication of an ATP transporter. *Data from NC, JFP <it>et al</it>., unpublished.</p>
               </text>
               <graphic file="gb-2009-10-10-r106-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p><it>O. bayeri </it>introns</p>
            </st>
            <p>Only 13 introns have been annotated in the <it>Enc. cuniculi </it>genome at present, and we identified a total of 6 introns in the present survey, all of which are homologous to introns reported in <it>Enc. cuniculi </it>ribosomal protein genes (L19, L27a, L37a, L37, L39, S26) <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. All the <it>O. bayeri </it>introns identified here are located within or close to the start codon, which is consistent with the introns in <it>Enc. cuniculi </it><abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, <it>Saccharomyces cerevisiae </it><abbrgrp><abbr bid="B43">43</abbr></abbrgrp> and cryptomonad nucleomorphs <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. The retention of the majority of these introns leads to frame-shifts and termination codons, while their removal leads to a complete ORF that is highly conserved with homologs from other eukaryotes. The intron sequences are available with the online version of this paper (Additional data file 4).</p>
         </sec>
         <sec>
            <st>
               <p><it>O. bayeri</it>-specific large amino acid insertions</p>
            </st>
            <p>A number of large insertions ranging from 15 to 57 amino acids were identified in 14 conserved proteins in <it>O. bayeri </it>(O-sialoglycoprotein endopeptidase, 3-hydroxy-3-methylglutaryl CoA reductase, 3-ketoacyl CoA thiolase, &#945; &#945;-trehalase precursor, choline phosphate cytidyltranferase, transcription factor of the E2F/DP family, tubulin &#947;-chain, kinesin-like protein, pyruvate dehydrogenase E1 component subunit &#945;, replication factor C, T complex protein 1 &#949; subunit, threonyl tRNA synthetase, and translation elongation factor 2). These insertions are all in-frame and in most cases are surrounded by highly conserved amino acid motifs, although they are not generally located within functionally important domains (Additional data file 5). RT-PCR confirmed that none of these inserts are removed from mRNA and so do not represent spliceosomal introns (data not shown). Similar insertions have been previously reported in the parasites <it>Plasmodium berghei </it>and <it>Toxoplasma gondii </it><abbrgrp><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Length of <it>O. bayeri </it>proteins</p>
            </st>
            <p>The majority of <it>O. bayeri </it>proteins were found to be larger than homologs from <it>Enc. cuniculi </it>(69%) and <it>Ent. bieneusi </it>(65%) (Figure <figr fid="F4">4</figr>). However, the opposite trend was identified when <it>O. bayeri </it>genes were compared with other fungal lineages, in which case the majority of <it>O. bayeri </it>proteins (75% on average) were found to be smaller than homologs from the other fungal lineages, even when the fungal species compared had a smaller genome than <it>O. bayeri</it>. The difference in the number of amino acids was found to be significantly larger between <it>O. bayeri </it>and other fungal lineages (14% smaller on average) than between <it>O. bayeri </it>and other microsporidia (3% larger on average) (Figure <figr fid="F4">4</figr>).</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Differences in gene length among microsporidia and their fungal relatives</p>
               </caption>
               <text>
                  <p><b>Differences in gene length among microsporidia and their fungal relatives</b>. <b>(a) </b>Comparison of the length (in amino acids) of <it>O. bayeri </it>proteins to orthologs from <it>Enc. cuniculi</it>, <it>Ent. bieneusi</it>, <it>S. cerevisiae</it>, <it>U. maydis</it>, <it>B. dendrobatidis </it>and <it>R. oryzae</it>. In general, <it>O. bayeri </it>proteins are longer than microsporidian orthologues, but shorter than fungal orthologues. Vertical arrows indicate the average reduction or increase in protein size compared to <it>O. bayeri</it>. <b>(b) </b>Specific examples of length variation between orthologs from <it>O. bayeri</it>, <it>Enc. cuniculi</it>, <it>Ent. bieneusi </it>and <it>S. cerevisiae</it>.</p>
               </text>
               <graphic file="gb-2009-10-10-r106-4"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Gene density and synteny</p>
            </st>
            <p>Gene density and synteny in <it>O. bayeri </it>were examined by annotating all ORFs of at least 100 amino acids on the 200 largest contigs (average length of 2,795 bp). In more than half of these contigs, no putative ORF could be identified. One contig was found to harbor three putative ORFs, whereas 72 and 22 contigs harbored one or two recognizable ORFs, respectively. No correlation between the length of the contigs and the number of ORFs could be identified (Figure <figr fid="F5">5a</figr>). Based on these contigs, gene density was calculated to be 1 gene every 4,593 bases. However, when two or more ORFs were identified on the same contig the average intergenic region was calculated to be only 429 bp, suggesting the gene density is highly variable across the genome. Conservation in gene order could be identified in only two cases, representing 8% of all the gene pairs identified (Figure <figr fid="F5">5b</figr>).</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Variation in gene density across the <it>O. bayeri </it>genome</p>
               </caption>
               <text>
                  <p><b>Variation in gene density across the <it>O. bayeri </it>genome</b>. <b>(a) </b>Identification and distribution of ORFs (of at least 100 amino acids) among the largest <it>O. bayeri </it>contigs. Only the 100 largest contigs are shown here for convenience. Yellow dots represent contigs in which no ORF could be annotated. Blue and red arrows and dots represent contigs harboring two or one ORF, respectively. <b>(b) </b>Two cases of gene order conservation between <it>O. bayeri </it>and <it>Enc. cuniculi</it>.</p>
               </text>
               <graphic file="gb-2009-10-10-r106-5"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Repeated elements</p>
            </st>
            <p>The large amount of small, non-coding DNA sequences identified in this study could reflect the presence of highly repeated sequences in the <it>O. bayeri </it>genome. This possibility was investigated by measuring the sequence coverage of each contig and identifying a possible correlation with their length. As suspected, the contigs with highest coverage are also the smallest. Specifically, all contigs with a coverage over 200&#215; are smaller than 300 bp, suggesting these are highly repetitive (Additional data file 6).</p>
            <p>The presence of repeated elements was also investigated among all contigs. A total of 74 <it>O. bayeri </it>contigs harbor DNA segments homologous to known fungal repeated elements (Additional data file 7). The Mariner, Gypsy and Copia classes of repeated elements are the most frequently observed in <it>O. bayeri</it>. The <it>O. bayeri </it>contigs also display DNA strings that are repeated in tandem, with strings repeated at least twice identified in 1,345 contigs (data not shown). However, these tandem repeats are usually short and rarely exceed ten consecutive repeated strings. Putative stem-loop structures with AT-rich palindromic stems have been identified in a number of contigs, although the primary sequences of these potential structures, aside from their biased nucleotide composition, do not appear to be repeated <it>per se</it>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <sec>
            <st>
               <p>Architecture of a large microsporidian genome</p>
            </st>
            <p>The currently available microsporidian genomes best represent the lower limits in the spectrum of genome sizes, not only for Eukaryotes as a whole, but also microsporidia. The single exception to this is <it>N. ceranea</it>, whose genome is more intermediate in size, but our knowledge of microsporidian genomes is still strongly biased, which might hinder the elucidation of the evolution of this poorly understood group. Our present survey of the <it>O. bayeri </it>genome is the first deep survey of a larger microsporidian genome, and estimates from sequence coverage suggest it may even be the largest known microsporidian genome (at 24 MB). What accounts for this variation in genome size and which features of microsporidian genomes have to be reconsidered after adding a genome from the other end of the genome size spectrum? There are several answers to these questions.</p>
            <p>Genomes might be larger due to the presence of more genes, which could be due to whole or partial genome duplications, repetitive sequences, expansion of gene families, or the retention of a greater diversity of genes in general. They might also have about the same complement of genes but have larger intergenic regions, more or larger introns, more transposable elements, and so on. Previous small-scale surveys of microsporidia with larger genomes have demonstrated a higher proportion of non-coding DNA, but reveal nothing about the overall organization of the genome because the fragments sampled were small and only a tiny fraction of the genome was characterized in any one case <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr><abbr bid="B38">38</abbr></abbrgrp>. The data presented here provide additional evidence that large microsporidian genomes have a very low gene density, in this case up to a fivefold decrease compared to species with smaller genomes, but also provide information on the organization and structure of a large genome in this group. <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>. First, gene density is not homogeneous across the genome, but is instead a sum of long stretches (&#8805;5.5 kb) of non-coding sequences, as well as regions where genes are separated by only 45 bp, which is even shorter than most intergenic regions found in <it>Enc. cuniculi</it>, <it>Ent. bieneusi </it>and <it>A. locustae</it>. Second, it now seems obvious that gene density alone accounts for most of the variation in genome size between different microsporidian species, although we did find numerous genes in <it>O. bayeri </it>that are absent in <it>Enc. cuniculi </it>(see below).</p>
            <p>Smaller microsporidian genomes have also been noted as sharing a high conservation of gene order across distantly related species, which has been attributed to compaction <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr><abbr bid="B34">34</abbr></abbrgrp>. Despite the overall low gene density, we found 8% of all annotated gene pairs (equating to 2 out of 24 gene pairs) were conserved in order between <it>O. bayeri </it>and <it>Enc. cuniculi</it>. This is not very different to what is found in other microsporidia <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>, and close to the expectation for closely related fungi <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>. It is interesting that both cases described here involve pairs of genes that are unusually close to one another (423 and 15 bp apart). This may reflect the role of compaction in conservation of gene order, but it might also be a sampling bias since closely spaced genes are more likely to be found on the same contig in our survey, which is based on contigs, rather than a complete genome.</p>
            <p>The large size of the <it>O. bayeri </it>genome does not reflect extensive and segmental gene duplication. However, numerous non-coding and small genomic repetitions could have played a role in its expansion. The origin of these repetitive regions is difficult to assess without a better genome assembly. Because these do not encode known functional proteins, nor harbor potential ORFs, however, it is possible that these represent telomeric and sub-telomeric regions of the <it>O. bayeri </it>genome. If this is the case, genome size variation in microsporidia could also be a consequence of variation in the size of telomeres. This prediction is supported by the recent acquisition in our laboratory of genome data from other, much smaller genomes, showing that the vast majority of unassembled Illumina&#8482; reads belong indeed to telomeric regions (NC, JFP and PJK, unpublished).</p>
         </sec>
         <sec>
            <st>
               <p>Length of microsporidian genes and size of the protein network</p>
            </st>
            <p>Microsporidian proteins are known to be shorter in general than orthologs in other organisms, a characteristic that has been attributed to the reduction in gene content and, by extension, protein networks in these cells <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B48">48</abbr></abbrgrp>. In keeping with this, the majority of the <it>O. bayeri </it>proteins are shorter than orthologs from <it>S. cerevisiae </it>(approximately 5,570 genes, size 12 Mb), <it>Ustilago maydis </it>(approximately 6,500 genes, 20 Mb genome), <it>Batrachytridium dendrobatidis </it>(approximately 8,700 genes, 24 Mb genome) and <it>Rhizopus oryzae </it>(approximately 17,459 genes, 35 Mb genome). Interestingly, however, <it>O. bayeri </it>proteins are also larger than orthologues found in <it>Enc. cuniculi </it>and <it>Ent. bieneusi</it>. Consequently, the <it>O. bayeri </it>genome provides additional evidence that microsporidian proteins are shorter than their homologs from other fungal phyla, but also that their size correlates better with the coding capacity rather than the size of the genome in which they are found.</p>
         </sec>
         <sec>
            <st>
               <p>Evidence for the progressive loss of ancestral genes throughout the evolution of the microsporidian lineage</p>
            </st>
            <p>Prior to this study, the vast majority of genes with predicted functions found in diverse microsporidia were also found in <it>Enc. cuniculi </it><abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr><abbr bid="B38">38</abbr></abbrgrp>. Three exceptions were found in <it>A. locustae </it><abbrgrp><abbr bid="B49">49</abbr><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr></abbrgrp> and a single one was found in <it>Ent. bieneusi </it><abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. This suggested that all members of this group share a common core set of genes that have been retained after massive gene losses occurred in their ancestor, resulting in only a small degree of variability in gene content. This prediction was based, however, on a very low coverage for two large microsporidian genomes <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. The <it>O. bayeri </it>genome and its evolutionary position within the group suggest that perhaps early microsporidians possessed many more genes with predicted functions than previously thought. It now seems likely that there was a large reduction in the ancestral proteome following the origin of microsporidia, but this was also followed by lineage-specific reductions and expansions in some branches of the microsporidian tree. The total number of ORFs identified in <it>O. bayeri </it>also suggests an overall coding capacity that is at least 10% larger than that of <it>Enc. cuniculi</it>. This is a conservative estimate based on the annotation of <it>O. bayeri </it>hypothetical proteins of at least 200 amino acids. Since it is known that <it>Enc. cuniculi </it>proteins shorter than 200 amino acids make up over a quarter of its total coding capacity <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, the overall coding capacity of <it>O. bayeri </it>is almost certainly greater still. It has been suggested that both <it>N. ceranae </it>and <it>Enc. bieneusi </it>genomes contain genes that are absent in <it>Enc. cuniculi</it>; however, the novel sequences in these genomes are apparently all hypothetical ORFs or transposable-like elements, and not genes with predicted functions. In these cases we cannot rule out that these are rapidly evolving genes with unrecognized homologues in other microsporidian genomes, or in some cases are not functional genes at all. In contrast, the genome of <it>O. bayeri </it>contains at least 80 genes with predicted functions and recognizable homologues in other organisms, but which are absent in <it>Enc. cuniculi</it>. This confirms that the proteome complexity of the ancestral microsporidian was greater than that seen in <it>Enc. cuniculi </it>(and other current taxa for which genome level surveys have been conducted to date), and suggests that further genome sequencing, especially of putatively deep-branching taxa, should reveal still more genes previously unseen in microsporidia. It is also formally possible that many genes were acquired relatively recently in the lineage leading to <it>O. bayeri </it>by lateral gene transfer, which has indeed been observed in other microsporidia <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. However, this does not seem likely for all these genes given the rarity of transferred genes in other microsporidia, and especially given that the most highly conserved cases are all notably similar to homologs in fungi, suggesting they are more likely ancestral to the microsporidia. This implies that much more proteome diversity awaits discovery as more microsporidian genomes are characterized.</p>
         </sec>
         <sec>
            <st>
               <p>Functional importance of <it>O. bayeri </it>proteins absent in other microsporidia</p>
            </st>
            <p>Perhaps the most intriguing finding of the present study is the identification of 80 <it>O. bayeri </it>proteins sharing homology with eukaryotes but not with <it>Enc. cuniculi</it>. Not surprisingly, these include eight transposable elements, some of which showed a high similarity to those reported from <it>Nosema bombycis </it><abbrgrp><abbr bid="B52">52</abbr></abbrgrp>. Transposable elements are absent in the most reduced microsporidian genomes <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B33">33</abbr></abbrgrp>, but are commonly reported in the ones that are larger and less compact <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr><abbr bid="B52">52</abbr></abbrgrp>, so in this case our study simply corroborates previous findings.</p>
            <p>The remainder of these eukaryotic proteins stood out for being involved in important functional processes. In total, 14 are involved in transcriptional processes, including RNA polymerases or proteins involved in the transcription of tRNAs, while 19 are part of different metabolic pathways such as the metabolism of fatty acids and lipids and nucleotide metabolism. A whole set of proteins involved in the modification of proteins and three cation transporters are also present in <it>O. bayeri </it>but absent in <it>Enc. cuniculi</it>. The identification of these eukaryotic proteins is important as it shows that the <it>O. bayeri </it>proteome is more complex than that of <it>Enc. cuniculi </it>or <it>Ent. bieneusi</it>. Moreover, most of these proteins have highest similarities with homolgs from fungal lineages, suggesting they arose through common descent rather than by their recent incorporation into the genome by lateral gene transfer.</p>
         </sec>
         <sec>
            <st>
               <p>Do <it>O. bayeri </it>protein categories reflect a lesser host dependency?</p>
            </st>
            <p>Aside from the set of <it>O. bayeri </it>proteins that are absent in <it>Enc. cuniculi</it>, the overall number of proteins with assigned functions is generally similar in the two genomes. This does not imply that both species encode the same set of identifiable eukaryotic homologs and, indeed, we observed several differences in the functional distribution of their proteins. For instance, genes involved in lipid and fatty acid metabolism are at least 25% more common in <it>O. bayeri </it>than in <it>Enc. cuniculi </it>or <it>Ent. bieneusi</it>. Similarly, <it>O. bayeri </it>harbors two additional genes involved in energy production compared to <it>Enc. cuniculi</it>, a trehalose synthase and an alternative oxidase.<it>O. bayeri </it>also harbors almost twice the number of proteins involved in glycosylation compared to <it>Enc. cuniculi</it>, suggesting a greater capacity to modify proteins, and perhaps the presence of a less simplified endoplasmic reticulum and Golgi apparatus compared to other microsporidia <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>.</p>
            <p>The presence of a larger number of genes for metabolic and energy generating proteins in <it>O. bayeri </it>does not by itself necessarily mean that this species is less dependent on its host for energy than are other microsporidia; however, we also observed a marked underrepresentation of proteins involved in stealing metabolites from the host. At the extreme, only one-quarter of the ATP transporters present in other microsporidia and around half of the <it>Enc. cuniculi </it>homologs of amino acid and sugar transporters were found in <it>O. bayeri</it>. <it>Octosporea </it>also appears to harbor a reduced set of ABC transporters compared to both <it>Enc. cuniculi </it>and <it>Ent. bieneusi</it>. Taken together, this implies that <it>O. bayeri </it>has a broader metabolic repertoire than other microsporidia while at the same time a reduced capability to derive metabolic products and energy from its host, both of which suggest it is less host-dependent than other microsporidia with smaller genomes.</p>
            <p>The phylogenetic placement of <it>O. bayeri </it>is also consistent with the idea that host dependency evolved hand in hand with reduction in genome size and hyper-adaptation for intracellular parasitism. Indeed, <it>O. bayeri </it>clusters at a basal position in the microsporidian phylogeny, in the proximity of other species characterized by large genomes, and the only ATP transporter identified from this species was also found to be a basal representative of the gene family. If both phylogenies depict the correct evolutionary relationships within the microsporidia, then the ancestral genome of microsporidia was almost certainly large, complex, and encoded few transporters. Certainly, genome surveys of other basal representatives of the group such as <it>Glugea plecoglossi </it>or <it>Trachipleistophora hominis </it>would provide decisive evidence in support or against the evolution of reduced microsporidian genomes from larger and complex relatives. This certainly warrants further need for investigating the genomics of these highly adapted and successful parasites.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>Not all microsporida are characterized by small and highly reduced genomes. Here we demonstrate that the proteome complexity can vary greatly across the different species of the group, and that a larger genome size could be a good predictor of increased genomic complexity and reduced host dependency in microsporidia.</p>
         <p>Since a microsporidian genome has now been surveyed with 454&#8482; (<it>N. ceranae </it><abbrgrp><abbr bid="B34">34</abbr></abbrgrp>) and Illumina&#8482; sequencing technology (this study), it might be interesting to compare the results. The 454&#8482; <it>de novo </it>genome assembly of <it>N. ceranae </it><abbrgrp><abbr bid="B34">34</abbr></abbrgrp> resulted in lower overall sequence coverage, but an assembly of larger contigs, on average, due to the longer sequence reads. However, the Illumina&#8482; methodology used to survey <it>O. bayeri </it>required substantially less high molecular weight DNA - in our case only 100 ng of sheared DNA. The downside of very short reads (35 bp) was mostly offset by the deep sequence coverage, allowing a detailed analysis of the coding capacity of the <it>O. bayeri </it>genome, but not of its structure (for example, conservation of gene order). Moreover, the small quantity of DNA required opens the door to genomic analyses from a broad range of uncultivatable organisms from which only a handful of contaminant-free DNA can be extracted.</p>
         <p>Finally, an important goal of the present study was to gather a large amount of genome sequence information from <it>O. bayeri </it>so that it may complement the soon-to-be annotated genome of its exclusive host, the crustacean <it>D. magna</it>. These two species represent an excellent and well-recognized model to study host-parasite interactions <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. The complementary nature of both genomic datasets will therefore form a great study system and provide a unique opportunity to further expand this specific field of evolutionary ecology into the post-genomic era</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>DNA and RNA extraction and DNA sequencing</p>
            </st>
            <p>Total RNA and genomic DNA from <it>O. bayeri </it>(isolate OER 3-3 from the Island Oeren in the Tv&#228;rminne archipelago, southwestern Finland) were obtained from purified spores isolated from a laboratory culture of infected <it>D. magna </it>hosts (University of Basel, Switzerland). A total of 100 ng of genomic DNA was sequenced with single and paired-end 35-bp reads on the Illumina&#8482; Genome Analyzer from Solexa (San Diego, CA, USA) by FASTERIS SA (Geneva, Switzerland). Reads were assembled using EDENA version 2.1.1, Velvet version 0.6.03 and ELAND version GAPipeline-1.0rc4 programs. This whole genome shotgun project has been deposited at GenBank under project accession [GenBank:<ext-link ext-link-id="ACSZ00000000" ext-link-type="gen">ACSZ00000000</ext-link>]. The version described in this paper is the first version [Genbank:<ext-link ext-link-id="ACSZ01000000" ext-link-type="gen">ACSZ01000000</ext-link>].</p>
         </sec>
         <sec>
            <st>
               <p>Identification of <it>O. bayeri </it>homologs present in the <it>Enc. cuniculi </it>genome</p>
            </st>
            <p>The <it>O. bayeri </it>homologs that are present in the <it>Enc. cuniculi </it>genome were identified by BLAST homology searches <abbrgrp><abbr bid="B53">53</abbr></abbrgrp> against the complete <it>Enc. cuniculi </it>genome using the NCBI BLASTALL suite. First, TBLASTX searches were performed under a cutoff <it>E</it>-value (<it>E </it>&#8804; 1E-10) against our local <it>Enc. cuniculi </it>database, then the <it>Enc. cuniculi </it>genes that were not found in <it>O. bayeri </it>were searched for using TBLASTX against the <it>O. bayeri </it>contigs. The <it>O. bayeri </it>tRNAs and tRNA introns identified using tRNAscan-SE and default parameters <abbrgrp><abbr bid="B54">54</abbr></abbrgrp> were searched for in the <it>Enc. cuniculi </it>genome manually.</p>
         </sec>
         <sec>
            <st>
               <p>Identification of <it>O. bayeri </it>eukaryotic homologs that are absent in <it>Enc. cuniculi</it></p>
            </st>
            <p>The contigs sharing no similarities in TBLASTX searches (<it>E </it>> 1E-3) with the <it>Enc. cuniculi </it>genome have been annotated for potential ORFs using the program GETORF from the EMBOSS package <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>. Eukaryotic homologs were identified by BLASTP searches (<it>E </it>&#8804; 1E-10) against a local copy of the NCBI non-redundant database using the NCBI BLASTALL suite. Following the BLASTP procedure, TBLASTX searches on contigs harboring ORFs that retrieved significant BLASTP hits were performed for further validation. The resulting ORFs were assigned to functional categories using the Kyoto Encyclopedia of Genes and Genomes (KEGG) <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>, Pfam <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>, and UniProt <abbrgrp><abbr bid="B58">58</abbr></abbrgrp> databases (Additional data file 2).</p>
         </sec>
         <sec>
            <st>
               <p>Identification of putative <it>O. bayeri</it>-specific hypothetical proteins</p>
            </st>
            <p>ORFs of at least 200 amino acids that did not retrieve significant homology in BLAST searches against the <it>Enc. cuniculi </it>genome or the NCBI non-redundant database were queried against the genome survey of <it>A. locustae </it><abbrgrp><abbr bid="B59">59</abbr></abbrgrp> using TBLASTP searches (<it>E </it>&#8804; 1E-10) to identify potential hypothetical proteins of microsporidian origin. Potential functions for these ORFs were also searched for using the KEGG <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>, Pfam <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>, and UniProt <abbrgrp><abbr bid="B58">58</abbr></abbrgrp> databases. ORFs of at least 200 amino acids that showed no homology in any of these searches were considered <it>O. bayeri</it>-specific putative proteins.</p>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic reconstruction</p>
            </st>
            <p>A total of 13 &#945;- and &#946;-tubulin amino acid sequences have been identified from a range of microsporidian species and used to reconstruct their phylogenetic relationships, as they represent the most conserved and widely sampled proteins within the group and have been successfully used in the past for similar purposes <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. <it>Ent. bieneusi </it>tubulins have been discarded from the present phylogeny because of their extreme amino acid divergence, resulting in its unsupported positioning within the tree and in an overall reduction in the statistical support for all other phylogenetic clades. Two zygomycetes have been used as outgroups as this phylum has been proposed to represent the most recent fungal common ancestor of microsporidia <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B22">22</abbr></abbrgrp>. The &#945;- and &#946;-tubulin amino acid sequences were aligned using Muscle v3.7 <abbrgrp><abbr bid="B60">60</abbr></abbrgrp> and the most conserved regions selected using Gblocks 0.91b <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>. The microsporidia phylogeny was reconstructed using concatenated &#945;- and &#946;-tubulin amino acid sequences and MrBayes v 3.1.2 <abbrgrp><abbr bid="B62">62</abbr></abbrgrp> with six General Time Reversible (GTR) types of substitutions, Dayoff acid substitution model and invariable plus gamma rate variations across sites. The Markov chain Monte Carlo search was run for 10,000 generations, sampling the Markov chain every 10 generations, and 250 were discarded as 'burn-in'. The relationships among microsporidia ATP transporters were studied in parallel using amino acid sequences retrieved from public databases and the parameters explained above.</p>
         </sec>
         <sec>
            <st>
               <p>Introns, gene density, and gene length</p>
            </st>
            <p>The <it>O. bayeri </it>ORFs with assigned functions were screened for potential frameshit mutations caused by the potential presence of introns, with introns previously reported in <it>Enc. cuniculi </it><abbrgrp><abbr bid="B25">25</abbr></abbrgrp> searched for manually. Gene density in the <it>O. bayeri </it>genome was determined by annotating ORFs of at least 100 amino acids along the 200 largest contigs used in this study. A number of complete <it>O. bayeri </it>proteins have been compared against orthologs from <it>Enc. cuniculi</it>, <it>Ent. bieneusi</it>, <it>S. cerevisiae</it>, <it>Neurospora crassa</it>, <it>U. maydis</it>, <it>B. dendrobatidis </it>and <it>R. oryzae </it>to identify the presence of significant differences in gene length. <it>O. bayeri</it>-specific inserts in otherwise highly conserved proteins were screened for by visual inspection of BLAST search results, compared with orthologs using MEGA 4 <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>, and their presence in mRNAs confirmed by RT-PCR. Locations of the <it>O. bayeri</it>-specific inserts on the corresponding protein three-dimensional structures were determined using SwissPDB-viewer and QuickPDB from the RSCB Protein Data Bank for available structures.</p>
         </sec>
         <sec>
            <st>
               <p>Repeated elements</p>
            </st>
            <p>DNA regions in the <it>O. bayeri </it>contigs showing homology with fungal repeated elements were identified with CENSOR <abbrgrp><abbr bid="B64">64</abbr></abbrgrp> from the Genetic Information Research Institute webserver. Repeated elements arrayed in tandem in the <it>O. bayeri </it>contigs were determined with Tandem Repeat Finder 4.03 <abbrgrp><abbr bid="B65">65</abbr></abbrgrp> using a match/mismatch/indel ratio of 2/7/7 and a minimum score of 50. Putative stem-loop structures in the <it>O. bayeri </it>contigs were screened for with PALINDROME from the EMBOSS package using a minimum stem length of 10 and a maximum loop length of 4.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>GTR: General Time Reversible; KEGG: Kyoto Encyclopedia of Genes and Genomes; NCBI: National Center for Biotechnology Information; ORF: open reading frame.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>NC conceived the study, performed molecular and bioinformatics analyses, contributed major scientific ideas and drafted the manuscript. KLH cultured <it>O. bayeri </it>strains and provided DNA and RNA samples required for sequencing. JFP performed bioinformatics analyses and drafted the manuscript. DE provided the raw sequence data on which all presented analyses have been performed and drafted the manuscript. PJK contributed to scientific ideas presented here and in conceiving the study, and drafted the manuscript.</p>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data are available with the online version of this paper: a table listing <it>Enc. cuniculi </it>predicted genes and the putative counterparts we identified in <it>O. bayeri </it>(Additional data file <supplr sid="S1">1</supplr>); a table listing the 80 <it>O. bayeri </it>proteins with assigned functions and motifs that are absent in <it>Enc. cuniculi </it>(Additional data file <supplr sid="S2">2</supplr>); a table listing <it>O. bayeri </it>ORFs and their assignment to functional categories (according to <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>) (Additional data file <supplr sid="S3">3</supplr>); the sequences of the six introns identified in <it>O. bayeri </it>(Additional data file <supplr sid="S4">4</supplr>); a figure showing three examples of large gene inserts we identified in otherwise conserved eukaryotic proteins (Additional data file <supplr sid="S5">5</supplr>); a graphical representation of the number of contigs used in this study and their respective sequence coverage (Additional data file <supplr sid="S6">6</supplr>); list of a number of repetitive elements we identified in the <it>O. bayeri </it>genome (Additional data file <supplr sid="S7">7</supplr>).</p>
         <suppl id="S1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p><it>Enc. cuniculi </it>predicted genes and the putative counterparts identified in <it>O. bayeri</it></p>
            </caption>
            <text>
               <p><it>Enc. cuniculi </it>predicted genes and the putative counterparts identified in <it>O. bayeri</it>.</p>
            </text>
            <file name="gb-2009-10-10-r106-S1.XLS">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional data file 2</p>
            </title>
            <caption>
               <p>The 80 <it>O. bayeri </it>proteins with assigned functions and motifs that are absent in <it>Enc. cuniculi</it></p>
            </caption>
            <text>
               <p>The 80 <it>O. bayeri </it>proteins with assigned functions and motifs that are absent in <it>Enc. cuniculi</it>.</p>
            </text>
            <file name="gb-2009-10-10-r106-S2.DOC">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S3">
            <title>
               <p>Additional data file 3</p>
            </title>
            <caption>
               <p><it>O. bayeri </it>ORFs and their assignment to functional categories (according to <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>)</p>
            </caption>
            <text>
               <p><it>O. bayeri </it>ORFs and their assignment to functional categories (according to <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>).</p>
            </text>
            <file name="gb-2009-10-10-r106-S3.XLS">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S4">
            <title>
               <p>Additional data file 4</p>
            </title>
            <caption>
               <p>Sequences of the six introns identified in <it>O. bayeri</it></p>
            </caption>
            <text>
               <p>Sequences of the six introns identified in <it>O. bayeri</it>.</p>
            </text>
            <file name="gb-2009-10-10-r106-S4.DOC">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S5">
            <title>
               <p>Additional data file 5</p>
            </title>
            <caption>
               <p>Three examples of large gene inserts identified in otherwise conserved eukaryotic proteins</p>
            </caption>
            <text>
               <p>Three examples of large gene inserts we identified in otherwise conserved eukaryotic proteins.</p>
            </text>
            <file name="gb-2009-10-10-r106-S5.EPS">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S6">
            <title>
               <p>Additional data file 6</p>
            </title>
            <caption>
               <p>The number of contigs used in this study and their respective sequence coverage</p>
            </caption>
            <text>
               <p>The number of contigs used in this study and their respective sequence coverage.</p>
            </text>
            <file name="gb-2009-10-10-r106-S6.PDF">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S7">
            <title>
               <p>Additional data file 7</p>
            </title>
            <caption>
               <p>Repetitive elements we identified in the <it>O. bayeri </it>genome</p>
            </caption>
            <text>
               <p>Repetitive elements we identified in the <it>O. bayeri </it>genome.</p>
            </text>
            <file name="gb-2009-10-10-r106-S7.XLS">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This work was supported by Canadian Institute of Health Research (CIHR) operating MOP (MOP-42517) to PJK and the Swiss National Foundation to DE. PJK is a Fellow of the Canadian Institute for Advanced Research (CIFAR) and a Senior Scholar of the Michael Smith Foundation for Health Research (MSFHR). NC is a Scholar of the Canadian Institute for Advanced Research (CIFAR) and a senior postdoctoral fellow of the Swiss National Science Foundation (PA00P3_124166). JFP is the recipient of the Fonds Qu&#233;b&#233;cois de la Recherche sur la Nature et les Technologies (FQRNT)/G&#233;nome Qu&#233;bec Louis-Berlinguet Postdoctoral Fellowship. KLH's work in Basel was supported by a Brazilian fellowship from CNPq, process #201401/2007-0. We thank Hilary Morrison and acknowledge the Josephine Bay Paul Center for Comparative Molecular Biology and Evolution for the use of data included in the <it>Antonospora locustae </it>Genome Project funded by NSF award number 0135272. We would like to thank Erick James and two anonymous reviewers for their important comments on previous versions of the manuscript, Renny Lee for his help in identifying <it>O. bayeri </it>introns, Sylvia Doan for her help in annotating <it>O. bayeri </it>full length ORFs and Laurent Farinelli (FASTERIS SA, Switzerland) for sequencing.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Microsporidia in insects.</p>
            </title>
            <aug>
               <au>
                  <snm>Becnel</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Andreadis</snm>
                  <fnm>TG</fnm>
               </au>
            </aug>
            <source>The Microsporidia and Microsporidiosis</source>
            <publisher>Washington, DC: American Society of Microbiology Press</publisher>
            <editor>Witter M, Weiss LM</editor>
            <pubdate>1999</pubdate>
            <fpage>447</fpage>
            <lpage>501</lpage>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Identification of microsporidia.</p>
            </title>
            <aug>
               <au>
                  <snm>Larsson</snm>
                  <fnm>JIR</fnm>
               </au>
            </aug>
            <source>Acta Protozoologica</source>
            <pubdate>1999</pubdate>
            <volume>38</volume>
            <fpage>161</fpage>
            <lpage>197</lpage>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Structure of the microsporidia.</p>
            </title>
            <aug>
               <au>
                  <snm>V&#225;vra</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Larsson</snm>
                  <fnm>JIR</fnm>
               </au>
            </aug>
            <source>The Microsporidia and Microsporidiosis</source>
            <publisher>Washington, DC: ASM Press</publisher>
            <editor>Wittner M, Weiss LM</editor>
            <pubdate>1999</pubdate>
            <fpage>7</fpage>
            <lpage>84</lpage>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Some properties of ribosomes from the sporoplasm of <it>Nosema bombycis</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Ishihara</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hayashi</snm>
                  <fnm>YJ</fnm>
               </au>
            </aug>
            <source>Invert Pathol</source>
            <pubdate>1968</pubdate>
            <volume>11</volume>
            <fpage>377</fpage>
            <lpage>385</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0022-2011(68)90186-9</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Presence of ribosomal RNAs with prokaryotic properties in Microsporidia, eukaryotic organisms.</p>
            </title>
            <aug>
               <au>
                  <snm>Curgy</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>V&#225;vra</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Vivar&#232;s</snm>
                  <fnm>CP</fnm>
               </au>
            </aug>
            <source>Biol Cell</source>
            <pubdate>1980</pubdate>
            <volume>38</volume>
            <fpage>49</fpage>
            <lpage>51</lpage>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Root of the universal tree of life based on ancient aminoacyl-tRNA synthetase gene duplications.</p>
            </title>
            <aug>
               <au>
                  <snm>Brown</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1995</pubdate>
            <volume>92</volume>
            <fpage>2441</fpage>
            <lpage>2445</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.92.7.2441</pubid>
                  <pubid idtype="pmcid">42233</pubid>
                  <pubid idtype="pmpid">7708661</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Eukaryote kingdoms: seven or nine?</p>
            </title>
            <aug>
               <au>
                  <snm>Cavalier-Smith</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Biosystems</source>
            <pubdate>1981</pubdate>
            <volume>14</volume>
            <fpage>461</fpage>
            <lpage>481</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0303-2647(81)90050-2</pubid>
                  <pubid idtype="pmpid">7337818</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Eukaryotes with no mitochondria.</p>
            </title>
            <aug>
               <au>
                  <snm>Cavalier-Smith</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1987</pubdate>
            <volume>326</volume>
            <fpage>332</fpage>
            <lpage>333</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/326332a0</pubid>
                  <pubid idtype="pmpid" link="fulltext">3561476</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Complete nucleotide sequences of the genes encoding translation elongation factors 1 alpha and 2 from a microsporidian parasite, <it>Glugea plecoglossi</it>: implications for the deepest branching of eukaryotes.</p>
            </title>
            <aug>
               <au>
                  <snm>Kamaishi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hashimoto</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Nakamura</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Masuda</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Nakamura</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Okamoto</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Shimizu</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hasegawa</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Biochem</source>
            <pubdate>1996</pubdate>
            <volume>120</volume>
            <fpage>1095</fpage>
            <lpage>1103</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9010756</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Protein phylogeny of translation elongation factor EF-1 alpha suggests microsporidians are extremely ancient eukaryotes.</p>
            </title>
            <aug>
               <au>
                  <snm>Kamaishi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hashimoto</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Nakamura</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Nakamura</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Murata</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Okada</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Okamoto</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Shimizu</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hasegawa</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1996</pubdate>
            <volume>42</volume>
            <fpage>257</fpage>
            <lpage>263</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF02198852</pubid>
                  <pubid idtype="pmpid" link="fulltext">8919877</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Ribosomal RNA sequence suggests microsporidia are extremely ancient eukaryotes.</p>
            </title>
            <aug>
               <au>
                  <snm>Vossbrinck</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Maddox</snm>
                  <fnm>JV</fnm>
               </au>
               <au>
                  <snm>Friedman</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Debrunner-Vossbrinck</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Woese</snm>
                  <fnm>CR</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1987</pubdate>
            <volume>326</volume>
            <fpage>411</fpage>
            <lpage>414</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/326411a0</pubid>
                  <pubid idtype="pmpid" link="fulltext">3550472</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>A mitochondrial remnant in the microsporidian <it>Trachipleistophora hominis</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Williams</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Hirt</snm>
                  <fnm>RP</fnm>
               </au>
               <au>
                  <snm>Lucocq</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Embley</snm>
                  <fnm>TM</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>418</volume>
            <fpage>865</fpage>
            <lpage>869</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature00949</pubid>
                  <pubid idtype="pmpid" link="fulltext">12192407</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>The rDNA unit of <it>Encephalitozoon cuniculi </it>(Microsporidia): complete 23S sequence and copy number.</p>
            </title>
            <aug>
               <au>
                  <snm>Biderre</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Peyretaillade</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Duffieux</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Peyret</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Metenier</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Vivares</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>J Eukaryot Microbiol</source>
            <pubdate>1997</pubdate>
            <volume>44</volume>
            <fpage>76S</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1550-7408.1997.tb05790.x</pubid>
                  <pubid idtype="pmpid">9508456</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Gene descent, duplication, and horizontal transfer in the evolution of glutamyl- and glutaminyl-tRNA synthetases.</p>
            </title>
            <aug>
               <au>
                  <snm>Brown</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1999</pubdate>
            <volume>49</volume>
            <fpage>485</fpage>
            <lpage>495</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/PL00006571</pubid>
                  <pubid idtype="pmpid" link="fulltext">10486006</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Only six kingdoms of life.</p>
            </title>
            <aug>
               <au>
                  <snm>Cavalier-Smith</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Proc Biol Sci</source>
            <pubdate>2004</pubdate>
            <volume>271</volume>
            <fpage>1251</fpage>
            <lpage>1262</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1098/rspb.2004.2705</pubid>
                  <pubid idtype="pmcid">1691724</pubid>
                  <pubid idtype="pmpid" link="fulltext">15306349</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Microsporidia: a journey through radical taxonomical revisions.</p>
            </title>
            <aug>
               <au>
                  <snm>Corradi</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Keeling</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>Fungal Biol Rev</source>
            <pubdate>2009</pubdate>
            <note>doi: 10.1016/j.fbr.2009.05.001.</note>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Phylogenetic analysis of beta-tubulin sequences from amitochondrial protozoa.</p>
            </title>
            <aug>
               <au>
                  <snm>Edlind</snm>
                  <fnm>TD</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Visvesvara</snm>
                  <fnm>GS</fnm>
               </au>
               <au>
                  <snm>Vodkin</snm>
                  <fnm>MH</fnm>
               </au>
               <au>
                  <snm>McLaughlin</snm>
                  <fnm>GL</fnm>
               </au>
               <au>
                  <snm>Katiyar</snm>
                  <fnm>SK</fnm>
               </au>
            </aug>
            <source>Mol Phylogenet Evol</source>
            <pubdate>1996</pubdate>
            <volume>5</volume>
            <fpage>359</fpage>
            <lpage>367</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/mpev.1996.0031</pubid>
                  <pubid idtype="pmpid" link="fulltext">8728394</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Phylogenetic analysis of the TATA box binding protein (TBP) gene from <it>Nosema locustae</it>: evidence for a microsporidia-fungi relationship and spliceosomal intron loss.</p>
            </title>
            <aug>
               <au>
                  <snm>Fast</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Logsdon</snm>
                  <fnm>JM</fnm>
                  <suf>Jr</suf>
               </au>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1999</pubdate>
            <volume>16</volume>
            <fpage>1415</fpage>
            <lpage>1419</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10563021</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Microsporidia are related to Fungi: evidence from the largest subunit of RNA polymerase II and other proteins.</p>
            </title>
            <aug>
               <au>
                  <snm>Hirt</snm>
                  <fnm>RP</fnm>
               </au>
               <au>
                  <snm>Logsdon</snm>
                  <fnm>JM</fnm>
                  <suf>Jr</suf>
               </au>
               <au>
                  <snm>Healy</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Dorey</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
               <au>
                  <snm>Embley</snm>
                  <fnm>TM</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1999</pubdate>
            <volume>96</volume>
            <fpage>580</fpage>
            <lpage>585</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.96.2.580</pubid>
                  <pubid idtype="pmcid">15179</pubid>
                  <pubid idtype="pmpid" link="fulltext">9892676</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Congruent evidence from alpha-tubulin and beta-tubulin gene phylogenies for a zygomycete origin of microsporidia.</p>
            </title>
            <aug>
               <au>
                  <snm>Keeling</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>Fungal Genet Biol</source>
            <pubdate>2003</pubdate>
            <volume>38</volume>
            <fpage>298</fpage>
            <lpage>309</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1087-1845(02)00537-6</pubid>
                  <pubid idtype="pmpid" link="fulltext">12684019</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Evidence from beta-tubulin phylogeny that microsporidia evolved from within the fungi.</p>
            </title>
            <aug>
               <au>
                  <snm>Keeling</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Luker</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Palmer</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2000</pubdate>
            <volume>17</volume>
            <fpage>23</fpage>
            <lpage>31</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10666703</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Microsporidia evolved from ancestral sexual fungi.</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Corradi</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Byrnes</snm>
                  <fnm>EJ</fnm>
                  <suf>3rd</suf>
               </au>
               <au>
                  <snm>Torres-Martinez</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dietrich</snm>
                  <fnm>FS</fnm>
               </au>
               <au>
                  <snm>Keeling</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Heitman</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Curr Biol</source>
            <pubdate>2008</pubdate>
            <volume>18</volume>
            <fpage>1675</fpage>
            <lpage>1679</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cub.2008.09.030</pubid>
                  <pubid idtype="pmpid" link="fulltext">18976912</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Phylogenetic analysis of the complete genome sequence of <it>Encephalitozoon cuniculi </it>supports the fungal origin of microsporidia and reveals a high frequency of fast-evolving genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Thomarat</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Vivares</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Gouy</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>2004</pubdate>
            <volume>59</volume>
            <fpage>780</fpage>
            <lpage>791</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00239-004-2673-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">15599510</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Microsporidia: accumulating molecular evidence that a group of amitochondriate and suspectedly primitive eukaryotes are just curious fungi.</p>
            </title>
            <aug>
               <au>
                  <snm>Peer</snm>
                  <mnm>Van de</mnm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Ben Ali</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2000</pubdate>
            <volume>246</volume>
            <fpage>1</fpage>
            <lpage>8</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0378-1119(00)00063-9</pubid>
                  <pubid idtype="pmpid" link="fulltext">10767522</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Genome sequence and gene compaction of the eukaryote parasite <it>Encephalitozoon cuniculi</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Katinka</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Duprat</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Cornillot</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Metenier</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Thomarat</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Prensier</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Barbe</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Peyretaillade</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Brottier</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Wincker</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Delbac</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>El Alahoui</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Peyret</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Saurin</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Gouy</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Weissenbach</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Vivar&#233;s</snm>
                  <fnm>CP</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>414</volume>
            <fpage>450</fpage>
            <lpage>453</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35106579</pubid>
                  <pubid idtype="pmpid" link="fulltext">11719806</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>A novel route for ATP acquisition by the remnant mitochondria of <it>Encephalitozoon cuniculi</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Tsaousis</snm>
                  <fnm>AD</fnm>
               </au>
               <au>
                  <snm>Kunji</snm>
                  <fnm>ER</fnm>
               </au>
               <au>
                  <snm>Goldberg</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Lucocq</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Hirt</snm>
                  <fnm>RP</fnm>
               </au>
               <au>
                  <snm>Embley</snm>
                  <fnm>TM</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2008</pubdate>
            <volume>453</volume>
            <fpage>553</fpage>
            <lpage>556</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature06903</pubid>
                  <pubid idtype="pmpid" link="fulltext">18449191</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>An ADP/ATP-specific mitochondrial carrier protein in the microsporidian <it>Antonospora locustae</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Williams</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Haferkamp</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Keeling</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2008</pubdate>
            <volume>375</volume>
            <fpage>1249</fpage>
            <lpage>1257</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jmb.2007.11.005</pubid>
                  <pubid idtype="pmpid" link="fulltext">18078956</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>mRNA processing in <it>Antonospora locustae </it>spores.</p>
            </title>
            <aug>
               <au>
                  <snm>Corradi</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Burri</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Keeling</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>Mol Genet Genomics</source>
            <pubdate>2008</pubdate>
            <volume>280</volume>
            <fpage>565</fpage>
            <lpage>574</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00438-008-0387-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">18818951</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Comparative profiling of overlapping transcription in the compacted genomes of microsporidia <it>Antonospora locustae </it>and <it>Encephalitozoon cuniculi</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Corradi</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Gangaeva</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Keeling</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>Genomics</source>
            <pubdate>2008</pubdate>
            <volume>91</volume>
            <fpage>388</fpage>
            <lpage>393</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ygeno.2007.12.006</pubid>
                  <pubid idtype="pmpid" link="fulltext">18280697</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>A high frequency of overlapping gene expression in compacted eukaryotic genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Williams</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Slamovits</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Patron</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Fast</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Keeling</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>10936</fpage>
            <lpage>10941</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.0501321102</pubid>
                  <pubid idtype="pmcid">1182411</pubid>
                  <pubid idtype="pmpid" link="fulltext">16037215</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Patterns of genome evolution among the microsporidian parasites <it>Encephalitozoon cuniculi</it>, <it>Antonospora locustae </it>and <it>Enterocytozoon bieneusi</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Corradi</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Akiyoshi</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Morrison</snm>
                  <fnm>HG</fnm>
               </au>
               <au>
                  <snm>Feng</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Weiss</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Tzipori</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Keeling</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>PLoS ONE</source>
            <pubdate>2007</pubdate>
            <volume>2</volume>
            <fpage>e1277</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.pone.0001277</pubid>
                  <pubid idtype="pmcid">2099475</pubid>
                  <pubid idtype="pmpid" link="fulltext">18060071</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Genome compaction and stability in microsporidian intracellular parasites.</p>
            </title>
            <aug>
               <au>
                  <snm>Slamovits</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Fast</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Law</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Keeling</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>Curr Biol</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>891</fpage>
            <lpage>896</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cub.2004.04.041</pubid>
                  <pubid idtype="pmpid" link="fulltext">15186746</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Genomic survey of the non-cultivatable opportunistic human pathogen, <it>Enterocytozoon bieneusi</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Akiyoshi</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Morrison</snm>
                  <fnm>HG</fnm>
               </au>
               <au>
                  <snm>Lei</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Feng</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Corradi</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Mayanja</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Tumwine</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Keeling</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Weiss</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Tzipori</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>PLoS Pathog</source>
            <pubdate>2009</pubdate>
            <volume>5</volume>
            <fpage>e1000261</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.ppat.1000261</pubid>
                  <pubid idtype="pmcid">2607024</pubid>
                  <pubid idtype="pmpid" link="fulltext">19132089</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Genomic analyses of the microsporidian <it>Nosema ceranae</it>, an emergent pathogen of honey bees.</p>
            </title>
            <aug>
               <au>
                  <snm>Cornman</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>YP</fnm>
               </au>
               <au>
                  <snm>Schatz</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Street</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Desany</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Egholm</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hutchison</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Pettis</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Lipkin</snm>
                  <fnm>WI</fnm>
               </au>
               <au>
                  <snm>Evans</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>PLoS Pathog</source>
            <pubdate>2009</pubdate>
            <volume>5</volume>
            <fpage>e1000466</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.ppat.1000466</pubid>
                  <pubid idtype="pmcid">2685015</pubid>
                  <pubid idtype="pmpid" link="fulltext">19503607</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Genes coding for reverse transcriptase, DNA-directed RNA polymerase, and chitin synthase from the microsporidian <it>Spraguea lophii</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Hinkle</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Morrison</snm>
                  <fnm>HG</fnm>
               </au>
               <au>
                  <snm>Sogin</snm>
                  <fnm>ML</fnm>
               </au>
            </aug>
            <source>Biol Bull</source>
            <pubdate>1997</pubdate>
            <volume>193</volume>
            <fpage>250</fpage>
            <lpage>251</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9390396</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Sequence survey of the genome of the opportunistic microsporidian pathogen, <it>Vittaforma corneae</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Mittleider</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>LC</fnm>
               </au>
               <au>
                  <snm>Mann</snm>
                  <fnm>VH</fnm>
               </au>
               <au>
                  <snm>Michael</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Didier</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Brindley</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>J Eukaryot Microbiol</source>
            <pubdate>2002</pubdate>
            <volume>49</volume>
            <fpage>393</fpage>
            <lpage>401</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1550-7408.2002.tb00218.x</pubid>
                  <pubid idtype="pmpid">12425527</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>ESTs from the microsporidian <it>Edhazardia aedis</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Gill</snm>
                  <fnm>EE</fnm>
               </au>
               <au>
                  <snm>Becnel</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Fast</snm>
                  <fnm>NM</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2008</pubdate>
            <volume>9</volume>
            <fpage>296</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1471-2164-9-296</pubid>
                  <pubid idtype="pmcid">2474625</pubid>
                  <pubid idtype="pmpid" link="fulltext">18570666</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Genome sequence surveys of <it>Brachiola algerae </it>and <it>Edhazardia aedis </it>reveal microsporidia with low gene densities.</p>
            </title>
            <aug>
               <au>
                  <snm>Williams</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Becnel</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Weiss</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Fast</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Keeling</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2008</pubdate>
            <volume>9</volume>
            <fpage>200</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1471-2164-9-200</pubid>
                  <pubid idtype="pmcid">2387174</pubid>
                  <pubid idtype="pmpid" link="fulltext">18445287</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <aug>
               <au>
                  <snm>Ebert</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Ecology, Epidemiology, and Evolution of Parasitism in Daphnia [Internet]</source>
            <publisher>Bethesda (MD): National Library of Medicine (US), National Center for Biotechnology Information</publisher>
            <url>http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Books</url>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Different mechanisms of transmission of the microsporidium <it>Octosporea bayeri</it>: a cocktail of solutions for the problem of parasite permanence.</p>
            </title>
            <aug>
               <au>
                  <snm>Vizoso</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Lass</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ebert</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Parasitology</source>
            <pubdate>2005</pubdate>
            <volume>130</volume>
            <fpage>501</fpage>
            <lpage>509</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1017/S0031182004006699</pubid>
                  <pubid idtype="pmpid">15991493</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Host-parasite coevolution: insights from the <it>Daphnia</it>-parasite model system.</p>
            </title>
            <aug>
               <au>
                  <snm>Ebert</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Curr Opin Microbiol</source>
            <pubdate>2008</pubdate>
            <volume>11</volume>
            <fpage>290</fpage>
            <lpage>301</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.mib.2008.05.012</pubid>
                  <pubid idtype="pmpid" link="fulltext">18556238</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Alpha- and beta-tubulin phylogenies support a close relationship between the microsporidia <it>Brachiola algerae </it>and <it>Antonospora locustae</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Williams</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Adamson</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Keeling</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Fast</snm>
                  <fnm>NM</fnm>
               </au>
            </aug>
            <source>J Eukaryot Microbiol</source>
            <pubdate>2008</pubdate>
            <volume>55</volume>
            <fpage>388</fpage>
            <lpage>392</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">19017059</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Genome-wide bioinformatic and molecular analysis of introns in <it>Saccharomyces cerevisiae</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Spingola</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Grate</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Haussler</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ares</snm>
                  <fnm>M</fnm>
                  <suf>Jr</suf>
               </au>
            </aug>
            <source>RNA</source>
            <pubdate>1999</pubdate>
            <volume>5</volume>
            <fpage>221</fpage>
            <lpage>234</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1017/S1355838299981682</pubid>
                  <pubid idtype="pmcid">1369754</pubid>
                  <pubid idtype="pmpid" link="fulltext">10024174</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>The highly reduced genome of an enslaved algal nucleus.</p>
            </title>
            <aug>
               <au>
                  <snm>Douglas</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zauner</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Fraunholz</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Beaton</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Penny</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>LT</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Reith</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cavalier-Smith</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Maier</snm>
                  <fnm>UG</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>410</volume>
            <fpage>1091</fpage>
            <lpage>1096</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35074092</pubid>
                  <pubid idtype="pmpid" link="fulltext">11323671</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>A unique insertion in <it>Plasmodium berghei </it>glucose-6-phosphate dehydrogenase-6-phosphogluconolactonase: evolutionary and functional studies.</p>
            </title>
            <aug>
               <au>
                  <snm>Clarke</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Sodeinde</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Mason</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>Mol Biochem Parasitol</source>
            <pubdate>2003</pubdate>
            <volume>127</volume>
            <fpage>1</fpage>
            <lpage>8</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0166-6851(02)00298-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">12615331</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p><it>Toxoplasma gondii </it>expresses two distinct lactate dehydrogenase homologous genes during its life cycle in intermediate hosts.</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Parmley</snm>
                  <fnm>SF</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>1997</pubdate>
            <volume>184</volume>
            <fpage>1</fpage>
            <lpage>12</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0378-1119(96)00566-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">9016946</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Prevalence of small inversions in yeast gene order evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Seoighe</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Federspiel</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hansen</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Bivolarovic</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Surzycki</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Tamse</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Komp</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Huizar</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Scherer</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tait</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Shaw</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Murphy</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Oliver</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Rajandream</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Barrell</snm>
                  <fnm>BG</fnm>
               </au>
               <au>
                  <snm>Wolfe</snm>
                  <fnm>KH</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2000</pubdate>
            <volume>97</volume>
            <fpage>14433</fpage>
            <lpage>14437</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.240462997</pubid>
                  <pubid idtype="pmcid">18936</pubid>
                  <pubid idtype="pmpid" link="fulltext">11087826</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Protein-length distributions for the three domains of life.</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <fpage>107</fpage>
            <lpage>109</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(99)01922-8</pubid>
                  <pubid idtype="pmpid" link="fulltext">10689349</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Bacterial catalase in the microsporidian <it>Nosema locustae</it>: implications for microsporidian metabolism and genome evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Fast</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Law</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Williams</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Keeling</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>Eukaryot Cell</source>
            <pubdate>2003</pubdate>
            <volume>2</volume>
            <fpage>1069</fpage>
            <lpage>1075</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1128/EC.2.5.1069-1075.2003</pubid>
                  <pubid idtype="pmcid">219363</pubid>
                  <pubid idtype="pmpid" link="fulltext">14555490</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Class II photolyase in a microsporidian intracellular parasite.</p>
            </title>
            <aug>
               <au>
                  <snm>Slamovits</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Keeling</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2004</pubdate>
            <volume>341</volume>
            <fpage>713</fpage>
            <lpage>721</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jmb.2004.06.032</pubid>
                  <pubid idtype="pmpid" link="fulltext">15288781</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Microsporidian mitochondrial proteins: expression in <it>Antonospora locustae </it>spores and identification of genes coding for two further proteins.</p>
            </title>
            <aug>
               <au>
                  <snm>Williams</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Keeling</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>J Eukaryot Microbiol</source>
            <pubdate>2005</pubdate>
            <volume>52</volume>
            <fpage>271</fpage>
            <lpage>276</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15927004</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>The varying microsporidian genome: existence of long-terminal repeat retrotransposon in domesticated silkworm parasite <it>Nosema bombycis</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Xu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Pan</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Fang</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Tian</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Xiang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Int J Parasitol</source>
            <pubdate>2006</pubdate>
            <volume>36</volume>
            <fpage>1049</fpage>
            <lpage>1056</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ijpara.2006.04.010</pubid>
                  <pubid idtype="pmpid" link="fulltext">16797019</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Basic local alignment search tool.</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Gish</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Myers</snm>
                  <fnm>EW</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1990</pubdate>
            <volume>215</volume>
            <fpage>403</fpage>
            <lpage>410</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">2231712</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.</p>
            </title>
            <aug>
               <au>
                  <snm>Lowe</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <fpage>955</fpage>
            <lpage>964</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/25.5.955</pubid>
                  <pubid idtype="pmcid">146525</pubid>
                  <pubid idtype="pmpid" link="fulltext">9023104</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>EMBOSS: the European Molecular Biology Open Software Suite.</p>
            </title>
            <aug>
               <au>
                  <snm>Rice</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Longden</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Bleasby</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <fpage>276</fpage>
            <lpage>277</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(00)02024-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">10827456</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>KEGG: Kyoto Encyclopedia of Genes and Genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Ogata</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Goto</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sato</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Fujibuchi</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Bono</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kanehisa</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1999</pubdate>
            <volume>27</volume>
            <fpage>29</fpage>
            <lpage>34</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/27.1.29</pubid>
                  <pubid idtype="pmcid">148090</pubid>
                  <pubid idtype="pmpid" link="fulltext">9847135</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>The Pfam protein families database.</p>
            </title>
            <aug>
               <au>
                  <snm>Finn</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Tate</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mistry</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Coggill</snm>
                  <fnm>PC</fnm>
               </au>
               <au>
                  <snm>Sammut</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Hotz</snm>
                  <fnm>HR</fnm>
               </au>
               <au>
                  <snm>Ceric</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Forslund</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Bateman</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2008</pubdate>
            <volume>36</volume>
            <fpage>D281</fpage>
            <lpage>288</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkm960</pubid>
                  <pubid idtype="pmpid" link="fulltext">18039703</pubid>
                  <pubid idtype="pmcid">2238907</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>UniProt: the Universal Protein knowledgebase.</p>
            </title>
            <aug>
               <au>
                  <snm>Apweiler</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bairoch</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Barker</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Boeckmann</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Ferro</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gasteiger</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lopez</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Magrane</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Natale</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>O'Donovan</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Redaschi</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Yeh</snm>
                  <fnm>LSL</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>115</fpage>
            <lpage>119</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkh131</pubid>
                  <pubid idtype="pmcid">373264</pubid>
                  <pubid idtype="pmpid" link="fulltext">14704348</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p><it>Antonospora locustae </it>Genome Database</p>
            </title>
            <url>http://gmod.mbl.edu/perl/site/antonospora01</url>
         </bibl>
         <bibl id="B60">
            <title>
               <p>MUSCLE: multiple sequence alignment with high accuracy and high throughput.</p>
            </title>
            <aug>
               <au>
                  <snm>Edgar</snm>
                  <fnm>RC</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>1792</fpage>
            <lpage>1797</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkh340</pubid>
                  <pubid idtype="pmcid">390337</pubid>
                  <pubid idtype="pmpid" link="fulltext">15034147</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments.</p>
            </title>
            <aug>
               <au>
                  <snm>Talavera</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Castresana</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Syst Biol</source>
            <pubdate>2007</pubdate>
            <volume>56</volume>
            <fpage>564</fpage>
            <lpage>577</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/10635150701472164</pubid>
                  <pubid idtype="pmpid" link="fulltext">17654362</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>MrBayes 3: Bayesian phylogenetic inference under mixed models.</p>
            </title>
            <aug>
               <au>
                  <snm>Ronquist</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Huelsenbeck</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <fpage>1572</fpage>
            <lpage>1574</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btg180</pubid>
                  <pubid idtype="pmpid" link="fulltext">12912839</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0.</p>
            </title>
            <aug>
               <au>
                  <snm>Tamura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Dudley</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kumar</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2007</pubdate>
            <volume>24</volume>
            <fpage>1596</fpage>
            <lpage>1599</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msm092</pubid>
                  <pubid idtype="pmpid" link="fulltext">17488738</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor.</p>
            </title>
            <aug>
               <au>
                  <snm>Kohany</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Gentles</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Hankus</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Jurka</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>BMC bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>474</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1471-2105-7-474</pubid>
                  <pubid idtype="pmcid">1634758</pubid>
                  <pubid idtype="pmpid" link="fulltext">17064419</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>Tandem repeats finder: a program to analyze DNA sequences.</p>
            </title>
            <aug>
               <au>
                  <snm>Benson</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1999</pubdate>
            <volume>27</volume>
            <fpage>573</fpage>
            <lpage>580</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/27.2.573</pubid>
                  <pubid idtype="pmcid">148217</pubid>
                  <pubid idtype="pmpid" link="fulltext">9862982</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>

