<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2009-10-1-r6</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Characterization and comparative profiling of the small RNA transcriptomes in two phases of locust</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Wei</snm>
               <fnm>Yuanyuan</fnm>
               <insr iid="I1"/>
               <email>weiyy_1982@163.com</email>
            </au>
            <au id="A2">
               <snm>Chen</snm>
               <fnm>Shuang</fnm>
               <insr iid="I1"/>
               <email>chenshuangzbs@163.com</email>
            </au>
            <au id="A3">
               <snm>Yang</snm>
               <fnm>Pengcheng</fnm>
               <insr iid="I1"/>
               <email>pengchy@yahoo.com.cn</email>
            </au>
            <au id="A4">
               <snm>Ma</snm>
               <fnm>Zongyuan</fnm>
               <insr iid="I1"/>
               <email>mazongyuan@163.com</email>
            </au>
            <au id="A5" ca="yes">
               <snm>Kang</snm>
               <fnm>Le</fnm>
               <insr iid="I1"/>
               <email>lkang@ioz.ac.cn</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Datun Road, Chaoyang District, Beijing 100101, PR China</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2009</pubdate>
         <volume>10</volume>
         <issue>1</issue>
         <fpage>R6</fpage>
         <url>http://genomebiology.com/2009/10/1/R6</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">19146710</pubid>
               <pubid idtype="doi">10.1186/gb-2009-10-1-r6</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>28</day>
               <month>9</month>
               <year>2008</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>11</day>
               <month>12</month>
               <year>2008</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>16</day>
               <month>1</month>
               <year>2009</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>16</day>
               <month>1</month>
               <year>2009</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2009</year>
         <collab>Wei et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Locust small RNAs</p>
      </shorttitle>
      <shortabs>
         <p>High-throughput sequencing of the small RNA transcriptome of locust reveals differences in post-transcriptional regulation between solitary and swarming phases and provides insights into the evolution of insect small RNAs.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>All the reports on insect small RNAs come from holometabolous insects whose genome sequence data are available. Therefore, study of hemimetabolous insect small RNAs could provide more insights into evolution and function of small RNAs in insects. The locust is an important, economically harmful hemimetabolous insect. Its phase changes, as a phenotypic plasticity, result from differential gene expression potentially regulated at both the post-transcriptional level, mediated by small RNAs, and the transcriptional level.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Here, using high-throughput sequencing, we characterize the small RNA transcriptome in the locust. We identified 50 conserved microRNA families by similarity searching against miRBase, and a maximum of 185 potential locust-specific microRNA family candidates were identified using our newly developed method independent of locust genome sequence. We also demonstrate conservation of microRNA*, and evolutionary analysis of locust microRNAs indicates that the generation of miRNAs in locusts is concentrated along three phylogenetic tree branches: bilaterians, coelomates, and insects. Our study identified thousands of endogenous small interfering RNAs, some of which were of transposon origin, and also detected many Piwi-interacting RNA-like small RNAs. Comparison of small RNA expression patterns of the two phases showed that longer small RNAs were expressed more abundantly in the solitary phase and that each category of small RNAs exhibited different expression profiles between the two phases.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>The abundance of small RNAs in the locust might indicate a long evolutionary history of post-transcriptional gene expression regulation, and differential expression of small RNAs between the two phases might further disclose the molecular mechanism of phase changes.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010005">Development</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010013">Methods</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Regulation of gene expression can occur at both transcriptional and post-transcriptional levels. In recent years, the discovery of numerous small RNAs has increased interest in post-transcriptional gene expression regulation during development and other biological processes. Small RNAs include several kinds of short non-coding RNAs, such as microRNA (miRNA), small interfering RNA (siRNA), and Piwi-associated RNA (piRNA), which all regulate gene expression at the post-transcriptional level. Typically, miRNAs are approximately 22 nucleotide small-RNA sequences <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> that play key roles in many diverse biological processes, including development, viral defense, metabolism, and apoptosis <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. The 'seed' region, located at miRNA nucleotides 2-8 <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, is the most important sequence for interaction with mRNA targets. There are two other important non-coding RNAs: endogenous siRNA (endo-siRNA) and piRNA. Endo-siRNA is derived from double-stranded RNA to guide RNA interference. Much of the research on endo-siRNAs has been done in plants <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, but recently endo-siRNAs derived from transposons and mRNAs in flies have also been identified <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. These findings indicate that endo-siRNAs may play a broader role in all organisms. A new class of small RNAs, piRNA, was discovered two years ago. piRNAs, 23-30 nucleotides in length, interact with PIWI proteins and repress the expression of selfish genetic elements, such as transposons, in the germ line <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>.</p>
         <p>Insects comprise the largest group of metazoans, and previous studies have shown that small RNAs are involved in a significant number of biological processes in them <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. Many small RNAs have been identified in insects whose whole genome sequences are available, including the fruit fly, bee, mosquito, and silkworm. These insects are all holometabolous, meaning that they go through the complete four stages of metamorphism. Another important group of insects are hemimetabolous insects, which undergo an incomplete metamorphism, bypassing the pupa stage. In this group of insects, no research on small RNAs has been carried out. Studies on small RNAs in very different groups of insects are important for understanding the evolution of post-transcriptional gene expression regulation, and gaining specific information from the hemimetabolous group represents a unique opportunity to examine species with an analogous, but modified, developmental process. Combined with the holometabolous group, the study of small RNAs in the hemimetabolous group, including several ancient orders of insects, could aid in understanding the whole picture of evolution and function of small RNAs in insects.</p>
         <p>The migratory locust (<it>Locusta migratoria</it>) is a typical hemimetabolous insect within the family Acrididae and is a worldwide, highly prevalent agricultural pest causing hundreds of millions of dollars worth of damage every year. The locust has also been used in research as a model organism for the study of developmental, physiological, immune, and neural pathways, as well as others <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Additionally, as compared to the fruit fly, the locust is a far more primitive insect, making it an excellent model for studying evolution.</p>
         <p>A great deal of work has been carried out specifically on the ability of the locust to change phases from solitary to gregarious (in the latter phase, locusts form swarms that cause devastation of crops). Phase transition, as a phenotypic plasticity in response to population density changes, is one of the most interesting behavioral phenomena of the locust, and is linked with changes in morphology, behavior, reproduction, endocrine balance, and disease resistance, all of which include many changes at the molecular level that are potentially involved in both transcriptional <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> and post-transcriptional regulation of gene expression. Given that small RNAs are known to be a key component in post-transcriptional gene expression regulation in a variety of organisms, information on the presence and activities of small RNAs in the locust would be particularly useful. The locust, however, currently lacks any substantial genome sequence data. Thus, the available expressed sequence tags (ESTs) <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp> provide the only basis for small RNA annotation. It is possible to identify the precursors of miRNAs and endogenous siRNAs via alignment to ESTs <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>. The identification and comparison of small RNAs in the gregarious and solitary phases can aid in understanding the mechanisms underlying their different biological processes, especially phase transition. Furthermore, differences in small RNAs between the two phases might provide clues about how to control locust plagues throughout the world by designing artificial siRNAs, thus saving a huge number of crops every year.</p>
         <p>For this study, because there is no whole genomic information available, we utilized the new high-throughput sequencing method (Illumina Genome Analyzer), instead of computational approaches, to characterize locust small RNAs, and developed a new method to predict locust-specific miRNAs. We further compared the small RNA characteristics and expression patterns between the gregarious and solitary phases.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>High-throughput sequencing of small RNAs</p>
            </st>
            <p>To survey small RNAs in the locust, we used Illumina sequencing technology on libraries of small RNAs from the gregarious and solitary phases [GEO:GSE12640]. We obtained 1,566,242 reads from the gregarious library and 1,949,248 reads from the solitary library after discarding the empty adapters. Generally, length distribution of small RNAs in two phase libraries is different (Figure <figr fid="F1">1a</figr>; also see the section 'Different expression profiles of small RNAs in the two phases' below). After discarding low-quality sequences, sequences shorter than 18 nucleotides, and single-read sequences, 895,554 and 1,377,859 reads, for the gregarious and solitary phases, respectively, remained for analysis. After comparing the small RNA sequences with the locust EST database <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp> as well as the <it>Drosophila melanogaster </it>rRNA, tRNA and snoRNA database <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>, sequences that came from these types of RNAs (Figure <figr fid="F1">1b</figr>) were removed. The remaining sequences were clustered based on sequence similarity because related sequences probably came from the same precursor as cleavage by RNase III enzymes was imprecise. We determined that the sequence with the dominant number of reads in a cluster was likely to be the real sequence due to its relatively high expression level, and these sequence clusters were further analyzed.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Length distribution and composition of the small RNA libraries in gregarious and solitary locusts</p>
               </caption>
               <text>
                  <p>Length distribution and composition of the small RNA libraries in gregarious and solitary locusts. Nt, nucleotides.</p>
               </text>
               <graphic file="gb-2009-10-1-r6-1"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Conserved microRNAs</p>
            </st>
            <p>We identified 55 miRNA sequences, belonging to 50 families (Table S1 in Additional data file 3), in the migratory locust by BLAST against the miRBase v11.0 <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. Most of the 50 miRNA families share the same 'seed' regions (the 5' region important for target recognition) <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> in the locust and other insects. However, locust miR-10 and miR-79 (lmi-miR-10 and lmi-miR-79) have very different 5' ends, thus changing their 'seed' region, compared with miR-10 and miR-79 of the other four insect species studied. For locust miR-79, the mature sequence has an additional adenosine at the 5' end (Figure S1 in Additional data file 3), similar to that of the <it>Caenorhabditis elegans </it>miR-79 (cel-miR-79). Although in most cases the key 'seed' site of the miRNA is nucleotides 2-8 <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B19">19</abbr></abbrgrp>, the 8-mer seed site of <it>D. melanogaster </it>miR-79 (dme-miR-79) has been validated as being at nucleotides 1-8 <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, which is the same as locust miR-79 nucleotides 2-9. This indicates that the additional adenosine at the 5' end of lmi-miR-79 possibly does not lead to different targets in the locust and fly.</p>
            <p>For lmi-miR-10, much like lmi-miR-79, the mature sequence in the locust has an additional nucleotide at the 5' end, in this case a uridine (Figure S1 in Additional data file 3), which is the same as the miR-10 of non-insect organisms. Previous studies have demonstrated that miR-10 in both species that do and do not have an extra U have similar targets <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Although lmi-miR-79 and lmi-miR-10 of the locust have an extra nucleotide at the 5' end compared to those of the fruit fly, they still have the same 'seed' sequences, which may potentially regulate similar targets.</p>
         </sec>
         <sec>
            <st>
               <p>Conservation of miRNA*</p>
            </st>
            <p>Although mature miRNA and miRNA* (the miRNA:miRNA* duplex) are complementary, their base-pairing is imperfect in the presence of compensatory substitutions (for example, C-G to U-G), and the miRNA* is generally less stable than the mature miRNA <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. Analysis of miRNA and miRNA* species in the miRNA database <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> indicated that miRNA* is less conserved than miRNA (data not shown). However, we found the homologs of several <it>D. melanogaster </it>miRNA* (miR-iab-4, miR-8, miR-9a, miR-10, miR-210, miR-276, miR-281, and miR-307; Table S2 in Additional data file 3) in the locust library, indicating conservation of these miRNAs* between the locust and the fruit fly.</p>
            <p>To test whether the locust miRNA* and their corresponding mature miRNA sequences came from the same precursors, we used a PCR-based method to confirm the relationship between the miRNA and its miRNA*. If the miRNA and its miRNA* came from the same precursor, we should be able to amplify 55-70 bp fragments from the genomic DNA. As expected, we amplified 55-70 bp products from all the miRNAs with the exception of mir-iab-4 (Figure <figr fid="F2">2a</figr>), and the sequences of the PCR products confirmed the matches between miRNAs* and mature miRNAs (Table S2 in Additional data file 3). We could not amplify the expected products of mir-iab-4, although we repeatedly performed the PCR experiments; the two sequences probably do not comprise the canonical miRNA precursor in the locust.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Conservation of miRNA* in the locust</p>
               </caption>
               <text>
                  <p>Conservation of miRNA* in the locust. <b>(a) </b>Electrophoretic analysis of PCR products amplified by the primer pairs designed on the basis of predicted miRNA* as based on a similarity to fruit fly miRNA* and their corresponding mature miRNAs. For each miRNA, the left lane is the negative control and the right lane is the positive result. <b>(b) </b>Two examples of precursor sequences of seven conserved miRNAs that have a conserved star sequence. The alignment of mir-276 and mir-307 in different insects shows high conservation of their miRNA*. The green nucleotides represent miRNA star sequence and the red represent mature miRNA sequence. The asterisks indicate the conserved sites among these species. <b>(c) </b>Hairpin structures of the mir-276 and mir-307 precursors of the locust. aga, <it>A</it>. <it>gambiae</it>; ame, <it>A</it>. <it>mellifera</it>; bmo, <it>B</it>. <it>mori</it>; dme, <it>D</it>. <it>melanogaster</it>; lmi, <it>L</it>. <it>migratoria</it>.</p>
               </text>
               <graphic file="gb-2009-10-1-r6-2"/>
            </fig>
            <p>We used the sequences of the amplified products of the conserved miRNA precursors to predict their secondary structure using mfold <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>, and all seven sequences could be properly folded into the typical hairpin structure (Figure <figr fid="F2">2c</figr>), again indicating that the miRNA pairs came from the same precursor and could properly fold into the pre-miRNA-like hairpin for further processing. Taken together, these data indicate that, in addition to conservation of mature miRNAs, some of the locust miRNA* are also highly conserved in different lineages (Figure <figr fid="F2">2b</figr>). That the miRNA* are conserved across several lineages indicates a possible role of miRNA* in regulating gene expression, which was previously reported in flies <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>.</p>
            <p>Since the locust and fruit fly separated about 350 million years ago <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, it is striking that the 22-nucleotide miRNA* has little sequence divergence between the two species. Moreover, in the case of lmi-mir-10, a greater number of reads (two-fold more abundant) was generated by the star form. For lmi-mir-8 and lmi-mir-276, thousands of their star reads were presented in the library (Figure S2 in Additional data file 3). These findings also implicated a functional role of miRNA* in regulating gene expression.</p>
         </sec>
         <sec>
            <st>
               <p>Identification of locust-specific miRNA families</p>
            </st>
            <p>In an attempt to discover locust-specific miRNA families, we integrated the data from the locust small RNA libraries we created with those of the locust EST database <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>. This, however, did not provide any significant findings (see Materials and methods), likely because of the low coverage of the locust EST database. Given that no methods were available to identify locust lineage-specific miRNA families in the absence of locust genomic information <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>, we developed a new method that is based on high-throughput sequencing but does not require the presence of whole genome sequence data (see Materials and methods).</p>
            <p>We obtained 185 miRNA duplex-like pairs (Figure <figr fid="F3">3a</figr>; Table S3 in Additional data file 3 shows the sequences with the dominant reads, potential miRNA candidates, in the pairs). If these pairs were true miRNA duplexes, 55-70 bp fragments should be amplified from the locust genomic DNA using primers designed according to the duplexes. To test the validity of our method to identify species-specific miRNAs, we amplified corresponding fragments from locust genomic DNA for 24 of our predicted candidate duplexes. Using this method we obtained amplified fragments of expected length from 13 out of the 24 candidates (Figure <figr fid="F3">3b</figr> and Table <tblr tid="T1">1</tblr>), indicating that about half of the predicted candidates may be canonical miRNA duplexes of which the strand with more reads should be mature miRNA and the other strand should be miRNA*.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Validated locust-specific miRNAs</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="left">
                        <p>miRNA family</p>
                     </c>
                     <c ca="left">
                        <p>Mature miRNA sequence (5'-3')</p>
                     </c>
                     <c ca="center">
                        <p>Length*</p>
                     </c>
                     <c ca="left">
                        <p>miRNA star sequence (5'-3')</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>lmi-novel-01</p>
                     </c>
                     <c ca="left">
                        <p>UCAGGAAAUCAAUCGUGUAAGU</p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                     <c ca="left">
                        <p>UUACACAGCUGGUUUCCUGGGA</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>lmi-novel-02</p>
                     </c>
                     <c ca="left">
                        <p>UGAAGCUCCUCAUAUCUGACCU</p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                     <c ca="left">
                        <p>GUGAGAUGUGAUGAGCUUCACU</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>lmi-novel-03</p>
                     </c>
                     <c ca="left">
                        <p>UAAGCUCGUCUUUCUGAGCAGU</p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                     <c ca="left">
                        <p>UCUUCGGAGGCGUGGGUAUCCC</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>lmi-novel-04</p>
                     </c>
                     <c ca="left">
                        <p>UAAUCUCAUGUGGUAACUGUGA</p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                     <c ca="left">
                        <p>CAGAUUGCCAUGUGGGGUUUCA</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>lmi-novel-05</p>
                     </c>
                     <c ca="left">
                        <p>AGCAUGAUCAGUGGCAUGAAUU</p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                     <c ca="left">
                        <p>UUCGUGUGACUGCUCAUGCAAC</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>lmi-novel-06</p>
                     </c>
                     <c ca="left">
                        <p>AUGGUGUCAGGAAUAUGAGUCG</p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                     <c ca="left">
                        <p>ACACAUAUUCCUGAUACUGACA</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>lmi-novel-07</p>
                     </c>
                     <c ca="left">
                        <p>GAAGAGAUAGAGGAGUCAACUGC</p>
                     </c>
                     <c ca="center">
                        <p>23</p>
                     </c>
                     <c ca="left">
                        <p>ACUGACUUCUCCAUCUCUUUGC</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>lmi-novel-08</p>
                     </c>
                     <c ca="left">
                        <p>CUGAAGUCACACGAGAGCGCCGU</p>
                     </c>
                     <c ca="center">
                        <p>23</p>
                     </c>
                     <c ca="left">
                        <p>CGCUCUCGUGUGACGUCAGGCA</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>lmi-novel-09</p>
                     </c>
                     <c ca="left">
                        <p>UUAUUCUGUCCGUGCCUCGAAA</p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                     <c ca="left">
                        <p>UUUGGCAGGUGGGCAGAAUAUGU</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>lmi-novel-10</p>
                     </c>
                     <c ca="left">
                        <p>GUAGGCCGGCGGAAACUACUUG</p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                     <c ca="left">
                        <p>AGGGGUUUCUUUCGGCCUCCAG</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>lmi-novel-11</p>
                     </c>
                     <c ca="left">
                        <p>AUGAGCAAUGUUAUUCAAAUGG</p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                     <c ca="left">
                        <p>AUUUGAAUAUCAUUGCACAUUG</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>lmi-novel-12</p>
                     </c>
                     <c ca="left">
                        <p>UGAUGCUGCAGGAGUUGUUGUGU</p>
                     </c>
                     <c ca="center">
                        <p>23</p>
                     </c>
                     <c ca="left">
                        <p>AUGGUAACCCUUGAGGAGUCUUG</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>lmi-novel-13</p>
                     </c>
                     <c ca="left">
                        <p>ACUGACUGCCCUAUUUCUUUGC</p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                     <c ca="left">
                        <p>GAAGAGAUAGGACAGUCAAUCU</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>*Length of mature miRNAs in nucleotides.</p>
               </tblfn>
            </tbl>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Principles of locust-specific miRNA prediction and examples of the secondary structure of locust-specific miRNA precursors</p>
               </caption>
               <text>
                  <p>Principles of locust-specific miRNA prediction and examples of the secondary structure of locust-specific miRNA precursors. <b>(a)</b> The features of miRNA and other small RNAs. Left side: the red and green lines represent the mature miRNA and miRNA*, respectively, which can be found in the same small RNA library sequenced by high-throughput sequencing in most cases. The black circles show the 1-2 nucleotide 3' overhang of miRNA:miRNA* duplex. Right side: inconsistency at the 5' ends of other small RNAs and the degradation fragments. <b>(b) </b>Electrophoretic analysis of PCR products of lmi-novel-04 and lmi-novel-07, showing the expected length of 55-70 nucleotides. <b>(c) </b>The secondary structures of lmi-novel-04 and lmi-novel-07. The red sequence represents mature miRNA and the green represents miRNA*. The black circles indicate 1-2 nucleotide 3' overhangs.</p>
               </text>
               <graphic file="gb-2009-10-1-r6-3"/>
            </fig>
            <p>We sequenced 8 of the 13 amplified products and, using mfold <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>, were able to confirm the ability of the 8 products to accurately fold in the typical hairpin structure of miRNA precursors (Figure <figr fid="F3">3c</figr>). For the 185 novel miRNA family candidates we predicted, we could not identify homologs in the <it>Drosophila </it>genome, indicating that they are probably species-specific families.</p>
         </sec>
         <sec>
            <st>
               <p>miRNA expression patterns</p>
            </st>
            <p>High-throughput sequencing is not only a good tool for identifying small RNAs, it can also provide information about their expression levels. Compared with other small RNAs, miRNAs make up a larger proportion of the locust small RNA libraries (Figure <figr fid="F1">1b</figr>), indicating that miRNAs are the main kind of small RNAs involved in gene expression regulation in the locust. However, our libraries are made up of a mixture of different tissue samples at different developmental stages, so it is possible that the proportion of miRNAs to other small RNAs could vary in different tissues or developmental stages.</p>
            <p>Some of the miRNAs we identified had more than one thousand reads, while others had fewer than ten (Figure S2 in Additional data file 3). Reads of the most abundant miRNAs are about 10,000-fold higher than those of the scarce miRNAs. Such extreme variation can provide some basic insight into the function of these miRNAs. The most abundant miRNA is mir-1, which had approximately 163,143 reads in the gregarious library and 135,794 in the solitary library. As a muscle-specific miRNA <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, mir-1 is the most abundant given its broad range of expression in different developmental stages and the high proportion of muscle tissues in the locust. As with mir-1, the miRNAs that have more reads should be expressed during most developmental stages, while those having fewer reads, such as mir-210 and lmi-novel-01 (Figure S2 in Additional data file 3), should be expressed in a much narrower range. It is likely that the expression of those exiguous miRNAs is developmentally related.</p>
            <p>As miRNA abundance is linked to the extent of conservation <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B20">20</abbr></abbrgrp>, conserved miRNAs in the locust comprise more than 80% of the total miRNA reads we examined. The locust-specific miRNAs were expressed at a significantly lower level than those in conserved families (Wilcoxon rank-sum test, <it>p </it>&lt; 1.0 &#215; 10<sup>-6</sup>).</p>
         </sec>
         <sec>
            <st>
               <p>Target prediction of miRNAs</p>
            </st>
            <p>In animals, although miRNAs have been shown to repress the expression of their targets by binding to sequences in the 3' untranslated region (UTR) in most cases <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>, both computational and experimental evidence show the existence of miRNA-binding sites in protein coding regions <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>. To identify potential targets of locust miRNAs, we searched unigene sequences from locust ESTs using miRanda 3.1 <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> because there is no 3' UTR database available (see Materials and methods). We found 8,212 unigenes targeted by 157 miRNAs (50 conserved miRNAs plus 7 conserved miRNA* plus the most abundant 100 locust-specific miRNA candidates predicted). All miRNAs have more than one predicted target, and some of the miRNAs even have more than 200 (Figure <figr fid="F4">4a</figr>). Similarly, some unigenes have more than one miRNA target site (Figure <figr fid="F4">4a</figr>). On average, every miRNA targets 147.5 unigenes and, conversely, every unigene is targeted by 2.8 miRNAs. We think that the higher the score given by miRanda, the more reliable the predicted results. The highest score for predicted targets was for LM00689, which is a potential target of lmi-miR-1 (Figure <figr fid="F4">4b</figr>). LM00689 is similar to the <it>ciboulot </it>gene of fruit fly, which encodes an actin binding protein and plays a major role in axonal growth during <it>Drosophila </it>brain metamorphosis <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Target prediction of locust miRNAs</p>
               </caption>
               <text>
                  <p>Target prediction of locust miRNAs. <b>(a) </b>Left side: distribution of target number of locust miRNAs. Right side: distribution of target site number of the unigenes. <b>(b) </b>Presumable pairing between lmi-miR-1 and LM00689 with highest score predicted by miRanda. <b>(c) </b>Conservation of mir-133 target site in the <it>pale </it>gene of locust (lmi) and 12 <it>Drosophila </it>species, and presumable pairing between miR-133 and the <it>pale </it>gene. The red boxes indicate conserved target sites of miR-133 in 3' UTR sequences of <it>pale</it>.</p>
               </text>
               <graphic file="gb-2009-10-1-r6-4"/>
            </fig>
            <p>We also found that some unigenes that had significant differences at the expression level between the gregarious and solitary phases were targeted by miRNAs. Although these genes may be regulated at the transcriptional level, it is possible that miRNAs play roles in regulating their expression. For example, microarray results in our lab show that the locust homolog of the <it>Drosophila </it>gene <it>pale </it>has significant differences in its expression levels between the two phases (Z Ma <it>et al</it>., unpublished). We found that the 3' UTR sequence of locust <it>pale </it>contains a target site of lmi-miR-133 (we got the 3' UTR sequences of <it>pale </it>in locust by 3' rapid amplification of cDNA ends (RACE); see Materials and methods; Figure <figr fid="F4">4c</figr>). We also found that in addition to the locust, 12 <it>Drosophila </it>species also have conserved target sites of miR-133 in the 3' UTR sequences of the <it>pale </it>gene <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B20">20</abbr><abbr bid="B32">32</abbr></abbrgrp> (Figure <figr fid="F4">4c</figr>), indicating the strong possibility of miR-133 regulating the expression of <it>pale </it>at the post-transcriptional level. Therefore, miR-133 may contribute to the different expression of <it>pale </it>between the gregarious and solitary phases (see Discussion).</p>
         </sec>
         <sec>
            <st>
               <p>The phylogenetic evolution of miRNAs</p>
            </st>
            <p>We sorted the 50 conserved families identified in the locust into 4 groups based on their phylogenetic distribution (Figure <figr fid="F5">5a</figr>). Four families (let-7, mir-1, mir-34, and mir-124) are present in insects, vertebrates, and nematodes; 17 families are present in insects and vertebrates, but not nematodes; 6 families are restricted to invertebrates (insects and nematodes); and the remaining 23 families are insect-specific.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Phylogenetic evolution of locust conserved miRNA families</p>
               </caption>
               <text>
                  <p>Phylogenetic evolution of locust conserved miRNA families. <b>(a)</b> Phylogenetic distribution of 50 conserved miRNA families of the locust. A plus (+) symbol indicates this miRNA family is found in the species named on the left, and a minus (-) symbol means it is absent in that species. A red plus symbol means this miRNA family can not be found in any database, but was found by our search in the corresponding species genome. <b>(b) </b>An example of clade-specific conserved miRNAs based on sequence substitutions. The red nucleotides indicate the positions that are the same among vertebrates but different from insects, which are shown in green. Vertebrates and insects can be easily separated according to sequence differences in their miR-190, showing the different sequence features of conserved miRNAs in different clades. The asterisks indicate the conserved sites among these species. <b>(c) </b>Two conserved miRNA families whose sequences are unique in the locust (lmi). The red nucleotide shows the locust-specific position that is different from any other species. The asterisks indicate the conserved sites among these species.</p>
               </text>
               <graphic file="gb-2009-10-1-r6-5"/>
            </fig>
            <p>Categorization of conserved miRNAs indicates that the innovation of miRNAs in the locust is concentrated along three branches of the phylogenetic tree leading to bilaterians, coelomates, and insects. Different conserved miRNAs in the locust have different ages. Some of them are from ancient families (for example, mir-1) and some appear to be much younger (for example, insect-specific miRNA families). Such age differences indicate that there is an ongoing process of miRNA evolution and it is possible that the insect lineage gave birth to the insect-specific miRNAs. Previous work in <it>Drosophila </it>has also indicated that the birth and death of miRNA families is a common phenomenon in insect evolution <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>.</p>
            <p>Although the 50 miRNA families in the locust are highly conserved throughout widely divergent animal taxa, there are lineage-specific sequence substitutions in most of these families that are present in both vertebrates and insects. Based on their characteristic sequences in different lineages, we divided these families into five categories (Table <tblr tid="T2">2</tblr>); in doing this we disregarded the deletion of nucleotides at the end of the miRNAs due to the inability to always accurately predict the termini of mature miRNAs. If a miRNA family had more than one of its members in certain species, we chose the member most similar to those in other species for use in categorizing because it may be an ancient member of the family. Families in category I have identical sequences in all observed species. Category II includes those families with small differences between invertebrates and vertebrates. Category III is made up of miRNA families that have identical sequences in all but one of the observed species. Category IV contains miRNAs with multiple variances in different lineages. Category V contains only one miRNA family (mir-1), which is identical in worms and vertebrates but not in insects.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Categories of conserved miRNA families common in vertebrates and insects according to their sequences</p>
               </caption>
               <tblbdy cols="2">
                  <r>
                     <c ca="left">
                        <p>Category</p>
                     </c>
                     <c ca="left">
                        <p>miRNA families</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>I</p>
                     </c>
                     <c ca="left">
                        <p>mir-7, mir-9, mir-124, mir-133, mir-219</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>II</p>
                     </c>
                     <c ca="left">
                        <p>mir-92, mir-190</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>III</p>
                     </c>
                     <c ca="left">
                        <p>let-7, mir-10, mir-33, mir-100, mir-184</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>IV</p>
                     </c>
                     <c ca="left">
                        <p>mir-8, mir-29, mir-31, mir-34, mir-125, mir-193, mir-210, mir-375</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>V</p>
                     </c>
                     <c ca="left">
                        <p>mir-1,</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>Despite the short sequences of mature miRNAs, the major clades are well separated due to substitutions in categories II to IV (Figure <figr fid="F5">5b</figr>), indicating that these miRNAs may have clade-specific functions. Scanning miRNA families in these categories, we identified two families, mir-8 and mir-375, by which the locust can be separated from other species (Figure <figr fid="F5">5c</figr>). Substitutions in mature miRNAs may lead to changes of targets, so it is likely that locust mir-8 and mir-375 have different modes of gene regulation in the locust.</p>
         </sec>
         <sec>
            <st>
               <p>Endogenous siRNAs</p>
            </st>
            <p>We found that 26,519 reads matched the sense strand of ESTs and 11,596 reads matched the antisense strand <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp> in the gregarious and solitary phase libraries. We classified the small RNAs matching the antisense strand as candidate endo-siRNAs (see Materials and methods; Additional data file 1).</p>
            <p>The proportion of endo-siRNAs in the small RNA libraries of locust is much lower than that of miRNAs (Figure <figr fid="F1">1b</figr>). However, because of incomplete mRNA sequence information in the locust EST database, the actual number of endo-siRNAs is likely to be higher. To gain greater understanding of the features of locust endo-siRNAs, we carried out additional analysis of these RNAs. Endo-siRNA length showed a major peak at 22 nucleotides, the same as miRNAs (Figure <figr fid="F6">6a</figr>); however, these small RNAs did not have a tendency to begin with uracil, a common feature of miRNA (data not shown). This provided additional evidence that these 22-nucleotide small RNAs were endo-siRNAs rather than miRNAs. In addition to the major peak at 22 nucleotides, there was also a minor peak at 27-28 nucleotides in endo-siRNAs. For small RNAs coming from sense strands of ESTs, in addition to a main peak at 22 nucleotides, there were also peaks at 27 nucleotides and 28 nucleotides (Figure <figr fid="F6">6b</figr>). An example of ESTs, aligned with small RNA reads that match the sense and antisense strands, is shown (Figure <figr fid="F6">6c</figr>). Also shown in the endo-siRNA sequences is the high variation at the 5' end of endo-siRNA clusters and their differences from the miRNA 5' ends (Figures <figr fid="F3">3a</figr> and <figr fid="F6">6c</figr>), again indicating their having a different role in gene silencing from miRNAs.</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Small RNAs that match to EST sequences perfectly</p>
               </caption>
               <text>
                  <p>Small RNAs that match to EST sequences perfectly. <b>(a) </b>The length distribution of the reads matching antisense strands of ESTs. <b>(b) </b>The length distribution of the reads matching sense strands of ESTs. <b>(c) </b>Portions of one locust EST aligned with small RNA reads that matched the sense (green) and antisense (red) strands.</p>
               </text>
               <graphic file="gb-2009-10-1-r6-6"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Small RNAs derived from transposons</p>
            </st>
            <p>About 20% (8,353 reads) of the small RNAs with a perfect match with ESTs were derived from transposons. Previous research has shown that transposons can generate two kinds of small RNAs: endo-siRNA and piRNA <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, which are 22-23 and 23-29 nucleotides long, respectively. Therefore, the shorter sequences derived from transposons may be endo-siRNAs and the longer may be piRNAs (Additional data file 2).</p>
            <p>There are a variety of transposons that could generate small RNAs regardless of whether they are siRNAs or piRNAs (Figure S3 in Additional data file 3), which may indicate the presence of a broad range of small RNAs for silencing these selfish genetic elements. Analysis of the transposons we observed indicated that long interspersed elements (LINEs) were the dominant class producing small RNAs (approximately 60% of the transposon-derived small RNAs). CR1 and RTE-BovB are the dominant subtypes generating small RNAs (approximately 34% of the transposon-derived small RNAs). As more transposon sequence information in the locust becomes available, we expect there will be additional transposon-derived small RNAs identified, which will give greater understanding of the impact of these elements on genome evolution of the locust and related species.</p>
         </sec>
         <sec>
            <st>
               <p>Classification of the rest of the small RNAs</p>
            </st>
            <p>The rest of the sequences in the locust small RNA libraries remained unannotated. Most of the unannotated sequences in the gregarious library were 22 and 23 nucleotides long and commonly began with uracil (Figure S4a in Additional data file 3). We expected that these were miRNAs missed in our search process, and thus suspected that these 22- and 23-mer RNAs included additional locust-specific miRNAs. In order to identify the potential miRNA candidates in the remaining 22- and 23-nucleotide sequences, we analyzed their 5' ends to determine whether they were similar in features to the miRNA 5' terminus (Figure <figr fid="F3">3a</figr>; see Materials and methods). Our data showed that 10,161 reads (1,025 clusters, 1,275 unique sequences) had a standard miRNA-like 5' end and, therefore, probably were miRNAs (Table S4 in Additional data file 3).</p>
            <p>There were also longer small RNAs (26-29 nucleotides) that generally began with uracil, especially in the solitary library (Figure S4b in Additional data file 3). Their features indicated that these 26-29-mer small RNAs might be of the piRNA class of small RNAs. Therefore, we analyzed the sequences of these small RNAs to look for the presence of an adenine at position 10, a common feature of piRNAs <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> (Figure S4c in Additional data file 3). Interestingly, although the 26-27-mer small RNAs did commonly start with uracil, there was no obvious preference for an adenine at position 10 (Figure S4c in Additional data file 3). Thus, while their other features do indicate their being some form of functional small RNA, these 26- and 27-U RNAs are some other kind of small RNA rather than piRNAs. However, the 28- and 29-U RNAs have a preference for an adenine at position 10 (Figure S4c in Additional data file 3); thus, they may be piRNA-like small RNAs.</p>
         </sec>
         <sec>
            <st>
               <p>Different expression profiles of small RNAs in the two phases</p>
            </st>
            <p>Small RNAs in the gregarious library were enriched for lengths of 22-23 nucleotides, a typical length for animal miRNAs, and those in the solitary library were enriched for lengths of 26-29 nucleotides and 22-23 nucleotides (Figure <figr fid="F1">1a</figr>). For small RNAs shorter than 22 nucleotides, the gregarious locust has a higher expression level than the solitary locust, while for those longer than 22 nucleotides, the opposite is the case. In addition to the different length distributions of the small RNAs, the proportions of each type of small RNA in the libraries between the two phases were different (Figure <figr fid="F1">1b</figr>). The proportion of miRNAs in the gregarious phase is nearly two times as much as that in the solitary phase; however, endo-siRNAs and piRNA-like small RNAs make up a larger proportion in the solitary phase compared with those in the gregarious phase. There are more unannotated small RNAs in the solitary phase, indicating their potential functions, although we could not annotate them. In summary, the small RNA transcriptomes of the two phases show big differences in their length distribution and composition.</p>
            <p>We converted the reads of each kind of small RNA into reads per million (rpm) in order to make a comparison between the gregarious and solitary small RNA libraries. Almost each kind of small RNA, including miRNAs, endo-siRNAs, and piRNA-like small RNAs, had some differences in expression level between the two phases. Seventeen conserved and 84 predicted locust-specific miRNAs differed by &#8805; 1.5-fold between the two phases. Four examples of conserved miRNAs (<it>mir-276</it>, <it>mir-125</it>, <it>mir-1</it>, and <it>let-7</it>) in the gregarious phase were shown to be expressed 1.94-fold, 1.87-fold, 1.5-fold, and 1.5-fold as much as in the solitary phase, respectively (Figure <figr fid="F7">7a</figr>). Also, some locust-specific miRNAs had a 1.5-24-fold difference in abundance between the two phases (Figure <figr fid="F7">7b</figr>).</p>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>Different expression levels of small RNAs in the gregarious and solitary phases</p>
               </caption>
               <text>
                  <p>Different expression levels of small RNAs in the gregarious and solitary phases. <b>(a) </b>Four examples of conserved miRNAs that have distinct expression levels between the gregarious and solitary phases. <b>(b) </b>Four examples of locust-specific miRNAs that have distinct expression levels between the gregarious and solitary phases. <b>(c) </b>Different length distribution of the unannotated sequences in the two libraries. Rpm, reads per million.</p>
               </text>
               <graphic file="gb-2009-10-1-r6-7"/>
            </fig>
            <p>Compared with those in the gregarious library, there are more abundant endo-siRNAs and piRNA-like small RNAs in the solitary library (Figure <figr fid="F6">6a</figr>; Tables S5 and S6 in Additional data file 3). For the endo-siRNAs shared in both libraries, only 3 in the gregarious phase were expressed at least 1.5-times as much as they were in the solitary phase. However, 26 endo-siRNAs in the solitary phase were expressed 1.67-54-fold as much as they were in the gregarious phase. Moreover, there are 86 solitary phase-specific siRNAs (&#8805; 5 reads), while there are only 6 gregarious phase-specific ones. We also observed that endo-siRNAs came from 2,307 unigenes of the ESTs, 319 of which generated siRNAs in both phases. However, 325 unigenes generated siRNAs only in the gregarious phase, and 1,663 only in the solitary phase. For piRNA-like small RNAs, the situation is similar to that of endo-siRNAs; 8 piRNA-like small RNAs differed more than 1.5-fold in abundance between the gregarious and solitary libraries, and only one of them was more abundant in the gregarious locust. There were no gregarious phase-specific piRNA-like small RNAs (&#8805; 5 reads), compared with 36 solitary phase-specific ones.</p>
            <p>There were huge differences between the two phases in the expression levels of these unannotated sequences. Similar to those of all small RNAs, the expression levels of most of the longer (26-29 nucleotide) small RNAs in the solitary phase were much higher than in the gregarious phase (Figure <figr fid="F7">7c</figr>; Figure S4 in Additional data file 3), indicating a potential role of these 26-29 nucleotide small RNAs in the phase changes of locust.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <sec>
            <st>
               <p>Evolution of conserved miRNAs</p>
            </st>
            <p>Nearly one-third of miRNAs from <it>D. melanogaster </it>in miRBase 11.0 <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> are conserved in the locust, indicating the bulk of miRNAs in the locust are composed of conserved and lineage-specific RNAs. Although some miRNAs are conserved in a wide range of species, our study shows that there are some species-specific nucleotide substitutions in the flanking regions of the 'seed' sequences in most of the conserved families. For example, the miR-190 sequences in the vertebrates we examined are the same (Figure <figr fid="F5">5b</figr>), but a different miR-190 sequence is found, and shared, in the insect species analyzed. In other words, the same miRNA family of closely related species can be clustered separately from that of other closely related species.</p>
            <p>Focusing on these conserved substitutions, separate from the 'seed' sequence <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, it is apparent that some highly conserved miRNA families can also be regarded as 'species-specific' (Figure <figr fid="F5">5</figr>). The 'seed' sequence is important for mRNA target recognition, but it alone is not sufficient for miRNA-target interaction. Given such conserved substitutions, it is possible that these are present in parts of the mature miRNAs that are also involved in target recognition. Such findings provide a clue that miRNA target recognition may be a complex process.</p>
            <p>We propose that these conserved regions of sequence substitutions could be classified into different functional groups, which we call as A, B...and so on. The 'seed' plus region A, for example, may lead the miRNA to interact with target A, whereas 'seed' plus region B would lead to an alternative target repression, and so on (Figure <figr fid="F8">8</figr>). According to this model, we propose that nucleotide substitutions in conserved miRNAs would lead to the generation of new functions for corresponding miRNAs without changing their original functions. From an evolutionary view, using different modes of the same miRNAs to regulate the expression of multiple genes would be advantageous as it is both more economical and safer than producing completely new miRNAs, which would most likely cause more harm than good at a high cost upon their emergence <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. In much the same way, protein evolution often occurs through changes in specific domains that become conserved during evolution as new functions arise with these domain changes. This hypothesis also explains why such sequence substitutions in the miRNA would be maintained in even distantly related species.</p>
            <fig id="F8">
               <title>
                  <p>Figure 8</p>
               </title>
               <caption>
                  <p>The same miRNA family plays roles in different modes in different species</p>
               </caption>
               <text>
                  <p>The same miRNA family plays roles in different modes in different species. Locust, fruit fly and human have the same targets if only the 'seed' sequence is used for target recognition. If the 'seed' and region A work together, mir-8 can regulate different targets in insects and human. If the 'seed' and region B work together, mir-8 in these three species can regulate species-specific targets. hsa-miR-200b belongs to the family of mir-8.</p>
               </text>
               <graphic file="gb-2009-10-1-r6-8"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Conservation of miRNA*</p>
            </st>
            <p>Previous studies have often ignored the function of miRNA* because these sequences are usually regarded as important primarily for maintaining the miRNA precursor secondary structure <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B24">24</abbr></abbrgrp>. However, our research showed high sequence similarity for miRNA* between the locust and fruit fly (Figure <figr fid="F2">2</figr>) even though these two species diverged 350 million years ago <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. This indicates that miRNA* may also play a functional role in some biological processes. A recent study <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> also indicated this may be the case, as they found miRNA* conservation among 12 sequenced Drosophilids. These 12 Drosophilids, however, were evolutionarily closely related. The findings in our study between the long separated locust and fruit fly provide even stronger support for a biologically functional role of miRNA* beyond maintenance of precursor secondary structure. Overall, with regards to miRNA* conservation, our findings indicate that organisms regulate mRNA expression in an economic way, using both miRNAs and miRNA*s.</p>
         </sec>
         <sec>
            <st>
               <p>Reliability of the method to identify locust-specific miRNAs</p>
            </st>
            <p>We identified non-conserved miRNA families in the locust due to the application of a new method we developed based on the biogenesis features of miRNAs <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B20">20</abbr><abbr bid="B38">38</abbr></abbrgrp>, which is independent of available genome sequence data. To estimate the validity of the predicted miRNA candidates, we used a PCR-based method to determine their locations in the locust genome on the basis of the secondary structure features of their precursors. We were able to validate 13 of our chosen 24 predicted miRNAs by our PCR-based method and determined that their sequences in conjunction with their flanking sequences in the genome could be folded into perfect miRNA-like hairpin structures. These results provide strong support for the reliability of our method to predict species-specific miRNAs. Although 11 of the chosen predicted miRNAs did not provide positive results, we felt that this was likely due to the low quality of primers we used, as we could only refer to short sequences during primer design. Given these limitations, we estimated that our false positive prediction rate would actually be lower than 40%. Overall, the principles we adopted in this study were stringent, especially in our allowance for mismatches (only four) between the predicted miRNA duplex-like pairs. In reality, some canonical miRNAs have more than four mismatches with their star sequences <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. So if less stringent mismatch criteria were used, it is likely that more miRNAs could be found (data not shown). This, however, would also be accompanied with a higher false positive rate due to allowed base-paring of random, rather than true, miRNA sequences. Thus, in this study, to minimize the false positive rate, we chose to use the more stringent mismatch limits when we searched for miRNA duplex-like pairs in the library.</p>
            <p>In addition to the PCR-based method of validation, we also assessed the reliability of our miRNA prediction method by using <it>Drosophila </it>miRNA data (see Methods in Additional data file 3). The findings here provide additional support for the feasibility and reliability of our method. We believe that more than 100 of our predicted locust-specific miRNAs would be canonical.</p>
            <p>Although the principle of our method, based on features of miRNA biogenesis, coincides with that of miRDeep <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>, our method is more effective for finding miRNA duplex-like pairs when there is little available genome sequence information. Therefore, our method could be used to identify miRNAs in a wider variety of organisms, particularly those without whole genome sequence data. Additionally, we have also provided a simple experimental method to validate the reliability of the results predicted by our computational method. Combining our computational approaches with experimental methods, novel and non-conserved miRNAs can be identified from any species regardless of the absence of their genome sequence data. Being able to identify more novel miRNAs in a greater number of species even in the absence of genome sequence data will be invaluable in improving our understanding of the evolution and function of miRNAs.</p>
         </sec>
         <sec>
            <st>
               <p>Target prediction of miRNAs</p>
            </st>
            <p>It is difficult to predict miRNA targets in animals because the detailed mechanism of interaction between miRNA and its target transcripts is not clear, although several bioinformatic tools have been developed, such as miRanda. None of the available computational methods can predict miRNA targets accurately and they all give results with higher false positive rates <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B32">32</abbr><abbr bid="B35">35</abbr></abbrgrp>. Moreover, most miRNAs target 3' UTR sequences of mRNAs in animals, so it is more difficult to predict targets of locust miRNAs without a complete 3' UTR database. Alternatively, we chose to use a locust EST database to predict the targets of its miRNAs. Although it is possible to find some canonical targets of miRNAs, using the EST database will lead to higher false positive rates compared with 3' UTR database when predicting targets using bioinformatic tools. Combining the two factors above, although we predicted several targets of locust miRNAs using miRanda, it is very likely that there are some false positives.</p>
            <p>We found that some mRNAs expressed differently between the two phases of the locust were potential targets of miRNAs. Because gene expression can be regulated at both the transcriptional and post-transcriptional levels, we believe it is possible that post-transcriptional gene expression regulation contributes to differential expression of genes between the two phases of the locust, such as the locust <it>pale </it>gene, a potential target of lmi-miR-133 (Figure <figr fid="F4">4c</figr>). However, validation of the relationship between miRNAs and mRNA transcripts expressed differentially between the two phases needs more experimental evidence.</p>
         </sec>
         <sec>
            <st>
               <p>The scope of small RNAs in the locust</p>
            </st>
            <p>High-throughput sequencing of small RNAs showed that there were a large number of small RNAs in the locust transcriptome (Figure <figr fid="F1">1</figr>). Our results indicate that there are fewer miRNAs in insects than in mammals; this is likely because there was an expansion in the number of miRNAs at the advent of vertebrates and mammals <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. We did find evidence for the existence of several different kinds of small RNAs in the locust, although the proportion of endo-siRNAs and piRNA-like small RNAs identified in the locust was small. We expect that more endo-siRNAs and piRNAs will be identified with an increase in available locust genome and transcriptome data.</p>
            <p>A global survey of small RNAs in the locust would contribute additional information to understanding the function and evolution of small RNAs in insects. The analysis of the characteristics and expression of small RNAs in the locust enhances the knowledge of gene expression regulation on non-model and hemimetabolous insects at the post-transcriptional level and enables comparison of long-term evolutionary history between the homo- and hemimetabolous insects.</p>
            <p>The locust has already been used in a variety of ways as a good model for understanding the mechanisms of the immune system and neural pathways <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>, in both of which small RNA gene regulation systems might be involved <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B11">11</abbr></abbrgrp>. Moreover, many of the small RNAs of the locust, a typical hemimetabolous insect, likely have important functions in complex developmental processes.</p>
         </sec>
         <sec>
            <st>
               <p>Small RNAs involved in phase changes</p>
            </st>
            <p>Significant differences in small RNA expression levels between the gregarious and the solitary phases indicate their potential functions in phase transition of the locust. The two phases of the locust share the same genome but exhibit different gene expression profiles and phenotypes, suggesting different regulation of gene expression <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Comparison between the coding genes of the two phases at the expression level has been done and many interesting genes have been found to be involved in phase changes of the locust <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B41">41</abbr></abbrgrp>. Also, our study shows that the expression patterns of non-coding RNAs differ between the gregarious and the solitary phases, suggesting that phenotypic differences between the two phases are epigenetic changes but not derived from genomic differences.</p>
            <p>We found 17 conserved miRNAs to have different expression levels between the gregarious and solitary phases. These miRNAs may be involved in gene expression regulation at the post-transcriptional level during phase transition. Especially, we are most interested in 5 of the 17 miRNAs that are expressed differentially in the two phases. <it>mir-276 </it>has the biggest expression difference between the two phases (Figure <figr fid="F7">7a</figr>), although there has been no reports about its function. Such a difference might imply its functional role in the phase transition of the locust. <it>let-7 </it>and <it>mir-125 </it>regulate metamorphic processes in <it>C. elegans </it>and <it>Drosophila </it><abbrgrp><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp>. Phase changes in locusts can only happen before they have become adults; solitary locusts can only swarm during the larval stages and no once they have reached the adult stage. Since the two miRNAs (<it>let-7 </it>and <it>mir-125</it>) and the phenomena of phase changes are both linked with metamorphic processes, we think that the two miRNAs and the phenotype of phase changes may be related. <it>mir-1 </it>and <it>mir-315 </it>also have different expression levels (data on <it>mir-315 </it>expression levels is not shown). <it>mir-1 </it>is a muscle-specific miRNA <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, and <it>mir-315 </it>is a potent activator of Wingless signaling in <it>Drosophila </it><abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. Because it is related to the thorax muscle and the wing, we think that the difference in flying ability between gregarious and solitary locusts may be regulated by <it>mir-1 </it>and <it>mir-315</it>.</p>
            <p>Based on our analysis of the small RNA expression levels in gregarious and solitary locusts, we believe that some small RNAs that regulate the expression of protein coding genes in the two phases must be involved in the process of phase changes. It is possible that we could provide insight into the phase changes and find new approaches to control the locust plagues throughout the world by small RNAs.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>High-throughput sequencing provides a good chance for us to study small RNAs in the locust, which is an important worldwide pest. This study led to the discovery of a large number of small RNAs in the locust, including miRNAs, endo-siRNAs and piRNA-like small RNAs. Importantly, we have identified 185 potential locust-specific miRNA candidates using the method we developed, although there is no locust genome sequence available. Our method makes it possible to discover more miRNA families in a broader range of species whose genome sequences have not been sequenced. We further show the evolutionary path of miRNAs in the locust, indicating the potential evolutionary mechanism of miRNAs. The function of small RNAs in phase changes of the locust is disclosed in our study. We found significant differences in the expression of small RNAs between the two phases of the locust and target prediction shows that some genes expressed differentially in the two phases are targets of miRNAs, which gives us clues to further discover the mechanisms of phase change in locusts.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Preparation of total RNA</p>
            </st>
            <p>Total RNA was extracted using TRIzol reagent (Invitrogen, Carlsbad, CA, USA) from mixed-stage (including embryos, every instar larvae and adults) <it>Locusta migratoria </it>that we fed in our lab. We collected 0-1, 2-3, 4-5, 6-7, 8-9, 10-11, 12-13 and 14-15 day-old embryos cultured at 30&#176;C in clean sand with relative humidity. For the larvae, we collected the whole body except the midgut and pooled them to ensure every instar was present in the sample. We chose to collect adults at eclosion, sexual maturation, post-spawning, and elderly stages separately and then pooled them together. Total RNA was extracted according to the manufacturer's protocol. We examined the quality of RNA using an Agilent 2100 Bioanalyzer.</p>
         </sec>
         <sec>
            <st>
               <p>Small RNA library construction and high-throughput sequencing</p>
            </st>
            <p>RNA fragments 14-30 bases long were isolated from total RNA by Novex 15% TBE-Urea gel (Invitrogen). Then, a 5' adaptor (Illumina, San Diego, CA, USA) was ligated to purified small RNAs followed by purification of ligation products on Novex 15% TBE-Urea gel. The 5' ligation products were then ligated to a 3' adaptor (Illumina) and products with 5' and 3' adaptors were purified from Novex 10% TBE-Urea gel (Invitrogen). Subsequently, these ligation products were reverse transcribed followed by PCR amplification. The amplification products were excised from 6% TBE-Urea gel (Invitrogen). The purified DNA fragments were used for clustering and sequencing by Illumina Genome Analyzer at the Beijing Genomics Institute, Shenzhen.</p>
         </sec>
         <sec>
            <st>
               <p>Discovery of conserved locust miRNA families</p>
            </st>
            <p>We discarded bad reads that were the result of incorrect sequencing or were the reads of adaptor contamination that were not ligated to any other sequences. We clustered the remaining reads based on sequence similarity and the dominant reads were analyzed as follows: the reads were analyzed by BLAST against EST database <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> and FlyBase <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> to discard rRNA, tRNA and snRNA. Subsequently, the remaining sequences were analyzed by BLAST search against miRBase v11.0 <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. Sequences in our libraries with identical or related (four or fewer nucleotide substitutions) sequences from <it>D. melanogaster </it>or other insects (mosquito, silkworm, and honeybee) were identified as conserved miRNAs.</p>
         </sec>
         <sec>
            <st>
               <p>Discovery of non-conserved locust miRNA families</p>
            </st>
            <p>We first looked at the high-throughput sequencing data of small RNAs in other species, including <it>C. elegans</it>, <it>D. melanogaster</it>, and <it>Arabidopsis </it><abbrgrp><abbr bid="B16">16</abbr><abbr bid="B20">20</abbr><abbr bid="B38">38</abbr></abbrgrp>, and found that star sequences of most miRNAs were also present in the small RNA libraries, and that the miRNA-miRNA* duplexes exhibited 1 or 2 nucleotide 3' overhangs, a characteristic of RNase III enzyme cleavage (Figure <figr fid="F3">3a</figr>). The 5' end sequences of miRNA clusters showed obvious consistency compared with other small RNAs and degradation fragments (Figure <figr fid="F3">3a</figr>). Thus, if a sequence in the locust small RNA libraries is a canonical miRNA, its star sequence should be identified based on imperfect base-pairing and a 1-2 nucleotide 3' overhang when paired with its complementary mature miRNA.</p>
            <p>Based on the biogenesis features of miRNA (Figure <figr fid="F3">3a</figr>), we developed a perl script to search for possible candidate miRNA-miRNA* duplexes, which satisfied the following criteria: they were selected primarily by base-pairing, allowing for G:U pairing, which is common in the miRNA precursors; they could contain up to four mismatches; they could have a maximum size of 4 nucleotides for a bulge in the candidate miRNA sequence; they had to have a 1-2 nucleotide 3' overhang <abbrgrp><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr></abbrgrp>; the dominant strand had to have five or more reads in the library because miRNAs with a low expression level were likely to have no star form in the library; the length of the dominant strand had to be between 18 and 24 nuceotides long; the 5' ends in more than 80% of the reads of those sequences in the cluster of the dominant sequence of the pairs had to be consistent with each other. After these criteria were met, we then used mfold to evaluate the ability of the identified pairs to form a hairpin structure <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>, where their free energy of folding (&#916;G) was an important standard for use in determining the stability of RNA secondary structure.</p>
            <p>In order to satisfy the requirement of input sequences analyzed by mfold, we joined the two sequences in each candidate pair using a standard hairpin-forming linker sequence (GCGGGGACGC). Those pairs that met the following conditions were analyzed further: the pairs had a free energy less than or equal to -21 kcal/mol (in cases where there was more than one partner for a sequence, the pair with the lowest free energy was selected as the true one); the pairs had no bulge bigger than 6 nucleotides and multiple loops. The way of determining the best parameters and of testing this method is described in the Methods in Additional data file 3.</p>
         </sec>
         <sec>
            <st>
               <p>Amplification of the miRNA precursors from locust genomic DNA</p>
            </st>
            <p>We extracted genomic DNA from the fifth instar locust using a Gentra Puregene Tissue Kit (Qiagen, Valencia, CA, USA) according to the manufacture's protocol. We designed primers for 8 conserved miRNAs and 24 candidate miRNA-miRNA* pairs we predicted based on a dependence of the sequences of the mature miRNA and miRNA* species using Primer Premier 5.0. (Premier Biosoft International, Palo Alto, CA, USA) Because mature miRNA may come from either arm of the precursor, we designed two pairs of primers for each duplex. Corresponding fragments were amplified by PCR and the length of amplification products was examined on 2.5% agarose gels. Fragments between 55 and 70 nucleotides in length were subcloned into pMD18-T vector (Takara, Dalian, Liaoning, China) for sequencing analyses.</p>
         </sec>
         <sec>
            <st>
               <p>Discovery of endo-siRNAs and piRNA-like small RNAs using ESTs</p>
            </st>
            <p>The 23-29 nucleotide long RNAs matching ESTs annotated as transposons were considered as piRNA-like small RNAs. Those small RNAs that perfectly matched EST antisense strands were considered as candidate endo-siRNAs if they were not from annotated transposons. Moreover, we also searched the ESTs for miRNA precursors. Although there were some sequences that perfectly matched EST sense strands, no typical hairpin structure of these ESTs could be identified using mfold. Rather than folding the entire EST sequence, regions of 70 nucleotides, 100 nucleotides and 150 nucleotides on either side of the small RNA sequences were folded.</p>
         </sec>
         <sec>
            <st>
               <p>Prediction of miRNA targets</p>
            </st>
            <p>Unigene sequences from the EST database of the locust <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp> were chosen to predict the miRNA targets without distinguishing the 3' UTR from the protein coding region. miRanda v3.1 <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> was selected as the prediction tool. A miRanda score greater than 150 was used to select unigene targets.</p>
         </sec>
         <sec>
            <st>
               <p>3' RACE of the locust pale gene</p>
            </st>
            <p>The 3' UTR sequence of the locust <it>pale </it>gene was obtained by 3' rapid amplification of cDNA ends (RACE) using a SMART RACE cDNA Amplification Kit (Clontech, Takara, Dalian, Liaoning, China) with the primer GCGACCTGGACAACTGCAACCACCTCAT according to the manufacturer's protocol.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>endo-siRNA: endogenous siRNA; EST: expressed sequence tag; lmi-miR number: locust miR-number; miRNA: microRNA; piRNA: piwi-associated RNA; RACE: rapid amplification of cDNA ends; siRNA: small interfering RNA; UTR: untranslated region.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>YW and LK designed the study. YW performed all the bioinformatic analysis except target prediction of miRNAs. SC provided help to analyze piRNA-like small RNAs. PY predicted targets of miRNAs and YW analyzed the prediction results. SC and YW isolated total RNA and carried out PCR experiments. ZM carried out 3' RACE experiments. YW and LK prepared the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data are available with the online version of this paper. Additional data file <supplr sid="S1">1</supplr> lists sequences of locust endo-siRNAs. Additional data file <supplr sid="S2">2</supplr> lists sequences of locust piRNA-like small RNAs. Additional data file <supplr sid="S3">3</supplr> contains four supplemental figures (Figures S1-S4), six supplemental tables (Tables S1-S6), and supplemental methods. Figure S1 shows alignment of miR-79 and miR-10 of different species. Figure S2 shows expression patterns of locust miRNAs. Figure S3 shows transposon types from which small RNAs are derived in the locust. Figure S4 shows lengths and initial nucleotide distributions of the unannotated small RNA sequences. Table S1 lists the sequences of conserved miRNAs and miRNA*s in the locust. Table S2 lists precursor sequences of the seven conserved miRNAs that have a conserved star sequence. Table S3 lists sequences of predicted locust-specific miRNAs. Table S4 lists the ten most abundant miRNA-like 5'-end small RNAs in the remaining reads after annotation of miRNAs, siRNAs and piRNA-like small RNAs. Table S5 lists endo-siRNAs with different expression levels between the two phases. Table S6 lists piRNA-like small RNAs with different expression levels between the two phases. The Methods show the way to determine the best parameters of our miRNA prediction method and assess the reliability of our method using <it>Drosophila </it>miRNA data.</p>
         <suppl id="S1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>Locust endo-siRNAs</p>
            </caption>
            <text>
               <p>Locust endo-siRNAs.</p>
            </text>
            <file name="gb-2009-10-1-r6-S1.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional data file 2</p>
            </title>
            <caption>
               <p>Locust piRNA-like small RNAs</p>
            </caption>
            <text>
               <p>Locust piRNA-like small RNAs.</p>
            </text>
            <file name="gb-2009-10-1-r6-S2.txt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S3">
            <title>
               <p>Additional data file 3</p>
            </title>
            <caption>
               <p>Figures S1-S4 and Tables S1-S6 and supplemental methods</p>
            </caption>
            <text>
               <p>Figure S1 shows alignment of miR-79 and miR-10 of different species. Figure S2 shows expression patterns of locust miRNAs. Figure S3 shows transposon types from which small RNAs are derived in the locust. Figure S4 shows lengths and initial nucleotide distributions of the unannotated small RNA sequences. Table S1 lists the sequences of conserved miRNAs and miRNA*s in the locust. Table S2 lists precursor sequences of the seven conserved miRNAs that have a conserved star sequence. Table S3 lists sequences of predicted locust-specific miRNAs. Table S4 lists the ten most abundant miRNA-like 5'-end small RNAs in the remaining reads after annotation of miRNAs, siRNAs and piRNA-like small RNAs. Table S5 lists endo-siRNAs with different expression levels between the two phases. Table S6 lists piRNA-like small RNAs with different expression levels between the two phases. The Methods show the way to determine the best parameters of our miRNA prediction method and assess the reliability of our method using <it>Drosophila </it>miRNA data.</p>
            </text>
            <file name="gb-2009-10-1-r6-S3.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>BGI (Shenzhen, China) carried out the Illumina sequencing of small RNA libraries of the locust. We thank Drs L Goodman and M Li for polishing the language and valuable comments on the manuscript. The research is supported by the National Basic Research Program of China (No. 2006CB102000), and the National Natural Science Foundation of China (No. 30830022).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>MicroRNAs: genomics, biogenesis, mechanism, and function.</p>
            </title>
            <aug>
               <au>
                  <snm>Bartel</snm>
                  <fnm>DP</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2004</pubdate>
            <volume>116</volume>
            <fpage>281</fpage>
            <lpage>297</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(04)00045-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">14744438</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>The <it>Drosophila </it>microRNA Mir-14 suppresses cell death and is required for normal fat metabolism.</p>
            </title>
            <aug>
               <au>
                  <snm>Xu</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Vernooy</snm>
                  <fnm>SY</fnm>
               </au>
               <au>
                  <snm>Guo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hay</snm>
                  <fnm>BA</fnm>
               </au>
            </aug>
            <source>Curr Biol</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>790</fpage>
            <lpage>795</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0960-9822(03)00250-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">12725740</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>The functions of animal microRNAs.</p>
            </title>
            <aug>
               <au>
                  <snm>Ambros</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>431</volume>
            <fpage>350</fpage>
            <lpage>355</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature02871</pubid>
                  <pubid idtype="pmpid" link="fulltext">15372042</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>How microRNAs control cell division, differentiation and death.</p>
            </title>
            <aug>
               <au>
                  <snm>Miska</snm>
                  <fnm>EA</fnm>
               </au>
            </aug>
            <source>Curr Opin Genet Dev</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>563</fpage>
            <lpage>568</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.gde.2005.08.005</pubid>
                  <pubid idtype="pmpid" link="fulltext">16099643</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Antiviral immunity directed by small RNAs.</p>
            </title>
            <aug>
               <au>
                  <snm>Ding</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Voinnet</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2007</pubdate>
            <volume>130</volume>
            <fpage>413</fpage>
            <lpage>426</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2007.07.039</pubid>
                  <pubid idtype="pmpid" link="fulltext">17693253</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Principles of microRNA-target recognition.</p>
            </title>
            <aug>
               <au>
                  <snm>Brennecke</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Stark</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Russell</snm>
                  <fnm>RB</fnm>
               </au>
               <au>
                  <snm>Cohen</snm>
                  <fnm>SM</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>e85</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1043860</pubid>
                  <pubid idtype="pmpid" link="fulltext">15723116</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0030085</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Identification and characterization of endogenous small interfering RNAs from rice.</p>
            </title>
            <aug>
               <au>
                  <snm>Sunkar</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Girke</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>JK</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>4443</fpage>
            <lpage>4454</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1182700</pubid>
                  <pubid idtype="pmpid" link="fulltext">16077027</pubid>
                  <pubid idtype="doi">10.1093/nar/gki758</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Endogenous siRNAs derived from transposons and mRNAs in <it>Drosophila </it>somatic cells.</p>
            </title>
            <aug>
               <au>
                  <snm>Ghildiyal</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Seitz</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Horwich</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Du</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kittler</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Zapp</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Weng</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Zamore</snm>
                  <fnm>PD</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2008</pubdate>
            <volume>320</volume>
            <fpage>1077</fpage>
            <lpage>1081</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1157396</pubid>
                  <pubid idtype="pmpid" link="fulltext">18403677</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>piRNAs - the ancient hunters of genome invaders.</p>
            </title>
            <aug>
               <au>
                  <snm>Hartig</snm>
                  <fnm>JV</fnm>
               </au>
               <au>
                  <snm>Tomari</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>F&#246;rstemann</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2007</pubdate>
            <volume>21</volume>
            <fpage>1707</fpage>
            <lpage>1713</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gad.1567007</pubid>
                  <pubid idtype="pmpid" link="fulltext">17639076</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>The Piwi-piRNA pathway provides an adaptive defense in the transposon arms race.</p>
            </title>
            <aug>
               <au>
                  <snm>Aravin</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Hannon</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Brennecke</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2007</pubdate>
            <volume>318</volume>
            <fpage>761</fpage>
            <lpage>764</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1146484</pubid>
                  <pubid idtype="pmpid" link="fulltext">17975059</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Insect microRNAs: Structure, function and evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Behura</snm>
                  <fnm>SK</fnm>
               </au>
            </aug>
            <source>Insect Biochem Mol Biol</source>
            <pubdate>2007</pubdate>
            <volume>37</volume>
            <fpage>3</fpage>
            <lpage>9</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ibmb.2006.10.006</pubid>
                  <pubid idtype="pmpid" link="fulltext">17175441</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Locusts.</p>
            </title>
            <aug>
               <au>
                  <snm>Simpson</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Sword</snm>
                  <fnm>GA</fnm>
               </au>
            </aug>
            <source>Curr Biol</source>
            <pubdate>2008</pubdate>
            <volume>18</volume>
            <fpage>R364</fpage>
            <lpage>R366</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cub.2008.02.029</pubid>
                  <pubid idtype="pmpid" link="fulltext">18460311</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>The analysis of large-scale gene expression correlated to the phase changes of the migratory locust.</p>
            </title>
            <aug>
               <au>
                  <snm>Kang</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Zheng</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <fpage>17611</fpage>
            <lpage>17615</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">535406</pubid>
                  <pubid idtype="pmpid" link="fulltext">15591108</pubid>
                  <pubid idtype="doi">10.1073/pnas.0407753101</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>LocustDB: a relational database for the transcriptome and biology of the migratory locust (<it>Locusta migratoria</it>).</p>
            </title>
            <aug>
               <au>
                  <snm>Ma</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kang</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>11</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1388198</pubid>
                  <pubid idtype="pmpid" link="fulltext">16426458</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-7-11</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>EST analyses predict the existence of a population of chimeric microRNA precursor-mRNA transcripts expressed in normal human and mouse tissues.</p>
            </title>
            <aug>
               <au>
                  <snm>Smalheiser</snm>
                  <fnm>NR</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <fpage>403</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">193628</pubid>
                  <pubid idtype="pmpid" link="fulltext">12844357</pubid>
                  <pubid idtype="doi">10.1186/gb-2003-4-7-403</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Large-scale sequencing reveals 21U-RNAs and additional microRNAs and endogenous siRNAs in <it>C. elegans</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Ruby</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Jan</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Player</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Axtell</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Nusbaum</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ge</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Bartel</snm>
                  <fnm>DP</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2006</pubdate>
            <volume>127</volume>
            <fpage>1193</fpage>
            <lpage>1207</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2006.10.040</pubid>
                  <pubid idtype="pmpid" link="fulltext">17174894</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>FlyBase: integration and improvements to query tools.</p>
            </title>
            <aug>
               <au>
                  <snm>Wilson</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Goodman</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Strelets</snm>
                  <fnm>VB</fnm>
               </au>
               <au>
                  <snm>FlyBase</snm>
                  <fnm>Consortium</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2008</pubdate>
            <issue>36 Database</issue>
            <fpage>D588</fpage>
            <lpage>D593</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2238994</pubid>
                  <pubid idtype="pmpid" link="fulltext">18160408</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>miRBase: tools for microRNA genomics.</p>
            </title>
            <aug>
               <au>
                  <snm>Griffiths-Jones</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Saini</snm>
                  <fnm>HK</fnm>
               </au>
               <au>
                  <snm>van Dongen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Enright</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2008</pubdate>
            <issue>36 Database</issue>
            <fpage>D154</fpage>
            <lpage>D158</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2238936</pubid>
                  <pubid idtype="pmpid" link="fulltext">17991681</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Prediction of mammalian microRNA targets.</p>
            </title>
            <aug>
               <au>
                  <snm>Lewis</snm>
                  <fnm>BP</fnm>
               </au>
               <au>
                  <snm>Shih</snm>
                  <fnm>IH</fnm>
               </au>
               <au>
                  <snm>Jones-Rhoades</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Bartel</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Burge</snm>
                  <fnm>CB</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2003</pubdate>
            <volume>115</volume>
            <fpage>787</fpage>
            <lpage>798</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(03)01018-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">14697198</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Evolution, biogenesis, expression, and target predictions of a substantially expanded set of <it>Drosophila </it>microRNAs.</p>
            </title>
            <aug>
               <au>
                  <snm>Ruby</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Stark</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Johnston</snm>
                  <fnm>WK</fnm>
               </au>
               <au>
                  <snm>Kellis</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bartel</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Lai</snm>
                  <fnm>EC</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2007</pubdate>
            <volume>17</volume>
            <fpage>1850</fpage>
            <lpage>1864</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2099593</pubid>
                  <pubid idtype="pmpid" link="fulltext">17989254</pubid>
                  <pubid idtype="doi">10.1101/gr.6597907</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Distinct roles for Argonaute proteins in small RNA-directed RNA cleavage pathways.</p>
            </title>
            <aug>
               <au>
                  <snm>Okamura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ishizuka</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Siomi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Siomi</snm>
                  <fnm>MC</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2004</pubdate>
            <volume>18</volume>
            <fpage>1655</fpage>
            <lpage>1666</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">478188</pubid>
                  <pubid idtype="pmpid" link="fulltext">15231716</pubid>
                  <pubid idtype="doi">10.1101/gad.1210204</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure.</p>
            </title>
            <aug>
               <au>
                  <snm>Mathews</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Sabina</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zuker</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Turner</snm>
                  <fnm>DH</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1999</pubdate>
            <volume>288</volume>
            <fpage>911</fpage>
            <lpage>940</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.1999.2700</pubid>
                  <pubid idtype="pmpid" link="fulltext">10329189</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Mfold web server for nucleic acid folding and hybridization prediction.</p>
            </title>
            <aug>
               <au>
                  <snm>Zuker</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <fpage>3406</fpage>
            <lpage>3415</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">169194</pubid>
                  <pubid idtype="pmpid" link="fulltext">12824337</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg595</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>The regulatory activity of microRNA* species has substantial influence on microRNA and 3' UTR evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Okamura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Phillips</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Tyler</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Duan</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Chou</snm>
                  <fnm>YT</fnm>
               </au>
               <au>
                  <snm>Lai</snm>
                  <fnm>EC</fnm>
               </au>
            </aug>
            <source>Nat Struct Mol Biol</source>
            <pubdate>2008</pubdate>
            <volume>15</volume>
            <fpage>354</fpage>
            <lpage>363</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">18376413</pubid>
                  <pubid idtype="doi">10.1038/nsmb.1409</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>The insects.</p>
            </title>
            <aug>
               <au>
                  <snm>Grimaldi</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Engel</snm>
                  <fnm>MS</fnm>
               </au>
            </aug>
            <source>Evolution of the Insects</source>
            <publisher>New York: Cambridge University Press</publisher>
            <pubdate>2005</pubdate>
            <fpage>119</fpage>
            <lpage>147</lpage>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Systematic discovery and characterization of fly microRNAs using 12 <it>Drosophila </it>genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Stark</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kheradpour</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Parts</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Brennecke</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hodges</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Hannon</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Kellis</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2007</pubdate>
            <volume>17</volume>
            <fpage>1865</fpage>
            <lpage>1879</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2099594</pubid>
                  <pubid idtype="pmpid" link="fulltext">17989255</pubid>
                  <pubid idtype="doi">10.1101/gr.6593807</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Identification of novel <it>Drosophila melanogaster </it>microRNAs.</p>
            </title>
            <aug>
               <au>
                  <snm>Sandmann</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Cohen</snm>
                  <fnm>SM</fnm>
               </au>
            </aug>
            <source>PLoS ONE</source>
            <pubdate>2007</pubdate>
            <volume>2</volume>
            <fpage>e1265</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2082411</pubid>
                  <pubid idtype="pmpid" link="fulltext">18043761</pubid>
                  <pubid idtype="doi">10.1371/journal.pone.0001265</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>RT-PCR-based analysis of microRNA (miR-1 and -124) expression in mouse CNS.</p>
            </title>
            <aug>
               <au>
                  <snm>Mishima</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Mizuguchi</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kawahigashi</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Takizawa</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Takizawa</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Brain Res</source>
            <pubdate>2007</pubdate>
            <volume>1131</volume>
            <fpage>37</fpage>
            <lpage>43</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.brainres.2006.11.035</pubid>
                  <pubid idtype="pmpid" link="fulltext">17182009</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>MicroRNAs: small RNAs with a big role in gene regulation.</p>
            </title>
            <aug>
               <au>
                  <snm>He</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Hannon</snm>
                  <fnm>GJ</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>522</fpage>
            <lpage>531</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg1379</pubid>
                  <pubid idtype="pmpid" link="fulltext">15211354</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>MicroRNAs direct rapid deadenylation of mRNA.</p>
            </title>
            <aug>
               <au>
                  <snm>Wu</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Fan</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Belasco</snm>
                  <fnm>JG</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <fpage>4034</fpage>
            <lpage>4039</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1449641</pubid>
                  <pubid idtype="pmpid" link="fulltext">16495412</pubid>
                  <pubid idtype="doi">10.1073/pnas.0510928103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Isolation of microRNA targets by miRNP immunopurification.</p>
            </title>
            <aug>
               <au>
                  <snm>Easow</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Teleman</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Cohen</snm>
                  <fnm>SM</fnm>
               </au>
            </aug>
            <source>RNA</source>
            <pubdate>2007</pubdate>
            <volume>13</volume>
            <fpage>1198</fpage>
            <lpage>1204</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1924889</pubid>
                  <pubid idtype="pmpid" link="fulltext">17592038</pubid>
                  <pubid idtype="doi">10.1261/rna.563707</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets.</p>
            </title>
            <aug>
               <au>
                  <snm>Lewis</snm>
                  <fnm>BP</fnm>
               </au>
               <au>
                  <snm>Burge</snm>
                  <fnm>CB</fnm>
               </au>
               <au>
                  <snm>Bartel</snm>
                  <fnm>DP</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2005</pubdate>
            <volume>120</volume>
            <fpage>15</fpage>
            <lpage>20</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2004.12.035</pubid>
                  <pubid idtype="pmpid" link="fulltext">15652477</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Physiological identification of human transcripts translationally regulated by a specific microRNA.</p>
            </title>
            <aug>
               <au>
                  <snm>Nakamoto</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jin</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>O'Donnell</snm>
                  <fnm>WT</fnm>
               </au>
               <au>
                  <snm>Warren</snm>
                  <fnm>ST</fnm>
               </au>
            </aug>
            <source>Hum Mol Genet</source>
            <pubdate>2005</pubdate>
            <volume>14</volume>
            <fpage>3813</fpage>
            <lpage>3821</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/hmg/ddi397</pubid>
                  <pubid idtype="pmpid" link="fulltext">16239240</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Discovery of functional elements in 12 <it>Drosophila </it>genomes using evolutionary signatures.</p>
            </title>
            <aug>
               <au>
                  <snm>Stark</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Kheradpour</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Pedersen</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Parts</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Carlson</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Crosby</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Rasmussen</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Roy</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Deoras</snm>
                  <fnm>AN</fnm>
               </au>
               <au>
                  <snm>Ruby</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Brennecke</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <cnm>Harvard FlyBase curators, Berkeley Drosophila Genome Project</cnm>
               </au>
               <au>
                  <snm>Hodges</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Hinrichs</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Caspi</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Paten</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Park</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>MV</fnm>
               </au>
               <au>
                  <snm>Maeder</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Polansky</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Robson</snm>
                  <fnm>BE</fnm>
               </au>
               <au>
                  <snm>Aerts</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>van Helden</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hassan</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gilbert</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Eastman</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Rice</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Weir</snm>
                  <fnm>M</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2007</pubdate>
            <volume>450</volume>
            <fpage>219</fpage>
            <lpage>232</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2474711</pubid>
                  <pubid idtype="pmpid" link="fulltext">17994088</pubid>
                  <pubid idtype="doi">10.1038/nature06340</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Human microRNA targets.</p>
            </title>
            <aug>
               <au>
                  <snm>John</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Enright</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Aravin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Tuschl</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sander</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Marks</snm>
                  <fnm>DS</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2004</pubdate>
            <volume>2</volume>
            <fpage>e363</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">15502875</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0030264</pubid>
                  <pubid idtype="pmcid">521178</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Ciboulot regulates actin assembly during <it>Drosophila </it>brain metamorphosis.</p>
            </title>
            <aug>
               <au>
                  <snm>Boquet</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Boujemaa</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Carlier</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Pr&#233;at</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2000</pubdate>
            <volume>102</volume>
            <fpage>797</fpage>
            <lpage>808</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(00)00068-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">11030623</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>The birth and death of microRNA genes in <it>Drosophila</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Lu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Shen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Kumar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Shi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Carthew</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>CI</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2008</pubdate>
            <volume>40</volume>
            <fpage>351</fpage>
            <lpage>355</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng.73</pubid>
                  <pubid idtype="pmpid" link="fulltext">18278047</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>High-throughput sequencing of <it>Arabidopsis </it>microRNAs: evidence for frequent birth and death of MIRNA genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Fahlgren</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Howell</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Kasschau</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Chapman</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Sullivan</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Cumbie</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Givan</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Law</snm>
                  <fnm>TF</fnm>
               </au>
               <au>
                  <snm>Grant</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Dangl</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Carrington</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>PLoS ONE</source>
            <pubdate>2007</pubdate>
            <volume>2</volume>
            <fpage>e219</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1790633</pubid>
                  <pubid idtype="pmpid" link="fulltext">17299599</pubid>
                  <pubid idtype="doi">10.1371/journal.pone.0000219</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Discovering microRNAs from deep sequencing data using miRDeep.</p>
            </title>
            <aug>
               <au>
                  <snm>Friedl&#228;nder</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Adamidi</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Maaskola</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Einspanier</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Knespel</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rajewsky</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2008</pubdate>
            <volume>26</volume>
            <fpage>407</fpage>
            <lpage>415</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nbt1394</pubid>
                  <pubid idtype="pmpid" link="fulltext">18392026</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>The expansion of the metazoan microRNA repertoire.</p>
            </title>
            <aug>
               <au>
                  <snm>Hertel</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lindemeyer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Missal</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Fried</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Tanzer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Flamm</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hofacker</snm>
                  <fnm>IL</fnm>
               </au>
               <au>
                  <snm>Stadler</snm>
                  <fnm>PF</fnm>
               </au>
               <au>
                  <cnm>Students of Bioinformatics Computer Labs 2004 and 2005</cnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>25</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1388199</pubid>
                  <pubid idtype="pmpid" link="fulltext">16480513</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-7-25</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>cDNA cloning of heat shock proteins and their expression in the two phases of the migratory locust.</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>HS</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>XH</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>LH</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Guo</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Kang</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Insect Mol Biol</source>
            <pubdate>2007</pubdate>
            <volume>16</volume>
            <fpage>207</fpage>
            <lpage>219</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-2583.2006.00715.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">17298555</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>The 21-nucleotide let-7 RNA regulates developmental timing in <it>Caenorhabditis elegans</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Reinhart</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Slack</snm>
                  <fnm>FJ</fnm>
               </au>
               <au>
                  <snm>Basson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pasquinelli</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Bettinger</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Rougvie</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Horvitz</snm>
                  <fnm>HR</fnm>
               </au>
               <au>
                  <snm>Ruvkun</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>403</volume>
            <fpage>901</fpage>
            <lpage>906</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35002607</pubid>
                  <pubid idtype="pmpid" link="fulltext">10706289</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Temporal regulation of metamorphic processes in <it>Drosophila </it>by the let-7 and miR-125 heterochronic microRNAs.</p>
            </title>
            <aug>
               <au>
                  <snm>Caygill</snm>
                  <fnm>EE</fnm>
               </au>
               <au>
                  <snm>Johnston</snm>
                  <fnm>LA</fnm>
               </au>
            </aug>
            <source>Curr Biol</source>
            <pubdate>2008</pubdate>
            <volume>18</volume>
            <fpage>943</fpage>
            <lpage>950</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cub.2008.06.020</pubid>
                  <pubid idtype="pmpid" link="fulltext">18571409</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Functional screening identifies miR-315 as a potent activator of Wingless signaling.</p>
            </title>
            <aug>
               <au>
                  <snm>Silver</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Hagen</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Okamura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Perrimon</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Lai</snm>
                  <fnm>EC</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2007</pubdate>
            <volume>104</volume>
            <fpage>18151</fpage>
            <lpage>18156</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2084312</pubid>
                  <pubid idtype="pmpid" link="fulltext">17989227</pubid>
                  <pubid idtype="doi">10.1073/pnas.0706673104</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Human let-7 stem-loop precursors harbor features of RNase III cleavage products.</p>
            </title>
            <aug>
               <au>
                  <snm>Basyuk</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Suavet</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Doglio</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bordonn&#233;</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bertrand</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <fpage>6593</fpage>
            <lpage>6597</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">275551</pubid>
                  <pubid idtype="pmpid" link="fulltext">14602919</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg855</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>The nuclear RNase III Drosha initiates microRNA processing.</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Ahn</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Choi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yim</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Provost</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>R&#229;dmark</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>VN</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>425</volume>
            <fpage>415</fpage>
            <lpage>419</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01957</pubid>
                  <pubid idtype="pmpid" link="fulltext">14508493</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>

