<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2007-8-1-r1</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Detection of weakly conserved ancestral mammalian regulatory sequences by primate comparisons</p>
         </title>
         <aug>
            <au id="A1" ca="yes" ce="yes">
               <snm>Wang</snm>
               <fnm>Qian-fei</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <insr iid="I3"/>
               <email>qwang2@medicine.bsd.uchicago.edu</email>
            </au>
            <au id="A2" ce="yes">
               <snm>Prabhakar</snm>
               <fnm>Shyam</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>SPrabhakar@lbl.gov</email>
            </au>
            <au id="A3">
               <snm>Chanan</snm>
               <fnm>Sumita</fnm>
               <insr iid="I1"/>
               <email>SChanan@lbl.gov</email>
            </au>
            <au id="A4">
               <snm>Cheng</snm>
               <fnm>Jan-Fang</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>JFCheng@lbl.gov</email>
            </au>
            <au id="A5">
               <snm>Rubin</snm>
               <mi>M</mi>
               <fnm>Edward</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>EMRubin@lbl.gov</email>
            </au>
            <au id="A6" ca="yes">
               <snm>Boffelli</snm>
               <fnm>Dario</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>DBoffelli@lbl.gov</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720 USA</p>
            </ins>
            <ins id="I2">
               <p>US Department of Energy Joint Genome Institute, Walnut Creek, California 94598, USA</p>
            </ins>
            <ins id="I3">
               <p>Current address: Section of Hematology/Oncology, Department of Medicine, University of Chicago, 5814 S. Ellis Avenue, Chicago, IL 60637, USA</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>R1</fpage>
         <url>http://genomebiology.com/2007/8/1/R1</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17201929</pubid>
               <pubid idtype="doi">10.1186/gb-2007-8-1-r1</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>2</day>
               <month>9</month>
               <year>2006</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>15</day>
               <month>11</month>
               <year>2006</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>3</day>
               <month>1</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>3</day>
               <month>01</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Wang et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Ancestral mammalian regulatory sequences</p>
      </shorttitle>
      <shortabs>
         <p>Intra-primate genome comparisons identify functional modules that are undetected by comparisons with non-mammalian genomes.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Genomic comparisons between human and distant, non-primate mammals are commonly used to identify <it>cis</it>-regulatory elements based on constrained sequence evolution. However, these methods fail to detect functional elements that are too weakly conserved among mammals to distinguish them from non-functional DNA.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>To evaluate a strategy for large scale genome annotation that is complementary to the commonly used distal species comparisons, we explored the potential of deep intra-primate sequence comparisons. We sequenced the orthologs of 558 kb of human genomic sequence, covering multiple loci involved in cholesterol homeostasis, in 6 non-human primates. Our analysis identified six non-coding DNA elements displaying significant conservation among primates but undetectable in more distant comparisons. <it>In vitro </it>and <it>in vivo </it>tests revealed that at least three of these six elements have regulatory function. Notably, the mouse orthologs of these three functional human sequences had regulatory activity despite their lack of significant sequence conservation, indicating that they are ancestral mammalian <it>cis</it>-regulatory elements. These regulatory elements could be detected even in a smaller set of three primate species including human, rhesus and marmoset.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>We have demonstrated that intra-primate sequence comparisons can be used to identify functional modules in large genomic regions, including <it>cis</it>-regulatory elements that are not detectable through comparison with non-mammalian genomes. With the available human and rhesus genomes and that of marmoset, which is being actively sequenced, this strategy can be extended to the whole genome in the near future.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010009">Genetics</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010016">Molecular biology</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Identifying <it>cis</it>-regulatory elements in the human genome, such as promoters and enhancers that regulate gene expression in normal and diseased cells and tissues, is a major challenge of the post-genomic era. Inter-species sequence comparisons have emerged as a major technique for identifying human regulatory elements, particularly those to the sequenced mouse, chicken and fish genomes <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. However, a significant fraction of empirically defined human regulatory modules are too weakly conserved in other mammalian genomes, such as the mouse, to distinguish them from non-functional DNA <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, and are completely undetectable in non-mammalian genomes <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. Identification of such significantly divergent functional sequences will require complementary methods in order to complete the functional annotation of the human genome.</p>
         <p>Deep intra-primate sequence comparison, referred to as 'phylogenetic shadowing', is a novel alternative to the commonly used distant species comparisons <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. However, primate shadowing has so far only been applied to the identification of novel <it>cis-</it>regulatory elements in short, targeted genomic fragments (&#8804; 2.0 kb) <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>, due to the lack of sequence data from multiple primates. Thus, it remains to be determined if this approach is useful in identifying otherwise undetectable regulatory regions in unbiased scans of large genomic loci. Perhaps for this reason, primate shadowing has been almost entirely overlooked as a predictor of regulatory elements.</p>
         <p>Here we evaluate the possibility of using deep primate sequence comparisons in large genomic regions (approximately 100 kb) to systematically uncover <it>cis</it>-regulatory elements that are undetectable through mammalian or more distant comparisons. We focused on genes involved in cholesterol metabolism, since this is a physiological process marked by numerous differences between human and distant mammals. In particular, differential regulation of LXR&#945; and its target genes is thought to contribute to inter-species variation in the plasma cholesterol response to dietary cholesterol intake <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. We evaluated the sensitivity and true positive rate of primate shadowing using as a test set known functional sequences in eight loci, for which we sequenced a phylogenetically representative panel of primate species. Using a combination of close and distant species comparisons, we then identified six human sequences characterized by primate-specific conservation in these eight gene loci, and tested them for enhancer function <it>in vitro </it>and <it>in vivo</it>. Finally, we determined if a subset of primate sequences comprising genomes currently available or being sequenced would suffice to identify divergent mammalian regulatory sequences.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Primate comparison identifies known functional sequences in large genomic intervals</p>
            </st>
            <p>To test the power of primate shadowing to identify functional elements in large genomic intervals, we sequenced the primate orthologs of eight human loci containing <it>LXR&#945; </it>and eight of its target genes: <it>SREBF1</it>, <it>CYP7A1</it>, <it>LDLR</it>, <it>ABCG5</it>, <it>ABCG8</it>, <it>APOE </it>cluster, <it>APOCIII </it>cluster, and <it>HMGCR</it>. The sequenced species comprised six anthropoid primates (baboon, colobus, dusky titi, marmoset, owl monkey and squirrel monkey) and one prosimian (lemur). The targeted genomic segments included all exons, introns and flanking intergenic regions of the above mentioned genes, encompassing 558 Kb of human genomic DNA.</p>
            <p>We identified sequences evolutionarily conserved among seven anthropoid primates (the six targeted anthropoids plus human) or among all eight primates (anthropoid plus lemur) using Gumby, an algorithm that detects sequence blocks evolving significantly more slowly than the local neutral rate <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. Most of the conserved regions overlapped exons (see, for example, Figure <figr fid="F1">1</figr>). The true-positive rate, defined as the fraction of conserved regions overlapping exons or known regulatory regions, was 80% in the 7-primate comparison (Figure <figr fid="F2">2a</figr>) and 81% using the 8-primate set (data not shown). The human-dog comparison, which approximately matches the combined branch length of the primate comparison, had a similar true-positive rate of 84% (Figure <figr fid="F2">2c</figr>). The more distant human-mouse comparison displayed a marginally higher true-positive rate of 90% (Figure <figr fid="F2">2b</figr>). This is consistent with the theoretical prediction that statistical power increases with the total branch length of the species set <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. It should be noted, though, that regulatory sequence annotation of the eight loci we analyzed is probably highly incomplete. Therefore, these true-positive rates are lower bounds; some or all of the 'false positives' could eventually be reassigned as true positives upon expansion of the set of sequences annotated as <it>cis</it>-regulatory. Thus, due to incomplete annotation of functional elements, it is not clear if the difference between the primate and human-mouse true positive rates reflects a significant difference in reliability between the two sets of predictions. On average, 64% of the exons in the 8 loci overlapped conserved regions in both the 7-primate and 8-primate comparisons. Similar sensitivity was obtained in human-mouse (65%) and human-dog (71%) comparisons. Thus, primate sequence comparison was approximately equivalent to pairwise human-mouse or human dog analysis in identifying exons.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Conservation profiles of a representative region, the <it>SREBF1 </it>locus, using close (primate) and distant (human-mouse) species comparisons</p>
               </caption>
               <text>
                  <p>Conservation profiles of a representative region, the <it>SREBF1 </it>locus, using close (primate) and distant (human-mouse) species comparisons. <b>(a) </b>Seven-primate (human, baboon, colobus, marmoset, dusky titi, owl monkey, and squirrel monkey), <b>(b) </b>human-mouse, and <b>(c) </b>three-primate (human, rhesus, and marmoset) conservation profiles in the <it>SREBF1 </it>locus with flanking genes partially shown. Sequence conservation was calculated using Gumby and visualized using RankVISTA with the human sequence as reference. Vertical bars above the horizontal axis depict evolutionarily conserved sequences, with height indicating the conservation score (-log(conservation <it>p </it>value); see Materials and methods). Coding exons (dark blue) and untranslated regions (UTRs; magenta) are marked below the horizontal axis. Vertical bars that overlap coding exons or UTRs are colored light blue, while non-overlapping bars are colored red. The arrowhead denotes SREBF1_PS, a non-coding element conserved in primates (<it>p </it>value &#8804; 0.005) but not in the mouse (<it>p </it>value > 0.1).</p>
               </text>
               <graphic file="gb-2007-8-1-r1-1"/>
            </fig>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Primate comparisons identify known functional elements and conserved non-coding sequences in genomic intervals encompassing <it>LXR&#945;</it>, <it>SREBF1</it>, <it>CYP7A1</it>, <it>LDLR</it>, <it>ABCG5</it>, <it>ABCG8</it>, <it>APOE </it>cluster, <it>APOCIII </it>cluster, and <it>HMGCR</it></p>
               </caption>
               <text>
                  <p>Primate comparisons identify known functional elements and conserved non-coding sequences in genomic intervals encompassing <it>LXR&#945;</it>, <it>SREBF1</it>, <it>CYP7A1</it>, <it>LDLR</it>, <it>ABCG5</it>, <it>ABCG8</it>, <it>APOE </it>cluster, <it>APOCIII </it>cluster, and <it>HMGCR</it>. The number of evolutionarily conserved sequences (<it>p </it>value &#8804; 0.1) overlapping exons, previously known regulatory elements and unannotated regions (new predictions) in the eight loci are shown for the following species sets: <b>(a) </b>seven anthropoid primates, <b>(b) </b>human-mouse, <b>(c) </b>human-dog and <b>(d) </b>human-rhesus-marmoset. Percentages were calculated by dividing the number of conserved sequences of each type by the total number of conserved elements (&#215; 100).</p>
               </text>
               <graphic file="gb-2007-8-1-r1-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic shadowing using seven anthropoid primates identifies non-coding sequences with primate-specific conservation</p>
            </st>
            <p>To identify <it>cis</it>-regulatory sequences not detectable in comparisons between human and distant mammals, we searched the 8 gene loci for non-coding sequences highly conserved among primates (<it>p </it>value &#8804; 0.005) but not detectable in human-mouse or human-dog comparisons (<it>p </it>value > 0.1). Gumby analysis of human and six other anthropoid primates identified six anthropoid-primate-conserved non-coding regions (Additional data file 4). These sequences were either undetectable (<it>p </it>value > 0.1) or less significantly conserved (<it>p </it>value > 0.005) when the prosimian lemur was included in the primate set (data not shown).</p>
            <p>To independently confirm the (anthropoid) primate-specific nature of sequence conservation in these six regions, we compared their nucleotide substitution rate to that of non-exonic sequences in the same locus. Evolutionarily conserved sequences are defined by a constraint factor (ratio of the substitution rate of test sequences to the non-exonic average) smaller than 1. We found that the six primate-conserved sequences had constraint factors well below 1 among anthropoid primates, as expected, and much closer to 1 in the human-mouse and human-dog comparisons (Figure <figr fid="F3">3</figr>). Finally, none of the 6 sequences overlapped significantly conserved segments identified by the phastCons program <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> in a 17-species alignment of the human genome to (mostly) non-primate mammals and more distant vertebrates <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, which further confirms the primate-specificity of their evolutionary conservation.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Evolutionary conservation of six primate-conserved sequences in anthropoid primates, but not between human and mouse or dog</p>
               </caption>
               <text>
                  <p>Evolutionary conservation of six primate-conserved sequences in anthropoid primates, but not between human and mouse or dog. <b>(a) </b>Sequence alignment of a representative primate-conserved sequence LDLR_PS4. Similar alignments for LDLR_PS2 and SREBF1_PS are provided as Additional data files <supplr sid="S1">1</supplr> and <supplr sid="S2">2</supplr>, respectively. <b>(b) </b>The constraint factor of a sequence element is defined as the nucleotide substitution rate (total branch length of the phylogenetic tree) within the element relative to the background non-coding rate in the aligned sequences. Constraint factors in the anthropoid primate comparisons (green bars) are consistently well below one (dashed line). Human-mouse (dotted) and human-dog (white) constraint factor ranges of the six sequences are broader, mostly exceeding one at the upper limit. Error bars indicate 95% confidence intervals.</p>
               </text>
               <graphic file="gb-2007-8-1-r1-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Non-coding sequences with primate-specific conservation include three regulatory elements</p>
            </st>
            <p>To explore the potential regulatory function of these primate-conserved elements, we examined their ability to drive reporter gene expression in both a transient transfection assay in human HepG2 cells and an <it>in vivo </it>mouse liver gene transfer assay. Since it is possible that the computational prediction only captures part of the entire regulatory module, each human element plus 200 to 400 base-pairs (bp) of flanking sequence on either side was cloned upstream of the human promoter of the gene closest to each element and fused to a luciferase reporter gene (see Materials and methods). Therefore, the included flanking sequences may also contribute to the observed regulatory activity. Two elements showed enhancer activity, increasing the expression of a luciferase reporter gene 1.6- to 5-fold in both the human liver cell line HepG2 and <it>in vivo </it>in mouse liver, while a third element appeared to be a silencer, suppressing luciferase expression by 50% (Figure <figr fid="F4">4</figr>, Table <tblr tid="T1">1</tblr> and data not shown). While a third element, LDLR_PS2, showed modest enhancer activity in HepG2 cells (approximately 1.6-fold increase over promoter alone), its activity in 293T cells was much stronger (approximately 5-fold increase over promoter alone), presumably due to availability in these cells of appropriate transcription factors, such as SREBFs, that are capable of activating <it>LDLR </it><abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. In an independent assay of transcription potential, both enhancer elements were shown to be DNase I hypersensitive sites in HepG2 cells (Figure <figr fid="F4">4</figr>, Table <tblr tid="T1">1</tblr> and data not shown), suggesting that the corresponding DNA elements are involved in transcriptional regulation of the endogenous genes. To confirm that the primate orthologs of these identified human regulatory elements are also functional, we cloned the aligned <it>LDLR </it>PS4 sequences from baboon, dusky titi, marmoset and lemur into the luciferase reporter vector and tested their ability to drive reporter gene expression in HepG2 cells. All orthologous non-human primate sequences showed enhancer activity (Additional data file 3).</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Functional assays indicate enhancer activity for a representative primate-conserved element, human <it>LDLR </it>PS4</p>
               </caption>
               <text>
                  <p>Functional assays indicate enhancer activity for a representative primate-conserved element, human <it>LDLR </it>PS4. Luciferase assay analysis of <b>(a) </b>transient transfections into human HepG2 cells and <b>(b) </b>plasmid DNA transfer into mouse liver. The luciferase reporter constructs tested are either the <it>LDLR </it>promoter alone (promoter), or the promoter in combination with the human <it>LDLR </it>PS4 (+ PS4). Fold increase over the empty vector is shown. Error bars in (a) indicate standard deviation. Each triangle in (b) represents luciferase activity in an individual mouse. Red bars denote the median activity of each construct. Luciferase activity is reported in arbitrary units. <b>(c) </b>DNase I hypersensitive site mapping around the <it>LDLR </it>PS4 region in human liver cell line HepG2. Vertical arrows indicates the lane with internal size marker that was generated by enzyme digestion of the <it>LDLR </it>PS4 sequence. The hypersensitive site is indicated by a horizontal arrow. Co-migration of the internal size marker with the hypersensitive site localizes the hypersensitive site to the <it>LDLR </it>PS4 sequence.</p>
               </text>
               <graphic file="gb-2007-8-1-r1-4"/>
            </fig>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Functional characterization of non-coding elements significantly conserved only in primates</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="center">
                        <p><it>In vitro </it>(HepG2)</p>
                     </c>
                     <c ca="left">
                        <p><it>In vivo </it>(mice)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Primate specific element</p>
                     </c>
                     <c ca="left">
                        <p>DNase I HS</p>
                     </c>
                     <c ca="left">
                        <p>Reporter transfection</p>
                     </c>
                     <c ca="left">
                        <p>Gene transfer</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>LDLR_PS1</p>
                     </c>
                     <c ca="left">
                        <p>No</p>
                     </c>
                     <c ca="left">
                        <p>No activity</p>
                     </c>
                     <c ca="left">
                        <p>No activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>LDLR_PS2</p>
                     </c>
                     <c ca="left">
                        <p>Yes</p>
                     </c>
                     <c ca="left">
                        <p>Enhancer* (~5.1-fold)</p>
                     </c>
                     <c ca="left">
                        <p>Enhancer (~5.5-fold)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>LDLR_PS3</p>
                     </c>
                     <c ca="left">
                        <p>No</p>
                     </c>
                     <c ca="left">
                        <p>No activity</p>
                     </c>
                     <c ca="left">
                        <p>No activity</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>LDLR_PS4</p>
                     </c>
                     <c ca="left">
                        <p>Yes</p>
                     </c>
                     <c ca="left">
                        <p>Enhancer (~3.7-fold)</p>
                     </c>
                     <c ca="left">
                        <p>Enhancer (~4.2-fold)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>SREBF1_PS</p>
                     </c>
                     <c ca="left">
                        <p>No</p>
                     </c>
                     <c ca="left">
                        <p>Silencer (~2.4-fold)</p>
                     </c>
                     <c ca="left">
                        <p>Silencer (~1.8-fold)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CYP7A1_PS</p>
                     </c>
                     <c ca="left">
                        <p>No</p>
                     </c>
                     <c ca="left">
                        <p>No activity</p>
                     </c>
                     <c ca="left">
                        <p>No activity</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>*In 293T cells. Human elements with primate-specific conservation were tested for their ability to drive reporter gene expression <it>in vitro </it>in HepG2 cells and <it>in vivo </it>in mouse liver. The genomic regions containing primate-conserved elements were also examined for the presence of DNase I hypersensitive sites (DNase I HS) in HepG2 cells. Enhancer or silencer strength is shown as fold increase or decrease relative to the promoter alone in luciferase assays.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Regulatory sequences with primate-specific conservation have functional orthologous mammalian counterparts</p>
            </st>
            <p>Since these functional human regulatory elements exhibited primate-specific sequence conservation, we explored whether their functional role is unique to primates. Although human-mouse comparison failed to identify these sequences as constrained (Gumby <it>p </it>value > 0.1; Figures <figr fid="F1">1</figr> and <figr fid="F3">3b</figr>), we were able to identify the aligned counterparts to the three primate-conserved functional sequences in mouse using the global alignment program MLAGAN <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. To explore the regulatory function, if any, of these mouse orthologs, we cloned the aligned sequences into the luciferase reporter vector described above and compared their activity to that of the human sequence. Despite the lack of statistically significant conservation between the rodent and human sequence, all three mouse orthologs exhibited regulatory activity in the same direction to that observed for the human elements (data not shown). Thus, the silencer and the two enhancers identified through primate-specific sequence conservation are ancestral mammalian regulatory elements, rather than newly evolved functional regions specific to primates.</p>
         </sec>
         <sec>
            <st>
               <p>A smaller set of three anthropoid primates is sufficient to detect the newly identified regulatory elements</p>
            </st>
            <p>The primate set we used to identify known functional elements and the three divergent mammalian regulatory sequences comprises human, baboon, colobus, marmoset, squirrel monkey and owl monkey. However, it is unlikely that all the corresponding genome sequences will be available in the near future. Of the set of most informative primate genomes for comparative analysis <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, the human and rhesus genome sequences are already publicly available, and the marmoset genome is currently being sequenced. We tested whether comparisons among these three species were sufficient to detect functional sequences in the eight lipid gene loci. Human-rhesus-marmoset comparison identified 55% of the 160 exons (versus 7 primates: 64%) with a true-positive rate of 72% (79% for 7 primates) (Figure <figr fid="F2">2a,d</figr>), suggesting that a significant fraction of exons can be detected using a limited number of primates <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. We subsequently assessed the ability of human-rhesus-marmoset comparison to detect the three newly identified regulatory sequences. As was observed in the comparison of 7 anthropoid primates, both <it>LDLR </it>enhancers were highly conserved (<it>p </it>value &lt;0.005) in the 3-way primate analysis and ranked among the three most conserved non-coding sequences in the 75 kb genomic region (data not shown). The smaller set of 3 primates was also sufficient to detect the silencer in the <it>SREBF1 </it>locus (non-coding rank 1), though not as strongly (<it>p </it>value = 0.044; Figure <figr fid="F1">1c</figr>). The lower statistical significance of the <it>SREBF1 </it>silencer in the three-primate analysis relative to the seven-primate analysis is due to the lower combined branch length of the former. These results suggest that availability of the marmoset genome sequence will facilitate genome-wide analysis of primate-specific conservation, and uncover regulatory sequences that are undetectable in distant, non-primate comparisons.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>Our analysis of over 500 kb of sequence from each of 7 primate species revealed 6 non-coding elements significantly conserved exclusively in primates, of which 3 were found to have gene regulatory activity in a variety of <it>in vitro </it>and <it>in vivo </it>assays. These three regulatory sequences are so weakly conserved in distant, non-primate mammals that none of the three independent methods we tested were able to detect them in mammalian comparisons. However, primate-specific conservation does not imply primate-specific function. Since the mouse orthologs are also functional, the identified sequences appear to be ancestral mammalian regulatory elements, as opposed to newly evolved functional sequences specific to primates. Nonetheless, it is likely that primate sequence comparison could also identify the subset of functional sequences that arose after primates split from distant mammals.</p>
         <p>Our results do not of course suggest that primate comparison is optimal for detecting all classes of regulatory sequence. If a human regulatory sequence is constrained in all mammals, for example, then multi-mammal species comparison is clearly preferable to primate comparison, since the mammalian species tree has greater combined branch length and, consequently, greater statistical power. However, it is well known that many human regulatory elements show no evidence of constraint in mammalian comparisons <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. We have demonstrated for the first time that primate comparisons can robustly identify at least some members of this class of 'mammal-diverged' human regulatory sequences, even in large (approximately 100 kb) genomic regions.</p>
         <p>It is worth noting that the three detected regulatory elements displayed only marginal sequence conservation in the prosimian lemur. This result suggests that primate comparisons should be limited to anthropoids (old world monkeys and new world monkeys) to sensitively detect divergent mammalian <it>cis</it>-regulatory elements. It is not clear at this point how many additional functional non-coding elements could be detected in the human genome on the basis of primate shadowing, relative to the number of elements already identifiable using the available mammalian genome sequences. However, it is encouraging that, at a conservation <it>p </it>value threshold of 0.005, primate shadowing expanded the set of predicted non-coding functional elements by 55% (6 elements with primate-specific sequence conservation versus 11 predicted by human-mouse and human-dog) in the 8 loci examined in this study. Further large-scale studies are required to precisely quantify the value added by multiple-primate analysis.</p>
         <p>As a consequence of high sequence identity between humans and great apes, our closest relatives, chimpanzee and gorilla, add very little to the power of primate sequence comparisons. The phylogenetically most informative set of primate species includes old world monkeys (for example, rhesus macaque) and new world monkeys (for example, marmoset), in addition to human <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. The three functional sequences revealed by seven-primate comparison were also detectable in the three-way human-rhesus-marmoset analysis, albeit less robustly, due to the shorter combined branch length of the three-way comparisons. Since the human and rhesus genome sequences are already publicly available, and the marmoset genome is currently being sequenced, our results support the feasibility of genome-wide discovery of primate-conserved regulatory elements.</p>
         <p>Sequence divergence of the identified regulatory elements between human and distant mammals may reflect functional changes in these sequences. <it>Cis</it>-regulatory elements with primate-specific sequence conservation are, therefore, potential substrates for determining the molecular basis of primate-specific aspects of gene expression. Previously, we described gain of sterol responsiveness in the anthropoid primate LDLR_PS2 enhancer <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> (Table <tblr tid="T1">1</tblr>). It is possible that primate-specific sequence conservation of the other two newly identified regulatory elements also reflects qualitative or quantitative expression differences between primates and non-primate mammals, which might be revealed by further in-depth functional characterization and sequence analysis. On the other hand, it is also possible that the lack of significant sequence conservation of some regulatory elements in distant mammals merely reflects the accumulation of compensatory mutations over tens of millions of years, which would retain functional similarity in the absence of significant sequence similarity <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>. Finally, it is possible that short sequence motifs such as transcription factor binding sites within the newly discovered regulatory elements are constrained in all mammals, while the entire elements are significantly conserved only in primates. In one example, we were able to find a conserved functional mammalian AP-4 site in the LDLR_PS2 enhancer <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> (data not shown).</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>Our results demonstrate that deep intra-primate sequence comparison can be used to identify functional modules such as exons, enhancers and silencers in large genomic regions. Most importantly, analysis of primate-specific conservation allowed detection of three divergent ancestral <it>cis</it>-regulatory elements, which were not detectable by more distant mammalian comparisons. With the availability of multiple primate genomes, it should be possible to improve the functional annotation of the human genome by uncovering numerous such <it>cis</it>-regulatory sequences, some of which potentially contribute to gene expression differences between primates and distant mammals.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Sources of sequence and annotation data</p>
            </st>
            <p>Primate bacterial artificial chromosome (BAC) clones containing targeted loci were purchased from Children's Hospital Oakland Research Institute in Oakland, California <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Draft sequences of baboon, colobus, dusky titi, marmoset, owl monkey, squirrel monkey, and lemur BACs were determined by sequencing ends of 3 kb subclones to 8- to 10-fold coverage using BigDye terminators (Applied Biosystems, Foster City, CA, USA) and assembling reads into contigs with the Phred-Phrap-Consed suite as described previously <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. All BAC sequences were submitted to GenBank (see Additional data file 6 for accession numbers). Human, mouse, dog and rhesus sequences were downloaded from the UCSC Genome Bioinformatics worldwide web site <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Based on the human March 2006 assembly hg18, the coordinates for the analyzed human loci are: <it>SREBF1</it>, chr17:17653939-17690020; <it>CYP7A1</it>, chr8: 59525742-59605413; <it>LDLR</it>, chr19:11054146-11127904; <it>ABCG5/ABCG8</it>, chr2:43835998-43966668; <it>APOE </it>cluster, chr19:50080832-50149764; <it>APOCIII </it>cluster, chr11:116137283-116217351; <it>HMGCR</it>, chr5:74646522-74714122; <it>LXR&#945;</it>, chr11:47231580-47253367. Exon annotations of these regions were obtained from the UCSC Genome Bioinformatics website <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. The promoter sequence of a gene is defined as the 1 kb region upstream of the transcription start site. Four enhancers in the <it>APOE </it>locus were previously described <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>. These four <it>APOE </it>enhancers, together with 15 promoters in the 8 genomic loci, comprise the set of known regulatory regions.</p>
         </sec>
         <sec>
            <st>
               <p>Analysis of sequence conservation</p>
            </st>
            <p>All sequence alignments were carried out using MLAGAN <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Aligned sequences were scanned for statistically significant (<it>p </it>value &#8804; 0.1) evolutionarily conserved regions using Gumby <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. We defined primate-specific conserved elements as those human sequences that were highly conserved (Gumby <it>p </it>value &#8804; 0.005) among anthropoid primates, but not conserved in more distant mammalian comparisons. Mammalian sequence conservation was defined as: <it>p </it>value &#8804; 0.1 in human-mouse or human-dog comparison; or 70% human-mouse sequence identity over at least 100 bp <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>; or significant conservation in an alignment of 17 vertebrate genomes <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>.</p>
            <p>Evolutionarily conserved regions identified by Gumby were visualized using RankVISTA <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Conservation scores in the RankVISTA plots were calculated as the negative logarithm of the Gumby <it>p </it>value.</p>
            <p>The constraint factor of a conserved non-coding sequence (CNS) was defined as the nucleotide substitution rate (summed over all branches of the phylogenetic tree) within the element divided by the local background substitution rate at intronic and intergenic positions in the locus (neutral rate). We estimated substitution rates along each lineage by maximum likelihood using fastDNAml <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Plasmid constructs</p>
            </st>
            <p>The human promoter was cloned in the proper orientation upstream of the luciferase cDNA in the pGL3Basic construct (Promega, Madison, WI, USA). The primate-specific elements from human or mouse were PCR cloned into polylinker sites upstream of the promoter of the closest gene (see Additional data file 5 for primer sequences). Each human element includes the primate-conserved sequence plus approximately 200 to 400 bp of flanking sequence. Thus, approximately 1,000 bp of total sequence are tested in each reporter construct.</p>
         </sec>
         <sec>
            <st>
               <p>Transient-transfection reporter assay</p>
            </st>
            <p>Cells were grown at 37&#176;C and 5% CO<sub>2 </sub>in the minimum essential medium (ATCC, Manassas, VA, USA) (HepG2), or Dulbecco's modified Eagle's medium (ATCC) (293T cells), supplemented with 10% fetal bovine serum (Hyclone, Logan, UT, USA), L-glutamine and penicillin-streptomycin. The cells were grown in 12-well plates (6 &#215; 10<sup>4 </sup>cells/well for HepG2, 4 &#215; 10<sup>4 </sup>cells/well for 293T) and transfected using Fugene (Roche Molecular Biochemicals, Indianapolis, IN, USA) following the manufacturer's protocol. Briefly, 500 ng (for HepG2) or 100 ng (for 293T) of each assayed plasmid and 50 ng (for HepG2) or 10 ng (for 293T) pCMV&#946; (BD Biosciences, San Jose, CA, USA) were mixed with 1.5 &#956;l Fugene and added to each well. Following 42 to 48 hours of incubation, cells were harvested and lysed. Activity of luciferase and &#946;-galactosidase was measured using the Luciferase Assay System (Promega) and galacto-Light Plus (Applied Biosystems), respectively. Luciferase activity for each sample was normalized to the &#946;-galactosidase assay control. Transfections were carried out in duplicates. All experiments are representative of at least three independent transfections.</p>
         </sec>
         <sec>
            <st>
               <p>Tail vein plasmid DNA transfer assays</p>
            </st>
            <p>Tail vein injection was performed as described by Herweijer and Wolff <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> following the TransIT<sup>&#174; </sup><it>In Vivo </it>Gene Delivery System Protocol (Mirus Corporation, Madison, WI, USA). Six to nine FVB male mice (Charles River Laboratory, Wilmington, MA, USA) at age 7 to 8 weeks were used for each reporter gene construct; 10 &#956;g of each reporter construct, along with 2 &#956;g of pCMV&#946; (BD Biosciences) to correct for delivery efficiency, were injected into each mouse. The entire content of the syringe was delivered in 3 to 5 seconds. Animals were sacrificed 24 hours later, livers extracted, measured to correct for size, homogenized, and centrifuged for 15 minutes at 4&#176;C, 14,000 rpm. Activity of luciferase and &#946;-galactosidase was measured as described above. All <it>p </it>values are from the two-sample Wilcoxon rank-sum (Mann-Whitney) test using STATA (STATA Corporation, College Station, TX, USA). All experimental results are representative of two independent plasmid DNA transfer assays.</p>
         </sec>
         <sec>
            <st>
               <p>DNase I-hypersensitive site mapping</p>
            </st>
            <p>DNase I-hypersensitive site mapping was performed as described previously <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. Briefly, cell pellets were resuspended in DNase I digestion buffer containing 0.5% IGEPAL (tert-Octylphenoxy poly(oxyethylene)ethanol) at 2 &#215; 10<sup>7 </sup>cells/ml buffer. Aliquots (100 &#956;l) of the resuspended cells were mixed with equal volumes of DNase I buffer containing varying concentrations of DNase I.The DNase I digestion reaction was incubated at 23&#176;C for 5 minutes before being stopped with the addition of 8 &#956;l of 0.5 M EDTA and 2 &#956;l of 100 mg/ml RNase A.Following 5 minutes of RNase A treatment, genomic DNA was isolated using the QiaQuickPCR Kit (Qiagen, Valencia, CA, USA). Then, 10 &#956;g of each of the DNA samples was digested with an appropriate restriction enzyme and resolved in a 1.2% agarose gel by electrophoresis. The DNA from the gel was then transferred onto a nylon membrane. Southern blotting was carried out with a radiolabeled DNA probe generated by PCR amplification (see Additional data file 5 for primer sequences), and corresponds to regions of approximately 1,000 bp from Gumby predicted elements. Following hybridization and washing, the blot was exposed to Biomax film (Kodak Co., Rochester, NY, USA) with intensifying screens at -80&#176;C for 48 hours.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data are available with the online version of this paper.</p>
         <p>Additional data file <supplr sid="S1">1</supplr> is a figure showing sequence alignments of primate-conserved sequence LDLR_PS2. Additional data file <supplr sid="S2">2</supplr> is a figure showing sequence alignments of primate-conserved sequence SREBF1_PS. Additional data file <supplr sid="S3">3</supplr> is a figure documenting luciferase functional assays, which indicate enhancer activity for orthologous primate <it>LDLR </it>PS4 from baboon, dusky titi, marmoset and lemur. Additional data file <supplr sid="S4">4</supplr> is a table listing the coordinates of evolutionarily conserved elements. Additional data file <supplr sid="S5">5</supplr> is a table providing sequences of PCR primers used for cloning regulatory elements into reporter gene constructs or generating Southern blotting probes for detecting DNase I hypersensitive sites. Additional data file <supplr sid="S6">6</supplr> is a table listing GenBank accession numbers for all primate BACs sequenced. Additional data file <supplr sid="S7">7</supplr> contains the figure legends for Additional data files <supplr sid="S1">1</supplr>, <supplr sid="S2">2</supplr>, <supplr sid="S3">3</supplr>.</p>
         <suppl id="S1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>Sequence alignments of primate-conserved sequence LDLR_PS2</p>
            </caption>
            <text>
               <p>Sequence alignments of primate-conserved sequence LDLR_PS2.</p>
            </text>
            <file name="gb-2007-8-1-r1-S1.eps">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional data file 2</p>
            </title>
            <caption>
               <p>Sequence alignments of primate-conserved sequence SREBF1_PS</p>
            </caption>
            <text>
               <p>Sequence alignments of primate-conserved sequence SREBF1_PS.</p>
            </text>
            <file name="gb-2007-8-1-r1-S2.eps">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S3">
            <title>
               <p>Additional data file 3</p>
            </title>
            <caption>
               <p>Luciferase functional assays, indicating enhancer activity for orthologous primate <it>LDLR </it>PS4 from baboon, dusky titi, marmoset and lemur</p>
            </caption>
            <text>
               <p>Luciferase functional assays, indicating enhancer activity for orthologous primate <it>LDLR </it>PS4 from baboon, dusky titi, marmoset and lemur.</p>
            </text>
            <file name="gb-2007-8-1-r1-S3.eps">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S4">
            <title>
               <p>Additional data file 4</p>
            </title>
            <caption>
               <p>Coordinates of evolutionarily conserved elements</p>
            </caption>
            <text>
               <p>Coordinates of evolutionarily conserved elements.</p>
            </text>
            <file name="gb-2007-8-1-r1-S4.doc">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S5">
            <title>
               <p>Additional data file 5</p>
            </title>
            <caption>
               <p>Sequences of PCR primers used for cloning regulatory elements into reporter gene constructs or generating Southern blotting probes for detecting DNase I hypersensitive sites</p>
            </caption>
            <text>
               <p>Sequences of PCR primers used for cloning regulatory elements into reporter gene constructs or generating Southern blotting probes for detecting. DNase I hypersensitive sites</p>
            </text>
            <file name="gb-2007-8-1-r1-S5.doc">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S6">
            <title>
               <p>Additional data file 6</p>
            </title>
            <caption>
               <p>GenBank accession numbers for all primate BACs sequenced</p>
            </caption>
            <text>
               <p>GenBank accession numbers for all primate BACs sequenced.</p>
            </text>
            <file name="gb-2007-8-1-r1-S6.doc">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S7">
            <title>
               <p>Additional data file 7</p>
            </title>
            <caption>
               <p>Figure legends for Additional data files <supplr sid="S1">1</supplr>, <supplr sid="S2">2</supplr>, <supplr sid="S3">3</supplr></p>
            </caption>
            <text>
               <p>Figure legends for Additional data files <supplr sid="S1">1</supplr>, <supplr sid="S2">2</supplr>, <supplr sid="S3">3</supplr>.</p>
            </text>
            <file name="gb-2007-8-1-r1-S7.doc">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank A Moses for discussions and insights on parts of this work, J Noonan, L Pennacchio, A Visel, N Ahituv, and other Rubin laboratory members for suggestions and criticisms on the manuscript, and B Kullgren and S Phouanenavong provided technical assistance for the tail vein plasmid DNA transfer assay. Research was conducted at the EO Lawrence Berkeley National Laboratory and at the Joint Genome Institute. This work was supported by the Director, Office of Science, of the US Department of Energy under contract no. DE-AC02-05CH11231 and NIH-NHLBI grant numbers THL007279F and U1HL66681B.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Comparative genomics.</p>
            </title>
            <aug>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Makova</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Nekrutenko</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hardison</snm>
                  <fnm>RC</fnm>
               </au>
            </aug>
            <source>Annu Rev Genomics Hum Genet</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>15</fpage>
            <lpage>56</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">15485342</pubid>
                  <pubid idtype="doi">10.1146/annurev.genom.5.061903.180057</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Initial sequencing and comparative analysis of the mouse genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Waterston</snm>
                  <fnm>RH</fnm>
               </au>
               <au>
                  <snm>Lindblad-Toh</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Rogers</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Abril</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Agarwal</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Agarwala</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ainscough</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Alexandersson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>An</snm>
                  <fnm>P</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>420</volume>
            <fpage>520</fpage>
            <lpage>562</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">12466850</pubid>
                  <pubid idtype="doi">10.1038/nature01262</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Close sequence comparisons are sufficient to identify human cis-regulatory elements.</p>
            </title>
            <aug>
               <au>
                  <snm>Prabhakar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Poulin</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Shoukry</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Afzal</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Couronne</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Pennacchio</snm>
                  <fnm>LA</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <fpage>855</fpage>
            <lpage>863</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">16769978</pubid>
                  <pubid idtype="doi">10.1101/gr.4717506</pubid>
                  <pubid idtype="pmcid">1484452</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Conservation of RET regulatory function from human to zebrafish without sequence similarity.</p>
            </title>
            <aug>
               <au>
                  <snm>Fisher</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Grice</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Vinton</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Bessling</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>McCallion</snm>
                  <fnm>AS</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2006</pubdate>
            <volume>312</volume>
            <fpage>276</fpage>
            <lpage>279</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">16556802</pubid>
                  <pubid idtype="doi">10.1126/science.1124070</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Comparative genomics at the vertebrate extremes.</p>
            </title>
            <aug>
               <au>
                  <snm>Boffelli</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Nobrega</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>456</fpage>
            <lpage>465</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">15153998</pubid>
                  <pubid idtype="doi">10.1038/nrg1350</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Phylogenetic shadowing of primate sequences to find functional regions of the human genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Boffelli</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>McAuliffe</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ovcharenko</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Ovcharenko</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Pachter</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>299</volume>
            <fpage>1391</fpage>
            <lpage>1394</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">12610304</pubid>
                  <pubid idtype="doi">10.1126/science.1081331</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Identifying candidate causal variants responsible for altered activity of the ABCB1 multidrug resistance gene.</p>
            </title>
            <aug>
               <au>
                  <snm>Soranzo</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Cavalleri</snm>
                  <fnm>GL</fnm>
               </au>
               <au>
                  <snm>Weale</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Wood</snm>
                  <fnm>NW</fnm>
               </au>
               <au>
                  <snm>Depondt</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Marguerie</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Sisodiya</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Goldstein</snm>
                  <fnm>DB</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>1333</fpage>
            <lpage>1344</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">15197162</pubid>
                  <pubid idtype="doi">10.1101/gr.1965304</pubid>
                  <pubid idtype="pmcid">442149</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Bile acid regulation of gene expression: roles of nuclear hormone receptors.</p>
            </title>
            <aug>
               <au>
                  <snm>Chiang</snm>
                  <fnm>JY</fnm>
               </au>
            </aug>
            <source>Endocr Rev</source>
            <pubdate>2002</pubdate>
            <volume>23</volume>
            <fpage>443</fpage>
            <lpage>463</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">12202460</pubid>
                  <pubid idtype="doi">10.1210/er.2000-0035</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Mapping cis-regulatory domains in the human genome using multi-species conservation of synteny.</p>
            </title>
            <aug>
               <au>
                  <snm>Ahituv</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Prabhakar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Poulin</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Couronne</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Hum Mol Genet</source>
            <pubdate>2005</pubdate>
            <volume>14</volume>
            <fpage>3057</fpage>
            <lpage>3063</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">16155111</pubid>
                  <pubid idtype="doi">10.1093/hmg/ddi338</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Annotation of cis-regulatory elements by identification, subclassification, and functional assessment of multispecies conserved sequences.</p>
            </title>
            <aug>
               <au>
                  <snm>Hughes</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Cheng</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Ventress</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Prabhakar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Anguita</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>De Gobbi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>de Jong</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Higgs</snm>
                  <fnm>DR</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>9830</fpage>
            <lpage>9835</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">15998734</pubid>
                  <pubid idtype="doi">10.1073/pnas.0503401102</pubid>
                  <pubid idtype="pmcid">1174996</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>A model of the statistical power of comparative genome sequence analysis.</p>
            </title>
            <aug>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>PLoS Biology</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>e10</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">15660152</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0030010</pubid>
                  <pubid idtype="pmcid">539325</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Siepel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bejerano</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Pedersen</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Hinrichs</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Hou</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rosenbloom</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Clawson</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Spieth</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hillier</snm>
                  <fnm>LW</fnm>
               </au>
               <au>
                  <snm>Richards</snm>
                  <fnm>S</fnm>
               </au>
               <etal/>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>1034</fpage>
            <lpage>1050</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">16024819</pubid>
                  <pubid idtype="doi">10.1101/gr.3715005</pubid>
                  <pubid idtype="pmcid">1182216</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>UCSC Genome Browser</p>
            </title>
            <url>http://genome.ucsc.edu</url>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Unsaturated fatty acids down-regulate SREBP isoforms 1a and 1c by two mechanisms in HEK-293 Cells.</p>
            </title>
            <aug>
               <au>
                  <snm>Hannah</snm>
                  <fnm>VC</fnm>
               </au>
               <au>
                  <snm>Ou</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Luong</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Goldstein</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>MS</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2001</pubdate>
            <volume>276</volume>
            <fpage>4365</fpage>
            <lpage>4372</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">11085986</pubid>
                  <pubid idtype="doi">10.1074/jbc.M007273200</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>LAGAN and Multi-LAGAN: efficient tools for large-scale multiple alignment of genomic DNA.</p>
            </title>
            <aug>
               <au>
                  <snm>Brudno</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Do</snm>
                  <fnm>CB</fnm>
               </au>
               <au>
                  <snm>Cooper</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Davydov</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>ED</fnm>
               </au>
               <au>
                  <snm>Sidow</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Batzoglou</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>721</fpage>
            <lpage>731</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">12654723</pubid>
                  <pubid idtype="doi">10.1101/gr.926603</pubid>
                  <pubid idtype="pmcid">430158</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>eShadow: a tool for comparing closely related sequences.</p>
            </title>
            <aug>
               <au>
                  <snm>Ovcharenko</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Boffelli</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Loots</snm>
                  <fnm>GG</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>1191</fpage>
            <lpage>1198</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">15173121</pubid>
                  <pubid idtype="doi">10.1101/gr.1773104</pubid>
                  <pubid idtype="pmcid">419798</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Primate-specific evolution of an LDLR enhancer.</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>QF</fnm>
               </au>
               <au>
                  <snm>Prabhakar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Moses</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Chanan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cheng</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Boffelli</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>R68</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">16884525</pubid>
                  <pubid idtype="doi">10.1186/gb-2006-7-8-r68</pubid>
                  <pubid idtype="pmcid">1779597</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Making very similar embryos with divergent genomes: conservation of regulatory mechanisms of Otx between the ascidians <it>Halocynthia roretzi </it>and <it>Ciona intestinalis</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Oda-Ishii</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Bertrand</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Matsuo</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Lemaire</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Saiga</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Development</source>
            <pubdate>2005</pubdate>
            <volume>132</volume>
            <fpage>1663</fpage>
            <lpage>1674</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">15743880</pubid>
                  <pubid idtype="doi">10.1242/dev.01707</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Evidence for stabilizing selection in a eukaryotic enhancer element.</p>
            </title>
            <aug>
               <au>
                  <snm>Ludwig</snm>
                  <fnm>MZ</fnm>
               </au>
               <au>
                  <snm>Bergman</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Patel</snm>
                  <fnm>NH</fnm>
               </au>
               <au>
                  <snm>Kreitman</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>403</volume>
            <fpage>564</fpage>
            <lpage>567</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">10676967</pubid>
                  <pubid idtype="doi">10.1038/35000615</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>BACPAC Resources</p>
            </title>
            <url>http://bacpac.chori.org</url>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Haplotypes in the APOA1-C3-A4-A5 gene cluster affect plasma lipids in both humans and baboons.</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>QF</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>O'Connell</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Peng</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Krauss</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Rainwater</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>VandeBerg</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Cheng</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Pennacchio</snm>
                  <fnm>LA</fnm>
               </au>
            </aug>
            <source>Hum Mol Genet</source>
            <pubdate>2004</pubdate>
            <volume>13</volume>
            <fpage>1049</fpage>
            <lpage>1056</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">15044382</pubid>
                  <pubid idtype="doi">10.1093/hmg/ddh121</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Two hepatic enhancers, HCR.1 and HCR.2, coordinate the liver expression of the entire human apolipoprotein E/C-I/C-IV/C-II gene cluster.</p>
            </title>
            <aug>
               <au>
                  <snm>Allan</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1997</pubdate>
            <volume>272</volume>
            <fpage>29113</fpage>
            <lpage>29119</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">9360987</pubid>
                  <pubid idtype="doi">10.1074/jbc.272.46.29113</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Regulated expression of the apolipoprotein E/C-I/C-IV/C-II gene cluster in murine and human macrophages. A critical role for nuclear liver X receptors alpha and beta.</p>
            </title>
            <aug>
               <au>
                  <snm>Mak</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Laffitte</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Desrumaux</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Joseph</snm>
                  <fnm>SB</fnm>
               </au>
               <au>
                  <snm>Curtiss</snm>
                  <fnm>LK</fnm>
               </au>
               <au>
                  <snm>Mangelsdorf</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Tontonoz</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Edwards</snm>
                  <fnm>PA</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2002</pubdate>
            <volume>277</volume>
            <fpage>31900</fpage>
            <lpage>31908</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">12032151</pubid>
                  <pubid idtype="doi">10.1074/jbc.M202993200</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>rVista for comparative sequence-based discovery of functional transcription factor binding sites.</p>
            </title>
            <aug>
               <au>
                  <snm>Loots</snm>
                  <fnm>GG</fnm>
               </au>
               <au>
                  <snm>Ovcharenko</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Pachter</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Dubchak</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <fpage>832</fpage>
            <lpage>839</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">11997350</pubid>
                  <pubid idtype="doi">10.1101/gr.225502. Article published online before print in April 2002</pubid>
                  <pubid idtype="pmcid">186580</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>VISTA Browser</p>
            </title>
            <url>http://pipeline.lbl.gov/</url>
         </bibl>
         <bibl id="B26">
            <title>
               <p>fastDNAmL: a tool for construction of phylogenetic trees of DNA sequences using maximum likelihood.</p>
            </title>
            <aug>
               <au>
                  <snm>Olsen</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Matsuda</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hagstrom</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Overbeek</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Comput Appl Biosci</source>
            <pubdate>1994</pubdate>
            <volume>10</volume>
            <fpage>41</fpage>
            <lpage>48</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8193955</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Progress and prospects: naked DNA gene transfer and therapy.</p>
            </title>
            <aug>
               <au>
                  <snm>Herweijer</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Wolff</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Gene Ther</source>
            <pubdate>2003</pubdate>
            <volume>10</volume>
            <fpage>453</fpage>
            <lpage>458</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">12621449</pubid>
                  <pubid idtype="doi">10.1038/sj.gt.3301983</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Activation of vascular endothelial growth factor A transcription in tumorigenic glioblastoma cell lines by an enhancer with cell type-specific DNase I accessibility.</p>
            </title>
            <aug>
               <au>
                  <snm>Liang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>XY</fnm>
               </au>
               <au>
                  <snm>Rebar</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Wolffe</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Case</snm>
                  <fnm>CC</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2002</pubdate>
            <volume>277</volume>
            <fpage>20087</fpage>
            <lpage>20094</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">11912213</pubid>
                  <pubid idtype="doi">10.1074/jbc.M201766200</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>

