Validation of the conservation scores obtained when applying FastCompare to H. sapiens and M. musculus. (a) Distributions of conservation scores for actual and randomized data, showing that high conservation scores are unlikely to be obtained by chance. Conservation scores for some known regulatory elements are also indicated. Both distributions were constructed using bin sizes of 5, and the top portion of the figure is not shown for the purpose of presentation. (b-d) Proportion of 7-mers supported by different types of independent biological data (using windows of size 100, see Materials and methods) as a function of the conservation score rank, obtained when applying FastCompare to H. sapiens and M. musculus. (b-d) strongly indicate that the frequency of support increases with conservation score as calculated by FastCompare.
Elemento and Tavazoie Genome Biology 2005 6:R18 doi:10.1186/gb-2005-6-2-r18