COMIT: identification of noncoding motifs under selection in coding sequences
Department of Biology, Boston College, 140 Commonwealth Avenue, Chestnut Hill, MA 02467, USA
Genome Biology 2009, 10:R133 doi:10.1186/gb-2009-10-11-r133Published: 20 November 2009
Coding nucleotide sequences contain myriad functions independent of their encoded protein sequences. We present the COMIT algorithm to detect functional noncoding motifs in coding regions using sequence conservation, explicitly separating nucleotide from amino acid effects. COMIT concurs with diverse experimental datasets, including splicing enhancers, silencers, replication motifs, and microRNA targets, and predicts many novel functional motifs. Intriguingly, COMIT scores are well-correlated to scores uncalibrated for amino acids, suggesting that nucleotide motifs often override peptide-level constraints.