Size distribution of the 12,082 EST-derived unique transcripts from A. pisum. Contigs and singletons with (filled bars) or without (open bars) a significant hit have been selected with a cutoff value 10-5 after a BLASTX on Uniprot. Size classes (in base pairs) were binned (for sequences less than 200 bp and more than 1,500 bp) to contain a minimum of 20 sequences for both 'hits' and 'no-hits' contigs. The curves (hits, filled diamonds; no-hits, open diamonds) show the percentage of contigs for which a coding sequence was predicted by FrameD. Contigs with no predicted coding sequences are presumably entirely UTR.
Sabater-Muñoz et al. Genome Biology 2006 7:R21 doi:10.1186/gb-2006-7-3-r21