A resolved misassembly from Release 2 sequence contains new trypsin genes. This illustration and Figures 3,4,5,6,7,8 are derived from the output of the graphical annotation tool Apollo , but these illustrations are not intended to be a direct representation of the data used to annotate the regions. Only evidence (shown in the black panels) directly used to annotate the gene models (shown in the cyan panels) are depicted in these illustrations. The plus strand is shown above the center scale, the minus strand below the center scale. Thin lines represent introns and thick boxes represent exons. Vertical green lines in the exons represent start codons and vertical red lines represent stop codons. An 8.5-kb region of genomic sequence on chromosome arm 2R was missing in Release 2 because of an apparent misassembly that incorrectly joined two tandemly repeated trypsin genes with a concomitant deletion of the intervening sequence (region shown in gray in the center scale). The missing sequence constituted an inverted repeat of 4 kb bordered by a simple repetitive sequence (S.C., unpublished results). Resolution of this error in Release 3 has led to the annotation of three new trypsin genes (blue rectangles): CG30025 (similar to βTry), CG30028 (similar to γδTry), and CG30031 (similar to γδTry). Gene-prediction data (dark purple for Genie and lavender for GENSCAN), cDNA data (dark green), and BLASTX protein similarity (red for Drosophila proteins, orange for other species' proteins) support the new trypsin genes.
Misra et al. Genome Biology 2002 3:research0083.1-0083.22 doi:10.1186/gb-2002-3-12-research0083