Robust detection of biologically relevant features across a variety of DNA-seq data types by ZINBA. (a-i) For CTCF ChIP-seq (a-c), RNA Pol II ChIP-seq (d-f) and FAIRE-seq (g-i) data, the top N ranked peaks from MACS (red dashed line), F-Seq (green dashed line) and ZINBA unrefined regions (light blue dashed line), and ZINBA refined regions (blue solid line) were compared based on the proportion overlapping a biologically relevant set of features (a, d, g), average distance to the biologically relevant set of features (b, e, h) and average length of peaks (c, f, i). The biologically relevant set of features included the CTCF motif (a), transcription start sites (TSSs) for RNA Pol II (d) and DNase hypersensitive sites (DHSs) for FAIRE (g).
Rashid et al. Genome Biology 2011 12:R67 doi:10.1186/gb-2011-12-7-r67