Astrogenomics: big data, old problems, old solutions?
1 Center for Epigenomics, Department of Genetics, Albert Einstein College of Medicine, Bronx, NY 10461, USA
2 Division of Computational Genetics, Department of Genetics, Albert Einstein College of Medicine, Bronx, NY 10461, USA
3 Center for Advanced Computing Research, California Institute of Technology, Pasadena, CA 91125, USA
4 Department of Astronomy, California Institute of Technology, Pasadena, CA 91125, USA
Genome Biology 2013, 14:129 doi:10.1186/gb-2013-14-8-129Published: 13 August 2013
First paragraph (this article has no abstract)
The ominous warnings of a data deluge in the life sciences from high-throughput DNA sequencing data are being supplanted by a second deluge, of clichés bemoaning our collective scientific fate unless we address the genomic data tsunami. It is imperative that we explore the many facets of the genome, not just sequence but also transcriptional and epigenetic variability, integrating these observations in order to attain a genuine understanding of how genes function, towards a goal of genomics-based personalized medicine. Determining any individual's genomic properties requires comparison to many others, sifting out the specific from the trends, requiring access to the many in order to yield information relevant to the few. This is the central big data challenge in genomics that still requires some sort of resolution. Is there a practical, feasible way of directly connecting the scientific community to this data universe?