Myriads of protein families, and still counting
1 Addresses: Computational Genomics Group, The European Bioinformatics Institute, EMBL Cambridge Outstation, Cambridge CB10 1SD, UK
2 Centro Nacional de Biotecnología CSIC, Campus de Cantoblanco 28049 Madrid, Spain
Genome Biology 2003, 4:401 doi:10.1186/gb-2003-4-2-401Published: 28 January 2003
From the historical record of genome sequencing, we show that the rate of discovery of new families has remained constant over time, indicating that our knowledge of sequence space is far from complete.