Bringing order to protein disorder through comparative genomics and genetic interactions
- Equal contributors
1 Department of Computer Science and Engineering, University of Minnesota, 200 Union Street SE, Minneapolis, MN 55455, USA
2 The Donnelly Centre, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
3 Banting and Best Department of Medical Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
4 Department of Molecular Genetics, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
5 Department of Computer Science, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
Genome Biology 2011, 12:R14 doi:10.1186/gb-2011-12-2-r14Published: 16 February 2011
Intrinsically disordered regions are widespread, especially in proteomes of higher eukaryotes. Recently, protein disorder has been associated with a wide variety of cellular processes and has been implicated in several human diseases. Despite its apparent functional importance, the sheer range of different roles played by protein disorder often makes its exact contribution difficult to interpret.
We attempt to better understand the different roles of disorder using a novel analysis that leverages both comparative genomics and genetic interactions. Strikingly, we find that disorder can be partitioned into three biologically distinct phenomena: regions where disorder is conserved but with quickly evolving amino acid sequences (flexible disorder); regions of conserved disorder with also highly conserved amino acid sequences (constrained disorder); and, lastly, non-conserved disorder. Flexible disorder bears many of the characteristics commonly attributed to disorder and is associated with signaling pathways and multi-functionality. Conversely, constrained disorder has markedly different functional attributes and is involved in RNA binding and protein chaperones. Finally, non-conserved disorder lacks clear functional hallmarks based on our analysis.
Our new perspective on protein disorder clarifies a variety of previous results by putting them into a systematic framework. Moreover, the clear and distinct functional association of flexible and constrained disorder will allow for new approaches and more specific algorithms for disorder detection in a functional context. Finally, in flexible disordered regions, we demonstrate clear evolutionary selection of protein disorder with little selection on primary structure, which has important implications for sequence-based studies of protein structure and evolution.