NAR CLADE.pdf (6.45 MB)
Array-based evolution of DNA aptamers allows modelling of an explicit sequence-fitness landscape
journal contribution
posted on 2013-01-02, 11:43 authored by Christopher G. Knight, Mark PlattMark Platt, William Rowe, David C. Wedge, Farid Khan, Philip J.R. Day, Andy McShea, Joshua Knowles, Douglas B. KellMapping the landscape of possible macromolecular
polymer sequences to their fitness in performing
biological functions is a challenge across the biosciences.
A paradigm is the case of aptamers,
nucleic acids that can be selected to bind particular
target molecules. We have characterized the
sequence-fitness landscape for aptamers binding
allophycocyanin (APC) protein via a novel Closed
Loop Aptameric Directed Evolution (CLADE)
approach. In contrast to the conventional SELEX
methodology, selection and mutation of aptamer
sequences was carried out in silico, with explicit fitness
assays for 44 131 aptamers of known sequence
using DNA microarrays in vitro. We capture the
landscape using a predictive machine learning
model linking sequence features and function and
validate this model using 5500 entirely separate
test sequences, which give a very high observed
versus predicted correlation of 0.87. This approach
reveals a complex sequence-fitness mapping, and
hypotheses for the physical basis of aptameric binding;
it also enables rapid design of novel aptamers
with desired binding properties. We demonstrate an
extension to the approach by incorporating prior knowledge into CLADE resulting in some of the tightest binding sequences.
History
School
- Science
Department
- Chemistry
Citation
KNIGHT, C.G., PLATT, M., ROWE, W. ... et al, 2009. Array-based evolution of DNA aptamers allows modelling of an explicit sequence-fitness landscape. Nucleic Acids Research, 37 (1), e6.Publisher
Oxford University Press (© The Authors)Version
- VoR (Version of Record)
Publication date
2009Notes
Copyright 2009 Author(s). This article is distributed under a Creative Commons Attribution 3.0 Unported License.ISSN
0305-1048eISSN
1362-4962Publisher version
Language
- en