Loughborough University
Browse
NAR CLADE.pdf (6.45 MB)

Array-based evolution of DNA aptamers allows modelling of an explicit sequence-fitness landscape

Download (6.45 MB)
journal contribution
posted on 2013-01-02, 11:43 authored by Christopher G. Knight, Mark PlattMark Platt, William Rowe, David C. Wedge, Farid Khan, Philip J.R. Day, Andy McShea, Joshua Knowles, Douglas B. Kell
Mapping the landscape of possible macromolecular polymer sequences to their fitness in performing biological functions is a challenge across the biosciences. A paradigm is the case of aptamers, nucleic acids that can be selected to bind particular target molecules. We have characterized the sequence-fitness landscape for aptamers binding allophycocyanin (APC) protein via a novel Closed Loop Aptameric Directed Evolution (CLADE) approach. In contrast to the conventional SELEX methodology, selection and mutation of aptamer sequences was carried out in silico, with explicit fitness assays for 44 131 aptamers of known sequence using DNA microarrays in vitro. We capture the landscape using a predictive machine learning model linking sequence features and function and validate this model using 5500 entirely separate test sequences, which give a very high observed versus predicted correlation of 0.87. This approach reveals a complex sequence-fitness mapping, and hypotheses for the physical basis of aptameric binding; it also enables rapid design of novel aptamers with desired binding properties. We demonstrate an extension to the approach by incorporating prior knowledge into CLADE resulting in some of the tightest binding sequences.

History

School

  • Science

Department

  • Chemistry

Citation

KNIGHT, C.G., PLATT, M., ROWE, W. ... et al, 2009. Array-based evolution of DNA aptamers allows modelling of an explicit sequence-fitness landscape. Nucleic Acids Research, 37 (1), e6.

Publisher

Oxford University Press (© The Authors)

Version

  • VoR (Version of Record)

Publication date

2009

Notes

Copyright 2009 Author(s). This article is distributed under a Creative Commons Attribution 3.0 Unported License.

ISSN

0305-1048

eISSN

1362-4962

Language

  • en

Usage metrics

    Loughborough Publications

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC