bioRxiv | 2021

Machine learning enables discovery of DNA-carbon nanotube sensors for serotonin

 
 
 
 
 
 
 
 

Abstract


DNA-wrapped single walled carbon nanotube (SWNT) conjugates have remarkable optical properties leading to their use in biosensing and imaging applications. A critical limitation in the development of DNA-SWNT sensors is the current inability to predict unique DNA sequences that confer a strong analyte-specific optical response to these sensors. Here, near-infrared (nIR) fluorescence response datasets for ~100 DNA-SWNT conjugates, narrowed down by a selective evolution protocol starting from a pool of ~1010 unique DNA-SWNT candidates, are used to train machine learning (ML) models to predict new unique DNA sequences with strong optical response to neurotransmitter serotonin. First, classifier models based on convolutional neural networks (CNN) are trained on sequence features to classify DNA ligands as either high response or low response to serotonin. Second, support vector machine (SVM) regression models are trained to predict relative optical response values for DNA sequences. Finally, we demonstrate with validation experiments that integrating the predictions of ensembles of the highest quality CNN classifiers and SVM regression models leads to the best predictions of both high and low response sequences. With our ML approaches, we discovered five new DNA-SWNT sensors with higher fluorescence intensity response to serotonin than obtained previously. Overall, the explored ML approaches introduce an important new tool to predict useful DNA sequences, which can be used for discovery of new DNA-based sensors and nanobiotechnologies.

Volume None
Pages None
DOI 10.1101/2021.08.20.457145
Language English
Journal bioRxiv

Full Text