Abstract
DNA code design aims to generate a set of DNA sequences (codewords) with minimum likelihood of undesired hybridizations among sequences and their reverse-complement (RC) pairs (cross-hybridization). Inspired by the distinct hybridization affinities (or stabilities) of perfect double helix constructed by individual single-stranded DNA (ssDNA) and its RC pair, we propose a novel similarity significance (SS) model to measure the similarity between DNA sequences. Particularly, instead of directly measuring the similarity of two sequences by any metric/approach, the proposed SS works in a way to evaluate how more likely will the undesirable hybridizations occur over the desirable hybridizations in the presence of the two measured sequences and their RC pairs. With this SS model, we construct thermodynamically stable DNA codes subject to several combinatorial constraints using a sorting-based algorithm. The proposed scheme results in DNA codes with larger code sizes and wider free energy gaps (hence better cross-hybridization performance) compared to the existing methods.
Original language | English |
---|---|
Title of host publication | 2020 IEEE International Symposium on Information Theory, ISIT 2020 - Proceedings |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 786-791 |
Number of pages | 6 |
ISBN (Electronic) | 9781728164328 |
DOIs | |
Publication status | Published - Jun 2020 |
Externally published | Yes |
Event | 2020 IEEE International Symposium on Information Theory, ISIT 2020 - Los Angeles, United States Duration: Jul 21 2020 → Jul 26 2020 |
Publication series
Name | IEEE International Symposium on Information Theory - Proceedings |
---|---|
Volume | 2020-June |
ISSN (Print) | 2157-8095 |
Conference
Conference | 2020 IEEE International Symposium on Information Theory, ISIT 2020 |
---|---|
Country/Territory | United States |
City | Los Angeles |
Period | 7/21/20 → 7/26/20 |
Bibliographical note
Publisher Copyright:© 2020 IEEE.
ASJC Scopus Subject Areas
- Theoretical Computer Science
- Information Systems
- Modelling and Simulation
- Applied Mathematics