Study on DNA Storage Encoding Based IAOA Under Innovation Constraints

aut.relation.endpage3590
aut.relation.issue4
aut.relation.journalCurrent Issues in Molecular Biology
aut.relation.pages18
aut.relation.startpage3573
aut.relation.volume45
dc.contributor.authorYan, Wei Qi
dc.contributor.authorDu, Haigui
dc.contributor.authorZhou, Shihua
dc.contributor.authorWang, Sijie
dc.date.accessioned2023-05-15T23:29:29Z
dc.date.available2023-05-15T23:29:29Z
dc.date.issued2022-04-18
dc.description.abstractWith the informationization of social processes, the amount of related data has greatly increased, making traditional storage media unable to meet the current requirements for data storage. Due to its advantages of a high storage capacity and persistence, deoxyribonucleic acid (DNA) has been considered the most prospective storage media to solve the data storage problem. Synthesis is an important process for DNA storage, and low-quality DNA coding can increase errors during sequencing, which can affect the storage efficiency. To reduce errors caused by the poor stability of DNA sequences during storage, this paper proposes a method that uses the double-matching and error-pairing constraints to improve the quality of the DNA coding set. First, the double-matching and error-pairing constraints are defined to solve problems of sequences with self-complementary reactions in the solution that are prone to mismatch at the 3′ end. In addition, two strategies are introduced in the arithmetic optimization algorithm, including a random perturbation of the elementary function and a double adaptive weighting strategy. An improved arithmetic optimization algorithm (IAOA) is proposed to construct DNA coding sets. The experimental results of the IAOA on 13 benchmark functions show a significant improvement in its exploration and development capabilities over the existing algorithms. Moreover, the IAOA is used in the DNA encoding design under both traditional and new constraints. The DNA coding sets are tested to estimate their quality regarding the number of hairpins and melting temperature. The DNA storage coding sets constructed in this study are improved by 77.7% at the lower boundary compared to existing algorithms. The DNA sequences in the storage sets show a reduction of 9.7–84.1% in the melting temperature variance, and the hairpin structure ratio is reduced by 2.1–80%. The results indicate that the stability of the DNA coding sets is improved under the two proposed constraints compared to traditional constraints.
dc.identifier.citationCurrent Issues in Molecular Biology, ISSN: 1467-3037 (Print), MDPI AG, 45(4), 3573-3590. doi: 10.3390/cimb45040233
dc.identifier.doi10.3390/cimb45040233
dc.identifier.issn1467-3037
dc.identifier.urihttps://hdl.handle.net/10292/16142
dc.publisherMDPI AG
dc.relation.urihttps://www.mdpi.com/1467-3045/45/4/233
dc.rights.accessrightsOpenAccess
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subject1108 Medical Microbiology
dc.subjectMicrobiology
dc.subject3107 Microbiology
dc.titleStudy on DNA Storage Encoding Based IAOA Under Innovation Constraints
dc.typeJournal Article
pubs.elements-id502079
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
cimb-45-00233-v2.pdf
Size:
1.43 MB
Format:
Adobe Portable Document Format
Description:
Journal article