TY - JOUR
T1 - A Computer Simulation of SARS-CoV-2 Mutation Spectra for Empirical Data Characterization and Analysis
AU - Xiao, Ming
AU - Ma, Fubo
AU - Yu, Jun
AU - Xie, Jianghang
AU - Zhang, Qiaozhen
AU - Liu, Peng
AU - Yu, Fei
AU - Jiang, Yuming
AU - Zhang, Le
N1 - Funding Information: This work was supported by the National Science and Technology Major Project (2021YFF1201200), Sichuan Science and Technology Program (2022YFS0048) and China Postdoctoral Science Foundation (2020M673221, China). Publisher Copyright: © 2022 by the authors.
PY - 2023/1/1
Y1 - 2023/1/1
N2 - It is very important to compute the mutation spectra, and simulate the intra-host mutation processes by sequencing data, which is not only for the understanding of SARS-CoV-2 genetic mechanism, but also for epidemic prediction, vaccine, and drug design. However, the current intra-host mutation analysis algorithms are not only inaccurate, but also the simulation methods are unable to quickly and precisely predict new SARS-CoV-2 variants generated from the accumulation of mutations. Therefore, this study proposes a novel accurate strand-specific SARS-CoV-2 intra-host mutation spectra computation method, develops an efficient and fast SARS-CoV-2 intra-host mutation simulation method based on mutation spectra, and establishes an online analysis and visualization platform. Our main results include: (1) There is a significant variability in the SARS-CoV-2 intra-host mutation spectra across different lineages, with the major mutations from G- > A, G- > C, G- > U on the positive-sense strand and C- > U, C- > G, C- > A on the negative-sense strand; (2) our mutation simulation reveals the simulation sequence starts to deviate from the base content percentage of Alpha-CoV/Delta-CoV after approximately 620 mutation steps; (3) 2019-NCSS provides an easy-to-use and visualized online platform for SARS-Cov-2 online analysis and mutation simulation.
AB - It is very important to compute the mutation spectra, and simulate the intra-host mutation processes by sequencing data, which is not only for the understanding of SARS-CoV-2 genetic mechanism, but also for epidemic prediction, vaccine, and drug design. However, the current intra-host mutation analysis algorithms are not only inaccurate, but also the simulation methods are unable to quickly and precisely predict new SARS-CoV-2 variants generated from the accumulation of mutations. Therefore, this study proposes a novel accurate strand-specific SARS-CoV-2 intra-host mutation spectra computation method, develops an efficient and fast SARS-CoV-2 intra-host mutation simulation method based on mutation spectra, and establishes an online analysis and visualization platform. Our main results include: (1) There is a significant variability in the SARS-CoV-2 intra-host mutation spectra across different lineages, with the major mutations from G- > A, G- > C, G- > U on the positive-sense strand and C- > U, C- > G, C- > A on the negative-sense strand; (2) our mutation simulation reveals the simulation sequence starts to deviate from the base content percentage of Alpha-CoV/Delta-CoV after approximately 620 mutation steps; (3) 2019-NCSS provides an easy-to-use and visualized online platform for SARS-Cov-2 online analysis and mutation simulation.
KW - SARS-CoV-2
KW - bioinformatics
KW - computational biology
KW - mutation simulation
KW - mutation spectra
UR - http://www.scopus.com/inward/record.url?scp=85146660020&partnerID=8YFLogxK
U2 - https://doi.org/10.3390/biom13010063
DO - https://doi.org/10.3390/biom13010063
M3 - Article
C2 - 36671448
SN - 2218-273X
VL - 13
JO - BIOMOLECULES
JF - BIOMOLECULES
IS - 1
M1 - 63
ER -