RNA sequencing and swarm intelligence–enhanced classification algorithm development for blood-based disease diagnostics using spliced blood platelet RNA

Myron G. Best, Sjors G. J. G. in ’t Veld, Nik Sol, Thomas Wurdinger, Sjors G J G In 't Veld

Research output: Contribution to journalArticleAcademicpeer-review

76 Citations (Scopus)

Abstract

Blood-based diagnostics tests, using individual or panels of biomarkers, may revolutionize disease diagnostics and enable minimally invasive therapy monitoring. However, selection of the most relevant biomarkers from liquid biosources remains an immense challenge. We recently presented the thromboSeq pipeline, which enables RNA sequencing and cancer classification via self-learning and swarm intelligence–enhanced bioinformatics algorithms using blood platelet RNA. Here, we provide the wet-lab protocol for the generation of platelet RNA-sequencing libraries and the dry-lab protocol for the development of swarm intelligence–enhanced machine-learning-based classification algorithms. The wet-lab protocol includes platelet RNA isolation, mRNA amplification, and preparation for next-generation sequencing. The dry-lab protocol describes the automated FASTQ file pre-processing to quantified gene counts, quality controls, data normalization and correction, and swarm intelligence–enhanced support vector machine (SVM) algorithm development. This protocol enables platelet RNA profiling from 500 pg of platelet RNA and allows automated and optimized biomarker panel selection. The wet-lab protocol can be performed in 5 d before sequencing, and the algorithm development can be completed in 2 d, depending on computational resources. The protocol requires basic molecular biology skills and a basic understanding of Linux and R. In all, with this protocol, we aim to enable the scientific community to test platelet RNA for diagnostic algorithm development.
Original languageEnglish
Pages (from-to)1206-1234
Number of pages29
JournalNature protocols
Volume14
Issue number4
DOIs
Publication statusPublished - 1 Apr 2019

Keywords

  • Biomarkers/blood
  • Blood Platelets/chemistry
  • Computational Biology/methods
  • DNA, Complementary/analysis
  • High-Throughput Nucleotide Sequencing/methods
  • Humans
  • RNA Splicing
  • RNA, Messenger/analysis
  • Sequence Analysis, RNA/methods
  • Support Vector Machine/statistics & numerical data

Cite this