Predicting Success of a Digital Self-Help Intervention for Alcohol and Substance Use With Machine Learning

Lucas A. Ramos; Matthijs Blankers; Guido van Wingen; Tamara de Bruijn; Steffen C. Pauws; Anneke E. Goudriaan

doi:https://doi.org/10.3389/fpsyg.2021.734633

Predicting Success of a Digital Self-Help Intervention for Alcohol and Substance Use With Machine Learning

Lucas A. Ramos, Matthijs Blankers, Guido van Wingen, Tamara de Bruijn, Steffen C. Pauws, Anneke E. Goudriaan

Research output: Contribution to journal › Article › Academic › peer-review

11 Citations (Scopus)

Abstract

Background: Digital self-help interventions for reducing the use of alcohol tobacco and other drugs (ATOD) have generally shown positive but small effects in controlling substance use and improving the quality of life of participants. Nonetheless, low adherence rates remain a major drawback of these digital interventions, with mixed results in (prolonged) participation and outcome. To prevent non-adherence, we developed models to predict success in the early stages of an ATOD digital self-help intervention and explore the predictors associated with participant’s goal achievement. Methods: We included previous and current participants from a widely used, evidence-based ATOD intervention from the Netherlands (Jellinek Digital Self-help). Participants were considered successful if they completed all intervention modules and reached their substance use goals (i.e., stop/reduce). Early dropout was defined as finishing only the first module. During model development, participants were split per substance (alcohol, tobacco, cannabis) and features were computed based on the log data of the first 3 days of intervention participation. Machine learning models were trained, validated and tested using a nested k-fold cross-validation strategy. Results: From the 32,398 participants enrolled in the study, 80% of participants did not complete the first module of the intervention and were excluded from further analysis. From the remaining participants, the percentage of success for each substance was 30% for alcohol, 22% for cannabis and 24% for tobacco. The area under the Receiver Operating Characteristic curve was the highest for the Random Forest model trained on data from the alcohol and tobacco programs (0.71 95%CI 0.69–0.73) and (0.71 95%CI 0.67–0.76), respectively, followed by cannabis (0.67 95%CI 0.59–0.75). Quitting substance use instead of moderation as an intervention goal, initial daily consumption, no substance use on the weekends as a target goal and intervention engagement were strong predictors of success. Discussion: Using log data from the first 3 days of intervention use, machine learning models showed positive results in identifying successful participants. Our results suggest the models were especially able to identify participants at risk of early dropout. Multiple variables were found to have high predictive value, which can be used to further improve the intervention.

Original language	English
Article number	734633
Journal	Frontiers in psychology
Volume	12
DOIs	https://doi.org/10.3389/fpsyg.2021.734633
Publication status	Published - 3 Sept 2021

Keywords

ATOD
CBT
Substance Use Disorder
addiction
eHealth
log data analysis
machine learning

Access to Document

https://doi.org/10.3389/fpsyg.2021.734633

Cite this

@article{01c009280a1e4c09a00d359b1d6172dd,

title = "Predicting Success of a Digital Self-Help Intervention for Alcohol and Substance Use With Machine Learning",

abstract = "Background: Digital self-help interventions for reducing the use of alcohol tobacco and other drugs (ATOD) have generally shown positive but small effects in controlling substance use and improving the quality of life of participants. Nonetheless, low adherence rates remain a major drawback of these digital interventions, with mixed results in (prolonged) participation and outcome. To prevent non-adherence, we developed models to predict success in the early stages of an ATOD digital self-help intervention and explore the predictors associated with participant{\textquoteright}s goal achievement. Methods: We included previous and current participants from a widely used, evidence-based ATOD intervention from the Netherlands (Jellinek Digital Self-help). Participants were considered successful if they completed all intervention modules and reached their substance use goals (i.e., stop/reduce). Early dropout was defined as finishing only the first module. During model development, participants were split per substance (alcohol, tobacco, cannabis) and features were computed based on the log data of the first 3 days of intervention participation. Machine learning models were trained, validated and tested using a nested k-fold cross-validation strategy. Results: From the 32,398 participants enrolled in the study, 80% of participants did not complete the first module of the intervention and were excluded from further analysis. From the remaining participants, the percentage of success for each substance was 30% for alcohol, 22% for cannabis and 24% for tobacco. The area under the Receiver Operating Characteristic curve was the highest for the Random Forest model trained on data from the alcohol and tobacco programs (0.71 95%CI 0.69–0.73) and (0.71 95%CI 0.67–0.76), respectively, followed by cannabis (0.67 95%CI 0.59–0.75). Quitting substance use instead of moderation as an intervention goal, initial daily consumption, no substance use on the weekends as a target goal and intervention engagement were strong predictors of success. Discussion: Using log data from the first 3 days of intervention use, machine learning models showed positive results in identifying successful participants. Our results suggest the models were especially able to identify participants at risk of early dropout. Multiple variables were found to have high predictive value, which can be used to further improve the intervention.",

keywords = "ATOD, CBT, Substance Use Disorder, addiction, eHealth, log data analysis, machine learning",

author = "Ramos, {Lucas A.} and Matthijs Blankers and {van Wingen}, Guido and {de Bruijn}, Tamara and Pauws, {Steffen C.} and Goudriaan, {Anneke E.}",

note = "Funding Information: This project under the name Digging for goals using data science: Improving E-health alcohol and substance use interventions using machine learning and longitudinal clustering Digging for goals using data science was funded by ZonMw under (Grant No. 555003024). Publisher Copyright: {\textcopyright} Copyright {\textcopyright} 2021 Ramos, Blankers, van Wingen, de Bruijn, Pauws and Goudriaan. Copyright: Copyright 2021 Elsevier B.V., All rights reserved.",

year = "2021",

month = sep,

day = "3",

doi = "https://doi.org/10.3389/fpsyg.2021.734633",

language = "English",

volume = "12",

journal = "Frontiers in psychology",

issn = "1664-1078",

publisher = "Frontiers Media S.A.",

}

TY - JOUR

T1 - Predicting Success of a Digital Self-Help Intervention for Alcohol and Substance Use With Machine Learning

AU - Ramos, Lucas A.

AU - Blankers, Matthijs

AU - van Wingen, Guido

AU - de Bruijn, Tamara

AU - Pauws, Steffen C.

AU - Goudriaan, Anneke E.

N1 - Funding Information: This project under the name Digging for goals using data science: Improving E-health alcohol and substance use interventions using machine learning and longitudinal clustering Digging for goals using data science was funded by ZonMw under (Grant No. 555003024). Publisher Copyright: © Copyright © 2021 Ramos, Blankers, van Wingen, de Bruijn, Pauws and Goudriaan. Copyright: Copyright 2021 Elsevier B.V., All rights reserved.

PY - 2021/9/3

Y1 - 2021/9/3

N2 - Background: Digital self-help interventions for reducing the use of alcohol tobacco and other drugs (ATOD) have generally shown positive but small effects in controlling substance use and improving the quality of life of participants. Nonetheless, low adherence rates remain a major drawback of these digital interventions, with mixed results in (prolonged) participation and outcome. To prevent non-adherence, we developed models to predict success in the early stages of an ATOD digital self-help intervention and explore the predictors associated with participant’s goal achievement. Methods: We included previous and current participants from a widely used, evidence-based ATOD intervention from the Netherlands (Jellinek Digital Self-help). Participants were considered successful if they completed all intervention modules and reached their substance use goals (i.e., stop/reduce). Early dropout was defined as finishing only the first module. During model development, participants were split per substance (alcohol, tobacco, cannabis) and features were computed based on the log data of the first 3 days of intervention participation. Machine learning models were trained, validated and tested using a nested k-fold cross-validation strategy. Results: From the 32,398 participants enrolled in the study, 80% of participants did not complete the first module of the intervention and were excluded from further analysis. From the remaining participants, the percentage of success for each substance was 30% for alcohol, 22% for cannabis and 24% for tobacco. The area under the Receiver Operating Characteristic curve was the highest for the Random Forest model trained on data from the alcohol and tobacco programs (0.71 95%CI 0.69–0.73) and (0.71 95%CI 0.67–0.76), respectively, followed by cannabis (0.67 95%CI 0.59–0.75). Quitting substance use instead of moderation as an intervention goal, initial daily consumption, no substance use on the weekends as a target goal and intervention engagement were strong predictors of success. Discussion: Using log data from the first 3 days of intervention use, machine learning models showed positive results in identifying successful participants. Our results suggest the models were especially able to identify participants at risk of early dropout. Multiple variables were found to have high predictive value, which can be used to further improve the intervention.

AB - Background: Digital self-help interventions for reducing the use of alcohol tobacco and other drugs (ATOD) have generally shown positive but small effects in controlling substance use and improving the quality of life of participants. Nonetheless, low adherence rates remain a major drawback of these digital interventions, with mixed results in (prolonged) participation and outcome. To prevent non-adherence, we developed models to predict success in the early stages of an ATOD digital self-help intervention and explore the predictors associated with participant’s goal achievement. Methods: We included previous and current participants from a widely used, evidence-based ATOD intervention from the Netherlands (Jellinek Digital Self-help). Participants were considered successful if they completed all intervention modules and reached their substance use goals (i.e., stop/reduce). Early dropout was defined as finishing only the first module. During model development, participants were split per substance (alcohol, tobacco, cannabis) and features were computed based on the log data of the first 3 days of intervention participation. Machine learning models were trained, validated and tested using a nested k-fold cross-validation strategy. Results: From the 32,398 participants enrolled in the study, 80% of participants did not complete the first module of the intervention and were excluded from further analysis. From the remaining participants, the percentage of success for each substance was 30% for alcohol, 22% for cannabis and 24% for tobacco. The area under the Receiver Operating Characteristic curve was the highest for the Random Forest model trained on data from the alcohol and tobacco programs (0.71 95%CI 0.69–0.73) and (0.71 95%CI 0.67–0.76), respectively, followed by cannabis (0.67 95%CI 0.59–0.75). Quitting substance use instead of moderation as an intervention goal, initial daily consumption, no substance use on the weekends as a target goal and intervention engagement were strong predictors of success. Discussion: Using log data from the first 3 days of intervention use, machine learning models showed positive results in identifying successful participants. Our results suggest the models were especially able to identify participants at risk of early dropout. Multiple variables were found to have high predictive value, which can be used to further improve the intervention.

KW - ATOD

KW - CBT

KW - Substance Use Disorder

KW - addiction

KW - eHealth

KW - log data analysis

KW - machine learning

UR - http://www.scopus.com/inward/record.url?scp=85115227694&partnerID=8YFLogxK

U2 - https://doi.org/10.3389/fpsyg.2021.734633

DO - https://doi.org/10.3389/fpsyg.2021.734633

M3 - Article

C2 - 34552539

SN - 1664-1078

VL - 12

JO - Frontiers in psychology

JF - Frontiers in psychology

M1 - 734633

ER -

Predicting Success of a Digital Self-Help Intervention for Alcohol and Substance Use With Machine Learning

Abstract

Keywords

Access to Document

Other files and links

Cite this