Domain intelligible models

Sultan Imangaliyev; Andrei Prodan; Max Nieuwdorp; Albert K. Groen; Natal A. W. van Riel; Evgeni Levin

doi:https://doi.org/10.1016/j.ymeth.2018.06.011

Domain intelligible models

Sultan Imangaliyev, Andrei Prodan, Max Nieuwdorp, Albert K. Groen, Natal A. W. van Riel, Evgeni Levin

Research output: Contribution to journal › Article › Academic › peer-review

3 Citations (Scopus)

Abstract

Mining biological information from rich “-omics” datasets is facilitated by organizing features into groups that are related to a biological phenomenon or clinical outcome. For example, microorganisms can be grouped based on a phylogenetic tree that depicts their similarities regarding genetic or physical characteristics. Here, we describe algorithms that incorporate auxiliary information in terms of groups of predictors and the relationships between them into the metagenome learning task to build intelligible models. In particular, our cost function guides the feature selection process using auxiliary information by requiring related groups of predictors to provide similar contributions to the final response. We apply the developed algorithms to a recently published dataset analyzing the effects of fecal microbiota transplantation (FMT) in order to identify factors that are associated with improved peripheral insulin sensitivity, leading to accurate predictions of the response to the FMT.

Original language	English
Pages (from-to)	69-73
Journal	Methods (San Diego, Calif.)
Volume	149
DOIs	https://doi.org/10.1016/j.ymeth.2018.06.011
Publication status	Published - 2018

Access to Document

https://doi.org/10.1016/j.ymeth.2018.06.011

Cite this

@article{5fbcd3e8ec4347e79d9fb4fad3a33d62,

title = "Domain intelligible models",

abstract = "Mining biological information from rich “-omics” datasets is facilitated by organizing features into groups that are related to a biological phenomenon or clinical outcome. For example, microorganisms can be grouped based on a phylogenetic tree that depicts their similarities regarding genetic or physical characteristics. Here, we describe algorithms that incorporate auxiliary information in terms of groups of predictors and the relationships between them into the metagenome learning task to build intelligible models. In particular, our cost function guides the feature selection process using auxiliary information by requiring related groups of predictors to provide similar contributions to the final response. We apply the developed algorithms to a recently published dataset analyzing the effects of fecal microbiota transplantation (FMT) in order to identify factors that are associated with improved peripheral insulin sensitivity, leading to accurate predictions of the response to the FMT.",

author = "Sultan Imangaliyev and Andrei Prodan and Max Nieuwdorp and Groen, {Albert K.} and {van Riel}, {Natal A. W.} and Evgeni Levin",

year = "2018",

doi = "https://doi.org/10.1016/j.ymeth.2018.06.011",

language = "English",

volume = "149",

pages = "69--73",

journal = "Methods (San Diego, Calif.)",

issn = "1046-2023",

publisher = "Academic Press Inc.",

}

TY - JOUR

T1 - Domain intelligible models

AU - Imangaliyev, Sultan

AU - Prodan, Andrei

AU - Nieuwdorp, Max

AU - Groen, Albert K.

AU - van Riel, Natal A. W.

AU - Levin, Evgeni

PY - 2018

Y1 - 2018

N2 - Mining biological information from rich “-omics” datasets is facilitated by organizing features into groups that are related to a biological phenomenon or clinical outcome. For example, microorganisms can be grouped based on a phylogenetic tree that depicts their similarities regarding genetic or physical characteristics. Here, we describe algorithms that incorporate auxiliary information in terms of groups of predictors and the relationships between them into the metagenome learning task to build intelligible models. In particular, our cost function guides the feature selection process using auxiliary information by requiring related groups of predictors to provide similar contributions to the final response. We apply the developed algorithms to a recently published dataset analyzing the effects of fecal microbiota transplantation (FMT) in order to identify factors that are associated with improved peripheral insulin sensitivity, leading to accurate predictions of the response to the FMT.

AB - Mining biological information from rich “-omics” datasets is facilitated by organizing features into groups that are related to a biological phenomenon or clinical outcome. For example, microorganisms can be grouped based on a phylogenetic tree that depicts their similarities regarding genetic or physical characteristics. Here, we describe algorithms that incorporate auxiliary information in terms of groups of predictors and the relationships between them into the metagenome learning task to build intelligible models. In particular, our cost function guides the feature selection process using auxiliary information by requiring related groups of predictors to provide similar contributions to the final response. We apply the developed algorithms to a recently published dataset analyzing the effects of fecal microbiota transplantation (FMT) in order to identify factors that are associated with improved peripheral insulin sensitivity, leading to accurate predictions of the response to the FMT.

UR - https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85052729287&origin=inward

UR - https://www.ncbi.nlm.nih.gov/pubmed/29981382

U2 - https://doi.org/10.1016/j.ymeth.2018.06.011

DO - https://doi.org/10.1016/j.ymeth.2018.06.011

M3 - Article

C2 - 29981382

SN - 1046-2023

VL - 149

SP - 69

EP - 73

JO - Methods (San Diego, Calif.)

JF - Methods (San Diego, Calif.)

ER -

Domain intelligible models

Abstract

Access to Document

Other files and links

Cite this