Maggie: Leveraging genetic variation to identify DNA sequence motifs mediating transcription factor binding and function

Zeyang Shen, Marten A. Hoeksema, Zhengyu Ouyang, Christopher Benner, Christopher K. Glass

Research output: Contribution to journalArticleAcademicpeer-review

7 Citations (Scopus)

Abstract

Motivation: Genetic variation in regulatory elements can alter transcription factor (TF) binding by mutating a TF binding motif, which in turn may affect the activity of the regulatory elements. However, it is unclear which motifs are prone to impact transcriptional regulation if mutated. Current motif analysis tools either prioritize TFs based on motif enrichment without linking to a function or are limited in their applications due to the assumption of linearity between motifs and their functional effects. Results: We present MAGGIE (Motif Alteration Genome-wide to Globally Investigate Elements), a novel method for identifying motifs mediating TF binding and function. By leveraging measurements from diverse genotypes, MAGGIE uses a statistical approach to link mutations of a motif to changes of an epigenomic feature without assuming a linear relationship. We benchmark MAGGIE across various applications using both simulated and biological datasets and demonstrate its improvement in sensitivity and specificity compared with the state-of-the-art motif analysis approaches. We use MAGGIE to gain novel insights into the divergent functions of distinct NF-jB factors in pro-inflammatory macrophages, revealing the association of p65-p50 co-binding with transcriptional activation and the association of p50 binding lacking p65 with transcriptional repression.

Original languageEnglish
Pages (from-to)I84-I92
JournalBioinformatics
Volume36
DOIs
Publication statusPublished - 2021

Cite this