Machine learning polymer models of three-dimensional chromatin organization in human lymphoblastoid cells

Ziad Al Bkhetan, Michal Kadlof, Agnieszka Kraft, Dariusz Plewczynski

Research output: Contribution to journalArticlepeer-review

7 Citations (Scopus)

Abstract

We present machine learning models of human genome three-dimensional structure that combine one dimensional (linear) sequence specificity, epigenomic information, and transcription factor binding profiles, with the polymer-based biophysical simulations in order to explain the extensive long-range chromatin looping observed in ChIA-PET experiments for lymphoblastoid cells. Random Forest, Gradient Boosting Machine (GBM), and Deep Learning models were constructed and evaluated, when predicting high-resolution interactions within Topologically Associating Domains (TADs). The predicted interactions are consistent with the experimental long-read ChIA-PET interactions mediated by CTCF and RNAPOL2 for GM12878 cell line. The contribution of sequence information and chromatin state defined by epigenomic features to the prediction task is analyzed and reported, when using them separately and combined. Furthermore, we design three-dimensional models of chromatin contact domains (CCDs) using real (ChIA-PET) and predicted looping interactions. Initial results show a similarity between both types of 3D computational models (constructed from experimental or predicted interactions). This observation confirms the association between genome sequence, epigenomic and transcription factor profiles, and three-dimensional interactions.

Original languageEnglish
Pages (from-to)83-90
Number of pages8
JournalMethods
Volume166
DOIs
Publication statusPublished or Issued - 15 Aug 2019
Externally publishedYes

Keywords

  • 3D genome structure
  • Biophysical modeling
  • Deep learning
  • Epigenomics
  • Machine learning
  • Transcription factors

ASJC Scopus subject areas

  • Molecular Biology
  • General Biochemistry,Genetics and Molecular Biology

Cite this