Related resources
Search for item elsewhere
University researcher(s)
Academic department(s)
Predicting context specific enhancer-promoter interactions from ChIP-Seq time course data
[Thesis]. Manchester, UK: The University of Manchester; 2017.
Access to files
- FULL-TEXT.PDF (pdf)
Abstract
We develop machine learning approaches to predict context specific enhancer-promoter interactions using evidence from changes in genomic protein occupancy over time. Occupancy of estrogen receptor alpha (ER-alpha), RNA polymerase (Pol II) and histone marks H2AZ and H3K4me3 were measured over time using ChIP-Seq experiments in MCF7 cells stimulated with estrogen. Two Bayesian classifiers were developed, unsupervised and supervised. The supervised approach uses the correlation of temporal binding patterns at enhancers and promoters and genomic proximity as features and predicts interactions. The method was trained using experimentally determined interactions from the same system and achieves much higher precision than predictions based on the genomic proximity of nearest ER-alpha binding. We use the method to identify a confident set of ER-alpha target genes and their regulatory enhancers genome-wide. Validation with publicly available GRO-Seq data shows our predicted targets are much more likely to show early nascent transcription than predictions based on genomic ER-alpha binding proximity alone. Accuracy of the predictions from the supervised model was compared against the second more complex unsupervised generative approach which uses proximity-based prior and temporal binding patterns at enhancers and promoters to infer protein-mediated regulatory complexes involving individual genes and their networks of multiple distant regulatory enhancers.
Additional content not available electronically
Our analysis is implemented in Python and all figures and output results can be reproduced by running appropriately named scripts. The package along with scripts is located in GitHub at: https://github.com/ManchesterBioinference/EP_Bayes. The repository also contains a readme file with the list of scripts and their descriptions.
Keyword(s)
Bayesian classifier; ChIP-Seq; Enhancer-promoter interaction; Estrogen receptor; Machine learning