Genome-wide predictors of NF-κB recruitment and transcriptional activity

Cieślik, Marcin; Bekiranov, Stefan

doi:10.1186/s13040-015-0071-3

Genome-wide predictors of NF-κB recruitment and transcriptional activity

Research
Open access
Published: 26 November 2015

Volume 8, article number 37, (2015)
Cite this article

Download PDF

You have full access to this open access article

BioData Mining Aims and scope Submit manuscript

Genome-wide predictors of NF-κB recruitment and transcriptional activity

Download PDF

Marcin Cieślik^1,2 &
Stefan Bekiranov¹

3293 Accesses
8 Citations
2 Altmetric
Explore all metrics

Abstract

Background

Inducible transcription factors (TFs) mediate transcriptional responses to environmental cues. In response to multiple inflammatory signals active NF-κB dimers enter the nucleus and trigger cell-type-, and stimulus-specific transcriptional programs. Although much is known about NF-κB inducing pathways and about locus-specific mechanisms of transcriptional control, it is poorly understood how the pre-existing chromatin landscape determines NF-κB target selection and activation. Specifically, it is not known which epigenetic marks and pre-bound TFs serve genome-wide as positive (negative) cues for active NF-κB.

Results

We applied multivariate and combinatorial data mining techniques on a comprehensive dataset of DNA methylation, DNase I hypersensitivity, eight epigenetic marks, and 34 TFs to arrive at genome-wide patterns that predict NF-κB binding. Strikingly, we observed NF-κB recruitment to accessible and nucleosome-bound sites. Within nucleosomal DNA NF-κB binding was primed by H3K4me1 and H2A.Z, but also hyper-methylated DNA outside of promoters and CpG-islands. Many of these predictors showed combinatorial cooperativity and statistically significant interactions. Recruitment to pre-accessible sites was more frequent and influenced by chromatin-associated TFs. We observed that specific TF-combinations are greatly enriched for (or depleted of) NF-κB binding events.

Conclusions

We provide evidence of NF-κB binding within genomic regions that lack classical marks of activity. These pioneer binding events are relatively often associated with transcriptional regulation. Further, our predictive models indicate that specific combinations of epigenetic marks and transcription factors predetermine the NF-κB cistrome, supporting the feasibility of using statistical approaches to identify “histone codes”.

View this article's peer review reports

Computational modeling of chromatin accessibility identified important epigenomic regulators

Article Open access 08 January 2022

MKL1 defines the H3K4Me3 landscape for NF-κB dependent inflammatory response

Article Open access 15 March 2017

Determinants of transcription factor regulatory range

Article Open access 18 May 2020

Background

Complex mechanisms have evolved to control gene expression in response to stimuli [1]. Signaling pathways bridge environmental signals with changes in gene expression patterns. The terminal transcription factors (TFs) of these pathways bind to specific regulatory regions and modulate gene expression through interactions with co-activators and co-repressors [2]. TFs need to locate their target sites within more than 3 billion base pairs of the human genome. The specificity of motif binding, accessibility of recognition sites, and expression of cell-type restricted cooperative binding partners, is believed to determine the fidelity of the transcriptional response (e.g. [3–5]). The canonical role of some TFs is to mediate responses to environmental cues (e.g. [6–8]), which converges on a relatively small number of TFs including CREB1 and the AP-1 and NF-κB families [9, 10]. The goal of this study was to computationally dissect the biological signals that pre-determine recruitment and transcriptional activation by NF-κB using logistic regression, non-negative matrix factorization, combinatorial enrichment analysis and statistical enrichment analysis.

The NF-κB-mediated response depends on the translocation of NF-κB proteins to the nucleus [11]. Selectivity of the transcriptional program is in part due to the specificity of different NF-κB homo- and hetero-dimers to distinct subsets of targets and cognate sequences [12–14], but also depends on the availability of tissue-restricted DNase I hypersensitive (DHS) sites [15]. These differences are among the primary determinants of cell-type identity, but can change in response to the environment and in pathologies [16–18].

Mechanisms of chromatin-mediated TF recruitment are well understood for the ligand-induced nuclear glucocorticoid receptor (GR) [19]. GR target-genes are often induced (or repressed) rapidly in response to hormone treatment [20]. It has been shown that GR binding at target promoters is almost exclusively associated with chromatin that is hypersensitive prior to hormone treatment [21]. These regions of accessible chromatin have been shown to be maintained by AP-1, which was required for GR-mediated response [22, 23].

In stark contrast to GR, “pioneer” transcription factors, such as PU.1 or FOXA1, are typically lineage-restricted and transcriptionally regulated [24]. They have the ability to bind nucleosomal DNA and to reduce nucleosome occupancy [25–27]. The functional consequence of this is the transcriptional activation of enhancers and promoters [28]. FOXA1 has been shown to be recruited to locally hypo-methylated sites within broader hyper-methylated regions [29]. These unmethylated CpG windows have been reported to complement bivalent domains in defining tissue-specific enhancers [30]. Importantly, it was recently shown that the pluripotency TFs OCT4, SOX2, and KLF4, have chromatin-binding characteristics very similar to classic pioneers [31, 32]. These studies highlighted that pioneer TFs have a similar role in chromatin remodeling, during development, cellular reprogramming, and transcriptional activation [33–37].

Transcriptional activation by NF-κB and its recruitment to chromatin appears to involve multiple mechanisms and results in a complex chromatin binding pattern and multiphasic transcriptional kinetics [38, 39]. According to the current model, differences in NF-κB binding are largely determined by the chromatin state [40], but specific targeting signals are unknown. Some target sites are bound immediately, whereas binding to others is delayed and ostensibly depends on prior chromatin remodeling [41–43]. It has been proposed that immediate binding sites are constitutive i.e. associated with genes induced in response to a broad range of stimuli. On the other hand, delayed binding is thought to be regulated through the cell type specific immediate response and induction of pathways that result in the remodeling of chromatin [9, 40, 44].

TNF-induced p65 (RELA or NFKB3, an NF-κB subunit) binding has been mapped genome-wide in a number of cells lines (THP-1, HeLa, A549, dendritic cells [15, 45, 46]). These data provided evidence that p65 is recruited to tens of thousands of cell-type specific loci [35, 47, 48]. Strikingly, a substantial number (~17 %) of p65 binding events occurred outside of pre-accessible chromatin [15, 49–52].

Despite these advances, much work remains to be done in order to better understand how inducible TFs including p65 are recruited to their eventual binding sites [40]. Here, we leveraged statistical machine learning on large integrated functional genomic data sets generated by ENCODE and other groups in A549 cells (a non-small cell lung cancer cell line and model for the study of inflammatory signaling in cancer) to assess the extent to which the majority of observed binding events are opportunistic or guided by specific signals [53–56]. Specifically, we analyzed binding sites for 34 TFs and two control tracks, digital genomic foot-printing (DGF) [57], DNA methylation (RRBS) [58], ChIP-seq [59] for 8 epigenetic marks, and RNA-seq [60] data. To arrive at predictive rules that determine which sites are likely to recruit p65, we statistically evaluated the individual and cooperative impact of NF-κB motifs, DNA methylation, DNase I hypersensitivity, histone modifications, and pre-bound TFs on p65 binding. However, genome-wide patterns of epigenetic features are noisy, complex, and highly correlated. Different approaches are required for chromatin marks and TFs due to their continuous (quantitative signals) and discrete (sites along the genome) nature, respectively. We found that TF binding was sufficiently well captured through binary variables (i.e., site present or absent), whereas epigenetic marks require quantitative features. To discover rules that govern the p65 cistrome (i.e., factors and states of the chromatin that recruit p65) diverse computational and statistical techniques had to be applied and developed including logistic regression based approaches—presence or absence of p65 binding as a function of quantitative features including histone modification levels. However, the identification of factors that cooperate to recruit p65 using regression-based approaches is particularly challenging due to the co-linearity of many of the input variables, leading to potentially meaningless regression coefficients. Consequently, we developed and applied a matrix factorization approach and applied it to histone modification data, which identifies “codes” of co-occurring marks which tend to be less correlated and yield more meaningful, interpretable regression coefficients. Moreover, while we analyzed the ability of TFs to recruit p65 one at a time (univariate statistical enrichment analysis), TFs are known to co-recruit other factors and likely work in a cooperative, combinatorial fashion in further recruitment of factors like p65 upon TNF stimulation. Consequently, we developed and applied combinatorial enrichment/association methods that uncovered combinations of TFs that recruit p65. Predictive clusters of TFs and synergistic patterns of histone marks, were indeed captured through “greedy” association rules and matrix factorization, respectively, indicating the existence of a strong combinatorial epigenetic code. The codes were often context dependent, which suggests that datasets with higher temporal and spatial resolution will enable their further refinement. Our computational approach reveals that p65 binding occurs within accessible and inaccessible chromatin regions, hence suggesting that p65 may function as a pioneer TF. Within accessible regions p65 recruitment is significantly predicted by a number of chromatin features and their combinatorial interactions, revealing an intricate yet specific set of p65-targeting epigenetic codes.

Methods

Peak calling

TF peaks were called using PeakRanger [61] version 1.16, which has been shown to yield above average spatial accuracy compared to other peak callers, using the A549 ChIP-seq control track provided by ENCODE. Due to the large number of processed data sets we used the tool consistently with all default parameters. DNase I hypersensitive (DHS) [62] sites from DGF sequencing were also called using PeakRanger and the suitability of this method was visually inspected.

Motif analysis

De novo motif discovery was done within p65 peaks (see Peak Calling) that did not overlap another TF peak or DHS site using MEME-chip with all default settings [63]. The recovered NF-κB motif, which closely matches the canonical sequence, was used to scan all genomic sites using tomtom [64] (see Regulatory sites definition).

Site conservation

We probed evolutionary conservation of regulatory sites via PhastCons [65] conservation scores for placental mammals, which limited our focus to conservation across mammalian vertebrates and were obtained from the UCSC genome browser track: phastCons46way.placental.

Known NF-κB targets

Refseq ids of known targets were retrieved from: http://www.bu.edu/nf-kb/gene-resources/target-genes/. Accessed 23 Nov 2015.

CpG islands

To classify genomic sites into CpG-islands (CGI) and non-CGI, we used two complimentary CGI definitions. The first set of CGIs was obtained from http://rafalab.jhsph.edu/CGI/-model-based-cpg-islands-hg19.txt[66]. The second set was from the UCSC genome browser cpgIslandExt.gz [67]. We defined a site as CGI or non-CGI if both CGI definitions were in agreement.

DNA methylation

Experimental methylation levels at CpG sites were provided by ENCODE. The data was generated using the reduced representation bisulfite sequencing technique (RRBS) (HAIB Methyl RRBS Track Protocol). Two replicates were available probing methylation at over 1.5 M sites. These sites are biased towards MspI cleavage sites, and high GC content, but cover the majority of promoters and CGIs. We kept only sites that were present in both replicates and had ten reads coverage. We classified them depending on methylation range (0 % < 25 % < 75 % < 100 %) as “low”, “medium”, “high” [68].

Regulatory sites definition

For statistical analyses and significance tests we exclude parts of the genome that appear to have no regulatory activity in either TNF-stimulated or unstimualted A549 cells. This allows us to evaluate statistical enrichments more stringently since we are omitting a large portion of the genome that had low a priori likelihood of being bound by p65. We base our definition of “regulatory sites” conservatively on TF-binding and DNase I hypersensitivity in unstimulated cells. We merge overlapping peaks of TFs into sites. Consequently, each site contains one or more TF peaks which overlap or are directly adjacent (no gap). Specifically, we use the cluster tree implementation from bx.python (https://bitbucket.org/james_taylor/bx-python/) to find the set of all “regulatory sites”.

Regulatory sites classification

We defined several sub-types (classes) of regulatory sites. First, we identified “inaccessible sites” as occupied by p65 in TNF-stimulated cells but not other TFs or DNase I hypersensitive sites in unstimulated cells. All remaining, regulatory sites were termed “accessible” as they were either occupied by a TF in unstimulated cells or DNase I hypersensitive. First, regulatory sites were classified as TSS-proximal or TSS-distal. If a site overlapped any TSS known to GENCODE v14 it was defined as TSS-proximal. Next we sub-classified proximal or distal sites based on number of TFs bound (occupancy): stage 1 TSS-proximal bound 1–2 TFs; stage 1 TSS-distal - bound by exactly one TF; stage 2 TSS-distal - bound by 2 to 4 TFs. The following marks: CTCF, DNase I hypersensitivity, and RAD21 (cohesin) did not contribute to the occupancy count.

Enrichment profiles

Positional enrichment profiles for epigenetic marks and PhastCons scores were generated using a modified version of the ACT.py using the tool’s default setting at 20 bp resolution [69].

ChIP-seq data scaling

Within regulatory sites reads counts have been obtained using: genomic_overlaps count -i, or bedtools intersect -c from the aligned ChIP-seq datasets. These raw read-counts were normalized by region length. And scaled to the 0 to 1 range using a sigmoid function, such that the scaling is approximately linear up to the 95th percentile [70].

Logistic regression

All models predicted binary p65 occupancy from levels of scaled epigenetic marks, epigenetic combinatorial patterns (see Combinatorial NMF “codes”), and binary p65 motif occurrence. The individual models differed in the number of included independent variables as described in the text. All models were trained according to the same scheme using scikits-learn [71]. 20 % of the observations were withheld as a test set. On the 80 % training set we used 10-fold cross-validation to select the best prior regularization parameters (L₁ or L₂ and C) using the Matthews correlation coefficient (MCC) as a balanced performance measure [72], and the prediction cut-off p_c = 0.5. The best cross-validated model was further tuned by scanning for a p_c that maximized the model’s MCC on the training set. The tuned model was benchmarked against the test set; we report both area under curve (AUC) and MCC.

Epigenetic mark correlations

Correlations between scaled epigenetic marks were calculated as Spearman’s rank correlation coefficient.

Epigenetic mark interactions

We have chosen the geometric mean (GM) to represent the joint magnitude of two marks. It is the square root transformation of the product of two magnitudes and scales the data to resemble a normal distribution.

Association between continuous levels and binary response

To assess the strength of association between epigenetic marks (or mark interactions) with p65 binding we calculated enrichment p-values via the random set (RS) statistical test [73]. The RS test statistic has a large magnitude, and corresponding small p-value, only if sites occupied by p65 have higher levels of marks, codes, expression, etc. than sites selected at random.

Combinatorial NMF “codes” of epigenetic marks

We used the NMF algorithm [74, 75] to decompose “chromatin patterns” at sub-classes of regulatory sites (see Regulatory sites classification) into small combinatorial “codes” of epigenetic marks [76]. Intuitively, codes capture correlations between histone modifications that are either global or local (valid for a subset of sites). Within each code values of individual histone modifications are tied, which is a proxy for their average relative abundance. Each regulatory site is associated with a weight for each code, the complete “chromatin pattern” (see ChIP-seq data scaling) is reconstructed by summing over codes multiplied by their weights. We validate that regression on code weights has desirable properties over regression on all scaled levels of all individual marks (see Additional file 1: Methods).

Combinatorial chromatin clusters of transcription factors

We used two approaches to discover chromatin-based combinatorial clusters of transcription factors; “TF-clusters” that are associated with p65 recruitment. This is a combinatorial optimization problem as the number of possible combinations of 30+ TFs is large. The first approach (TGC) directly relied on the significance of the statistical enrichment, Fisher’s exact test (FET), and greedily “expanded” TF-clusters if they improved the p-value (association strength) by 0.2 (log2-odds), but were also relatively frequently observed genome-wide (at least 40 instances). The second approach, called “top-k non-redundant association rule” (TNR, [77]) was less biased in that it found the most “important” non-redundant associations between combinations of TFs, from which we selected those that predicted p65 binding (see Additional file 1: Methods).

Linking genes, TSS-proximal and TSS-distal sites

To assign TSS-distal to TSS-proximal regulatory sites we followed a probabilistic approach that involves both distance restraints (100 kb) and the similarity of TF binding patterns (based on [37]). To assign regulatory sites to protein-coding genes we use a simple overlap criterion for TSS-proximal sites, and a 100 kb distance limit for enhancers (see Additional file 1: Methods).

Differential expression

We used the DESeq [78] algorithm to identify differentially expressed (up- or down-regulated) GENCODE genes. We obtained raw exon-based expression estimates using the htseq-count defaults (see Additional file 1: Methods).

Statistical analysis of proportions

Ninety five percent confidence intervals for the proportion of upregulated genes and statistical significance of the difference between two proportions have been estimated using the methods suggested by [79] as implemented in the R-package binom.

Results

p65 is recruited to “latent” and “primed” enhancers

According to the current model, p65 (RELA) is rapidly recruited to mostly constitutively accessible sites [15, 40]. To assess this on a genome-wide scale we determined the overlap between DNase I hypersensitive (DHS) sites and post-induction p65 peaks (Fig. 1a–b). DHS regions are reliable indicators of DNA binding by the majority of TFs [57, 80–82]. We find a total of 70,146 p65 peaks at a 1 % FDR, which reduce to 65,253 after collapsing into larger regulatory sites (Methods). We have chosen to include all peaks which cannot be explained by experimental noise [53, 83]. Notably, of the approximately 400 known, well-annotated NF-κB target genes (see Methods) we find 150 contain a peak in their promoter-proximal region (see Additional file 2: Data). We find a surprisingly large number (21,103) of p65 binding events outside of pre-existing DHS sites, mostly outside (20,591) of promoter-proximal regions (Fig. 1a; Fig. 1a and b are similar to Venn diagrams and show the total overlaps between p65, promoters, DHS and NF-κB motifs). In total, 32 % off all p65 peaks are outside DHS sites, while 33 % of all (132,236) DHS sites show enrichment of p65. To rule-out the possibility that the majority of p65 binding is non-specific and opportunistic due to saturation [53, 83, 84], we assessed the overlap of experimental peaks with occurrences of the NF-κB motif (Fig. 1b). Overall, of the 65,253 p65-positive regions 36,464 (56 %) have an NF-κB motif (Methods). Strikingly, this proportion was significantly higher for p65 peaks outside DHS sites 73 %, compared to 48 % in nucleosome-free regions. Next, we attempted to identify other proteins that are co-recruited with p65 to the nucleosome-bound sites by applying de novo motif discovery. However, the only detectable motif was the canonical NF-κB sequence (Additional file 1: Figure S1A). To test whether p65 binding outside of nucleosome-depleted regions is likely “functional” we probed for evolutionary conservation (Methods). We found evidence for purifying selection at p65 summits within and outside DHS sites (Fig. 1c). We observed significantly stronger conservation at summits within constitutive DHS sites. Which is expected since lineage-specific regulatory sequences are not well conserved across mammals [85].

Recruitment to hyper-methylated distal regulatory sites (statistical enrichment analysis)

Next, since TFs differ markedly in their ability to bind methylated DNA, we asked whether CpG-methylation influences p65 recruitment propensities. We assessed the overlap between p65 peaks and CpG methylation status at promoter and putative enhancer regions divided into CpG-islands (CGIs) and non-CpG-islands (non-CGIs) (Fig. 1d). In normal cells most CGIs are constitutively hypo-methylated. Conversely, non-CGIs, which include gene body and enhancer CpGs, have dynamic methylation levels [86]. Cancer cells in general are characterized by global losses of non-CGI methylation and focal increases in CGI methylation [87]. Importantly, LPS primary response genes tend to have CpG-rich constitutively accessible promoters [44]. We found that p65 recruitment is sensitive to methylation status both at promoter-proximal and distal regulatory regions and depends on the CGI context. As expected, hypo-methylated CGI sites showed an increased propensity for p65 binding, whereas methylated CGIs were depleted of p65. The effect was more pronounced at TSS-proximal sites. Hypo-methylated non-CGI promoters were likewise enriched for p65, however we also observed a modest but significant (2.5-fold, (Fisher’s Exact Test) FET p-value 8.38e-73) enrichment (over that expected by chance) for p65 binding at hyper-methylated, non-CpG, TSS-distal regions. This is unexpected given that, genome-wide, p65 preferentially binds to promoters (11-fold enrichment), CpG-islands, and hypo-methylated sites (16-fold). Why would DNA methylation at a class of non-CpG, promoter-distal regions increase p65 binding propensity? In stark contrast to CGIs, at CpG-poor sites hyper-methylation is correlated with DNase I hypersensitivity (2-fold, p 7.28e-64). However, even after controlling for DHS status hyper-methylated regions are still enriched for p65 peaks in DHS-positive (1.65-fold, p 5.91e-13) and DHS-negative (3-fold, p 2.25e-42) subsets of non-CpG, promoter-distal sites.

Rapid recruitment to inaccessible heterochromatic regions

We hypothesized that p65 recruitment to both DHS-positive and DHS-negative regions is guided, to some extent, by epigenetic marks [26, 88]. We used DHS sites and TF peaks to identify all genomic “regulatory sites” in TNF-stimulated and unstimulated A549 cells (Methods) and classified them as either “accessible” or “inaccessible”. The accessible sites were either, strictly DNase I hypersensitive, or enriched for any of the 34 TFs for which data was available (Methods). Surprisingly, we found that 16,065 p65 summits were within inaccessible regions. One concern is that these sites are possibly non-functional. We again found evidence for sequence conservation at p65 summits within inaccessible sites (Fig. 1c). We also find a large fraction of the 16,065 sites contain one or more NF-κB motifs (Fig. 1b). As we discuss below, we find p65-dependent transcriptional induction displays a strong monotonic downward trend as a function of the number of TFs bound at promoters (see Fig. 5). Promoters with no TFs bound showed the strongest induction consistent with this trend. Importantly, many of these promoters do not contain a DHS, further arguing that these inaccessible sites are functional. Finally, we compared chromatin enrichment profiles between the two classes of sites and found that accessible sites were enriched for epigenetic marks typically associated with permissive or active chromatin, while inaccessible sites showed no significant enrichment (Fig. 2a).

Differences in histone modification levels at distinct classes of p65 binding sites (comparison of distributions)

To investigate this in more detail we compared histone modification levels between p65-bound and unbound sites. For this and all subsequent analyses we focus on three classes of accessible regulatory sites. (1) TSS-proximal sites with at most 2 TFs pre-bound; (2) DHS-negative TSS-distal sites with at most 1 TF pre-bound; and (3) DHS-positive TSS-distal sites with 2 to 4 TFs pre-bound. We will refer to these classes as stage 1 promoters, stage 1 enhancers, and stage 2 enhancers, respectively. The rationale behind this classification is to separate the earliest detectable high-confidence regulatory regions (single TF), from enhancers that recruited co-factors (up to 4 TFs) and from fully assembled enhancers.

We found that sites that recruit p65 have higher levels of several histone modifications, H3K4me1 in particular (stage 1 Fig. 2b, stage 2 Additional file 1: Figure S2). However, the differences were pronounced only at stage 2 enhancers, which are almost exclusively accessible sites (Additional file 1: Figure S2). Within inaccessible regions p65 site selection was weakly correlated with H3K4me1 and H3K4me2, but almost fully independent of H3K27ac (Fig. 2b). At accessible sites enrichment profiles for several marks have a bimodal shape around the peak summit, which is the most likely location of the NF-κB motif (Additional file 1: Figure S1B) [47, 89]. On the other hand, H2A.Z, a histone variant associated with regulatory regions [90], has a single peak over the p65 summit. This suggests that the H2A.Z-containing nucleosome is centered on the NF-κB element.

Pioneering recruitment of p65 to latent and epigenetically silent enhancers

Together, these results suggest that within 30 min of induction (i.e., 30 min after initial TNF treatment; see [46] for experimental details), p65 is being recruited to both, DNase I hypersensitive and nucleosome-bound sites. Although the recruitment to DHS sites is qualitatively stronger (Additional file 1: Figure S3) and occurs more frequently (7.1-fold, p 0), p65 binding to nucleosome-bound DNA appears to be more specific as judged by the occurrence of the NF-κB motif. Constitutively accessible sites typically harbor clusters of multiple TF recognition elements [53, 81], which might suggest that the higher conservation of accessible sites is due to purifying selection on motifs of other TFs. Most strikingly, outside of promoters and CpG islands, p65 binds preferentially to methylated sites; which brings to mind the methyl-DNA binding capabilities of FOXA1, a “pioneer” transcription factor [29, 91, 92]. In summary, we found that p65 binding within inaccessible regions occurred both at epigenetically silent “latent” (i.e., H3K4me1 low) and epigenetically “primed” enhancers (i.e., H3K4me1 high, and/or DNase I hyper-sensitive) [48, 93].

Epigenetic marks are quantitative predictors of p65 recruitment

When transcription factor binding and histone modification levels are mapped in the same state (cell line, condition) they appear highly correlated. However, whether epigenetic marks are truly useful predictors of TF-binding is disputed (e.g. [94–96]). In particular it is not well established, whether histone modifications confer binding specificity, and if TFs prefer sites with a specific chromatin pattern [88]. We decided to test this hypothesis by using histone modification levels prior to stimulation to predict which sites will recruit p65. Since there is no consensus regarding how to define enhancers from epigenetic marks (e.g. [89]) we favored a conservative approach and identified cis-regulatory regions based on the presence of at least one TF and/or DNase susceptibility. Only a small fraction of all sites (~5 %, 10,625) will recruit p65 on stimulation.

Univariate predictive modeling of p65 recruitment (logistic regression analysis)

We tested whether p65 binding can be predicted from histone modification levels, either on their own, or together with occurrences of the κB recognition element (Table 1). We modeled the probability of p65 recruitment using a logistic function using histone modifications and motif occurrences as predictors (see Methods). We found that motif-based prediction was universally improved by the inclusion of any histone modification, with a substantial average gain of Δ0.055 area under ROC (AUC). The improvement was highest at promoters, on average by Δ0.072 AUC with six of the eight histone modifications reaching Δ0.10 AUC. It is remarkable that at TSS-proximal sites H2A.Z and H3K4me2 are better predictors of p65 recruitment than the occurrence of the p65 motif – 0.72 vs 0.69 AUC, respectively. The binding of p65 at stage 1 enhancers sites is most strongly determined by the presence of the NF-κB motif (0.79 AUC). This score is most significantly improved by H3K4me1 to 0.86 AUC, which along with H3K4me2 and the p65 motif yield the best-performing models overall. At enhancers with several cofactors, neither motif-occurrence, nor histone modifications were individually sufficient to model p65 binding (max 0.63 AUC) and even the best-performing combined model barely exceeded 0.70 AUC. The predictive performance of histone acetylations was low but similar at stage 1 and stage 2 enhancers. Conversely, lysine methylations, which significantly determined p65 binding at stage 1 enhancers, were less relevant at stage 2 enhancers (Table 1).

Table 1 Performance of epigenetic marks in the prediction of p65 binding

Full size table

We have shown that even after controlling for DNase I hypersensitivity, TF occupancy, and motif occurrence, specific epigenetic marks determine p65 recruitment. It was previously shown that H3K4me1 is involved in the priming of enhancers for activation in multiple lineages [97]. H3K4me1 depositions are relatively stable [89], and their presence is thought to be determined by lineage-restricted “pioneer” transcription factors, such as PU.1 [29, 35, 47]. It has been postulated that this defines lineage-restricted enhancers and creates binding sites for additional inducible TFs [37, 98]. The finding that p65 recruitment to stage 1 enhancers was correlated with H3K4me1 levels is consistent with the current interpretation of H3K4me1 as a mark of regulatory regions primed for activation [97, 99–101]. Histone acetylations are associated with active enhancers [100, 102], but are also highly correlated with DNase I hypersensitivity and other permissive histone marks [103]. In line with previous findings which determined an important role of H2A.Z in the regulation of transcription [104–106], our results suggest that H2A.Z-marked promoters are poised for increased TF binding (Additional file 1: Figure S4).

Context-dependent roles of epigenetic marks in p65 recruitment

It has been argued that the co-occurrence of multiple epigenetic marks is necessary for specific chromatin-triggered processes [107, 108] and that combinations of multiple marks are required to reliably identify tissue-specific enhancers and direct TFs to specific chromatin regions [88]. We reasoned that p65 site selection could be further refined by cooperative action between multiple marks. To discover these synergistic effects we built a predictive model of synergistic p65 recruitment. This model was augmented by more standard correlative descriptions of genome-wide associations (Additional file 1: Figure S5).

Synergism and correlation between predictive chromatin features

Through correlative analyses (Additional file 1: Text) we found that canonical active marks (H3K9ac, H3K27ac, H3K4me3) were involved in synergistic interactions with H3K4me1 at promoters and H2A.Z at enhancers (Additional file 1: Figure S5). At promoters, high levels of H3K4me1 and active marks were mutually exclusive (Additional file 1: Figure S5A), and H3K4me1 was unable to recruit p65 on its own. This suggests that mono-methylation is only an intermediate in the further activation of promoters [109]. Conversely, at enhancers H3K4me1 has an independent role in p65 recruitment and appears to be required for H3K9ac enrichment (Additional file 1: Figure S5B). Although, p65 recruitment was weakly associated with H2A.Z [110] on its own (Table 1) we observed strong synergism between H2A.Z and other active marks at nucleosome-bound, but not nucleosome-free, enhancers (Additional file 1: Figure S5B).

Multivariate predictive modeling using non-negative matrix factorization (NMF)

To construct the predictive model we used non-negative matrix factorization (NMF) [111] to decompose “chromatin patterns” at TSS-proximal and TSS-distal regulatory regions into fundamental additive parts we refer to as “codes”. The method is explained in detail in the (Additional file 1: Methods) [76]. Briefly, we construct an “epigenetic matrix” V, which is factored via NMF into matrices W and H. V has rows which are regulatory sites and columns which are epigenetic marks. Elements of V are normalized levels of a specific mark at a certain site. When multiplied, matrices W and H result in an approximation of V. Matrix H contains a small number of rows, each row represents a combination of marks that frequently co-occur in subsets of genomic regions (codes). Matrix W contains weights on how to reconstruct each row of V using the codes in H.

We applied the algorithm separately on each of the three classes of regulatory sites and discovered four, largely equivalent, combinatorial patterns (Fig. 2c). The equivalent codes from different regions showed quantitative differences in mark values, which reflect differences in the relative levels and importance of marks within a pattern [76]. The first code subsumes the permissive epigenetic marks (H3K9ac, H3K27ac, H3K4me2, H3K4me3) (Additional file 1: Figure S6A). The remaining three codes capture context-dependent patterns, which is a particular strength of NMF [111, 112]. Globally H2A.Z and H3K4me1 are only weakly correlated (Additional file 1: Figure S6A), but they were hypothesized to co-occur at a subset of primed enhancers [101] and this exact pattern is captured through code 3. Another subclass of enhancers is silenced by the presence of the repressive H3K27me3 mark [99, 100]. Code 2 has positive values for H3K27me3 and H2A.Z, which captures their dependence at both promoters and enhancers. The last pattern is related to H3K79me2. At TSS-proximal regions this mark is associated with transcription elongation by RNAPII [113] and is relatively independent of other histone modifications (Additional file 1: Figure S6A). Conversely, within TSS-distal regulatory regions H3K79me2 is tied to the enhancer mark H3K4me1, which suggests that the code is largely specific to enhancers within introns of actively transcribed genes.

Signals of p65-targeting “histone codes” (logistic regression analysis using NMF “code” weights as predictors)

The unique “chromatin pattern” at each locus is reconstructed from prototypical codes and locus-specific weights. The codes are simply tied levels of specific mark combinations, the weights reflect the importance of a code at a locus. Together, this allows us to quantitatively link combinatorial epigenetic patterns to p65 recruitment using logistic regression (Methods). We modeled the probability of p65 binding as a function of the code weights and motif occurrence. The recruitment of p65 is most easily predicted at promoters (AUC = 0.82, MCC = 0.42) followed by stage 1 (AUC = 0.85, MCC = 0.32), and stage 2 (AUC = 0.71, MCC = 0.31) enhancers. We found that some codes were robust predictors of p65 binding (Fig. 2d, Additional file 1: Methods). Motif presence has the largest standardized coefficient (log-odds ratio, LOR). However, as few as 7.4 % of stage 1 TSS-distal sites contain a p65 motif, which means that the large 0.86 LOR (2.4 fold) applies only to a rare subset of the population of sites [114]. We found that code 3 (Fig. 2c), was an important predictor at stage 1 enhancers (Fig. 2d). Code 4 showed opposite roles in the recruitment of p65 to promoter and enhancer sites. This counter-intuitive observation is largely explained by differences in the code itself. At stage 1 distal sites the code ties levels of H3K79me2 and H3K4me1, whereas at proximal sites it is driven by H3K79me2 alone. Still, it appears that promoters that have successfully recruited RNAPII and engaged in elongation – as judged by higher levels of H3K79me2 – are less likely to recruit the transcriptional activator p65. The “repressive” code 2 had only a weakly negative effect on p65 binding. This suggests that the repressive role of H3K27me3 can be mostly attributed to the inverse correlation with H3K27ac and other “active” marks (Additional file 1: Figure S6A), but not H3K27me3 itself (code 1).

Together, these results reveal context-dependent roles of histone modifications in p65 recruitment Further, we found that H2A.Z in the context of H3K4me1 (code 3) is the strongest epigenetic predictor of p65 recruitment to TSS-distal sites, whereas in the context of H3K27me3 (code 2) it is slightly impeding. H3K27me3 deposition has been shown to be dependent on H2A.Z [115], whereas the mechanism that lead to H3K4me1 and H2A.Z co-occurrence are poorly understood. Co-operativity of H2A.Z with other “active” marks (code 1) (Additional file 1: Figure S5B), agrees with the observation that codes 1 and 3 are relatively uncorrelated (Additional file 1: Figure S6B). The co-occurrence of H2A.Z and H3K4me1 at a relatively large subset of enhancers (code 2) further supports the emerging interpretation of the H2A.Z/H3K4me1 enhancer pattern as priming enhancers for activation [101].

p65 site selection is guided by the presence of specific TFs and overall occupancy

While the predictive performance of the code-based models is considerably better than relying just on one mark and motif presence, the predictions are far from perfect (Table 1). The models’ sensitivity (i.e., percentage of correctly predicted p65 binding events) with respect to p65-bound sites is low (20 %) at stage 1 enhancers, and modest at stage 2 enhancers (34 %). This is still remarkable given that less than 2 % of stage 1 sites are bound by p65. It has been widely accepted that tissue specific gene expression is achieved through combinatorial interactions among transcription factors [116, 117]. We reasoned that the presence or absence of chromatin-associated proteins, transcription factors, and their combinations could provide an independent set of cues for p65 site selection.

Our first goal was to understand the general trend by which chromatin-bound proteins influence p65 recruitment. Recent findings suggest that following stimulus the induced transcription factors frequently localize at “pre-coded” sites with a high-occupancy of other TFs (HOT regions) [37, 118]. We observed that the probability of p65 binding increases monotonically with the number of proteins present at a site (Fig. 3a). For example, sites which are bound by 12 proteins have a 75 % chance of recruiting p65 after stimulation. The presence of specific TFs can increase or decrease binding frequency. We found, among others, that sites bound by GATA3 and BCL3 are overall less likely to recruit p65, while sites bound by CREB1 have an increased propensity for p65 (Fig. 3a). Importantly, BCL3 is a nuclear IκB protein with strikingly opposite and poorly understood roles in the regulation of NF-κB binding and activity [119, 120].

Pre-bound TFs recruit p65 to specific regulatory regions (association analysis)

To quantify the strength of association we calculated the odds of p65-recruitment contingent on the presence of chromatin-associated proteins at TSS-distal regulatory regions (Fig. 3b). To control for occupancy we analyzed single-factor stage 1 and medium-occupancy (2–4 pre-bound TFs) stage 2 sites individually and evaluated statistical-significance using the Fisher’s exact test (see Methods). We found that BCL3, RAD21 (cohesin), and MAX, were most significantly associated with p65 recruitment at both classes of sites. Together, these observations show that pre-bound transcription factors influence the propensity of a site to recruit p65 after induction. More importantly, they suggest that sites co-occupied by several transcription factors can have markedly different propensities for p65 binding than the individual proteins, which provides indirect support for combinatorial regulation of transcription by specific TF recruitment [4, 88].

Cooperative p65 recruitment by clusters of TFs

Use of a “cooperativity matrix” for the identification of recruitment synergism

We decided to test this hypothesis computationally, with regard to p65 binding, by determining whether combinatorial interactions of proteins at the chromatin-level lead to cooperative recruitment of p65. We focused on stage 2 enhancers, as these sample pairwise interactions exhaustively, while controlling for DNase I accessibility and occupancy number (see Methods). Importantly, less than 10 % of stage 2 TSS-distal sites recruit p65 (see “all” TF curve with occupancy ranging from 2 to 4 in Fig. 3a), which suggests a relatively large degree of binding specificity. We calculated a “cooperativity matrix” (Methods), which shows whether sites co-occupied by pairs of TFs have increased (positive cooperativity) or decreased (negative cooperativity) propensity for p65 binding (Fig. 4a). A similar matrix was calculated for low-occupancy promoters (i.e. stage 1 TSS-proximal sites). These results are similar, and support most of the cooperative interactions observed at stage 2 enhancers (not shown).

Strong synergism between BCL3 and MAX in p65 recruitment

We found that the majority of interactions shows positive cooperativity; some of which are surprisingly strong. For example, sites occupied by both BCL3 and MAX show 37-fold higher propensity (log-odds 5.22) for p65 over sites bound by two random proteins (Methods). The synergism of this pair improves over the already strong enrichment of BCL3 on average (log-odds 1.77) (Fig. 3b). Several TFs participate in multiple synergistic interactions, most notably SP1, MAX, JUND, and YY1. Importantly, all of these TFs show individually a strong propensity for p65 (Fig. 3b). Several TFs appeared to form “cooperativity cliques”. For example, all pairwise interactions between MAX, RAD21, and JUND show positive cooperativity in p65 binding (Fig. 4a). Together, this suggests that p65 is recruited by clusters of chromatin-associated proteins with strong affinity for p65.

Association rules for the identification of cooperative TF clusters (optimization of combinations of TFs that recruit p65)

Our next goal was to identify combinatorial clusters of TFs that are predictive of p65 site selection [121, 122]. Intuitively, such “TF-clusters” should show the highest propensity for p65 when they are fully assembled on the chromatin. We used two heuristics that attempted to find orthogonal solutions to this combinatorial problem. The first method (TGC) was “greedy”, but optimized the statistical association (enrichment p-value) directly. The second technique (TNR [77]) attempted to identify all associations (strictly association rules [123]) with strong statistical support, which we next filtered for those that predict p65 binding (see Methods and Additional file 1: Methods). We ran both algorithms on TF occupancy data at stage 2 enhancers and report on the most significant TGC clusters as these were a high-confidence subset of the TNR clusters.

Sites occupied by CREB1, RAD21, JUND, and FOXA2 are preferentially bound by p65

We found a total of 21 TF-clusters with a high propensity for p65, and one which was significantly depleted of p65 binding (Table 2). Stage 2 enhancers occupied by any of these TF-clusters were on average 4-fold enriched for p65 and predicted p65 recruitment with genome-wide confidence (i.e. probability of p65 recruitment) over 60 % . Combinatorial clusters with the strongest enrichment had confidence over 70 % (Fig. 4b-d, Table 2). The TF-cluster with the highest propensity for p65 binding involved CREB1, RAD21, JUND, and FOXA2 (Fig. 4b). The only cluster with negative cooperativity in p65 recruitment comprised GATA3, TCF12, and USF1 (Fig. 4c left). Strikingly, TCF12 was also found in a positively cooperative TF-cluster with CREB1 and RAD21 (Fig. 4d right). Consistently, TCF12 was the only context-dependent TF involved in both negatively and positively cooperative pairwise interactions (Fig. 4a). On the contrary MAX and BCL3, which have high individual propensity did not appear to form cooperative clusters among the included TFs.

Table 2 Combinatorial clusters of chromatin-bound TFs most significantly associated with p65 recruitment

Full size table

Cooperative clusters are consistent with the pioneer-constitutive-inducible sequence of enhancer assembly

According to the current model of sequential enhanceosome assembly [35, 37, 124], lineage-determining TFs, such as PU.1 (SPI1) initiate remodeling at tissue-specific enhancers and act as pioneer TFs, which have the ability to “open” closed chromatin, particularly at H2A.Z-containing nucleosomes [110]. These accessible sites are then bound by ubiquitous TFs, which facilitate selective access for stress-inducible TFs. Our results are consistent with this model. First, we observed that the “pioneer” TF FOXA2 was positively associated with p65 recruitment only at sites, which were further co-occupied by additional TFs. FOXA2’s ability to markedly increase p65 binding frequency appeared to require at least two other proteins (Fig. 4d). Conversely, ubiquitously expressed TFs – SP1, MAX, YY1, JUND – had intrinsic individual propensities for p65, which were boosted through their frequent pairwise and higher-order interactions (Fig. 3b, Fig. 4, Table 2). For example, the TF-cluster with the largest p65 binding odds contained the transcription factors FOXA2 and JUND, together with RAD21 and CREB1 (Fig. 4b). In agreement with the enhanceosome model FOXA2 is the pioneer, while JUND unlike other AP-1 family members is constitutively expressed [125]. AP-1 (JUND/FOSL1) and NF-κB (p65/p50) have been shown to be required for the induction of the IL6 gene [126]. Consistently, we found that IL6 is among the most up-regulated genes (23-fold). We found that the majority of TF-clusters, including the top FOXA2-JUND pair, further included CREB1, RAD21, or both (Table 2). It has been recently reported that both, RAD21 (cohesin) and CREB1, have a constitutive role in the stabilization of large protein complexes at tissue-specific loci [127, 128]. Together, these observations suggest that the layered TF-network (pioneer, ubiquitous, induced) [37] involves CREB1 and RAD21 as scaffolding components. Strikingly, SP1, YY1, and JUND, have been shown to physically interact with each other or NF-κB [129–131], and thus might confer specificity for p65.

We observe a similar TF-network at the negatively cooperative TF-cluster (GATA3, TCF12, USF1). Analogously, GATA3 is a pioneer TF [132], USF1 is an ubiquitously expressed basic helix-loop-helix (bHLH) transcription factor [133], while TCF12 (HEB) is a bHLH required specifically for T-cell development [134]. Further, all three proteins show negative cooperativity with ETS1, a pioneer factor from the same family as PU.1. Consistently, several pairwise interactions within a small subset of proteins, most notably ETS1 and GATA3, showed moderate negative cooperativity in p65 recruitment (Fig. 4a). Interestingly, ETS1 is also also typically restricted to the T-cell lineage, which when compared to PU.1, shows a remarkable difference in sequence specificity, and was shown to interact with NFKB1 (p50) [40, 135, 136].

p65-mediated induction of pre-assembled promoters

We attempted to identify TSS-proximal and TSS-distal sites that are most frequently associated with p65-upregulated genes [137]. We find that the probability of a p65-bound site to be associated with an up-regulated gene depends, not surprisingly, on whether the site is a promoter or enhancer (Fig. 5). Reassuringly, we observed, that among all profiled TFs, sites bound by p65 are up-regulated most frequently (proportion difference p-value: 0.001 [79]). This confirms the role of p65 as a transcriptional activator. In addition, of the 415 known NF-κB target genes (see Methods), 102 are differentially expressed with 89 upregulated. Moreover, 51 of these known NF-κB target genes are differentially expressed and contain a p65 peak in their promoter (Additional file 2: Data). Strikingly, we found that the ability of p65 to induce gene expression is much higher at low-occupancy p65-recruiting promoters (low-occupancy vs high-occupancy p-value: < 1e-4), whereas no strong influence of occupancy on p65-induction was seen at TSS-distal sites (Fig. 5). This re-affirms that p65 binding to inaccessible sites (Fig. 1), is “functional” and has on average the same impact on transcriptional activation as p65 recruitment to pre-accessible sites (approx. 6 %). It also suggests that a relatively large subset (approx. 16 %) of low-occupancy promoters are particularly susceptible to transcriptional activation. As a corollary genes with high-occupancy promoters are not easily inducible (< 5 %). We observed that low-occupancy promoters bound by TAF1 had the highest chance of being up-regulated (Additional file 1: Discussion, Additional file 1: Figures S7-9), which raises the possibility that specific TFs loaded at TSS-proximal or TSS-distal sites would bias genes towards p65-mediated activation.

Discussion

After treatment of A549 with TNF and subsequent NF-κB activation we observed 65,253 p65 (RELA) binding sites genome-wide (Fig. 1). We tried to determine genome-wide cues that guide p65 to select accessible sites or alternatively bind at ostensibly inaccessible regions. Paradoxically, p65 binding was so ubiquitous that the majority of genes had a p65 peak in their proximity (not shown), which is in stark contrast to the relatively small number of 716 genes that were differentially expressed.

Our goal was to understand the regulatory logic of the NF-κB transcriptional network. Through large-scale data-mining and the development of sensu stricto predictive models we identified a number of testable hypotheses regarding which genomic features influence p65 binding site selection, and which promote p65-mediated transcriptional activation. We have found evidence of rapid p65 binding outside accessible chromatin; importantly this type of remodeling activity is typically attributed to pioneer transcription factors. Further, we identified combinatorial patterns of epigenetic marks and transcription factors that predict p65 recruitment. This type of association, although not strictly causal, shows to what extent the induced p65 cistrome is predetermined by the extant chromatin landscape.

NF-κB is recruited to chromatin like a pioneer TF

NF-κB dimers are believed to fall into the category of “signal sensors” inducible transcription factors (TF) that opportunistically exploit a pre-existing enhancer landscape [15, 40]. Our analyses suggest that p65 is recruited to all types of enhancers including “latent”, “primed”, and “active” ones [48, 93, 97, 102]. Our results suggests that within 30 min of TNF stimulation p65 is already, albeit at a relatively low level (Additional file 1: Figure S3), recruited to 16,032 putative latent enhancers in A549 cells, which represent 24.5 % of all p65 binding. This is similar to the time frame required for the “pioneer” transcription factor PU.1 to bind latent enhancers in macrophages (Fig. 4 in [93]). Consistently, Jin et al. have shown that in HeLa cells 17 % of p65 binding occurs outside accessible chromatin (Fig. 2d in [15]). These numbers should be contrasted with, for example, GR and FOXP3, which have, 5 and 2 % sites outside DHS regions, respectively [138, 139]. On the other hand, it has been previously shown that cytokine induced TFs have the ability to activate lineage-specific enhancers. A recent report suggests that gains in H3K4me1 and p300 at a subset of enhancers in Th2 cells are mediated by STAT6 [48].

Strikingly, we observed that p65 binding outside of CpG islands (CGI) occurs both at hyper- and hypo-methylated CpG sites (Fig. 1d). It has been recently shown that a similar subset of non-CGI regions switch from DNA methylation to H3K27me3 during early embryonic differentiation [92]. This process appeared to be associated with FOXA1 and its quite unique ability to bind methylated DNA and initiate chromatin remodeling [29, 91]. What is more, FOXA1 and FOXA2 have been shown to bind nucleosome-wrapped DNA [27, 110], and similarly, NF-κB dimers have a demonstrated ability to recognize nucleosome-occluded κB elements [140]. Pioneer TFs have the unique capability to trigger cellular reprogramming while chromatin remodeling was shown to be an early bottleneck of this process [31–33] (reviewed in [24, 34, 141]). Likewise, induction of NF-κB activity has been shown to accelerate epigenetic reprogramming, the epithelial-mesenchymal transition, and dedifferentiation [49, 51, 142]. Interestingly NF-κB activity is also required for B-cell and T-cell development and maturation [8, 143]. Since there is no evidence that NF-κB has the ability to initiate chromatin remodeling (reviewed in [40]) it is likely that the the above analogies reflect synergism between NF-κB and some unknown constitutive or induced pioneer transcription TF(s); possibly from the Stat or Irf families, which have overlapping targets and similar roles in development and cancer (e.g. [144, 145]).

NF-κB induces expression from enhancers and low-occupancy promoters

Further, our results highlight that p65 recruitment to latent enhancers has similar influence on the induction of putative late response (not simply binding) to primed and active regulatory regions (Fig. 5), which suggest that during the course of TNF treatment some of the latent sites ultimately develop into high-occupancy enhancers, recruit additional co-activators, and enhance transcription. Strikingly, we have found that promoters of the most upregulated putative late response genes had low-occupancy promoters with a RNAPII positioned immediately downstream of the TSS (Additional file 1: Figure S8), and conversely that at high-occupancy promoters p65 binding had less impact on transcription. Genes with a fully “loaded” preinitiation complex (PIC) including TFIID (TAF1) and other co-activators were modestly upregulated [146]. Why the majority of p65-bound promoters are inefficiently or transiently induced is, however, still an open question.

Epigenetic codes predict p65 recruitment

We observed that within accessible sites epigenetic marks and their combinatorial patterns are quantitative predictors of p65 recruitment. Building on initial results from previous reports [94, 147, 148] we found that marks influence the selection of binding sites even after controlling for DNase I hypersensitivity (DHS). Importantly, in the previous studies TF peaks and DHS sites were mapped in the same cellular state and provide limited independent information as they are expected to correlate almost perfectly [81]. We also found that overall H3K4me1 and H2A.Z, a histone variant associated with high nucleosome-turnover [149], are most significantly associated with p65 binding at enhancers and promoters (Fig. 2c–d). Importantly, H3K4me1 deposition has been shown to be triggered by lineage-specifying TFs [47], and not to change in response to stimulus [35, 89] (but see [48, 93]), and interpreted to mark enhancers “primed” for activation [97]. Our results suggest that H3K4me1 strongly influences p65 binding probability (Table 1) and provide evidence for synergism and context-dependent roles of select combinations of histone modifications in the recruitment of p65 judging by the uninterpretable results of simple additive models (Additional file 1: Figure S10). We find, for example, that H2A.Z is involved with other histone modifications in the cooperative recruitment of p65 (Fig. 2, Additional file 1: Figure S3) and that the co-occurrence of H3K4me1 and H2A.Z is the strongest predictor of p65 binding to TSS-distal enhancers and hence a de facto signature of “primed” enhancers [101]. Strikingly, we found that H2A.Z can have the opposite effect on p65 binding when it is in another context with the repressive H3K27me3 methylation (Fig. 2c). These findings are in agreement with a recent study which showed that recruitment of PRC2, the complex that deposits H3K27me3, is dependent on H2A.Z [115]. Interestingly, H2A.Z removal can be triggered by the pioneer TF FOXA2 [110].

Recruitment of p65 by pre-bound transcription factors

Current models of stimulus-dependent transcriptional activation describe the sequential recruitment of increasingly specialized TFs which leads to gene expression [35]. In general terms, competence for induction by the most specialized TF (including NF-κB) was proposed to depend on the combination of previously recruited proteins [37]. Implications from this model prompted us to explore whether pre-bound TFs influence p65 binding, and whether sites co-occupied by clusters of several TFs showed cooperativity in p65 recruitment. We found that sites occupied by BCL3 showed the largest propensity for p65. BCL3 is a nuclear IκB protein, but its role in the regulation of NF-κB binding and activity is context-dependent and poorly understood [150–152]. Importantly, it has been shown that BCL3 and IRF3 might play complementary roles as co-activators, at a subset of NF-κB targets, in TNF- and LPS-induced cells, respectively [5], which could mean that BCL3 can initiate chromatin remodeling.

Enhanceosome assembly, p65 recruitment and clusters of cooperative TFs

Strikingly, we found that the role of multiple TFs is context-dependent. For example, CREB1 is negatively associated with p65 binding when present at a site where no other TF is bound, but shows positive cooperativity with several other TFs at co-occupied sites (Fig. 4a). One of the roles of CREB1 is to recruit CBP, which is both a histone acetyltransferase and protein scaffold [153]. Interestingly, several of the TFs engaged in cooperative p65 recruitment are known to physically interact with CBP, but not with each other. This observation, together with recent reports on the role of CREB1 and RAD21 (cohesin) in the stabilization of large protein complexes [127, 128], suggest that p65 binding at tissue-specific loci is stabilized by RAD21 or CREB1.

Finally, we have found that sites occupied by specific clusters of TFs have very large positive or negative propensities for p65 recruitment. In line with other studies we interpret this as traces of cooperative binding [117, 154]. However, direct interaction between the pre-bound proteins and/or p65 might not be strictly necessary as cooperative binding has been observed in vivo within larger and more complex regulatory sites [80, 154] and can arise from competition with nucleosomes [155, 156]. Importantly, the hierarchy of both TF-clusters is consistent with the current models of sequential enhanceosome assembly and combinatorial regulation of gene expression [116]. Sites occupied by (CREB1, RAD21, JUND, FOXA2), were most enriched for p65 binding, whereas those occupied by (GATA3, TCF12, USF1), were most depleted. Interestingly, AP-1 (JUN/FOS) is also involved in the maintenance of glucocorticoid receptor (GR) accessible sites [22]. GR has an important anti-inflammatory role and acts via the inhibition of NF-κB-mediated gene induction [157, 158]. This suggests that NF-κB and GR have overlapping sets of target sites. A striking example is the transcriptional regulation of the IL6 promoter. Induction of IL6 has been shown to depend on p65 and JUND recruitment, whereas silencing required GR binding, which was mediated by FOXA2 [159].

References

Smale ST. Transcriptional regulation in the innate immune system. Curr Opin Immunol. 2012;24:51–7.
Article CAS PubMed PubMed Central Google Scholar
Sarnico I, Branca C, Lanzillotta A, Porrini V, Benarese M, Spano PF, et al. NF‐κB and epigenetic mechanisms as integrative regulators of brain resilience to anoxic stress. Brain Res. 2012;1476:203–10.
Article CAS PubMed Google Scholar
Biddie SC. Chromatin architecture and the regulation of nuclear receptor inducible transcription. J Neuroendocrinol. 2011;23:94–106.
Article CAS PubMed Google Scholar
Zinzen RP, Girardot C, Gagneur J, Braun M, Furlong EEM. Combinatorial binding predicts spatio-temporal cis-regulatory activity. Nature. 2009;462:65–70.
Article CAS PubMed Google Scholar
Leung TH, Hoffmann A, Baltimore D. One Nucleotide in a κB Site Can Determine Cofactor Specificity for NF-κB Dimers. Cell. 2004;118:453–64.
Article CAS PubMed Google Scholar
Reddy TE, Pauli F, Sprouse RO, Neff NF, Newberry KM, Garabedian MJ, et al. Genomic determination of the glucocorticoid response reveals unexpected mechanisms of gene regulation. Genome Res. 2009;19:2163–71.
Article CAS PubMed PubMed Central Google Scholar
Goenka S, Kaplan MH. Transcriptional regulation by STAT6. Immunol Res. 2011;50:87–96.
Article CAS PubMed PubMed Central Google Scholar
Vallabhapurapu S, Karin M. Regulation and Function of NF-κB Transcription Factors in the Immune System. Annu Rev Immunol. 2009;27:693–733.
Article CAS PubMed Google Scholar
Smale ST. Selective transcription in response to an inflammatory stimulus. Cell. 2010;140:833–44.
Article CAS PubMed PubMed Central Google Scholar
Oeckinghaus A, Hayden MS, Ghosh S. Crosstalk in NF-κB signaling pathways. Nat Immunol. 2011;12:695–708.
Article CAS PubMed Google Scholar
Perkins ND. The diverse and complex roles of NF-κB subunits in cancer. Nat Rev Cancer. 2012;12:121–32.
CAS PubMed Google Scholar
Gerondakis S, Grumont R, Gugasyan R, Wong L, Isomura I, Ho W, et al. Unravelling the complexities of the NF-κB signalling pathway using mouse knockout and transgenic models. Oncogene. 2006;25:6781–99.
Article CAS PubMed Google Scholar
Wong D, Teixeira A, Oikonomopoulos S, Humburg P, Lone IN, Saliba D, et al. Extensive characterization of NF-κB binding uncovers non-canonical motifs and advances the interpretation of genetic functional traits. Genome Biol. 2011;12:R70.
Article CAS PubMed PubMed Central Google Scholar
Smale ST. Dimer-specific regulatory mechanisms within the NF-κB family of transcription factors. Immunol Rev. 2012;246:193–204.
Article PubMed CAS Google Scholar
Jin F, Li Y, Ren B, Natarajan R. PU.1 and C/EBPα synergistically program distinct response to NF-κB activation through establishing monocyte specific enhancers. Proc Natl Acad Sci. 2011;108:5290–5.
Article CAS PubMed PubMed Central Google Scholar
Bulger M, Groudine M. Functional and mechanistic diversity of distal transcription enhancers. Cell. 2011;144:327–39.
Article CAS PubMed PubMed Central Google Scholar
Zhu J, Adli M, Zou JY, Verstappen G, Coyne M, Zhang X, et al. Genome-wide chromatin state transitions associated with developmental and environmental cues. Cell. 2013;152:642–54.
Article CAS PubMed PubMed Central Google Scholar
Shen H, Laird PW. Interplay between the cancer genome and epigenome. Cell. 2013;153:38–55.
Article CAS PubMed PubMed Central Google Scholar
Revollo JR, Cidlowski JA. Mechanisms generating diversity in glucocorticoid receptor signaling. Ann N Y Acad Sci. 2009;1179:167–78.
Article CAS PubMed Google Scholar
John S, Johnson TA, Sung M-H, Biddie SC, Trump S, Koch-Paiz CA, et al. Kinetic complexity of the global response to glucocorticoid receptor action. Endocrinology. 2009;150:1766–74.
Article CAS PubMed PubMed Central Google Scholar
John S, Sabo PJ, Johnson TA, Sung M-H, Biddie SC, Lightman SL, et al. Interaction of the glucocorticoid receptor with the chromatin landscape. Mol Cell. 2008;29:611–24.
Article CAS PubMed Google Scholar
Biddie SC, John S, Sabo PJ, Thurman RE, Johnson TA, Schiltz RL, et al. Transcription factor AP1 potentiates chromatin accessibility and glucocorticoid receptor binding. Mol Cell. 2011;43:145–55.
Article CAS PubMed PubMed Central Google Scholar
Langlais D, Couture C, Balsalobre A, Drouin J. The Stat3/GR interaction code: predictive value of direct/indirect DNA recruitment for transcription outcome. Mol Cell. 2012;47:38–49.
Article CAS PubMed Google Scholar
Zaret KS, Carroll JS. Pioneer transcription factors: establishing competence for gene expression. Genes Dev. 2011;25:2227–41.
Article CAS PubMed PubMed Central Google Scholar
Cirillo LA, Lin FR, Cuesta I, Friedman D, Jarnik M, Zaret KS. Opening of compacted chromatin by early developmental transcription factors HNF3 (FoxA) and GATA-4. Mol Cell. 2002;9:279–89.
Article CAS PubMed Google Scholar
Lupien M, Eeckhoute J, Meyer CA, Wang Q, Zhang Y, Li W, et al. FoxA1 translates epigenetic signatures into enhancer-driven lineage-specific transcription. Cell. 2008;132:958–70.
Article CAS PubMed PubMed Central Google Scholar
Sekiya T, Muthurajan UM, Luger K, Tulin AV, Zaret KS. Nucleosome-binding affinity as a primary determinant of the nuclear mobility of the pioneer transcription factor FoxA. Genes Dev. 2009;23:804–9.
Article CAS PubMed PubMed Central Google Scholar
Li G, Ruan X, Auerbach RK, Sandhu KS, Zheng M, Wang P, et al. Extensive promoter-centered chromatin interactions provide a topological basis for transcription regulation. Cell. 2012;148:84–98.
Article CAS PubMed PubMed Central Google Scholar
Sérandour AA, Avner S, Percevault F, Demay F, Bizot M, Lucchetti-Miganeh C, et al. Epigenetic switch involved in activation of pioneer factor FOXA1-dependent enhancers. Genome Res. 2011;21:555–65.
Article PubMed PubMed Central CAS Google Scholar
Xu J, Pope SD, Jazirehi AR, Attema JL, Papathanasiou P, Watts JA, et al. Pioneer factor interactions and unmethylated CpG dinucleotides mark silent tissue-specific enhancers in embryonic stem cells. Proc Natl Acad Sci. 2007;104:12377–82.
Article CAS PubMed PubMed Central Google Scholar
Koche RP, Smith ZD, Adli M, Gu H, Ku M, Gnirke A, et al. Reprogramming factor expression initiates widespread targeted chromatin remodeling. Cell Stem Cell. 2011;8:96–105.
Article CAS PubMed PubMed Central Google Scholar
Soufi A, Donahue G, Zaret KS. Facilitators and impediments of the pluripotency reprogramming Factors’ initial engagement with the genome. Cell. 2012;151:994–1004.
Article CAS PubMed PubMed Central Google Scholar
Ang Y-S, Tsai S-Y, Lee D-F, Monk J, Su J, Ratnakumar K, et al. Wdr5 mediates self-renewal and reprogramming via the embryonic stem cell core transcriptional network. Cell. 2011;145:183–97.
Article CAS PubMed PubMed Central Google Scholar
Papp B, Plath K. Epigenetics of reprogramming to induced pluripotency. Cell. 2013;152:1324–43.
Article CAS PubMed PubMed Central Google Scholar
Ghisletti S, Barozzi I, Mietton F, Polletti S, De Santa F, Venturini E, et al. Identification and characterization of enhancers controlling the inflammatory gene expression program in macrophages. Immunity. 2010;32:317–28.
Article CAS PubMed Google Scholar
Aksoy I, Jauch R, Chen J, Dyla M, Divakar U, Bogu GK, et al. Oct4 switches partnering from Sox2 to Sox17 to reinterpret the enhancer code and specify endoderm. EMBO J. 2013;32:938–53.
Article CAS PubMed PubMed Central Google Scholar
Garber M, Yosef N, Goren A, Raychowdhury R, Thielke A, Guttman M, et al. A high-throughput chromatin immunoprecipitation approach reveals principles of dynamic gene regulation in mammals. Mol Cell. 2012;47:810–22.
Article CAS PubMed Google Scholar
Tian B, Nowak DE, Brasier AR. A TNF-induced gene expression program under oscillatory NF-κB control. BMC Genomics. 2005;6:137.
Article PubMed PubMed Central CAS Google Scholar
Ashall L, Horton CA, Nelson DE, Paszek P, Harper CV, Sillitoe K, et al. Pulsatile stimulation determines timing and specificity of NF-κB-dependent transcription. Science. 2009;324:242–6.
Article CAS PubMed PubMed Central Google Scholar
Natoli G. NF-κB and chromatin: ten years on the path from basic mechanisms to candidate drugs. Immunol Rev. 2012;246:183–92.
Article PubMed CAS Google Scholar
Saccani S, Pantano S, Natoli G. p38-dependent marking of inflammatory genes for increased NF-κB recruitment. Nat Immunol. 2002;3:69–75.
Article CAS PubMed Google Scholar
Ramirez-Carrozzi VR, Nazarian AA, Li CC, Gore SL, Sridharan R, Imbalzano AN, et al. Selective and antagonistic functions of SWI/SNF and Mi-2β nucleosome remodeling complexes during an inflammatory response. Genes Dev. 2006;20:282–96.
Article CAS PubMed PubMed Central Google Scholar
Natoli G. Control of NF-kappaB-dependent transcriptional responses by chromatin organization. Cold Spring Harb Perspect Biol. 2009;1:a000224.
Article PubMed PubMed Central CAS Google Scholar
Ramirez-Carrozzi VR, Braas D, Bhatt DM, Cheng CS, Hong C, Doty KR, et al. A unifying model for the selective regulation of inducible transcription by CpG islands and nucleosome remodeling. Cell. 2009;138:114–28.
Article CAS PubMed PubMed Central Google Scholar
Rao NAS, McCalman MT, Moulos P, Francoijs K-J, Chatziioannou A, Kolisis FN, et al. Coactivation of GR and NFKB alters the repertoire of their binding sites and target genes. Genome Res. 2011;21:1404–16.
Article CAS PubMed PubMed Central Google Scholar
Raskatov JA, Meier JL, Puckett JW, Yang F, Ramakrishnan P, Dervan PB. Modulation of NF-κB-dependent gene transcription using programmable DNA minor groove binders. Proc Natl Acad Sci. 2012;109:1023–8.
Article CAS PubMed Google Scholar
Heinz S, Benner C, Spann N, Bertolino E, Lin YC, Laslo P, et al. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol Cell. 2010;38:576–89.
Article CAS PubMed PubMed Central Google Scholar
Vahedi G, Takahashi H, Nakayamada S, Sun H, Sartorelli V, Kanno Y, et al. STATs shape the active enhancer landscape of T cell populations. Cell. 2012;151:981–93.
Article CAS PubMed PubMed Central Google Scholar
Huber MA, Azoitei N, Baumann B, Grünert S, Sommer A, Pehamberger H, et al. NF-κB is essential for epithelial-mesenchymal transition and metastasis in a model of breast cancer progression. J Clin Invest. 2004;114:569–81.
Article CAS PubMed PubMed Central Google Scholar
Naugler WE, Karin M. NF-κB and cancer — identifying targets and mechanisms. Curr Opin Genet Dev. 2008;18:19–26.
Article CAS PubMed PubMed Central Google Scholar
Kawata M, Koinuma D, Ogami T, Umezawa K, Iwata C, Watabe T, et al. TGF-β-induced epithelial-mesenchymal transition of A549 lung adenocarcinoma cells is enhanced by pro-inflammatory cytokines derived from RAW 264.7 macrophage cells. J Biochem (Tokyo). 2012;151:205–16.
Article CAS Google Scholar
Maier HJ, Schmidt-Straßburger U, Huber MA, Wiedemann EM, Beug H, Wirth T. NF-κB promotes epithelial–mesenchymal transition, migration and invasion of pancreatic carcinoma cells. Cancer Lett. 2010;295:214–28.
Article CAS PubMed Google Scholar
MacArthur S, Li X-Y, Li J, Brown JB, Chu HC, Zeng L, et al. Developmental roles of 21 Drosophila transcription factors are determined by quantitative differences in binding to an overlapping set of thousands of genomic regions. Genome Biol. 2009;10:1–26.
Article CAS Google Scholar
Lu R, Markowetz F, Unwin RD, Leek JT, Airoldi EM, MacArthur BD, et al. Systems-level dynamic analyses of fate change in murine embryonic stem cells. Nature. 2009;462:358–62.
Article CAS PubMed PubMed Central Google Scholar
Lieber M, Smith B, Szakal A, Nelson-Rees W, Todaro G. A continuous tumor-cell line from a human lung carcinoma with properties of type II alveolar epithelial cells. Int J Cancer. 1976;17:62–70.
Article CAS PubMed Google Scholar
Kumar M, Allison DF, Baranova NN, Wamsley JJ, Katz AJ, Bekiranov S, et al. NF-κB regulates mesenchymal transition for the induction of Non-small cell lung cancer initiating cells. PLoS One. 2013;8:e68597.
Article CAS PubMed PubMed Central Google Scholar
Neph S, Vierstra J, Stergachis AB, Reynolds AP, Haugen E, Vernot B, et al. An expansive human regulatory lexicon encoded in transcription factor footprints. Nature. 2012;489:83–90.
Article CAS PubMed PubMed Central Google Scholar
Meissner A, Gnirke A, Bell GW, Ramsahoye B, Lander ES, Jaenisch R. Reduced representation bisulfite sequencing for comparative high-resolution DNA methylation analysis. Nucleic Acids Res. 2005;33:5868–77.
Article CAS PubMed PubMed Central Google Scholar
Mikkelsen TS, Ku M, Jaffe DB, Issac B, Lieberman E, Giannoukos G, et al. Genome-wide maps of chromatin state in pluripotent and lineage-committed cells. Nature. 2007;448:553–60.
Article CAS PubMed PubMed Central Google Scholar
Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008;5:621–8.
Article CAS PubMed Google Scholar
Feng X, Grossman R, Stein L. PeakRanger: A cloud-enabled peak caller for ChIP-seq data. BMC Bioinformatics. 2011;12:139.
Article PubMed PubMed Central Google Scholar
Boyle AP, Davis S, Shulha HP, Meltzer P, Margulies EH, Weng Z, et al. High-resolution mapping and characterization of open chromatin across the genome. Cell. 2008;132:311–22.
Article CAS PubMed PubMed Central Google Scholar
Machanick P, Bailey TL. MEME-ChIP: motif analysis of large DNA datasets. Bioinformatics. 2011;27:1696–7.
Article CAS PubMed PubMed Central Google Scholar
Gupta S, Stamatoyannopoulos JA, Bailey TL, Noble WS. Quantifying similarity between motifs. Genome Biol. 2007;8:R24.
Article PubMed PubMed Central CAS Google Scholar
Siepel A, Bejerano G, Pedersen JS, Hinrichs AS, Hou M, Rosenbloom K, et al. Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. Genome Res. 2005;15:1034–50.
Article CAS PubMed PubMed Central Google Scholar
Wu H, Caffo B, Jaffee H, Feinberg A, Irizarry R. Redefining CpG Islands using a hideen markov model. Johns Hopkins Univ Dept Biostat Work Pap. 2009.
Gardiner-Garden M, Frommer M. CpG Islands in vertebrate genomes. J Mol Biol. 1987;196:261–82.
Article CAS PubMed Google Scholar
Stadler MB, Murr R, Burger L, Ivanek R, Lienert F, Schöler A, et al. DNA-binding factors shape the mouse methylome at distal regulatory regions. Nature. 2011;480:490–5.
CAS PubMed Google Scholar
Jee J, Rozowsky J, Yip KY, Lochovsky L, Bjornson R, Zhong G, et al. ACT: aggregation and correlation toolbox for analyses of genome tracks. Bioinformatics. 2011;27:1152–4.
Article CAS PubMed PubMed Central Google Scholar
Jain A, Nandakumar K, Ross A. Score normalization in multimodal biometric systems. Pattern Recognit. 2005;38:2270–85.
Article Google Scholar
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: Machine learning in python. J Mach Learn Res. 2011;12:2825–30.
Google Scholar
Baldi P, Brunak S, Chauvin Y, Andersen CAF, Nielsen H. Assessing the accuracy of prediction algorithms for classification: an overview. Bioinformatics. 2000;16:412–24.
Article CAS PubMed Google Scholar
Newton MA, Quintana FA, den Boon JA, Sengupta S, Ahlquist P. Random-Set methods identify distinct aspects of the enrichment signal in gene-Set analysis. Ann Appl Stat. 2007;1:85–106.
Article Google Scholar
Lee DD, Seung HS. Learning the parts of objects by non-negative matrix factorization. Nature. 1999;401:788–91.
Article CAS PubMed Google Scholar
Lin C-J. Projected gradient methods for nonnegative matrix factorization. Neural Comput. 2007;19:2756–79.
Article PubMed Google Scholar
Cieslik M, Bekiranov S. Combinatorial epigenetic patterns as quantitative predictors of chromatin biology. BMC Genomics. 2014;15:76.
Article PubMed PubMed Central CAS Google Scholar
Fournier-Viger P, Tseng VS. Mining top-K non-redundant association rules. In Found Intell Syst. Berlin Heidelberg, Germany: Springer; 2012:31–40.
Anders S, Huber W. Differential expression analysis for sequence count data. Genome Biol. 2010;11:R106.
Article CAS PubMed PubMed Central Google Scholar
Newcombe RG. Improved confidence intervals for the difference between binomial proportions based on paired data. Stat Med. 1998;17:2635–50.
Article CAS PubMed Google Scholar
Boyes J, Felsenfeld G. Tissue-specific factors additively increase the probability of the all-or-none formation of a hypersensitive site. EMBO J. 1996;15:2496–507.
CAS PubMed PubMed Central Google Scholar
Song L, Zhang Z, Grasfeder LL, Boyle AP, Giresi PG, Lee B-K, et al. Open chromatin defined by DNaseI and FAIRE identifies regulatory elements that shape cell-type identity. Genome Res. 2011;21:1757–67.
Article CAS PubMed PubMed Central Google Scholar
Thurman RE, Rynes E, Humbert R, Vierstra J, Maurano MT, Haugen E, et al. The accessible chromatin landscape of the human genome. Nature. 2012;489:75–82.
Article CAS PubMed PubMed Central Google Scholar
Biggin MD. Animal transcription networks as highly connected, quantitative continua. Dev Cell. 2011;21:611–26.
Article CAS PubMed Google Scholar
Guccione E, Martinato F, Finocchiaro G, Luzi L, Tizzoni L, Dall’ Olio V, et al. Myc-binding-site recognition in the human genome is determined by chromatin context. Nat Cell Biol. 2006;8:764–70.
Article CAS PubMed Google Scholar
Birney E, Stamatoyannopoulos JA, Dutta A, Guigó R, Gingeras TR, Margulies EH, et al. Identification and analysis of functional elements in 1 % of the human genome by the ENCODE pilot project. Nature. 2007;447:799–816.
Article CAS PubMed Google Scholar
Jones PA. Functions of DNA methylation: islands, start sites, gene bodies and beyond. Nat Rev Genet. 2012;13:484–92.
Article CAS PubMed Google Scholar
Aryee MJ, Liu W, Engelmann JC, Nuhn P, Gurel M, Haffner MC, et al. DNA methylation alterations exhibit intraindividual stability and interindividual heterogeneity in prostate cancer metastases. Sci Transl Med. 2013;5:169ra10.
Article PubMed PubMed Central CAS Google Scholar
Filion GJ, van Bemmel JG, Braunschweig U, Talhout W, Kind J, Ward LD, et al. Systematic protein location mapping reveals five principal chromatin types in drosophila cells. Cell. 2010;143:212–24.
Article CAS PubMed PubMed Central Google Scholar
Bonn S, Zinzen RP, Girardot C, Gustafson EH, Perez-Gonzalez A, Delhomme N, et al. Tissue-specific analysis of chromatin state identifies temporal signatures of enhancer activity during embryonic development. Nat Genet. 2012;44:148–56.
Article CAS PubMed Google Scholar
Ku M, Jaffe JD, Koche RP, Rheinbay E, Endoh M, Koseki H, et al. H2A.Z landscapes and dual modifications in pluripotent and multipotent stem cells underlie complex genome regulatory functions. Genome Biol. 2012;13:R85.
Article CAS PubMed PubMed Central Google Scholar
Gifford CA, Meissner A. Epigenetic obstacles encountered by transcription factors: reprogramming against all odds. Curr Opin Genet Dev. 2012;22:409–15.
Article CAS PubMed PubMed Central Google Scholar
Gifford CA, Ziller MJ, Gu H, Trapnell C, Donaghey J, Tsankov A, et al. Transcriptional and epigenetic dynamics during specification of human embryonic stem cells. Cell. 2013;153(5):1149–63.
Article CAS PubMed PubMed Central Google Scholar
Ostuni R, Piccolo V, Barozzi I, Polletti S, Termanini A, Bonifacio S, et al. Latent enhancers activated by stimulation in differentiated cells. Cell. 2013;152:157–71.
Article CAS PubMed Google Scholar
Arvey A, Agius P, Noble WS, Leslie C. Sequence and chromatin determinants of cell-type–specific transcription factor binding. Genome Res. 2012;22:1723–34.
Article CAS PubMed PubMed Central Google Scholar
Pique-Regi R, Degner JF, Pai AA, Gaffney DJ, Gilad Y, Pritchard JK. Accurate inference of transcription factor binding from DNA sequence and chromatin accessibility data. Genome Res. 2011;21:447–55.
Article CAS PubMed PubMed Central Google Scholar
Won K-J, Ren B, Wang W. Genome-wide prediction of transcription factor binding sites using an integrated model. Genome Biol. 2010;11:R7.
Article PubMed PubMed Central CAS Google Scholar
Mercer EM, Lin YC, Benner C, Jhunjhunwala S, Dutkowski J, Flores M, et al. Multilineage priming of enhancer repertoires precedes commitment to the B and myeloid cell lineages in hematopoietic progenitors. Immunity. 2011;35:413–25.
Article CAS PubMed PubMed Central Google Scholar
Heintzman ND, Hon GC, Hawkins RD, Kheradpour P, Stark A, Harp LF, et al. Histone modifications at human enhancers reflect global cell-type-specific gene expression. Nature. 2009;459:108–12.
Article CAS PubMed PubMed Central Google Scholar
Rada-Iglesias A, Bajpai R, Swigut T, Brugmann SA, Flynn RA, Wysocka J. A unique chromatin signature uncovers early developmental enhancers in humans. Nature. 2011;470:279–83.
Article CAS PubMed Google Scholar
Zentner GE, Tesar PJ, Scacheri PC. Epigenetic signatures distinguish multiple classes of enhancers with distinct cellular functions. Genome Res. 2011;21:1273–83.
Article CAS PubMed PubMed Central Google Scholar
Calo E, Wysocka J. Modification of enhancer chromatin: What, How, and Why? Mol Cell. 2013;49:825–37.
Article CAS PubMed Google Scholar
Creyghton MP, Cheng AW, Welstead GG, Kooistra T, Carey BW, Steine EJ, et al. Histone H3K27ac separates active from poised enhancers and predicts developmental state. Proc Natl Acad Sci U S A. 2010;107:21931–6.
Article CAS PubMed PubMed Central Google Scholar
Kundaje A, Kyriazopoulou-Panagiotopoulou S, Libbrecht M, Smith CL, Raha D, Winters EE, et al. Ubiquitous heterogeneity and asymmetry of the chromatin environment at regulatory elements. Genome Res. 2012;22:1735–47.
Article CAS PubMed PubMed Central Google Scholar
Bargaje R, Alam MP, Patowary A, Sarkar M, Ali T, Gupta S, et al. Proximity of H2A.Z containing nucleosome to the transcription start site influences gene expression levels in the mammalian liver and brain. Nucleic Acids Res. 2012;40:8965–78.
Article CAS PubMed PubMed Central Google Scholar
Wong MM, Cox LK, Chrivia JC. The chromatin remodeling protein, SRCAP, is critical for deposition of the histone variant H2A.Z at promoters. J Biol Chem. 2007;282:26132–9.
Article CAS PubMed Google Scholar
Creyghton MP, Markoulaki S, Levine SS, Hanna J, Lodato MA, Sha K, et al. H2AZ is enriched at polycomb complex target genes in ES cells and is necessary for lineage commitment. Cell. 2008;135:649–61.
Article CAS PubMed PubMed Central Google Scholar
Strahl BD, Allis CD. The language of covalent histone modifications. Nature. 2000;403:41–5.
Article CAS PubMed Google Scholar
Wang Z, Zang C, Rosenfeld JA, Schones DE, Barski A, Cuddapah S, et al. Combinatorial patterns of histone acetylations and methylations in the human genome. Nat Genet. 2008;40:897–903.
Article CAS PubMed PubMed Central Google Scholar
Wang Z, Zang C, Cui K, Schones DE, Barski A, Peng W, et al. Genome-wide mapping of HATs and HDACs reveals distinct functions in active and inactive genes. Cell. 2009;138:1019–31.
Article CAS PubMed PubMed Central Google Scholar
Li Z, Gadue P, Chen K, Jiao Y, Tuteja G, Schug J, et al. Foxa2 and H2A.Z mediate nucleosome depletion during embryonic stem cell differentiation. Cell. 2012;151:1608–16.
Article CAS PubMed PubMed Central Google Scholar
Brunet J-P, Tamayo P, Golub TR, Mesirov JP. Metagenes and molecular pattern discovery using matrix factorization. Proc Natl Acad Sci. 2004;101:4164–9.
Article CAS PubMed PubMed Central Google Scholar
Devarajan K. Nonnegative matrix factorization: An analytical and interpretive tool in computational biology. PLoS Comput Biol. 2008;4:e1000029.
Article PubMed PubMed Central CAS Google Scholar
Guenther MG, Levine SS, Boyer LA, Jaenisch R, Young RA. A chromatin landmark and transcription initiation at most promoters in human cells. Cell. 2007;130:77–88.
Article CAS PubMed PubMed Central Google Scholar
Gelman A. Scaling regression inputs by dividing by two standard deviations. Stat Med. 2008;27:2865–73.
Article PubMed Google Scholar
Hu G, Cui K, Northrup D, Liu C, Wang C, Tang Q, et al. H2A.Z facilitates access of active and repressive complexes to chromatin in embryonic stem cell self-renewal and differentiation. Cell Stem Cell. 2013;12:180–92.
Article CAS PubMed Google Scholar
Ravasi T, Suzuki H, Cannistraci CV, Katayama S, Bajic VB, Tan K, et al. An atlas of combinatorial transcriptional regulation in mouse and man. Cell. 2010;140:744–52.
Article CAS PubMed Google Scholar
Wilson NK, Foster SD, Wang X, Knezevic K, Schütte J, Kaimakis P, et al. Combinatorial transcriptional control in blood stem/progenitor cells: Genome-wide analysis of Ten major transcriptional regulators. Cell Stem Cell. 2010;7:532–44.
Article CAS PubMed Google Scholar
Yip KY, Cheng C, Bhardwaj N, Brown JB, Leng J, Kundaje A, et al. Classification of human genomic regions based on experimentally determined binding sites of more than 100 transcription-related factors. Genome Biol. 2012;13:R48.
Article CAS PubMed PubMed Central Google Scholar
Massoumi R, Chmielarska K, Hennecke K, Pfeifer A, Fässler R. Cyld inhibits tumor cell proliferation by blocking Bcl-3-dependent NF-κB signaling. Cell. 2006;125:665–77.
Article CAS PubMed Google Scholar
Palmer S, Chen YH. Bcl-3, a multifaceted modulator of NF-κB-mediated gene transcription. Immunol Res. 2008;42:210–8.
Article CAS PubMed Google Scholar
Whitington T, Frith MC, Johnson J, Bailey TL. Inferring transcription factor complexes from ChIP-seq data. Nucleic Acids Res. 2011;39:e98.
Article CAS PubMed PubMed Central Google Scholar
Lee B-K, Bhinge AA, Battenhouse A, McDaniell RM, Liu Z, Song L, et al. Cell-type specific and combinatorial usage of diverse transcription factors revealed by genome-wide binding studies in multiple human cells. Genome Res. 2012;22:9–24.
Article CAS PubMed PubMed Central Google Scholar
Agrawal R, Imieliński T, Swami A. Mining association rules between sets of items in large databases. SIGMOD Rec. 1993;22:207–16.
Article Google Scholar
Agalioti T, Lomvardas S, Parekh B, Yie J, Maniatis T, Thanos D. Ordered recruitment of chromatin modifying and general transcription factors to the IFN-β promoter. Cell. 2000;103:667–78.
Article CAS PubMed Google Scholar
Hernandez JM, Floyd DH, Weilbaecher KN, Green PL, Boris-Lawrie K. Multiple facets of junD gene expression are atypical among AP-1 family members. Oncogene. 2008;27:4757–67.
Article CAS PubMed PubMed Central Google Scholar
Zerbini LF, Wang Y, Cho J-Y, Libermann TA. Constitutive activation of nuclear factor κB p50/p65 and Fra-1 and JunD is essential for deregulated interleukin 6 expression in prostate cancer. Cancer Res. 2003;63:2206–15.
CAS PubMed Google Scholar
Faure AJ, Schmidt D, Watt S, Schwalie PC, Wilson MD, Xu H, et al. Cohesin regulates tissue-specific expression by stabilising highly occupied cis-regulatory modules. Genome Res. 2012;22(11):2163–75.
Article CAS PubMed PubMed Central Google Scholar
Everett LJ, Lay JL, Lukovac S, Bernstein D, Steger DJ, Lazar MA, et al. Integrative genomic analysis of CREB defines a critical role for transcription factor networks in mediating the fed/fasted switch in liver. BMC Genomics. 2013;14:337.
Article CAS PubMed PubMed Central Google Scholar
Chun RF, Semmes OJ, Neuveut C, Jeang K-T. Modulation of Sp1 Phosphorylation by Human Immunodeficiency Virus Type 1 Tat. J Virol. 1998;72:2615–29.
CAS PubMed PubMed Central Google Scholar
Lee JS, Galvin KM, Shi Y. Evidence for physical interaction between the zinc-finger transcription factors YY1 and Sp1. Proc Natl Acad Sci U S A. 1993;90:6145–9.
Article CAS PubMed PubMed Central Google Scholar
Toualbi-Abed K, Daniel F, Güller MC, Legrand A, Mauriz J-L, Mauviel A, et al. Jun D cooperates with p65 to activate the proximal κB site of the cyclin D1 promoter: role of PI3K/PDK-1. Carcinogenesis. 2008;29:536–43.
Article CAS PubMed Google Scholar
Frietze S, Wang R, Yao L, Tak YG, Ye Z, Gaddis M, et al. Cell type-specific binding patterns reveal that TCF7L2 can be tethered to the genome by association with GATA3. Genome Biol. 2012;13:R52.
Article CAS PubMed PubMed Central Google Scholar
Huang S, Li X, Yusufzai TM, Qiu Y, Felsenfeld G. USF1 recruits histone modification complexes and is critical for maintenance of a chromatin barrier. Mol Cell Biol. 2007;27:7991–8002.
Article CAS PubMed PubMed Central Google Scholar
Schotte R, Dontje W, Nagasawa M, Yasuda Y, Bakker AQ, Spits H, et al. Synergy between IL-15 and Id2 promotes the expansion of human NK progenitor cells, which Can Be counteracted by the E protein HEB required to drive T cell development. J Immunol. 2010;184:6670–9.
Article CAS PubMed Google Scholar
Hollenhorst PC, McIntosh LP, Graves BJ. Genomic and biochemical insights into the specificity of ETS transcription factors. Annu Rev Biochem. 2011;80:437–71.
Article CAS PubMed Google Scholar
Dickinson LA, Trauger JW, Baird EE, Dervan PB, Graves BJ, Gottesfeld JM. Inhibition of Ets-1 DNA binding and ternary complex formation between Ets-1, NF-κB, and DNA by a designed DNA-binding ligand. J Biol Chem. 1999;274:12765–73.
Article CAS PubMed Google Scholar
Apostolou E, Thanos D. Virus infection induces NF-κB-dependent interchromosomal associations mediating monoallelic IFN-β gene expression. Cell. 2008;134:85–96.
Article CAS PubMed Google Scholar
Samstein RM, Arvey A, Josefowicz SZ, Peng X, Reynolds A, Sandstrom R, et al. Foxp3 exploits a Pre-existent enhancer landscape for regulatory T cell lineage specification. Cell. 2012;151:153–66.
Article CAS PubMed PubMed Central Google Scholar
John S, Sabo PJ, Thurman RE, Sung M-H, Biddie SC, Johnson TA, et al. Chromatin accessibility pre-determines glucocorticoid receptor binding patterns. Nat Genet. 2011;43:264–8.
Article CAS PubMed Google Scholar
Angelov D, Lenouvel F, Hans F, Müller CW, Bouvet P, Bednar J, et al. The histone octamer is invisible when NF-κB binds to the nucleosome. J Biol Chem. 2004;279:42374–82.
Article CAS PubMed Google Scholar
Papp B, Plath K. Reprogramming to pluripotency: stepwise resetting of the epigenetic landscape. Cell Res. 2011;21:486–501.
Article CAS PubMed PubMed Central Google Scholar
Lee J, Sayed N, Hunter A, Au KF, Wong WH, Mocarski ES, et al. Activation of innate immunity is required for efficient nuclear reprogramming. Cell. 2012;151:547–58.
Article CAS PubMed PubMed Central Google Scholar
Feng B, Cheng S, Pear WS, Liou H-C. NF-kB inhibitor blocks B cell development at two checkpoints. Med Immunol. 2004;3:1.
Article PubMed PubMed Central Google Scholar
Wietek C, Miggin SM, Jefferies CA, O’Neill LAJ. Interferon regulatory factor-3-mediated activation of the interferon-sensitive response element by toll-like receptor (TLR) 4 but Not TLR3 requires the p65 subunit of NF-κ. J Biol Chem. 2003;278:50923–31.
Article CAS PubMed Google Scholar
Lee H, Herrmann A, Deng J-H, Kujawski M, Niu G, Li Z, et al. Persistently activated Stat3 maintains constitutive NF-κB activity in tumors. Cancer Cell. 2009;15:283–93.
Article CAS PubMed PubMed Central Google Scholar
Ainbinder E, Revach M, Wolstein O, Moshonov S, Diamant N, Dikstein R. Mechanism of rapid transcriptional induction of tumor necrosis factor alpha-responsive genes by NF-κB. Mol Cell Biol. 2002;22:6354–62.
Article CAS PubMed PubMed Central Google Scholar
Cuellar-Partida G, Buske FA, McLeay RC, Whitington T, Noble WS, Bailey TL. Epigenetic priors for identifying active transcription factor binding sites. Bioinformatics. 2012;28:56–62.
Article CAS PubMed Google Scholar
Ramsey SA, Knijnenburg TA, Kennedy KA, Zak DE, Gilchrist M, Gold ES, et al. Genome-wide histone acetylation data improve prediction of mammalian transcription factor binding sites. Bioinformatics. 2010;26:2071–5.
Article CAS PubMed PubMed Central Google Scholar
Henikoff S, Henikoff JG, Sakai A, Loeb GB, Ahmad K. Genome-wide profiling of salt fractions maps physical properties of chromatin. Genome Res. 2009;19:460–9.
Article CAS PubMed PubMed Central Google Scholar
Hayden MS, Ghosh S. Shared principles in NF-κB signaling. Cell. 2008;132:344–62.
Article CAS PubMed Google Scholar
Carmody RJ, Ruan Q, Palmer S, Hilliard B, Chen YH. Negative regulation of toll-like receptor signaling by NF-{kappa}B p50 ubiquitination blockade. Science. 2007;317:675–8.
Article CAS PubMed Google Scholar
Brenne A-T, Fagerli U-M, Shaughnessy Jr JD, Våtsveen TK, Rø TB, Hella H, et al. High expression of BCL3 in human myeloma cells is associated with increased proliferation and inferior prognosis. Eur J Haematol. 2009;82:354–63.
Article CAS PubMed PubMed Central Google Scholar
Iyer NG, Özdag H, Caldas C. p300/CBP and cancer. Oncogene. 2004;23:4225–31.
Article CAS PubMed Google Scholar
Junion G, Spivakov M, Girardot C, Braun M, Gustafson EH, Birney E, et al. A transcription factor collective defines cardiac cell fate and reflects lineage history. Cell. 2012;148:473–86.
Article CAS PubMed Google Scholar
Panne D. The enhanceosome. Curr Opin Struct Biol. 2008;18:236–42.
Article CAS PubMed Google Scholar
Mirny LA. Nucleosome-mediated cooperativity between transcription factors. Proc Natl Acad Sci. 2010;107:22534–9.
Article CAS PubMed PubMed Central Google Scholar
Bosscher KD, Berghe WV, Vermeulen L, Plaisance S, Boone E, Haegeman G. Glucocorticoids repress NF-κB-driven genes by disturbing the interaction of p65 with the basal transcription machinery, irrespective of coactivator levels in the cell. Proc Natl Acad Sci. 2000;97:3919–24.
Article PubMed PubMed Central Google Scholar
Bosscher KD, Berghe WV, Haegeman G. The interplay between the glucocorticoid receptor and nuclear factor-κB or activator protein-1: Molecular mechanisms for gene repression. Endocr Rev. 2003;24:488–522.
Article PubMed CAS Google Scholar
Li Z, White P, Tuteja G, Rubins N, Sackett S, Kaestner KH. Foxa1 and Foxa2 regulate bile duct development in mice. J Clin Invest. 2009;119:1537–45.
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgments

This research has been partially funded by the University of Virginia startup grant to SB.

Author information

Authors and Affiliations

Department of Biochemistry and Molecular Genetics, University of Virginia School of Medicine, Charlottesville, Virginia, USA
Marcin Cieślik & Stefan Bekiranov
Michigan Center for Translational Pathology, University of Michigan Medical School, Ann Arbor, Michigan, 48109, USA
Marcin Cieślik

Authors

Marcin Cieślik
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Bekiranov
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Stefan Bekiranov.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

MC and SB designed the study. MC performed the analyses, interpreted the results, and wrote the manuscript draft. SB supervised the research and edited the manuscript. Both authors read and approved the final manuscript.

Additional files

Additional file 1:

Supplemental Material (Discussion, Methods, Figures, and Tables). (PDF 2793 kb)

Additional file 2:

NF-κB Target Genes (Gene Name, RefSeq Id, Ensembl Id), log₂Fold Change (TNF stimulated vs unstimulated), Differential Expression Significance Call (TRUE/FALSE) and NF-κB Peak Detected after TNF Stimulation (TRUE/FALSE) (Data). (CSV 24 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Cieślik, M., Bekiranov, S. Genome-wide predictors of NF-κB recruitment and transcriptional activity. BioData Mining 8, 37 (2015). https://doi.org/10.1186/s13040-015-0071-3

Download citation

Received: 13 June 2014
Accepted: 18 November 2015
Published: 26 November 2015
DOI: https://doi.org/10.1186/s13040-015-0071-3

Genome-wide predictors of NF-κB recruitment and transcriptional activity

Abstract

Background

Results

Conclusions

Similar content being viewed by others

Computational modeling of chromatin accessibility identified important epigenomic regulators

MKL1 defines the H3K4Me3 landscape for NF-κB dependent inflammatory response

Determinants of transcription factor regulatory range

Background

Methods

Peak calling

Motif analysis

Site conservation

Known NF-κB targets

CpG islands

DNA methylation

Regulatory sites definition

Regulatory sites classification

Enrichment profiles

ChIP-seq data scaling

Logistic regression

Epigenetic mark correlations

Epigenetic mark interactions

Association between continuous levels and binary response

Combinatorial NMF “codes” of epigenetic marks

Combinatorial chromatin clusters of transcription factors

Linking genes, TSS-proximal and TSS-distal sites

Differential expression

Statistical analysis of proportions

Results

p65 is recruited to “latent” and “primed” enhancers

Recruitment to hyper-methylated distal regulatory sites (statistical enrichment analysis)

Rapid recruitment to inaccessible heterochromatic regions

Differences in histone modification levels at distinct classes of p65 binding sites (comparison of distributions)

Pioneering recruitment of p65 to latent and epigenetically silent enhancers

Epigenetic marks are quantitative predictors of p65 recruitment

Univariate predictive modeling of p65 recruitment (logistic regression analysis)

Context-dependent roles of epigenetic marks in p65 recruitment

Synergism and correlation between predictive chromatin features

Multivariate predictive modeling using non-negative matrix factorization (NMF)

Signals of p65-targeting “histone codes” (logistic regression analysis using NMF “code” weights as predictors)

p65 site selection is guided by the presence of specific TFs and overall occupancy

Pre-bound TFs recruit p65 to specific regulatory regions (association analysis)

Cooperative p65 recruitment by clusters of TFs

Use of a “cooperativity matrix” for the identification of recruitment synergism

Strong synergism between BCL3 and MAX in p65 recruitment

Association rules for the identification of cooperative TF clusters (optimization of combinations of TFs that recruit p65)

Sites occupied by CREB1, RAD21, JUND, and FOXA2 are preferentially bound by p65

Cooperative clusters are consistent with the pioneer-constitutive-inducible sequence of enhancer assembly

p65-mediated induction of pre-assembled promoters

Discussion

NF-κB is recruited to chromatin like a pioneer TF

NF-κB induces expression from enhancers and low-occupancy promoters

Epigenetic codes predict p65 recruitment

Recruitment of p65 by pre-bound transcription factors

Enhanceosome assembly, p65 recruitment and clusters of cooperative TFs

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Additional files

Additional file 1:

Additional file 2:

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation