Skip to main content

Genome-wide analysis of mono-, di- and trimethylation of histone H3 lysine 4 in Arabidopsis thaliana

Abstract

Background

Post-translational modifications of histones play important roles in maintaining normal transcription patterns by directly or indirectly affecting the structural properties of the chromatin. In plants, methylation of histone H3 lysine 4 (H3K4me) is associated with genes and required for normal plant development.

Results

We have characterized the genome-wide distribution patterns of mono-, di- and trimethylation of H3K4 (H3K4me1, H3K4me2 and H3K4me3, respectively) in Arabidopsis thaliana seedlings using chromatin immunoprecipitation and high-resolution whole-genome tiling microarrays (ChIP-chip). All three types of H3K4me are found to be almost exclusively genic, and two-thirds of Arabidopsis genes contain at least one type of H3K4me. H3K4me2 and H3K4me3 accumulate predominantly in promoters and 5' genic regions, whereas H3K4me1 is distributed within transcribed regions. In addition, H3K4me3-containing genes are highly expressed with low levels of tissue specificity, but H3K4me1 or H3K4me2 may not be directly involved in transcriptional activation. Furthermore, the preferential co-localization of H3K4me3 and H3K27me3 found in mammals does not appear to occur in plants at a genome-wide level, but H3K4me2 and H3K27me3 co-localize at a higher-than-expected frequency. Finally, we found that H3K4me2/3 and DNA methylation appear to be mutually exclusive, but surprisingly, H3K4me1 is highly correlated with CG DNA methylation in the transcribed regions of genes.

Conclusions

H3K4me plays widespread roles in regulating gene expression in plants. Although many aspects of the mechanisms and functions of H3K4me appear to be conserved among all three kingdoms, we observed significant differences in the relationship between H3K4me and transcription or other epigenetic pathways in plants and mammals.

Background

Post-translational modifications of histones play important roles in maintaining normal transcription patterns by directly or indirectly affecting the structural properties of the chromatin. Histone modifications are highly complex due to the large number of residues that can be modified as well as the variety of modification types (for example, methylation, acetylation, phosphorylation and ubiquitination, and so on) [1]. In addition, in the case of lysine methylation, a lysine residue can be mono-, di- or trimethylated with potentially different effects on chromatin structure [2–4]. Some histone modifications can directly alter chromatin structure. For example, acetylation of specific residues in the globular core domains of histones weakens the histone-DNA interactions, resulting in a relatively 'open' chromatin structure that facilitates transcription [5, 6]. In contrast, other modifications (such as lysine methylation on the amino-terminal tail of H3) do not grossly affect chromatin structure per se, but interact with additional factors. For example, several groups of proteins have been shown to preferentially bind histone H3 methylated at lysine 4 (H3K4me): the human chromatin remodeling and assembly factor hCHD1 (human homolog of Chromodomain helicase DNA binding protein 1) binds H3K4me through its chromodomain [7, 8], the chromatin remodeling complex NURF (Nucleosome remodeling factor) binds H3K4me through the PHD (plant homeodomain) domain of its large subunit BPTF (Bromodomain PHD finger transcription factor) [9], the H3K9me3 and H3K36me3 demethylase JMJD2A (Jumonji domain containing 2A) binds H3K4me (and H4K20me3) through its Tudor domain [10, 11], and members of the ING (Inhibitor of growth) family of tumor suppressor proteins bind H3K4me through the PHD domain [12, 13].

Four lysine residues on histone H3 were found to be methylated in Arabidopsis thaliana by mass spectrometry studies (H3K4, H3K9, H3K27 and H3K36) [14, 15]. Di-methylation of histone H3 lysine 9 (H3K9me2) is required for the transcriptional silencing of transposons and other repetitive sequences [16, 17], whereas H3K27me3 is primarily involved in the developmental repression of endogenous genes [18–21]. Recent genome-wide profiling studies in Arabidopsis have shown that H3K9me2 is highly enriched in the pericentromeric heterochromatin where transposons and other repeats cluster [22–25], whereas H3K27me3 is mostly distributed in the transcribed regions of a large number of euchromatic genes and bound by the chromodomain-containing protein LIKE HETEROCHROMATIN PROTEIN-1 (LHP1) [23, 26, 27]. H3K36me is required for normal plant development, but the genome-wide distribution of this modification and its role in transcriptional regulation remain unclear [28–31]. Finally, H3K4me2 is primarily distributed in endogenous genes but not transcriptionally silent transposons, as shown by a previous study of a 1-Mb heterochromatic region in Arabidopsis [22].

Only one H3K4 methyltransferase (SET1; SET domain containing 1) has been identified in yeast (Saccharomyces cerevisiae), and it has been proposed the differential methylation of H3K4 can be attributed to the kinetics of the dissociation of SET1 from the elongating RNA polymerase [32]. Multiple putative H3K4 methyltransferases homologous to SET1 have been identified in Arabidopsis [33–36]. Several lines of evidence suggest that in Arabidopsis distinct H3K4 methyltransferase complexes may also contribute to the differential accumulation of H3K4me1, H3K4me2 and H3K4me3 at specific loci. For example, loss of the H3K4 methyltransferase ATX1 (Arabidopsis homolog of Trithorax 1) leads to a mild reduction in global H3K4me3 level and eliminates H3K4me3 at specific loci, but has no detectable effect on H3K4me2 [37]. In contrast, the loss of a closely related H3K4 methyltransferase, ATX2, results in locus-specific defects in H3K4me2 but does not appear to affect H3K4me3 [38]. Examination of H3K4me levels at several genes revealed that the types of H3K4me present at individual genes may differ significantly [38, 39]. Interestingly, the atx1 mutant exhibits several developmental abnormalities, whereas the atx2 mutant is phenotypically normal [38–40]. Furthermore, results from transcriptional profiling studies indicated that ATX1 and ATX2 likely regulate two largely non-overlapping sets of genes [38]. It therefore appears that there may be significant differences in the mechanism, localization and function H3K4me1, H3K4me2 and H3K4me3.

Here we report a genome-wide analysis of H3K4me1, H3K4me2 and H3K4me3 in Arabidopsis using chromatin immunoprecipitation (ChIP) and whole-genome tiling microarrays (ChIP-chip). We found that all three types of H3K4me are distributed exclusively within genes and their promoters, and that approximately two-thirds of genes contain at least one type of H3K4me. In addition, H3K4me3, H3K4me2 and H3K4me1 are distributed with a 5'-to-3' gradient along genes, where H3K4me3 and H3K4me2 are enriched in the promoters and 5' end of transcribed regions with H3K4me3 distributed slightly upstream of H3K4me2, and H3K4me1 is depleted in promoters but enriched in the transcribed regions with an apparent 3' bias. Interestingly, we found that genes associated with different combinations of H3K4me are expressed at different levels and with different degrees of tissue specificity. Furthermore, genome-wide comparisons between H3K4me and other epigenetic marks revealed preferential co-localization between H3K4me2 and H3K27me3, and between H3K4me1 and CG DNA methylation in the transcribed regions of genes. Finally, the relationship between H3K4me and DNA methylation was further examined by genome-wide profiling of H3K4me in a DNA methylation mutant. The results suggested that H3K4me and DNA methylation may not directly interfere with each other in Arabidopsis, and that these two epigenetic pathways interact primarily through transcription.

Results and discussion

Genome-wide profiling of H3K4me1, H3K4me2 and H3K4me3

Arabidopsis chromatin enriched for H3K4me was isolated by ChIP using antibodies that specifically recognize H3K4me1, H3K4me2 and H3K4me3 (Figure S1 in Additional data file 1). As a control, nucleosomal DNA was isolated by ChIP using an antibody against histone H3 regardless of its modifications. H3K4me ChIP samples were compared to the control nucleosomal DNA by hybridization to Affymetrix whole-genome tiling microarrays that represent approximately 97% of the sequenced Arabidopsis genome at 35-bp resolution.

H3K4me1, H3K4me2 and H3K4me3 regions identified here are highly consistent with results from recently published studies [38] (Figure S2 in Additional data file 1). In addition, real-time PCR validations were performed at a number of randomly chosen loci, all of which yielded results consistent with the ChIP-chip data (Figure S3 in Additional data file 1). Finally, only 0.10%, 0.66% and 0.57% of the chloroplast genome was falsely identified as containing H3K4me1, H3K4me2 and H3K4me3, respectively. Taken together, these results indicate that the ChIP-chip data here provide an accurate representation of the genome-wide distribution of H3K4me with a relatively low false positive rate.

H3K4me1, H3K4me2 and H3K4me3 accumulate exclusively in genes

A total of 15,475 (7.77 Mb) H3K4me1, 12,781 (7.17 Mb) H3K4me2 and 15,894 (14.48 Mb) H3K4me3 regions were identified as described above, representing 6.45%, 6.0% and 12.1% of the sequenced nuclear genome, respectively. All three types of H3K4me are highly enriched in the gene-rich euchromatin and absent from pericentromeric heterochromatin regions where transposons and other repetitive sequences cluster (Figure 1a). Such a euchromatic distribution may largely reflect the fact that H3K4me1, H3K4me2 and H3K4me3 localize almost exclusively in genes: 96.7%, 93.3% and 95.7% of all H3K4me1, H3K4me2 and H3K4me3 regions, respectively, are in or overlap with transcribed regions of genes or their promoters (defined as the 200-bp regions upstream of transcription start sites). Only a small fraction of the remaining H3K4me1, H3K4me2 and H3K4me3 regions (0.6%, 1.3% and 1.5% of total, respectively) overlap with intergenic repetitive sequences such as transposons. The distribution of HK4me in a representative eukaryotic region is shown in Figure 1b.

Figure 1
figure 1

Distribution of H3K4me in the Arabidopsis genome. (a) Chromosomal distribution of H3K4me. Top row: the total length of repetitive sequences (y-axis, left-side scale) and number of genes per 100 kb (y-axis, right-side scale). Bottom panels: chromosomal distribution of H3K4me1, H3K4me2 and H3K4me3. X-axis: chromosomal position; y-axis: the total length of genomic regions containing H3K4me1, H3K4me2 and H3K4me3 per 100 kb, respectively. Arrows indicate the heterochromatic knob on chromosome 4. (b) Local distribution of H3K4me1, H3K4me2, H3K4me3, other epigenetic marks (DNA methylation, H3K9me2, H3K27me3, nucleosome density, small RNAs) and transcription activity in an approximately 40-kb euchromatic region on chromosome 1. Repetitive sequences are shown as filled red boxes on top. Individual genes are shown in open red boxes (arrows indicate direction of transcription; filled light blue boxes, exons; light blue lines, introns). Distribution of H3K4me on the gene labeled by a red asterisk is enlarged and shown in detail at the bottom.

Differential distribution of H3K4me1, H3K4me2 and H3K4me3 within genes

A total of 18,233 genes (approximately 68.0% of all annotated genes) were found to contain H3K4me in their promoters and/or transcribed regions, including 8,571 with H3K4me1, 10,396 with H3K4me2 and 14,712 with H3K4me3. The distribution patterns of H3K4me at the 5' regions of genes were determined by aligning genes by their transcription start sites, and the percentage of genes containing H3K4me in their promoters and the 5' transcribed regions was determined. Similarly, the distribution patterns of H3K4me at the 3' regions of genes were determined by aligning genes by the 3' end of their transcribed regions. These analyses were performed on a set of 5,809 genes that meet the following two criteria. First, they are located 1 kb or more away from the upstream and downstream genes such that ambiguity introduced by neighboring genes can be minimized. Second, they are longer than 1 kb so that there is sufficient gene space to determine the distribution of H3K4me. We further classified the 5,809 genes into four groups according to their length: long genes (>4 kb, 691 genes), intermediate genes (3 to 4 kb, 828 genes; 2 to 3 kb, 1,768 genes) and short genes (1 to 2 kb, 2,522 genes).

The distribution patterns of H3K4me on long genes are shown in Figure 2a. H3K4me1 is present at relatively low level at the 5' and 3' termini of transcribed regions, but is enriched in the internal regions with a slight 3' bias. In contrast, H3K4me2 and H3K4me3 are both enriched in the 5' end with H3K4me3 distributed slightly upstream of H3K4me2. Both H3K4me2 and H3K4me3 are also enriched in the promoters (200 bp upstream of transcription start sites) and 5' flanking regions (200 to approximately 400 bp upstream of transcription start sites), but are absent in the 3' half of the transcribed regions or the 3' flanking regions of the long genes.

Figure 2
figure 2

Distribution of H3K4me relative to genes. (a) Distribution of H3K4me at the 5' and 3' ends of genes. 'Isolated' genes are divided into four groups according to their length (see text for details). Genes belonging to each length group were aligned at the transcription start sites, and the percentage of genes containing H3K4me in their promoters or 5' ends is determined at 200-bp intervals (left y-axis). Similarly, genes belonging to each length group were aligned at the end of transcribed regions, and the percentage of genes containing H3K4me in their 3' ends or downstream flanking regions is determined at 200-bp intervals (right y-axis). The first and last 500 bp, 1 kb, 1.5 kb and 2 kb are shown for genes that are 1 to 2 kb, 2 to 3 kb, 3 to 4 kb and >4 kb in length, respectively. (b) Distribution of H3K4me across genes. Each gene (thick horizontal bar) was divided into 20 intervals (5% each interval), and the 1-kb regions upstream and downstream of each gene (thin horizontal bars) were divided into 50-bp intervals. The percentage of genes with H3K27me3 in each interval was graphed (y-axis). (c) Relationship between gene length and H3K4me. Genes are divided into eight categories according to the combination of H3K4me (see text for details), and the percentage of genes within each length group that are associated with a particular combination of H3K4me is shown (y-axis). (d) Length distribution of genes associated with different combinations of H3K4me. X-axis: gene length in kb (200 bp per bin); y-axis: the percentage of genes associated with a particular combination of H3K4me that are of the corresponding length. A small number of genes longer than 8 kb are not shown.

A comparison of the distribution patterns of H3K4me on long genes and intermediate or short genes revealed several common features as well as some interesting differences. First, as gene length decreases, significantly smaller fractions of genes were found to contain H3K4me1, but the relative position of H3K4me1 in genes (that is, internal regions with a 3' bias) remains similar. Second, the distribution patterns of both H3K4me2 and H3K4me3 at the 5' ends of short or intermediate genes are largely similar to those on long genes, although the shortest genes seem to contain a lower level of H3K4me3 at the 5' end. Third, as gene length decreases, significantly more genes were found to contain H3K4me2 and H3K4me3 in their 3' regions. For example, in the last 200 bp, 10.8- and 13.3-fold more short genes contain H3K4me2 and H3K4me3 than long genes, respectively.

In order to obtain a more continuous view of the distribution of H3K4me, we analyzed the average distribution levels of H3K4me across entire genes. To do this, we divided the transcribed region of each gene into 20 bins (5% of the gene length per bin), and divided the 1-kb upstream and downstream flanking regions of each gene into 20 bins (50 bp per bin). The percentage of genes containing H3K4me in each bin was then determined (Figure 2b). Consistent with the results described above, H3K4me1 is highly enriched within the transcribed regions, but it is present at very low levels in promoters and 3' flanking regions. In addition, H3K4me1 is present at significantly higher levels and spans broader regions on longer genes. In contrast, H3K4me2 and H3K4me3 are enriched in promoters and the 5' half of transcribed regions, at comparable levels on genes with different lengths. Although H3K4me2 and H3K4me3 extend further towards the 3' end on shorter genes relative to gene length, the absolute positions remain virtually constant: regardless of gene length, the highest levels of H3K4me2 and H3K4me3 were found at approximately 600 to 800 bp and 400 to 600 bp downstream of transcription start sites, respectively (Figure 2a). In addition, for genes in all the length groups, H3K4me2 and H3K4me3 appear to be enriched (that is, present at the same or higher levels as they are at transcription start sites) downstream of transcription start sites for approximately 1.5 kb and 1 kb, respectively (Figure 2a).

The observation that H3K4me2 and H3K4me3 appear to cover the 5' regions of genes for a relatively constant length suggests that the length of a given gene may affect the association of this gene with different types of H3K4me, in particular H3K4me1. For example, while all three types of H3K4me are positively correlated with gene length (Figure 2b), such a relationship is significantly more pronounced for H3K4me1. To further study the relationship between gene length and H3K4me, we classified the 5,809 genes into 8 categories based on the 8 possible combinations of their associated H3K4me: H3K4me1 only (me1+me2-me3-), H3K4me2 only (me1-me2+me3-), H3K4me3 only (me1-me2-me3+), H3K4me1 and H3K4me2 but no H3K4me3 (me1+me2+me3-), H3K4me1 and H3K4me3 but not H3K4me2 (me1+me2-me3+), H3K4me2 and H3K4me3 but not H3K4me2 (me1-me2+me3+), H3K4me1, H3K4me2 and H3K4me3 (me1+me2+me3+), and no H3K4me (me1-me2-me3-). The frequencies of occurrences of these combinations within each length group were then determined. As shown in Figure 2c, all combinations that include H3K4me1 (regardless of H3K4me2 and H3K4me3) showed a strong positive correlation with gene length, and all combinations of H3K4me2 and H3K4me3 (in the absence of H3K4me1) showed a negative correlation with gene length. In addition, genes associated with H3K4me1 (me1+me2-me3-, me1+me2+me3-, me1+me2-me3+, me1+me2+me3+) are generally longer than average, with me1+me2-me3+ and me1+me2+me3+ genes being significantly longer and including very few genes shorter than 2 kb (Figure 2d). In summary, by every measure, longer genes show higher levels of H3K4me1.

The distribution patterns of H3K4me2 and H3K4me3 described here are similar to results from analyzing genes on chromosomes 4 and 10 in rice [41]. That is, in both species, H3K4me2 and H3K4me3 are enriched in the promoters and the 5' ends of transcribed regions, with H3K4me3 peaking slightly upstream of H3K4me2 (at approximately 400 to 600 bp and approximately 600 to 800 bp downstream of transcription start sites, respectively; Figure 2a). These results suggest that H3K4me2 and H3K4me3 may be involved in both transcription initiation and the early stages of transcription elongation. In contrast, the internal distribution of H3K4me1 observed here suggests that H3K4me1 might be primarily involved in the elongation step during the transcription of longer genes. Alternatively, the apparent preferential accumulation of H3K4me1 in the transcribed regions may be because this modification is reduced at gene ends (that is, H3K4 is preferentially di- or trimethylated at the 5' ends and unmethylated at the 3' ends).

Association of different combinations of H3K4me1, H3K4me2 and H3K4me3 with differential gene expression patterns

To further test the relationship between H3K4me and transcription, we compared the expression level and tissue specificity of genes associated with different combinations of H3K4me, using a previously published expression profiling dataset [42]. Of the 5,809 genes described above, 5,479 were analyzed here, as expression data were available for these genes. As shown in Figure 3a, me1+me2-me3+, me1+me2+me3+ and me1-me2-me3+ genes are highly expressed, whereas me1+me2-me3-, me1-me2+me3- and me1+me2+me3- genes are expressed at very low levels. The me1-me2+me3+ group includes genes with a wide range of expression levels and seems to be enriched for moderately expressed genes. In addition, me1+me2-me3+, me1+me2+me3+ and me1-me2-me3+ genes exhibit very low levels of tissue specificity, while me1+me2-me3-, me1-me2+me3- and me1+me2+me3- genes are highly tissue specific (Figure 3b). Taken together, these results suggest that H3K4me3 is associated with and likely plays important roles in active transcription. H3K4me1 and H3K4me2, in the absence of H3K4me3, are preferentially associated with tissue-specific genes that are generally not expressed at the developmental stage assayed in this study. These results are consistent with previous reports that although H3K4me2 is generally associated with genes in Arabidopsis, its presence does not always correlate with active transcription [37].

Figure 3
figure 3

Genes with different expression levels and patterns are associated with different combinations of H3K4me. (a) Distribution of expression levels of genes associated with different combinations of H3K4me. X-axis: gene expression level determined in a previous study (log2 scale) [42]. Y-axis: the percentage of genes with corresponding H3K4me combination and expression level. (b) The degree of tissue-specific expression of genes associated with different combinations of H3K4me, as measured by entropy (x-axis). Y-axis: the percentage of genes with corresponding H3K4me combination and entropy values.

Relationship between H3K4me and H3K27me3

In Drosophila, the Trithorax (TRX) family of H3K4 methyltransferases and the Enhancer of Zeste (E(z)) family of H3K27 methyltransferases function antagonistically to activate or repress a largely overlapping set of genes, respectively [43, 44]. Interestingly, many genes are associated with both H3K4me and H3K27me3 in mammalian stem cells, and such a 'bivalent' histone modification has been suggested to play an important role in stem cell renewal and differentiation [45]. Similarly, the co-existence and antagonistic functions of H3K4me3 and H3K27me3 have been described at the FLC and AGAMOUS genes in Arabidopsis [38, 39, 46–48]. We have indeed detected H3K4me2, H3K4me3 and H3K27me3 at the FLC gene. However, we found that AGAMOUS contains a low level of H3K4me2 but no significant level of H3K4me3. This apparent discrepancy is likely due to the different tissues used in the experiments: young seedlings were used in this studywhereas a previous study used mature rosettes (Z Avramova, personal communication).

We have previously found that H3K27me3 is associated with 4,000 to 5,000 tissue-specific genes in their repressed state in Arabidopsis [26]. In order to test whether a preferential association of H3K4me with H3K27me3 exists that could indicate a functional connection, we first determined the fraction of genes with each combination of H3K4me that are also associated with H3K27me3. As shown in Table 1, we found that me1-me2-me3- and me1-me2+me3- genes are associated with H3K27me3 more frequently than expected. In addition, the association frequencies of me1+me2-me3-, me1+me2+me3- and me1-me2+me3+ genes with H3K27me3 are all lower than expected. Finally, me1-me2-me3+, me1+me2-me3+, and me1+me2+me3+ genes are even more depleted of H3K27me3. It should be noted that the differences in transcription levels cannot fully account for the differential association of H3K4me genes with H3K27me3. For example, the me1-me2-me3- and me1-me2+me3- genes are significantly more frequently associated with H3K27me3 than me1+me2-me3- and me1+me2+me3- genes, but these four categories of genes are expressed at very similar levels (Figure 3). The relationship between H3K4me and H3K27me3 was further examined by directly testing whether they co-localize to the same genomic regions. To do this, we determined the presence of each type of H3K4me in H3K27me3-containing genomic regions. As a control, we also determined the presence of H3K4me in a set of randomly chosen regions with the same length and genomic distributions of H3K27me3-containing regions. As shown in Table 2, whereas H3K4me1 and H3K4me3 are significantly depleted in H3K27me3-containing regions, H3K4me2 was found to overlap with H3K27me3 slightly more frequently than with random control regions.

Table 1 Co-localization of H3K4me and H3K27me3 in genes
Table 2 Co-localization of H3K4me and H3K27me3 in the same genome regions

It should be noted that the starting materials in this study (young seedlings) included many distinct cell types. It is likely that some genes are associated with H3K4me3 when they are expressed in some cell types, but are associated with H3K27m3 elsewhere when they are transcriptionally repressed. It is therefore possible that the low frequency of co-localization between H3K4me3 and H3K27me3 described here may still represent an overestimate. It is also possible, however, that co-localization of H3K4me3 and H3K27me3 at a given gene only occurs in specific cell types or during certain developmental stages. If this is the case, our results generated using mixed cell types from a single development stage could represent a gross underestimate of the prevalence of bivalent chromatin modification in plants. Future studies at cell-specific levels should more directly address the exact extent to which plant genes are bivalently modified. In any event, our results seem to indicate a mutually exclusive relationship between H3K4me3 and H3K27me3 at many genes in Arabidopsis seedlings. In animals, the H3K4 demethylase JARID1A (Jumonji, AT rich interactive domain 1A)/RBP2 (Retinol binding protein 2) is recruited to genomic targets through its interaction with the H3K27me3 methyltransferase complex Polycomb repressive complex (PRC) 2, where RBP2 mediates transcriptional repression by demethylating H3K4me3 to H3K4me2 (and to a lesser extent, H3K4me2 to H3K4me1) [49, 50]. In addition, the H3K4me3-specific demethylase JARID1D interacts with Ring6a (Really interesting new gene 6a)/MBLR (Mel18 and Bmi1-like RING finger protein), which is closely related to the PRC1 components Bmi1 (B Lymphoma Mo-MLV insertion region 1) and Mel18 [51]. Interestingly, two Arabidopsis RING finger proteins, AtRING1a and AtRING1b, have been recently found to interact with the H3K27me3 methyltransferase CURLY LEAF and the H3K27me3-binding protein LIKE HETEROCHROMATIN PROTEIN1, and are required for the transcriptional repression of H3K27me3 target genes [52]. The general mutual exclusion between H3K4me3 and H3K27me3 as well as the more frequent overlap of H3K4me2 and H3K27me3 suggest that similar mechanisms might also function in plants. That is, plant H3K4me3 demethylase(s) may function in transcriptional repression by interacting with PRC1 and/or PCR2. If this is the case, a fraction of the H3K4me2 in the Arabidopsis genome could be the demethylation product of H3K4me3.

We also observed that H3K4me1 tended not to co-localize with H3K27me3. One contributing factor could be the differential distribution patterns of these histone modifications along genes: H3K4me1 tends to be present at the 3' half of long genes, whereas H3K27me3 does not exhibit similar preferences for either location within genes or gene length (Figure S4 in Additional data file 1). Furthermore, H3K4me1 was present more frequently on ubiquitously expressed housekeeping genes, while H3K27m3 was more frequently present on tissue-specific genes.

Relationship between H3K4me and DNA methylation

Cytosine DNA methylation is an epigenetic silencing mechanism important for the developmental regulation of endogenous genes and the transcriptional silencing of transposons [53–56]. A mechanistic relationship between DNA methylation and H3K4me has been described in mammals, where the DNA methyltransferase (DNMT) homolog DNMT3L specifically interacts with histone H3 containing unmethylated lysine 4 [57]. That DNMT3L also binds and stimulates the activity of the de novo DNA methyltransferase DNMT3A suggests that H3 with unmethylated K4 may play a role in targeting de novo DNA methylation in mammals [57–59]. However, a distinct small interfering RNA (siRNA)-directed pathway is responsible for de novo DNA methylation in plants [60–62], and an interaction between DNA methyltransferase and histone has not been reported.

Three DNA methylation pathways have been described in plants: METHYLTRANSFERASE 1 (MET1) is a homolog of mammalian DNMT1 and primarily functions in maintaining DNA methylation in the CG sequence context ('CG methylation') [63–66]. The DOMAIN REARRANGED METHYLASE (DRM) (homologous to mammalian DNMT3) interacts with the siRNA pathway and is required for de novo DNA methylation in all sequence contexts as well as the maintenance of DNA methylation in the CHH context (H = A, C or T; 'CHH methylation') [60–62]. The CHROMOMETHYLASE3 is specific to plant genomes and interacts with the H3K9me2 pathway to maintain DNA methylation in the CHG sequence context ('CHG methylation') [67, 68].

The genome-wide distribution of DNA methylation in Arabidopsis has been determined by a number of studies using microarray analyses or ultra-high-throughput deep sequencing of bisulfite treated DNA [22, 25, 69–77]. Results from these studies are largely consistent: CG, CHG and CHH methylation is highly enriched in transposons and other repetitive sequences, suggesting that the RNA interference, H3K9me2 and DNA methylation pathways function together in the transcriptional repression at these loci. DNA methylation is generally depleted in the promoters and 5' ends of endogenous genes. However, over one-third of Arabidopsis genes contain DNA methylation exclusively in the CG sequence context that is enriched in the 3' half of their transcribed regions (termed 'body-methylation'). Most body-methylated genes are expressed at moderate to high levels, and it is therefore unclear whether CG methylation alone in the transcribed regions of genes plays a direct and significant repressive role in transcription.

In order to determine the relationship between DNA methylation and H3K4me in Arabidopsis, we compared DNA methylation levels in genomic regions containing H3K4me to the whole-genome average of DNA methylation. As shown in Table 3, CHG and CHH methylation is significantly depleted in genomic regions containing H3K4me1, H3K4me2 or H3K4me3. CG methylation is also significantly depleted in H3K4me2- and H3K4me3-containing regions. In stark contrast, we found that CG methylation is highly enriched in H3K4me1-containing regions (Table 3). In addition, nearly two-thirds of H3K4me1-containing regions (8,841 of 14,599, approximately 60.6%) with two or more CG dinucleotides are methylated at two or more CG sites, compared to approximately 7.0% (842 of 12,100) and approximately 11.7% (1,750 of 14,918) for H3K4me2- and H3K4me3-containing regions, respectively.

Table 3 The percentage of cytosine residues that are methylated in CG, CHG or CHH sequence contexts in H3K4me-containing genomic regions*

The low level of CHG and CHH methylation in H3K4me-containing regions can be explained by the virtual absence of siRNAs and H3K9me2 within actively transcribed endogenous genes. The lack of CG methylation in H3K4me2- and H3K4me3-containing regions could be due to an active mutual exclusion mechanism (for example, MET1 may be discouraged from chromatin containing H3K4m2 or H3K4me3) similar to what was recently described between DNA methylation and the deposition of the histone variant H2A.Z [78], or simply the differential localization of DNA methylation and H3K4me2/H3K4me3 relative to genes (a 5' bias for H3K4me2/H3K4me3 and a 3' bias for DNA methylation). The high level of CG methylation in H3K4me1-containing regions was unexpected. It is possible that CG methylation and H3K4me1 interact with each other and therefore co-localize at the 3' transcribed regions of genes. It is also possible that the overlap of these two epigenetic marks merely reflects their preferential localization in the similar regions of highly expressed genes. In either case, these results indicate that CG methylation per se and H3K4me1 do not appear to interfere with each other. Finally, genomic regions free of H3K4me frequently lack DNA methylation, suggesting that the absence of H3K4me alone is insufficient to trigger DNA methylation.

Ectopic H3K4me in met1 is associated with transcriptional de-repression

In order to test whether direct mechanistic links exist between DNA methylation and H3K4me (that is, whether DNA methylation per se excludes H3K4me2/H3K4me3 and whether gene body methylation facilitates H3K4me1), we determined the genome-wide distribution of H3K4me in the met1 mutant by ChIP-chip. Previous studies have shown that loss of MET1 eliminates CG methylation as well as substantial fractions of CHG and CHH methylation, resulting in massive transcriptional reactivation of transposons [71, 72, 74, 76, 77].

All three types of H3K4me were found to be present at much higher levels in the pericentromeric heterochromatin regions in met1 (Figure 4). A closer examination revealed that hyper-H3K4me in met1 is almost always associated with ectopic over-expression of transposons or pseudogenes (Figure 4). However, the loss of DNA methylation does not appear to directly trigger hyper-H3K4me. In contrast to the transcription-independent ectopic accumulation of H2A.Z in DNA hypomethylated regions in the met1 mutant [78], no major change in H3K4me was observed in genomic regions that are DNA-hypomethylated but not transcribed. This suggests that the ectopic transcriptional activity resulting from the loss of DNA methylation, but not the loss of DNA methylation per se, is associated with hyper-H3K4me. In addition, at nearly all genes, a complete loss of gene body methylation in met1 had no significant effect on H3K4me1, H3K4me2 or H3K4me3 (Figure 4), suggesting that CG methylation in genes is dispensable for the normal accumulation of H3K4me1.

Figure 4
figure 4

Comparisons of H3K4me accumulated in wild-type Arabidopsis (Wt, green) and the met1 mutant (light brown). Left: chromosome-level changes in H3K4me, showing the ectopic accumulation of H3K4me in the pericentromeric heterochromatin. Chromosome 5 is shown as an example (Wt, green; met1, light brown). X-axis: chromosome position; y-axis: the percentage of H3K4me on chromosome 5 in the corresponding region (in 100 kb bins). Right: local changes in DNA methylation, H3K4me and transcription in a euchromatic region (top right) and a heterochromatic region (bottom right) on chromosome 5. The five genes shown in the euchromatic region likely encode cellular proteins and their expression patterns are unaffected in the met1 mutant. These are (from left to right): At5g56210, WPP-DOMAIN INTERACTING PROTEIN 2; At5g56220, nucleoside-triphosphatase; At5g56230, prenylated rab acceptor (PRA1) family protein; At5g56240, unknown protein. The six genes shown in the heterochromatic region are all transposon-encoded genes. These are (from left to right): At5g32925, CACTA-like transposase; At5g32950, CACTA-like transposase, At5g32975, similar to En/Spm-like transposon protein; At5g33000, Transposable element gene; At5g33025, gypsy-like retrotransposon; At5g33050, gypsy-like retrotransposon. Note that the overexpression of At5g32950 and At5g33050 is associated with ectopic accumulation of H3K4me.

Conclusions

Our genome-wide analysis of H3K4me1, H3K4me2 and H3K4me3 led to several interesting results. First, a large number of genes were found to contain H3K4me: at a single developmental stage, approximately two-thirds of all Arabidopsis genes contain at least one type of H3K4me. This suggests that H3K4me may be required for the normal expression or a large number of genes in plants. Second, H3K4me1, H3K4me2 and H3K4me3 are enriched in different regions in their target genes. H3K4me2 and H3K4me3 are distributed in the promoters and 5' regions with H3K4me3 slightly more upstream, whereas H3K4me1 is mostly located within the transcribed regions. Our H3K4me3 results are highly consistent with those recently published by van Nocker and colleagues [47]. Importantly, very similar distribution patterns of H3K4me1, H3K4me2 and H3K4me3 were also found in yeast, human and other plants (for example, H3K4me2 and H3K4me3 in rice) [32, 41, 79–81], which suggests that many aspects of the mechanisms and functions of H3K4me may be highly conserved during evolution. Third, we found that genes with different expression levels and tissue specificity are associated with different assortments of H3K4me1, H3K4me2 and H3K4me3, suggesting that the three types of H3K4me may have different effects on chromatin structure and transcription. In particular, whereas H3K4me3 appears to be generally associated with actively transcribed genes, our results do not support a direct role of H3K4me1 and H3K4me2 in transcriptional activation: H3K4me1 and H3K4me2 do not appear to have an additive effect on H3K4me3 with regard to transcription levels and, in the absence of H3K4me3, they are not preferentially associated with active transcription. Interestingly, our observation that H3K4me2 (but not H3K4me1 or H3K4me3) often overlaps with H3K27me3 raises the possibility that the accumulation of H3K4me2 at some loci in the Arabidopsis genome might result from demethylation of H3K4me3 by histone demethylases associated with PcG complexes. Fourth, unlike in mammalian stem cells, H3K4me3 and H3K27me3 do not appear to preferentially co-localize on a genome-wide level in Arabidopsis. A second significant difference between plants and mammals is that, in mammals, H3K4me3 is present at active promoters as well as a large number of 'poised' promoters [82], whereas in plants, the presence of H3K4me3 is usually correlated with active transcription. Finally, we observed strong negative correlations between H3K4me2/H3K4me3 and all three types of DNA methylation, and between H3K4me1 and CHG and CHH DNA methylation. However, the loss of DNA methylation does not generally trigger hyper-H3K4me in the corresponding genomic region, indicating that DNA methylation per se may not inhibit H3K4me. Our results do suggest that DNA methylation may interfere with H3K4me indirectly through transcriptional repression, as ectopic transcription was observed in the vast majority of the cases where DNA hypomethylation and hyper-H3K4me occur at the same genes. Interestingly, H3K4me1 is highly correlated with the CG methylation that exists within the transcribed regions of genes. Although the retention of H3K4m1 in the met1 mutant indicates that CG DNA methylation is not required for the accumulation of H3K4me1, it is possible that H3K4me1 might play a role in the colonization of CG DNA methylation within the transcribed regions of genes.

Materials and methods

Arabidopsis thaliana plants (accession Col) were grown on soil under continuous light for 3 weeks, and the aerial part of the seedlings was harvested. The met1-3 mutant plants were grown under the same conditions and harvested at a similar developmental stage. Chromatin was fragmented to 300 to 1,200 bp (mostly 600 to 800 bp) by sonication, and ChIP was performed as previously described using antibodies purchased from Abcam (anti-H3K4me1, ab8895; anti-H3K4me2, ab7766; anti-H3K4me3, ab8580; anti-H3, ab1791) (Cambridge, MA, USA) [26]. The specificities of anti-H3K4me antibodies were validated by dot blot analysis (Figure S1 in Additional data file 1). ChIP samples were amplified, labeled, and hybridized to microarrays as previously described [26, 72]. Four biological replicates were performed for H3K4me1 and H3K4me3, and eight biological replicates were performed for H3K4me2. For each H3K4me ChIP, an H3 ChIP was performed to isolate nucleosomal control DNA. Microarray hybridization intensities from probes that match a unique genomic region were analyzed using Tilemap with the Hidden Markov model option, as previously described [83]. All raw microarray data (CEL files) have been deposited in Gene Expression Omnibus [GEO:GSE13613]. Processed data showing the enrichment of H3K4me can be viewed online [84]. The gene expression data used here were from a previous comprehensive transcriptional profiling study (data for 7- to 14-day-old seedlings were used here for analysis of gene expression levels, and data for all tissue types and developmental stages were used here to analyze tissue specificity) [42]. The gene annotations used here are according to TAIR7. Real-time PCR validation of ChIP-chip results was performed using the SYBR Green I Master kit (Roche; Indianapolis, IN, USA) on a Roche Light Cycler 480. The PCR parameters are: 1 cycle of 10 minutes at 95°C, 40 cycles of 10 s at 95°C, 10 s at 60°C, and 20 s at 72°C. PCR primer sequences are listed in Table S1 in Additional data file 1.

Additional data files

The following additional data are available with the online version of this paper: Figures S1 to S4 and Table S1 (included in Additional data file 1).

Abbreviations

ATX:

Arabidopsis homolog of Trithorax

ChIP:

chromatin immunoprecipitation

DNMT:

DNA methyltransferase

H3K4me:

H3 methylated at lysine 4

JARID:

Jumonji, AT rich interactive domain

MET1:

METHYLTRANSFERASE 1

PHD:

plant homeodomain

PRC:

Polycomb repressive complex

RBP:

Retinol binding protein

SET1:

SET domain containing 1

siRNA:

small interfering RNA.

References

  1. Bhaumik SR, Smith E, Shilatifard A: Covalent modifications of histones during development and disease pathogenesis. Nat Struct Mol Biol. 2007, 14: 1008-1016. 10.1038/nsmb1337.

    Article  PubMed  CAS  Google Scholar 

  2. Schneider J, Wood A, Lee JS, Schuster R, Dueker J, Maguire C, Swanson SK, Florens L, Washburn MP, Shilatifard A: Molecular regulation of histone H3 trimethylation by COMPASS and the regulation of gene expression. Mol Cell. 2005, 19: 849-856. 10.1016/j.molcel.2005.07.024.

    Article  PubMed  CAS  Google Scholar 

  3. Santos-Rosa H, Schneider R, Bannister AJ, Sherriff J, Bernstein BE, Emre NC, Schreiber SL, Mellor J, Kouzarides T: Active genes are tri-methylated at K4 of histone H3. Nature. 2002, 419: 407-411. 10.1038/nature01080.

    Article  PubMed  CAS  Google Scholar 

  4. Wood A, Shukla A, Schneider J, Lee JS, Stanton JD, Dzuiba T, Swanson SK, Florens L, Washburn MP, Wyrick J, Bhaumik SR, Shilatifard A: Ctk complex-mediated regulation of histone methylation by COMPASS. Mol Cell Biol. 2007, 27: 709-720. 10.1128/MCB.01627-06.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  5. Williams SK, Truong D, Tyler JK: Acetylation in the globular core of histone H3 on lysine-56 promotes chromatin disassembly during transcriptional activation. Proc Natl Acad Sci USA. 2008, 105: 9000-9005. 10.1073/pnas.0800057105.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  6. Xu F, Zhang K, Grunstein M: Acetylation in histone H3 globular domain regulates gene expression in yeast. Cell. 2005, 121: 375-385. 10.1016/j.cell.2005.03.011.

    Article  PubMed  CAS  Google Scholar 

  7. Sims RJ, Chen CF, Santos-Rosa H, Kouzarides T, Patel SS, Reinberg D: Human but not yeast CHD1 binds directly and selectively to histone H3 methylated at lysine 4 via its tandem chromodomains. J Biol Chem. 2005, 280: 41789-41792. 10.1074/jbc.C500395200.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  8. Flanagan JF, Mi LZ, Chruszcz M, Cymborowski M, Clines KL, Kim Y, Minor W, Rastinejad F, Khorasanizadeh S: Double chromodomains cooperate to recognize the methylated histone H3 tail. Nature. 2005, 438: 1181-1185. 10.1038/nature04290.

    Article  PubMed  CAS  Google Scholar 

  9. Wysocka J, Swigut T, Xiao H, Milne TA, Kwon SY, Landry J, Kauer M, Tackett AJ, Chait BT, Badenhorst P, Wu C, Allis CD: A PHD finger of NURF couples histone H3 lysine 4 trimethylation with chromatin remodelling. Nature. 2006, 442: 86-90.

    PubMed  CAS  Google Scholar 

  10. Huang Y, Fang J, Bedford MT, Zhang Y, Xu RM: Recognition of histone H3 lysine-4 methylation by the double tudor domain of JMJD2A. Science. 2006, 312: 748-751. 10.1126/science.1125162.

    Article  PubMed  CAS  Google Scholar 

  11. Kim J, Daniel J, Espejo A, Lake A, Krishna M, Xia L, Zhang Y, Bedford MT: Tudor, MBT and chromo domains gauge the degree of lysine methylation. EMBO Rep. 2006, 7: 397-403.

    PubMed  CAS  PubMed Central  Google Scholar 

  12. Pena PV, Davrazou F, Shi X, Walter KL, Verkhusha VV, Gozani O, Zhao R, Kutateladze TG: Molecular mechanism of histone H3K4me3 recognition by plant homeodomain of ING2. Nature. 2006, 442: 100-103.

    PubMed  CAS  PubMed Central  Google Scholar 

  13. Shi X, Hong T, Walter KL, Ewalt M, Michishita E, Hung T, Carney D, Pena P, Lan F, Kaadige MR, Lacoste N, Cayrou C, Davrazou F, Saha A, Cairns BR, Ayer DE, Kutateladze TG, Shi Y, Cote J, Chua KF, Gozani O: ING2 PHD domain links histone H3 lysine 4 methylation to active gene repression. Nature. 2006, 442: 96-99. 10.1038/nature05140.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  14. Johnson L, Mollah S, Garcia BA, Muratore TL, Shabanowitz J, Hunt DF, Jacobsen SE: Mass spectrometry analysis of Arabidopsis histone H3 reveals distinct combinations of post-translational modifications. Nucleic Acids Res. 2004, 32: 6511-6518. 10.1093/nar/gkh992.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  15. Zhang K, Sridhar VV, Zhu J, Kapoor A, Zhu JK: Distinctive core histone post-translational modification patterns in Arabidopsis thaliana. PLoS ONE. 2007, 2: e1210-10.1371/journal.pone.0001210.

    Article  PubMed  PubMed Central  Google Scholar 

  16. Jackson JP, Lindroth AM, Cao X, Jacobsen SE: Control of CpNpG DNA methylation by the KRYPTONITE histone H3 methyltransferase. Nature. 2002, 416: 556-560. 10.1038/nature731.

    Article  PubMed  CAS  Google Scholar 

  17. Malagnac F, Bartee L, Bender J: An Arabidopsis SET domain protein required for maintenance but not establishment of DNA methylation. EMBO J. 2002, 21: 6842-6852. 10.1093/emboj/cdf687.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  18. Schubert D, Clarenz O, Goodrich J: Epigenetic control of plant development by Polycomb-group proteins. Curr Opin Plant Biol. 2005, 8: 553-561. 10.1016/j.pbi.2005.07.005.

    Article  PubMed  CAS  Google Scholar 

  19. Lindroth AM, Shultis D, Jasencakova Z, Fuchs J, Johnson L, Schubert D, Patnaik D, Pradhan S, Goodrich J, Schubert I, Jenuwein T, Khorasanizadeh S, Jacobsen SE: Dual histone H3 methylation marks at lysines 9 and 27 required for interaction with CHROMOMETHYLASE3. EMBO J. 2004, 23: 4286-4296. 10.1038/sj.emboj.7600430.

    Article  PubMed  CAS  Google Scholar 

  20. Kinoshita T, Harada JJ, Goldberg RB, Fischer RL: Polycomb repression of flowering during early plant development. Proc Natl Acad Sci USA. 2001, 98: 14156-14161. 10.1073/pnas.241507798.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  21. Chanvivattana Y, Bishopp A, Schubert D, Stock C, Moon YH, Sung ZR, Goodrich J: Interaction of Polycomb-group proteins controlling flowering in Arabidopsis. Development. 2004, 131: 5263-5276. 10.1242/dev.01400.

    Article  PubMed  CAS  Google Scholar 

  22. Lippman Z, Gendrel AV, Black M, Vaughn MW, Dedhia N, McCombie WR, Lavine K, Mittal V, May B, Kasschau KD, Carrington JC, Doerge RW, Colot V, Martienssen R: Role of transposable elements in heterochromatin and epigenetic control. Nature. 2004, 430: 471-476. 10.1038/nature02651.

    Article  PubMed  CAS  Google Scholar 

  23. Turck F, Roudier F, Farrona S, Martin-Magniette ML, Guillaume E, Buisine N, Gagnot S, Martienssen RA, Coupland G, Colot V: Arabidopsis TFL2/LHP1 specifically associates with genes marked by trimethylation of histone H3 lysine 27. PLoS Genet. 2007, 3: e86-10.1371/journal.pgen.0030086.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Bernatavichute YV, Zhang X, Cokus S, Pellegrini M, Jacobsen SE: Genome-wide association of histone H3 lysine nine methylation with CHG DNA methylation in Arabidopsis thaliana. PLoS ONE. 2008, 3: e3156-10.1371/journal.pone.0003156.

    Article  PubMed  PubMed Central  Google Scholar 

  25. Zhang X: The epigenetic landscape of plants. Science. 2008, 320: 489-492. 10.1126/science.1153996.

    Article  PubMed  CAS  Google Scholar 

  26. Zhang X, Clarenz O, Cokus S, Bernatavichute YV, Pellegrini M, Goodrich J, Jacobsen SE: Whole-genome analysis of histone H3 lysine 27 trimethylation in Arabidopsis. PLoS Biol. 2007, 5: e129-10.1371/journal.pbio.0050129.

    Article  PubMed  PubMed Central  Google Scholar 

  27. Zhang X, Germann S, Blus BJ, Khorasanizadeh S, Gaudin V, Jacobsen SE: The Arabidopsis LHP1 protein colocalizes with histone H3 Lys27 trimethylation. Nat Struct Mol Biol. 2007, 14: 869-871. 10.1038/nsmb1283.

    Article  PubMed  CAS  Google Scholar 

  28. Zhao Z, Yu Y, Meyer D, Wu C, Shen WH: Prevention of early flowering by expression of FLOWERING LOCUS C requires methylation of histone H3 K36. Nat Cell Biol. 2005, 7: 1256-1260. 10.1038/ncb1329.

    Article  PubMed  Google Scholar 

  29. Kim SY, He Y, Jacob Y, Noh YS, Michaels S, Amasino R: Establishment of the vernalization-responsive, winter-annual habit in Arabidopsis requires a putative histone H3 methyl transferase. Plant Cell. 2005, 17: 3301-3310. 10.1105/tpc.105.034645.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  30. Xu L, Zhao Z, Dong A, Soubigou-Taconnat L, Renou JP, Steinmetz A, Shen WH: Di- and tri- but not monomethylation on histone H3 lysine 36 marks active transcription of genes involved in flowering time regulation and other processes in Arabidopsis thaliana. Mol Cell Biol. 2008, 28: 1348-1360. 10.1128/MCB.01607-07.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  31. Dong G, Ma DP, Li J: The histone methyltransferase SDG8 regulates shoot branching in Arabidopsis. Biochem Biophys Res Commun. 2008, 373: 659-664. 10.1016/j.bbrc.2008.06.096.

    Article  PubMed  CAS  Google Scholar 

  32. Liu CL, Kaplan T, Kim M, Buratowski S, Schreiber SL, Friedman N, Rando OJ: Single-nucleosome mapping of histone modifications in S. cerevisiae. PLoS Biol. 2005, 3: e328-10.1371/journal.pbio.0030328.

    Article  PubMed  PubMed Central  Google Scholar 

  33. Alvarez-Venegas R, Avramova Z: SET-domain proteins of the Su(var)3-9, E(z) and trithorax families. Gene. 2002, 285: 25-37. 10.1016/S0378-1119(02)00401-8.

    Article  PubMed  CAS  Google Scholar 

  34. Baumbusch LO, Thorstensen T, Krauss V, Fischer A, Naumann K, Assalkhou R, Schulz I, Reuter G, Aalen RB: The Arabidopsis thaliana genome contains at least 29 active genes encoding SET domain proteins that can be assigned to four evolutionarily conserved classes. Nucleic Acids Res. 2001, 29: 4319-4333. 10.1093/nar/29.21.4319.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  35. Springer NM, Napoli CA, Selinger DA, Pandey R, Cone KC, Chandler VL, Kaeppler HF, Kaeppler SM: Comparative analysis of SET domain proteins in maize and Arabidopsis reveals multiple duplications preceding the divergence of monocots and dicots. Plant Physiol. 2003, 132: 907-925. 10.1104/pp.102.013722.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  36. Zhao Z, Shen WH: Plants contain a high number of proteins showing sequence similarity to the animal SUV39H family of histone methyltransferases. Ann N Y Acad Sci. 2004, 1030: 661-669. 10.1196/annals.1329.077.

    Article  PubMed  CAS  Google Scholar 

  37. Alvarez-Venegas R, Avramova Z: Methylation patterns of histone H3 Lys 4, Lys 9 and Lys 27 in transcriptionally active and inactive Arabidopsis genes and in atx1 mutants. Nucleic Acids Res. 2005, 33: 5199-5207. 10.1093/nar/gki830.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  38. Saleh A, Alvarez-Venegas R, Yilmaz M, Le O, Hou G, Sadder M, Al-Abdallat A, Xia Y, Lu G, Ladunga I, Avramova Z: The highly similar Arabidopsis homologs of trithorax ATX1 and ATX2 encode proteins with divergent biochemical functions. Plant Cell. 2008, 20: 568-579. 10.1105/tpc.107.056614.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  39. Pien S, Fleury D, Mylne JS, Crevillen P, Inze D, Avramova Z, Dean C, Grossniklaus U: ARABIDOPSIS TRITHORAX1 dynamically regulates FLOWERING LOCUS C activation via histone 3 lysine 4 trimethylation. Plant Cell. 2008, 20: 580-588. 10.1105/tpc.108.058172.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  40. Alvarez-Venegas R, Pien S, Sadder M, Witmer X, Grossniklaus U, Avramova Z: ATX-1, an Arabidopsis homolog of trithorax, activates flower homeotic genes. Curr Biol. 2003, 13: 627-637. 10.1016/S0960-9822(03)00243-4.

    Article  PubMed  CAS  Google Scholar 

  41. Li X, Wang X, He K, Ma Y, Su N, He H, Stolc V, Tongprasit W, Jin W, Jiang J, Terzaghi W, Li S, Deng XW: High-resolution mapping of epigenetic modifications of the rice genome uncovers interplay between DNA methylation, histone methylation, and gene expression. Plant Cell. 2008, 20: 259-276. 10.1105/tpc.107.056879.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  42. Schmid M, Davison TS, Henz SR, Pape UJ, Demar M, Vingron M, Scholkopf B, Weigel D, Lohmann JU: A gene expression map of Arabidopsis thaliana development. Nat Genet. 2005, 37: 501-506. 10.1038/ng1543.

    Article  PubMed  CAS  Google Scholar 

  43. Papp B, Muller J: Histone trimethylation and the maintenance of transcriptional ON and OFF states by trxG and PcG proteins. Genes Dev. 2006, 20: 2041-2054. 10.1101/gad.388706.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  44. Beisel C, Buness A, Roustan-Espinosa IM, Koch B, Schmitt S, Haas SA, Hild M, Katsuyama T, Paro R: Comparing active and repressed expression states of genes controlled by the Polycomb/Trithorax group proteins. Proc Natl Acad Sci USA. 2007, 104: 16615-16620. 10.1073/pnas.0701538104.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  45. Bernstein BE, Mikkelsen TS, Xie X, Kamal M, Huebert DJ, Cuff J, Fry B, Meissner A, Wernig M, Plath K, Jaenisch R, Wagschal A, Feil R, Schreiber SL, Lander ES: A bivalent chromatin structure marks key developmental genes in embryonic stem cells. Cell. 2006, 125: 315-326. 10.1016/j.cell.2006.02.041.

    Article  PubMed  CAS  Google Scholar 

  46. Saleh A, Al-Abdallat A, Ndamukong I, Alvarez-Venegas R, Avramova Z: The Arabidopsis homologs of trithorax (ATX1) and enhancer of zeste (CLF) establish 'bivalent chromatin marks' at the silent AGAMOUS locus. Nucleic Acids Res. 2007, 35: 6290-6296. 10.1093/nar/gkm464.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  47. Oh S, Park S, van Nocker S: Genic and global functions for Paf1C in chromatin modification and gene expression in Arabidopsis. PLoS Genet. 2008, 4: e1000077-10.1371/journal.pgen.1000077.

    Article  PubMed  PubMed Central  Google Scholar 

  48. Saleh A, Alvarez-Venegas R, Avramova Z: Dynamic and stable histone H3 methylation patterns at the Arabidopsis FLC and AP1 loci. Gene. 2008, 423: 43-47. 10.1016/j.gene.2008.06.022.

    Article  PubMed  CAS  Google Scholar 

  49. Pasini D, Hansen KH, Christensen J, Agger K, Cloos PA, Helin K: Coordinated regulation of transcriptional repression by the RBP2 H3K4 demethylase and Polycomb-Repressive Complex 2. Genes Dev. 2008, 22: 1345-1355. 10.1101/gad.470008.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  50. Christensen J, Agger K, Cloos PA, Pasini D, Rose S, Sennels L, Rappsilber J, Hansen KH, Salcini AE, Helin K: RBP2 belongs to a family of demethylases, specific for tri-and dimethylated lysine 4 on histone 3. Cell. 2007, 128: 1063-1076. 10.1016/j.cell.2007.02.003.

    Article  PubMed  CAS  Google Scholar 

  51. Lee MG, Norman J, Shilatifard A, Shiekhattar R: Physical and functional association of a trimethyl H3K4 demethylase and Ring6a/MBLR, a polycomb-like protein. Cell. 2007, 128: 877-887. 10.1016/j.cell.2007.02.004.

    Article  PubMed  CAS  Google Scholar 

  52. Xu L, Shen WH: Polycomb silencing of KNOX genes confines shoot stem cell niches in Arabidopsis. Curr Biol. 2008, 18: 1966-1971. 10.1016/j.cub.2008.11.019.

    Article  PubMed  CAS  Google Scholar 

  53. Zhang X, Jacobsen SE: Genetic analyses of DNA methyltransferases in Arabidopsis thaliana. Cold Spring Harb Symp Quant Biol. 2006, 71: 439-447. 10.1101/sqb.2006.71.047.

    Article  PubMed  CAS  Google Scholar 

  54. Zilberman D: The evolving functions of DNA methylation. Curr Opin Plant Biol. 2008, 11: 554-559. 10.1016/j.pbi.2008.07.004.

    Article  PubMed  CAS  Google Scholar 

  55. Henderson IR, Jacobsen SE: Epigenetic inheritance in plants. Nature. 2007, 447: 418-424. 10.1038/nature05917.

    Article  PubMed  CAS  Google Scholar 

  56. Chan SW, Henderson IR, Jacobsen SE: Gardening the genome: DNA methylation in Arabidopsis thaliana. Nat Rev Genet. 2005, 6: 351-360. 10.1038/nrg1601.

    Article  PubMed  CAS  Google Scholar 

  57. Ooi SK, Qiu C, Bernstein E, Li K, Jia D, Yang Z, Erdjument-Bromage H, Tempst P, Lin SP, Allis CD, Cheng X, Bestor TH: DNMT3L connects unmethylated lysine 4 of histone H3 to de novo methylation of DNA. Nature. 2007, 448: 714-717. 10.1038/nature05987.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  58. Chen T, Ueda Y, Xie S, Li E: A novel Dnmt3a isoform produced from an alternative promoter localizes to euchromatin and its expression correlates with active de novo methylation. J Biol Chem. 2002, 277: 38746-38754. 10.1074/jbc.M205312200.

    Article  PubMed  CAS  Google Scholar 

  59. Suetake I, Shinozaki F, Miyagawa J, Takeshima H, Tajima S: DNMT3L stimulates the DNA methylation activity of Dnmt3a and Dnmt3b through a direct interaction. J Biol Chem. 2004, 279: 27816-27823. 10.1074/jbc.M400181200.

    Article  PubMed  CAS  Google Scholar 

  60. Cao X, Jacobsen SE: Role of the Arabidopsis DRM methyltransferases in de novo DNA methylation and gene silencing. Curr Biol. 2002, 12: 1138-1144. 10.1016/S0960-9822(02)00925-9.

    Article  PubMed  CAS  Google Scholar 

  61. Chan SW, Zilberman D, Xie Z, Johansen LK, Carrington JC, Jacobsen SE: RNA silencing genes control de novo DNA methylation. Science. 2004, 303: 1336-10.1126/science.1095989.

    Article  PubMed  CAS  Google Scholar 

  62. Zilberman D, Cao X, Jacobsen SE: ARGONAUTE4 control of locus-specific siRNA accumulation and DNA and histone methylation. Science. 2003, 299: 716-719. 10.1126/science.1079695.

    Article  PubMed  CAS  Google Scholar 

  63. Finnegan EJ, Peacock WJ, Dennis ES: Reduced DNA methylation in Arabidopsis thaliana results in abnormal plant development. Proc Natl Acad Sci USA. 1996, 93: 8449-8454. 10.1073/pnas.93.16.8449.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  64. Ronemus MJ, Galbiati M, Ticknor C, Chen J, Dellaporta SL: Demethylation-induced developmental pleiotropy in Arabidopsis. Science. 1996, 273: 654-657. 10.1126/science.273.5275.654.

    Article  PubMed  CAS  Google Scholar 

  65. Kankel MW, Ramsey DE, Stokes TL, Flowers SK, Haag JR, Jeddeloh JA, Riddle NC, Verbsky ML, Richards EJ: Arabidopsis MET1 cytosine methyltransferase mutants. Genetics. 2003, 163: 1109-1122.

    PubMed  CAS  PubMed Central  Google Scholar 

  66. Saze H, Scheid OM, Paszkowski J: Maintenance of CpG methylation is essential for epigenetic inheritance during plant gametogenesis. Nat Genet. 2003, 34: 65-69. 10.1038/ng1138.

    Article  PubMed  CAS  Google Scholar 

  67. Bartee L, Malagnac F, Bender J: Arabidopsis cmt3 chromomethylase mutations block non-CG methylation and silencing of an endogenous gene. Genes Dev. 2001, 15: 1753-1758. 10.1101/gad.905701.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  68. Lindroth AM, Cao X, Jackson JP, Zilberman D, McCallum CM, Henikoff S, Jacobsen SE: Requirement of CHROMOMETHYLASE3 for maintenance of CpXpG methylation. Science. 2001, 292: 2077-2080. 10.1126/science.1059745.

    Article  PubMed  CAS  Google Scholar 

  69. Tran RK, Henikoff JG, Zilberman D, Ditt RF, Jacobsen SE, Henikoff S: DNA methylation profiling identifies CG methylation clusters in Arabidopsis genes. Curr Biol. 2005, 15: 154-159. 10.1016/j.cub.2005.01.008.

    Article  PubMed  CAS  Google Scholar 

  70. Vaughn MW, Tanurd Ic M, Lippman Z, Jiang H, Carrasquillo R, Rabinowicz PD, Dedhia N, McCombie WR, Agier N, Bulski A, Colot V, Doerge RW, Martienssen RA: Epigenetic natural variation in Arabidopsis thaliana. PLoS Biol. 2007, 5: e174-10.1371/journal.pbio.0050174.

    Article  PubMed  PubMed Central  Google Scholar 

  71. Zilberman D, Gehring M, Tran RK, Ballinger T, Henikoff S: Genome-wide analysis of Arabidopsis thaliana DNA methylation uncovers an interdependence between methylation and transcription. Nat Genet. 2007, 39: 61-69. 10.1038/ng1929.

    Article  PubMed  CAS  Google Scholar 

  72. Zhang X, Yazaki J, Sundaresan A, Cokus S, Chan SW, Chen H, Henderson IR, Shinn P, Pellegrini M, Jacobsen SE, Ecker JR: Genome-wide high-resolution mapping and functional analysis of DNA methylation in Arabidopsis. Cell. 2006, 126: 1189-1201. 10.1016/j.cell.2006.08.003.

    Article  PubMed  CAS  Google Scholar 

  73. Reinders J, Delucinge Vivier C, Theiler G, Chollet D, Descombes P, Paszkowski J: Genome-wide, high-resolution DNA methylation profiling using bisulfite-mediated cytosine conversion. Genome Res. 2008, 18: 469-476. 10.1101/gr.7073008.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  74. Mathieu O, Reinders J, Caikovski M, Smathajitt C, Paszkowski J: Transgenerational stability of the Arabidopsis epigenome is coordinated by CG methylation. Cell. 2007, 130: 851-862. 10.1016/j.cell.2007.07.007.

    Article  PubMed  CAS  Google Scholar 

  75. Zilberman D, Henikoff S: Genome-wide analysis of DNA methylation patterns. Development. 2007, 134: 3959-3965. 10.1242/dev.001131.

    Article  PubMed  CAS  Google Scholar 

  76. Cokus SJ, Feng S, Zhang X, Chen Z, Merriman B, Haudenschild CD, Pradhan S, Nelson SF, Pellegrini M, Jacobsen SE: Shotgun bisulphite sequencing of the Arabidopsis genome reveals DNA methylation patterning. Nature. 2008, 452: 215-219. 10.1038/nature06745.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  77. Lister R, O'Malley RC, Tonti-Filippini J, Gregory BD, Berry CC, Millar AH, Ecker JR: Highly integrated single-base resolution maps of the epigenome in Arabidopsis. Cell. 2008, 133: 523-536. 10.1016/j.cell.2008.03.029.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  78. Zilberman D, Coleman-Derr D, Ballinger T, Henikoff S: Histone H2A.Z and DNA methylation are mutually antagonistic chromatin marks. Nature. 2008, 456: 125-129. 10.1038/nature07324.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  79. Mikkelsen TS, Ku M, Jaffe DB, Issac B, Lieberman E, Giannoukos G, Alvarez P, Brockman W, Kim TK, Koche RP, Lee W, Mendenhall E, O'Donovan A, Presser A, Russ C, Xie X, Meissner A, Wernig M, Jaenisch R, Nusbaum C, Lander ES, Bernstein BE: Genome-wide maps of chromatin state in pluripotent and lineage-committed cells. Nature. 2007, 448: 553-560. 10.1038/nature06008.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  80. Barski A, Cuddapah S, Cui K, Roh TY, Schones DE, Wang Z, Wei G, Chepelev I, Zhao K: High-resolution profiling of histone methylations in the human genome. Cell. 2007, 129: 823-837. 10.1016/j.cell.2007.05.009.

    Article  PubMed  CAS  Google Scholar 

  81. Pokholok DK, Harbison CT, Levine S, Cole M, Hannett NM, Lee TI, Bell GW, Walker K, Rolfe PA, Herbolsheimer E, Zeitlinger J, Lewitter F, Gifford DK, Young RA: Genome-wide map of nucleosome acetylation and methylation in yeast. Cell. 2005, 122: 517-527. 10.1016/j.cell.2005.06.026.

    Article  PubMed  CAS  Google Scholar 

  82. Guenther MG, Levine SS, Boyer LA, Jaenisch R, Young RA: A chromatin landmark and transcription initiation at most promoters in human cells. Cell. 2007, 130: 77-88. 10.1016/j.cell.2007.05.042.

    Article  PubMed  CAS  PubMed Central  Google Scholar 

  83. Ji H, Wong WH: TileMap: create chromosomal map of tiling array hybridizations. Bioinformatics. 2005, 21: 3629-3636. 10.1093/bioinformatics/bti593.

    Article  PubMed  CAS  Google Scholar 

  84. UCSC Genome Bioinformatics. [http://epigenomics.mcdb.ucla.edu/H3K4m1m2m3/]

Download references

Acknowledgements

XZ was supported by a Faculty Research Grant (JR-040) from the University of Georgia. YVB was supported by USPHS National Research Service Award GM07104. Jacobsen lab research was supported by NIH grant GM60398. SEJ is an investigator of the Howard Hughes Medical Institute.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Xiaoyu Zhang or Steven E Jacobsen.

Additional information

Authors' contributions

XZ, YVB and SEJ designed the experiments. YVB and XZ performed the experiments. XZ, MP and SEJ analyzed the data. SC contributed reagents/materials/analysis tools. XZ wrote the paper.

Xiaoyu Zhang, Yana V Bernatavichute contributed equally to this work.

Electronic supplementary material

13059_2008_2212_MOESM1_ESM.pdf

Additional data file 1: Figure S1: dot blot analysis showing the specificity of antibodies used here. Figure S2: comparison of the H3K4me distribution patterns determined here and those reported in a recent locus-specific study [38]. Figure S3: real-time PCR validation of H3K4me ChIP-chip results. Figure S4: length distribution of H3K27me3 target genes. Table S1: real-time PCR primer sequences. (PDF 2 MB)

Authors’ original submitted files for images

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, X., Bernatavichute, Y.V., Cokus, S. et al. Genome-wide analysis of mono-, di- and trimethylation of histone H3 lysine 4 in Arabidopsis thaliana. Genome Biol 10, R62 (2009). https://doi.org/10.1186/gb-2009-10-6-r62

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/gb-2009-10-6-r62

Keywords