A Genome-Wide Survey of Imprinted Genes in Rice Seeds Reveals Imprinting Primarily Occurs in the Endosperm

Genomic imprinting causes the expression of an allele depending on its parental origin. In plants, most imprinted genes have been identified in Arabidopsis endosperm, a transient structure consumed by the embryo during seed formation. We identified imprinted genes in rice seed where both the endosperm and embryo are present at seed maturity. RNA was extracted from embryos and endosperm of seeds obtained from reciprocal crosses between two subspecies Nipponbare (Japonica rice) and 93-11 (Indica rice). Sequenced reads from cDNA libraries were aligned to their respective parental genomes using single-nucleotide polymorphisms (SNPs). Reads across SNPs enabled derivation of parental expression bias ratios. A continuum of parental expression bias states was observed. Statistical analyses indicated 262 candidate imprinted loci in the endosperm and three in the embryo (168 genic and 97 non-genic). Fifty-six of the 67 loci investigated were confirmed to be imprinted in the seed. Imprinted loci are not clustered in the rice genome as found in mammals. All of these imprinted loci were expressed in the endosperm, and one of these was also imprinted in the embryo, confirming that in both rice and Arabidopsis imprinted expression is primarily confined to the endosperm. Some rice imprinted genes were also expressed in vegetative tissues, indicating that they have additional roles in plant growth. Comparison of candidate imprinted genes found in rice with imprinted candidate loci obtained from genome-wide surveys of imprinted genes in Arabidopsis to date shows a low degree of conservation, suggesting that imprinting has evolved independently in eudicots and monocots.

Published in the journal: . PLoS Genet 7(6): e32767. doi:10.1371/journal.pgen.1002125
Category: Research Article
doi: 10.1371/journal.pgen.1002125


Genomic imprinting causes the expression of an allele depending on its parental origin. In plants, most imprinted genes have been identified in Arabidopsis endosperm, a transient structure consumed by the embryo during seed formation. We identified imprinted genes in rice seed where both the endosperm and embryo are present at seed maturity. RNA was extracted from embryos and endosperm of seeds obtained from reciprocal crosses between two subspecies Nipponbare (Japonica rice) and 93-11 (Indica rice). Sequenced reads from cDNA libraries were aligned to their respective parental genomes using single-nucleotide polymorphisms (SNPs). Reads across SNPs enabled derivation of parental expression bias ratios. A continuum of parental expression bias states was observed. Statistical analyses indicated 262 candidate imprinted loci in the endosperm and three in the embryo (168 genic and 97 non-genic). Fifty-six of the 67 loci investigated were confirmed to be imprinted in the seed. Imprinted loci are not clustered in the rice genome as found in mammals. All of these imprinted loci were expressed in the endosperm, and one of these was also imprinted in the embryo, confirming that in both rice and Arabidopsis imprinted expression is primarily confined to the endosperm. Some rice imprinted genes were also expressed in vegetative tissues, indicating that they have additional roles in plant growth. Comparison of candidate imprinted genes found in rice with imprinted candidate loci obtained from genome-wide surveys of imprinted genes in Arabidopsis to date shows a low degree of conservation, suggesting that imprinting has evolved independently in eudicots and monocots.


Maternal and paternal alleles, inherited in plants and animals after fertilization, are usually equivalently expressed during the developmental cycle. In plants and in placental animals, a subset of genes is preferentially transcribed depending on the gender of the parent from which the gene originates and they are defined as imprinted genes [1]. Mono-allelic expression and also preferential expression of parental alleles has been observed for imprinted genes. A combination of epigenetic processes including DNA and histone methylation and demethylation are involved in repressing expression from one parental allele and enabling expression from the other in plants and animals [2][16].

In animals, many hundreds of imprinted genes have been identified that are thought to be involved in various functions including regulation of nutrient transfer from the foetal placenta to the embryo, in embryo growth and in adult brain development [17][22]. In flowering plants, disruption of imprinting alters seed development [23][25]. The majority of plant imprinted genes have been found in seeds of the model plant Arabidopsis where they manifest their biased expression in the endosperm [1][3]. The endosperm is a terminal seed tissue with a nutritive function that forms following double fertilization in the ovule of the flower [1][3], [26]. In plants, the endosperm is triploid as nuclei contain a 2 maternal∶1 paternal genome complement resulting from the fusion of the diploid maternal central cell nucleus with a sperm cell. The other product of double fertilization is a diploid zygote that develops into an embryo, the progenitor of the seedling. Embryo nuclei contain a 1 maternal∶1 paternal genome complement (Figure 1A). Until recently, only 21 imprinted genes had been identified in flowering plants from studies in eudicot Arabidopsis and in monocot cereals rice and maize [2]. With the exception of one maternally expressed maize embryo gene, Mee1 [27], the remainder were imprinted in the endosperm. Transcriptomic surveys of Arabidopsis seeds have significantly expanded the set of imprinted genes in eudicot endosperm to over 170 candidates but additional embryo imprinted genes have not been identified [28, Wolff et al. 2011 unpublished].

Seed development and imprinting in rice seeds.
Fig. 1. Seed development and imprinting in rice seeds.
(A) A cartoon showing parental contributions to embryo and endosperm tissues following double fertilization in rice ovules. In the triploid endosperm the maternal (m) to paternal (p) genome ratio is 2∶1 and in the diploid embryo a 1m∶1p ratio is observed. (B–D) Sections of developing rice seeds at indicated days after fertilization (DAF) showing embryo (em) and endosperm (en) morphology and the proximity of these tissues. Embryo differentiation is complete 10 DAF. Starch and protein accumulation continue in the endosperm until grain maturation (∼20–25 DAF). Endosperm was harvested for transcriptome analyses at 5 DAF and embryos at 6 DAF. Scale bars are all 100 µm. (E) Os10g05750 is a maternally expressed gene found in the embryo and endosperm. RT-PCR sequencing chromatographs show that maternal expression is maintained during endosperm development. In the embryo imprinting is transient as bi-allelic expression is evident at 8 and 10 DAF in the Nip (m)×93-11 (p) cross (arrows indicate double peaks for G and A). Some paternal expression is evident in the embryo at 10 DAF in the 93-11 (m)×Nip (p) cross. Os10g05750 is expressed in a range of other plant tissues (Figure S6). (F) Temporal changes in imprinting status for four endosperm imprinted genes. The bars provide a schematic representation of the relative expression levels of maternal (m; red) or paternal (p; blue) alleles for the four genes based on RT-PCR sequencing chromatographs (Figure S3). Dotted bars indicate low level of expression. All four genes become uniparentally expressed. Expression in sperm cells is indicated by (+), absence by (−) and (?) indicates uncertain from microarray analyses in triplicate (see Materials and Methods). (G) Allele specific RT-PCR confirming the dynamics of imprinting for Os02g55560 in endosperm obtained from reciprocal crosses between Nip and 93-11. The RT-PCR products were cut with EcoRV. The Nip+93-11 lane is a digestion control containing equal amounts of parental Nip and 93-11 RT-PCR products.

In plants, nutrient allocation is considered to be the driving force for the evolution of imprinting in the endosperm according to the parental conflict theory [29]. This theory predicts that excess dosage of paternal alleles promotes larger seeds while an excess of maternal alleles produces small seeds. These predictions have held true in interploidy crosses that alter maternal and paternal gene dosage in Arabidopsis [23]. Some of the Arabidopsis imprinted genes have roles controlling endosperm cell proliferation and growth [14], [30], [31] but for most of the newly identified genes their roles in endosperm formation and nutrient allocation are unknown [28]. The recently identified Arabidopsis imprinted genes have a diverse range of putative functions suggesting they have the potential to influence conflict between maternal and paternal genomes at a range of molecular regulatory levels [28]. Dilkes and Comai [32] have suggested imprinting may play a wider role in plants by regulating dosage sensitive gene expression. The significance of the parental conflict theory in self-fertilizing plants like Arabidopsis has also been questioned and it has been proposed that imprinting may serve to promote hybridity in outcross situations [33], [34].

Different seed development profiles are observed in plants. The endosperm is transient and consumed by the embryo during seed development in Arabidopsis. By contrast, in rice and maize, the endosperm is persistent and contributes to the bulk of the mature seed acting as a nutrient source for the embryo at germination. Comparisons of imprinted genes found in such developmentally distinct seed types of different evolutionary origin, should provide further insight to the conservation, role and importance of imprinted genes in seed development. Prior to his study, only seven locus-specific imprinted genes had been reported in rice and maize [35][42]. The two known rice imprinted genes, OsFIE1 and OsMADS87, were found using comparative sequence similarity with the maize imprinted gene FIE 1 and the Arabidopsis imprinted gene PHERES1, respectively [41], [42].

We have conducted a transcriptomic analysis of imprinted genes in the embryo and endosperm of rice, an important food crop, and one in which both of these tissues persist in the mature seed. We report the identification of 262 candidate loci with parentally biased expression in the endosperm and the experimental verification of 56 of these loci. Imprinting in rice, like Arabidopsis, primarily occurs in the endosperm and is rare in the embryo. Only one gene was found to be maternally expressed, albeit transiently in the embryo, and it was also maternally expressed in the endosperm. A comparison of rice candidate imprinted loci with the reported set of candidate Arabidopsis imprinted genes indicates that these two species share very few imprinted genes in common, suggesting that imprinting is likely to have evolved independently in monocots and eudicots. Interestingly, a small number of candidate imprinted genes that share high sequence homology in rice and Arabidopsis are associated with epigenetic regulation, including DNA methylation, histone modification and small RNA pathways. The identified rice loci provide a comprehensive platform to further explore imprinted gene function and imprinting mechanisms in cereal seed development.


Identification of imprinted loci in seeds from reciprocal crosses of Indica and Japonica rice

We examined imprinting in endosperm and embryos of seeds derived from reciprocal crosses between Nipponbare (Nip), a Japonica rice and 93-11, a Chinese Indica rice subspecies. The genomic sequences for these two subspecies are publically available and reciprocal crosses result in viable progeny (Figure S1). Cytological analysis of seed growth in selfed parents and in reciprocal crosses indicated that they followed previously described seed developmental profiles (Figure 1B–1D) [43]. Endosperm was nuclear in seeds 2 days after fertilization (DAF), became cellular at 3 DAF and embryo morphogenesis was complete at 10 DAF. Some variation in the timing of rice seed development was observed in the intervening stages (Figure S1A). Unlike Arabidopsis, rice seeds are large. They attain maximum length soon after fertilization. The embryo and endosperm are found in distinct compartments in the rice seed and they can be manually isolated as pure fractions since their boundaries are readily separable and washing of the embryo is sufficient to remove all traces of endosperm (Figure S1B). Endosperm for transcriptomic analyses was harvested at 5 DAF (Figure 1C) when seeds had nearly attained their maximum length; maternal nucellar tissue was absent and milky endosperm could readily be extracted from the top of the seed. Embryos for transcriptomic analysis were harvested at 6 DAF when the first to third leaf primordia were developing. Embryos of the same size and morphological stage were collected for RNA isolation to minimize stage variation (Figure S1B).

We conducted a genome-wide survey of gene expression in the harvested endosperm and embryo tissues using the Illumina high-throughput short read sequencing platform to identify loci that were exclusively or preferentially expressed from maternal and paternal genomes. In the endosperm, a total of 4.1 million reads could be aligned uniquely to both genomes at publicly reported SNP loci (SNP reads), covering a total of 116,291 SNPs with a median read count of 35 reads per SNP location (Table S1). In the embryo, read mapping resulted in 9.9 million reads across 168,250 SNPs and a median read count at SNP loci of 59 (Table S1). SNP reads were required to align to a unique position in both genomes covering exactly one publicly reported SNP. Throughout we considered a genomic feature expressed if it showed at least 10 uniquely aligned reads and progressed a feature for analysis if it contained at least 10 SNP reads. The use of SNP reads allows the allelic assignment of expressed regions to their parental genome of origin and the characterization of any allelic bias of expression of that region. Using this approach we were able to analyse allelic expression of approximately 50% and 70% of annotated and expressed genes across the endosperm and embryo transcriptome respectively (Table 1). Of those that we were not able to analyse approximately 20% were not annotated as containing SNPs and the remainder displayed insufficient read density across SNP loci in these datasets.

Tab. 1. Summary of imprinted candidates identified in rice embryo and endosperm by transcriptome analyses and those confirmed by PCR.
Summary of imprinted candidates identified in rice embryo and endosperm by transcriptome analyses and those confirmed by PCR.
: Three embryo genes that met significance and four additional genes with suggestive evidence were analysed. Only one gene proved to be imprinted and it arose from the four additional genes analysed (Figure 1E). This gene is also imprinted in the endosperm.

We analysed allelic expression bias of both overlapping physical windows (1 kb width every 0.5 kb) tiled across the genome and publicly annotated features relevant to the transcriptome including cDNAs, introns and intergenic regions (Table S2). This redundant strategy allowed us to maximize exploration of imprinted transcripts from annotated and unannotated regions of the genome.

Physical windows were considered for analysis if they included at least 10 SNP reads in each cross. The relative allelic proportions of SNP reads within windows were observed to globally agree with the expected 1∶1 maternal to paternal allelic contributions expected in the embryo transcriptome (Figure 1A, Figure S2A and S2B). Similarly, across the endosperm transcriptome allelic expression was observed to closely approximate the expected 2∶1 maternal to paternal ratio (Figure 1A, Figure S2C and S2D). This evident discrimination in global patterns of the endosperm and embryo transcriptomes suggested that the contamination of maternal tissues in both endosperm and embryo was neglible and specific parental contributions could be determined in this dataset. Evidence for imprinting was determined through statistical analysis of observed to expected allelic contributions to expression and loci were considered putatively imprinted at P≤0.05 after a Pearson chi-square test.

Analysis of the embryo identified 56 windows with evidence of imprinting (55 maternal, 1 paternal) whereas in the endosperm, 1,220 windows showed evidence of imprinting (24% maternal, 76% paternal; Table S2). Allelic analysis of expression also allowed the identification of subspecies bias of expression, for example more than 2,500 windows showed increased expression of the 93-11 allele in endosperm (Table S2). These subspecies expression biases are potentially of interest for the characterisation of the genetic regulation of expression in these subspecies, however these biases were not further investigated in this study.

Imprinted physical windows showed no evidence of physical clustering throughout the genome. The median distance between significant windows was 1.1 Mb and only 4.5% (58) of all window candidates were within 10 kb of another putatively imprinted window. We observed that the majority (97%) of all significant windows mapped either to within annotated gene features or within several kilobases upstream or downstream of annotated genes. The 3% of remaining window candidate loci (41) could be classified as intergenic. They were located several kilobases away from annotated transcription, and 64% (26) of these involved significance at a single window or region of 1 kb; with the remainder involving two or at most three consecutive windows. Given the strong overlap of candidate imprinted loci with annotated features, we decided to focus our analysis on annotated features for imprinting status, however, all physical window candidate loci were investigated for overlap with imprinted candidates detected by the annotated feature analysis.

SNP reads were mapped to annotated gene features including cDNAs, introns and intergenic regions (Table S2). This mapping utilised all SNP reads analysed in the physical window analysis where reads were allocated to cDNA and intron regions as appropriate when they overlapped annotated features, and intergenic regions when they did not overlap annotated features. In addition, we considered all annotated transcript isoforms for all genes, analysing each transcript isoform independently for imprinting status. This reallocation accounted for all SNP reads analysed in the physical window analysis. Statistical analysis of the expression of annotated features identified 262 candidate loci with evidence of a parental expression bias including 177 maternally expressed and 85 paternally expressed candidate loci (Table 1, Tables S2 and S3). The majority of these candidate loci (165) coincided with annotated cDNAs, whereas the remaining loci (97) were localized to intergenic regions (75) or introns (22) (Table 1, Tables S2 and S3). These intergenic loci might represent non-coding RNAs as identified in mammals [17], [18] or various types of mis-annotated transcription. A search of miRBase (http://www.mirbase.org/) with the sequences of these intergenic loci did not detect any matches to known miRNAs. Most of the biased transcription annotated as intronic are possibly unannotated alternative splicing events as suggested by their close proximity to annotated transcripts and the observation that the flanking exon sequences show similar modes of parental bias.

In contrast to the endosperm, only three maternally expressed cDNAs were identified in the embryo transcriptome, despite the extra sequencing depth (Table 1, Tables S2 and S3). To explore whether the candidate imprinted endosperm genes are also imprinted in the embryo, expression of all 165 endosperm cDNAs showing a parental bias were examined in the embryo transcriptome (Table S3). We found that 64 cDNAs (39%) were detected in the embryo transcriptome with less than 10 SNP reads in at least one cross; thus suggesting that these genes are minimally expressed in the embryo and their expression was too low to confidently examine their imprinting status in the embryo in this dataset. Another 85 cDNAs showed no evidence of parentally biased expression including 15 which appeared to show subspecies specific expression. Only one, cDNA Os08g08960 displayed statistically significant evidence of maternally biased expression in both endosperm and embryo transcriptomes (Table S3).

A comparison of the two analytical strategies for both embryo and endosperm transcriptomes revealed that the majority of putatively imprinted physical windows overlapped with candidates suggested through the annotated features analysis. There was, however, a portion of the windows analysis (36%) not represented in the annotated features analysis, and a much smaller subset of the annotated feature candidates (7%) not captured in the physical windows analysis. In all but three cases, these analysis-specific loci overlapped annotated gene features in the rice genome. On closer inspection it was observed that some biased windows lost evidence of imprinting when summarised into larger features such as cDNA. This could suggest the presence of noise over smaller numbers of SNP loci that could be eliminated by analysis of a larger region with more SNP evidence. It could also suggest that in some cases differentially biased expression of windows within one gene may correspond to parent of origin-specific alternative splicing as found in animals [44]. As mentioned previously, our analysis of annotated features considered all annotated alternate splice variants. We extended this in a modification of the physical windows analysis to search for evidence of unannotated alternative splicing differentiated by parent of origin in the endosperm and embryo transcriptomes. This analysis identified nine candidates in the endosperm and none in the embryo that showed evidence of a novel phenomenon (Table 1 and Table S3). Multiple parentally biased transcripts of differing sequence and genic structure appeared to be arising from the same locus in the nine identified endosperm candidates as a result of alternative splicing and we termed this parent of origin-specific alternative splicing.

In summary, the collective in silico transcriptome analysis of the embryo and endosperm tissues of hybrid F1 seeds derived from reciprocal crosses identified 93 maternally and 72 paternally biased genes, in addition to 97 parentally biased non-genic transcripts in endosperm. The analysis also identified nine putative cases of parent of origin-specific alternate splicing. By contrast, parentally biased gene expression was restricted to only three maternally expressed genes in the embryo (Table 1). This tissue-specific profile of imprinting was also reflected in analysis of physical windows tiled across the genome which identified 20-fold more putatively imprinted loci in the endosperm relative to the embryo. The majority of putative imprinted loci identified through the window analysis were co-located with annotated transcripts either as cDNA or as non-genic regions associated with a transcript. A small subset of unannotated intergenic loci identified in both the analysis of physical windows and annotated features may be non-coding RNAs, but they do not match any currently known non-coding RNA sequences. Since the profiling utilized poly A+ RNA they are unlikely to be small RNAs. These putative intergenic transcripts will require further characterization to confirm their transcription and explore their function.

Identified imprinted loci are not clustered

Most of the candidate rice imprinted genes and intergenic loci identified are not organized in physically co-localized clusters in the rice genome of the type found in mammals [45]. There are six pairs of candidate imprinted genes located close to each other that we term micro-clusters (Cluster 1: Os01g12860-Os01g12890; Cluster 2: Os02g29140-Os02g29150; Cluster 3: Os03g27450-Os03g27460; Cluster 4: Os05g05780-Os05g05790; Cluster 5: Os06g33640-Os06g33690; Cluster 6: Os12g32150-Os12g32170). Only one of these appears to contain a duplicated gene (Cluster 5). Whether these micro-clusters are regulated in a similar manner to mammalian clusters via an imprinting control region remains to be determined. We found no significant evidence for enrichment of transposons or repeats around the candidate imprinted rice genes relative to non-imprinted genes (based on the annotation in http://rice.plantbiology.msu.edu/; Tables S4 and S5). However, given the frequent distribution of repeats around many rice genes, and the currently less advanced annotation of transposons and repeat-like elements in the rice genome, we cannot rule out the possibility that some of these elements play a role in influencing imprinting in the rice transcriptome.

Os10g05750 is maternally expressed in the embryo and endosperm

The three putative embryo imprinted gene candidates were further examined for maternal allele-specific gene expression in embryos by RT-PCR using gene specific primers and sequencing across SNPs (Tables S2 and S3). Imprinting was not confirmed in any of these statistically predicted candidates. We then tested four additional embryo candidates that were originally statistically excluded from analysis because they showed parentally-biased expression in one cross, but had low SNP read coverage in the other cross (Table S3). One of these, Os10g05750 was confirmed to be maternally expressed in both the embryo and endosperm (Figure 1E). Os10g05750 is a homolog of olive Ole e 1 which encodes an allergenic protein thought to control pollen tube emergence and guidance [46]. The transcript abundance of this gene is in the mid-range of imprinted transcripts detected in the endosperm (Table S3), thus it is unlikely that it results from a major endosperm contaminant. Analysis of Os10g05750 expression in the embryo and endosperm at 6, 8 and 10 DAF showed persistence of expression of the maternal allele in the endosperm. However, biallelic expression of Os10g05750 was observed in embryos of the Nip×93-11 cross at 8 DAF and some paternal expression was observed at 10 DAF in the reciprocal cross (Figure 1E). Maternal expression of the Os10g05750 gene in the embryo during seed development is therefore transient.

Confirmation of maternally and paternally expressed endosperm loci

The remainder of our analyses focused on candidate imprinted loci identified in the endosperm. Parent-specific gene expression biases observed in endosperm formed a continuum similar to that found for the transcriptomic analysis of imprinting in the mouse brain [18] (Table S6). For experimental confirmation, we primarily selected loci with 90% or greater expression bias from one parental allele (bias of 0.9 and above in Tables S3 and S6). Within the 165 cDNAs with parental expression bias in the endosperm, 121 imprinted genes comprising 62 maternally expressed and 59 paternally expressed gene candidates met the selection criteria of strong parental expression bias (Table S6). From this subset, we randomly selected 60 putatively imprinted endosperm loci for validation by RT-PCR sequencing across SNPs. These loci included 54 from the set of 165 expressed genes including OsFIE1 (Os08g04290), a maternally expressed rice endosperm gene that we had previously identified [41], and six other loci showing putative imprinting within introns and intergenic regions. In sharp contrast to the single imprinted gene identified in the embryo, we confirmed a total of 22 predicted maternally expressed genes and 27 predicted paternally expressed genes in the endosperm (Table 2 and Figure S3). We also confirmed maternal expression of an intergenic region between two non-imprinted genes Os01g69110 and Os01g69120 where there is currently no annotated ORF (Table 2 and Figure S4). Os04g20774 produced transcripts with preferentially maternal reads from introns and paternal reads from coding regions and this was confirmed using exon and intron primers (Table 2; Figure S3). Maternally expressed transcription within the intron of another putatively imprinted gene, Os07g42390.1 was also confirmed (Table 2 and Figure S3). Collectively, these results provide support that the methods utilized appear robust for the identification of parent of origin-specific gene expression.

Tab. 2. Imprinted loci confirmed by RT-PCR and sequencing across SNPs in reciprocal crosses.
Imprinted loci confirmed by RT-PCR and sequencing across SNPs in reciprocal crosses.
: Mat is maternal and Pat is paternal expression bias.

Parent of origin-biased alternative splicing and variable transcript termination result in biased parental transcripts arising from one gene

We selected Os01g70060, one of the nine candidate loci showing evidence of novel alternative splicing patterns to closely examine the types of maternal and paternal transcript isoforms generated from this gene using 3′-end race and RT-PCR. Figure 2 and Figure S5 show the range of transcript isoforms produced from Os01g70060, which encodes a putative DUF-domain containing protein. Correctly spliced paternally biased transcripts were detected with nine different polyadenylation sites within the small region downstream of the stop codon of this gene. In addition, a range of truncated, maternally biased polyadenylated transcripts were also formed. They contained the 250 bp exon and they terminated at various positions in the adjoining large intron of the gene. A total of 18 different termination sites were identified located in the 5′ end of the 2.4 kb intron. In some cases, additional alternative splicing occurred within this intron (Figure 2). There is a stop codon just inside the large intron, thus the truncated maternally biased transcripts containing the 5′ portion of the gene are likely to produce a small protein. These data provide a clear example of parent of origin-biased transcript isoforms arising from the same plant gene. The other eight candidate loci remain to be examined as does the mechanism giving rise to this phenomenon.

Different parentally biased transcripts arising from the Os01g70060 gene as a result of novel splicing and polyadenylation.
Fig. 2. Different parentally biased transcripts arising from the Os01g70060 gene as a result of novel splicing and polyadenylation.
A cartoon of the gene structure is shown with exons in green. The structure of the paternally biased transcript detected is shown with blue exons and the various maternally biased transcripts generated from the gene are shown with red exons. The positions of the primers used to detect transcripts are shown by arrows with the primer names as indicated. Further details are provided in Materials and Methods and confirmation of parental bias using RT-PCR and sequencing is provided in Figure S5. The transcripts terminate at a range of different positions as indicated in the text, and the transcripts are polyadenylated.

Temporal expression of imprinted genes during endosperm formation and transcript presence in other plant tissues

We examined whether transcripts from the imprinted loci were seed-specific in expression or if they were also expressed in other parts of the rice plant by RT-PCR. Maternally and paternally expressed loci were clearly not all silenced in other rice tissues, nor were all maternally expressed genes restricted to the endosperm of rice. Expression analyses conducted on 46 confirmed imprinted endosperm loci (Figure S6) indicated that most paternally expressed endosperm loci were expressed in various tissues in the rice plant (17/21), while approximately half of the examined maternal loci (13/25) were specifically expressed in the endosperm at 5 DAF (Figure S6). These genes may have functions in the development of a number of plant tissues.

The imprinted genes in the endosperm were originally identified using endosperm harvested from seeds at 5 DAF. We also examined the imprinting status of 43 confirmed genes at various stages of endosperm development (3.5, 6, 8, and 10 DAF) to assess whether imprinting was persistent during seed development. In 39 examined cases biased expression was evident at 3.5 or 5 DAF and persisted in the endosperm until 10 DAF (Figure S3). In four cases (Os06g33640, Os09g03500, Os12g32170, and Os12g40520) imprinting was lost in later stages of endosperm development as expression became biallelic in at least one cross (Figure S3). We also examined the expression of these imprinted genes in isolated rice sperm cells by microarray which revealed that 37 of the maternally expressed endosperm genes examined are also expressed in sperm cells. This suggests that the paternal allele might be silenced after fertilization (Table 2 and Table S3, and Materials and Methods).

We noted that four genes (Os02g55560, Os04g39560, Os04g42250 and Os06g40490) were biallelically expressed or incompletely imprinted in the endosperm at 3.5 DAF and became uniparentally expressed at later stages of endosperm development in both crosses (Figure 1F and Figure S3). Figure 1F summarizes the data obtained for these genes showing that two become maternally expressed and two become paternally expressed (RT-PCR sequencing traces supporting these data are provided in Figure S3). We confirmed the stage specific imprinting of Os02g55560 by allele-specific RT-PCR (Figure 1G). We also subcloned and sequenced RT-PCR products from the endosperm RNAs of both crosses for Os02g55560, obtaining 21 maternal and 11 paternal SNPs (21m: 11p) in the Nip×93-11 cross and 19m: 20p paternal SNPs in the 93-11×Nip cross at 3.5 DAF. By contrast, maternal SNPs were almost exclusively detected in 5 DAF endosperm from the Nip×93-11 cross (38m: 0p) and the 93-11×Nip cross (45m: 2p). Of the two genes Os02g55560 and 0s04g39560 that become maternally biased in the endosperm at later stages of seed development, 0s04g39560 (Figure 1F) is also expressed in sperm cells (Table 2 and ), thus paternal message carry over may be a possibility at 3.5DAF [47]. Alternatively, the paternal allele may be silenced after fertilization.

In the case of the two genes that become paternally expressed in endosperm, Os06g40490 is expressed in sperm and Os04g42250 is not (Figure 1F and Table 2 and Table S3). If the latter gene is epigenetically silenced in sperm it may be activated upon fertilization or during early endosperm development. Collectively these data indicate that there are dynamic changes in imprinted gene expression during rice endosperm development. These may involve selective silencing or activation of parental alleles after fertilization during endosperm development for some genes.

Putative functions of rice imprinted genes and their imprinted homologs in other plants

Database comparisons of the 165 candidate imprinted rice genic endosperm loci revealed that 52 (∼30%) were of unknown function. The remainder were enriched in a range of putative regulatory processes including DNA and RNA binding and signal transduction, and cellular component organization (Figure S7 and Table 2 and Table S3). For example, several rice parentally biased endosperm genes are homologous to those regulating mRNA levels in other species, including CCR4-NOT [48], Pumilio [49] and LEUNIG [50]. Chromatin remodelling genes, including OsFIE1 (Os08g04290) [41] the homolog of the maize imprinted Polycomb gene Zmfie1 [38], a putative H3 K9 methyltransferase gene (Os03g20430) and an SSXT type gene homologous to a component of the SWI/SNF complex [51] were also identified.

Comparisons of our 165 candidate rice imprinted loci with the known cereal imprinted loci in maize and rice [35][42] revealed that only the Polycomb-group genes Zmfie1 and OsFIE1 (Os08g04290) were in common, with both showing endosperm specific expression [38], [41]. OsMADS87 (Os03g38610) a previously reported rice imprinted homolog of Arabidopsis PHERES 1 [42] showed no evidence of imprinting in our dataset because very low expression of OsMADS87 was observed in our hybrid endosperm with a total of only 6 reads from both reciprocal crosses.

The 165 putative rice imprinted genes found here were also compared with two recent datasets of candidate Arabidopsis imprinted genes identified by seed transcriptome sequencing [28, Wolff et al. 2011 unpublished] and a third dataset of Arabidopsis genes predicted to be imprinted by genome methylation analysis [52]. We found a total of only 27 rice candidate imprinted genes with significant homology to the three candidate Arabidopsis imprinted gene datasets (Table S7). Some of the imprinted genes with high sequence homology between rice and Arabidopsis include C3HC4 ring finger genes, YUCCA10 potentially involved in auxin biosynthesis, rice AUXIN RESPONSE FACTOR 18, a MYB gene and protein kinases (Table S7). Nine of the genes with high homology are implicated in epigenetic regulation, potentially involved in small RNA, chromatin remodelling, and DNA methylation functions (Table 3). Examples of the latter include ARGONAUTE (AGO) and DsRNA BINDING (DRB) protein genes potentially associated with the small RNA pathway [53]. H3K9 methyltransferase genes (SUVH) [54] and PICKLE-like genes [55] are potential chromatin remodelling factors, and a paternally biased rice homolog (Os04g22240) of imprinted Arabidopsis VARIANT IN METHYLATION 5 (VIM5) [28] was also detected (Table 3). Not all 27 candidate imprinted genes with high homology between rice and Arabidopsis have been experimentally verified. However, from current data, it is interesting to note that parental expression biases appear to be retained in 20 cases (Table 3 and Table S7).

Tab. 3. Candidate imprinted gene homologs in rice and Arabidopsis may have roles in epigenetic regulation.1,2
Candidate imprinted gene homologs in rice and <i>Arabidopsis</i> may have roles in epigenetic regulation.<em class=&quot;ref&quot;>1</em><sup>,</sup><em class=&quot;ref&quot;>2</em>
: Imprinting has not been experimentally tested.


Genomic imprinting primarily occurs in rice endosperm with transcripts arising from genic and non-genic regions

Our approach to identify imprinted genes in reciprocal crosses between Nipponbare (Nip) and 93-11 Indica rice relied on the use of SNP reads to enable allelic assignment of expressed sequences to their genome of origin and the characterization of any allelic bias in expression. Using this approach we have been able to analyse allelic expression of approximately 50% and 70% of annotated and expressed genes across the endosperm and embryo transcriptome, respectively. This resulted in the identification of 262 candidate loci in the endosperm and only three in the embryo. Given that 50% of the endosperm expressed genes and 30% of the embryo expressed genes either lacked SNPs or were expressed at a level too low to be characterized, we estimate that there might be at least an additional 260 loci in the endosperm, bringing the total closer to 520 candidate imprinted loci in the rice endosperm; and potentially additional loci in the embryo if imprinting is more prevalent in earlier stages of embryo development than those examined here. By contrast, there are 750 candidate genes estimated to be imprinted in Arabidopsis [28].

Subsequent validation of our in silico data led to the confirmation of only one gene in the embryo. This Ole e 1 homologue [46] was found to be maternally expressed in the embryo, and it was also maternally expressed in the endosperm, although imprinting of this gene was lost during later embryo development. It may be that we have missed other imprinted embryo genes because our analyses were conducted using embryos at 6 DAF. Nonetheless, we conclude that imprinting in the embryo of rice occurs at low frequency and is transient. This is supported by the observation that imprinted genes have not been found in rice seedlings [56].

By contrast, 262 loci with evidence of a parental expression bias were found in the endosperm, 177 maternally expressed and 85 paternally expressed. The majority, 165, coincided with annotated cDNAs and 97 with non-genic loci. The non-genic loci localized to introns in 22 instances, and in 75 cases to intergenic regions. Verification of the imprinting status of 92% (55/60) of candidate imprinted loci examined provided us with confidence that the method is robust, and importantly confirms the notion that in rice seed, like Arabidopsis, the majority of the imprinted genes are expressed in endosperm. Evidence that biased transcripts arise from intron and intergenic loci was also obtained amongst the 60 examined candidates. While the intergenic regions may be unannotated regions, it is tempting to speculate that intergenic regions encode non-coding RNAs and further characterization of this class is required.

While most of the genes examined remain persistently imprinted during rice endosperm development, we have observed dynamic changes in imprinted gene expression, including gain or loss of imprinting. Similar cases are also found in mouse, where the imprinting status of some genes depends on the stage of development [18]. Similarly, our analyses indicate that the establishment of imprinting for some rice genes may not be predetermined in gametes, and reprogramming of epigenetic marks may be effected soon after fertilization.

We found that parentally biased transcript isoforms can arise from a gene via alternative splicing and differential transcript termination. The observed transcripts are polyadenylated. Nine such putative candidate loci were identified and one investigated in detail. A similar phenomenon has also been documented at a mouse locus where the maternal allele produces functional full-length transcripts whereas the paternal transcripts are alternatively spliced in a novel manner and truncated. The production of paternally truncated transcripts was found to depend on the status of DNA methylation within the maternal allele [44]. In the case examined here in detail in rice, full-length paternally biased transcripts were formed whereas various maternally derived transcripts were truncated and the mechanisms are unknown.

In summary, we have observed that in rice seed, imprinting primarily occurs in the endosperm from both genic and non-genic regions and we have observed novel features pertaining to imprinted gene expression during endosperm development after fertilization.

Genomic imprinting in monocot and eudicot seeds

Rice and Arabidopsis are developmentally distinct seed types. In rice and other cereals, the endosperm persists in the mature seed and is utilized by the embryo during germination. During cereal seed development, significant translocation of photosynthetic products from the vegetative parts of the plant occurs during endosperm differentiation and grain filling. By contrast, in Arabidopsis, the endosperm is transient and consumed by the developing embryo, which stores reserves for germination. Nevertheless, in both types of seeds, imprinting is predominant in the endosperm. The identified imprinted genes are not physically clustered in the genomes of these monocot and eudicot plants as found in animals, but a few paired micro-clusters are found in rice and Arabidopsis [Wolff et al. 2011 unpublished].

The candidate imprinted genes identified in rice and Arabidopsis have diverse functions which suggest that they may be involved in controlling endosperm and seed development at different regulatory levels [28]. However, only 27 of the identified 165 candidate rice imprinted genes display significant sequence homology with the candidate Arabidopsis imprinted genes found by transcriptome analyses [28, Wolff et al. 2011 unpublished] and predicted by methylation analyses in the Arabidopsis genome [52]. This contrasts with the situation observed in animals where conservation of imprinted genes and their expression modes are more frequently observed [57].

In Arabidopsis, imprinting appears to be a consequence of global epigenetic reprogramming during the events of sexual reproduction [52], [58]. A combination of the RETINOBLASTOMA pathway and DEMETER (DME) activity in the central cell results in a genome-wide hypomethylation of central cell DNA and the activation of maternally expressed alleles [9], [52], [58]. CG hypomethylation results from DME activity which removes methylcytosines from DNA and accompanying down-regulation of METHYLTRANSFERASE 1(MET1) and VIM5 activity. The latter functions to recruit DNA methyltransferases to hemi-methylated DNA [59], [28] The non-expressed parental allele can be silenced by DNA methylation, histone K27 tri-methylation mediated by the Polycomb group (PcG) complex, or both [7][14].

Genome hypomethylation has also been found in rice endosperm [60], however, a DME homologue is not evident in rice [60]. The observed hypomethylation and imprinting in rice endosperm may be due to the activity of other members of this family or by other currently unknown mechanisms [60]. Our analyses indicate that VIM5 homologues are paternally expressed in both rice and Arabidopsis endosperm (Table 3). Imprinting of the putative rice VIM5 remains to be confirmed, however, this observation suggests that some mechanisms regulating imprinting may be common in the endosperm of both species.

Analyses of Arabidopsis imprinted gene expression in mutant backgrounds has revealed that imprinting in a subset of maternally expressed genes is not controlled by DME, FIE (a member of the PcG complex) or MET1 suggesting additional unknown mechanisms for allele silencing and activation [28]. It has been hypothesized, but not yet proven, that these maternal transcripts may be transferred or deposited into the endosperm from surrounding maternal seed tissues. Some of the rice candidate imprinted genes have homology to this set of putative maternally deposited Arabidopsis genes (Table 3 and Table S7).

The low level of homology in candidate imprinted genes in rice and Arabidopsis suggests imprinted genes are likely to have evolved independently after the divergence of monocots and eudicots. It is possible that the development of these distinct seed types may require different sets of imprinted genes, which may contribute to reproductive isolation [34]. Testing the roles of the identified parentally biased genes in both the regulation of imprinting and in rice endosperm development will provide further insights into the molecular basis for and functional significance of imprinting in plants.

Materials and Methods

Reciprocal crosses in rice (Oryza sativa) subspecies and RNA preparation

The Japonica rice subspecies Nipponbare (Nip) and the Chinese Indica subspecies 93-11 were grown in the glasshouse at 26°C to 28°C under natural light supplemented with artificial light to achieve a 16 hour photoperiod. Emasculation of Nip florets was carried out by cutting mature florets in half to remove anthers and this was conducted early morning before anther dehiscence and exertion. A hot water treatment was used for early morning emasculation of 93-11 and the panicles were immersed in water at 43°C for 8 minutes. Reciprocal crosses between the emasculated subspecies were carried out after emasculation. Fertilization in rice occurs one hour after pollination. Nip×93-11 indicates Nip female pollinated with 93-11, and the reciprocal cross 93-11×Nip indicates 93-11 female crossed with Nip pollen. Seed development in reciprocal crosses and selfed subspecies parents was monitored cytologically and compared to previously described stages of seed development by Itoh et al. [43]. These data are presented in the text and Figure S1.

Endosperm samples from reciprocal crosses for transcriptome sequencing were harvested at 5 days after fertilization (DAF) by cutting the tip of the seed at the opposite end to that containing the embryo and gently removing a small portion of endosperm. Endosperm from 50 seeds for each cross was pooled for RNA extraction. Embryos for transcriptome sequencing were manually extracted at 6 DAF to minimize contamination with endosperm as separation from the endosperm compartment is established; they were washed three times in RNAlater and then size selected to obtain comparable developmental stages for RNA extraction. 600 embryos were harvested from each cross. RNA was isolated using the QIAGEN RNeasy Plant mini kit with DNase I treatment according to the manufacturer's protocol. 40 µg of total RNA from endosperm for each reciprocal cross, and 30 µg of embryo RNA for each reciprocal cross were sent to AGRF (Australian Genome Research Facility http://www.agrf.org.au), where libraries were constructed for each sample and sequenced using the Illumina platform (see below).

Additional material was subsequently harvested utilizing the above procedures for verification of imprinted genes and for developmental analyses. Endosperm samples for developmental analysis of gene expression were harvested at 3.5, 5, 6, 8 and 10 DAF, and embryos at 6, 8 and 10 DAF. Other rice tissues harvested for gene expression analyses included husk of developing seed, anther, mature ovule, root, flag leaf, stem and endosperm from both subspecies.

Sequencing data generation

The generation of sequence libraries and sequence data was completed in accordance with recommended Illumina GAII protocols (http://www.illumina.com) by AGRF (Australian Genome Research Facility). Briefly, total RNA samples were tested for quality using the Agilent BioAnalyzer. Poly-A containing RNAs were purified and reverse transcribed to cDNAs using random primers. Double-stranded cDNAs were synthesized and these fragments were end-repaired and extended through ligation with a single ‘A’ base and then Illumina adaptor sequences. After PCR amplification, the sequencing libraries were size-selected on an agarose gel and purified for sequencing. The endosperm sequencing libraries were sequenced to a length of 36 base pairs, while the embryo library was sequenced to a length of 75 base pairs. Raw sequence data were generated by the Illumina analysis pipeline (version 1.3). For endosperm, one sequencing library was generated for each reciprocal cross. For embryos, one library was also constructed for each reciprocal cross.

Sequence data analysis

Sequence data was pre-processed to trim off any 3′ adaptor sequence present and exclude sequences that were less than 18 bases in length or >50% repetitive in nature. A non-redundant sequence dataset consisting of unique sequences and associated total read counts for each sample was used in subsequent analyses (Table S1).

SOAP [61] was used to align the reads to the assembled Japonica (Nip) and 93-11 genomes [62], [63]. Reads were filtered to those that aligned perfectly to a single location in one genome, and to the other with exactly a single mismatch corresponding to a SNP reported in the public SNP repository (http://rice.plantbiology.msu.edu/cgi-bin/gbrowse/rice/) (Table S1). For direct comparability to the endosperm, only the first 36 bases of 75 base embryo reads were analysed. This maintained a similar risk of reads being excluded from consideration due to the presence of more than one SNP in the read. Even with this conservative approach, the embryo data were observed to have greater sequencing depth and SNP coverage than the endosperm data (Table 1 and Table S1) which is indicative of the greater yield of sequences now possible in rapidly emerging sequencing chemistries. On the basis of SNPs detected, reads were assigned to either the maternal or paternal genome in each cross. Read counts were normalised between crosses for discrepancies in total read count by conversion to reads per million and multiplication by a fixed constant to return values to integer counts similar in scale to the observed data.

Two approaches were used to complete genome-wide scans of putatively maternal or paternal reads. One approach used overlapping windows of 1 kb (every 0.5 kb) to summarise read counts across the whole genome, and the other summarised reads into annotated genic features including cDNAs, introns and intergenic regions. In both approaches only those features showing a total normalized read count of at least 10 reads for both crosses were considered (Table S2). Parental expression bias (B) of window and genic features were calculated from totals of normalised read counts across each feature. For example, maternal bias was calculated as shown below.Where Bmat is maternal expression bias, Rmat and Rpat are the total normalized read count across the feature allocated to the maternal genome and paternal genome respectively, and Re is the expected read count as calculated from proportioning the sum of Rmat and Rpat into the expected maternal∶paternal contributions (2∶1 in the endosperm; 1∶1 in the embryo) (Figure S2). Statistical significance (−log10P) was assessed using a Pearson chi-square test (degrees of freedom = 1). Deviation from expected contributions was considered statistically significant at −log10P≥1.3 (P≥0.05). Putatively imprinted features were identified as those that showed concordant statistically significant bias in both crosses associated with the gender and not the subspecies of the parent. Subspecies dependent features were identified as those that showed significant bias concordant between crosses and associated with the subspecies of the parent and not gender. Where evidence from both crosses was combined, single bias and −log10P values were calculated as the minimum (least significant) of the two crosses.

Genome-wide scans for involvement of alternate splicing were completed in two ways. Firstly, the genic feature analysis considered all annotated alternate transcript isoforms. Secondly, a modification of the sliding window analysis allowed us to investigate the dataset for the involvement of novel, unannotated alternate splicing in transcribed regions. In both cases, the data was screened for transcripts that contained genic features or parts of genic features displaying a mixture of imprinting status. All candidates proposed by these methods were closely inspected with regard to read distribution relative to reported annotations. In cases where annotated alternate transcripts shared identical support (read counts), these were collapsed and compound annotated (Table S3). Similarly, in cases where intergenic regions were implicated and it was not possible to unambiguously determine functional association with a single neighbouring gene, these findings were collapsed and compound annotated (Table S3).

Functional characterization of imprinted features

Putatively biased genic features in both embryo and endosperm were investigated for proximity to transposons and repeats with the rice genome and differential GC content (Tables S4 and S5). Transposon and repeat annotation were obtained from a public rice genome browser (http://rice.plantbiology.msu.edu/cgi-bin/gbrowse/rice/). At the time of download this annotation contained predictions of 264,562 repeats and transposons including MITE-like (43.7%), transposon (22.2%), retrotransposon LINE-like (0.1%), retrotransposon SINE-like (2.7%), retrotransposon unclassified (22.1%) and unclassified repetitive elements (5.4%). The analysis investigated both proximity and density of transposons and repeats to imprinted cDNAs relative to similar measures for all annotated cDNAs in the genome. Genic features were annotated with gene function ontology terms (GO terms) and GO enrichment analysis was performed using a hypergeometric test for enrichment of terms relative to expected frequencies. Expected frequencies were calculated from the annotation of all transcripts found to contain at least 10 reads in the dataset.

Identified imprinted rice candidates were investigated for sequence homology to previously reported candidates from maize [35][40] and Arabidopsis [28], [52, Wolff et al. 2011 unpublished] using blast [64]. Candidates from maize and Arabidopsis were aligned to the rice genome (Osa1, Release 6.1) and high-scoring (E-value<1e-10) blasts hits were ordered by increasing E-value. If any of the rice candidates were identified amongst the blast hits its highest (most significant) rank was recorded. Similarly, candidates from this study in rice were aligned to the maize (B73 RefGen_v2) and Arabidopsis genome (TAIR9). Consequently, the rank of 1 means that this is the best match reported in the target genome. Subsequent rank numbers can be interpreted as a measure of the number of genes (rank - 1) with better sequence homology in the target genome than the match reported. This approach not only identifies sequence homology but indicates the specificity of the homology relative to other possible sequence matches throughout the genome which is particularly relevant for comparisons in large gene families.

Gene expression and confirmation of imprinting with RT-PCR sequencing

Primers for RT-PCR sequencing and gene expression were designed to be flanking at least one SNP and, if possible, at least one intron. About 0.5 µg total RNA from various tissues was used for cDNA synthesis using an Invitrogen Superscript III Reverse Transcriptase kit. The 20 ul PCR mix contained 2 ul of 10× buffer, 1.3 ul of MgCl2 (25 mM), 1 ul of dNTPs (5 mM), 0.5 ul of each primer (20 uM) and 1 unit Taq DNA polymerase (Perkin Elmer). The Thermal cycler programme includes an initial incubation of 3 minutes at 95°C, followed by 27–35 cycles, depending on the experimentally determined level of gene expression, where each cycle consisted of 20 s at 95°C, 20 s at 55°C and 1 min at 72°C, followed by 2 min at 72°C and 1 min at 25°C. PCR products were visualised on a 1.5% agarose gel with ethidium bromide. RT-PCR products generated from F1 endosperm and embryo were submitted for direct sequencing. RT-PCR products for Os02g55560 were subcloned using pGEM-T Easy Vector System (Promega) and then sequenced. Several imprinted genes were also confirmed by allelic RT-PCR.

Confirmation of parent of origin-specific alternative splicing of Os01g70060

For a map of the gene and locations of primers utilized see Figure 2 and Figure S5. A GeneRacer Kit (Invitrogen) was initially used to examine the 3′ end of Os01g70060 transcripts from the endosperm of reciprocal crosses between Nip and 93-11. After reverse transcription using the GeneRacer Oligo dT Primer according to the manufacturer recommended protocol, primers from exon 3 (3eRace), intron 3 (3iRace) and exon 8 (8eRace) were used with GeneRacer 3′ Primer to PCR-amplify the RT products. These RT-PCR products were directly sequenced or subcloned and sequenced. Additional pairs of primers were designed from intron 3 or exons to detect detailed information regarding the parent of origin of the transcripts. These primers include a pair from intron 3 (3iF and 3iR), a pair from exon 2 and exon 3 (2eF and 3eR), and a pair from exon 3 and exon 8 (3eF and 8eR). Primers 3eF and 3iR-1 were used to detect transcripts proceeding from exon 3 into the adjacent intron 3. Note that 3eF overlaps with 3eRace. 3iRace is complementary to 3iR-1. For additional information refer to text.

Sperm isolation, oligonucleotide microarray hybridization, and data collection

Sperm cells were isolated from mature anthers of field-grown rice (Oryza sativa subspecies Japonica, cultivar ‘Katy’). A centrifugation-based separation method was used for isolating sperm cells from mature pollen grains [65]. Total RNA was purified using the RNeasy plant mini kit according to manufacturer's instructions (Qiagen, http://www.qiagen.com/). For sperm cell RNA, we could not accurately determine concentration because of limited material. All accumulated isolated sperm RNA and 100 ng total RNA of seedlings and mature pollen were used for probe preparation for each of the three biological/replicates performed. Amplified sperm RNAs and pollen RNAs were used to test specificity of sperm isolation with qPCR. An 82-fold difference for Ory s 1 (a sperm specific gene) [66] transcripts and 15 fold difference for GCS1 (HAP2) (a specific sperm gene) [67] homolog transcripts between pollen and sperm cells confirmed the specificity of sperm cells.

Since the amount of starting total RNA was low (in the range of 10–100 ng per sample for sperm), the Affymetrix GeneChip Two-Cycle cDNA Synthesis Kit (Affymetrix, Santa Clara, CA, USA, http://www.affymetrix.com/) was used for target preparation with signal amplification. A mixture containing 15 µg of fragmented cRNA was hybridized to the Affymetrix 57K Rice Genome GeneChip at 45°C for 16 h. Subsequent washing and staining steps were performed on a GeneChip Fluidics Station 450 and the chips were scanned on a GeneChip Scanner 3000. The microarray data generated from all chips met quality control criteria set by Affymetrix. Materials are archived on NCBI Gene Expression Omnibus (GEO) as a data set within experimental series GSE17002. For determination of sperm expression, data was background subtracted, where background was considered to be 5% of signal and normalised using the dChip MBEI method [68]. Expression levels were averaged for the three replicates. We used the dChip protocol to determine the presence or absence calls in sperm cells (http://biosun1.harvard.edu/complab/dchip/). In this approach, P≤0.05 was declared present (P), marginal (M) was defined as 0.05>P≤0.065 and p>0.065 was considered absent (A). Expression status of imprinted genes in sperm can be seen in Table S3.

Supporting Information

Attachment 1

Attachment 2

Attachment 3

Attachment 4

Attachment 5

Attachment 6

Attachment 7

Attachment 8

Attachment 9

Attachment 10

Attachment 11

Attachment 12

Attachment 13

Attachment 14


1. FeilRBergerF 2007 Convergent evolution of genomic imprinting in plants and mammals. Trends Genet 23 192 9

2. RaissigMTBarouxCGrossniklausU 2011 Regulation and Flexibility of Genomic Imprinting during Seed Development. Plant Cell 10.1105/tpc.110.081018. [Epub ahead of print]

3. KöhlerCWeinhofer-MolischI 2010 Mechanisms and evolution of genomic imprinting in plants. Heredity 105 57 63

4. IderaabdullahFYVigneauSBartolomeiMS 2008 Genomic imprinting mechanisms in mammals. Mutat Res 647 77 85

5. ChoiYGehringMJohnsonLHannonMHaradaJJ 2002 DEMETER, a DNA glycosylase domain protein, is required for endosperm gene imprinting and seed viability in Arabidopsis. Cell 110 33 42

6. GehringMHuhJHHsiehTFPentermanJChoiY 2006 DEMETER DNA glycosylase establishes MEDEA Polycomb gene self-imprinting by allele-specific demethylation. Cell 124 495 506

7. KinoshitaTMiuraAChoiYKinoshitaYCaoX 2004 One-way control of FWA imprinting in Arabidopsis endosperm by DNA methylation. Science 303 521 523

8. JullienPEKinoshitaTOhadNBergerF 2006 Maintenance of DNA methylation during the Arabidopsis life cycle is essential for parental imprinting. Plant Cell 18 1360 1372

9. JullienPEMosqunaAIngouffMSakataTOhadN 2008 Retinoblastoma and its binding partner MSI1 control imprinting in Arabidopsis. PLoS Biol 6 e194 doi:10.1371/journal.pbio.0060194

10. JullienPEKatzAOlivaMOhadNBergerF 2006 Polycomb group complexes self-regulate imprinting of the Polycomb group gene MEDEA in Arabidopsis. Curr Biol 16 486 492

11. KöhlerCPageDRGagliardiniVGrossniklausU 2005 The Arabidopsis thaliana MEDEA Polycomb group protein controls expression of PHERES1 by parental imprinting. Nat Genet 37 28 30

12. BarouxCGagliardiniVPageDRGrossniklausU 2006 Dynamic regulatory interactions of Polycomb group genes: MEDEA autoregulation is required for imprinted gene expression in Arabidopsis. Genes Dev 20 1081 1086

13. MakarevichGVillarCBErilovaAKöhlerC 2008 Mechanism of PHERES1 imprinting in Arabidopsis. J Cell Sci 121 906 912

14. Fitz GeraldJNHuiPSBergerF 2009 Polycomb group-dependent imprinting of the actin regulator AtFH5 regulates morphogenesis in Arabidopsis thaliana. Development 136 3399 3404

15. Gutiérrez-MarcosJFCostaLMDal PràMScholtenSKranzE 2006 Epigenetic asymmetry of imprinted genes in plant gametes. Nat Genet 38 876 8

16. HaunWJSpringerNM 2008 Maternal and paternal alleles exhibit differential histone methylation and acetylation at maize imprinted genes. Plant J 56 903 12

17. BabakTDevealeBArmourCRaymondCClearyMA 2008 Global survey of genomic imprinting by transcriptome sequencing. Curr Biol 18 1735 41

18. GreggCZhangJWeissbourdBLuoSSchrothGP 2010 High-Resolution Analysis of Parent-of-Origin Allelic Expression in the Mouse Brain. Science 329 643 648

19. TyckoBMorisonIM 2002 Physiological functions of imprinted genes. Journal of Cellular Physiology 192 245 58

20. WoodAJOakeyRJ 2006 Genomic Imprinting in Mammals: Emerging Themes and Established Theories. PLoS Genet 2 e147 doi:10.1371/journal.pgen.0020147

21. HirasawaRFeilR 2010 Genomic imprinting and human disease. Essays Biochem 48 187 200

22. FrostJMMooreGE 2010 The importance of imprinting in the human placenta. PLoS Genet 6 e1001015 doi:10.1371/journal.pgen.1001015

23. ScottRJSpielmanMBaileyJDickinsonHG 1998 Parent-of-origin effects on seed development in Arabidopsis thaliana. Development 125 3329 41

24. LinBY 1984 Ploidy barrier to endosperm development in maize. Genetics 107 103 115

25. BirchlerJA 1993 Dosage analysis of maize endosperm development. Ann Rev Genet 27 181 204

26. DrewsGNYadegariR 2002 Development and function of the angiosperm female gametophyte. Annu Rev Genet 36 99 124

27. JahnkeSScholtenS 2009 Epigenetic resetting of a gene imprinted in plant embryos. Curr Biol 19 1677 1681

28. HsiehTFShinJUzawaRSilvaPCohenS 2011 Inaugural Article: Regulation of imprinted gene expression in Arabidopsis endosperm. Proc Natl Acad Sci U S A 108 1755 1762

29. HaigDWestobyM 1989 Parent specific gene expression and the triploid endosperm. Am Nature 134 147 155

30. ChaudhuryAMMingLMillerCCraigSDennisES 1997 Fertilization-independent seed development in Arabidopsis thaliana. Proc Natl Acad Sci USA 94 4223 4228

31. GrossniklausUVielle-CalzadaJPHoeppnerMAGaglianoWB 1998 Maternal control of embryogenesis by MEDEA, a polycomb group gene in Arabidopsis. Science 280 446 450

32. DilkesBComaiL 2004 A Differential Dosage Hypothesis for Parental Effects in Seed Development. The Plant Cell 16 3174 3180

33. JosefssonCDilkesBComaiL 2006 Parent-dependent loss of gene silencing during interspecies hybridization. Curr Biol 16 1322 1328

34. KinoshitaT 2007 Reproductive barrier and genomic imprinting in the endosperm of flowering plants. Genes Genet Syst 82 177 86

35. Gutierrez-MarcosJFPenningtonPDCostaLMDickinsonHG 2003 Imprinting in the endosperm: a possible role in preventing wide hybridization. Philos Trans R Soc Lond B Biol Sci 358 1105 11 Review

36. GuoMRupeMADanilevskayaONYangXHuZ 2003 Genome-wide mRNA profiling reveals heterochronic allelic variation and a new imprinted gene in hybrid maize endosperm. Plant J 36 30 44

37. StuparRMHermansonPJSpringerNM 2007 Nonadditive expression and parent-of-origin effects identified by microarray and allele-specific expression profiling of maize endosperm. Plant Physiol 145 411 25

38. DanilevskayaONHermonPHantkeSMuszynskiMGKolliparaK 2003 Duplicated fie genes in maize: expression pattern and imprinting suggest distinct functions. Plant Cell 15 425 438

39. Gutiérrez-MarcosJFCostaLMBiderre-PetitCKhbayaBO'SullivanDM 2004 maternally expressed gene1 is a novel maize endosperm transfer cell-specific gene with a maternal parent-of-origin pattern of expression. Plant Cell 16 1288 1301

40. HermonPSrilunchangKOZouJDresselhausTDanilevskayaON 2007 Activation of the imprinted Polycomb group Fie1 gene in maize endosperm requires demethylation of the maternal allele. Plant Mol Biol 64 387 395

41. LuoMPlattenDChaudhuryAPeacockWJDennisES 2009 Expression, imprinting, and evolution of rice homologs of the polycomb group genes. Mol Plant 2 711 23

42. IshikawaROhnishiTKinoshitaYEiguchiMKurataN 2010 Rice interspecies hybrids show precocious or delayed developmental transitions in the endosperm without change to the rate of syncytial nuclear division. Plant J 65 798 806

43. ItohJNonomuraKIkedaKYamakiSInukaiY 2005 Rice plant development: from zygote to spikelet. Plant Cell Physiol 46 23 47

44. WoodAJSchulzRWoodfineKKoltowskaKBeecheyCV 2008 Regulation of alternative polyadenylation by genomic imprinting. Genes Dev 22 1141 6

45. ThorvaldsenJLBartolomeiMS 2007 SnapShot: Imprinted gene clusters. Cell 130 958

46. de Dios AlchéJM'rani-AlaouiMCastroAJRodríguez-GarcíaMI 2004 Ole e 1, the major allergen from olive (Olea europaea L.) pollen, increases its expression and is released to the culture medium during in vitro germination. Plant Cell Physiol 45 1149 57

47. BayerMNawyTGiglioneCGalliMMeinnelT 2009 Paternal control of embryonic patterning in Arabidopsis thaliana. Science 323 1485 8

48. DenisCLChenJ 2003 The CCR4-NOT complex plays diverse roles in mRNA metabolism. Prog Nucleic Acid Res Mol Biol 73 221 50

49. SpassovDJurecicR 2003 The PUF family of RNA-binding proteins: does evolutionarily conserved structure equal conserved function? IUBMB Life 55 359 366

50. GonzalezDBowenAJCarrollTSConlanRS 2007 The transcription corepressor LEUNIG interacts with the histone deacetylase HDA19 and mediator components MED14 (SWP) and CDK8 (HEN3) to repress transcription. Mol Cell Biol 27 5306 5315

51. KatoHTjernbergAZhangWKrutchinskyANAnW 2002 SYT associates with human SNF/SWI complexes and the C-terminal region of its fusion partner SSX1 targets histones. J Biol Chem 277 5498 505

52. GehringMBubbKLHenikoffS 2009 Extensive demethylation of repetitive elements during seed development underlies gene imprinting. Science 324 1447 1451

53. XieZJohansenLKGustafsonAMKasschauKDLellisAD 2004 Genetic and Functional Diversification of Small RNA Pathways in Plants. PLoS Biol 2 e104 doi:10.1371/journal.pbio.0020104

54. JacksonJPJohnsonLJasencakovaZZhangXPerezBurgosL 2004 Dimethylation of histone H3 lysine 9 is a critical mark for DNA methylation and gene silencing in Arabidopsis thaliana. Chromosoma 112 308 315

55. OgasJKaufmannSHendersonJSomervilleC 1999 PICKLE is a CHD3 chromatin-remodeling factor that regulates the transition from embryonic to vegetative development in Arabidopsis. Proc Natl Acad Sci U S A 96 13839 44

56. HeGZhuXEllingAAChenLWangX 2010 Global epigenetic and transcriptional trends among two rice subspecies and their reciprocal hybrids. The Plant Cell 22 17 33

57. RenfreeMBAgerEIShawGPaskAJ 2008 Genomic imprinting in marsupial placentation. Reproduction 136 523 31

58. HsiehTFIbarraCASilvaPZemachAEshed-WilliamsL 2009 Genome-wide demethylation of Arabidopsis endosperm. Science 324 1451 1454

59. KraftEBostickMJacobsenSECallisJ 2008 ORTH/VIM proteins that regulate DNA methylation are functional ubiquitin E3 ligases. Plant J 56 704 715

60. ZemachAKimMYSilvaPRodriguesJADotsonB 2010 Local DNA hypomethylation activates genes in rice endosperm. Proc Natl Acad Sci U S A 107 18729 34

61. LiRLiYKristiansenKWangJ 2008 SOAP: short oligonucleotide alignment program. Bioinformatics 24 713 4

62. International Rice Genome Sequencing Project 2005 The map-based sequence of the rice genome. Nature 436 793 800

63. YuJHuSWangJWongGKLiS 2002 A draft sequence of the rice genome (Oryza sativa L. ssp. indica). Sciense 296 79 92

64. AltschulSFGishWMillerWMyersEWLipmanDJ 1990 Basic local alignment search tool. J Mol Biol 215 403 10

65. GouXPWangSHChenF 1999 Isolation and cytological observation of viable sperm cells of rice. Acta Botanica Sinica 41 669 674

66. XuHTheerakulpisutPGouldingNSuphiogluCSinghMB 1995 Cloning, expression and immunological characterization of Ory s 1, the major allergen of rice pollen. Gene 164 255 9

67. von BesserKFrankACJohnsonMAPreussD 2006 Arabidopsis HAP2 (GCS1) is a sperm-specific gene required for pollen tube guidance and fertilization. Development 133 4761 9

68. LiC 2008 Automating dChip: toward reproducible sharing of microarray data analysis. BMC Bioinformatics 9 231

Genetika Reprodukční medicína

Článek vyšel v časopise

PLOS Genetics

2011 Číslo 6

Nejčtenější v tomto čísle
Zapomenuté heslo

Nemáte účet?  Registrujte se

Zapomenuté heslo

Zadejte e-mailovou adresu, se kterou jste vytvářel(a) účet, budou Vám na ni zaslány informace k nastavení nového hesla.


Nemáte účet?  Registrujte se