Existing Elaeis guineensis cultivars lack sufficient genetic diversity due to extensive breeding. Harnessing variation in wild crop relatives is necessary to expand the breadth of agronomically valuable traits. Using RAD sequencing, we examine the natural diversity of wild American oil palm populations (Elaeis oleifera), a sister species of the cultivated Elaeis guineensis oil palm. We genotyped 192 wild E. oleifera palms collected from seven Latin American countries along with four cultivated E. guineensis palms. Honduras, Costa Rica, Panama and Colombia palms are panmictic and genetically similar. Genomic patterns of diversity suggest that these populations likely originated from the Amazon Basin. Despite evidence of a genetic bottleneck and high inbreeding observed in these populations, there is considerable genetic and phenotypic variation for agronomically valuable traits. Genome-wide association revealed several candidate genes associated with fatty acid composition along with vegetative and yield-related traits. These observations provide valuable insight into the geographic distribution of diversity, phenotypic variation and its genetic architecture that will guide choices of wild genotypes for crop improvement.
Evaluation of transcriptome data in combination with QTL information has been applied in many crops to study the expression of genes responsible for specific phenotypes. In oil palm, the mesocarp oil extracted from E. oleifera × E. guineensis interspecific hybrids is known to have lower palmitic acid (C16:0) content compared to pure African palms. The present study demonstrates the effectiveness of transcriptome data in revealing the expression profiles of genes in the fatty acid (FA) and triacylglycerol (TAG) biosynthesis processes in interspecific hybrids. The transcriptome assembly yielded 43,920 putative genes of which a large proportion were homologous to known genes in the public databases. Most of the genes encoding key enzymes involved in the FA and TAG synthesis pathways were identified. Of these, 27, including two candidate genes located within the QTL associated with C16:0 content, showed differential expression between developmental stages, populations and/or palms with contrasting C16:0 content. Further evaluation using quantitative real-time PCR revealed that differentially expressed patterns are generally consistent with those observed in the transcriptome data. Our results also suggest that different isoforms are likely to be responsible for some of the variation observed in FA composition of interspecific hybrids.