Domesticated cannabinoid synthases amid a wild mosaic cannabis pangenome

成果类型:
Article
署名作者:
Lynch, Ryan C.; Padgitt-Cobb, Lillian K.; Garfinkel, Andrea R.; Knaus, Brian J.; Hartwick, Nolan T.; Allsing, Nicholas; Aylward, Anthony; Bentz, Philip C.; Carey, Sarah B.; Mamerto, Allen; Kitony, Justine K.; Colt, Kelly; Murray, Emily R.; Duong, Tiffany; Chen, Heidi I.; Trippe, Aaron; Harkess, Alex; Crawford, Seth; Vining, Kelly; Michael, Todd P.
署名单位:
Salk Institute; Oregon State University; HudsonAlpha Institute for Biotechnology; University of California System; University of California San Diego; University of California System; University of California San Diego
刊物名称:
Nature
ISSN/ISSBN:
0028-2339
DOI:
10.1038/s41586-025-09065-0
发表日期:
2025-07-24
关键词:
variant call format genome arabidopsis alignment provides BIAS thioesterases performance generation DISCOVERY
摘要:
Cannabis sativa is a globally important seed oil, fibre and drug-producing plant species. However, a century of prohibition has severely restricted development of breeding and germplasm resources, leaving potential hemp-based nutritional and fibre applications unrealized. Here we present a cannabis pangenome, constructed with 181 new and 12 previously released genomes from a total of 144 biological samples including both male (XY) and female (XX) plants. We identified widespread regions of the cannabis pangenome that are surprisingly diverse for a single species, with high levels of genetic and structural variation, and propose a novel population structure and hybridization history. Across the ancient heteromorphic X and Y sex chromosomes, we observed a variable boundary at the sex-determining and pseudoautosomal regions as well as genes that exhibit male-biased expression, including genes encoding several key flowering regulators. Conversely, the cannabinoid synthase genes, which are responsible for producing cannabidiol acid and delta-9-tetrahydrocannabinolic acid, contained very low levels of diversity, despite being embedded within a variable region with multiple pseudogenized paralogues, structural variation and distinct transposable element arrangements. Additionally, we identified variants of acyl-lipid thioesterase genes that were associated with fatty acid chain length variation and the production of the rare cannabinoids, tetrahydrocannabivarin and cannabidivarin. We conclude that the C. sativa gene pool remains only partially characterized, the existence of wild relatives in Asia is likely and its potential as a crop species remains largely unrealized.