Systematic analysis of the R2R3-MYB family of transcription factors in Camellia sinensis: evidence for species-specific catechin biosynthesis regulation
Tea from Camellia sinensis is one of the most popular beverages worldwide, lauded for its charming flavors and health-promoting properties. C. sinensis produces an abundance of specialized metabolites, which makes it an excellent model for digging into the genetic regulation of plant-specific metabolite biosynthesis. The most abundant health-promoting metabolites in tea are galloylated catechins, and the most bioactive of the galloylated catechins, epigallocatechin gallate (EGCG), is exclusively found in C. sinensis. The R2R3-MYB transcription factor family regulates metabolism of phenylpropanoids, the precursors to catechins, in various plant lineages. However, the transcriptional regulation of galloylated catechin biosynthesis remains elusive. Species-expanded or specific MYB transcription factors may regulate species-specific metabolite biosynthesis. This study mined the R2R3-MYB transcription factors associated with galloylated catechin biosynthesis in C. sinensis. A total of 118 R2R3-MYB proteins, classified into 38 subgroups, were identified. R2R3-MYB subgroups specific to or expanded in C. sinensis were hypothesized to be essential to evolutionary diversification of tea-specific metabolites. Notably, nine of these R2R3-MYB genes were expressed preferentially in apical buds and young leaves, exactly where galloylated catechins accumulate. Three putative R2R3-MYB genes displayed strong correlation with key galloylated catechin biosynthesis genes, suggesting a role in regulating biosynthesis of epicatechin gallate (ECG) and EGCG. Overall, this study paves the way to reveal the transcriptional regulation of galloylated catechins in C. sinensis.