Genome-wide identification of poplar malectin/malectin-like domain-containing proteins and in-silico expression analyses find novel candidates for signaling and regulation of wood development
Abstract Background: Malectin domain (MD) is a ligand-binding protein motif of pro- and eukaryotes. It is particularly abundant in Viridiplantae, where it occurs as either a single (MD, PF1721) or tandemly duplicated domain (PF12819) called malectin-like domain (MLD). In herbaceous plants, MD- or MLD-containing proteins (MD proteins) are known to regulate development, reproduction, and resistance to various stresses. However, their functions in woody plants have not yet been studied. To unravel their potential role in wood development, we carried out genome-wide identification of MD proteins in the model tree species black cottonwood (Populus trichocarpa), and analyzed their in-silico expression and co-expression networks.Results: P. trichocarpa had 146 MD genes assigned to 14 different clades, two of which were specific to the genus Populus. 87% of these genes were located on chromosomes, the rest being associated with scaffolds. Based on their protein domain organization, and in agreement with the exon-intron structures, the MD genes identified could be classified into five superclades having the following domains: leucine-rich repeat (LRR)-MD-protein kinase (PK), MLD-LRR-PK, MLD-PK (CrRLK1L), MLD-LRR, and MD-Kinesin. Whereas the majority of MD genes were highly expressed in leaves, particularly under stress conditions, eighteen showed a peak of expression during secondary wall formation and their co-expression networks suggested signaling functions in cell wall integrity, pathogen-associated molecular patterns, calcium, ROS, and hormone pathways.Conclusion: P. trichocarpa MD genes exhibit a variety of domain organizations, and include genes apparently specific to Populus, as well as genes with potential involvement in signaling pathways regulating secondary wall formation.