The non-neutral, multi-level, phenotypic impact of synonymous mutations revealed through heterologous gene expression in human cells.
Redundancy in the genetic code allows for differences in transcription and/or translation efficiency between mRNA molecules carrying synonymous polymorphisms, with potential phenotypic impact at the molecular and at the organismal level. A combination of neutral and selective processes determines the global genome codon usage preferences, as well as local differences between genes within a genome and between positions along a single gene. The relative contribution of evolutionary forces at shaping codon usage bias in eukaryotes is a matter of debate, especially in mammals. The main riddle remains understanding the sharp contrast between the strong molecular impact of gene expression differences arising from codon usage preferences and the thin evidence for codon usage selection at the organismal level. Here we report a multiscale analysis of the consequences of alternative codon usage on heterologous gene expression in human cells. We generated synonymous versions of the shble antibiotic resistance gene, fused to a fluorescent reporter, and expressed independently them in human HEK293 cells. We analysed: i) mRNA-to-DNA and protein-to-mRNA ratios for each shble version; ii) cellular fluorescence, using flow cytometry, as a proxy for single cell-level construct expression; and iii) real-time cell proliferation in absence or presence of antibiotic, as a proxy for the cellular fitness. Our results show that differences in codon usage preferences in our focal gene strongly impacted the molecular and the cellular phenotype: i) they elicited large differences in mRNA and in protein levels, as well in mRNA-to-protein ratio; ii) they introduced splicing events not predicted by current algorithms; iii) they lead to reproducible phenotypic heterogeneity as different multimodal distributions of cellular fluorescence EGFP; iv) they resulted in a trade-off between burden of heterologous expression and antibiotic resistance. While certain codon usage-related variables monotonically correlated with protein expression, other variables (e.g. CpG content or mRNA folding energy) displayed a bell-like behaviour. We interpret that codon usage preferences strongly shape the molecular and cellular phenotype in human cells through a direct impact on gene expression.