Rules for designing protein fold switches and their implications for the folding code
We have engineered switches between the three most common small folds, 3a, 4b+a, and a/b plait, referred to here as A, B, and S, respectively. Mutations were introduced into the natural S protein until sequences were created that have a stable S-fold in their longer (~90 amino acid) form and have an alternative fold (either A or B) in their shorter (56 amino acid) form. Five sequence pairs were designed and key structures were determined using NMR spectroscopy. Each protein pair is 100% identical in the 56 amino acid region of overlap. Several rules for engineering switches emerged. First, designing one sequence with good native state interactions in two folds requires care but is feasible. Once this condition is met, fold populations are determined by the stability of the embedded A- or B-fold relative to the S-fold and the conformational propensities of the ends that are generated in the switch to the embedded fold. If the stabilities of the embedded fold and the longer fold are similar, conformation is highly sensitive to mutation so that even a single amino acid substitution can radically shift the population to the alternative fold. The results provide insight into why dimorphic sequences can be engineered and sometimes exist in nature, while most natural protein sequences populate single folds. Proteins may evolve toward unique folds because dimorphic sequences generate interactions that destabilize and can produce aberrant functions. Thus two-state behavior may result from nature's negative design rather than being an inherent property of the folding code.