Corpus Statistics for Empirical Translation Studies
In recent years a number of authors have made good use of statistical texts in empirical translation studies. These tests are well established in the scientific literature but have only recently been applied to the comparison of original and translated texts for the identification of the characteristics of “translationese.” There has also been interest in the comparison between professional and student translations and between machine and human translation. In this chapter, various statistical tests are examined in the context of real-world empirical studies in translation: analysis of variance and Tukey’s “honestly significant difference” test, the chi-squared test and the G-statistic, and the visualization techniques of hierarchical cluster analysis and principal component analysis. The chapter finishes with a discussion of the linguistic features chosen or found to characterize the original and translated texts.