Duplicitous Diabolos
This paper briefly discusses markup, metadata and evaluation issues that arise when projects do not include a critical edition adjudicating different variants, but instead incorporate multiple, full diplomatic transcriptions. When used naively, such corpora will cause duplicate results that are hard to discern in quantitative studies, and in cases of incomplete, unexact or fragmentary parallel witnesses, substantially complicate the decision about what users actually want to have. Using a case study on Coptic manuscripts, the paper suggests that as a provisional strategy, documents should be partitioned as finely grained as necessary such that each section's parallel witness status is encoded, and that for each parallel set, it can be useful to define a redundancy metadatum which identifies the 'best' candidate for quantitative study among the available choices.