Evaluation of cell segmentation methods without reference segmentations
AbstractCell segmentation is a cornerstone of many bioimage informatics studies. Inaccurate segmentation introduces computational error in downstream cellular analysis. Evaluating the segmentation results is thus a necessary step for developing the segmentation methods as well as choosing the most appropriate one for a certain kind of tissue or image. The evaluation process has typically involved comparison of segmentations to those generated by humans, which can be expensive and subject to unknown bias. We present here an approach that seeks to evaluate cell segmentation methods without relying upon comparison to results from humans. For this, we defined a number of segmentation quality metrics that can be applied to multichannel fluorescence images. We calculated these metrics for 11 previously-described segmentation methods applied to datasets from 5 multiplexed microscope modalities covering 5 tissues. Using principal component analysis to combine the metrics we defined an overall cell segmentation quality score and ranked the segmentation methods. A Reproducible Research Archive containing all data and code will be made available upon publication at http://hubmap.scs.cmu.edu.