Label Aggregation for Crowdsourced Triplet Similarity Comparisons

Over the last decade, hundreds of thousands of volunteers have contributed to science by collecting or analyzing data. This public participation in science, also known as citizen science, has contributed to significant discoveries and led to publications in major scientific journals. However, little attention has been paid to data quality issues. In this work we argue that being able to determine the accuracy of data obtained by crowdsourcing is a fundamental question and we point out that, for many real-life scenarios, mathematical tools and processes for the evaluation of data quality are missing. We propose a probabilistic methodology for the evaluation of the accuracy of labeling data obtained by crowdsourcing in citizen science. The methodology builds on an abstract probabilistic graphical model formalism, which is shown to generalize some already existing label aggregation models. We show how to make practical use of the methodology through a comparison of data obtained from different citizen science communities analyzing the earthquake that took place in Albania in 2019.

Download Full-text

Imprecise Label Aggregation Approach Under the Belief Function Theory

Advances in Intelligent Systems and Computing - Intelligent Systems Design and Applications ◽

10.1007/978-3-030-16660-1_59 ◽

2019 ◽

pp. 607-616 ◽

Cited By ~ 3

Author(s):

Lina Abassi ◽

Imen Boukhris

Keyword(s):

Function Theory ◽

Belief Function ◽

Belief Function Theory ◽

Label Aggregation

Download Full-text

Coupled-View Deep Classifier Learning from Multiple Noisy Annotators

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.5898 ◽

2020 ◽

Vol 34 (04) ◽

pp. 4667-4674 ◽

Cited By ~ 2

Author(s):

Shikun Li ◽

Shiming Ge ◽

Yingying Hua ◽

Chunhui Zhang ◽

Hao Wen ◽

...

Keyword(s):

Deep Neural Networks ◽

Teaching Strategy ◽

Real Data ◽

Peer Networks ◽

Bayes Classifier ◽

Learning Problem ◽

Learning Abilities ◽

Small Loss ◽

Label Aggregation ◽

Classifier Learning

Typically, learning a deep classifier from massive cleanly annotated instances is effective but impractical in many real-world scenarios. An alternative is collecting and aggregating multiple noisy annotations for each instance to train the classifier. Inspired by that, this paper proposes to learn deep classifier from multiple noisy annotators via a coupled-view learning approach, where the learning view from data is represented by deep neural networks for data classification and the learning view from labels is described by a Naive Bayes classifier for label aggregation. Such coupled-view learning is converted to a supervised learning problem under the mutual supervision of the aggregated and predicted labels, and can be solved via alternate optimization to update labels and refine the classifiers. To alleviate the propagation of incorrect labels, small-loss metric is proposed to select reliable instances in both views. A co-teaching strategy with class-weighted loss is further leveraged in the deep classifier learning, which uses two networks with different learning abilities to teach each other, and the diverse errors introduced by noisy labels can be filtered out by peer networks. By these strategies, our approach can finally learn a robust data classifier which less overfits to label noise. Experimental results on synthetic and real data demonstrate the effectiveness and robustness of the proposed approach.

Download Full-text