Broad coverage emotion annotation

Digging for the truth: the case for active annotation in evaluating the credibility of online medical information (Preprint)

10.2196/preprints.25920 ◽

2020 ◽

Author(s):

Mikołaj Morzy ◽

Bartłomiej Balcerzak ◽

Adam Wierzbicki ◽

Adam Wierzbicki

Keyword(s):

Machine Learning ◽

Medical Information ◽

Representation Learning ◽

Training Dataset ◽

Highly Qualified ◽

Human In The Loop ◽

Annotation Process ◽

Comprehensive Framework ◽

Online Sources ◽

The Web

BACKGROUND With the rapidly accelerating spread of dissemination of false medical information on the Web, the task of establishing the credibility of online sources of medical information becomes a pressing necessity. The sheer number of websites offering questionable medical information presented as reliable and actionable suggestions with possibly harmful effects poses an additional requirement for potential solutions, as they have to scale to the size of the problem. Machine learning is one such solution which, when properly deployed, can be an effective tool in fighting medical disinformation on the Web. OBJECTIVE We present a comprehensive framework for designing and curating of machine learning training datasets for online medical information credibility assessment. We show how the annotation process should be constructed and what pitfalls should be avoided. Our main objective is to provide researchers from medical and computer science communities with guidelines on how to construct datasets for machine learning models for various areas of medical information wars. METHODS The key component of our approach is the active annotation process. We begin by outlining the annotation protocol for the curation of high-quality training dataset, which then can be augmented and rapidly extended by employing the human-in-the-loop paradigm to machine learning training. To circumvent the cold start problem of insufficient gold standard annotations, we propose a pre-processing pipeline consisting of representation learning, clustering, and re-ranking of sentences for the acceleration of the training process and the optimization of human resources involved in the annotation. RESULTS We collect over 10 000 annotations of sentences related to selected subjects (psychiatry, cholesterol, autism, antibiotics, vaccines, steroids, birth methods, food allergy testing) for less than $7 000 employing 9 highly qualified annotators (certified medical professionals) and we release this dataset to the general public. We develop an active annotation framework for more efficient annotation of non-credible medical statements. The results of the qualitative analysis support our claims of the efficacy of the presented method. CONCLUSIONS A set of very diverse incentives is driving the widespread dissemination of medical disinformation on the Web. An effective strategy of countering this spread is to use machine learning for automatically establishing the credibility of online medical information. This, however, requires a thoughtful design of the training pipeline. In this paper we present a comprehensive framework of active annotation. In addition, we publish a large curated dataset of medical statements labelled as credible, non-credible, or neutral.

Download Full-text

Generating semantic templates to support the image annotation process

2nd European Workshop on the Integration of Knowledge, Semantics and Digital Media Technology (EWIMT 2005) ◽

10.1049/ic.2005.0716 ◽

2005 ◽

Cited By ~ 1

Author(s):

J. Vompras ◽

S. Conrad

Keyword(s):

Image Annotation ◽

Annotation Process

Download Full-text

Psychiatry Transcript Annotation: Process Study and Improvements

Proceedings of the International Symposium on Human Factors and Ergonomics in Health Care ◽

10.1177/2327857921101030 ◽

2021 ◽

Vol 10 (1) ◽

pp. 71-75

Author(s):

Srinivasan Sridhar ◽

Nazmul Kazi ◽

Indika Kahanda ◽

Bernadette McCrory

Keyword(s):

Mental Health ◽

Work Experience ◽

Phase 1 ◽

Kappa Statistics ◽

Phase 2 ◽

Rater Reliability ◽

Standard Data ◽

Annotation Process ◽

Data Collection Process ◽

Two Phases

Background: The demand for psychiatry is increasing each year. Limited research has been performed to improve psychiatrist work experience and reduce daily workload using computational methods. There is currently no validated tool or procedure for the mental health transcript annotation process for generating “gold-standard” data. The purpose of this paper was to determine the annotation process for mental health transcripts and how it can be improved to acquire more reliable results considering human factors elements. Method: Three expert clinicians were recruited in this study to evaluate the transcripts. The clinicians were asked to fully annotate two transcripts. An additional five subjects were recruited randomly (aged between 20-40) for this pilot study, which was divided into two phases, phase 1 (annotation without training) and phase 2 (annotation with training) of five transcripts. Kappa statistics were used to measure the inter-rater reliability and accuracy between subjects. Results: The inter-rater reliability between expert clinicians for two transcripts were 0.26 (CI 0.19 to 0.33) and 0.49 (CI 0.42 to 0.57), respectively. In the pilot testing phases, the mean inter-rater reliability between subjects was higher in phase 2 with training transcript (k= 0.35 (CI 0.052 to 0.625)) than in phase 1 without training transcript (k= 0.29 (CI 0.128 to 0.451)). After training, the accuracy percentage among subjects was significantly higher in transcript A (p=0.04) than transcript B (p=0.10). Conclusion: This study focused on understanding the annotation process for mental health transcripts, which will be applied in training machine learning models. Through this exploratory study, the research found appropriate categorical labels that should be included for transcripts annotation, and the importance of training the subjects. Contributions of this case study will help the psychiatric clinicians and researchers in implementing the recommended data collection process to develop a more accurate artificial intelligence model for fully- or semi-automated transcript annotation.

Download Full-text

Semi-Automatic Online Tagging with K-Medoid Clustering

International Journal of Software Engineering and Knowledge Engineering ◽

10.1142/s0218194014400075 ◽

2014 ◽

Vol 24 (08) ◽

pp. 1115-1130 ◽

Cited By ~ 1

Author(s):

He Hu ◽

Xiaoyong Du

Keyword(s):

Clustering Algorithm ◽

Prototype System ◽

Web Pages ◽

Web Page ◽

Web Browser ◽

User Input ◽

Browser Extension ◽

Annotation Process ◽

Efficiency And Effectiveness ◽

Automatic Mechanism

Online tagging is crucial for the acquisition and organization of web knowledge. We present TYG (Tag-as-You-Go) in this paper, a web browser extension for online tagging of personal knowledge on standard web pages. We investigate an approach to combine a K-Medoid-style clustering algorithm with the user input to achieve semi-automatic web page annotation. The annotation process supports user-defined tagging schema and comprises an automatic mechanism that is built upon clustering techniques, which can automatically group similar HTML DOM nodes into clusters corresponding to the user specification. TYG is a prototype system illustrating the proposed approach. Experiments with TYG show that our approach can achieve both efficiency and effectiveness in real world annotation scenarios.

Download Full-text

The Proposition Bank: An Annotated Corpus of Semantic Roles

Computational Linguistics ◽

10.1162/0891201053630264 ◽

2005 ◽

Vol 31 (1) ◽

pp. 71-106 ◽

Cited By ~ 515

Author(s):

Martha Palmer ◽

Daniel Gildea ◽

Paul Kingsbury

Keyword(s):

Automatic System ◽

Semantic Representation ◽

Higher Order ◽

Practical Approach ◽

Semantic Role ◽

Syntactic Parsing ◽

Semantic Roles ◽

Syntactic Structures ◽

Annotation Process ◽

Types Of Information

The Proposition Bank project takes a practical approach to semantic representation, adding a layer of predicate-argument information, or semantic role labels, to the syntactic structures of the Penn Treebank. The resulting resource can be thought of as shallow, in that it does not represent coreference, quantification, and many other higher-order phenomena, but also broad, in that it covers every instance of every verb in the corpus and allows representative statistics to be calculated. We discuss the criteria used to define the sets of semantic roles used in the annotation process and to analyze the frequency of syntactic/semantic alternations in the corpus. We describe an automatic system for semantic role tagging trained on the corpus and discuss the effect on its performance of various types of information, including a comparison of full syntactic parsing with a flat representation and the contribution of the empty “trace” categories of the treebank.

Download Full-text

A semiautomatic annotation approach for sentiment analysis

Journal of Information Science ◽

10.1177/01655515211006594 ◽

2021 ◽

pp. 016555152110065

Author(s):

Rahma Alahmary ◽

Hmood Al-Dossari

Keyword(s):

Deep Learning ◽

Sentiment Analysis ◽

Short Term Memory ◽

Machine Learning Algorithms ◽

Support Vector ◽

Short Term ◽

Term Memory ◽

Annotation Process ◽

Learning Classifiers ◽

Long Short Term Memory

Sentiment analysis (SA) aims to extract users’ opinions automatically from their posts and comments. Almost all prior works have used machine learning algorithms. Recently, SA research has shown promising performance in using the deep learning approach. However, deep learning is greedy and requires large datasets to learn, so it takes more time for data annotation. In this research, we proposed a semiautomatic approach using Naïve Bayes (NB) to annotate a new dataset in order to reduce the human effort and time spent on the annotation process. We created a dataset for the purpose of training and testing the classifier by collecting Saudi dialect tweets. The dataset produced from the semiautomatic model was then used to train and test deep learning classifiers to perform Saudi dialect SA. The accuracy achieved by the NB classifier was 83%. The trained semiautomatic model was used to annotate the new dataset before it was fed into the deep learning classifiers. The three deep learning classifiers tested in this research were convolutional neural network (CNN), long short-term memory (LSTM) and bidirectional long short-term memory (Bi-LSTM). Support vector machine (SVM) was used as the baseline for comparison. Overall, the performance of the deep learning classifiers exceeded that of SVM. The results showed that CNN reported the highest performance. On one hand, the performance of Bi-LSTM was higher than that of LSTM and SVM, and, on the other hand, the performance of LSTM was higher than that of SVM. The proposed semiautomatic annotation approach is usable and promising to increase speed and save time and effort in the annotation process.

Download Full-text

TSeg – A Text Segmenter for Corpus Annotation

10.5753/sbsi.2012.14419 ◽

2012 ◽

Author(s):

Felipe Rodrigues ◽

Richard Semolini ◽

Norton Trevisan Roman ◽

Ana Maria Monteiro

Keyword(s):

Cognitive Load ◽

Automatic Segmentation ◽

Text Segmentation ◽

Corpus Annotation ◽

Source Text ◽

Java Application ◽

Annotation Process ◽

Clear Point

This paper describes TSeg – a Java application that allows for both manual and automatic segmentation of a source text into basic units of annotation. TSeg provides a straightforward way to approach this task through a clear point-and-click interface. Once finished the text segmentation, the application outputs an XML file that may be used as input to a more problem specific annotation software. Hence, TSeg moves the identification of basic units of annotation out of the task of annotating these units, making it possible for both problems to be analysed in isolation, thereby reducing the cognitive load on the user and preventing potential damages to the overall outcome of the annotation process.

Download Full-text

Based on isolated saliency or causal integration? toward a better understanding of human annotation process using multiple instance learning and sequential probability ratio test

10.21437/interspeech.2012-187 ◽

2012 ◽

Author(s):

Chi-Chun Lee ◽

Athanasios Katsamanis ◽

Panayiotis G. Georgiou ◽

Shrikanth S. Narayanan

Keyword(s):

Multiple Instance Learning ◽

Sequential Probability Ratio Test ◽

Ratio Test ◽

Probability Ratio ◽

Annotation Process ◽

Sequential Probability

Download Full-text

An Arabic Dataset for Disease Named Entity Recognition with Multi-Annotation Schemes

Data ◽

10.3390/data5030060 ◽

2020 ◽

Vol 5 (3) ◽

pp. 60

Author(s):

Nasser Alshammari ◽

Saad Alanazi

Keyword(s):

Language Processing ◽

Named Entity Recognition ◽

Entity Recognition ◽

Definite Article ◽

Linguistic Features ◽

Annotation Scheme ◽

Named Entity ◽

Part Of Speech ◽

Annotation Process ◽

Arabic Natural Language Processing

This article outlines a novel data descriptor that provides the Arabic natural language processing community with a dataset dedicated to named entity recognition tasks for diseases. The dataset comprises more than 60 thousand words, which were annotated manually by two independent annotators using the inside–outside (IO) annotation scheme. To ensure the reliability of the annotation process, the inter-annotator agreements rate was calculated, and it scored 95.14%. Due to the lack of research efforts in the literature dedicated to studying Arabic multi-annotation schemes, a distinguishing and a novel aspect of this dataset is the inclusion of six more annotation schemes that will bridge the gap by allowing researchers to explore and compare the effects of these schemes on the performance of the Arabic named entity recognizers. These annotation schemes are IOE, IOB, BIES, IOBES, IE, and BI. Additionally, five linguistic features, including part-of-speech tags, stopwords, gazetteers, lexical markers, and the presence of the definite article, are provided for each record in the dataset.

Download Full-text

Annotating Speaker Stance in Discourse: The Brexit Blog Corpus

Corpus Linguistics and Linguistic Theory ◽

10.1515/cllt-2016-0060 ◽

2017 ◽

Vol 0 (0) ◽

Cited By ~ 6

Author(s):

Vasiliki Simaki ◽

Carita Paradis ◽

Maria Skeppstedt ◽

Magnus Sahlgren ◽

Kostiantyn Kucher ◽

...

Keyword(s):

Active Learning ◽

Gold Standard ◽

Visual Analytics ◽

Language Use ◽

Final Version ◽

Annotation Process ◽

Analytical Protocol ◽

Gold Standard Corpus ◽

Set Up ◽

Level Of Agreement

AbstractThe aim of this study is to explore the possibility of identifying speaker stance in discourse, provide an analytical resource for it and an evaluation of the level of agreement across speakers. We also explore to what extent language users agree about what kind of stances are expressed in natural language use or whether their interpretations diverge. In order to perform this task, a comprehensive cognitive-functional framework of ten stance categories was developed based on previous work on speaker stance in the literature. A corpus of opinionated texts was compiled, the Brexit Blog Corpus (BBC). An analytical protocol and interface (Active Learning and Visual Analytics) for the annotations was set up and the data were independently annotated by two annotators. The annotation procedure, the annotation agreements and the co-occurrence of more than one stance in the utterances are described and discussed. The careful, analytical annotation process has returned satisfactory inter- and intra-annotation agreement scores, resulting in a gold standard corpus, the final version of the BBC.

Download Full-text