Evaluation of two-view geometry methods with automatic ground-truth generation

Many scientific studies deal with person counting and density estimation from single images. Recently, convolutional neural networks (CNNs) have been applied for these tasks. Even though often better results are reported, it is often not clear where the improvements are resulting from, and if the proposed approaches would generalize. Thus, the main goal of this paper was to identify the critical aspects of these tasks and to show how these limit state-of-the-art approaches. Based on these findings, we show how to mitigate these limitations. To this end, we implemented a CNN-based baseline approach, which we extended to deal with identified problems. These include the discovery of bias in the reference data sets, ambiguity in ground truth generation, and mismatching of evaluation metrics w.r.t. the training loss function. The experimental results show that our modifications allow for significantly outperforming the baseline in terms of the accuracy of person counts and density estimation. In this way, we get a deeper understanding of CNN-based person density estimation beyond the network architecture. Furthermore, our insights would allow to advance the field of person density estimation in general by highlighting current limitations in the evaluation protocols.

Download Full-text

Analysing political events on Twitter

ACM SIGIR Forum ◽

10.1145/3458537.3458542 ◽

2019 ◽

Vol 53 (1) ◽

pp. 38-39

Author(s):

Anjie Fang

Keyword(s):

Topic Model ◽

Ground Truth ◽

Topic Modelling ◽

Time Dimension ◽

Social Scientists ◽

Political Events ◽

Community Classification ◽

Ground Truth Generation ◽

Twitter Users ◽

Modelling Approach

Recently, political events, such as elections, have raised a lot of discussions on social media networks, in particular, Twitter. This brings new opportunities for social scientists to address social science tasks, such as understanding what communities said or identifying whether a community has an influence on another. However, identifying these communities and extracting what they said from social media data are challenging and non-trivial tasks. We aim to make progress towards understanding 'who' (i.e. communities) said 'what' (i.e. discussed topics) and 'when' (i.e. time) during political events on Twitter. While identifying the 'who' can benefit from Twitter user community classification approaches, 'what' they said and 'when' can be effectively addressed on Twitter by extracting their discussed topics using topic modelling approaches that also account for the importance of time on Twitter. To evaluate the quality of these topics, it is necessary to investigate how coherent these topics are to humans. Accordingly, we propose a series of approaches in this thesis. First, we investigate how to effectively evaluate the coherence of the topics generated using a topic modelling approach. The topic coherence metric evaluates the topical coherence by examining the semantic similarity among words in a topic. We argue that the semantic similarity of words in tweets can be effectively captured by using word embeddings trained using a Twitter background dataset. Through a user study, we demonstrate that our proposed word embedding-based topic coherence metric can assess the coherence of topics like humans [1, 2]. In addition, inspired by the precision at k metric, we propose to evaluate the coherence of a topic model (containing many topics) by averaging the top-ranked topics within the topic model [3]. Our proposed metrics can not only evaluate the coherence of topics and topic models, but also can help users to choose the most coherent topics. Second, we aim to extract topics with a high coherence from Twitter data. Such topics can be easily interpreted by humans and they can assist to examine 'what' has been discussed and 'when'. Indeed, we argue that topics can be discussed in different time periods (see [4]) and therefore can be effectively identified and distinguished by considering their time periods. Hence, we propose an effective time-sensitive topic modelling approach by integrating the time dimension of tweets (i.e. 'when') [5]. We show that the time dimension helps to generate topics with a high coherence. Hence, we argue that 'what' has been discussed and 'when' can be effectively addressed by our proposed time-sensitive topic modelling approach. Next, to identify 'who' participated in the topic discussions, we propose approaches to identify the community affiliations of Twitter users, including automatic ground-truth generation approaches and a user community classification approach. We show that the mentioned hashtags and entities in the users' tweets can indicate which community a Twitter user belongs to. Hence, we argue that they can be used to generate the ground-truth data for classifying users into communities. On the other hand, we argue that different communities favour different topic discussions and their community affiliations can be identified by leveraging the discussed topics. Accordingly, we propose a Topic-Based Naive Bayes (TBNB) classification approach to classify Twitter users based on their words and discussed topics [6]. We demonstrate that our TBNB classifier together with the ground-truth generation approaches can effectively identify the community affiliations of Twitter users. Finally, to show the generalisation of our approaches, we apply our approaches to analyse 3.6 million tweets related to US Election 2016 on Twitter [7]. We show that our TBNB approach can effectively identify the 'who', i.e. classify Twitter users into communities. To investigate 'what' these communities have discussed, we apply our time-sensitive topic modelling approach to extract coherent topics. We finally analyse the community-related topics evaluated and selected using our proposed topic coherence metrics. Overall, we contribute to provide effective approaches to assist social scientists towards analysing political events on Twitter. These approaches include topic coherence metrics, a time-sensitive topic modelling approach and approaches for classifying the community affiliations of Twitter users. Together they make progress to study and understand the connections and dynamics among communities on Twitter. Supervisors : Iadh Ounis, Craig Macdonald, Philip Habel The thesis is available at http://theses.gla.ac.uk/41135/

Download Full-text

Density based semi-automatic labeling on multi-feature representations for ground truth generation: Application to handwritten character recognition

Knowledge-Based Systems ◽

10.1016/j.knosys.2021.106953 ◽

2021 ◽

Vol 220 ◽

pp. 106953

Author(s):

Papangkorn Inkeaw ◽

Piyachat Udomwong ◽

Jeerayut Chaijaruwanich

Keyword(s):

Character Recognition ◽

Ground Truth ◽

Handwritten Character Recognition ◽

Feature Representations ◽

Handwritten Character ◽

Ground Truth Generation

Download Full-text

Enhanced Algorithm of Automated Ground Truth Generation and Validation for Lane Detection System by $\text{M}^{2}\text{BMT}$

IEEE Transactions on Intelligent Transportation Systems ◽

10.1109/tits.2016.2594055 ◽

2017 ◽

Vol 18 (4) ◽

pp. 996-1005 ◽

Cited By ~ 3

Author(s):

Apurba Das ◽

N Siva Srinivasa Murthy ◽

Upendra Suddamalla

Keyword(s):

Detection System ◽

Ground Truth ◽

Lane Detection ◽

Ground Truth Generation

Download Full-text

BYANJON: A Ground Truth Preparation System for Online Handwritten Bangla Documents

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/3464379 ◽

2021 ◽

Vol 20 (6) ◽

pp. 1-16

Author(s):

Shibaprasad Sen ◽

Ankan Bhattacharyya ◽

Ram Sarkar ◽

Kaushik Roy

Keyword(s):

Extraction Procedure ◽

Ground Truth ◽

Word Segmentation ◽

Text Line ◽

Line Extraction ◽

Manual Intervention ◽

Ground Truth Generation ◽

Class Labels ◽

Text Line Extraction ◽

Ground Truth Information

The work reported in this article deals with the ground truth generation scheme for online handwritten Bangla documents at text-line, word, and stroke levels. The aim of the proposed scheme is twofold: firstly, to build a document level database so that future researchers can use the database to do research in this field. Secondly, the ground truth information will help other researchers to evaluate the performance of their algorithms developed for text-line extraction, word extraction, word segmentation, stroke recognition, and word recognition. The reported ground truth generation scheme starts with text-line extraction from the online handwritten Bangla documents, then words extraction from the text-lines, and finally segmentation of those words into basic strokes. After word segmentation, the basic strokes are assigned appropriate class labels by using modified distance-based feature extraction procedure and the MLP ( Multi-layer Perceptron ) classifier. The Unicode for the words are then generated from the sequence of stroke labels. XML files are used to store the stroke, word, and text-line levels ground truth information for the corresponding documents. The proposed system is semi-automatic and each step such as text-line extraction, word extraction, word segmentation, and stroke recognition has been implemented by using different algorithms. Thus, the proposed ground truth generation procedure minimizes huge manual intervention by reducing the number of mouse clicks required to extract text-lines, words from the document, and segment the words into basic strokes. The integrated stroke recognition module also helps to minimize the manual labor needed to assign appropriate stroke labels. The freely available and can be accessed at https://byanjon.herokuapp.com/ .

Download Full-text

Is Crowdsourcing for Optical Flow Ground Truth Generation Feasible?

Lecture Notes in Computer Science - Computer Vision Systems ◽

10.1007/978-3-642-39402-7_20 ◽

2013 ◽

pp. 193-202 ◽

Cited By ~ 6

Author(s):

Axel Donath ◽

Daniel Kondermann

Keyword(s):

Optical Flow ◽

Ground Truth ◽

Ground Truth Generation

Download Full-text

Exploit 18F-FDG enhanced urinary bladder in PET data for deep learning ground truth generation in CT scans

Medical Imaging 2018: Biomedical Applications in Molecular, Structural, and Functional Imaging ◽

10.1117/12.2292706 ◽

2018 ◽

Cited By ~ 1

Author(s):

Christina Gsaxner ◽

Birgit Pfarrkirchner ◽

Lydia Lindner ◽

Jürgen Wallner ◽

Jan Egger ◽

...

Keyword(s):

Deep Learning ◽

Urinary Bladder ◽

Ground Truth ◽

Ct Scans ◽

Ground Truth Generation ◽

18F Fdg

Download Full-text

Phantom-based ground-truth generation for cerebral vessel segmentation and pulsatile deformation analysis

10.1117/12.2216675 ◽

2016 ◽

Cited By ~ 1

Author(s):

Daniel Schetelig ◽

Dennis Säring ◽

Till Illies ◽

Jan Sedlacik ◽

Fabian Kording ◽

...

Keyword(s):

Ground Truth ◽

Vessel Segmentation ◽

Deformation Analysis ◽

Cerebral Vessel ◽

Ground Truth Generation

Download Full-text

Ground Truth generation from automatic recognition of ground Control Points

2016 24th Signal Processing and Communication Application Conference (SIU) ◽

10.1109/siu.2016.7495837 ◽

2016 ◽

Author(s):

Onur Aydin ◽

Kaan Kalkan ◽

Levent Ozparlak

Keyword(s):

Ground Truth ◽

Automatic Recognition ◽

Ground Control ◽

Control Points ◽

Ground Control Points ◽

Ground Truth Generation

Download Full-text

Deep-Learning-Based Cerebral Artery Semantic Segmentation in Neurosurgical Operating Microscope Vision Using Indocyanine Green Fluorescence Videoangiography

Frontiers in Neurorobotics ◽

10.3389/fnbot.2021.735177 ◽

2022 ◽

Vol 15 ◽

Author(s):

Min-seok Kim ◽

Joon Hyuk Cha ◽

Seonhwa Lee ◽

Lihong Han ◽

Wonhyoung Park ◽

...

Keyword(s):

Computer Vision ◽

Deep Learning ◽

Indocyanine Green ◽

Blood Vessels ◽

Cerebral Artery ◽

Ground Truth ◽

Neural Network Models ◽

Indocyanine Green Fluorescence ◽

Surgical Assistance ◽

Ground Truth Generation

There have been few anatomical structure segmentation studies using deep learning. Numbers of training and ground truth images applied were small and the accuracies of which were low or inconsistent. For a surgical video anatomy analysis, various obstacles, including a variable fast-changing view, large deformations, occlusions, low illumination, and inadequate focus occur. In addition, it is difficult and costly to obtain a large and accurate dataset on operational video anatomical structures, including arteries. In this study, we investigated cerebral artery segmentation using an automatic ground-truth generation method. Indocyanine green (ICG) fluorescence intraoperative cerebral videoangiography was used to create a ground-truth dataset mainly for cerebral arteries and partly for cerebral blood vessels, including veins. Four different neural network models were trained using the dataset and compared. Before augmentation, 35,975 training images and 11,266 validation images were used. After augmentation, 260,499 training and 90,129 validation images were used. A Dice score of 79% for cerebral artery segmentation was achieved using the DeepLabv3+ model trained using an automatically generated dataset. Strict validation in different patient groups was conducted. Arteries were also discerned from the veins using the ICG videoangiography phase. We achieved fair accuracy, which demonstrated the appropriateness of the methodology. This study proved the feasibility of operating field view of the cerebral artery segmentation using deep learning, and the effectiveness of the automatic blood vessel ground truth generation method using ICG fluorescence videoangiography. Using this method, computer vision can discern blood vessels and arteries from veins in a neurosurgical microscope field of view. Thus, this technique is essential for neurosurgical field vessel anatomy-based navigation. In addition, surgical assistance, safety, and autonomous surgery neurorobotics that can detect or manipulate cerebral vessels would require computer vision to identify blood vessels and arteries.

Download Full-text