ground truth generation Latest Research Papers

Deep-Learning-Based Cerebral Artery Semantic Segmentation in Neurosurgical Operating Microscope Vision Using Indocyanine Green Fluorescence Videoangiography

Frontiers in Neurorobotics ◽

10.3389/fnbot.2021.735177 ◽

2022 ◽

Vol 15 ◽

Author(s):

Min-seok Kim ◽

Joon Hyuk Cha ◽

Seonhwa Lee ◽

Lihong Han ◽

Wonhyoung Park ◽

...

Keyword(s):

Computer Vision ◽

Deep Learning ◽

Indocyanine Green ◽

Blood Vessels ◽

Cerebral Artery ◽

Ground Truth ◽

Neural Network Models ◽

Indocyanine Green Fluorescence ◽

Surgical Assistance ◽

Ground Truth Generation

There have been few anatomical structure segmentation studies using deep learning. Numbers of training and ground truth images applied were small and the accuracies of which were low or inconsistent. For a surgical video anatomy analysis, various obstacles, including a variable fast-changing view, large deformations, occlusions, low illumination, and inadequate focus occur. In addition, it is difficult and costly to obtain a large and accurate dataset on operational video anatomical structures, including arteries. In this study, we investigated cerebral artery segmentation using an automatic ground-truth generation method. Indocyanine green (ICG) fluorescence intraoperative cerebral videoangiography was used to create a ground-truth dataset mainly for cerebral arteries and partly for cerebral blood vessels, including veins. Four different neural network models were trained using the dataset and compared. Before augmentation, 35,975 training images and 11,266 validation images were used. After augmentation, 260,499 training and 90,129 validation images were used. A Dice score of 79% for cerebral artery segmentation was achieved using the DeepLabv3+ model trained using an automatically generated dataset. Strict validation in different patient groups was conducted. Arteries were also discerned from the veins using the ICG videoangiography phase. We achieved fair accuracy, which demonstrated the appropriateness of the methodology. This study proved the feasibility of operating field view of the cerebral artery segmentation using deep learning, and the effectiveness of the automatic blood vessel ground truth generation method using ICG fluorescence videoangiography. Using this method, computer vision can discern blood vessels and arteries from veins in a neurosurgical microscope field of view. Thus, this technique is essential for neurosurgical field vessel anatomy-based navigation. In addition, surgical assistance, safety, and autonomous surgery neurorobotics that can detect or manipulate cerebral vessels would require computer vision to identify blood vessels and arteries.

BYANJON: A Ground Truth Preparation System for Online Handwritten Bangla Documents

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/3464379 ◽

2021 ◽

Vol 20 (6) ◽

pp. 1-16

Author(s):

Shibaprasad Sen ◽

Ankan Bhattacharyya ◽

Ram Sarkar ◽

Kaushik Roy

Keyword(s):

Extraction Procedure ◽

Ground Truth ◽

Word Segmentation ◽

Text Line ◽

Line Extraction ◽

Manual Intervention ◽

Ground Truth Generation ◽

Class Labels ◽

Text Line Extraction ◽

Ground Truth Information

The work reported in this article deals with the ground truth generation scheme for online handwritten Bangla documents at text-line, word, and stroke levels. The aim of the proposed scheme is twofold: firstly, to build a document level database so that future researchers can use the database to do research in this field. Secondly, the ground truth information will help other researchers to evaluate the performance of their algorithms developed for text-line extraction, word extraction, word segmentation, stroke recognition, and word recognition. The reported ground truth generation scheme starts with text-line extraction from the online handwritten Bangla documents, then words extraction from the text-lines, and finally segmentation of those words into basic strokes. After word segmentation, the basic strokes are assigned appropriate class labels by using modified distance-based feature extraction procedure and the MLP ( Multi-layer Perceptron ) classifier. The Unicode for the words are then generated from the sequence of stroke labels. XML files are used to store the stroke, word, and text-line levels ground truth information for the corresponding documents. The proposed system is semi-automatic and each step such as text-line extraction, word extraction, word segmentation, and stroke recognition has been implemented by using different algorithms. Thus, the proposed ground truth generation procedure minimizes huge manual intervention by reducing the number of mouse clicks required to extract text-lines, words from the document, and segment the words into basic strokes. The integrated stroke recognition module also helps to minimize the manual labor needed to assign appropriate stroke labels. The freely available and can be accessed at https://byanjon.herokuapp.com/ .

Automated Ground Truth Generation for Learning-Based Crack Detection on Concrete Surfaces

Applied Sciences ◽

10.3390/app112210966 ◽

2021 ◽

Vol 11 (22) ◽

pp. 10966

Author(s):

Hsiang-Chieh Chen ◽

Zheng-Ting Li

Keyword(s):

Deep Learning ◽

Crack Detection ◽

Ground Truth ◽

Experimental Results ◽

Training Data ◽

Generation Process ◽

Detection Model ◽

Ground Truth Generation ◽

Labeling Approach ◽

The Cost

This article introduces an automated data-labeling approach for generating crack ground truths (GTs) within concrete images. The main algorithm includes generating first-round GTs, pre-training a deep learning-based model, and generating second-round GTs. On the basis of the generated second-round GTs of the training data, a learning-based crack detection model can be trained in a self-supervised manner. The pre-trained deep learning-based model is effective for crack detection after it is re-trained using the second-round GTs. The main contribution of this study is the proposal of an automated GT generation process for training a crack detection model at the pixel level. Experimental results show that the second-round GTs are similar to manually marked labels. Accordingly, the cost of implementing learning-based methods is reduced significantly because data labeling by humans is not necessitated.

Density based semi-automatic labeling on multi-feature representations for ground truth generation: Application to handwritten character recognition

Knowledge-Based Systems ◽

10.1016/j.knosys.2021.106953 ◽

2021 ◽

Vol 220 ◽

pp. 106953

Author(s):

Papangkorn Inkeaw ◽

Piyachat Udomwong ◽

Jeerayut Chaijaruwanich

Keyword(s):

Character Recognition ◽

Ground Truth ◽

Handwritten Character Recognition ◽

Feature Representations ◽

Handwritten Character ◽

Ground Truth Generation

Evaluation of Unsupervised Entity and Event Salience Estimation

The International FLAIRS Conference Proceedings ◽

10.32473/flairs.v34i1.128482 ◽

2021 ◽

Vol 34 (1) ◽

Author(s):

Jiaying Lu ◽

Jinho D Choi

Keyword(s):

Evaluation Process ◽

Ground Truth ◽

Generation Process ◽

The Novel ◽

Evaluation Protocol ◽

Ground Truth Generation ◽

Dependency Parser ◽

Syntactic Dependency ◽

Event Definition

Salience Estimation aims to predict term importance in documents.Due to few existing human-annotated datasets and the subjective notion of salience, previous studies typically generate pseudo-ground truth for evaluation. However, our investigation reveals that the evaluation protocol proposed by prior work is difficult to replicate, thus leading to few follow-up studies existing. Moreover, the evaluation process is problematic: the entity linking tool used for entity matching is very noisy, while the ignorance of event argument for event evaluation leads to boosted performance. In this work, we propose a light yet practical entity and event salience estimation evaluation protocol, which incorporates the more reliable syntactic dependency parser. Furthermore, we conduct a comprehensive analysis among popular entity and event definition standards, and present our own definition for the Salience Estimation task to reduce noise during the pseudo-ground truth generation process. Furthermore, we construct dependency-based heterogeneous graphs to capture the interactions of entities and events. The empirical results show that both baseline methods and the novel GNN method utilizing the heterogeneous graph consistently outperform the previous SOTA model in all proposed metrics.

Critical Aspects of Person Counting and Density Estimation

Journal of Imaging ◽

10.3390/jimaging7020021 ◽

2021 ◽

Vol 7 (2) ◽

pp. 21

Author(s):

Roland Perko ◽

Manfred Klopschitz ◽

Alexander Almer ◽

Peter M. Roth

Keyword(s):

Density Estimation ◽

Network Architecture ◽

Reference Data ◽

State Of The Art ◽

Limit State ◽

Ground Truth ◽

Data Sets ◽

Ground Truth Generation ◽

Baseline Approach ◽

Critical Aspects

Many scientific studies deal with person counting and density estimation from single images. Recently, convolutional neural networks (CNNs) have been applied for these tasks. Even though often better results are reported, it is often not clear where the improvements are resulting from, and if the proposed approaches would generalize. Thus, the main goal of this paper was to identify the critical aspects of these tasks and to show how these limit state-of-the-art approaches. Based on these findings, we show how to mitigate these limitations. To this end, we implemented a CNN-based baseline approach, which we extended to deal with identified problems. These include the discovery of bias in the reference data sets, ambiguity in ground truth generation, and mismatching of evaluation metrics w.r.t. the training loss function. The experimental results show that our modifications allow for significantly outperforming the baseline in terms of the accuracy of person counts and density estimation. In this way, we get a deeper understanding of CNN-based person density estimation beyond the network architecture. Furthermore, our insights would allow to advance the field of person density estimation in general by highlighting current limitations in the evaluation protocols.

Cooperative Raw Sensor Data Fusion for Ground Truth Generation in Autonomous Driving

2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC) ◽

10.1109/itsc45102.2020.9294477 ◽

2020 ◽

Author(s):

Egon Ye ◽

Philip Spiegel ◽

Matthias Althoff

Keyword(s):

Data Fusion ◽

Ground Truth ◽

Autonomous Driving ◽

Sensor Data ◽

Sensor Data Fusion ◽

Ground Truth Generation

GROUND TRUTH GENERATION AND DISPARITY ESTIMATION FOR OPTICAL SATELLITE IMAGERY

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xliii-b2-2020-127-2020 ◽

2020 ◽

Vol XLIII-B2-2020 ◽

pp. 127-134

Author(s):

M. Cournet ◽

E. Sarrazin ◽

L. Dumas ◽

J. Michel ◽

J. Guinet ◽

...

Keyword(s):

Deep Learning ◽

Satellite Imagery ◽

Learning Difficulties ◽

Ground Truth ◽

Disparity Estimation ◽

Learning Network ◽

Ground Truth Generation ◽

Global Matching ◽

Disparity Maps ◽

Deep Learning Network

Abstract. Several 3D reconstruction pipelines are being developed around the world for satellite imagery. Most of them implement their own versions of Semi-Global Matching, as an option for the matching step. However, deep learning based solutions already outperform every SGM derived algorithms on Kitti and Middlebury stereo datasets. But these deep learning based solutions need huge quantities of ground truths for training. This implies that the generation of ground truth stereo datasets, from satellite imagery and lidar, seems to be of great interest for the scientific community. It will aim at reducing the potential transfer learning difficulties, that could arise from a training done on datasets such as Middlebury or Kitti. In this work, we present a new ground truth generation pipeline. It produces stereo-rectified images and ground truth disparity maps, from satellite imagery and lidar. We also assess the rectification and the disparity accuracies of these outputs. We finally train a deep learning network on our preliminary ground truth dataset.

Semi-Automatic Cloud-Native Video Annotation for Autonomous Driving

Applied Sciences ◽

10.3390/app10124301 ◽

2020 ◽

Vol 10 (12) ◽

pp. 4301

Author(s):

Sergio Sánchez-Carballido ◽

Orti Senderos ◽

Marcos Nieto ◽

Oihana Otaegui

Keyword(s):

High Performance ◽

Ground Truth ◽

Autonomous Driving ◽

Video Annotation ◽

Driver Assistance ◽

Driver Assistance Systems ◽

Innovative Solution ◽

Ground Truth Generation ◽

Reliable Design ◽

High Performance Computing Cluster

An innovative solution named Annotation as a Service (AaaS) has been specifically designed to integrate heterogeneous video annotation workflows into containers and take advantage of a cloud native highly scalable and reliable design based on Kubernetes workloads. Using the AaaS as a foundation, the execution of automatic video annotation workflows is addressed in the broader context of a semi-automatic video annotation business logic for ground truth generation for Autonomous Driving (AD) and Advanced Driver Assistance Systems (ADAS). The document presents design decisions, innovative developments, and tests conducted to provide scalability to this cloud-native ecosystem for semi-automatic annotation. The solution has proven to be efficient and resilient on an AD/ADAS scale, specifically in an experiment with 25 TB of input data to annotate, 4000 concurrent annotation jobs, and 32 worker nodes forming a high performance computing cluster with a total of 512 cores, and 2048 GB of RAM. Automatic pre-annotations with the proposed strategy reduce the time of human participation in the annotation up to 80% maximum and 60% on average.

Multi-Stream Networks and Ground Truth Generation for Crowd Counting

International journal of electrical and computer engineering systems ◽

10.32985/ijeces.11.1.4 ◽

2020 ◽

Vol 11 (1) ◽

pp. 33-41

Author(s):

Rodolfo Quispe ◽

Darwin Ttito ◽

Adín Rivera ◽

Helio Pedrini

Keyword(s):

Neural Network ◽

Network Architecture ◽

Receptive Fields ◽

Ground Truth ◽

Scene Analysis ◽

Stream Networks ◽

Single Image ◽

Crowd Counting ◽

Ground Truth Generation ◽

Density Map

Crowd scene analysis has received a lot of attention recently due to a wide variety of applications, e.g., forensic science, urban planning, surveillance and security. In this context, a challenging task is known as crowd counting [1–6], whose main purpose is to estimate the number of people present in a single image. A multi-stream convolutional neural network is developed and evaluated in this paper, which receives an image as input and produces a density map that represents the spatial distribution of people in an end-to-end fashion. In order to address complex crowd counting issues, such as extremely unconstrained scale and perspective changes, the network architecture utilizes receptive fields with different size filters for each stream. In addition, we investigate the influence of the two most common fashions on the generation of ground truths and propose a hybrid method based on tiny face detection and scale interpolation. Experiments conducted on two challenging datasets, UCF-CC-50 and ShanghaiTech, demonstrate that the use of our ground truth generation methods achieves superior results.

ground truth generation
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Deep-Learning-Based Cerebral Artery Semantic Segmentation in Neurosurgical Operating Microscope Vision Using Indocyanine Green Fluorescence Videoangiography

BYANJON: A Ground Truth Preparation System for Online Handwritten Bangla Documents

Automated Ground Truth Generation for Learning-Based Crack Detection on Concrete Surfaces

Density based semi-automatic labeling on multi-feature representations for ground truth generation: Application to handwritten character recognition

Evaluation of Unsupervised Entity and Event Salience Estimation

Critical Aspects of Person Counting and Density Estimation

Cooperative Raw Sensor Data Fusion for Ground Truth Generation in Autonomous Driving

GROUND TRUTH GENERATION AND DISPARITY ESTIMATION FOR OPTICAL SATELLITE IMAGERY

Semi-Automatic Cloud-Native Video Annotation for Autonomous Driving

Multi-Stream Networks and Ground Truth Generation for Crowd Counting

Export Citation Format

ground truth generationRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Deep-Learning-Based Cerebral Artery Semantic Segmentation in Neurosurgical Operating Microscope Vision Using Indocyanine Green Fluorescence Videoangiography

BYANJON: A Ground Truth Preparation System for Online Handwritten Bangla Documents

Automated Ground Truth Generation for Learning-Based Crack Detection on Concrete Surfaces

Density based semi-automatic labeling on multi-feature representations for ground truth generation: Application to handwritten character recognition

Evaluation of Unsupervised Entity and Event Salience Estimation

Critical Aspects of Person Counting and Density Estimation

Cooperative Raw Sensor Data Fusion for Ground Truth Generation in Autonomous Driving

GROUND TRUTH GENERATION AND DISPARITY ESTIMATION FOR OPTICAL SATELLITE IMAGERY

Semi-Automatic Cloud-Native Video Annotation for Autonomous Driving

Multi-Stream Networks and Ground Truth Generation for Crowd Counting

ground truth generation
Recently Published Documents