Edge Prior Multilayer Segmentation Network Based on Bayesian Framework

In recent years, methods based on neural network have achieved excellent performance for image segmentation. However, segmentation around the edge area is still unsatisfactory when dealing with complex boundaries. This paper proposes an edge prior semantic segmentation architecture based on Bayesian framework. The entire framework is composed of three network structures, a likelihood network and an edge prior network at the front, followed by a constraint network. The likelihood network produces a rough segmentation result, which is later optimized by edge prior information, including the edge map and the edge distance. For the constraint network, the modified domain transform method is proposed, in which the diffusion direction is revised through the newly defined distance map and some added constraint conditions. Experiments about the proposed approach and several contrastive methods show that our proposed method had good performance and outperformed FCN in terms of average accuracy for 0.0209 on ESAR data set.

Download Full-text

Nystagmus Estimation for Dizziness Diagnosis by Pupil Detection and Tracking Using Mexican-Hat-Type Ellipse Pattern Matching

Healthcare ◽

10.3390/healthcare9070885 ◽

2021 ◽

Vol 9 (7) ◽

pp. 885

Author(s):

Yoanda Alim Syahbana ◽

Yokota Yasunari ◽

Morita Hiroyuki ◽

Aoki Mitsuhiro ◽

Suzuki Kanade ◽

...

Keyword(s):

Pattern Matching ◽

Hough Transform ◽

University Hospital ◽

Transform Method ◽

Data Set ◽

Medical Specialists ◽

Detection And Tracking ◽

Mexican Hat ◽

Pupil Detection ◽

In The Wild

The detection of nystagmus using video oculography experiences accuracy problems when patients who complain of dizziness have difficulty in fully opening their eyes. Pupil detection and tracking in this condition affect the accuracy of the nystagmus waveform. In this research, we design a pupil detection method using a pattern matching approach that approximates the pupil using a Mexican hat-type ellipse pattern, in order to deal with the aforementioned problem. We evaluate the performance of the proposed method, in comparison with that of a conventional Hough transform method, for eye movement videos retrieved from Gifu University Hospital. The performance results show that the proposed method can detect and track the pupil position, even when only 20% of the pupil is visible. In comparison, the conventional Hough transform only indicates good performance when 90% of the pupil is visible. We also evaluate the proposed method using the Labelled Pupil in the Wild (LPW) data set. The results show that the proposed method has an accuracy of 1.47, as evaluated using the Mean Square Error (MSE), which is much lower than that of the conventional Hough transform method, with an MSE of 9.53. We conduct expert validation by consulting three medical specialists regarding the nystagmus waveform. The medical specialists agreed that the waveform can be evaluated clinically, without contradicting their diagnoses.

Download Full-text

Current cancer driver variant predictors learn to recognize driver genes instead of functional variants

BMC Biology ◽

10.1186/s12915-020-00930-0 ◽

2021 ◽

Vol 19 (1) ◽

Author(s):

Daniele Raimondi ◽

Antoine Passemiers ◽

Piero Fariselli ◽

Yves Moreau

Keyword(s):

Data Sets ◽

Precision Oncology ◽

Complex Task ◽

Excellent Performance ◽

Driver Genes ◽

Significant Drop ◽

Data Set ◽

Gene Effects ◽

Cancer Driver ◽

Functional Variants

Abstract Background Identifying variants that drive tumor progression (driver variants) and distinguishing these from variants that are a byproduct of the uncontrolled cell growth in cancer (passenger variants) is a crucial step for understanding tumorigenesis and precision oncology. Various bioinformatics methods have attempted to solve this complex task. Results In this study, we investigate the assumptions on which these methods are based, showing that the different definitions of driver and passenger variants influence the difficulty of the prediction task. More importantly, we prove that the data sets have a construction bias which prevents the machine learning (ML) methods to actually learn variant-level functional effects, despite their excellent performance. This effect results from the fact that in these data sets, the driver variants map to a few driver genes, while the passenger variants spread across thousands of genes, and thus just learning to recognize driver genes provides almost perfect predictions. Conclusions To mitigate this issue, we propose a novel data set that minimizes this bias by ensuring that all genes covered by the data contain both driver and passenger variants. As a result, we show that the tested predictors experience a significant drop in performance, which should not be considered as poorer modeling, but rather as correcting unwarranted optimism. Finally, we propose a weighting procedure to completely eliminate the gene effects on such predictions, thus precisely evaluating the ability of predictors to model the functional effects of single variants, and we show that indeed this task is still open.

Download Full-text

Stochastic inversion of pressure and saturation changes from time-lapse AVO data

Geophysics ◽

10.1190/1.2235858 ◽

2006 ◽

Vol 71 (5) ◽

pp. C81-C92 ◽

Cited By ~ 17

Author(s):

Helene Hafslund Veire ◽

Hilde Grude Borgos ◽

Martin Landrø

Keyword(s):

Stochastic Model ◽

Seismic Data ◽

Synthetic Data ◽

Time Lapse ◽

Bayesian Framework ◽

Reservoir Modeling ◽

Data Set ◽

The North ◽

Fluid Saturation ◽

Likelihood Model

Effects of pressure and fluid saturation can have the same degree of impact on seismic amplitudes and differential traveltimes in the reservoir interval; thus, they are often inseparable by analysis of a single stacked seismic data set. In such cases, time-lapse AVO analysis offers an opportunity to discriminate between the two effects. We quantify the uncertainty in estimations to utilize information about pressure- and saturation-related changes in reservoir modeling and simulation. One way of analyzing uncertainties is to formulate the problem in a Bayesian framework. Here, the solution of the problem will be represented by a probability density function (PDF), providing estimations of uncertainties as well as direct estimations of the properties. A stochastic model for estimation of pressure and saturation changes from time-lapse seismic AVO data is investigated within a Bayesian framework. Well-known rock physical relationships are used to set up a prior stochastic model. PP reflection coefficient differences are used to establish a likelihood model for linking reservoir variables and time-lapse seismic data. The methodology incorporates correlation between different variables of the model as well as spatial dependencies for each of the variables. In addition, information about possible bottlenecks causing large uncertainties in the estimations can be identified through sensitivity analysis of the system. The method has been tested on 1D synthetic data and on field time-lapse seismic AVO data from the Gullfaks Field in the North Sea.

Download Full-text

Detection of COVID-19 cases through X-ray images using hybrid deep neural network

World Journal of Engineering ◽

10.1108/wje-10-2020-0529 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Rajit Nair ◽

Santosh Vishwakarma ◽

Mukesh Soni ◽

Tejas Patel ◽

Shubham Joshi

Keyword(s):

Real Time ◽

Binary Classification ◽

Data Set ◽

Content Type ◽

X Ray ◽

Internal Parameters ◽

Average Accuracy ◽

Chest X Ray ◽

Multi Class Classification

Purpose The latest 2019 coronavirus (COVID-2019), which first appeared in December 2019 in Wuhan's city in China, rapidly spread around the world and became a pandemic. It has had a devastating impact on daily lives, the public's health and the global economy. The positive cases must be identified as soon as possible to avoid further dissemination of this disease and swift care of patients affected. The need for supportive diagnostic instruments increased, as no specific automated toolkits are available. The latest results from radiology imaging techniques indicate that these photos provide valuable details on the virus COVID-19. User advanced artificial intelligence (AI) technologies and radiological imagery can help diagnose this condition accurately and help resolve the lack of specialist doctors in isolated areas. In this research, a new paradigm for automatic detection of COVID-19 with bare chest X-ray images is displayed. Images are presented. The proposed model DarkCovidNet is designed to provide correct binary classification diagnostics (COVID vs no detection) and multi-class (COVID vs no results vs pneumonia) classification. The implemented model computed the average precision for the binary and multi-class classification of 98.46% and 91.352%, respectively, and an average accuracy of 98.97% and 87.868%. The DarkNet model was used in this research as a classifier for a real-time object detection method only once. A total of 17 convolutionary layers and different filters on each layer have been implemented. This platform can be used by the radiologists to verify their initial application screening and can also be used for screening patients through the cloud. Design/methodology/approach This study also uses the CNN-based model named Darknet-19 model, and this model will act as a platform for the real-time object detection system. The architecture of this system is designed in such a way that they can be able to detect real-time objects. This study has developed the DarkCovidNet model based on Darknet architecture with few layers and filters. So before discussing the DarkCovidNet model, look at the concept of Darknet architecture with their functionality. Typically, the DarkNet architecture consists of 5 pool layers though the max pool and 19 convolution layers. Assume as a convolution layer, and as a pooling layer. Findings The work discussed in this paper is used to diagnose the various radiology images and to develop a model that can accurately predict or classify the disease. The data set used in this work is the images bases on COVID-19 and non-COVID-19 taken from the various sources. The deep learning model named DarkCovidNet is applied to the data set, and these have shown signification performance in the case of binary classification and multi-class classification. During the multi-class classification, the model has shown an average accuracy 98.97% for the detection of COVID-19, whereas in a multi-class classification model has achieved an average accuracy of 87.868% during the classification of COVID-19, no detection and Pneumonia. Research limitations/implications One of the significant limitations of this work is that a limited number of chest X-ray images were used. It is observed that patients related to COVID-19 are increasing rapidly. In the future, the model on the larger data set which can be generated from the local hospitals will be implemented, and how the model is performing on the same will be checked. Originality/value Deep learning technology has made significant changes in the field of AI by generating good results, especially in pattern recognition. A conventional CNN structure includes a convolution layer that extracts characteristics from the input using the filters it applies, a pooling layer that reduces calculation efficiency and the neural network's completely connected layer. A CNN model is created by integrating one or more of these layers, and its internal parameters are modified to accomplish a specific mission, such as classification or object recognition. A typical CNN structure has a convolution layer that extracts features from the input with the filters it applies, a pooling layer to reduce the size for computational performance and a fully connected layer, which is a neural network. A CNN model is created by combining one or more such layers, and its internal parameters are adjusted to accomplish a particular task, such as classification or object recognition.

Download Full-text

EFN: Field-Based Object Detection for Aerial Images

Remote Sensing ◽

10.3390/rs12213630 ◽

2020 ◽

Vol 12 (21) ◽

pp. 3630

Author(s):

Jin Liu ◽

Haokun Zheng

Keyword(s):

Object Detection ◽

Semantic Segmentation ◽

Natural Images ◽

Aerial Images ◽

Aerial Image ◽

Data Sets ◽

Data Set ◽

Bounding Box ◽

Good Score ◽

Bounding Boxes

Object detection and recognition in aerial and remote sensing images has become a hot topic in the field of computer vision in recent years. As these images are usually taken from a bird’s-eye view, the targets often have different shapes and are densely arranged. Therefore, using an oriented bounding box to mark the target is a mainstream choice. However, this general method is designed based on horizontal box annotation, while the improved method for detecting an oriented bounding box has a high computational complexity. In this paper, we propose a method called ellipse field network (EFN) to organically integrate semantic segmentation and object detection. It predicts the probability distribution of the target and obtains accurate oriented bounding boxes through a post-processing step. We tested our method on the HRSC2016 and DOTA data sets, achieving mAP values of 0.863 and 0.701, respectively. At the same time, we also tested the performance of EFN on natural images and obtained a mAP of 84.7 in the VOC2012 data set. These extensive experiments demonstrate that EFN can achieve state-of-the-art results in aerial image tests and can obtain a good score when considering natural images.

Download Full-text

Enhancing Accuracy in a Touch Operation Biometric System: A Case on the Android Pattern Lock Scheme

Mobile Information Systems ◽

10.1155/2020/4165457 ◽

2020 ◽

Vol 2020 ◽

pp. 1-12

Author(s):

Allan Ng’ang’a ◽

Paula M. W. Musuva

Keyword(s):

Repeated Measures ◽

Smart Devices ◽

Biometric System ◽

Data Set ◽

Pattern Complexity ◽

User Training ◽

Average Accuracy ◽

Within Subjects ◽

Dynamic Time ◽

Future Work

The main objective of this research study is to enhance the functionality of an Android pattern lock application by determining whether the time elements of a touch operation, in particular time on dot (TOD) and time between dot (TBD), can be accurately used as a biometric identifier. The three hypotheses that were tested through this study were the following–H1: there is a correlation between the number of touch stroke features used and the accuracy of the touch operation biometric system; H2: there is a correlation between pattern complexity and accuracy of the touch operation biometric system; H3: there is a correlation between user training and accuracy of the touch operation biometric system. Convenience sampling and a within-subjects design involving repeated measures were incorporated when testing an overall sample size of 12 subjects drawn from a university population who gave a total of 2,096 feature extracted data. Analysis was done using the Dynamic Time Warping (DTW) Algorithm. Through this study, it was shown that the extraction of one-touch stroke biometric feature coupled with user training was able to yield high average accuracy levels of up to 82%. This helps build a case for the introduction of biometrics into smart devices with average processing capabilities as they would be able to handle a biometric system without it compromising on the overall system performance. For future work, it is recommended that more work be done by applying other classification algorithms to the existing data set and comparing their results with those obtained with DTW.

Download Full-text

A Method for Human Facial Image Annotation on Low Power Consumption Autonomous Devices

Sensors ◽

10.3390/s20072140 ◽

2020 ◽

Vol 20 (7) ◽

pp. 2140

Author(s):

Tomasz Hachaj

Keyword(s):

Power Consumption ◽

Low Power ◽

Image Annotation ◽

Facial Feature ◽

Video Data ◽

Low Power Consumption ◽

Facial Image ◽

True Negative ◽

Data Set ◽

Average Accuracy

This paper proposes a classifier designed for human facial feature annotation, which is capable of running on relatively cheap, low power consumption autonomous microcomputer systems. An autonomous system is one that depends only on locally available hardware and software—for example, it does not use remote services available through the Internet. The proposed solution, which consists of a Histogram of Oriented Gradients (HOG) face detector and a set of neural networks, has comparable average accuracy and average true positive and true negative ratio to state-of-the-art deep neural network (DNN) architectures. However, contrary to DNNs, it is possible to easily implement the proposed method in a microcomputer with very limited RAM memory and without the use of additional coprocessors. The proposed method was trained and evaluated on a large 200,000 image face data set and compared with results obtained by other researchers. Further evaluation proves that it is possible to perform facial image attribute classification using the proposed algorithm on incoming video data captured by an RGB camera sensor of the microcomputer. The obtained results can be easily reproduced, as both the data set and source code can be downloaded. Developing and evaluating the proposed facial image annotation algorithm and its implementation, which is easily portable between various hardware and operating systems (virtually the same code works both on high-end PCs and microcomputers using the Windows and Linux platforms) and which is dedicated for low power consumption devices without coprocessors, is the main and novel contribution of this research.

Download Full-text

A Non-Stationary Uncertainty Model and Bayesian Calibration of Strain-Life Models

Journal of Verification Validation and Uncertainty Quantification ◽

10.1115/1.4049324 ◽

2020 ◽

Author(s):

Arinan Dourado ◽

Firat Irmak ◽

Felipe Viana ◽

Ali Gordon

Keyword(s):

Fatigue Life ◽

Test Data ◽

Model Calibration ◽

Prior Information ◽

Bayesian Framework ◽

Applied Strain ◽

Prediction Intervals ◽

Bayesian Calibration ◽

Uncertainty Model ◽

Inconel 617

Abstract The Coffin-Manson-Basquin-Haford (CMBH) model is a well-accepted strain-life relationship to model fatigue life as a function of applied strain. In this paper, we propose a non-stationary uncertainty model for the CMBH model, alongside a Bayesian framework for model calibration and estimation of confidence and prediction intervals. Using Inconel 617 coupon test data, we compared our approach to traditional stationary variance models. The proposed uncertainty model successfully captures the fact that the variance of fatigue life decreases as the applied strain decreases. Additionally, a discussion on how to use the proposed Bayesian framework to compensate for the lack of data by using prior information coming from a similar alloys is also presented considering Hastealloy-X and Inconel 617 coupon data.

Download Full-text

FaceParser – A new face segmentation approach and labeleddatabase

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i2.5.10043 ◽

2018 ◽

Vol 7 (2.5) ◽

pp. 1

Author(s):

Khalil Khan ◽

Nasir Ahmad ◽

Irfan Uddin ◽

Muhammad Ehsan Mazhar ◽

Rehan Ullah Khan

Keyword(s):

Semantic Segmentation ◽

New Method ◽

Facial Image ◽

Data Set ◽

Semantic Classes ◽

Discriminative Model ◽

Face Segmentation ◽

Segmentation Approach ◽

Class Labels ◽

Pixel Labeling

Background and objective: A novel face parsing method is proposed in this paper which partition facial image into six semantic classes. Unlike previous approaches which segmented a facial image into three or four classes, we extended the class labels to six. Materials and Methods: A data-set of 464 images taken from FEI, MIT-CBCL, Pointing’04 and SiblingsDB databases was annotated. A discriminative model was trained by extracting features from squared patches. The built model was tested on two different semantic segmentation approaches – pixel-based and super-pixel-based semantic segmentation (PB_SS and SPB_SS).Results: A pixel labeling accuracy (PLA) of 94.68% and 90.35% was obtained with PB_SS and SPB_SS methods respectively on frontal images. Conclusions: A new method for face parts parsing was proposed which efficiently segmented a facial image into its constitute parts.

Download Full-text

TIG Stainless Steel Molten Pool Contour Detection and Weld Width Prediction Based on Res-Seg

Metals ◽

10.3390/met10111495 ◽

2020 ◽

Vol 10 (11) ◽

pp. 1495

Author(s):

Yiming Wang ◽

Jing Han ◽

Jun Lu ◽

Lianfa Bai ◽

Zhuang Zhao

Keyword(s):

Edge Detection ◽

Molten Pool ◽

Morphological Characteristics ◽

Semantic Segmentation ◽

Morphological Data ◽

Generative Adversarial Networks ◽

Pool Data ◽

Data Set ◽

Weld Width ◽

Detection Algorithms

As the basic visual morphological characteristics of molten pool, contour extraction plays an important role in on-line monitoring of welding quality. The limitations of traditional edge detection algorithms make deep learning play a more important role in the task of target segmentation. In this paper, a molten pool visual sensing system in a tungsten inert gas welding (TIG) process environment is established and the corresponding molten pool image data set is made. Based on a residual network, a multi-scale feature fusion semantic segmentation network Res-Seg is designed. In order to further improve the generalization ability of the network model, this paper uses deep convolutional generative adversarial networks (DCGAN) to supplement the molten pool data set, then performs color and morphological data enhancement before network training. By comparing with other traditional edge detection algorithms and semantic segmentation network, it is verified that the scheme has high accuracy and robustness in the actual welding environment. Moreover, a back propagation (BP) neural network is used to predict the weld width, and a fitting test is carried out for the pixel width of the molten pool and its corresponding actual weld width. The average testing error is less than 0.2 mm, which meets the welding accuracy requirements.

Download Full-text