Comparison of rule-based and neural network models for negation detection in radiology reports

Abstract Using natural language processing, it is possible to extract structured information from raw text in the electronic health record (EHR) at reasonably high accuracy. However, the accurate distinction between negated and non-negated mentions of clinical terms remains a challenge. EHR text includes cases where diseases are stated not to be present or only hypothesised, meaning a disease can be mentioned in a report when it is not being reported as present. This makes tasks such as document classification and summarisation more difficult. We have developed the rule-based EdIE-R-Neg, part of an existing text mining pipeline called EdIE-R (Edinburgh Information Extraction for Radiology reports), developed to process brain imaging reports, (https://www.ltg.ed.ac.uk/software/edie-r/) and two machine learning approaches; one using a bidirectional long short-term memory network and another using a feedforward neural network. These were developed on data from the Edinburgh Stroke Study (ESS) and tested on data from routine reports from NHS Tayside (Tayside). Both datasets consist of written reports from medical scans. These models are compared with two existing rule-based models: pyConText (Harkema et al. 2009. Journal of Biomedical Informatics42(5), 839–851), a python implementation of a generalisation of NegEx, and NegBio (Peng et al. 2017. NegBio: A high-performance tool for negation and uncertainty detection in radiology reports. arXiv e-prints, p. arXiv:1712.05898), which identifies negation scopes through patterns applied to a syntactic representation of the sentence. On both the test set of the dataset from which our models were developed, as well as the largely similar Tayside test set, the neural network models and our custom-built rule-based system outperformed the existing methods. EdIE-R-Neg scored highest on F1 score, particularly on the test set of the Tayside dataset, from which no development data were used in these experiments, showing the power of custom-built rule-based systems for negation detection on datasets of this size. The performance gap of the machine learning models to EdIE-R-Neg on the Tayside test set was reduced through adding development Tayside data into the ESS training set, demonstrating the adaptability of the neural network models.

Download Full-text

STATISTICAL AND MACHINE LEARNING HIGH-FREQUENCY TIME SERIES FORECASTING METHODS IN AUTOMATIC MODE

Vestnik komp iuternykh i informatsionnykh tekhnologii ◽

10.14489/vkit.2020.06.pp.003-011 ◽

2020 ◽

pp. 3-11

Author(s):

Sai Van Cuong ◽

M. V. Shcherbakov

Keyword(s):

Neural Network ◽

Machine Learning ◽

Time Series ◽

High Frequency ◽

Network Models ◽

Time Series Forecasting ◽

Neural Network Models ◽

Automatic Mode ◽

Forecasting Methods ◽

The Neural Network

The research of the problem of automatic high-frequency time series forecasting (without expert) is devoted. The efficiency of high-frequency time series forecasting using different statistical and machine learning modelsis investigated. Theclassical statistical forecasting methods are compared with neural network models based on 1000 synthetic sets of high-frequency data. The neural network models give better prediction results, however, it takes more time to compute compared to statistical approaches.

Download Full-text

Analysis of Neural Network Based Language Modeling

Journal of Artificial Intelligence and Capsule Networks - September 2019 ◽

10.36548/jaicn.2020.3.006 ◽

2020 ◽

Vol 2 (1) ◽

pp. 53-63

Author(s):

Dr. Karrupusamy P.

Keyword(s):

Neural Network ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Network Models ◽

Research Field ◽

Neural Network Models ◽

Natural Languages ◽

The Neural Network ◽

Language Modelling

The fundamental and core process of the natural language processing is the language modelling usually referred as the statistical language modelling. The language modelling is also considered to be vital in the processing the natural languages as the other chores such as the completion of sentences, recognition of speech automatically, translations of the statistical machines, and generation of text and so on. The success of the viable natural language processing totally relies on the quality of the modelling of the language. In the previous spans the research field such as the linguistics, psychology, speech recognition, data compression, neuroscience, machine translation etc. As the neural network are the very good choices for having a quality language modelling the paper presents the analysis of neural networks in the modelling of the language. Utilizing some of the dataset such as the Penn Tree bank, Billion Word Benchmark and the Wiki Test the neural network models are evaluated on the basis of the word error rate, perplexity and the bilingual evaluation under study scores to identify the optimal model.

Download Full-text

A Study on the Effect of DropConnect to Control Overfitting in Designing Neural Networks

Machine Learning and Artificial Intelligence - Frontiers in Artificial Intelligence and Applications ◽

10.3233/faia200780 ◽

2020 ◽

Author(s):

Hyun-il Lim

Keyword(s):

Neural Network ◽

Machine Learning ◽

Neural Networks ◽

Neural Network Model ◽

Network Models ◽

Training Data ◽

Neural Network Models ◽

Structural Problems ◽

The Neural Network ◽

Training Neural Network

The neural network is an approach of machine learning by training the connected nodes of a model to predict the results of specific problems. The prediction model is trained by using previously collected training data. In training neural network models, overfitting problems can occur from the excessively dependent training of data and the structural problems of the models. In this paper, we analyze the effect of DropConnect for controlling overfitting in neural networks. It is analyzed according to the DropConnect rates and the number of nodes in designing neural networks. The analysis results of this study help to understand the effect of DropConnect in neural networks. To design an effective neural network model, the DropConnect can be applied with appropriate parameters from the understanding of the effect of the DropConnect in neural network models.

Download Full-text

Analysis of Neural Network Based Language Modeling

Journal of Artificial Intelligence and Capsule Networks - September 2019 ◽

10.36548/jaicn.2020.1.006 ◽

2020 ◽

Vol 2 (1) ◽

pp. 53-63

Author(s):

Dr. Karrupusamy P.

Keyword(s):

Neural Network ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Network Models ◽

Research Field ◽

Neural Network Models ◽

Natural Languages ◽

The Neural Network ◽

Language Modelling

Download Full-text

Intelligent Industrial Cleaning: A Multi-Sensor Approach Utilising Machine Learning-Based Regression

Sensors ◽

10.3390/s20133642 ◽

2020 ◽

Vol 20 (13) ◽

pp. 3642

Author(s):

Alessandro Simeone ◽

Elliot Woolley ◽

Josep Escrig ◽

Nicholas James Watson

Keyword(s):

Neural Network ◽

Machine Learning ◽

Network Models ◽

Production Equipment ◽

Neural Network Models ◽

Safe Production ◽

The Neural Network ◽

Cleaning Equipment ◽

Industrial Cleaning ◽

Area And Volume

Effectively cleaning equipment is essential for the safe production of food but requires a significant amount of time and resources such as water, energy, and chemicals. To optimize the cleaning of food production equipment, there is the need for innovative technologies to monitor the removal of fouling from equipment surfaces. In this work, optical and ultrasonic sensors are used to monitor the fouling removal of food materials with different physicochemical properties from a benchtop rig. Tailored signal and image processing procedures are developed to monitor the cleaning process, and a neural network regression model is developed to predict the amount of fouling remaining on the surface. The results show that the three dissimilar food fouling materials investigated were removed from the test section via different cleaning mechanisms, and the neural network models were able to predict the area and volume of fouling present during cleaning with accuracies as high as 98% and 97%, respectively. This work demonstrates that sensors and machine learning methods can be effectively combined to monitor cleaning processes.

Download Full-text

Neural Models of Text Normalization for Speech Applications

Computational Linguistics ◽

10.1162/coli_a_00349 ◽

2019 ◽

Vol 45 (2) ◽

pp. 293-337 ◽

Cited By ~ 4

Author(s):

Hao Zhang ◽

Richard Sproat ◽

Axel H. Ng ◽

Felix Stahlberg ◽

Xiaochang Peng ◽

...

Keyword(s):

Neural Network ◽

Machine Learning ◽

Language Processing ◽

Speech Synthesis ◽

Network Models ◽

Neural Models ◽

Neural Network Models ◽

Sentential Context ◽

Finite State ◽

Text Normalization

Machine learning, including neural network techniques, have been applied to virtually every domain in natural language processing. One problem that has been somewhat resistant to effective machine learning solutions is text normalization for speech applications such as text-to-speech synthesis (TTS). In this application, one must decide, for example, that 123 is verbalized as one hundred twenty three in 123 pages but as one twenty three in 123 King Ave. For this task, state-of-the-art industrial systems depend heavily on hand-written language-specific grammars. We propose neural network models that treat text normalization for TTS as a sequence-to-sequence problem, in which the input is a text token in context, and the output is the verbalization of that token. We find that the most effective model, in accuracy and efficiency, is one where the sentential context is computed once and the results of that computation are combined with the computation of each token in sequence to compute the verbalization. This model allows for a great deal of flexibility in terms of representing the context, and also allows us to integrate tagging and segmentation into the process. These models perform very well overall, but occasionally they will predict wildly inappropriate verbalizations, such as reading 3 cm as three kilometers. Although rare, such verbalizations are a major issue for TTS applications. We thus use finite-state covering grammars to guide the neural models, either during training and decoding, or just during decoding, away from such “unrecoverable” errors. Such grammars can largely be learned from data.

Download Full-text

Machine Learning Crowdfunding

International Journal of Knowledge-Based Organizations ◽

10.4018/ijkbo.2020040101 ◽

2020 ◽

Vol 10 (2) ◽

pp. 1-11

Author(s):

Evangelos Katsamakas ◽

Hao Sun

Keyword(s):

Neural Network ◽

Machine Learning ◽

Language Processing ◽

Topic Modeling ◽

Network Models ◽

Unstructured Data ◽

Text Analytics ◽

Learning Models ◽

Neural Network Models ◽

Machine Learning Models

Crowdfunding is a novel and important economic mechanism for funding projects and promoting innovation in the digital economy. This article explores most recent structured and unstructured data from a crowdfunding platform. It provides an in-depth exploration of the data using text analytics techniques, such as sentiment analysis and topic modeling. It uses novel natural language processing to represent project descriptions, and evaluates machine learning models, including neural network models, to predict project fundraising success. It discusses the findings of the performance evaluation, and summarizes lessons for crowdfunding platforms and their users.

Download Full-text

Digital twin of equipment as a basis for the consumer in digital production

Automation. Modern Techologies ◽

10.36652/0869-4931-2020-74-9-394-402 ◽

2020 ◽

Keyword(s):

Neural Network ◽

Tool Wear ◽

Chip Formation ◽

Network Models ◽

Machining Accuracy ◽

Neural Network Models ◽

Digital Twin ◽

The Neural Network ◽

Digital Production ◽

Cyberphysical System

The neural network models series used in the development of an aggregated digital twin of equipment as a cyber-physical system are presented. The twins of machining accuracy, chip formation and tool wear are examined in detail. On their basis, systems for stabilization of the chip formation process during cutting and diagnose of the cutting too wear are developed. Keywords cyberphysical system; neural network model of equipment; big data, digital twin of the chip formation; digital twin of the tool wear; digital twin of nanostructured coating choice

Download Full-text

Spatial Variability Aware Deep Neural Networks (SVANN): A General Approach

ACM Transactions on Intelligent Systems and Technology ◽

10.1145/3466688 ◽

2021 ◽

Vol 12 (6) ◽

pp. 1-21

Author(s):

Jayant Gupta ◽

Carl Molnar ◽

Yiqun Xie ◽

Joe Knight ◽

Shashi Shekhar

Keyword(s):

Neural Network ◽

Spatial Variability ◽

Network Architecture ◽

Network Models ◽

Neural Network Architecture ◽

Neural Network Models ◽

Climatic Zones ◽

The Neural Network ◽

Plant Hardiness ◽

Interpretation Model

Spatial variability is a prominent feature of various geographic phenomena such as climatic zones, USDA plant hardiness zones, and terrestrial habitat types (e.g., forest, grasslands, wetlands, and deserts). However, current deep learning methods follow a spatial-one-size-fits-all (OSFA) approach to train single deep neural network models that do not account for spatial variability. Quantification of spatial variability can be challenging due to the influence of many geophysical factors. In preliminary work, we proposed a spatial variability aware neural network (SVANN-I, formerly called SVANN ) approach where weights are a function of location but the neural network architecture is location independent. In this work, we explore a more flexible SVANN-E approach where neural network architecture varies across geographic locations. In addition, we provide a taxonomy of SVANN types and a physics inspired interpretation model. Experiments with aerial imagery based wetland mapping show that SVANN-I outperforms OSFA and SVANN-E performs the best of all.

Download Full-text

Towards Enhanced Performance of Neural-Network-Based Fault Detection Using an Sequential D-Optimum Experimental Design

Applied Sciences ◽

10.3390/app8081290 ◽

2018 ◽

Vol 8 (8) ◽

pp. 1290 ◽

Cited By ~ 2

Author(s):

Beata Mrugalska

Keyword(s):

Neural Network ◽

Experimental Design ◽

Fault Detection ◽

Network Models ◽

Neural Model ◽

Neural Network Models ◽

Optimum Experimental Design ◽

The Neural Network ◽

Adaptive Thresholds ◽

Linear Neural Network

Increasing expectations of industrial system reliability require development of more effective and robust fault diagnosis methods. The paper presents a framework for quality improvement on the neural model applied for fault detection purposes. In particular, the proposed approach starts with an adaptation of the modified quasi-outer-bounding algorithm towards non-linear neural network models. Subsequently, its convergence is proven using quadratic boundedness paradigm. The obtained algorithm is then equipped with the sequential D-optimum experimental design mechanism allowing gradual reduction of the neural model uncertainty. Finally, an emerging robust fault detection framework on the basis of the neural network uncertainty description as the adaptive thresholds is proposed.

Download Full-text