Acquisition of a Knowledge Dictionary from Training Examples Including Multiple Values

Training medical image analysis models traditionally requires large amounts of expertly annotated imaging data which is time-consuming and expensive to obtain. One solution is to automatically extract scan-level labels from radiology reports. Previously, we showed that, by extending BERT with a per-label attention mechanism, we can train a single model to perform automatic extraction of many labels in parallel. However, if we rely on pure data-driven learning, the model sometimes fails to learn critical features or learns the correct answer via simplistic heuristics (e.g., that “likely” indicates positivity), and thus fails to generalise to rarer cases which have not been learned or where the heuristics break down (e.g., “likely represents prominent VR space or lacunar infarct” which indicates uncertainty over two differential diagnoses). In this work, we propose template creation for data synthesis, which enables us to inject expert knowledge about unseen entities from medical ontologies, and to teach the model rules on how to label difficult cases, by producing relevant training examples. Using this technique alongside domain-specific pre-training for our underlying BERT architecture i.e., PubMedBERT, we improve F1 micro from 0.903 to 0.939 and F1 macro from 0.512 to 0.737 on an independent test set for 33 labels in head CT reports for stroke patients. Our methodology offers a practical way to combine domain knowledge with machine learning for text classification tasks.

Download Full-text

CrossGR

Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies ◽

10.1145/3448100 ◽

2021 ◽

Vol 5 (1) ◽

pp. 1-23

Author(s):

Xinyi Li ◽

Liqiong Chang ◽

Fangfang Song ◽

Ju Wang ◽

Xiaojiang Chen ◽

...

Keyword(s):

Gesture Recognition ◽

Low Cost ◽

User Involvement ◽

Recognition System ◽

Training Data ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Target User ◽

Order Of Magnitude ◽

Training Examples

This paper focuses on a fundamental question in Wi-Fi-based gesture recognition: "Can we use the knowledge learned from some users to perform gesture recognition for others?". This problem is also known as cross-target recognition. It arises in many practical deployments of Wi-Fi-based gesture recognition where it is prohibitively expensive to collect training data from every single user. We present CrossGR, a low-cost cross-target gesture recognition system. As a departure from existing approaches, CrossGR does not require prior knowledge (such as who is currently performing a gesture) of the target user. Instead, CrossGR employs a deep neural network to extract user-agnostic but gesture-related Wi-Fi signal characteristics to perform gesture recognition. To provide sufficient training data to build an effective deep learning model, CrossGR employs a generative adversarial network to automatically generate many synthetic training data from a small set of real-world examples collected from a small number of users. Such a strategy allows CrossGR to minimize the user involvement and the associated cost in collecting training examples for building an accurate gesture recognition system. We evaluate CrossGR by applying it to perform gesture recognition across 10 users and 15 gestures. Experimental results show that CrossGR achieves an accuracy of over 82.6% (up to 99.75%). We demonstrate that CrossGR delivers comparable recognition accuracy, but uses an order of magnitude less training samples collected from the end-users when compared to state-of-the-art recognition systems.

Download Full-text

Modification of the method of generation of control finite-state machines with continuous actions based on training examples

Journal of Computer and Systems Sciences International ◽

10.1134/s1064230715050044 ◽

2015 ◽

Vol 54 (6) ◽

pp. 853-865 ◽

Cited By ~ 3

Author(s):

I. P. Buzhinsky ◽

S. V. Kazakov ◽

V. I. Ulyantsev ◽

F. N. Tsarev ◽

A. A. Shalyto

Keyword(s):

Finite State Machines ◽

State Machines ◽

Finite State ◽

Training Examples ◽

Continuous Actions

Download Full-text

THE APPLICATION OF MACHINE LEARNING TOOLS TO THE VALIDATION OF AN AIR TRAFFIC CONTROL DOMAIN THEORY

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213001000684 ◽

2001 ◽

Vol 10 (04) ◽

pp. 613-637 ◽

Cited By ~ 5

Author(s):

M. M. WEST ◽

T. L. McCLUSKEY

Keyword(s):

Machine Learning ◽

Traffic Control ◽

Air Traffic Control ◽

Air Traffic ◽

Domain Theory ◽

Learning Tools ◽

General Clause ◽

Faulty Component ◽

Training Examples ◽

Mixed Initiative

In this paper we describe a project (IMPRESS) in which machine learning (ML) tools were created and utilised for the validation of an Air Traffic Control domain theory written in first order logic. During the project, novel techniques were devised for the automated revision of general clause form theories using training examples. These techniques were combined in an algorithm which focused in on the parts of a theory which involve ordinal sorts, and applied geometrical revision operators to repair faulty component parts. While we illustrate the feasibility of applying ML to this area, we conclude that to be effective it must be focused to the application at hand, and used in mixed-initiative mode within a tools environment. The method is illustrated with experimental results obtained during the project.

Download Full-text

Interactive Medical Image Segmentation Method with only Positive Label Examples

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.989-994.1088 ◽

2014 ◽

Vol 989-994 ◽

pp. 1088-1092

Author(s):

Chen Guang Zhang ◽

Yan Zhang ◽

Xia Huan Zhang

Keyword(s):

Image Segmentation ◽

Medical Image ◽

Medical Image Segmentation ◽

Segmentation Method ◽

High Quality ◽

Neighbor Graph ◽

Foreground Region ◽

Training Examples ◽

Positive Label ◽

Negative Training

In this paper, a novel interactive medical image segmentation method called SMOPL is proposed. This method only needs marking some pixels on foreground region for segmentation. To do this, SMOPL characterize the inherent correlations among foreground and background pixels as Hilbert-Schmidt independence. By maximizing the independence and minimizing the smoothness of labels on instance neighbor graph simultaneously, SMOPL gets the sufficiently smooth confidences of both positive and negative classes in absence of negative training examples. Then a image segmentation can be obtained by assigning each pixel to the label for which the greatest confidence is calculated. Experiments on real-world medical images show that SMOPL is robust to get a high-quality segmentation with only positive label examples.

Download Full-text

Search on Method of Netwok Intrusion Detection Based on Support Vector Machine with Pre-Extracting Support Vector

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.655-657.1787 ◽

2013 ◽

Vol 655-657 ◽

pp. 1787-1790

Author(s):

Sheng Chen Yu ◽

Li Min Sun ◽

Yang Xue ◽

Hui Guo ◽

Xiao Ju Wang ◽

...

Keyword(s):

Support Vector Machine ◽

Intrusion Detection ◽

Intrusion Detection System ◽

Detection System ◽

Detection Algorithm ◽

Support Vector ◽

Distance Ratio ◽

Training Time ◽

Training Examples ◽

Center Distance

Intrusion detection algorithm based on support vector machine with pre-extracting support vector is proposed which combines the center distance ratio and classification algorithm. Given proper thresholds, we can use the support vector as a substitute for the training examples. Then the scale of dataset is decreased and the performance of support vector machine is improved in the detection rate and the training time. The experiment result has shown that the intrusion detection system(IDS) based on support vector machine with pre-extracting support needs less training time under the same detection performance condition.

Download Full-text