scholarly journals Exploiting hierarchy in medical concept embedding*

JAMIA Open ◽  
2021 ◽  
Vol 4 (1) ◽  
Author(s):  
Anthony Finch ◽  
Alexander Crowell ◽  
Mamta Bhatia ◽  
Pooja Parameshwarappa ◽  
Yung-Chieh Chang ◽  
...  

Abstract Objective To construct and publicly release a set of medical concept embeddings for codes following the ICD-10 coding standard which explicitly incorporate hierarchical information from medical codes into the embedding formulation. Materials and Methods We trained concept embeddings using several new extensions to the Word2Vec algorithm using a dataset of approximately 600,000 patients from a major integrated healthcare organization in the Mid-Atlantic US. Our concept embeddings included additional entities to account for the medical categories assigned to codes by the Clinical Classification Software Revised (CCSR) dataset. We compare these results to sets of publicly released pretrained embeddings and alternative training methodologies. Results We found that Word2Vec models which included hierarchical data outperformed ordinary Word2Vec alternatives on tasks which compared naïve clusters to canonical ones provided by CCSR. Our Skip-Gram model with both codes and categories achieved 61.4% normalized mutual information with canonical labels in comparison to 57.5% with traditional Skip-Gram. In models operating on two different outcomes, we found that including hierarchical embedding data improved classification performance 96.2% of the time. When controlling for all other variables, we found that co-training embeddings improved classification performance 66.7% of the time. We found that all models outperformed our competitive benchmarks. Discussion We found significant evidence that our proposed algorithms can express the hierarchical structure of medical codes more fully than ordinary Word2Vec models, and that this improvement carries forward into classification tasks. As part of this publication, we have released several sets of pretrained medical concept embeddings using the ICD-10 standard which significantly outperform other well-known pretrained vectors on our tested outcomes.

2017 ◽  
Vol 13 (1) ◽  
pp. 155014771668368 ◽  
Author(s):  
Charissa Ann Ronao ◽  
Sung-Bae Cho

Human activity recognition has been gaining more and more attention from researchers in recent years, particularly with the use of widespread and commercially available devices such as smartphones. However, most of the existing works focus on discriminative classifiers while neglecting the inherent time-series and continuous characteristics of sensor data. To address this, we propose a two-stage continuous hidden Markov model framework, which also takes advantage of the innate hierarchical structure of basic activities. This kind of system architecture not only enables the use of different feature subsets on different subclasses, which effectively reduces feature computation overhead, but also allows for varying number of states and iterations. Experiments show that the hierarchical structure dramatically increases classification performance. We analyze the behavior of the accelerometer and gyroscope signals for each activity through graphs, and with added fine tuning of states and training iterations, the proposed method is able to achieve an overall accuracy of up to 93.18%, which is the best performance among the state-of-the-art classifiers for the problem at hand.


2021 ◽  
Vol 2021 ◽  
pp. 1-6
Author(s):  
Qiang Cai ◽  
Fenghai Li ◽  
Yifan Chen ◽  
Haisheng Li ◽  
Jian Cao ◽  
...  

Along with the strong representation of the convolutional neural network (CNN), image classification tasks have achieved considerable progress. However, majority of works focus on designing complicated and redundant architectures for extracting informative features to improve classification performance. In this study, we concentrate on rectifying the incomplete outputs of CNN. To be concrete, we propose an innovative image classification method based on Label Rectification Learning (LRL) through kernel extreme learning machine (KELM). It mainly consists of two steps: (1) preclassification, extracting incomplete labels through a pretrained CNN, and (2) label rectification, rectifying the generated incomplete labels by the KELM to obtain the rectified labels. Experiments conducted on publicly available datasets demonstrate the effectiveness of our method. Notably, our method is extensible which can be easily integrated with off-the-shelf networks for improving performance.


2003 ◽  
Vol 2 (1) ◽  
pp. 31-39 ◽  
Author(s):  
Frank van Ham ◽  
Jarke J. van Wijk

Beamtrees are a new method for the visualization of large hierarchical data sets, such as directory structures and organization structures. Nodes are shown as stacked circular beams such that both the hierarchical structure as well as the size of nodes are depicted. The dimensions of beams are calculated using a variation of the treemap algorithm. Both a two-dimensional and a three-dimensional variant are presented. A small user study indicated that beamtrees are significantly more effective than nested treemaps and cushion treemaps for the extraction of global hierarchical information.


2020 ◽  
Vol 38 (15_suppl) ◽  
pp. e18069-e18069
Author(s):  
Gboyega Adeboyeje ◽  
Kaushal Desai ◽  
Shahed Iqbal ◽  
Jinghua He ◽  
Matthew J. Monberg

e18069 Background: Historically, recurrence in ovarian cancer (OC) following first-line (1L) chemotherapy (CT) occurs in up to 80% of patients within 2 years. The Clinical Classification Software (CCS) systematically classifies thousands of ICD-9 codes into a smaller number of clinically meaningful categories. We sought to use CCS and other routinely collected variables to differentiate the clinical and demographic profiles of patients with good prognosis (GP) versus poor prognosis (PP) in the United States (US). Methods: This was a retrospective cohort study of newly diagnosed (FIGO stage II - IV), treatment-naïve patients, ≥ 66 years, who received 4-10 cycles of platinum-based 1L CT between Jan 2009 - Dec 2015 using the SEER-Medicare database, a nationally representative cancer registry. Patient were assumed to have progressed to a subsequent line of therapy following a gap between consecutive CT cycles ≥ 63 days. Patients were classified as GP if alive ≥4 years with no further treatment following 1L CT; PP was defined as receipt of ≥2L CT within 12 months of initial 1L CT. Demographic and prognostic characteristics were assessed during a 6-month baseline period prior to initiation of 1L CT. We assessed clinically meaningful differences in baseline characteristics with absolute standardized differences (ASD) using a threshold of 0.1 (indicating negligible difference between two cohorts). Results: There were a total of 2,262 patients (mean age: 74.6 ±6.2 years) including 251 GP (11%) and 209 PP (9%) patients (table below). PP patients were significantly more likely to be older than 70 years, and present at stage IV, liver disease and ascites, and anemia at diagnosis. PP patients were also less likely to have primary debulking surgery. Conclusions: Approximately one tenth of OC patients received no further treatment 4 years after the initial treatment with contemporary standard of care. GP may be differentiated from PP on the basis of commonly used clinical characteristics such as stage and also specific comorbidities such as liver disease and ascites. [Table: see text]


PLoS ONE ◽  
2017 ◽  
Vol 12 (7) ◽  
pp. e0175508 ◽  
Author(s):  
Wei-Qi Wei ◽  
Lisa A. Bastarache ◽  
Robert J. Carroll ◽  
Joy E. Marlo ◽  
Travis J. Osterman ◽  
...  

Perception ◽  
1994 ◽  
Vol 23 (5) ◽  
pp. 489-504 ◽  
Author(s):  
Ruth Kimchi

A distinction has previously been proposed between global properties, defined by their position in the hierarchical structure of the stimulus, and wholistic/configural properties defined as a function of interrelations among component parts. The processing consequences of this distinction were examined in five experiments. In experiments 1–4 configural properties (closure and intersection) were pitted against component properties (line orientation and direction of curvature) and the results showed that discrimination and classification performance was dominated by the configural properties. In experiment 5 the relative perceptual dominance of type of property (configural/nonconfigural) and level of pattern structure (global/local) was examined. The results showed that classifications based on the configural property of closure were not affected at all by the level of globality at which this property varied. Global advantage was observed only with classifications based on line orientation. Taken together, the present results suggest that configural properties dominate discrimination and classification of visual forms, whereas the perceptual advantage of the global level of structure depends critically on the type of properties present at the global and local levels. These findings are also discussed in relation to findings on texture perception, and it is suggested that the perceptual system may be characterized by a predisposition for configural properties.


2021 ◽  
Vol 15 ◽  
pp. 174830262110449
Author(s):  
Kai-Jun Hu ◽  
He-Feng Yin ◽  
Jun Sun

During the past decade, representation based classification method has received considerable attention in the community of pattern recognition. The recently proposed non-negative representation based classifier achieved superb recognition results in diverse pattern classification tasks. Unfortunately, discriminative information of training data is not fully exploited in non-negative representation based classifier, which undermines its classification performance in practical applications. To address this problem, we introduce a decorrelation regularizer into the formulation of non-negative representation based classifier and propose a discriminative non-negative representation based classifier for pattern classification. The decorrelation regularizer is able to reduce the correlation of representation results of different classes, thus promoting the competition among them. Experimental results on benchmark datasets validate the efficacy of the proposed discriminative non-negative representation based classifier, and it can outperform some state-of-the-art deep learning based methods. The source code of our proposed discriminative non-negative representation based classifier is accessible at https://github.com/yinhefeng/DNRC .


2020 ◽  
Author(s):  
Ying Bi ◽  
Bing Xue ◽  
Mengjie Zhang

IEEE Feature extraction is essential for solving image classification by transforming low-level pixel values into high-level features. However, extracting effective features from images is challenging due to high variations across images in scale, rotation, illumination, and background. Existing methods often have a fixed model complexity and require domain expertise. Genetic programming with a flexible representation can find the best solution without the use of domain knowledge. This paper proposes a new genetic programming-based approach to automatically learning informative features for different image classification tasks. In the new approach, a number of image-related operators, including filters, pooling operators and feature extraction methods, are employed as functions. A flexible program structure is developed to integrate different functions and terminals into a single tree/solution. The new approach can evolve solutions of variable depths to extract various numbers and types of features from the images. The new approach is examined on 12 different image classification tasks of varying difficulty and compared with a large number of effective algorithms. The results show that the new approach achieves better classification performance than most benchmark methods. The analysis of the evolved programs/solutions and the visualisation of the learned features provide deep insights on the proposed approach.


Author(s):  
Jianrong Qiu ◽  
David B. Logan ◽  
Jennifer Oxley ◽  
Christopher Lowe

This paper examines the effects of vehicular and operational characteristics on bus roadworthiness. The analysis was based on annual bus inspection data in Victoria, Australia, between 2014 and 2017, consisting of 17,630 inspections of 6,447 vehicles run by 252 operators. A multilevel modeling approach was employed to account for the hierarchical data structure where inspections are nested within vehicles and vehicles within operators. The results offered insights into the effects on bus roadworthiness of characteristics attributable to inspections, vehicles, and operators. The probability of failing an inspection was found to be positively associated with vehicle age and odometer reading. Vehicle make played an important role in roadworthiness outcome, with the performance of different makes varying significantly. Small operators carried the highest risk of failure and large operators the lowest, irrespective of the location of operation. The multilevel analysis revealed that 28.9% of the variation in inspection outcomes occurred across operators and 5.2% across vehicles, which verified the presence of the hierarchical structure. The findings from this study provide safety regulators with solid research evidence to formulate policies aimed at enhancing bus roadworthiness.


2019 ◽  
Vol 27 (5) ◽  
pp. 295-306
Author(s):  
Esteban Jaramillo-Cabrera ◽  
Eduardo F Morales ◽  
Jose Martinez-Carranza

Recent advances in deep learning, in particular in convolutional neural networks (CNNs), have been widely used in robotics for object classification and action recognition, among others, with very high performance. Nevertheless, this high performance, mostly in classification tasks, is rarely accompanied by reasoning processes that consider the relationships between objects, actions, and effects. In this article, we used three CNNs to classify objects, actions, and effects that were trained with the CERTH-SOR3D dataset that has more than 20,000 RGB-D videos. This dataset involves 14 objects, 13 actions, and in this article was augmented with seven effects. The probabilistic vector output of each trained CNN was combined into a Bayesian network (BN) to capture the relationships between objects, actions, and effects. It is shown that by probabilistically combining information from the three classifiers, it is possible to improve the classification performance of each CNN or to level the same performance with less training data. In particular, the recognition performance improved from 71.2% to 79.7% for actions, 85.0%–86.7% for objects, and 77.0%–82.1% for effects. In the article, it is also shown that with missing information, the model can still produce reasonable classification performance. In particular, the system can be used for reasoning purposes in robotics, as it can make action planning with information from object and effects or it can predict effects with information from objects and actions.


Sign in / Sign up

Export Citation Format

Share Document