Converging Semantic Knowledge and Deep Learning for Medical Coding

Author(s):  
Nuria Garcia-Santa ◽  
Beatriz San Miguel ◽  
Takanori Ugai

The field of medical coding enables to assign codes of medical classifications such as the international classification of diseases (ICD) to clinical notes, which are medical reports about patients' conditions written by healthcare professionals in natural language. These texts potentially include medical terms that define diagnosis, symptoms, drugs, treatments, etc., and the use of spontaneous language is challenging for automatic processing. Medical coding is usually performed manually by human medical coders becoming time-consuming and prone to errors. This research aims at developing new approaches that combine deep learning elements together with traditional technologies. A semantic-based proposal supported by a proprietary knowledge graph (KG), neural network implementations, and an ensemble model to resolve the medical coding are presented. A comparative discussion between the proposals where the advantages and disadvantages of each one is analysed. To evaluate approaches, two main corpus have been used: MIMIC-III and private de-identified clinical notes.

Author(s):  
Dimitra Petroudi ◽  
Athanasios Zekios

The introduction of information systems in health progressively led tï coding systems. The purposes of these systems are: recording causes of death, coding diseases and procedures, etc. The most important medical coding system in our days is ICD (International Classification of Diseases). Other coding systems that health professionals use are: SNOMED, LOINC, MeSH, UMLS, DSM, DRG and HCPCS. There are also many Nursing Classification Systems, such as: NANDA, NIC, NOC, ICNP, Omaha and HHCC. This chapter describes these coding systems and their advantages.


2022 ◽  
pp. 115-127
Author(s):  
Sagar Sudhir Dhobale ◽  
Sharda Bapat

ICD (international classification of diseases) is a system developed by the WHO in which every unique diagnosis and procedure has a unique code. It provides a standardized way to represent medical information and makes it sharable and comparable across different hospitals and countries. Currently, the task of assigning ICD codes to patient discharge summaries is performed manually by medical coders. Manual coding is costly, time consuming, and inefficient for huge data. So, the healthcare industry requires automated solutions to make the medical coding more efficient, accurate, and consistent. In this study, the automated ICD-9 coding is approached as a multi-label text classification problem. A deep learning system is presented to assign ICD-9 codes automatically to the patient discharge summaries. Convolutional neural networks and word2vec model are combined to automatically extract features from the input text. The best model has achieved 83.28% accuracy. The results of this research prove the usability of deep learning for multi-label text classification and medical coding.


1981 ◽  
Vol 26 (4) ◽  
pp. 240-243 ◽  
Author(s):  
J. Hoenig

There is a fundamental difference between nosology and a statistical classification, and the two should not be confused. The discipline of nosology uses scientific methods to arrive at a classification of psychiatric disorders and is concerned with the validity of its entities. A statistical classification aims to attain the widest compliance in spite of differences in the theoretical orientation of its users. It must therefore be atheoretical, and must represent a widely negotiated agreement between its future users. The most important statistical classification is the “International Classification of Diseases, Injuries and Causes of Death” (ICD-9) endorsed by the member states of the World Health Organization. The DSM III (Diagnostic and Statistical Manual), a newly accepted classification of the American Psychiatric Association, departs in many ways from the ICD-9, and Canada will have to decide whether adherence to ICD-9 should continue, or be replaced by the adoption of DSM III. Advantages and disadvantages of the DSM III are briefly discussed.


2021 ◽  
Vol 21 (S9) ◽  
Author(s):  
Shuyuan Hu ◽  
Fei Teng ◽  
Lufei Huang ◽  
Jun Yan ◽  
Haibo Zhang

Abstract Background Clinical notes are unstructured text documents generated by clinicians during patient encounters, generally are annotated with International Classification of Diseases (ICD) codes, which give formatted information about the diagnosis and treatment. ICD code has shown its potentials in many fields, but manual coding is labor-intensive and error-prone, lead to researches of automatic coding. Two specific challenges of this task are (1) given an annotated clinical notes, the reasons behind specific diagnoses and treatments are  implicit; (2) explainability is important for practical automatic coding method, the method should not only explain its prediction output but also have explainable internal mechanics. This study aims to develop an explainable CNN approach to address these two challenges. Method Our key idea is that for the automatic ICD coding task, the presence of informative snippets in the clinical text that correlated with each code plays an important role in the prediction of codes, and an informative snippet can be considered as a local and low-level feature. We infer that there exists a correspondence between a convolution filter and a local and low-level feature. Base on the inference, we come up with the Shallow and Wide Attention convolutional Mechanism (SWAM) to improve the CNN-based models’ ability to learn local and low-level features for each label. Results We evaluate our approach on MIMIC-III, an open-access dataset of ICU medical records. Our approach substantially outperforms previous results on top-50 medical code prediction on MIMIC-III dataset, the precision of the worst-performing 10% labels in previous works is increased from 0% to 53% on average. We attribute this improvement to SWAM, by which the wide architecture with attention mechanism gives the model ability to more extensively learn the unique features of different codes, and we prove it by an ablation experiment. Besides, we perform manual analysis of the performance imbalance between different codes, and preliminary conclude the characteristics that determine the difficulty of learning specific codes. Conclusions Our main contributions can be summarized into the following three: (1) We present local and low-level features, a.k.a. informative snippets play an important role in the automatic ICD coding task, and the informative snippets extracted from the clinical text provide explanations for each code. (2) We propose that there exists a correspondence between a convolution filter and a local and low-level feature. A combination of wide and shallow convolutional layer and attention layer can help the CNN-based models better learn local and low-level features. (3) We improved the precision of the worst-performing 10% labels from 0 to 53% on average.


2021 ◽  
Vol 21 (S6) ◽  
Author(s):  
James E. Harrison ◽  
Stefanie Weber ◽  
Robert Jakob ◽  
Christopher G. Chute

Abstract Background The International Classification of Diseases (ICD) has long been the main basis for comparability of statistics on causes of mortality and morbidity between places and over time. This paper provides an overview of the recently completed 11th revision of the ICD, focusing on the main innovations and their implications. Main text Changes in content reflect knowledge and perspectives on diseases and their causes that have emerged since ICD-10 was developed about 30 years ago. Changes in design and structure reflect the arrival of the networked digital era, for which ICD-11 has been prepared. ICD-11’s information framework comprises a semantic knowledge base (the Foundation), a biomedical ontology linked to the Foundation and classifications derived from the Foundation. ICD-11 for Mortality and Morbidity Statistics (ICD-11-MMS) is the primary derived classification and the main successor to ICD-10. Innovations enabled by the new architecture include an online coding tool (replacing the index and providing additional functions), an application program interface to enable remote access to ICD-11 content and services, enhanced capability to capture and combine clinically relevant characteristics of cases and integrated support for multiple languages. Conclusions ICD-11 was adopted by the World Health Assembly in May 2019. Transition to implementation is in progress. ICD-11 can be accessed at icd.who.int.


2015 ◽  
Vol 61 (2) ◽  
pp. 39-44 ◽  
Author(s):  
V A Peterkova ◽  
O V Vasyukova

This paper concerns classification of obesity in the children and adolescents, one of the debatable issues in modern pediatrics and pediatric endocrinology. The historical sketch of various classifications of obesity in the children and adolescents accepted in this country and abroad is presented with special reference to the advantages and disadvantages of each variant. The authors emphasize the difficulty of developing a unified classification of the multifactor disease being considered. A new classification of obesity in the children and adolescents is proposed that takes into consideration the etiological aspects, complications, co-morbid conditions, and the degree of obesity. The possible variants of diagnosis formulation taking account of the present-day international classification of diseases are discussed.


2020 ◽  
Vol 10 (15) ◽  
pp. 5262 ◽  
Author(s):  
Elias Moons ◽  
Aditya Khanna ◽  
Abbas Akkasi ◽  
Marie-Francine Moens

In this survey, we discuss the task of automatically classifying medical documents into the taxonomy of the International Classification of Diseases (ICD), by the use of deep neural networks. The literature in this domain covers different techniques. We will assess and compare the performance of those techniques in various settings and investigate which combination leverages the best results. Furthermore, we introduce an hierarchical component that exploits the knowledge of the ICD taxonomy. All methods and their combinations are evaluated on two publicly available datasets that represent ICD-9 and ICD-10 coding, respectively. The evaluation leads to a discussion of the advantages and disadvantages of the models.


2021 ◽  
Vol 21 (S2) ◽  
Author(s):  
Weidong Bao ◽  
Hongfei Lin ◽  
Yijia Zhang ◽  
Jian Wang ◽  
Shaowu Zhang

Abstract Background Clinical notes record the health status, clinical manifestations and other detailed information of each patient. The International Classification of Diseases (ICD) codes are important labels for electronic health records. Automatic medical codes assignment to clinical notes through the deep learning model can not only improve work efficiency and accelerate the development of medical informatization but also facilitate the resolution of many issues related to medical insurance. Recently, neural network-based methods have been proposed for the automatic medical code assignment. However, in the medical field, clinical notes are usually long documents and contain many complex sentences, most of the current methods cannot effective in learning the representation of potential features from document text. Methods In this paper, we propose a hybrid capsule network model. Specifically, we use bi-directional LSTM (Bi-LSTM) with forwarding and backward directions to merge the information from both sides of the sequence. The label embedding framework embeds the text and labels together to leverage the label information. We then use a dynamic routing algorithm in the capsule network to extract valuable features for medical code prediction task. Results We applied our model to the task of automatic medical codes assignment to clinical notes and conducted a series of experiments based on MIMIC-III data. The experimental results show that our method achieves a micro F1-score of 67.5% on MIMIC-III dataset, which outperforms the other state-of-the-art methods. Conclusions The proposed model employed the dynamic routing algorithm and label embedding framework can effectively capture the important features across sentences. Both Capsule networks and domain knowledge are helpful for medical code prediction task.


Author(s):  
Jessica W. M. Wong ◽  
Friedrich M. Wurst ◽  
Ulrich W. Preuss

Abstract. Introduction: With advances in medicine, our understanding of diseases has deepened and diagnostic criteria have evolved. Currently, the most frequently used diagnostic systems are the ICD (International Classification of Diseases) and the DSM (Diagnostic and Statistical Manual of Mental Disorders) to diagnose alcohol-related disorders. Results: In this narrative review, we follow the historical developments in ICD and DSM with their corresponding milestones reflecting the scientific research and medical considerations of their time. The current diagnostic concepts of DSM-5 and ICD-11 and their development are presented. Lastly, we compare these two diagnostic systems and evaluate their practicability in clinical use.


Author(s):  
Timo D. Vloet ◽  
Marcel Romanos

Zusammenfassung. Hintergrund: Nach 12 Jahren Entwicklung wird die 11. Version der International Classification of Diseases (ICD-11) von der Weltgesundheitsorganisation (WHO) im Januar 2022 in Kraft treten. Methodik: Im Rahmen eines selektiven Übersichtsartikels werden die Veränderungen im Hinblick auf die Klassifikation von Angststörungen von der ICD-10 zur ICD-11 zusammenfassend dargestellt. Ergebnis: Die diagnostischen Kriterien der generalisierten Angststörung, Agoraphobie und spezifischen Phobien werden angepasst. Die ICD-11 wird auf Basis einer Lebenszeitachse neu organisiert, sodass die kindesaltersspezifischen Kategorien der ICD-10 aufgelöst werden. Die Trennungsangststörung und der selektive Mutismus werden damit den „regulären“ Angststörungen zugeordnet und können zukünftig auch im Erwachsenenalter diagnostiziert werden. Neu ist ebenso, dass verschiedene Symptomdimensionen der Angst ohne kategoriale Diagnose verschlüsselt werden können. Diskussion: Die Veränderungen im Bereich der Angsterkrankungen umfassen verschiedene Aspekte und sind in der Gesamtschau nicht unerheblich. Positiv zu bewerten ist die Einführung einer Lebenszeitachse und Parallelisierung mit dem Diagnostic and Statistical Manual of Mental Disorders (DSM-5). Schlussfolgerungen: Die entwicklungsbezogene Neuorganisation in der ICD-11 wird auch eine verstärkte längsschnittliche Betrachtung von Angststörungen in der Klinik sowie Forschung zur Folge haben. Damit rückt insbesondere die Präventionsforschung weiter in den Fokus.


Sign in / Sign up

Export Citation Format

Share Document