Templated Text Synthesis for Expert-Guided Multi-Label Extraction from Radiology Reports

Training medical image analysis models traditionally requires large amounts of expertly annotated imaging data which is time-consuming and expensive to obtain. One solution is to automatically extract scan-level labels from radiology reports. Previously, we showed that, by extending BERT with a per-label attention mechanism, we can train a single model to perform automatic extraction of many labels in parallel. However, if we rely on pure data-driven learning, the model sometimes fails to learn critical features or learns the correct answer via simplistic heuristics (e.g., that “likely” indicates positivity), and thus fails to generalise to rarer cases which have not been learned or where the heuristics break down (e.g., “likely represents prominent VR space or lacunar infarct” which indicates uncertainty over two differential diagnoses). In this work, we propose template creation for data synthesis, which enables us to inject expert knowledge about unseen entities from medical ontologies, and to teach the model rules on how to label difficult cases, by producing relevant training examples. Using this technique alongside domain-specific pre-training for our underlying BERT architecture i.e., PubMedBERT, we improve F1 micro from 0.903 to 0.939 and F1 macro from 0.512 to 0.737 on an independent test set for 33 labels in head CT reports for stroke patients. Our methodology offers a practical way to combine domain knowledge with machine learning for text classification tasks.

Download Full-text

Medical domain knowledge in domain-agnostic generative AI

10.1101/2022.01.10.22269025 ◽

2022 ◽

Author(s):

Jakob Nikolas Kather ◽

Narmin Ghaffari Laleh ◽

Sebastian Foersch ◽

Daniel Truhn

Keyword(s):

Medical Image ◽

Domain Knowledge ◽

Medical Image Analysis ◽

Medical Knowledge ◽

Fine Tuning ◽

Domain Specific ◽

Additional Domain ◽

Single Use ◽

Explicit Training ◽

Medical Concepts

The text-guided diffusion model GLIDE (Guided Language to Image Diffusion for Generation and Editing) is the state of the art in text-to-image generative artificial intelligence (AI). GLIDE has rich representations, but medical applications of this model have not been systematically explored. If GLIDE had useful medical knowledge, it could be used for medical image analysis tasks, a domain in which AI systems are still highly engineered towards a single use-case. Here we show that the publicly available GLIDE model has reasonably strong representations of key topics in cancer research and oncology, in particular the general style of histopathology images and multiple facets of diseases, pathological processes and laboratory assays. However, GLIDE seems to lack useful representations of the style and content of radiology data. Our findings demonstrate that domain-agnostic generative AI models can learn relevant medical concepts without explicit training. Thus, GLIDE and similar models might be useful for medical image processing tasks in the future - particularly with additional domain-specific fine-tuning.

Download Full-text

Deep Learning Based Cardiac MRI Segmentation: Do We Need Experts?

Algorithms ◽

10.3390/a14070212 ◽

2021 ◽

Vol 14 (7) ◽

pp. 212

Author(s):

Youssef Skandarani ◽

Pierre-Marc Jodoin ◽

Alain Lalande

Keyword(s):

Deep Learning ◽

Cardiac Mri ◽

Expert Knowledge ◽

Medical Image Analysis ◽

Ground Truth ◽

Cine Mri ◽

Data Sets ◽

Mri Segmentation ◽

Segmentation Evaluation ◽

Ground Truth Data

Deep learning methods are the de facto solutions to a multitude of medical image analysis tasks. Cardiac MRI segmentation is one such application, which, like many others, requires a large number of annotated data so that a trained network can generalize well. Unfortunately, the process of having a large number of manually curated images by medical experts is both slow and utterly expensive. In this paper, we set out to explore whether expert knowledge is a strict requirement for the creation of annotated data sets on which machine learning can successfully be trained. To do so, we gauged the performance of three segmentation models, namely U-Net, Attention U-Net, and ENet, trained with different loss functions on expert and non-expert ground truth for cardiac cine–MRI segmentation. Evaluation was done with classic segmentation metrics (Dice index and Hausdorff distance) as well as clinical measurements, such as the ventricular ejection fractions and the myocardial mass. The results reveal that generalization performances of a segmentation neural network trained on non-expert ground truth data is, to all practical purposes, as good as that trained on expert ground truth data, particularly when the non-expert receives a decent level of training, highlighting an opportunity for the efficient and cost-effective creation of annotations for cardiac data sets.

Download Full-text

What Can We Learn from Almost a Decade of Food Tweets

Frontiers in Artificial Intelligence and Applications - Human Language Technologies – The Baltic Perspective ◽

10.3233/faia200622 ◽

2020 ◽

Author(s):

Uga Sproģis ◽

Matīss Rikters

Keyword(s):

Sentiment Analysis ◽

Question Answering ◽

Time Span ◽

Use Cases ◽

Specific Question ◽

Domain Specific ◽

Question And Answer ◽

Analysis Models ◽

Over Time

We present the Latvian Twitter Eater Corpus - a set of tweets in the narrow domain related to food, drinks, eating and drinking. The corpus has been collected over time-span of over 8 years and includes over 2 million tweets entailed with additional useful data. We also separate two sub-corpora of question and answer tweets and sentiment annotated tweets. We analyse the contents of the corpus and demonstrate use-cases for the sub-corpora by training domain-specific question-answering and sentiment-analysis models using the data from the corpus.

Download Full-text

Cross-Fertilizing Deep Web Analysis and Ontology Enrichment

10.31219/osf.io/b3fvz ◽

2017 ◽

Author(s):

Marilena Oita ◽

Antoine Amarilli ◽

Pierre Senellart

Keyword(s):

Domain Knowledge ◽

Deep Web ◽

Web Pages ◽

Complete Understanding ◽

Specific Knowledge ◽

Domain Specific ◽

Domain Specific Knowledge ◽

Web Crawlers ◽

New Perspective ◽

The Impact

Deep Web databases, whose content is presented as dynamically-generated Web pages hidden behind forms, have mostly been left unindexed by search engine crawlers. In order to automatically explore this mass of information, many current techniques assume the existence of domain knowledge, which is costly to create and maintain. In this article, we present a new perspective on form understanding and deep Web data acquisition that does not require any domain-specific knowledge. Unlike previous approaches, we do not perform the various steps in the process (e.g., form understanding, record identification, attribute labeling) independently but integrate them to achieve a more complete understanding of deep Web sources. Through information extraction techniques and using the form itself for validation, we reconcile input and output schemas in a labeled graph which is further aligned with a generic ontology. The impact of this alignment is threefold: first, the resulting semantic infrastructure associated with the form can assist Web crawlers when probing the form for content indexing; second, attributes of response pages are labeled by matching known ontology instances, and relations between attributes are uncovered; and third, we enrich the generic ontology with facts from the deep Web.

Download Full-text

E-scape: Interactive visualization of single cell phylogenetics and spatio-temporal evolution in cancer

10.1101/080622 ◽

2016 ◽

Author(s):

Maia A. Smith ◽

Cydney Nielsen ◽

Fong Chun Chan ◽

Andrew McPherson ◽

Andrew Roth ◽

...

Keyword(s):

Single Cell ◽

Visual Analytics ◽

Domain Knowledge ◽

Treatment Resistance ◽

Cancer Evolution ◽

Imaging Data ◽

Web Based ◽

Clinical Endpoints ◽

Cell Experiment ◽

Spatio Temporal

Inference of clonal dynamics and tumour evolution has fundamental importance in understanding the major clinical endpoints in cancer: development of treatment resistance, relapse and metastasis. DNA sequencing technology has made measuring clonal dynamics through mutation analysis accessible at scale, facilitating computational inference of informative patterns of interest. However, currently no tools allow for biomedical experts to meaningfully interact with the often complex and voluminous dataset to inject domain knowledge into the inference process. We developed an interactive, web-based visual analytics software suite called E-scape which supports dynamically linked, multi-faceted views of cancer evolution data. Developed using R and javascript d3.js libraries, the suite includes three tools: TimeScape and MapScape for visualizing population dynamics over time and space, respectively, and CellScape for visualizing evolution at single cell resolution. The tool suite integrates phylogenetic, clonal prevalence, mutation and imaging data to generate intuitive, dynamically linked views of data which update in real time as a function of user actions. The system supports visualization of both point mutation and copy number alterations, rendering how mutations distribute in clones in both bulk and single cell experiment data in multiple representations including phylogenies, heatmaps, growth trajectories, spatial distributions and mutation tables. E-scape is open source and is freely available to the community at large.

Download Full-text

Commonsense Reasoning to Guide Deep Learning for Scene Understanding (Extended Abstract)

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/661 ◽

2020 ◽

Author(s):

Mohan Sridharan ◽

Tiago Mota

Keyword(s):

Domain Knowledge ◽

Inductive Learning ◽

Network Models ◽

Computational Effort ◽

Partial Occlusion ◽

Commonsense Reasoning ◽

Deep Networks ◽

Training Examples ◽

The Stability ◽

Simulated Images

Our architecture uses non-monotonic logical reasoning with incomplete commonsense domain knowledge, and incremental inductive learning, to guide the construction of deep network models from a small number of training examples. Experimental results in the context of a robot reasoning about the partial occlusion of objects and the stability of object configurations in simulated images indicate an improvement in reliability and a reduction in computational effort in comparison with an architecture based just on deep networks.

Download Full-text

The REA Pattern, Knowledge Structures, and Conceptual Modeling Performance

Journal of Information Systems ◽

10.2308/jis.2005.19.2.57 ◽

2005 ◽

Vol 19 (2) ◽

pp. 57-77 ◽

Cited By ~ 11

Author(s):

Gregory J. Gerard

Keyword(s):

Domain Knowledge ◽

Conceptual Models ◽

Conceptual Modeling ◽

Knowledge Structure ◽

Specific Pattern ◽

Accounting Information Systems ◽

Domain Specific ◽

Domain Specific Knowledge ◽

Business Setting ◽

Structured Knowledge

Most database textbooks on conceptual modeling do not cover domainspecific patterns. The texts emphasize notation, apparently assuming that notation enables individuals to correctly model domain-specific knowledge acquired from experience. However, the domain knowledge acquired may not aid in the construction of conceptual models if it is not structured to support conceptual modeling. This study uses the Resources Events Agents (REA) pattern as an example of a domain-specific pattern that can be encoded as a knowledge structure for conceptual modeling of accounting information systems (AIS), and tests its effects on the accuracy of conceptual modeling in a familiar business setting. Fifty-three undergraduate and forty-six graduate students completed recall tasks designed to measure REA knowledge structure. The accuracy of participants' conceptual models was positively related to REA knowledge structure. Results suggest it is insufficient to know only conceptual modeling notation because structured knowledge of domain-specific patterns reduces design errors.

Download Full-text

Iterative and Pattern-Based Development of Internal Domain-Specific Languages

Formal and Practical Aspects of Domain-Specific Languages ◽

10.4018/978-1-4666-2092-6.ch006 ◽

2012 ◽

pp. 132-155

Author(s):

Sebastian Günther

Keyword(s):

Design Patterns ◽

Domain Knowledge ◽

Special Kind ◽

Language Implementation ◽

Domain Specific ◽

Iterative Development ◽

Essential Properties ◽

Familiar Environment ◽

Independent Design ◽

Internal Domain

Internal DSLs are a special kind of DSLs that use an existing programming language as their host. In this chapter, the author explains an iterative development process for internal DSLs. The goals of this process are: (1) to give developers a familiar environment in which they can use known and proven development steps, techniques, tools, and host languages, (2) to provide a set of repeatable, iterative steps that support the continuous adaptation and evolution of the domain knowledge and the DSL implementation, and (3) to apply design principles that help to develop DSLs with essential properties and to use host language independent design patterns to plan and communicate the design and implementation of the DSL. The process consists of three development steps (analysis, language design, and language implementation) and applies four principles: open form, agile and test-driven development, design pattern knowledge, and design principle knowledge.

Download Full-text

A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play

Science ◽

10.1126/science.aar6404 ◽

2018 ◽

Vol 362 (6419) ◽

pp. 1140-1144 ◽

Cited By ~ 388

Author(s):

David Silver ◽

Thomas Hubert ◽

Julian Schrittwieser ◽

Ioannis Antonoglou ◽

Matthew Lai ◽

...

Keyword(s):

Artificial Intelligence ◽

Reinforcement Learning ◽

Domain Knowledge ◽

Learning Algorithm ◽

Search Techniques ◽

Domain Specific ◽

Evaluation Functions ◽

History Of ◽

World Champion ◽

Reinforcement Learning Algorithm

The game of chess is the longest-studied domain in the history of artificial intelligence. The strongest programs are based on a combination of sophisticated search techniques, domain-specific adaptations, and handcrafted evaluation functions that have been refined by human experts over several decades. By contrast, the AlphaGo Zero program recently achieved superhuman performance in the game of Go by reinforcement learning from self-play. In this paper, we generalize this approach into a single AlphaZero algorithm that can achieve superhuman performance in many challenging games. Starting from random play and given no domain knowledge except the game rules, AlphaZero convincingly defeated a world champion program in the games of chess and shogi (Japanese chess), as well as Go.

Download Full-text

Review of Fuzzy Image Segmentation Techniques

Design and Management of Multimedia Information Systems ◽

10.4018/978-1-930708-00-6.ch014 ◽

2011 ◽

pp. 282-313 ◽

Cited By ~ 5

Author(s):

Gour C. Karmakar ◽

Laurence Dooley ◽

Mahbubhur Rahman Syed

Keyword(s):

Image Segmentation ◽

Expert Knowledge ◽

Fuzzy Rule ◽

Clustering Methods ◽

Comprehensive Overview ◽

Domain Specific ◽

Soft Computing Techniques ◽

Fuzzy Clustering Methods ◽

Fuzzy Image ◽

Multimodal Images

This chapter provides a comprehensive overview of various methods of fuzzy logic-based image segmentation techniques. Fuzzy image segmentation techniques outperform conventional techniques, as they are able to evaluate imprecise data as well as being more robust in noisy environment. Fuzzy clustering methods need to set the number of clusters prior to segmentation and are sensitive to the initialization of cluster centers. Fuzzy rule-based segmentation techniques can incorporate the domain expert knowledge and manipulate numerical as well as linguistic data. It is also capable of drawing partial inference using fuzzy IF-THEN rules. It has been also intensively applied in medical imaging. These rules are, however, application-domain specific and very difficult to define either manually or automatically that can complete the segmentation alone. Fuzzy geometry and thresholding-based image segmentation techniques are suitable only for bimodal images and can be applied in multimodal images, but they don’t produce a good result for the images that contain a significant amount of overlapping pixels between background and foreground regions. A few techniques on image segmentation based on fuzzy integral and soft computing techniques have been published and appear to offer considerable promise.

Download Full-text