Identifying QT prolongation from ECG impressions using a general-purpose Natural Language Processor

AbstractThe previous Emerging Trends article (Church et al., 2021. Natural Language Engineering27(5), 631–645.) introduced deep nets to poets. Poets is an imperfect metaphor, intended as a gesture toward inclusion. The future for deep nets will benefit by reaching out to a broad audience of potential users, including people with little or no programming skills, and little interest in training models. That paper focused on inference, the use of pre-trained models, as is, without fine-tuning. The goal of this paper is to make fine-tuning more accessible to a broader audience. Since fine-tuning is more challenging than inference, the examples in this paper will require modest programming skills, as well as access to a GPU. Fine-tuning starts with a general purpose base (foundation) model and uses a small training set of labeled data to produce a model for a specific downstream application. There are many examples of fine-tuning in natural language processing (question answering (SQuAD) and GLUE benchmark), as well as vision and speech.

Download Full-text

User specification of syntactic case frames in TELI, a transportable, user-customized natural language processor

10.3115/991365.991500 ◽

1986 ◽

Cited By ~ 4

Author(s):

Bruce W. Ballard

Keyword(s):

Natural Language ◽

Language Processor ◽

Case Frames

Download Full-text

Grounding Action Descriptions in Videos

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00207 ◽

2013 ◽

Vol 1 ◽

pp. 25-36 ◽

Cited By ~ 63

Author(s):

Michaela Regneri ◽

Marcus Rohrbach ◽

Dominikus Wetzel ◽

Stefan Thater ◽

Bernt Schiele ◽

...

Keyword(s):

Natural Language ◽

Recent Work ◽

Visual Information ◽

General Purpose ◽

Experimental Results ◽

High Quality ◽

Improve Model ◽

Model Predictions ◽

Static Images

Recent work has shown that the integration of visual information into text-based models can substantially improve model predictions, but so far only visual information extracted from static images has been used. In this paper, we consider the problem of grounding sentences describing actions in visual information extracted from videos. We present a general purpose corpus that aligns high quality videos with multiple natural language descriptions of the actions portrayed in the videos, together with an annotation of how similar the action descriptions are to each other. Experimental results demonstrate that a text-based model of similarity between actions improves substantially when combined with visual information from videos depicting the described actions.

Download Full-text

Informatics in Radiology: RADTF: A Semantic Search–enabled, Natural Language Processor–generated Radiology Teaching File

Radiographics ◽

10.1148/rg.307105083 ◽

2010 ◽

Vol 30 (7) ◽

pp. 2039-2048 ◽

Cited By ~ 26

Author(s):

Bao H. Do ◽

Andrew Wu ◽

Sandip Biswal ◽

Aya Kamaya ◽

Daniel L. Rubin

Keyword(s):

Natural Language ◽

Semantic Search ◽

Teaching File ◽

Language Processor ◽

Radiology Teaching

Download Full-text

Logic programmable natural language processor of a knowledge-base management system

Proceedings of the first international conference on Industrial and engineering applications of artificial intelligence and expert systems - IEA/AIE '88 ◽

10.1145/55674.55741 ◽

1988 ◽

Cited By ~ 1

Author(s):

M. Matsuo ◽

K. Arima ◽

F. Freiheit ◽

K. Hubbard

Keyword(s):

Natural Language ◽

Knowledge Base ◽

Management System ◽

Language Processor ◽

Base Management System ◽

Base Management ◽

Knowledge Base Management

Download Full-text

Towards a General-Purpose Linguistic Annotation Backend

10.33011/computel.v2i.437 ◽

2019 ◽

Vol 2 (1) ◽

Author(s):

Graham Neubig ◽

Patrick Littell ◽

Chian-Yu Chen ◽

Jean Lee ◽

Zirui Li ◽

...

Keyword(s):

Neural Networks ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

General Purpose ◽

Language Documentation ◽

Training Material ◽

Future Directions ◽

Recent Advances ◽

Linguistic Annotation

Language documentation is inherently a time-intensive process; transcription, glossing, and corpus management consume a significant portion of documentary linguists’ work. Advances in natural language processing can help to accelerate this work, using the linguists’ past decisions as training material, but questions remain about how to prioritize human involvement. In this extended abstract, we describe the beginnings of a new project that will attempt to ease this language documentation process through the use of natural language processing (NLP) technology. It is based on (1) methods to adapt NLP tools to new languages, based on recent advances in massively multilingual neural networks, and (2) backend APIs and interfaces that allow linguists to upload their data (§2). We then describe our current progress on two fronts: automatic phoneme transcription, and glossing (§3). Finally, we briefly describe our future directions (§4).

Download Full-text

Learning Uniform Semantic Features for Natural Language and Programming Language Globally, Locally and Sequentially

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33015845 ◽

2019 ◽

Vol 33 ◽

pp. 5845-5852

Author(s):

Yudong Zhang ◽

Wenhao Zheng ◽

Ming Li

Keyword(s):

Natural Language ◽

Programming Language ◽

Semantic Representation ◽

Feature Learning ◽

Feature Space ◽

General Purpose ◽

Semantic Features ◽

Textual Data ◽

Software Mining ◽

Sequential Information

Semantic feature learning for natural language and programming language is a preliminary step in addressing many software mining tasks. Many existing methods leverage information in lexicon and syntax to learn features for textual data. However, such information is inadequate to represent the entire semantics in either text sentence or code snippet. This motivates us to propose a new approach to learn semantic features for both languages, through extracting three levels of information, namely global, local and sequential information, from textual data. For tasks involving both modalities, we project the data of both types into a uniform feature space so that the complementary knowledge in between can be utilized in their representation. In this paper, we build a novel and general-purpose feature learning framework called UniEmbed, to uniformly learn comprehensive semantic representation for both natural language and programming language. Experimental results on three real-world software mining tasks show that UniEmbed outperforms state-of-the-art models in feature learning and prove the capacity and effectiveness of our model.

Download Full-text