Towards a General-Purpose Linguistic Annotation Backend

Graham Neubig; Patrick Littell; Chian-Yu Chen; Jean Lee; Zirui Li; Yu-Hsiang Lin; Yuyan Zhang

doi:10.33011/computel.v2i.437

Towards a General-Purpose Linguistic Annotation Backend

Mapping Intimacies ◽

10.33011/computel.v2i.437 ◽

2019 ◽

Vol 2 (1) ◽

Author(s):

Graham Neubig ◽

Patrick Littell ◽

Chian-Yu Chen ◽

Jean Lee ◽

Zirui Li ◽

...

Keyword(s):

Neural Networks ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

General Purpose ◽

Language Documentation ◽

Training Material ◽

Future Directions ◽

Recent Advances ◽

Linguistic Annotation

Language documentation is inherently a time-intensive process; transcription, glossing, and corpus management consume a significant portion of documentary linguists’ work. Advances in natural language processing can help to accelerate this work, using the linguists’ past decisions as training material, but questions remain about how to prioritize human involvement. In this extended abstract, we describe the beginnings of a new project that will attempt to ease this language documentation process through the use of natural language processing (NLP) technology. It is based on (1) methods to adapt NLP tools to new languages, based on recent advances in massively multilingual neural networks, and (2) backend APIs and interfaces that allow linguists to upload their data (§2). We then describe our current progress on two fronts: automatic phoneme transcription, and glossing (§3). Finally, we briefly describe our future directions (§4).

Download Full-text

Recent advances in processing negation

Natural Language Engineering ◽

10.1017/s1351324920000534 ◽

2020 ◽

pp. 1-10

Author(s):

Roser Morante ◽

Eduardo Blanco

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Future Directions ◽

Rule Based ◽

Computational Approaches ◽

Recent Advances ◽

Linguistic Phenomenon

Abstract Negation is a complex linguistic phenomenon present in all human languages. It can be seen as an operator that transforms an expression into another expression whose meaning is in some way opposed to the original expression. In this article, we survey previous work on negation with an emphasis on computational approaches. We start defining negation and two important concepts: scope and focus of negation. Then, we survey work in natural language processing that considers negation primarily as a means to improve the results in some task. We also provide information about corpora containing negation annotations in English and other languages, which usually include a combination of annotations of negation cues, scopes, foci, and negated events. We continue the survey with a description of automated approaches to process negation, ranging from early rule-based systems to systems built with traditional machine learning and neural networks. Finally, we conclude with some reflections on current progress and future directions.

Download Full-text

Optimization of Recurrent Neural Networks on Natural Language Processing

Proceedings of the 2019 8th International Conference on Computing and Pattern Recognition ◽

10.1145/3373509.3373573 ◽

2019 ◽

Cited By ~ 2

Author(s):

Jingyu Huang ◽

Yunfei Feng

Keyword(s):

Neural Networks ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Recurrent Neural Networks

Download Full-text

A Survey on Bias in Deep NLP

Applied Sciences ◽

10.3390/app11073184 ◽

2021 ◽

Vol 11 (7) ◽

pp. 3184

Author(s):

Ismael Garrido-Muñoz ◽

Arturo Montejo-Ráez ◽

Fernando Martínez-Santiago ◽

L. Alfonso Ureña-López

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Natural Language Processing ◽

Probability Distribution ◽

Natural Language ◽

Network Design ◽

Language Processing ◽

Deep Neural Networks ◽

Learning Processes ◽

Relevant Issue

Deep neural networks are hegemonic approaches to many machine learning areas, including natural language processing (NLP). Thanks to the availability of large corpora collections and the capability of deep architectures to shape internal language mechanisms in self-supervised learning processes (also known as “pre-training”), versatile and performing models are released continuously for every new network design. These networks, somehow, learn a probability distribution of words and relations across the training collection used, inheriting the potential flaws, inconsistencies and biases contained in such a collection. As pre-trained models have been found to be very useful approaches to transfer learning, dealing with bias has become a relevant issue in this new scenario. We introduce bias in a formal way and explore how it has been treated in several networks, in terms of detection and correction. In addition, available resources are identified and a strategy to deal with bias in deep NLP is proposed.

Download Full-text

Recent Advances in Conversational Intelligent Tutoring Systems

AI Magazine ◽

10.1609/aimag.v34i3.2485 ◽

2013 ◽

Vol 34 (3) ◽

pp. 42-54 ◽

Cited By ~ 54

Author(s):

Vasile Rus ◽

Sidney D’Mello ◽

Xiangen Hu ◽

Arthur Graesser

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Intelligent Tutoring Systems ◽

Intelligent Tutoring ◽

Individual Student ◽

Learning Progressions ◽

Tutoring Systems ◽

Recent Advances ◽

Processing Techniques

We report recent advances in intelligent tutoring systems with conversational dialogue. We highlight progress in terms of macro and microadaptivity. Macroadaptivity refers to a system’s capability to select appropriate instructional tasks for the learner to work on. Microadaptivity refers to a system’s capability to adapt its scaffolding while the learner is working on a particular task. The advances in macro and microadaptivity that are presented here were made possible by the use of learning progressions, deeper dialogue and natural language processing techniques, and by the use of affect-enabled components. Learning progressions and deeper dialogue and natural language processing techniques are key features of DeepTutor, the first intelligent tutoring system based on learning progressions. These improvements extend the bandwidth of possibilities for tailoring instruction to each individual student which is needed for maximizing engagement and ultimately learning.

Download Full-text

Natural Language Processing with Subsymbolic Neural Networks

Neural Network Perspectives on Cognition and Adaptive Robotics ◽

10.1201/9780367813239-8 ◽

2019 ◽

pp. 120-139

Author(s):

Risto Miikkulainen

Keyword(s):

Neural Networks ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing

Download Full-text

Prediction of Emergency Department Hospital Admission Based on Natural Language Processing and Neural Networks

Methods of Information in Medicine ◽

10.3414/me17-01-0024 ◽

2017 ◽

Vol 56 (05) ◽

pp. 377-389 ◽

Cited By ~ 21

Author(s):

Xingyu Zhang ◽

Joyce Kim ◽

Rachel E. Patzer ◽

Stephen R. Pitts ◽

Aaron Patzer ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Emergency Department ◽

Logistic Regression ◽

Natural Language Processing ◽

Natural Language ◽

Hospital Admission ◽

Language Processing ◽

Predictive Accuracy ◽

Free Text

SummaryObjective: To describe and compare logistic regression and neural network modeling strategies to predict hospital admission or transfer following initial presentation to Emergency Department (ED) triage with and without the addition of natural language processing elements.Methods: Using data from the National Hospital Ambulatory Medical Care Survey (NHAMCS), a cross-sectional probability sample of United States EDs from 2012 and 2013 survey years, we developed several predictive models with the outcome being admission to the hospital or transfer vs. discharge home. We included patient characteristics immediately available after the patient has presented to the ED and undergone a triage process. We used this information to construct logistic regression (LR) and multilayer neural network models (MLNN) which included natural language processing (NLP) and principal component analysis from the patient’s reason for visit. Ten-fold cross validation was used to test the predictive capacity of each model and receiver operating curves (AUC) were then calculated for each model.Results: Of the 47,200 ED visits from 642 hospitals, 6,335 (13.42%) resulted in hospital admission (or transfer). A total of 48 principal components were extracted by NLP from the reason for visit fields, which explained 75% of the overall variance for hospitalization. In the model including only structured variables, the AUC was 0.824 (95% CI 0.818-0.830) for logistic regression and 0.823 (95% CI 0.817-0.829) for MLNN. Models including only free-text information generated AUC of 0.742 (95% CI 0.7310.753) for logistic regression and 0.753 (95% CI 0.742-0.764) for MLNN. When both structured variables and free text variables were included, the AUC reached 0.846 (95% CI 0.839-0.853) for logistic regression and 0.844 (95% CI 0.836-0.852) for MLNN.Conclusions: The predictive accuracy of hospital admission or transfer for patients who presented to ED triage overall was good, and was improved with the inclusion of free text data from a patient’s reason for visit regardless of modeling approach. Natural language processing and neural networks that incorporate patient-reported outcome free text may increase predictive accuracy for hospital admission.

Download Full-text

Word Recognition as a First Step Towards Natural Language Processing with Artificial Neural Networks

Konnektionismus in Artificial Intelligence und Kognitionsforschung - Informatik-Fachberichte ◽

10.1007/978-3-642-76070-9_27 ◽

1990 ◽

pp. 221-225 ◽

Cited By ~ 1

Author(s):

Renate Deffner ◽

Klaus Eder ◽

Hans Geiger

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Natural Language Processing ◽

Word Recognition ◽

Natural Language ◽

Language Processing ◽

Artificial Neural

Download Full-text

Advances in Computational Linguistics and Text Processing Frameworks

Advances in Computer and Electrical Engineering - Handbook of Research on Engineering Innovations and Technology Management in Organizations ◽

10.4018/978-1-7998-2772-6.ch012 ◽

2020 ◽

pp. 217-244

Author(s):

Ayush Srivastav ◽

Hera Khan ◽

Amit Kumar Mishra

Keyword(s):

Neural Networks ◽

Natural Language Processing ◽

Natural Language ◽

Computational Linguistics ◽

Language Processing ◽

Text Processing ◽

Named Entity Recognition ◽

Entity Recognition ◽

Named Entity ◽

Part Of Speech

The chapter provides an eloquent account of the major methodologies and advances in the field of Natural Language Processing. The most popular models that have been used over time for the task of Natural Language Processing have been discussed along with their applications in their specific tasks. The chapter begins with the fundamental concepts of regex and tokenization. It provides an insight to text preprocessing and its methodologies such as Stemming and Lemmatization, Stop Word Removal, followed by Part-of-Speech tagging and Named Entity Recognition. Further, this chapter elaborates the concept of Word Embedding, its various types, and some common frameworks such as word2vec, GloVe, and fastText. A brief description of classification algorithms used in Natural Language Processing is provided next, followed by Neural Networks and its advanced forms such as Recursive Neural Networks and Seq2seq models that are used in Computational Linguistics. A brief description of chatbots and Memory Networks concludes the chapter.

Download Full-text

A Matter of Perspective

Legal Regulations, Implications, and Issues Surrounding Digital Data - Advances in Information Security, Privacy, and Ethics ◽

10.4018/978-1-7998-3130-3.ch010 ◽

2020 ◽

pp. 182-202

Author(s):

Katie Miller

Keyword(s):

Artificial Intelligence ◽

Neural Networks ◽

Human Rights ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Equal Opportunity ◽

Facial Recognition ◽

Phone Service

The challenge presented is an age when some decisions are made by humans, some are made by AI, and some are made by a combination of AI and humans. For the person refused housing, a phone service, or employment, the experience is the same, but the ability to understand what has happened and obtain a remedy may be very different if the discrimination is attributable to or contributed by an AI system. If we are to preserve the policy intentions of our discrimination, equal opportunity, and human rights laws, we need to understand how discrimination arises in AI systems; how design in AI systems can mitigate such discrimination; and whether our existing laws are adequate to address discrimination in AI. This chapter endeavours to provide this understanding. In doing so, it focuses on narrow but advanced forms of artificial intelligence, such as natural language processing, facial recognition, and cognitive neural networks.

Download Full-text

Artificial Neural Networks and Natural Language Processing

Encyclopedia of Library and Information Science, Fourth Edition ◽

10.1081/e-elis4-120008648 ◽

2017 ◽

pp. 279-292

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Artificial Neural

Download Full-text