Natural Language Processing and Futures Studies

Because the input for Futures Studies is to a very high degree formulated as written words and texts, methods which automate the processing of texts can substantially help Futures Studies. At Shaping Tomorrow, we have developed a software system using Natural Language Processing (NLP), a subfield of Artificial Intelligence, which automatically analyzes publicly available texts and extracts future-relevant data from theses texts. This process can be used to study the futures. This article discusses this software system, explains how it works with a detailed example, and shows real-life applications and visualizations of the resulting data. The current state of this method is just the first step; a number of technological improvements and their possible benefits are explained. The implications of using this software system for the field of Futures Studies are mostly positive, but there are also a number of caveats.

Download Full-text

Designing and Validating an Annotation Model of Dynamic Modality for English and Spanish: Issues and Problems

10.29007/pc58 ◽

2018 ◽

Author(s):

Julia Lavid ◽

Marta Carretero ◽

Juan Rafael Zamorano

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Large Scale ◽

Reliability Study ◽

Annotation Scheme ◽

High Degree ◽

Difficult Cases

In this paper we set forth an annotation model for dynamic modality in English and Spanish, given its relevance not only for contrastive linguistic purposes, but also for its impact on practical annotation tasks in the Natural Language Processing (NLP) community. An annotation scheme is proposed, which captures both the functional-semantic meanings and the language-specific realisations of dynamic meanings in both languages. The scheme is validated through a reliability study performed on a randomly selected set of one hundred and twenty sentences from the MULTINOT corpus, resulting in a high degree of inter-annotator agreement. We discuss our main findings and give attention to the difficult cases as they are currently being used to develop detailed guidelines for the large-scale annotation of dynamic modality in English and Spanish.

Download Full-text

Automation of Software System Development Using Natural Language Processing and Two-Level Grammar

Radical Innovations of Software and Systems Engineering in the Future - Lecture Notes in Computer Science ◽

10.1007/978-3-540-24626-8_15 ◽

2004 ◽

pp. 219-233 ◽

Cited By ~ 4

Author(s):

Beum-Seuk Lee ◽

Barrett R. Bryant

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

System Development ◽

Software System

Download Full-text

Anomaly detection of software system logs based on natural language processing

2018 International Conference on Image and Video Processing, and Artificial Intelligence ◽

10.1117/12.2513857 ◽

2018 ◽

Author(s):

Mengying Wang ◽

Lili Guo ◽

Lele Xu

Keyword(s):

Natural Language Processing ◽

Anomaly Detection ◽

Natural Language ◽

Language Processing ◽

Software System ◽

System Logs

Download Full-text

Clustering by Similarity of Brazilian Legal Documents Using Natural Language Processing Approaches

10.5772/intechopen.99875 ◽

2021 ◽

Author(s):

Raphael Souza de Oliveira ◽

Erick Giovani Sperandio Nascimento

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Legal Proceedings ◽

Legal Documents ◽

Current State ◽

Document Frequency ◽

Reference File ◽

Judicial Proceedings ◽

Cosine Distance

The Brazilian legal system postulates the expeditious resolution of judicial proceedings. However, legal courts are working under budgetary constraints and with reduced staff. As a way to face these restrictions, artificial intelligence (AI) has been tackling many complex problems in natural language processing (NLP). This work aims to detect the degree of similarity between judicial documents that can be achieved in the inference group using unsupervised learning, by applying three NLP techniques, namely term frequency-inverse document frequency (TF-IDF), Word2Vec CBoW, and Word2Vec Skip-gram, the last two being specialized with a Brazilian language corpus. We developed a template for grouping lawsuits, which is calculated based on the cosine distance between the elements of the group to its centroid. The Ordinary Appeal was chosen as a reference file since it triggers legal proceedings to follow to the higher court and because of the existence of a relevant contingent of lawsuits awaiting judgment. After the data-processing steps, documents had their content transformed into a vector representation, using the three NLP techniques. We notice that specialized word-embedding models—like Word2Vec—present better performance, making it possible to advance in the current state of the art in the area of NLP applied to the legal sector.

Download Full-text

Natural language interfaces to databases

The Knowledge Engineering Review ◽

10.1017/s0269888900005476 ◽

1990 ◽

Vol 5 (4) ◽

pp. 225-249 ◽

Cited By ~ 52

Author(s):

Ann Copestake ◽

Karen Sparck Jones

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

State Of The Art ◽

Central Process ◽

Current State ◽

Natural Language Question ◽

The One ◽

Language Question ◽

And Task

AbstractThis paper reviews the current state of the art in natural language access to databases. This has been a long-standing area of work in natural language processing. But though some commercial systems are now available, providing front ends has proved much harder than was expected, and the necessary limitations on front ends have to be recognized. The paper discusses the issues, both general to language and task-specific, involved in front end design, and the way these have been addressed, concentrating on the work of the last decade. The focus is on the central process of translating a natural language question into a database query, but other supporting functions are also covered. The points are illustrated by the use of a single example application. The paper concludes with an evaluation of the current state, indicating that future progress will depend on the one hand on general advances in natural language processing, and on the other on expanding the capabilities of traditional databases.

Download Full-text

Natural Language Processing and Machine Learning Techniques Help Achieve a Better Medical Practice

Medical Applications of Intelligent Data Analysis - Advances in Medical Technologies and Clinical Practice ◽

10.4018/978-1-4666-1803-9.ch016 ◽

2012 ◽

pp. 237-254

Author(s):

Oana Frunza ◽

Diana Inkpen

Keyword(s):

Machine Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Medical Practice ◽

Clinical Decision Making ◽

Medical Information ◽

Real Life ◽

Data Sets ◽

Intelligent Tools

This book chapter presents several natural language processing (NLP) and machine learning (ML) techniques that can help achieve a better medical practice by means of extracting relevant medical information from the wealth of textual data. The chapter describes three major tasks: building intelligent tools that can help in the clinical decision making, tools that can automatically identify relevant medical information from the life-science literature, and tools that can extract semantic relations between medical concepts. Besides introducing and describing these tasks, methodological settings accompanied by representative results obtained on real-life data sets are presented.

Download Full-text

OR13-07 Crystal Bone: Personalized, Short-Term Fracture Risk Prediction with Natural Language Processing Methods

Journal of the Endocrine Society ◽

10.1210/jendso/bvaa046.018 ◽

2020 ◽

Vol 4 (Supplement_1) ◽

Author(s):

Yasmeen Almog ◽

Angshu Rai ◽

Anirban Mishra ◽

Amanda Moulaison ◽

Ross Powell ◽

...

Keyword(s):

Natural Language Processing ◽

High Risk ◽

Natural Language ◽

Fracture Risk ◽

Risk Prediction ◽

Language Processing ◽

Assessment Tools ◽

Short Term ◽

Fracture Risk Prediction ◽

Very High

Abstract Fragility fractures due to osteoporosis are common and are associated with significant clinical, personal, and economic burden. Even after a fragility fracture, osteoporosis remains widely underdiagnosed and undertreated. Common fracture risk assessment tools, such as FRAX1 and Garvan,2 confer risk over the long term but do not provide short-term risk estimates necessary to identify very high-risk patients likely to fracture in the next 1–2 years. Furthermore, these tools utilize cross-sectional data representing a subset of all available clinical risk factors for risk prediction. Thus, these methods are generalized across patient populations and may not fully utilize patient histories commonly found in electronic health records (EHRs) that contain temporal information for thousands of unique features. The Optum® de-identified EHR dataset (2007–2018) provides an opportunity to use historical medical data to generate short-term, personalized fracture risk predictions for individual patients. We used the Optum® dataset to develop Crystal Bone, a method that applies machine learning techniques commonly used in natural language processing to the temporal nature of patient histories in order to predict fracture risk over a 1- to 2-year timeframe. Specifically, we repurposed deep-learning models typically applied to language-based prediction tasks in which the goal is to learn the meanings of words and sentences to classify them. Crystal Bone uses context-based embedding techniques to learn an equivalent “semantic” meaning of various medical events. Similar to how language models predict the next word in a given sentence or the topic of an overall document, Crystal Bone can predict that a patient’s future trajectory may contain a fracture or that the “signature” of the patient’s overall journey is similar to that of a typical fracture patient. We applied Crystal Bone to two datasets, one enriched for fracture patients and one representative of a typical hospital system. In both datasets, when predicting likelihood of fracture in the next 1–2 years, Crystal Bone had an area under the receiver operating characteristic (AUROC) score ranging from 72% to 83% on a test (hold-out) dataset. These results suggest performance similar to that of FRAX and Garvan, which have 10-year fracture risk prediction AUROC scores of 64.4% +/- 3.7%.3 Our results suggest that it is possible to use each patient’s unique medical history as it changes over time to predict patients at risk for fracture in 1–2 years. Furthermore, it is theoretically possible to integrate a model like Crystal Bone directly into an EHR system, enabling “hands-off” fracture risk prediction, which could lead to improved identification of patients at very high risk for fracture. 1Kanis JA, Osteoporos Int. 2012;23:2239–56. 2Rubin KH, J Bone Miner Res. 2013;28:1701–17. 3Leslie WD, Osteoporos Int. 2014;25:1–21.

Download Full-text

A Systematic Literature Review of Natural Language Processing: Current State, Challenges and Risks

Proceedings of the Future Technologies Conference (FTC) 2020, Volume 1 - Advances in Intelligent Systems and Computing ◽

10.1007/978-3-030-63128-4_49 ◽

2020 ◽

pp. 634-647

Author(s):

Eghbal Ghazizadeh ◽

Pengxiang Zhu

Keyword(s):

Natural Language Processing ◽

Literature Review ◽

Natural Language ◽

Language Processing ◽

Systematic Literature Review ◽

Current State

Download Full-text

A Brief Survey of Question Answering Systems

International Journal of Artificial Intelligence & Applications ◽

10.5121/ijaia.2021.12501 ◽

2021 ◽

Vol 12 (5) ◽

pp. 01-07

Author(s):

Michael Caballero

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Question Answering ◽

Open Domain ◽

Knowledge Based ◽

Current State ◽

Introductory Overview ◽

Building Systems ◽

Question Answering Systems

Question Answering (QA) is a subfield of Natural Language Processing (NLP) and computer science focused on building systems that automatically answer questions from humans in natural language. This survey summarizes the history and current state of the field and is intended as an introductory overview of QA systems. After discussing QA history, this paper summarizes the different approaches to the architecture of QA systems -- whether they are closed or open-domain and whether they are text-based, knowledge-based, or hybrid systems. Lastly, some common datasets in this field are introduced and different evaluation metrics are discussed.

Download Full-text

Relationship Between Artificial Intelligence and the Engagement Variable

10.53902/jpssr.2021.01.000515 ◽

2021 ◽

Vol 1 (3) ◽

Author(s):

García Navarro C

Keyword(s):

Artificial Intelligence ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Real Life ◽

New Technique ◽

Final Conclusion ◽

New Techniques ◽

A New Technique ◽

The One

The purpose of this research is to learn how engagement has been measured so far and what new techniques will be used to measure it in the future. To this end, firstly, there is a review of all the current research in engagement has been conducted, in addition to a review of the current traditional techniques used to measure it. Secondly, the concept of Artificial Intelligence has been analyzed and how one of its most common techniques (Natural Language Processing) is starting to be used as a new technique to measure engagement. Once the traditional and new techniques had been presented, a theoretical differentiation was made between them in order to test the benefits of the latter. The main conclusions were that Artificial Intelligence is increasing its fields of action, specifically in the psychology of organizations. In this field, the new techniques allow companies to save time in the administration and the conduction of surveys. Moreover, the data reported by AI is less biased than the one that comes from surveys, since the data is collected directly and these techniques do not bias the employee when answering the items. As a final conclusion, it is proposed that a study be carried out to compare the results of both techniques in real-life companies.

Download Full-text