Computer Science Meets Education: Natural Language Processing for Automatic Grading of Open-Ended Questions in eBooks

We investigated how Natural Language Processing (NLP) algorithms could automatically grade answers to open-ended inference questions in web-based eBooks. This is a component of research on making reading more motivating to children and to increasing their comprehension. We obtained and graded a set of answers to open-ended questions embedded in a fiction novel written in English. Computer science students used a subset of the graded answers to develop algorithms designed to grade new answers to the questions. The algorithms utilized the story text, existing graded answers for a given question and publicly accessible databases in grading new responses. A computer science professor used another subset of the graded answers to evaluate the students’ NLP algorithms and to select the best algorithm. The results showed that the best algorithm correctly graded approximately 85% of the real-world answers as correct, partly correct, or wrong. The best NLP algorithm was trained with questions and graded answers from a series of new text narratives in another language, Slovenian. The resulting NLP algorithm model was successfully used in fourth-grade language arts classes for providing feedback to student answers on open-ended questions in eBooks.

Download Full-text

1243-P: Novel Use of Natural Language Processing to Identify Reasons for Insulin Discontinuation in Patients with T2DM: A Real-World Evidence Study

Diabetes ◽

10.2337/db19-1243-p ◽

2019 ◽

Vol 68 (Supplement 1) ◽

pp. 1243-P

Author(s):

JIANMIN WU ◽

FRITHA J. MORRISON ◽

ZHENXIANG ZHAO ◽

XUANYAO HE ◽

MARIA SHUBINA ◽

...

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Real World ◽

Real World Evidence

Download Full-text

UGLEO: A WEB BASED INTELLIGENCE CHATBOT FOR STUDENT ADMISSION PORTAL USING MEGAHAL STYLE

Jurnal Ilmiah Informatika Komputer ◽

10.35760/ik.2018.v23i3.2373 ◽

2018 ◽

Vol 23 (3) ◽

pp. 175-191

Author(s):

Anneke Annassia Putri Siswadi ◽

Avinanta Tarigan

Keyword(s):

Markov Chain ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Information Need ◽

Web Based ◽

Markov Chain Method ◽

Information Center

To fulfill the prospective student's information need about student admission, Gunadarma University has already many kinds of services which are time limited, such as website, book, registration place, Media Information Center, and Question Answering’s website (UG-Pedia). It needs a service that can serve them anytime and anywhere. Therefore, this research is developing the UGLeo as a web based QA intelligence chatbot application for Gunadarma University's student admission portal. UGLeo is developed by MegaHal style which implements the Markov Chain method. In this research, there are some modifications in MegaHal style, those modifications are the structure of natural language processing and the structure of database. The accuracy of UGLeo reply is 65%. However, to increase the accuracy there are some improvements to be applied in UGLeo system, both improvement in natural language processing and improvement in MegaHal style.

Download Full-text

Using NLP for Fact Checking: A Survey

Designs ◽

10.3390/designs5030042 ◽

2021 ◽

Vol 5 (3) ◽

pp. 42

Author(s):

Eric Lazarski ◽

Mahmood Al-Khassaweneh ◽

Cynthia Howard

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Computer Science ◽

Language Processing ◽

The Internet ◽

Fake News ◽

Fact Checking ◽

The Many ◽

Human Powered ◽

The Web

In recent years, disinformation and “fake news” have been spreading throughout the internet at rates never seen before. This has created the need for fact-checking organizations, groups that seek out claims and comment on their veracity, to spawn worldwide to stem the tide of misinformation. However, even with the many human-powered fact-checking organizations that are currently in operation, disinformation continues to run rampant throughout the Web, and the existing organizations are unable to keep up. This paper discusses in detail recent advances in computer science to use natural language processing to automate fact checking. It follows the entire process of automated fact checking using natural language processing, from detecting claims to fact checking to outputting results. In summary, automated fact checking works well in some cases, though generalized fact checking still needs improvement prior to widespread use.

Download Full-text

INDRA-IPM: interactive pathway modeling using natural language with automated assembly

Bioinformatics ◽

10.1093/bioinformatics/btz289 ◽

2019 ◽

Vol 35 (21) ◽

pp. 4501-4503 ◽

Cited By ~ 9

Author(s):

Petar V Todorov ◽

Benjamin M Gyori ◽

John A Bachman ◽

Peter K Sorger

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Web Service ◽

Language Processing ◽

Source Code ◽

Supplementary Information ◽

Expression Data ◽

Supplementary Data ◽

Automated Assembly ◽

Web Based

Abstract Summary INDRA-IPM (Interactive Pathway Map) is a web-based pathway map modeling tool that combines natural language processing with automated model assembly and visualization. INDRA-IPM contextualizes models with expression data and exports them to standard formats. Availability and implementation INDRA-IPM is available at: http://pathwaymap.indra.bio. Source code is available at http://github.com/sorgerlab/indra_pathway_map. The underlying web service API is available at http://api.indra.bio:8000. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Semi-Automatic De-identification of Hospital Discharge Summaries with Natural Language Processing: A Case-Study of Performance and Real-World Usability

2017 IEEE International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData) ◽

10.1109/ithings-greencom-cpscom-smartdata.2017.169 ◽

2017 ◽

Author(s):

Ioan Calapodescu ◽

David Rozier ◽

Svetlana Artemova ◽

Jean-Luc Bosson

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Hospital Discharge ◽

Language Processing ◽

Real World ◽

Discharge Summaries

Download Full-text

Web-based models for natural language processing

ACM Transactions on Speech and Language Processing ◽

10.1145/1075389.1075392 ◽

2005 ◽

Vol 2 (1) ◽

pp. 3 ◽

Cited By ~ 51

Author(s):

Mirella Lapata ◽

Frank Keller

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Web Based

Download Full-text

PNS266 LANDSCAPE ANALYSIS OF IMPACT OF MACHINE LEARNING, NATURAL LANGUAGE PROCESSING, ARTIFICIAL INTELLIGENCE AND BLOCKCHAIN TECHNOLOGY ON LEVERAGING REAL WORLD EVIDENCE (RWE)

Value in Health ◽

10.1016/j.jval.2019.04.1621 ◽

2019 ◽

Vol 22 ◽

pp. S332

Author(s):

M. Garg

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Real World ◽

Landscape Analysis ◽

Blockchain Technology ◽

Real World Evidence

Download Full-text

Simulation and Visualization of Concept Algebra in MATLAB

International Journal of Software Science and Computational Intelligence ◽

10.4018/ijssci.2014010103 ◽

2014 ◽

Vol 6 (1) ◽

pp. 30-55 ◽

Cited By ~ 7

Author(s):

Xiaoyu Lin ◽

Yingxu Wang

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Mathematical Models ◽

Language Processing ◽

Real World ◽

Formal Knowledge ◽

Design And Implementation ◽

Visualization Software ◽

Formal Concepts ◽

Algebraic Operations

Concept algebra (CA) is a denotational mathematics for formal knowledge manipulation and natural language processing. In order to explicitly demonstrate the mathematical models of formal concepts and their algebraic operations in CA, a simulation and visualization software is developed in the MATLAB environment known as the Visual Simulator of Concept Algebra (VSCA). This paper presents the design and implementation of VSCA and the theories underpinning its development. Visual simulations for the sets of reproductive and compositional operations of CA are demonstrated by real-world examples throughout the elaborations of CA and VSCA.

Download Full-text

Natural Language Processing and Biological Methods

Encyclopedia of Artificial Intelligence ◽

10.4018/978-1-59904-849-9.ch171 ◽

2011 ◽

pp. 1173-1178

Author(s):

Gemma Bel Enguix ◽

M. Dolores Jiménez López

Keyword(s):

Natural Language Processing ◽

Molecular Biology ◽

Natural Language ◽

Computer Science ◽

Genetic Code ◽

20Th Century ◽

Language Processing ◽

Dna Sequences ◽

Theoretical Computer Science ◽

Automated Generation

During the 20th century, biology—especially molecular biology—has become a pilot science, so that many disciplines have formulated their theories under models taken from biology. Computer science has become almost a bio-inspired field thanks to the great development of natural computing and DNA computing. From linguistics, interactions with biology have not been frequent during the 20th century. Nevertheless, because of the “linguistic” consideration of the genetic code, molecular biology has taken several models from formal language theory in order to explain the structure and working of DNA. Such attempts have been focused in the design of grammar-based approaches to define a combinatorics in protein and DNA sequences (Searls, 1993). Also linguistics of natural language has made some contributions in this field by means of Collado (1989), who applied generativist approaches to the analysis of the genetic code. On the other hand, and only from theoretical interest a strictly, several attempts of establishing structural parallelisms between DNA sequences and verbal language have been performed (Jakobson, 1973, Marcus, 1998, Ji, 2002). However, there is a lack of theory on the attempt of explaining the structure of human language from the results of the semiosis of the genetic code. And this is probably the only arrow that remains incomplete in order to close the path between computer science, molecular biology, biosemiotics and linguistics. Natural Language Processing (NLP) –a subfield of Artificial Intelligence that concerns the automated generation and understanding of natural languages— can take great advantage of the structural and “semantic” similarities between those codes. Specifically, taking the systemic code units and methods of combination of the genetic code, the methods of such entity can be translated to the study of natural language. Therefore, NLP could become another “bio-inspired” science, by means of theoretical computer science, that provides the theoretical tools and formalizations which are necessary for approaching such exchange of methodology. In this way, we obtain a theoretical framework where biology, NLP and computer science exchange methods and interact, thanks to the semiotic parallelism between the genetic code and natural language.

Download Full-text