natural language text
Recently Published Documents


TOTAL DOCUMENTS

234
(FIVE YEARS 77)

H-INDEX

17
(FIVE YEARS 1)

2022 ◽  
Vol 40 (1) ◽  
pp. 1-43
Author(s):  
Ruqing Zhang ◽  
Jiafeng Guo ◽  
Lu Chen ◽  
Yixing Fan ◽  
Xueqi Cheng

Question generation is an important yet challenging problem in Artificial Intelligence (AI), which aims to generate natural and relevant questions from various input formats, e.g., natural language text, structure database, knowledge base, and image. In this article, we focus on question generation from natural language text, which has received tremendous interest in recent years due to the widespread applications such as data augmentation for question answering systems. During the past decades, many different question generation models have been proposed, from traditional rule-based methods to advanced neural network-based methods. Since there have been a large variety of research works proposed, we believe it is the right time to summarize the current status, learn from existing methodologies, and gain some insights for future development. In contrast to existing reviews, in this survey, we try to provide a more comprehensive taxonomy of question generation tasks from three different perspectives, i.e., the types of the input context text, the target answer, and the generated question. We take a deep look into existing models from different dimensions to analyze their underlying ideas, major design principles, and training strategies We compare these models through benchmark tasks to obtain an empirical understanding of the existing techniques. Moreover, we discuss what is missing in the current literature and what are the promising and desired future directions.


Author(s):  
Kristin Stock ◽  
Christopher B. Jones ◽  
Shaun Russell ◽  
Mansi Radke ◽  
Prarthana Das ◽  
...  

2021 ◽  
Author(s):  
Baosheng Yin ◽  
Yifei Sun

Abstract As an important part of information extraction, relationship extraction aims to extract the relationships between given entities from natural language text. On the basis of the pre-training model R-BERT, this paper proposes an entity relationship extraction method that integrates entity dependency path and pre-training model, which generates a dependency parse tree by dependency parsing, obtains the dependency path of entity pair via a given entity, and uses entity dependency path to exclude such information as modifier chunks and useless entities in sentences. This model has achieved good F1 value performance on the SemEval2010 Task 8 dataset. Experiments on dataset show that dependency parsing can provide context information for models and improve performance.


2021 ◽  
Author(s):  
Ganesh N. Jorvekar ◽  
Mohit Gangwar

In recent years, the number of user comments and text materials has increased dramatically. Analysis of the emotions has drawn interest from researchers. Earlier research in the field of artificial-intelligence concentrate on identification of emotion and exploring the explanation the emotions can’t recognized or misrecognized. The association between the emotions leads to the understanding of emotion loss. In this Work we are trying to fill the gap between emotional recognition and emotional co-relation mining through social media reviews of natural language text. The association between emotions, represented as the emotional uncertainty and evolution, is mainly triggered by cognitive bias in the human emotion. Numerous types of features and Recurrent neural-network (RNN) as deep learning model provided to mine the emotion co-relation from emotion detection using text. The rule on conflict of emotions is derived on a symmetric basis. TF-IDF, NLP Features and Co-relation features has used for feature extraction as well as section and Recurrent Neural Network (RNN) and Hybrid deep learning algorithm for classification has used to demonstrates the entire research experiments. Finally evaluate the system performance with various existing system and show the effectiveness of proposed system.


2021 ◽  
Author(s):  
Johannes Lindén ◽  
Tingting Zhang ◽  
Stefan Forsström ◽  
Patrik Österberg

Information extraction is a task that can extract meta-data information from text. The research in this article proposes a new information extraction algorithm called GenerateIE. The proposed algorithm identifies pairs of entities and relations described in a piece of text. The extracted meta-data is useful in many areas, but within this research the focus is to use them in news-media contexts to provide the gist of the written articles for analytics and paraphrasing of news information. GenerateIE algorithm is compared with existing state of the art algorithms with two benefits. Firstly, the GenerateIE provides the co-referenced word as the entity instead of using he, she, it, etc. which is more beneficial for knowledge graphs. Secondly GenerateIE can be applied on multiple languages without changing the algorithm itself apart from the underlying natural language text-parsing. Furthermore, the performance of GenerateIE compared with state-of-the-art algorithms is not significantly better, but it offers competitive results.


2021 ◽  
Vol 3 (2) ◽  
pp. 1-4
Author(s):  
Kieran Tranter

This brief editorial focuses on the contribution in this volume titled ‘Machines Will Never Replace Humans!’ compiled by GPT-3. The brief text is provocative. It is provocative in demonstrating the potential efficiencies and complexities of machine-produced natural language text for ‘writing’ professions like law and the academy. It is further provocative as it reflects back the image and representation of the human within the digital. There is a denotive suggestion that humans are valuable and significant as lawyers because they possess intuition. There is a further suggestion that humans, or more precisely the imprint of humans in the digital, are televisual consumers of dated sitcoms, revealing the disconnect between existent digital archives and the totality of humanity.


Author(s):  
Dov Dori ◽  
Ahmad Jbara ◽  
Yongkai E. Yang ◽  
Andrew M. Liu ◽  
Charles M. Oman

Objective We define and demonstrate the use of OPM-TA—a model-based task analysis (TA) framework that uses object-process methodology (OPM) ISO 19450 as a viable alternative to traditional TA techniques. Background A variety of different TA methods exist in human factors engineering, and several of them are often applied successively for a broad task representation, making it difficult to follow. Method Using OPM-TA, we modeled how an International Space Station (ISS) astronaut would support extravehicular activities using the existing robotic arm workstation with a new control panel and an electronic procedure system. The modeling employed traditional TA methods and the new OPM-TA approach, enabling a comparison between them. Results While the initial stages of modeling with OPM-TA follow those of traditional TA, OPM-TA modeling yields an executable and logically verifiable model of the entire human–robot system. Both OPM’s hierarchical set of diagrams and the equivalent, automatically generated statements in a subset of natural language text specify how objects and processes relate to each other at increasingly detailed levels. The graphic and textual OPM modalities specify the system’s architecture, which enables its function and benefits its users. To verify the model logical correctness model, we executed it using OPM’s simulation capability. Conclusion OPM-TA was able to unify traditional TA methods and expand their capabilities. The formal yet intuitive OPM-TA approach fuses and extends traditional TA methods, which are not amenable to simulation. It therefore can potentially become a widely used means for TA and human–machine procedure development and testing.


2021 ◽  
Vol 39 (3) ◽  
pp. 121-128
Author(s):  
Chulho Kim

Natural language processing (NLP) is a computerized approach to analyzing text that explores how computers can be used to understand and manipulate natural language text or speech to do useful things. In healthcare field, these NLP techniques are applied in a variety of applications, ranging from evaluating the adequacy of treatment, assessing the presence of the acute illness, and the other clinical decision support. After converting text into computer-readable data through the text preprocessing process, an NLP can extract valuable information using the rule-based algorithm, machine learning, and neural network. We can use NLP to distinguish subtypes of stroke or accurately extract critical clinical information such as severity of stroke and prognosis of patients, etc. If these NLP methods are actively utilized in the future, they will be able to make the most of the electronic health records to enable optimal medical judgment.


2021 ◽  
Vol 8 ◽  
Author(s):  
Zhi Hong ◽  
J. Gregory Pauloski ◽  
Logan Ward ◽  
Kyle Chard ◽  
Ben Blaiszik ◽  
...  

Researchers worldwide are seeking to repurpose existing drugs or discover new drugs to counter the disease caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). A promising source of candidates for such studies is molecules that have been reported in the scientific literature to be drug-like in the context of viral research. However, this literature is too large for human review and features unusual vocabularies for which existing named entity recognition (NER) models are ineffective. We report here on a project that leverages both human and artificial intelligence to detect references to such molecules in free text. We present 1) a iterative model-in-the-loop method that makes judicious use of scarce human expertise in generating training data for a NER model, and 2) the application and evaluation of this method to the problem of identifying drug-like molecules in the COVID-19 Open Research Dataset Challenge (CORD-19) corpus of 198,875 papers. We show that by repeatedly presenting human labelers only with samples for which an evolving NER model is uncertain, our human-machine hybrid pipeline requires only modest amounts of non-expert human labeling time (tens of hours to label 1778 samples) to generate an NER model with an F-1 score of 80.5%—on par with that of non-expert humans—and when applied to CORD’19, identifies 10,912 putative drug-like molecules. This enriched the computational screening team’s targets by 3,591 molecules, of which 18 ranked in the top 0.1% of all 6.6 million molecules screened for docking against the 3CLPro protein.


2021 ◽  
Vol 20 (2) ◽  
pp. 29-35
Author(s):  
Mussa Omar ◽  
Abdulrhman Alsheky ◽  
Balha Faiz

Extracting entities from natural language text to design conceptual models of the entity relationships is not trivial and novice designers and students can find it especially difficult. Researchers have suggested linguistic rules/guidelines for extracting entities from natural language text. Unfortunately, while these guidelines are often correct they can, also, be invalid. There is no rule that is true at all times. This paper suggests novel rules based on the machine learning classifiers, the RIPPER, the PART and the decision trees. Performance comparison was made between the linguistic and the machine learning rules. The results shows that there was a dramatic improvement when machine learning rules were used.


Sign in / Sign up

Export Citation Format

Share Document