Spatio-temporal Semantic Analysis of Safety Production Accidents in Grain Depot based on Natural Language Processing

BUILD KNOWLEDGE GRAPH FROM HETEROGENEOUS DOCUMENTS

Journal of Science and Technology - IUH ◽

10.46242/jst-iuh.v47i05.761 ◽

2021 ◽

Vol 47 (05) ◽

Author(s):

NGUYỄN CHÍ HIẾU

Keyword(s):

Information Retrieval ◽

Deep Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Question Answering ◽

Semantic Analysis ◽

Knowledge Graph ◽

Question Answering Systems ◽

Knowledge Graphs

Knowledge Graphs are applied in many fields such as search engines, semantic analysis, and question answering in recent years. However, there are many obstacles for building knowledge graphs as methodologies, data and tools. This paper introduces a novel methodology to build knowledge graph from heterogeneous documents. We use the methodologies of Natural Language Processing and deep learning to build this graph. The knowledge graph can use in Question answering systems and Information retrieval especially in Computing domain

Download Full-text

Predicting Verification Methods from Natural Language Requirements

10.31224/osf.io/wxv9e ◽

2020 ◽

Author(s):

Michael Prendergast

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Semantic Analysis ◽

Nearest Neighbor ◽

Large Set ◽

Technical Requirements ◽

Verification Methods ◽

Natural Language Requirements ◽

Processing Algorithms

Abstract – A Verification Cross-Reference Matrix (VCRM) is a table that depicts the verification methods for requirements in a specification. Usually requirement labels are rows, available test methods are columns, and an “X” in a cell indicates usage of a verification method for that requirement. Verification methods include Demonstration, Inspection, Analysis and Test, and sometimes Certification, Similarity and/or Analogy. VCRMs enable acquirers and stakeholders to quickly understand how a product’s requirements will be tested.Maintaining consistency of very large VCRMs can be challenging, and inconsistent verification methods can result in a large set of uncoordinated “spaghetti tests”. Natural language processing algorithms that can identify similarities between requirements offer promise in addressing this challenge.This paper applies and compares compares four natural language processing algorithms to the problem of automatically populating VCRMs from natural language requirements: Naïve Bayesian inference, (b) Nearest Neighbor by weighted Dice similarity, (c) Nearest Neighbor with Latent Semantic Analysis similarity, and (d) an ensemble method combining the first three approaches. The VCRMs used for this study are for slot machine technical requirements derived from gaming regulations from the countries of Australia and New Zealand, the province of Nova Scotia (Canada), the state of Michigan (United States) and recommendations from the International Association of Gaming Regulators (IAGR).

Download Full-text

Can MOOC Instructor Be Portrayed by Semantic Features? Using Discourse and Clustering Analysis to Identify Lecture-Style of Instructors in MOOCs

Frontiers in Psychology ◽

10.3389/fpsyg.2021.751492 ◽

2021 ◽

Vol 12 ◽

Author(s):

Changcheng Wu ◽

Junyi Li ◽

Ye Zhang ◽

Chunmei Lan ◽

Kaiji Zhou ◽

...

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Semantic Analysis ◽

Sentence Length ◽

Semantic Features ◽

Course Quality ◽

Fine Grained ◽

Massive Open Online Course ◽

Complex Words

Nowadays, most courses in massive open online course (MOOC) platforms are xMOOCs, which are based on the traditional instruction-driven principle. Course lecture is still the key component of the course. Thus, analyzing lectures of the instructors of xMOOCs would be helpful to evaluate the course quality and provide feedback to instructors and researchers. The current study aimed to portray the lecture styles of instructors in MOOCs from the perspective of natural language processing. Specifically, 129 course transcripts were downloaded from two major MOOC platforms. Two semantic analysis tools (linguistic inquiry and word count and Coh-Metrix) were used to extract semantic features including self-reference, tone, effect, cognitive words, cohesion, complex words, and sentence length. On the basis of the comments of students, course video review, and the results of cluster analysis, we found four different lecture styles: “perfect,” “communicative,” “balanced,” and “serious.” Significant differences were found between the different lecture styles within different disciplines for notes taking, discussion posts, and overall course satisfaction. Future studies could use fine-grained log data to verify the results of our study and explore how to use the results of natural language processing to improve the lecture of instructors in both MOOCs and traditional classes.

Download Full-text

Plagiarism Detection Technique using www and Wordnet

Indian Journal of Artificial Intelligence and Neural Networking ◽

10.54105/ijainn.b1015.061321 ◽

2021 ◽

pp. 1-6

Author(s):

Dr. Kamlesh Sharma ◽

◽

Nidhi Garg ◽

Arun Pandey ◽

Daksh Yadav ◽

...

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Semantic Analysis ◽

Detection Technique ◽

Linguistic Variation ◽

Plagiarism Detection ◽

Detection Techniques ◽

Day By Day

Plagiarism is an act of using another person’s words, idea or information without giving credit to that person and presenting them as your own. With the development of the technologies in recent years, the act of Plagiarism increases significantly. But luckily the plagiarism detection techniques are available and they are improving day by day to detect the attempts of plagiarizing the content in education. The software like Turnitin, iThenticate or Safe Assign is available in the markets that are doing a great job in this context. But the problem is not fully solved yet. These software(s) still doesn’t detect the rephrasing of statements of another writer in other words. This paper primarily focuses to detect the plagiarism in the suspicious document based on the meaning and linguistic variation of the content. The techniques used for this context is based on Natural language processing. In this Paper, we present how the semantic analysis and syntactic driven Parsing can be used to detect the plagiarism.

Download Full-text

Development of Cross Language Clone Detector for C, C++ & Java Repositories using Natural Language Processing

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.b3612.129219 ◽

2019 ◽

Vol 9 (2) ◽

pp. 2289-2293

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Latent Semantic Analysis ◽

Semantic Analysis ◽

Code Clones ◽

Code Base ◽

Value Decomposition ◽

Cross Language ◽

Bug Fixes

Reusing the code with or without modification is common process in building all the large codebases of system software like Linux, gcc , and jdk. This process is referred to as software cloning or forking. Developers always find difficulty of bug fixes in porting large code base from one language to other native language during software porting. There exist many approaches in identifying software clones of same language that may not contribute for the developers involved in porting hence there is a need for cross language clone detector. This paper uses primary Natural Language Processing (NLP) approach using latent semantic analysis to find the cross language clones of other neighboring languages in terms of all 4 types of clones using latent semantic analysis algorithm that uses Singular value decomposition. It takes input as code(C, C++ or Java) and matches all the neighboring code clones in the static repository in terms of frequency of lines matched

Download Full-text

Application of deep learning fusion algorithm in natural language processing in emotional semantic analysis

Concurrency and Computation Practice and Experience ◽

10.1002/cpe.4779 ◽

2018 ◽

Vol 31 (10) ◽

pp. e4779

Author(s):

Yunlu Gong ◽

Nannan Lu ◽

Jiajian Zhang

Keyword(s):

Deep Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Semantic Analysis ◽

Fusion Algorithm

Download Full-text

Semantic Analysis of Big Data in the Problem of Forecasting the Innovative Development of the Energy Infrastructure of the Russian Federation

CPT2020 The 8th International Scientific Conference on Computing in Physics and Technology Proceedings ◽

10.30987/conferencearticle_5fd755c02b5fa9.82083943 ◽

2020 ◽

Author(s):

Alexey Kopaygorodsky ◽

I. Khayrullin ◽

E. Khayrullina

Keyword(s):

Big Data ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Russian Federation ◽

Semantic Analysis ◽

Innovative Development ◽

Energy Infrastructure ◽

Automate Monitoring ◽

The Russian Federation

The article discusses the use of methods of semantic analysis and natural language processing to support research and forecasting the innovative development of the energy infrastructure of the Russian Federation. The existing methods and approaches to the organization of monitoring of technological solutions and innovative scientific developments are considered. To automate monitoring, the authors propose the use of natural language processing (NLP) methods. Semantic analysis and knowledge integration are based on a system of ontologies. The paper presents the main methods and approaches to building an infrastructure for processing open Big Data. Application of the proposed methods makes it possible to improve the quality of scientific research in this area and make them better.

Download Full-text

Recent Advances in Clinical Natural Language Processing in Support of Semantic Analysis

Yearbook of Medical Informatics ◽

10.15265/iy-2015-009 ◽

2015 ◽

Vol 24 (01) ◽

pp. 183-193 ◽

Cited By ~ 11

Author(s):

D. Mowery ◽

B. R. South ◽

M. Kvist ◽

H. Dalianis ◽

S. Velupillai

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Semantic Analysis ◽

Building Blocks ◽

Use Cases ◽

Additional Patient ◽

Clinical Use ◽

Recent Advances ◽

Clinical Natural Language Processing

Summary Objectives: We present a review of recent advances in clinical Natural Language Processing (NLP), with a focus on semantic analysis and key subtasks that support such analysis. Methods: We conducted a literature review of clinical NLP research from 2008 to 2014, emphasizing recent publications (2012-2014), based on PubMed and ACL proceedings as well as relevant referenced publications from the included papers. Results: Significant articles published within this time-span were included and are discussed from the perspective of semantic analysis. Three key clinical NLP subtasks that enable such analysis were identified: 1) developing more efficient methods for corpus creation (annotation and de-identification), 2) generating building blocks for extracting meaning (morphological, syntactic, and semantic subtasks), and 3) leveraging NLP for clinical utility (NLP applications and infrastructure for clinical use cases). Finally, we provide a reflection upon most recent developments and potential areas of future NLP development and applications. Conclusions: There has been an increase of advances within key NLP subtasks that support semantic analysis. Performance of NLP semantic analysis is, in many cases, close to that of agreement between humans. The creation and release of corpora annotated with complex semantic information models has greatly supported the development of new tools and approaches. Research on non-English languages is continuously growing. NLP methods have sometimes been successfully employed in real-world clinical tasks. However, there is still a gap between the development of advanced resources and their utilization in clinical settings. A plethora of new clinical use cases are emerging due to established health care initiatives and additional patient-generated sources through the extensive use of social media and other devices.

Download Full-text

A Multilevel Natural Language Processing Model

Nordic Journal of Linguistics ◽

10.1017/s033258650000175x ◽

1988 ◽

Vol 11 (1-2) ◽

pp. 69-87 ◽

Cited By ~ 2

Author(s):

H. Jäppinen ◽

T. Honkela ◽

H. Hyötyniemi ◽

A. Lehtola

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Multilevel Model ◽

Morphological Analysis ◽

Semantic Analysis ◽

Dependency Parsing ◽

Computational Aspects

In this paper we describe a multilevel model for natural language processing. The distinct computational strata are motivated by invariant linguistic properties which are progressively uncovered from utterances. We examine each level in detail. The processes are morphological analysis, dependency parsing, logico-semantic analysis and query adaptation. Both linguistic and computational aspects are discussed. In addition to theory, we consider certain engineering viewpoints important and discuss them briefly.

Download Full-text

Semantic analysis in the automation of ER modelling through natural language processing

2006 International Conference on Computing & Informatics ◽

10.1109/icoci.2006.5276559 ◽

2006 ◽

Cited By ~ 2

Author(s):

N. Omar ◽

P. Hanna ◽

P. Mc Kevitt

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Semantic Analysis

Download Full-text