An Innovative AI-Based System for Corruption Risks Assessment Among Corporate Managers to Support Open Source Analysis

The paper has its focus on the creation of an innovative Natural Language Processing system for the quest of available information and consequent data analysis, aimed at reconstructing the corporate chain and monitoring the sensitive risk of corruption for people involved in command positions. Today, the greatest opportunity in finding information is represented by the Internet or other open sources, where the contents related to corporate managers are continuously posted and updated. Given the vastness of the information dimension, it seems remarkably advantageous to have an intelligent analysis system capable of independently finding, analyzing and synthesizing information related to a set of target subjects. The aim of this document is to describe a forecasting model based on Machine Learning and Artificial Intelligence techniques capable of understanding whether a news item related to an individual (sought during a due diligence process) contains information about crime, investigation, conviction, fraud, corruption or sanction relating to the subject sought. Methods based on Artificial Neural Networks and Support Vector Machine, compared one to the others, are introduced and applied for the scope. In particular, results showed the architecture based on SVM with TF-IDF matrix and test pre-processing outperforms the others discussed in this paper demonstrating high accuracy and precision in prediction new data as well.

Download Full-text

The Experience of Developing a Large-Scale Natural Language Processing System: Critique

The Kluwer International Series in Engineering and Computer Science - Natural Language Processing: The PLNLP Approach ◽

10.1007/978-1-4615-3170-8_7 ◽

1993 ◽

pp. 77-89 ◽

Cited By ~ 2

Author(s):

Stephen Richardson ◽

Lisa Braden-Harder

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Large Scale ◽

Processing System ◽

Natural Language Processing System

Download Full-text

Face Detection and Natural Language Processing System Using Artificial Intelligence

Lecture Notes in Networks and Systems - Inventive Communication and Computational Technologies ◽

10.1007/978-981-15-0146-3_73 ◽

2020 ◽

pp. 773-780

Author(s):

H. S. Avani ◽

Ayushi Turkar ◽

C. D. Divya

Keyword(s):

Artificial Intelligence ◽

Natural Language Processing ◽

Natural Language ◽

Face Detection ◽

Language Processing ◽

Processing System ◽

Natural Language Processing System

Download Full-text

Natural Language Processing: System Evaluation

Encyclopedia of Language & Linguistics ◽

10.1016/b0-08-044854-2/00932-9 ◽

2006 ◽

pp. 518-523

Author(s):

M. Maybury

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Processing System ◽

System Evaluation ◽

Natural Language Processing System

Download Full-text

Model of process and model of natural language processing system

IOP Conference Series Materials Science and Engineering ◽

10.1088/1757-899x/878/1/012028 ◽

2020 ◽

Vol 878 ◽

pp. 012028

Author(s):

M Zhekova ◽

G Totkov

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Processing System ◽

Natural Language Processing System

Download Full-text

Moonstone: a novel natural language processing system for inferring social risk from clinical narratives

Journal of Biomedical Semantics ◽

10.1186/s13326-019-0198-0 ◽

2019 ◽

Vol 10 (1) ◽

Cited By ~ 3

Author(s):

Mike Conway ◽

Salomeh Keyhani ◽

Lee Christensen ◽

Brett R. South ◽

Marzieh Vali ◽

...

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Processing System ◽

Social Risk ◽

Natural Language Processing System

Download Full-text

The PUNDIT natural-language processing system

[1989] Proceedings. The Annual AI Systems in Government Conference ◽

10.1109/aisig.1989.47330 ◽

2003 ◽

Cited By ~ 3

Author(s):

L. Hirschman ◽

M. Palmer ◽

J. Dowding ◽

D. Dahl ◽

M. Linebarger ◽

...

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Processing System ◽

Natural Language Processing System

Download Full-text

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing: System Demonstrations

10.18653/v1/d18-2 ◽

2018 ◽

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Processing System ◽

Empirical Methods ◽

Natural Language Processing System

Download Full-text

Repurposing the Clinical Record: Can an Existing Natural Language Processing System De-identify Clinical Notes?

Journal of the American Medical Informatics Association ◽

10.1197/jamia.m2862 ◽

2009 ◽

Vol 16 (1) ◽

pp. 37-39 ◽

Cited By ~ 20

Author(s):

F. P. Morrison ◽

L. Li ◽

A. M. Lai ◽

G. Hripcsak

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Processing System ◽

Clinical Record ◽

Clinical Notes ◽

Natural Language Processing System

Download Full-text

Learning predictive models of drug side-effect relationships from distributed representations of literature-derived semantic predications

Journal of the American Medical Informatics Association ◽

10.1093/jamia/ocy077 ◽

2018 ◽

Vol 25 (10) ◽

pp. 1339-1350 ◽

Cited By ~ 5

Author(s):

Justin Mower ◽

Devika Subramanian ◽

Trevor Cohen

Keyword(s):

Machine Learning ◽

Language Processing ◽

Side Effect ◽

Cross Validation ◽

Processing System ◽

Biomedical Literature ◽

Supervised Machine Learning ◽

Machine Learning Techniques ◽

Drug Side Effect ◽

Natural Language Processing System

Abstract Objective The aim of this work is to leverage relational information extracted from biomedical literature using a novel synthesis of unsupervised pretraining, representational composition, and supervised machine learning for drug safety monitoring. Methods Using ≈80 million concept-relationship-concept triples extracted from the literature using the SemRep Natural Language Processing system, distributed vector representations (embeddings) were generated for concepts as functions of their relationships utilizing two unsupervised representational approaches. Embeddings for drugs and side effects of interest from two widely used reference standards were then composed to generate embeddings of drug/side-effect pairs, which were used as input for supervised machine learning. This methodology was developed and evaluated using cross-validation strategies and compared to contemporary approaches. To qualitatively assess generalization, models trained on the Observational Medical Outcomes Partnership (OMOP) drug/side-effect reference set were evaluated against a list of ≈1100 drugs from an online database. Results The employed method improved performance over previous approaches. Cross-validation results advance the state of the art (AUC 0.96; F1 0.90 and AUC 0.95; F1 0.84 across the two sets), outperforming methods utilizing literature and/or spontaneous reporting system data. Examination of predictions for unseen drug/side-effect pairs indicates the ability of these methods to generalize, with over tenfold label support enrichment in the top 100 predictions versus the bottom 100 predictions. Discussion and Conclusion Our methods can assist the pharmacovigilance process using information from the biomedical literature. Unsupervised pretraining generates a rich relationship-based representational foundation for machine learning techniques to classify drugs in the context of a putative side effect, given known examples.

Download Full-text