Neural Network Based Text Mining to Discover Enterprise Networks

Author(s):  
Hajlaoui, Kafil
2021 ◽  
Vol 5 (2) ◽  
pp. 109-118
Author(s):  
Euis Saraswati ◽  
Yuyun Umaidah ◽  
Apriade Voutama

Coronavirus disease (Covid-19) or commonly called coronavirus. This virus spreads very quickly and even almost infects the whole world, including Indonesia. A large number of cases and the rapid spread of this virus make people worry and even fear the increasing spread of the Covid-19 virus. Information about this virus has also been spread on various social media, one of which is Twitter. Various public opinions regarding the Covid-19 virus are also widely expressed on Twitter. Opinions on a tweet contain positive or negative sentiments. Sentiments of sentiment contained in a tweet can be used as material for consideration and evaluation for the government in dealing with the Covid-19 virus. Based on these problems, a sentiment analysis classification is needed to find out public opinion on the Covid-19 virus. This research uses Artificial Neural Network (ANN) algorithm with the Backpropagation method. The results of this test get 88.62% accuracy, 91.5% precision, and 95.73% recall. The results obtained show that the ANN model is quite good for classifying text mining.


2013 ◽  
Vol 278-280 ◽  
pp. 1972-1975
Author(s):  
Guo Hong Zhou

Today, numbers of documents that firms must process and with the extensive amounts of information available on the Internet, an automated intelligent method is needed to sort through all available documents or sites. It would be beneficial to start with grouping similar documents or sites together based on similarities. Classifying documents and sorting them into categories could be beneficial since in most cases, no one user would be interested in all the different categories of documents at the same time. Classifying a large number of documents would also make it easier to locate a specific document. In this paper, neural network computing based text mining (TM) and its application will be discussed.


2021 ◽  
Vol 17 (5) ◽  
pp. e1008967
Author(s):  
Chun-Nan Hsu ◽  
Chia-Hui Chang ◽  
Thamolwan Poopradubsil ◽  
Amanda Lo ◽  
Karen A. William ◽  
...  

Antibodies are widely used reagents to test for expression of proteins and other antigens. However, they might not always reliably produce results when they do not specifically bind to the target proteins that their providers designed them for, leading to unreliable research results. While many proposals have been developed to deal with the problem of antibody specificity, it is still challenging to cover the millions of antibodies that are available to researchers. In this study, we investigate the feasibility of automatically generating alerts to users of problematic antibodies by extracting statements about antibody specificity reported in the literature. The extracted alerts can be used to construct an “Antibody Watch” knowledge base containing supporting statements of problematic antibodies. We developed a deep neural network system and tested its performance with a corpus of more than two thousand articles that reported uses of antibodies. We divided the problem into two tasks. Given an input article, the first task is to identify snippets about antibody specificity and classify if the snippets report that any antibody exhibits non-specificity, and thus is problematic. The second task is to link each of these snippets to one or more antibodies mentioned in the snippet. The experimental evaluation shows that our system can accurately perform the classification task with 0.925 weighted F1-score, linking with 0.962 accuracy, and 0.914 weighted F1 when combined to complete the joint task. We leveraged Research Resource Identifiers (RRID) to precisely identify antibodies linked to the extracted specificity snippets. The result shows that it is feasible to construct a reliable knowledge base about problematic antibodies by text mining.


2021 ◽  
Vol 2021 ◽  
pp. 1-11
Author(s):  
Ayoub Bagheri ◽  
T. Katrien J. Groenhof ◽  
Folkert W. Asselbergs ◽  
Saskia Haitjema ◽  
Michiel L. Bots ◽  
...  

Background and Objective. Electronic health records (EHRs) contain free-text information on symptoms, diagnosis, treatment, and prognosis of diseases. However, this potential goldmine of health information cannot be easily accessed and used unless proper text mining techniques are applied. The aim of this project was to develop and evaluate a text mining pipeline in a multimodal learning architecture to demonstrate the value of medical text classification in chest radiograph reports for cardiovascular risk prediction. We sought to assess the integration of various text representation approaches and clinical structured data with state-of-the-art deep learning methods in the process of medical text mining. Methods. We used EHR data of patients included in the Second Manifestations of ARTerial disease (SMART) study. We propose a deep learning-based multimodal architecture for our text mining pipeline that integrates neural text representation with preprocessed clinical predictors for the prediction of recurrence of major cardiovascular events in cardiovascular patients. Text preprocessing, including cleaning and stemming, was first applied to filter out the unwanted texts from X-ray radiology reports. Thereafter, text representation methods were used to numerically represent unstructured radiology reports with vectors. Subsequently, these text representation methods were added to prediction models to assess their clinical relevance. In this step, we applied logistic regression, support vector machine (SVM), multilayer perceptron neural network, convolutional neural network, long short-term memory (LSTM), and bidirectional LSTM deep neural network (BiLSTM). Results. We performed various experiments to evaluate the added value of the text in the prediction of major cardiovascular events. The two main scenarios were the integration of radiology reports (1) with classical clinical predictors and (2) with only age and sex in the case of unavailable clinical predictors. In total, data of 5603 patients were used with 5-fold cross-validation to train the models. In the first scenario, the multimodal BiLSTM (MI-BiLSTM) model achieved an area under the curve (AUC) of 84.7%, misclassification rate of 14.3%, and F1 score of 83.8%. In this scenario, the SVM model, trained on clinical variables and bag-of-words representation, achieved the lowest misclassification rate of 12.2%. In the case of unavailable clinical predictors, the MI-BiLSTM model trained on radiology reports and demographic (age and sex) variables reached an AUC, F1 score, and misclassification rate of 74.5%, 70.8%, and 20.4%, respectively. Conclusions. Using the case study of routine care chest X-ray radiology reports, we demonstrated the clinical relevance of integrating text features and classical predictors in our text mining pipeline for cardiovascular risk prediction. The MI-BiLSTM model with word embedding representation appeared to have a desirable performance when trained on text data integrated with the clinical variables from the SMART study. Our results mined from chest X-ray reports showed that models using text data in addition to laboratory values outperform those using only known clinical predictors.


Author(s):  
Lean Yu ◽  
Shouyang Wang ◽  
Kin Keung Lai

With the rapid increase of the huge amount of online information, there is a strong demand for Web text mining which helps people discover some useful knowledge from Web documents. For this purpose, this chapter first proposes a back-propagation neural network (BPNN)-based Web text mining system for decision support. In the BPNN-based Web text mining system, four main processes, Web document search, Web text processing, text feature conversion, and BPNN-based knowledge discovery, are involved. Particularly, BPNN is used as an intelligent learning agent that learns about underlying Web documents. In order to scale the individual intelligent agent with the large number of Web documents, we then provide a multi-agent-based neural network system for Web text mining in a parallel way. For illustration purpose, a simulated experiment is performed. Experiment results reveal that the proposed multi-agent neural network system is an effective solution to large scale Web text mining.


2021 ◽  
Author(s):  
Mourad Ellouze ◽  
Seifeddine Mechti ◽  
Moez Krichen ◽  
vinayakumar R ◽  
Lamia Hadrich Belguith

This paper proposes an architecture taking advantage of artificial intelligence and text mining techniques in order to: (i) detect paranoid people by classifying their set of tweets into two classes (Paranoid/not-Paranoid), (ii) ensure the surveillance of these people by classifying their tweets about Covid-19 into two classes (person with normal behavior, person with inappropriate behavior). These objectives are achieved using an approach that takes advantage of different information related to the textual part, user and tweets for features selection task and deep neural network for the classification task. We obtained as an F-score rate 70% for the detection of paranoid people and 73% for the detection of the behavior of these people towards Covid-19. The obtained results are motivating and encouraging researchers to improve them given the interest and the importance of this research axis.


Sign in / Sign up

Export Citation Format

Share Document