Graph-Based Topic Extraction from Vector Embeddings of Text Documents: Application to a Corpus of News Articles

Opinion mining and sentiment analysis are valuable to extract the useful subjective information out of text documents. Predicting the customer’s opinion on amazon products has several benefits like reducing customer churn, agent monitoring, handling multiple customers, tracking overall customer satisfaction, quick escalations, and upselling opportunities. However, performing sentiment analysis is a challenging task for the researchers in order to find the users sentiments from the large datasets, because of its unstructured nature, slangs, misspells and abbreviations. To address this problem, a new proposed system is developed in this research study. Here, the proposed system comprises of four major phases; data collection, pre-processing, key word extraction, and classification. Initially, the input data were collected from the dataset: amazon customer review. After collecting the data, preprocessing was carried-out for enhancing the quality of collected data. The pre-processing phase comprises of three systems; lemmatization, review spam detection, and removal of stop-words and URLs. Then, an effective topic modelling approach Latent Dirichlet Allocation (LDA) along with modified Possibilistic Fuzzy C-Means (PFCM) was applied to extract the keywords and also helps in identifying the concerned topics. The extracted keywords were classified into three forms (positive, negative and neutral) by applying an effective machine learning classifier: Convolutional Neural Network (CNN). The experimental outcome showed that the proposed system enhanced the accuracy in sentiment analysis up to 6-20% related to the existing systems.

Download Full-text

An Improved B-hill Climbing Optimization Technique for Solving the Text Documents Clustering Problem

Current Medical Imaging Formerly Current Medical Imaging Reviews ◽

10.2174/1573405614666180903112541 ◽

2020 ◽

Vol 16 (4) ◽

pp. 296-306 ◽

Cited By ~ 3

Author(s):

Laith Mohammad Abualigah ◽

Essam Said Hanandeh ◽

Ahamad Tajudin Khader ◽

Mohammed Abdallh Otair ◽

Shishir Kumar Shandilya

Keyword(s):

Optimization Technique ◽

Document Clustering ◽

Text Clustering ◽

Hill Climbing ◽

Text Documents ◽

Clustering Problem ◽

Text Document ◽

Text Information ◽

Amount Of Knowledge ◽

The Hill

Background: Considering the increasing volume of text document information on Internet pages, dealing with such a tremendous amount of knowledge becomes totally complex due to its large size. Text clustering is a common optimization problem used to manage a large amount of text information into a subset of comparable and coherent clusters. Aims: This paper presents a novel local clustering technique, namely, β-hill climbing, to solve the problem of the text document clustering through modeling the β-hill climbing technique for partitioning the similar documents into the same cluster. Methods: The β parameter is the primary innovation in β-hill climbing technique. It has been introduced in order to perform a balance between local and global search. Local search methods are successfully applied to solve the problem of the text document clustering such as; k-medoid and kmean techniques. Results: Experiments were conducted on eight benchmark standard text datasets with different characteristics taken from the Laboratory of Computational Intelligence (LABIC). The results proved that the proposed β-hill climbing achieved better results in comparison with the original hill climbing technique in solving the text clustering problem. Conclusion: The performance of the text clustering is useful by adding the β operator to the hill climbing.

Download Full-text

Link-based multi-verse optimizer for text documents clustering

Applied Soft Computing ◽

10.1016/j.asoc.2019.106002 ◽

2020 ◽

Vol 87 ◽

pp. 106002 ◽

Cited By ~ 4

Author(s):

Ammar Kamal Abasi ◽

Ahamad Tajudin Khader ◽

Mohammed Azmi Al-Betar ◽

Syibrah Naim ◽

Sharif Naser Makhadmeh ◽

...

Keyword(s):

Text Documents

Download Full-text

Learning emotional word embeddings for sentiment analysis

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-201993 ◽

2021 ◽

pp. 1-13

Author(s):

Qingtian Zeng ◽

Xishi Zhao ◽

Xiaohui Hu ◽

Hua Duan ◽

Zhongying Zhao ◽

...

Keyword(s):

Sentiment Analysis ◽

Language Processing ◽

State Of The Art ◽

Research Problem ◽

Emotional Word ◽

Classification Model ◽

Data Sets ◽

Word Embeddings ◽

Real World Data ◽

Text Documents

Word embeddings have been successfully applied in many natural language processing tasks due to its their effectiveness. However, the state-of-the-art algorithms for learning word representations from large amounts of text documents ignore emotional information, which is a significant research problem that must be addressed. To solve the above problem, we propose an emotional word embedding (EWE) model for sentiment analysis in this paper. This method first applies pre-trained word vectors to represent document features using two different linear weighting methods. Then, the resulting document vectors are input to a classification model and used to train a text sentiment classifier, which is based on a neural network. In this way, the emotional polarity of the text is propagated into the word vectors. The experimental results on three kinds of real-world data sets demonstrate that the proposed EWE model achieves superior performances on text sentiment prediction, text similarity calculation, and word emotional expression tasks compared to other state-of-the-art models.

Download Full-text

Mechanisms of Drug Resistance in the Pathogenesis of Epilepsy: Role of Neuroinflammation. A Literature Review

Brain Sciences ◽

10.3390/brainsci11050663 ◽

2021 ◽

Vol 11 (5) ◽

pp. 663

Author(s):

Elena D. Bazhanova ◽

Alexander A. Kozlov ◽

Anastasia V. Litovchenko

Keyword(s):

Drug Resistance ◽

Premature Death ◽

Resistance Mechanisms ◽

Biomedical Literature ◽

Drug Resistant ◽

Psychological Consequences ◽

Text Documents ◽

Neural Connections ◽

Inflammatory Processes ◽

Drug Resistant Epilepsy

Epilepsy is a chronic neurological disorder characterized by recurring spontaneous seizures. Drug resistance appears in 30% of patients and it can lead to premature death, brain damage or a reduced quality of life. The purpose of the study was to analyze the drug resistance mechanisms, especially neuroinflammation, in the epileptogenesis. The information bases of biomedical literature Scopus, PubMed, Google Scholar and SciVerse were used. To obtain full-text documents, electronic resources of PubMed Central and Research Gate were used. The article examines the recent research of the mechanisms of drug resistance in epilepsy and discusses the hypotheses of drug resistance development (genetic, epigenetic, target hypothesis, etc.). Drug-resistant epilepsy is associated with neuroinflammatory, autoimmune and neurodegenerative processes. Neuroinflammation causes immune, pathophysiological, biochemical and psychological consequences. Focal or systemic unregulated inflammatory processes lead to the formation of aberrant neural connections and hyperexcitable neural networks. Inflammatory mediators affect the endothelium of cerebral vessels, destroy contacts between endothelial cells and induce abnormal angiogenesis (the formation of “leaky” vessels), thereby affecting the blood–brain barrier permeability. Thus, the analysis of pro-inflammatory and other components of epileptogenesis can contribute to the further development of the therapeutic treatment of drug-resistant epilepsy.

Download Full-text

Increasing the Reliability of Full Text Documents Based on the Use of Mechanisms for Extraction of Statistical and Semantic Links of Elements

2020 International Conference on Information Science and Communications Technologies (ICISCT) ◽

10.1109/icisct50599.2020.9351397 ◽

2020 ◽

Author(s):

Jumanov Isroil ◽

Karshiev Khusan

Keyword(s):

Full Text ◽

Text Documents

Download Full-text

Text Document Summarization Using POS tagging for Kannada Text Documents

2021 11th International Conference on Cloud Computing, Data Science & Engineering (Confluence) ◽

10.1109/confluence51648.2021.9377106 ◽

2021 ◽

Author(s):

Jayashree R ◽

Basavaraj S Anami ◽

Poornima B K

Keyword(s):

Text Documents ◽

Document Summarization ◽

Pos Tagging ◽

Text Document

Download Full-text

Jigsaw: Supporting Investigative Analysis through Interactive Visualization

Information Visualization ◽

10.1057/palgrave.ivs.9500180 ◽

2008 ◽

Vol 7 (2) ◽

pp. 118-132 ◽

Cited By ~ 198

Author(s):

John Stasko ◽

Carsten Görg ◽

Zhicheng Liu

Keyword(s):

Interactive Visualization ◽

Sense Making ◽

Text Documents ◽

Potential Interest

Investigative analysts who work with collections of text documents connect embedded threads of evidence in order to formulate hypotheses about plans and activities of potential interest. As the number of documents and the corresponding number of concepts and entities within the documents grow larger, sense-making processes become more and more difficult for the analysts. We have developed a visual analytic system called Jigsaw that represents documents and their entities visually in order to help analysts examine them more efficiently and develop theories about potential actions more quickly. Jigsaw provides multiple coordinated views of document entities with a special emphasis on visually illustrating connections between entities across the different documents.

Download Full-text

Graph-Based Topic Extraction from Vector Embeddings of Text Documents: Application to a Corpus of News Articles

Topic Extraction from Text Documents Using Multiple-Cause Networks

Detection and correcting the wrong words from Hindi, English and Punjabi Text Documents

Convolutional Neural Network for Customer’s Opinion on Amazon Products

An Improved B-hill Climbing Optimization Technique for Solving the Text Documents Clustering Problem

Link-based multi-verse optimizer for text documents clustering

Learning emotional word embeddings for sentiment analysis

Mechanisms of Drug Resistance in the Pathogenesis of Epilepsy: Role of Neuroinflammation. A Literature Review

Increasing the Reliability of Full Text Documents Based on the Use of Mechanisms for Extraction of Statistical and Semantic Links of Elements

Text Document Summarization Using POS tagging for Kannada Text Documents

Jigsaw: Supporting Investigative Analysis through Interactive Visualization

Export Citation Format