Text Categorization Based on Condensed Graph

2021 ◽  
Vol 27 (3) ◽  
pp. 138-146
Author(s):  
A. O. Korney ◽  
◽  
E. N. Kryuchkova ◽  

The resonant world events of2020 led to an increase in the amount of information on the Internet, including criminal, fake news, and fake negative reviews. False negative information can spread very quickly, and methods are needed to suppress this process. The development of effective algorithms for automatic text analysis is especially relevant today. The most important subtasks include thematic catesorization, sentiment analysis, includins ABSA (aspect-based sentiment analysis). The paper proposes a combined semantic-statistical alsorithm for the aspect analysis of larse texts, based on the use of a semantic graph. The aspect extraction method contains the phases of selectins a set of sisnificant words, calculatins the weishts of the vertices of the semantic sraph by the relaxation method, filterins aspects based on the sradient method. The method proposed allows to extract domain-dependent aspect terms from trainins data. Different aspect term sets extracted from different domains have the same statistical features, and in the same time lexical diversity and structure are taken into account.

2020 ◽  
Vol 34 (05) ◽  
pp. 8600-8607
Author(s):  
Haiyun Peng ◽  
Lu Xu ◽  
Lidong Bing ◽  
Fei Huang ◽  
Wei Lu ◽  
...  

Target-based sentiment analysis or aspect-based sentiment analysis (ABSA) refers to addressing various sentiment analysis tasks at a fine-grained level, which includes but is not limited to aspect extraction, aspect sentiment classification, and opinion extraction. There exist many solvers of the above individual subtasks or a combination of two subtasks, and they can work together to tell a complete story, i.e. the discussed aspect, the sentiment on it, and the cause of the sentiment. However, no previous ABSA research tried to provide a complete solution in one shot. In this paper, we introduce a new subtask under ABSA, named aspect sentiment triplet extraction (ASTE). Particularly, a solver of this task needs to extract triplets (What, How, Why) from the inputs, which show WHAT the targeted aspects are, HOW their sentiment polarities are and WHY they have such polarities (i.e. opinion reasons). For instance, one triplet from “Waiters are very friendly and the pasta is simply average” could be (‘Waiters’, positive, ‘friendly’). We propose a two-stage framework to address this task. The first stage predicts what, how and why in a unified model, and then the second stage pairs up the predicted what (how) and why from the first stage to output triplets. In the experiments, our framework has set a benchmark performance in this novel triplet extraction task. Meanwhile, it outperforms a few strong baselines adapted from state-of-the-art related methods.


Author(s):  
А. Mukasheva

The purpose of this article is to study one of the methods of social networks analysis – text sentiment analysis. Today, social media has become a big data base that social network analysis is used for various purposes – from setting up targeted advertising for a cosmetics store to preventing riots at the state level. There are various methods for analyzing social networks such as graph method, text sentiment analysis, audio, and video object analysis. Among them, sentiment analysis is widely used for political, social, consumer research, and also for cybersecurity. Since the analysis of the sentiment of the text involves the analysis of the emotional opinions expressed in the text, the first step is to define the term opinion. An opinion can be simple, that is, a positive, negative or neutral emotion towards a particular object or its aspect. Comparison is also an opinion, but devoid of emotional connotation. To work with simple opinions, the first task of text sentiment analysis is to classify the text. There are three levels of classifications: classification at the text level, at the level of a sentence, and at the aspect level of the object. After classifying the text at the desired level, the next task is to extract structured data from unstructured information. The problem can be solved using the five-tuple method. One of the important elements of a tuple is the aspect in which an opinion is usually expressed. Next, aspect-based sentiment analysis is applied, which involves identifying aspects of the desired object and assessing the polarity of mood for each aspect. This task is divided into two sub-tasks such as aspect extraction and aspect classification. Sentiment analysis has limitations such as the definition of sarcasm and difficulty of working with abbreviated words.


2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Lixue Zou ◽  
Xiwen Liu ◽  
Wray Buntine ◽  
Yanli Liu

PurposeFull text of a document is a rich source of information that can be used to provide meaningful topics. The purpose of this paper is to demonstrate how to use citation context (CC) in the full text to identify the cited topics and citing topics efficiently and effectively by employing automatic text analysis algorithms.Design/methodology/approachThe authors present two novel topic models, Citation-Context-LDA (CC-LDA) and Citation-Context-Reference-LDA (CCRef-LDA). CC is leveraged to extract the citing text from the full text, which makes it possible to discover topics with accuracy. CC-LDA incorporates CC, citing text, and their latent relationship, while CCRef-LDA incorporates CC, citing text, their latent relationship and reference information in CC. Collapsed Gibbs sampling is used to achieve an approximate estimation. The capacity of CC-LDA to simultaneously learn cited topics and citing topics together with their links is investigated. Moreover, a topic influence measure method based on CC-LDA is proposed and applied to create links between the two-level topics. In addition, the capacity of CCRef-LDA to discover topic influential references is also investigated.FindingsThe results indicate CC-LDA and CCRef-LDA achieve improved or comparable performance in terms of both perplexity and symmetric Kullback–Leibler (sKL) divergence. Moreover, CC-LDA is effective in discovering the cited topics and citing topics with topic influence, and CCRef-LDA is able to find the cited topic influential references.Originality/valueThe automatic method provides novel knowledge for cited topics and citing topics discovery. Topic influence learnt by our model can link two-level topics and create a semantic topic network. The method can also use topic specificity as a feature to rank references.


2022 ◽  
Vol 24 (3) ◽  
pp. 0-0

In this digital era, people are very keen to share their feedback about any product, services, or current issues on social networks and other platforms. A fine analysis of these feedbacks can give a clear picture of what people think about a particular topic. This work proposed an almost unsupervised Aspect Based Sentiment Analysis approach for textual reviews. Latent Dirichlet Allocation, along with linguistic rules, is used for aspect extraction. Aspects are ranked based on their probability distribution values and then clustered into predefined categories using frequent terms with domain knowledge. SentiWordNet lexicon uses for sentiment scoring and classification. The experiment with two popular datasets shows the superiority of our strategy as compared to existing methods. It shows the 85% average accuracy when tested on manually labeled data.


2022 ◽  
pp. 255-263
Author(s):  
Chirag Visani ◽  
Vishal Sorathiya ◽  
Sunil Lavadiya

The popularity of the internet has increased the use of e-commerce websites and news channels. Fake news has been around for many years, and with the arrival of social media and modern-day news at its peak, easy access to e-platform and exponential growth of the knowledge available on social media networks has made it intricate to differentiate between right and wrong information, which has caused large effects on the offline society already. A crucial goal in improving the trustworthiness of data in online social networks is to spot fake news so the detection of spam news becomes important. For sentiment mining, the authors specialise in leveraging Facebook, Twitter, and Whatsapp, the most prominent microblogging platforms. They illustrate how to assemble a corpus automatically for sentiment analysis and opinion mining. They create a sentiment classifier using the corpus that can classify between fake, real, and neutral opinions in a document.


Sign in / Sign up

Export Citation Format

Share Document