semantic mining
Recently Published Documents


TOTAL DOCUMENTS

70
(FIVE YEARS 15)

H-INDEX

7
(FIVE YEARS 1)

2021 ◽  
Vol 2021 ◽  
pp. 1-11
Author(s):  
Meijing Li ◽  
Tianjie Chen ◽  
Keun Ho Ryu ◽  
Cheng Hao Jin

Semantic mining is always a challenge for big biomedical text data. Ontology has been widely proved and used to extract semantic information. However, the process of ontology-based semantic similarity calculation is so complex that it cannot measure the similarity for big text data. To solve this problem, we propose a parallelized semantic similarity measurement method based on Hadoop MapReduce for big text data. At first, we preprocess and extract the semantic features from documents. Then, we calculate the document semantic similarity based on ontology network structure under MapReduce framework. Finally, based on the generated semantic document similarity, document clusters are generated via clustering algorithms. To validate the effectiveness, we use two kinds of open datasets. The experimental results show that the traditional methods can hardly work for more than ten thousand biomedical documents. The proposed method keeps efficient and accurate for big dataset and is of high parallelism and scalability.


2021 ◽  
Author(s):  
Jianyuan Wang ◽  
Xinyue Liu ◽  
Qianggang Cao ◽  
Biao Leng
Keyword(s):  

Author(s):  
Tsung-Yi Chen ◽  
Yuh-Min Chen ◽  
Meng-Che Tsai

Enterprises need to obtain information about not only specific customer preferences, but also, more importantly, customers' psychological characteristics that significantly influence their consumption behaviors and response to intelligent-based marketing activities. If enterprises want to implement more precise intelligent selling activities for customers, customers' personality information will serve as a highly valued reference. The automatic detection method proposed in this study is based on techniques such as text semantic mining and machine learning to conduct personality type prediction on the target by collecting and analyzing the target's social media data. In the test, 5,858 statuses were obtained, 815 of which were labeled, with 122 effective tags. In general, when n = 5, the labeling rate can reach 60-80%. The status property classifier (SPC) proposed in this study can predict the personality type (PT) of the user publishing the status set with a high degree of accuracy by conducting text semantic mining on the status set.


2020 ◽  
Vol 539 ◽  
pp. 461-486 ◽  
Author(s):  
Hongbin Zhang ◽  
Renzhong Wu ◽  
Tian Yuan ◽  
Ziliang Jiang ◽  
Song Huang ◽  
...  

2020 ◽  
Vol 12 (16) ◽  
pp. 6294
Author(s):  
Chenyu Zheng

Global cities act as influential hubs in the networked world. Their city brands, which are projected by the global news media, are becoming sustainable resources in various global competitions and cooperations. This study adopts the research paradigm of computational social science to assess and compare the city brand attention, positivity, and influence of ten Globalization and World Cities Research Network (GaWC) Alpha+ global cities, along with their dimensional structures, based on combining the cognitive and affective theoretical perspectives on the frameworks of the Anholt global city brand dimension system, the big data of global news knowledge graph in Google’s Global Database of Events, Language, and Tone (GDELT), and the technologies of word-embedding semantic mining and clustering analysis. The empirical results show that the overall values and dimensional structures of city brand influence of global cities form distinct levels and clusters, respectively. Although global cities share a common structural characteristic of city brand influence of the dimensions of presence and potential being most prominent, Western and Eastern global cities differentiate in the clustering of dimensional structures of city brand attention, positivity, and influence. City brand attention is more important than city brand positivity in improving the city brand influence of global cities. The preferences of the global news media over global city brands fits the nature of global cities.


2020 ◽  
Vol 29 (15) ◽  
pp. 2050248
Author(s):  
Hamed Jelodar ◽  
Yongli Wang ◽  
Ahamdreza Vajdi ◽  
Mahdi Rabbani ◽  
Ruxin Zhao ◽  
...  

Question-answering (QA) websites supply a quickly growing source of useful information in numerous areas. These platforms present novel opportunities for online users to supply solutions, they also pose numerous challenges with the ever-growing size of the QA community. QA sites supply platforms for users to cooperate in the form of asking questions or giving answers. Stack Overflow is a massive source of information for both industry and academic practitioners, and its analysis can supply useful insights. Topic modeling of Stack Overflow is very beneficial for pattern discovery and behavior analysis in programming knowledge. In this paper, we propose a framework based on the Latent Dirichlet Allocation (LDA) algorithm and fuzzy rules for question topic mining and recommending highlight latent topics in a community question-answering (CQA) forum of developer community. We consider a real dataset and use 170,091 programmer questions in the R language forum from the Stack Overflow website. Our result shows that LDA topic models via novel fuzzy rules can play an effective role for extracting meaningful concepts and semantic mining in question-answering forums in developer communities.


Sign in / Sign up

Export Citation Format

Share Document