topic clustering
Recently Published Documents


TOTAL DOCUMENTS

54
(FIVE YEARS 22)

H-INDEX

6
(FIVE YEARS 4)

2021 ◽  
Vol 2021 ◽  
pp. 1-12
Author(s):  
Yue Li ◽  
Yan Zhou ◽  
Xiangying Ma ◽  
Yunxing Zhang

Due to the demand for safety and convenience in traveling, self-driving technology has developed very fast in the past decades. In this paper, a novel technology forecasting model is developed. The topic-based text mining and expert judgment approaches are combined to forecast the technology trends efficiently and accurately. To improve the reliability of the results, multidimensional information including scientific papers, patents, and industry data is considered. Then, the model is utilized to forecast the development trends of self-driving technology in China. Data ranging from 2002 to 2019 are adopted with proper data cleaning. Topic clustering for papers and patents is performed, and the hierarchical structures are constructed. On this basis, the results of technology’s evolution based on papers and patents are compared and the development trends are obtained. With these results, it is speculated that technology on “Decision” will be the next hotspot in patents. The research results of this paper will provide reference and guidance for Chinese enterprises and government in decision-making on self-driving technology.


2021 ◽  
Vol 2021 ◽  
pp. 1-11
Author(s):  
Huilin Song

Smart government is an important part of the smart world. The use of big data analysis technology can effectively improve the government’s ability of fine management. Taking China’s bike-sharing industry as the research object, we study the relationship between public-derived big data and industrial policy. First, a feature-enhanced short text clustering method is proposed to perform topic clustering on publicly derived big data. Second, keyword extraction based on word frequency is used to quantify the text of industrial policy. Finally, time is taken as the main line to analyze the co-occurrence of clustering topics and keywords. The results show that (1) the feature enhancement method we proposed can effectively improve the clustering effect. (2) There is a great correlation between the industrial policy and the information mined by Weibo, but there is an obvious lag. Rational use of public-derived big data will effectively help the industrial policy to be released in a better and faster way.


Author(s):  
Ari Silburt ◽  
Anja Subasic ◽  
Evan Thompson ◽  
Carmeline Dsilva ◽  
Tarec Fares
Keyword(s):  

2021 ◽  
Vol 29 (1) ◽  
Author(s):  
Nur Atiqah Sia Abdullah ◽  
Nur Ida Aniza Rusli

With the explosive growth of social media, the online community can freely express their opinions without disclosing their identities. People with hidden agendas can easily post fake opinions to discredit target products, services, politicians, or organizations. With these big data, monitoring opinions and distilling their sentiments remain a formidable task because of the proliferation of diverse sites with a large volume of opinions that are portrayed in multilingual. Therefore, this paper aims to provide a systematic literature review on multilingual sentiment analysis, which summarises the common languages supported in multilingual sentiment analysis, pre-processing techniques, existing sentiment analysis approaches, and evaluation models that have been used for multilingual sentiment analysis. By following the systematic literature review, the findings revealed, most of the models supported two languages, and English is seen as the most used language in sentiment analysis studies. None of the reviewed literature has catered the combination of languages for English, Chinese, Malay, and Hindi language on multilingual sentiment analysis. The common pre-processing techniques for the multilingual domain are tokenization, normalization, capitalization, N-gram, and machine translation. Meanwhile, the sentiment analysis classification techniques for multilingual sentiment are hybrid sentiment analysis, which includes localized language analysis, unsupervised topic clustering, and then followed by multilingual sentiment analysis. In terms of evaluation, most of the studies used precision, recall, and accuracy as the benchmark for the results.


2020 ◽  
Author(s):  
Qiulin Wang ◽  
Lihua Song ◽  
Li Zhuang ◽  
Jiangwen Su

Sign in / Sign up

Export Citation Format

Share Document