text mining Latest Research Papers

Automated Text Classification of Maintenance Data of Higher Education Buildings Using Text Mining and Machine Learning Techniques

Journal of Architectural Engineering ◽

10.1061/(asce)ae.1943-5568.0000522 ◽

2022 ◽

Vol 28 (1) ◽

Author(s):

Sungil Hong ◽

Junghyun Kim ◽

Eunhwa Yang

Keyword(s):

Higher Education ◽

Machine Learning ◽

Text Mining ◽

Text Classification ◽

Machine Learning Techniques ◽

Learning Techniques ◽

Automated Text Classification

Evaluation of the synergy degree of industrial de-capacity policies based on text mining: A case study of China's coal industry

Resources Policy ◽

10.1016/j.resourpol.2021.102547 ◽

2022 ◽

Vol 76 ◽

pp. 102547

Author(s):

Dandan Liu ◽

Delu Wang

Keyword(s):

Text Mining ◽

Coal Industry

Application of Informetrics on Financial Network Text Mining Based on Affective Computing

Information Processing & Management ◽

10.1016/j.ipm.2021.102822 ◽

2022 ◽

Vol 59 (2) ◽

pp. 102822

Author(s):

Anzhong Huang ◽

Yuling Zhang ◽

Jianping Peng ◽

Hong Chen

Keyword(s):

Text Mining ◽

Affective Computing ◽

Financial Network

Recycling behaviour: Mapping knowledge domain through bibliometrics and text mining

Journal of Environmental Management ◽

10.1016/j.jenvman.2021.114160 ◽

2022 ◽

Vol 303 ◽

pp. 114160

Author(s):

Alessandro Concari ◽

Gerjo Kok ◽

Pim Martens

Keyword(s):

Text Mining ◽

Knowledge Domain

PENERAPAN TEXT MINING UNTUK MELAKUKAN CLUSTERING DATA TWEET AKUN BLIBLI PADA MEDIA SOSIAL TWITTER MENGGUNAKAN K-MEANS CLUSTERING

Jurnal Gaussian ◽

10.14710/j.gauss.v10i4.30409 ◽

2022 ◽

Vol 10 (4) ◽

pp. 583-593

Author(s):

Syiva Multi Fani ◽

Rukun Santoso ◽

Suparti Suparti

Keyword(s):

Social Media ◽

Text Mining ◽

Virtual Networks ◽

Number Of Clusters ◽

Silhouette Coefficient ◽

Twitter Account ◽

Computer Based ◽

Twitter Users ◽

Clustering Data ◽

Coefficient Method

Social media is computer-based technology that facilitates the sharing of ideas, thoughts, and information through the building of virtual networks and communities. Twitter is one of the most popular social media in Indonesia which has 78 million users. Businesses rely heavily on Twitter for advertising. Businesses can use these types of tweet content as a means of advertising to Twitter users by Knowing the types of tweet content that are mostly retweeted by their followers . In this study, the application of Text Mining to perform clustering using the K-means clustering method with the best number of clusters obtained from the Silhouette Coefficient method on the @bliblidotcom Twitter tweet data to determine the types of tweet content that are mostly retweeted by @bliblidotcom followers. Tweets with the most retweets and favorites are discount offers and flash sales, so Blibli Indonesia could use this kind of tweet to conduct advertising on social media Twitter because the prize quiz tweets are liked by the @bliblidotcom Twitter account followers.

The Epilepsy Ontology: a community-based ontology tailored for semantic interoperability and text-mining

10.21203/rs.3.rs-1259791/v1 ◽

2022 ◽

Author(s):

Astghik Sargsyan ◽

Philipp Wegner ◽

Stephan Gebel ◽

Shounak Baksi ◽

Geena Mariya Jose ◽

...

Keyword(s):

Text Mining ◽

Knowledge Exchange ◽

Biomedical Ontology ◽

Formal Ontology ◽

Basic Formal Ontology ◽

Community Members ◽

Domain Specific ◽

Complex Disorder ◽

Precise Understanding ◽

Structured Knowledge

Abstract Motivation: Epilepsy is a multi-faceted complex disorder that requires a precise understanding of the classification, diagnosis, treatment, and disease mechanism governing it. Although scattered resources are available on epilepsy, comprehensive and structured knowledge is missing. In contemplation to promote multidisciplinary knowledge exchange and facilitate advancement in clinical management, especially in pre-clinical research, a disease-specific ontology is necessary. The presented ontology is designed to enable better interconnection between scientific community members in the epilepsy domain.Results: The Epilepsy Ontology (EPIO) is an assembly of structured knowledge on various aspects of epilepsy, developed according to Basic Formal Ontology (BFO) and Open Biological and Biomedical Ontology (OBO) Foundry principles. Concepts and definitions are collected from the latest International League against Epilepsy (ILAE) classification, domain-specific ontologies, and scientific literature. This ontology consists of 1,879 classes and 28,151 axioms (2,171 declaration axioms, 2,219 logical axioms) from several aspects of epilepsy. This ontology is intended to be used for data management and text mining purposes.

ANALISIS KECENDERUNGAN LAPORAN MASYARAKAT PADA “LAPORGUB..!” PROVINSI JAWA TENGAH MENGGUNAKAN TEXT MINING DENGAN FUZZY C-MEANS CLUSTERING

Jurnal Gaussian ◽

10.14710/j.gauss.v10i4.33101 ◽

2022 ◽

Vol 10 (4) ◽

pp. 544-553

Author(s):

Ratna Kurniasari ◽

Rukun Santoso ◽

Alan Prahutama

Keyword(s):

Text Mining ◽

Cluster Center ◽

Text Data ◽

Fuzzy C Means ◽

Word Cloud ◽

Silhouette Coefficient ◽

Degree Of Membership ◽

Fuzzy C Means Clustering ◽

Hard Clustering ◽

The Government

Effective communication between the government and society is essential to achieve good governance. The government makes an effort to provide a means of public complaints through an online aspiration and complaint service called “LaporGub..!”. To group incoming reports easier, the topic of the report is searched by using clustering. Text Mining is used to convert text data into numeric data so that it can be processed further. Clustering is classified as soft clustering (fuzzy) and hard clustering. Hard clustering will divide data into clusters strictly without any overlapping membership with other clusters. Soft clustering can enter data into several clusters with a certain degree of membership value. Different membership values make fuzzy grouping have more natural results than hard clustering because objects at the boundary between several classes are not forced to fully fit into one class but each object is assigned a degree of membership. Fuzzy c-means has an advantage in terms of having a more precise placement of the cluster center compared to other cluster methods, by improving the cluster center repeatedly. The formation of the best number of clusters is seen based on the maximum silhouette coefficient. Wordcloud is used to determine the dominant topic in each cluster. Word cloud is a form of text data visualization. The results show that the maximum silhouette coefficient value for fuzzy c-means clustering is shown by the three clusters. The first cluster produces a word cloud regarding road conditions as many as 449 reports, the second cluster produces a word cloud regarding covid assistance as many as 964 reports, and the third cluster produces a word cloud regarding farmers fertilizers as many as 176 reports. The topic of the report regarding covid assistance is the cluster with the most number of members.

Text visualization for geological hazard documents via text mining and natural language processing

Earth Science Informatics ◽

10.1007/s12145-021-00732-0 ◽

2022 ◽

Author(s):

Ying Ma ◽

Zhong Xie ◽

Gang Li ◽

Kai Ma ◽

Zhen Huang ◽

...

Keyword(s):

Natural Language Processing ◽

Text Mining ◽

Natural Language ◽

Language Processing ◽

Geological Hazard ◽

Text Visualization

Analysis of sebaceous gland carcinoma associated genes using network analysis to identify potentially actionable genes

South Asian Journal of Experimental Biology ◽

10.38150/sajeb.11(6).p634-645 ◽

2022 ◽

Vol 11 (6) ◽

pp. 634-645

Author(s):

Nimita Kant ◽

Perumal Jayaraj ◽

Chitra

Keyword(s):

Network Analysis ◽

Text Mining ◽

Cell Fate ◽

Protein Interaction ◽

Sebaceous Gland ◽

Ppi Network ◽

Sebaceous Gland Carcinoma ◽

Protein Protein Interaction ◽

Pubmed Database ◽

Gland Carcinoma

Eyelid sebaceous gland carcinoma (SGC) is a rare but life-threatening condi-tion. However, there is limited computational research associated with un-derlying protein interactions specific to eyelid sebaceous gland carcinoma. The aim of our study is to identify and analyse the genes associated with eyelid sebaceous gland carcinoma using text mining and to develop a protein-protein interaction network to predict significant biological pathways using bioinformatics tool. Genes associated with eyelid sebaceous gland carcinoma were retrieved from the PubMed database using text mining with key terms ‘eyelid’, ‘sebaceous gland carcinoma’ and excluding the genes for ‘Muir-Torre Syndrome’. The interaction partners were identified using STRING. Cytoscape was used for visualization and analysis of the PPI network. Molec-ular complexes in the network were predicted using MCODE plug-in and ana-lyzed for gene ontology terms using DAVID. PubMed retrieval process identi-fied 79 genes related to eyelid sebaceous gland carcinoma. The PPI network associated with eyelid sebaceous gland carcinoma produced 79 nodes, 1768 edges. Network analysis using Cytoscape identified nine key genes and two molecular complexes to be enriched in the protein-protein interaction net-work. GO enrichment analysis identified biological processes cell fate com-mitment, Wnt signalling pathway, retinoic acid signalling and response to cytokines to be enriched in our network. Genes identified in the study might play a pivotal role in understanding the underlying molecular pathways in-volved in the development and progression of eyelid sebaceous gland carci-noma. Furthermore, it may aid in the identification of candidate biomarkers and therapeutic targets in the treatment of eyelid sebaceous gland carcino-ma.

Determining banking service attributes from online reviews: text mining and sentiment analysis

International Journal of Bank Marketing ◽

10.1108/ijbm-08-2021-0380 ◽

2022 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Divya Mittal ◽

Shiv Ratan Agrawal

Keyword(s):

Text Mining ◽

Customer Satisfaction ◽

Sentiment Analysis ◽

Banking Sector ◽

Online Reviews ◽

Socioeconomic Background ◽

Content Type ◽

Customer Reviews ◽

Banking Service ◽

The Interest Rate

PurposeThe current study employs text mining and sentiment analysis to identify core banking service attributes and customer sentiment in online user-generated reviews. Additionally, the study explains customer satisfaction based on the identified predictors.Design/methodology/approachA total of 32,217 customer reviews were collected across 29 top banks on bankbazaar.com posted from 2014 to 2021. In total three conceptual models were developed and evaluated employing regression analysis.FindingsThe study revealed that all variables were found to be statistically significant and affect customer satisfaction in their respective models except the interest rate.Research limitations/implicationsThe study is confined to the geographical representation of its subjects' i.e. Indian customers. A cross-cultural and socioeconomic background analysis of banking customers in different countries may help to better generalize the findings.Practical implicationsThe study makes essential theoretical and managerial contributions to the existing literature on services, particularly the banking sector.Originality/valueThis paper is unique in nature that focuses on banking customer satisfaction from online reviews and ratings using text mining and sentiment analysis.

text mining
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Automated Text Classification of Maintenance Data of Higher Education Buildings Using Text Mining and Machine Learning Techniques

Evaluation of the synergy degree of industrial de-capacity policies based on text mining: A case study of China's coal industry

Application of Informetrics on Financial Network Text Mining Based on Affective Computing

Recycling behaviour: Mapping knowledge domain through bibliometrics and text mining

PENERAPAN TEXT MINING UNTUK MELAKUKAN CLUSTERING DATA TWEET AKUN BLIBLI PADA MEDIA SOSIAL TWITTER MENGGUNAKAN K-MEANS CLUSTERING

The Epilepsy Ontology: a community-based ontology tailored for semantic interoperability and text-mining

ANALISIS KECENDERUNGAN LAPORAN MASYARAKAT PADA “LAPORGUB..!” PROVINSI JAWA TENGAH MENGGUNAKAN TEXT MINING DENGAN FUZZY C-MEANS CLUSTERING

Text visualization for geological hazard documents via text mining and natural language processing

Analysis of sebaceous gland carcinoma associated genes using network analysis to identify potentially actionable genes

Determining banking service attributes from online reviews: text mining and sentiment analysis

Export Citation Format

text miningRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Automated Text Classification of Maintenance Data of Higher Education Buildings Using Text Mining and Machine Learning Techniques

Evaluation of the synergy degree of industrial de-capacity policies based on text mining: A case study of China's coal industry

Application of Informetrics on Financial Network Text Mining Based on Affective Computing

Recycling behaviour: Mapping knowledge domain through bibliometrics and text mining

PENERAPAN TEXT MINING UNTUK MELAKUKAN CLUSTERING DATA TWEET AKUN BLIBLI PADA MEDIA SOSIAL TWITTER MENGGUNAKAN K-MEANS CLUSTERING

The Epilepsy Ontology: a community-based ontology tailored for semantic interoperability and text-mining

ANALISIS KECENDERUNGAN LAPORAN MASYARAKAT PADA “LAPORGUB..!” PROVINSI JAWA TENGAH MENGGUNAKAN TEXT MINING DENGAN FUZZY C-MEANS CLUSTERING

Text visualization for geological hazard documents via text mining and natural language processing

Analysis of sebaceous gland carcinoma associated genes using network analysis to identify potentially actionable genes

Determining banking service attributes from online reviews: text mining and sentiment analysis

text mining
Recently Published Documents