Data Mining Using K-Means Clustering Algorithm for Grouping Countries of Origin of Foreign Tourist

2019 ◽

Vol 1 (1) ◽

pp. 31-39

Author(s):

Ilham Safitra Damanik ◽

Sundari Retno Andani ◽

Dedi Sehendro

Keyword(s):

Data Mining ◽

Milk Production ◽

Clustering Algorithm ◽

Clustering Method ◽

Data Mining Techniques ◽

Low Level ◽

Fresh Milk ◽

Nutritional Needs ◽

High Level ◽

Level Cluster

Milk is an important intake to meet nutritional needs. Both consumed by children, and adults. Indonesia has many producers of fresh milk, but it is not sufficient for national milk needs. Data mining is a science in the field of computers that is widely used in research. one of the data mining techniques is Clustering. Clustering is a method by grouping data. The Clustering method will be more optimal if you use a lot of data. Data to be used are provincial data in Indonesia from 2000 to 2017 obtained from the Central Statistics Agency. The results of this study are in Clusters based on 2 milk-producing groups, namely high-dairy producers and low-milk producing regions. From 27 data on fresh milk production in Indonesia, two high-level provinces can be obtained, namely: West Java and East Java. And 25 others were added in 7 provinces which did not follow the calculation of the K-Means Clustering Algorithm, including in the low level cluster.

Download Full-text

K-MEANS CLUSTERING ALGORITHM FOR SERVICE DATA ANALYSIS BASED ON CUSTOMERS COMBINATION

Unes journal of Information System ◽

10.31933/ujis.3.1.001-007.2018 ◽

2018 ◽

Vol 3 (1) ◽

pp. 001

Author(s):

Zulhendra Zulhendra ◽

Gunadi Widi Nurcahyo ◽

Julius Santony

Keyword(s):

Data Mining ◽

Data Analysis ◽

Clustering Algorithm ◽

Customer Complaints ◽

Using Data ◽

Clustering Data ◽

Service Data ◽

Selection Of

In this study using Data Mining, namely K-Means Clustering. Data Mining can be used in searching for a large enough data analysis that aims to enable Indocomputer to know and classify service data based on customer complaints using Weka Software. In this study using the algorithm K-Means Clustering to predict or classify complaints about hardware damage on Payakumbuh Indocomputer. And can find out the data of Laptop brands most do service on Indocomputer Payakumbuh as one of the recommendations to consumers for the selection of Laptops.

Download Full-text

The Sustainable Development of Financial Topic Detection and Trend Prediction by Data Mining

Sustainability ◽

10.3390/su13147585 ◽

2021 ◽

Vol 13 (14) ◽

pp. 7585

Author(s):

Yunmei Liu ◽

Shuai Zhang ◽

Min Chen ◽

Yenchun Wu ◽

Zhengxian Chen

Keyword(s):

Data Mining ◽

Supply Chain ◽

Financial Institutions ◽

Clustering Algorithm ◽

Development Trend ◽

Research Field ◽

Trend Prediction ◽

Blockchain Technology ◽

Supply Chain Finance ◽

The Government

Blockchain technology is the most cutting-edge technology in the field of financial technology, which has attracted extensive attention from governments, financial institutions and investors of various countries. Blockchain and finance, as an interdisciplinary, cross-technology and cross-field topic, has certain limitations in both theory and application. Based on the bibliometrics data of Web of Science, this paper conducts data mining on 759 papers related to blockchain technology in the financial field by means of co-word analysis, bi-clustering algorithm and strategic coordinate analysis, so as to explore hot topics in this field and predict the future development trend. The experimental results found ten research topics in the field of blockchain combined with finance, including blockchain crowdfunding, Fintech, encryption currency, consensus mechanism, the Internet of Things, digital financial, medical insurance, supply chain finance, intelligent contract and financial innovation. Among them, blockchain crowdfunding, Fintech, encryption currency and supply chain finance are the key research directions in this research field. Finally, this paper also analyzes the opportunities and risks of blockchain development in the financial field and puts forward targeted suggestions for the government and financial institutions.

Download Full-text

Improved K-Means Clustering Algorithm for Big Data Mining under Hadoop Parallel Framework

Journal of Grid Computing ◽

10.1007/s10723-019-09503-0 ◽

2019 ◽

Vol 18 (2) ◽

pp. 239-250 ◽

Cited By ~ 3

Author(s):

Weijia Lu

Keyword(s):

Data Mining ◽

Big Data ◽

Clustering Algorithm ◽

Big Data Mining

Download Full-text

Analysis on Network Clustering Algorithm of Data Mining Methods Based on Rough Set Theory

2011 Fourth International Symposium on Knowledge Acquisition and Modeling ◽

10.1109/kam.2011.85 ◽

2011 ◽

Author(s):

Xiao-rong Ye

Keyword(s):

Data Mining ◽

Set Theory ◽

Rough Set ◽

Clustering Algorithm ◽

Rough Set Theory ◽

Network Clustering ◽

Mining Methods

Download Full-text

A Clustering Algorithm in Stream Data Using Strong Coreset

Journal of Interconnection Networks ◽

10.1142/s0219265921430118 ◽

2021 ◽

Author(s):

Manmohan Singh ◽

Rajendra Pamula ◽

Alok Kumar

Keyword(s):

Data Mining ◽

Clustering Algorithm ◽

Local Optimum ◽

Reduction Algorithm ◽

Stream Data ◽

Stream Data Mining ◽

Clustering Approach ◽

Approximation Guarantee ◽

Competitive Algorithms ◽

Learning Data

There are various applications of clustering in the fields of machine learning, data mining, data compression along with pattern recognition. The existent techniques like the Llyods algorithm (sometimes called k-means) were affected by the issue of the algorithm which converges to a local optimum along with no approximation guarantee. For overcoming these shortcomings, an efficient k-means clustering approach is offered by this paper for stream data mining. Coreset is a popular and fundamental concept for k-means clustering in stream data. In each step, reduction determines a coreset of inputs, and represents the error, where P represents number of input points according to nested property of coreset. Hence, a bit reduction in error of final coreset gets n times more accurate. Therefore, this motivated the author to propose a new coreset-reduction algorithm. The proposed algorithm executed on the Covertype dataset, Spambase dataset, Census 1990 dataset, Bigcross dataset, and Tower dataset. Our algorithm outperforms with competitive algorithms like Streamkm[Formula: see text], BICO (BIRCH meets Coresets for k-means clustering), and BIRCH (Balance Iterative Reducing and Clustering using Hierarchies.

Download Full-text

Application of Digital Mining Facing Information Fusion Technology in the Field of National Costume Culture Design

Mobile Information Systems ◽

10.1155/2021/3790413 ◽

2021 ◽

Vol 2021 ◽

pp. 1-11

Author(s):

Min Yu ◽

Rongrong Cui

Keyword(s):

Data Mining ◽

Internet Of Things ◽

Data Processing ◽

Clustering Algorithm ◽

Laplacian Matrix ◽

Design System ◽

Design Effect ◽

Constraint Matrix ◽

Clothing Design ◽

Culture Design

In order to improve the design effect of minority clothing, according to the needs of minority clothing design, this paper uses data mining and Internet of Things technologies to construct an intelligent ethnic clothing design system and builds an intelligent clothing design system that meets customer needs based on the idea of human-computer interaction. In data processing, this paper uses the constraint spectrum clustering algorithm to take the Laplacian matrix and the constraint matrix as input and finally outputs a clustering indicator vector to improve the data processing effect of minority clothing design. Finally, this paper verifies the performance of the system designed in this paper through experiments. From the experimental research, it can be known that the minority clothing design system based on the Internet of Things and data mining constructed in this paper has a certain effect and can effectively improve the minority clothing design effect.

Download Full-text

Efficient and Privacy-Preserving Multi-User Outsourced K-Means Clustering

Computer and Information Science ◽

10.5539/cis.v14n2p26 ◽

2021 ◽

Vol 14 (2) ◽

pp. 26

Author(s):

Na Li ◽

Lianguan Huang ◽

Yanling Li ◽

Meng Sun

Keyword(s):

Data Mining ◽

Big Data ◽

Clustering Algorithm ◽

Privacy Preserving ◽

Locality Sensitive Hashing ◽

Sensitive Information ◽

The Public ◽

Big Data Mining ◽

Euclidean Distances ◽

Computational Resources

In recent years, with the development of the Internet, the data on the network presents an outbreak trend. Big data mining aims at obtaining useful information through data processing, such as clustering, clarifying and so on. Clustering is an important branch of big data mining and it is popular because of its simplicity. A new trend for clients who lack of storage and computational resources is to outsource the data and clustering task to the public cloud platforms. However, as datasets used for clustering may contain some sensitive information (e.g., identity information, health information), simply outsourcing them to the cloud platforms can't protect the privacy. So clients tend to encrypt their databases before uploading to the cloud for clustering. In this paper, we focus on privacy protection and efficiency promotion with respect to k-means clustering, and we propose a new privacy-preserving multi-user outsourced k-means clustering algorithm which is based on locality sensitive hashing (LSH). In this algorithm, we use a Paillier cryptosystem encrypting databases, and combine LSH to prune off some unnecessary computations during the clustering. That is, we don't need to compute the Euclidean distances between each data record and each clustering center. Finally, the theoretical and experimental results show that our algorithm is more efficient than most existing privacy-preserving k-means clustering.

Download Full-text

Improved minimum-minimum roughness algorithm for clustering categorical data

International Journal of ADVANCED AND APPLIED SCIENCES ◽

10.21833/ijaas.2021.10.006 ◽

2021 ◽

Vol 8 (10) ◽

pp. 43-50

Author(s):

Truong et al. ◽

Keyword(s):

Machine Learning ◽

Data Mining ◽

Hierarchical Clustering ◽

Categorical Data ◽

Clustering Algorithm ◽

Clustering Algorithms ◽

Experimental Results ◽

Data Sets ◽

Top Down ◽

Hierarchical Clustering Algorithm

Clustering is a fundamental technique in data mining and machine learning. Recently, many researchers are interested in the problem of clustering categorical data and several new approaches have been proposed. One of the successful and pioneering clustering algorithms is the Minimum-Minimum Roughness algorithm (MMR) which is a top-down hierarchical clustering algorithm and can handle the uncertainty in clustering categorical data. However, MMR tends to choose the category with less value leaf node with more objects, leading to undesirable clustering results. To overcome such shortcomings, this paper proposes an improved version of the MMR algorithm for clustering categorical data, called IMMR (Improved Minimum-Minimum Roughness). Experimental results on actual data sets taken from UCI show that the IMMR algorithm outperforms MMR in clustering categorical data.

Download Full-text

Clustering Algorithm for Data Mining using Posterior Probability-based Information Entropy

Journal of Digital Convergence ◽

10.14400/jdc.2014.12.12.293 ◽

2014 ◽

Vol 12 (12) ◽

pp. 293-301 ◽

Cited By ~ 2

Author(s):

In-Kyoo Park

Keyword(s):

Data Mining ◽

Information Entropy ◽

Posterior Probability ◽

Clustering Algorithm

Download Full-text

Data Mining Using K-Means Clustering Algorithm for Grouping Countries of Origin of Foreign Tourist

Teknik Data Mining Dalam Clustering Produksi Susu Segar Di Indonesia Dengan Algoritma K-Means

K-MEANS CLUSTERING ALGORITHM FOR SERVICE DATA ANALYSIS BASED ON CUSTOMERS COMBINATION

The Sustainable Development of Financial Topic Detection and Trend Prediction by Data Mining

Improved K-Means Clustering Algorithm for Big Data Mining under Hadoop Parallel Framework

Analysis on Network Clustering Algorithm of Data Mining Methods Based on Rough Set Theory

A Clustering Algorithm in Stream Data Using Strong Coreset

Application of Digital Mining Facing Information Fusion Technology in the Field of National Costume Culture Design

Efficient and Privacy-Preserving Multi-User Outsourced K-Means Clustering

Improved minimum-minimum roughness algorithm for clustering categorical data

Clustering Algorithm for Data Mining using Posterior Probability-based Information Entropy

Export Citation Format