Data-Mining Techniques: A New Approach to Identifying the Links among Hybrid Strains of Pleurotus with Culture Media

In this study, a data set of mycelial and cultural characteristics of hybrid strains of Pleurotus ostreatus and Pleurotus djamor were analyzed using three data-mining techniques: the K-medoids clustering algorithm, PCA biplot and the association rules algorithm. The characteristics evaluated were as follows: maximum velocity; lag phase; biomass; and exopolysaccharides content in the cultivation of 50 hybrid strains of Pleurotus ostreatus and 50 hybrid strains of Pleurotus djamor. Different mixtures of culture media were used to supplement Ecuadorian agricultural products. Data of the parameters obtained in the experimental methods were grouped into four clusters, obtaining a presentation of the hybrid strains of Pleurotus with a higher relation to each characteristic measured. Data-mining tools showed the hybrid strains cultivated on solid-culture media (M1 = malt extract agar and rice flour) and liquid-culture media (L1 = maltose, yeast extract and rice flour) presented the highest mycelial and cultural characteristics. These results are good indicators to improve the industrial production of edible fungi by using rice flour in the cultivation, contributing to the mushroom market and circular economy.

Download Full-text

Teknik Data Mining Dalam Clustering Produksi Susu Segar Di Indonesia Dengan Algoritma K-Means

BRAHMANA: Jurnal Penerapan Kecerdasan Buatan ◽

10.30645/brahmana.v1i1.5 ◽

2019 ◽

Vol 1 (1) ◽

pp. 31-39

Author(s):

Ilham Safitra Damanik ◽

Sundari Retno Andani ◽

Dedi Sehendro

Keyword(s):

Data Mining ◽

Milk Production ◽

Clustering Algorithm ◽

Clustering Method ◽

Data Mining Techniques ◽

Low Level ◽

Fresh Milk ◽

Nutritional Needs ◽

High Level ◽

Level Cluster

Milk is an important intake to meet nutritional needs. Both consumed by children, and adults. Indonesia has many producers of fresh milk, but it is not sufficient for national milk needs. Data mining is a science in the field of computers that is widely used in research. one of the data mining techniques is Clustering. Clustering is a method by grouping data. The Clustering method will be more optimal if you use a lot of data. Data to be used are provincial data in Indonesia from 2000 to 2017 obtained from the Central Statistics Agency. The results of this study are in Clusters based on 2 milk-producing groups, namely high-dairy producers and low-milk producing regions. From 27 data on fresh milk production in Indonesia, two high-level provinces can be obtained, namely: West Java and East Java. And 25 others were added in 7 provinces which did not follow the calculation of the K-Means Clustering Algorithm, including in the low level cluster.

Download Full-text

Failure Analysis in University and Computer Science Contexts With Data Mining

10.5753/wei.2020.11132 ◽

2020 ◽

Author(s):

Daniela De Souza Gomes ◽

Marcos Henrique Fonseca Ribeiro ◽

Giovanni Ventorim Comarela ◽

Gabriel Philippe Pereira

Keyword(s):

Data Mining ◽

Decision Making ◽

Failure Analysis ◽

Computer Science ◽

Educational Administration ◽

Intelligent Systems ◽

Data Set ◽

Data Mining Techniques ◽

Study Case ◽

Support Students

High failure rates are a worrying and relevant problem in Brazilian universities. From a data set of student transcripts, we performed a study case for both general and Computer Science contexts, in which Data Mining Techniques were used to find patterns concerning failures. The knowledge acquired can be used for better educational administration and also build intelligent systems to support students’ decision making.

Download Full-text

Privacy Preservation using (L, D) Inference Model Based on Dependency Identification Information Gain

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.f1196.0986s319 ◽

2019 ◽

Vol 8 (6S3) ◽

pp. 1170-1173

Keyword(s):

Data Mining ◽

Information Gain ◽

Original Data ◽

Perturbation Approach ◽

Sensitive Information ◽

Functional Dependencies ◽

Inference Model ◽

Data Set ◽

Data Mining Techniques ◽

Original Dataset

The improvement of an information processing and Memory capacity, the vast amount of data is collected for various data analyses purposes. Data mining techniques are used to get knowledgeable information. The process of extraction of data by using data mining techniques the data get discovered publically and this leads to breaches of specific privacy data. Privacypreserving data mining is used to provide to protection of sensitive information from unwanted or unsanctioned disclosure. In this paper, we analysis the problem of discovering similarity checks for functional dependencies from a given dataset such that application of algorithm (l, d) inference with generalization can anonymised the micro data without loss in utility. [8] This work has presented Functional dependency based perturbation approach which hides sensitive information from the user, by applying (l, d) inference model on the dependency attributes based on Information Gain. This approach works on both categorical and numerical attributes. The perturbed data set does not affects the original dataset it maintains the same or very comparable patterns as the original data set. Hence the utility of the application is always high, when compared to other data mining techniques. The accuracy of the original and perturbed datasets is compared and analysed using tools, data mining classification algorithm.

Download Full-text

A Hybrid Method for Prediction and Assessment Efficiency of Decision Making Units

International Journal of Decision Support System Technology ◽

10.4018/jdsst.2013010104 ◽

2013 ◽

Vol 5 (1) ◽

pp. 66-83 ◽

Cited By ~ 1

Author(s):

Iman Rahimi ◽

Reza Behmanesh ◽

Rosnah Mohd. Yusuff

Keyword(s):

Data Mining ◽

Decision Making ◽

Decision Rules ◽

Large Data ◽

Poultry Meat ◽

Small Data ◽

Data Set ◽

Data Mining Techniques ◽

Decision Making Units

The objective of this article is an evaluation and assessment efficiency of the poultry meat farm as a case study with the new method. As it is clear poultry farm industry is one of the most important sub- sectors in comparison to other ones. The purpose of this study is the prediction and assessment efficiency of poultry farms as decision making units (DMUs). Although, several methods have been proposed for solving this problem, the authors strongly need a methodology to discriminate performance powerfully. Their methodology is comprised of data envelopment analysis and some data mining techniques same as artificial neural network (ANN), decision tree (DT), and cluster analysis (CA). As a case study, data for the analysis were collected from 22 poultry companies in Iran. Moreover, due to a small data set and because of the fact that the authors must use large data set for applying data mining techniques, they employed k-fold cross validation method to validate the authors’ model. After assessing efficiency for each DMU and clustering them, followed by applied model and after presenting decision rules, results in precise and accurate optimizing technique.

Download Full-text

Optimization of the promotion mix in the healthcare industry

International Journal of Pharmaceutical and Healthcare Marketing ◽

10.1108/ijphm-03-2013-0008 ◽

2015 ◽

Vol 9 (4) ◽

pp. 289-305

Author(s):

Dominique Haughton ◽

Guangying Hua ◽

Danny Jin ◽

John Lin ◽

Qizhi Wei ◽

...

Keyword(s):

Data Mining ◽

Indirect Effects ◽

Directed Acyclic Graphs ◽

Optimization Process ◽

Sales Volume ◽

Healthcare Industry ◽

Direct And Indirect Effects ◽

Data Set ◽

Content Type ◽

Data Mining Techniques

Purpose – The purpose of this paper is to propose data mining techniques to model the return on investment from various types of promotional spending to market a drug and then use the model to draw conclusions on how the pharmaceutical industry might go about allocating promotion expenditures in a more efficient manner, potentially reducing costs to the consumer. The main contributions of the paper are two-fold. First, it demonstrates how to undertake a promotion mix optimization process in the pharmaceutical context and carry it through from the beginning to the end. Second, the paper proposes using directed acyclic graphs (DAGs) to help unravel the direct and indirect effects of various promotional media on sales volume. Design/methodology/approach – A synthetic data set was constructed to prototype proposed data mining techniques and two analyses approaches were investigated. Findings – The two methods were found to yield insights into the problem of the promotion mix in the context of the healthcare industry. First, a factor analysis followed by a regression analysis and an optimization algorithm applied to the resulting equation were used. Second, DAG was used to unravel direct and indirect effects of promotional expenditures on new prescriptions. Research limitations/implications – The data are synthetic and do not incorporate any time autocorrelations. Practical implications – The promotion mix optimization process is demonstrated from the beginning to the end, and the issue of negative coefficient in promotion mix models are addressed. In addition, a method is proposed to identify direct and indirect effects on new prescriptions. Social implications – A better allocation of promotional expenditures has the potential for reducing the cost of healthcare to consumers. Originality/value – The contributions of the paper are two-fold: for the first time in the literature (to the best of the authors’ knowledge), the authors have undertaken a promotion mix optimization process and have carried it through from the beginning to the end Second, the authors propose the use of DAGs to help unravel the effects of various promotion media on sales volume, notably direct and indirect effects.

Download Full-text

Association Rules Mining Based on Adaptive Fuzzy Clustering Algorithm

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.998-999.842 ◽

2014 ◽

Vol 998-999 ◽

pp. 842-845 ◽

Cited By ~ 1

Author(s):

Jia Mei Guo ◽

Yin Xiang Pei

Keyword(s):

Data Mining ◽

Association Rules ◽

Clustering Algorithm ◽

Original Data ◽

Data Set ◽

Association Rules Mining ◽

Fuzzy Association Rules ◽

Redundant Data ◽

Fuzzy Partitions ◽

Rules Extraction

Association rules extraction is one of the important goals of data mining and analyzing. Aiming at the problem that information lose caused by crisp partition of numerical attribute , in this article, we put forward a fuzzy association rules mining method based on fuzzy logic. First, we use c-means clustering to generate fuzzy partitions and eliminate redundant data, and then map the original data set into fuzzy interval, in the end, we extract the fuzzy association rules on the fuzzy data set as providing the basis for proper decision-making. Results show that this method can effectively improve the efficiency of data mining and the semantic visualization and credibility of association rules.

Download Full-text

DATA MINING TECHNIQUES FOR EDUCATIONAL DATA: A REVIEW

International Journal of Engineering Technologies and Management Research ◽

10.29121/ijetmr.v5.i2.2018.641 ◽

2020 ◽

Vol 5 (2) ◽

pp. 166-177 ◽

Cited By ~ 1

Author(s):

Pragati Sharma ◽

Dr. Sanjiv Sharma

Keyword(s):

Higher Education ◽

Data Mining ◽

Decision Making ◽

Educational Institutions ◽

Data Set ◽

Data Mining Techniques ◽

Use Of Data ◽

New Knowledge ◽

Hidden Patterns ◽

Educational Field

Recently, data mining is gaining more popularity among researcher. Data mining provides various techniques and methods for analysing data produced by various applications of different domain. Similarly, Educational mining is providing a way for analyzing educational data set. Educational mining concerns with developing methods for discovering knowledge from data that come from educational field and it helps to extract the hidden patterns and to discover new knowledge from large educational databases with the use of data mining techniques and tools. Extracted knowledge from educational mining can be used for decision making in higher educational institutions. This paper is based on literature review of different data mining techniques along with certain algorithms like classification, clustering etc. This paper represents the effectiveness of mining techniques with educational data set for higher education institutions.

Download Full-text

A dynamic K-means clustering for data mining

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v13.i2.pp521-526 ◽

2019 ◽

Vol 13 (2) ◽

pp. 521

Author(s):

Md. Zakir Hossain ◽

Md.Nasim Akhtar ◽

R.B. Ahmad ◽

Mostafijur Rahman

Keyword(s):

Data Mining ◽

Clustering Algorithm ◽

Large Data ◽

Threshold Value ◽

Specific Pattern ◽

Large Data Sets ◽

Data Sets ◽

Data Set ◽

Number Of Clusters ◽

Data Points

<span>Data mining is the process of finding structure of data from large data sets. With this process, the decision makers can make a particular decision for further development of the real-world problems. Several data clusteringtechniques are used in data mining for finding a specific pattern of data. The K-means method isone of the familiar clustering techniques for clustering large data sets. The K-means clustering method partitions the data set based on the assumption that the number of clusters are fixed.The main problem of this method is that if the number of clusters is to be chosen small then there is a higher probability of adding dissimilar items into the same group. On the other hand, if the number of clusters is chosen to be high, then there is a higher chance of adding similar items in the different groups. In this paper, we address this issue by proposing a new K-Means clustering algorithm. The proposed method performs data clustering dynamically. The proposed method initially calculates a threshold value as a centroid of K-Means and based on this value the number of clusters are formed. At each iteration of K-Means, if the Euclidian distance between two points is less than or equal to the threshold value, then these two data points will be in the same group. Otherwise, the proposed method will create a new cluster with the dissimilar data point. The results show that the proposed method outperforms the original K-Means method.</span>

Download Full-text

Modified Single Pass Clustering Algorithm Based on Median as a Threshold Similarity Value

Collaborative Filtering Using Data Mining and Analysis - Advances in Data Mining and Database Management ◽

10.4018/978-1-5225-0489-4.ch002 ◽

2017 ◽

pp. 24-48 ◽

Cited By ~ 1

Author(s):

Mamta Mittal ◽

R. K. Sharma ◽

V.P. Singh ◽

Lalit Mohan Goyal

Keyword(s):

Data Mining ◽

Clustering Algorithm ◽

Clustering Algorithms ◽

Data Mining Techniques ◽

Single Pass ◽

Hidden Patterns ◽

Data Objects

Clustering is one of the data mining techniques that investigates these data resources for hidden patterns. Many clustering algorithms are available in literature. This chapter emphasizes on partitioning based methods and is an attempt towards developing clustering algorithms that can efficiently detect clusters. In partitioning based methods, k-means and single pass clustering are popular clustering algorithms but they have several limitations. To overcome the limitations of these algorithms, a Modified Single Pass Clustering (MSPC) algorithm has been proposed in this work. It revolves around the proposition of a threshold similarity value. This is not a user defined parameter; instead, it is a function of data objects left to be clustered. In our experiments, this threshold similarity value is taken as median of the paired distance of all data objects left to be clustered. To assess the performance of MSPC algorithm, five experiments for k-means, SPC and MSPC algorithms have been carried out on artificial and real datasets.

Download Full-text

Study on Fuzzy Clustering Algorithm of Spatial Data Mining

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.416-417.1244 ◽

2013 ◽

Vol 416-417 ◽

pp. 1244-1250

Author(s):

Ting Ting Zhao

Keyword(s):

Data Mining ◽

Fuzzy Clustering ◽

Spatial Data ◽

Clustering Algorithm ◽

Spatial Clustering ◽

Rapid Development ◽

Spatial Database ◽

Spatial Data Mining ◽

Data Set ◽

Fuzzy Similarity

With rapid development of space information crawl technology, different types of spatial database and data size of spatial database increases continuously. How to extract valuable information from complicated spatial data has become an urgent issue. Spatial data mining provides a new thought for solving the problem. The paper introduces fuzzy clustering into spatial data clustering field, studies the method that fuzzy set theory is applied to spatial data mining, proposes spatial clustering algorithm based on fuzzy similar matrix, fuzzy similarity clustering algorithm. The algorithm not only can solve the disadvantage that fuzzy clustering cant process large data set, but also can give similarity measurement between objects.

Download Full-text