data mining technique
Recently Published Documents


TOTAL DOCUMENTS

848
(FIVE YEARS 284)

H-INDEX

21
(FIVE YEARS 5)

2022 ◽  
pp. 24-56
Author(s):  
Rajab Ssemwogerere ◽  
Wamwoyo Faruk ◽  
Nambobi Mutwalibi

Classification is a data mining technique or approach used to estimate the grouped membership of items on a basis of a common feature. This technique is virtuous for future planning and discovering new knowledge about a specific dataset. An in-depth study of previous pieces of literature implementing data mining techniques in the design of recommender systems was performed. This chapter provides a broad study of the way of designing recommender systems using various data mining classification techniques of machine learning and also exploiting their methodological decisions in four aspects, the recommendation approaches, data mining techniques, recommendation types, and performance measures. This study focused on some selected classification methods and can be so supportive for both the researchers and the students in the field of computer science and machine learning in strengthening their knowledge about the machine learning hypothesis and data mining.


2022 ◽  
pp. 42-71
Author(s):  
Artemisa Rocha Dores ◽  
Andreia Geraldo ◽  
Helena Martins

Intervention in mental health urges new solutions that merge solid theoretical foundations and new possibilities provided by technological development. This chapter is structured around results from a data mining technique using VOSViewer, which organized the field into five clusters of published literature: (1) most affected populations, (2) mental illness/disorders and their impact, (3) the expansion of remote interventions, (4) ICT potential to overcome limitations and (5) a positive approach to ICTs in mental health care. Solutions and recommendations are presented to overcome the issues identified, including how future interventions should consider old and new issues as the ones raised by the COVID-19 pandemic. Computer-based or web-based interventions are hereby presented as part of the revolution towards digital mental health or e-mental health. This approach has the potential to deconfine interventions, releasing them from the traditional settings and reaching new populations. It also reinforces the path already started, from the secondary to the primary and primordial prevention, towards the modification of the psychopathological trajectories.


Author(s):  
Kalyana Saravanan ◽  
Angamuthu Tamilarasi

Big data is a collection of large volume of data and extract similar data points from large dataset. Clustering is an essential data mining technique for examining large volume of data. Several techniques have been developed for handling big dataset. However, with much time consumption and space complexity, accuracy is said to be compromised. In order to improve clustering accuracy with less complexity, Sørensen-Dice Indexing based Weighted Iterative X-means Clustering (SDI-WIXC) technique is introduced. SDI-WIXC technique is used for grouping the similar data points with higher clustering accuracy and minimal time. First, number of data points is collected from big dataset. Then, along with the weight value, the given dataset is partitioned into ‘X’ number of clusters. Next, based on the similarity measure, Weighted Iterated X-means Clustering (WIXC) is applied for clustering data points. Sørensen-Dice Indexing Process is used for measuring similarity between cluster weight value and data points. Upon similarity found between weight value of cluster and data point, data points are grouped into a specific cluster. Besides, the WIXC method also improves the cluster assignments through repeated subdivision using Bayesian probability criterion. This in turn helps to group all data points and hence, improving the clustering accuracy. Experimental evaluation is carried out with number of factors such as clustering accuracy, clustering time and space complexity with respect to the number of data points. The experimental results reported that the proposed SDI-WIXC technique obtains high clustering accuracy with minimum time as well as space complexity.


Author(s):  
Trisna Yuniarti ◽  
Dahliyah Hayati

The oil palm is the most productive plantation product in Indonesia. Government strategies and policies related to oil palm plantations continue to be carried out considering that the plantation area is increasing every year. Segmentation of oil palm plantations based on area, production, and productivity aims to identify groups of potential oil palm plantations in the territory of Indonesia. This segmentation can provide consideration in formulating strategies and policies that will be made by the government. The segmentation method for grouping oil palm plantations uses the K-Means Clustering Data Mining technique with 3 clusters specified. Data mining stages start from data collection until representation is carried out, where 34 data sets are collected, only 25 data sets can be processed further. The results of this grouping obtained three plantation segments, namely 72% of the plantation group with low potential, 20% of the plantation group with medium potential, and 8% of the plantation group with high potential.


MAUSAM ◽  
2021 ◽  
Vol 67 (3) ◽  
pp. 669-676
Author(s):  
KAVITA PABREJA ◽  
RATTAN K. DATTA

Data Mining has been used extensively in various business and scientific applications for last few years. Data mining has been found to be providing a deep insight into understanding the hidden facts in huge databases. Data mining is an interdisciplinary subfield of computer science that discovers patterns in large data sets by using methods at the intersection of artificial intelligence, machine learning, statistics, and database systems. In this paper, data mining technique for Interpretation of Weather Forecasts for one of the most disastrous weather phenomenon viz. cloudburst has been applied. Every year, cloudburst over hilly areas and coastal regions causes loss of lives and property. The forecasting and warning of these events is very difficult. There is no satisfactory technique for anticipating the occurrence of cloudbursts because of their small scale. A very fine network of radars is required to be able to detect the likelihood of a cloudburst and this would be prohibitively expensive. The warning of cloudburst could only be provided at a small lead time say a few hours in advance based on the interpretation of latest satellite imagery data, powerful radar (Doppler category), if available, or by using Model Output Statistics (MOS) models. Another dimension to forecasting this weather event has been identified by applying clustering technique on primary data forecasted by global and regional models of weather forecasting. A recent case of Cloudburst over Uttarakhand that caused a huge loss has been analyzed using k-means clustering technique of data mining. It has been observed that with the mining of Numerical Weather Prediction model forecast data, the signals of formation of cloudburst can be found3-4 days in advance.


Stroke remains one of the leading causes of death worldwide. It is usually associated with a build-up of fatty deposits inside the arteries which increases the risk of blood clotting. The unannounced nature of the disease when it strikes has posed a major challenge in the health sector. Poor medical facilities, insufficient information on how to accurately diagnose stroke, late identification of the disease by the patients due to being ignorant of the disease are some of the reasons for the increasing mortality rate due it. The application of data mining technique in the field of medicine has brought about positive development in the area of diagnosing, prediction and deeply understanding of healthcare data. This study considers some of the Predictive Models developed using some data mining approaches to predict patients at risk of developing stroke in order for other researchers to build on.


Healthcare ◽  
2021 ◽  
Vol 9 (12) ◽  
pp. 1652
Author(s):  
Hanan Aljuaid ◽  
Hanan A. Hosni Mahmoud

Epigenetic changes are a necessary characteristic of all cancer types. Tumor cells usually target genetic changes and epigenetic alterations as well. It is most beneficial to identify epigenetic similar features among cancer various types to be able to discover the appropriate treatments. The existence of epigenetic alteration profiles can aid in targeting this goal. In this paper, we propose a new technique applying data mining and clustering methodologies for cancer epigenetic changes analysis. The proposed technique aims to detect common patterns of epigenetic changes in various cancer types. We demonstrated the validation of the new technique by detecting epigenetic patterns across seven cancer types and by determining epigenetic similarities among various cancer types. The experimental results demonstrate that common epigenetic patterns do exist across these cancer types. Additionally, epigenetic gene analysis performed on the associated genes found a strong relationship with the development of various types of cancer and proved high risk across the studied cancer types. We utilized the frequent pattern data mining approach to represent cancer types compactly in the promoters for some epigenetic marks. Utilizing the built frequent pattern item set, the most frequent items are identified and yield the group of the bi-clusters of these patterns. Experimental results of the proposed method are shown to have a success rate of 88% in detecting cancer types according to specific epigenetic pattern.


Sign in / Sign up

Export Citation Format

Share Document