Mining Rare Association Rules by Discovering Quasi-Functional Dependencies

In the context of anomaly detection, the data mining technique of extracting association rules can be used to identify rare rules which represent infrequent situations. A method to detect rare rules is to first infer the normal behavior of objects in the form of quasi-functional dependencies (i.e. functional dependencies that frequently hold), and then analyzing rare violations with respect to them. The quasi-functional dependencies are usually inferred from the current instance of a database. However, in several applications, the database is not static, but new data are added or deleted continuously. Thus, the anomalies have to be updated because they change over time. In this chapter, we propose an incremental algorithm to efficiently maintain up-to-date rules (i.e., functional and quasi-functional dependencies). The impact of the cardinality of the data set and the number of new tuples on the execution time is evaluated through a set of experiments on synthetic and real databases, whose results are here reported.

Download Full-text

An efficient data mining technique for discovering interesting association rules

Database and Expert Systems Applications. 8th International Conference, DEXA '97. Proceedings ◽

10.1109/dexa.1997.617409 ◽

2002 ◽

Cited By ~ 1

Author(s):

Show-Jane Yen ◽

A.L.P. Chen

Keyword(s):

Data Mining ◽

Association Rules ◽

Data Mining Technique ◽

Mining Technique ◽

Efficient Data

Download Full-text

Male and Female Partner-Caregivers’ Burden: Does It Get Worse Over Time?

The Gerontologist ◽

10.1093/geront/gny132 ◽

2018 ◽

Vol 59 (6) ◽

pp. 1103-1111 ◽

Cited By ~ 9

Author(s):

Joukje C Swinkels ◽

Marjolein I Broese van Groenou ◽

Alice de Boer ◽

Theo G van Tilburg

Keyword(s):

Caregiver Burden ◽

Process Model ◽

Female Partner ◽

General View ◽

Data Set ◽

Change Over Time ◽

Men And Women ◽

Wear And Tear ◽

The Impact ◽

Over Time

Abstract Background and Objectives The general view is that partner-caregiver burden increases over time but findings are inconsistent. Moreover, the pathways underlying caregiver burden may differ between men and women. This study examines to what degree and why partner-caregiver burden changes over time. It adopts Pearlin’s Caregiver Stress Process Model, as it is expected that higher primary and secondary stressors will increase burden and larger amounts of resources will lower burden. Yet, the impact of stressors and resources may change over time. The wear-and-tear model predicts an increase of burden due to a stronger impact of stressors and lower impact of resources over time. Alternatively, the adaptation model predicts a decrease of burden due to a lower impact of stressors and higher impact of resources over time. Research Design and Methods We used 2 observations with a 1-year interval of 279 male and 443 female partner-caregivers, derived from the Netherlands Older Persons and Informal Caregivers Survey Minimum Data Set. We applied multilevel regression analysis, stratified by gender. Results Adjusted for all predictors, caregiver burden increased over time for both men and women. For female caregivers, the impact of poor spousal health on burden increased and the impact of fulfillment decreased over time. Among male caregivers, the impact of predictors did not change over time. Discussion and Implications The increase of burden over time supports the wear-and-tear model, in particular for women. This study highlights the need for gender-specific interventions that are focused on enabling older partners to be better prepared for long-term partner-care.

Download Full-text

Privacy Preservation using (L, D) Inference Model Based on Dependency Identification Information Gain

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.f1196.0986s319 ◽

2019 ◽

Vol 8 (6S3) ◽

pp. 1170-1173

Keyword(s):

Data Mining ◽

Information Gain ◽

Original Data ◽

Perturbation Approach ◽

Sensitive Information ◽

Functional Dependencies ◽

Inference Model ◽

Data Set ◽

Data Mining Techniques ◽

Original Dataset

The improvement of an information processing and Memory capacity, the vast amount of data is collected for various data analyses purposes. Data mining techniques are used to get knowledgeable information. The process of extraction of data by using data mining techniques the data get discovered publically and this leads to breaches of specific privacy data. Privacypreserving data mining is used to provide to protection of sensitive information from unwanted or unsanctioned disclosure. In this paper, we analysis the problem of discovering similarity checks for functional dependencies from a given dataset such that application of algorithm (l, d) inference with generalization can anonymised the micro data without loss in utility. [8] This work has presented Functional dependency based perturbation approach which hides sensitive information from the user, by applying (l, d) inference model on the dependency attributes based on Information Gain. This approach works on both categorical and numerical attributes. The perturbed data set does not affects the original dataset it maintains the same or very comparable patterns as the original data set. Hence the utility of the application is always high, when compared to other data mining techniques. The accuracy of the original and perturbed datasets is compared and analysed using tools, data mining classification algorithm.

Download Full-text

An analysis on the impact of fluoride in human health (dental) using clustering data mining technique

International Conference on Pattern Recognition, Informatics and Medical Engineering (PRIME-2012) ◽

10.1109/icprime.2012.6208374 ◽

2012 ◽

Cited By ~ 7

Author(s):

T. Balasubramanian ◽

R. Umarani

Keyword(s):

Data Mining ◽

Human Health ◽

Data Mining Technique ◽

Mining Technique ◽

Clustering Data ◽

The Impact

Download Full-text

Finding Persistent Strong Rules

Knowledge Discovery Practices and Emerging Applications of Data Mining - Advances in Data Mining and Database Management ◽

10.4018/978-1-60960-067-9.ch005 ◽

2010 ◽

pp. 85-107

Author(s):

Anthony Scime ◽

Karthik Rajasethupathy ◽

Kulathur S. Rajasethupathy ◽

Gregg R. Murray

Keyword(s):

Data Mining ◽

Association Rules ◽

Strong Association ◽

National Election ◽

Data Sets ◽

Rule Discovery ◽

Discovery Process ◽

Data Set ◽

Rule Sets ◽

Election Studies

Data mining is a collection of algorithms for finding interesting and unknown patterns or rules in data. However, different algorithms can result in different rules from the same data. The process presented here exploits these differences to find particularly robust, consistent, and noteworthy rules among much larger potential rule sets. More specifically, this research focuses on using association rules and classification mining to select the persistently strong association rules. Persistently strong association rules are association rules that are verifiable by classification mining the same data set. The process for finding persistent strong rules was executed against two data sets obtained from the American National Election Studies. Analysis of the first data set resulted in one persistent strong rule and one persistent rule, while analysis of the second data set resulted in 11 persistent strong rules and 10 persistent rules. The persistent strong rule discovery process suggests these rules are the most robust, consistent, and noteworthy among the much larger potential rule sets.

Download Full-text

The Observation Report of Red Blood Cell Morphology in Thailand Teenager by Using Data Mining Technique

Advances in Hematology ◽

10.1155/2014/493706 ◽

2014 ◽

Vol 2014 ◽

pp. 1-5 ◽

Cited By ~ 4

Author(s):

Sarawut Saichanma ◽

Sucha Chulsomlee ◽

Nonthaya Thangrua ◽

Pornsuri Pongsuchart ◽

Duangmanee Sanmun

Keyword(s):

Data Mining ◽

Blood Cell ◽

Hematological Parameters ◽

Medical Science ◽

Peripheral Blood Smear ◽

Medical Laboratory ◽

Data Mining Technique ◽

Data Set ◽

Mining Technique ◽

Laboratory Results

It is undeniable that laboratory information is important in healthcare in many ways such as management, planning, and quality improvement. Laboratory diagnosis and laboratory results from each patient are organized from every treatment. These data are useful for retrospective study exploring a relationship between laboratory results and diseases. By doing so, it increases efficiency in diagnosis and quality in laboratory report. Our study will utilize J48 algorithm, a data mining technique to predict abnormality in peripheral blood smear from 1,362 students by using 13 data set of hematological parameters gathered from automated blood cell counter. We found that the decision tree which is created from the algorithm can be used as a practical guideline for RBC morphology prediction by using 4 hematological parameters (MCV, MCH, Hct, and RBC). The average prediction of RBC morphology has true positive, false positive, precision, recall, and accuracy of 0.940, 0.050, 0.945, 0.940, and 0.943, respectively. A newly found paradigm in managing medical laboratory information will be helpful in organizing, researching, and assisting correlation in multiple disciplinary other than medical science which will eventually lead to an improvement in quality of test results and more accurate diagnosis.

Download Full-text

Association Rules Mining Based on Adaptive Fuzzy Clustering Algorithm

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.998-999.842 ◽

2014 ◽

Vol 998-999 ◽

pp. 842-845 ◽

Cited By ~ 1

Author(s):

Jia Mei Guo ◽

Yin Xiang Pei

Keyword(s):

Data Mining ◽

Association Rules ◽

Clustering Algorithm ◽

Original Data ◽

Data Set ◽

Association Rules Mining ◽

Fuzzy Association Rules ◽

Redundant Data ◽

Fuzzy Partitions ◽

Rules Extraction

Association rules extraction is one of the important goals of data mining and analyzing. Aiming at the problem that information lose caused by crisp partition of numerical attribute , in this article, we put forward a fuzzy association rules mining method based on fuzzy logic. First, we use c-means clustering to generate fuzzy partitions and eliminate redundant data, and then map the original data set into fuzzy interval, in the end, we extract the fuzzy association rules on the fuzzy data set as providing the basis for proper decision-making. Results show that this method can effectively improve the efficiency of data mining and the semantic visualization and credibility of association rules.

Download Full-text

Fuzzy data mining for discovering changes in association rules over time

2002 IEEE World Congress on Computational Intelligence. 2002 IEEE International Conference on Fuzzy Systems. FUZZ-IEEE'02. Proceedings (Cat. No.02CH37291) ◽

10.1109/fuzz.2002.1006622 ◽

2003 ◽

Author(s):

Wai-Ho Au ◽

K.C.C. Chan

Keyword(s):

Data Mining ◽

Association Rules ◽

Fuzzy Data ◽

Over Time

Download Full-text

Inflation and Economic Growth: a Review of The International Literature

Comparative Economic Research ◽

10.1515/cer-2017-0019 ◽

2017 ◽

Vol 20 (3) ◽

pp. 41-56 ◽

Cited By ~ 3

Author(s):

Foluso A. Akinsola ◽

Nicholas M. Odhiambo

Keyword(s):

Economic Growth ◽

Developing Countries ◽

Negative Relationship ◽

Threshold Level ◽

Data Set ◽

Developed Economies ◽

The Impact ◽

The Relationship ◽

Over Time ◽

Country Specific

This paper surveys the existing literature on the relationship between inflation and economic growth in developed and developing countries, highlighting the theoretical and empirical indications. The study finds that the impact of inflation on economic growth varies from country to country and over time. The study also finds that the results from these studies depend on country‑specific characteristics, the data set used, and the methodology employed. On balance, the study finds overwhelming support in favour of a negative relationship between inflation and growth, especially in developed economies. However, there is still much controversy about the specific threshold level of inflation that is appropriate for growth. Most previous studies on this subject just assume a unidirectional causal relationsship between inflation and economic growth. To our knowledge, this may be the first review of its kind to survey, in detail, the existing research on the relationship between inflation and economic growth in developed and developing countries.

Download Full-text

Association Rules Analysis on FP-Growth Method in Predicting Sales

10.31227/osf.io/8m57c ◽

2017 ◽

Author(s):

Andysah Putera Utama Siahaan ◽

Mesran Mesran ◽

Andre Hasudungan Lubis ◽

Ali Ikhwan ◽

Supiyandi

Keyword(s):

Data Mining ◽

Association Rules ◽

Frequent Itemset ◽

Frequent Pattern ◽

Data Set ◽

Pattern Processing ◽

Large Databases ◽

Growth Method ◽

Association Rules Analysis ◽

A Company

Sales transaction data on a company will continue to increase day by day. Large amounts of data can be problematic for a company if it is not managed properly. Data mining is a field of science that unifies techniques from machine learning, pattern processing, statistics, databases, and visualization to handle the problem of retrieving information from large databases. The relationship sought in data mining can be a relationship between two or more in one dimension. The algorithm included in association rules in data mining is the Frequent Pattern Growth (FP-Growth) algorithm is one of the alternatives that can be used to determine the most frequent itemset in a data set.

Download Full-text