Using feature selection and association rule mining to evaluate digital courseware

AbstractRecent advancements in science and technology and advances in the medical field have paved the way for the accumulation of huge amount of medical data in the digital repositories, where they are stored for future endeavors. Mining medical data is the most challenging task as the data are subjected to many social concerns and ethical issues. Moreover, medical data are more illegible as they contain many missing and misleading values and may sometimes be faulty. Thus, pre-processing tasks in medical data mining are of great importance, and the main focus is on feature selection, because the quality of the input determines the quality of the resultant data mining process. This paper provides insight to develop a feature selection process, where a data set subjected to constraint-governed association rule mining and interestingness measures results in a small feature subset capable of producing better classification results. From the results of the experimental study, the feature subset was reduced to more than 50% by applying syntax-governed constraints and dimensionality-governed constraints, and this resulted in a high-quality result. This approach yielded about 98% of classification accuracy for the Breast Cancer Surveillance Consortium (BCSC) data set.

Get full-text (via PubEx)

Association Rule-Based Feature Mining for Automated Fault Diagnosis of Rolling Bearing

Shock and Vibration ◽

10.1155/2019/1518246 ◽

2019 ◽

Vol 2019 ◽

pp. 1-12 ◽

Cited By ~ 1

Author(s):

Yuan Li ◽

Jinjiang Wang ◽

Lixiang Duan ◽

Tangbo Bai ◽

Xuduo Wang ◽

...

Keyword(s):

Feature Selection ◽

Fault Diagnosis ◽

Association Rules ◽

Association Rule ◽

Association Rule Mining ◽

Data Matrix ◽

Fault Classification ◽

Rule Mining ◽

Bearing Fault ◽

New Association

Effective and efficient diagnosis methods are highly demanded to improve system reliability. Comparing with conventional fault diagnosis methods taking a forward approach (e.g., feature extraction, feature selection, and fusion, and then fault diagnosis), this paper presents a new association rule mining method which provides an inverse approach unearthing the underlying relation between labeled defects and extracted features for bearing fault analysis. Instead of evenly dividing methods used in traditional association rule mining, a new association rule mining approach based on the equal probability discretization method is presented in this study. First, a series of extracted features of signal data are discretized following the guideline of equalized probability distribution of the data in order to avoid excessive concentration or decentralized data. Next, the data matrix composed of arrays of discretized features and defect labels is exploited to generate the association rules representing the relation between the features and fault types. Experimental study on a bearing test reveals that the proposed method can generate a series of underlying association rules for bearing fault diagnosis, and the related features selected by the proposed method can be used directly to analyze bearing signals for fault classification and defect severity identification. As a new feature selection method, it possesses prominent superiority compared to traditional PCA, KPCA, and LLE dimension reduction methods.

Get full-text (via PubEx)

Feature Selection for Large Scale Data Using Class Association Rule Mining

Journal of Convergence Information Technology ◽

10.4156/jcit.vol6.issue11.42 ◽

2011 ◽

Vol 6 (11) ◽

pp. 371-377

Author(s):

J. Alamelu Mangai ◽

S. Sameen Fathima

Keyword(s):

Feature Selection ◽

Association Rule ◽

Association Rule Mining ◽

Large Scale ◽

Rule Mining ◽

Large Scale Data ◽

Selection For ◽

Class Association Rule ◽

Scale Data

Get full-text (via PubEx)

Correlation-Based Feature Selection for Association Rule Mining in Semantic Annotation of Mammographic Medical Images

Information Retrieval Technology - Lecture Notes in Computer Science ◽

10.1007/978-3-319-12844-3_41 ◽

2014 ◽

pp. 482-493 ◽

Cited By ~ 2

Author(s):

Nirase Fathima Abubacker ◽

Azreen Azman ◽

Shyamala Doraisamy ◽

Masrah Azrifah Azmi Murad ◽

Mohamed Eltahir Makki Elmanna ◽

...

Keyword(s):

Feature Selection ◽

Association Rule ◽

Association Rule Mining ◽

Medical Images ◽

Semantic Annotation ◽

Rule Mining ◽

Correlation Based Feature Selection ◽

Selection For

Get full-text (via PubEx)

Graph Based Feature Selection for Reduction of Dimensionality in Next-Generation RNA Sequencing Datasets

Algorithms ◽

10.3390/a15010021 ◽

2022 ◽

Vol 15 (1) ◽

pp. 21

Author(s):

Consolata Gakii ◽

Paul O. Mireji ◽

Richard Rimiru

Keyword(s):

Feature Selection ◽

Association Rule ◽

Association Rule Mining ◽

Principal Component ◽

Recursive Feature Elimination ◽

High Dimensional ◽

Rule Mining ◽

Rnaseq Data ◽

Feature Selection Approach ◽

Feature Selection Techniques

Analysis of high-dimensional data, with more features () than observations () (), places significant demand in cost and memory computational usage attributes. Feature selection can be used to reduce the dimensionality of the data. We used a graph-based approach, principal component analysis (PCA) and recursive feature elimination to select features for classification from RNAseq datasets from two lung cancer datasets. The selected features were discretized for association rule mining where support and lift were used to generate informative rules. Our results show that the graph-based feature selection improved the performance of sequential minimal optimization (SMO) and multilayer perceptron classifiers (MLP) in both datasets. In association rule mining, features selected using the graph-based approach outperformed the other two feature-selection techniques at a support of 0.5 and lift of 2. The non-redundant rules reflect the inherent relationships between features. Biological features are usually related to functions in living systems, a relationship that cannot be deduced by feature selection and classification alone. Therefore, the graph-based feature-selection approach combined with rule mining is a suitable way of selecting and finding associations between features in high-dimensional RNAseq data.

Get full-text (via PubEx)

A Novel Market Basket Analysis Using Adaptive Association Rule Mining Algorithm

International Journal of Scientific Research ◽

10.15373/22778179/sep2012/9 ◽

2012 ◽

Vol 1 (4) ◽

pp. 25-28

Author(s):

M.Dhanabhakyam M.Dhanabhakyam ◽

◽

Dr.M.Punithavalli Dr.M.Punithavalli

Keyword(s):

Association Rule ◽

Association Rule Mining ◽

Market Basket Analysis ◽

Rule Mining ◽

Market Basket ◽

Mining Algorithm

Get full-text (via PubEx)

Study of Various Parallel Implementations of Association Rule Mining Algorithm

American Journal Of Advanced Computing ◽

10.15864/ajac.v2i1.94 ◽

2015 ◽

Vol 2 (1) ◽

Author(s):

Sarbani Dasgupta

Keyword(s):

Association Rule ◽

Association Rule Mining ◽

Rule Mining ◽

Mining Algorithm ◽

Parallel Implementations

Get full-text (via PubEx)