Improving Modular Classification Rule Induction with G-Prism Using Dynamic Rule Term Boundaries

A typical predictive approach in data mining that produces If-Then knowledge for decision making is rule-based classification. Rule-based classification includes a large number of algorithms that fall under the categories of covering, greedy, rule induction, and associative classification. These approaches have shown promising results due to the simplicity of the models generated and the user’s ability to understand, and maintain them. Phishing is one of the emergent online threats in web security domains that necessitates anti-phishing models with rules so users can easily differentiate among website types. This paper critically analyses recent research studies on the use of predictive models with rules for phishing detection, and evaluates the applicability of these approaches on phishing. To accomplish our task, we experimentally evaluate four different rule-based classifiers that belong to greedy, associative classification and rule induction approaches on real phishing datasets and with respect to different evaluation measures. Moreover, we assess the classifiers derived and contrast them with known classic classification algorithms including Bayes Net and Simple Logistics. The aim of the comparison is to determine the pros and cons of predictive models with rules and reveal their actual performance when it comes to detecting phishing activities. The results clearly showed that eDRI, a recently greedy algorithm, not only generates useful models but these are also highly competitive with respect to predictive accuracy as well as runtime when they are employed as anti-phishing tools.

Download Full-text

P-Prism: A Computationally Efficient Approach to Scaling up Classification Rule Induction

Artificial Intelligence in Theory and Practice II - IFIP – The International Federation for Information Processing ◽

10.1007/978-0-387-09695-7_8 ◽

2008 ◽

pp. 77-86 ◽

Cited By ~ 4

Author(s):

Frederic T. Stahl ◽

Max A. Bramer ◽

Mo Adda

Keyword(s):

Scaling Up ◽

Rule Induction ◽

Classification Rule ◽

Computationally Efficient ◽

Efficient Approach

Download Full-text

FPGA kernels for classification rule induction

2016 39th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) ◽

10.1109/mipro.2016.7522163 ◽

2016 ◽

Author(s):

P. Skoda ◽

B. Medved Rogina

Keyword(s):

Rule Induction ◽

Classification Rule

Download Full-text

Towards a Computationally Efficient Approach to Modular Classification Rule Induction

Research and Development in Intelligent Systems XXIV ◽

10.1007/978-1-84800-094-0_27 ◽

2007 ◽

pp. 357-362 ◽

Cited By ~ 2

Author(s):

Frederic Stahl ◽

Max Bramer

Keyword(s):

Rule Induction ◽

Classification Rule ◽

Computationally Efficient ◽

Efficient Approach

Download Full-text

Scaling up classification rule induction through parallel processing

The Knowledge Engineering Review ◽

10.1017/s0269888912000355 ◽

2012 ◽

Vol 28 (4) ◽

pp. 451-478 ◽

Cited By ~ 3

Author(s):

Frederic Stahl ◽

Max Bramer

Keyword(s):

Data Mining ◽

Parallel Computing ◽

Scale Up ◽

Cost Effective ◽

Scaling Up ◽

Rule Induction ◽

Classification Rule ◽

Recorded Data ◽

Fast Increase

AbstractThe fast increase in the size and number of databases demands data mining approaches that are scalable to large amounts of data. This has led to the exploration of parallel computing technologies in order to perform data mining tasks concurrently using several processors. Parallelization seems to be a natural and cost-effective way to scale up data mining technologies. One of the most important of these data mining technologies is the classification of newly recorded data. This paper surveys advances in parallelization in the field of classification rule induction.

Download Full-text