A Discrete Artificial Bees Colony Inspired Biclustering Algorithm

Clustering of data in a large dimension space is of great interest in many data mining applications. In this paper, we propose a method for clustering of web usage data in a high-dimensional space based on a concept hierarchy model. In this method, the relationship present in the web usage data are mapped into a fuzzy proximity relation of user transactions. We also described an approach to present the preference set of URLs to a new user transaction based on the match score with the clusters. The study demonstrates that our approach is general and effective for mining the web data for web personalization.

Download Full-text

Discrete Artificial Bee Colony Optimization Algorithm for Financial Classification Problems

Trends in Developing Metaheuristics, Algorithms, and Optimization Approaches ◽

10.4018/978-1-4666-2145-9.ch004 ◽

2012 ◽

pp. 44-58

Author(s):

Yannis Marinakis ◽

Magdalene Marinaki ◽

Nikolaos Matsatsinis ◽

Constantin Zopounidis

Keyword(s):

Artificial Bee Colony ◽

Discrete Version ◽

Classification Models ◽

Classification Problems ◽

Financial Decisions ◽

Artificial Bee Colony Optimization ◽

Bee Colony ◽

Selection Step ◽

Benchmark Datasets ◽

Bee Colony Optimization

Nature-inspired methods are used in various fields for solving a number of problems. This study uses a nature-inspired method, artificial bee colony optimization that is based on the foraging behaviour of bees, for a financial classification problem. Financial decisions are often based on classification models, which are used to assign a set of observations into predefined groups. One important step toward the development of accurate financial classification models involves the selection of the appropriate independent variables (features) that are relevant to the problem. The proposed method uses a discrete version of the artificial bee colony algorithm for the feature selection step while nearest neighbour based classifiers are used for the classification step. The performance of the method is tested using various benchmark datasets from UCI Machine Learning Repository and in a financial classification task involving credit risk assessment. Its results are compared with the results of other nature-inspired methods.

Download Full-text

Traversal Pattern Mining in Web Usage Data

Data Warehousing and Mining ◽

10.4018/978-1-59904-951-9.ch119 ◽

2008 ◽

pp. 2004-2021

Author(s):

Jenq-Foung Yao ◽

Yongqiao Xiao

Keyword(s):

Pattern Mining ◽

Pattern Discovery ◽

Web Usage Mining ◽

Sequential Patterns ◽

Web Usage ◽

Web Logs ◽

Frequent Episodes ◽

Browsing Behavior ◽

The Web ◽

Usage Data

Web usage mining is to discover useful patterns in the web usage data, and the patterns provide useful information about the user’s browsing behavior. This chapter examines different types of web usage traversal patterns and the related techniques used to uncover them, including Association Rules, Sequential Patterns, Frequent Episodes, Maximal Frequent Forward Sequences, and Maximal Frequent Sequences. As a necessary step for pattern discovery, the preprocessing of the web logs is described. Some important issues, such as privacy, sessionization, are raised, and the possible solutions are also discussed.

Download Full-text

Traversal Pattern Mining in Web Usage Data

Web Information Systems ◽

10.4018/978-1-59140-208-4.ch010 ◽

2004 ◽

pp. 335-358 ◽

Cited By ~ 2

Author(s):

Yongqiao Xiao ◽

Jenq-Foung (J.F.) Yao

Keyword(s):

Pattern Mining ◽

Pattern Discovery ◽

Web Usage Mining ◽

Sequential Patterns ◽

Web Usage ◽

Web Logs ◽

Frequent Episodes ◽

Browsing Behavior ◽

The Web ◽

Usage Data

Web usage mining is to discover useful patterns in the web usage data, and the patterns provide useful information about the user’s browsing behavior. This chapter examines different types of web usage traversal patterns and the related techniques used to uncover them, including Association Rules, Sequential Patterns, Frequent Episodes, Maximal Frequent Forward Sequences, and Maximal Frequent Sequences. As a necessary step for pattern discovery, the preprocessing of the web logs is described. Some important issues, such as privacy, sessionization, are raised, and the possible solutions are also discussed.

Download Full-text

WEB FARMING WITH CLICKSTREAM

International Journal of Information Technology & Decision Making ◽

10.1142/s0219622008002971 ◽

2008 ◽

Vol 07 (02) ◽

pp. 291-308 ◽

Cited By ~ 18

Author(s):

JIA HU ◽

NING ZHONG

Keyword(s):

Behavior Analysis ◽

Web Mining ◽

User Interaction ◽

Purchasing Behavior ◽

Web Content ◽

Related Data ◽

Web Usage ◽

Proposed Model ◽

The Web ◽

Usage Data

In a commercial website or portal, Web information fusion is usually from the following two approaches, one is to integrate the Web content, structure, and usage data for surfing behavior analysis; the other is to integrate Web usage data with traditional customer, product, and transaction data for purchasing behavior analysis. In this paper, we propose a unified model based on Web farming technology for collecting clickstream logs in the whole user interaction process. We emphasize that collecting clickstream logs at the application layer will help to seamlessly integrate Web usage data with other customer-related data sources. In this paper, we extend the Web log standard to modeling clickstream format and Web mining to Web farming from passively collecting data and analyzing the customer behavior to actively influence the customer's decision making. The proposed model can be developed as a common plugin for most existing commercial websites and portals.

Download Full-text

Optimal PMU Placement using Binary Particle Swarm and Artificial Bee Colony with Channel Limitations and Redundancy.

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.c5104.098319 ◽

2019 ◽

Vol 8 (3) ◽

pp. 3881-3886

Keyword(s):

Particle Swarm Optimization ◽

Power System ◽

Artificial Bee Colony ◽

Particle Swarm ◽

Measurement Unit ◽

Binary Particle Swarm Optimization ◽

Swarm Optimization ◽

Artificial Bee Colony Optimization ◽

Bee Colony ◽

Bee Colony Optimization

Phasor Measurement Unit (PMU) being expensive and to be placed optimally, a meta-heuristic approach of Binary particle swarm optimization (BPSO) and Binary artificial bee colony optimization (BABC) is made for the optimal allocation of PMU in a power system. The PMU locations resulted are served by basic system conditions like network configuration, critical generators, and loads. The pattern of locations on including Zero-Injection Buess (ZIB) is also discussed. The redundancy in case of PMU loss is coined so as to obtain a complete observability of the power system. the channel limitations of device is also taken into consideration for better results in real-time systems. Optimal PMU locations for IEEE 30-bus and 14-bus systems with channel limits are compared with all above considerations. The number of PMU locations is reduced as channel limits increases. The simulated PMU locations are decreased with improved observability by Binary Artificial Bee Colony Optimization as compared to Binary Particle Swarm Optimization.

Download Full-text

Extraction of Target User Group from Web Usage Data Using Evolutionary Biclustering Approach

Trends in Developing Metaheuristics, Algorithms, and Optimization Approaches ◽

10.4018/978-1-4666-2145-9.ch015 ◽

2012 ◽

pp. 253-263

Author(s):

R. Rathipriya ◽

K. Thangavel ◽

J. Bagyamani

Keyword(s):

Data Mining ◽

Greedy Heuristic ◽

Data Mining Technique ◽

Web Usage ◽

Target User ◽

Global Optimizer ◽

User Groups ◽

Optimal Target ◽

Global Optimal ◽

Usage Data

Data mining extracts hidden information from a database that the user did not know existed. Biclustering is one of the data mining technique which helps marketing user to target marketing campaigns more accurately and to align campaigns more closely with the needs, wants, and attitudes of customers and prospects. The biclustering results can be tuned to find users’ browsing patterns relevant to current business problems. This paper presents a new application of biclustering to web usage data using a combination of heuristics and meta-heuristics algorithms. Two-way K-means clustering is used to generate the seeds from preprocessed web usage data, Greedy Heuristic is used iteratively to refine a set of seeds, which is fast but often yield local optimal solutions. In this paper, Genetic Algorithm is used as a global optimizer that can be coupled with greedy method to identify the global optimal target user groups based on their coherent browsing pattern. The performance of the proposed work is evaluated by conducting experiment on the msnbc, a clickstream dataset from UCI repository. Results show that the proposed work performs well in extracting optimal target users groups from the web usage data which can be used for focalized marketing campaigns.

Download Full-text

Extraction of Knowledge from Web Server Logs Using Web Usage Mining

Asian Journal of Computer Science and Technology ◽

10.51983/ajcst-2019.8.s3.2113 ◽

2019 ◽

Vol 8 (S3) ◽

pp. 12-15

Author(s):

B. Harika ◽

T. Sudha

Keyword(s):

Data Mining ◽

Web Mining ◽

Web Server ◽

Primary Source ◽

Web Usage Mining ◽

Data Mining Techniques ◽

Web Usage ◽

Browsing Behavior ◽

The Web ◽

Usage Data

Information on internet increases rapidly from day to day and the usage of the web also increases, thus there is the need to discover interesting patterns from web. The process used to extract and mine useful information from web documents by using Data Mining Techniques is called Web Mining. Web Mining is broadly classified in to three types namely Web Content Mining, Web Structure Mining and Web Usage Mining. In this paper our focus is mainly on Web Usage Mining, where we are applying the data mining techniques to analyse and discover interesting knowledge from the Web Usage data. The activities of the user are captured and stored at different levels such as server level, proxy level and user level called as Web Usage Data and the usage data stored at server side is Web Server Log, where it records the browsing behavior of users and their requests based on the user clicks. Web server Log is a primary source to perform Web Usage Mining. This paper also brings in to discussion of various existing pre-processing techniques and analysis of web log files and how clustering is applied to group the users based on the browsing behavior of users on their interested contents.

Download Full-text

Discrete Artificial Bee Colony Optimization Algorithm for Financial Classification Problems

International Journal of Applied Metaheuristic Computing ◽

10.4018/jamc.2011010101 ◽

2011 ◽

Vol 2 (1) ◽

pp. 1-17 ◽

Cited By ~ 3

Author(s):

Yannis Marinakis ◽

Magdalene Marinaki ◽

Nikolaos Matsatsinis ◽

Constantin Zopounidis

Keyword(s):

Artificial Bee Colony ◽

Classification Problem ◽

Discrete Version ◽

Classification Models ◽

Classification Problems ◽

Financial Decisions ◽

Artificial Bee Colony Optimization ◽

Bee Colony ◽

Benchmark Datasets ◽

Bee Colony Optimization

Nature-inspired methods are used in various fields for solving a number of problems. This study uses a nature-inspired method, artificial bee colony optimization that is based on the foraging behaviour of bees, for a financial classification problem. Financial decisions are often based on classification models, which are used to assign a set of observations into predefined groups. One important step toward the development of accurate financial classification models involves the selection of the appropriate independent variables (features) that are relevant to the problem. The proposed method uses a discrete version of the artificial bee colony algorithm for the feature selection step while nearest neighbour based classifiers are used for the classification step. The performance of the method is tested using various benchmark datasets from UCI Machine Learning Repository and in a financial classification task involving credit risk assessment. Its results are compared with the results of other nature-inspired methods.

Download Full-text

Usage Profile Generation from Web Usage Data Using Hybrid Biclustering Algorithm

Modeling Applications and Theoretical Innovations in Interdisciplinary Evolutionary Computation ◽

10.4018/978-1-4666-3628-6.ch016 ◽

2013 ◽

pp. 260-272 ◽

Cited By ~ 1

Author(s):

R. Rathipriya ◽

K. Thangavel ◽

J. Bagyamani

Keyword(s):

Web Mining ◽

User Profile ◽

Binary Particle Swarm Optimization ◽

Local Optimum ◽

Mutation Operator ◽

Swarm Optimization ◽

Web Usage ◽

Specific Subset ◽

Browsing Behavior ◽

Usage Data

Biclustering has the potential to make significant contributions in the fields of information retrieval, web mining, and so forth. In this paper, the authors analyze the complex association between users and pages of a web site by using a biclustering algorithm. This method automatically identifies the groups of users that show similar browsing patterns under a specific subset of the pages. In this paper, mutation operator from Genetic Algorithms is incorporated into the Binary Particle Swarm Optimization (BPSO) for biclustering of web usage data. This hybridization can increase the diversity of the population and help the particles effectively escape from the local optimum. It detects optimized user profile group according to coherent browsing behavior. Experiments are performed on a benchmark clickstream dataset to test the effectiveness of the proposed algorithm. The results show that the proposed algorithm has higher performance than existing PSO methods. The interpretation of this biclustering results are useful for marketing and sales strategies.

Download Full-text