Feature Selection Algorithm for High-dimensional Biomedical Data Using Information Gain and Improved Chemical Reaction Optimization

2021 ◽  
Vol 15 (8) ◽  
pp. 912-926
Author(s):  
Ge Zhang ◽  
Pan Yu ◽  
Jianlin Wang ◽  
Chaokun Yan

Background: There have been rapid developments in various bioinformatics technologies, which have led to the accumulation of a large amount of biomedical data. However, these datasets usually involve thousands of features and include much irrelevant or redundant information, which leads to confusion during diagnosis. Feature selection is a solution that consists of finding the optimal subset, which is known to be an NP problem because of the large search space. Objective: For the issue, this paper proposes a hybrid feature selection method based on an improved chemical reaction optimization algorithm (ICRO) and an information gain (IG) approach, which called IGICRO. Methods: IG is adopted to obtain some important features. The neighborhood search mechanism is combined with ICRO to increase the diversity of the population and improve the capacity of local search. Results: Experimental results of eight public available data sets demonstrate that our proposed approach outperforms original CRO and other state-of-the-art approaches.

2021 ◽  
Vol 18 (6) ◽  
pp. 7143-7160
Author(s):  
Shijing Ma ◽  
◽  
Yunhe Wang ◽  
Shouwei Zhang ◽  

<abstract><p>Chemical Reaction Optimization (CRO) is a simple and efficient evolutionary optimization algorithm by simulating chemical reactions. As far as the current research is concerned, the algorithm has been successfully used for solving a number of real-world optimization tasks. In our paper, a new real encoded chemical reaction optimization algorithm is proposed to boost the efficiency of the optimization operations in standard chemical reactions optimization algorithm. Inspired by the evolutionary operation of the differential evolution algorithm, an improved search operation mechanism is proposed based on the underlying operation. It is modeled to further explore the search space of the algorithm under the best individuals. Afterwards, to control the perturbation frequency of the search strategy, the modification rate is increased to balance between the exploration ability and mining ability of the algorithm. Meanwhile, we also propose a new population initialization method that incorporates several models to produce high-quality initialized populations. To validate the effectiveness of the algorithm, nine unconstrained optimization algorithms are used as benchmark functions. As observed from the experimental results, it is evident that the proposed algorithm is significantly better than the standard chemical reaction algorithm and other evolutionary optimization algorithms. Then, we also apply the proposed model to address the synthesis problem of two antenna array synthesis. The results also reveal that the proposed algorithm is superior to other approaches from different perspectives.</p></abstract>


2020 ◽  
Vol 3 (1) ◽  
pp. 58-63
Author(s):  
Y. Mansour Mansour ◽  
Majed A. Alenizi

Emails are currently the main communication method worldwide as it proven in its efficiency. Phishing emails in the other hand is one of the major threats which results in significant losses, estimated at billions of dollars. Phishing emails is a more dynamic problem, a struggle between the phishers and defenders where the phishers have more flexibility in manipulating the emails features and evading the anti-phishing techniques. Many solutions have been proposed to mitigate the phishing emails impact on the targeted sectors, but none have achieved 100% detection and accuracy. As phishing techniques are evolving, the solutions need to be evolved and generalized in order to mitigate as much as possible. This article presents a new emergent classification model based on hybrid feature selection method that combines two common feature selection methods, Information Gain and Genetic Algorithm that keep only significant and high-quality features in the final classifier. The Proposed hybrid approach achieved 98.9% accuracy rate against phishing emails dataset comprising 8266 instances and results depict enhancement by almost 4%. Furthermore, the presented technique has contributed to reducing the search space by reducing the number of selected features.


2015 ◽  
Vol 2015 ◽  
pp. 1-20 ◽  
Author(s):  
Yuyi Jiang ◽  
Zhiqing Shao ◽  
Yi Guo ◽  
Huanhuan Zhang ◽  
Kun Niu

An efficient DAG task scheduling is crucial for leveraging the performance potential of a heterogeneous system and finding a schedule that minimizes themakespan(i.e., the total execution time) of a DAG is known to be NP-complete. A recently proposed metaheuristic method, Chemical Reaction Optimization (CRO), demonstrates its capability for solving NP-complete optimization problems. This paper develops an algorithm named Double-Reaction-Structured Chemical Reaction Optimization (DRSCRO) for DAG scheduling on heterogeneous systems, which modifies the conventional CRO framework and incorporates CRO with the variable neighborhood search (VNS) method. DRSCRO has two reaction phases for super molecule selection and global optimization, respectively. In the molecule selection phase, the CRO as a metaheuristic algorithm is adopted to obtain a super molecule for accelerating convergence. For promoting the intensification capability, in the global optimization phase, the VNS algorithm with a new processor selection model is used as the initialization under the consideration of scheduling order and processor assignment, and the load balance neighborhood structure of VNS is also utilized in the ineffective reaction operator. The experimental results verify the effectiveness and efficiency of DRSCRO in terms ofmakespanand convergence rate.


Sign in / Sign up

Export Citation Format

Share Document