Design and Procedures for the Investigation Conducted

Selection Method ◽

Cancer Classification ◽

Experimental Setup ◽

Detailed Design

In this chapter, the design of each proposed case study model mentioned in Chapter 3 is presented with their different experimental procedures. The chapter includes the data preparation, suitable parameters and data pre-processing, and detailed design of two case studies. Case 1: examining the accuracy and efficiency (time complexity) of high-performance gene selection and cancer classification algorithms; Case 2: A two-stage hybrid multi-filter feature selection method for high colon-cancer classification. It shows the experimental setup and environment and the description of the hardware and software components used.

A Robust Gene selection Method for Microarray-based Cancer Classification

Cancer Informatics ◽

10.4137/cin.s3794 ◽

2010 ◽

Vol 9 ◽

pp. CIN.S3794 ◽

Cited By ~ 21

Author(s):

Xiaosheng Wang ◽

Osamu Gotoh

Keyword(s):

Gene Expression ◽

Feature Selection ◽

Gene Selection ◽

Information Gain ◽

Expression Profiles ◽

Gene Expression Profiles ◽

Molecular Classification ◽

Selection Method ◽

Chi Square

Gene selection is of vital importance in molecular classification of cancer using high-dimensional gene expression data. Because of the distinct characteristics inherent to specific cancerous gene expression profiles, developing flexible and robust feature selection methods is extremely crucial. We investigated the properties of one feature selection approach proposed in our previous work, which was the generalization of the feature selection method based on the depended degree of attribute in rough sets. We compared the feature selection method with the established methods: the depended degree, chi-square, information gain, Relief-F and symmetric uncertainty, and analyzed its properties through a series of classification experiments. The results revealed that our method was superior to the canonical depended degree of attribute based method in robustness and applicability. Moreover, the method was comparable to the other four commonly used methods. More importantly, the method can exhibit the inherent classification difficulty with respect to different gene expression datasets, indicating the inherent biology of specific cancers.

Prostate Cancer Classification Based on Best First Search and Taguchi Feature Selection Method

Image and Video Technology - Lecture Notes in Computer Science ◽

10.1007/978-3-030-34879-3_25 ◽

2019 ◽

pp. 325-336

Author(s):

Md Akizur Rahman ◽

Priyanka Singh ◽

Ravie Chandren Muniyandi ◽

Domingo Mery ◽

Mukesh Prasad

Keyword(s):

Prostate Cancer ◽

Feature Selection ◽

Selection Method ◽

Cancer Classification ◽

Best First Search ◽

Prostate Cancer Classification

Machine Learning in Cancer Research With Applications in Colon Cancer and Big Data Analysis - Advances in Medical Technologies and Clinical Practice ◽

Final Remarks for the Research With Advanced Machine Learning Methods in Colon Cancer Analysis

10.4018/978-1-7998-7316-7.ch007 ◽

2021 ◽

pp. 151-154

Keyword(s):

Colon Cancer ◽

Time Complexity ◽

High Performance ◽

Complexity Analysis ◽

Search Algorithm ◽

Cancer Treatments ◽

Cancer Dataset ◽

Performance Accuracy ◽

Research Questions

Generally, classification accuracy is very important to gene processing and selection and cancer classification. It is needed to achieve better cancer treatments and improve medical drug assignments. However, the time complexity analysis will enhance the application's significance. To answer the research questions in Chapter 1, several case studies have been implemented (see Chapters 4 and 5), each was essential to sustain the methodologies discussed in Chapter 3. The study used a colon-cancer dataset comprising 2000 genes. The best search algorithm, GA, showed high performance with a good efficient time complexity. However, both DTs and SVMs showed the best classification contribution with reference to performance accuracy and time efficiency. However, it is difficult to apply a completely fair comparative study because existing algorithms and methods were tested by different authors to reflect the effectiveness and powerful of their own methods.

New Hybrid Features Selection Method: A Case Study on Websites Phishing

Security and Communication Networks ◽

10.1155/2017/9838169 ◽

2017 ◽

Vol 2017 ◽

pp. 1-10 ◽

Cited By ~ 13

Author(s):

Khairan D. Rajab

Keyword(s):

Feature Selection ◽

Detection Rate ◽

Selection Method ◽

Decision Makers ◽

Features Selection ◽

Hybrid Features ◽

Pros And Cons ◽

New Feature

Phishing is one of the serious web threats that involves mimicking authenticated websites to deceive users in order to obtain their financial information. Phishing has caused financial damage to the different online stakeholders. It is massive in the magnitude of hundreds of millions; hence it is essential to minimize this risk. Classifying websites into “phishy” and legitimate types is a primary task in data mining that security experts and decision makers are hoping to improve particularly with respect to the detection rate and reliability of the results. One way to ensure the reliability of the results and to enhance performance is to identify a set of related features early on so the data dimensionality reduces and irrelevant features are discarded. To increase reliability of preprocessing, this article proposes a new feature selection method that combines the scores of multiple known methods to minimize discrepancies in feature selection results. The proposed method has been applied to the problem of website phishing classification to show its pros and cons in identifying relevant features. Results against a security dataset reveal that the proposed preprocessing method was able to derive new features datasets which when mined generate high competitive classifiers with reference to detection rate when compared to results obtained from other features selection methods.

An Enhancement in Cancer Classification Accuracy Using a Two-Step Feature Selection Method Based on Artificial Neural Networks with 15 Neurons

Symmetry ◽

10.3390/sym12020271 ◽

2020 ◽

Vol 12 (2) ◽

pp. 271 ◽

Author(s):

Md Akizur Rahman ◽

Ravie Chandren Muniyandi

Keyword(s):

Neural Network ◽

Feature Selection ◽

Classification Accuracy ◽

Selection Method ◽

Cancer Classification ◽

Neuron Network ◽

Artificial Neural ◽

Artificial Neural Network Ann ◽

Risk Of Cancer

An artificial neural network (ANN) is a tool that can be utilized to recognize cancer effectively. Nowadays, the risk of cancer is increasing dramatically all over the world. Detecting cancer is very difficult due to a lack of data. Proper data are essential for detecting cancer accurately. Cancer classification has been carried out by many researchers, but there is still a need to improve classification accuracy. For this purpose, in this research, a two-step feature selection (FS) technique with a 15-neuron neural network (NN), which classifies cancer with high accuracy, is proposed. The FS method is utilized to reduce feature attributes, and the 15-neuron network is utilized to classify the cancer. This research utilized the benchmark Wisconsin Diagnostic Breast Cancer (WDBC) dataset to compare the proposed method with other existing techniques, showing a significant improvement of up to 99.4% in classification accuracy. The results produced in this research are more promising and significant than those in existing papers.

Machine Learning in Cancer Research With Applications in Colon Cancer and Big Data Analysis - Advances in Medical Technologies and Clinical Practice ◽

Findings for the Conducted Investigations

10.4018/978-1-7998-7316-7.ch005 ◽

2021 ◽

pp. 117-141

Keyword(s):

Feature Selection ◽

Gene Selection ◽

Cancer Classification ◽

Classification Algorithms ◽

Third Phase ◽

Original Dataset ◽

Two Phases ◽

Selection Algorithms ◽

Gene Feature

This chapter focuses on the results produced from each case study experiment. For case one, the experiments were conducted in three phases. Phase one implemented GA, PSO, and IG as the gene/feature selection algorithms over the entire dataset. Phase =two2 utilised the original dataset to implement only the cancer classification algorithms without involving any gene/feature selection algorithms. Four recognised classification algorithms are employed: SVM, NB, GP, and DT. The third phase implemented the combined approach of gene selection and cancer classification algorithms. The results of these phases are presented in the next subsections. For case two, these experiments were implemented in two phases. Phase one implemented the classification algorithms over the features selected by the hybridised selection algorithms (GA+IG), whereas Phase two classified the features using the proposed two-stage multifilter selection system. In this section, the results are presented as follows

2018 2nd International Conference on BioSignal Analysis, Processing and Systems (ICBAPS) ◽

A Hybrid Filter-Wrapper Gene Selection Method for Cancer Classification

10.1109/icbaps.2018.8527392 ◽

2018 ◽

Author(s):

Osama Ahmad Alomari ◽

Ahamad Tajudin Khader ◽

Mohammed Azmi Al-Betar ◽

Zaid Abdi Alkareem Alyasseri

Keyword(s):

Gene Selection ◽

Selection Method ◽

Cancer Classification ◽

Hybrid Filter ◽

Gene Selection Method

Meta-analysis approach as a gene selection method in class prediction: does it improve model performance? A case study in acute myeloid leukemia

BMC Bioinformatics ◽

10.1186/s12859-017-1619-7 ◽

2017 ◽

Vol 18 (1) ◽

Author(s):

Putri W. Novianti ◽

Victor L. Jong ◽

Kit C. B. Roes ◽

Marinus J. C. Eijkemans

Keyword(s):

Myeloid Leukemia ◽

Gene Selection ◽

Meta Analysis ◽

Model Performance ◽

Selection Method ◽

Gene Selection Method ◽

Class Prediction ◽

Improve Model ◽

Improve Model Performance

An Improved Feature Selection Based on Effective Range for Classification

The Scientific World JOURNAL ◽

10.1155/2014/972125 ◽

2014 ◽

Vol 2014 ◽

pp. 1-8 ◽

Cited By ~ 4

Author(s):

Jianzhong Wang ◽

Shuang Zhou ◽

Yugen Yi ◽

Jun Kong

Keyword(s):

Feature Selection ◽

Gene Selection ◽

State Of The Art ◽

Selection Method ◽

Effective Range ◽

Inclusion Relation ◽

Statistical Feature ◽

Selection Approach ◽

Feature Selection Approach

Feature selection is a key issue in the domain of machine learning and related fields. The results of feature selection can directly affect the classifier’s classification accuracy and generalization performance. Recently, a statistical feature selection method named effective range based gene selection (ERGS) is proposed. However, ERGS only considers the overlapping area (OA) among effective ranges of each class for every feature; it fails to handle the problem of the inclusion relation of effective ranges. In order to overcome this limitation, a novel efficient statistical feature selection approach called improved feature selection based on effective range (IFSER) is proposed in this paper. In IFSER, an including area (IA) is introduced to characterize the inclusion relation of effective ranges. Moreover, the samples’ proportion for each feature of every class in both OA and IA is also taken into consideration. Therefore, IFSER outperforms the original ERGS and some other state-of-the-art algorithms. Experiments on several well-known databases are performed to demonstrate the effectiveness of the proposed method.

Study of Classification Accuracy of Microarray Data for Cancer Classification using Multivariate and Hybrid Feature Selection Method

IOSR Journal of Engineering ◽

10.9790/3021-0281112119 ◽

2012 ◽

Vol 02 (08) ◽

pp. 112-119 ◽

Author(s):

Sujata Dash

Keyword(s):

Feature Selection ◽

Microarray Data ◽

Classification Accuracy ◽

Selection Method ◽

Cancer Classification