Applications of Feature Selection and Regression Techniques in Materials Design

Feature selection is considered as an important preprocessing step to data mining and soft computing, whereas regression is a collection of methods to optimally assess the signal from a noisy output. Both seek to arrive at the dependence and relation between different attributes and a target material property. In the present chapter a flock of regression and feature selection techniques are discussed, and the kind of results that can be obtained with each of them has been illustrated with the help of a dataset on steel. The different methods are capable of abstracting data in different forms, thus revealing hidden knowledge from different perspectives. Choosing the most appropriate method depends on the application at hand and the kind of objective that one is looking for.

Download Full-text

CLASSIFICATION OF HIGH-DIMENSIONAL MICROARRAY DATA WITH A TWO-STEP PROCEDURE VIA A WILCOXON CRITERION AND MULTILAYER PERCEPTRON

International Journal of Computational Intelligence and Applications ◽

10.1142/s1469026811002969 ◽

2011 ◽

Vol 10 (01) ◽

pp. 1-14

Author(s):

VLADIMIR NIKULIN ◽

TIAN-HSIANG HUANG ◽

GEOFFREY J. MCLACHLAN

Keyword(s):

Data Mining ◽

Feature Selection ◽

High Dimensional ◽

Second Step ◽

Support Vector ◽

Step Procedure ◽

Leave One Out ◽

Natural Combination ◽

Feature Selection Techniques

The method presented in this paper is novel as a natural combination of two mutually dependent steps. Feature selection is a key element (first step) in our classification system, which was employed during the 2010 International RSCTC data mining (bioinformatics) Challenge. The second step may be implemented using any suitable classifier such as linear regression, support vector machine or neural networks. We conducted leave-one-out (LOO) experiments with several feature selection techniques and classifiers. Based on the LOO evaluations, we decided to use feature selection with the separation type Wilcoxon-based criterion for all final submissions. The method presented in this paper was tested successfully during the RSCTC data mining Challenge, where we achieved the top score in the Basic track.

Download Full-text

A Survey on Phishing Detection and The Importance of Feature Selection In Data Mining Classification Algorithms

Issue 4 - Journal of Science and Technology ◽

10.46243/jst.2020.v5.i6.pp11-18 ◽

2020 ◽

pp. 11-18

Keyword(s):

Data Mining ◽

Feature Selection ◽

Support Vector ◽

Classification Algorithms ◽

End User ◽

Preparation Methods ◽

Survey Paper ◽

Vector Machines ◽

Feature Selection Techniques ◽

Phishing Detection

: In this era of Internet, the issue of security of information is at its peak. One of the main threats in this cyber world is phishing attacks which is an email or website fraud method that targets the genuine webpage or an email and hacks it without the consent of the end user. There are various techniques which help to classify whether the website or an email is legitimate or fake. The major contributors in the process of detection of these phishing frauds include the classification algorithms, feature selection techniques or dataset preparation methods and the feature extraction that plays an important role in detection as well as in prevention of these attacks. This Survey Paper studies the effect of all these contributors and the approaches that are applied in the study conducted on the recent papers. Some of the classification algorithms that are implemented includes Decision tree, Random Forest , Support Vector Machines, Logistic Regression , Lazy K Star, Naive Bayes and J48 etc.

Download Full-text

Hybrid soft computing techniques for feature selection and parameter optimization in power quality data mining

Applied Soft Computing ◽

10.1016/j.asoc.2011.05.010 ◽

2011 ◽

Vol 11 (8) ◽

pp. 5485-5497 ◽

Cited By ~ 29

Author(s):

K. Manimala ◽

K. Selvi ◽

R. Ahila

Keyword(s):

Data Mining ◽

Feature Selection ◽

Power Quality ◽

Soft Computing ◽

Parameter Optimization ◽

Quality Data ◽

Soft Computing Techniques

Download Full-text

A literature review of feature selection techniques and applications: Review of feature selection in data mining

2014 IEEE International Conference on Computational Intelligence and Computing Research ◽

10.1109/iccic.2014.7238499 ◽

2014 ◽

Cited By ~ 23

Author(s):

S. Visalakshi ◽

V. Radha

Keyword(s):

Data Mining ◽

Feature Selection ◽

Literature Review ◽

Feature Selection Techniques

Download Full-text

FEATURE SELECTION FOR OPTIMIZATION OF WAVELET PACKET DECOMPOSITION IN RELIABILITY ANALYSIS OF SYSTEMS

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213013600117 ◽

2013 ◽

Vol 22 (05) ◽

pp. 1360011 ◽

Cited By ~ 4

Author(s):

RANDALL WALD ◽

TAGHI M. KHOSHGOFTAAR ◽

JOHN C. SLOAN

Keyword(s):

Data Mining ◽

Feature Selection ◽

Wavelet Packet ◽

Vibration Signal ◽

Machine Learning Algorithms ◽

Wavelet Packet Decomposition ◽

Time Frequency ◽

Speed Up ◽

Frequency Domain Techniques ◽

Feature Selection Techniques

One of the most important types of signal found in the area of machine condition monitoring/prognostic health monitoring (MCM/PHM) is the vibration signal, a type of waveform. Many time-frequency domain techniques have been proposed to interpret such signals, including wavelet packet decomposition (WPD). Previous work has shown how to extend the WPD algorithm to operate on streaming signals, but the number of output variables becomes exponential in the number of levels of decomposition, hindering data mining in limited-memory environments. Feature selection techniques, well understood in other areas of data mining, can be used to greatly reduce the number of output variables and speed up the machine learning algorithms. This paper presents a case study comparing two versions of WPD both with and without feature selection, demonstrating that removing most of the features produced by the WPD does not impair its performance within the context of MCM/PHM.

Download Full-text

Interpretation function of dynamic of an underwater vehicle in non-stationary environment

MORSKIE INTELLEKTUAL`NYE TEHNOLOGII ◽

10.37220/mit.2021.54.4.100 ◽

2021 ◽

pp. 171-176

Author(s):

Ю.И. Нечаев ◽

Д.В. Никущенко

Keyword(s):

Data Mining ◽

Soft Computing ◽

Dynamic Environment ◽

Underwater Vehicle ◽

Underwater Vehicles ◽

Modern Theory ◽

Dynamic Visualization ◽

Software Complex ◽

Hidden Knowledge ◽

Urgent Computing

Рассматривается построение и анализ функций интерпретации моделей нестационарной динамики подводных объектов (ПО) новых поколений на основе функциональных пространств современной теории катастроф (СТК) [1] – [7]. Формальный аппарат концептуальных решений и принципов построения функций интерпретации реализован в нестационарной динамической среде в рамках принципа конкуренции. Процедуры функций интерпретации основаны на использовании различных моделей взаимодействия в зависимости от уровня действующих возмущений. Неопределенность и неполнота исходной информации в динамике взаимодействия ПО в нестационарной среде, определили подход к построению функций интерпретации при построении математического описания задач нестационарной динамики ПО на основе концепции мягких вычислений (Soft Computing) [7] и выявления «скрытых» знаний (Data Mining) [1]. Разработанные модели и алгоритмы интерпретации нестационарной динамики ПО реализованы в функциональном блоке моделирования многофункционального программного комплекса (МПК) динамической визуализации нестационарной динамики ПО в режиме экстренных вычислений (Urgent Computing – UC [6]. The construction and analysis of the interpretation functions of the models of unsteady dynamics of new generation an underwater vehicle (UV) based on the modern theory of disasters (STK) [1] - [7] are considered. The formal apparatus of conceptual solutions and principles of constructing interpretation functions is implemented in a non-stationary dynamic environment within the framework of the principle of competition. The procedures of the interpretation functions are based on the use of various interaction models depending on the level of acting disturbances. The uncertainty and incompleteness of the initial information on the dynamics of the interaction of underwater vehicles in a non-stationary environment determined the approach to constructing interpretation functions when constructing a mathematical description of the problems of non-stationary dynamics of underwater vehicles based on the concept of soft computing (Soft Computing) [7] and the identification of “hidden” knowledge (Data Mining) [1]. The developed models and algorithms for interpreting unsteady dynamics of submarines are implemented in the functional block for modeling a multifunctional software complex (MPC) for dynamic visualization of unsteady dynamics of underwater vehicles in emergency computing mode Urgent Computing [6].

Download Full-text

A Survey of Feature Selection Techniques in Intrusion Detection System: A Soft Computing Perspective

Advances in Intelligent Systems and Computing - Progress in Computing, Analytics and Networking ◽

10.1007/978-981-10-7871-2_75 ◽

2018 ◽

pp. 785-793 ◽

Cited By ~ 9

Author(s):

P. Ravi Kiran Varma ◽

V. Valli Kumari ◽

S. Srinivas Kumar

Keyword(s):

Feature Selection ◽

Intrusion Detection ◽

Soft Computing ◽

Intrusion Detection System ◽

Detection System ◽

System A ◽

Feature Selection Techniques

Download Full-text

Review On Feature Selection Techniques in Data Mining

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v5i11.187191 ◽

2017 ◽

Vol 5 (11) ◽

pp. 187-191

Author(s):

S. Ramadass ◽

◽

M.Gunasekaran .

Keyword(s):

Data Mining ◽

Feature Selection ◽

Feature Selection Techniques

Download Full-text

Success/Failure Prediction of Noninvasive Mechanical Ventilation in Intensive Care Units

Methods of Information in Medicine ◽

10.3414/me14-01-0015 ◽

2016 ◽

Vol 55 (03) ◽

pp. 234-241 ◽

Cited By ~ 6

Author(s):

Félix Martín-González ◽

Javier González-Robledo ◽

Fernando Sánchez-Hernández ◽

María Moreno-García

Keyword(s):

Data Mining ◽

Feature Selection ◽

Intensive Care ◽

Intensive Care Units ◽

Influential Factors ◽

Selection Methods ◽

Noninvasive Mechanical Ventilation ◽

Mining Methods ◽

The One ◽

Feature Selection Techniques

SummaryObjectives: This paper addresses the problem of decision-making in relation to the administration of noninvasive mechanical ventila tion (NIMV) in intensive care units.Methods: Data mining methods were employed to find out the factors influencing the success/failure of NIMV and to predict its results in future patients. These artificial intelligence-based methods have not been applied in this field in spite of the good results obtained in other medical areas.Results: Feature selection methods provided the most influential variables in the success/ failure of NIMV, such as NIMV hours, PaCO2 at the start, PaO2 / FiO2 ratio at the start, hematocrit at the start or PaO2 / FiO2 ratio after two hours. These methods were also used in the preprocessing step with the aim of improving the results of the classifiers. The algorithms provided the best results when the dataset used as input was the one containing the attributes selected with the CFS method. Conclusions: Data mining methods can be successfully applied to determine the most influential factors in the success/failure of NIMV and also to predict NIMV results in future patients. The results provided by classifiers can be improved by preprocessing the data with feature selection techniques.

Download Full-text

On the value of filter feature selection techniques in homogeneous ensembles effort estimation

Journal of Software Evolution and Process ◽

10.1002/smr.2343 ◽

2021 ◽

Author(s):

Mohamed Hosni ◽

Ali Idri ◽

Alain Abran

Keyword(s):

Feature Selection ◽

Effort Estimation ◽

Feature Selection Techniques

Download Full-text