A new approach to feature selection

AbstractTextual Feature Selection (TFS) aims to extract relevant parts or segments from text as being the most relevant ones w.r.t. the information it expresses. The selected features are useful for automatic indexing, summarization, document categorization, knowledge discovery, so on. Regarding the huge amount of electronic textual data daily published, many challenges related to the semantic aspect as well as the processing efficiency are addressed. In this paper, we propose a new approach for TFS based on Formal Concept Analysis background. Mainly, we propose to extract textual features by exploring the regularities in a formal context where isolated points exist. We introduce the notion ofN-composite isolated points as a set ofNwords to be considered as a unique textual feature. We show that a reduced value ofN(between 1 and 3) allows extracting significant textual features compared with existing approaches even for non-completely covering an initial formal context.

Download Full-text

A New Approach for Spatio-Spectral Feature Selection for Sensors with Noisy and Overlapping Spectral Bands

IGARSS 2008 - 2008 IEEE International Geoscience and Remote Sensing Symposium ◽

10.1109/igarss.2008.4780109 ◽

2008 ◽

Author(s):

Biliana S. Paskalva ◽

Majeed M. Hayat ◽

Woo-Yong Jang ◽

Sanjay Krishna

Keyword(s):

Feature Selection ◽

Spectral Feature ◽

New Approach ◽

Spectral Bands ◽

Selection For ◽

Spectral Feature Selection

Download Full-text

A New Approach for Wrapper Feature Selection Using Genetic Algorithm for Big Data

Proceedings in Adaptation, Learning and Optimization - Intelligent and Evolutionary Systems ◽

10.1007/978-3-319-27000-5_6 ◽

2015 ◽

pp. 75-83 ◽

Cited By ~ 2

Author(s):

Waad Bouaguel

Keyword(s):

Genetic Algorithm ◽

Feature Selection ◽

Big Data ◽

New Approach ◽

Wrapper Feature Selection

Download Full-text

Vector space model for patent documents with hierarchical class labels

Journal of Information Science ◽

10.1177/0165551512437635 ◽

2012 ◽

Vol 38 (3) ◽

pp. 222-233 ◽

Cited By ~ 6

Author(s):

Yen-Liang Chen ◽

Yu-Ting Chiu

Keyword(s):

Feature Selection ◽

Vector Space ◽

Selection Process ◽

Vector Space Model ◽

Class Label ◽

Discriminative Ability ◽

New Approach ◽

Space Model ◽

Patent Documents ◽

Class Labels

A vector space model (VSM) composed of selected important features is a common way to represent documents, including patent documents. Patent documents have some special characteristics that make it difficult to apply traditional feature selection methods directly: (a) it is difficult to find common terms for patent documents in different categories; and (b) the class label of a patent document is hierarchical rather than flat. Hence, in this article we propose a new approach that includes a hierarchical feature selection (HFS) algorithm which can be used to select more representative features with greater discriminative ability to present a set of patent documents with hierarchical class labels. The performance of the proposed method is evaluated through application to two documents sets with 2400 and 9600 patent documents, where we extract candidate terms from their titles and abstracts. The experimental results reveal that a VSM whose features are selected by a proportional selection process gives better coverage, while a VSM whose features are selected with a weighted-summed selection process gives higher accuracy.

Download Full-text

New approach for automatic recognition of melanoma in profilometry: optimized feature selection using genetic algorithms

10.1117/12.310948 ◽

1998 ◽

Cited By ~ 1

Author(s):

Heinz Handels ◽

Th Ross ◽

J. Kreusch ◽

H. H. Wolff ◽

S. J. Poeppl

Keyword(s):

Genetic Algorithms ◽

Feature Selection ◽

Automatic Recognition ◽

New Approach

Download Full-text

FEATURE SELECTION FOR SUPPORT VECTOR MACHINES USING GENETIC ALGORITHMS

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213004001818 ◽

2004 ◽

Vol 13 (04) ◽

pp. 791-800 ◽

Cited By ~ 26

Author(s):

HOLGER FRÖHLICH ◽

OLIVIER CHAPELLE ◽

BERNHARD SCHÖLKOPF

Keyword(s):

Genetic Algorithms ◽

Feature Selection ◽

Support Vector Machines ◽

Cross Validation ◽

Support Vector ◽

Generalization Error ◽

New Approach ◽

Vector Machines ◽

Selection For ◽

Natural Way

The problem of feature selection is a difficult combinatorial task in Machine Learning and of high practical relevance, e.g. in bioinformatics. Genetic Algorithms (GAs) offer a natural way to solve this problem. In this paper we present a special Genetic Algorithm, which especially takes into account the existing bounds on the generalization error for Support Vector Machines (SVMs). This new approach is compared to the traditional method of performing cross-validation and to other existing algorithms for feature selection.

Download Full-text

A new approach to feature selection based on the Karhunen-Loeve expansion

Pattern Recognition ◽

10.1016/0031-3203(73)90025-3 ◽

1973 ◽

Vol 5 (4) ◽

pp. 335-352 ◽

Cited By ~ 99

Author(s):

Josef Kittler ◽

Peter C. Young

Keyword(s):

Feature Selection ◽

New Approach

Download Full-text

OPTIMIZATION OF EVALUATION OF THE INFORMATIVITY OF MEDICAL INDICATORS ON THE BASIS OF THE HYBRID APPROACH

Transport development ◽

10.33082/td.2017.1-1.11 ◽

2017 ◽

pp. 108-115

Author(s):

Є.В. БОДЯНСЬКИЙ ◽

І.Г. ПЕРОВА ◽

Г.В. СТОЙКА

Keyword(s):

Data Mining ◽

Principal Component Analysis ◽

Feature Selection ◽

Hybrid Approach ◽

Principal Component ◽

Mining Area ◽

Extraction Methods ◽

Optimal Combination ◽

Information Quantity ◽

New Approach

Feature Selection task is one of most complicated and actual in Data Mining area. Any approaches for it solving are based on non-mathematical and presentative hypothesis. New approach for evaluation of medical features information quantity, based on optimal combination of Feature Selection and Feature Extraction methods. This approach permits to produce optimal reduced number of features with linguistic interpreting of each ones. Hybrid system of Feature Selection/Extraction is proposed. This system is numerically simple, can produce Feature Selection/ Extraction with any number of features using standard method of principal component analysis and calculating distance between first principal component and all medical features.

Download Full-text

A new approach for HIV-1 protease cleavage site prediction combined with feature selection

Journal of Biomedical Science and Engineering ◽

10.4236/jbise.2013.612144 ◽

2013 ◽

Vol 06 (12) ◽

pp. 1155-1160 ◽

Cited By ~ 1

Author(s):

Yao Yuan ◽

Hui Liu ◽

Guangtao Qiu

Keyword(s):

Feature Selection ◽

Cleavage Site ◽

New Approach ◽

Site Prediction ◽

Protease Cleavage Site ◽

Protease Cleavage ◽

Cleavage Site Prediction ◽

Hiv 1

Download Full-text

A new approach for text feature selection based on OWA operator

2010 5th International Symposium on Telecommunications ◽

10.1109/istel.2010.5734091 ◽

2010 ◽

Cited By ~ 3

Author(s):

Mohammad Ali Ghaderi ◽

Nasser Yazdani ◽

Behzad Moshiri ◽

Maryam Tayefeh Mahmoudi

Keyword(s):

Feature Selection ◽

Owa Operator ◽

New Approach ◽

Text Feature

Download Full-text