A PageRank-Based Method to Extract Fuzzy Expressions as Features in Supervised Classification Problems

Author(s):  
Pablo Carmona ◽  
Juan Luis Castro ◽  
Jesús Lozano ◽  
José Ignacio Suárez
Author(s):  
Tobias Scheffer

For many classification problems, unlabeled training data are inexpensive and readily available, whereas labeling training data imposes costs. Semi-supervised classification algorithms aim at utilizing information contained in unlabeled data in addition to the (few) labeled data.


Author(s):  
ALEXSEY LIAS-RODRÍGUEZ ◽  
GUILLERMO SANCHEZ-DIAZ

Typical testors are useful tools for feature selection and for determining feature relevance in supervised classification problems. Nowadays, computing all typical testors of a training matrix is very expensive; all reported algorithms have exponential complexity depending on the number of columns in the matrix. In this paper, we introduce the faster algorithm BR (Boolean Recursive), called fast-BR algorithm, that is based on elimination of gaps and reduction of columns. Fast-BR algorithm is designed to generate all typical testors from a training matrix, requiring a reduced number of operations. Experimental results using this fast implementation and the comparison with other state-of-the-art related algorithms that generate typical testors are presented.


2008 ◽  
Vol 41 (12) ◽  
pp. 3706-3719 ◽  
Author(s):  
Wing W.Y. Ng ◽  
Daniel S. Yeung ◽  
Michael Firth ◽  
Eric C.C. Tsang ◽  
Xi-Zhao Wang

Sensors ◽  
2021 ◽  
Vol 21 (19) ◽  
pp. 6661
Author(s):  
Lars Schmarje ◽  
Johannes Brünger ◽  
Monty Santarossa ◽  
Simon-Martin Schröder ◽  
Rainer Kiko ◽  
...  

Deep learning has been successfully applied to many classification problems including underwater challenges. However, a long-standing issue with deep learning is the need for large and consistently labeled datasets. Although current approaches in semi-supervised learning can decrease the required amount of annotated data by a factor of 10 or even more, this line of research still uses distinct classes. For underwater classification, and uncurated real-world datasets in general, clean class boundaries can often not be given due to a limited information content in the images and transitional stages of the depicted objects. This leads to different experts having different opinions and thus producing fuzzy labels which could also be considered ambiguous or divergent. We propose a novel framework for handling semi-supervised classifications of such fuzzy labels. It is based on the idea of overclustering to detect substructures in these fuzzy labels. We propose a novel loss to improve the overclustering capability of our framework and show the benefit of overclustering for fuzzy labels. We show that our framework is superior to previous state-of-the-art semi-supervised methods when applied to real-world plankton data with fuzzy labels. Moreover, we acquire 5 to 10% more consistent predictions of substructures.


Ciencia Unemi ◽  
2018 ◽  
Vol 11 (28) ◽  
pp. 8-17
Author(s):  
Pavel Novoa-Hernández ◽  
Dailín Cobos-Valdes ◽  
Eduardo Samaniego-Mena ◽  
Milvio Novoa-Pérez

En el presente trabajo se propone un nuevo modelo para la evaluación del riesgo biológico en procesos biofarmacéuticos. La propuesta extiende un modelo existente, aportando como principal novedad el tratamiento de las determinaciones de los niveles de consecuencia y probabilidad de riesgo, como problemas de clasificación supervisada. Específicamente, se obtuvieron modelos de clasificación basados en árboles de decisión que poseen como ventajas más importantes: 1) un número menor de indicadores para la determinación de consecuencias y probabilidades, 2) un orden de medición de los indicadores, basado en la importancia de los mismos. Con el objetivo de analizar las bondades del nuevo modelo, se consideraron tres casos de estudio relacionados con procesos farmacéuticos reales. En comparación con el modelo anterior, el nuevo ofrece resultados similares, pero facilitando notablemente el proceso de evaluación del riesgo biológico. AbstractA new model for assessing biological risk in biopharmaceutical process is proposed in the present work. This proposal extends an existing model including the handling of the consequence and probability levels computations as main novelty, and also as supervised classification problems. Specifically, two classification models based on decision trees were obtained, which gives as major advantages: 1) a lower number of indicators for the determination of consequence and probabilities, and 2) an order of measurement of the related indicators. In order to analyze the benefits of the new model, three real pharmaceutical processes were considered as cases studies. In comparison with the previous model, the new one offers similar results, but significantly facilitating the biological risk assessment process.


2021 ◽  
Vol 15 ◽  
Author(s):  
Charlotte Frenkel ◽  
Martin Lefebvre ◽  
David Bol

While the backpropagation of error algorithm enables deep neural network training, it implies (i) bidirectional synaptic weight transport and (ii) update locking until the forward and backward passes are completed. Not only do these constraints preclude biological plausibility, but they also hinder the development of low-cost adaptive smart sensors at the edge, as they severely constrain memory accesses and entail buffering overhead. In this work, we show that the one-hot-encoded labels provided in supervised classification problems, denoted as targets, can be viewed as a proxy for the error sign. Therefore, their fixed random projections enable a layerwise feedforward training of the hidden layers, thus solving the weight transport and update locking problems while relaxing the computational and memory requirements. Based on these observations, we propose the direct random target projection (DRTP) algorithm and demonstrate that it provides a tradeoff between accuracy and computational cost that is suitable for adaptive edge computing devices.


Author(s):  
Srijan Das ◽  
Arpita Dutta ◽  
Saurav Sharma ◽  
Sangharatna Godboley

Anomaly Detection is an important research domain of Pattern Recognition due to its effects of classification and clustering problems. In this paper, an anomaly detection algorithm is proposed using different primitive cost functions such as Normal Perceptron, Relaxation Criterion, Mean Square Error (MSE) and Ho-Kashyap. These criterion functions are minimized to locate the decision boundary in the data space so as to classify the normal data objects and the anomalous data objects. The authors proposed algorithm uses the concept of supervised classification, though it is very different from solving normal supervised classification problems. This proposed algorithm using different criterion functions has been compared with the accuracy of the Neural Networks (NN) in order to bring out a comparative analysis between them and discuss some advantages.


Sign in / Sign up

Export Citation Format

Share Document