Semi-Supervised learning with Collaborative Bagged Multi-label K-Nearest-Neighbors

Nesma Settouti; Khalida Douibi; Mohammed El Amine Bechar; Mostafa El Habib Daho; Meryem Saidi

doi:10.1515/comp-2019-0017

Semi-Supervised learning with Collaborative Bagged Multi-label K-Nearest-Neighbors

Open Computer Science ◽

10.1515/comp-2019-0017 ◽

2019 ◽

Vol 9 (1) ◽

pp. 226-242

Author(s):

Nesma Settouti ◽

Khalida Douibi ◽

Mohammed El Amine Bechar ◽

Mostafa El Habib Daho ◽

Meryem Saidi

Keyword(s):

Supervised Learning ◽

Real World ◽

Nearest Neighbors ◽

Unlabeled Data ◽

Manual Annotation ◽

K Nearest Neighbors ◽

Significant Attention

AbstractOver the last few years, Multi-label classification has received significant attention from researchers to solve many issues in many fields. The manual annotation of available datasets is time-consuming and need a huge effort from the expert, especially for Multi-label applications in which each example of learning is associated with many labels at once. To overcome the manual annotation drawback, and to take advantages from the large amounts of unlabeled data, many semi-supervised approaches were proposed in the literature to give more sophisticated and fast solutions to support the automatic labeling of the unlabeled data. In this paper, a Collaborative Bagged Multi-label K-Nearest-Neighbors (CobMLKNN) algorithm is proposed, that extend the co-Training paradigm by a Multi-label K-Nearest-Neighbors algorithm. Experiments on ten real-world Multi-label datasets show the effectiveness of CobMLKNN algorithm to improve the performance of MLKNN to learn from a small number of labeled samples by exploiting unlabeled samples.

Download Full-text

Tropical Balls and Its Applications to K Nearest Neighbor over the Space of Phylogenetic Trees

Mathematics ◽

10.3390/math9070779 ◽

2021 ◽

Vol 9 (7) ◽

pp. 779

Author(s):

Ruriko Yoshida

Keyword(s):

Supervised Learning ◽

Phylogenetic Trees ◽

Nearest Neighbor ◽

Nearest Neighbors ◽

High Dimensional ◽

Learning Method ◽

Dimensional Vector ◽

K Nearest Neighbor ◽

K Nearest Neighbors

A tropical ball is a ball defined by the tropical metric over the tropical projective torus. In this paper we show several properties of tropical balls over the tropical projective torus and also over the space of phylogenetic trees with a given set of leaf labels. Then we discuss its application to the K nearest neighbors (KNN) algorithm, a supervised learning method used to classify a high-dimensional vector into given categories by looking at a ball centered at the vector, which contains K vectors in the space.

Download Full-text

Multiple Noisy Label Distribution Propagation for Crowdsourcing

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/204 ◽

2019 ◽

Cited By ~ 1

Author(s):

Hao Zhang ◽

Liangxiao Jiang ◽

Wenqiang Xu

Keyword(s):

Supervised Learning ◽

Real World ◽

Effective Means ◽

Ground Truth ◽

Cost Effective ◽

Nearest Neighbors ◽

True Label ◽

Real World Datasets ◽

The Individual ◽

Label Distribution

Crowdsourcing services provide a fast, efficient, and cost-effective means of obtaining large labeled data for supervised learning. Ground truth inference, also called label integration, designs proper aggregation strategies to infer the unknown true label of each instance from the multiple noisy label set provided by ordinary crowd workers. However, to the best of our knowledge, nearly all existing label integration methods focus solely on the multiple noisy label set itself of the individual instance while totally ignoring the intercorrelation among multiple noisy label sets of different instances. To solve this problem, a multiple noisy label distribution propagation (MNLDP) method is proposed in this study. MNLDP first transforms the multiple noisy label set of each instance into its multiple noisy label distribution and then propagates its multiple noisy label distribution to its nearest neighbors. Consequently, each instance absorbs a fraction of the multiple noisy label distributions from its nearest neighbors and yet simultaneously maintains a fraction of its own original multiple noisy label distribution. Promising experimental results on simulated and real-world datasets validate the effectiveness of our proposed method.

Download Full-text

Graph Algorithm to Find Core Periphery Structures using Mutual K-nearest Neighbors

International Journal of Artificial Intelligence & Applications ◽

10.5121/ijaia.2021.12101 ◽

2021 ◽

Vol 12 (1) ◽

pp. 1-18

Author(s):

Divya Sardana ◽

Raj Bhatnagar Bhatnagar

Keyword(s):

Real World ◽

Biological Networks ◽

Metabolic Networks ◽

Graph Algorithm ◽

Nearest Neighbors ◽

Weighted Graphs ◽

K Nearest Neighbors ◽

Definition Of ◽

Affinity Measure ◽

Meso Scale

Core periphery structures exist naturally in many complex networks in the real-world like social, economic, biological and metabolic networks. Most of the existing research efforts focus on the identification of a meso scale structure called community structure. Core periphery structures are another equally important meso scale property in a graph that can help to gain deeper insights about the relationships between different nodes. In this paper, we provide a definition of core periphery structures suitable for weighted graphs. We further score and categorize these relationships into different types based upon the density difference between the core and periphery nodes. Next, we propose an algorithm called CP-MKNN (Core Periphery-Mutual K Nearest Neighbors) to extract core periphery structures from weighted graphs using a heuristic node affinity measure called Mutual K-nearest neighbors (MKNN). Using synthetic and real-world social and biological networks, we illustrate the effectiveness of developed core periphery structures.

Download Full-text

Cross Domain Adaptation of Crowd Counting with Model-Agnostic Meta-Learning

Applied Sciences ◽

10.3390/app112412037 ◽

2021 ◽

Vol 11 (24) ◽

pp. 12037

Author(s):

Xiaoyu Hou ◽

Jihui Xu ◽

Jinming Wu ◽

Huaiyu Xu

Keyword(s):

Feature Extraction ◽

Supervised Learning ◽

Density Estimation ◽

Real World ◽

Domain Adaptation ◽

Manual Annotation ◽

Crowd Counting ◽

Cross Domain ◽

Meta Learning ◽

Density Map

Counting people in crowd scenarios is extensively conducted in drone inspections, video surveillance, and public safety applications. Today, crowd count algorithms with supervised learning have improved significantly, but with a reliance on a large amount of manual annotation. However, in real world scenarios, different photo angles, exposures, location heights, complex backgrounds, and limited annotation data lead to supervised learning methods not working satisfactorily, plus many of them suffer from overfitting problems. To address the above issues, we focus on training synthetic crowd data and investigate how to transfer information to real-world datasets while reducing the need for manual annotation. CNN-based crowd-counting algorithms usually consist of feature extraction, density estimation, and count regression. To improve the domain adaptation in feature extraction, we propose an adaptive domain-invariant feature extracting module. Meanwhile, after taking inspiration from recent innovative meta-learning, we present a dynamic-β MAML algorithm to generate a density map in unseen novel scenes and render the density estimation model more universal. Finally, we use a counting map refiner to optimize the coarse density map transformation into a fine density map and then regress the crowd number. Extensive experiments show that our proposed domain adaptation- and model-generalization methods can effectively suppress domain gaps and produce elaborate density maps in cross-domain crowd-counting scenarios. We demonstrate that the proposals in our paper outperform current state-of-the-art techniques.

Download Full-text

PNS207 Patient and Sub-Population Simulation VIA Multidimensional Correlation Generation (MCG) and Weighted K Nearest Neighbors (KNN) Algorithms for Clinical Trials and Real-World DATA

Value in Health ◽

10.1016/j.jval.2020.08.1651 ◽

2020 ◽

Vol 23 ◽

pp. S675-S676

Author(s):

G. Wei ◽

Y. Wang ◽

M. Behnke ◽

E. Reiner ◽

...

Keyword(s):

Clinical Trials ◽

Real World ◽

Nearest Neighbors ◽

Real World Data ◽

K Nearest Neighbors ◽

World Data ◽

Population Simulation

Download Full-text

An Online Incremental Semi-Supervised Learning Method

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2010.p0593 ◽

2010 ◽

Vol 14 (6) ◽

pp. 593-605 ◽

Cited By ~ 5

Author(s):

Furao Shen ◽

◽

Hui Yu ◽

Youki Kamiya ◽

Osamu Hasegawa ◽

...

Keyword(s):

Supervised Learning ◽

Prior Knowledge ◽

Network Structure ◽

Real World ◽

Input Data ◽

Data Distribution ◽

Semisupervised Learning ◽

Unlabeled Data ◽

Learning Method ◽

Real World Data

Using labeled data and large amounts of unlabeled data, our proposed online incremental semisupervised learning automatically learns the topology of input data distribution without prior knowledge of numbers of nodes or network structure. Using labeled data, it labels generated nodes and divides a learned topology into substructures corresponding to classes. Node weights used as prototype vectors enable classification. New labeled or unlabeled data is added incrementally to the system during learning. Experimental results for artificial and real-world data show that this learning efficiently learns online incremental tasks even in noisy and non-stationary environments.

Download Full-text

Supervised Learning-Classification Using K-Nearest Neighbors (KNN)

Python® Machine Learning ◽

10.1002/9781119557500.ch9 ◽

2019 ◽

pp. 205-220 ◽

Cited By ~ 1

Keyword(s):

Supervised Learning ◽

Nearest Neighbors ◽

K Nearest Neighbors

Download Full-text

A Novel Density Peaks Clustering Algorithm Based on K Nearest Neighbors With Adaptive Merging Strategy

10.21203/rs.3.rs-95747/v1 ◽

2020 ◽

Author(s):

Xiaoning Yuan ◽

Hang Yu ◽

Jun Liang ◽

Bing Xu

Keyword(s):

Real World ◽

Clustering Algorithm ◽

Nearest Neighbors ◽

K Nearest Neighbors ◽

Density Peaks ◽

Density Peaks Clustering ◽

Cutoff Distance ◽

Real World Datasets ◽

Merging Strategy ◽

Selection Of

Abstract Recently the density peaks clustering algorithm (dubbed as DPC) attracts lots of attention. The DPC is able to quickly find cluster centers and complete clustering tasks. And the DPC is suitable for many clustering tasks. However, the cutoff distance 𝑑𝑑𝑐𝑐 is depends on human experience which will greatly affect the clustering results. In addition, the selection of cluster centers requires manual participation which will affect the clustering efficiency. In order to solve these problem, we propose a density peaks clustering algorithm based on K nearest neighbors with adaptive merging strategy (dubbed as KNN-ADPC). We propose a clusters merging strategy to automatically aggregate the over-segmented clusters. Additionally, the K nearest neighbors is adopted to divide points more reasonably. The KNN-ADPC only has one parameter and the clustering task can be conducted automatically without human involvement. The experiment results on artificial and real-world datasets prove the higher accuracy of KNN-ADPC compared with DBSCAN, K-means++, DPC and DPC-KNN.

Download Full-text