unsupervised method
Recently Published Documents


TOTAL DOCUMENTS

180
(FIVE YEARS 60)

H-INDEX

14
(FIVE YEARS 2)

2022 ◽  
Vol 176 ◽  
pp. 107344
Author(s):  
Xu Wang ◽  
Junwu Zhou ◽  
Qingkai Wang ◽  
Daoxi Liu ◽  
Jingmin Lian

Plants ◽  
2021 ◽  
Vol 10 (12) ◽  
pp. 2741
Author(s):  
Rahul Jamdade ◽  
Maulik Upadhyay ◽  
Khawla Al Shaer ◽  
Eman Al Harthi ◽  
Mariam Al Sallani ◽  
...  

Arabia is the largest peninsula in the world, with >3000 species of vascular plants. Not much effort has been made to generate a multi-locus marker barcode library to identify and discriminate the recorded plant species. This study aimed to determine the reliability of the available Arabian plant barcodes (>1500; rbcL and matK) at the public repository (NCBI GenBank) using the unsupervised and supervised methods. Comparative analysis was carried out with the standard dataset (FINBOL) to assess the methods and markers’ reliability. Our analysis suggests that from the unsupervised method, TaxonDNA’s All Species Barcode criterion (ASB) exhibits the highest accuracy for rbcL barcodes, followed by the matK barcodes using the aligned dataset (FINBOL). However, for the Arabian plant barcode dataset (GBMA), the supervised method performed better than the unsupervised method, where the Random Forest and K-Nearest Neighbor (gappy kernel) classifiers were robust enough. These classifiers successfully recognized true species from both barcode markers belonging to the aligned and alignment-free datasets, respectively. The multi-class classifier showed high species resolution following the two classifiers, though its performance declined when employed to recognize true species. Similar results were observed for the FINBOL dataset through the supervised learning approach; overall, matK marker showed higher accuracy than rbcL. However, the lower rate of species identification in matK in GBMA data could be due to the higher evolutionary rate or gaps and missing data, as observed for the ASB criterion in the FINBOL dataset. Further, a lower number of sequences and singletons could also affect the rate of species resolution, as observed in the GBMA dataset. The GBMA dataset lacks sufficient species membership. We would encourage the taxonomists from the Arabian Peninsula to join our campaign on the Arabian Barcode of Life at the Barcode of Life Data (BOLD) systems. Our efforts together could help improve the rate of species identification for the Arabian Vascular plants.


2021 ◽  
Vol 313 ◽  
pp. 125563
Author(s):  
Zhandong Yuan ◽  
Shengyang Zhu ◽  
Chao Chang ◽  
Xuancheng Yuan ◽  
Qinglai Zhang ◽  
...  

2021 ◽  
Vol 13 (23) ◽  
pp. 4874
Author(s):  
Jihan Alameddine ◽  
Kacem Chehdi ◽  
Claude Cariou

In this paper, we propose a true unsupervised method to partition large-size images, where the number of classes, training samples, and other a priori information is not known. Thus, partitioning an image without any knowledge is a great challenge. This novel adaptive and hierarchical classification method is based on affinity propagation, where all criteria and parameters are adaptively calculated from the image to be partitioned. It is reliable to objectively discover classes of an image without user intervention and therefore satisfies all the objectives of an unsupervised method. Hierarchical partitioning adopted allows the user to analyze and interpret the data very finely. The optimal partition maximizing an objective criterion provides the number of classes and the exemplar of each class. The efficiency of the proposed method is demonstrated through experimental results on hyperspectral images. The obtained results show its superiority over the most widely used unsupervised and semi-supervised methods. The developed method can be used in several application domains to partition large-size images or data. It allows the user to consider all or part of the obtained classes and gives the possibility to select the samples in an objective way during a learning process.


Sensors ◽  
2021 ◽  
Vol 21 (18) ◽  
pp. 6198
Author(s):  
Daniel de Matos Luna dos Santos ◽  
Ewaldo Eder Carvalho Santana ◽  
Paulo Fernandes da Silva Junior ◽  
Jonathan Araujo Queiroz ◽  
João Viana da Fonseca Neto ◽  
...  

In this paper, a bioinspired method in the magnetic field memory of the bees, applied in a rover of precision pollination, is presented. The method calculates sharpness features by entropy and variance of the Laplacian of images segmented by color in the HSV system in real-time. A complementary positioning method based on area feature extraction between active markers was developed, analyzing color characteristics, noise, and vibrations of the probe in time and frequency, through the lateral image of the probe. From the observed results, it can be seen that the unsupervised method does not require previous calibration of target dimensions, histogram, and distances involved in positioning. The algorithm showed less sensitivity in the extraction of sharpness characteristics regarding the number of edges and greater sensitivity to the gradient, allowing unforeseen operation scenarios, even in small sharpness variations, and robust response to variance local, temporal, and geophysical of the magnetic declination, not needing luminosity after scanning, with the two freedom of degrees of the rotation.


2021 ◽  
Vol 12 (10) ◽  
pp. 6284
Author(s):  
Mengyang Lu ◽  
Xin Liu ◽  
Chengcheng Liu ◽  
Boyi Li ◽  
Wenting Gu ◽  
...  

2021 ◽  
Author(s):  
Zuguang Gu ◽  
Daniel Huebschmann

Consensus partitioning is an unsupervised method widely used in high throughput data analysis for revealing subgroups and assigns stability for the classification. However, standard consensus partitioning procedures are weak to identify large numbers of stable subgroups. There are two main issues. 1. Subgroups with small differences are difficult to separate if they are simultaneously detected with subgroups with large differences. And 2. stability of classification generally decreases as the number of subgroups increases. In this work, we proposed a new strategy to solve these two issues by applying consensus partitionings in a hierarchical procedure. We demonstrated hierarchical consensus partitioning can be efficient to reveal more subgroups. We also tested the performance of hierarchical consensus partitioning on revealing a great number of subgroups with a DNA methylation dataset. The hierarchical consensus partitioning is implemented in the R package cola with comprehensive functionality for analysis and visualizations. It can also automate the analysis only with a minimum of two lines of code, which generates a detailed HTML report containing the complete analysis.


Sign in / Sign up

Export Citation Format

Share Document