scholarly journals A unified view of density-based methods for semi-supervised clustering and classification

2019 ◽  
Vol 33 (6) ◽  
pp. 1894-1952 ◽  
Author(s):  
Jadson Castro Gertrudes ◽  
Arthur Zimek ◽  
Jörg Sander ◽  
Ricardo J. G. B. Campello

Abstract Semi-supervised learning is drawing increasing attention in the era of big data, as the gap between the abundance of cheap, automatically collected unlabeled data and the scarcity of labeled data that are laborious and expensive to obtain is dramatically increasing. In this paper, we first introduce a unified view of density-based clustering algorithms. We then build upon this view and bridge the areas of semi-supervised clustering and classification under a common umbrella of density-based techniques. We show that there are close relations between density-based clustering algorithms and the graph-based approach for transductive classification. These relations are then used as a basis for a new framework for semi-supervised classification based on building-blocks from density-based clustering. This framework is not only efficient and effective, but it is also statistically sound. In addition, we generalize the core algorithm in our framework, HDBSCAN*, so that it can also perform semi-supervised clustering by directly taking advantage of any fraction of labeled data that may be available. Experimental results on a large collection of datasets show the advantages of the proposed approach both for semi-supervised classification as well as for semi-supervised clustering.

2020 ◽  
Vol 34 (6) ◽  
pp. 1984-1985
Author(s):  
Jadson Castro Gertrudes ◽  
Arthur Zimek ◽  
Jörg Sander ◽  
Ricardo J. G. B. Campello

2011 ◽  
Vol 121-126 ◽  
pp. 4675-4679
Author(s):  
Ming Wei Leng ◽  
Xiao Yun Chen ◽  
Jian Jun Cheng ◽  
Long Jie Li

In many data mining domains, labeled data is very expensive to generate, how to make the best use of labeled data to guide the process of unlabeled clustering is the core problem of semi-supervised clustering. Most of semi-supervised clustering algorithms require a certain amount of labeled data and need set the values of some parameters, different values maybe have different results. In view of this, a new algorithm, called semi-supervised clustering algorithm based on small size of labeled data, is presented, which can use the small size of labeled data to expand labeled dataset by labeling their k-nearest neighbors and only one parameter. We demonstrate our clustering algorithm with three UCI datasets, compared with SSDBSCAN[4] and KNN, the experimental results confirm that accuracy of our clustering algorithm is close to that of KNN classification algorithm.


2019 ◽  
Vol 16 (12) ◽  
pp. 1348-1353
Author(s):  
Huanhuan Qu ◽  
Baixue Li ◽  
Jingyi Yang ◽  
Huaiwen Liang ◽  
Meixia Li ◽  
...  

Background: Disaccharide core 1 (Galβ1-3GalNAc) is a common O-glycan structure in nature. Biochemical studies have confirmed that the formation of the core 1 structure is an important initial step in O-glycan biosynthesis and it is of great importance for human body. Objective: Our study will provide meaningful and useful sights for O-glycan synthesis and their bioassay. And all the synthetic glycosides would be used as intermediate building blocks in the scheme developed for oligosaccharide construction. Methods: In this article, we firstly used chemical procedures to prepare core 1 and its derivative, and a novel disaccharide was efficiently synthesized. The structures of the synthesized compounds were elucidated and confirmed by 1H NMR, 13C NMR and MS. Then we employed three human gut symbionts belonging to Bacteroidetes, a predominantphyla in the distal gut, as models to study the bioactivity of core 1 and its derivative on human gut microbiota. Results: According to our results, both core 1 and derivative could support the growth of B. fragilis, especially the core 1 derivative, while failed to support the growth of B. thetaiotaomicron and B. ovatus. Conclusion: This suggested that the B. fragilis might have the specificity glycohydrolase to cut the glycosidic bond for acquiring monosaccharide.


Author(s):  
Hazel Gray

This chapter sets out the analytical framework of political settlements and elaborates the framework to account for the socialist experiences of Tanzania and Vietnam in the 1960s and 1970s. A political settlement, as defined by Mushtaq Khan, is a combination of power and institutions that is mutually compatible and also sustainable in terms of economic and political viability. The chapter clarifies the core building blocks of the approach and sets out the main differences between political settlements and new institutional economics. The chapter then defines a socialist political settlement where productive rights are formally held by the collective and formal institutions protect common and collectively owned assets. The attempts to construct a socialist political settlement left important institutional, political, and economic legacies. These shaped incentives and constraints which influenced a number of critical processes at the heart of economic development—related to technological learning, accumulation for investment, and political stabilization.


2011 ◽  
Vol 291-294 ◽  
pp. 344-348
Author(s):  
Lin Lin ◽  
Shu Yan ◽  
Yi Nian

The hierarchical topology of wireless sensor networks can effectively reduce the consumption in communication. Clustering algorithm is the foundation to realize herarchical structure, so it has been extensive researched. On the basis of Leach algorithm, a distance density based clustering algorithm (DDBC) is proposed, considering synthetically the distribution density of around nodes and the remaining energy factors of the node to dynamically banlance energy usage of nodes when selecting cluster heads. We analyzed the performance of DDBC through compared with the existing other clustering algorithms in simulation experiment. Results show that the proposed method can generare stable quantity cluster heads and banlance the energy load effectively.


2021 ◽  
Vol 25 (6) ◽  
pp. 1453-1471
Author(s):  
Chunhua Tang ◽  
Han Wang ◽  
Zhiwen Wang ◽  
Xiangkun Zeng ◽  
Huaran Yan ◽  
...  

Most density-based clustering algorithms have the problems of difficult parameter setting, high time complexity, poor noise recognition, and weak clustering for datasets with uneven density. To solve these problems, this paper proposes FOP-OPTICS algorithm (Finding of the Ordering Peaks Based on OPTICS), which is a substantial improvement of OPTICS (Ordering Points To Identify the Clustering Structure). The proposed algorithm finds the demarcation point (DP) from the Augmented Cluster-Ordering generated by OPTICS and uses the reachability-distance of DP as the radius of neighborhood eps of its corresponding cluster. It overcomes the weakness of most algorithms in clustering datasets with uneven densities. By computing the distance of the k-nearest neighbor of each point, it reduces the time complexity of OPTICS; by calculating density-mutation points within the clusters, it can efficiently recognize noise. The experimental results show that FOP-OPTICS has the lowest time complexity, and outperforms other algorithms in parameter setting and noise recognition.


2021 ◽  
Vol 108 (1) ◽  
pp. 25-33
Author(s):  
Matthew Clauhs ◽  
Bryan Powell

The National Coalition for Core Arts Standards released standards for music education in 2014. These standards are guided by artistic processes and measured by performance standards specific to content areas and grade levels. As school districts in the United States adopt the Core Arts Standards for their music programs, it is imperative that modern band teachers demonstrate how their curriculum aligns with this new framework. Modern band is one approach to popular music education that is particularly well suited to address this new framework; the emphases of songwriting, improvising, critical listening, and group work in a learner-centered modern band class/ensemble are associated with a wide variety of standards. This article explores connections between popular music pedagogies and each of the processes in the Core Arts Standards and examines which standards may be most appropriate for modern band contexts.


Sign in / Sign up

Export Citation Format

Share Document