scholarly journals MultiKOC: Multi-One-Class Classifier Based K-Means Clustering

Algorithms ◽  
2021 ◽  
Vol 14 (5) ◽  
pp. 134
Author(s):  
Loai Abdallah ◽  
Murad Badarna ◽  
Waleed Khalifa ◽  
Malik Yousef

In the computational biology community there are many biological cases that are considered as multi-one-class classification problems. Examples include the classification of multiple tumor types, protein fold recognition and the molecular classification of multiple cancer types. In all of these cases the real world appropriately characterized negative cases or outliers are impractical to achieve and the positive cases might consist of different clusters, which in turn might lead to accuracy degradation. In this paper we present a novel algorithm named MultiKOC multi-one-class classifiers based K-means to deal with this problem. The main idea is to execute a clustering algorithm over the positive samples to capture the hidden subdata of the given positive data, and then building up a one-class classifier for every cluster member’s examples separately: in other word, train the OC classifier on each piece of subdata. For a given new sample, the generated classifiers are applied. If it is rejected by all of those classifiers, the given sample is considered as a negative sample, otherwise it is a positive sample. The results of MultiKOC are compared with the traditional one-class, multi-one-class, ensemble one-classes and two-class methods, yielding a significant improvement over the one-class and like the two-class performance.

Blood ◽  
2012 ◽  
Vol 119 (19) ◽  
pp. 4467-4475 ◽  
Author(s):  
Andreas Agathangelidis ◽  
Nikos Darzentas ◽  
Anastasia Hadzidimitriou ◽  
Xavier Brochet ◽  
Fiona Murray ◽  
...  

Abstract Mounting evidence indicates that grouping of chronic lymphocytic leukemia (CLL) into distinct subsets with stereotyped BCRs is functionally and prognostically relevant. However, several issues need revisiting, including the criteria for identification of BCR stereotypy and its actual frequency as well as the identification of “CLL-biased” features in BCR Ig stereotypes. To this end, we examined 7596 Ig VH (IGHV-IGHD-IGHJ) sequences from 7424 CLL patients, 3 times the size of the largest published series, with an updated version of our purpose-built clustering algorithm. We document that CLL may be subdivided into 2 distinct categories: one with stereotyped and the other with nonstereotyped BCRs, at an approximate ratio of 1:2, and provide evidence suggesting a different ontogeny for these 2 categories. We also show that subset-defining sequence patterns in CLL differ from those underlying BCR stereotypy in other B-cell malignancies. Notably, 19 major subsets contained from 20 to 213 sequences each, collectively accounting for 943 sequences or one-eighth of the cohort. Hence, this compartmentalized examination of VH sequences may pave the way toward a molecular classification of CLL with implications for targeted therapeutic interventions, applicable to a significant number of patients assigned to the same subset.


2001 ◽  
Vol 17 (Suppl 1) ◽  
pp. S316-S322 ◽  
Author(s):  
C.-H. Yeang ◽  
S. Ramaswamy ◽  
P. Tamayo ◽  
S. Mukherjee ◽  
R. M. Rifkin ◽  
...  

Author(s):  
Dan Luo ◽  
Xili Wang

Background: Semi-supervised learning in the machine learning community has received widespread attention. Semi-supervised learning can use a small number of tagged samples and a large number of untagged samples for efficient learning. Methods: In 2014, Kim proposed a new semi-supervised learning method: the minimax label propagation (MMLP) method. This method reduces time complexity to O (n), with a smaller computation cost and stronger classification ability than traditional methods. However, classification results are not accurate in large-scale image classifications. Thus, in this paper, we propose a semisupervised image classification method, which is an MMLP-based algorithm. The main idea is threefold: (1) Improving connectivity of image pixels by pixel sampling to reduce the image size, at the same time, reduce the diversity of image characteristics; (2) Using a recall feature to improve the MMLP algorithm; (3) through classification mapping, gaining the classification of the original data from the classification of the data reduction. Results: In the end, our algorithm also gains a minimax path from untagged samples to tagged samples. The experimental results proved that this algorithm is applicable to semi-supervised learning on small-size and that it can also gain better classification results for large-size image at the same time. Conclusion: In our paper, considering the connectivity of the neighboring matrix and the diversity of the characteristics, we used meanshift clustering algorithm, next we will use fuzzy energy clustering on our algorithm. We will study the function of these paths.


2020 ◽  
Vol 4 (2) ◽  
pp. 780-787
Author(s):  
Ibrahim Hassan Hayatu ◽  
Abdullahi Mohammed ◽  
Barroon Ahmad Isma’eel ◽  
Sahabi Yusuf Ali

Soil fertility determines a plant's development process that guarantees food sufficiency and the security of lives and properties through bumper harvests. The fertility of soil varies according to regions, thereby determining the type of crops to be planted. However, there is no repository or any source of information about the fertility of the soil in any region in Nigeria especially the Northwest of the country. The only available information is soil samples with their attributes which gives little or no information to the average farmer. This has affected crop yield in all the regions, more particularly the Northwest region, thus resulting in lower food production.  Therefore, this study is aimed at classifying soil data based on their fertility in the Northwest region of Nigeria using R programming. Data were obtained from the department of soil science from Ahmadu Bello University, Zaria. The data contain 400 soil samples containing 13 attributes. The relationship between soil attributes was observed based on the data. K-means clustering algorithm was employed in analyzing soil fertility clusters. Four clusters were identified with cluster 1 having the highest fertility, followed by 2 and the fertility decreases with an increasing number of clusters. The identification of the most fertile clusters will guide farmers on where best to concentrate on when planting their crops in order to improve productivity and crop yield.


Author(s):  
Antonio Pico ◽  
Laura Sanchez-Tejada ◽  
Ruth Sanchez-Ortiga ◽  
Rosa Camara ◽  
Cristina Lamas ◽  
...  

2018 ◽  
Vol 79 (8) ◽  
pp. 35-40
Author(s):  
M. I. Kuznetsova

One of the goals of the Russian language course in the primary school is the formation of the communicative literacy. The content of the course should be aimed at understanding the wealth of linguistic means by primary school children; the formation of the ability to detect a violation of linguistic norms and the inadequacy of the linguistic means used in the speech situation; the accumulation of the experience in choosing of linguistic means in accordance with the peculiarities of the speech situation; the creation of oral and written texts that meet the criteria of content, connectivity, compliance with the norms of the Russian literary language. The article considers the classification of exercises that contribute to the formation of communicative literacy. The author gives the examples of exercises where the student acts in different roles: the student is an observer of the speech situation and analyzes the adequacy of the choice of linguistic means; the student is a direct participant in the given speech situation and makes a choice of language facilities; the student is offered to create the speech situation himself, to independently construct an oral and written text.


1975 ◽  
Vol 14 (1) ◽  
pp. 149-152
Author(s):  
M.A. Behzad

Development Financing under Constraints, as the author himself puts it, is 'aimed to recapitulate the spirit in which the African Development Bank was founded, describe how it later functioned and why it functioned the way it did'. The study is an excellent attempt to highlight economic cooperation and integ¬ration and to discuss its rationale in view of the given constraints. The main idea behind the establishment of an institution, like the African Develop¬ment Bank (ADB), was necessarily an 'all-African Investment Bank' to promote development projects. The newly independent nations of Africa, lacking as they are in the basic infrastructure, are beset with difficulties in surviving as economically viable units. As such, the need for a pooling of resources and for technical know-how is particularly imperative


Sign in / Sign up

Export Citation Format

Share Document