gene shaving
Recently Published Documents


TOTAL DOCUMENTS

10
(FIVE YEARS 1)

H-INDEX

2
(FIVE YEARS 0)

PLoS ONE ◽  
2019 ◽  
Vol 14 (5) ◽  
pp. e0217027
Author(s):  
Md. Ashad Alam ◽  
Mohammd Shahjaman ◽  
Md. Ferdush Rahman ◽  
Fokhrul Hossain ◽  
Hong-Wen Deng

2016 ◽  
Vol 3 (2) ◽  
pp. 129-138
Author(s):  
Betha Nur Sari

In Indonesia, tuberculosis remains one of the major health problems unresolved. Indonesia is second ranked in the world as the country with the most tuberculosis cases. The purpose of this research is to study how K-means clustering applied to the treatment of tuberculosis patients data in order to identify the characteristics of tuberculosis patients. The results of K-means clustering validated by gene shaving and silhoutte coefficient. The experiment results indicate the optimum clusters value obtained from the K-mean clustering that has been validated by gene shaving and silhouette coefficient. K-means clustering divided four groups of tuberculosis patients based on their characteristics. There were divided at a category of disease (pulmonary TB, Extra Pulmonary TB and both), the age of the patient and the results of treatment of tuberculosis.


2013 ◽  
Vol 8-9 ◽  
pp. 508-515
Author(s):  
Raul Malutan ◽  
Pedro Gómez Vilda ◽  
Monica Borda

Data classification has an important role in analyzing high dimensional data. In this paper Gene Shaving algorithm was used for a previous supervised classification and once the cluster information was obtained, data was classified again with supervised algorithms like Support Vector Machine (SVM) and k-Nearest Neighbor (k-NN) for an optimal clustering. These algorithms have proven to be useful when the classes of the training data and the attributes of each class are well established. The algorithms were run on several data sets, observing that the quality of the obtained clusters is dependent on the number of clusters specified.


Author(s):  
Bradley M. Broom ◽  
Erik P. Sulman ◽  
Kim Anh Do ◽  
Mary E. Edgerton ◽  
Kenneth D. Aldape

2007 ◽  
Vol 5 ◽  
pp. 117693510700500
Author(s):  
K-A. Do ◽  
G.J. McLachlan ◽  
R. Bean ◽  
S. Wen

Researchers are frequently faced with the analysis of microarray data of a relatively large number of genes using a small number of tissue samples. We examine the application of two statistical methods for clustering such microarray expression data: EMMIX-GENE and GeneClust. EMMIX-GENE is a mixture-model based clustering approach, designed primarily to cluster tissue samples on the basis of the genes. GeneClust is an implementation of the gene shaving methodology, motivated by research to identify distinct sets of genes for which variation in expression could be related to a biological property of the tissue samples. We illustrate the use of these two methods in the analysis of Affymetrix oligonucleotide arrays of well-known data sets from colon tissue samples with and without tumors, and of tumor tissue samples from patients with leukemia. Although the two approaches have been developed from different perspectives, the results demonstrate a clear correspondence between gene clusters produced by GeneClust and EMMIX-GENE for the colon tissue data. It is demonstrated, for the case of ribosomal proteins and smooth muscle genes in the colon data set, that both methods can classify genes into co-regulated families. It is further demonstrated that tissue types (tumor and normal) can be separated on the basis of subtle distributed patterns of genes. Application to the leukemia tissue data produces a division of tissues corresponding closely to the external classification, acute myeloid meukemia (AML) and acute lymphoblastic leukemia (ALL), for both methods. In addition, we also identify genes specific for the subgroup of ALL-Tcell samples. Overall, we find that the gene shaving method produces gene clusters at great speed; allows variable cluster sizes and can incorporate partial or full supervision; and finds clusters of genes in which the gene expression varies greatly over the tissue samples while maintaining a high level of coherence between the gene expression profiles. The intent of the EMMIX-GENE method is to cluster the tissue samples. It performs a filtering step that results in a subset of relevant genes, followed by gene clustering, and then tissue clustering, and is favorable in its accuracy of ranking the clusters produced.


Sign in / Sign up

Export Citation Format

Share Document