Semi-supervised clustering techniques for categorization of text documents

In Present situation, a huge quantity of data is recorded in variety of forms like text, image, video, and audio and is estimated to enhance in future. The major tasks related to text are entity extraction, information extraction, entity relation modeling, document summarization are performed by using text mining. This paper main focus is on document clustering, a sub task of text mining and to measure the performance of different clustering techniques. In this paper we are using an enhanced features selection for clustering of text documents to prove that it produces better results compared to traditional feature selection.

Download Full-text

Pseudo-Supervised Clustering for Text Documents

IEEE/WIC/ACM International Conference on Web Intelligence (WI'04) ◽

10.1109/wi.2004.10138 ◽

2005 ◽

Author(s):

M. Maggini ◽

L. Rigutini ◽

M. Turchi

Keyword(s):

Text Documents ◽

Supervised Clustering

Download Full-text

Supervised Regression Clustering

International Journal of Business Analytics ◽

10.4018/ijban.2016100102 ◽

2016 ◽

Vol 3 (4) ◽

pp. 21-40 ◽

Cited By ~ 1

Author(s):

Ali Fallah Tehrani ◽

Diane Ahrens

Keyword(s):

Supervised Learning ◽

Data Analytics ◽

Apparel Industry ◽

Clustering Methods ◽

Specific Behavior ◽

Clustering Techniques ◽

Real Dataset ◽

Supervised Clustering ◽

Fashion Products

Clustering techniques typically group similar instances underlying individual attributes by supposing that similar instances have similar attributes characteristic. On contrary, clustering similar instances given a specific behavior is framed through supervised learning. For instance, which fashion products have similar behavior in term of sales. Unfortunately, conventional clustering methods cannot tackle this case, since they handle attributes by a same manner. In fact, conventional clustering approaches do not consider any response, and moreover they assume attributes act by the same importance. However, clustering instances with respect to responses leads to a better data analytics. In this research, the authors introduce an approach for the goal supervised clustering and show its advantage in terms of data analytics as well as prediction. To verify the feasibility and the performance of this approach the authors conducted several experiments on a real dataset derived from an apparel industry.

Download Full-text

Clustering techniques for thyroid nodules malignancy inference in the era of personalized medicine

Endocrine Abstracts ◽

10.1530/endoabs.70.ep445 ◽

2020 ◽

Author(s):

Andrea Giani ◽

de Souza Patricia Borges ◽

Stefania Bartoletti ◽

Flavio Morselli ◽

Andrea Conti ◽

...

Keyword(s):

Personalized Medicine ◽

Thyroid Nodules ◽

Clustering Techniques

Download Full-text

A Survival Study on Data Structure Based Clustering Techniques for Multidimensional Data Stream Analysis

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v5i12.101108 ◽

2017 ◽

Vol 5 (12) ◽

pp. 101-108

Author(s):

K. Chitra ◽

◽

D. Maheswari

Keyword(s):

Data Structure ◽

Data Stream ◽

Multidimensional Data ◽

Clustering Techniques ◽

Survival Study ◽

Data Stream Analysis

Download Full-text

A State of Art Approaches on Energy Efficient Clustering Techniques in WSN

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v7i3.5054 ◽

2019 ◽

Vol 7 (3) ◽

pp. 50-54

Author(s):

N. Thilagavathi ◽

Christy Wood ◽

V. Hemalakshumi ◽

V. Mathumiithaa

Keyword(s):

Energy Efficient ◽

Clustering Techniques ◽

Energy Efficient Clustering ◽

State Of Art

Download Full-text

Detection and correcting the wrong words from Hindi, English and Punjabi Text Documents

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v7i6.314318 ◽

2019 ◽

Vol 7 (6) ◽

pp. 314-318

Author(s):

Shaina . ◽

Naresh Kumar

Keyword(s):

Text Documents

Download Full-text

Examination of Clustering Techniques using Genetic Algorithm

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v6i4.374378 ◽

2018 ◽

Vol 6 (4) ◽

pp. 374-378

Author(s):

S. Ramya ◽

◽

N. Subha

Keyword(s):

Genetic Algorithm ◽

Clustering Techniques

Download Full-text

Systematic Defect Identification through Layout Snippet Clustering

ISTFA 2010: Conference Proceedings from the 36th International Symposium for Testing and Failure Analysis ◽

10.31399/asm.cp.istfa2010p0320 ◽

2010 ◽

Author(s):

Wing Chiu Tam ◽

Osei Poku ◽

R. D. (Shawn) Blanton

Keyword(s):

Design Process ◽

Integrated Circuit ◽

Yield Loss ◽

Defect Identification ◽

Clustering Techniques ◽

Dominant Component

Abstract Systematic defects due to design-process interactions are a dominant component of integrated circuit (IC) yield loss in nano-scaled technologies. Test structures do not adequately represent the product in terms of feature diversity and feature volume, and therefore are unable to identify all the systematic defects that affect the product. This paper describes a method that uses diagnosis to identify layout features that do not yield as expected. Specifically, clustering techniques are applied to layout snippets of diagnosis-implicated regions from (ideally) a statistically-significant number of IC failures for identifying feature commonalties. Experiments involving an industrial chip demonstrate the identification of possible systematic yield loss due to lithographic hotspots.

Download Full-text

Semi-supervised clustering techniques for categorization of text documents

Comparative study of clustering techniques for short text documents

Research of Clustering Algorithms using Enhanced Feature Selection

Pseudo-Supervised Clustering for Text Documents

Supervised Regression Clustering

Clustering techniques for thyroid nodules malignancy inference in the era of personalized medicine

A Survival Study on Data Structure Based Clustering Techniques for Multidimensional Data Stream Analysis

A State of Art Approaches on Energy Efficient Clustering Techniques in WSN

Detection and correcting the wrong words from Hindi, English and Punjabi Text Documents

Examination of Clustering Techniques using Genetic Algorithm

Systematic Defect Identification through Layout Snippet Clustering

Export Citation Format