Dimensionality Reduction by Combining Category Information and Latent Semantic Index for Text Categorization

Text categorization (TC) is a data mining technique for automatically classifying documents to one or more predefined categories. This paper will introduce the principles of TC, discuss common TC methods and steps, give an overview of the various types of TC systems, and discuss future trends. TC systems begin with a group of known categories and a set of training documents already assigned to a category, usually by a human expert. Depending on the system, the documents may undergo a process called dimensionality reduction, which reduces the number of words or features that the classifier evaluates during the learning process. The system then analyzes the documents and “learns” which words or features of each document caused it to be classified into a particular category. This is known as supervised learning, because it is based on human knowledge of the categories and their criteria. The learning process results in a classifier which can apply the rules it learned during the training phase to additional documents.

Download Full-text

Raising High-Degree Overlapped Character Bigrams into Trigrams for Dimensionality Reduction in Chinese Text Categorization

Computational Linguistics and Intelligent Text Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-540-24630-5_72 ◽

2004 ◽

pp. 584-595

Author(s):

Dejun Xue ◽

Maosong Sun

Keyword(s):

Dimensionality Reduction ◽

Chinese Text ◽

Text Categorization ◽

High Degree

Download Full-text

Dimensionality Reduction by Semantic Mapping in Text Categorization

Neural Information Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-540-30499-9_160 ◽

2004 ◽

pp. 1032-1037 ◽

Cited By ~ 2

Author(s):

Renato Fernandes Corrêa ◽

Teresa Bernarda Ludermir

Keyword(s):

Dimensionality Reduction ◽

Text Categorization ◽

Semantic Mapping

Download Full-text

The automatic detection of unattended social category information from unfamiliar faces

PsycEXTRA Dataset ◽

10.1037/e514472015-752 ◽

2014 ◽

Author(s):

Douglas Martin ◽

Rachel Swainson ◽

Gillian Slessor ◽

Jacqui Hutchison ◽

Diana Marosi

Keyword(s):

Automatic Detection ◽

Social Category ◽

Category Information ◽

Unfamiliar Faces

Download Full-text

Analysis on Research Paper Publication Recommendation System with Composition of Papers and Conferences Matrices

International Journal of Scientific Research in Science Engineering and Technology ◽

10.32628/ijsrset207330 ◽

2020 ◽

pp. 120-127

Author(s):

Htay Htay Win ◽

Aye Thida Myint ◽

Mi Cho Cho

Keyword(s):

Social Network ◽

Dimensionality Reduction ◽

Correspondence Analysis ◽

Recommendation System ◽

Research Paper ◽

Topic Modelling ◽

Research Papers ◽

The Right ◽

Publication Recommendation

For years, achievements and discoveries made by researcher are made aware through research papers published in appropriate journals or conferences. Many a time, established s researcher and mainly new user are caught up in the predicament of choosing an appropriate conference to get their work all the time. Every scienti?c conference and journal is inclined towards a particular ?eld of research and there is a extensive group of them for any particular ?eld. Choosing an appropriate venue is needed as it helps in reaching out to the right listener and also to further one’s chance of getting their paper published. In this work, we address the problem of recommending appropriate conferences to the authors to increase their chances of receipt. We present three di?erent approaches for the same involving the use of social network of the authors and the content of the paper in the settings of dimensionality reduction and topic modelling. In all these approaches, we apply Correspondence Analysis (CA) to obtain appropriate relationships between the entities in question, such as conferences and papers. Our models show hopeful results when compared with existing methods such as content-based ?ltering, collaborative ?ltering and hybrid ?ltering.

Download Full-text

Speech Emotion Recognition Based on Sparse Representation

Archives of Acoustics ◽

10.2478/aoa-2013-0055 ◽

2013 ◽

Vol 38 (4) ◽

pp. 465-470 ◽

Cited By ~ 11

Author(s):

Jingjie Yan ◽

Xiaolan Wang ◽

Weiyi Gu ◽

LiLi Ma

Keyword(s):

Dimensionality Reduction ◽

Emotion Recognition ◽

Least Squares ◽

Partial Least Squares ◽

Partial Least Squares Regression ◽

Speech Emotion Recognition ◽

Least Squares Regression ◽

Computer Science Pedagogy ◽

Reduction Methods ◽

Analysis Computer

Abstract Speech emotion recognition is deemed to be a meaningful and intractable issue among a number of do- mains comprising sentiment analysis, computer science, pedagogy, and so on. In this study, we investigate speech emotion recognition based on sparse partial least squares regression (SPLSR) approach in depth. We make use of the sparse partial least squares regression method to implement the feature selection and dimensionality reduction on the whole acquired speech emotion features. By the means of exploiting the SPLSR method, the component parts of those redundant and meaningless speech emotion features are lessened to zero while those serviceable and informative speech emotion features are maintained and selected to the following classification step. A number of tests on Berlin database reveal that the recogni- tion rate of the SPLSR method can reach up to 79.23% and is superior to other compared dimensionality reduction methods.

Download Full-text