Classification of Semantic Paraphasias: Optimization of a Word Embedding Model

The background of this article is the issue of how to overview the knowledge of a given query keyword. Especially, the authors focus on concerns of those who search for web pages with a given query keyword. The Web search information needs of a given query keyword is collected through search engine suggests. Given a query keyword, the authors collect up to around 1,000 suggests, while many of them are redundant. They classify redundant search engine suggests based on a topic model. However, one limitation of the topic model based classification of search engine suggests is that the granularity of the topics, i.e., the clusters of search engine suggests, is too coarse. In order to overcome the problem of the coarse-grained classification of search engine suggests, this article further applies the word embedding technique to the webpages used during the training of the topic model, in addition to the text data of the whole Japanese version of Wikipedia. Then, the authors examine the word embedding based similarity between search engines suggests and further classify search engine suggests within a single topic into finer-grained subtopics based on the similarity of word embeddings. Evaluation results prove that the proposed approach performs well in the task of subtopic classification of search engine suggests.

Download Full-text

Classification of Taxonomical Relationship by Word Embedding

2018 IEEE International Conference on Cognitive Computing (ICCC) ◽

10.1109/iccc.2018.00027 ◽

2018 ◽

Cited By ~ 2

Author(s):

Kazuki Omine ◽

Incheon Paik

Keyword(s):

Word Embedding

Download Full-text

Concept of TF-IDF, Common Bag of Word and Word Embedding for Effective Sentiment Classification

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.f4582.049620 ◽

2020 ◽

Vol 9 (4) ◽

pp. 2198-2201

Keyword(s):

Machine Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Sentiment Classification ◽

Word Embedding ◽

Text Representation ◽

Human Beings ◽

Text Data

Sentiment Classification is one of the well-known and most popular domain of machine learning and natural language processing. An algorithm is developed to understand the opinion of an entity similar to human beings. This research fining article presents a similar to the mention above. Concept of natural language processing is considered for text representation. Later novel word embedding model is proposed for effective classification of the data. Tf-IDF and Common BoW representation models were considered for representation of text data. Importance of these models are discussed in the respective sections. The proposed is testing using IMDB datasets. 50% training and 50% testing with three random shuffling of the datasets are used for evaluation of the model.

Download Full-text

Correlation Analysis and Text Classification of Chemical Accident Cases Based on Word Embedding

Process Safety and Environmental Protection ◽

10.1016/j.psep.2021.12.038 ◽

2021 ◽

Author(s):

Sifeng Jing ◽

Xiwei Liu ◽

Xiaoyan Gong ◽

Ying Tang ◽

Gang Xiong ◽

...

Keyword(s):

Correlation Analysis ◽

Text Classification ◽

Word Embedding ◽

Chemical Accident

Download Full-text

The Classification of the Documents Based on Word Embedding and 2-layer Spherical Self Organizing Maps

Proceedings of the 2019 11th International Conference on Machine Learning and Computing - ICMLC '19 ◽

10.1145/3318299.3318378 ◽

2019 ◽

Author(s):

Koki Yoshioka ◽

Hiroshi Dozono

Keyword(s):

Word Embedding ◽

Self Organizing Maps ◽

Self Organizing

Download Full-text

Automated Lexicon and Feature Construction Using Word Embedding and Clustering for Classification of ASD Diagnoses Using EHR

Natural Language Processing and Information Systems - Lecture Notes in Computer Science ◽

10.1007/978-3-319-59569-6_4 ◽

2017 ◽

pp. 34-37 ◽

Cited By ~ 1

Author(s):

Gondy Leroy ◽

Yang Gu ◽

Sydney Pettygrove ◽

Margaret Kurzius-Spencer

Keyword(s):

Word Embedding ◽

Feature Construction

Download Full-text

UTILIZAÇÃO PRÁTICA DE WORD EMBEDDING APLICADA À CLASSIFICAÇÃO DE TEXTO

10.48090/ciki.v1i1.899 ◽

2020 ◽

Author(s):

Luiz Fernando Spillere de Souza ◽

Alexandre Leopoldo Gonçalves

Keyword(s):

Text Classification ◽

Word Embedding ◽

Practical Application ◽

Accuracy Rate ◽

Unstructured Text ◽

Representation Technique ◽

Concept Of Word

Text classification aims to extract knowledge from unstructured text patterns. The concept of word incorporation is a representation technique that allows words with similar meanings to have a similar representation, in order to incorporate reasoning characteristics about their use and meaning. The aim of this article is to analyze the work already published on the use of embedded words applied to the classification of texts, to propose a practical application that demonstrates its effectiveness. This study contributes to proving the effectiveness of the use of word incorporation applied to text classification, having reached an accuracy rate of around 73%.

Download Full-text

A Comparative Study of Using Bag-of-Words and Word-Embedding Attributes in the Spoiler Classification of English and Thai Text

Applied Computing and Information Technology - Studies in Computational Intelligence ◽

10.1007/978-3-030-25217-5_7 ◽

2019 ◽

pp. 81-93 ◽

Cited By ~ 1

Author(s):

Rangsipan Marukatat

Keyword(s):

Comparative Study ◽

Word Embedding ◽

Bag Of Words

Download Full-text