Correlation Analysis and Text Classification of Chemical Accident Cases Based on Word Embedding

UTILIZAÇÃO PRÁTICA DE WORD EMBEDDING APLICADA À CLASSIFICAÇÃO DE TEXTO

10.48090/ciki.v1i1.899 ◽

2020 ◽

Author(s):

Luiz Fernando Spillere de Souza ◽

Alexandre Leopoldo Gonçalves

Keyword(s):

Text Classification ◽

Word Embedding ◽

Practical Application ◽

Accuracy Rate ◽

Unstructured Text ◽

Representation Technique ◽

Concept Of Word

Text classification aims to extract knowledge from unstructured text patterns. The concept of word incorporation is a representation technique that allows words with similar meanings to have a similar representation, in order to incorporate reasoning characteristics about their use and meaning. The aim of this article is to analyze the work already published on the use of embedded words applied to the classification of texts, to propose a practical application that demonstrates its effectiveness. This study contributes to proving the effectiveness of the use of word incorporation applied to text classification, having reached an accuracy rate of around 73%.

Download Full-text

A Brief Survey on Text Classification Using Various Machine Learning Techniques

International Journal of Advanced Research in Computer Science and Software Engineering ◽

10.23956/ijarcsse.v8i1.521 ◽

2018 ◽

Vol 8 (1) ◽

pp. 14

Author(s):

Padmavathi .S ◽

M. Chidambaram

Keyword(s):

Machine Learning ◽

Text Classification ◽

Fixed Number ◽

Machine Learning Techniques ◽

Online Information ◽

Rule Based ◽

Learning Techniques ◽

Machine Learning Approach ◽

Rule Based Approach

Text classification has grown into more significant in managing and organizing the text data due to tremendous growth of online information. It does classification of documents in to fixed number of predefined categories. Rule based approach and Machine learning approach are the two ways of text classification. In rule based approach, classification of documents is done based on manually defined rules. In Machine learning based approach, classification rules or classifier are defined automatically using example documents. It has higher recall and quick process. This paper shows an investigation on text classification utilizing different machine learning techniques.

Download Full-text

Chemometric Analysis for the Classification of some Groups of Drugs with Divergent Pharmacological Activity on the Basis of some Chromatographic and Molecular Modeling Parameters

Combinatorial Chemistry & High Throughput Screening ◽

10.2174/1386207321666180129102149 ◽

2018 ◽

Vol 21 (2) ◽

pp. 125-137

Author(s):

Jolanta Stasiak ◽

Marcin Koba ◽

Marcin Gackowski ◽

Tomasz Baczek

Keyword(s):

Correlation Analysis ◽

Pharmacological Activity ◽

Correlation Coefficients ◽

Principal Component ◽

Cardiovascular Drugs ◽

New Drugs ◽

Analgesic Drugs ◽

Starting Point ◽

Chromatographic Parameters

Aim and Objective: In this study, chemometric methods as correlation analysis, cluster analysis (CA), principal component analysis (PCA), and factor analysis (FA) have been used to reduce the number of chromatographic parameters (logk/logkw) and various (e.g., 0D, 1D, 2D, 3D) structural descriptors for three different groups of drugs, such as 12 analgesic drugs, 11 cardiovascular drugs and 36 “other” compounds and especially to choose the most important data of them. Material and Methods: All chemometric analyses have been carried out, graphically presented and also discussed for each group of drugs. At first, compounds’ structural and chromatographic parameters were correlated. The best results of correlation analysis were as follows: correlation coefficients like R = 0.93, R = 0.88, R = 0.91 for cardiac medications, analgesic drugs, and 36 “other” compounds, respectively. Next, part of molecular and HPLC experimental data from each group of drugs were submitted to FA/PCA and CA techniques. Results: Almost all results obtained by FA or PCA, and total data variance, from all analyzed parameters (experimental and calculated) were explained by first two/three factors: 84.28%, 76.38 %, 69.71% for cardiovascular drugs, for analgesic drugs and for 36 “other” compounds, respectively. Compounds clustering by CA method had similar characteristic as those obtained by FA/PCA. In our paper, statistical classification of mentioned drugs performed has been widely characterized and discussed in case of their molecular structure and pharmacological activity. Conclusion: Proposed QSAR strategy of reduced number of parameters could be useful starting point for further statistical analysis as well as support for designing new drugs and predicting their possible activity.

Download Full-text

Detection and Prevention of Spam Mail with Semantics-based text classification of Collaborative and Content Filtering

Journal of Physics Conference Series ◽

10.1088/1742-6596/1770/1/012031 ◽

2021 ◽

Vol 1770 (1) ◽

pp. 012031

Author(s):

S. Prayla Shyry ◽

Y. Bevish Jinila

Keyword(s):

Text Classification ◽

Content Filtering

Download Full-text

Deep learning based multi-label text classification of UNGA resolutions

Proceedings of the 13th International Conference on Theory and Practice of Electronic Governance ◽

10.1145/3428502.3428604 ◽

2020 ◽

Author(s):

Francesco Sovrano ◽

Monica Palmirani ◽

Fabio Vitali

Keyword(s):

Deep Learning ◽

Text Classification

Download Full-text

Classification of the Global Tidal Types Based on Auto-correlation Analysis

Ocean Science Journal ◽

10.1007/s12601-019-0009-7 ◽

2019 ◽

Vol 54 (2) ◽

pp. 279-286

Author(s):

Sung-Hwa Lee ◽

You-Soon Chang

Keyword(s):

Correlation Analysis ◽

Auto Correlation

Download Full-text

Towards Classification of Personality Prediction Model: A Combination of BERT Word Embedding and MLSMOTE

10.1109/iccsai53272.2021.9609750 ◽

2021 ◽

Author(s):

Henry Lucky ◽

Roslynlia ◽

Derwin Suhartono

Keyword(s):

Prediction Model ◽

Word Embedding ◽

Personality Prediction

Download Full-text

Automate Labeling Of Bugs and Tickets Using Attention-Based Mechanism in Recurrent Neral Networks

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.38982 ◽

2021 ◽

Vol 9 (11) ◽

pp. 1411-1418

Author(s):

Ravi Kauthale

Keyword(s):

Neural Networks ◽

Text Classification ◽

Recurrent Neural Networks ◽

Support Systems ◽

The Neural Networks

Abstract: The aim here is to explore the methods to automate the labelling of the information that is present in bug trackers and client support systems. This is majorly based on the classification of the content depending on some criteria e.g., priority or product area. Labelling of the tickets is important as it helps in effective and efficient handling of the ticket and help is quicker and comprehensive resolution of the tickets. The main goal of the project is to analyze the existing methodologies used for automated labelling and then use a newer approach and compare the results. The existing methodologies are the ones which are based of the neural networks and without neural networks. In this project, a newer approach based on the recurrent neural networks which are based on the hierarchical attention paradigm will be used. Keywords: Automate Labeling, Recurrent Neural Networks, Hierarchical Attention, Multi-class Text Classification, GRU

Download Full-text