A Brief Survey on Text Classification Using Various Machine Learning Techniques

Author(s):  
Padmavathi .S ◽  
M. Chidambaram

Text classification has grown into more significant in managing and organizing the text data due to tremendous growth of online information. It does classification of documents in to fixed number of predefined categories. Rule based approach and Machine learning approach are the two ways of text classification. In rule based approach, classification of documents is done based on manually defined rules. In Machine learning based approach, classification rules or classifier are defined automatically using example documents. It has higher recall and quick process. This paper shows an investigation on text classification utilizing different machine learning techniques.

Author(s):  
P. Rama Santosh Naidu ◽  
K.Venkata Ramana ◽  
G. Lavanya Devi

In recent days Machine Learning has become major study aspect in various applications that includes medical care where convenient discovery of anomalies in ECG signals plays an important role in monitoring patient's condition regularly. This study concentrates on various MachineLearning techniques applied for classification of ECG signals which include CNN and RNN. In the past few years, it is being observed that CNN is playing a dominant role in feature extraction from which we can infer that machine learning techniques have been showing accuracy and progress in classification of ECG signals. Therefore, this paper includes Convolutional Neural Network and Recurrent Neural Network which is being classified into two types for better results from considerably increased depth.


2021 ◽  
Vol 7 (1) ◽  
pp. 51
Author(s):  
Rubén Pérez-Jove ◽  
Cristian R. Munteanu ◽  
Alejandro Pazos Sierra ◽  
José M. Vázquez-Naya

In the field of computer security, the possibility of knowing which specific version of an operating system is running behind a machine can be useful, to assist in a penetration test or monitor the devices connected to a specific network. One of the most widespread tools that better provides this functionality is Nmap, which follows a rule-based approach for this process. In this context, applying machine learning techniques seems to be a good option for addressing this task. The present work explores the strengths of different machine learning algorithms to perform operating system fingerprinting, using for that, the Nmap reference database. Moreover, some optimizations were applied to the method which brought the best results, random forest, obtaining an accuracy higher than 96%.


Author(s):  
Damian Alberto

The manual classification of a large amount of textual materials are very costly in time and personnel. For this reason, a lot of research has been devoted to the problem of automatic classification and work on the subject dates from 1960. A lot of text classification software has appeared. For some tasks, automatic classifiers perform almost as well as humans, but for others, the gap is still large. These systems are directly related to machine learning. It aims to achieve tasks normally affordable only by humans. There are generally two types of learning: learning “by heart,” which consists of storing information as is, and learning generalization, where we learn from examples. In this chapter, the authors address the classification concept in detail and how to solve different classification problems using different machine learning techniques.


Author(s):  
Ernesto Dufrechou ◽  
Pablo Ezzatti ◽  
Enrique S Quintana-Ortí

More than 10 years of research related to the development of efficient GPU routines for the sparse matrix-vector product (SpMV) have led to several realizations, each with its own strengths and weaknesses. In this work, we review some of the most relevant efforts on the subject, evaluate a few prominent routines that are publicly available using more than 3000 matrices from different applications, and apply machine learning techniques to anticipate which SpMV realization will perform best for each sparse matrix on a given parallel platform. Our numerical experiments confirm the methods offer such varied behaviors depending on the matrix structure that the identification of general rules to select the optimal method for a given matrix becomes extremely difficult, though some useful strategies (heuristics) can be defined. Using a machine learning approach, we show that it is possible to obtain unexpensive classifiers that predict the best method for a given sparse matrix with over 80% accuracy, demonstrating that this approach can deliver important reductions in both execution time and energy consumption.


Sign in / Sign up

Export Citation Format

Share Document