An Ensemble Approach to Multi-label Classification of Textual Data

There is growing interest in the study and practice of applying data science (DS) and machine learning (ML) to automate decision making in safety-critical industries. As an alternative or augmentation to human review, there are opportunities to explore these methods for classifying aviation operational events by root cause. This study seeks to apply a thoughtful approach to design, compare, and combine rule-based and ML techniques to classify events caused by human error in aircraft/engine assembly, maintenance or operation. Event reports contain a combination of continuous parameters, unstructured text entries, and categorical selections. A Human Factors approach to classifier development prioritizes the evaluation of distinct data features and entry methods to improve modeling. Findings, including the performance of tested models, led to recommendations for the design of textual data collection systems and classification approaches.

Download Full-text

An ensemble approach for the detection and classification of mixed pixels of remotely sensed images

2018 4th International Conference on Recent Advances in Information Technology (RAIT) ◽

10.1109/rait.2018.8389026 ◽

2018 ◽

Author(s):

Shabnam Choudhury ◽

Anik Chaudhuri ◽

Dipti Patra

Keyword(s):

Remotely Sensed ◽

Ensemble Approach ◽

Mixed Pixels ◽

Remotely Sensed Images

Download Full-text

News Classification Using Machine Learning

International Journal on Recent and Innovation Trends in Computing and Communication ◽

10.17762/ijritcc.v9i5.5464 ◽

2021 ◽

Vol 9 (5) ◽

pp. 23-27

Author(s):

SHWETA MAHAJAN

Keyword(s):

Machine Learning ◽

Social Media ◽

Performance Improvement ◽

Vital Role ◽

Learning Approach ◽

Entertainment Education ◽

Meaningful Information ◽

Textual Data ◽

Machine Learning Approach

There are plenty of social media webpages and platforms producing the textual data. These different kind of a data needs to be analysed and processed to extract meaningful information from raw data. Classification of text plays a vital role in extraction of useful information along with summarization, text retrieval. In our work we have considered the problem of news classification using machine learning approach. Currently we have a news related dataset which having various types of data like entertainment, education, sports, politics, etc. On this data we have applying classification algorithm with some word vectorizing techniques in order to get best result. The results which we got that have been compared on different parameters like Precision, Recall, F1 Score, accuracy for performance improvement.

Download Full-text

On the Utility of Incremental Feature Selection for the Classification of Textual Data Streams

Advances in Informatics - Lecture Notes in Computer Science ◽

10.1007/11573036_32 ◽

2005 ◽

pp. 338-348 ◽

Cited By ~ 21

Author(s):

Ioannis Katakis ◽

Grigorios Tsoumakas ◽

Ioannis Vlahavas

Keyword(s):

Feature Selection ◽

Data Streams ◽

Incremental Feature Selection ◽

Textual Data ◽

Selection For

Download Full-text

Emotion Mining Using Semantic Similarity

International Journal of Synthetic Emotions ◽

10.4018/ijse.2018070101 ◽

2018 ◽

Vol 9 (2) ◽

pp. 1-22 ◽

Cited By ~ 2

Author(s):

Rafiya Jan ◽

Afaq Alam Khan

Keyword(s):

Semantic Similarity ◽

Semantic Relatedness ◽

Semantic Features ◽

Emotion Detection ◽

Emotion Classification ◽

Textual Data ◽

Different Types ◽

Distributional Semantic Models ◽

Affective Information

Social networks are considered as the most abundant sources of affective information for sentiment and emotion classification. Emotion classification is the challenging task of classifying emotions into different types. Emotions being universal, the automatic exploration of emotion is considered as a difficult task to perform. A lot of the research is being conducted in the field of automatic emotion detection in textual data streams. However, very little attention is paid towards capturing semantic features of the text. In this article, the authors present the technique of semantic relatedness for automatic classification of emotion in the text using distributional semantic models. This approach uses semantic similarity for measuring the coherence between the two emotionally related entities. Before classification, data is pre-processed to remove the irrelevant fields and inconsistencies and to improve the performance. The proposed approach achieved the accuracy of 71.795%, which is competitive considering as no training or annotation of data is done.

Download Full-text

Ooredoo Rayek

International Journal of Technology Diffusion ◽

10.4018/ijtd.2020040105 ◽

2020 ◽

Vol 11 (2) ◽

pp. 66-81

Author(s):

Badia Klouche ◽

Sidi Mohamed Benslimane ◽

Sakina Rim Bennabi

Keyword(s):

Social Media ◽

Support Vector Machine ◽

Text Mining ◽

Sentiment Analysis ◽

Experimental Results ◽

Support Vector ◽

Textual Data ◽

New Strategy ◽

Set Up

Sentiment analysis is one of the recent areas of emerging research in the classification of sentiment polarity and text mining, particularly with the considerable number of opinions available on social media. The Algerian Operator Telephone Ooredoo, as other operators, deploys in its new strategy to conquer new customers, by exploiting their opinions through a sentiments analysis. The purpose of this work is to set up a system called “Ooredoo Rayek”, whose objective is to collect, transliterate, translate and classify the textual data expressed by the Ooredoo operator's customers. This article developed a set of rules allowing the transliteration from Algerian Arabizi to Algerian dialect. Furthermore, the authors used Naïve Bayes (NB) and (Support Vector Machine) SVM classifiers to assign polarity tags to Facebook comments from the official pages of Ooredoo written in multilingual and multi-dialect context. Experimental results show that the system obtains good performance with 83% of accuracy.

Download Full-text

An Ensemble Approach to Transfer Learning for Classification of Habitat Mapping

2018 International Conference on Signal Processing and Information Security (ICSPIS) ◽

10.1109/cspis.2018.8642722 ◽

2018 ◽

Cited By ~ 1

Author(s):

Prajowal Manandhar ◽

Prashanth Reddy Marpu ◽

Zeyar Aung

Keyword(s):

Transfer Learning ◽

Habitat Mapping ◽

Ensemble Approach

Download Full-text

The Social Spiders in the Clustering of Texts

International Journal of Artificial Life Research ◽

10.4018/jalr.2012070101 ◽

2012 ◽

Vol 3 (3) ◽

pp. 1-14 ◽

Cited By ~ 7

Author(s):

Reda Mohamed Hamou ◽

Abdelmalek Amine ◽

Ahmed Chaouki Lokbani

Keyword(s):

Data Stream ◽

Combinatorial Problem ◽

Large Data ◽

Social Spiders ◽

Stream Flows ◽

The Social ◽

Textual Data ◽

Biomimetic Approach ◽

N Gram

In this paper the authors experiment and test a new biomimetic approach based on social spiders to solve a combinatorial problem ie the automatic classification of texts because a very large data stream flows and particularly on the web. Representation of textual data was performed by a method independent of the language ie n-gram characters and words because there is currently no method of learning that can directly represent unstructured data (text). To validate the classification, the authors used a measure of evaluation based on recall and precision (F-measure). During the experiment, the authors found a powerful visualization tool in social spiders that they exploit to make visual classification.

Download Full-text

Methods for Classification of Text Data: Can the Potential of Quantitative Analysis Be Applied to Qualitative Research?

Inter ◽

10.19181/inter.2021.13.2.5 ◽

2021 ◽

Vol 13 (2) ◽

pp. 81-96

Author(s):

Marina Aleksandrova

Keyword(s):

Social Sciences ◽

European Social Survey ◽

Research Field ◽

Item Nonresponse ◽

Text Data ◽

Social Survey ◽

The Social ◽

Textual Data ◽

Analysis Of Results

Text mining has developed rapidly in recent years. In this article we compare classification methods that are suitable for solving problems of predicting item nonresponse. The author builds reasoning about how the analysis of textual data can be implemented in a wider research field based on this material. The author considers a number of metrics adapted for textual analysis in the social sciences: accuracy, precision, recall, F1-score, and gives examples that can help a sociologist figure out which of them is worth paying attention depending on the task at hand (classify text data with equal accuracy, or more fully describe one of the classes of interest). The article proposes an analysis of results obtained by analyzing texts based on the materials of the European Social Survey (ESS).

Download Full-text

Classification of P300 Component Using a Riemannian Ensemble Approach

IFMBE Proceedings - XV Mediterranean Conference on Medical and Biological Engineering and Computing – MEDICON 2019 ◽

10.1007/978-3-030-31635-8_229 ◽

2019 ◽

pp. 1885-1889

Author(s):

Dominik Krzemiński ◽

Sebastian Michelmann ◽

Matthias Treder ◽

Lorena Santamaria

Keyword(s):

P300 Component ◽

Ensemble Approach

Download Full-text