Comparative analysis of machine learning-based classification models using sentiment classification of tweets related to COVID-19 pandemic

Terahertz time-domain spectroscopy is a useful technique for determining some physical characteristics of materials, and is based on selective frequency absorption of a broad-spectrum electromagnetic pulse. In order to investigate the potential of this technology to classify cocoa percentages in chocolates, the terahertz spectra (0.5–10 THz) of five chocolate samples (50%, 60%, 70%, 80% and 90% of cocoa) were examined. The acquired data matrices were analyzed with the MATLAB 2019b application, from which the dielectric function was obtained along with the absorbance curves, and were classified by using 24 mathematical classification models, achieving differentiations of around 93% obtained by the Gaussian SVM algorithm model with a kernel scale of 0.35 and a one-against-one multiclass method. It was concluded that the combined processing and classification of images obtained from the terahertz time-domain spectroscopy and the use of machine learning algorithms can be used to successfully classify chocolates with different percentages of cocoa.

Download Full-text

Breast Cancer Type Classification Using Machine Learning

Journal of Personalized Medicine ◽

10.3390/jpm11020061 ◽

2021 ◽

Vol 11 (2) ◽

pp. 61

Author(s):

Jiande Wu ◽

Chindo Hicks

Keyword(s):

Breast Cancer ◽

Gene Expression ◽

Machine Learning ◽

Triple Negative Breast Cancer ◽

Triple Negative ◽

Genomic Research ◽

Support Vector ◽

Cancer Type ◽

Classification Models

Background: Breast cancer is a heterogeneous disease defined by molecular types and subtypes. Advances in genomic research have enabled use of precision medicine in clinical management of breast cancer. A critical unmet medical need is distinguishing triple negative breast cancer, the most aggressive and lethal form of breast cancer, from non-triple negative breast cancer. Here we propose use of a machine learning (ML) approach for classification of triple negative breast cancer and non-triple negative breast cancer patients using gene expression data. Methods: We performed analysis of RNA-Sequence data from 110 triple negative and 992 non-triple negative breast cancer tumor samples from The Cancer Genome Atlas to select the features (genes) used in the development and validation of the classification models. We evaluated four different classification models including Support Vector Machines, K-nearest neighbor, Naïve Bayes and Decision tree using features selected at different threshold levels to train the models for classifying the two types of breast cancer. For performance evaluation and validation, the proposed methods were applied to independent gene expression datasets. Results: Among the four ML algorithms evaluated, the Support Vector Machine algorithm was able to classify breast cancer more accurately into triple negative and non-triple negative breast cancer and had less misclassification errors than the other three algorithms evaluated. Conclusions: The prediction results show that ML algorithms are efficient and can be used for classification of breast cancer into triple negative and non-triple negative breast cancer types.

Download Full-text

An Automated Machine Learning Approach for Sentiment Classification of Bengali E-Commerce Sites

2019 IEEE 5th International Conference for Convergence in Technology (I2CT) ◽

10.1109/i2ct45611.2019.9033741 ◽

2019 ◽

Author(s):

Md. Golam Sarowar ◽

Mushfiqur Rahman ◽

Md. Nawab Yousuf Ali ◽

Omor Faruk Rakib

Keyword(s):

Machine Learning ◽

Sentiment Classification ◽

Learning Approach ◽

Machine Learning Approach ◽

Automated Machine Learning

Download Full-text

Comparative Analysis of Machine Learning Algorithm for Classification of different Osteosarcoma types

10.1109/icccnt51525.2021.9579556 ◽

2021 ◽

Author(s):

Sanket Mahore ◽

Kalyani Bhole ◽

Shashikant Rathod

Keyword(s):

Machine Learning ◽

Comparative Analysis ◽

Learning Algorithm ◽

Machine Learning Algorithm

Download Full-text

The Potential of Machine Learning Algorithms for Sentiment Classification of Students’ Feedback on MOOC

10.1007/978-3-030-82199-9_2 ◽

2021 ◽

pp. 11-22

Author(s):

Maryam Edalati ◽

Ali Shariq Imran ◽

Zenun Kastrati ◽

Sher Muhammad Daudpota

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Sentiment Classification

Download Full-text

Comparative analysis of machine learning classification of time series with fractal properties

2019 IEEE 8th International Conference on Advanced Optoelectronics and Lasers (CAOL) ◽

10.1109/caol46282.2019.9019416 ◽

2019 ◽

Author(s):

Tamara Radivilova ◽

Lyudmyla Kirichenko ◽

Bulakh Vitalii

Keyword(s):

Machine Learning ◽

Time Series ◽

Comparative Analysis ◽

Machine Learning Classification ◽

Fractal Properties

Download Full-text

Sentiment Classification Of Movie Review And Twitter Data Using Machine Learning

International Journal of Computer & Organization Trends ◽

10.14445/22492593/ijcot-v9i3p301 ◽

2019 ◽

Vol 9 (3) ◽

pp. 1-8

Author(s):

Prafulla Mohapatra ◽

Rohit Kumar Singh ◽

Shashank Pandey ◽

PrashanthAnand Kumar ◽

Asha K N

Keyword(s):

Machine Learning ◽

Sentiment Classification ◽

Twitter Data

Download Full-text

Sentiment classification of online Cantonese reviews by supervised machine learning approaches

International Journal of Web Engineering and Technology ◽

10.1504/ijwet.2009.032254 ◽

2009 ◽

Vol 5 (4) ◽

pp. 382 ◽

Cited By ~ 11

Author(s):

Ziqiong Zhang ◽

Qiang Ye ◽

Yijun Li ◽

Rob Law

Keyword(s):

Machine Learning ◽

Sentiment Classification ◽

Supervised Machine Learning ◽

Learning Approaches

Download Full-text

Sentiment Classification of Bank Clients’ Reviews Written in the Polish Language

Acta Universitatis Lodziensis Folia oeconomica ◽

10.18778/0208-6018.353.03 ◽

2021 ◽

pp. 43-56

Author(s):

Adam Piotr Idczak

Keyword(s):

Logistic Regression ◽

Comparative Analysis ◽

Text Classification ◽

Sentiment Classification ◽

Bayes Classifier ◽

Text Documents ◽

Text Document ◽

Polish Language ◽

Common Problems

It is estimated that approximately 80% of all data gathered by companies are text documents. This article is devoted to one of the most common problems in text mining, i. e. text classification in sentiment analysis, which focuses on determining document’s sentiment. Lack of defined structure of the text makes this problem more challenging. This has led to development of various techniques used in determining document’s sentiment. In this paper the comparative analysis of two methods in sentiment classification: naive Bayes classifier and logistic regression was conducted. Analysed texts are written in Polish language and come from banks. Classification was conducted by means of bag-of-n-grams approach where text document is presented as set of terms and each term consists of n words. The results show that logistic regression performed better.

Download Full-text

Concept of TF-IDF, Common Bag of Word and Word Embedding for Effective Sentiment Classification

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.f4582.049620 ◽

2020 ◽

Vol 9 (4) ◽

pp. 2198-2201

Keyword(s):

Machine Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Sentiment Classification ◽

Word Embedding ◽

Text Representation ◽

Human Beings ◽

Text Data

Sentiment Classification is one of the well-known and most popular domain of machine learning and natural language processing. An algorithm is developed to understand the opinion of an entity similar to human beings. This research fining article presents a similar to the mention above. Concept of natural language processing is considered for text representation. Later novel word embedding model is proposed for effective classification of the data. Tf-IDF and Common BoW representation models were considered for representation of text data. Importance of these models are discussed in the respective sections. The proposed is testing using IMDB datasets. 50% training and 50% testing with three random shuffling of the datasets are used for evaluation of the model.

Download Full-text