Multi-Class Text Classification Using Machine Learning Models for Online Drug Reviews

News articles are important for providing timely, historic information. However, the Internet is replete with text that may contain irrelevant or unhelpful information, therefore means of processing it and distilling content is important and useful to human readers as well as information extracting tools. Some common questions we may want to answer are “what is this article about?” and “who wrote it?”. In this work we compare machine learning models for evaluating two common NLP tasks, topic and authorship attribution, on the 2017 Vox Media dataset. Additionally, we use the models to classify on a subsection, about ~20%, of the original text which show to be better for classification than the provided blurbs. Because of the large number of topics, we take into account topic overlap and address it via top-n accuracy and hierarchical groupings of topics. We also consider edge cases in authorship by classifying on inter-topic and intra-topic author distributions. Our results show that both topics and authors readily identifiable consistently perform best when using neural networks rather than support vector, random forests, or naive Bayes classifiers, although the latter methods perform acceptably.

Download Full-text

A comparative analysis of machine learning models for quality pillar assessment of SaaS services by multi-class text classification of users’ reviews

Future Generation Computer Systems ◽

10.1016/j.future.2019.06.022 ◽

2019 ◽

Vol 101 ◽

pp. 341-371 ◽

Cited By ~ 6

Author(s):

Muhammad Raza ◽

Farookh Khadeer Hussain ◽

Omar Khadeer Hussain ◽

Ming Zhao ◽

Zia ur Rehman

Keyword(s):

Machine Learning ◽

Comparative Analysis ◽

Text Classification ◽

Learning Models ◽

Machine Learning Models

Download Full-text

Text classification using Fuzzy TF-IDF and Machine Learning Models

Proceedings of the 4th International Conference on Big Data and Internet of Things ◽

10.1145/3372938.3372956 ◽

2019 ◽

Author(s):

Mariem Bounabi ◽

Karim El Moutaouakil ◽

Khalid Satori

Keyword(s):

Machine Learning ◽

Text Classification ◽

Learning Models ◽

Machine Learning Models

Download Full-text

Machine learning models for bank reviews classification

Keldysh Institute Preprints ◽

10.20948/prepr-2021-50 ◽

2021 ◽

pp. 1-14

Author(s):

Natalya Dmitriyevna Badanina ◽

Vladimir Anatolievich Sudakov

Keyword(s):

Machine Learning ◽

Text Classification ◽

Training Sample ◽

Corpus Analysis ◽

Classification Models ◽

Learning Models ◽

Textual Information ◽

Internet Resources ◽

People’S Attitudes ◽

Machine Learning Models

Using the banking products and services review corpus, analysis is conducted to establish different text classification models. The paper explores different approaches to the processing of unstructured textual information. Based on the selected approaches, the review corpus on banking products and services received during the COVID-19 pandemic is analyzed. An automatic Internet resources parser has been developed to obtain the required training sample. Software has been developed that implemens basic methods for the classification models construction. This model can be used to create system for monitoring people’s attitudes to banking processes.

Download Full-text

Multi Faceted Text Classification using Supervised Machine Learning Models

10.31979/etd.7crd-u5pw ◽

2016 ◽

Author(s):

Abhiteja Gajjala

Keyword(s):

Machine Learning ◽

Text Classification ◽

Supervised Machine Learning ◽

Learning Models ◽

Machine Learning Models

Download Full-text

Improving XGBoost with Imagination Sampling

Communications of the Blyth Institute ◽

10.33014/issn.2640-5652.2.1.holloway.1 ◽

2020 ◽

Vol 2 (1) ◽

pp. 3-6

Author(s):

Eric Holloway

Keyword(s):

Machine Learning ◽

General System ◽

Learning Models ◽

Starting Point ◽

Machine Learning Models

Imagination Sampling is the usage of a person as an oracle for generating or improving machine learning models. Previous work demonstrated a general system for using Imagination Sampling for obtaining multibox models. Here, the possibility of importing such models as the starting point for further automatic enhancement is explored.

Download Full-text

Development of Machine Learning Models to Predict Student Performance in Computer Literacy Courses

International Review on Computers and Software (IRECOS) ◽

10.15866/irecos.v13i1.16863 ◽

2018 ◽

Vol 13 (1) ◽

pp. 21

Author(s):

George Anderson ◽

Oduronke T. Eyitayo

Keyword(s):

Machine Learning ◽

Student Performance ◽

Computer Literacy ◽

Learning Models ◽

Machine Learning Models

Download Full-text

Experimental Comparison of Machine Learning Models in Malware Packing Detection

2020 21st Asia-Pacific Network Operations and Management Symposium (APNOMS) ◽

10.23919/apnoms50412.2020.9237007 ◽

2020 ◽

Author(s):

Jong-Wouk Kim ◽

Juhong Namgung ◽

Yang-Sae Moon ◽

Mi-Jung Choi

Keyword(s):

Machine Learning ◽

Experimental Comparison ◽

Learning Models ◽

Machine Learning Models

Download Full-text

Epigenetic Target Prediction with Accurate Machine Learning Models

10.26434/chemrxiv.13522313 ◽

2021 ◽

Author(s):

Norberto Sánchez-Cruz ◽

Jose L. Medina-Franco

Keyword(s):

Machine Learning ◽

Small Molecules ◽

Predictive Models ◽

Large Scale ◽

Target Prediction ◽

Quantitative Measure ◽

Learning Models ◽

Discovery Research ◽

Drug Discovery Research ◽

Machine Learning Models

<p>Epigenetic targets are a significant focus for drug discovery research, as demonstrated by the eight approved epigenetic drugs for treatment of cancer and the increasing availability of chemogenomic data related to epigenetics. This data represents a large amount of structure-activity relationships that has not been exploited thus far for the development of predictive models to support medicinal chemistry efforts. Herein, we report the first large-scale study of 26318 compounds with a quantitative measure of biological activity for 55 protein targets with epigenetic activity. Through a systematic comparison of machine learning models trained on molecular fingerprints of different design, we built predictive models with high accuracy for the epigenetic target profiling of small molecules. The models were thoroughly validated showing mean precisions up to 0.952 for the epigenetic target prediction task. Our results indicate that the herein reported models have considerable potential to identify small molecules with epigenetic activity. Therefore, our results were implemented as freely accessible and easy-to-use web application.</p>

Download Full-text